Garcia, Nelson; Messing, Joachim
2017-01-01
The TEL2, TTI1, and TTI2 proteins are co-chaperones for heat shock protein 90 (HSP90) to regulate the protein folding and maturation of phosphatidylinositol 3-kinase-related kinases (PIKKs). Referred to as the TTT complex, the genes that encode them are highly conserved from man to maize. TTT complex and PIKK genes exist mostly as single copy genes in organisms where they have been characterized. Members of this interacting protein network in maize were identified and synteny analyses were performed to study their evolution. Similar to other species, there is only one copy of each of these genes in maize which was due to a loss of the duplicated copy created by ancient allotetraploidy. Moreover, the retained copies of the TTT complex and the PIKK genes tolerated extensive retrotransposon insertion in their introns that resulted in increased gene lengths and gene body methylation, without apparent effect in normal gene expression and function. The results raise an interesting question on whether the reversion to single copy was due to selection against deleterious unbalanced gene duplications between members of the complex as predicted by the gene balance hypothesis, or due to neutral loss of extra copies. Uneven alteration of dosage either by adding extra copies or modulating gene expression of complex members is being proposed as a means to investigate whether the data supports the gene balance hypothesis or not.
Marian, Ali J.; van Rooij, Eva; Roberts, Robert
2016-01-01
This is the first of 2 review papers on genetics and genomics appearing as part of the series on “omics.” Genomics pertains to all components of an organism’s genes, whereas genetics involves analysis of a specific gene(s) in the context of heredity. The paper provides introductory comments, describes the basis of human genetic diversity, and addresses the phenotypic consequences of genetic variants. Rare variants with large effect sizes are responsible for single-gene disorders, whereas complex polygenic diseases are typically due to multiple genetic variants, each exerting a modest effect size. To illustrate the clinical implications of genetic variants with large effect sizes, 3 common forms of hereditary cardiomyopathies are discussed as prototypic examples of single-gene disorders, including their genetics, clinical manifestations, pathogenesis, and treatment. The genetic basis of complex traits is discussed in a separate paper. PMID:28007145
Identification of essential genes and synthetic lethal gene combinations in Escherichia coli K-12.
Mori, Hirotada; Baba, Tomoya; Yokoyama, Katsushi; Takeuchi, Rikiya; Nomura, Wataru; Makishi, Kazuichi; Otsuka, Yuta; Dose, Hitomi; Wanner, Barry L
2015-01-01
Here we describe the systematic identification of single genes and gene pairs, whose knockout causes lethality in Escherichia coli K-12. During construction of precise single-gene knockout library of E. coli K-12, we identified 328 essential gene candidates for growth in complex (LB) medium. Upon establishment of the Keio single-gene deletion library, we undertook the development of the ASKA single-gene deletion library carrying a different antibiotic resistance. In addition, we developed tools for identification of synthetic lethal gene combinations by systematic construction of double-gene knockout mutants. We introduce these methods herein.
Fighting fire with fire: attacking the complexity of human tumors with armed therapeutic viruses.
Hermiston, Terry
2002-08-01
Cancer gene therapies have centered on the use of a single gene, directed against a particular property or single aspect of tumor biology, to treat neoplastic disease. These therapies have met with limited clinical success. This is, perhaps, not surprising given the complex and heterogeneous nature of solid tumors. Treatments targeted at confronting multiple dimensions of human tumors are needed. Armed therapeutic viruses (oncolytic viruses carrying therapeutic genes) represent a system where the concerted action of multiple therapeutics can be joined into a single agent, and represent a promising avenue for developing future cancer therapies.
Pathological mechanisms underlying single large‐scale mitochondrial DNA deletions
Rocha, Mariana C.; Rosa, Hannah S.; Grady, John P.; Blakely, Emma L.; He, Langping; Romain, Nadine; Haller, Ronald G.; Newman, Jane; McFarland, Robert; Ng, Yi Shiau; Gorman, Grainne S.; Schaefer, Andrew M.; Tuppen, Helen A.; Taylor, Robert W.
2018-01-01
Objective Single, large‐scale deletions in mitochondrial DNA (mtDNA) are a common cause of mitochondrial disease. This study aimed to investigate the relationship between the genetic defect and molecular phenotype to improve understanding of pathogenic mechanisms associated with single, large‐scale mtDNA deletions in skeletal muscle. Methods We investigated 23 muscle biopsies taken from adult patients (6 males/17 females with a mean age of 43 years) with characterized single, large‐scale mtDNA deletions. Mitochondrial respiratory chain deficiency in skeletal muscle biopsies was quantified by immunoreactivity levels for complex I and complex IV proteins. Single muscle fibers with varying degrees of deficiency were selected from 6 patient biopsies for determination of mtDNA deletion level and copy number by quantitative polymerase chain reaction. Results We have defined 3 “classes” of single, large‐scale deletion with distinct patterns of mitochondrial deficiency, determined by the size and location of the deletion. Single fiber analyses showed that fibers with greater respiratory chain deficiency harbored higher levels of mtDNA deletion with an increase in total mtDNA copy number. For the first time, we have demonstrated that threshold levels for complex I and complex IV deficiency differ based on deletion class. Interpretation Combining genetic and immunofluorescent assays, we conclude that thresholds for complex I and complex IV deficiency are modulated by the deletion of complex‐specific protein‐encoding genes. Furthermore, removal of mt‐tRNA genes impacts specific complexes only at high deletion levels, when complex‐specific protein‐encoding genes remain. These novel findings provide valuable insight into the pathogenic mechanisms associated with these mutations. Ann Neurol 2018;83:115–130 PMID:29283441
Gradia, Scott D; Ishida, Justin P; Tsai, Miaw-Sheue; Jeans, Chris; Tainer, John A; Fuss, Jill O
2017-01-01
Recombinant expression of large, multiprotein complexes is essential and often rate limiting for determining structural, biophysical, and biochemical properties of DNA repair, replication, transcription, and other key cellular processes. Baculovirus-infected insect cell expression systems are especially well suited for producing large, human proteins recombinantly, and multigene baculovirus systems have facilitated studies of multiprotein complexes. In this chapter, we describe a multigene baculovirus system called MacroBac that uses a Biobricks-type assembly method based on restriction and ligation (Series 11) or ligation-independent cloning (Series 438). MacroBac cloning and assembly is efficient and equally well suited for either single subcloning reactions or high-throughput cloning using 96-well plates and liquid handling robotics. MacroBac vectors are polypromoter with each gene flanked by a strong polyhedrin promoter and an SV40 poly(A) termination signal that minimize gene order expression level effects seen in many polycistronic assemblies. Large assemblies are robustly achievable, and we have successfully assembled as many as 10 genes into a single MacroBac vector. Importantly, we have observed significant increases in expression levels and quality of large, multiprotein complexes using a single, multigene, polypromoter virus rather than coinfection with multiple, single-gene viruses. Given the importance of characterizing functional complexes, we believe that MacroBac provides a critical enabling technology that may change the way that structural, biophysical, and biochemical research is done. © 2017 Elsevier Inc. All rights reserved.
Convergent evolution of gene networks by single-gene duplications in higher eukaryotes.
Amoutzias, Gregory D; Robertson, David L; Oliver, Stephen G; Bornberg-Bauer, Erich
2004-03-01
By combining phylogenetic, proteomic and structural information, we have elucidated the evolutionary driving forces for the gene-regulatory interaction networks of basic helix-loop-helix transcription factors. We infer that recurrent events of single-gene duplication and domain rearrangement repeatedly gave rise to distinct networks with almost identical hub-based topologies, and multiple activators and repressors. We thus provide the first empirical evidence for scale-free protein networks emerging through single-gene duplications, the dominant importance of molecular modularity in the bottom-up construction of complex biological entities, and the convergent evolution of networks.
Strategic approaches to unraveling genetic causes of cardiovascular diseases
USDA-ARS?s Scientific Manuscript database
DNA sequence variants are major components of the "causal field" for virtually all medical phenotypes, whether single gene familial disorders or complex traits without a clear familial aggregation. The causal variants in single gene disorders are necessary and sufficient to impart large effects. In ...
Fourie, Gerda; van der Merwe, Nicolaas A; Wingfield, Brenda D; Bogale, Mesfin; Tudzynski, Bettina; Wingfield, Michael J; Steenkamp, Emma T
2013-09-08
The availability of mitochondrial genomes has allowed for the resolution of numerous questions regarding the evolutionary history of fungi and other eukaryotes. In the Gibberella fujikuroi species complex, the exact relationships among the so-called "African", "Asian" and "American" Clades remain largely unresolved, irrespective of the markers employed. In this study, we considered the feasibility of using mitochondrial genes to infer the phylogenetic relationships among Fusarium species in this complex. The mitochondrial genomes of representatives of the three Clades (Fusarium circinatum, F. verticillioides and F. fujikuroi) were characterized and we determined whether or not the mitochondrial genomes of these fungi have value in resolving the higher level evolutionary relationships in the complex. Overall, the mitochondrial genomes of the three species displayed a high degree of synteny, with all the genes (protein coding genes, unique ORFs, ribosomal RNA and tRNA genes) in identical order and orientation, as well as introns that share similar positions within genes. The intergenic regions and introns generally contributed significantly to the size differences and diversity observed among these genomes. Phylogenetic analysis of the concatenated protein-coding dataset separated members of the Gibberella fujikuroi complex from other Fusarium species and suggested that F. fujikuroi ("Asian" Clade) is basal in the complex. However, individual mitochondrial gene trees were largely incongruent with one another and with the concatenated gene tree, because six distinct phylogenetic trees were recovered from the various single gene datasets. The mitochondrial genomes of Fusarium species in the Gibberella fujikuroi complex are remarkably similar to those of the previously characterized Fusarium species and Sordariomycetes. Despite apparently representing a single replicative unit, all of the genes encoded on the mitochondrial genomes of these fungi do not share the same evolutionary history. This incongruence could be due to biased selection on some genes or recombination among mitochondrial genomes. The results thus suggest that the use of individual mitochondrial genes for phylogenetic inference could mask the true relationships between species in this complex.
A kernel regression approach to gene-gene interaction detection for case-control studies.
Larson, Nicholas B; Schaid, Daniel J
2013-11-01
Gene-gene interactions are increasingly being addressed as a potentially important contributor to the variability of complex traits. Consequently, attentions have moved beyond single locus analysis of association to more complex genetic models. Although several single-marker approaches toward interaction analysis have been developed, such methods suffer from very high testing dimensionality and do not take advantage of existing information, notably the definition of genes as functional units. Here, we propose a comprehensive family of gene-level score tests for identifying genetic elements of disease risk, in particular pairwise gene-gene interactions. Using kernel machine methods, we devise score-based variance component tests under a generalized linear mixed model framework. We conducted simulations based upon coalescent genetic models to evaluate the performance of our approach under a variety of disease models. These simulations indicate that our methods are generally higher powered than alternative gene-level approaches and at worst competitive with exhaustive SNP-level (where SNP is single-nucleotide polymorphism) analyses. Furthermore, we observe that simulated epistatic effects resulted in significant marginal testing results for the involved genes regardless of whether or not true main effects were present. We detail the benefits of our methods and discuss potential genome-wide analysis strategies for gene-gene interaction analysis in a case-control study design. © 2013 WILEY PERIODICALS, INC.
Bialk, Pawel; Rivera-Torres, Natalia; Strouse, Bryan; Kmiec, Eric B.
2015-01-01
Single-stranded DNA oligonucleotides (ssODNs) can direct the repair of a single base mutation in human genes. While the regulation of this gene editing reaction has been partially elucidated, the low frequency with which repair occurs has hampered development toward clinical application. In this work a CRISPR/Cas9 complex is employed to induce double strand DNA breakage at specific sites surrounding the nucleotide designated for exchange. The result is a significant elevation in ssODN-directed gene repair, validated by a phenotypic readout. By analysing reaction parameters, we have uncovered restrictions on gene editing activity involving CRISPR/Cas9 complexes. First, ssODNs that hybridize to the non-transcribed strand direct a higher level of gene repair than those that hybridize to the transcribed strand. Second, cleavage must be proximal to the targeted mutant base to enable higher levels of gene editing. Third, DNA cleavage enables a higher level of gene editing activity as compared to single-stranded DNA nicks, created by modified Cas9 (Nickases). Fourth, we calculated the hybridization potential and free energy levels of ssODNs that are complementary to the guide RNA sequences of CRISPRs used in this study. We find a correlation between free energy potential and the capacity of single-stranded oligonucleotides to inhibit specific DNA cleavage activity, thereby indirectly reducing gene editing activity. Our data provide novel information that might be taken into consideration in the design and usage of CRISPR/Cas9 systems with ssODNs for gene editing. PMID:26053390
Bialk, Pawel; Rivera-Torres, Natalia; Strouse, Bryan; Kmiec, Eric B
2015-01-01
Single-stranded DNA oligonucleotides (ssODNs) can direct the repair of a single base mutation in human genes. While the regulation of this gene editing reaction has been partially elucidated, the low frequency with which repair occurs has hampered development toward clinical application. In this work a CRISPR/Cas9 complex is employed to induce double strand DNA breakage at specific sites surrounding the nucleotide designated for exchange. The result is a significant elevation in ssODN-directed gene repair, validated by a phenotypic readout. By analysing reaction parameters, we have uncovered restrictions on gene editing activity involving CRISPR/Cas9 complexes. First, ssODNs that hybridize to the non-transcribed strand direct a higher level of gene repair than those that hybridize to the transcribed strand. Second, cleavage must be proximal to the targeted mutant base to enable higher levels of gene editing. Third, DNA cleavage enables a higher level of gene editing activity as compared to single-stranded DNA nicks, created by modified Cas9 (Nickases). Fourth, we calculated the hybridization potential and free energy levels of ssODNs that are complementary to the guide RNA sequences of CRISPRs used in this study. We find a correlation between free energy potential and the capacity of single-stranded oligonucleotides to inhibit specific DNA cleavage activity, thereby indirectly reducing gene editing activity. Our data provide novel information that might be taken into consideration in the design and usage of CRISPR/Cas9 systems with ssODNs for gene editing.
[Identification and polymorphism of pectinase genes PGU in the Saccharomyces bayanus complex].
Shalamitskiy, M Yu; Naumov, G I
2016-05-01
Pectinase (endo-polygalacturonase) is the key enzyme splitting plant pectin. The corresponding single gene PGU1 is documented for the yeast S. cerevisiae. On the basis of phylogenetic analysis of the PGU nucleotide sequence available in the GenBank, a family of divergent PGU genes is found in the species complex S. bayanus: S. bayanus var. uvarum, S. eubayanus, and hybrid taxon S. pastorianus. The PGU genes have different chromosome localization.
Spatial reconstruction of single-cell gene expression data.
Satija, Rahul; Farrell, Jeffrey A; Gennert, David; Schier, Alexander F; Regev, Aviv
2015-05-01
Spatial localization is a key determinant of cellular fate and behavior, but methods for spatially resolved, transcriptome-wide gene expression profiling across complex tissues are lacking. RNA staining methods assay only a small number of transcripts, whereas single-cell RNA-seq, which measures global gene expression, separates cells from their native spatial context. Here we present Seurat, a computational strategy to infer cellular localization by integrating single-cell RNA-seq data with in situ RNA patterns. We applied Seurat to spatially map 851 single cells from dissociated zebrafish (Danio rerio) embryos and generated a transcriptome-wide map of spatial patterning. We confirmed Seurat's accuracy using several experimental approaches, then used the strategy to identify a set of archetypal expression patterns and spatial markers. Seurat correctly localizes rare subpopulations, accurately mapping both spatially restricted and scattered groups. Seurat will be applicable to mapping cellular localization within complex patterned tissues in diverse systems.
An efficient platform for genetic selection and screening of gene switches in Escherichia coli
Muranaka, Norihito; Sharma, Vandana; Nomura, Yoko; Yokobayashi, Yohei
2009-01-01
Engineered gene switches and circuits that can sense various biochemical and physical signals, perform computation, and produce predictable outputs are expected to greatly advance our ability to program complex cellular behaviors. However, rational design of gene switches and circuits that function in living cells is challenging due to the complex intracellular milieu. Consequently, most successful designs of gene switches and circuits have relied, to some extent, on high-throughput screening and/or selection from combinatorial libraries of gene switch and circuit variants. In this study, we describe a generic and efficient platform for selection and screening of gene switches and circuits in Escherichia coli from large libraries. The single-gene dual selection marker tetA was translationally fused to green fluorescent protein (gfpuv) via a flexible peptide linker and used as a dual selection and screening marker for laboratory evolution of gene switches. Single-cycle (sequential positive and negative selections) enrichment efficiencies of >7000 were observed in mock selections of model libraries containing functional riboswitches in liquid culture. The technique was applied to optimize various parameters affecting the selection outcome, and to isolate novel thiamine pyrophosphate riboswitches from a complex library. Artificial riboswitches with excellent characteristics were isolated that exhibit up to 58-fold activation as measured by fluorescent reporter gene assay. PMID:19190095
Martin, Eden R.; Lai, Eric H.; Gilbert, John R.; Rogala, Allison R.; Afshari, A. J.; Riley, John; Finch, K. L.; Stevens, J. F.; Livak, K. J.; Slotterbeck, Brandon D.; Slifer, Susan H.; Warren, Liling L.; Conneally, P. Michael; Schmechel, Donald E.; Purvis, Ian; Pericak-Vance, Margaret A.; Roses, Allen D.; Vance, Jeffery M.
2000-01-01
There has been great interest in the prospects of using single-nucleotide polymorphisms (SNPs) in the search for complex disease genes, and several initiatives devoted to the identification and mapping of SNPs throughout the human genome are currently underway. However, actual data investigating the use of SNPs for identification of complex disease genes are scarce. To begin to look at issues surrounding the use of SNPs in complex disease studies, we have initiated a collaborative SNP mapping study around APOE, the well-established susceptibility gene for late-onset Alzheimer disease (AD). Sixty SNPs in a 1.5-Mb region surrounding APOE were genotyped in samples of unrelated cases of AD, in controls, and in families with AD. Standard tests were conducted to look for association of SNP alleles with AD, in cases and controls. We also used family-based association analyses, including recently developed methods to look for haplotype association. Evidence of association (P⩽.05) was identified for 7 of 13 SNPs, including the APOE-4 polymorphism, spanning 40 kb on either side of APOE. As expected, very strong evidence for association with AD was seen for the APOE-4 polymorphism, as well as for two other SNPs that lie <16 kb from APOE. Haplotype analysis using family data increased significance over that seen in single-locus tests for some of the markers, and, for these data, improved localization of the gene. Our results demonstrate that associations can be detected at SNPs near a complex disease gene. We found that a high density of markers will be necessary in order to have a good chance of including SNPs with detectable levels of allelic association with the disease mutation, and statistical analysis based on haplotypes can provide additional information with respect to tests of significance and fine localization of complex disease genes. PMID:10869235
Martin, E R; Lai, E H; Gilbert, J R; Rogala, A R; Afshari, A J; Riley, J; Finch, K L; Stevens, J F; Livak, K J; Slotterbeck, B D; Slifer, S H; Warren, L L; Conneally, P M; Schmechel, D E; Purvis, I; Pericak-Vance, M A; Roses, A D; Vance, J M
2000-08-01
There has been great interest in the prospects of using single-nucleotide polymorphisms (SNPs) in the search for complex disease genes, and several initiatives devoted to the identification and mapping of SNPs throughout the human genome are currently underway. However, actual data investigating the use of SNPs for identification of complex disease genes are scarce. To begin to look at issues surrounding the use of SNPs in complex disease studies, we have initiated a collaborative SNP mapping study around APOE, the well-established susceptibility gene for late-onset Alzheimer disease (AD). Sixty SNPs in a 1.5-Mb region surrounding APOE were genotyped in samples of unrelated cases of AD, in controls, and in families with AD. Standard tests were conducted to look for association of SNP alleles with AD, in cases and controls. We also used family-based association analyses, including recently developed methods to look for haplotype association. Evidence of association (P=.05) was identified for 7 of 13 SNPs, including the APOE-4 polymorphism, spanning 40 kb on either side of APOE. As expected, very strong evidence for association with AD was seen for the APOE-4 polymorphism, as well as for two other SNPs that lie <16 kb from APOE. Haplotype analysis using family data increased significance over that seen in single-locus tests for some of the markers, and, for these data, improved localization of the gene. Our results demonstrate that associations can be detected at SNPs near a complex disease gene. We found that a high density of markers will be necessary in order to have a good chance of including SNPs with detectable levels of allelic association with the disease mutation, and statistical analysis based on haplotypes can provide additional information with respect to tests of significance and fine localization of complex disease genes.
Multiple origins of interdependent endosymbiotic complexes in a genus of cicadas.
Łukasik, Piotr; Nazario, Katherine; Van Leuven, James T; Campbell, Matthew A; Meyer, Mariah; Michalik, Anna; Pessacq, Pablo; Simon, Chris; Veloso, Claudio; McCutcheon, John P
2018-01-09
Bacterial endosymbionts that provide nutrients to hosts often have genomes that are extremely stable in structure and gene content. In contrast, the genome of the endosymbiont Hodgkinia cicadicola has fractured into multiple distinct lineages in some species of the cicada genus Tettigades To better understand the frequency, timing, and outcomes of Hodgkinia lineage splitting throughout this cicada genus, we sampled cicadas over three field seasons in Chile and performed genomics and microscopy on representative samples. We found that a single ancestral Hodgkinia lineage has split at least six independent times in Tettigades over the last 4 million years, resulting in complexes of between two and six distinct Hodgkinia lineages per host. Individual genomes in these symbiotic complexes differ dramatically in relative abundance, genome size, organization, and gene content. Each Hodgkinia lineage retains a small set of core genes involved in genetic information processing, but the high level of gene loss experienced by all genomes suggests that extensive sharing of gene products among symbiont cells must occur. In total, Hodgkinia complexes that consist of multiple lineages encode nearly complete sets of genes present on the ancestral single lineage and presumably perform the same functions as symbionts that have not undergone splitting. However, differences in the timing of the splits, along with dissimilar gene loss patterns on the resulting genomes, have led to very different outcomes of lineage splitting in extant cicadas.
Rivera-Torres, Natalia; Banas, Kelly; Bialk, Pawel; Bloh, Kevin M; Kmiec, Eric B
2017-01-01
CRISPR/Cas9 and single-stranded DNA oligonucleotides (ssODNs) have been used to direct the repair of a single base mutation in human genes. Here, we examine a method designed to increase the precision of RNA guided genome editing in human cells by utilizing a CRISPR/Cas9 ribonucleoprotein (RNP) complex to initiate DNA cleavage. The RNP is assembled in vitro and induces a double stranded break at a specific site surrounding the mutant base designated for correction by the ssODN. We use an integrated mutant eGFP gene, bearing a single base change rendering the expressed protein nonfunctional, as a single copy target in HCT 116 cells. We observe significant gene correction activity of the mutant base, promoted by the RNP and single-stranded DNA oligonucleotide with validation through genotypic and phenotypic readout. We demonstrate that all individual components must be present to obtain successful gene editing. Importantly, we examine the genotype of individually sorted corrected and uncorrected clonally expanded cell populations for the mutagenic footprint left by the action of these gene editing tools. While the DNA sequence of the corrected population is exact with no adjacent sequence modification, the uncorrected population exhibits heterogeneous mutagenicity with a wide variety of deletions and insertions surrounding the target site. We designate this type of DNA aberration as on-site mutagenicity. Analyses of two clonal populations bearing specific DNA insertions surrounding the target site, indicate that point mutation repair has occurred at the level of the gene. The phenotype, however, is not rescued because a section of the single-stranded oligonucleotide has been inserted altering the reading frame and generating truncated proteins. These data illustrate the importance of analysing mutagenicity in uncorrected cells. Our results also form the basis of a simple model for point mutation repair directed by a short single-stranded DNA oligonucleotides and CRISPR/Cas9 ribonucleoprotein complex.
Rivera-Torres, Natalia; Bialk, Pawel; Bloh, Kevin M.; Kmiec, Eric B.
2017-01-01
CRISPR/Cas9 and single-stranded DNA oligonucleotides (ssODNs) have been used to direct the repair of a single base mutation in human genes. Here, we examine a method designed to increase the precision of RNA guided genome editing in human cells by utilizing a CRISPR/Cas9 ribonucleoprotein (RNP) complex to initiate DNA cleavage. The RNP is assembled in vitro and induces a double stranded break at a specific site surrounding the mutant base designated for correction by the ssODN. We use an integrated mutant eGFP gene, bearing a single base change rendering the expressed protein nonfunctional, as a single copy target in HCT 116 cells. We observe significant gene correction activity of the mutant base, promoted by the RNP and single-stranded DNA oligonucleotide with validation through genotypic and phenotypic readout. We demonstrate that all individual components must be present to obtain successful gene editing. Importantly, we examine the genotype of individually sorted corrected and uncorrected clonally expanded cell populations for the mutagenic footprint left by the action of these gene editing tools. While the DNA sequence of the corrected population is exact with no adjacent sequence modification, the uncorrected population exhibits heterogeneous mutagenicity with a wide variety of deletions and insertions surrounding the target site. We designate this type of DNA aberration as on-site mutagenicity. Analyses of two clonal populations bearing specific DNA insertions surrounding the target site, indicate that point mutation repair has occurred at the level of the gene. The phenotype, however, is not rescued because a section of the single-stranded oligonucleotide has been inserted altering the reading frame and generating truncated proteins. These data illustrate the importance of analysing mutagenicity in uncorrected cells. Our results also form the basis of a simple model for point mutation repair directed by a short single-stranded DNA oligonucleotides and CRISPR/Cas9 ribonucleoprotein complex. PMID:28052104
Capomaccio, Stefano; Milanesi, Marco; Bomba, Lorenzo; Cappelli, Katia; Nicolazzi, Ezequiel L; Williams, John L; Ajmone-Marsan, Paolo; Stefanon, Bruno
2015-08-01
Genome-wide association studies (GWAS) have been widely applied to disentangle the genetic basis of complex traits. In cattle breeds, classical GWAS approaches with medium-density marker panels are far from conclusive, especially for complex traits. This is due to the intrinsic limitations of GWAS and the assumptions that are made to step from the association signals to the functional variations. Here, we applied a gene-based strategy to prioritize genotype-phenotype associations found for milk production and quality traits with classical approaches in three Italian dairy cattle breeds with different sample sizes (Italian Brown n = 745; Italian Holstein n = 2058; Italian Simmental n = 477). Although classical regression on single markers revealed only a single genome-wide significant genotype-phenotype association, for Italian Holstein, the gene-based approach identified specific genes in each breed that are associated with milk physiology and mammary gland development. As no standard method has yet been established to step from variation to functional units (i.e., genes), the strategy proposed here may contribute to revealing new genes that play significant roles in complex traits, such as those investigated here, amplifying low association signals using a gene-centric approach. © 2015 Stichting International Foundation for Animal Genetics.
Regan, Kelly; Wang, Kanix; Doughty, Emily; Li, Haiquan; Li, Jianrong; Lee, Younghee; Kann, Maricel G
2012-01-01
Objective Although trait-associated genes identified as complex versus single-gene inheritance differ substantially in odds ratio, the authors nonetheless posit that their mechanistic concordance can reveal fundamental properties of the genetic architecture, allowing the automated interpretation of unique polymorphisms within a personal genome. Materials and methods An analytical method, SPADE-gen, spanning three biological scales was developed to demonstrate the mechanistic concordance between Mendelian and complex inheritance of Alzheimer's disease (AD) genes: biological functions (BP), protein interaction modeling, and protein domain implicated in the disease-associated polymorphism. Results Among Gene Ontology (GO) biological processes (BP) enriched at a false detection rate <5% in 15 AD genes of Mendelian inheritance (Online Mendelian Inheritance in Man) and independently in those of complex inheritance (25 host genes of intragenic AD single-nucleotide polymorphisms confirmed in genome-wide association studies), 16 overlapped (empirical p=0.007) and 45 were similar (empirical p<0.009; information theory). SPAN network modeling extended the canonical pathway of AD (KEGG) with 26 new protein interactions (empirical p<0.0001). Discussion The study prioritized new AD-associated biological mechanisms and focused the analysis on previously unreported interactions associated with the biological processes of polymorphisms that affect specific protein domains within characterized AD genes and their direct interactors using (1) concordant GO-BP and (2) domain interactions within STRING protein–protein interactions corresponding to the genomic location of the AD polymorphism (eg, EPHA1, APOE, and CD2AP). Conclusion These results are in line with unique-event polymorphism theory, indicating how disease-associated polymorphisms of Mendelian or complex inheritance relate genetically to those observed as ‘unique personal variants’. They also provide insight for identifying novel targets, for repositioning drugs, and for personal therapeutics. PMID:22319180
Ewen-Campen, Ben; Mohr, Stephanie E; Hu, Yanhui; Perrimon, Norbert
2017-10-09
Single-gene knockout experiments can fail to reveal function in the context of redundancy, which is frequently observed among duplicated genes (paralogs) with overlapping functions. We discuss the complexity associated with studying paralogs and outline how recent advances in CRISPR will help address the "phenotype gap" and impact biomedical research. Copyright © 2017 Elsevier Inc. All rights reserved.
Mitchelson, K R
1996-01-01
The small single-copy region (SSCR) of the chloroplast genome of many higher plants typically contain ndh genes encoding proteins that share homology with subunits of the respiratory-chain reduced nicotinamide adenine dinucleotide (NADH) dehydrogenase complex of mitochondria. A map of the lettuce chloroplast SSCR has been determined by Southern cross-hybridization, taking advantage of the high degree of homology between a tobacco small single-copy fragment and a corresponding lettuce chloroplast fragment. The gene order of the SSCR of lettuce and tobacco chloroplasts is similar. The cross-hybridization method can rapidly create a primary gene map of unknown chloroplast fragments, thus providing detailed information of the localization and arrangement of genes and conserved open reading frame regions.
Lymphocyte signaling: beyond knockouts.
Saveliev, Alexander; Tybulewicz, Victor L J
2009-04-01
The analysis of lymphocyte signaling was greatly enhanced by the advent of gene targeting, which allows the selective inactivation of a single gene. Although this gene 'knockout' approach is often informative, in many cases, the phenotype resulting from gene ablation might not provide a complete picture of the function of the corresponding protein. If a protein has multiple functions within a single or several signaling pathways, or stabilizes other proteins in a complex, the phenotypic consequences of a gene knockout may manifest as a combination of several different perturbations. In these cases, gene targeting to 'knock in' subtle point mutations might provide more accurate insight into protein function. However, to be informative, such mutations must be carefully based on structural and biophysical data.
Efficient Credit Assignment through Evaluation Function Decomposition
NASA Technical Reports Server (NTRS)
Agogino, Adrian; Turner, Kagan; Mikkulainen, Risto
2005-01-01
Evolutionary methods are powerful tools in discovering solutions for difficult continuous tasks. When such a solution is encoded over multiple genes, a genetic algorithm faces the difficult credit assignment problem of evaluating how a single gene in a chromosome contributes to the full solution. Typically a single evaluation function is used for the entire chromosome, implicitly giving each gene in the chromosome the same evaluation. This method is inefficient because a gene will get credit for the contribution of all the other genes as well. Accurately measuring the fitness of individual genes in such a large search space requires many trials. This paper instead proposes turning this single complex search problem into a multi-agent search problem, where each agent has the simpler task of discovering a suitable gene. Gene-specific evaluation functions can then be created that have better theoretical properties than a single evaluation function over all genes. This method is tested in the difficult double-pole balancing problem, showing that agents using gene-specific evaluation functions can create a successful control policy in 20 percent fewer trials than the best existing genetic algorithms. The method is extended to more distributed problems, achieving 95 percent performance gains over tradition methods in the multi-rover domain.
A Nonlinear Model for Gene-Based Gene-Environment Interaction.
Sa, Jian; Liu, Xu; He, Tao; Liu, Guifen; Cui, Yuehua
2016-06-04
A vast amount of literature has confirmed the role of gene-environment (G×E) interaction in the etiology of complex human diseases. Traditional methods are predominantly focused on the analysis of interaction between a single nucleotide polymorphism (SNP) and an environmental variable. Given that genes are the functional units, it is crucial to understand how gene effects (rather than single SNP effects) are influenced by an environmental variable to affect disease risk. Motivated by the increasing awareness of the power of gene-based association analysis over single variant based approach, in this work, we proposed a sparse principle component regression (sPCR) model to understand the gene-based G×E interaction effect on complex disease. We first extracted the sparse principal components for SNPs in a gene, then the effect of each principal component was modeled by a varying-coefficient (VC) model. The model can jointly model variants in a gene in which their effects are nonlinearly influenced by an environmental variable. In addition, the varying-coefficient sPCR (VC-sPCR) model has nice interpretation property since the sparsity on the principal component loadings can tell the relative importance of the corresponding SNPs in each component. We applied our method to a human birth weight dataset in Thai population. We analyzed 12,005 genes across 22 chromosomes and found one significant interaction effect using the Bonferroni correction method and one suggestive interaction. The model performance was further evaluated through simulation studies. Our model provides a system approach to evaluate gene-based G×E interaction.
Four Linked Genes Participate in Controlling Sporulation Efficiency in Budding Yeast
Ben-Ari, Giora; Zenvirth, Drora; Sherman, Amir; David, Lior; Klutstein, Michael; Lavi, Uri; Hillel, Jossi; Simchen, Giora
2006-01-01
Quantitative traits are conditioned by several genetic determinants. Since such genes influence many important complex traits in various organisms, the identification of quantitative trait loci (QTLs) is of major interest, but still encounters serious difficulties. We detected four linked genes within one QTL, which participate in controlling sporulation efficiency in Saccharomyces cerevisiae. Following the identification of single nucleotide polymorphisms by comparing the sequences of 145 genes between the parental strains SK1 and S288c, we analyzed the segregating progeny of the cross between them. Through reciprocal hemizygosity analysis, four genes, RAS2, PMS1, SWS2, and FKH2, located in a region of 60 kilobases on Chromosome 14, were found to be associated with sporulation efficiency. Three of the four “high” sporulation alleles are derived from the “low” sporulating strain. Two of these sporulation-related genes were verified through allele replacements. For RAS2, the causative variation was suggested to be a single nucleotide difference in the upstream region of the gene. This quantitative trait nucleotide accounts for sporulation variability among a set of ten closely related winery yeast strains. Our results provide a detailed view of genetic complexity in one “QTL region” that controls a quantitative trait and reports a single nucleotide polymorphism-trait association in wild strains. Moreover, these findings have implications on QTL identification in higher eukaryotes. PMID:17112318
NASA Technical Reports Server (NTRS)
Stuart, J. J.; Brown, S. J.; Beeman, R. W.; Denell, R. E.; Spooner, B. S. (Principal Investigator)
1993-01-01
The Abdominal gene is a member of the single homeotic complex of the beetle, Tribolium castaneum. An integrated developmental genetic and molecular analysis shows that Abdominal is homologous to the abdominal-A gene of the bithorax complex of Drosophila. abdominal-A mutant embryos display strong homeotic transformations of the anterior abdomen (parasegments 7-9) to PS6, whereas developmental commitments in the posterior abdomen depend primarily on Abdominal-B. In beetle embryos lacking Abdominal function, parasegments throughout the abdomen are transformed to PS6. This observation demonstrates the general functional significance of parasegmental expression among insects and shows that the control of determinative decisions in the posterior abdomen by homeotic selector genes has undergone considerable evolutionary modification.
Stuart, J J; Brown, S J; Beeman, R W; Denell, R E
1993-01-01
The Abdominal gene is a member of the single homeotic complex of the beetle, Tribolium castaneum. An integrated developmental genetic and molecular analysis shows that Abdominal is homologous to the abdominal-A gene of the bithorax complex of Drosophila. abdominal-A mutant embryos display strong homeotic transformations of the anterior abdomen (parasegments 7-9) to PS6, whereas developmental commitments in the posterior abdomen depend primarily on Abdominal-B. In beetle embryos lacking Abdominal function, parasegments throughout the abdomen are transformed to PS6. This observation demonstrates the general functional significance of parasegmental expression among insects and shows that the control of determinative decisions in the posterior abdomen by homeotic selector genes has undergone considerable evolutionary modification.
Genes for normal sleep and sleep disorders.
Tafti, Mehdi; Maret, Stéphanie; Dauvilliers, Yves
2005-01-01
Sleep and wakefulness are complex behaviors that are influenced by many genetic and environmental factors, which are beginning to be discovered. The contribution of genetic components to sleep disorders is also increasingly recognized as important. Point mutations in the prion protein, period 2, and the prepro-hypocretin/orexin gene have been found as the cause of a few sleep disorders but the possibility that other gene defects may contribute to the pathophysiology of major sleep disorders is worth in-depth investigations. However, single gene disorders are rare and most common disorders are complex in terms of their genetic susceptibility, environmental effects, gene-gene, and gene-environment interactions. We review here the current progress in the genetics of normal and pathological sleep.
Incorporating gene-environment interaction in testing for association with rare genetic variants.
Chen, Han; Meigs, James B; Dupuis, Josée
2014-01-01
The incorporation of gene-environment interactions could improve the ability to detect genetic associations with complex traits. For common genetic variants, single-marker interaction tests and joint tests of genetic main effects and gene-environment interaction have been well-established and used to identify novel association loci for complex diseases and continuous traits. For rare genetic variants, however, single-marker tests are severely underpowered due to the low minor allele frequency, and only a few gene-environment interaction tests have been developed. We aimed at developing powerful and computationally efficient tests for gene-environment interaction with rare variants. In this paper, we propose interaction and joint tests for testing gene-environment interaction of rare genetic variants. Our approach is a generalization of existing gene-environment interaction tests for multiple genetic variants under certain conditions. We show in our simulation studies that our interaction and joint tests have correct type I errors, and that the joint test is a powerful approach for testing genetic association, allowing for gene-environment interaction. We also illustrate our approach in a real data example from the Framingham Heart Study. Our approach can be applied to both binary and continuous traits, it is powerful and computationally efficient.
Wang, Zhuo; Jin, Shuilin; Liu, Guiyou; Zhang, Xiurui; Wang, Nan; Wu, Deliang; Hu, Yang; Zhang, Chiping; Jiang, Qinghua; Xu, Li; Wang, Yadong
2017-05-23
The development of single-cell RNA sequencing has enabled profound discoveries in biology, ranging from the dissection of the composition of complex tissues to the identification of novel cell types and dynamics in some specialized cellular environments. However, the large-scale generation of single-cell RNA-seq (scRNA-seq) data collected at multiple time points remains a challenge to effective measurement gene expression patterns in transcriptome analysis. We present an algorithm based on the Dynamic Time Warping score (DTWscore) combined with time-series data, that enables the detection of gene expression changes across scRNA-seq samples and recovery of potential cell types from complex mixtures of multiple cell types. The DTWscore successfully classify cells of different types with the most highly variable genes from time-series scRNA-seq data. The study was confined to methods that are implemented and available within the R framework. Sample datasets and R packages are available at https://github.com/xiaoxiaoxier/DTWscore .
NASA Astrophysics Data System (ADS)
Werthmann, Britta; Marwan, Wolfgang
2017-11-01
The developmental switch to sporulation in Physarum polycephalum is a phytochrome-mediated far-red light-induced cell fate decision that synchronously encompasses the entire multinucleate plasmodial cell and is associated with extensive reprogramming of the transcriptome. By repeatedly taking samples of single cells after delivery of a light stimulus pulse, we analysed differential gene expression in two mutant strains and in a heterokaryon of the two strains all of which display a different propensity for making the cell fate decision. Multidimensional scaling of the gene expression data revealed individually different single cell trajectories eventually leading to sporulation. Characterization of the trajectories as walks through states of gene expression discretized by hierarchical clustering allowed the reconstruction of Petri nets that model and predict the observed behavior. Structural analyses of the Petri nets indicated stimulus- and genotype-dependence of both, single cell trajectories and of the quasipotential landscape through which these trajectories are taken. The Petri net-based approach to the analysis and decomposition of complex cellular responses and of complex mutant phenotypes may provide a scaffold for the data-driven reconstruction of causal molecular mechanisms that shape the topology of the quasipotential landscape.
Methods for detection of ataxia telangiectasia mutations
Gatti, Richard A.
2005-10-04
The present invention is directed to a method of screening large, complex, polyexonic eukaryotic genes such as the ATM gene for mutations and polymorphisms by an improved version of single strand conformation polymorphism (SSCP) electrophoresis that allows electrophoresis of two or three amplified segments in a single lane. The present invention also is directed to new mutations and polymorphisms in the ATM gene that are useful in performing more accurate screening of human DNA samples for mutations and in distinguishing mutations from polymorphisms, thereby improving the efficiency of automated screening methods.
Modified SSCP method using sequential electrophoresis of multiple nucleic acid segments
Gatti, Richard A.
2002-10-01
The present invention is directed to a method of screening large, complex, polyexonic eukaryotic genes such as the ATM gene for mutations and polymorphisms by an improved version of single strand conformation polymorphism (SSCP) electrophoresis that allows electrophoresis of two or three amplified segments in a single lane. The present invention also is directed to new mutations and polymorphisms in the ATM gene that are useful in performing more accurate screening of human DNA samples for mutations and in distinguishing mutations from polymorphisms, thereby improving the efficiency of automated screening methods.
Saeed, Mohammad
2017-05-01
Systemic lupus erythematosus (SLE) is a complex disorder. Genetic association studies of complex disorders suffer from the following three major issues: phenotypic heterogeneity, false positive (type I error), and false negative (type II error) results. Hence, genes with low to moderate effects are missed in standard analyses, especially after statistical corrections. OASIS is a novel linkage disequilibrium clustering algorithm that can potentially address false positives and negatives in genome-wide association studies (GWAS) of complex disorders such as SLE. OASIS was applied to two SLE dbGAP GWAS datasets (6077 subjects; ∼0.75 million single-nucleotide polymorphisms). OASIS identified three known SLE genes viz. IFIH1, TNIP1, and CD44, not previously reported using these GWAS datasets. In addition, 22 novel loci for SLE were identified and the 5 SLE genes previously reported using these datasets were verified. OASIS methodology was validated using single-variant replication and gene-based analysis with GATES. This led to the verification of 60% of OASIS loci. New SLE genes that OASIS identified and were further verified include TNFAIP6, DNAJB3, TTF1, GRIN2B, MON2, LATS2, SNX6, RBFOX1, NCOA3, and CHAF1B. This study presents the OASIS algorithm, software, and the meta-analyses of two publicly available SLE GWAS datasets along with the novel SLE genes. Hence, OASIS is a novel linkage disequilibrium clustering method that can be universally applied to existing GWAS datasets for the identification of new genes.
Routine Discovery of Complex Genetic Models using Genetic Algorithms
Moore, Jason H.; Hahn, Lance W.; Ritchie, Marylyn D.; Thornton, Tricia A.; White, Bill C.
2010-01-01
Simulation studies are useful in various disciplines for a number of reasons including the development and evaluation of new computational and statistical methods. This is particularly true in human genetics and genetic epidemiology where new analytical methods are needed for the detection and characterization of disease susceptibility genes whose effects are complex, nonlinear, and partially or solely dependent on the effects of other genes (i.e. epistasis or gene-gene interaction). Despite this need, the development of complex genetic models that can be used to simulate data is not always intuitive. In fact, only a few such models have been published. We have previously developed a genetic algorithm approach to discovering complex genetic models in which two single nucleotide polymorphisms (SNPs) influence disease risk solely through nonlinear interactions. In this paper, we extend this approach for the discovery of high-order epistasis models involving three to five SNPs. We demonstrate that the genetic algorithm is capable of routinely discovering interesting high-order epistasis models in which each SNP influences risk of disease only through interactions with the other SNPs in the model. This study opens the door for routine simulation of complex gene-gene interactions among SNPs for the development and evaluation of new statistical and computational approaches for identifying common, complex multifactorial disease susceptibility genes. PMID:20948983
Hox genes and study of Hox genes in crustacean
NASA Astrophysics Data System (ADS)
Hou, Lin; Chen, Zhijuan; Xu, Mingyu; Lin, Shengguo; Wang, Lu
2004-12-01
Homeobox genes have been discovered in many species. These genes are known to play a major role in specifying regional identity along the anterior-posterior axis of animals from a wide range of phyla. The products of the homeotic genes are a set of evolutionarily conserved transcription factors that control elaborate developmental processes and specify cell fates in metazoans. Crustacean, presenting a variety of body plans not encountered in any other class or phylum of the Metazoa, has been shown to possess a single set of homologous Hox genes like insect. The ancestral crustacean Hox gene complex comprised ten genes: eight homologous to the hometic Hox genes and two related to nonhomeotic genes presented within the insect Hox complexes. The crustacean in particular exhibits an abundant diversity segment specialization and tagmosis. This morphological diversity relates to the Hox genes. In crustacean body plan, different Hox genes control different segments and tagmosis.
Lymphocyte signaling : beyond knockouts
Saveliev, Alexander; Tybulewicz, Victor L. J.
2016-01-01
The analysis of lymphocyte signaling was greatly enhanced by the advent of gene targeting, which allows the selective inactivation of a single gene. Whereas this gene ‘knockout’ approach is often informative, in many cases the phenotype resulting from gene ablation might not provide a complete picture of the function of the corresponding protein. If a protein has multiple functions within a single or several signaling pathways, or stabilizes other proteins in a complex, the phenotypic consequences of a gene knockout may manifest as a combination of several different perturbations. In these cases, gene targeting to ‘knockin’ subtle point mutations might provide more accurate insight into protein function. However, to be informative, such mutations must be carefully designed based on structural and biophysical data. PMID:19295633
TRACING CO-REGULATORY NETWORK DYNAMICS IN NOISY, SINGLE-CELL TRANSCRIPTOME TRAJECTORIES.
Cordero, Pablo; Stuart, Joshua M
2017-01-01
The availability of gene expression data at the single cell level makes it possible to probe the molecular underpinnings of complex biological processes such as differentiation and oncogenesis. Promising new methods have emerged for reconstructing a progression 'trajectory' from static single-cell transcriptome measurements. However, it remains unclear how to adequately model the appreciable level of noise in these data to elucidate gene regulatory network rewiring. Here, we present a framework called Single Cell Inference of MorphIng Trajectories and their Associated Regulation (SCIMITAR) that infers progressions from static single-cell transcriptomes by employing a continuous parametrization of Gaussian mixtures in high-dimensional curves. SCIMITAR yields rich models from the data that highlight genes with expression and co-expression patterns that are associated with the inferred progression. Further, SCIMITAR extracts regulatory states from the implicated trajectory-evolvingco-expression networks. We benchmark the method on simulated data to show that it yields accurate cell ordering and gene network inferences. Applied to the interpretation of a single-cell human fetal neuron dataset, SCIMITAR finds progression-associated genes in cornerstone neural differentiation pathways missed by standard differential expression tests. Finally, by leveraging the rewiring of gene-gene co-expression relations across the progression, the method reveals the rise and fall of co-regulatory states and trajectory-dependent gene modules. These analyses implicate new transcription factors in neural differentiation including putative co-factors for the multi-functional NFAT pathway.
Proglucagons in vertebrates: Expression and processing of multiple genes in a bony fish.
Busby, Ellen R; Mommsen, Thomas P
2016-09-01
In contrast to mammals, where a single proglucagon (PG) gene encodes three peptides: glucagon, glucagon-like peptide 1 and glucagon-like peptide 2 (GLP-1; GLP-2), many non-mammalian vertebrates carry multiple PG genes. Here, we investigate proglucagon mRNA sequences, their tissue expression and processing in a diploid bony fish. Copper rockfish (Sebastes caurinus) express two independent genes coding for distinct proglucagon sequences (PG I, PG II), with PG II lacking the GLP-2 sequence. These genes are differentially transcribed in the endocrine pancreas, the brain, and the gastrointestinal tract. Alternative splicing identified in rockfish is only one part of this complex regulation of the PG transcripts: the system has the potential to produce two glucagons, four GLP-1s and a single GLP-2, or any combination of these peptides. Mass spectrometric analysis of partially purified PG-derived peptides in endocrine pancreas confirms translation of both PG transcripts and differential processing of the resulting peptides. The complex differential regulation of the two PG genes and their continued presence in this extant teleostean fish strongly suggests unique and, as yet largely unidentified, roles for the peptide products encoded in each gene. Copyright © 2016 Elsevier Inc. All rights reserved.
2012-01-01
High-dimensional gene expression data provide a rich source of information because they capture the expression level of genes in dynamic states that reflect the biological functioning of a cell. For this reason, such data are suitable to reveal systems related properties inside a cell, e.g., in order to elucidate molecular mechanisms of complex diseases like breast or prostate cancer. However, this is not only strongly dependent on the sample size and the correlation structure of a data set, but also on the statistical hypotheses tested. Many different approaches have been developed over the years to analyze gene expression data to (I) identify changes in single genes, (II) identify changes in gene sets or pathways, and (III) identify changes in the correlation structure in pathways. In this paper, we review statistical methods for all three types of approaches, including subtypes, in the context of cancer data and provide links to software implementations and tools and address also the general problem of multiple hypotheses testing. Further, we provide recommendations for the selection of such analysis methods. Reviewers This article was reviewed by Arcady Mushegian, Byung-Soo Kim and Joel Bader. PMID:23227854
Tamminen, Manu V; Virta, Marko P J
2015-01-01
Recent progress in environmental microbiology has revealed vast populations of microbes in any given habitat that cannot be detected by conventional culturing strategies. The use of sensitive genetic detection methods such as CARD-FISH and in situ PCR have been limited by the cell wall permeabilization requirement that cannot be performed similarly on all cell types without lysing some and leaving some nonpermeabilized. Furthermore, the detection of low copy targets such as genes present in single copies in the microbial genomes, has remained problematic. We describe an emulsion-based procedure to trap individual microbial cells into picoliter-volume polyacrylamide droplets that provide a rigid support for genetic material and therefore allow complete degradation of cellular material to expose the individual genomes. The polyacrylamide droplets are subsequently converted into picoliter-scale reactors for genome amplification. The amplified genomes are labeled based on the presence of a target gene and differentiated from those that do not contain the gene by flow cytometry. Using the Escherichia coli strains XL1 and MC1061, which differ with respect to the presence (XL1), or absence (MC1061) of a single copy of a tetracycline resistance gene per genome, we demonstrate that XL1 genomes present at 0.1% of MC1061 genomes can be differentiated using this method. Using a spiked sediment microbial sample, we demonstrate that the method is applicable to highly complex environmental microbial communities as a target gene-based screen for individual microbes. The method provides a novel tool for enumerating functional cell populations in complex microbial communities. We envision that the method could be optimized for fluorescence-activated cell sorting to enrich genetic material of interest from complex environmental samples.
Mammalian monogamy is not controlled by a single gene
Fink, Sabine; Excoffier, Laurent; Heckel, Gerald
2006-01-01
Complex social behavior in Microtus voles and other mammals has been postulated to be under the direct genetic control of a single locus: the arginine vasopressin 1a receptor (avpr1a) gene. Using a phylogenetic approach, we show that a repetitive element in the promoter region of avpr1a, which reportedly causes social monogamy, is actually widespread in nonmonogamous Microtus and other rodents. There was no evidence for intraspecific polymorphism in regard to the presence or absence of the repetitive element. Among 25 rodent species studied, the element was absent in only two closely related nonmonogamous species, indicating that this absence is certainly the result of an evolutionarily recent loss. Our analyses further demonstrate that the repetitive structures upstream of the avpr1a gene in humans and primates, which have been associated with social bonding, are evolutionarily distinct from those in rodents. Our evolutionary approach reveals that monogamy in rodents is not controlled by a single polymorphism in the promoter region of the avpr1a gene. We thus resolve the contradiction between the claims for an evolutionarily conserved genetic programming of social behavior in mammals and the vast evidence for highly complex and flexible mating systems. PMID:16832060
Shirts, Brian H; Salipante, Stephen J; Casadei, Silvia; Ryan, Shawnia; Martin, Judith; Jacobson, Angela; Vlaskin, Tatyana; Koehler, Karen; Livingston, Robert J; King, Mary-Claire; Walsh, Tom; Pritchard, Colin C
2014-10-01
Single-exon inversions have rarely been described in clinical syndromes and are challenging to detect using Sanger sequencing. We report the case of a 40-year-old woman with adenomatous colon polyps too numerous to count and who had a complex inversion spanning the entire exon 10 in APC (the gene encoding for adenomatous polyposis coli), causing exon skipping and resulting in a frameshift and premature protein truncation. In this study, we employed complete APC gene sequencing using high-coverage next-generation sequencing by ColoSeq, analysis with BreakDancer and SLOPE software, and confirmatory transcript analysis. ColoSeq identified a complex small genomic rearrangement consisting of an inversion that results in translational skipping of exon 10 in the APC gene. This mutation would not have been detected by traditional sequencing or gene-dosage methods. We report a case of adenomatous polyposis resulting from a complex single-exon inversion. Our report highlights the benefits of large-scale sequencing methods that capture intronic sequences with high enough depth of coverage-as well as the use of informatics tools-to enable detection of small pathogenic structural rearrangements.
Meslin, Camille; Plakke, Melissa S.; Deutsch, Aaron B.; Small, Brandon S.; Morehouse, Nathan I.; Clark, Nathan L.
2015-01-01
Persistent adaptive challenges are often met with the evolution of novel physiological traits. Although there are specific examples of single genes providing new physiological functions, studies on the origin of complex organ functions are lacking. One such derived set of complex functions is found in the Lepidopteran bursa copulatrix, an organ within the female reproductive tract that digests nutrients from the male ejaculate or spermatophore. Here, we characterized bursa physiology and the evolutionary mechanisms by which it was equipped with digestive and absorptive functionality. By studying the transcriptome of the bursa and eight other tissues, we revealed a suite of highly expressed and secreted gene products providing the bursa with a combination of stomach-like traits for mechanical and enzymatic digestion of the male spermatophore. By subsequently placing these bursa genes in an evolutionary framework, we found that the vast majority of their novel digestive functions were co-opted by borrowing genes that continue to be expressed in nonreproductive tissues. However, a number of bursa-specific genes have also arisen, some of which represent unique gene families restricted to Lepidoptera and may provide novel bursa-specific functions. This pattern of promiscuous gene borrowing and relatively infrequent evolution of tissue-specific duplicates stands in contrast to studies of the evolution of novelty via single gene co-option. Our results suggest that the evolution of complex organ-level phenotypes may often be enabled (and subsequently constrained) by changes in tissue specificity that allow expression of existing genes in novel contexts, such as reproduction. The extent to which the selective pressures encountered in these novel roles require resolution via duplication and sub/neofunctionalization is likely to be determined by the need for specialized reproductive functionality. Thus, complex physiological phenotypes such as that found in the bursa offer important opportunities for understanding the relative role of pleiotropy and specialization in adaptive evolution. PMID:25725432
Ito, Teruyo; Ma, Xiao Xue; Takeuchi, Fumihiko; Okuma, Keiko; Yuzawa, Harumi; Hiramatsu, Keiichi
2004-01-01
Staphylococcal cassette chromosome mec (SCCmec) is a mobile genetic element composed of the mec gene complex, which encodes methicillin resistance, and the ccr gene complex, which encodes the recombinases responsible for its mobility. The mec gene complex has been classified into four classes, and the ccr gene complex has been classified into three allotypes. Different combinations of mec gene complex classes and ccr gene complex types have so far defined four types of SCCmec elements. Now we introduce the fifth allotype of SCCmec, which was found on the chromosome of a community-acquired methicillin-resistant Staphylococcus aureus strain (strain WIS [WBG8318]) isolated in Australia. The element shared the same chromosomal integration site with the four extant types of SCCmec and the characteristic nucleotide sequences at the chromosome-SCCmec junction regions. The novel SCCmec carried mecA bracketed by IS431 (IS431-mecA-ΔmecR1-IS431), which is designated the class C2 mec gene complex; and instead of ccrA and ccrB genes, it carried a single copy of a gene homologue that encoded cassette chromosome recombinase. Since the open reading frame (ORF) was found to encode an enzyme which catalyzes the precise excision as well as site- and orientation-specific integration of the element, we designated the ORF cassette chromosome recombinase C (ccrC), and we designated the element type V SCCmec. Type V SCCmec is a small SCCmec element (28 kb) and does not carry any antibiotic resistance genes besides mecA. Unlike the extant SCCmec types, it carries a set of foreign genes encoding a restriction-modification system that might play a role in the stabilization of the element on the chromosome. PMID:15215121
A survey of human brain transcriptome diversity at the single cell level.
Darmanis, Spyros; Sloan, Steven A; Zhang, Ye; Enge, Martin; Caneda, Christine; Shuer, Lawrence M; Hayden Gephart, Melanie G; Barres, Ben A; Quake, Stephen R
2015-06-09
The human brain is a tissue of vast complexity in terms of the cell types it comprises. Conventional approaches to classifying cell types in the human brain at single cell resolution have been limited to exploring relatively few markers and therefore have provided a limited molecular characterization of any given cell type. We used single cell RNA sequencing on 466 cells to capture the cellular complexity of the adult and fetal human brain at a whole transcriptome level. Healthy adult temporal lobe tissue was obtained during surgical procedures where otherwise normal tissue was removed to gain access to deeper hippocampal pathology in patients with medical refractory seizures. We were able to classify individual cells into all of the major neuronal, glial, and vascular cell types in the brain. We were able to divide neurons into individual communities and show that these communities preserve the categorization of interneuron subtypes that is typically observed with the use of classic interneuron markers. We then used single cell RNA sequencing on fetal human cortical neurons to identify genes that are differentially expressed between fetal and adult neurons and those genes that display an expression gradient that reflects the transition between replicating and quiescent fetal neuronal populations. Finally, we observed the expression of major histocompatibility complex type I genes in a subset of adult neurons, but not fetal neurons. The work presented here demonstrates the applicability of single cell RNA sequencing on the study of the adult human brain and constitutes a first step toward a comprehensive cellular atlas of the human brain.
Adamson, Britt; Norman, Thomas M.; Jost, Marco; Cho, Min Y.; Nuñez, James K.; Chen, Yuwen; Villalta, Jacqueline E.; Gilbert, Luke A.; Horlbeck, Max A.; Hein, Marco Y.; Pak, Ryan A.; Gray, Andrew N.; Gross, Carol A.; Dixit, Atray; Parnas, Oren; Regev, Aviv; Weissman, Jonathan S.
2016-01-01
SUMMARY Functional genomics efforts face tradeoffs between number of perturbations examined and complexity of phenotypes measured. We bridge this gap with Perturb-seq, which combines droplet-based single-cell RNA-seq with a strategy for barcoding CRISPR-mediated perturbations, allowing many perturbations to be profiled in pooled format. We applied Perturb-seq to dissect the mammalian unfolded protein response (UPR) using single and combinatorial CRISPR perturbations. Two genome-scale CRISPR interference (CRISPRi) screens identified genes whose repression perturbs ER homeostasis. Subjecting ~100 hits to Perturb-seq enabled high-precision functional clustering of genes. Single-cell analyses decoupled the three UPR branches, revealed bifurcated UPR branch activation among cells subject to the same perturbation, and uncovered differential activation of the branches across hits, including an isolated feedback loop between the translocon and IRE1α. These studies provide insight into how the three sensors of ER homeostasis monitor distinct types of stress and highlight the ability of Perturb-seq to dissect complex cellular responses. PMID:27984733
Plessy, Charles; Desbois, Linda; Fujii, Teruo; Carninci, Piero
2013-02-01
Tissues contain complex populations of cells. Like countries, which are comprised of mixed populations of people, tissues are not homogeneous. Gene expression studies that analyze entire populations of cells from tissues as a mixture are blind to this diversity. Thus, critical information is lost when studying samples rich in specialized but diverse cells such as tumors, iPS colonies, or brain tissue. High throughput methods are needed to address, model and understand the constitutive and stochastic differences between individual cells. Here, we describe microfluidics technologies that utilize a combination of molecular biology and miniaturized labs on chips to study gene expression at the single cell level. We discuss how the characterization of the transcriptome of each cell in a sample will open a new field in gene expression analysis, population transcriptomics, that will change the academic and biomedical analysis of complex samples by defining them as quantified populations of single cells. Copyright © 2013 WILEY Periodicals, Inc.
Linkage mapping of a mouse gene, iv, that controls left-right asymmetry of the heart and viscera.
Brueckner, M; D'Eustachio, P; Horwich, A L
1989-01-01
Inherited single gene defects have been identified in both humans and mice that lead to loss of developmental control over the left-right asymmetry of the heart and viscera. In mice the recessively inherited mutation iv leads to such apparent loss of control over situs: 50% of iv/iv mice exhibit situs inversus and 50% exhibit normal situs. The affected gene product has not been identified in these animals. To study the normal function of iv, we have taken an approach directed to the gene itself. As a first step, we have mapped iv genetically, by examining its segregation in backcrosses with respect to markers defined by restriction fragment length polymorphisms. The iv locus lies 3 centimorgans (cM) from the immunoglobulin heavy-chain constant-region gene complex (Igh-C) on chromosome 12. A multilocus map of the region suggests the gene order centromere-Aat (alpha 1-antitrypsin gene complex)-(11 cM)-iv-(3 cM)-Igh-C-(1 cM)-Igh-V (immunoglobulin heavy-chain variable-region gene complex). Images PMID:2740340
The Neuroanatomy and Neuroendocrinology of Fragile X Syndrome
ERIC Educational Resources Information Center
Hessl, David; Rivera, Susan M.; Reiss, Allan L.
2004-01-01
Fragile X syndrome (FXS), caused by a single gene mutation on the X chromosome, offers a unique opportunity for investigation of gene-brain-behavior relationships. Recent advances in molecular genetics, human brain imaging, and behavioral studies have started to unravel the complex pathways leading to the cognitive, psychiatric, and physical…
Schoeman, Elizna M; Lopez, Genghis H; McGowan, Eunike C; Millard, Glenda M; O'Brien, Helen; Roulis, Eileen V; Liew, Yew-Wah; Martin, Jacqueline R; McGrath, Kelli A; Powley, Tanya; Flower, Robert L; Hyland, Catherine A
2017-04-01
Blood group single nucleotide polymorphism genotyping probes for a limited range of polymorphisms. This study investigated whether massively parallel sequencing (also known as next-generation sequencing), with a targeted exome strategy, provides an extended blood group genotype and the extent to which massively parallel sequencing correctly genotypes in homologous gene systems, such as RH and MNS. Donor samples (n = 28) that were extensively phenotyped and genotyped using single nucleotide polymorphism typing, were analyzed using the TruSight One Sequencing Panel and MiSeq platform. Genes for 28 protein-based blood group systems, GATA1, and KLF1 were analyzed. Copy number variation analysis was used to characterize complex structural variants in the GYPC and RH systems. The average sequencing depth per target region was 66.2 ± 39.8. Each sample harbored on average 43 ± 9 variants, of which 10 ± 3 were used for genotyping. For the 28 samples, massively parallel sequencing variant sequences correctly matched expected sequences based on single nucleotide polymorphism genotyping data. Copy number variation analysis defined the Rh C/c alleles and complex RHD hybrids. Hybrid RHD*D-CE-D variants were correctly identified, but copy number variation analysis did not confidently distinguish between D and CE exon deletion versus rearrangement. The targeted exome sequencing strategy employed extended the range of blood group genotypes detected compared with single nucleotide polymorphism typing. This single-test format included detection of complex MNS hybrid cases and, with copy number variation analysis, defined RH hybrid genes along with the RHCE*C allele hitherto difficult to resolve by variant detection. The approach is economical compared with whole-genome sequencing and is suitable for a red blood cell reference laboratory setting. © 2017 AABB.
Zhao, Dejian; Lin, Mingyan; Pedrosa, Erika; Lachman, Herbert M; Zheng, Deyou
2017-11-10
Monoallelic expression of autosomal genes has been implicated in human psychiatric disorders. However, there is a paucity of allelic expression studies in human brain cells at the single cell and genome wide levels. In this report, we reanalyzed a previously published single-cell RNA-seq dataset from several postmortem human brains and observed pervasive monoallelic expression in individual cells, largely in a random manner. Examining single nucleotide variants with a predicted functional disruption, we found that the "damaged" alleles were overall expressed in fewer brain cells than their counterparts, and at a lower level in cells where their expression was detected. We also identified many brain cell type-specific monoallelically expressed genes. Interestingly, many of these cell type-specific monoallelically expressed genes were enriched for functions important for those brain cell types. In addition, function analysis showed that genes displaying monoallelic expression and correlated expression across neuronal cells from different individual brains were implicated in the regulation of synaptic function. Our findings suggest that monoallelic gene expression is prevalent in human brain cells, which may play a role in generating cellular identity and neuronal diversity and thus increasing the complexity and diversity of brain cell functions.
A high-throughput screen for single gene activities: isolation of apoptosis inducers.
Albayrak, Timur; Grimm, Stefan
2003-05-16
We describe a novel genetic screen that is performed by transfecting every individual clone of an expression library into a separate population of cells in a high-throughput mode. The screen allows one to achieve a hitherto unattained sensitivity in expression cloning which was exploited in a first read-out to clone apoptosis-inducing genes. This led to the isolation of several genes whose proteins induce distinct phenotypes of apoptosis in 293T cells. One of the isolated genes is the tumor suppressor cytochrome b(L) (cybL), a component of the respiratory chain complex II, that diminishes the activity of this complex for apoptosis induction. This gene is more efficient and specific for causing cell death than a drug with the same activity. These results suggest further applications, both of the isolated genes and the screen.
[Research advance of dosage compensation and MSL complex].
Sun, Min-Qiu; Lin, Peng; Chen, Yun; Wang, Yi-Lei; Zhang, Zi-Ping
2012-05-01
Dosage compensation effect, which exists widely in eukaryotes with sexual reproduction, is an essential biological process that equalizes the level of gene expression between genders based on sex determination. In Drosophila, the male-specific lethal (MSL) complex mediates dosage compensation by acetylating histone H4 lysine K16 on nucleosome of some specific sites on the male X chromosome, globally upregulates twofold expression of active X-linked genes from the single X chromosome, and makes up for the shortage that the male has only one single X chromosome in male Drosophila. Up to date, the structure of basic components of MSL complex, which consists of at least five protein subunits and two non-coding RNAs, has already been revealed, and the interaction sites among these components have also been generally identified. Furthermore, abundant researches on recognition mechanism of the complex have been published. In contrast, many studies have revealed that mammalian dosage compensation functions by silencing gene expression from one of the two X chromosomes in females. The main components of mammalian MSL complex have already been identified, but the knowledge of their function is limited. Up to now, research of MSLs in teleosts is scarcely studied. This review summarizes the similarities and differences among dosage compensation mechanisms of nematodes, fruit flies and mammals, introduces the recent research advances in MSL complex, as well as molecular mechanism of dosage compensation in fruit fly, and finally addresses some problems to be resolved. Meanwhile, the diversity of msl3 gene in fishes is found by synteny analysis. This information might provide insightful directions for future research on the mechanisms of dosage compensation in various species.
Carver-Brown, Rachel K.; Reis, Arthur H.; Rice, Lisa M.; Czajka, John W.; Wangh, Lawrence J.
2012-01-01
Aims. The goal of this study was to construct a single tube molecular diagnostic multiplex assay for the detection of microbial pathogens commonly associated with septicemia, using LATE-PCR and Lights-On/Lights-Off probe technology. Methods and Results. The assay described here identified pathogens associated with sepsis by amplification and analysis of the 16S ribosomal DNA gene sequence for bacteria and specific gene sequences for fungi. A sequence from an unidentified gene in Lactococcus lactis subsp. cremoris served as a positive control for assay function. LATE-PCR was used to generate single-stranded amplicons that were then analyzed at endpoint over a wide temperature range in a specific fluorescent color. Each bacterial target was identified by its pattern of hybridization to Lights-On/Lights-Off probes derived from molecular beacons. Complex mixtures of targets were also detected. Conclusions. All microbial targets were identified in samples containing low starting copy numbers of pathogen genomic DNA, both as individual targets and in complex mixtures. Significance and Impact of the Study. This assay uses new technology to achieve an advance in the field of molecular diagnostics: a single-tube multiplex assay for identification of pathogens commonly associated with sepsis. PMID:23326668
The evolution of resistance genes in multi-protein plant resistance systems.
Friedman, Aaron R; Baker, Barbara J
2007-12-01
The genomic perspective aids in integrating the analysis of single resistance (R-) genes into a higher order model of complex plant resistance systems. The majority of R-genes encode a class of proteins with nucleotide binding (NB) and leucine-rich repeat (LRR) domains. Several R-proteins act in multi-protein R-complexes that mediate interaction with pathogen effectors to induce resistance signaling. The complexity of these systems seems to have resulted from multiple rounds of plant-pathogen co-evolution. R-gene evolution is thought to be facilitated by the formation of R-gene clusters, which permit sequence exchanges via recombinatorial mispairing and generate high haplotypic diversity. This pattern of evolution may also generate diversity at other loci that contribute to the R-complex. The rate of recombination at R-clusters is not necessarily homogeneous or consistent over evolutionary time: recent evidence suggests that recombination at R-clusters is increased following pathogen infection, suggesting a mechanism that induces temporary genome instability in response to extreme stress. DNA methylation and chromatin modifications may allow this instability to be conditionally regulated and targeted to specific genome regions. Knowledge of natural R-gene evolution may contribute to strategies for artificial evolution of novel resistance specificities.
Yang, Yong; Wu, Zhihong; Zhao, Taimao; Wang, Hai; Zhao, Dong; Zhang, Jianguo; Wang, Yipeng; Ding, Yaozhong; Qiu, Guixing
2009-06-01
The etiology of adolescent idiopathic scoliosis is undetermined despite years of research. A number of hypotheses have been postulated to explain its development, including growth abnormalities. The irregular expression of growth hormone and insulin-like growth factor-1 (IGF-1) may disturb hormone metabolism, result in a gross asymmetry, and promote the progress of adolescent idiopathic scoliosis. Initial association studies in complex diseases have demonstrated the power of candidate gene association. Prior to our study, 1 study in this field had a negative result. A replicable study is vital for reliability. To determine the relationship of growth hormone receptor and IGF-1 genes with adolescent idiopathic scoliosis, a population-based association study was performed. Single nucleotide polymorphisms with potential function were selected from candidate genes and a distribution analysis was performed. A conclusion was made confirming the insufficiency of an association between adolescent idiopathic scoliosis and the single-nucleotide polymorphism of the growth hormone receptor and IGF-1 genes in Han Chinese.
2011-01-01
Background In Drosophila, the Enhancer of split complex (E(spl)-C) comprises 11 bHLH and Bearded genes that function during Notch signaling to repress proneural identity in the developing peripheral nervous system. Comparison with other insects indicates that the basal state for Diptera is a single bHLH and Bearded homolog and that the expansion of the gene complex occurred in the lineage leading to Drosophila. However, comparative genomic data from other fly species that would elucidate the origin and sequence of gene duplication for the complex is lacking. Therefore, in order to examine the evolutionary history of the complex within Diptera, we reconstructed, using several fosmid clones, the entire E(spl)-complex in the stalk-eyed fly, Teleopsis dalmanni and collected additional homologs of E(spl)-C genes from searches of dipteran EST databases and the Glossina morsitans genome assembly. Results Comparison of the Teleopsis E(spl)-C gene organization with Drosophila indicates complete conservation in gene number and orientation between the species except that T. dalmanni contains a duplicated copy of E(spl)m5 that is not present in Drosophila. Phylogenetic analysis of E(spl)-complex bHLH and Bearded genes for several dipteran species clearly demonstrates that all members of the complex were present prior to the diversification of schizophoran flies. Comparison of upstream regulatory elements and 3' UTR domains between the species also reveals strong conservation for many of the genes and identifies several novel characteristics of E(spl)-C regulatory evolution including the discovery of a previously unidentified, highly conserved SPS+A domain between E(spl)mγ and E(spl)mβ. Conclusion Identifying the phylogenetic origin of E(spl)-C genes and their associated regulatory DNA is essential to understanding the functional significance of this well-studied gene complex. Results from this study provide numerous insights into the evolutionary history of the complex and will help refine the focus of studies examining the adaptive consequences of this gene expansion. PMID:22151427
Li, Wan; Zhu, Lina; Huang, Hao; He, Yuehan; Lv, Junjie; Li, Weimin; Chen, Lina; He, Weiming
2017-10-01
Complex chronic diseases are caused by the effects of genetic and environmental factors. Single nucleotide polymorphisms (SNPs), one common type of genetic variations, played vital roles in diseases. We hypothesized that disease risk functional SNPs in coding regions and protein interaction network modules were more likely to contribute to the identification of disease susceptible genes for complex chronic diseases. This could help to further reveal the pathogenesis of complex chronic diseases. Disease risk SNPs were first recognized from public SNP data for coronary heart disease (CHD), hypertension (HT) and type 2 diabetes (T2D). SNPs in coding regions that were classified into nonsense and missense by integrating several SNP functional annotation databases were treated as functional SNPs. Then, regions significantly associated with each disease were screened using random permutations for disease risk functional SNPs. Corresponding to these regions, 155, 169 and 173 potential disease susceptible genes were identified for CHD, HT and T2D, respectively. A disease-related gene product interaction network in environmental context was constructed for interacting gene products of both disease genes and potential disease susceptible genes for these diseases. After functional enrichment analysis for disease associated modules, 5 CHD susceptible genes, 7 HT susceptible genes and 3 T2D susceptible genes were finally identified, some of which had pleiotropic effects. Most of these genes were verified to be related to these diseases in literature. This was similar for disease genes identified from another method proposed by Lee et al. from a different aspect. This research could provide novel perspectives for diagnosis and treatment of complex chronic diseases and susceptible genes identification for other diseases. Copyright © 2017 Elsevier Inc. All rights reserved.
Large Scale Single Nucleotide Polymorphism Study of PD Susceptibility
2005-03-01
identification of eight genetic loci in the familial PD, the results of intensive investigations of polymorphisms in dozens of genes related to sporadic, late...1) investigate the association between classical, sporadic PD and 2386 SNPs in 23 genes implicated in the pathogenesis of PD; (2) construct...addition, experiences derived from this study may be applied in other complex disorders for the identification of susceptibility genes , as well as in genome
Komatsu, Masanori; Nishino, Kagetomo; Fujimori, Yuki; Haga, Yasutoshi; Iwama, Nagako; Arakawa, Aisaku; Aihara, Yoshito; Takeda, Hisato; Takahashi, Hideaki
2018-02-01
Growth hormone secretagogue receptor 1a (GHSR1a), growth hormone (GH), growth hormone receptor (GHR), non-SMC condensin I complex, subunit G (NCAPG) and stearoyl-CoA desaturase (SCD), are known to play important roles in growth and lipid metabolisms. Single and epistatic effects of the five genes on carcass, price-related and fatty acid (FA) composition traits were analyzed in a commercial Japanese Black cattle population of Ibaraki Prefecture. A total of 650 steers and 116 heifers for carcass and price-related traits, and 158 steers for FA composition traits were used in this study. Epistatic effects between pairs of the five genes were found in several traits. Alleles showing strain-specific differences in the five genes had significant single and epistatic effects in some traits. The data suggest that a TG-repeat polymorphism of the GHSR1a.5'UTR-(TG) n locus plays a central role in gene-gene epistatic interaction of FA composition traits in the adipose tissue of Japanese Black cattle. © 2017 Japanese Society of Animal Science.
Merhej, Vicky; Raoult, Didier
2012-01-01
Darwin's theory about the evolution of species has been the object of considerable dispute. In this review, we have described seven key principles in Darwin's book The Origin of Species and tried to present how genomics challenge each of these concepts and improve our knowledge about evolution. Darwin believed that species evolution consists on a positive directional selection ensuring the “survival of the fittest.” The most developed state of the species is characterized by increasing complexity. Darwin proposed the theory of “descent with modification” according to which all species evolve from a single common ancestor through a gradual process of small modification of their vertical inheritance. Finally, the process of evolution can be depicted in the form of a tree. However, microbial genomics showed that evolution is better described as the “biological changes over time.” The mode of change is not unidirectional and does not necessarily favors advantageous mutations to increase fitness it is rather subject to random selection as a result of catastrophic stochastic processes. Complexity is not necessarily the completion of development: several complex organisms have gone extinct and many microbes including bacteria with intracellular lifestyle have streamlined highly effective genomes. Genomes evolve through large events of gene deletions, duplications, insertions, and genomes rearrangements rather than a gradual adaptative process. Genomes are dynamic and chimeric entities with gene repertoires that result from vertical and horizontal acquisitions as well as de novo gene creation. The chimeric character of microbial genomes excludes the possibility of finding a single common ancestor for all the genes recorded currently. Genomes are collections of genes with different evolutionary histories that cannot be represented by a single tree of life (TOL). A forest, a network or a rhizome of life may be more accurate to represent evolutionary relationships among species. PMID:22973559
Evolution of the Class IV HD-Zip Gene Family in Streptophytes
Zalewski, Christopher S.; Floyd, Sandra K.; Furumizu, Chihiro; Sakakibara, Keiko; Stevenson, Dennis W.; Bowman, John L.
2013-01-01
Class IV homeodomain leucine zipper (C4HDZ) genes are plant-specific transcription factors that, based on phenotypes in Arabidopsis thaliana, play an important role in epidermal development. In this study, we sampled all major extant lineages and their closest algal relatives for C4HDZ homologs and phylogenetic analyses result in a gene tree that mirrors land plant evolution with evidence for gene duplications in many lineages, but minimal evidence for gene losses. Our analysis suggests an ancestral C4HDZ gene originated in an algal ancestor of land plants and a single ancestral gene was present in the last common ancestor of land plants. Independent gene duplications are evident within several lineages including mosses, lycophytes, euphyllophytes, seed plants, and, most notably, angiosperms. In recently evolved angiosperm paralogs, we find evidence of pseudogenization via mutations in both coding and regulatory sequences. The increasing complexity of the C4HDZ gene family through the diversification of land plants correlates to increasing complexity in epidermal characters. PMID:23894141
Allcock, Richard J N; Barrow, Alexander D; Forbes, Simon; Beck, Stephan; Trowsdale, John
2003-02-01
We have characterized a cluster of single immunoglobulin variable (IgV) domain receptors centromeric of the major histocompatibility complex (MHC) on human chromosome 6. In addition to triggering receptor expressed on myeloid cells (TREM)-1 and TREM2, the cluster contains NKp44, a triggering receptor whose expression is limited to NK cells. We identified three new related genes and two gene fragments within a cluster of approximately 200 kb. Two of the three new genes lack charged residues in their transmembrane domain tails. Further, one of the genes contains two potential immunotyrosine Inhibitory motifs in its cytoplasmic tail, suggesting that it delivers inhibitory signals. The human and mouse TREM clusters appear to have diverged such that there are unique sequences in each species. Finally, each gene in the TREM cluster was expressed in a different range of cell types.
Genome-scale transcriptional activation by an engineered CRISPR-Cas9 complex.
Konermann, Silvana; Brigham, Mark D; Trevino, Alexandro E; Joung, Julia; Abudayyeh, Omar O; Barcena, Clea; Hsu, Patrick D; Habib, Naomi; Gootenberg, Jonathan S; Nishimasu, Hiroshi; Nureki, Osamu; Zhang, Feng
2015-01-29
Systematic interrogation of gene function requires the ability to perturb gene expression in a robust and generalizable manner. Here we describe structure-guided engineering of a CRISPR-Cas9 complex to mediate efficient transcriptional activation at endogenous genomic loci. We used these engineered Cas9 activation complexes to investigate single-guide RNA (sgRNA) targeting rules for effective transcriptional activation, to demonstrate multiplexed activation of ten genes simultaneously, and to upregulate long intergenic non-coding RNA (lincRNA) transcripts. We also synthesized a library consisting of 70,290 guides targeting all human RefSeq coding isoforms to screen for genes that, upon activation, confer resistance to a BRAF inhibitor. The top hits included genes previously shown to be able to confer resistance, and novel candidates were validated using individual sgRNA and complementary DNA overexpression. A gene expression signature based on the top screening hits correlated with markers of BRAF inhibitor resistance in cell lines and patient-derived samples. These results collectively demonstrate the potential of Cas9-based activators as a powerful genetic perturbation technology.
Molas, M; Bartrons, R; Perales, J C
2002-08-15
Nonviral gene transfer vectors have been actively studied in the past years in order to obtain structural entities with minimum size and defined shape. The final size of a gene transfer vector, which is compacted into unimolecular complexes, is directly proportional to the mass of the nucleic acid to be compacted. Thus, the purpose of this study was to assess the possibility of producing ssDNA vectors and their biophysical and biological characterization. We have obtained ssDNA/poly-L-lysine complexes that are significantly smaller than their double-stranded counterparts. We have also identified a lesser aggregative behavior of compacted single-stranded vs. double-stranded DNA vectors in the presence of physiological NaCl concentrations. Expression of compacted ssDNA is observed in hepatoma cell lines. Moreover, we have successfully delivered galactosylated ssDNA complexes into cells that express the asialoglycoprotein receptor via receptor-mediated endocytosis. The reduced size and biophysical behavior of ssDNA vectors may provide an advantage for transfection of eukaryotic cells.
Rapid screening for nuclear genes mutations in isolated respiratory chain complex I defects.
Pagniez-Mammeri, Hélène; Lombes, Anne; Brivet, Michèle; Ogier-de Baulny, Hélène; Landrieu, Pierre; Legrand, Alain; Slama, Abdelhamid
2009-04-01
Complex I or reduced nicotinamide adenine dinucleotide (NADH): ubiquinone oxydoreductase deficiency is the most common cause of respiratory chain defects. Molecular bases of complex I deficiencies are rarely identified because of the dual genetic origin of this multi-enzymatic complex (nuclear DNA and mitochondrial DNA) and the lack of phenotype-genotype correlation. We used a rapid method to screen patients with isolated complex I deficiencies for nuclear genes mutations by Surveyor nuclease digestion of cDNAs. Eight complex I nuclear genes, among the most frequently mutated (NDUFS1, NDUFS2, NDUFS3, NDUFS4, NDUFS7, NDUFS8, NDUFV1 and NDUFV2), were studied in 22 cDNA fragments spanning their coding sequences in 8 patients with a biochemically proved complex I deficiency. Single nucleotide polymorphisms and missense mutations were detected in 18.7% of the cDNA fragments by Surveyor nuclease treatment. Molecular defects were detected in 3 patients. Surveyor nuclease screening is a reliable method for genotyping nuclear complex I deficiencies, easy to interpret, and limits the number of sequence reactions. Its use will enhance the possibility of prenatal diagnosis and help us for a better understanding of complex I molecular defects.
Network Medicine: From Cellular Networks to the Human Diseasome
NASA Astrophysics Data System (ADS)
Barabasi, Albert-Laszlo
2014-03-01
Given the functional interdependencies between the molecular components in a human cell, a disease is rarely a consequence of an abnormality in a single gene, but reflects the perturbations of the complex intracellular network. The tools of network science offer a platform to explore systematically not only the molecular complexity of a particular disease, leading to the identification of disease modules and pathways, but also the molecular relationships between apparently distinct (patho)phenotypes. Advances in this direction not only enrich our understanding of complex systems, but are also essential to identify new disease genes, to uncover the biological significance of disease-associated mutations identified by genome-wide association studies and full genome sequencing, and to identify drug targets and biomarkers for complex diseases.
Gao, Yan; Ni, Xiaohui; Guo, Hua; Su, Zhe; Ba, Yi; Tong, Zhongsheng; Guo, Zhi; Yao, Xin; Chen, Xixi; Yin, Jian; Yan, Zhao; Guo, Lin; Liu, Ying; Bai, Fan; Xie, X Sunney; Zhang, Ning
2017-08-01
Copy number alteration (CNA) is a major contributor to genome instability, a hallmark of cancer. Here, we studied genomic alterations in single primary tumor cells and circulating tumor cells (CTCs) from the same patient. Single-nucleotide variants (SNVs) in single cells from both samples occurred sporadically, whereas CNAs among primary tumor cells emerged accumulatively rather than abruptly, converging toward the CNA in CTCs. Focal CNAs affecting the MYC gene and the PTEN gene were observed only in a minor portion of primary tumor cells but were present in all CTCs, suggesting a strong selection toward metastasis. Single-cell structural variant (SV) analyses revealed a two-step mechanism, a complex rearrangement followed by gene amplification, for the simultaneous formation of anomalous CNAs in multiple chromosome regions. Integrative CNA analyses of 97 CTCs from 23 patients confirmed the convergence of CNAs and revealed single, concurrent, and mutually exclusive CNAs that could be the driving events in cancer metastasis. © 2017 Gao et al.; Published by Cold Spring Harbor Laboratory Press.
Single cell genome analysis of an uncultured heterotrophic stramenopile
NASA Astrophysics Data System (ADS)
Roy, Rajat S.; Price, Dana C.; Schliep, Alexander; Cai, Guohong; Korobeynikov, Anton; Yoon, Hwan Su; Yang, Eun Chan; Bhattacharya, Debashish
2014-04-01
A broad swath of eukaryotic microbial biodiversity cannot be cultivated in the lab and is therefore inaccessible to conventional genome-wide comparative methods. One promising approach to study these lineages is single cell genomics (SCG), whereby an individual cell is captured from nature and genome data are produced from the amplified total DNA. Here we tested the efficacy of SCG to generate a draft genome assembly from a single sample, in this case a cell belonging to the broadly distributed MAST-4 uncultured marine stramenopiles. Using de novo gene prediction, we identified 6,996 protein-encoding genes in the MAST-4 genome. This genetic inventory was sufficient to place the cell within the ToL using multigene phylogenetics and provided preliminary insights into the complex evolutionary history of horizontal gene transfer (HGT) in the MAST-4 lineage.
Urushibara, Noriko; Paul, Shyamal Kumar; Hossain, Mohammad Akram; Kawaguchiya, Mitsuyo; Kobayashi, Nobumichi
2011-06-01
Methicillin resistance in staphylococci is conferred by the acquisition in its chromosome of the mecA gene, which is located on a mobile genetic element called staphylococcal cassette chromosome mec (SCCmec). Genetic type of SCCmec is defined by combination of mec gene complex class and cassette chromosome recombinase gene (ccr) allotype. In this study, we analyzed genetic diversity of the SCCmec in 11 Staphylococcus haemolyticus strains and a Staphylococcus sciuri strain, which were recently isolated from clinical specimens in Bangladesh. Among these strains, only two S. haemolyticus strains were proved to have the known types of SCCmec, that is, SCCmec V (class C2 mec-ccrC) and VII (class C1 mec-ccrC). Five S. haemolyticus strains were assigned two unique mec-ccr gene complexes combination; that is, class C1 mec-ccrA4B4 (four isolates) and class A mec-ccrC (one isolate). In the remaining four S. haemolyticus strains with class C1 mec, no known ccr allotypes could be detected. A single S. sciuri strain with class A mec complex carried a ccrA gene belonging to a novel allotype designated ccrA7, together with ccrB3. The ccrA7 gene in the S. sciuri strain showed 61.7%-82.7% sequence identity to the ccrA gene sequences published so far, and 75.3% identity to ccrA3, which is a component of the type 3 ccr complex (ccrA3-ccrB3) in methicillin-resistant Staphylococcus aureus. The results of the present study indicated that mec gene complex and ccr genes in coagulase-negative staphylococci are highly divergent, and distinct from those of common methicillin-resistant S. aureus. Identification of the novel ccrA7 allotype combined with ccrB3 suggested an occurrence of recombination between different ccr complexes in nature.
Large Scale Single Nucleotide Polymorphism Study of PD Susceptibility
2006-03-01
familial PD, the results of intensive investigations of polymorphisms in dozens of genes related to sporadic, late onset, typical PD have not shown...association between classical, sporadic PD and 2386 SNPs in 23 genes implicated in the pathogenesis of PD; (2) construct haplotypes based on the SNP...derived from this study may be applied in other complex disorders for the identification of susceptibility genes , as well as in genome-wide SNP
A deficiency of the homeotic complex of the beetle Tribolium
NASA Technical Reports Server (NTRS)
Stuart, J. J.; Brown, S. J.; Beeman, R. W.; Denell, R. E.; Spooner, B. S. (Principal Investigator)
1991-01-01
In Drosophila, the establishment of regional commitments along most of the anterior/posterior axis of the developing embryo depends on two clusters of homeotic genes: the Antennapedia complex (ANT-C) and the bithorax complex (BX-C). The red flour beetle has a single complex (HOM-C) representing the homologues of the ANT-C and BX-C in juxtaposition. Beetles trans-heterozygous for two particular HOM-C mutations spontaneously generate a large deficiency, presumably by an exchange within the common region of two overlapping inversions. Genetic and molecular results indicate that this deficiency spans at least the interval between the Deformed and abdominal-A homologues. In deficiency homozygous embryos, all gnathal, thoracic and abdominal segments develop antennal appendages, suggesting that a gene(s) has been deleted that acts to distinguish trunk from head. There is no evidence that beetles have a homologue of the segmentation gene fushi tarazu of similar genomic location and function. On the basis of the genetic tractability, convenient genome size and organization of Tribolium, and its relatively long phylogenetic divergence from Drosophila (>300 million years), we have integrated developmental genetic and molecular analyses of the HOM-C. We isolated about 70 mutations in the complex representing at least six complementation groups. The homeotic phenotypes of adults and lethal embryos lead us to believe that these beetle genes are homologous with the Drosophila genes indicated in Fig. 1 (see text).
A deficiency of the homeotic complex of the beetle Tribolium.
Stuart, J J; Brown, S J; Beeman, R W; Denell, R E
1991-03-07
In Drosophila, the establishment of regional commitments along most of the anterior/posterior axis of the developing embryo depends on two clusters of homeotic genes: the Antennapedia complex (ANT-C) and the bithorax complex (BX-C). The red flour beetle has a single complex (HOM-C) representing the homologues of the ANT-C and BX-C in juxtaposition. Beetles trans-heterozygous for two particular HOM-C mutations spontaneously generate a large deficiency, presumably by an exchange within the common region of two overlapping inversions. Genetic and molecular results indicate that this deficiency spans at least the interval between the Deformed and abdominal-A homologues. In deficiency homozygous embryos, all gnathal, thoracic and abdominal segments develop antennal appendages, suggesting that a gene(s) has been deleted that acts to distinguish trunk from head. There is no evidence that beetles have a homologue of the segmentation gene fushi tarazu of similar genomic location and function. On the basis of the genetic tractability, convenient genome size and organization of Tribolium, and its relatively long phylogenetic divergence from Drosophila (>300 million years), we have integrated developmental genetic and molecular analyses of the HOM-C. We isolated about 70 mutations in the complex representing at least six complementation groups. The homeotic phenotypes of adults and lethal embryos lead us to believe that these beetle genes are homologous with the Drosophila genes indicated in Fig. 1 (see text).
Patterns of homoeologous gene expression shown by RNA sequencing in hexaploid bread wheat
2014-01-01
Background Bread wheat (Triticum aestivum) has a large, complex and hexaploid genome consisting of A, B and D homoeologous chromosome sets. Therefore each wheat gene potentially exists as a trio of A, B and D homoeoloci, each of which may contribute differentially to wheat phenotypes. We describe a novel approach combining wheat cytogenetic resources (chromosome substitution ‘nullisomic-tetrasomic’ lines) with next generation deep sequencing of gene transcripts (RNA-Seq), to directly and accurately identify homoeologue-specific single nucleotide variants and quantify the relative contribution of individual homoeoloci to gene expression. Results We discover, based on a sample comprising ~5-10% of the total wheat gene content, that at least 45% of wheat genes are expressed from all three distinct homoeoloci. Most of these genes show strikingly biased expression patterns in which expression is dominated by a single homoeolocus. The remaining ~55% of wheat genes are expressed from either one or two homoeoloci only, through a combination of extensive transcriptional silencing and homoeolocus loss. Conclusions We conclude that wheat is tending towards functional diploidy, through a variety of mechanisms causing single homoeoloci to become the predominant source of gene transcripts. This discovery has profound consequences for wheat breeding and our understanding of wheat evolution. PMID:24726045
Morino, Masato; Natsui, Shinsuke; Swartz, Talia H; Krulwich, Terry A; Ito, Masahiro
2008-06-01
Mrp antiporters catalyze secondary Na(+)(Li(+))/H(+) antiport and/or K(+)/H(+) antiport that is physiologically important in diverse bacteria. An additional capacity for anion flux has been observed for a few systems. Mrp is unique among antiporters in that it requires all six or seven hydrophobic gene products (MrpA to MrpG) of the mrp operon for full antiporter activity, but MrpE has been reported to be dispensable. Here, the membrane complexes formed by Mrp proteins were examined using a cloned mrp operon from alkaliphilic Bacillus pseudofirmus OF4. The operon was engineered so that the seven Mrp proteins could be detected in single samples. Membrane extracts of an antiporter-deficient Escherichia coli strain expressing this construct were analyzed by blue native-sodium dodecyl sulfate-polyacrylamide gel electrophoresis. Mrp complexes of two sizes were identified containing all seven Mrp proteins. Studies of the single nonpolar mrp gene deletions in the construct showed that a subcomplex of MrpA, MrpB, MrpC, and MrpD was formed in the absence of MrpE, MrpF, or MrpG. By contrast, MrpE, MrpF, and MrpG were not observed in membranes lacking MrpA, MrpB, MrpC, or MrpD. Although MrpA and MrpD have been hypothesized to be the antiporter proteins, the MrpA-to-D complex was inactive. Every Mrp protein was required for an activity level near that of the wild-type Na(+)/H(+) antiporter, but a very low activity level was observed in the absence of MrpE. The introduction of an MrpE(P114G) mutation into the full Mrp complex led to antiport activity with a greatly increased apparent K(m) value for Na(+). The results suggested that interactions among the proteins of heterooligomeric Mrp complexes strongly impact antiporter properties.
Book, Adam J; Smalle, Jan; Lee, Kwang-Hee; Yang, Peizhen; Walker, Joseph M; Casper, Sarah; Holmes, James H; Russo, Laura A; Buzzinotti, Zachri W; Jenik, Pablo D; Vierstra, Richard D
2009-02-01
The 26S proteasome is an essential multicatalytic protease complex that degrades a wide range of intracellular proteins, especially those modified with ubiquitin. Arabidopsis thaliana and other plants use pairs of genes to encode most of the core subunits, with both of the isoforms often incorporated into the mature complex. Here, we show that the gene pair encoding the regulatory particle non-ATPase subunit (RPN5) has a unique role in proteasome function and Arabidopsis development. Homozygous rpn5a rpn5b mutants could not be generated due to a defect in male gametogenesis. While single rpn5b mutants appear wild-type, single rpn5a mutants display a host of morphogenic defects, including abnormal embryogenesis, partially deetiolated development in the dark, a severely dwarfed phenotype when grown in the light, and infertility. Proteasome complexes missing RPN5a are less stable in vitro, suggesting that some of the rpn5a defects are caused by altered complex integrity. The rpn5a phenotype could be rescued by expression of either RPN5a or RPN5b, indicating functional redundancy. However, abnormal phenotypes generated by overexpression implied that paralog-specific functions also exist. Collectively, the data point to a specific role for RPN5 in the plant 26S proteasome and suggest that its two paralogous genes in Arabidopsis have both redundant and unique roles in development.
Multi-functional regulation of 4E-BP gene expression by the Ccr4-Not complex.
Okada, Hirokazu; Schittenhelm, Ralf B; Straessle, Anna; Hafen, Ernst
2015-01-01
The mechanistic target of rapamycin (mTOR) signaling pathway is highly conserved from yeast to humans. It senses various environmental cues to regulate cellular growth and homeostasis. Deregulation of the pathway has been implicated in many pathological conditions including cancer. Phosphorylation cascades through the pathway have been extensively studied but not much is known about the regulation of gene expression of the pathway components. Here, we report that the mRNA level of eukaryotic translation initiation factor (eIF) subunit 4E-binding protein (4E-BP) gene, one of the key mTOR signaling components, is regulated by the highly conserved Ccr4-Not complex. RNAi knockdown of Not1, a putative scaffold protein of this protein complex, increases the mRNA level of 4E-BP in Drosophila Kc cells. Examination of the gene expression mechanism using reporter swap constructs reveals that Not1 depletion increases reporter mRNAs with the 3'UTR of 4E-BP gene, but decreases the ones with the 4E-BP promoter region, suggesting that Ccr4-Not complex regulates both degradation and transcription of 4E-BP mRNA. These results indicate that the Ccr4-Not complex controls expression of a single gene at multiple levels and adjusts the magnitude of the total effect. Thus, our study reveals a novel regulatory mechanism of a key component of the mTOR signaling pathway at the level of gene expression.
High-frequency promoter firing links THO complex function to heavy chromatin formation.
Mouaikel, John; Causse, Sébastien Z; Rougemaille, Mathieu; Daubenton-Carafa, Yves; Blugeon, Corinne; Lemoine, Sophie; Devaux, Frédéric; Darzacq, Xavier; Libri, Domenico
2013-11-27
The THO complex is involved in transcription, genome stability, and messenger ribonucleoprotein (mRNP) formation, but its precise molecular function remains enigmatic. Under heat shock conditions, THO mutants accumulate large protein-DNA complexes that alter the chromatin density of target genes (heavy chromatin), defining a specific biochemical facet of THO function and a powerful tool of analysis. Here, we show that heavy chromatin distribution is dictated by gene boundaries and that the gene promoter is necessary and sufficient to convey THO sensitivity in these conditions. Single-molecule fluorescence in situ hybridization measurements show that heavy chromatin formation correlates with an unusually high firing pace of the promoter with more than 20 transcription events per minute. Heavy chromatin formation closely follows the modulation of promoter firing and strongly correlates with polymerase occupancy genome wide. We propose that the THO complex is required for tuning the dynamic of gene-nuclear pore association and mRNP release to the same high pace of transcription initiation. Copyright © 2013 The Authors. Published by Elsevier Inc. All rights reserved.
Jalili, Mahdi; Gebhardt, Tom; Wolkenhauer, Olaf; Salehzadeh-Yazdi, Ali
2018-06-01
Decoding health and disease phenotypes is one of the fundamental objectives in biomedicine. Whereas high-throughput omics approaches are available, it is evident that any single omics approach might not be adequate to capture the complexity of phenotypes. Therefore, integrated multi-omics approaches have been used to unravel genotype-phenotype relationships such as global regulatory mechanisms and complex metabolic networks in different eukaryotic organisms. Some of the progress and challenges associated with integrated omics studies have been reviewed previously in comprehensive studies. In this work, we highlight and review the progress, challenges and advantages associated with emerging approaches, integrating gene expression and protein-protein interaction networks to unravel network-based functional features. This includes identifying disease related genes, gene prioritization, clustering protein interactions, developing the modules, extract active subnetworks and static protein complexes or dynamic/temporal protein complexes. We also discuss how these approaches contribute to our understanding of the biology of complex traits and diseases. This article is part of a Special Issue entitled: Cardiac adaptations to obesity, diabetes and insulin resistance, edited by Professors Jan F.C. Glatz, Jason R.B. Dyck and Christine Des Rosiers. Copyright © 2018 Elsevier B.V. All rights reserved.
Imaging dynamic and selective low-complexity domain interactions that control gene transcription.
Chong, Shasha; Dugast-Darzacq, Claire; Liu, Zhe; Dong, Peng; Dailey, Gina M; Cattoglio, Claudia; Heckert, Alec; Banala, Sambashiva; Lavis, Luke; Darzacq, Xavier; Tjian, Robert
2018-06-21
Many eukaryotic transcription factors (TFs) contain intrinsically disordered low-complexity domains (LCDs), but how they drive transactivation remains unclear. Here, live-cell single-molecule imaging reveals that TF-LCDs form local high-concentration interaction hubs at synthetic and endogenous genomic loci. TF-LCD hubs stabilize DNA binding, recruit RNA polymerase II (Pol II), and activate transcription. LCD-LCD interactions within hubs are highly dynamic, display selectivity with binding partners, and are differentially sensitive to disruption by hexanediols. Under physiological conditions, rapid and reversible LCD-LCD interactions occur between TFs and the Pol II machinery without detectable phase separation. Our findings reveal fundamental mechanisms underpinning transcriptional control and suggest a framework for developing single-molecule imaging screens for novel drugs targeting gene regulatory interactions implicated in disease. Copyright © 2018, American Association for the Advancement of Science.
de Groot, Reinoud; Lüthi, Joel; Lindsay, Helen; Holtackers, René; Pelkmans, Lucas
2018-01-23
High-content imaging using automated microscopy and computer vision allows multivariate profiling of single-cell phenotypes. Here, we present methods for the application of the CISPR-Cas9 system in large-scale, image-based, gene perturbation experiments. We show that CRISPR-Cas9-mediated gene perturbation can be achieved in human tissue culture cells in a timeframe that is compatible with image-based phenotyping. We developed a pipeline to construct a large-scale arrayed library of 2,281 sequence-verified CRISPR-Cas9 targeting plasmids and profiled this library for genes affecting cellular morphology and the subcellular localization of components of the nuclear pore complex (NPC). We conceived a machine-learning method that harnesses genetic heterogeneity to score gene perturbations and identify phenotypically perturbed cells for in-depth characterization of gene perturbation effects. This approach enables genome-scale image-based multivariate gene perturbation profiling using CRISPR-Cas9. © 2018 The Authors. Published under the terms of the CC BY 4.0 license.
Polonikov, Alexey V.; Ivanov, Vladimir P.; Bogomazov, Alexey D.; Freidin, Maxim B.; Illig, Thomas; Solodilova, Maria A.
2014-01-01
Oxidative stress resulting from an increased amount of reactive oxygen species and an imbalance between oxidants and antioxidants plays an important role in the pathogenesis of asthma. The present study tested the hypothesis that genetic susceptibility to allergic and nonallergic variants of asthma is determined by complex interactions between genes encoding antioxidant defense enzymes (ADE). We carried out a comprehensive analysis of the associations between adult asthma and 46 single nucleotide polymorphisms of 34 ADE genes and 12 other candidate genes of asthma in Russian population using set association analysis and multifactor dimensionality reduction approaches. We found for the first time epistatic interactions between ADE genes underlying asthma susceptibility and the genetic heterogeneity between allergic and nonallergic variants of the disease. We identified GSR (glutathione reductase) and PON2 (paraoxonase 2) as novel candidate genes for asthma susceptibility. We observed gender-specific effects of ADE genes on the risk of asthma. The results of the study demonstrate complexity and diversity of interactions between genes involved in oxidative stress underlying susceptibility to allergic and nonallergic asthma. PMID:24895604
Singh, Ravi; Pantarotto, Davide; McCarthy, David; Chaloin, Olivier; Hoebeke, Johan; Partidos, Charalambos D; Briand, Jean-Paul; Prato, Maurizio; Bianco, Alberto; Kostarelos, Kostas
2005-03-30
Carbon nanotubes (CNTs) constitute a class of nanomaterials that possess characteristics suitable for a variety of possible applications. Their compatibility with aqueous environments has been made possible by the chemical functionalization of their surface, allowing for exploration of their interactions with biological components including mammalian cells. Functionalized CNTs (f-CNTs) are being intensively explored in advanced biotechnological applications ranging from molecular biosensors to cellular growth substrates. We have been exploring the potential of f-CNTs as delivery vehicles of biologically active molecules in view of possible biomedical applications, including vaccination and gene delivery. Recently we reported the capability of ammonium-functionalized single-walled CNTs to penetrate human and murine cells and facilitate the delivery of plasmid DNA leading to expression of marker genes. To optimize f-CNTs as gene delivery vehicles, it is essential to characterize their interactions with DNA. In the present report, we study the interactions of three types of f-CNTs, ammonium-functionalized single-walled and multiwalled carbon nanotubes (SWNT-NH3+; MWNT-NH3+), and lysine-functionalized single-walled carbon nanotubes (SWNT-Lys-NH3+), with plasmid DNA. Nanotube-DNA complexes were analyzed by scanning electron microscopy, surface plasmon resonance, PicoGreen dye exclusion, and agarose gel shift assay. The results indicate that all three types of cationic carbon nanotubes are able to condense DNA to varying degrees, indicating that both nanotube surface area and charge density are critical parameters that determine the interaction and electrostatic complex formation between f-CNTs with DNA. All three different f-CNT types in this study exhibited upregulation of marker gene expression over naked DNA using a mammalian (human) cell line. Differences in the levels of gene expression were correlated with the structural and biophysical data obtained for the f-CNT:DNA complexes to suggest that large surface area leading to very efficient DNA condensation is not necessary for effective gene transfer. However, it will require further investigation to determine whether the degree of binding and tight association between DNA and nanotubes is a desirable trait to increase gene expression efficiency in vitro or in vivo. This study constitutes the first thorough investigation into the physicochemical interactions between cationic functionalized carbon nanotubes and DNA toward construction of carbon nanotube-based gene transfer vector systems.
Zhang, Qin; Bai, Bao-Ling; Liu, Xiao-Zhen; Miao, Chun-Yue; Li, Hui-Li
2014-08-01
To explore the association of polymorphisms in folate metabolism genes, methionine synthase reductase (MTRR) gene and 5,10-methylenetetrahydrofolate reductase (MTHFR) gene, with complex congenital abnormalities and to further investigate its association with complex congenital abnormalities derived from three germ layers. A total of 250 cases of birth defects (with complex congenital abnormalities including congenital heart disease, neural tube defects, and craniofacial anomalies) in Shanxi Province, China were included in the study. MTRR single nucleotide polymorphism (SNP) (rs1801394) and MTHFR SNP (rs1801133) were genotyped by the SNaPshot method, and the genotyping results were compared with those of controls (n=420). SNPs rs1801394 and rs1801133 were associated with multiple birth defects. For the recessive model, individuals with GG genotype at rs1801394 and CC genotype at rs1801133 had a relatively low risk of developing birth defects, so the two genotypes were protective factors against birth defects. The homozygous recessive genotype at rs1801133, which served as a protective factor, was associated with ectoderm- or endoderm-derived complex congenital abnormalities, while the homozygous recessive genotype at rs1801394, which served as a protective factor, was associated with ectoderm-, mesoderm- or endoderm-derived complex congenital abnormalities. Among the Chinese population in Shanxi Province, the SNPs in folate metabolism genes (MTRR and MTHFR) are associated with complex congenital abnormalities and related to ectoderm, mesoderm or endoderm development.
Chattaway, Marie Anne; Day, Michaela; Mtwale, Julia; White, Emma; Rogers, James; Day, Martin; Powell, David; Ahmad, Marwa; Harris, Ross; Talukder, Kaisar Ali; Wain, John; Jenkins, Claire; Cravioto, Alejandro
2017-10-01
This study investigates the virulence and antimicrobial resistance in association with common clonal complexes (CCs) of enteroaggregative Escherichia coli (EAEC) isolated from Bangladesh. The aim was to determine whether specific CCs were more likely to be associated with putative virulence genes and/or antimicrobial resistance. The presence of 15 virulence genes (by PCR) and susceptibility to 18 antibiotics were determined for 151 EAEC isolated from cases and controls during an intestinal infectious disease study carried out between 2007-2011 in the rural setting of Mirzapur, Bangladesh (Kotloff KL, Blackwelder WC, Nasrin D, Nataro JP, Farag TH et al.Clin Infect Dis 2012;55:S232-S245). These data were then analysed in the context of previously determined serotypes and clonal complexes defined by multi-locus sequence typing. Overall there was no association between the presence of virulence or antimicrobial resistance genes in isolates of EAEC from cases versus controls. However, when stratified by clonal complex (CC) one CC associated with cases harboured more virulence factors (CC40) and one CC harboured more resistance genes (CC38) than the average. There was no direct link between the virulence gene content and antibiotic resistance. Strains within a single CC had variable virulence and resistance gene content indicating independent and multiple gene acquisitions over time. In Bangladesh, there are multiple clonal complexes of EAEC harbouring a variety of virulence and resistance genes. The emergence of two of the most successful clones appeared to be linked to either increased virulence (CC40) or antimicrobial resistance (CC38), but increased resistance and virulence were not found in the same clonal complexes.
Network Medicine: A Network-based Approach to Human Disease
Barabási, Albert-László; Gulbahce, Natali; Loscalzo, Joseph
2011-01-01
Given the functional interdependencies between the molecular components in a human cell, a disease is rarely a consequence of an abnormality in a single gene, but reflects the perturbations of the complex intracellular network. The emerging tools of network medicine offer a platform to explore systematically not only the molecular complexity of a particular disease, leading to the identification of disease modules and pathways, but also the molecular relationships between apparently distinct (patho)phenotypes. Advances in this direction are essential to identify new diseases genes, to uncover the biological significance of disease-associated mutations identified by genome-wide association studies and full genome sequencing, and to identify drug targets and biomarkers for complex diseases. PMID:21164525
Cheering for Team Science | Office of Cancer Genomics
As a graduate student, my PhD thesis focused on the function of a single human gene, within a genome of some 20,000 genes. Although this sometimes made my work seem insignificant, I was reminded of how important one small piece of a large puzzle can be when I discovered all the ways the gene knockout cells were disadvantaged. Studying the basic biology of our cells made me appreciate the beautiful complexity of human biology.
Marais, Thomas L Des; Kluz, Thomas; Xu, Dazhong; Zhang, Xiaoru; Gesumaria, Lisa; Matsui, Mary S; Costa, Max; Sun, Hong
2017-10-19
Ultraviolet radiation (UVR) from sunlight is the major effector for skin aging and carcinogenesis. However, genes and pathways altered by solar-simulated UVR (ssUVR), a mixture of UVA and UVB, are not well characterized. Here we report global changes in gene expression as well as associated pathways and upstream transcription factors in human keratinocytes exposed to ssUVR. Human HaCaT keratinocytes were exposed to either a single dose or 5 repetitive doses of ssUVR. Comprehensive analyses of gene expression profiles as well as functional annotation were performed at 24 hours post irradiation. Our results revealed that ssUVR modulated genes with diverse cellular functions changed in a dose-dependent manner. Gene expression in cells exposed to a single dose of ssUVR differed significantly from those that underwent repetitive exposures. While single ssUVR caused a significant inhibition in genes involved in cell cycle progression, especially G2/M checkpoint and mitotic regulation, repetitive ssUVR led to extensive changes in genes related to cell signaling and metabolism. We have also identified a panel of ssUVR target genes that exhibited persistent changes in gene expression even at 1 week after irradiation. These results revealed a complex network of transcriptional regulators and pathways that orchestrate the cellular response to ssUVR.
Whole-genome association studies of alcoholism with loci linked to schizophrenia susceptibility.
Namkung, Junghyun; Kim, Youngchul; Park, Taesung
2005-12-30
Alcoholism is a complex disease. There have been many reports on significant comorbidity between alcoholism and schizophrenia. For the genetic study of complex diseases, association analysis has been recommended because of its higher power than that of the linkage analysis for detecting genes with modest effects on disease. To identify alcoholism susceptibility loci, we performed genome-wide single-nucleotide polymorphisms (SNP) association tests, which yielded 489 significant SNPs at the 1% significance level. The association tests showed that tsc0593964 (P-value 0.000013) on chromosome 7 was most significantly associated with alcoholism. From 489 SNPs, 74 genes were identified. Among these genes, GABRA1 is a member of the same gene family with GABRA2 that was recently reported as alcoholism susceptibility gene. By comparing 74 genes to the published results of various linkage studies of schizophrenia, we identified 13 alcoholism associated genes that were located in the regions reported to be linked to schizophrenia. These 13 identified genes can be important candidate genes to study the genetic mechanism of co-occurrence of both diseases.
Eystathioy, Theophany; Chan, Edward K. L.; Tenenbaum, Scott A.; Keene, Jack D.; Griffith, Kevin; Fritzler, Marvin J.
2002-01-01
A novel human cellular structure has been identified that contains a unique autoimmune antigen and multiple messenger RNAs. This complex was discovered using an autoimmune serum from a patient with motor and sensory neuropathy and contains a protein of 182 kDa. The gene and cDNA encoding the protein indicated an open reading frame with glycine-tryptophan (GW) repeats and a single RNA recognition motif. Both the patient's serum and a rabbit serum raised against the recombinant GW protein costained discrete cytoplasmic speckles designated as GW bodies (GWBs) that do not overlap with the Golgi complex, endosomes, lysosomes, or peroxisomes. The mRNAs associated with GW182 represent a clustered set of transcripts that are presumed to reside within the GW complexes. We propose that the GW ribonucleoprotein complex is involved in the posttranscriptional regulation of gene expression by sequestering a specific subset of gene transcripts involved in cell growth and homeostasis. PMID:11950943
Pena, S D; Barreto, G; Vago, A R; De Marco, L; Reinach, F C; Dias Neto, E; Simpson, A J
1994-01-01
Low-stringency single specific primer PCR (LSSP-PCR) is an extremely simple PCR-based technique that detects single or multiple mutations in gene-sized DNA fragments. A purified DNA fragment is subjected to PCR using high concentrations of a single specific oligonucleotide primer, large amounts of Taq polymerase, and a very low annealing temperature. Under these conditions the primer hybridizes specifically to its complementary region and nonspecifically to multiple sites within the fragment, in a sequence-dependent manner, producing a heterogeneous set of reaction products resolvable by electrophoresis. The complex banding pattern obtained is significantly altered by even a single-base change and thus constitutes a unique "gene signature." Therefore LSSP-PCR will have almost unlimited application in all fields of genetics and molecular medicine where rapid and sensitive detection of mutations and sequence variations is important. The usefulness of LSSP-PCR is illustrated by applications in the study of mutants of smooth muscle myosin light chain, analysis of a family with X-linked nephrogenic diabetes insipidus, and identity testing using human mitochondrial DNA. Images PMID:8127912
Producing a functional eukaryotic messenger RNA (mRNA) requires the coordinated activity of several large protein complexes to initiate transcription, elongate nascent transcripts, splice together exons, and cleave and polyadenylate the 3’ end. Kinetic competition between these various processes has been proposed to regulate mRNA maturation, but this model could lead to multiple, randomly determined, or stochastic, pathways or outcomes. Regulatory checkpoints have been suggested as a means of ensuring quality control. However, current methods have been unable to tease apart the contributions of these processes at a single gene or on a time scale that could provide mechanistic insight. To begin to investigate the kinetic relationship between transcription and splicing, Daniel Larson, Ph.D., of CCR’s Laboratory of Receptor Biology and Gene Expression, and his colleagues employed a single-molecule RNA imaging approach to monitor production and processing of a human β-globin reporter gene in living cells.
A Multiomics Approach to Identify Genes Associated with Childhood Asthma Risk and Morbidity.
Forno, Erick; Wang, Ting; Yan, Qi; Brehm, John; Acosta-Perez, Edna; Colon-Semidey, Angel; Alvarez, Maria; Boutaoui, Nadia; Cloutier, Michelle M; Alcorn, John F; Canino, Glorisa; Chen, Wei; Celedón, Juan C
2017-10-01
Childhood asthma is a complex disease. In this study, we aim to identify genes associated with childhood asthma through a multiomics "vertical" approach that integrates multiple analytical steps using linear and logistic regression models. In a case-control study of childhood asthma in Puerto Ricans (n = 1,127), we used adjusted linear or logistic regression models to evaluate associations between several analytical steps of omics data, including genome-wide (GW) genotype data, GW methylation, GW expression profiling, cytokine levels, asthma-intermediate phenotypes, and asthma status. At each point, only the top genes/single-nucleotide polymorphisms/probes/cytokines were carried forward for subsequent analysis. In step 1, asthma modified the gene expression-protein level association for 1,645 genes; pathway analysis showed an enrichment of these genes in the cytokine signaling system (n = 269 genes). In steps 2-3, expression levels of 40 genes were associated with intermediate phenotypes (asthma onset age, forced expiratory volume in 1 second, exacerbations, eosinophil counts, and skin test reactivity); of those, methylation of seven genes was also associated with asthma. Of these seven candidate genes, IL5RA was also significant in analytical steps 4-8. We then measured plasma IL-5 receptor α levels, which were associated with asthma age of onset and moderate-severe exacerbations. In addition, in silico database analysis showed that several of our identified IL5RA single-nucleotide polymorphisms are associated with transcription factors related to asthma and atopy. This approach integrates several analytical steps and is able to identify biologically relevant asthma-related genes, such as IL5RA. It differs from other methods that rely on complex statistical models with various assumptions.
Functional Conservation of MIKC*-Type MADS Box Genes in Arabidopsis and Rice Pollen Maturation[C][W
Liu, Yuan; Cui, Shaojie; Wu, Feng; Yan, Shuo; Lin, Xuelei; Du, Xiaoqiu; Chong, Kang; Schilling, Susanne; Theißen, Günter; Meng, Zheng
2013-01-01
There are two groups of MADS intervening keratin-like and C-terminal (MIKC)-type MADS box genes, MIKCC type and MIKC* type. In seed plants, the MIKCC type shows considerable diversity, but the MIKC* type has only two subgroups, P- and S-clade, which show conserved expression in the gametophyte. To examine the functional conservation of MIKC*-type genes, we characterized all three rice (Oryza sativa) MIKC*-type genes. All three genes are specifically expressed late in pollen development. The single knockdown or knockout lines, respectively, of the S-clade MADS62 and MADS63 did not show a mutant phenotype, but lines in which both S-clade genes were affected showed severe defects in pollen maturation and germination, as did knockdown lines of MADS68, the only P-clade gene in rice. The rice MIKC*-type proteins form strong heterodimeric complexes solely with partners from the other subclade; these complexes specifically bind to N10-type C-A-rich-G-boxes in vitro and regulate downstream gene expression by binding to N10-type promoter motifs. The rice MIKC* genes have a much lower degree of functional redundancy than the Arabidopsis thaliana MIKC* genes. Nevertheless, our data indicate that the function of heterodimeric MIKC*-type protein complexes in pollen development has been conserved since the divergence of monocots and eudicots, roughly 150 million years ago. PMID:23613199
Life-cycle and growth-phase-dependent regulation of the ubiquitin genes of Trypanosoma cruzi.
Manning-Cela, Rebeca; Jaishankar, Sobha; Swindle, John
2006-07-01
Trypanosoma cruzi, the causative agent of Chagas disease, exhibits a complex life cycle that is accompanied by the stage-specific gene expression. At the molecular level, very little is known about gene regulation in trypanosomes. Complex gene organizations coupled with polycistronic transcription units make the analysis of regulated gene expression difficult in trypanosomes. The ubiquitin genes of T. cruzi are a good example of this complexity. They are organized as a single cluster containing five ubiquitin fusion (FUS) and five polyubiquitin (PUB) genes that are polycistronically transcribed but expressed differently in response to developmental and environmental changes. Gene replacements were used to study FUS and PUB gene expression at different stages of growth and at different points in the life cycle of T. cruzi. Based on the levels of reporter gene expression, it was determined that FUS1 expression was downregulated as the parasites approached stationary phase, whereas PUB12.5 polyubiquitin gene expression increased. Conversely, FUS1 expression increases when epimastigotes and amastigotes differentiate into trypomastigotes, whereas the expression of PUB12.5 decreases when epimastigotes differentiate into amastigotes and trypomastigotes. Although the level of CAT activity in logarithmic growing epimastigotes is six- to seven-fold higher when the gene was expressed from the FUS1 locus than when expressed from the PUB12.5 locus, the rate of transcription from the two loci was the same implying that post-transcriptional mechanisms play a dominant role in the regulation of gene expression.
Single-molecule RNA observation in vivo reveals dynamics of co-transcriptional splicing
NASA Astrophysics Data System (ADS)
Ferguson, M. L.; Coulon, A.; de Turris, V.; Palangat, M.; Chow, C. C.; Singer, R. H.; Larson, D. R.
2013-03-01
The synthesis of pre-mRNA and the splicing of that pre-mRNA to form completed transcripts requires coordination between two large multi-subunit complexes (the transcription elongation complex and the spliceosome). How this coordination occurs in vivo is unknown. Here we report the first experimental observation of transcription and splicing occurring at the same gene in living cells. By utilizing the PP7/MS2 fluorescent RNA reporter system, we can directly observe two distinct regions of the nascent RNA, allowing us to measure the rise and fall time of the intron and exon of a reporter gene stably integrated into a human cell line. The reporter gene consists of a beta globin gene where we have inserted a 24 RNA hairpin cassette into the intron/exon. Upon synthesis, the RNA hairpins are tightly bound by fluorescently-labeled PP7/MS2 bacteriophage coat proteins. After gene induction, a single locus of active transcription in the nucleus shows fluorescence intensity changes characteristic of the synthesis and excision of the intron/exon. Using fluctuation analysis, we determine the elongation rate to be 1.5 kb/min. From the temporal cross correlation function, we determine that splicing of this gene must be co-transcriptional with a splicing time of ~100 seconds before termination and a ~200 second pause at termination. We propose that dual-color RNA imaging may be extended to investigate other mechanisms of transcription, gene regulation, and RNA processing.
Jiang, Yiwei
2013-01-01
Drought is a major environmental stress limiting growth of perennial grasses in temperate regions. Plant drought tolerance is a complex trait that is controlled by multiple genes. Candidate gene association mapping provides a powerful tool for dissection of complex traits. Candidate gene association mapping of drought tolerance traits was conducted in 192 diverse perennial ryegrass (Lolium perenne L.) accessions from 43 countries. The panel showed significant variations in leaf wilting, leaf water content, canopy and air temperature difference, and chlorophyll fluorescence under well-watered and drought conditions across six environments. Analysis of 109 simple sequence repeat markers revealed five population structures in the mapping panel. A total of 2520 expression-based sequence readings were obtained for a set of candidate genes involved in antioxidant metabolism, dehydration, water movement across membranes, and signal transduction, from which 346 single nucleotide polymorphisms were identified. Significant associations were identified between a putative LpLEA3 encoding late embryogenesis abundant group 3 protein and a putative LpFeSOD encoding iron superoxide dismutase and leaf water content, as well as between a putative LpCyt Cu-ZnSOD encoding cytosolic copper-zinc superoxide dismutase and chlorophyll fluorescence under drought conditions. Four of these identified significantly associated single nucleotide polymorphisms from these three genes were also translated to amino acid substitutions in different genotypes. These results indicate that allelic variation in these genes may affect whole-plant response to drought stress in perennial ryegrass. PMID:23386684
Kelemen, Arpad; Vasilakos, Athanasios V; Liang, Yulan
2009-09-01
Comprehensive evaluation of common genetic variations through association of single-nucleotide polymorphism (SNP) structure with common complex disease in the genome-wide scale is currently a hot area in human genome research due to the recent development of the Human Genome Project and HapMap Project. Computational science, which includes computational intelligence (CI), has recently become the third method of scientific enquiry besides theory and experimentation. There have been fast growing interests in developing and applying CI in disease mapping using SNP and haplotype data. Some of the recent studies have demonstrated the promise and importance of CI for common complex diseases in genomic association study using SNP/haplotype data, especially for tackling challenges, such as gene-gene and gene-environment interactions, and the notorious "curse of dimensionality" problem. This review provides coverage of recent developments of CI approaches for complex diseases in genetic association study with SNP/haplotype data.
Genomic signatures of evolutionary transitions from solitary to group living
Kapheim, Karen M.; Pan, Hailin; Li, Cai; Salzberg, Steven L.; Puiu, Daniela; Magoc, Tanja; Robertson, Hugh M.; Hudson, Matthew E.; Venkat, Aarti; Fischman, Brielle J.; Hernandez, Alvaro; Yandell, Mark; Ence, Daniel; Holt, Carson; Yocum, George D.; Kemp, William P.; Bosch, Jordi; Waterhouse, Robert M.; Zdobnov, Evgeny M.; Stolle, Eckart; Kraus, F. Bernhard; Helbing, Sophie; Moritz, Robin F. A.; Glastad, Karl M.; Hunt, Brendan G.; Goodisman, Michael A. D.; Hauser, Frank; Grimmelikhuijzen, Cornelis J. P.; Pinheiro, Daniel Guariz; Nunes, Francis Morais Franco; Soares, Michelle Prioli Miranda; Tanaka, Érica Donato; Simões, Zilá Luz Paulino; Hartfelder, Klaus; Evans, Jay D.; Barribeau, Seth M.; Johnson, Reed M.; Massey, Jonathan H.; Southey, Bruce R.; Hasselmann, Martin; Hamacher, Daniel; Biewer, Matthias; Kent, Clement F.; Zayed, Amro; Blatti, Charles; Sinha, Saurabh; Johnston, J. Spencer; Hanrahan, Shawn J.; Kocher, Sarah D.; Wang, Jun; Robinson, Gene E.; Zhang, Guojie
2017-01-01
The evolution of eusociality is one of the major transitions in evolution, but the underlying genomic changes are unknown. We compared the genomes of 10 bee species that vary in social complexity, representing multiple independent transitions in social evolution, and report three major findings. First, many important genes show evidence of neutral evolution as a consequence of relaxed selection with increasing social complexity. Second, there is no single road map to eusociality; independent evolutionary transitions in sociality have independent genetic underpinnings. Third, though clearly independent in detail, these transitions do have similar general features, including an increase in constrained protein evolution accompanied by increases in the potential for gene regulation and decreases in diversity and abundance of transposable elements. Eusociality may arise through different mechanisms each time, but would likely always involve an increase in the complexity of gene networks. PMID:25977371
Social evolution. Genomic signatures of evolutionary transitions from solitary to group living.
Kapheim, Karen M; Pan, Hailin; Li, Cai; Salzberg, Steven L; Puiu, Daniela; Magoc, Tanja; Robertson, Hugh M; Hudson, Matthew E; Venkat, Aarti; Fischman, Brielle J; Hernandez, Alvaro; Yandell, Mark; Ence, Daniel; Holt, Carson; Yocum, George D; Kemp, William P; Bosch, Jordi; Waterhouse, Robert M; Zdobnov, Evgeny M; Stolle, Eckart; Kraus, F Bernhard; Helbing, Sophie; Moritz, Robin F A; Glastad, Karl M; Hunt, Brendan G; Goodisman, Michael A D; Hauser, Frank; Grimmelikhuijzen, Cornelis J P; Pinheiro, Daniel Guariz; Nunes, Francis Morais Franco; Soares, Michelle Prioli Miranda; Tanaka, Érica Donato; Simões, Zilá Luz Paulino; Hartfelder, Klaus; Evans, Jay D; Barribeau, Seth M; Johnson, Reed M; Massey, Jonathan H; Southey, Bruce R; Hasselmann, Martin; Hamacher, Daniel; Biewer, Matthias; Kent, Clement F; Zayed, Amro; Blatti, Charles; Sinha, Saurabh; Johnston, J Spencer; Hanrahan, Shawn J; Kocher, Sarah D; Wang, Jun; Robinson, Gene E; Zhang, Guojie
2015-06-05
The evolution of eusociality is one of the major transitions in evolution, but the underlying genomic changes are unknown. We compared the genomes of 10 bee species that vary in social complexity, representing multiple independent transitions in social evolution, and report three major findings. First, many important genes show evidence of neutral evolution as a consequence of relaxed selection with increasing social complexity. Second, there is no single road map to eusociality; independent evolutionary transitions in sociality have independent genetic underpinnings. Third, though clearly independent in detail, these transitions do have similar general features, including an increase in constrained protein evolution accompanied by increases in the potential for gene regulation and decreases in diversity and abundance of transposable elements. Eusociality may arise through different mechanisms each time, but would likely always involve an increase in the complexity of gene networks. Copyright © 2015, American Association for the Advancement of Science.
Gupta, Gagan D.; Howes, Mark T.; Chandran, Ruma; Das, Anupam; Menon, Sindhu; Parton, Robert G.; Sowdhamini, R.; Thattai, Mukund; Mayor, Satyajit
2014-01-01
Single-cell-resolved measurements reveal heterogeneous distributions of clathrin-dependent (CD) and -independent (CLIC/GEEC: CG) endocytic activity in Drosophila cell populations. dsRNA-mediated knockdown of core versus peripheral endocytic machinery induces strong changes in the mean, or subtle changes in the shapes of these distributions, respectively. By quantifying these subtle shape changes for 27 single-cell features which report on endocytic activity and cell morphology, we organize 1072 Drosophila genes into a tree-like hierarchy. We find that tree nodes contain gene sets enriched in functional classes and protein complexes, providing a portrait of core and peripheral control of CD and CG endocytosis. For 470 genes we obtain additional features from separate assays and classify them into early- or late-acting genes of the endocytic pathways. Detailed analyses of specific genes at intermediate levels of the tree suggest that Vacuolar ATPase and lysosomal genes involved in vacuolar biogenesis play an evolutionarily conserved role in CG endocytosis. PMID:24971745
Lee, Seungyeoun; Kim, Yongkang; Kwon, Min-Seok; Park, Taesung
2015-01-01
Genome-wide association studies (GWAS) have extensively analyzed single SNP effects on a wide variety of common and complex diseases and found many genetic variants associated with diseases. However, there is still a large portion of the genetic variants left unexplained. This missing heritability problem might be due to the analytical strategy that limits analyses to only single SNPs. One of possible approaches to the missing heritability problem is to consider identifying multi-SNP effects or gene-gene interactions. The multifactor dimensionality reduction method has been widely used to detect gene-gene interactions based on the constructive induction by classifying high-dimensional genotype combinations into one-dimensional variable with two attributes of high risk and low risk for the case-control study. Many modifications of MDR have been proposed and also extended to the survival phenotype. In this study, we propose several extensions of MDR for the survival phenotype and compare the proposed extensions with earlier MDR through comprehensive simulation studies. PMID:26339630
Single cell gene expression profiling of cortical osteoblast lineage cells.
Flynn, James M; Spusta, Steven C; Rosen, Clifford J; Melov, Simon
2013-03-01
In tissues with complex architectures such as bone, it is often difficult to purify and characterize specific cell types via molecular profiling. Single cell gene expression profiling is an emerging technology useful for characterizing transcriptional profiles of individual cells isolated from heterogeneous populations. In this study we describe a novel procedure for the isolation and characterization of gene expression profiles of single osteoblast lineage cells derived from cortical bone. Mixed populations of different cell types were isolated from adult long bones of C57BL/6J mice by enzymatic digestion, and subsequently subjected to FACS to purify and characterize osteoblast lineage cells via a selection strategy using antibodies against CD31, CD45, and alkaline phosphatase (AP), specific for mature osteoblasts. The purified individual osteoblast lineage cells were then profiled at the single cell level via nanofluidic PCR. This method permits robust gene expression profiling on single osteoblast lineage cells derived from mature bone, potentially from anatomically distinct sites. In conjunction with this technique, we have also shown that it is possible to carry out single cell profiling on cells purified from fixed and frozen bone samples without compromising the gene expression signal. The latter finding means the technique can be extended to biopsies of bone from diseased individuals. Our approach for single cell expression profiling provides a new dimension to the transcriptional profile of the primary osteoblast lineage population in vivo, and has the capacity to greatly expand our understanding of how these cells may function in vivo under normal and diseased states. Copyright © 2012 Elsevier Inc. All rights reserved.
Miró-Bueno, Jesús M.; Rodríguez-Patón, Alfonso
2011-01-01
Negative and positive transcriptional feedback loops are present in natural and synthetic genetic oscillators. A single gene with negative transcriptional feedback needs a time delay and sufficiently strong nonlinearity in the transmission of the feedback signal in order to produce biochemical rhythms. A single gene with only positive transcriptional feedback does not produce oscillations. Here, we demonstrate that this single-gene network in conjunction with a simple negative interaction can also easily produce rhythms. We examine a model comprised of two well-differentiated parts. The first is a positive feedback created by a protein that binds to the promoter of its own gene and activates the transcription. The second is a negative interaction in which a repressor molecule prevents this protein from binding to its promoter. A stochastic study shows that the system is robust to noise. A deterministic study identifies that the dynamics of the oscillator are mainly driven by two types of biomolecules: the protein, and the complex formed by the repressor and this protein. The main conclusion of this paper is that a simple and usual negative interaction, such as degradation, sequestration or inhibition, acting on the positive transcriptional feedback of a single gene is a sufficient condition to produce reliable oscillations. One gene is enough and the positive transcriptional feedback signal does not need to activate a second repressor gene. This means that at the genetic level an explicit negative feedback loop is not necessary. The model needs neither cooperative binding reactions nor the formation of protein multimers. Therefore, our findings could help to clarify the design principles of cellular clocks and constitute a new efficient tool for engineering synthetic genetic oscillators. PMID:22205920
Ishikawa, Akira
2017-11-27
Large numbers of quantitative trait loci (QTL) affecting complex diseases and other quantitative traits have been reported in humans and model animals. However, the genetic architecture of these traits remains elusive due to the difficulty in identifying causal quantitative trait genes (QTGs) for common QTL with relatively small phenotypic effects. A traditional strategy based on techniques such as positional cloning does not always enable identification of a single candidate gene for a QTL of interest because it is difficult to narrow down a target genomic interval of the QTL to a very small interval harboring only one gene. A combination of gene expression analysis and statistical causal analysis can greatly reduce the number of candidate genes. This integrated approach provides causal evidence that one of the candidate genes is a putative QTG for the QTL. Using this approach, I have recently succeeded in identifying a single putative QTG for resistance to obesity in mice. Here, I outline the integration approach and discuss its usefulness using my studies as an example.
Synergistic effect of amino acids modified on dendrimer surface in gene delivery.
Wang, Fei; Wang, Yitong; Wang, Hui; Shao, Naimin; Chen, Yuanyuan; Cheng, Yiyun
2014-11-01
Design of an efficient gene vector based on dendrimer remains a great challenge due to the presence of multiple barriers in gene delivery. Single-functionalization on dendrimer cannot overcome all the barriers. In this study, we synthesized a list of single-, dual- and triple-functionalized dendrimers with arginine, phenylalanine and histidine for gene delivery using a one-pot approach. The three amino acids play different roles in gene delivery: arginine is essential in formation of stable complexes, phenylalanine improves cellular uptake efficacy, and histidine increases pH-buffering capacity and minimizes cytotoxicity of the cationic dendrimer. A combination of these amino acids on dendrimer generates a synergistic effect in gene delivery. The dual- and triple-functionalized dendrimers show minimal cytotoxicity on the transfected NIH 3T3 cells. Using this combination strategy, we can obtain triple-functionalized dendrimers with comparable transfection efficacy to several commercial transfection reagents. Such a combination strategy should be applicable to the design of efficient and biocompatible gene vectors for gene delivery. Copyright © 2014 Elsevier Ltd. All rights reserved.
Tsai, Yu-Shuen; Aguan, Kripamoy; Pal, Nikhil R.; Chung, I-Fang
2011-01-01
Informative genes from microarray data can be used to construct prediction model and investigate biological mechanisms. Differentially expressed genes, the main targets of most gene selection methods, can be classified as single- and multiple-class specific signature genes. Here, we present a novel gene selection algorithm based on a Group Marker Index (GMI), which is intuitive, of low-computational complexity, and efficient in identification of both types of genes. Most gene selection methods identify only single-class specific signature genes and cannot identify multiple-class specific signature genes easily. Our algorithm can detect de novo certain conditions of multiple-class specificity of a gene and makes use of a novel non-parametric indicator to assess the discrimination ability between classes. Our method is effective even when the sample size is small as well as when the class sizes are significantly different. To compare the effectiveness and robustness we formulate an intuitive template-based method and use four well-known datasets. We demonstrate that our algorithm outperforms the template-based method in difficult cases with unbalanced distribution. Moreover, the multiple-class specific genes are good biomarkers and play important roles in biological pathways. Our literature survey supports that the proposed method identifies unique multiple-class specific marker genes (not reported earlier to be related to cancer) in the Central Nervous System data. It also discovers unique biomarkers indicating the intrinsic difference between subtypes of lung cancer. We also associate the pathway information with the multiple-class specific signature genes and cross-reference to published studies. We find that the identified genes participate in the pathways directly involved in cancer development in leukemia data. Our method gives a promising way to find genes that can involve in pathways of multiple diseases and hence opens up the possibility of using an existing drug on other diseases as well as designing a single drug for multiple diseases. PMID:21909426
Cooperative Adaptive Responses in Gene Regulatory Networks with Many Degrees of Freedom
Inoue, Masayo; Kaneko, Kunihiko
2013-01-01
Cells generally adapt to environmental changes by first exhibiting an immediate response and then gradually returning to their original state to achieve homeostasis. Although simple network motifs consisting of a few genes have been shown to exhibit such adaptive dynamics, they do not reflect the complexity of real cells, where the expression of a large number of genes activates or represses other genes, permitting adaptive behaviors. Here, we investigated the responses of gene regulatory networks containing many genes that have undergone numerical evolution to achieve high fitness due to the adaptive response of only a single target gene; this single target gene responds to changes in external inputs and later returns to basal levels. Despite setting a single target, most genes showed adaptive responses after evolution. Such adaptive dynamics were not due to common motifs within a few genes; even without such motifs, almost all genes showed adaptation, albeit sometimes partial adaptation, in the sense that expression levels did not always return to original levels. The genes split into two groups: genes in the first group exhibited an initial increase in expression and then returned to basal levels, while genes in the second group exhibited the opposite changes in expression. From this model, genes in the first group received positive input from other genes within the first group, but negative input from genes in the second group, and vice versa. Thus, the adaptation dynamics of genes from both groups were consolidated. This cooperative adaptive behavior was commonly observed if the number of genes involved was larger than the order of ten. These results have implications in the collective responses of gene expression networks in microarray measurements of yeast Saccharomyces cerevisiae and the significance to the biological homeostasis of systems with many components. PMID:23592959
Single-Cell RT-PCR in Microfluidic Droplets with Integrated Chemical Lysis.
Kim, Samuel C; Clark, Iain C; Shahi, Payam; Abate, Adam R
2018-01-16
Droplet microfluidics can identify and sort cells using digital reverse transcription polymerase chain reaction (RT-PCR) signals from individual cells. However, current methods require multiple microfabricated devices for enzymatic cell lysis and PCR reagent addition, making the process complex and prone to failure. Here, we describe a new approach that integrates all components into a single device. The method enables controlled exposure of isolated single cells to a high pH buffer, which lyses cells and inactivates reaction inhibitors but can be instantly neutralized with RT-PCR buffer. Using our chemical lysis approach, we distinguish individual cells' gene expression with data quality equivalent to more complex two-step workflows. Our system accepts cells and produces droplets ready for amplification, making single-cell droplet RT-PCR faster and more reliable.
Noninvasive prenatal diagnosis for single gene disorders.
Allen, Stephanie; Young, Elizabeth; Bowns, Benjamin
2017-04-01
Noninvasive prenatal diagnosis for single gene disorders is coming to fruition in its clinical utility. The presence of cell-free DNA in maternal plasma has been recognized for many years, and a number of applications have developed from this. Noninvasive prenatal diagnosis for single gene disorders has lagged behind due to complexities of technology development, lack of investment and the need for validation samples for rare disorders. Publications are emerging demonstrating a variety of technical approaches and feasibility of clinical application. Techniques for analysis of cell-free DNA including digital PCR, next-generation sequencing and relative haplotype dosage have been used most often for assay development. Analysis of circulating fetal cells in the maternal blood is still being investigated as a viable alternative and more recently transcervical trophoblast cells. Studies exploring ethical and social issues are generally positive but raise concerns around the routinization of prenatal testing. Further work is necessary to make testing available to all patients with a pregnancy at risk of a single gene disorder, and it remains to be seen if the development of more powerful technologies such as isolation and analysis of single cells will shift the emphasis of noninvasive prenatal diagnosis. As testing becomes possible for a wider range of conditions, more ethical questions will become relevant.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Noethen, M.M.; Eggermann, K.; Propping, P.
1995-10-01
It is well accepted that association studies are a major tool in investigating the contribution of single genes to the development of diseases that do not follow simple Mendelian inheritance pattern (so-called complex traits). Such major psychiatric diseases as bipolar affective disorder and schizophrenia clearly fall into this category of diseases. 7 refs., 1 tab.
Genetic complexity underlying hybrid male sterility in Drosophila.
Sawamura, Kyoichi; Roote, John; Wu, Chung-I; Yamamoto, Masa-Toshi
2004-02-01
Recent genetic analyses of closely related species of Drosophila have indicated that hybrid male sterility is the consequence of highly complex synergistic effects among multiple genes, both conspecific and heterospecific. On the contrary, much evidence suggests the presence of major genes causing hybrid female sterility and inviability in the less-related species, D. melanogaster and D. simulans. Does this contrast reflect the genetic distance between species? Or, generally, is the genetic basis of hybrid male sterility more complex than that of hybrid female sterility and inviability? To clarify this point, the D. simulans introgression of the cytological region 34D-36A to the D. melanogaster genome, which causes recessive male sterility, was dissected by recombination, deficiency, and complementation mapping. The 450-kb region between two genes, Suppressor of Hairless and snail, exhibited a strong effect on the sterility. Males are (semi-)sterile if this region of the introgression is made homozygous or hemizygous. But no genes in the region singly cause the sterility; this region has at least two genes, which in combination result in male sterility. Further, the males are less fertile when heterozygous with a larger introgression, which suggests that dominant modifiers enhance the effects of recessive genes of male sterility. Such an epistatic view, even in the less-related species, suggests that the genetic complexity is special to hybrid male sterility.
Genetic complexity underlying hybrid male sterility in Drosophila.
Sawamura, Kyoichi; Roote, John; Wu, Chung-I; Yamamoto, Masa-Toshi
2004-01-01
Recent genetic analyses of closely related species of Drosophila have indicated that hybrid male sterility is the consequence of highly complex synergistic effects among multiple genes, both conspecific and heterospecific. On the contrary, much evidence suggests the presence of major genes causing hybrid female sterility and inviability in the less-related species, D. melanogaster and D. simulans. Does this contrast reflect the genetic distance between species? Or, generally, is the genetic basis of hybrid male sterility more complex than that of hybrid female sterility and inviability? To clarify this point, the D. simulans introgression of the cytological region 34D-36A to the D. melanogaster genome, which causes recessive male sterility, was dissected by recombination, deficiency, and complementation mapping. The 450-kb region between two genes, Suppressor of Hairless and snail, exhibited a strong effect on the sterility. Males are (semi-)sterile if this region of the introgression is made homozygous or hemizygous. But no genes in the region singly cause the sterility; this region has at least two genes, which in combination result in male sterility. Further, the males are less fertile when heterozygous with a larger introgression, which suggests that dominant modifiers enhance the effects of recessive genes of male sterility. Such an epistatic view, even in the less-related species, suggests that the genetic complexity is special to hybrid male sterility. PMID:15020468
Localization of migraine susceptibility genes in human brain by single-cell RNA sequencing.
Renthal, William
2018-01-01
Background Migraine is a debilitating disorder characterized by severe headaches and associated neurological symptoms. A key challenge to understanding migraine has been the cellular complexity of the human brain and the multiple cell types implicated in its pathophysiology. The present study leverages recent advances in single-cell transcriptomics to localize the specific human brain cell types in which putative migraine susceptibility genes are expressed. Methods The cell-type specific expression of both familial and common migraine-associated genes was determined bioinformatically using data from 2,039 individual human brain cells across two published single-cell RNA sequencing datasets. Enrichment of migraine-associated genes was determined for each brain cell type. Results Analysis of single-brain cell RNA sequencing data from five major subtypes of cells in the human cortex (neurons, oligodendrocytes, astrocytes, microglia, and endothelial cells) indicates that over 40% of known migraine-associated genes are enriched in the expression profiles of a specific brain cell type. Further analysis of neuronal migraine-associated genes demonstrated that approximately 70% were significantly enriched in inhibitory neurons and 30% in excitatory neurons. Conclusions This study takes the next step in understanding the human brain cell types in which putative migraine susceptibility genes are expressed. Both familial and common migraine may arise from dysfunction of discrete cell types within the neurovascular unit, and localization of the affected cell type(s) in an individual patient may provide insight into to their susceptibility to migraine.
Chattaway, Marie Anne; Day, Michaela; Mtwale, Julia; White, Emma; Rogers, James; Day, Martin; Powell, David; Ahmad, Marwa; Harris, Ross; Talukder, Kaisar Ali; Wain, John; Jenkins, Claire; Cravioto, Alejandro
2017-01-01
Purpose This study investigates the virulence and antimicrobial resistance in association with common clonal complexes (CCs) of enteroaggregative Escherichia coli (EAEC) isolated from Bangladesh. The aim was to determine whether specific CCs were more likely to be associated with putative virulence genes and/or antimicrobial resistance. Methodology The presence of 15 virulence genes (by PCR) and susceptibility to 18 antibiotics were determined for 151 EAEC isolated from cases and controls during an intestinal infectious disease study carried out between 2007–2011 in the rural setting of Mirzapur, Bangladesh (Kotloff KL, Blackwelder WC, Nasrin D, Nataro JP, Farag TH et al. Clin Infect Dis 2012;55:S232–S245). These data were then analysed in the context of previously determined serotypes and clonal complexes defined by multi-locus sequence typing. Results Overall there was no association between the presence of virulence or antimicrobial resistance genes in isolates of EAEC from cases versus controls. However, when stratified by clonal complex (CC) one CC associated with cases harboured more virulence factors (CC40) and one CC harboured more resistance genes (CC38) than the average. There was no direct link between the virulence gene content and antibiotic resistance. Strains within a single CC had variable virulence and resistance gene content indicating independent and multiple gene acquisitions over time. Conclusion In Bangladesh, there are multiple clonal complexes of EAEC harbouring a variety of virulence and resistance genes. The emergence of two of the most successful clones appeared to be linked to either increased virulence (CC40) or antimicrobial resistance (CC38), but increased resistance and virulence were not found in the same clonal complexes. PMID:28945190
Calvete, Oriol; González, Josefa; Betrán, Esther; Ruiz, Alfredo
2012-01-01
Chromosomal inversions are usually portrayed as simple two-breakpoint rearrangements changing gene order but not gene number or structure. However, increasing evidence suggests that inversion breakpoints may often have a complex structure and entail gene duplications with potential functional consequences. Here, we used a combination of different techniques to investigate the breakpoint structure and the functional consequences of a complex rearrangement fixed in Drosophila buzzatii and comprising two tandemly arranged inversions sharing the middle breakpoint: 2m and 2n. By comparing the sequence in the breakpoint regions between D. buzzatii (inverted chromosome) and D. mojavensis (noninverted chromosome), we corroborate the breakpoint reuse at the molecular level and infer that inversion 2m was associated with a duplication of a ∼13 kb segment and likely generated by staggered breaks plus repair by nonhomologous end joining. The duplicated segment contained the gene CG4673, involved in nuclear transport, and its two nested genes CG5071 and CG5079. Interestingly, we found that other than the inversion and the associated duplication, both breakpoints suffered additional rearrangements, that is, the proximal breakpoint experienced a microinversion event associated at both ends with a 121-bp long duplication that contains a promoter. As a consequence of all these different rearrangements, CG5079 has been lost from the genome, CG5071 is now a single copy nonnested gene, and CG4673 has a transcript ∼9 kb shorter and seems to have acquired a more complex gene regulation. Our results illustrate the complex effects of chromosomal rearrangements and highlight the need of complementing genomic approaches with detailed sequence-level and functional analyses of breakpoint regions if we are to fully understand genome structure, function, and evolutionary dynamics. PMID:22328714
Functional Regression Models for Epistasis Analysis of Multiple Quantitative Traits.
Zhang, Futao; Xie, Dan; Liang, Meimei; Xiong, Momiao
2016-04-01
To date, most genetic analyses of phenotypes have focused on analyzing single traits or analyzing each phenotype independently. However, joint epistasis analysis of multiple complementary traits will increase statistical power and improve our understanding of the complicated genetic structure of the complex diseases. Despite their importance in uncovering the genetic structure of complex traits, the statistical methods for identifying epistasis in multiple phenotypes remains fundamentally unexplored. To fill this gap, we formulate a test for interaction between two genes in multiple quantitative trait analysis as a multiple functional regression (MFRG) in which the genotype functions (genetic variant profiles) are defined as a function of the genomic position of the genetic variants. We use large-scale simulations to calculate Type I error rates for testing interaction between two genes with multiple phenotypes and to compare the power with multivariate pairwise interaction analysis and single trait interaction analysis by a single variate functional regression model. To further evaluate performance, the MFRG for epistasis analysis is applied to five phenotypes of exome sequence data from the NHLBI's Exome Sequencing Project (ESP) to detect pleiotropic epistasis. A total of 267 pairs of genes that formed a genetic interaction network showed significant evidence of epistasis influencing five traits. The results demonstrate that the joint interaction analysis of multiple phenotypes has a much higher power to detect interaction than the interaction analysis of a single trait and may open a new direction to fully uncovering the genetic structure of multiple phenotypes.
Systems Genetics as a Tool to Identify Master Genetic Regulators in Complex Disease.
Moreno-Moral, Aida; Pesce, Francesco; Behmoaras, Jacques; Petretto, Enrico
2017-01-01
Systems genetics stems from systems biology and similarly employs integrative modeling approaches to describe the perturbations and phenotypic effects observed in a complex system. However, in the case of systems genetics the main source of perturbation is naturally occurring genetic variation, which can be analyzed at the systems-level to explain the observed variation in phenotypic traits. In contrast with conventional single-variant association approaches, the success of systems genetics has been in the identification of gene networks and molecular pathways that underlie complex disease. In addition, systems genetics has proven useful in the discovery of master trans-acting genetic regulators of functional networks and pathways, which in many cases revealed unexpected gene targets for disease. Here we detail the central components of a fully integrated systems genetics approach to complex disease, starting from assessment of genetic and gene expression variation, linking DNA sequence variation to mRNA (expression QTL mapping), gene regulatory network analysis and mapping the genetic control of regulatory networks. By summarizing a few illustrative (and successful) examples, we highlight how different data-modeling strategies can be effectively integrated in a systems genetics study.
Lin, Eugene; Pei, Dee; Huang, Yi-Jen; Hsieh, Chang-Hsun; Wu, Lawrence Shih-Hsin
2009-08-01
Recent studies indicate that obesity may play a key role in modulating genetic predispositions to type 2 diabetes (T2D). This study examines the main effects of both single-locus and multilocus interactions among genetic variants in Taiwanese obese and nonobese individuals to test the hypothesis that obesity-related genes may contribute to the etiology of T2D independently and/or through such complex interactions. We genotyped 11 single nucleotide polymorphisms for 10 obesity candidate genes including adrenergic beta-2-receptor surface, adrenergic beta-3-receptor surface, angiotensinogen, fat mass and obesity associated gene, guanine nucleotide binding protein beta polypeptide 3 (GNB3), interleukin 6 receptor, proprotein convertase subtilisin/kexin type 1 (PCSK1), uncoupling protein 1, uncoupling protein 2, and uncoupling protein 3. There were 389 patients diagnosed with T2D and 186 age- and sex-matched controls. Single-locus analyses showed significant main effects of the GNB3 and PCSK1 genes on the risk of T2D among the nonobese group (p = 0.002 and 0.047, respectively). Further, interactions involving GNB3 and PCSK1 were suggested among the nonobese population using the generalized multifactor dimensionality reduction method (p = 0.001). In addition, interactions among angiotensinogen, fat mass and obesity associated gene, GNB3, and uncoupling protein 3 genes were found in a significant four-locus generalized multifactor dimensionality reduction model among the obese population (p = 0.001). The results suggest that the single nucleotide polymorphisms from the obesity candidate genes may contribute to the risk of T2D independently and/or in an interactive manner according to the presence or absence of obesity.
Hill, W D; Davies, G; van de Lagemaat, L N; Christoforou, A; Marioni, R E; Fernandes, C P D; Liewald, D C; Croning, M D R; Payton, A; Craig, L C A; Whalley, L J; Horan, M; Ollier, W; Hansell, N K; Wright, M J; Martin, N G; Montgomery, G W; Steen, V M; Le Hellard, S; Espeseth, T; Lundervold, A J; Reinvang, I; Starr, J M; Pendleton, N; Grant, S G N; Bates, T C; Deary, I J
2014-01-01
Differences in general cognitive ability (intelligence) account for approximately half of the variation in any large battery of cognitive tests and are predictive of important life events including health. Genome-wide analyses of common single-nucleotide polymorphisms indicate that they jointly tag between a quarter and a half of the variance in intelligence. However, no single polymorphism has been reliably associated with variation in intelligence. It remains possible that these many small effects might be aggregated in networks of functionally linked genes. Here, we tested a network of 1461 genes in the postsynaptic density and associated complexes for an enriched association with intelligence. These were ascertained in 3511 individuals (the Cognitive Ageing Genetics in England and Scotland (CAGES) consortium) phenotyped for general cognitive ability, fluid cognitive ability, crystallised cognitive ability, memory and speed of processing. By analysing the results of a genome wide association study (GWAS) using Gene Set Enrichment Analysis, a significant enrichment was found for fluid cognitive ability for the proteins found in the complexes of N-methyl-D-aspartate receptor complex; P=0.002. Replication was sought in two additional cohorts (N=670 and 2062). A meta-analytic P-value of 0.003 was found when these were combined with the CAGES consortium. The results suggest that genetic variation in the macromolecular machines formed by membrane-associated guanylate kinase (MAGUK) scaffold proteins and their interaction partners contributes to variation in intelligence. PMID:24399044
Grindberg, Rashel V.; Ishoey, Thomas; Brinza, Dumitru; Esquenazi, Eduardo; Coates, R. Cameron; Liu, Wei-ting; Gerwick, Lena; Dorrestein, Pieter C.; Pevzner, Pavel; Lasken, Roger; Gerwick, William H.
2011-01-01
Filamentous marine cyanobacteria are extraordinarily rich sources of structurally novel, biomedically relevant natural products. To understand their biosynthetic origins as well as produce increased supplies and analog molecules, access to the clustered biosynthetic genes that encode for the assembly enzymes is necessary. Complicating these efforts is the universal presence of heterotrophic bacteria in the cell wall and sheath material of cyanobacteria obtained from the environment and those grown in uni-cyanobacterial culture. Moreover, the high similarity in genetic elements across disparate secondary metabolite biosynthetic pathways renders imprecise current gene cluster targeting strategies and contributes sequence complexity resulting in partial genome coverage. Thus, it was necessary to use a dual-method approach of single-cell genomic sequencing based on multiple displacement amplification (MDA) and metagenomic library screening. Here, we report the identification of the putative apratoxin. A biosynthetic gene cluster, a potent cancer cell cytotoxin with promise for medicinal applications. The roughly 58 kb biosynthetic gene cluster is composed of 12 open reading frames and has a type I modular mixed polyketide synthase/nonribosomal peptide synthetase (PKS/NRPS) organization and features loading and off-loading domain architecture never previously described. Moreover, this work represents the first successful isolation of a complete biosynthetic gene cluster from Lyngbya bouillonii, a tropical marine cyanobacterium renowned for its production of diverse bioactive secondary metabolites. PMID:21533272
Yamanaka, Yuki; Winardhi, Ricksen S; Yamauchi, Erika; Nishiyama, So-Ichiro; Sowa, Yoshiyuki; Yan, Jie; Kawagishi, Ikuro; Ishihama, Akira; Yamamoto, Kaneyoshi
2018-06-15
The bacterial nucleoid-associated protein H-NS is a DNA-binding protein, playing a major role in gene regulation. To regulate transcription, H-NS silences genes, including horizontally acquired foreign genes. Escherichia coli H-NS is 137 residues long and consists of two discrete and independent structural domains: an N-terminal oligomerization domain and a C-terminal DNA-binding domain, joined by a flexible linker. The N-terminal oligomerization domain is composed of two dimerization sites, dimerization sites 1 and 2, which are both required for H-NS oligomerization, but the exact role of dimerization site 2 in gene silencing is unclear. To this end, we constructed a whole set of single amino acid substitution variants spanning residues 2 to 137. Using a well-characterized H-NS target, the slp promoter of the glutamic acid-dependent acid resistance (GAD) cluster promoters, we screened for any variants defective in gene silencing. Focusing on the function of dimerization site 2, we analyzed four variants, I70C/I70A and L75C/L75A, which all could actively bind DNA but are defective in gene silencing. Atomic force microscopy analysis of DNA-H-NS complexes revealed that all of these four variants formed condensed complexes on DNA, whereas WT H-NS formed rigid and extended nucleoprotein filaments, a conformation required for gene silencing. Single-molecule stretching experiments confirmed that the four variants had lost the ability to form stiffened filaments. We conclude that dimerization site 2 of H-NS plays a key role in the formation of rigid H-NS nucleoprotein filament structures required for gene silencing. © 2018 by The American Society for Biochemistry and Molecular Biology, Inc.
Jang, Sumin; Choubey, Sandeep; Furchtgott, Leon; Zou, Ling-Nan; Doyle, Adele; Menon, Vilas; Loew, Ethan B; Krostag, Anne-Rachel; Martinez, Refugio A; Madisen, Linda; Levi, Boaz P; Ramanathan, Sharad
2017-01-01
The complexity of gene regulatory networks that lead multipotent cells to acquire different cell fates makes a quantitative understanding of differentiation challenging. Using a statistical framework to analyze single-cell transcriptomics data, we infer the gene expression dynamics of early mouse embryonic stem (mES) cell differentiation, uncovering discrete transitions across nine cell states. We validate the predicted transitions across discrete states using flow cytometry. Moreover, using live-cell microscopy, we show that individual cells undergo abrupt transitions from a naïve to primed pluripotent state. Using the inferred discrete cell states to build a probabilistic model for the underlying gene regulatory network, we further predict and experimentally verify that these states have unique response to perturbations, thus defining them functionally. Our study provides a framework to infer the dynamics of differentiation from single cell transcriptomics data and to build predictive models of the gene regulatory networks that drive the sequence of cell fate decisions during development. DOI: http://dx.doi.org/10.7554/eLife.20487.001 PMID:28296635
Evolutionary interplay between sister cytochrome P450 genes shapes plasticity in plant metabolism.
Liu, Zhenhua; Tavares, Raquel; Forsythe, Evan S; André, François; Lugan, Raphaël; Jonasson, Gabriella; Boutet-Mercey, Stéphanie; Tohge, Takayuki; Beilstein, Mark A; Werck-Reichhart, Danièle; Renault, Hugues
2016-10-07
Expansion of the cytochrome P450 gene family is often proposed to have a critical role in the evolution of metabolic complexity, in particular in microorganisms, insects and plants. However, the molecular mechanisms underlying the evolution of this complexity are poorly understood. Here we describe the evolutionary history of a plant P450 retrogene, which emerged and underwent fixation in the common ancestor of Brassicales, before undergoing tandem duplication in the ancestor of Brassicaceae. Duplication leads first to gain of dual functions in one of the copies. Both sister genes are retained through subsequent speciation but eventually return to a single copy in two of three diverging lineages. In the lineage in which both copies are maintained, the ancestral functions are split between paralogs and a novel function arises in the copy under relaxed selection. Our work illustrates how retrotransposition and gene duplication can favour the emergence of novel metabolic functions.
Model-based design of RNA hybridization networks implemented in living cells
Rodrigo, Guillermo; Prakash, Satya; Shen, Shensi; Majer, Eszter
2017-01-01
Abstract Synthetic gene circuits allow the behavior of living cells to be reprogrammed, and non-coding small RNAs (sRNAs) are increasingly being used as programmable regulators of gene expression. However, sRNAs (natural or synthetic) are generally used to regulate single target genes, while complex dynamic behaviors would require networks of sRNAs regulating each other. Here, we report a strategy for implementing such networks that exploits hybridization reactions carried out exclusively by multifaceted sRNAs that are both targets of and triggers for other sRNAs. These networks are ultimately coupled to the control of gene expression. We relied on a thermodynamic model of the different stable conformational states underlying this system at the nucleotide level. To test our model, we designed five different RNA hybridization networks with a linear architecture, and we implemented them in Escherichia coli. We validated the network architecture at the molecular level by native polyacrylamide gel electrophoresis, as well as the network function at the bacterial population and single-cell levels with a fluorescent reporter. Our results suggest that it is possible to engineer complex cellular programs based on RNA from first principles. Because these networks are mainly based on physical interactions, our designs could be expanded to other organisms as portable regulatory resources or to implement biological computations. PMID:28934501
Hu, Qianni; Sun, Genlou
2017-06-01
Two single-copy nuclear genes, the second largest subunit of RNA polymerase II (RPB2) and thioredoxin-like gene (HTL), were used to explore the phylogeny and origin of polyploid species in Hordeum. Our results were partly in accord with previous studies, but disclosed additional complexity. Both RPB2 and HTL trees confirmed the presence of Xa genome in H. capense and H. secalinum, and that H. depressum originated from H. californicum together with other American diploids, either H. intercedens or H. pusillum. American diploids solely contributed to the origin of H. depressum. The Asian diploids, either H. bogdanii or H. brevisubulatum, contributed to the formation of American polyploids except H. depressum. RPB2 and HTL sequences showed that H. roshevitzii did not contribute to the origin of American tetraploids. Our data showed a close relationship between the hexaploids H. procerum and H. parodii and the tetraploids H. brachyantherum, H. fuegianum, H. guatemalense, H. jubatum, and H. tetraploidum. The involvement of the diploid H. pusillum and the tetraploid H. jubatum in the formation of H. arizonicum was also indicated in the HTL phylogeny. Our results suggested a possible gene introgression of W- and P-genome species into the tetraploid H. jubatum and the hexaploid H. procerum.
Genetic heterogeneity in autism: From single gene to a pathway perspective.
An, Joon Yong; Claudianos, Charles
2016-09-01
The extreme genetic heterogeneity of autism spectrum disorder (ASD) represents a major challenge. Recent advances in genetic screening and systems biology approaches have extended our knowledge of the genetic etiology of ASD. In this review, we discuss the paradigm shift from a single gene causation model to pathway perturbation model as a guide to better understand the pathophysiology of ASD. We discuss recent genetic findings obtained through next-generation sequencing (NGS) and examine various integrative analyses using systems biology and complex networks approaches that identify convergent patterns of genetic elements associated with ASD. Copyright © 2016 Elsevier Ltd. All rights reserved.
MicroRNAs are tightly associated with RNA-induced gene silencing complexes in vivo.
Tang, Fuchou; Hajkova, Petra; O'Carroll, Dónal; Lee, Caroline; Tarakhovsky, Alexander; Lao, Kaiqin; Surani, M Azim
2008-07-18
Previous work has shown that synthesized siRNA/miRNA is tightly associated with RNA-induced Gene Silencing Complexes (RISCs) in vitro. However, it is unknown if the endogenous miRNAs are also stably bound to RISC complexes in vivo in cells under physiological conditions. Here we describe the use of the looped real-time PCR-based method to trace the location of endogenous miRNAs in intact cells. We found that most of the endogenous miRNAs are tightly bound to RISC complexes, and only a very small proportion of them are free in cells. Furthermore, synthesized single-stranded mature miRNA or hairpin miRNA precursor cannot replace endogenous miRNAs already present in RISC complexes. However, we found that modified 2-O-Methyl-ribonucleotides were able to dissociate the target miRNA specifically from the RISC complex. These findings have important implications for understanding the basis for the stability and metabolism of miRNAs in living cells.
Spatial reconstruction of single-cell gene expression
Satija, Rahul; Farrell, Jeffrey A.; Gennert, David; Schier, Alexander F.; Regev, Aviv
2015-01-01
Spatial localization is a key determinant of cellular fate and behavior, but spatial RNA assays traditionally rely on staining for a limited number of RNA species. In contrast, single-cell RNA-seq allows for deep profiling of cellular gene expression, but established methods separate cells from their native spatial context. Here we present Seurat, a computational strategy to infer cellular localization by integrating single-cell RNA-seq data with in situ RNA patterns. We applied Seurat to spatially map 851 single cells from dissociated zebrafish (Danio rerio) embryos, inferring a transcriptome-wide map of spatial patterning. We confirmed Seurat’s accuracy using several experimental approaches, and used it to identify a set of archetypal expression patterns and spatial markers. Additionally, Seurat correctly localizes rare subpopulations, accurately mapping both spatially restricted and scattered groups. Seurat will be applicable to mapping cellular localization within complex patterned tissues in diverse systems. PMID:25867923
GeneNetFinder2: Improved Inference of Dynamic Gene Regulatory Relations with Multiple Regulators.
Han, Kyungsook; Lee, Jeonghoon
2016-01-01
A gene involved in complex regulatory interactions may have multiple regulators since gene expression in such interactions is often controlled by more than one gene. Another thing that makes gene regulatory interactions complicated is that regulatory interactions are not static, but change over time during the cell cycle. Most research so far has focused on identifying gene regulatory relations between individual genes in a particular stage of the cell cycle. In this study we developed a method for identifying dynamic gene regulations of several types from the time-series gene expression data. The method can find gene regulations with multiple regulators that work in combination or individually as well as those with single regulators. The method has been implemented as the second version of GeneNetFinder (hereafter called GeneNetFinder2) and tested on several gene expression datasets. Experimental results with gene expression data revealed the existence of genes that are not regulated by individual genes but rather by a combination of several genes. Such gene regulatory relations cannot be found by conventional methods. Our method finds such regulatory relations as well as those with multiple, independent regulators or single regulators, and represents gene regulatory relations as a dynamic network in which different gene regulatory relations are shown in different stages of the cell cycle. GeneNetFinder2 is available at http://bclab.inha.ac.kr/GeneNetFinder and will be useful for modeling dynamic gene regulations with multiple regulators.
One-plasmid tunable coexpression for mycobacterial protein–protein interaction studies
Chang, Yong; Mead, David; Dhodda, Vinay; Brumm, Phil; Fox, Brian G
2009-01-01
A single plasmid that allows controlled coexpression has been developed for use in mycobacteria. The tetracycline inducible promoter, PtetO, was used to provide tetracycline-dependent induction of one gene, while the Psmyc, Pimyc, or Phsp promoters were used to provide three different levels of constitutive expression of a second gene. The functions of these four individual promoters were established using green fluorescent protein (GFP) and a newly identified red fluorescence inducible protein from Geobacillus sterothermophilus strain G1.13 (RFIP) as reporters. The tandem use of GFP and RFIP as reporter genes allowed optimization of the tunable coexpression in Mycobacterium smegmatis; either time at a fixed inducer concentration or changes in inducer concentration could be used to control the protein:protein ratio. This single vector system was used to coexpress the two-protein Mycobacterium tuberculosis stearoyl-CoA Δ9 desaturase complex (integral membrane desaturase Rv3229c and NADPH oxidoreductase Rv3230c) in M. smegmatis. The catalytic activity was found to increase in a manner corresponding to increasing the level of Rv3230c relative to a fixed level of Rv3229c. This system, which can yield finely tuned coexpression of the fatty acid desaturase complex in mycobacteria, may be useful for study of other multicomponent complexes. Furthermore, the tunable coexpression strategy used herein should also be applicable in other species with minor modifications. PMID:19760663
Identification of type 2 diabetes-associated combination of SNPs using support vector machine.
Ban, Hyo-Jeong; Heo, Jee Yeon; Oh, Kyung-Soo; Park, Keun-Joon
2010-04-23
Type 2 diabetes mellitus (T2D), a metabolic disorder characterized by insulin resistance and relative insulin deficiency, is a complex disease of major public health importance. Its incidence is rapidly increasing in the developed countries. Complex diseases are caused by interactions between multiple genes and environmental factors. Most association studies aim to identify individual susceptibility single markers using a simple disease model. Recent studies are trying to estimate the effects of multiple genes and multi-locus in genome-wide association. However, estimating the effects of association is very difficult. We aim to assess the rules for classifying diseased and normal subjects by evaluating potential gene-gene interactions in the same or distinct biological pathways. We analyzed the importance of gene-gene interactions in T2D susceptibility by investigating 408 single nucleotide polymorphisms (SNPs) in 87 genes involved in major T2D-related pathways in 462 T2D patients and 456 healthy controls from the Korean cohort studies. We evaluated the support vector machine (SVM) method to differentiate between cases and controls using SNP information in a 10-fold cross-validation test. We achieved a 65.3% prediction rate with a combination of 14 SNPs in 12 genes by using the radial basis function (RBF)-kernel SVM. Similarly, we investigated subpopulation data sets of men and women and identified different SNP combinations with the prediction rates of 70.9% and 70.6%, respectively. As the high-throughput technology for genome-wide SNPs improves, it is likely that a much higher prediction rate with biologically more interesting combination of SNPs can be acquired by using this method. Support Vector Machine based feature selection method in this research found novel association between combinations of SNPs and T2D in a Korean population.
Extreme heterogeneity of influenza virus infection in single cells
Russell, Alistair B; Trapnell, Cole
2018-01-01
Viral infection can dramatically alter a cell’s transcriptome. However, these changes have mostly been studied by bulk measurements on many cells. Here we use single-cell mRNA sequencing to examine the transcriptional consequences of influenza virus infection. We find extremely wide cell-to-cell variation in the productivity of viral transcription – viral transcripts comprise less than a percent of total mRNA in many infected cells, but a few cells derive over half their mRNA from virus. Some infected cells fail to express at least one viral gene, but this gene absence only partially explains variation in viral transcriptional load. Despite variation in viral load, the relative abundances of viral mRNAs are fairly consistent across infected cells. Activation of innate immune pathways is rare, but some cellular genes co-vary in abundance with the amount of viral mRNA. Overall, our results highlight the complexity of viral infection at the level of single cells. PMID:29451492
Towards a Molecular Understanding of the Fanconi Anemia Core Complex
Hodson, Charlotte; Walden, Helen
2012-01-01
Fanconi Anemia (FA) is a genetic disorder characterized by the inability of patient cells to repair DNA damage caused by interstrand crosslinking agents. There are currently 14 verified FA genes, where mutation of any single gene prevents repair of DNA interstrand crosslinks (ICLs). The accumulation of ICL damage results in genome instability and patients having a high predisposition to cancers. The key event of the FA pathway is dependent on an eight-protein core complex (CC), required for the monoubiquitination of each member of the FANCD2-FANCI complex. Interestingly, the majority of patient mutations reside in the CC. The molecular mechanisms underlying the requirement for such a large complex to carry out a monoubiquitination event remain a mystery. This paper documents the extensive efforts of researchers so far to understand the molecular roles of the CC proteins with regard to its main function in the FA pathway, the monoubiquitination of FANCD2 and FANCI. PMID:22675617
Riahi, Reza; Wang, Shue; Long, Min; Li, Na; Chiou, Pei-Yu; Zhang, Donna D; Wong, Pak Kin
2014-04-22
The photothermal effect of plasmonic nanostructures has numerous applications, such as cancer therapy, photonic gene circuit, large cargo delivery, and nanostructure-enhanced laser tweezers. The photothermal operation can also induce unwanted physical and biochemical effects, which potentially alter the cell behaviors. However, there is a lack of techniques for characterizing the dynamic cell responses near the site of photothermal operation with high spatiotemporal resolution. In this work, we show that the incorporation of locked nucleic acid probes with gold nanorods allows photothermal manipulation and real-time monitoring of gene expression near the area of irradiation in living cells and animal tissues. The multimodal gold nanorod serves as an endocytic delivery reagent to transport the probes into the cells, a fluorescence quencher and a binding competitor to detect intracellular mRNA, and a plasmonic photothermal transducer to induce cell ablation. We demonstrate the ability of the gold nanorod-locked nucleic acid complex for detecting the spatiotemporal gene expression in viable cells and tissues and inducing photothermal ablation of single cells. Using the gold nanorod-locked nucleic acid complex, we systematically characterize the dynamic cellular heat shock responses near the site of photothermal operation. The gold nanorod-locked nucleic acid complex enables mapping of intracellular gene expressions and analyzes the photothermal effects of nanostructures toward various biomedical applications.
Wang, Xiyin; Guo, Hui; Wang, Jinpeng; Lei, Tianyu; Liu, Tao; Wang, Zhenyi; Li, Yuxian; Lee, Tae-Ho; Li, Jingping; Tang, Haibao; Jin, Dianchuan; Paterson, Andrew H
2016-02-01
The 'apparently' simple genomes of many angiosperms mask complex evolutionary histories. The reference genome sequence for cotton (Gossypium spp.) revealed a ploidy change of a complexity unprecedented to date, indeed that could not be distinguished as to its exact dosage. Herein, by developing several comparative, computational and statistical approaches, we revealed a 5× multiplication in the cotton lineage of an ancestral genome common to cotton and cacao, and proposed evolutionary models to show how such a decaploid ancestor formed. The c. 70% gene loss necessary to bring the ancestral decaploid to its current gene count appears to fit an approximate geometrical model; that is, although many genes may be lost by single-gene deletion events, some may be lost in groups of consecutive genes. Gene loss following cotton decaploidy has largely just reduced gene copy numbers of some homologous groups. We designed a novel approach to deconvolute layers of chromosome homology, providing definitive information on gene orthology and paralogy across broad evolutionary distances, both of fundamental value and serving as an important platform to support further studies in and beyond cotton and genomics communities. No claim to original US government works. New Phytologist © 2015 New Phytologist Trust.
Evidence for a Complex Class of Nonadenylated mRNA in Drosophila
Zimmerman, J. Lynn; Fouts, David L.; Manning, Jerry E.
1980-01-01
The amount, by mass, of poly(A+) mRNA present in the polyribosomes of third-instar larvae of Drosophila melanogaster, and the relative contribution of the poly(A+) mRNA to the sequence complexity of total polysomal RNA, has been determined. Selective removal of poly(A+) mRNA from total polysomal RNA by use of either oligo-dT-cellulose, or poly(U)-sepharose affinity chromatography, revealed that only 0.15% of the mass of the polysomal RNA was present as poly(A+) mRNA. The present study shows that this RNA hybridized at saturation with 3.3% of the single-copy DNA in the Drosophila genome. After correction for asymmetric transcription and reactability of the DNA, 7.4% of the single-copy DNA in the Drosophila genome is represented in larval poly(A+) mRNA. This corresponds to 6.73 x 106 nucleotides of mRNA coding sequences, or approximately 5,384 diverse RNA sequences of average size 1,250 nucleotides. However, total polysomal RNA hybridizes at saturation to 10.9% of the single-copy DNA sequences. After correcting this value for asymmetric transcription and tracer DNA reactability, 24% of the single-copy DNA in Drosophila is represented in total polysomal RNA. This corresponds to 2.18 x 107 nucleotides of RNA coding sequences or 17,440 diverse RNA molecules of size 1,250 nucleotides. This value is 3.2 times greater than that observed for poly(A+) mRNA, and indicates that ≃69% of the polysomal RNA sequence complexity is contributed by nonadenylated RNA. Furthermore, if the number of different structural genes represented in total polysomal RNA is ≃1.7 x 104, then the number of genes expressed in third-instar larvae exceeds the number of chromomeres in Drosophila by about a factor of three. This numerology indicates that the number of chromomeres observed in polytene chromosomes does not reflect the number of structural gene sequences in the Drosophila genome. PMID:6777246
Nuclear localization of Schizosaccharomyces pombe Mcm2/Cdc19p requires MCM complex assembly.
Pasion, S G; Forsburg, S L
1999-12-01
The minichromosome maintenance (MCM) proteins MCM2-MCM7 are conserved eukaryotic replication factors that assemble in a heterohexameric complex. In fission yeast, these proteins are nuclear throughout the cell cycle. In studying the mechanism that regulates assembly of the MCM complex, we analyzed the cis and trans elements required for nuclear localization of a single subunit, Mcm2p. Mutation of any single mcm gene leads to redistribution of wild-type MCM subunits to the cytoplasm, and this redistribution depends on an active nuclear export system. We identified the nuclear localization signal sequences of Mcm2p and showed that these are required for nuclear targeting of other MCM subunits. In turn, Mcm2p must associate with other MCM proteins for its proper localization; nuclear localization of MCM proteins thus requires assembly of MCM proteins in a complex. We suggest that coupling complex assembly to nuclear targeting and retention ensures that only intact heterohexameric MCM complexes remain nuclear.
Nuclear Localization of Schizosaccharomyces pombe Mcm2/Cdc19p Requires MCM Complex Assembly
Pasion, Sally G.; Forsburg, Susan L.
1999-01-01
The minichromosome maintenance (MCM) proteins MCM2–MCM7 are conserved eukaryotic replication factors that assemble in a heterohexameric complex. In fission yeast, these proteins are nuclear throughout the cell cycle. In studying the mechanism that regulates assembly of the MCM complex, we analyzed the cis and trans elements required for nuclear localization of a single subunit, Mcm2p. Mutation of any single mcm gene leads to redistribution of wild-type MCM subunits to the cytoplasm, and this redistribution depends on an active nuclear export system. We identified the nuclear localization signal sequences of Mcm2p and showed that these are required for nuclear targeting of other MCM subunits. In turn, Mcm2p must associate with other MCM proteins for its proper localization; nuclear localization of MCM proteins thus requires assembly of MCM proteins in a complex. We suggest that coupling complex assembly to nuclear targeting and retention ensures that only intact heterohexameric MCM complexes remain nuclear. PMID:10588642
Bizarro, Cristina; Eide, Marta; Hitchcock, Daniel J; Goksøyr, Anders; Ortiz-Zarragoitia, Maren
2016-08-01
The low concentrations of most contaminants in the aquatic environment individually may not affect the normal function of the organisms on their own. However, when combined, complex mixtures may provoke unexpected effects even at low amounts. Selected aquatic micropollutants such as chlorpyrifos, bis-(2-ethylhexyl)-phthalate (DEHP), perfluorooctanoic acid (PFOA) and 17α-ethinylestradiol (EE2) were tested singly and in mixtures at nM to μM concentrations using precision-cut liver slices (PCLS) of Atlantic cod (Gadus morhua). Fish liver is a target organ for contaminants due to its crucial role in detoxification processes. In order to understand the effects on distinct key liver metabolic pathways, transcription levels of various genes were measured, including cyp1a1 and cyp3a, involved in the metabolism of organic compounds, including toxic ones, and the catabolism of bile acids and steroid hormones; cyp7a1, fabp and hmg-CoA, involved in lipid and cholesterol homeostasis; cyp24a1, involved in vitamin D metabolism; and vtg, a key gene in xenoestrogenic response. Only EE2 had significant effects on gene expression in cod liver slices when exposed singly at the concentrations tested. However, when exposed in combinations, effects not detected in single exposure conditions arose, suggesting complex interactions between studied pollutants that could not be predicted from the results of individual exposure scenarios. Thus, the present work highlights the importance of assessing mixtures when describing the toxic effects of micropollutants to fish liver metabolism. Copyright © 2016 Elsevier B.V. All rights reserved.
duVerle, David A; Yotsukura, Sohiya; Nomura, Seitaro; Aburatani, Hiroyuki; Tsuda, Koji
2016-09-13
Single-cell RNA sequencing is fast becoming one the standard method for gene expression measurement, providing unique insights into cellular processes. A number of methods, based on general dimensionality reduction techniques, have been suggested to help infer and visualise the underlying structure of cell populations from single-cell expression levels, yet their models generally lack proper biological grounding and struggle at identifying complex differentiation paths. Here we introduce cellTree: an R/Bioconductor package that uses a novel statistical approach, based on document analysis techniques, to produce tree structures outlining the hierarchical relationship between single-cell samples, while identifying latent groups of genes that can provide biological insights. With cellTree, we provide experimentalists with an easy-to-use tool, based on statistically and biologically-sound algorithms, to efficiently explore and visualise single-cell RNA data. The cellTree package is publicly available in the online Bionconductor repository at: http://bioconductor.org/packages/cellTree/ .
Pitchiaya, Sethuramasundaram; Krishnan, Vishalakshi; Custer, Thomas C.; Walter, Nils G.
2013-01-01
Non-coding RNAs (ncRNAs) recently were discovered to outnumber their protein-coding counterparts, yet their diverse functions are still poorly understood. Here we report on a method for the intracellular Single-molecule High Resolution Localization and Counting (iSHiRLoC) of microRNAs (miRNAs), a conserved, ubiquitous class of regulatory ncRNAs that controls the expression of over 60% of all mammalian protein coding genes post-transcriptionally, by a mechanism shrouded by seemingly contradictory observations. We present protocols to execute single particle tracking (SPT) and single-molecule counting of functional microinjected, fluorophore-labeled miRNAs and thereby extract diffusion coefficients and molecular stoichiometries of micro-ribonucleoprotein (miRNP) complexes from living and fixed cells, respectively. This probing of miRNAs at the single molecule level sheds new light on the intracellular assembly/disassembly of miRNPs, thus beginning to unravel the dynamic nature of this important gene regulatory pathway and facilitating the development of a parsimonious model for their obscured mechanism of action. PMID:23820309
Expression and assembly of a fully active antibody in algae
NASA Astrophysics Data System (ADS)
Mayfield, Stephen P.; Franklin, Scott E.; Lerner, Richard A.
2003-01-01
Although combinatorial antibody libraries have solved the problem of access to large immunological repertoires, efficient production of these complex molecules remains a problem. Here we demonstrate the efficient expression of a unique large single-chain (lsc) antibody in the chloroplast of the unicellular, green alga, Chlamydomonas reinhardtii. We achieved high levels of protein accumulation by synthesizing the lsc gene in chloroplast codon bias and by driving expression of the chimeric gene using either of two C. reinhardtii chloroplast promoters and 5' and 3' RNA elements. This lsc antibody, directed against glycoprotein D of the herpes simplex virus, is produced in a soluble form by the alga and assembles into higher order complexes in vivo. Aside from dimerization by disulfide bond formation, the antibody undergoes no detectable posttranslational modification. We further demonstrate that accumulation of the antibody can be modulated by the specific growth regime used to culture the alga, and by the choice of 5' and 3' elements used to drive expression of the antibody gene. These results demonstrate the utility of alga as an expression platform for recombinant proteins, and describe a new type of single chain antibody containing the entire heavy chain protein, including the Fc domain.
Kogelman, Lisette J A; Zhernakova, Daria V; Westra, Harm-Jan; Cirera, Susanna; Fredholm, Merete; Franke, Lude; Kadarmideen, Haja N
2015-10-20
Obesity is a multi-factorial health problem in which genetic factors play an important role. Limited results have been obtained in single-gene studies using either genomic or transcriptomic data. RNA sequencing technology has shown its potential in gaining accurate knowledge about the transcriptome, and may reveal novel genes affecting complex diseases. Integration of genomic and transcriptomic variation (expression quantitative trait loci [eQTL] mapping) has identified causal variants that affect complex diseases. We integrated transcriptomic data from adipose tissue and genomic data from a porcine model to investigate the mechanisms involved in obesity using a systems genetics approach. Using a selective gene expression profiling approach, we selected 36 animals based on a previously created genomic Obesity Index for RNA sequencing of subcutaneous adipose tissue. Differential expression analysis was performed using the Obesity Index as a continuous variable in a linear model. eQTL mapping was then performed to integrate 60 K porcine SNP chip data with the RNA sequencing data. Results were restricted based on genome-wide significant single nucleotide polymorphisms, detected differentially expressed genes, and previously detected co-expressed gene modules. Further data integration was performed by detecting co-expression patterns among eQTLs and integration with protein data. Differential expression analysis of RNA sequencing data revealed 458 differentially expressed genes. The eQTL mapping resulted in 987 cis-eQTLs and 73 trans-eQTLs (false discovery rate < 0.05), of which the cis-eQTLs were associated with metabolic pathways. We reduced the eQTL search space by focusing on differentially expressed and co-expressed genes and disease-associated single nucleotide polymorphisms to detect obesity-related genes and pathways. Building a co-expression network using eQTLs resulted in the detection of a module strongly associated with lipid pathways. Furthermore, we detected several obesity candidate genes, for example, ENPP1, CTSL, and ABHD12B. To our knowledge, this is the first study to perform an integrated genomics and transcriptomics (eQTL) study using, and modeling, genomic and subcutaneous adipose tissue RNA sequencing data on obesity in a porcine model. We detected several pathways and potential causal genes for obesity. Further validation and investigation may reveal their exact function and association with obesity.
Hindman, Ryan; Gollnick, Paul
2016-01-01
Vaccinia virus early genes are transcribed immediately upon infection. Nucleoside triphosphate phosphohydrolase I (NPH I) is an essential component of the early gene transcription complex. NPH I hydrolyzes ATP to release transcripts during transcription termination. The ATPase activity of NPH I requires single-stranded (ss) DNA as a cofactor; however, the source of this cofactor within the transcription complex is not known. Based on available structures of transcription complexes it has been hypothesized that the ssDNA cofactor is obtained from the unpaired non-template strand within the transcription bubble. In vitro transcription on templates that lack portions of the non-template strand within the transcription bubble showed that the upstream portion of the transcription bubble is required for efficient NPH I-mediated transcript release. Complementarity between the template and non-template strands in this region is also required for NPH I-mediated transcript release. This observation complicates locating the source of the ssDNA cofactor within the transcription complex because removal of the non-template strand also disrupts transcription bubble reannealing. Prior studies have shown that ssRNA binds to NPH I, but it does not activate ATPase activity. Chimeric transcription templates with RNA in the non-template strand confirm that the source of the ssDNA cofactor for NPH I is the upstream portion of the non-template strand in the transcription bubble. Consistent with this conclusion we also show that isolated NPH I acts as a 5′ to 3′ translocase on single-stranded DNA. PMID:27189950
Development of functional ectopic compound eyes in scarabaeid beetles by knockdown of orthodenticle.
Zattara, Eduardo E; Macagno, Anna L M; Busey, Hannah A; Moczek, Armin P
2017-11-07
Complex traits like limbs, brains, or eyes form through coordinated integration of diverse cell fates across developmental space and time, yet understanding how complexity and integration emerge from uniform, undifferentiated precursor tissues remains limited. Here, we use ectopic eye formation as a paradigm to investigate the emergence and integration of novel complex structures following massive ontogenetic perturbation. We show that down-regulation via RNAi of a single head patterning gene- orthodenticle -induces ectopic structures externally resembling compound eyes at the middorsal adult head of both basal and derived scarabaeid beetle species (Onthophagini and Oniticellini). Scanning electron microscopy documents ommatidial organization of these induced structures, while immunohistochemistry reveals the presence of rudimentary ommatidial lenses, crystalline cones, and associated neural-like tissue within them. Further, RNA-sequencing experiments show that after orthodenticle down-regulation, the transcriptional signature of the middorsal head-the location of ectopic eye induction-converges onto that of regular compound eyes, including up-regulation of several retina-specific genes. Finally, a light-aversion behavioral assay to assess functionality reveals that ectopic compound eyes can rescue the ability to respond to visual stimuli when wild-type eyes are surgically removed. Combined, our results show that knockdown of a single gene is sufficient for the middorsal head to acquire the competence to ectopically generate a functional compound eye-like structure. These findings highlight the buffering capacity of developmental systems, allowing massive genetic perturbations to be channeled toward orderly and functional developmental outcomes, and render ectopic eye formation a widely accessible paradigm to study the evolution of complex systems. Published under the PNAS license.
Ohta, Takahisa; Hashida, Yasuhiko; Higuchi, Yuriko; Yamashita, Fumiyoshi; Hashida, Mitsuru
2017-03-01
Single-walled carbon nanotubes (SWCNTs) attract great interest in biomedical fields including application for drug delivery system. In this study, we developed a novel gene delivery system employing SWCNTs associated with polycationic and amphiphilic H-(-Lys-Trp-Lys-Gly-) 7 -OH [(KWKG) 7 ] peptides having pegylation. SWCNTs wrapped with (KWKG) 7 formed a complex with plasmid DNA (pDNA) in aqueous solution based on polyionic interaction but later underwent aggregation. On the other hand, a complex of pDNA and SWCNT-(KWKG) 7 modified with polyethylene glycol (PEG) chains of 12 units [SWCNT-(KWKG) 7 -(PEG) 12 ] afforded good dispersion stability for 24 h even in a cell culture medium. The in vitro cellular uptake of SWCNT-(KWKG) 7 -(PEG) 12 /pDNA complex prepared with fluorescence-labeled pDNA was evaluated with fluorescent microscopic observation and flow cytometry. The uptake by A549 human lung adenocarcinoma epithelial cells increased along with the extent of pegylation, suggesting the importance of dispersion stability in addition to the cationic charge which facilitates ionic cellular interaction. The expression of pDNA encoding the monomeric Kusabira-Orange 2 fluorescent protein in the form of the SWCNT-(KWKG) 7 -(PEG) 12 /pDNA complex demonstrated remarkable enhancement of transfection depending also on the extent of pegylation and the N/P ratio. The potential of the SWCNT composite wrapped with polycationic and amphiphilic (KWKG) 7 with pegylation as a carrier for gene delivery was demonstrated. Copyright © 2017 American Pharmacists Association®. Published by Elsevier Inc. All rights reserved.
Homology-dependent Gene Silencing in Paramecium
Ruiz, Françoise; Vayssié, Laurence; Klotz, Catherine; Sperling, Linda; Madeddu, Luisa
1998-01-01
Microinjection at high copy number of plasmids containing only the coding region of a gene into the Paramecium somatic macronucleus led to a marked reduction in the expression of the corresponding endogenous gene(s). The silencing effect, which is stably maintained throughout vegetative growth, has been observed for all Paramecium genes examined so far: a single-copy gene (ND7), as well as members of multigene families (centrin genes and trichocyst matrix protein genes) in which all closely related paralogous genes appeared to be affected. This phenomenon may be related to posttranscriptional gene silencing in transgenic plants and quelling in Neurospora and allows the efficient creation of specific mutant phenotypes thus providing a potentially powerful tool to study gene function in Paramecium. For the two multigene families that encode proteins that coassemble to build up complex subcellular structures the analysis presented herein provides the first experimental evidence that the members of these gene families are not functionally redundant. PMID:9529389
Aukema, Sietse M; Kreuz, Markus; Kohler, Christian W; Rosolowski, Maciej; Hasenclever, Dirk; Hummel, Michael; Küppers, Ralf; Lenze, Dido; Ott, German; Pott, Christiane; Richter, Julia; Rosenwald, Andreas; Szczepanowski, Monika; Schwaenen, Carsten; Stein, Harald; Trautmann, Heiko; Wessendorf, Swen; Trümper, Lorenz; Loeffler, Markus; Spang, Rainer; Kluin, Philip M; Klapper, Wolfram; Siebert, Reiner
2014-04-01
Chromosomal translocations affecting the MYC oncogene are the biological hallmark of Burkitt lymphomas but also occur in a subset of other mature B-cell lymphomas. If accompanied by a chromosomal break targeting the BCL2 and/or BCL6 oncogene these MYC translocation-positive (MYC(+)) lymphomas are called double-hit lymphomas, otherwise the term single-hit lymphomas is applied. In order to characterize the biological features of these MYC(+) lymphomas other than Burkitt lymphoma we explored, after exclusion of molecular Burkitt lymphoma as defined by gene expression profiling, the molecular, pathological and clinical aspects of 80 MYC-translocation-positive lymphomas (31 single-hit, 46 double-hit and 3 MYC(+)-lymphomas with unknown BCL6 status). Comparison of single-hit and double-hit lymphomas revealed no difference in MYC partner (IG/non-IG), genomic complexity, MYC expression or gene expression profile. Double-hit lymphomas more frequently showed a germinal center B-cell-like gene expression profile and had higher IGH and MYC mutation frequencies. Gene expression profiling revealed 130 differentially expressed genes between BCL6(+)/MYC(+) and BCL2(+)/MYC(+) double-hit lymphomas. BCL2(+)/MYC(+) double-hit lymphomas more frequently showed a germinal center B-like gene expression profile. Analysis of all lymphomas according to MYC partner (IG/non-IG) revealed no substantial differences. In this series of lymphomas, in which immunochemotherapy was administered in only a minority of cases, single-hit and double-hit lymphomas had a similar poor outcome in contrast to the outcome of molecular Burkitt lymphoma and lymphomas without the MYC break. Our data suggest that, after excluding molecular Burkitt lymphoma and pediatric cases, MYC(+) lymphomas are biologically quite homogeneous with single-hit and double-hit lymphomas as well as IG-MYC and non-IG-MYC(+) lymphomas sharing various molecular characteristics.
Transcriptomic insights into phenological development and cold tolerance of wheat grown in the field
USDA-ARS?s Scientific Manuscript database
Low temperature (LT) acclimation and winter survival in cereal species is determined by complicated environmentally regulated gene expression. However, studies investigating these complex LT responses are mostly conducted in controlled environments that only consider the responses to single environm...
Application of advanced cytometric and molecular technologies to minimal residual disease monitoring
NASA Astrophysics Data System (ADS)
Leary, James F.; He, Feng; Reece, Lisa M.
2000-04-01
Minimal residual disease monitoring presents a number of theoretical and practical challenges. Recently it has been possible to meet some of these challenges by combining a number of new advanced biotechnologies. To monitor the number of residual tumor cells requires complex cocktails of molecular probes that collectively provide sensitivities of detection on the order of one residual tumor cell per million total cells. Ultra-high-speed, multi parameter flow cytometry is capable of analyzing cells at rates in excess of 100,000 cells/sec. Residual tumor selection marker cocktails can be optimized by use of receiver operating characteristic analysis. New data minimizing techniques when combined with multi variate statistical or neural network classifications of tumor cells can more accurately predict residual tumor cell frequencies. The combination of these techniques can, under at least some circumstances, detect frequencies of tumor cells as low as one cell in a million with an accuracy of over 98 percent correct classification. Detection of mutations in tumor suppressor genes requires insolation of these rare tumor cells and single-cell DNA sequencing. Rare residual tumor cells can be isolated at single cell level by high-resolution single-cell cell sorting. Molecular characterization of tumor suppressor gene mutations can be accomplished using a combination of single- cell polymerase chain reaction amplification of specific gene sequences followed by TA cloning techniques and DNA sequencing. Mutations as small as a single base pair in a tumor suppressor gene of a single sorted tumor cell have been detected using these methods. Using new amplification procedures and DNA micro arrays it should be possible to extend the capabilities shown in this paper to screening of multiple DNA mutations in tumor suppressor and other genes on small numbers of sorted metastatic tumor cells.
Targeted Re-Sequencing Emulsion PCR Panel for Myopathies: Results in 94 Cases.
Punetha, Jaya; Kesari, Akanchha; Uapinyoying, Prech; Giri, Mamta; Clarke, Nigel F; Waddell, Leigh B; North, Kathryn N; Ghaoui, Roula; O'Grady, Gina L; Oates, Emily C; Sandaradura, Sarah A; Bönnemann, Carsten G; Donkervoort, Sandra; Plotz, Paul H; Smith, Edward C; Tesi-Rocha, Carolina; Bertorini, Tulio E; Tarnopolsky, Mark A; Reitter, Bernd; Hausmanowa-Petrusewicz, Irena; Hoffman, Eric P
2016-05-27
Molecular diagnostics in the genetic myopathies often requires testing of the largest and most complex transcript units in the human genome (DMD, TTN, NEB). Iteratively targeting single genes for sequencing has traditionally entailed high costs and long turnaround times. Exome sequencing has begun to supplant single targeted genes, but there are concerns regarding coverage and needed depth of the very large and complex genes that frequently cause myopathies. To evaluate efficiency of next-generation sequencing technologies to provide molecular diagnostics for patients with previously undiagnosed myopathies. We tested a targeted re-sequencing approach, using a 45 gene emulsion PCR myopathy panel, with subsequent sequencing on the Illumina platform in 94 undiagnosed patients. We compared the targeted re-sequencing approach to exome sequencing for 10 of these patients studied. We detected likely pathogenic mutations in 33 out of 94 patients with a molecular diagnostic rate of approximately 35%. The remaining patients showed variants of unknown significance (35/94 patients) or no mutations detected in the 45 genes tested (26/94 patients). Mutation detection rates for targeted re-sequencing vs. whole exome were similar in both methods; however exome sequencing showed better distribution of reads and fewer exon dropouts. Given that costs of highly parallel re-sequencing and whole exome sequencing are similar, and that exome sequencing now takes considerably less laboratory processing time than targeted re-sequencing, we recommend exome sequencing as the standard approach for molecular diagnostics of myopathies.
Bipartite Community Structure of eQTLs.
Platig, John; Castaldi, Peter J; DeMeo, Dawn; Quackenbush, John
2016-09-01
Genome Wide Association Studies (GWAS) and expression quantitative trait locus (eQTL) analyses have identified genetic associations with a wide range of human phenotypes. However, many of these variants have weak effects and understanding their combined effect remains a challenge. One hypothesis is that multiple SNPs interact in complex networks to influence functional processes that ultimately lead to complex phenotypes, including disease states. Here we present CONDOR, a method that represents both cis- and trans-acting SNPs and the genes with which they are associated as a bipartite graph and then uses the modular structure of that graph to place SNPs into a functional context. In applying CONDOR to eQTLs in chronic obstructive pulmonary disease (COPD), we found the global network "hub" SNPs were devoid of disease associations through GWAS. However, the network was organized into 52 communities of SNPs and genes, many of which were enriched for genes in specific functional classes. We identified local hubs within each community ("core SNPs") and these were enriched for GWAS SNPs for COPD and many other diseases. These results speak to our intuition: rather than single SNPs influencing single genes, we see groups of SNPs associated with the expression of families of functionally related genes and that disease SNPs are associated with the perturbation of those functions. These methods are not limited in their application to COPD and can be used in the analysis of a wide variety of disease processes and other phenotypic traits.
Carrier screening for single gene disorders.
Rose, Nancy C; Wick, Myra
2018-04-01
Screening for genetic disorders began in 1963 with the initiation of newborn screening for phenylketonuria. Advances in molecular technology have made both newborn screening for newborns affected with serious disorders, and carrier screening of individuals at risk for offspring with genetic disorders, more complex and more widely available. Carrier screening today can be performed secondary to family history-based screening, ethnic-based screening, and expanded carrier screening (ECS). ECS is panel-based screening, which analyzes carrier status for hundreds of genetic disorders irrespective of patient race or ethnicity. In this article, we review the historical and current aspects of carrier screening for single gene disorders, including future research directions. Copyright © 2017 Elsevier Ltd. All rights reserved.
Reavey, Caitlin T; Hickman, Mark J; Dobi, Krista C; Botstein, David; Winston, Fred
2015-10-01
Studies of natural populations of many organisms have shown that traits are often complex, caused by contributions of mutations in multiple genes. In contrast, genetic studies in the laboratory primarily focus on studying the phenotypes caused by mutations in a single gene. However, the single mutation approach may be limited with respect to the breadth and degree of new phenotypes that can be found. We have taken the approach of isolating complex, or polygenic mutants in the lab to study the regulation of transcriptional activation distance in yeast. While most aspects of eukaryotic transcription are conserved from yeast to human, transcriptional activation distance is not. In Saccharomyces cerevisiae, the upstream activating sequence (UAS) is generally found within 450 base pairs of the transcription start site (TSS) and when the UAS is moved too far away, activation no longer occurs. In contrast, metazoan enhancers can activate from as far as several hundred kilobases from the TSS. Previously, we identified single mutations that allow transcription activation to occur at a greater-than-normal distance from the GAL1 UAS. As the single mutant phenotypes were weak, we have now isolated polygenic mutants that possess strong long-distance phenotypes. By identification of the causative mutations we have accounted for most of the heritability of the phenotype in each strain and have provided evidence that the Mediator coactivator complex plays both positive and negative roles in the regulation of transcription activation distance. Copyright © 2015 by the Genetics Society of America.
A Fast Multiple-Kernel Method With Applications to Detect Gene-Environment Interaction.
Marceau, Rachel; Lu, Wenbin; Holloway, Shannon; Sale, Michèle M; Worrall, Bradford B; Williams, Stephen R; Hsu, Fang-Chi; Tzeng, Jung-Ying
2015-09-01
Kernel machine (KM) models are a powerful tool for exploring associations between sets of genetic variants and complex traits. Although most KM methods use a single kernel function to assess the marginal effect of a variable set, KM analyses involving multiple kernels have become increasingly popular. Multikernel analysis allows researchers to study more complex problems, such as assessing gene-gene or gene-environment interactions, incorporating variance-component based methods for population substructure into rare-variant association testing, and assessing the conditional effects of a variable set adjusting for other variable sets. The KM framework is robust, powerful, and provides efficient dimension reduction for multifactor analyses, but requires the estimation of high dimensional nuisance parameters. Traditional estimation techniques, including regularization and the "expectation-maximization (EM)" algorithm, have a large computational cost and are not scalable to large sample sizes needed for rare variant analysis. Therefore, under the context of gene-environment interaction, we propose a computationally efficient and statistically rigorous "fastKM" algorithm for multikernel analysis that is based on a low-rank approximation to the nuisance effect kernel matrices. Our algorithm is applicable to various trait types (e.g., continuous, binary, and survival traits) and can be implemented using any existing single-kernel analysis software. Through extensive simulation studies, we show that our algorithm has similar performance to an EM-based KM approach for quantitative traits while running much faster. We also apply our method to the Vitamin Intervention for Stroke Prevention (VISP) clinical trial, examining gene-by-vitamin effects on recurrent stroke risk and gene-by-age effects on change in homocysteine level. © 2015 WILEY PERIODICALS, INC.
Levine, Mia T; Holloway, Alisha K; Arshad, Umbreen; Begun, David J
2007-11-01
Dosage compensation refers to the equalization of X-linked gene transcription among heterogametic and homogametic sexes. In Drosophila, the dosage compensation complex (DCC) mediates the twofold hypertranscription of the single male X chromosome. Loss-of-function mutations at any DCC protein-coding gene are male lethal. Here we report a population genetic analysis suggesting that four of the five core DCC proteins--MSL1, MSL2, MSL3, and MOF--are evolving under positive selection in D. melanogaster. Within these four proteins, several domains that range in function from X chromosome localization to protein-protein interactions have elevated, D. melanogaster-specific, amino acid divergence.
Dietzgen, Ralf G.; Kondo, Hideki; Goodin, Michael M.; Kurath, Gael; Vasilakis, Nikos
2017-01-01
The family Rhabdoviridae consists of mostly enveloped, bullet-shaped or bacilliform viruses with a negative-sense, single-stranded RNA genome that infect vertebrates, invertebrates or plants. This ecological diversity is reflected by the diversity and complexity of their genomes. Five canonical structural protein genes are conserved in all rhabdoviruses, but may be overprinted, overlapped or interspersed with several novel and diverse accessory genes. This review gives an overview of the characteristics and diversity of rhabdoviruses, their taxonomic classification, replication mechanism, properties of classical rhabdoviruses such as rabies virus and rhabdoviruses with complex genomes, rhabdoviruses infecting aquatic species, and plant rhabdoviruses with both mono- and bipartite genomes.
Identification, inheritance, and linkage of B-G-like and MHC class I genes in cranes
Jarvi, S.I.; Goto, R.M.; Gee, G.F.; Briles, W.E.; Miller, M.M.
1999-01-01
We identified B-G-like genes in the whooping and Florida sandhill cranes and linked them to the major histocompatibility complex (MHC). We evaluated the inheritance of B-G-like genes in families of whooping and Florida sandhill cranes using restriction fragment patterns (RFPs). Two B-G-like genes, designated wcbgl and wcbg2, were located within 8 kb of one another. The fully sequenced wcbg2 gene encodes a B-G IgV-like domain, an additional Ig-like domain, a transmembrane domain, and a single heptad domain typical of '-helical coiled coils. Patterns of restriction fragments in DNA from the whooping crane and from a number of other species indicate that the B-G-like gene families of cranes are large with diverse sequences. Segregation of RFPs in families of Florida sandhill cranes provide evidence for genetic polymorphism in the B-G-like genes. The restriction fragments generally segregated in concert with MHC haplotypes assigned by serological typing and by single stranded conformational polymorphism (SSCP) assays based in the second exon of the crane MHC class I genes. This study supports the concept of a long-term association of polymorphic B-G-like genes with the MHC. It also establishes SSCP as a means for evaluating MHC genetic variability in cranes.
Identification, inheritance, and linkage of B-G-like and MHC class I genes in cranes.
Jarvi, S I; Goto, R M; Gee, G F; Briles, W E; Miller, M M
1999-01-01
We identified B-G-like genes in the whooping and Florida sandhill cranes and linked them to the major histocompatibility complex (MHC). We evaluated the inheritance of B-G-like genes in families of whooping and Florida sandhill cranes using restriction fragment patterns (RFPs). Two B-G-like genes, designated wcbg1 and wcbg2, were located within 8 kb of one another. The fully sequenced wcbg2 gene encodes a B-G IgV-like domain, an additional Ig-like domain, a transmembrane domain, and a single heptad domain typical of alpha-helical coiled coils. Patterns of restriction fragments in DNA from the whooping crane and from a number of other species indicate that the B-G-like gene families of cranes are large with diverse sequences. Segregation of RFPs in families of Florida sandhill cranes provide evidence for genetic polymorphism in the B-G-like genes. The restriction fragments generally segregated in concert with MHC haplotypes assigned by serological typing and by single stranded conformational polymorphism (SSCP) assays based in the second exon of the crane MHC class I genes. This study supports the concept of a long-term association of polymorphic B-G-like genes with the MHC. It also establishes SSCP as a means for evaluating MHC genetic variability in cranes.
Functional genomics efforts face tradeoffs between number of perturbations examined and complexity of phenotypes measured. We bridge this gap with Perturb-seq, which combines droplet-based single-cell RNA-seq with a strategy for barcoding CRISPR-mediated perturbations, allowing many perturbations to be profiled in pooled format. We applied Perturb-seq to dissect the mammalian unfolded protein response (UPR) using single and combinatorial CRISPR perturbations. Two genome-scale CRISPR interference (CRISPRi) screens identified genes whose repression perturbs ER homeostasis.
Miyamoto, T; Koh, E; Tsujimura, A; Miyagawa, Y; Saijo, Y; Namiki, M; Sengoku, K
2014-04-01
Genetic mechanisms have been implicated as a cause of some cases of male infertility. Recently, ten novel genes involved in human spermatogenesis, including human LRWD1, have been identified by expression microarray analysis of human testictissue. The human LRWD1 protein mediates the origin recognition complex in chromatin, which is critical for the initiation of pre-replication complex assembly in G1 and chromatin organization in post-G1 cells. The Lrwd1 gene expression is specific to the testis in mice. Therefore, we hypothesized that mutation or polymorphisms of LRWD1 participate in male infertility, especially azoospermia. To investigate whether LRWD1 gene defects are associated with azoospermia caused by SCOS and meiotic arrest (MA), mutational analysis was performed in 100 and 30 Japanese patients by direct sequencing of the coding regions, respectively. Statistical analysis was performed for patients with SCOS and MA and in 100 healthy control men. No mutations were found in LRWD1; however, three coding single-nucleotide polymorphisms (SNP1-SNP3) could be detected in the patients. The genotype and allele frequencies in SNP1 and SNP2 were notably higher in the SCOS group than in the control group (P < 0.05). These results suggest the critical role of LRWD1 in human spermatogenesis. © 2013 Blackwell Verlag GmbH.
Model-based design of RNA hybridization networks implemented in living cells.
Rodrigo, Guillermo; Prakash, Satya; Shen, Shensi; Majer, Eszter; Daròs, José-Antonio; Jaramillo, Alfonso
2017-09-19
Synthetic gene circuits allow the behavior of living cells to be reprogrammed, and non-coding small RNAs (sRNAs) are increasingly being used as programmable regulators of gene expression. However, sRNAs (natural or synthetic) are generally used to regulate single target genes, while complex dynamic behaviors would require networks of sRNAs regulating each other. Here, we report a strategy for implementing such networks that exploits hybridization reactions carried out exclusively by multifaceted sRNAs that are both targets of and triggers for other sRNAs. These networks are ultimately coupled to the control of gene expression. We relied on a thermodynamic model of the different stable conformational states underlying this system at the nucleotide level. To test our model, we designed five different RNA hybridization networks with a linear architecture, and we implemented them in Escherichia coli. We validated the network architecture at the molecular level by native polyacrylamide gel electrophoresis, as well as the network function at the bacterial population and single-cell levels with a fluorescent reporter. Our results suggest that it is possible to engineer complex cellular programs based on RNA from first principles. Because these networks are mainly based on physical interactions, our designs could be expanded to other organisms as portable regulatory resources or to implement biological computations. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Minireview: Genetic basis of heterogeneity and severity in sickle cell disease
Habara, Alawi
2016-01-01
Sickle cell disease, a common single gene disorder, has a complex pathophysiology that at its root is initiated by the polymerization of deoxy sickle hemoglobin. Sickle vasoocclusion and hemolytic anemia drive the development of disease complications. In this review, we focus on the genetic modifiers of disease heterogeneity. The phenotypic heterogeneity of disease is only partially explained by genetic variability of fetal hemoglobin gene expression and co-inheritance of α thalassemia. Given the complexity of pathophysiology, many different definitions of severity are possible complicating a full understanding of its genetic foundation. The pathophysiological complexity and the interlocking nature of the biological processes underpinning disease severity are becoming better understood. Nevertheless, useful genetic signatures of severity, regardless of how this is defined, are insufficiently developed to be used for treatment decisions and for counseling. PMID:26936084
Kim, Dongkyu; Ku, Sook Hee; Kim, Hyosuk; Jeong, Ji Hoon; Lee, Minhyung; Kwon, Ick Chan; Choi, Donghoon; Kim, Sun Hwa
2016-12-10
Gene therapy is aimed at selectively knocking up or knocking down the target genes involved in the development of diseases. In many human diseases, dysregulation of disease-associated genes is occurred concurrently: some genes are abnormally turned up and some are turned down. In the field of non-viral gene therapy, plasmid DNA (pDNA) and small interfering RNA (siRNA) are suggested as representative regulation tools for activating and silencing the expression of genes of interest, representatively. Herein, we simultaneously loaded both siRNA (Src homology region 2 domain-containing tyrosine phosphatase-1 siRNA, siSHP-1) for anti-apoptosis and pDNA (hypoxia-inducible vascular endothelial growth factor expression vector, pHI-VEGF) for angiogenesis in a single polymeric nanocarrier and used to synergistically attenuate ischemia-reperfusion (IR)-induced myocardial infarction, which is mainly caused by dysregulating of cardiac apoptosis and angiogenesis. For dual-modality cardiac gene delivery, siSHP-1 and pHI-VEGF were sequentially incorporated into a stable nanocomplex by using deoxycholic acid-modified polyethylenimine (DA-PEI). The resulting DA-PEI/siSHP-1/pHI-VEGF complexes exhibited the high structural stability against polyanion competition and the improved resistance to digestion by nucleases. The cardiac administration of DA-PEI/siSHP-1/pHI-VEGF reduced cardiomyocyte apoptosis and enhanced cardiac microvessel formation, thereby reducing infarct size in rat ischemia-reperfusion model. The simultaneous anti-apoptotic and angiogenic gene therapies synergized the cardioprotective effects of each strategy; thus our dual-modal single-carrier gene delivery system can be considered as a promising candidate for treating ischemic heart diseases. Copyright © 2016 Elsevier B.V. All rights reserved.
IMGT/GeneInfo: enhancing V(D)J recombination database accessibility
Baum, Thierry-Pascal; Pasqual, Nicolas; Thuderoz, Florence; Hierle, Vivien; Chaume, Denys; Lefranc, Marie-Paule; Jouvin-Marche, Evelyne; Marche, Patrice-Noël; Demongeot, Jacques
2004-01-01
IMGT/GeneInfo is a user-friendly online information system that provides information on data resulting from the complex mechanisms of immunoglobulin (IG) and T cell receptor (TR) V(D)J recombinations. For the first time, it is possible to visualize all the rearrangement parameters on a single page. IMGT/GeneInfo is part of the international ImMunoGeneTics information system® (IMGT), a high-quality integrated knowledge resource specializing in IG, TR, major histocompatibility complex (MHC), and related proteins of the immune system of human and other vertebrate species. The IMGT/GeneInfo system was developed by the TIMC and ICH laboratories (with the collaboration of LIGM), and is the first example of an external system being incorporated into IMGT. In this paper, we report the first part of this work. IMGT/GeneInfo_TR deals with the human and mouse TRA/TRD and TRB loci of the TR. Data handling and visualization are complementary to the current data and tools in IMGT, and will subsequently allow the modelling of V(D)J gene use, and thus, to predict non-standard recombination profiles which may eventually be found in conditions such as leukaemias or lymphomas. Access to IMGT/GeneInfo is free and can be found at http://imgt.cines.fr/GeneInfo. PMID:14681357
Structural variation within the potato Ve gene locus and correlation with molecular marker analysis
USDA-ARS?s Scientific Manuscript database
The disconnect between single genotype model systems and plant breeding using wide crosses of diverse germplasm is often too great to affect progress in understanding complex phenotypes. Whole genome sequencing allows researchers and breeders to quickly and inexpensively resequence interesting indiv...
Lamara, Mebarek; Raherison, Elie; Lenz, Patrick; Beaulieu, Jean; Bousquet, Jean; MacKay, John
2016-04-01
Association studies are widely utilized to analyze complex traits but their ability to disclose genetic architectures is often limited by statistical constraints, and functional insights are usually minimal in nonmodel organisms like forest trees. We developed an approach to integrate association mapping results with co-expression networks. We tested single nucleotide polymorphisms (SNPs) in 2652 candidate genes for statistical associations with wood density, stiffness, microfibril angle and ring width in a population of 1694 white spruce trees (Picea glauca). Associations mapping identified 229-292 genes per wood trait using a statistical significance level of P < 0.05 to maximize discovery. Over-representation of genes associated for nearly all traits was found in a xylem preferential co-expression group developed in independent experiments. A xylem co-expression network was reconstructed with 180 wood associated genes and several known MYB and NAC regulators were identified as network hubs. The network revealed a link between the gene PgNAC8, wood stiffness and microfibril angle, as well as considerable within-season variation for both genetic control of wood traits and gene expression. Trait associations were distributed throughout the network suggesting complex interactions and pleiotropic effects. Our findings indicate that integration of association mapping and co-expression networks enhances our understanding of complex wood traits. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.
Deng, Huai; Kerppola, Tom K.
2014-01-01
Interactions among transcription factors control their physiological functions by regulating their binding specificities and transcriptional activities. We implement a strategy to visualize directly the genomic loci that are bound by multi-protein complexes in single cells in Drosophila. This method is based on bimolecular fluorescence complementation (BiFC) analysis of protein interactions on polytene chromosomes. Drosophila Keap1 (dKeap1)-CncC complexes localized to the nucleus and bound chromatin loci that were not bound preferentially by dKeap1 or CncC when they were expressed separately. dKeap1 and CncC binding at these loci was enhanced by phenobarbital, but not by tert-butylhydroquinone (tBHQ) or paraquat. Endogenous dKeap1 and CncC activated transcription of the Jheh (Jheh1, Jheh2, Jheh3) and dKeap1 genes at these loci, whereas CncC alone activated other xenobiotic response genes. Ectopic dKeap1 expression increased CncC binding at the Jheh and dKeap1 gene loci and activated their transcription, whereas dKeap1 inhibited CncC binding at other xenobiotic response gene loci and suppressed their transcription. The combinatorial chromatin-binding specificities and transcriptional activities of dKeap1-CncC complexes mediated the selective activation of different sets of genes by different xenobiotic compounds, in part through feed-forward activation of dKeap1 transcription. PMID:25063457
Molee, A.; Kongroi, K.; Kuadsantia, P.; Poompramun, C.; Likitdecharote, B.
2016-01-01
The aim of the present study was to investigate the effect of single nucleotide polymorphisms in the major histocompatibility complex (MHC) class II gene on resistance to Newcastle disease virus and body weight of the Thai indigenous chicken, Leung Hang Khao (Gallus gallus domesticus). Blood samples were collected for single nucleotide polymorphism analysis from 485 chickens. Polymerase chain reaction sequencing was used to classify single nucleotide polymorphisms of class II MHC. Body weights were measured at the ages of 3, 4, 5, and 7 months. Titres of Newcastle disease virus at 2 weeks to 7 months were determined and the correlation between body weight and titre was analysed. The association between single nucleotide polymorphisms and body weight and titre were analysed by a generalized linear model. Seven single nucleotide polymorphisms were identified: C125T, A126T, C209G, C242T, A243T, C244T, and A254T. Significant correlations between log titre and body weight were found at 2 and 4 weeks. Associations between single nucleotide polymorphisms and titre were found for C209G and A254T, and between all single nucleotide polymorphisms (except A243T) and body weight. The results showed that class II MHC is associated with both titre of Newcastle disease virus and body weight in Leung Hang Khao chickens. This is of concern because improved growth traits are the main goal of breeding selection. Moreover, the results suggested that MHC has a pleiotropic effect on the titre and growth performance. This mechanism should be investigated in a future study. PMID:26732325
Photomodulating Gene Expression by Using Caged siRNAs with Single-Aptamer Modification.
Zhang, Liangliang; Chen, Changmai; Fan, Xinli; Tang, Xinjing
2018-06-18
Caged siRNAs incorporating terminal modification were rationally designed for photochemical regulation of gene silencing induced by RNA interference (RNAi). Through the conjugation of a single oligonucleotide aptamer at the 5' terminus of the antisense RNA strand, enhancement of the blocking effect for RNA-induced silencing complex (RISC) formation/processing was expected, due both/either to the aptamers themselves and/or to their interaction with large binding proteins. Two oligonucleotide aptamers (AS1411 and MUC-1) were chosen for aptamer-siRNA conjugation through a photolabile linker. This caging strategy was successfully used to photoregulate gene expression both of firefly luciferase and of green fluorescent protein (GFP) in cells. Further patterning experiments revealed that spatial regulation of GFP expression was successfully achieved by using the aptamer-modified caged siRNA and light activation. We expect that further optimized caged siRNAs featuring aptamer conjugation will be promising for practical applications to spatiotemporal photoregulation of gene expression in the future. © 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Structural Heterogeneity and Functional Domains of Murine Immunoglobulin G Fc Receptors
NASA Astrophysics Data System (ADS)
Ravetch, Jeffrey V.; Luster, Andrew D.; Weinshank, Richard; Kochan, Jarema; Pavlovec, Amalia; Portnoy, Daniel A.; Hulmes, Jeffrey; Pan, Yu-Ching E.; Unkeless, Jay C.
1986-11-01
Binding of antibodies to effector cells by way of receptors to their constant regions (Fc receptors) is central to the pathway that leads to clearance of antigens by the immune system. The structure and function of this important class of receptors on immune cells is addressed through the molecular characterization of Fc receptors (FcR) specific for the murine immunoglobulin G isotype. Structural diversity is encoded by two genes that by alternative splicing result in expression of molecules with highly conserved extracellular domains and different transmembrane and intracytoplasmic domains. The proteins encoded by these genes are members of the immunoglobulin supergene family, most homologous to the major histocompatibility complex molecule Eβ. Functional reconstitution of ligand binding by transfection of individual FcR genes demonstrates that the requirements for ligand binding are encoded in a single gene. These studies demonstrate the molecular basis for the functional heterogeneity of FcR's, accounting for the possible transduction of different signals in response to a single ligand.
Validation of high-throughput single cell analysis methodology.
Devonshire, Alison S; Baradez, Marc-Olivier; Morley, Gary; Marshall, Damian; Foy, Carole A
2014-05-01
High-throughput quantitative polymerase chain reaction (qPCR) approaches enable profiling of multiple genes in single cells, bringing new insights to complex biological processes and offering opportunities for single cell-based monitoring of cancer cells and stem cell-based therapies. However, workflows with well-defined sources of variation are required for clinical diagnostics and testing of tissue-engineered products. In a study of neural stem cell lines, we investigated the performance of lysis, reverse transcription (RT), preamplification (PA), and nanofluidic qPCR steps at the single cell level in terms of efficiency, precision, and limit of detection. We compared protocols using a separate lysis buffer with cell capture directly in RT-PA reagent. The two methods were found to have similar lysis efficiencies, whereas the direct RT-PA approach showed improved precision. Digital PCR was used to relate preamplified template copy numbers to Cq values and reveal where low-quality signals may affect the analysis. We investigated the impact of calibration and data normalization strategies as a means of minimizing the impact of inter-experimental variation on gene expression values and found that both approaches can improve data comparability. This study provides validation and guidance for the application of high-throughput qPCR workflows for gene expression profiling of single cells. Copyright © 2014 Elsevier Inc. All rights reserved.
Community of protein complexes impacts disease association
Wang, Qianghu; Liu, Weisha; Ning, Shangwei; Ye, Jingrun; Huang, Teng; Li, Yan; Wang, Peng; Shi, Hongbo; Li, Xia
2012-01-01
One important challenge in the post-genomic era is uncovering the relationships among distinct pathophenotypes by using molecular signatures. Given the complex functional interdependencies between cellular components, a disease is seldom the consequence of a defect in a single gene product, instead reflecting the perturbations of a group of closely related gene products that carry out specific functions together. Therefore, it is meaningful to explore how the community of protein complexes impacts disease associations. Here, by integrating a large amount of information from protein complexes and the cellular basis of diseases, we built a human disease network in which two diseases are linked if they share common disease-related protein complex. A systemic analysis revealed that linked disease pairs exhibit higher comorbidity than those that have no links, and that the stronger association two diseases have based on protein complexes, the higher comorbidity they are prone to display. Moreover, more connected diseases tend to be malignant, which have high prevalence. We provide novel disease associations that cannot be identified through previous analysis. These findings will potentially provide biologists and clinicians new insights into the etiology, classification and treatment of diseases. PMID:22549411
A powerful score-based test statistic for detecting gene-gene co-association.
Xu, Jing; Yuan, Zhongshang; Ji, Jiadong; Zhang, Xiaoshuai; Li, Hongkai; Wu, Xuesen; Xue, Fuzhong; Liu, Yanxun
2016-01-29
The genetic variants identified by Genome-wide association study (GWAS) can only account for a small proportion of the total heritability for complex disease. The existence of gene-gene joint effects which contains the main effects and their co-association is one of the possible explanations for the "missing heritability" problems. Gene-gene co-association refers to the extent to which the joint effects of two genes differ from the main effects, not only due to the traditional interaction under nearly independent condition but the correlation between genes. Generally, genes tend to work collaboratively within specific pathway or network contributing to the disease and the specific disease-associated locus will often be highly correlated (e.g. single nucleotide polymorphisms (SNPs) in linkage disequilibrium). Therefore, we proposed a novel score-based statistic (SBS) as a gene-based method for detecting gene-gene co-association. Various simulations illustrate that, under different sample sizes, marginal effects of causal SNPs and co-association levels, the proposed SBS has the better performance than other existed methods including single SNP-based and principle component analysis (PCA)-based logistic regression model, the statistics based on canonical correlations (CCU), kernel canonical correlation analysis (KCCU), partial least squares path modeling (PLSPM) and delta-square (δ (2)) statistic. The real data analysis of rheumatoid arthritis (RA) further confirmed its advantages in practice. SBS is a powerful and efficient gene-based method for detecting gene-gene co-association.
Booher, Nicholas J.; Carpenter, Sara C. D.; Sebra, Robert P.; Wang, Li; Salzberg, Steven L.; Leach, Jan E.
2015-01-01
Pathogen-injected, direct transcriptional activators of host genes, TAL (transcription activator-like) effectors play determinative roles in plant diseases caused by Xanthomonas spp. A large domain of nearly identical, 33–35 aa repeats in each protein mediates DNA recognition. This modularity makes TAL effectors customizable and thus important also in biotechnology. However, the repeats render TAL effector (tal) genes nearly impossible to assemble using next-generation, short reads. Here, we demonstrate that long-read, single molecule real-time (SMRT) sequencing solves this problem. Taking an ensemble approach to first generate local, tal gene contigs, we correctly assembled de novo the genomes of two strains of the rice pathogen X. oryzae completed previously using the Sanger method and even identified errors in those references. Sequencing two more strains revealed a dynamic genome structure and a striking plasticity in tal gene content. Our results pave the way for population-level studies to inform resistance breeding, improve biotechnology and probe TAL effector evolution. PMID:27148456
Modeling stochastic noise in gene regulatory systems
Meister, Arwen; Du, Chao; Li, Ye Henry; Wong, Wing Hung
2014-01-01
The Master equation is considered the gold standard for modeling the stochastic mechanisms of gene regulation in molecular detail, but it is too complex to solve exactly in most cases, so approximation and simulation methods are essential. However, there is still a lack of consensus about the best way to carry these out. To help clarify the situation, we review Master equation models of gene regulation, theoretical approximations based on an expansion method due to N.G. van Kampen and R. Kubo, and simulation algorithms due to D.T. Gillespie and P. Langevin. Expansion of the Master equation shows that for systems with a single stable steady-state, the stochastic model reduces to a deterministic model in a first-order approximation. Additional theory, also due to van Kampen, describes the asymptotic behavior of multistable systems. To support and illustrate the theory and provide further insight into the complex behavior of multistable systems, we perform a detailed simulation study comparing the various approximation and simulation methods applied to synthetic gene regulatory systems with various qualitative characteristics. The simulation studies show that for large stochastic systems with a single steady-state, deterministic models are quite accurate, since the probability distribution of the solution has a single peak tracking the deterministic trajectory whose variance is inversely proportional to the system size. In multistable stochastic systems, large fluctuations can cause individual trajectories to escape from the domain of attraction of one steady-state and be attracted to another, so the system eventually reaches a multimodal probability distribution in which all stable steady-states are represented proportional to their relative stability. However, since the escape time scales exponentially with system size, this process can take a very long time in large systems. PMID:25632368
Pleiotropic and Epistatic Network-Based Discovery: Integrated Networks for Target Gene Discovery
DOE Office of Scientific and Technical Information (OSTI.GOV)
Weighill, Deborah; Jones, Piet; Shah, Manesh
Biological organisms are complex systems that are composed of functional networks of interacting molecules and macro-molecules. Complex phenotypes are the result of orchestrated, hierarchical, heterogeneous collections of expressed genomic variants. However, the effects of these variants are the result of historic selective pressure and current environmental and epigenetic signals, and, as such, their co-occurrence can be seen as genome-wide correlations in a number of different manners. Biomass recalcitrance (i.e., the resistance of plants to degradation or deconstruction, which ultimately enables access to a plant's sugars) is a complex polygenic phenotype of high importance to biofuels initiatives. This study makes usemore » of data derived from the re-sequenced genomes from over 800 different Populus trichocarpa genotypes in combination with metabolomic and pyMBMS data across this population, as well as co-expression and co-methylation networks in order to better understand the molecular interactions involved in recalcitrance, and identify target genes involved in lignin biosynthesis/degradation. A Lines Of Evidence (LOE) scoring system is developed to integrate the information in the different layers and quantify the number of lines of evidence linking genes to target functions. This new scoring system was applied to quantify the lines of evidence linking genes to lignin-related genes and phenotypes across the network layers, and allowed for the generation of new hypotheses surrounding potential new candidate genes involved in lignin biosynthesis in P. trichocarpa, including various AGAMOUS-LIKE genes. Lastly, the resulting Genome Wide Association Study networks, integrated with Single Nucleotide Polymorphism (SNP) correlation, co-methylation, and co-expression networks through the LOE scores are proving to be a powerful approach to determine the pleiotropic and epistatic relationships underlying cellular functions and, as such, the molecular basis for complex phenotypes, such as recalcitrance.« less
Pleiotropic and Epistatic Network-Based Discovery: Integrated Networks for Target Gene Discovery
Weighill, Deborah; Jones, Piet; Shah, Manesh; ...
2018-05-11
Biological organisms are complex systems that are composed of functional networks of interacting molecules and macro-molecules. Complex phenotypes are the result of orchestrated, hierarchical, heterogeneous collections of expressed genomic variants. However, the effects of these variants are the result of historic selective pressure and current environmental and epigenetic signals, and, as such, their co-occurrence can be seen as genome-wide correlations in a number of different manners. Biomass recalcitrance (i.e., the resistance of plants to degradation or deconstruction, which ultimately enables access to a plant's sugars) is a complex polygenic phenotype of high importance to biofuels initiatives. This study makes usemore » of data derived from the re-sequenced genomes from over 800 different Populus trichocarpa genotypes in combination with metabolomic and pyMBMS data across this population, as well as co-expression and co-methylation networks in order to better understand the molecular interactions involved in recalcitrance, and identify target genes involved in lignin biosynthesis/degradation. A Lines Of Evidence (LOE) scoring system is developed to integrate the information in the different layers and quantify the number of lines of evidence linking genes to target functions. This new scoring system was applied to quantify the lines of evidence linking genes to lignin-related genes and phenotypes across the network layers, and allowed for the generation of new hypotheses surrounding potential new candidate genes involved in lignin biosynthesis in P. trichocarpa, including various AGAMOUS-LIKE genes. Lastly, the resulting Genome Wide Association Study networks, integrated with Single Nucleotide Polymorphism (SNP) correlation, co-methylation, and co-expression networks through the LOE scores are proving to be a powerful approach to determine the pleiotropic and epistatic relationships underlying cellular functions and, as such, the molecular basis for complex phenotypes, such as recalcitrance.« less
The putative oncogene Pim-1 in the mouse: its linkage and variation among t haplotypes.
Nadeau, J H; Phillips, S J
1987-11-01
Pim-1, a putative oncogene involved in T-cell lymphomagenesis, was mapped between the pseudo-alpha globin gene Hba-4ps and the alpha-crystallin gene Crya-1 on mouse chromosome 17 and therefore within the t complex. Pim-1 restriction fragment variants were identified among t haplotypes. Analysis of restriction fragment sizes obtained with 12 endonucleases demonstrated that the Pim-1 genes in some t haplotypes were indistinguishable from the sizes for the Pim-1b allele in BALB/c inbred mice. There are now three genes, Pim-1, Crya-1 and H-2 I-E, that vary among independently derived t haplotypes and that have indistinguishable alleles in t haplotypes and inbred strains. These genes are closely linked within the distal inversion of the t complex. Because it is unlikely that these variants arose independently in t haplotypes and their wild-type homologues, we propose that an exchange of chromosomal segments, probably through double crossingover, was responsible for indistinguishable Pim-1 genes shared by certain t haplotypes and their wild-type homologues. There was, however, no apparent association between variant alleles of these three genes among t haplotypes as would be expected if a single exchange introduced these alleles into t haplotypes. If these variant alleles can be shown to be identical to the wild-type allele, then lack of association suggests that multiple exchanges have occurred during the evolution of the t complex.
Benoit, Isabelle; Zhou, Miaomiao; Vivas Duarte, Alexandra; Downes, Damien J.; Todd, Richard B.; Kloezen, Wendy; Post, Harm; Heck, Albert J. R.; Maarten Altelaar, A. F.; de Vries, Ronald P.
2015-01-01
Degradation of plant biomass to fermentable sugars is of critical importance for the use of plant materials for biofuels. Filamentous fungi are ubiquitous organisms and major plant biomass degraders. Single colonies of some fungal species can colonize massive areas as large as five soccer stadia. During growth, the mycelium encounters heterogeneous carbon sources. Here we assessed whether substrate heterogeneity is a major determinant of spatial gene expression in colonies of Aspergillus niger. We analyzed whole-genome gene expression in five concentric zones of 5-day-old colonies utilizing sugar beet pulp as a complex carbon source. Growth, protein production and secretion occurred throughout the colony. Genes involved in carbon catabolism were expressed uniformly from the centre to the periphery whereas genes encoding plant biomass degrading enzymes and nitrate utilization were expressed differentially across the colony. A combined adaptive response of carbon-catabolism and enzyme production to locally available monosaccharides was observed. Finally, our results demonstrate that A. niger employs different enzymatic tools to adapt its metabolism as it colonizes complex environments. PMID:26314379
Benoit, Isabelle; Zhou, Miaomiao; Vivas Duarte, Alexandra; Downes, Damien J; Todd, Richard B; Kloezen, Wendy; Post, Harm; Heck, Albert J R; Maarten Altelaar, A F; de Vries, Ronald P
2015-08-28
Degradation of plant biomass to fermentable sugars is of critical importance for the use of plant materials for biofuels. Filamentous fungi are ubiquitous organisms and major plant biomass degraders. Single colonies of some fungal species can colonize massive areas as large as five soccer stadia. During growth, the mycelium encounters heterogeneous carbon sources. Here we assessed whether substrate heterogeneity is a major determinant of spatial gene expression in colonies of Aspergillus niger. We analyzed whole-genome gene expression in five concentric zones of 5-day-old colonies utilizing sugar beet pulp as a complex carbon source. Growth, protein production and secretion occurred throughout the colony. Genes involved in carbon catabolism were expressed uniformly from the centre to the periphery whereas genes encoding plant biomass degrading enzymes and nitrate utilization were expressed differentially across the colony. A combined adaptive response of carbon-catabolism and enzyme production to locally available monosaccharides was observed. Finally, our results demonstrate that A. niger employs different enzymatic tools to adapt its metabolism as it colonizes complex environments.
Hulse-Kemp, Amanda M; Maheshwari, Shamoni; Stoffel, Kevin; Hill, Theresa A; Jaffe, David; Williams, Stephen R; Weisenfeld, Neil; Ramakrishnan, Srividya; Kumar, Vijay; Shah, Preyas; Schatz, Michael C; Church, Deanna M; Van Deynze, Allen
2018-01-01
Linked-Read sequencing technology has recently been employed successfully for de novo assembly of human genomes, however, the utility of this technology for complex plant genomes is unproven. We evaluated the technology for this purpose by sequencing the 3.5-gigabase (Gb) diploid pepper ( Capsicum annuum ) genome with a single Linked-Read library. Plant genomes, including pepper, are characterized by long, highly similar repetitive sequences. Accordingly, significant effort is used to ensure that the sequenced plant is highly homozygous and the resulting assembly is a haploid consensus. With a phased assembly approach, we targeted a heterozygous F 1 derived from a wide cross to assess the ability to derive both haplotypes and characterize a pungency gene with a large insertion/deletion. The Supernova software generated a highly ordered, more contiguous sequence assembly than all currently available C. annuum reference genomes. Over 83% of the final assembly was anchored and oriented using four publicly available de novo linkage maps. A comparison of the annotation of conserved eukaryotic genes indicated the completeness of assembly. The validity of the phased assembly is further demonstrated with the complete recovery of both 2.5-Kb insertion/deletion haplotypes of the PUN1 locus in the F 1 sample that represents pungent and nonpungent peppers, as well as nearly full recovery of the BUSCO2 gene set within each of the two haplotypes. The most contiguous pepper genome assembly to date has been generated which demonstrates that Linked-Read library technology provides a tool to de novo assemble complex highly repetitive heterozygous plant genomes. This technology can provide an opportunity to cost-effectively develop high-quality genome assemblies for other complex plants and compare structural and gene differences through accurate haplotype reconstruction.
Tiezzi, Francesco; Parker-Gaddis, Kristen L.; Cole, John B.; Clay, John S.; Maltecca, Christian
2015-01-01
Clinical mastitis (CM) is one of the health disorders with large impacts on dairy farming profitability and animal welfare. The objective of this study was to perform a genome-wide association study (GWAS) for CM in first-lactation Holstein. Producer-recorded mastitis event information for 103,585 first-lactation cows were used, together with genotype information on 1,361 bulls from the Illumina BovineSNP50 BeadChip. Single-step genomic-BLUP methodology was used to incorporate genomic data into a threshold-liability model. Association analysis confirmed that CM follows a highly polygenic mode of inheritance. However, 10-adjacent-SNP windows showed that regions on chromosomes 2, 14 and 20 have impacts on genetic variation for CM. Some of the genes located on chromosome 14 (LY6K, LY6D, LYNX1, LYPD2, SLURP1, PSCA) are part of the lymphocyte-antigen-6 complex (LY6) known for its neutrophil regulation function linked to the major histocompatibility complex. Other genes on chromosome 2 were also involved in regulating immune response (IFIH1, LY75, and DPP4), or are themselves regulated in the presence of specific pathogens (ITGB6, NR4A2). Other genes annotated on chromosome 20 are involved in mammary gland metabolism (GHR, OXCT1), antibody production and phagocytosis of bacterial cells (C6, C7, C9, C1QTNF3), tumor suppression (DAB2), involution of mammary epithelium (OSMR) and cytokine regulation (PRLR). DAVID enrichment analysis revealed 5 KEGG pathways. The JAK-STAT signaling pathway (cell proliferation and apoptosis) and the ‘Cytokine-cytokine receptor interaction’ (cytokine and interleukines response to infectious agents) are co-regulated and linked to the ‘ABC transporters’ pathway also found here. Gene network analysis performed using GeneMania revealed a co-expression network where 665 interactions existed among 145 of the genes reported above. Clinical mastitis is a complex trait and the different genes regulating immune response are known to be pathogen-specific. Despite the lack of information in this study, candidate QTL for CM were identified in the US Holstein population. PMID:25658712
Apicella, Coren L.; Cesarini, David; Johannesson, Magnus; Dawes, Christopher T.; Lichtenstein, Paul; Wallace, Björn; Beauchamp, Jonathan; Westberg, Lars
2010-01-01
Background Oxytocin (OXT) has been implicated in a suite of complex social behaviors including observed choices in economic laboratory experiments. However, actual studies of associations between oxytocin receptor (OXTR) gene variants and experimentally elicited social preferences are rare. Methodology/Principal Findings We test hypotheses of associations between social preferences, as measured by behavior in two economic games, and 9 single nucleotide polymorphisms (SNPs) of the OXTR gene in a sample of Swedish twins (n = 684). Two standard economic games, the dictator game and the trust game, both involving real monetary consequences, were used to elicit such preferences. After correction for multiple hypothesis testing, we found no significant associations between any of the 9 single nucleotide polymorphisms (SNPs) and behavior in either of the games. Conclusion We were unable to replicate the most significant association reported in previous research between the amount donated in a dictator game and an OXTR genetic variant. PMID:20585395
Apicella, Coren L; Cesarini, David; Johannesson, Magnus; Dawes, Christopher T; Lichtenstein, Paul; Wallace, Björn; Beauchamp, Jonathan; Westberg, Lars
2010-06-16
Oxytocin (OXT) has been implicated in a suite of complex social behaviors including observed choices in economic laboratory experiments. However, actual studies of associations between oxytocin receptor (OXTR) gene variants and experimentally elicited social preferences are rare. We test hypotheses of associations between social preferences, as measured by behavior in two economic games, and 9 single nucleotide polymorphisms (SNPs) of the OXTR gene in a sample of Swedish twins (n = 684). Two standard economic games, the dictator game and the trust game, both involving real monetary consequences, were used to elicit such preferences. After correction for multiple hypothesis testing, we found no significant associations between any of the 9 single nucleotide polymorphisms (SNPs) and behavior in either of the games. We were unable to replicate the most significant association reported in previous research between the amount donated in a dictator game and an OXTR genetic variant.
Congenital Chromosomal Syndromes—A Model for Pathogenesis
Rohde, Russell A.
1965-01-01
The origin of anomalies in the chromosomal syndromes is believed to be both polyetiologic and polypathogenetic. Whereas some malformations quite clearly appear to result from single gene mutations or from genic imbalance due to individual monosomic or trisomic loci, other anomalies (singly or in complex patterns) are better interpreted as originating from disturbances in particular biochemical pathways which affect the development of a variety of traits. Additional phenogenetic studies and the use of sophisticated biochemical analysis in persons with complex patterns of anomalies should provide a truer understanding of disease mechanisms and provide guidance for future studies which are aimed at the treatment and prevention of these intriguing misadventures of Nature. PMID:5318572
Osborne, Peter W; Benoit, Gérard; Laudet, Vincent; Schubert, Michael; Ferrier, David E K
2009-03-01
The ParaHox cluster is the evolutionary sister to the Hox cluster. Like the Hox cluster, the ParaHox cluster displays spatial and temporal regulation of the component genes along the anterior/posterior axis in a manner that correlates with the gene positions within the cluster (a feature called collinearity). The ParaHox cluster is however a simpler system to study because it is composed of only three genes. We provide a detailed analysis of the amphioxus ParaHox cluster and, for the first time in a single species, examine the regulation of the cluster in response to a single developmental signalling molecule, retinoic acid (RA). Embryos treated with either RA or RA antagonist display altered ParaHox gene expression: AmphiGsx expression shifts in the neural tube, and the endodermal boundary between AmphiXlox and AmphiCdx shifts its anterior/posterior position. We identified several putative retinoic acid response elements and in vitro assays suggest some may participate in RA regulation of the ParaHox genes. By comparison to vertebrate ParaHox gene regulation we explore the evolutionary implications. This work highlights how insights into the regulation and evolution of more complex vertebrate arrangements can be obtained through studies of a simpler, unduplicated amphioxus gene cluster.
A reduced transcriptome approach to assess environmental toxicants using zebrafish embryo tests
This paper reports on the pilot testing of a new bioassay platform that monitors expression of 1600 genes in zebrafish embryos exposed to either single chemicals or complex water samples. The method provides a more cost effective, high throughput means to broadly evaluate the pot...
Maruyama, Kohei; Takeyama, Haruko; Nemoto, Etsuo; Tanaka, Tsuyoshi; Yoda, Kiyoshi; Matsunaga, Tadashi
2004-09-20
Single nucleotide polymorphism (SNP) detection for aldehyde dehydrogenase 2 (ALDH2) gene based on DNA thermal dissociation curve analysis was successfully demonstrated using an automated system with bacterial magnetic particles (BMPs) by developing a new method for avoiding light scattering caused by nanometer-size particles when using commercially available fluorescent dyes such as FITC, Cy3, and Cy5 as labeling chromophores. Biotin-labeled PCR products in ALDH2, two allele-specific probes (Cy3-labeled detection probe for ALDH2*1 and Cy5-labeled detection probe for ALDH2*2), streptavidin-immobilized BMPs (SA-BMPs) were simultaneously mixed. The mixture was denatured at 70 degrees C for 3 min, cooled slowly to 25 degrees C, and incubated for 10 min, allowing the DNA duplex to form between Cy3- or Cy5-labeled detection probes and biotin-labeled PCR products on SA-BMPs. Then duplex DNA-BMP complex was heated to 58 degrees C, a temperature determined by dissociation curve analysis and a dissociated single-base mismatched detection probe was removed at the same temperature under precise control. Furthermore, fluorescence signal from the detection probe was liberated into the supernatant from completely matched duplex DNA-BMP complex by heating to 80 degrees C and measured. In the homozygote target DNA (ALDH2*1/*1 and ALDH2*2/*2), the fluorescence signals from single-base mismatched were decreased to background level, indicating that mismatched hybridization was efficiently removed by the washing process. In the heterozygote target DNA (ALDH2*1/*2), each fluorescence signals was at a similar level. Therefore, three genotypes of SNP in ALDH2 gene were detected using the automated detection system with BMPs. Copyright 2004 Wiley Periodicals, Inc.
2010-01-01
Background Horizontal gene transfer (HGT) is relatively common in plant mitochondrial genomes but the mechanisms, extent and consequences of transfer remain largely unknown. Previous results indicate that parasitic plants are often involved as either transfer donors or recipients, suggesting that direct contact between parasite and host facilitates genetic transfer among plants. Results In order to uncover the mechanistic details of plant-to-plant HGT, the extent and evolutionary fate of transfer was investigated between two groups: the parasitic genus Cuscuta and a small clade of Plantago species. A broad polymerase chain reaction (PCR) survey of mitochondrial genes revealed that at least three genes (atp1, atp6 and matR) were recently transferred from Cuscuta to Plantago. Quantitative PCR assays show that these three genes have a mitochondrial location in the one species line of Plantago examined. Patterns of sequence evolution suggest that these foreign genes degraded into pseudogenes shortly after transfer and reverse transcription (RT)-PCR analyses demonstrate that none are detectably transcribed. Three cases of gene conversion were detected between native and foreign copies of the atp1 gene. The identical phylogenetic distribution of the three foreign genes within Plantago and the retention of cytidines at ancestral positions of RNA editing indicate that these genes were probably acquired via a single, DNA-mediated transfer event. However, samplings of multiple individuals from two of the three species in the recipient Plantago clade revealed complex and perplexing phylogenetic discrepancies and patterns of sequence divergence for all three of the foreign genes. Conclusions This study reports the best evidence to date that multiple mitochondrial genes can be transferred via a single HGT event and that transfer occurred via a strictly DNA-level intermediate. The discovery of gene conversion between co-resident foreign and native mitochondrial copies suggests that transferred genes may be evolutionarily important in generating mitochondrial genetic diversity. Finally, the complex relationships within each lineage of transferred genes imply a surprisingly complicated history of these genes in Plantago subsequent to their acquisition via HGT and this history probably involves some combination of additional transfers (including intracellular transfer), gene duplication, differential loss and mutation-rate variation. Unravelling this history will probably require sequencing multiple mitochondrial and nuclear genomes from Plantago. See Commentary: http://www.biomedcentral.com/1741-7007/8/147. PMID:21176201
Chacón-Sánchez, María I.; Martínez-Castillo, Jaime
2017-01-01
Plant domestication can be seen as a long-term process that involves a complex interplay among demographic processes and evolutionary forces. Previous studies have suggested two domestication scenarios for Lima bean in Mesoamerica: two separate domestication events, one from gene pool MI in central-western Mexico and another one from gene pool MII in the area Guatemala-Costa Rica, or a single domestication from gene pool MI in central-western Mexico followed by post-domestication gene flow with wild populations. In this study we evaluated the genetic structure of the wild gene pool and tested these two competing domestication scenarios of Lima bean in Mesoamerica by applying an ABC approach to a set of genome-wide SNP markers. The results confirm the existence of three gene pools in wild Lima bean, two Mesoamerican gene pools (MI and MII) and the Andean gene pool (AI), and suggest the existence of another gene pool in central Colombia. The results indicate that although both domestication scenarios may be supported by genetic data, higher statistical support was given to the single domestication scenario in central-western Mexico followed by admixture with wild populations. Domestication would have involved strong founder effects reflected in loss of genetic diversity and increased LD levels in landraces. Genomic regions affected by selection were detected and these may harbor candidate genes related to domestication. PMID:28955351
Protein import and the origin of red complex plastids.
Gould, Sven B; Maier, Uwe-G; Martin, William F
2015-06-15
The number and nature of endosymbioses involving red algal endosymbionts are debated. Gene phylogenies have become the most popular tool to untangle this issue, but they deliver conflicting results. As gene and lineage sampling has increased, so have both the number of conflicting trees and the number of suggestions in the literature for multiple tertiary, and even quaternary, symbioses that might reconcile the tree conflicts. Independent lines of evidence that can address the issue are needed. Here we summarize the mechanism and machinery of protein import into complex red plastids. The process involves protein translocation machinery, known as SELMA, that arose once in evolution, that facilitates protein import across the second outermost of the four plastid membranes, and that is always targeted specifically to that membrane, regardless of where it is encoded today. It is widely accepted that the unity of protein import across the two membranes of primary plastids is strong evidence for their single cyanobacterial origin. Similarly, the unity of SELMA-dependent protein import across the second outermost plastid membrane constitutes strong evidence for the existence of a single red secondary endosymbiotic event at the common origin of all red complex plastids. We furthermore propose that the two outer membranes of red complex plastids are derived from host endoplasmic reticulum in the initial red secondary endosymbiotic event. Copyright © 2015 Elsevier Ltd. All rights reserved.
Improved maize reference genome with single-molecule technologies.
Jiao, Yinping; Peluso, Paul; Shi, Jinghua; Liang, Tiffany; Stitzer, Michelle C; Wang, Bo; Campbell, Michael S; Stein, Joshua C; Wei, Xuehong; Chin, Chen-Shan; Guill, Katherine; Regulski, Michael; Kumari, Sunita; Olson, Andrew; Gent, Jonathan; Schneider, Kevin L; Wolfgruber, Thomas K; May, Michael R; Springer, Nathan M; Antoniou, Eric; McCombie, W Richard; Presting, Gernot G; McMullen, Michael; Ross-Ibarra, Jeffrey; Dawe, R Kelly; Hastie, Alex; Rank, David R; Ware, Doreen
2017-06-22
Complete and accurate reference genomes and annotations provide fundamental tools for characterization of genetic and functional variation. These resources facilitate the determination of biological processes and support translation of research findings into improved and sustainable agricultural technologies. Many reference genomes for crop plants have been generated over the past decade, but these genomes are often fragmented and missing complex repeat regions. Here we report the assembly and annotation of a reference genome of maize, a genetic and agricultural model species, using single-molecule real-time sequencing and high-resolution optical mapping. Relative to the previous reference genome, our assembly features a 52-fold increase in contig length and notable improvements in the assembly of intergenic spaces and centromeres. Characterization of the repetitive portion of the genome revealed more than 130,000 intact transposable elements, allowing us to identify transposable element lineage expansions that are unique to maize. Gene annotations were updated using 111,000 full-length transcripts obtained by single-molecule real-time sequencing. In addition, comparative optical mapping of two other inbred maize lines revealed a prevalence of deletions in regions of low gene density and maize lineage-specific genes.
Andrés-Benito, Pol; Moreno, Jesús; Aso, Ester; Povedano, Mónica; Ferrer, Isidro
2017-01-01
Transcriptome arrays identifies 747 genes differentially expressed in the anterior horn of the spinal cord and 2,300 genes differentially expressed in frontal cortex area 8 in a single group of typical sALS cases without frontotemporal dementia compared with age-matched controls. Main up-regulated clusters in the anterior horn are related to inflammation and apoptosis; down-regulated clusters are linked to axoneme structures and protein synthesis. In contrast, up-regulated gene clusters in frontal cortex area 8 involve neurotransmission, synaptic proteins and vesicle trafficking, whereas main down-regulated genes cluster into oligodendrocyte function and myelin-related proteins. RT-qPCR validates the expression of 58 of 66 assessed genes from different clusters. The present results: a. reveal regional differences in de-regulated gene expression between the anterior horn of the spinal cord and frontal cortex area 8 in the same individuals suffering from sALS; b. validate and extend our knowledge about the complexity of the inflammatory response in the anterior horn of the spinal cord; and c. identify for the first time extensive gene up-regulation of neurotransmission and synaptic-related genes, together with significant down-regulation of oligodendrocyte- and myelin-related genes, as important contributors to the pathogenesis of frontal cortex alterations in the sALS/frontotemporal lobar degeneration spectrum complex at stages with no apparent cognitive impairment. PMID:28283675
Nutrigenetics and prostate cancer: 2011 and beyond.
Yuan, Yinan; Ferguson, Lynnette R
2011-01-01
Prostate cancer runs in families and shows a clear dietary involvement. Until recently, the key risk gene(s) have proved elusive. We summarise current understandings of nutrient-gene interactions in prostate cancer risk and progression. A MEDLINE-based literature search was conducted. Hypothesis-directed candidate gene approaches provide plausible, albeit statistically weak, nutrient-gene interactions. These are based on early understandings of factors likely to impact on carcinogenesis, including both nutrient and genetic effects on androgen biosynthesis and action, xenobiotic metabolism, DNA damage and DNA repair. Non-hypothesis-directed genome-wide association studies provide much stronger evidence for other genes, not hitherto suspected for involvement. Although only a few of these have been formally tested for dietary associations in well-designed epidemiologic studies, the nature of many of the genes suggests that their activity may be regulated by nutrients. These effects may not only be relevant to prostate cancer susceptibility, but also to disease progression. It will be important to move beyond studying single nucleotide polymorphisms, into more complex chromosomal rearrangements and to epigenetic changes. For future progress, large international cohorts will not only need to provide proof of individual nutrient-gene interactions, but also to relate these to more complex nutrient-gene-gene interactions, as parts of pathways. Bioinformatics and biostatistics will be increasingly important tools in nutrigenetic studies beyond 2011. Copyright © 2011 S. Karger AG, Basel.
Sidorenko, Lyudmila; Dorweiler, Jane E; Cigan, A Mark; Arteaga-Vazquez, Mario; Vyas, Meenal; Kermicle, Jerry; Jurcin, Diane; Brzeski, Jan; Cai, Yu; Chandler, Vicki L
2009-11-01
Paramutation involves homologous sequence communication that leads to meiotically heritable transcriptional silencing. We demonstrate that mop2 (mediator of paramutation2), which alters paramutation at multiple loci, encodes a gene similar to Arabidopsis NRPD2/E2, the second-largest subunit of plant-specific RNA polymerases IV and V. In Arabidopsis, Pol-IV and Pol-V play major roles in RNA-mediated silencing and a single second-largest subunit is shared between Pol-IV and Pol-V. Maize encodes three second-largest subunit genes: all three genes potentially encode full length proteins with highly conserved polymerase domains, and each are expressed in multiple overlapping tissues. The isolation of a recessive paramutation mutation in mop2 from a forward genetic screen suggests limited or no functional redundancy of these three genes. Potential alternative Pol-IV/Pol-V-like complexes could provide maize with a greater diversification of RNA-mediated transcriptional silencing machinery relative to Arabidopsis. Mop2-1 disrupts paramutation at multiple loci when heterozygous, whereas previously silenced alleles are only up-regulated when Mop2-1 is homozygous. The dramatic reduction in b1 tandem repeat siRNAs, but no disruption of silencing in Mop2-1 heterozygotes, suggests the major role for tandem repeat siRNAs is not to maintain silencing. Instead, we hypothesize the tandem repeat siRNAs mediate the establishment of the heritable silent state-a process fully disrupted in Mop2-1 heterozygotes. The dominant Mop2-1 mutation, which has a single nucleotide change in a domain highly conserved among all polymerases (E. coli to eukaryotes), disrupts both siRNA biogenesis (Pol-IV-like) and potentially processes downstream (Pol-V-like). These results suggest either the wild-type protein is a subunit in both complexes or the dominant mutant protein disrupts both complexes. Dominant mutations in the same domain in E. coli RNA polymerase suggest a model for Mop2-1 dominance: complexes containing Mop2-1 subunits are non-functional and compete with wild-type complexes.
Wood Utilization Is Dependent on Catalase Activities in the Filamentous Fungus Podospora anserina
Bourdais, Anne; Bidard, Frederique; Zickler, Denise; Berteaux-Lecellier, Veronique; Silar, Philippe; Espagne, Eric
2012-01-01
Catalases are enzymes that play critical roles in protecting cells against the toxic effects of hydrogen peroxide. They are implicated in various physiological and pathological conditions but some of their functions remain unclear. In order to decipher the role(s) of catalases during the life cycle of Podospora anserina, we analyzed the role of the four monofunctional catalases and one bifunctional catalase-peroxidase genes present in its genome. The five genes were deleted and the phenotypes of each single and all multiple mutants were investigated. Intriguingly, although the genes are differently expressed during the life cycle, catalase activity is dispensable during both vegetative growth and sexual reproduction in laboratory conditions. Catalases are also not essential for cellulose or fatty acid assimilation. In contrast, they are strictly required for efficient utilization of more complex biomass like wood shavings by allowing growth in the presence of lignin. The secreted CATB and cytosolic CAT2 are the major catalases implicated in peroxide resistance, while CAT2 is the major player during complex biomass assimilation. Our results suggest that P. anserina produces external H2O2 to assimilate complex biomass and that catalases are necessary to protect the cells during this process. In addition, the phenotypes of strains lacking only one catalase gene suggest that a decrease of catalase activity improves the capacity of the fungus to degrade complex biomass. PMID:22558065
Wood utilization is dependent on catalase activities in the filamentous fungus Podospora anserina.
Bourdais, Anne; Bidard, Frederique; Zickler, Denise; Berteaux-Lecellier, Veronique; Silar, Philippe; Espagne, Eric
2012-01-01
Catalases are enzymes that play critical roles in protecting cells against the toxic effects of hydrogen peroxide. They are implicated in various physiological and pathological conditions but some of their functions remain unclear. In order to decipher the role(s) of catalases during the life cycle of Podospora anserina, we analyzed the role of the four monofunctional catalases and one bifunctional catalase-peroxidase genes present in its genome. The five genes were deleted and the phenotypes of each single and all multiple mutants were investigated. Intriguingly, although the genes are differently expressed during the life cycle, catalase activity is dispensable during both vegetative growth and sexual reproduction in laboratory conditions. Catalases are also not essential for cellulose or fatty acid assimilation. In contrast, they are strictly required for efficient utilization of more complex biomass like wood shavings by allowing growth in the presence of lignin. The secreted CATB and cytosolic CAT2 are the major catalases implicated in peroxide resistance, while CAT2 is the major player during complex biomass assimilation. Our results suggest that P. anserina produces external H(2)O(2) to assimilate complex biomass and that catalases are necessary to protect the cells during this process. In addition, the phenotypes of strains lacking only one catalase gene suggest that a decrease of catalase activity improves the capacity of the fungus to degrade complex biomass.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rinchik, E.M.; Carpenter, D.A.; Handel, M.A.
1995-07-03
Variability and complexity of phenotypes observed in microdeletion syndromes can be due to deletion of a single gene whose product participates in several aspects of development or can be due to the deletion of a number of tightly linked genes, each adding its own effect to the syndrome. The p{sup 6H} deletion in mouse chromosome 7 presents a good model with which to address this question of multigene vs. single-gene pleiotropy. Mice homozygous for the p{sup 6H} deletion are diluted in pigmentation, are smaller than their littermates, and manifest a nervous jerky-gait phenotype. Male homozygotes are sterile and exhibit profoundmore » abnormalities in spermiogenesis. By using N-ethyl-N-nitrosourea (EtNU) mutagenesis and a breeding protocol designed to recover recessive mutations expressed hemizygously opposite a large p-locus deletion, we have generated three noncomplementing mutations that map to the p{sup 6H} deletion. Each of these EtNU-induced mutations has adverse effects on the size, nervous behavior, and progression of spermiogenesis that characterize p{sup 6H} deletion homozygotes. Because etNU is thought to induce primarily intragenic (point) mutations in mouse stem-cell spermatogonia, we propose that the trio of phenotypes (runtiness, nervous jerky gait, and male sterility) expressed in p{sup 6H} deletion homozygotes is the result of deletion of a single highly pleiotropic gene. We also predict that a homologous single locus, quite possibly tightly linked and distal to the D15S12 (P) locus in human chromosome 15q11-q13, may be associated with similar developmental abnormalities in humans. 29 refs., 3 figs., 1 tab.« less
A single-gene explanation for the probability of having idiopathic talipes equinovarus
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rebbeck, T.R.; Buetow, K.H.; Dietz, F.R.
1993-11-01
It has been hypothesized that the pathogenesis of idiopathic talipes equinovarus (ITEV, or clubfoot) is explained by genetic regulation of development and growth. The objective of the present study was to determine whether a single Mendelian gene explains the probability of having ITEV in a sample of 143 Caucasian pedigrees from Iowa. These pedigrees were ascertained through probands with ITEV. Complex segregation analyses were undertaken using a regressive logistic model. The results of these analyses strongly rejected the hypotheses that the probability of having ITEV in these pedigrees was explained by a non-Mendelian pattern of transmission with residual sibling correlation,more » a nontransmitted (environmental) factor with residual sibling correlation, or residual sibling correlation alone. These results were consistent with the hypothesis that the probability of having ITEV was explained by the Mendelian segregation of a single gene with two alleles plus the effects of some unmeasured factor(s) shared among siblings. The segregation of alleles at this single Mendelian gene indicated that the disease allele A was incompletely dominant to the nondisease allele B. The disease allele A, associated with ITEV affection, was estimated to occur in the population of inference with a frequency of .007. After adjusting for sex-specific population incidences of ITEV, the conditional probability (penetrance) of ITEV affection given the AA, AB, and BB genotypes was computed to be 1.0, 0.039, and .0006, respectively. Individual pedigrees in this sample that most strongly supported the single Mendelian gene hypothesis were identified. These pedigrees are candidates for genetic linkage analyses or DNA association studies. 35 refs., 2 figs., 7 tabs.« less
Lochlainn, Seosamh Ó; Amoah, Stephen; Graham, Neil S; Alamer, Khalid; Rios, Juan J; Kurup, Smita; Stoute, Andrew; Hammond, John P; Østergaard, Lars; King, Graham J; White, Phillip J; Broadley, Martin R
2011-12-08
Targeted Induced Loci Lesions IN Genomes (TILLING) is increasingly being used to generate and identify mutations in target genes of crop genomes. TILLING populations of several thousand lines have been generated in a number of crop species including Brassica rapa. Genetic analysis of mutants identified by TILLING requires an efficient, high-throughput and cost effective genotyping method to track the mutations through numerous generations. High resolution melt (HRM) analysis has been used in a number of systems to identify single nucleotide polymorphisms (SNPs) and insertion/deletions (IN/DELs) enabling the genotyping of different types of samples. HRM is ideally suited to high-throughput genotyping of multiple TILLING mutants in complex crop genomes. To date it has been used to identify mutants and genotype single mutations. The aim of this study was to determine if HRM can facilitate downstream analysis of multiple mutant lines identified by TILLING in order to characterise allelic series of EMS induced mutations in target genes across a number of generations in complex crop genomes. We demonstrate that HRM can be used to genotype allelic series of mutations in two genes, BraA.CAX1a and BraA.MET1.a in Brassica rapa. We analysed 12 mutations in BraA.CAX1.a and five in BraA.MET1.a over two generations including a back-cross to the wild-type. Using a commercially available HRM kit and the Lightscanner™ system we were able to detect mutations in heterozygous and homozygous states for both genes. Using HRM genotyping on TILLING derived mutants, it is possible to generate an allelic series of mutations within multiple target genes rapidly. Lines suitable for phenotypic analysis can be isolated approximately 8-9 months (3 generations) from receiving M3 seed of Brassica rapa from the RevGenUK TILLING service.
2011-01-01
Background Targeted Induced Loci Lesions IN Genomes (TILLING) is increasingly being used to generate and identify mutations in target genes of crop genomes. TILLING populations of several thousand lines have been generated in a number of crop species including Brassica rapa. Genetic analysis of mutants identified by TILLING requires an efficient, high-throughput and cost effective genotyping method to track the mutations through numerous generations. High resolution melt (HRM) analysis has been used in a number of systems to identify single nucleotide polymorphisms (SNPs) and insertion/deletions (IN/DELs) enabling the genotyping of different types of samples. HRM is ideally suited to high-throughput genotyping of multiple TILLING mutants in complex crop genomes. To date it has been used to identify mutants and genotype single mutations. The aim of this study was to determine if HRM can facilitate downstream analysis of multiple mutant lines identified by TILLING in order to characterise allelic series of EMS induced mutations in target genes across a number of generations in complex crop genomes. Results We demonstrate that HRM can be used to genotype allelic series of mutations in two genes, BraA.CAX1a and BraA.MET1.a in Brassica rapa. We analysed 12 mutations in BraA.CAX1.a and five in BraA.MET1.a over two generations including a back-cross to the wild-type. Using a commercially available HRM kit and the Lightscanner™ system we were able to detect mutations in heterozygous and homozygous states for both genes. Conclusions Using HRM genotyping on TILLING derived mutants, it is possible to generate an allelic series of mutations within multiple target genes rapidly. Lines suitable for phenotypic analysis can be isolated approximately 8-9 months (3 generations) from receiving M3 seed of Brassica rapa from the RevGenUK TILLING service. PMID:22152063
Peredo, Elena L.; King, Ursula M.; Les, Donald H.
2013-01-01
The re-colonization of aquatic habitats by angiosperms has presented a difficult challenge to plants whose long evolutionary history primarily reflects adaptations to terrestrial conditions. Many aquatics must complete vital stages of their life cycle on the water surface by means of floating or emergent leaves and flowers. Only a few species, mainly within the order Alismatales, are able to complete all aspects of their life cycle including pollination, entirely underwater. Water-pollinated Alismatales include seagrasses and water nymphs (Najas), the latter being the only freshwater genus in the family Hydrocharitaceae with subsurface water-pollination. We have determined the complete nucleotide sequence of the plastid genome of Najas flexilis. The plastid genome of N. flexilis is a circular AT-rich DNA molecule of 156 kb, which displays a quadripartite structure with two inverted repeats (IR) separating the large single copy (LSC) from the small single copy (SSC) regions. In N. flexilis, as in other Alismatales, the rps19 and trnH genes are localized in the LSC region instead of within the IR regions as in other monocots. However, the N. flexilis plastid genome presents some anomalous modifications. The size of the SSC region is only one third of that reported for closely related species. The number of genes in the plastid is considerably less. Both features are due to loss of the eleven ndh genes in the Najas flexilis plastid. In angiosperms, the absence of ndh genes has been related mainly to the loss of photosynthetic function in parasitic plants. The ndh genes encode the NAD(P)H dehydrogenase complex, believed essential in terrestrial environments, where it increases photosynthetic efficiency in variable light intensities. The modified structure of the N. flexilis plastid genome suggests that adaptation to submersed environments, where light is scarce, has involved the loss of the NDH complex in at least some photosynthetic angiosperms. PMID:23861923
Kao, Katy C.; Schwartz, Katja; Sherlock, Gavin
2010-01-01
The Dobzhansky-Muller (D-M) model of speciation by genic incompatibility is widely accepted as the primary cause of interspecific postzygotic isolation. Since the introduction of this model, there have been theoretical and experimental data supporting the existence of such incompatibilities. However, speciation genes have been largely elusive, with only a handful of candidate genes identified in a few organisms. The Saccharomyces sensu stricto yeasts, which have small genomes and can mate interspecifically to produce sterile hybrids, are thus an ideal model for studying postzygotic isolation. Among them, only a single D-M pair, comprising a mitochondrially targeted product of a nuclear gene and a mitochondrially encoded locus, has been found. Thus far, no D-M pair of nuclear genes has been identified between any sensu stricto yeasts. We report here the first detailed genome-wide analysis of rare meiotic products from an otherwise sterile hybrid and show that no classic D-M pairs of speciation genes exist between the nuclear genomes of the closely related yeasts S. cerevisiae and S. paradoxus. Instead, our analyses suggest that more complex interactions, likely involving multiple loci having weak effects, may be responsible for their post-zygotic separation. The lack of a nuclear encoded classic D-M pair between these two yeasts, yet the existence of multiple loci that may each exert a small effect through complex interactions suggests that initial speciation events might not always be mediated by D-M pairs. An alternative explanation may be that the accumulation of polymorphisms leads to gamete inviability due to the activities of anti-recombination mechanisms and/or incompatibilities between the species' transcriptional and metabolic networks, with no single pair at least initially being responsible for the incompatibility. After such a speciation event, it is possible that one or more D-M pairs might subsequently arise following isolation. PMID:20686707
Zakharova, Irina; Teteryatnikova, Natalya; Toporkov, Andrey; Viktorov, Dmitry
2017-10-01
Two species of Burkholderia pseudomallei complex (Bpc), B. pseudomallei and B. mallei, can cause severe life-threatening infections. Rapidly discerning individual species within the group and separating them from other opportunistic pathogens of the Burkholderia cepacia complex (Bcc) is essential to establish a correct diagnosis and for epidemiological surveillance. In this study, a multiplex PCR assay based on the detection of an individual set of chromosomal beta-lactamase genes for single-step identification and differentiation of B. pseudomallei, B. mallei, B. thailandensis, and Bcc was developed. Two pairs of primers specific to a distinct class of B metallo-beta-lactamase genes and a pair of primers specific to the oxacillin-hydrolyzing class D beta-lactamase gene were demonstrated to successfully discriminate species within Bpc and from Bcc. The assay sensitivity was 9561 genomic equivalents (GE) for B. pseudomallei, 7827 GE for B. mallei, 8749 GE for B. thailandensis and 6023 GE for B. cepacia. Copyright © 2017 Elsevier B.V. All rights reserved.
Pediatric Multiple Sclerosis: Genes, Environment, and a Comprehensive Therapeutic Approach.
Cappa, Ryan; Theroux, Liana; Brenton, J Nicholas
2017-10-01
Pediatric multiple sclerosis is an increasingly recognized and studied disorder that accounts for 3% to 10% of all patients with multiple sclerosis. The risk for pediatric multiple sclerosis is thought to reflect a complex interplay between environmental and genetic risk factors. Environmental exposures, including sunlight (ultraviolet radiation, vitamin D levels), infections (Epstein-Barr virus), passive smoking, and obesity, have been identified as potential risk factors in youth. Genetic predisposition contributes to the risk of multiple sclerosis, and the major histocompatibility complex on chromosome 6 makes the single largest contribution to susceptibility to multiple sclerosis. With the use of large-scale genome-wide association studies, other non-major histocompatibility complex alleles have been identified as independent risk factors for the disease. The bridge between environment and genes likely lies in the study of epigenetic processes, which are environmentally-influenced mechanisms through which gene expression may be modified. This article will review these topics to provide a framework for discussion of a comprehensive approach to counseling and ultimately treating the pediatric patient with multiple sclerosis. Copyright © 2017 Elsevier Inc. All rights reserved.
NASA Astrophysics Data System (ADS)
Willette, Demian A.; Iñiguez, Abril R.; Kupriyanova, Elena K.; Starger, Craig J.; Varman, Tristan; Toha, Abdul Hamid; Maralit, Benedict A.; Barber, Paul H.
2015-09-01
Christmas tree worm is the common name of a group of colorful serpulid polychaetes from the genus Spirobranchus that are symbionts of hermatypic corals. As is increasingly common with reef-associated organisms, Spirobranchus is arranged as a complex of species with overlapping geographic ranges. Current species delimitations based largely on opercular morphology are problematic because of high intraspecific variation. Here, a multi-gene phylogeny of the Spirobranchus corniculatus complex, which tentatively includes S. corniculatus, S. cruciger, and S. gaymardi, sampled from the Coral Triangle, Australia, and Fiji, was reconstructed to test whether the complex includes three genetically distinct lineages identifiable by their opercula. Maximum-likelihood analyses of nuclear and mitochondrial markers revealed a single, monophyletic clade for the S. corniculatus complex. Furthermore, the genetic and morphological variation observed is not geographically based, indicating that the former S. corniculatus complex of three morphospecies is a single, morphologically variable species across the Central Indo-Pacific. Resolving the taxonomy of S. corniculatus presents novel opportunities to utilize this tentative bio-indicator species for monitoring reef health.
Dietzgen, Ralf G; Kondo, Hideki; Goodin, Michael M; Kurath, Gael; Vasilakis, Nikos
2017-01-02
The family Rhabdoviridae consists of mostly enveloped, bullet-shaped or bacilliform viruses with a negative-sense, single-stranded RNA genome that infect vertebrates, invertebrates or plants. This ecological diversity is reflected by the diversity and complexity of their genomes. Five canonical structural protein genes are conserved in all rhabdoviruses, but may be overprinted, overlapped or interspersed with several novel and diverse accessory genes. This review gives an overview of the characteristics and diversity of rhabdoviruses, their taxonomic classification, replication mechanism, properties of classical rhabdoviruses such as rabies virus and rhabdoviruses with complex genomes, rhabdoviruses infecting aquatic species, and plant rhabdoviruses with both mono- and bipartite genomes. Copyright © 2016 Elsevier B.V. All rights reserved.
Kampmann, Martin; Bassik, Michael C.; Weissman, Jonathan S.
2013-01-01
A major challenge of the postgenomic era is to understand how human genes function together in normal and disease states. In microorganisms, high-density genetic interaction (GI) maps are a powerful tool to elucidate gene functions and pathways. We have developed an integrated methodology based on pooled shRNA screening in mammalian cells for genome-wide identification of genes with relevant phenotypes and systematic mapping of all GIs among them. We recently demonstrated the potential of this approach in an application to pathways controlling the susceptibility of human cells to the toxin ricin. Here we present the complete quantitative framework underlying our strategy, including experimental design, derivation of quantitative phenotypes from pooled screens, robust identification of hit genes using ultra-complex shRNA libraries, parallel measurement of tens of thousands of GIs from a single double-shRNA experiment, and construction of GI maps. We describe the general applicability of our strategy. Our pooled approach enables rapid screening of the same shRNA library in different cell lines and under different conditions to determine a range of different phenotypes. We illustrate this strategy here for single- and double-shRNA libraries. We compare the roles of genes for susceptibility to ricin and Shiga toxin in different human cell lines and reveal both toxin-specific and cell line-specific pathways. We also present GI maps based on growth and ricin-resistance phenotypes, and we demonstrate how such a comparative GI mapping strategy enables functional dissection of physical complexes and context-dependent pathways. PMID:23739767
The developmental transcriptome atlas of the spoon worm Urechis unicinctus (Echiurida: Annelida).
Park, Chungoo; Han, Yong-Hee; Lee, Sung-Gwon; Ry, Kyoung-Bin; Oh, Jooseong; Kern, Elizabeth M A; Park, Joong-Ki; Cho, Sung-Jin
2018-03-01
Echiurida is one of the most intriguing major subgroups of annelida because, unlike most other annelids, echiurids lack metameric body segmentation as adults. For this reason, transcriptome analyses from various developmental stages of echiurid species can be of substantial value for understanding precise expression levels and the complex regulatory networks during early and larval development. A total of 914 million raw RNA-Seq reads were produced from 14 developmental stages of Urechis unicinctus and were de novo assembled into contigs spanning 63,928,225 bp with an N50 length of 2700 bp. The resulting comprehensive transcriptome database of the early developmental stages of U. unicinctus consists of 20,305 representative functional protein-coding transcripts. Approximately 66% of unigenes were assigned to superphylum-level taxa, including Lophotrochozoa (40%). The completeness of the transcriptome assembly was assessed using benchmarking universal single-copy orthologs; 75.7% of the single-copy orthologs were presented in our transcriptome database. We observed 3 distinct patterns of global transcriptome profiles from 14 developmental stages and identified 12,705 genes that showed dynamic regulation patterns during the differentiation and maturation of U. unicinctus cells. We present the first large-scale developmental transcriptome dataset of U. unicinctus and provide a general overview of the dynamics of global gene expression changes during its early developmental stages. The analysis of time-course gene expression data is a first step toward understanding the complex developmental gene regulatory networks in U. unicinctus and will furnish a valuable resource for analyzing the functions of gene repertoires in various developmental phases.
Genetic Heterogeneity in Algerian Human Populations
Deba, Tahria; Calafell, Francesc; Benhamamouch, Soraya; Comas, David
2015-01-01
The demographic history of human populations in North Africa has been characterized by complex processes of admixture and isolation that have modeled its current gene pool. Diverse genetic ancestral components with different origins (autochthonous, European, Middle Eastern, and sub-Saharan) and genetic heterogeneity in the region have been described. In this complex genetic landscape, Algeria, the largest country in Africa, has been poorly covered, with most of the studies using a single Algerian sample. In order to evaluate the genetic heterogeneity of Algeria, Y-chromosome, mtDNA and autosomal genome-wide makers have been analyzed in several Berber- and Arab-speaking groups. Our results show that the genetic heterogeneity found in Algeria is not correlated with geography or linguistics, challenging the idea of Berber groups being genetically isolated and Arab groups open to gene flow. In addition, we have found that external sources of gene flow into North Africa have been carried more often by females than males, while the North African autochthonous component is more frequent in paternally transmitted genome regions. Our results highlight the different demographic history revealed by different markers and urge to be cautious when deriving general conclusions from partial genomic information or from single samples as representatives of the total population of a region. PMID:26402429
The oculocerebrorenal syndrome gene product is a 105-kD protein localized to the Golgi complex.
Olivos-Glander, I M; Jänne, P A; Nussbaum, R L
1995-01-01
The oculocerebrorenal syndrome of Lowe (OCRL) is a multisystem disorder affecting the lens, kidney, and CNS. The predicted amino acid sequence of the OCRL gene, OCRL-1, was used to develop antibodies against the OCRL-1 protein. Western blot analysis using affinity-purified serum against the amino terminus of the OCRL-1 gene product (ocrl-1) demonstrates a single protein of 105 kD in fibroblasts of a normal individual that is absent in fibroblasts of an OCRL patient who lacks OCRL-1 transcript. A single protein with the same electrophoretic mobility is found by western analysis in various human cultured cell lines, and approximately the same size protein is also found in all mouse tissues tested. Northern analysis of various human and mouse tissues demonstrate that OCRL-1 transcript is expressed in nearly all tissues examined. By immunofluorescence, the ocrl-1 antibody stains a juxtanuclear region in normal fibroblast cells, while no specific staining is evident in the OCRL patient who produces no transcript. Colocalization of the ocrl-1 protein to the Golgi complex was demonstrated using a known monoclonal antibody against a Golgi-specific coat protein, beta-COP (beta coatomer protein). Images Figure 1 Figure 2 Figure 3 Figure 4 Figure 5 Figure 6 PMID:7573041
Evstiukhina, T A; Alekseeva, E A; Fedorov, D V; Peshekhonov, V T; Korolev, V G
2017-02-01
Chromatin remodulators are special multiprotein machines capable of transforming the structure, constitution, and positioning of nucleosomes on DNA. Biochemical activities of remodeling complexes CHD1 and ISWI from the SWI2/SNF2 family are well established. They ensure correct positioning of nucleosomes along the genome, which is probably critical for genome stability, in particular, after action of polymerases, repair enzymes, and transcription. In this paper, we show that single mutations in genes ISW1, ISW2, and CHD1 weakly affect repair and mutagenic processes in yeast cells. At the same time, there are differences in the effect of these mutations on spontaneous mutation levels, which indicates certain specificity of action of protein complexes ISW1, ISW2, and CHD1 on expression of different genes that control repair and mutation processes in yeast.
Positional cloning in mice and its use for molecular dissection of inflammatory arthritis.
Abe, Koichiro; Yu, Philipp
2009-02-01
One of the upcoming next quests in the field of genetics might be molecular dissection of the genetic and environmental components of human complex diseases. In humans, however, there are certain experimental limitations for identification of a single component of the complex interactions by genetic analyses. Experimental animals offer simplified models for genetic and environmental interactions in human complex diseases. In particular, mice are the best mammalian models because of a long history and ample experience for genetic analyses. Forward genetics, which includes genetic screen and subsequent positional cloning of the causative genes, is a powerful strategy to dissect a complex phenomenon without preliminarily molecular knowledge of the process. In this review, first, we describe a general scheme of positional cloning in mice. Next, recent accomplishments on the patho-mechanisms of inflammatory arthritis by forward genetics approaches are introduced; Positional cloning effort for skg, Ali5, Ali18, cmo, and lupo mutants are provided as examples for the application to human complex diseases. As seen in the examples, the identification of genetic factors by positional cloning in the mouse have potential in solving molecular complexity of gene-environment interactions in human complex diseases.
Estrada, Beatriz; Maeland, Anne D; Gisselbrecht, Stephen S; Bloor, James W; Brown, Nicholas H; Michelson, Alan M
2007-07-15
Multinucleated myotubes develop by the sequential fusion of individual myoblasts. Using a convergence of genomic and classical genetic approaches, we have discovered a novel gene, singles bar (sing), that is essential for myoblast fusion. sing encodes a small multipass transmembrane protein containing a MARVEL domain, which is found in vertebrate proteins involved in processes such as tight junction formation and vesicle trafficking where--as in myoblast fusion--membrane apposition occurs. sing is expressed in both founder cells and fusion competent myoblasts preceding and during myoblast fusion. Examination of embryos injected with double-stranded sing RNA or embryos homozygous for ethane methyl sulfonate-induced sing alleles revealed an identical phenotype: replacement of multinucleated myofibers by groups of single, myosin-expressing myoblasts at a stage when formation of the mature muscle pattern is complete in wild-type embryos. Unfused sing mutant myoblasts form clusters, suggesting that early recognition and adhesion of these cells are unimpaired. To further investigate this phenotype, we undertook electron microscopic ultrastructural studies of fusing myoblasts in both sing and wild-type embryos. These experiments revealed that more sing mutant myoblasts than wild-type contain pre-fusion complexes, which are characterized by electron-dense vesicles paired on either side of the fusing plasma membranes. In contrast, embryos mutant for another muscle fusion gene, blown fuse (blow), have a normal number of such complexes. Together, these results lead to the hypothesis that sing acts at a step distinct from that of blow, and that sing is required on both founder cell and fusion-competent myoblast membranes to allow progression past the pre-fusion complex stage of myoblast fusion, possibly by mediating fusion of the electron-dense vesicles to the plasma membrane.
Genomic organization of the rat alpha 2u-globulin gene cluster.
McFadyen, D A; Addison, W; Locke, J
1999-05-01
The alpha 2u-globulin are a group of similar proteins, belonging to the lipocalin superfamily of proteins, that are synthesized in a subset of secretory tissues in rats. The many alpha 2u-globulin isoforms are encoded by a multigene family that exhibits extensive homology. Despite a high degree of sequence identity, individual family members show diverse expression patterns involving complex hormonal, tissue-specific, and developmental regulation. Analysis suggests that there are approximately 20 alpha 2u-globulin genes in the rat genome. We have used fluorescence in situ hybridization (FISH) to show that the alpha 2u-globulin genes are clustered at a single site on rat Chromosome (Chr) 5 (5q22-24). Southern blots of rat genomic DNA separated by pulsed field gel electrophoresis indicated that the alpha 2u-globulin genes are contained on two NruI fragments with a total size of 880 kbp. Analysis of three P1 clones containing alpha 2u-globulin genes indicated that the alpha 2u-globulin genes are tandemly arranged in a head-to-tail fashion. The organization of the alpha 2u-globulin genes in the rat as a tandem array of single genes differs from the homologous major urinary protein genes in the mouse, which are organized as tandem arrays of divergently oriented gene pairs. The structure of these gene clusters may have consequences for the proposed function, as a pheromone transporter, for the protein products encoded by these genes.
Röschmann, K I L; van Kuijen, A-M; Luiten, S; Jonker, M J; Breit, T M; Fokkens, W J; Petersen, A; van Drunen, C M
2012-10-01
Seasonal allergic rhinitis (AR) is a global health problem and its prevalence has increased considerably in the last decades. As the allergic response with its clinical manifestations is triggered by only a few proteins within natural extracts, there is an increasing tendency for single-component-resolved diagnosis and immunotherapy. As natural exposure is not to single proteins, but to complex mixtures of molecules, we were interested in comparing the activation of respiratory epithelial cells induced by the purified major allergen Phl p 1 with the induction caused by a complete extract of Timothy grass pollen (GPE). NCI-H292 cells were exposed to GPE or Ph1 p 1 for 24 h, isolated RNA and cell culture supernatants were used for microarray analysis, multiplex enzyme-linked immunosorbant assay (ELISA) and subsequent analysis. We found 262 genes that showed a GPE-induced change of at least 3-fold, whereas Ph1 p 1-stimulation resulted in 71 genes with a fold induction of more than 3-fold. Besides genes that were regulated by both stimuli, we also detected genes displaying an opposite response after stimulation, suggesting that GPE might be more than purified major allergens with regard to induced immune responses. Additional components within GPE and the resulting modulation of general processes affecting gene transcription and signalling pathways might be crucial to maintain/overcome the diseased phenotype and to induce the influx of cells contributing to late-phase allergic responses. When the initial process of sensitization is the matter of interest or late-phase allergic responses, one might miss important immune modulatory molecules and their interaction with allergens by applying single components only. © 2012 Blackwell Publishing Ltd.
Meta-analysis of gene-level associations for rare variants based on single-variant statistics.
Hu, Yi-Juan; Berndt, Sonja I; Gustafsson, Stefan; Ganna, Andrea; Hirschhorn, Joel; North, Kari E; Ingelsson, Erik; Lin, Dan-Yu
2013-08-08
Meta-analysis of genome-wide association studies (GWASs) has led to the discoveries of many common variants associated with complex human diseases. There is a growing recognition that identifying "causal" rare variants also requires large-scale meta-analysis. The fact that association tests with rare variants are performed at the gene level rather than at the variant level poses unprecedented challenges in the meta-analysis. First, different studies may adopt different gene-level tests, so the results are not compatible. Second, gene-level tests require multivariate statistics (i.e., components of the test statistic and their covariance matrix), which are difficult to obtain. To overcome these challenges, we propose to perform gene-level tests for rare variants by combining the results of single-variant analysis (i.e., p values of association tests and effect estimates) from participating studies. This simple strategy is possible because of an insight that multivariate statistics can be recovered from single-variant statistics, together with the correlation matrix of the single-variant test statistics, which can be estimated from one of the participating studies or from a publicly available database. We show both theoretically and numerically that the proposed meta-analysis approach provides accurate control of the type I error and is as powerful as joint analysis of individual participant data. This approach accommodates any disease phenotype and any study design and produces all commonly used gene-level tests. An application to the GWAS summary results of the Genetic Investigation of ANthropometric Traits (GIANT) consortium reveals rare and low-frequency variants associated with human height. The relevant software is freely available. Copyright © 2013 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Han, Xuelei; Jiang, Tengfei; Yang, Huawei; Zhang, Qingde; Wang, Weimin; Fan, Bin; Liu, Bang
2012-06-01
Meat quality traits are economically important traits of swine, and are controlled by multiple genes as complex quantitative traits. In the present study four genes, H-FABP (heart fatty acid-binding protein), MASTR (MEF2 activating motif and SAP domain containing transcriptional regulator), UCP3 (uncoupling protein 3) and MYOD1 (myogenic differentiation 1) were researched in Large White pigs. The polymorphisms H-FABP T/C of 5'UTR, MYOD1 g.257 A>C, UCP3 g.1406 G>A in exon 3 and MASTR c.187 C>T have been reported to be associated with meat quality traits in pigs. The aim of this study was to analyze the effect of single and multiple markers for single traits in Large White pigs. The single marker association analysis showed that the H-FABP and MASTR genes were associated with IMF (intramuscular fat content) (P < 0.05), and that the g.257 A>C of MYOD1 gene was most significantly related to muscle pH value (P < 0.01). The multiple markers for IMF were analyzed by combining the markers and quantitative trait modes into the linear regression. The results revealed that H-FABP and MASTR integrate gene networks for IMF. Thus, our study results suggested that H-FABP and MASTR polymorphisms could be used as genetic markers in the marker-assisted selection towards the improvement of IMF in Large White pigs.
Balandyté, Lina; Brodard, Isabelle; Frey, Joachim; Oevermann, Anna; Abril, Carlos
2011-01-01
Listeria monocytogenes is among the most important food-borne pathogens and is well adapted to persist in the environment. To gain insight into the genetic relatedness and potential virulence of L. monocytogenes strains causing central nervous system (CNS) infections, we used multilocus variable-number tandem-repeat analysis (MLVA) to subtype 183 L. monocytogenes isolates, most from ruminant rhombencephalitis and some from human patients, food, and the environment. Allelic-profile-based comparisons grouped L. monocytogenes strains mainly into three clonal complexes and linked single-locus variants (SLVs). Clonal complex A essentially consisted of isolates from human and ruminant brain samples. All but one rhombencephalitis isolate from cattle were located in clonal complex A. In contrast, food and environmental isolates mainly clustered into clonal complex C, and none was classified as clonal complex A. Isolates of the two main clonal complexes (A and C) obtained by MLVA were analyzed by PCR for the presence of 11 virulence-associated genes (prfA, actA, inlA, inlB, inlC, inlD, inlE, inlF, inlG, inlJ, and inlC2H). Virulence gene analysis revealed significant differences in the actA, inlF, inlG, and inlJ allelic profiles between clinical isolates (complex A) and nonclinical isolates (complex C). The association of particular alleles of actA, inlF, and newly described alleles of inlJ with isolates from CNS infections (particularly rhombencephalitis) suggests that these virulence genes participate in neurovirulence of L. monocytogenes. The overall absence of inlG in clinical complex A and its presence in complex C isolates suggests that the InlG protein is more relevant for the survival of L. monocytogenes in the environment. PMID:21984240
Sinha, Pallavi; Gupta, Anamika; Prakash, Pradyot; Anupurba, Shampa; Tripathi, Rajneesh; Srivastava, G N
2016-03-12
Control of the global burden of tuberculosis is obstructed due to lack of simple, rapid and cost effective diagnostic techniques that can be used in resource poor-settings. To facilitate the early diagnosis of TB directly from clinical specimens, we have standardized and validated the use of nested multiplex PCR, targeting gene fragments IS6110, MTP40 and 32kD α-antigen encoding genes specific for Mycobacterium tuberculosis complex and non-tubercular mycobacteria (NTM), in comparison to smear microscopy, solid culture and single step multiplex PCR. The results were evaluated in comparison to a composite reference standard (CRS) comprising of microbiological results (smear and culture), clinical, radiological and cytopathological findings, clinical treatment and response to anti-tubercular therapy. The nested multiplex PCR (nMPCR) assay was evaluated to test its utility in 600 (535 pulmonary and 65 extra-pulmonary specimens) clinically suspected TB cases. All specimens were processed for smear, culture, single step multiplex PCR and nested multiplex PCR testing. Out of 535 screened pulmonary and 65 extra-pulmonary specimens, 329 (61.5%) and 19 (29.2%) cases were culture positive for M. tuberculosis. Based on CRS, 450 patients had "clinical TB" (definitive-TB, probable-TB and possible-TB). Remaining 150 were confirmed "non-TB" cases. For culture, the sensitivity was low, 79.3% for pulmonary and 54.3% for extra-pulmonary cases. The sensitivity and specificity results for nMPCR test were evaluated taken composite reference standard as a gold standard. The sensitivity of the nMPCR assay was 97.1% for pulmonary and 91.4% for extra-pulmonary TB cases with specificity of 100% and 93.3% respectively. Nested multiplex PCR using three gene primers is a rapid, reliable and highly sensitive and specific diagnostic technique for the detection and differentiation of M. tuberculosis complex from NTM genome and will be useful in diagnosing paucibacillary samples. Nested multiplex PCR assay was found to be better than single step multiplex PCR for assessing the diagnosis of TB.
Gribble, Kristin E; Mark Welch, David B
2012-08-01
Chemically mediated prezygotic barriers to reproduction likely play an important role in speciation. In facultatively sexual monogonont rotifers from the Brachionus plicatilis cryptic species complex, mate recognition of females by males is mediated by the Mate Recognition Protein (MRP), a globular glycoprotein on the surface of females, encoded by the mmr-b gene family. In this study, we sequenced mmr-b copies from 27 isolates representing 11 phylotypes of the B. plicatilis species complex, examined the mode of evolution and selection of mmr-b, and determined the relationship between mmr-b genetic distance and mate recognition among isolates. Isolates of the B. plicatilis species complex have 1-4 copies of mmr-b, each composed of 2-9 nearly identical tandem repeats. The repeats within a gene copy are generally more similar than are gene copies among phylotypes, suggesting concerted evolution. Compared to housekeeping genes from the same isolates, mmr-b has accumulated only half as many synonymous differences but twice as many non-synonymous differences. Most of the amino acid differences between repeats appear to occur on the outer face of the protein, and these often result in changes in predicted patterns of phosphorylation. However, we found no evidence of positive selection driving these differences. Isolates with the most divergent copies were unable to mate with other isolates and rarely self-crossed. Overall the degree of mate recognition was significantly correlated with the genetic distance of mmr-b. Discrimination of compatible mates in the B. plicatilis species complex is determined by proteins encoded by closely related copies of a single gene, mmr-b. While concerted evolution of the tandem repeats in mmr-b may function to maintain identity, it can also lead to the rapid spread of a mutation through all copies in the genome and thus to reproductive isolation. The mmr-b gene is evolving rapidly, and novel alleles may be maintained and increase in frequency via asexual reproduction. Our analyses indicate that mate recognition, controlled by MMR-B, may drive reproductive isolation and allow saltational sympatric speciation within the B. plicatilis cryptic species complex, and that this process may be largely neutral.
Kurbasic, Azra; Poveda, Alaitz; Chen, Yan; Ågren, Åsa; Engberg, Elisabeth; Hu, Frank B.; Johansson, Ingegerd; Barroso, Ines; Brändström, Anders; Hallmans, Göran; Renström, Frida; Franks, Paul W.
2014-01-01
Most complex diseases have well-established genetic and non-genetic risk factors. In some instances, these risk factors are likely to interact, whereby their joint effects convey a level of risk that is either significantly more or less than the sum of these risks. Characterizing these gene-environment interactions may help elucidate the biology of complex diseases, as well as to guide strategies for their targeted prevention. In most cases, the detection of gene-environment interactions will require sample sizes in excess of those needed to detect the marginal effects of the genetic and environmental risk factors. Although many consortia have been formed, comprising multiple diverse cohorts to detect gene-environment interactions, few robust examples of such interactions have been discovered. This may be because combining data across studies, usually through meta-analysis of summary data from the contributing cohorts, is often a statistically inefficient approach for the detection of gene-environment interactions. Ideally, single, very large and well-genotyped prospective cohorts, with validated measures of environmental risk factor and disease outcomes should be used to study interactions. The presence of strong founder effects within those cohorts might further strengthen the capacity to detect novel genetic effects and gene-environment interactions. Access to accurate genealogical data would also aid in studying the diploid nature of the human genome, such as genomic imprinting (parent-of-origin effects). Here we describe two studies from northern Sweden (the GLACIER and VIKING studies) that fulfill these characteristics. PMID:25396097
Kurbasic, Azra; Poveda, Alaitz; Chen, Yan; Agren, Asa; Engberg, Elisabeth; Hu, Frank B; Johansson, Ingegerd; Barroso, Ines; Brändström, Anders; Hallmans, Göran; Renström, Frida; Franks, Paul W
2014-12-01
Most complex diseases have well-established genetic and non-genetic risk factors. In some instances, these risk factors are likely to interact, whereby their joint effects convey a level of risk that is either significantly more or less than the sum of these risks. Characterizing these gene-environment interactions may help elucidate the biology of complex diseases, as well as to guide strategies for their targeted prevention. In most cases, the detection of gene-environment interactions will require sample sizes in excess of those needed to detect the marginal effects of the genetic and environmental risk factors. Although many consortia have been formed, comprising multiple diverse cohorts to detect gene-environment interactions, few robust examples of such interactions have been discovered. This may be because combining data across studies, usually through meta-analysis of summary data from the contributing cohorts, is often a statistically inefficient approach for the detection of gene-environment interactions. Ideally, single, very large and well-genotyped prospective cohorts, with validated measures of environmental risk factor and disease outcomes should be used to study interactions. The presence of strong founder effects within those cohorts might further strengthen the capacity to detect novel genetic effects and gene-environment interactions. Access to accurate genealogical data would also aid in studying the diploid nature of the human genome, such as genomic imprinting (parent-of-origin effects). Here we describe two studies from northern Sweden (the GLACIER and VIKING studies) that fulfill these characteristics.
Duconge, Jorge; Cadilla, Carmen L; Windemuth, Andreas; Kocherla, Mohan; Gorowski, Krystyna; Seip, Richard L; Bogaard, Kali; Renta, Jessica Y; Piovanetti, Paola; D'Agostino, Darrin; Santiago-Borrero, Pedro J; Ruaño, Gualberto
2009-01-01
Polymorphisms in the cytochrome P450 2C9 (CYP2C9) and vitamin K epoxide reductase complex subunit 1 (VKORC1) genes significantly alter the effective warfarin dose. We determined the frequencies of alleles, single carriers, and double carriers of single nucleotide polymorphisms (SNPs) in the CYP2C9 and VKORC1 genes in a Puerto Rican cohort and gauged the impact of these polymorphisms on warfarin dosage using a published algorithm. A total of 92 DNA samples were genotyped using Luminex x-MAP technology. The polymorphism frequencies were 6.52%, 5.43% and 28.8% for CYP2C9 *2, *3 and VKORC1-1639 C>A polymorphisms, respectively. The prevalence of combinatorial genotypes was 16% for carriers of both the CYP2C9 and VKORC1 polymorphisms, 9% for carriers of CYP2C9 polymorphisms, 35% for carriers of the VKORC1 polymorphism, and the remaining 40% were non-carriers for either gene. Based on a published warfarin dosing algorithm, single, double and triple carriers of functionally deficient polymorphisms predict reductions of 1.0-1.6, 2.0-2.9, and 2.9-3.7 mg/day, respectively, in warfarin dose. Overall, 60% of the population carried at least a single polymorphism predicting deficient warfarin metabolism or responsiveness and 13% were double carriers with polymorphisms in both genes studied. Combinatorial genotyping of CYP2C9 and VKORC1 can allow for individualized dosing of warfarin among patients with gene polymorphisms, potentially reducing the risk of stroke or bleeding.
A strategy to apply quantitative epistasis analysis on developmental traits.
Labocha, Marta K; Yuan, Wang; Aleman-Meza, Boanerges; Zhong, Weiwei
2017-05-15
Genetic interactions are keys to understand complex traits and evolution. Epistasis analysis is an effective method to map genetic interactions. Large-scale quantitative epistasis analysis has been well established for single cells. However, there is a substantial lack of such studies in multicellular organisms and their complex phenotypes such as development. Here we present a method to extend quantitative epistasis analysis to developmental traits. In the nematode Caenorhabditis elegans, we applied RNA interference on mutants to inactivate two genes, used an imaging system to quantitatively measure phenotypes, and developed a set of statistical methods to extract genetic interactions from phenotypic measurement. Using two different C. elegans developmental phenotypes, body length and sex ratio, as examples, we showed that this method could accommodate various metazoan phenotypes with performances comparable to those methods in single cell growth studies. Comparing with qualitative observations, this method of quantitative epistasis enabled detection of new interactions involving subtle phenotypes. For example, several sex-ratio genes were found to interact with brc-1 and brd-1, the orthologs of the human breast cancer genes BRCA1 and BARD1, respectively. We confirmed the brc-1 interactions with the following genes in DNA damage response: C34F6.1, him-3 (ortholog of HORMAD1, HORMAD2), sdc-1, and set-2 (ortholog of SETD1A, SETD1B, KMT2C, KMT2D), validating the effectiveness of our method in detecting genetic interactions. We developed a reliable, high-throughput method for quantitative epistasis analysis of developmental phenotypes.
Oh, Binna; Han, Jaesik; Choi, Eunji; Tan, Xiaonan; Lee, Minhyung
2015-04-01
Bevacizumab (Avastin) is an angiogenesis inhibitor used as a treatment for various cancers. In this study, the combination therapy of Avastin and glioblastoma-specific thymidine kinase gene [pEpo-NI2-SV-herpes simplex virus thymidine kinase(HSVtk)] was evaluated in a glioblastoma animal model. The R7L10 peptide was used as a gene carrier of pEpo-NI2-SV-HSVtk. Gel retardation assays confirmed that R7L10 formed stable complexes with pEpo-NI2-SV-HSVtk. R7L10 protected DNA from nuclease digestion. R7L10 had lower transfection efficiency than polyethylenimine (PEI; 25 kDa). However, the in vitro and in vivo toxicity assays showed that R7L10 had lower cytotoxicity than PEI, suggesting that R7L10 is safer than PEI. For the combination therapy, Avastin was injected intravenously and the pEpo-NI2-SV-HSVtk/R7L10 complexes were injected intratumorally in the glioblastoma animal model. Tumor growth was most effectively inhibited by the combination therapy of Avastin and the gene. The immunostaining results confirmed that the HSVtk genes were expressed in the groups with the pEpo-NI2-SV-HSVtk/R7L10 complex. The terminal deoxynucleotidyl transferase dUTP nick end labeling assay showed a higher level of apoptotic cells in the combination group than the pEpo-NI2-SV-HSVtk/R7L10 complex or Avastin group. In conclusion, the combination of Avastin and the glioblastoma-specific HSVtk gene has a higher antitumor effect than single therapy of Avastin or HSVtk after intratumoral administration in glioblastoma animal model. © 2015 Wiley Periodicals, Inc. and the American Pharmacists Association.
The genetics of mental illness: implications for practice.
Hyman, S. E.
2000-01-01
Many of the comfortable and relatively simple models of the nature of mental disorders, their causes and their neural substrates now appear quite frayed. Gone is the idea that symptom clusters, course of illness, family history and treatment response would coalesce in a simple way to yield valid diagnoses. Also too simple was the concept, born of early pharmacological successes, that abnormal levels of one or more neurotransmitters would satisfactorily explain the pathogenesis of depression or schizophrenia. Gone is the notion that there is a single gene that causes any mental disorder or determines any behavioural variant. The concept of the causative gene has been replaced by that of genetic complexity, in which multiple genes act in concert with non-genetic factors to produce a risk of mental disorder. Discoveries in genetics and neuroscience can be expected to lead to better models that provide improved representation of the complexity of the brain and behaviour and the development of both. There are likely to be profound implications for clinical practice. The complex genetics of risk should reinvigorate research on the epidemiology and classification of mental disorders and explain the complex patterns of disease transmission within families. Knowledge of the timing of the expression of risk genes during brain development and of their function should not only contribute to an understanding of gene action and the pathophysiology of disease but should also help to direct the search for modifiable environmental risk factors that convert risk into illness. The function of risk genes can only become comprehensible in the context of advances at the molecular, cellular and systems levels in neuroscience and the behavioural sciences. Genetics should yield new therapies aimed not just at symptoms but also at pathogenic processes, thus permitting the targeting of specific therapies to individual patients. PMID:10885164
Chen, L P; E, G X; Zhao, Y J; Na, R S; Zhao, Z Q; Zhang, J H; Ma, Y H; Sun, Y W; Zhong, T; Zhang, H P; Huang, Y F
2015-06-18
DRA encodes the alpha chain of the DR heterodimer, is closely linked to DRB and is considered almost monomorphic in major histocompatibility complex region. In this study, we identified the exon 2 of DRA to evaluate the immunogenetic diversity of Chinese south indigenous goat. Two single nucleotide polymorphisms in an untranslated region and one synonymous substitution in coding region were identified. These data suggest that high immunodiversity in native Chinese population.
MaizeGDB: Global support for maize research through open access information [abstract
USDA-ARS?s Scientific Manuscript database
MaizeGDB is the open-access global repository for maize genetic and genomic information – from single genes that determine nutritional quality to whole genome-scale data for complex traits including yield and drought tolerance. The data and tools at MaizeGDB enable researchers from Ethiopia to Ghan...
Single-cell sequencing and tumorigenesis: improved understanding of tumor evolution and metastasis.
Ellsworth, Darrell L; Blackburn, Heather L; Shriver, Craig D; Rabizadeh, Shahrooz; Soon-Shiong, Patrick; Ellsworth, Rachel E
2017-12-01
Extensive genomic and transcriptomic heterogeneity in human cancer often negatively impacts treatment efficacy and survival, thus posing a significant ongoing challenge for modern treatment regimens. State-of-the-art DNA- and RNA-sequencing methods now provide high-resolution genomic and gene expression portraits of individual cells, facilitating the study of complex molecular heterogeneity in cancer. Important developments in single-cell sequencing (SCS) technologies over the past 5 years provide numerous advantages over traditional sequencing methods for understanding the complexity of carcinogenesis, but significant hurdles must be overcome before SCS can be clinically useful. In this review, we: (1) highlight current methodologies and recent technological advances for isolating single cells, single-cell whole-genome and whole-transcriptome amplification using minute amounts of nucleic acids, and SCS, (2) summarize research investigating molecular heterogeneity at the genomic and transcriptomic levels and how this heterogeneity affects clonal evolution and metastasis, and (3) discuss the promise for integrating SCS in the clinical care arena for improved patient care.
Kuznetsova, A A; Lukyanets, E A; Solovyeva, L I; Knorre, D G; Fedorova, O S
2008-12-01
Design of chemically modified oligonucleotides for regulation of gene expression has attracted considerable attention over the past decades. One actively pursued approach involves antisense or antigene oligonucleotide constructs carrying reactive groups, many of these based on transition metal complexes. The complexes of Fe(II) and Co(II) with phthalocyanines are extremely good catalysts of oxidation of organic compounds with molecular oxygen and hydrogen peroxide. The binding of positively charged Fe(II) and Co(II) phthalocyanines with single- and double-stranded DNA was investigated. It was shown that these phthalocyanines interact with nucleic acids through an outside binding mode. The site-directed modification of single-stranded DNA by O2 and H2O2 in the presence of dimeric complexes of negatively and positively charged Fe(II) and Co(II) phthalocyanines was investigated. These complexes were formed directly on single-stranded DNA through interaction between negatively charged phthalocyanine in conjugate and positively charged phthalocyanine in solution. The resulting oppositely charged phthalocyanine complexes showed significant increase of catalytic activity compared with monomeric forms of phthalocyanines Fe(II) and Co(II). These complexes catalyzed the DNA oxidation with high efficacy and led to direct DNA strand cleavage. It was determined that oxidation of DNA by molecular oxygen catalyzed by complex of Fe(II)-phthalocyanines proceeds with higher rate than in the case of Co(II)-phthalocyanines but the latter led to a greater extent of target DNA modification.
Telonis-Scott, Marina; Sgrò, Carla M.; Hoffmann, Ary A.; Griffin, Philippa C.
2016-01-01
Repeated attempts to map the genomic basis of complex traits often yield different outcomes because of the influence of genetic background, gene-by-environment interactions, and/or statistical limitations. However, where repeatability is low at the level of individual genes, overlap often occurs in gene ontology categories, genetic pathways, and interaction networks. Here we report on the genomic overlap for natural desiccation resistance from a Pool-genome-wide association study experiment and a selection experiment in flies collected from the same region in southeastern Australia in different years. We identified over 600 single nucleotide polymorphisms associated with desiccation resistance in flies derived from almost 1,000 wild-caught genotypes, a similar number of loci to that observed in our previous genomic study of selected lines, demonstrating the genetic complexity of this ecologically important trait. By harnessing the power of cross-study comparison, we narrowed the candidates from almost 400 genes in each study to a core set of 45 genes, enriched for stimulus, stress, and defense responses. In addition to gene-level overlap, there was higher order congruence at the network and functional levels, suggesting genetic redundancy in key stress sensing, stress response, immunity, signaling, and gene expression pathways. We also identified variants linked to different molecular aspects of desiccation physiology previously verified from functional experiments. Our approach provides insight into the genomic basis of a complex and ecologically important trait and predicts candidate genetic pathways to explore in multiple genetic backgrounds and related species within a functional framework. PMID:26733490
Laarits, T; Bordalo, P; Lemos, B
2016-08-01
Regulatory networks play a central role in the modulation of gene expression, the control of cellular differentiation, and the emergence of complex phenotypes. Regulatory networks could constrain or facilitate evolutionary adaptation in gene expression levels. Here, we model the adaptation of regulatory networks and gene expression levels to a shift in the environment that alters the optimal expression level of a single gene. Our analyses show signatures of natural selection on regulatory networks that both constrain and facilitate rapid evolution of gene expression level towards new optima. The analyses are interpreted from the standpoint of neutral expectations and illustrate the challenge to making inferences about network adaptation. Furthermore, we examine the consequence of variable stabilizing selection across genes on the strength and direction of interactions in regulatory networks and in their subsequent adaptation. We observe that directional selection on a highly constrained gene previously under strong stabilizing selection was more efficient when the gene was embedded within a network of partners under relaxed stabilizing selection pressure. The observation leads to the expectation that evolutionarily resilient regulatory networks will contain optimal ratios of genes whose expression is under weak and strong stabilizing selection. Altogether, our results suggest that the variable strengths of stabilizing selection across genes within regulatory networks might itself contribute to the long-term adaptation of complex phenotypes. © 2016 European Society For Evolutionary Biology. Journal of Evolutionary Biology © 2016 European Society For Evolutionary Biology.
Negre, Bárbara; Casillas, Sònia; Suzanne, Magali; Sánchez-Herrero, Ernesto; Akam, Michael; Nefedov, Michael; Barbadilla, Antonio; de Jong, Pieter; Ruiz, Alfredo
2005-01-01
Homeotic (Hox) genes are usually clustered and arranged in the same order as they are expressed along the anteroposterior body axis of metazoans. The mechanistic explanation for this colinearity has been elusive, and it may well be that a single and universal cause does not exist. The Hox-gene complex (HOM-C) has been rearranged differently in several Drosophila species, producing a striking diversity of Hox gene organizations. We investigated the genomic and functional consequences of the two HOM-C splits present in Drosophila buzzatii. Firstly, we sequenced two regions of the D. buzzatii genome, one containing the genes labial and abdominal A, and another one including proboscipedia, and compared their organization with that of D. melanogaster and D. pseudoobscura in order to map precisely the two splits. Then, a plethora of conserved noncoding sequences, which are putative enhancers, were identified around the three Hox genes closer to the splits. The position and order of these enhancers are conserved, with minor exceptions, between the three Drosophila species. Finally, we analyzed the expression patterns of the same three genes in embryos and imaginal discs of four Drosophila species with different Hox-gene organizations. The results show that their expression patterns are conserved despite the HOM-C splits. We conclude that, in Drosophila, Hox-gene clustering is not an absolute requirement for proper function. Rather, the organization of Hox genes is modular, and their clustering seems the result of phylogenetic inertia more than functional necessity. PMID:15867430
Functional Dynamics within the Human Ribosome Regulate the Rate of Active Protein Synthesis.
Ferguson, Angelica; Wang, Leyi; Altman, Roger B; Terry, Daniel S; Juette, Manuel F; Burnett, Benjamin J; Alejo, Jose L; Dass, Randall A; Parks, Matthew M; Vincent, C Theresa; Blanchard, Scott C
2015-11-05
The regulation of protein synthesis contributes to gene expression in both normal physiology and disease, yet kinetic investigations of the human translation mechanism are currently lacking. Using single-molecule fluorescence imaging methods, we have quantified the nature and timing of structural processes in human ribosomes during single-turnover and processive translation reactions. These measurements reveal that functional complexes exhibit dynamic behaviors and thermodynamic stabilities distinct from those observed for bacterial systems. Structurally defined sub-states of pre- and post-translocation complexes were sensitive to specific inhibitors of the eukaryotic ribosome, demonstrating the utility of this platform to probe drug mechanism. The application of three-color single-molecule fluorescence resonance energy transfer (smFRET) methods further revealed a long-distance allosteric coupling between distal tRNA binding sites within ribosomes bearing three tRNAs, which contributed to the rate of processive translation. Copyright © 2015 Elsevier Inc. All rights reserved.
Functional dynamics within the human ribosome regulate the rate of active protein synthesis
Ferguson, Angelica; Wang, Leyi; Altman, Roger B.; Terry, Daniel S.; Juette, Manuel F.; Burnett, Benjamin J.; Alejo, Jose L.; Dass, Randall A.; Parks, Matthew M.; Vincent, Theresa C.; Blanchard, Scott C.
2015-01-01
SUMMARY The regulation of protein synthesis contributes to gene expression in both normal physiology and disease, yet kinetic investigations of the human translation mechanism are currently lacking. Using single-molecule fluorescence imaging methods, we have quantified the nature and timing of structural processes in human ribosomes during single-turnover and processive translation reactions. These measurements reveal that functional complexes exhibit dynamic behaviors and thermodynamic stabilities distinct from those observed for bacterial systems. Structurally defined sub-states of pre- and post-translocation complexes were sensitive to specific inhibitors of the eukaryotic ribosome demonstrating the utility of this platform to probe drug mechanism. The application of three-color single-molecule FRET methods further revealed a long-distance allosteric coupling between distal tRNA binding sites within ribosomes bearing three tRNAs, which contributed to the rate of processive translation. PMID:26593721
Autism biomarkers: challenges, pitfalls and possibilities.
Anderson, George M
2015-04-01
Network perspectives, in their emphasis on components and their interactions, might afford the best approach to the complexities of the ASD realm. Categorical approaches are unlikely to be fruitful as one should not expect to find a single or even predominant underlying cause of autism behavior across individuals. It is possible that the complex, highly interactive, heterogeneous and individualistic nature of the autism realm is intractable in terms of identifying clinically useful biomarker tests. It is hopeful from an emergenic perspective that small corrective changes in a single component of a deleterious network/configuration might have large beneficial consequences on developmental trajectories and in later treatment. It is suggested that the relationship between ASD and intellectual disability might be fundamentally different in single-gene versus nonsyndromic ASD. It is strongly stated that available biomarker "tests" for autism/ASD will do more harm than good. Finally, the serotonin-melatonin-oxidative stress-placental intersection might be an especially fruitful area of biological investigation.
Colloquium paper: bioenergetics, the origins of complexity, and the ascent of man.
Wallace, Douglas C
2010-05-11
Complex structures are generated and maintained through energy flux. Structures embody information, and biological information is stored in nucleic acids. The progressive increase in biological complexity over geologic time is thus the consequence of the information-generating power of energy flow plus the information-accumulating capacity of DNA, winnowed by natural selection. Consequently, the most important component of the biological environment is energy flow: the availability of calories and their use for growth, survival, and reproduction. Animals can exploit and adapt to available energy resources at three levels. They can evolve different anatomical forms through nuclear DNA (nDNA) mutations permitting exploitation of alternative energy reservoirs, resulting in new species. They can evolve modified bioenergetic physiologies within a species, primarily through the high mutation rate of mitochondrial DNA (mtDNA)-encoded bioenergetic genes, permitting adjustment to regional energetic environments. They can alter the epigenomic regulation of the thousands of dispersed bioenergetic genes via mitochondrially generated high-energy intermediates permitting individual accommodation to short-term environmental energetic fluctuations. Because medicine pertains to a single species, Homo sapiens, functional human variation often involves sequence changes in bioenergetic genes, most commonly mtDNA mutations, plus changes in the expression of bioenergetic genes mediated by the epigenome. Consequently, common nDNA polymorphisms in anatomical genes may represent only a fraction of the genetic variation associated with the common "complex" diseases, and the ascent of man has been the product of 3.5 billion years of information generation by energy flow, accumulated and preserved in DNA and edited by natural selection.
L(3)mbt and the LINT complex safeguard cellular identity in the Drosophila ovary.
Coux, Rémi-Xavier; Teixeira, Felipe Karam; Lehmann, Ruth
2018-04-04
Maintenance of cellular identity is essential for tissue development and homeostasis. At the molecular level, cell identity is determined by the coordinated activation and repression of defined sets of genes. The tumor suppressor L(3)mbt has been shown to secure cellular identity in Drosophila larval brains by repressing germline-specific genes. Here, we interrogate the temporal and spatial requirements for L(3)mbt in the Drosophila ovary, and show that it safeguards the integrity of both somatic and germline tissues. l(3)mbt mutant ovaries exhibit multiple developmental defects, which we find to be largely caused by the inappropriate expression of a single gene, nanos , a key regulator of germline fate, in the somatic ovarian cells. In the female germline, we find that L(3)mbt represses testis-specific and neuronal genes. At the molecular level, we show that L(3)mbt function in the ovary is mediated through its co-factor Lint-1 but independently of the dREAM complex. Together, our work uncovers a more complex role for L(3)mbt than previously understood and demonstrates that L(3)mbt secures tissue identity by preventing the simultaneous expression of original identity markers and tissue-specific misexpression signatures. © 2018. Published by The Company of Biologists Ltd.
Development of an ELA-DRA gene typing method based on pyrosequencing technology.
Díaz, S; Echeverría, M G; It, V; Posik, D M; Rogberg-Muñoz, A; Pena, N L; Peral-García, P; Vega-Pla, J L; Giovambattista, G
2008-11-01
The polymorphism of equine lymphocyte antigen (ELA) class II DRA gene had been detected by polymerase chain reaction-single-strand conformational polymorphism (PCR-SSCP) and reference strand-mediated conformation analysis. These methodologies allowed to identify 11 ELA-DRA exon 2 sequences, three of which are widely distributed among domestic horse breeds. Herein, we describe the development of a pyrosequencing-based method applicable to ELA-DRA typing, by screening samples from eight different horse breeds previously typed by PCR-SSCP. This sequence-based method would be useful in high-throughput genotyping of major histocompatibility complex genes in horses and other animal species, making this system interesting as a rapid screening method for animal genotyping of immune-related genes.
Single-cell transcriptional analysis of taste sensory neuron pair in Caenorhabditis elegans.
Takayama, Jun; Faumont, Serge; Kunitomo, Hirofumi; Lockery, Shawn R; Iino, Yuichi
2010-01-01
The nervous system is composed of a wide variety of neurons. A description of the transcriptional profiles of each neuron would yield enormous information about the molecular mechanisms that define morphological or functional characteristics. Here we show that RNA isolation from single neurons is feasible by using an optimized mRNA tagging method. This method extracts transcripts in the target cells by co-immunoprecipitation of the complexes of RNA and epitope-tagged poly(A) binding protein expressed specifically in the cells. With this method and genome-wide microarray, we compared the transcriptional profiles of two functionally different neurons in the main C. elegans gustatory neuron class ASE. Eight of the 13 known subtype-specific genes were successfully detected. Additionally, we identified nine novel genes including a receptor guanylyl cyclase, secreted proteins, a TRPC channel and uncharacterized genes conserved among nematodes, suggesting the two neurons are substantially different than previously thought. The expression of these novel genes was controlled by the previously known regulatory network for subtype differentiation. We also describe unique motif organization within individual gene groups classified by the expression patterns in ASE. Our study paves the way to the complete catalog of the expression profiles of individual C. elegans neurons.
Walkowiak, Sean; Rowland, Owen; Rodrigue, Nicolas; Subramaniam, Rajagopal
2016-12-09
The Fusarium graminearum species complex is composed of many distinct fungal species that cause several diseases in economically important crops, including Fusarium Head Blight of wheat. Despite being closely related, these species and individuals within species have distinct phenotypic differences in toxin production and pathogenicity, with some isolates reported as non-pathogenic on certain hosts. In this report, we compare genomes and gene content of six new isolates from the species complex, including the first available genomes of F. asiaticum and F. meridionale, with four other genomes reported in previous studies. A comparison of genome structure and gene content revealed a 93-99% overlap across all ten genomes. We identified more than 700 k base pairs (kb) of single nucleotide polymorphisms (SNPs), insertions, and deletions (indels) within common regions of the genome, which validated the species and genetic populations reported within species. We constructed a non-redundant pan gene list containing 15,297 genes from the ten genomes and among them 1827 genes or 12% were absent in at least one genome. These genes were co-localized in telomeric regions and select regions within chromosomes with a corresponding increase in SNPs and indels. Many are also predicted to encode for proteins involved in secondary metabolism and other functions associated with disease. Genes that were common between isolates contained high levels of nucleotide variation and may be pseudogenes, allelic, or under diversifying selection. The genomic resources we have contributed will be useful for the identification of genes that contribute to the phenotypic variation and niche specialization that have been reported among members of the F. graminearum species complex.
Baurens, Franc-Christophe; Bocs, Stéphanie; Rouard, Mathieu; Matsumoto, Takashi; Miller, Robert N G; Rodier-Goud, Marguerite; MBéguié-A-MBéguié, Didier; Yahiaoui, Nabila
2010-07-16
Comparative sequence analysis of complex loci such as resistance gene analog clusters allows estimating the degree of sequence conservation and mechanisms of divergence at the intraspecies level. In banana (Musa sp.), two diploid wild species Musa acuminata (A genome) and Musa balbisiana (B genome) contribute to the polyploid genome of many cultivars. The M. balbisiana species is associated with vigour and tolerance to pests and disease and little is known on the genome structure and haplotype diversity within this species. Here, we compare two genomic sequences of 253 and 223 kb corresponding to two haplotypes of the RGA08 resistance gene analog locus in M. balbisiana "Pisang Klutuk Wulung" (PKW). Sequence comparison revealed two regions of contrasting features. The first is a highly colinear gene-rich region where the two haplotypes diverge only by single nucleotide polymorphisms and two repetitive element insertions. The second corresponds to a large cluster of RGA08 genes, with 13 and 18 predicted RGA genes and pseudogenes spread over 131 and 152 kb respectively on each haplotype. The RGA08 cluster is enriched in repetitive element insertions, in duplicated non-coding intergenic sequences including low complexity regions and shows structural variations between haplotypes. Although some allelic relationships are retained, a large diversity of RGA08 genes occurs in this single M. balbisiana genotype, with several RGA08 paralogs specific to each haplotype. The RGA08 gene family has evolved by mechanisms of unequal recombination, intragenic sequence exchange and diversifying selection. An unequal recombination event taking place between duplicated non-coding intergenic sequences resulted in a different RGA08 gene content between haplotypes pointing out the role of such duplicated regions in the evolution of RGA clusters. Based on the synonymous substitution rate in coding sequences, we estimated a 1 million year divergence time for these M. balbisiana haplotypes. A large RGA08 gene cluster identified in wild banana corresponds to a highly variable genomic region between haplotypes surrounded by conserved flanking regions. High level of sequence identity (70 to 99%) of the genic and intergenic regions suggests a recent and rapid evolution of this cluster in M. balbisiana.
Davey, Mark W; Graham, Neil S; Vanholme, Bartel; Swennen, Rony; May, Sean T; Keulemans, Johan
2009-01-01
Background 'Systems-wide' approaches such as microarray RNA-profiling are ideally suited to the study of the complex overlapping responses of plants to biotic and abiotic stresses. However, commercial microarrays are only available for a limited number of plant species and development costs are so substantial as to be prohibitive for most research groups. Here we evaluate the use of cross-hybridisation to Affymetrix oligonucleotide GeneChip® microarrays to profile the response of the banana (Musa spp.) leaf transcriptome to drought stress using a genomic DNA (gDNA)-based probe-selection strategy to improve the efficiency of detection of differentially expressed Musa transcripts. Results Following cross-hybridisation of Musa gDNA to the Rice GeneChip® Genome Array, ~33,700 gene-specific probe-sets had a sufficiently high degree of homology to be retained for transcriptomic analyses. In a proof-of-concept approach, pooled RNA representing a single biological replicate of control and drought stressed leaves of the Musa cultivar 'Cachaco' were hybridised to the Affymetrix Rice Genome Array. A total of 2,910 Musa gene homologues with a >2-fold difference in expression levels were subsequently identified. These drought-responsive transcripts included many functional classes associated with plant biotic and abiotic stress responses, as well as a range of regulatory genes known to be involved in coordinating abiotic stress responses. This latter group included members of the ERF, DREB, MYB, bZIP and bHLH transcription factor families. Fifty-two of these drought-sensitive Musa transcripts were homologous to genes underlying QTLs for drought and cold tolerance in rice, including in 2 instances QTLs associated with a single underlying gene. The list of drought-responsive transcripts also included genes identified in publicly-available comparative transcriptomics experiments. Conclusion Our results demonstrate that despite the general paucity of nucleotide sequence data in Musa and only distant phylogenetic relations to rice, gDNA probe-based cross-hybridisation to the Rice GeneChip® is a highly promising strategy to study complex biological responses and illustrates the potential of such strategies for gene discovery in non-model species. PMID:19758430
Complex Adaptive System Models and the Genetic Analysis of Plasma HDL-Cholesterol Concentration
Rea, Thomas J.; Brown, Christine M.; Sing, Charles F.
2006-01-01
Despite remarkable advances in diagnosis and therapy, ischemic heart disease (IHD) remains a leading cause of morbidity and mortality in industrialized countries. Recent efforts to estimate the influence of genetic variation on IHD risk have focused on predicting individual plasma high-density lipoprotein cholesterol (HDL-C) concentration. Plasma HDL-C concentration (mg/dl), a quantitative risk factor for IHD, has a complex multifactorial etiology that involves the actions of many genes. Single gene variations may be necessary but are not individually sufficient to predict a statistically significant increase in risk of disease. The complexity of phenotype-genotype-environment relationships involved in determining plasma HDL-C concentration has challenged commonly held assumptions about genetic causation and has led to the question of which combination of variations, in which subset of genes, in which environmental strata of a particular population significantly improves our ability to predict high or low risk phenotypes. We document the limitations of inferences from genetic research based on commonly accepted biological models, consider how evidence for real-world dynamical interactions between HDL-C determinants challenges the simplifying assumptions implicit in traditional linear statistical genetic models, and conclude by considering research options for evaluating the utility of genetic information in predicting traits with complex etiologies. PMID:17146134
Xue, Huping; Wu, Zhaowei; Li, Longping; Li, Fan; Wang, Yiqing
2015-01-01
The structure of a composite staphylococcal cassette chromosome (SCC) carried by a methicillin-resistant Staphylococcus haemolyticus (NW19A) isolated from a bovine milk sample was analyzed. The formation of the circular forms of both single SCC elements and composite SCC elements was detected in NW19A. Twenty heavy metal and antibiotic resistance-related genes coexisted in this composite SCC, suggesting that these genes might be coselected under environmental pressure. The mec gene complex in NW19A, designated type C3, is different from classic C1 or C2 gene complexes structurally and likely evolves differently. Furthermore, results from alignment of the SCC composite island of NW19A with 50 related sequences from different staphylococcal strains provided additional evidence to support the notion that coagulase-negative staphylococci (CoNS) are the original host of heavy metal resistance genes among staphylococci. Given that a SCC composite island could transfer freely among different staphylococcal species from different hosts, more attention should be paid to contamination with heavy metals and antibiotics in dairy farming environments, including wastewater, soil, feces, and feed. PMID:26169408
Xue, Huping; Wu, Zhaowei; Li, Longping; Li, Fan; Wang, Yiqing; Zhao, Xin
2015-09-01
The structure of a composite staphylococcal cassette chromosome (SCC) carried by a methicillin-resistant Staphylococcus haemolyticus (NW19A) isolated from a bovine milk sample was analyzed. The formation of the circular forms of both single SCC elements and composite SCC elements was detected in NW19A. Twenty heavy metal and antibiotic resistance-related genes coexisted in this composite SCC, suggesting that these genes might be coselected under environmental pressure. The mec gene complex in NW19A, designated type C3, is different from classic C1 or C2 gene complexes structurally and likely evolves differently. Furthermore, results from alignment of the SCC composite island of NW19A with 50 related sequences from different staphylococcal strains provided additional evidence to support the notion that coagulase-negative staphylococci (CoNS) are the original host of heavy metal resistance genes among staphylococci. Given that a SCC composite island could transfer freely among different staphylococcal species from different hosts, more attention should be paid to contamination with heavy metals and antibiotics in dairy farming environments, including wastewater, soil, feces, and feed. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Zhang, Honglian; Gan, Haiyun; Wang, Zhiquan; Lee, Jeong-Heon; Zhou, Hui; Ordog, Tamas; Wold, Marc S; Ljungman, Mats; Zhang, Zhiguo
2017-01-19
The histone chaperone HIRA is involved in depositing histone variant H3.3 into distinct genic regions, including promoters, enhancers, and gene bodies. However, how HIRA deposits H3.3 to these regions remains elusive. Through a short hairpin RNA (shRNA) screening, we identified single-stranded DNA binding protein replication protein A (RPA) as a regulator of the deposition of newly synthesized H3.3 into chromatin. We show that RPA physically interacts with HIRA to form RPA-HIRA-H3.3 complexes, and it co-localizes with HIRA and H3.3 at gene promoters and enhancers. Depletion of RPA1, the largest subunit of the RPA complex, dramatically reduces both HIRA association with chromatin and the deposition of newly synthesized H3.3 at promoters and enhancers and leads to altered transcription at gene promoters. These results support a model whereby RPA, best known for its role in DNA replication and repair, recruits HIRA to promoters and enhancers and regulates deposition of newly synthesized H3.3 to these regulatory elements for gene regulation. Copyright © 2017 Elsevier Inc. All rights reserved.
Bock, Sylvia; Ortelt, Jennifer; Link, Gerhard
2014-01-01
Plants contain a nuclear gene family for plastid sigma factors, i.e., proteins that associate with the “bacterial-type” organellar RNA polymerase and confer the ability for correct promoter binding and transcription initiation. Questions that are still unresolved relate to the “division of labor” among members of the sigma family, both in terms of their range of target genes and their temporal and spatial activity during development. Clues to the in vivo role of individual sigma genes have mainly come from studies of sigma knockout lines. Despite its obvious strengths, however, this strategy does not necessarily trace-down causal relationships between mutant phenotype and a single sigma gene, if other family members act in a redundant and/or compensatory manner. We made efforts to reduce the complexity by genetic crosses of Arabidopsis single mutants (with focus on a chlorophyll-deficient sig6 line) to generate double knockout lines. The latter typically had a similar visible phenotype as the parental lines, but tended to be more strongly affected in the transcript patterns of both plastid and sigma genes. Because triple mutants were lethal under our growth conditions, we exploited a strategy of transformation of single and double mutants with RNAi constructs that contained sequences from the unconserved sigma region (UCR). These RNAi/knockout lines phenotypically resembled their parental lines, but were even more strongly affected in their plastid transcript patterns. Expression patterns of sigma genes revealed both similarities and differences compared to the parental lines, with transcripts at reduced or unchanged amounts and others that were found to be present in higher (perhaps compensatory) amounts. Together, our results reveal considerable flexibility of gene activity at the levels of both sigma and plastid gene expression. A (still viable) “basal state” seems to be reached, if 2–3 of the 6 Arabidopsis sigma genes are functionally compromised. PMID:25505479
Structure of novel rat major histocompatibility complex class II genes RT1.Ha and Hb
DOE Office of Scientific and Technical Information (OSTI.GOV)
Arimura, Yutaka; Tang, Wei Ran; Koda, Toshiaki
1995-03-01
We have cloned the novel rat MHC class II genes, RT1.Ha and Hb, which are homologous to human HLA-DPA and DPB. RT1.Hb is a pseudogene, whereas RT1.Ha is apparently intact and may have transcriptional potential. In addition, with an RT1.Ha probe, we detecteda single Southern hybridization band in the genome of the mouse. This finding may aford an opportunity to analyze the HLA-DPA homologue in the mouse genome. 18 refs., 4 figs., 1 tab.
Programming Morphogenesis through Systems and Synthetic Biology.
Velazquez, Jeremy J; Su, Emily; Cahan, Patrick; Ebrahimkhani, Mo R
2018-04-01
Mammalian tissue development is an intricate, spatiotemporal process of self-organization that emerges from gene regulatory networks of differentiating stem cells. A major goal in stem cell biology is to gain a sufficient understanding of gene regulatory networks and cell-cell interactions to enable the reliable and robust engineering of morphogenesis. Here, we review advances in synthetic biology, single cell genomics, and multiscale modeling, which, when synthesized, provide a framework to achieve the ambitious goal of programming morphogenesis in complex tissues and organoids. Copyright © 2017 Elsevier Ltd. All rights reserved.
Mitochondrial introgression and complex biogeographic history of the genus Picea.
Ran, Jin-Hua; Shen, Ting-Ting; Liu, Wen-Juan; Wang, Pei-Pei; Wang, Xiao-Quan
2015-12-01
Biogeographic history of plants is much more complex in the Northern Hemisphere than in the Southern Hemisphere due to that both the Bering and the North Atlantic land bridges contributed to floristic exchanges in the Cenozoic, which led to hybridization between congeneric species from different continents. It would be interesting to know how intercontinental gene flow and introgression have affected plant phylogenetic reconstruction and biogeographic inference. In this study, we reinvestigated the phylogenetic and biogeographic history of Picea, a main component of the Northern Hemisphere forest with many species that originated from recent radiation, using two chloroplast (cp), one mitochondrial (mt) and three single-copy nuclear gene markers. The generated gene trees are topologically highly discordant and the geographically closely related species generally show a close affinity of mtDNA rather than cp- or nuclear DNA, suggesting that inter- and intra-continental gene flow and mtDNA introgression might have occurred commonly. However, all gene trees resolved Picea breweriana as the basal-most lineage, which, together with fossil evidence, supports the North American origin hypothesis for the genus. Both dispersal and vicariance have played important roles in the evolution of Picea, and the Bering Land Bridge could have mediated the "North America to Eurasia" dispersal at least two times during the Miocene and Pliocene. Our study again demonstrates the importance of applying data from three genomes for a clear understanding of evolutionary histories in the pine family. Any markers from a single genome alone will not reveal a clear picture of the phylogenetic relationships among closely related congeneric species. In particular, mtDNA markers should be cautiously used, considering that introgression of the maternally inherited mtDNA with a lower rate of gene flow (by seeds) could have occurred much more frequently than that of the paternally inherited cpDNA with a higher rate of gene flow (by pollen) in Pinaceae. Copyright © 2015 Elsevier Inc. All rights reserved.
eQTL networks unveil enriched mRNA master integrators downstream of complex disease-associated SNPs.
Li, Haiquan; Pouladi, Nima; Achour, Ikbel; Gardeux, Vincent; Li, Jianrong; Li, Qike; Zhang, Hao Helen; Martinez, Fernando D; 'Skip' Garcia, Joe G N; Lussier, Yves A
2015-12-01
The causal and interplay mechanisms of Single Nucleotide Polymorphisms (SNPs) associated with complex diseases (complex disease SNPs) investigated in genome-wide association studies (GWAS) at the transcriptional level (mRNA) are poorly understood despite recent advancements such as discoveries reported in the Encyclopedia of DNA Elements (ENCODE) and Genotype-Tissue Expression (GTex). Protein interaction network analyses have successfully improved our understanding of both single gene diseases (Mendelian diseases) and complex diseases. Whether the mRNAs downstream of complex disease genes are central or peripheral in the genetic information flow relating DNA to mRNA remains unclear and may be disease-specific. Using expression Quantitative Trait Loci (eQTL) that provide DNA to mRNA associations and network centrality metrics, we hypothesize that we can unveil the systems properties of information flow between SNPs and the transcriptomes of complex diseases. We compare different conditions such as naïve SNP assignments and stringent linkage disequilibrium (LD) free assignments for transcripts to remove confounders from LD. Additionally, we compare the results from eQTL networks between lymphoblastoid cell lines and liver tissue. Empirical permutation resampling (p<0.001) and theoretic Mann-Whitney U test (p<10(-30)) statistics indicate that mRNAs corresponding to complex disease SNPs via eQTL associations are likely to be regulated by a larger number of SNPs than expected. We name this novel property mRNA hubness in eQTL networks, and further term mRNAs with high hubness as master integrators. mRNA master integrators receive and coordinate the perturbation signals from large numbers of polymorphisms and respond to the personal genetic architecture integratively. This genetic signal integration contrasts with the mechanism underlying some Mendelian diseases, where a genetic polymorphism affecting a single protein hub produces a divergent signal that affects a large number of downstream proteins. Indeed, we verify that this property is independent of the hubness in protein networks for which these mRNAs are transcribed. Our findings provide novel insights into the pleiotropy of mRNAs targeted by complex disease polymorphisms and the architecture of the information flow between the genetic polymorphisms and transcriptomes of complex diseases. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.
A powerful approach reveals numerous expression quantitative trait haplotypes in multiple tissues.
Ying, Dingge; Li, Mulin Jun; Sham, Pak Chung; Li, Miaoxin
2018-04-26
Recently many studies showed single nucleotide polymorphisms (SNPs) affect gene expression and contribute to development of complex traits/diseases in a tissue context-dependent manner. However, little is known about haplotype's influence on gene expression and complex traits, which reflects the interaction effect between SNPs. In the present study, we firstly proposed a regulatory region guided eQTL haplotype association analysis approach, and then systematically investigate the expression quantitative trait loci (eQTL) haplotypes in 20 different tissues by the approach. The approach has a powerful design of reducing computational burden by the utilization of regulatory predictions for candidate SNP selection and multiple testing corrections on non-independent haplotypes. The application results in multiple tissues showed that haplotype-based eQTLs not only increased the number of eQTL genes in a tissue specific manner, but were also enriched in loci that associated with complex traits in a tissue-matched manner. In addition, we found that tag SNPs of eQTL haplotypes from whole blood were selectively enriched in certain combination of regulatory elements (e.g. promoters and enhancers) according to predicted chromatin states. In summary, this eQTL haplotype detection approach, together with the application results, shed insights into synergistic effect of sequence variants on gene expression and their susceptibility to complex diseases. The executable application "eHaplo" is implemented in Java and is publicly available at http://grass.cgs.hku.hk/limx/ehaplo/. jonsonfox@gmail.com, limiaoxin@mail.sysu.edu.cn. Supplementary data are available at Bioinformatics online.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rao, V.S.; Auerbach, S.A.; Farrer, L.A.
1996-09-01
Apolipoprotein E (APOE) genotype is the single most important determinant to the common form of Alzheimer disease (AD) yet identified. Several studies show that family history of AD is not entirely accounted for by APOE genotype. Also, there is evidence for an interaction between APOE genotype and gender. We carried out a complex segregation analysis in 636 nuclear families of consecutively ascertained and rigorously diagnosed probands in the Multi-Institutional Research in Alzheimer Genetic Epidemiology study in order to derive models of disease transmission which account for the influences of APOE genotype of the proband and gender. In the total groupmore » of families, models postulating sporadic occurrence, no major gene effect, random environmental transmission, and Mendelian inheritance were rejected. Transmission of AD in families of probands with at least one {epsilon}4 allele best fit a dominant model. Moreover, single gene inheritance best explained clustering of the disorder in families of probands lacking E4, but a more complex genetic model or multiple genetic models may ultimately account for risk in this group of families. Our results also suggest that susceptibility to AD differs between men and women regardless of the proband`s APOE status. Assuming a dominant model, AD appears to be completely penetrant in women, whereas only 62%-65% of men with predisposing genotypes develop AD. However, parameter estimates from the arbitrary major gene model suggests that AD is expressed dominantly in women and additively in men. These observations, taken together with epidemiologic data, are consistent with the hypothesis of an interaction between genes and other biological factors affecting disease susceptibility. 76 refs., 4 tabs.« less
The search for the genetic basis of hypertension.
Yagil, Yoram; Yagil, Chana
2005-03-01
This review surveys the literature on the search for the genetic basis of hypertension during the 10 months since November 2003. The goals set forth by this search are defined and the highlights of the work accomplished are provided. The search for the genetic basis of hypertension is ongoing, generating an abundance of new data. These data consist of a large number of candidate genes, association of previously known and novel candidate genes with various facets of hypertension, detection of new quantitative trait loci and identification of genes that mediate susceptibility to hypertension. The renin-zangiotensin-aldosterone system continues to dominate the interest of investigators. Other gene systems are also emerging but a single-gene system cannot be singled out beyond the renin-angiotensin-aldosterone system and the data are mostly sporadic and do not reflect a guided or coordinated effort to resolve unanswered issues. The notion that hypertension is polygenic is reinforced, yet few data are provided as to the actual number of genes involved, gene-gene interaction or gene-environment interaction. Advanced biotechnological tools involving transcriptomics and proteomics are underused. Research on the genetic basis of hypertension has generated over the past year a large number of candidate genes and tied them to various aspects of hypertension. How these genes fit into the complex pathophysiological network that induces hypertension remains unclear. The task of putting together these genes into a cohesive framework still lies ahead, but promises to enlighten us as to the true nature of hypertension, the pathogenic mechanisms involved and improved therapeutic and preventive measures.
Chaouch, Melek; Fathallah-Mili, Akila; Driss, Mehdi; Lahmadi, Ramzi; Ayari, Chiraz; Guizani, Ikram; Ben Said, Moncef; Benabderrazak, Souha
2013-03-01
Discrimination of the Old World Leishmania parasites is important for diagnosis and epidemiological studies of leishmaniasis. We have developed PCR assays that allow the discrimination between Leishmania major, Leishmania tropica and Leishmania infantum Tunisian species. The identification was performed by a simple PCR targeting cysteine protease B (cpb) gene copies. These PCR can be a routine molecular biology tools for discrimination of Leishmania spp. from different geographical origins and different clinical forms. Our assays can be an informative source for cpb gene studying concerning drug, diagnostics and vaccine research. The PCR products of the cpb gene and the N-acetylglucosamine-1-phosphate transferase (nagt) Leishmania gene were sequenced and aligned. Phylogenetic trees of Leishmania based cpb and nagt sequences are close in topology and present the classic distribution of Leishmania in the Old World. The phylogenetic analysis has enabled the characterization and identification of different strains, using both multicopy (cpb) and single copy (nagt) genes. Indeed, the cpb phylogenetic analysis allowed us to identify the Tunisian Leishmania killicki species, and a group which gathers the least evolved isolates of the Leishmania donovani complex, that was originated from East Africa. This clustering confirms the African origin for the visceralizing species of the L. donovani complex. Copyright © 2012 Elsevier B.V. All rights reserved.
Genomic analysis reveals extensive gene duplication within the bovine TRB locus
Connelley, Timothy; Aerts, Jan; Law, Andy; Morrison, W Ivan
2009-01-01
Background Diverse TR and IG repertoires are generated by V(D)J somatic recombination. Genomic studies have been pivotal in cataloguing the V, D, J and C genes present in the various TR/IG loci and describing how duplication events have expanded the number of these genes. Such studies have also provided insights into the evolution of these loci and the complex mechanisms that regulate TR/IG expression. In this study we analyze the sequence of the third bovine genome assembly to characterize the germline repertoire of bovine TRB genes and compare the organization, evolution and regulatory structure of the bovine TRB locus with that of humans and mice. Results The TRB locus in the third bovine genome assembly is distributed over 5 scaffolds, extending to ~730 Kb. The available sequence contains 134 TRBV genes, assigned to 24 subgroups, and 3 clusters of DJC genes, each comprising a single TRBD gene, 5–7 TRBJ genes and a single TRBC gene. Seventy-nine of the TRBV genes are predicted to be functional. Comparison with the human and murine TRB loci shows that the gene order, as well as the sequences of non-coding elements that regulate TRB expression, are highly conserved in the bovine. Dot-plot analyses demonstrate that expansion of the genomic TRBV repertoire has occurred via a complex and extensive series of duplications, predominantly involving DNA blocks containing multiple genes. These duplication events have resulted in massive expansion of several TRBV subgroups, most notably TRBV6, 9 and 21 which contain 40, 35 and 16 members respectively. Similarly, duplication has lead to the generation of a third DJC cluster. Analyses of cDNA data confirms the diversity of the TRBV genes and, in addition, identifies a substantial number of TRBV genes, predominantly from the larger subgroups, which are still absent from the genome assembly. The observed gene duplication within the bovine TRB locus has created a repertoire of phylogenetically diverse functional TRBV genes, which is substantially larger than that described for humans and mice. Conclusion The analyses completed in this study reveal that, although the gene content and organization of the bovine TRB locus are broadly similar to that of humans and mice, multiple duplication events have led to a marked expansion in the number of TRB genes. Similar expansions in other ruminant TR loci suggest strong evolutionary pressures in this lineage have selected for the development of enlarged sets of TR genes that can contribute to diverse TR repertoires. PMID:19393068
Intelligence: shared genetic basis between Mendelian disorders and a polygenic trait.
Franić, Sanja; Groen-Blokhuis, Maria M; Dolan, Conor V; Kattenberg, Mathijs V; Pool, René; Xiao, Xiangjun; Scheet, Paul A; Ehli, Erik A; Davies, Gareth E; van der Sluis, Sophie; Abdellaoui, Abdel; Hansell, Narelle K; Martin, Nicholas G; Hudziak, James J; van Beijsterveldt, Catherina E M; Swagerman, Suzanne C; Hulshoff Pol, Hilleke E; de Geus, Eco J C; Bartels, Meike; Ropers, H Hilger; Hottenga, Jouke-Jan; Boomsma, Dorret I
2015-10-01
Multiple inquiries into the genetic etiology of human traits indicated an overlap between genes underlying monogenic disorders (eg, skeletal growth defects) and those affecting continuous variability of related quantitative traits (eg, height). Extending the idea of a shared genetic basis between a Mendelian disorder and a classic polygenic trait, we performed an association study to examine the effect of 43 genes implicated in autosomal recessive cognitive disorders on intelligence in an unselected Dutch population (N=1316). Using both single-nucleotide polymorphism (SNP)- and gene-based association testing, we detected an association between intelligence and the genes of interest, with genes ELP2, TMEM135, PRMT10, and RGS7 showing the strongest associations. This is a demonstration of the relevance of genes implicated in monogenic disorders of intelligence to normal-range intelligence, and a corroboration of the utility of employing knowledge on monogenic disorders in identifying the genetic variability underlying complex traits.
Carrasco-Rando, Marta; Tutor, Antonio S.; Prieto-Sánchez, Silvia; González-Pérez, Esther; Barrios, Natalia; Letizia, Annalisa; Martín, Paloma; Campuzano, Sonsoles; Ruiz-Gómez, Mar
2011-01-01
A central issue of myogenesis is the acquisition of identity by individual muscles. In Drosophila, at the time muscle progenitors are singled out, they already express unique combinations of muscle identity genes. This muscle code results from the integration of positional and temporal signalling inputs. Here we identify, by means of loss-of-function and ectopic expression approaches, the Iroquois Complex homeobox genes araucan and caupolican as novel muscle identity genes that confer lateral transverse muscle identity. The acquisition of this fate requires that Araucan/Caupolican repress other muscle identity genes such as slouch and vestigial. In addition, we show that Caupolican-dependent slouch expression depends on the activation state of the Ras/Mitogen Activated Protein Kinase cascade. This provides a comprehensive insight into the way Iroquois genes integrate in muscle progenitors, signalling inputs that modulate gene expression and protein activity. PMID:21811416
USDA-ARS?s Scientific Manuscript database
Genetic variants detected from sequence have been used to successfully identify causal variants and map complex traits in several organisms. High and moderate impact variants, those expected to alter or disrupt the protein coded by a gene and those that regulate protein production, likely have a mor...
USDA-ARS?s Scientific Manuscript database
Viroids are the smallest known agents of infectious disease – small, single-stranded, highly structured, circular RNAs that lack detectable messenger RNA activity yet are able to replicate autonomously in susceptible plant species. Potato spindle tuber viroid (PSTVd) infection in tomato is accompan...
USDA-ARS?s Scientific Manuscript database
The Zamia pumila L. complex (Cycadales: Zamiaceae) is a monophyletic, diploid (2n = 16) and distinctive assemblage of cycad populations restricted to the West Indies and southeastern U. S. that has been treated as comprising one to nine species. Our research project seeks to simultaneously investi...
Quantitative imaging of single mRNA splice variants in living cells
NASA Astrophysics Data System (ADS)
Lee, Kyuwan; Cui, Yi; Lee, Luke P.; Irudayaraj, Joseph
2014-06-01
Alternative messenger RNA (mRNA) splicing is a fundamental process of gene regulation, and errors in RNA splicing are known to be associated with a variety of different diseases. However, there is currently a lack of quantitative technologies for monitoring mRNA splice variants in cells. Here, we show that a combination of plasmonic dimer probes and hyperspectral imaging can be used to detect and quantify mRNA splice variants in living cells. The probes are made from gold nanoparticles functionalized with oligonucleotides and can hybridize to specific mRNA sequences, forming nanoparticle dimers that exhibit distinct spectral shifts due to plasmonic coupling. With this approach, we show that the spatial and temporal distribution of three selected splice variants of the breast cancer susceptibility gene, BRCA1, can be monitored at single-copy resolution by measuring the hybridization dynamics of the nanoplasmonic dimers. Our study provides insights into RNA and its transport in living cells, which could improve our understanding of cellular protein complexes, pharmacogenomics, genetic diagnosis and gene therapies.
Kiefer, Christiane; Koch, Marcus A.
2012-01-01
74 of the currently accepted 111 taxa of the North American genus Boechera (Brassicaceae) were subject to pyhlogenetic reconstruction and network analysis. The dataset comprised 911 accessions for which ITS sequences were analyzed. Phylogenetic analyses yielded largely unresolved trees. Together with the network analysis confirming this result this can be interpreted as an indication for multiple, independent, and rapid diversification events. Network analyses were superimposed with datasets describing i) geographical distribution, ii) taxonomy, iii) reproductive mode, and iv) distribution history based on phylogeographic evidence. Our results provide first direct evidence for enormous reticulate evolution in the entire genus and give further insights into the evolutionary history of this complex genus on a continental scale. In addition two novel single-copy gene markers, orthologues of the Arabidopsis thaliana genes At2g25920 and At3g18900, were analyzed for subsets of taxa and confirmed the findings obtained through the ITS data. PMID:22606266
Spatial transcriptomic survey of human embryonic cerebral cortex by single-cell RNA-seq analysis.
Fan, Xiaoying; Dong, Ji; Zhong, Suijuan; Wei, Yuan; Wu, Qian; Yan, Liying; Yong, Jun; Sun, Le; Wang, Xiaoye; Zhao, Yangyu; Wang, Wei; Yan, Jie; Wang, Xiaoqun; Qiao, Jie; Tang, Fuchou
2018-06-04
The cellular complexity of human brain development has been intensively investigated, although a regional characterization of the entire human cerebral cortex based on single-cell transcriptome analysis has not been reported. Here, we performed RNA-seq on over 4,000 individual cells from 22 brain regions of human mid-gestation embryos. We identified 29 cell sub-clusters, which showed different proportions in each region and the pons showed especially high percentage of astrocytes. Embryonic neurons were not as diverse as adult neurons, although they possessed important features of their destinies in adults. Neuron development was unsynchronized in the cerebral cortex, as dorsal regions appeared to be more mature than ventral regions at this stage. Region-specific genes were comprehensively identified in each neuronal sub-cluster, and a large proportion of these genes were neural disease related. Our results present a systematic landscape of the regionalized gene expression and neuron maturation of the human cerebral cortex.
Single molecule fluorescence microscopy for ultra-sensitive RNA expression profiling
NASA Astrophysics Data System (ADS)
Hesse, Jan; Jacak, Jaroslaw; Regl, Gerhard; Eichberger, Thomas; Aberger, Fritz; Schlapak, Robert; Howorka, Stefan; Muresan, Leila; Frischauf, Anna-Maria; Schütz, Gerhard J.
2007-02-01
We developed a microarray analysis platform for ultra-sensitive RNA expression profiling of minute samples. It utilizes a novel scanning system for single molecule fluorescence detection on cm2 size samples in combination with specialized biochips, optimized for low autofluorescence and weak unspecific adsorption. 20 μg total RNA was extracted from 10 6 cells of a human keratinocyte cell line (HaCaT) and reversely transcribed in the presence of Alexa647-aha-dUTP. 1% of the resulting labeled cDNA was used for complex hybridization to a custom-made oligonucleotide microarray representing a set of 125 different genes. For low abundant genes, individual cDNA molecules hybridized to the microarray spots could be resolved. Single cDNA molecules hybridized to the chip surface appeared as diffraction limited features in the fluorescence images. The à trous wavelet method was utilized for localization and counting of the separated cDNA signals. Subsequently, the degree of labeling of the localized cDNA molecules was determined by brightness analysis for the different genes. Variations by factors up to 6 were found, which in conventional microarray analysis would result in a misrepresentation of the relative abundance of mRNAs.
High-coverage methylation data of a gene model before and after DNA damage and homologous repair.
Pezone, Antonio; Russo, Giusi; Tramontano, Alfonso; Florio, Ermanno; Scala, Giovanni; Landi, Rosaria; Zuchegna, Candida; Romano, Antonella; Chiariotti, Lorenzo; Muller, Mark T; Gottesman, Max E; Porcellini, Antonio; Avvedimento, Enrico V
2017-04-11
Genome-wide methylation analysis is limited by its low coverage and the inability to detect single variants below 10%. Quantitative analysis provides accurate information on the extent of methylation of single CpG dinucleotide, but it does not measure the actual polymorphism of the methylation profiles of single molecules. To understand the polymorphism of DNA methylation and to decode the methylation signatures before and after DNA damage and repair, we have deep sequenced in bisulfite-treated DNA a reporter gene undergoing site-specific DNA damage and homologous repair. In this paper, we provide information on the data generation, the rationale for the experiments and the type of assays used, such as cytofluorimetry and immunoblot data derived during a previous work published in Scientific Reports, describing the methylation and expression changes of a model gene (GFP) before and after formation of a double-strand break and repair by homologous-recombination or non-homologous-end-joining. These data provide: 1) a reference for the analysis of methylation polymorphism at selected loci in complex cell populations; 2) a platform and the tools to compare transcription and methylation profiles.
High-coverage methylation data of a gene model before and after DNA damage and homologous repair
Pezone, Antonio; Russo, Giusi; Tramontano, Alfonso; Florio, Ermanno; Scala, Giovanni; Landi, Rosaria; Zuchegna, Candida; Romano, Antonella; Chiariotti, Lorenzo; Muller, Mark T.; Gottesman, Max E.; Porcellini, Antonio; Avvedimento, Enrico V.
2017-01-01
Genome-wide methylation analysis is limited by its low coverage and the inability to detect single variants below 10%. Quantitative analysis provides accurate information on the extent of methylation of single CpG dinucleotide, but it does not measure the actual polymorphism of the methylation profiles of single molecules. To understand the polymorphism of DNA methylation and to decode the methylation signatures before and after DNA damage and repair, we have deep sequenced in bisulfite-treated DNA a reporter gene undergoing site-specific DNA damage and homologous repair. In this paper, we provide information on the data generation, the rationale for the experiments and the type of assays used, such as cytofluorimetry and immunoblot data derived during a previous work published in Scientific Reports, describing the methylation and expression changes of a model gene (GFP) before and after formation of a double-strand break and repair by homologous-recombination or non-homologous-end-joining. These data provide: 1) a reference for the analysis of methylation polymorphism at selected loci in complex cell populations; 2) a platform and the tools to compare transcription and methylation profiles. PMID:28398335
Evolution of the F-Box Gene Family in Euarchontoglires: Gene Number Variation and Selection Patterns
Wang, Ailan; Fu, Mingchuan; Jiang, Xiaoqian; Mao, Yuanhui; Li, Xiangchen; Tao, Shiheng
2014-01-01
F-box proteins are substrate adaptors used by the SKP1–CUL1–F-box protein (SCF) complex, a type of E3 ubiquitin ligase complex in the ubiquitin proteasome system (UPS). SCF-mediated ubiquitylation regulates proteolysis of hundreds of cellular proteins involved in key signaling and disease systems. However, our knowledge of the evolution of the F-box gene family in Euarchontoglires is limited. In the present study, 559 F-box genes and nine related pseudogenes were identified in eight genomes. Lineage-specific gene gain and loss events occurred during the evolution of Euarchontoglires, resulting in varying F-box gene numbers ranging from 66 to 81 among the eight species. Both tandem duplication and retrotransposition were found to have contributed to the increase of F-box gene number, whereas mutation in the F-box domain was the main mechanism responsible for reduction in the number of F-box genes, resulting in a balance of expansion and contraction in the F-box gene family. Thus, the Euarchontoglire F-box gene family evolved under a birth-and-death model. Signatures of positive selection were detected in substrate-recognizing domains of multiple F-box proteins, and adaptive changes played a role in evolution of the Euarchontoglire F-box gene family. In addition, single nucleotide polymorphism (SNP) distributions were found to be highly non-random among different regions of F-box genes in 1092 human individuals, with domain regions having a significantly lower number of non-synonymous SNPs. PMID:24727786
Human genetics of infectious diseases: a unified theory
Casanova, Jean-Laurent; Abel, Laurent
2007-01-01
Since the early 1950s, the dominant paradigm in the human genetics of infectious diseases postulates that rare monogenic immunodeficiencies confer vulnerability to multiple infectious diseases (one gene, multiple infections), whereas common infections are associated with the polygenic inheritance of multiple susceptibility genes (one infection, multiple genes). Recent studies, since 1996 in particular, have challenged this view. A newly recognised group of primary immunodeficiencies predisposing the individual to a principal or single type of infection is emerging. In parallel, several common infections have been shown to reflect the inheritance of one major susceptibility gene, at least in some populations. This novel causal relationship (one gene, one infection) blurs the distinction between patient-based Mendelian genetics and population-based complex genetics, and provides a unified conceptual frame for exploring the molecular genetic basis of infectious diseases in humans. PMID:17255931
Co-expression networks reveal the tissue-specific regulation of transcription and splicing
Saha, Ashis; Kim, Yungil; Gewirtz, Ariel D.H.; Jo, Brian; Gao, Chuan; McDowell, Ian C.; Engelhardt, Barbara E.
2017-01-01
Gene co-expression networks capture biologically important patterns in gene expression data, enabling functional analyses of genes, discovery of biomarkers, and interpretation of genetic variants. Most network analyses to date have been limited to assessing correlation between total gene expression levels in a single tissue or small sets of tissues. Here, we built networks that additionally capture the regulation of relative isoform abundance and splicing, along with tissue-specific connections unique to each of a diverse set of tissues. We used the Genotype-Tissue Expression (GTEx) project v6 RNA sequencing data across 50 tissues and 449 individuals. First, we developed a framework called Transcriptome-Wide Networks (TWNs) for combining total expression and relative isoform levels into a single sparse network, capturing the interplay between the regulation of splicing and transcription. We built TWNs for 16 tissues and found that hubs in these networks were strongly enriched for splicing and RNA binding genes, demonstrating their utility in unraveling regulation of splicing in the human transcriptome. Next, we used a Bayesian biclustering model that identifies network edges unique to a single tissue to reconstruct Tissue-Specific Networks (TSNs) for 26 distinct tissues and 10 groups of related tissues. Finally, we found genetic variants associated with pairs of adjacent nodes in our networks, supporting the estimated network structures and identifying 20 genetic variants with distant regulatory impact on transcription and splicing. Our networks provide an improved understanding of the complex relationships of the human transcriptome across tissues. PMID:29021288
Mechanisms of radiation-induced gene responses
DOE Office of Scientific and Technical Information (OSTI.GOV)
Woloschak, G.E.; Paunesku, T.
1996-10-01
In the process of identifying genes differentially expressed in cells exposed ultraviolet radiation, we have identified a transcript having a 26-bp region that is highly conserved in a variety of species including Bacillus circulans, yeast, pumpkin, Drosophila, mouse, and man. When the 5` region (flanking region or UTR) of a gene, the sequence is predominantly in +/+ orientation with respect to the coding DNA strand; while in the coding region and the 3` region (UTR), the sequence is most frequently in the +/-orientation with respect to the coding DNA strand. In two genes, the element is split into two parts;more » however, in most cases, it is found only once but with a minimum of 11 consecutive nucleotides precisely depicting the original sequence. The element is found in a large number of different genes with diverse functions (from human ras p21 to B. circulans chitonase). Gel shift assays demonstrated the presence of a protein in HeLa cell extracts that binds to the sense and antisense single-stranded consensus oligomers, as well as to the double- stranded oligonucleotide. When double-stranded oligomer was used, the size shift demonstrated as additional protein-oligomer complex larger than the one bound to either sense or antisense single-stranded consensus oligomers alone. It is speculated either that this element binds to protein(s) important in maintaining DNA is a single-stranded orientation for transcription or, alternatively that this element is important in the transcription-coupled DNA repair process.« less
Application of machine learning on brain cancer multiclass classification
NASA Astrophysics Data System (ADS)
Panca, V.; Rustam, Z.
2017-07-01
Classification of brain cancer is a problem of multiclass classification. One approach to solve this problem is by first transforming it into several binary problems. The microarray gene expression dataset has the two main characteristics of medical data: extremely many features (genes) and only a few number of samples. The application of machine learning on microarray gene expression dataset mainly consists of two steps: feature selection and classification. In this paper, the features are selected using a method based on support vector machine recursive feature elimination (SVM-RFE) principle which is improved to solve multiclass classification, called multiple multiclass SVM-RFE. Instead of using only the selected features on a single classifier, this method combines the result of multiple classifiers. The features are divided into subsets and SVM-RFE is used on each subset. Then, the selected features on each subset are put on separate classifiers. This method enhances the feature selection ability of each single SVM-RFE. Twin support vector machine (TWSVM) is used as the method of the classifier to reduce computational complexity. While ordinary SVM finds single optimum hyperplane, the main objective Twin SVM is to find two non-parallel optimum hyperplanes. The experiment on the brain cancer microarray gene expression dataset shows this method could classify 71,4% of the overall test data correctly, using 100 and 1000 genes selected from multiple multiclass SVM-RFE feature selection method. Furthermore, the per class results show that this method could classify data of normal and MD class with 100% accuracy.
Duchi, Diego; Gryte, Kristofer; Robb, Nicole C; Morichaud, Zakia; Sheppard, Carol; Wigneshweraraj, Sivaramesh
2018-01-01
Abstract Transcription initiation is a major step in gene regulation for all organisms. In bacteria, the promoter DNA is first recognized by RNA polymerase (RNAP) to yield an initial closed complex. This complex subsequently undergoes conformational changes resulting in DNA strand separation to form a transcription bubble and an RNAP-promoter open complex; however, the series and sequence of conformational changes, and the factors that influence them are unclear. To address the conformational landscape and transitions in transcription initiation, we applied single-molecule Förster resonance energy transfer (smFRET) on immobilized Escherichia coli transcription open complexes. Our results revealed the existence of two stable states within RNAP–DNA complexes in which the promoter DNA appears to adopt closed and partially open conformations, and we observed large-scale transitions in which the transcription bubble fluctuated between open and closed states; these transitions, which occur roughly on the 0.1 s timescale, are distinct from the millisecond-timescale dynamics previously observed within diffusing open complexes. Mutational studies indicated that the σ70 region 3.2 of the RNAP significantly affected the bubble dynamics. Our results have implications for many steps of transcription initiation, and support a bend-load-open model for the sequence of transitions leading to bubble opening during open complex formation. PMID:29177430
Zhang, Jing; Zhang, Lu; Zhang, Yan; Yang, Jing; Guo, Mengbiao; Sun, Liangdan; Pan, Hai-Feng; Hirankarn, Nattiya; Ying, Dingge; Zeng, Shuai; Lee, Tsz Leung; Lau, Chak Sing; Chan, Tak Mao; Leung, Alexander Moon Ho; Mok, Chi Chiu; Wong, Sik Nin; Lee, Ka Wing; Ho, Marco Hok Kung; Lee, Pamela Pui Wah; Chung, Brian Hon-Yin; Chong, Chun Yin; Wong, Raymond Woon Sing; Mok, Mo Yin; Wong, Wilfred Hing Sang; Tong, Kwok Lung; Tse, Niko Kei Chiu; Li, Xiang-Pei; Avihingsanon, Yingyos; Rianthavorn, Pornpimol; Deekajorndej, Thavatchai; Suphapeetiporn, Kanya; Shotelersuk, Vorasuk; Ying, Shirley King Yee; Fung, Samuel Ka Shun; Lai, Wai Ming; Garcia-Barceló, Maria-Mercè; Cherny, Stacey S; Sham, Pak Chung; Cui, Yong; Yang, Sen; Ye, Dong Qing; Zhang, Xue-Jun; Lau, Yu Lung; Yang, Wanling
2015-11-01
Previous genome-wide association studies (GWAS), which were mainly based on single-variant analysis, have identified many systemic lupus erythematosus (SLE) susceptibility loci. However, the genetic architecture of this complex disease is far from being understood. The aim of this study was to investigate whether using a gene-based analysis may help to identify novel loci, by considering global evidence of association from a gene or a genomic region rather than focusing on evidence for individual variants. Based on the results of a meta-analysis of 2 GWAS of SLE conducted in 2 Asian cohorts, we performed an in-depth gene-based analysis followed by replication in a total of 4,626 patients and 7,466 control subjects of Asian ancestry. Differential allelic expression was measured by pyrosequencing. More than one-half of the reported SLE susceptibility loci showed evidence of independent effects, and this finding is important for understanding the mechanisms of association and explaining disease heritability. ANXA6 was detected as a novel SLE susceptibility gene, with several single-nucleotide polymorphisms (SNPs) contributing independently to the association with disease. The risk allele of rs11960458 correlated significantly with increased expression of ANXA6 in peripheral blood mononuclear cells from heterozygous healthy control subjects. Several other associated SNPs may also regulate ANXA6 expression, according to data obtained from public databases. Higher expression of ANXA6 in patients with SLE was also reported previously. Our study demonstrated the merit of using gene-based analysis to identify novel susceptibility loci, especially those with independent effects, and also demonstrated the widespread presence of loci with independent effects in SLE susceptibility genes. © 2015, American College of Rheumatology.
Macronuclear Genome Sequence of the Ciliate Tetrahymena thermophila, a Model Eukaryote
Eisen, Jonathan A; Coyne, Robert S; Wu, Martin; Wu, Dongying; Thiagarajan, Mathangi; Wortman, Jennifer R; Badger, Jonathan H; Ren, Qinghu; Amedeo, Paolo; Jones, Kristie M; Tallon, Luke J; Delcher, Arthur L; Salzberg, Steven L; Silva, Joana C; Haas, Brian J; Majoros, William H; Farzad, Maryam; Carlton, Jane M; Smith, Roger K; Garg, Jyoti; Pearlman, Ronald E; Karrer, Kathleen M; Sun, Lei; Manning, Gerard; Elde, Nels C; Turkewitz, Aaron P; Asai, David J; Wilkes, David E; Wang, Yufeng; Cai, Hong; Collins, Kathleen; Stewart, B. Andrew; Lee, Suzanne R; Wilamowska, Katarzyna; Weinberg, Zasha; Ruzzo, Walter L; Wloga, Dorota; Gaertig, Jacek; Frankel, Joseph; Tsao, Che-Chia; Gorovsky, Martin A; Keeling, Patrick J; Waller, Ross F; Patron, Nicola J; Cherry, J. Michael; Stover, Nicholas A; Krieger, Cynthia J; del Toro, Christina; Ryder, Hilary F; Williamson, Sondra C; Barbeau, Rebecca A; Hamilton, Eileen P; Orias, Eduardo
2006-01-01
The ciliate Tetrahymena thermophila is a model organism for molecular and cellular biology. Like other ciliates, this species has separate germline and soma functions that are embodied by distinct nuclei within a single cell. The germline-like micronucleus (MIC) has its genome held in reserve for sexual reproduction. The soma-like macronucleus (MAC), which possesses a genome processed from that of the MIC, is the center of gene expression and does not directly contribute DNA to sexual progeny. We report here the shotgun sequencing, assembly, and analysis of the MAC genome of T. thermophila, which is approximately 104 Mb in length and composed of approximately 225 chromosomes. Overall, the gene set is robust, with more than 27,000 predicted protein-coding genes, 15,000 of which have strong matches to genes in other organisms. The functional diversity encoded by these genes is substantial and reflects the complexity of processes required for a free-living, predatory, single-celled organism. This is highlighted by the abundance of lineage-specific duplications of genes with predicted roles in sensing and responding to environmental conditions (e.g., kinases), using diverse resources (e.g., proteases and transporters), and generating structural complexity (e.g., kinesins and dyneins). In contrast to the other lineages of alveolates (apicomplexans and dinoflagellates), no compelling evidence could be found for plastid-derived genes in the genome. UGA, the only T. thermophila stop codon, is used in some genes to encode selenocysteine, thus making this organism the first known with the potential to translate all 64 codons in nuclear genes into amino acids. We present genomic evidence supporting the hypothesis that the excision of DNA from the MIC to generate the MAC specifically targets foreign DNA as a form of genome self-defense. The combination of the genome sequence, the functional diversity encoded therein, and the presence of some pathways missing from other model organisms makes T. thermophila an ideal model for functional genomic studies to address biological, biomedical, and biotechnological questions of fundamental importance. PMID:16933976
Duconge, Jorge; Cadilla, Carmen L.; Windemuth, Andreas; Kocherla, Mohan; Gorowski, Krystyna; Seip, Richard L.; Bogaard, Kali; Renta, Jessica Y.; Piovanetti, Paola; D’Agostino, Darrin; Santiago-Borrero, Pedro J.; Ruaño, Gualberto
2010-01-01
Polymorphisms in the cytochrome P450 2C9 (CYP2C9) and vitamin K epoxide reductase complex subunit 1 (VKORC1) genes significantly alter the effective warfarin dose. We determined the frequencies of alleles, single carriers, and double carriers of single nucleotide polymorphisms (SNPs) in the CYP2C9 and VKORC1 genes in a Puerto Rican cohort and gauged the impact of these polymorphisms on warfarin dosage using a published algorithm. A total of 92 DNA samples were genotyped using Luminex® x-MAP technology. The polymorphism frequencies were 6.52%, 5.43% and 28.8% for CYP2C9 *2, *3 and VKORC1-1639 G>A polymorphisms, respectively. The prevalence of combinatorial genotypes was 16% for carriers of both the CYP2C9 and VKORC1 polymorphisms, 9% for carriers of CYP2C9 polymorphisms, 35% for carriers of the VKORC1 polymorphism, and the remaining 40% were non-carriers for either gene. Based on a published warfarin dosing algorithm, single, double and triple carriers of functionally deficient polymorphisms predict reductions of 1.0–1.6, 2.0–2.9, and 2.9–3.7 mg/day, respectively, in warfarin dose. Overall, 60% of the population carried at least a single polymorphism predicting deficient warfarin metabolism or responsiveness and 13% were double carriers with polymorphisms in both genes studied. Combinatorial genotyping of CYP2C9 and VKORC1 can allow for individualized dosing of warfarin among patients with gene polymorphisms, potentially reducing the risk of stroke or bleeding. PMID:20073138
Lim, Byung Chan; Lee, Seungbok; Shin, Jong-Yeon; Kim, Jong-Il; Hwang, Hee; Kim, Ki Joong; Hwang, Yong Seung; Seo, Jeong-Sun; Chae, Jong Hee
2011-11-01
Duchenne muscular dystrophy or Becker muscular dystrophy might be a suitable candidate disease for application of next-generation sequencing in the genetic diagnosis because the complex mutational spectrum and the large size of the dystrophin gene require two or more analytical methods and have a high cost. The authors tested whether large deletions/duplications or small mutations, such as point mutations or short insertions/deletions of the dystrophin gene, could be predicted accurately in a single platform using next-generation sequencing technology. A custom solution-based target enrichment kit was designed to capture whole genomic regions of the dystrophin gene and other muscular-dystrophy-related genes. A multiplexing strategy, wherein four differently bar-coded samples were captured and sequenced together in a single lane of the Illumina Genome Analyser, was applied. The study subjects were 25 16 with deficient dystrophin expression without a large deletion/duplication and 9 with a known large deletion/duplication. Nearly 100% of the exonic region of the dystrophin gene was covered by at least eight reads with a mean read depth of 107. Pathogenic small mutations were identified in 15 of the 16 patients without a large deletion/duplication. Using these 16 patients as the standard, the authors' method accurately predicted the deleted or duplicated exons in the 9 patients with known mutations. Inclusion of non-coding regions and paired-end sequence analysis enabled accurate identification by increasing the read depth and providing information about the breakpoint junction. The current method has an advantage for the genetic diagnosis of Duchenne muscular dystrophy and Becker muscular dystrophy wherein a comprehensive mutational search may be feasible using a single platform.
MIDAS: A Modular DNA Assembly System for Synthetic Biology.
van Dolleweerd, Craig J; Kessans, Sarah A; Van de Bittner, Kyle C; Bustamante, Leyla Y; Bundela, Rudranuj; Scott, Barry; Nicholson, Matthew J; Parker, Emily J
2018-04-20
A modular and hierarchical DNA assembly platform for synthetic biology based on Golden Gate (Type IIS restriction enzyme) cloning is described. This enabling technology, termed MIDAS (for Modular Idempotent DNA Assembly System), can be used to precisely assemble multiple DNA fragments in a single reaction using a standardized assembly design. It can be used to build genes from libraries of sequence-verified, reusable parts and to assemble multiple genes in a single vector, with full user control over gene order and orientation, as well as control of the direction of growth (polarity) of the multigene assembly, a feature that allows genes to be nested between other genes or genetic elements. We describe the detailed design and use of MIDAS, exemplified by the reconstruction, in the filamentous fungus Penicillium paxilli, of the metabolic pathway for production of paspaline and paxilline, key intermediates in the biosynthesis of a range of indole diterpenes-a class of secondary metabolites produced by several species of filamentous fungi. MIDAS was used to efficiently assemble a 25.2 kb plasmid from 21 different modules (seven genes, each composed of three basic parts). By using a parts library-based system for construction of complex assemblies, and a unique set of vectors, MIDAS can provide a flexible route to assembling tailored combinations of genes and other genetic elements, thereby supporting synthetic biology applications in a wide range of expression hosts.
Rai1 duplication causes physical and behavioral phenotypes in a mouse model of dup(17)(p11.2p11.2)
Walz, Katherina; Paylor, Richard; Yan, Jiong; Bi, Weimin; Lupski, James R.
2006-01-01
Genomic disorders are conditions that result from DNA rearrangements, such as deletions or duplications. The identification of the dosage-sensitive gene(s) within the rearranged genomic interval is important for the elucidation of genes responsible for complex neurobehavioral phenotypes. Smith-Magenis syndrome is associated with a 3.7-Mb deletion in 17p11.2, and its clinical presentation is caused by retinoic acid inducible 1 (RAI1) haploinsufficiency. The reciprocal microduplication syndrome, dup(17)(p11.2p11.2), manifests several neurobehavioral abnormalities, but the responsible dosage-sensitive gene(s) remain undefined. We previously generated a mouse model for dup(17)(p11.2p11.2), Dp(11)17/+, that recapitulated most of the phenotypes observed in human patients. We have now analyzed compound heterozygous mice carrying a duplication [Dp(11)17] in one chromosome 11 along with a null allele of Rai1 in the other chromosome 11 homologue [Dp(11)17/Rai1– mice] in order to study the relationship between Rai1 gene copy number and the Dp(11)17/+ phenotypes. Normal disomic Rai1 gene dosage was sufficient to rescue the complex physical and behavioral phenotypes observed in Dp(11)17/+ mice, despite altered trisomic copy number of the other 18 genes present in the rearranged genomic interval. These data provide a model for variation in copy number of single genes that could influence common traits such as obesity and behavior. PMID:17024248
Zheng, Kaijie; Tian, Hainan; Hu, Qingnan; Guo, Hongyan; Yang, Li; Cai, Ling; Wang, Xutong; Liu, Bao; Wang, Shucai
2016-01-01
In Arabidopsis, a MYB-bHLH-WD40 (MBW) transcriptional activator complex activates the homeodomain protein gene GLABRA2 (GL2), leading to the promotion of trichome formation and inhibition of root hair formation. The same MBW complex also activates single-repeat R3 MYB genes. R3 MYBs in turn, play a negative feedback role by competing with R2R3 MYB proteins for binding bHLH proteins, thus blocking the formation of the MBW complex. By BLASTing the rice (Oryza sativa) protein database using the entire amino acid sequence of Arabidopsis R3 MYB transcription factor TRICHOMELESS1 (TCL1), we found that there are two genes in rice genome encoding R3 MYB transcription factors, namely Oryza sativa TRICHOMELESS1 (OsTCL1) and OsTCL2. Expressing OsTCL1 in Arabidopsis inhibited trichome formation and promoted root hair formation, and OsTCL1 interacted with GL3 when tested in Arabidopsis protoplasts. Consistent with these observations, expression levels of GL2, R2R3 MYB transcription factor gene GLABRA1 (GL1) and several R3 MYB genes were greatly reduced, indicating that OsTCL1 is functional R3 MYB. However, trichome and root hair formation in transgenic rice plants overexpressing OsTCL1 remained largely unchanged, and elevated expression of OsGL2 was observed in the transgenic rice plants, indicating that rice may use different mechanisms to regulate trichome formation. PMID:26758286
A linear framework for time-scale separation in nonlinear biochemical systems.
Gunawardena, Jeremy
2012-01-01
Cellular physiology is implemented by formidably complex biochemical systems with highly nonlinear dynamics, presenting a challenge for both experiment and theory. Time-scale separation has been one of the few theoretical methods for distilling general principles from such complexity. It has provided essential insights in areas such as enzyme kinetics, allosteric enzymes, G-protein coupled receptors, ion channels, gene regulation and post-translational modification. In each case, internal molecular complexity has been eliminated, leading to rational algebraic expressions among the remaining components. This has yielded familiar formulas such as those of Michaelis-Menten in enzyme kinetics, Monod-Wyman-Changeux in allostery and Ackers-Johnson-Shea in gene regulation. Here we show that these calculations are all instances of a single graph-theoretic framework. Despite the biochemical nonlinearity to which it is applied, this framework is entirely linear, yet requires no approximation. We show that elimination of internal complexity is feasible when the relevant graph is strongly connected. The framework provides a new methodology with the potential to subdue combinatorial explosion at the molecular level.
2013-01-01
Background Histone methyltransferase enhancer of zeste homologue 2 (EZH2) forms an obligate repressive complex with suppressor of zeste 12 and embryonic ectoderm development, which is thought, along with EZH1, to be primarily responsible for mediating Polycomb-dependent gene silencing. Polycomb-mediated repression influences gene expression across the entire gamut of biological processes, including development, differentiation and cellular proliferation. Deregulation of EZH2 expression is implicated in numerous complex human diseases. To date, most EZH2-mediated function has been primarily ascribed to a single protein product of the EZH2 locus. Results We report that the EZH2 locus undergoes alternative splicing to yield at least two structurally and functionally distinct EZH2 methyltransferases. The longest protein encoded by this locus is the conventional enzyme, which we refer to as EZH2α, whereas EZH2β, characterized here, represents a novel isoform. We find that EZH2β localizes to the cell nucleus, complexes with embryonic ectoderm development and suppressor of zeste 12, trimethylates histone 3 at lysine 27, and mediates silencing of target promoters. At the cell biological level, we find that increased EZH2β induces cell proliferation, demonstrating that this protein is functional in the regulation of processes previously attributed to EZH2α. Biochemically, through the use of genome-wide expression profiling, we demonstrate that EZH2β governs a pattern of gene repression that is often ontologically redundant from that of EZH2α, but also divergent for a wide variety of specific target genes. Conclusions Combined, these results demonstrate that an expanded repertoire of EZH2 writers can modulate histone code instruction during histone 3 lysine 27-mediated gene silencing. These data support the notion that the regulation of EZH2-mediated gene silencing is more complex than previously anticipated and should guide the design and interpretation of future studies aimed at understanding the biochemical and biological roles of this important family of epigenomic regulators. PMID:23448518
Rong, Junkang; Feltus, F. Alex; Waghmare, Vijay N.; Pierce, Gary J.; Chee, Peng W.; Draye, Xavier; Saranga, Yehoshua; Wright, Robert J.; Wilkins, Thea A.; May, O. Lloyd; Smith, C. Wayne; Gannaway, John R.; Wendel, Jonathan F.; Paterson, Andrew H.
2007-01-01
QTL mapping experiments yield heterogeneous results due to the use of different genotypes, environments, and sampling variation. Compilation of QTL mapping results yields a more complete picture of the genetic control of a trait and reveals patterns in organization of trait variation. A total of 432 QTL mapped in one diploid and 10 tetraploid interspecific cotton populations were aligned using a reference map and depicted in a CMap resource. Early demonstrations that genes from the non-fiber-producing diploid ancestor contribute to tetraploid lint fiber genetics gain further support from multiple populations and environments and advanced-generation studies detecting QTL of small phenotypic effect. Both tetraploid subgenomes contribute QTL at largely non-homeologous locations, suggesting divergent selection acting on many corresponding genes before and/or after polyploid formation. QTL correspondence across studies was only modest, suggesting that additional QTL for the target traits remain to be discovered. Crosses between closely-related genotypes differing by single-gene mutants yield profoundly different QTL landscapes, suggesting that fiber variation involves a complex network of interacting genes. Members of the lint fiber development network appear clustered, with cluster members showing heterogeneous phenotypic effects. Meta-analysis linked to synteny-based and expression-based information provides clues about specific genes and families involved in QTL networks. PMID:17565937
Rong, Junkang; Feltus, F Alex; Waghmare, Vijay N; Pierce, Gary J; Chee, Peng W; Draye, Xavier; Saranga, Yehoshua; Wright, Robert J; Wilkins, Thea A; May, O Lloyd; Smith, C Wayne; Gannaway, John R; Wendel, Jonathan F; Paterson, Andrew H
2007-08-01
QTL mapping experiments yield heterogeneous results due to the use of different genotypes, environments, and sampling variation. Compilation of QTL mapping results yields a more complete picture of the genetic control of a trait and reveals patterns in organization of trait variation. A total of 432 QTL mapped in one diploid and 10 tetraploid interspecific cotton populations were aligned using a reference map and depicted in a CMap resource. Early demonstrations that genes from the non-fiber-producing diploid ancestor contribute to tetraploid lint fiber genetics gain further support from multiple populations and environments and advanced-generation studies detecting QTL of small phenotypic effect. Both tetraploid subgenomes contribute QTL at largely non-homeologous locations, suggesting divergent selection acting on many corresponding genes before and/or after polyploid formation. QTL correspondence across studies was only modest, suggesting that additional QTL for the target traits remain to be discovered. Crosses between closely-related genotypes differing by single-gene mutants yield profoundly different QTL landscapes, suggesting that fiber variation involves a complex network of interacting genes. Members of the lint fiber development network appear clustered, with cluster members showing heterogeneous phenotypic effects. Meta-analysis linked to synteny-based and expression-based information provides clues about specific genes and families involved in QTL networks.
Complexity and specificity of the maize (Zea mays L.) root hair transcriptome.
Hey, Stefan; Baldauf, Jutta; Opitz, Nina; Lithio, Andrew; Pasha, Asher; Provart, Nicholas; Nettleton, Dan; Hochholdinger, Frank
2017-04-01
Root hairs are tubular extensions of epidermis cells. Transcriptome profiling demonstrated that the single cell-type root hair transcriptome was less complex than the transcriptome of multiple cell-type primary roots without root hairs. In total, 831 genes were exclusively and 5585 genes were preferentially expressed in root hairs [false discovery rate (FDR) ≤1%]. Among those, the most significantly enriched Gene Ontology (GO) functional terms were related to energy metabolism, highlighting the high energy demand for the development and function of root hairs. Subsequently, the maize homologs for 138 Arabidopsis genes known to be involved in root hair development were identified and their phylogenetic relationship and expression in root hairs were determined. This study indicated that the genetic regulation of root hair development in Arabidopsis and maize is controlled by common genes, but also shows differences which need to be dissected in future genetic experiments. Finally, a maize root view of the eFP browser was implemented including the root hair transcriptome of the present study and several previously published maize root transcriptome data sets. The eFP browser provides color-coded expression levels for these root types and tissues for any gene of interest, thus providing a novel resource to study gene expression and function in maize roots. © The Author 2017. Published by Oxford University Press on behalf of the Society for Experimental Biology.
Mutations in the ABCA4 (ABCR) gene are the major cause of autosomal recessive cone-rod dystrophy.
Maugeri, A; Klevering, B J; Rohrschneider, K; Blankenagel, A; Brunner, H G; Deutman, A F; Hoyng, C B; Cremers, F P
2000-10-01
The photoreceptor cell-specific ATP-binding cassette transporter gene (ABCA4; previously denoted "ABCR") is mutated, in most patients, with autosomal recessive (AR) Stargardt disease (STGD1) or fundus flavimaculatus (FFM). In addition, a few cases with AR retinitis pigmentosa (RP) and AR cone-rod dystrophy (CRD) have been found to have ABCA4 mutations. To evaluate the importance of the ABCA4 gene as a cause of AR CRD, we selected 5 patients with AR CRD and 15 patients from Germany and The Netherlands with isolated CRD. Single-strand conformation-polymorphism analysis and sequencing revealed 19 ABCA4 mutations in 13 (65%) of 20 patients. In six patients, mutations were identified in both ABCA4 alleles; in seven patients, mutations were detected in one allele. One complex ABCA4 allele (L541P;A1038V) was found exclusively in German patients with CRD; one patient carried this complex allele homozygously, and five others were compound heterozygous. These findings suggest that mutations in the ABCA4 gene are the major cause of AR CRD. A primary role of the ABCA4 gene in STGD1/FFM and AR CRD, together with the gene's involvement in an as-yet-unknown proportion of cases with AR RP, strengthens the idea that mutations in the ABCA4 gene could be the most frequent cause of inherited retinal dystrophy in humans.
Massively parallel nanowell-based single-cell gene expression profiling.
Goldstein, Leonard D; Chen, Ying-Jiun Jasmine; Dunne, Jude; Mir, Alain; Hubschle, Hermann; Guillory, Joseph; Yuan, Wenlin; Zhang, Jingli; Stinson, Jeremy; Jaiswal, Bijay; Pahuja, Kanika Bajaj; Mann, Ishminder; Schaal, Thomas; Chan, Leo; Anandakrishnan, Sangeetha; Lin, Chun-Wah; Espinoza, Patricio; Husain, Syed; Shapiro, Harris; Swaminathan, Karthikeyan; Wei, Sherry; Srinivasan, Maithreyan; Seshagiri, Somasekar; Modrusan, Zora
2017-07-07
Technological advances have enabled transcriptome characterization of cell types at the single-cell level providing new biological insights. New methods that enable simple yet high-throughput single-cell expression profiling are highly desirable. Here we report a novel nanowell-based single-cell RNA sequencing system, ICELL8, which enables processing of thousands of cells per sample. The system employs a 5,184-nanowell-containing microchip to capture ~1,300 single cells and process them. Each nanowell contains preprinted oligonucleotides encoding poly-d(T), a unique well barcode, and a unique molecular identifier. The ICELL8 system uses imaging software to identify nanowells containing viable single cells and only wells with single cells are processed into sequencing libraries. Here, we report the performance and utility of ICELL8 using samples of increasing complexity from cultured cells to mouse solid tissue samples. Our assessment of the system to discriminate between mixed human and mouse cells showed that ICELL8 has a low cell multiplet rate (< 3%) and low cross-cell contamination. We characterized single-cell transcriptomes of more than a thousand cultured human and mouse cells as well as 468 mouse pancreatic islets cells. We were able to identify distinct cell types in pancreatic islets, including alpha, beta, delta and gamma cells. Overall, ICELL8 provides efficient and cost-effective single-cell expression profiling of thousands of cells, allowing researchers to decipher single-cell transcriptomes within complex biological samples.
Biogenesis of the yeast cytochrome bc1 complex.
Zara, Vincenzo; Conte, Laura; Trumpower, Bernard L
2009-01-01
The mitochondrial respiratory chain is composed of four different protein complexes that cooperate in electron transfer and proton pumping across the inner mitochondrial membrane. The cytochrome bc1 complex, or complex III, is a component of the mitochondrial respiratory chain. This review will focus on the biogenesis of the bc1 complex in the mitochondria of the yeast Saccharomyces cerevisiae. In wild type yeast mitochondrial membranes the major part of the cytochrome bc1 complex was found in association with one or two copies of the cytochrome c oxidase complex. The analysis of several yeast mutant strains in which single genes or pairs of genes encoding bc1 subunits had been deleted revealed the presence of a common set of bc1 sub-complexes. These sub-complexes are represented by the central core of the bc1 complex, consisting of cytochrome b bound to subunit 7 and subunit 8, by the two core proteins associated with each other, by the Rieske protein associated with subunit 9, and by those deriving from the unexpected interaction of each of the two core proteins with cytochrome c1. Furthermore, a higher molecular mass sub-complex is that composed of cytochrome b, cytochrome c1, core protein 1 and 2, subunit 6, subunit 7 and subunit 8. The identification and characterization of all these sub-complexes may help in defining the steps and the molecular events leading to bc1 assembly in yeast mitochondria.
Kiemnec-Tyburczy, Karen M.; Richmond, Jonathan Q.; Savage, Anna E.; Zamudio, Kelly R.
2010-01-01
Genes encoded by the major histocompatibility complex (MHC) play key roles in the vertebrate immune system. However, our understanding of the evolutionary processes and underlying genetic mechanisms shaping these genes is limited in many taxa, including amphibians, a group currently impacted by emerging infectious diseases. To further elucidate the evolution of the MHC in frogs (anurans) and develop tools for population genetics, we surveyed allelic diversity of the MHC class II ??1 domain in both genomic and complementary DNA of seven New World species in the genus Rana (Lithobates). To assign locus affiliation to our alleles, we used a "gene walking" technique to obtain intron 2 sequences that flanked MHC class II?? exon 2. Two distinct intron sequences were recovered, suggesting the presence of at least two class II?? loci in Rana. We designed a primer pair that successfully amplified an orthologous locus from all seven Rana species. In total, we recovered 13 alleles and documented trans-species polymorphism for four of the alleles. We also found quantitative evidence of selection acting on amino acid residues that are putatively involved in peptide binding and structural stability of the ??1 domain of anurans. Our results indicated that primer mismatch can result in polymerase chain reaction (PCR) bias, which influences the number of alleles that are recovered. Using a single locus may minimize PCR bias caused by primer mismatch, and the gene walking technique was an effective approach for generating single-copy orthologous markers necessary for future studies of MHC allelic variation in natural amphibian populations. ?? 2010 Springer-Verlag.
Circadian Enhancers Coordinate Multiple Phases of Rhythmic Gene Transcription In Vivo
Fang, Bin; Everett, Logan J.; Jager, Jennifer; Briggs, Erika; Armour, Sean M.; Feng, Dan; Roy, Ankur; Gerhart-Hines, Zachary; Sun, Zheng; Lazar, Mitchell A.
2014-01-01
SUMMARY Mammalian transcriptomes display complex circadian rhythms with multiple phases of gene expression that cannot be accounted for by current models of the molecular clock. We have determined the underlying mechanisms by measuring nascent RNA transcription around the clock in mouse liver. Unbiased examination of eRNAs that cluster in specific circadian phases identified functional enhancers driven by distinct transcription factors (TFs). We further identify on a global scale the components of the TF cistromes that function to orchestrate circadian gene expression. Integrated genomic analyses also revealed novel mechanisms by which a single circadian factor controls opposing transcriptional phases. These findings shed new light on the diversity and specificity of TF function in the generation of multiple phases of circadian gene transcription in a mammalian organ. PMID:25416951
Metagenomic discovery of biomass-degrading genes and genomes from cow rumen.
Hess, Matthias; Sczyrba, Alexander; Egan, Rob; Kim, Tae-Wan; Chokhawala, Harshal; Schroth, Gary; Luo, Shujun; Clark, Douglas S; Chen, Feng; Zhang, Tao; Mackie, Roderick I; Pennacchio, Len A; Tringe, Susannah G; Visel, Axel; Woyke, Tanja; Wang, Zhong; Rubin, Edward M
2011-01-28
The paucity of enzymes that efficiently deconstruct plant polysaccharides represents a major bottleneck for industrial-scale conversion of cellulosic biomass into biofuels. Cow rumen microbes specialize in degradation of cellulosic plant material, but most members of this complex community resist cultivation. To characterize biomass-degrading genes and genomes, we sequenced and analyzed 268 gigabases of metagenomic DNA from microbes adherent to plant fiber incubated in cow rumen. From these data, we identified 27,755 putative carbohydrate-active genes and expressed 90 candidate proteins, of which 57% were enzymatically active against cellulosic substrates. We also assembled 15 uncultured microbial genomes, which were validated by complementary methods including single-cell genome sequencing. These data sets provide a substantially expanded catalog of genes and genomes participating in the deconstruction of cellulosic biomass.
Circadian enhancers coordinate multiple phases of rhythmic gene transcription in vivo.
Fang, Bin; Everett, Logan J; Jager, Jennifer; Briggs, Erika; Armour, Sean M; Feng, Dan; Roy, Ankur; Gerhart-Hines, Zachary; Sun, Zheng; Lazar, Mitchell A
2014-11-20
Mammalian transcriptomes display complex circadian rhythms with multiple phases of gene expression that cannot be accounted for by current models of the molecular clock. We have determined the underlying mechanisms by measuring nascent RNA transcription around the clock in mouse liver. Unbiased examination of enhancer RNAs (eRNAs) that cluster in specific circadian phases identified functional enhancers driven by distinct transcription factors (TFs). We further identify on a global scale the components of the TF cistromes that function to orchestrate circadian gene expression. Integrated genomic analyses also revealed mechanisms by which a single circadian factor controls opposing transcriptional phases. These findings shed light on the diversity and specificity of TF function in the generation of multiple phases of circadian gene transcription in a mammalian organ.
Cloning and expression of the gene for bacteriophage T7 RNA polymerase
Studier, F.W.; Davanloo, P.; Rosenberg, A.H.
1984-03-30
This application describes a means to clone a functional gene for bacteriophage T7 RNA polymerase. Active T7 RNA polymerase is produced from the cloned gene, and a plasmid has been constructed that can produce the active enzyme in large amounts. T7 RNA polymerase transcribes DNA very efficiently and is highly selective for a relatively long promoter sequence. This enzyme is useful for synthesizing large amounts of RNA in vivo or in vitro, and is capable of producing a single RNA selectively from a complex mixture of DNAs. The procedure used to obtain a clone of the T7 RNA polymerase gene can be applied to other T7-like phages to obtain clones that produce RNA polymerases having different promoter specificities, different bacterial hosts, or other desirable properties.
Tähtiharju, Sari; Rijpkema, Anneke S; Vetterli, Adrien; Albert, Victor A; Teeri, Teemu H; Elomaa, Paula
2012-04-01
Plant-specific TCP domain transcription factors have been shown to regulate morphological novelties during plant evolution, including the complex architecture of the Asteraceae inflorescence that involves different types of flowers. We conducted comparative analysis of the CYCLOIDEA/TEOSINTE BRANCHED1 (CYC/TB1) gene family in Gerbera hybrida (gerbera) and Helianthus annuus (sunflower), two species that represent distant tribes within Asteraceae. Our data confirm that the CYC/TB1 gene family has expanded in Asteraceae, a condition that appears to be connected with the increased developmental complexity and evolutionary success of this large plant family. Phylogenetic analysis of the CYC/TB1 gene family revealed both shared and lineage-specific duplications in gerbera and sunflower, corresponding to the three gene lineages previously identified as specific to core eudicots: CYC1, CYC2, and CYC3. Expression analyses of early stages of flower primordia development indicated that especially within the CYC2 clade, with the greatest number of secondary gene duplications, gene expression patterns are conserved between the species and associated with flower and inflorescence development. All sunflower and gerbera CYC2 clade genes showed differential expression between developing flower types, being upregulated in marginal ray (and trans) flowers. One gene in gerbera (GhCYC3) and two in sunflower (HaCYC2d and HaCYC2c) were indicated to be strong candidates as regulators of ray flower identity, a function that is specific for Asteraceae. Our data further showed that other CYC2 clade genes are likely to have more specialized functions at the level of single flowers, including the late functions in floral reproductive organs that may be more conserved across plant families. The expression patterns of CYC1 and CYC3 clade genes showed more differences between the two species but still pointed to possible conserved functions during vegetative plant development. Pairwise protein-protein interaction assays gave the first molecular evidence that CYC/TB1-like proteins function in complexes. Compared with sunflower, the gerbera proteins showed higher capacity for dimerization, between as well as within CYC clades. Our data from two distant species within the Asteraceae suggest that the expansion and the apparent conservation of especially the CYC2 clade CYC/TB1-like genes are associated with the evolution of the increased complexity of the Asteraceae inflorescence architecture.
Yu, Simei; Jordán-Pla, Antonio; Gañez-Zapater, Antoni; Jain, Shruti; Rolicka, Anna; Östlund Farrants, Ann-Kristin; Visa, Neus
2018-05-31
SWI/SNF complexes associate with genes and regulate transcription by altering the chromatin at the promoter. It has recently been shown that these complexes play a role in pre-mRNA processing by associating at alternative splice sites. Here, we show that SWI/SNF complexes are involved also in pre-mRNA 3' end maturation by facilitating 3' end cleavage of specific pre-mRNAs. Comparative proteomics show that SWI/SNF ATPases interact physically with subunits of the cleavage and polyadenylation complexes in fly and human cells. In Drosophila melanogaster, the SWI/SNF ATPase Brahma (dBRM) interacts with the CPSF6 subunit of cleavage factor I. We have investigated the function of dBRM in 3' end formation in S2 cells by RNA interference, single-gene analysis and RNA sequencing. Our data show that dBRM facilitates pre-mRNA cleavage in two different ways: by promoting the association of CPSF6 to the cleavage region and by stabilizing positioned nucleosomes downstream of the cleavage site. These findings show that SWI/SNF complexes play a role also in the cleavage of specific pre-mRNAs in animal cells.
Charles, J. P.; Chihara, C.; Nejad, S.; Riddiford, L. M.
1997-01-01
A 36-kb genomic DNA segment of the Drosophila melanogaster genome containing 12 clustered cuticle genes has been mapped and partially sequenced. The cluster maps at 65A 5-6 on the left arm of the third chromosome, in agreement with the previously determined location of a putative cluster encompassing the genes for the third instar larval cuticle proteins LCP5, LCP6 and LCP8. This cluster is the largest cuticle gene cluster discovered to date and shows a number of surprising features that explain in part the genetic complexity of the LCP5, LCP6 and LCP8 loci. The genes encoding LCP5 and LCP8 are multiple copy genes and the presence of extensive similarity in their coding regions gives the first evidence for gene conversion in cuticle genes. In addition, five genes in the cluster are intronless. Four of these five have arisen by retroposition. The other genes in the cluster have a single intron located at an unusual location for insect cuticle genes. PMID:9383064
Gaj, Thomas; Staahl, Brett T; Rodrigues, Gonçalo M C; Limsirichai, Prajit; Ekman, Freja K; Doudna, Jennifer A; Schaffer, David V
2017-06-20
Realizing the full potential of genome editing requires the development of efficient and broadly applicable methods for delivering programmable nucleases and donor templates for homology-directed repair (HDR). The RNA-guided Cas9 endonuclease can be introduced into cells as a purified protein in complex with a single guide RNA (sgRNA). Such ribonucleoproteins (RNPs) can facilitate the high-fidelity introduction of single-base substitutions via HDR following co-delivery with a single-stranded DNA oligonucleotide. However, combining RNPs with transgene-containing donor templates for targeted gene addition has proven challenging, which in turn has limited the capabilities of the RNP-mediated genome editing toolbox. Here, we demonstrate that combining RNP delivery with naturally recombinogenic adeno-associated virus (AAV) donor vectors enables site-specific gene insertion by homology-directed genome editing. Compared to conventional plasmid-based expression vectors and donor templates, we show that combining RNP and AAV donor delivery increases the efficiency of gene addition by up to 12-fold, enabling the creation of lineage reporters that can be used to track the conversion of striatal neurons from human fibroblasts in real time. These results thus illustrate the potential for unifying nuclease protein delivery with AAV donor vectors for homology-directed genome editing. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Hendrischk, Anne-Kathrin; Frühwirth, Sebastian Walter; Moldt, Julia; Pokorny, Richard; Metz, Sebastian; Kaiser, Gebhard; Jäger, Andreas; Batschauer, Alfred; Klug, Gabriele
2009-11-01
Blue light receptors belonging to the cryptochrome/photolyase family are found in all kingdoms of life. The functions of photolyases in repair of UV-damaged DNA as well as of cryptochromes in the light-dependent regulation of photomorphogenetic processes and in the circadian clock in plants and animals are well analysed. In prokaryotes, the only role of members of this protein family that could be demonstrated is DNA repair. Recently, we identified a gene for a cryptochrome-like protein (CryB) in the alpha-proteobacterium Rhodobacter sphaeroides. The protein lacks the typical C-terminal extension of cryptochromes, and is not related to the Cry DASH family. Here we demonstrate that CryB binds flavin adenine dinucleotide that can be photoreduced by blue light. CryB binds single-stranded DNA with very high affinity (K(d) approximately 10(-8) M) but double-stranded DNA and single-stranded RNA with far lower affinity (K(d) approximately 10(-6) M). Despite of that, no in vitro repair activity for pyrimidine dimers in single-stranded DNA could be detected. However, we show that CryB clearly affects the expression of genes for pigment-binding proteins and consequently the amount of photosynthetic complexes in R. sphaeroides. Thus, for the first time a role of a bacterial cryptochrome in gene regulation together with a biological function is demonstrated.
Besmer, Eva; Market, Eleonora; Papavasiliou, F. Nina
2006-01-01
Activation-induced cytidine deaminase (AID) is a single-stranded DNA deaminase required for somatic hypermutation of immunoglobulin (Ig) genes, a key process in the development of adaptive immunity. Transcription provides a single-stranded DNA substrate for AID, both in vivo and in vitro. We present here an assay which can faithfully replicate all of the molecular features of the initiation of hypermutation of Ig genes in vivo. In this assay, which detects AID-mediated deamination in the context of transcription by Escherichia coli RNA polymerase, deamination targets either strand and declines in efficiency as the distance from the promoter increases. We show that AID binds DNA exposed by the transcribing polymerase, implicating the polymerase itself as the vehicle which distributes AID on DNA as it moves away from the promoter. PMID:16705187
Novel approaches in function-driven single-cell genomics.
Doud, Devin F R; Woyke, Tanja
2017-07-01
Deeper sequencing and improved bioinformatics in conjunction with single-cell and metagenomic approaches continue to illuminate undercharacterized environmental microbial communities. This has propelled the 'who is there, and what might they be doing' paradigm to the uncultivated and has already radically changed the topology of the tree of life and provided key insights into the microbial contribution to biogeochemistry. While characterization of 'who' based on marker genes can describe a large fraction of the community, answering 'what are they doing' remains the elusive pinnacle for microbiology. Function-driven single-cell genomics provides a solution by using a function-based screen to subsample complex microbial communities in a targeted manner for the isolation and genome sequencing of single cells. This enables single-cell sequencing to be focused on cells with specific phenotypic or metabolic characteristics of interest. Recovered genomes are conclusively implicated for both encoding and exhibiting the feature of interest, improving downstream annotation and revealing activity levels within that environment. This emerging approach has already improved our understanding of microbial community functioning and facilitated the experimental analysis of uncharacterized gene product space. Here we provide a comprehensive review of strategies that have been applied for function-driven single-cell genomics and the future directions we envision. © FEMS 2017.
Novel approaches in function-driven single-cell genomics
DOE Office of Scientific and Technical Information (OSTI.GOV)
Doud, Devin F. R.; Woyke, Tanja
Deeper sequencing and improved bioinformatics in conjunction with single-cell and metagenomic approaches continue to illuminate undercharacterized environmental microbial communities. This has propelled the 'who is there, and what might they be doing' paradigm to the uncultivated and has already radically changed the topology of the tree of life and provided key insights into the microbial contribution to biogeochemistry. While characterization of 'who' based on marker genes can describe a large fraction of the community, answering 'what are they doing' remains the elusive pinnacle for microbiology. Function-driven single-cell genomics provides a solution by using a function-based screen to subsample complex microbialmore » communities in a targeted manner for the isolation and genome sequencing of single cells. This enables single-cell sequencing to be focused on cells with specific phenotypic or metabolic characteristics of interest. Recovered genomes are conclusively implicated for both encoding and exhibiting the feature of interest, improving downstream annotation and revealing activity levels within that environment. This emerging approach has already improved our understanding of microbial community functioning and facilitated the experimental analysis of uncharacterized gene product space. Here we provide a comprehensive review of strategies that have been applied for function-driven single-cell genomics and the future directions we envision.« less
Novel approaches in function-driven single-cell genomics
Doud, Devin F. R.; Woyke, Tanja
2017-06-07
Deeper sequencing and improved bioinformatics in conjunction with single-cell and metagenomic approaches continue to illuminate undercharacterized environmental microbial communities. This has propelled the 'who is there, and what might they be doing' paradigm to the uncultivated and has already radically changed the topology of the tree of life and provided key insights into the microbial contribution to biogeochemistry. While characterization of 'who' based on marker genes can describe a large fraction of the community, answering 'what are they doing' remains the elusive pinnacle for microbiology. Function-driven single-cell genomics provides a solution by using a function-based screen to subsample complex microbialmore » communities in a targeted manner for the isolation and genome sequencing of single cells. This enables single-cell sequencing to be focused on cells with specific phenotypic or metabolic characteristics of interest. Recovered genomes are conclusively implicated for both encoding and exhibiting the feature of interest, improving downstream annotation and revealing activity levels within that environment. This emerging approach has already improved our understanding of microbial community functioning and facilitated the experimental analysis of uncharacterized gene product space. Here we provide a comprehensive review of strategies that have been applied for function-driven single-cell genomics and the future directions we envision.« less
Sisakhtnezhad, Sajjad; Heshmati, Parvin
2018-07-01
Identifying effective internal factors for regulating germline commitment during development and for maintaining spermatogonial stem cells (SSCs) self-renewal is important to understand the molecular basis of spermatogenesis process, and to develop new protocols for the production of the germline cells from other cell sources. Therefore, this study was designed to investigate single-cell RNA-sequencing data for identification of differentially expressed genes (DEGs) in 12 mouse-derived single SSCs (mSSCs) in compare with 16 mouse-derived single mesenchymal stem cells. We also aimed to find transcriptional regulators of DEGs. Collectively, 1,584 up-regulated DEGs were identified that are associated with 32 biological processes. Moreover, investigation of the expression profiles of genes including in spermatogenesis process revealed that Dazl, Ddx4, Sall4, Fkbp6, Tex15, Tex19.1, Rnf17, Piwil2, Taf7l, Zbtb16, and Cadm1 are presented in the first 30 up-regulated DEGs. We also found 12 basal transcription factors (TFs) and three sequence-specific TFs that control the expression of DEGs. Our findings also indicated that MEIS1, SMC3, TAF1, KAT2A, STAT3, GTF3C2, SIN3A, BDP1, PHC1, and EGR1 are the main central regulators of DEGs in mSSCs. In addition, we collectively detected two significant protein complexes in the protein-protein interactions network for DEGs regulators. Finally, this study introduces the major upstream kinases for the main central regulators of DEGs and the components of core protein complexes. In conclusion, this study provides a molecular blueprint to uncover the molecular mechanisms behind the biology of SSCs and offers a list of candidate factors for cell type conversion approaches and production of germ cells. © 2017 Wiley Periodicals, Inc.
2012-01-01
Background Chemically mediated prezygotic barriers to reproduction likely play an important role in speciation. In facultatively sexual monogonont rotifers from the Brachionus plicatilis cryptic species complex, mate recognition of females by males is mediated by the Mate Recognition Protein (MRP), a globular glycoprotein on the surface of females, encoded by the mmr-b gene family. In this study, we sequenced mmr-b copies from 27 isolates representing 11 phylotypes of the B. plicatilis species complex, examined the mode of evolution and selection of mmr-b, and determined the relationship between mmr-b genetic distance and mate recognition among isolates. Results Isolates of the B. plicatilis species complex have 1–4 copies of mmr-b, each composed of 2–9 nearly identical tandem repeats. The repeats within a gene copy are generally more similar than are gene copies among phylotypes, suggesting concerted evolution. Compared to housekeeping genes from the same isolates, mmr-b has accumulated only half as many synonymous differences but twice as many non-synonymous differences. Most of the amino acid differences between repeats appear to occur on the outer face of the protein, and these often result in changes in predicted patterns of phosphorylation. However, we found no evidence of positive selection driving these differences. Isolates with the most divergent copies were unable to mate with other isolates and rarely self-crossed. Overall the degree of mate recognition was significantly correlated with the genetic distance of mmr-b. Conclusions Discrimination of compatible mates in the B. plicatilis species complex is determined by proteins encoded by closely related copies of a single gene, mmr-b. While concerted evolution of the tandem repeats in mmr-b may function to maintain identity, it can also lead to the rapid spread of a mutation through all copies in the genome and thus to reproductive isolation. The mmr-b gene is evolving rapidly, and novel alleles may be maintained and increase in frequency via asexual reproduction. Our analyses indicate that mate recognition, controlled by MMR-B, may drive reproductive isolation and allow saltational sympatric speciation within the B. plicatilis cryptic species complex, and that this process may be largely neutral. PMID:22852831
Brauburger, Kristina; Boehmann, Yannik; Tsuda, Yoshimi; Hoenen, Thomas; Olejnik, Judith; Schümann, Michael; Ebihara, Hideki
2014-01-01
ABSTRACT Ebola virus (EBOV) belongs to the group of nonsegmented negative-sense RNA viruses. The seven EBOV genes are separated by variable gene borders, including short (4- or 5-nucleotide) intergenic regions (IRs), a single long (144-nucleotide) IR, and gene overlaps, where the neighboring gene end and start signals share five conserved nucleotides. The unique structure of the gene overlaps and the presence of a single long IR are conserved among all filoviruses. Here, we sought to determine the impact of the EBOV gene borders during viral transcription. We show that readthrough mRNA synthesis occurs in EBOV-infected cells irrespective of the structure of the gene border, indicating that the gene overlaps do not promote recognition of the gene end signal. However, two consecutive gene end signals at the VP24 gene might improve termination at the VP24-L gene border, ensuring efficient L gene expression. We further demonstrate that the long IR is not essential for but regulates transcription reinitiation in a length-dependent but sequence-independent manner. Mutational analysis of bicistronic minigenomes and recombinant EBOVs showed no direct correlation between IR length and reinitiation rates but demonstrated that specific IR lengths not found naturally in filoviruses profoundly inhibit downstream gene expression. Intriguingly, although truncation of the 144-nucleotide-long IR to 5 nucleotides did not substantially affect EBOV transcription, it led to a significant reduction of viral growth. IMPORTANCE Our current understanding of EBOV transcription regulation is limited due to the requirement for high-containment conditions to study this highly pathogenic virus. EBOV is thought to share many mechanistic features with well-analyzed prototype nonsegmented negative-sense RNA viruses. A single polymerase entry site at the 3′ end of the genome determines that transcription of the genes is mainly controlled by gene order and cis-acting signals found at the gene borders. Here, we examined the regulatory role of the structurally unique EBOV gene borders during viral transcription. Our data suggest that transcriptional regulation in EBOV is highly complex and differs from that in prototype viruses and further the understanding of this most fundamental process in the filovirus replication cycle. Moreover, our results with recombinant EBOVs suggest a novel role of the long IR found in all filovirus genomes during the viral replication cycle. PMID:25142600
Brauburger, Kristina; Boehmann, Yannik; Tsuda, Yoshimi; Hoenen, Thomas; Olejnik, Judith; Schümann, Michael; Ebihara, Hideki; Mühlberger, Elke
2014-11-01
Ebola virus (EBOV) belongs to the group of nonsegmented negative-sense RNA viruses. The seven EBOV genes are separated by variable gene borders, including short (4- or 5-nucleotide) intergenic regions (IRs), a single long (144-nucleotide) IR, and gene overlaps, where the neighboring gene end and start signals share five conserved nucleotides. The unique structure of the gene overlaps and the presence of a single long IR are conserved among all filoviruses. Here, we sought to determine the impact of the EBOV gene borders during viral transcription. We show that readthrough mRNA synthesis occurs in EBOV-infected cells irrespective of the structure of the gene border, indicating that the gene overlaps do not promote recognition of the gene end signal. However, two consecutive gene end signals at the VP24 gene might improve termination at the VP24-L gene border, ensuring efficient L gene expression. We further demonstrate that the long IR is not essential for but regulates transcription reinitiation in a length-dependent but sequence-independent manner. Mutational analysis of bicistronic minigenomes and recombinant EBOVs showed no direct correlation between IR length and reinitiation rates but demonstrated that specific IR lengths not found naturally in filoviruses profoundly inhibit downstream gene expression. Intriguingly, although truncation of the 144-nucleotide-long IR to 5 nucleotides did not substantially affect EBOV transcription, it led to a significant reduction of viral growth. Our current understanding of EBOV transcription regulation is limited due to the requirement for high-containment conditions to study this highly pathogenic virus. EBOV is thought to share many mechanistic features with well-analyzed prototype nonsegmented negative-sense RNA viruses. A single polymerase entry site at the 3' end of the genome determines that transcription of the genes is mainly controlled by gene order and cis-acting signals found at the gene borders. Here, we examined the regulatory role of the structurally unique EBOV gene borders during viral transcription. Our data suggest that transcriptional regulation in EBOV is highly complex and differs from that in prototype viruses and further the understanding of this most fundamental process in the filovirus replication cycle. Moreover, our results with recombinant EBOVs suggest a novel role of the long IR found in all filovirus genomes during the viral replication cycle. Copyright © 2014, American Society for Microbiology. All Rights Reserved.
Li, Yaqian; Du, Xilin; Lu, Zhi John; Wu, Daqiang; Zhao, Yilei; Ren, Bin; Huang, Jiaofang; Huang, Xianqing; Xu, Yuhong; Xu, Yuquan
2011-01-01
Background Phenazines are important compounds produced by pseudomonads and other bacteria. Two phz gene clusters called phzA1-G1 and phzA2-G2, respectively, were found in the genome of Pseudomonas sp. M18, an effective biocontrol agent, which is highly homologous to the opportunistic human pathogen P. aeruginosa PAO1, however little is known about the correlation between the expressions of two phz gene clusters. Methodology/Principal Findings Two chromosomal insertion inactivated mutants for the two gene clusters were constructed respectively and the correlation between the expressions of two phz gene clusters was investigated in strain M18. Phenazine-1-carboxylic acid (PCA) molecules produced from phzA2-G2 gene cluster are able to auto-regulate expression itself and activate the expression of phzA1-G1 gene cluster in a circulated amplification pattern. However, the post-transcriptional expression of phzA1-G1 transcript was blocked principally through 5′-untranslated region (UTR). In contrast, the phzA2-G2 gene cluster was transcribed to a lesser extent and translated efficiently and was negatively regulated by the GacA signal transduction pathway, mainly at a post-transcriptional level. Conclusions/Significance A single molecule, PCA, produced in different quantities by the two phz gene clusters acted as the functional mediator and the two phz gene clusters developed a specific regulatory mechanism which acts through 5′-UTR to transfer a single, but complex bacterial signaling event in Pseudomonas sp. strain M18. PMID:21559370
Using a periclinal chimera to unravel layer-specific gene expression in plants
Filippis, Ioannis; Lopez-Cobollo, Rosa; Abbott, James; Butcher, Sarah; Bishop, Gerard J
2013-01-01
Plant organs are made from multiple cell types, and defining the expression level of a gene in any one cell or group of cells from a complex mixture is difficult. Dicotyledonous plants normally have three distinct layers of cells, L1, L2 and L3. Layer L1 is the single layer of cells making up the epidermis, layer L2 the single cell sub-epidermal layer and layer L3 constitutes the rest of the internal cells. Here we show how it is possible to harvest an organ and characterise the level of layer-specific expression by using a periclinal chimera that has its L1 layer from Solanum pennellii and its L2 and L3 layers from Solanum lycopersicum. This is possible by measuring the level of the frequency of species-specific transcripts. RNA-seq analysis enabled the genome-wide assessment of whether a gene is expressed in the L1 or L2/L3 layers. From 13 277 genes that are expressed in both the chimera and the parental lines and with at least one polymorphism between the parental alleles, we identified 382 genes that are preferentially expressed in L1 in contrast to 1159 genes in L2/L3. Gene ontology analysis shows that many genes preferentially expressed in L1 are involved in cutin and wax biosynthesis, whereas numerous genes that are preferentially expressed in L2/L3 tissue are associated with chloroplastic processes. These data indicate the use of such chimeras and provide detailed information on the level of layer-specific expression of genes. PMID:23725542
Miller, Marcia M.; Taylor, Robert L.
2016-01-01
Nearly all genes presently mapped to chicken chromosome 16 (GGA 16) have either a demonstrated role in immune responses or are considered to serve in immunity by reason of sequence homology with immune system genes defined in other species. The genes are best described in regional units. Among these, the best known is the polymorphic major histocompatibility complex-B (MHC-B) region containing genes for classical peptide antigen presentation. Nearby MHC-B is a small region containing two CD1 genes, which encode molecules known to bind lipid antigens and which will likely be found in chickens to present lipids to specialized T cells, as occurs with CD1 molecules in other species. Another region is the MHC-Y region, separated from MHC-B by an intervening region of tandem repeats. Like MHC-B, MHC-Y is polymorphic. It contains specialized class I and class II genes and c-type lectin-like genes. Yet another region, separated from MHC-Y by the single nucleolar organizing region (NOR) in the chicken genome, contains olfactory receptor genes and scavenger receptor genes, which are also thought to contribute to immunity. The structure, distribution, linkages and patterns of polymorphism in these regions, suggest GGA 16 evolves as a microchromosome devoted to immune defense. Many GGA 16 genes are polymorphic and polygenic. At the moment most disease associations are at the haplotype level. Roles of individual MHC genes in disease resistance are documented in only a very few instances. Provided suitable experimental stocks persist, the availability of increasingly detailed maps of GGA 16 genes combined with new means for detecting genetic variability will lead to investigations defining the contributions of individual loci and more applications for immunogenetics in breeding healthy poultry. PMID:26740135
Herranz, Mari Carmen; Niehl, Annette; Rosales, Marlene; Fiore, Nicola; Zamorano, Alan; Granell, Antonio; Pallas, Vicente
2013-05-28
Microarray profiling is a powerful technique to investigate expression changes of large amounts of genes in response to specific environmental conditions. The majority of the studies investigating gene expression changes in virus-infected plants are limited to interactions between a virus and a model host plant, which usually is Arabidopsis thaliana or Nicotiana benthamiana. In the present work, we performed microarray profiling to explore changes in the expression profile of field-grown Prunus persica (peach) originating from Chile upon single and double infection with Prunus necrotic ringspot virus (PNRSV) and Peach latent mosaic viroid (PLMVd), worldwide natural pathogens of peach trees. Upon single PLMVd or PNRSV infection, the number of statistically significant gene expression changes was relatively low. By contrast, doubly-infected fruits presented a high number of differentially regulated genes. Among these, down-regulated genes were prevalent. Functional categorization of the gene expression changes upon double PLMVd and PNRSV infection revealed protein modification and degradation as the functional category with the highest percentage of repressed genes whereas induced genes encoded mainly proteins related to phosphate, C-compound and carbohydrate metabolism and also protein modification. Overrepresentation analysis upon double infection with PLMVd and PNRSV revealed specific functional categories over- and underrepresented among the repressed genes indicating active counter-defense mechanisms of the pathogens during infection. Our results identify a novel synergistic effect of PLMVd and PNRSV on the transcriptome of peach fruits. We demonstrate that mixed infections, which occur frequently in field conditions, result in a more complex transcriptional response than that observed in single infections. Thus, our data demonstrate for the first time that the simultaneous infection of a viroid and a plant virus synergistically affect the host transcriptome in infected peach fruits. These field studies can help to fully understand plant-pathogen interactions and to develop appropriate crop protection strategies.
2013-01-01
Background Microarray profiling is a powerful technique to investigate expression changes of large amounts of genes in response to specific environmental conditions. The majority of the studies investigating gene expression changes in virus-infected plants are limited to interactions between a virus and a model host plant, which usually is Arabidopsis thaliana or Nicotiana benthamiana. In the present work, we performed microarray profiling to explore changes in the expression profile of field-grown Prunus persica (peach) originating from Chile upon single and double infection with Prunus necrotic ringspot virus (PNRSV) and Peach latent mosaic viroid (PLMVd), worldwide natural pathogens of peach trees. Results Upon single PLMVd or PNRSV infection, the number of statistically significant gene expression changes was relatively low. By contrast, doubly-infected fruits presented a high number of differentially regulated genes. Among these, down-regulated genes were prevalent. Functional categorization of the gene expression changes upon double PLMVd and PNRSV infection revealed protein modification and degradation as the functional category with the highest percentage of repressed genes whereas induced genes encoded mainly proteins related to phosphate, C-compound and carbohydrate metabolism and also protein modification. Overrepresentation analysis upon double infection with PLMVd and PNRSV revealed specific functional categories over- and underrepresented among the repressed genes indicating active counter-defense mechanisms of the pathogens during infection. Conclusions Our results identify a novel synergistic effect of PLMVd and PNRSV on the transcriptome of peach fruits. We demonstrate that mixed infections, which occur frequently in field conditions, result in a more complex transcriptional response than that observed in single infections. Thus, our data demonstrate for the first time that the simultaneous infection of a viroid and a plant virus synergistically affect the host transcriptome in infected peach fruits. These field studies can help to fully understand plant-pathogen interactions and to develop appropriate crop protection strategies. PMID:23710752
Gene- and pathway-based association tests for multiple traits with GWAS summary statistics.
Kwak, Il-Youp; Pan, Wei
2017-01-01
To identify novel genetic variants associated with complex traits and to shed new insights on underlying biology, in addition to the most popular single SNP-single trait association analysis, it would be useful to explore multiple correlated (intermediate) traits at the gene- or pathway-level by mining existing single GWAS or meta-analyzed GWAS data. For this purpose, we present an adaptive gene-based test and a pathway-based test for association analysis of multiple traits with GWAS summary statistics. The proposed tests are adaptive at both the SNP- and trait-levels; that is, they account for possibly varying association patterns (e.g. signal sparsity levels) across SNPs and traits, thus maintaining high power across a wide range of situations. Furthermore, the proposed methods are general: they can be applied to mixed types of traits, and to Z-statistics or P-values as summary statistics obtained from either a single GWAS or a meta-analysis of multiple GWAS. Our numerical studies with simulated and real data demonstrated the promising performance of the proposed methods. The methods are implemented in R package aSPU, freely and publicly available at: https://cran.r-project.org/web/packages/aSPU/ CONTACT: weip@biostat.umn.eduSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Menon, Debashish U; Coarfa, Cristian; Xiao, Weimin; Gunaratne, Preethi H; Meller, Victoria H
2014-11-18
Highly differentiated sex chromosomes create a lethal imbalance in gene expression in one sex. To accommodate hemizygosity of the X chromosome in male fruit flies, expression of X-linked genes increases twofold. This is achieved by the male- specific lethal (MSL) complex, which modifies chromatin to increase expression. Mutations that disrupt the X localization of this complex decrease the expression of X-linked genes and reduce male survival. The mechanism that restricts the MSL complex to X chromatin is not understood. We recently reported that the siRNA pathway contributes to localization of the MSL complex, raising questions about the source of the siRNAs involved. The X-linked 1.688 g/cm(3) satellite related repeats (1.688(X) repeats) are restricted to the X chromosome and produce small RNA, making them an attractive candidate. We tested RNA from these repeats for a role in dosage compensation and found that ectopic expression of single-stranded RNAs from 1.688(X) repeats enhanced the male lethality of mutants with defective X recognition. In contrast, expression of double-stranded hairpin RNA from a 1.688(X) repeat generated abundant siRNA and dramatically increased male survival. Consistent with improved survival, X localization of the MSL complex was largely restored in these males. The striking distribution of 1.688(X) repeats, which are nearly exclusive to the X chromosome, suggests that these are cis-acting elements contributing to identification of X chromatin.
Single proteins that serve linked functions in intracellular and extracellular microenvironments
DOE Office of Scientific and Technical Information (OSTI.GOV)
Radisky, Derek C.; Stallings-Mann, Melody; Hirai, Yohei
2009-06-03
Maintenance of organ homeostasis and control of appropriate response to environmental alterations requires intimate coordination of cellular function and tissue organization. An important component of this coordination may be provided by proteins that can serve distinct, but linked, functions on both sides of the plasma membrane. Here we present a novel hypothesis in which non-classical secretion can provide a mechanism through which single proteins can integrate complex tissue functions. Single genes can exert a complex, dynamic influence through a number of different processes that act to multiply the function of the gene product(s). Alternative splicing can create many different transcriptsmore » that encode proteins of diverse, even antagonistic, function from a single gene. Posttranslational modifications can alter the stability, activity, localization, and even basic function of proteins. A protein can exist in different subcellular localizations. More recently, it has become clear that single proteins can function both inside and outside the cell. These proteins often lack defined secretory signal sequences, and transit the plasma membrane by mechanisms separate from the classical ER/Golgi secretory process. When examples of such proteins are examined individually, the multifunctionality and lack of a signal sequence are puzzling - why should a protein with a well known function in one context function in such a distinct fashion in another? We propose that one reason for a single protein to perform intracellular and extracellular roles is to coordinate organization and maintenance of a global tissue function. Here, we describe in detail three specific examples of proteins that act in this fashion, outlining their specific functions in the extracellular space and in the intracellular space, and we discuss how these functions may be linked. We present epimorphin/syntaxin-2, which may coordinate morphogenesis of secretory organs (as epimorphin) with control of protein secretion (as syntaxin-2), amphoterin/high mobility group box-1 (HMGB1), which may link inflammation (as amphoterin) with regulation of gene expression (as HMGB1), and tissue transglutaminase, which affects delivery of and response to apoptotic signals by serving a related function on both sides of the plasma membrane. As it is notable that all three of these proteins have been reported to transit the plasma membrane through non-classical secretory mechanisms, we will also discuss why coordinated inside/outside functions may be found in some examples of proteins which transit the plasma membrane through non-classical mechanisms and how this relationship can be used to identify additional proteins that share these characteristics.« less
Grigat, Mathias; Jäschke, Yvonne; Kliewe, Felix; Pfeifer, Matthias; Walz, Susanne; Schüller, Hans-Joachim
2012-06-01
Yeast genes of phospholipid biosynthesis are negatively regulated by repressor protein Opi1 when precursor molecules inositol and choline (IC) are available. Opi1-triggered gene repression is mediated by recruitment of the Sin3 corepressor complex. In this study, we systematically investigated the regulatory contribution of subunits of Sin3 complexes and identified Pho23 as important for IC-dependent gene repression. Two non-overlapping regions within Pho23 mediate its direct interaction with Sin3. Previous work has shown that Sin3 recruits the histone deacetylase (HDAC) Rpd3 to execute gene repression. While deletion of SIN3 strongly alleviates gene repression by IC, an rpd3 null mutant shows almost normal regulation. We thus hypothesized that various HDACs may contribute to Sin3-mediated repression of IC-regulated genes. Indeed, a triple mutant lacking HDACs, Rpd3, Hda1 and Hos1, could phenocopy a sin3 single mutant. We show that these proteins are able to contact Sin3 in vitro and in vivo and mapped three distinct HDAC interaction domains, designated HID1, HID2 and HID3. HID3, which is identical to the previously described structural motif PAH4 (paired amphipathic helix), can bind all HDACs tested. Chromatin immunoprecipitation studies finally confirmed that Hda1 and Hos1 are recruited to promoters of phospholipid biosynthetic genes INO1 and CHO2.
Molecular evolution: breakthroughs and mysteries in Batesian mimicry.
Booker, Tom; Ness, Rob W; Charlesworth, Deborah
2015-06-15
Recent studies appear to overthrow the hypothesis that, in butterfly species exhibiting Batesian mimicry, a multi-gene complex or 'supergene' controls the multiple differences between mimetic and non-mimetic individuals, suggesting instead that near-perfect mimicry can be produced by a set of changes within a single locus, together with changes in the genetic background. Copyright © 2015 Elsevier Ltd. All rights reserved.
Miller, Hilary C.; O’Meally, Denis; Ezaz, Tariq; Amemiya, Chris; Marshall-Graves, Jennifer A.; Edwards, Scott
2015-01-01
Major histocompatibility complex (MHC) genes are a central component of the vertebrate immune system and usually exist in a single genomic region. However, considerable differences in MHC organization and size exist between different vertebrate lineages. Reptiles occupy a key evolutionary position for understanding how variation in MHC structure evolved in vertebrates, but information on the structure of the MHC region in reptiles is limited. In this study, we investigate the organization and cytogenetic location of MHC genes in the tuatara (Sphenodon punctatus), the sole extant representative of the early-diverging reptilian order Rhynchocephalia. Sequencing and mapping of 12 clones containing class I and II MHC genes from a bacterial artificial chromosome library indicated that the core MHC region is located on chromosome 13q. However, duplication and translocation of MHC genes outside of the core region was evident, because additional class I MHC genes were located on chromosome 4p. We found a total of seven class I sequences and 11 class II β sequences, with evidence for duplication and pseudogenization of genes within the tuatara lineage. The tuatara MHC is characterized by high repeat content and low gene density compared with other species and we found no antigen processing or MHC framework genes on the MHC gene-containing clones. Our findings indicate substantial differences in MHC organization in tuatara compared with mammalian and avian MHCs and highlight the dynamic nature of the MHC. Further sequencing and annotation of tuatara and other reptile MHCs will determine if the tuatara MHC is representative of nonavian reptiles in general. PMID:25953959
Microprocessor dynamics and interactions at endogenous imprinted C19MC microRNA genes.
Bellemer, Clément; Bortolin-Cavaillé, Marie-Line; Schmidt, Ute; Jensen, Stig Mølgaard Rask; Kjems, Jørgen; Bertrand, Edouard; Cavaillé, Jérôme
2012-06-01
Nuclear primary microRNA (pri-miRNA) processing catalyzed by the DGCR8-Drosha (Microprocessor) complex is highly regulated. Little is known, however, about how microRNA biogenesis is spatially organized within the mammalian nucleus. Here, we image for the first time, in living cells and at the level of a single microRNA cluster, the intranuclear distribution of untagged, endogenously-expressed pri-miRNAs generated at the human imprinted chromosome 19 microRNA cluster (C19MC), from the environment of transcription sites to single molecules of fully released DGCR8-bound pri-miRNAs dispersed throughout the nucleoplasm. We report that a large fraction of Microprocessor concentrates onto unspliced C19MC pri-miRNA deposited in close proximity to their genes. Our live-cell imaging studies provide direct visual evidence that DGCR8 and Drosha are targeted post-transcriptionally to C19MC pri-miRNAs as a preformed complex but dissociate separately. These dynamics support the view that, upon pri-miRNA loading and most probably concomitantly with Drosha-mediated cleavages, Microprocessor undergoes conformational changes that trigger the release of Drosha while DGCR8 remains stably bound to pri-miRNA.
Pendergrass, Sarah A; Verma, Shefali S; Holzinger, Emily R; Moore, Carrie B; Wallace, John; Dudek, Scott M; Huggins, Wayne; Kitchner, Terrie; Waudby, Carol; Berg, Richard; McCarty, Catherine A; Ritchie, Marylyn D
2013-01-01
Investigating the association between biobank derived genomic data and the information of linked electronic health records (EHRs) is an emerging area of research for dissecting the architecture of complex human traits, where cases and controls for study are defined through the use of electronic phenotyping algorithms deployed in large EHR systems. For our study, 2580 cataract cases and 1367 controls were identified within the Marshfield Personalized Medicine Research Project (PMRP) Biobank and linked EHR, which is a member of the NHGRI-funded electronic Medical Records and Genomics (eMERGE) Network. Our goal was to explore potential gene-gene and gene-environment interactions within these data for 529,431 single nucleotide polymorphisms (SNPs) with minor allele frequency > 1%, in order to explore higher level associations with cataract risk beyond investigations of single SNP-phenotype associations. To build our SNP-SNP interaction models we utilized a prior-knowledge driven filtering method called Biofilter to minimize the multiple testing burden of exploring the vast array of interaction models possible from our extensive number of SNPs. Using the Biofilter, we developed 57,376 prior-knowledge directed SNP-SNP models to test for association with cataract status. We selected models that required 6 sources of external domain knowledge. We identified 5 statistically significant models with an interaction term with p-value < 0.05, as well as an overall model with p-value < 0.05 associated with cataract status. We also conducted gene-environment interaction analyses for all GWAS SNPs and a set of environmental factors from the PhenX Toolkit: smoking, UV exposure, and alcohol use; these environmental factors have been previously associated with the formation of cataracts. We found a total of 288 models that exhibit an interaction term with a p-value ≤ 1×10(-4) associated with cataract status. Our results show these approaches enable advanced searches for epistasis and gene-environment interactions beyond GWAS, and that the EHR based approach provides an additional source of data for seeking these advanced explanatory models of the etiology of complex disease/outcome such as cataracts.
Wang, Lu-Yong; Fasulo, D
2006-01-01
Genome-wide association study for complex diseases will generate massive amount of single nucleotide polymorphisms (SNPs) data. Univariate statistical test (i.e. Fisher exact test) was used to single out non-associated SNPs. However, the disease-susceptible SNPs may have little marginal effects in population and are unlikely to retain after the univariate tests. Also, model-based methods are impractical for large-scale dataset. Moreover, genetic heterogeneity makes the traditional methods harder to identify the genetic causes of diseases. A more recent random forest method provides a more robust method for screening the SNPs in thousands scale. However, for more large-scale data, i.e., Affymetrix Human Mapping 100K GeneChip data, a faster screening method is required to screening SNPs in whole-genome large scale association analysis with genetic heterogeneity. We propose a boosting-based method for rapid screening in large-scale analysis of complex traits in the presence of genetic heterogeneity. It provides a relatively fast and fairly good tool for screening and limiting the candidate SNPs for further more complex computational modeling task.
De novo gene mutations highlight patterns of genetic and neural complexity in schizophrenia
Xu, Bin; Ionita-Laza, Iuliana; Roos, J. Louw; Boone, Braden; Woodrick, Scarlet; Sun, Yan; Levy, Shawn; Gogos, Joseph A.; Karayiorgou, Maria
2013-01-01
To evaluate evidence for de novo etiologies in schizophrenia, we sequenced at high coverage the exomes of families recruited from two populations with distinct demographic structure and history. We sequenced a total of 795 exomes from 231 parent-proband trios enriched for sporadic schizophrenia cases, as well as 34 unaffected trios. We observed in cases an excess of non-synonymous single nucleotide variants as well as a higher prevalence of gene-disruptive de novo mutations. We found four genes (LAMA2, DPYD, TRRAP and VPS39) affected by recurrent de novo events within or across the two populations, a finding unlikely to have occurred by chance. We show that de novo mutations affect genes with diverse functions and developmental profiles but we also find a substantial contribution of mutations in genes with higher expression in early fetal life. Our results help define the pattern of genomic and neural architecture of schizophrenia. PMID:23042115
Cloning and characterization of the canine receptor for advanced glycation end products.
Murua Escobar, Hugo; Soller, Jan T; Sterenczak, Katharina A; Sperveslage, Jan D; Schlueter, Claudia; Burchardt, Birgit; Eberle, Nina; Fork, Melanie; Nimzyk, Rolf; Winkler, Susanne; Nolte, Ingo; Bullerdiek, Jörn
2006-03-15
Metastasis is one of the major problems when dealing with malignant neoplasias. Accordingly, the finding of molecular targets, which can be addressed to reduce tumour metastasising, will have significant impact on the development of new therapeutic approaches. Recently, the receptor for advanced glycation end products (RAGE)-high mobility group B1 (HMGB1) protein complex has been shown to have significant influence on invasiveness, growth and motility of tumour cells, which are essential characteristics required for metastatic behaviour. A set of in vitro and in vivo approaches showed that blocking of this complex resulted in drastic suppression of tumour cell growth. Due to the similarities of human and canine cancer the dog has joined the common rodent animal model for therapeutic and preclinical studies. However, complete characterisation of the protein complex is a precondition to a therapeutic approach based on the blocking of the RAGE-HMGB1 complex to spontaneously occurring tumours in dogs. We recently characterised the canine HMGB1 gene and protein completely. Here we present the complete characterisation of the canine RAGE gene including its 1384 bp mRNA, the 1215 bp protein coding sequence, the 2835 bp genomic structure, chromosomal localisation, gene expression pattern, and its 404 amino acid protein. Furthermore we compared the CDS of six different canine breeds and screened them for single nucleotide polymorphisms.
From the ORFeome concept to highly comprehensive, full-genome screening libraries.
Rid, Raphaela; Abdel-Hadi, Omar; Maier, Richard; Wagner, Martin; Hundsberger, Harald; Hintner, Helmut; Bauer, Johann; Onder, Kamil
2013-02-01
Recombination-based cloning techniques have in recent times facilitated the establishment of genome-scale single-gene ORFeome repositories. Their further handling and downstream application in systematic fashion is, however, practically impeded because of logistical plus economic challenges. At this juncture, simultaneously transferring entire gene collections in compiled pool format could represent an advanced compromise between systematic ORFeome (an organism's entire set of protein-encoding open reading frames) projects and traditional random library approaches, but has not yet been considered in great detail. In our endeavor to merge the comprehensiveness of ORFeomes with a basically simple, streamlined, and easily executable single-tube design, we have here produced five different pooled screening-ready libraries for both Staphylococcus aureus and Homo sapiens. By evaluating the parallel transfer efficiencies of differentially sized genes from initial polymerase chain reaction (PCR) product amplification to entry and final destination library construction via quantitative real-time PCR, we found that the complexity of the gene population is fairly stably maintained once an entry resource has been successfully established, and that no apparent size-selection bias loss of large inserts takes place. Recombinational transfer processes are hence robust enough for straightforwardly achieving such pooled screening libraries.
IGF-I gene variability is associated with an increased risk for AD.
Vargas, Teo; Martinez-Garcia, Ana; Antequera, Desiree; Vilella, Elisabet; Clarimon, Jordi; Mateo, Ignacio; Sanchez-Juan, Pascual; Rodriguez-Rodriguez, Eloy; Frank, Ana; Rosich-Estrago, Marcel; Lleo, Alberto; Molina-Porcel, Laura; Blesa, Rafael; Gomez-Isla, Teresa; Combarros, Onofre; Bermejo-Pareja, Felix; Valdivieso, Fernando; Bullido, Maria Jesus; Carro, Eva
2011-03-01
Insulin-like growth factor I (IGF-I), a neuroprotective factor with a wide spectrum of actions in the adult brain, is involved in the pathogenesis of Alzheimer's disease (AD). Circulating levels of IGF-I change in AD patients and are implicated in the clearance of brain amyloid beta (Aβ) complexes. To investigate this hypothesis, we screened the IGF-I gene for various well known single nucleotide polymorphisms (SNPs) covering % of the gene variability in a population of 2352 individuals. Genetic analysis indicated different distribution of genotypes of 1 single nucleotide polymorphism, and 1 extended haplotype in the AD population compared with healthy control subjects. In particular, the frequency of rs972936 GG genotype was significantly greater in AD patients than in control subjects (63% vs. 55%). The rs972936 GG genotype was associated with an increased risk for disease, independently of apolipoprotein E genotype, and with enhanced circulating levels of IGF-I. These findings suggest that polymorphisms within the IGF-I gene could infer greater risk for AD through their effect on IGF-I levels, and confirm the physiological role IGF-I in the pathogenesis of AD. Copyright © 2011 IBRO. Published by Elsevier Inc. All rights reserved.
Baum, AE; Akula, N; Cabanero, M; Cardona, I; Corona, W; Klemens, B; Schulze, TG; Cichon, S; Rietschel, M; Nöthen, MM; Georgi, A; Schumacher, J; Schwarz, M; Jamra, R Abou; Höfels, S; Propping, P; Satagopan, J; Detera-Wadleigh, SD; Hardy, J; McMahon, FJ
2008-01-01
The genetic basis of bipolar disorder has long been thought to be complex, with the potential involvement of multiple genes, but methods to analyze populations with respect to this complexity have only recently become available. We have carried out a genome-wide association study of bipolar disorder by genotyping over 550,000 SNPs in two independent case-control samples of European origin. The initial association screen was performed using pooled DNA; selected SNPs were confirmed by individual genotyping. While DNA pooling reduces power to detect genetic associations, there is a substantial cost savings and gain in efficiency. A total of 88 SNPs representing 80 different genes met the prior criteria for replication in both samples. Effect sizes were modest: no single SNP of large effect was detected. Of 37 SNPs selected for individual genotyping, the strongest association signal was detected at a marker within the first intron of DGKH (p = 1.5 × 10−8, experiment-wide p<0.01, OR= 1.59). This gene encodes diacylglycerol kinase eta, a key protein in the lithium-sensitive phosphatidyl inositol pathway. This first genome-wide association study of bipolar disorder shows that several genes, each of modest effect, reproducibly influence disease risk. Bipolar disorder may be a polygenic disease. PMID:17486107
The Complex Transcriptional Response of Acaryochloris marina to Different Oxygen Levels.
Hernández-Prieto, Miguel A; Lin, Yuankui; Chen, Min
2017-02-09
Ancient oxygenic photosynthetic prokaryotes produced oxygen as a waste product, but existed for a long time under an oxygen-free (anoxic) atmosphere, before an oxic atmosphere emerged. The change in oxygen levels in the atmosphere influenced the chemistry and structure of many enzymes that contained prosthetic groups that were inactivated by oxygen. In the genome of Acaryochloris marina , multiple gene copies exist for proteins that are normally encoded by a single gene copy in other cyanobacteria. Using high throughput RNA sequencing to profile transcriptome responses from cells grown under microoxic and hyperoxic conditions, we detected 8446 transcripts out of the 8462 annotated genes in the Cyanobase database. Two-thirds of the 50 most abundant transcripts are key proteins in photosynthesis. Microoxic conditions negatively affected the levels of expression of genes encoding photosynthetic complexes, with the exception of some subunits. In addition to the known regulation of the multiple copies of psbA , we detected a similar transcriptional pattern for psbJ and psbU , which might play a key role in the altered components of photosystem II. Furthermore, regulation of genes encoding proteins important for reactive oxygen species-scavenging is discussed at genome level, including, for the first time, specific small RNAs having possible regulatory roles under varying oxygen levels. Copyright © 2017 Hernandez-Prieto et al.
The Complex Transcriptional Response of Acaryochloris marina to Different Oxygen Levels
Hernández-Prieto, Miguel A.; Lin, Yuankui; Chen, Min
2016-01-01
Ancient oxygenic photosynthetic prokaryotes produced oxygen as a waste product, but existed for a long time under an oxygen-free (anoxic) atmosphere, before an oxic atmosphere emerged. The change in oxygen levels in the atmosphere influenced the chemistry and structure of many enzymes that contained prosthetic groups that were inactivated by oxygen. In the genome of Acaryochloris marina, multiple gene copies exist for proteins that are normally encoded by a single gene copy in other cyanobacteria. Using high throughput RNA sequencing to profile transcriptome responses from cells grown under microoxic and hyperoxic conditions, we detected 8446 transcripts out of the 8462 annotated genes in the Cyanobase database. Two-thirds of the 50 most abundant transcripts are key proteins in photosynthesis. Microoxic conditions negatively affected the levels of expression of genes encoding photosynthetic complexes, with the exception of some subunits. In addition to the known regulation of the multiple copies of psbA, we detected a similar transcriptional pattern for psbJ and psbU, which might play a key role in the altered components of photosystem II. Furthermore, regulation of genes encoding proteins important for reactive oxygen species-scavenging is discussed at genome level, including, for the first time, specific small RNAs having possible regulatory roles under varying oxygen levels. PMID:27974439
Modarai, Shirin R; Man, Dula; Bialk, Pawel; Rivera-Torres, Natalia; Bloh, Kevin; Kmiec, Eric B
2018-06-01
CD34+ cells are prime targets for therapeutic strategies for gene editing, because modified progenitor cells have the capacity to differentiate through an erythropoietic lineage. Although experimental advances have been reported, the associated experimental protocols have largely been less than clear or robust. As such, we evaluated the relationships among cellular delivery; nuclear uptake, often viewed as the benchmark metric of successful gene editing; and single base repair. We took a combinatorial approach using single-stranded oligonucleotide and a CRISPR/Cas9 ribonucleoprotein to convert wild-type HBB into the sickle cell genotype by evaluating conditions for two common delivery strategies of gene editing tools into CD34+ cells. Confocal microscopy data show that the CRISPR/Cas9 ribonucleoprotein tends to accumulate at the outer membrane of the CD34+ cell nucleus when the Neon Transfection System is employed, while the ribonucleoproteins do pass into the cell nucleus when nucleofection is used. Despite the high efficiency of cellular transformation, and the traditional view of success in efficient nuclear uptake, neither delivery methodology enabled gene editing activity. Our results indicate that more stringent criteria must be established to facilitate the clinical translation and scientific robustness of gene editing for sickle cell disease. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
Proteomics to study DNA-bound and chromatin-associated gene regulatory complexes
Wierer, Michael; Mann, Matthias
2016-01-01
High-resolution mass spectrometry (MS)-based proteomics is a powerful method for the identification of soluble protein complexes and large-scale affinity purification screens can decode entire protein interaction networks. In contrast, protein complexes residing on chromatin have been much more challenging, because they are difficult to purify and often of very low abundance. However, this is changing due to recent methodological and technological advances in proteomics. Proteins interacting with chromatin marks can directly be identified by pulldowns with synthesized histone tails containing posttranslational modifications (PTMs). Similarly, pulldowns with DNA baits harbouring single nucleotide polymorphisms or DNA modifications reveal the impact of those DNA alterations on the recruitment of transcription factors. Accurate quantitation – either isotope-based or label free – unambiguously pinpoints proteins that are significantly enriched over control pulldowns. In addition, protocols that combine classical chromatin immunoprecipitation (ChIP) methods with mass spectrometry (ChIP-MS) target gene regulatory complexes in their in-vivo context. Similar to classical ChIP, cells are crosslinked with formaldehyde and chromatin sheared by sonication or nuclease digested. ChIP-MS baits can be proteins in tagged or endogenous form, histone PTMs, or lncRNAs. Locus-specific ChIP-MS methods would allow direct purification of a single genomic locus and the proteins associated with it. There, loci can be targeted either by artificial DNA-binding sites and corresponding binding proteins or via proteins with sequence specificity such as TAL or nuclease deficient Cas9 in combination with a specific guide RNA. We predict that advances in MS technology will soon make such approaches generally applicable tools in epigenetics. PMID:27402878
González-Thuillier, Irene; Venegas-Calerón, Mónica; Sánchez, Rosario; Garcés, Rafael; von Wettstein-Knowles, Penny; Martínez-Force, Enrique
2016-02-01
Two sunflower hydroxyacyl-[acyl carrier protein] dehydratases evolved into two different isoenzymes showing distinctive expression levels and kinetics' efficiencies. β-Hydroxyacyl-[acyl carrier protein (ACP)]-dehydratase (HAD) is a component of the type II fatty acid synthase complex involved in 'de novo' fatty acid biosynthesis in plants. This complex, formed by four intraplastidial proteins, is responsible for the sequential condensation of two-carbon units, leading to 16- and 18-C acyl-ACP. HAD dehydrates 3-hydroxyacyl-ACP generating trans-2-enoyl-ACP. With the aim of a further understanding of fatty acid biosynthesis in sunflower (Helianthus annuus) seeds, two β-hydroxyacyl-[ACP] dehydratase genes have been cloned from developing seeds, HaHAD1 (GenBank HM044767) and HaHAD2 (GenBank GU595454). Genomic DNA gel blot analyses suggest that both are single copy genes. Differences in their expression patterns across plant tissues were detected. Higher levels of HaHAD2 in the initial stages of seed development inferred its key role in seed storage fatty acid synthesis. That HaHAD1 expression levels remained constant across most tissues suggest a housekeeping function. Heterologous expression of these genes in E. coli confirmed both proteins were functional and able to interact with the bacterial complex 'in vivo'. The large increase of saturated fatty acids in cells expressing HaHAD1 and HaHAD2 supports the idea that these HAD genes are closely related to the E. coli FabZ gene. The proposed three-dimensional models of HaHAD1 and HaHAD2 revealed differences at the entrance to the catalytic tunnel attributable to Phe166/Val1159, respectively. HaHAD1 F166V was generated to study the function of this residue. The 'in vitro' enzymatic characterization of the three HAD proteins demonstrated all were active, with the mutant having intermediate K m and V max values to the wild-type proteins.
Yi, Zhenzhen; Song, Weibo; Clamp, John C; Chen, Zigui; Gao, Shan; Zhang, Qianqian
2009-03-01
Comprehensive molecular analyses of phylogenetic relationships within euplotid ciliates are relatively rare, and the relationships among some families remain questionable. We performed phylogenetic analyses of the order Euplotida based on new sequences of the gene coding for small-subunit RNA (SSrRNA) from a variety of taxa across the entire order as well as sequences from some of these taxa of other genes (ITS1-5.8S-ITS2 region and histone H4) that have not been included in previous analyses. Phylogenetic trees based on SSrRNA gene sequences constructed with four different methods had a consistent branching pattern that included the following features: (1) the "typical" euplotids comprised a paraphyletic assemblage composed of two divergent clades (family Uronychiidae and families Euplotidae-Certesiidae-Aspidiscidae-Gastrocirrhidae), (2) in the family Uronychiidae, the genera Uronychia and Paradiophrys formed a clearly outlined, well-supported clade that seemed to be rather divergent from Diophrys and Diophryopsis, suggesting that the Diophrys-complex may have had a longer and more separate evolutionary history than previously supposed, (3) inclusion of 12 new SSrRNA sequences in analyses of Euplotidae revealed two new clades of species within the family and cast additional doubt on the present classification of genera within the family, and (4) the intraspecific divergence among five species of Aspidisca was far greater than those of closely related genera. The ITS1-5.8S-ITS2 coding regions and partial histone H4 genes of six morphospecies in the Diophrys-complex were sequenced along with their SSrRNA genes and used to compare phylogenies constructed from single data sets to those constructed from combined sets. Results indicated that combined analyses could be used to construct more reliable, less ambiguous phylogenies of complex groups like the order Euplotida, because they provide a greater amount and diversity of information.
Physiogenomic analysis of localized FMRI brain activity in schizophrenia.
Windemuth, Andreas; Calhoun, Vince D; Pearlson, Godfrey D; Kocherla, Mohan; Jagannathan, Kanchana; Ruaño, Gualberto
2008-06-01
The search for genetic factors associated with disease is complicated by the complexity of the biological pathways linking genotype and phenotype. This analytical complexity is particularly concerning in diseases historically lacking reliable diagnostic biological markers, such as schizophrenia and other mental disorders. We investigate the use of functional magnetic resonance imaging (fMRI) as an intermediate phenotype (endophenotype) to identify physiogenomic associations to schizophrenia. We screened 99 subjects, 30 subjects diagnosed with schizophrenia, 13 unaffected relatives of schizophrenia patients, and 56 unrelated controls, for gene polymorphisms associated with fMRI activation patterns at two locations in temporal and frontal lobes previously implied in schizophrenia. A total of 22 single nucleotide polymorphisms (SNPs) in 15 genes from the dopamine and serotonin neurotransmission pathways were genotyped in all subjects. We identified three SNPs in genes that are significantly associated with fMRI activity. SNPs of the dopamine beta-hydroxylase (DBH) gene and of the dopamine receptor D4 (DRD4) were associated with activity in the temporal and frontal lobes, respectively. One SNP of serotonin-3A receptor (HTR3A) was associated with temporal lobe activity. The results of this study support the physiogenomic analysis of neuroimaging data to discover associations between genotype and disease-related phenotypes.
Yu, Xuefei; Zheng, Wei; Bhat, Somanath; Aquilina, J. Andrew
2015-01-01
Bacillus sp. CDB3 possesses a novel eight-gene ars cluster (ars1, arsRYCDATorf7orf8) with some unusual features in regard to expression regulation. This study demonstrated that the cluster is a single operon but can also produce a short three-gene arsRYC transcript. A hairpin structure formed by internal inverted repeats between arsC and arsD was shown to diminish the expression of the full operon, thereby probably acting as a transcription attenuator. A degradation product of the arsRYC transcript was also identified. Electrophoretic mobility shift analysis demonstrated that ArsR interacts with the ars1 promoter forming a protein-DNA complex that could be impaired by arsenite. However, no interaction was detected between ArsD and the ars1 promoter, suggesting that the CDB3 ArsD protein may not play a regulatory role. Compared to other ars gene clusters, regulation of the Bacillus sp. CDB3 ars1 operon is more complex. It represents another example of specific mRNA degradation in the transporter gene region and possibly the first case of attenuator-mediated regulation of ars operons. PMID:26355338
Lamiquiz-Moneo, Itziar; Pérez-Ruiz, María Rosario; Jarauta, Estíbaliz; Tejedor, María Teresa; Bea, Ana M; Mateo-Gallego, Rocío; Pérez-Calahorra, Sofía; Baila-Rueda, Lucía; Marco-Benedí, Victoria; de Castro-Orós, Isabel; Cenarro, Ana; Civeira, Fernando
2018-05-01
Approximately 20% to 40% of clinically defined familial hypercholesterolemia cases do not show a causative mutation in candidate genes, and some of them may have a polygenic origin. A cholesterol gene risk score for the diagnosis of polygenic hypercholesterolemia has been demonstrated to be valuable to differentiate polygenic and monogenic hypercholesterolemia. The aim of this study was to determine the contribution to low-density lipoprotein cholesterol (LDL-C) of the single nucleotide variants associated with polygenic hypercholesterolemia in probands with genetic hypercholesterolemia without mutations in candidate genes (nonfamilial hypercholesterolemia genetic hypercholesterolemia) and the genetic score in cascade screening in their family members. We recruited 49 nonfamilial hypercholesterolemia genetic hypercholesterolemia families (294 participants) and calculated cholesterol gene scores, derived from single nucleotide variants in SORT1, APOB, ABCG8, APOE and LDLR and lipoprotein(a) plasma concentration. Risk alleles in SORT1, ABCG8, APOE, and LDLR showed a statistically significantly higher frequency in blood relatives than in the 1000 Genomes Project. However, there were no differences between affected and nonaffected members. The contribution of the cholesterol gene score to LDL-C was significantly higher in affected than in nonaffected participants (P = .048). The percentage of the LDL-C variation explained by the score was 3.1%, and this percentage increased to 6.9% in those families with the highest genetic score in the proband. Nonfamilial hypercholesterolemia genetic hypercholesterolemia families concentrate risk alleles for high LDL-C. Their contribution varies greatly among families, indicating the complexity and heterogeneity of these forms of hypercholesterolemias. The gene score explains a small percentage of LDL-C, which limits its use in diagnosis. Copyright © 2017 Sociedad Española de Cardiología. Published by Elsevier España, S.L.U. All rights reserved.
Vojinovic, Dina; Brison, Nathalie; Ahmad, Shahzad; Noens, Ilse; Pappa, Irene; Karssen, Lennart C; Tiemeier, Henning; van Duijn, Cornelia M; Peeters, Hilde; Amin, Najaf
2017-08-01
Autism spectrum disorder (ASD) is a highly heritable neurodevelopmental disorder with a complex genetic architecture. To identify genetic variants underlying ASD, we performed single-variant and gene-based genome-wide association studies using a dense genotyping array containing over 2.3 million single-nucleotide variants in a discovery sample of 160 families with at least one child affected with non-syndromic ASD using a binary (ASD yes/no) phenotype and a quantitative autistic trait. Replication of the top findings was performed in Psychiatric Genomics Consortium and Erasmus Rucphen Family (ERF) cohort study. Significant association of quantitative autistic trait was observed with the TTC25 gene at 17q21.2 (effect size=10.2, P-value=3.4 × 10 -7 ) in the gene-based analysis. The gene also showed nominally significant association in the cohort-based ERF study (effect=1.75, P-value=0.05). Meta-analysis of discovery and replication improved the association signal (P-value meta =1.5 × 10 -8 ). No genome-wide significant signal was observed in the single-variant analysis of either the binary ASD phenotype or the quantitative autistic trait. Our study has identified a novel gene TTC25 to be associated with quantitative autistic trait in patients with ASD. The replication of association in a cohort-based study and the effect estimate suggest that variants in TTC25 may also be relevant for broader ASD phenotype in the general population. TTC25 is overexpressed in frontal cortex and testis and is known to be involved in cilium movement and thus an interesting candidate gene for autistic trait.
Diffusion dynamics of the Keap1–Cullin3 interaction in single live cells
DOE Office of Scientific and Technical Information (OSTI.GOV)
Baird, Liam; Dinkova-Kostova, Albena T., E-mail: a.dinkovakostova@dundee.ac.uk; Department of Pharmacology and Molecular Sciences, Johns Hopkins University School of Medicine, Baltimore, MD
2013-03-29
Highlights: ► We developed a quantitative FRAP-based system to study the Keap1–Cul3 interaction. ► We show that Keap1–EGFP and mCherry–Cul3 interact in single live cells. ► We used inducers which target distinct cysteine sensors of Keap1 and differ 4000-fold in potency. ► Inducers cause Nrf2 stabilization, nuclear translocation, and target gene expression. ► Inducers of four different types do not dissociate the Keap1–EGFP:mCherry–Cul3 complex. -- Abstract: Transcription factor NF-E2 p45-related factor 2 (Nrf2) regulates the expression of a network of genes encoding drug-detoxification, anti-inflammatory, and metabolic enzymes, as well as proteins involved in the regulation of cellular redox homeostasis. Undermore » basal conditions, Kelch-like ECH associated protein 1 (Keap1) targets Nrf2 for ubiquitination and proteasomal degradation via association with Cullin3 (Cul3)-based Rbx1 E3 ubiquitin ligase. Various small molecules (inducers) activate Nrf2 leading to upregulation of cytoprotective gene expression. Inducers chemically modify specific cysteine residues of Keap1 which ultimately loses its ability to target Nrf2 for degradation. Dissociation of the Keap1–Cul3 complex by inducers is one possible mechanism, but evidence in single live cells is lacking. To investigate the diffusion dynamics of the Keap1–Cul3 interaction and the effect of inducers, we developed a quantitative fluorescence recovery after photobleaching (FRAP)-based system using Keap1–EGFP and mCherry–Cul3 fusion proteins. We show that Keap1–EGFP and mCherry–Cul3 interact in single live cells. Exposure for 1 h to small-molecule inducers of 4 different types, the oleanane triterpenoid CDDO, the isothiocyanate sulforaphane, the sulfoxythiocarbamate STCA, and the oxidant hydrogen peroxide which target distinct cysteine sensors within Keap1 with potencies which differ by nearly 4000-fold, does not dissociate the Keap1–Cul3 complex. As inducers cause conformational changes in Keap1, we conclude that changes in conformation rather than dissociation from Cul3 inactivate the repressor function of Keap1 leading to Nrf2 stabilization.« less
Ranade, Sonali Sachin; García-Gil, María Rosario; Rosselló, Josep A
2016-04-01
Many genes have been lost from the prokaryote plastidial genome during the early events of endosymbiosis in eukaryotes. Some of them were definitively lost, but others were relocated and functionally integrated to the host nuclear genomes through serial events of gene transfer during plant evolution. In gymnosperms, plastid genome sequencing has revealed the loss of ndh genes from several species of Gnetales and Pinaceae, including Norway spruce (Picea abies). This study aims to trace the ndh genes in the nuclear and organellar Norway spruce genomes. The plastid genomes of higher plants contain 11 ndh genes which are homologues of mitochondrial genes encoding subunits of the proton-pumping NADH-dehydrogenase (nicotinamide adenine dinucleotide dehydrogenase) or complex I (electron transport chain). Ndh genes encode 11 NDH polypeptides forming the Ndh complex (analogous to complex I) which seems to be primarily involved in chloro-respiration processes. We considered ndh genes from the plastidial genome of four gymnosperms (Cryptomeria japonica, Cycas revoluta, Ginkgo biloba, Podocarpus totara) and a single angiosperm species (Arabidopsis thaliana) to trace putative homologs in the nuclear and organellar Norway spruce genomes using tBLASTn to assess the evolutionary fate of ndh genes in Norway spruce and to address their genomic location(s), structure, integrity and functionality. The results obtained from tBLASTn were subsequently analyzed by performing homology search for finding ndh specific conserved domains using conserved domain search. We report the presence of non-functional plastid ndh gene fragments, excepting ndhE and ndhG genes, in the nuclear genome of Norway spruce. Regulatory transcriptional elements like promoters, TATA boxes and enhancers were detected in the upstream regions of some ndh fragments. We also found transposable elements in the flanking regions of few ndh fragments suggesting nuclear rearrangements in those regions. These evidences support the hypothesis that, at least in Picea, ndh translocations from the plastid to the nuclear genome have occurred, and that there might have been a functional machinery at some time during evolution to accommodate them within a nuclear-encoded environment, or attempts to form it.
Greenberg, David A; Zhang, Junying; Shmulewitz, Dvora; Strug, Lisa J; Zimmerman, Regina; Singh, Veena; Marathe, Sudhir
2005-12-30
The Genetic Analysis Workshop 14 simulated dataset was designed 1) To test the ability to find genes related to a complex disease (such as alcoholism). Such a disease may be given a variety of definitions by different investigators, have associated endophenotypes that are common in the general population, and is likely to be not one disease but a heterogeneous collection of clinically similar, but genetically distinct, entities. 2) To observe the effect on genetic analysis and gene discovery of a complex set of gene x gene interactions. 3) To allow comparison of microsatellite vs. large-scale single-nucleotide polymorphism (SNP) data. 4) To allow testing of association to identify the disease gene and the effect of moderate marker x marker linkage disequilibrium. 5) To observe the effect of different ascertainment/disease definition schemes on the analysis. Data was distributed in two forms. Data distributed to participants contained about 1,000 SNPs and 400 microsatellite markers. Internet-obtainable data consisted of a finer 10,000 SNP map, which also contained data on controls. While disease characteristics and parameters were constant, four "studies" used varying ascertainment schemes based on differing beliefs about disease characteristics. One of the studies contained multiplex two- and three-generation pedigrees with at least four affected members. The simulated disease was a psychiatric condition with many associated behaviors (endophenotypes), almost all of which were genetic in origin. The underlying disease model contained four major genes and two modifier genes. The four major genes interacted with each other to produce three different phenotypes, which were themselves heterogeneous. The population parameters were calibrated so that the major genes could be discovered by linkage analysis in most datasets. The association evidence was more difficult to calibrate but was designed to find statistically significant association in 50% of datasets. We also simulated some marker x marker linkage disequilibrium around some of the genes and also in areas without disease genes. We tried two different methods to simulate the linkage disequilibrium.
Wongabel rhabdovirus accessory protein U3 targets the SWI/SNF chromatin remodeling complex.
Joubert, D Albert; Rodriguez-Andres, Julio; Monaghan, Paul; Cummins, Michelle; McKinstry, William J; Paradkar, Prasad N; Moseley, Gregory W; Walker, Peter J
2015-01-15
Wongabel virus (WONV) is an arthropod-borne rhabdovirus that infects birds. It is one of the growing array of rhabdoviruses with complex genomes that encode multiple accessory proteins of unknown function. In addition to the five canonical rhabdovirus structural protein genes (N, P, M, G, and L), the 13.2-kb negative-sense single-stranded RNA (ssRNA) WONV genome contains five uncharacterized accessory genes, one overlapping the N gene (Nx or U4), three located between the P and M genes (U1 to U3), and a fifth one overlapping the G gene (Gx or U5). Here we show that WONV U3 is expressed during infection in insect and mammalian cells and is required for efficient viral replication. A yeast two-hybrid screen against a mosquito cell cDNA library identified that WONV U3 interacts with the 83-amino-acid (aa) C-terminal domain of SNF5, a component of the SWI/SNF chromatin remodeling complex. The interaction was confirmed by affinity chromatography, and nuclear colocalization was established by confocal microscopy. Gene expression studies showed that SNF5 transcripts are upregulated during infection of mosquito cells with WONV, as well as West Nile virus (Flaviviridae) and bovine ephemeral fever virus (Rhabdoviridae), and that SNF5 knockdown results in increased WONV replication. WONV U3 also inhibits SNF5-regulated expression of the cytokine gene CSF1. The data suggest that WONV U3 targets the SWI/SNF complex to block the host response to infection. The rhabdoviruses comprise a large family of RNA viruses infecting plants, vertebrates, and invertebrates. In addition to the major structural proteins (N, P, M, G, and L), many rhabdoviruses encode a diverse array of accessory proteins of largely unknown function. Understanding the role of these proteins may reveal much about host-pathogen interactions in infected cells. Here we examine accessory protein U3 of Wongabel virus, an arthropod-borne rhabdovirus that infects birds. We show that U3 enters the nucleus and interacts with SNF5, a component of the chromatin remodeling complex that is upregulated in response to infection and restricts viral replication. We also show that U3 inhibits SNF5-regulated expression of the cytokine colony-stimulating factor 1 (CSF1), suggesting that it targets the chromatin remodeling complex to block the host response to infection. This study appears to provide the first evidence of a virus targeting SNF5 to inhibit host gene expression. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Wongabel Rhabdovirus Accessory Protein U3 Targets the SWI/SNF Chromatin Remodeling Complex
Joubert, D. Albert; Rodriguez-Andres, Julio; Monaghan, Paul; Cummins, Michelle; McKinstry, William J.; Paradkar, Prasad N.; Moseley, Gregory W.
2014-01-01
ABSTRACT Wongabel virus (WONV) is an arthropod-borne rhabdovirus that infects birds. It is one of the growing array of rhabdoviruses with complex genomes that encode multiple accessory proteins of unknown function. In addition to the five canonical rhabdovirus structural protein genes (N, P, M, G, and L), the 13.2-kb negative-sense single-stranded RNA (ssRNA) WONV genome contains five uncharacterized accessory genes, one overlapping the N gene (Nx or U4), three located between the P and M genes (U1 to U3), and a fifth one overlapping the G gene (Gx or U5). Here we show that WONV U3 is expressed during infection in insect and mammalian cells and is required for efficient viral replication. A yeast two-hybrid screen against a mosquito cell cDNA library identified that WONV U3 interacts with the 83-amino-acid (aa) C-terminal domain of SNF5, a component of the SWI/SNF chromatin remodeling complex. The interaction was confirmed by affinity chromatography, and nuclear colocalization was established by confocal microscopy. Gene expression studies showed that SNF5 transcripts are upregulated during infection of mosquito cells with WONV, as well as West Nile virus (Flaviviridae) and bovine ephemeral fever virus (Rhabdoviridae), and that SNF5 knockdown results in increased WONV replication. WONV U3 also inhibits SNF5-regulated expression of the cytokine gene CSF1. The data suggest that WONV U3 targets the SWI/SNF complex to block the host response to infection. IMPORTANCE The rhabdoviruses comprise a large family of RNA viruses infecting plants, vertebrates, and invertebrates. In addition to the major structural proteins (N, P, M, G, and L), many rhabdoviruses encode a diverse array of accessory proteins of largely unknown function. Understanding the role of these proteins may reveal much about host-pathogen interactions in infected cells. Here we examine accessory protein U3 of Wongabel virus, an arthropod-borne rhabdovirus that infects birds. We show that U3 enters the nucleus and interacts with SNF5, a component of the chromatin remodeling complex that is upregulated in response to infection and restricts viral replication. We also show that U3 inhibits SNF5-regulated expression of the cytokine colony-stimulating factor 1 (CSF1), suggesting that it targets the chromatin remodeling complex to block the host response to infection. This study appears to provide the first evidence of a virus targeting SNF5 to inhibit host gene expression. PMID:25392228
Co-expression networks reveal the tissue-specific regulation of transcription and splicing.
Saha, Ashis; Kim, Yungil; Gewirtz, Ariel D H; Jo, Brian; Gao, Chuan; McDowell, Ian C; Engelhardt, Barbara E; Battle, Alexis
2017-11-01
Gene co-expression networks capture biologically important patterns in gene expression data, enabling functional analyses of genes, discovery of biomarkers, and interpretation of genetic variants. Most network analyses to date have been limited to assessing correlation between total gene expression levels in a single tissue or small sets of tissues. Here, we built networks that additionally capture the regulation of relative isoform abundance and splicing, along with tissue-specific connections unique to each of a diverse set of tissues. We used the Genotype-Tissue Expression (GTEx) project v6 RNA sequencing data across 50 tissues and 449 individuals. First, we developed a framework called Transcriptome-Wide Networks (TWNs) for combining total expression and relative isoform levels into a single sparse network, capturing the interplay between the regulation of splicing and transcription. We built TWNs for 16 tissues and found that hubs in these networks were strongly enriched for splicing and RNA binding genes, demonstrating their utility in unraveling regulation of splicing in the human transcriptome. Next, we used a Bayesian biclustering model that identifies network edges unique to a single tissue to reconstruct Tissue-Specific Networks (TSNs) for 26 distinct tissues and 10 groups of related tissues. Finally, we found genetic variants associated with pairs of adjacent nodes in our networks, supporting the estimated network structures and identifying 20 genetic variants with distant regulatory impact on transcription and splicing. Our networks provide an improved understanding of the complex relationships of the human transcriptome across tissues. © 2017 Saha et al.; Published by Cold Spring Harbor Laboratory Press.
Pathway analyses and understanding disease associations
Liu, Yu; Chance, Mark R
2013-01-01
High throughput technologies have been applied to investigate the underlying mechanisms of complex diseases, identify disease-associations and help to improve treatment. However it is challenging to derive biological insight from conventional single gene based analysis of “omics” data from high throughput experiments due to sample and patient heterogeneity. To address these challenges, many novel pathway and network based approaches were developed to integrate various “omics” data, such as gene expression, copy number alteration, Genome Wide Association Studies, and interaction data. This review will cover recent methodological developments in pathway analysis for the detection of dysregulated interactions and disease-associated subnetworks, prioritization of candidate disease genes, and disease classifications. For each application, we will also discuss the associated challenges and potential future directions. PMID:24319650
Tay, Wee Tek; Elfekih, Samia; Court, Leon N; Gordon, Karl H J; Delatte, Hélène; De Barro, Paul J
2017-10-01
Molecular species identification using suboptimal PCR primers can over-estimate species diversity due to coamplification of nuclear mitochondrial (NUMT) DNA/pseudogenes. For the agriculturally important whitefly Bemisia tabaci cryptic pest species complex, species identification depends primarily on characterization of the mitochondrial DNA cytochrome oxidase I (mtDNA COI) gene. The lack of robust PCR primers for the mtDNA COI gene can undermine correct species identification which in turn compromises management strategies. This problem is identified in the B. tabaci Africa/Middle East/Asia Minor clade which comprises the globally invasive Mediterranean (MED) and Middle East Asia Minor I (MEAM1) species, Middle East Asia Minor 2 (MEAM2), and the Indian Ocean (IO) species. Initially identified from the Indian Ocean island of Réunion, MEAM2 has since been reported from Japan, Peru, Turkey and Iraq. We identified MEAM2 individuals from a Peruvian population via Sanger sequencing of the mtDNA COI gene. In attempting to characterize the MEAM2 mitogenome, we instead characterized mitogenomes of MEAM1. We also report on the mitogenomes of MED, AUS, and IO thereby increasing genomic resources for members of this complex. Gene synteny (i.e., same gene composition and orientation) was observed with published B. tabaci cryptic species mitogenomes. Pseudogene fragments matching MEAM2 partial mtDNA COI gene exhibited low frequency single nucleotide polymorphisms that matched low copy number DNA fragments (<3%) of MEAM1 genomes, whereas presence of internal stop codons, loss of expected stop codons and poor primer annealing sites, all suggested MEAM2 as a pseudogene artifact and so not a real species. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Growth condition dependency is the major cause of non-responsiveness upon genetic perturbation
Amini, Saman; Holstege, Frank C. P.
2017-01-01
Investigating the role and interplay between individual proteins in biological processes is often performed by assessing the functional consequences of gene inactivation or removal. Depending on the sensitivity of the assay used for determining phenotype, between 66% (growth) and 53% (gene expression) of Saccharomyces cerevisiae gene deletion strains show no defect when analyzed under a single condition. Although it is well known that this non-responsive behavior is caused by different types of redundancy mechanisms or by growth condition/cell type dependency, it is not known what the relative contribution of these different causes is. Understanding the underlying causes of and their relative contribution to non-responsive behavior upon genetic perturbation is extremely important for designing efficient strategies aimed at elucidating gene function and unraveling complex cellular systems. Here, we provide a systematic classification of the underlying causes of and their relative contribution to non-responsive behavior upon gene deletion. The overall contribution of redundancy to non-responsive behavior is estimated at 29%, of which approximately 17% is due to homology-based redundancy and 12% is due to pathway-based redundancy. The major determinant of non-responsiveness is condition dependency (71%). For approximately 14% of protein complexes, just-in-time assembly can be put forward as a potential mechanistic explanation for how proteins can be regulated in a condition dependent manner. Taken together, the results underscore the large contribution of growth condition requirement to non-responsive behavior, which needs to be taken into account for strategies aimed at determining gene function. The classification provided here, can also be further harnessed in systematic analyses of complex cellular systems. PMID:28257504
Klumplerova, Marie; Vychodilova, Leona; Bobrova, Olga; Cvanova, Michaela; Futas, Jan; Janova, Eva; Vyskocil, Mirko; Vrtkova, Irena; Putnova, Lenka; Dusek, Ladislav; Marti, Eliane; Horin, Petr
2013-04-01
Insect bite hypersensitivity (IBH) is an allergic dermatitis of horses caused by bites of insects. IBH is a multifactorial disease with contribution of genetic and environmental factors. Candidate gene association analysis of IBH was performed in a group of 89 Icelandic horses all born in Iceland and imported to Europe. Horses were classified in IBH-affected and non-affected based on clinical signs and history of recurrent dermatitis, and on the results of an in vitro sulfidoleukotriene (sLT)-release assay with Culicoides nubeculosus and Simulium vittatum extract. Different genetic markers were tested for association with IBH by the Fisher's exact test. The effect of the major histocompatibility complex (MHC) gene region was studied by genotyping five microsatellites spanning the MHC region (COR112, COR113, COR114, UM011 and UMN-JH34-2), and exon 2 polymorphisms of the class II Eqca-DRA gene. Associations with Eqca-DRA and COR113 were identified (p < 0.05). In addition, a panel of 20 single nucleotide polymorphisms (SNPs) in 17 candidate allergy-related genes was tested. During the initial screen, no marker from the panel was significantly (p < 0.05) associated with IBH. Five SNPs associated with IBH at p < 0.10 were therefore used for analysis of combined genotypes. Out of them, SNPs located in the genes coding for the CD14 receptor (CD14), interleukin 23 receptor (IL23R), thymic stromal lymphopoietin (TSLP) and transforming growth factor beta 3 (TGFB3) molecules were associated with IBH as parts of complex genotypes. These results are supported by similar associations and by expression data from different horse populations and from human studies.
2010-01-01
Background Comparative sequence analysis of complex loci such as resistance gene analog clusters allows estimating the degree of sequence conservation and mechanisms of divergence at the intraspecies level. In banana (Musa sp.), two diploid wild species Musa acuminata (A genome) and Musa balbisiana (B genome) contribute to the polyploid genome of many cultivars. The M. balbisiana species is associated with vigour and tolerance to pests and disease and little is known on the genome structure and haplotype diversity within this species. Here, we compare two genomic sequences of 253 and 223 kb corresponding to two haplotypes of the RGA08 resistance gene analog locus in M. balbisiana "Pisang Klutuk Wulung" (PKW). Results Sequence comparison revealed two regions of contrasting features. The first is a highly colinear gene-rich region where the two haplotypes diverge only by single nucleotide polymorphisms and two repetitive element insertions. The second corresponds to a large cluster of RGA08 genes, with 13 and 18 predicted RGA genes and pseudogenes spread over 131 and 152 kb respectively on each haplotype. The RGA08 cluster is enriched in repetitive element insertions, in duplicated non-coding intergenic sequences including low complexity regions and shows structural variations between haplotypes. Although some allelic relationships are retained, a large diversity of RGA08 genes occurs in this single M. balbisiana genotype, with several RGA08 paralogs specific to each haplotype. The RGA08 gene family has evolved by mechanisms of unequal recombination, intragenic sequence exchange and diversifying selection. An unequal recombination event taking place between duplicated non-coding intergenic sequences resulted in a different RGA08 gene content between haplotypes pointing out the role of such duplicated regions in the evolution of RGA clusters. Based on the synonymous substitution rate in coding sequences, we estimated a 1 million year divergence time for these M. balbisiana haplotypes. Conclusions A large RGA08 gene cluster identified in wild banana corresponds to a highly variable genomic region between haplotypes surrounded by conserved flanking regions. High level of sequence identity (70 to 99%) of the genic and intergenic regions suggests a recent and rapid evolution of this cluster in M. balbisiana. PMID:20637079
Jones, Kaylie D; Wheaton, Dianna K; Bowne, Sara J; Sullivan, Lori S; Birch, David G; Chen, Rui; Daiger, Stephen P
2017-01-01
With recent availability of next-generation sequencing (NGS), it is becoming more common to pursue disease-targeted panel testing rather than traditional sequential gene-by-gene dideoxy sequencing. In this report, we describe using NGS to identify multiple disease-causing mutations that contribute concurrently or independently to retinal dystrophy in three relatively small families. Family members underwent comprehensive visual function evaluations, and genetic counseling including a detailed family history. A preliminary genetic inheritance pattern was assigned and updated as additional family members were tested. Family 1 (FAM1) and Family 2 (FAM2) were clinically diagnosed with retinitis pigmentosa (RP) and had a suspected autosomal dominant pedigree with non-penetrance (n.p.). Family 3 (FAM3) consisted of a large family with a diagnosis of RP and an overall dominant pedigree, but the proband had phenotypically cone-rod dystrophy. Initial genetic analysis was performed on one family member with traditional Sanger single gene sequencing and/or panel-based testing, and ultimately, retinal gene-targeted NGS was required to identify the underlying cause of disease for individuals within the three families. Results obtained in these families necessitated further genetic and clinical testing of additional family members to determine the complex genetic and phenotypic etiology of each family. Genetic testing of FAM1 (n = 4 affected; 1 n.p.) identified a dominant mutation in RP1 (p.Arg677Ter) that was present for two of the four affected individuals but absent in the proband and the presumed non-penetrant individual. Retinal gene-targeted NGS in the fourth affected family member revealed compound heterozygous mutations in USH2A (p. Cys419Phe, p.Glu767Serfs*21). Genetic testing of FAM2 (n = 3 affected; 1 n.p.) identified three retinal dystrophy genes ( PRPH2 , PRPF8 , and USH2A ) with disease-causing mutations in varying combinations among the affected family members. Genetic testing of FAM3 (n = 7 affected) identified a mutation in PRPH2 (p.Pro216Leu) tracking with disease in six of the seven affected individuals. Additional retinal gene-targeted NGS testing determined that the proband also harbored a multiple exon deletion in the CRX gene likely accounting for her cone-rod phenotype; her son harbored only the mutation in CRX , not the familial mutation in PRPH2 . Multiple genes contributing to the retinal dystrophy genotypes within a family were discovered using retinal gene-targeted NGS. Families with noted examples of phenotypic variation or apparent non-penetrant individuals may offer a clue to suspect complex inheritance. Furthermore, this finding underscores that caution should be taken when attributing a single gene disease-causing mutation (or inheritance pattern) to a family as a whole. Identification of a disease-causing mutation in a proband, even with a clear inheritance pattern in hand, may not be sufficient for targeted, known mutation analysis in other family members.
Gene editing for cell engineering: trends and applications.
Gupta, Sanjeev K; Shukla, Pratyoosh
2017-08-01
Gene editing with all its own advantages in molecular biology applications has made easy manipulation of various production hosts with the discovery and implementation of modern gene editing tools such as Crispr (Clustered regularly interspaced short palindromic repeats), TALENs (Transcription activator-like effector nucleases) and ZFNs (Zinc finger nucleases). With the advent of these modern tools, it is now possible to manipulate the genome of industrial production hosts such as yeast and mammalian cells which allows developing a potential and cost effective recombinant therapeutic protein. These tools also allow single editing to multiple genes for knocking-in or knocking-out of a host genome quickly in an efficient manner. A recent study on "multiplexed" gene editing revolutionized the knock-out and knock-in events of yeast and CHO, mammalian cells genome for metabolic engineering as well as high, stable, and consistent expression of a transgene encoding complex therapeutic protein such as monoclonal antibody. The gene of interest can either be integrated or deleted at single or multiple loci depending on the strategy and production requirement. This review will give a gist of all the modern tools with a brief description and advances in genetic manipulation using three major tools being implemented for the modification of such hosts with the emphasis on the use of Crispr-Cas9 for the "multiplexing gene-editing approach" for genetic manipulation of yeast and CHO mammalian hosts that ultimately leads to a fast track product development with consistent, improved product yield, quality, and thus affordability for a population at large.
Single cell RNA Seq reveals dynamic paracrine control of cellular variation
Shalek, Alex K.; Satija, Rahul; Shuga, Joe; Trombetta, John J.; Gennert, Dave; Lu, Diana; Chen, Peilin; Gertner, Rona S.; Gaublomme, Jellert T.; Yosef, Nir; Schwartz, Schraga; Fowler, Brian; Weaver, Suzanne; Wang, Jing; Wang, Xiaohui; Ding, Ruihua; Raychowdhury, Raktima; Friedman, Nir; Hacohen, Nir; Park, Hongkun; May, Andrew P.; Regev, Aviv
2014-01-01
High-throughput single-cell transcriptomics offers an unbiased approach for understanding the extent, basis, and function of gene expression variation between seemingly identical cells. Here, we sequence single-cell RNA-Seq libraries prepared from over 1,700 primary mouse bone marrow derived dendritic cells (DCs) spanning several experimental conditions. We find substantial variation between identically stimulated DCs, in both the fraction of cells detectably expressing a given mRNA and the transcript’s level within expressing cells. Distinct gene modules are characterized by different temporal heterogeneity profiles. In particular, a “core” module of antiviral genes is expressed very early by a few “precocious” cells, but is later activated in all cells. By stimulating cells individually in sealed microfluidic chambers, analyzing DCs from knockout mice, and modulating secretion and extracellular signaling, we show that this response is coordinated via interferon-mediated paracrine signaling. Surprisingly, preventing cell-to-cell communication also substantially reduces variability in the expression of an early-induced “peaked” inflammatory module, suggesting that paracrine signaling additionally represses part of the inflammatory program. Our study highlights the importance of cell-to-cell communication in controlling cellular heterogeneity and reveals general strategies that multicellular populations use to establish complex dynamic responses. PMID:24919153
Caoduro, Cécile; Hervouet, Eric; Girard-Thernier, Corine; Gharbi, Tijani; Boulahdour, Hatem; Delage-Mourroux, Régis; Pudlo, Marc
2017-02-01
Carbon nanotubes represent promising transporters for delivery of DNA and other biomolecules into living cells. Various methods of CNTs surface functionalization have been developed. These are essential to improve CNTs dispersibility and permit their interactions with biological structures that broaden their use in advanced biomedical applications. The present review discusses the different single walled carbon nanotubes and multiwalled carbon nanotubes functionalization methods, leading to the formation of optimized and functionalized-CNT complexes with DNA. F-CNTs are recognized as efficient and promising gene carriers. Emphasis is then placed on the processes used by f-CNTs/DNA complexes to cross cell membranes. Energy independent pathways and uptake mechanisms dependent on energy, such as endocytosis or phagocytosis, are reported by many studies, and if these mechanisms seem contradictory at first sight, a detailed review of the literature illustrates that they are rather complementary. Preferential use of one or the other depends on the DNA and CNTs chemical nature and physical parameters, experimental procedures and cell types. Efficient non-viral gene delivery is desirable, yet challenging. CNTs appear as a promising solution to penetrate into cells and successfully deliver DNA. Moreover, the field of use of CNTs as gene carrier is large and is currently growing. This critical review summarizes the development and evaluation of CNTs as intracellular gene delivery system and provides an overview of functionalized CNTs/DNA cellular uptake mechanisms, depending on several parameters of CNTs/DNA complexes. Copyright © 2016 Acta Materialia Inc. Published by Elsevier Ltd. All rights reserved.
Effect of misspecification of gene frequency on the two-point LOD score.
Pal, D K; Durner, M; Greenberg, D A
2001-11-01
In this study, we used computer simulation of simple and complex models to ask: (1) What is the penalty in evidence for linkage when the assumed gene frequency is far from the true gene frequency? (2) If the assumed model for gene frequency and inheritance are misspecified in the analysis, can this lead to a higher maximum LOD score than that obtained under the true parameters? Linkage data simulated under simple dominant, recessive, dominant and recessive with reduced penetrance, and additive models, were analysed assuming a single locus with both the correct and incorrect dominance model and assuming a range of different gene frequencies. We found that misspecifying the analysis gene frequency led to little penalty in maximum LOD score in all models examined, especially if the assumed gene frequency was lower than the generating one. Analysing linkage data assuming a gene frequency of the order of 0.01 for a dominant gene, and 0.1 for a recessive gene, appears to be a reasonable tactic in the majority of realistic situations because underestimating the gene frequency, even when the true gene frequency is high, leads to little penalty in the LOD score.
Mutations in the ABCA4 (ABCR) Gene Are the Major Cause of Autosomal Recessive Cone-Rod Dystrophy
Maugeri, Alessandra; Klevering, B. Jeroen; Rohrschneider, Klaus; Blankenagel, Anita; Brunner, Han G.; Deutman, August F.; Hoyng, Carel B.; Cremers, Frans P. M.
2000-01-01
The photoreceptor cell–specific ATP-binding cassette transporter gene (ABCA4; previously denoted “ABCR”) is mutated in most patients with autosomal recessive (AR) Stargardt disease (STGD1) or fundus flavimaculatus (FFM). In addition, a few cases with AR retinitis pigmentosa (RP) and AR cone-rod dystrophy (CRD) have been found to have ABCA4 mutations. To evaluate the importance of the ABCA4 gene as a cause of AR CRD, we selected 5 patients with AR CRD and 15 patients with isolated CRD, all from Germany and The Netherlands . Single-strand conformation–polymorphism analysis and sequencing revealed 19 ABCA4 mutations in 13 (65%) of 20 patients. In six patients, mutations were identified in both ABCA4 alleles; in seven patients, mutations were detected in one allele. One complex ABCA4 allele (L541P;A1038V) was found exclusively in German patients with CRD; one patient carried this complex allele homozygously, and five others were compound heterozygous. These findings suggest that mutations in the ABCA4 gene are the major cause of AR CRD. A primary role of the ABCA4 gene in STGD1/FFM and AR CRD, together with the gene's involvement in an as-yet-unknown proportion of cases with AR RP, strengthens the idea that mutations in the ABCA4 gene could be the most frequent cause of inherited retinal dystrophy in humans. PMID:10958761
Single-Cell Resolution of Temporal Gene Expression during Heart Development.
DeLaughter, Daniel M; Bick, Alexander G; Wakimoto, Hiroko; McKean, David; Gorham, Joshua M; Kathiriya, Irfan S; Hinson, John T; Homsy, Jason; Gray, Jesse; Pu, William; Bruneau, Benoit G; Seidman, J G; Seidman, Christine E
2016-11-21
Activation of complex molecular programs in specific cell lineages governs mammalian heart development, from a primordial linear tube to a four-chamber organ. To characterize lineage-specific, spatiotemporal developmental programs, we performed single-cell RNA sequencing of >1,200 murine cells isolated at seven time points spanning embryonic day 9.5 (primordial heart tube) to postnatal day 21 (mature heart). Using unbiased transcriptional data, we classified cardiomyocytes, endothelial cells, and fibroblast-enriched cells, thus identifying markers for temporal and chamber-specific developmental programs. By harnessing these datasets, we defined developmental ages of human and mouse pluripotent stem-cell-derived cardiomyocytes and characterized lineage-specific maturation defects in hearts of mice with heterozygous mutations in Nkx2.5 that cause human heart malformations. This spatiotemporal transcriptome analysis of heart development reveals lineage-specific gene programs underlying normal cardiac development and congenital heart disease. Copyright © 2016 Elsevier Inc. All rights reserved.
2007-02-01
antisense RNA for suppressing gene expression in nematode worms (Caenorhabditis elegans) 2. This was followed by the introduction of dsRNA into worms...When single-stranded antisense RNA and double stranded RNA was introduced into worms, they found that dsRNA was more effective than either strand...RISC ( RNA -induced silencing complex), which contains helicase activity that unwinds the two strands 3 of RNA molecules, allowing the antisense
Moretti, Stefano; van Leeuwen, Danitsja; Gmuender, Hans; Bonassi, Stefano; van Delft, Joost; Kleinjans, Jos; Patrone, Fioravante; Merlo, Domenico Franco
2008-01-01
Background In gene expression analysis, statistical tests for differential gene expression provide lists of candidate genes having, individually, a sufficiently low p-value. However, the interpretation of each single p-value within complex systems involving several interacting genes is problematic. In parallel, in the last sixty years, game theory has been applied to political and social problems to assess the power of interacting agents in forcing a decision and, more recently, to represent the relevance of genes in response to certain conditions. Results In this paper we introduce a Bootstrap procedure to test the null hypothesis that each gene has the same relevance between two conditions, where the relevance is represented by the Shapley value of a particular coalitional game defined on a microarray data-set. This method, which is called Comparative Analysis of Shapley value (shortly, CASh), is applied to data concerning the gene expression in children differentially exposed to air pollution. The results provided by CASh are compared with the results from a parametric statistical test for testing differential gene expression. Both lists of genes provided by CASh and t-test are informative enough to discriminate exposed subjects on the basis of their gene expression profiles. While many genes are selected in common by CASh and the parametric test, it turns out that the biological interpretation of the differences between these two selections is more interesting, suggesting a different interpretation of the main biological pathways in gene expression regulation for exposed individuals. A simulation study suggests that CASh offers more power than t-test for the detection of differential gene expression variability. Conclusion CASh is successfully applied to gene expression analysis of a data-set where the joint expression behavior of genes may be critical to characterize the expression response to air pollution. We demonstrate a synergistic effect between coalitional games and statistics that resulted in a selection of genes with a potential impact in the regulation of complex pathways. PMID:18764936
Mapping cis- and trans-regulatory effects across multiple tissues in twins
Grundberg, Elin; Small, Kerrin S.; Hedman, Åsa K.; Nica, Alexandra C.; Buil, Alfonso; Keildson, Sarah; Bell, Jordana T.; Yang, Tsun-Po; Meduri, Eshwar; Barrett, Amy; Nisbett, James; Sekowska, Magdalena; Wilk, Alicja; Shin, So-Youn; Glass, Daniel; Travers, Mary; Min, Josine L.; Ring, Sue; Ho, Karen; Thorleifsson, Gudmar; Kong, Augustine; Thorsteindottir, Unnur; Ainali, Chrysanthi; Dimas, Antigone S.; Hassanali, Neelam; Ingle, Catherine; Knowles, David; Krestyaninova, Maria; Lowe, Christopher E.; Di Meglio, Paola; Montgomery, Stephen B.; Parts, Leopold; Potter, Simon; Surdulescu, Gabriela; Tsaprouni, Loukia; Tsoka, Sophia; Bataille, Veronique; Durbin, Richard; Nestle, Frank O.; O’Rahilly, Stephen; Soranzo, Nicole; Lindgren, Cecilia M.; Zondervan, Krina T.; Ahmadi, Kourosh R.; Schadt, Eric E.; Stefansson, Kari; Smith, George Davey; McCarthy, Mark I.; Deloukas, Panos; Dermitzakis, Emmanouil T.; Spector, Tim D.
2013-01-01
Sequence-based variation in gene expression is a key driver of disease risk. Common variants regulating expression in cis have been mapped in many eQTL studies typically in single tissues from unrelated individuals. Here, we present a comprehensive analysis of gene expression across multiple tissues conducted in a large set of mono- and dizygotic twins that allows systematic dissection of genetic (cis and trans) and non-genetic effects on gene expression. Using identity-by-descent estimates, we show that at least 40% of the total heritable cis-effect on expression cannot be accounted for by common cis-variants, a finding which exposes the contribution of low frequency and rare regulatory variants with respect to both transcriptional regulation and complex trait susceptibility. We show that a substantial proportion of gene expression heritability is trans to the structural gene and identify several replicating trans-variants which act predominantly in a tissue-restricted manner and may regulate the transcription of many genes. PMID:22941192
Ventura, Marco; Kenny, John G; Zhang, Ziding; Fitzgerald, Gerald F; van Sinderen, Douwe
2005-09-01
The so-called clp genes, which encode components of the Clp proteolytic complex, are widespread among bacteria. The Bifidobacterium breve UCC 2003 genome contains a clpB gene with significant homology to predicted clpB genes from other members of the Actinobacteridae group. The heat- and osmotic-inducibility of the B. breve UCC 2003 clpB homologue was verified by slot-blot analysis, while Northern blot and primer extension analyses showed that the clpB gene is transcribed as a monocistronic unit with a single promoter. The role of a hspR homologue, known to control the regulation of clpB and dnaK gene expression in other high G+C content bacteria was investigated by gel mobility shift assays. Moreover the predicted 3D structure of HspR provides further insight into the binding mode of this protein to the clpB promoter region, and highlights the key amino acid residues believed to be involved in the protein-DNA interaction.
Pineda, Sandy S; Sollod, Brianna L; Wilson, David; Darling, Aaron; Sunagar, Kartik; Undheim, Eivind A B; Kely, Laurence; Antunes, Agostinho; Fry, Bryan G; King, Glenn F
2014-03-05
Spiders have evolved pharmacologically complex venoms that serve to rapidly subdue prey and deter predators. The major toxic factors in most spider venoms are small, disulfide-rich peptides. While there is abundant evidence that snake venoms evolved by recruitment of genes encoding normal body proteins followed by extensive gene duplication accompanied by explosive structural and functional diversification, the evolutionary trajectory of spider-venom peptides is less clear. Here we present evidence of a spider-toxin superfamily encoding a high degree of sequence and functional diversity that has evolved via accelerated duplication and diversification of a single ancestral gene. The peptides within this toxin superfamily are translated as prepropeptides that are posttranslationally processed to yield the mature toxin. The N-terminal signal sequence, as well as the protease recognition site at the junction of the propeptide and mature toxin are conserved, whereas the remainder of the propeptide and mature toxin sequences are variable. All toxin transcripts within this superfamily exhibit a striking cysteine codon bias. We show that different pharmacological classes of toxins within this peptide superfamily evolved under different evolutionary selection pressures. Overall, this study reinforces the hypothesis that spiders use a combinatorial peptide library strategy to evolve a complex cocktail of peptide toxins that target neuronal receptors and ion channels in prey and predators. We show that the ω-hexatoxins that target insect voltage-gated calcium channels evolved under the influence of positive Darwinian selection in an episodic fashion, whereas the κ-hexatoxins that target insect calcium-activated potassium channels appear to be under negative selection. A majority of the diversifying sites in the ω-hexatoxins are concentrated on the molecular surface of the toxins, thereby facilitating neofunctionalisation leading to new toxin pharmacology.
2013-01-01
Background Complex diseases are often difficult to diagnose, treat and study due to the multi-factorial nature of the underlying etiology. Large data sets are now widely available that can be used to define novel, mechanistically distinct disease subtypes (endotypes) in a completely data-driven manner. However, significant challenges exist with regard to how to segregate individuals into suitable subtypes of the disease and understand the distinct biological mechanisms of each when the goal is to maximize the discovery potential of these data sets. Results A multi-step decision tree-based method is described for defining endotypes based on gene expression, clinical covariates, and disease indicators using childhood asthma as a case study. We attempted to use alternative approaches such as the Student’s t-test, single data domain clustering and the Modk-prototypes algorithm, which incorporates multiple data domains into a single analysis and none performed as well as the novel multi-step decision tree method. This new method gave the best segregation of asthmatics and non-asthmatics, and it provides easy access to all genes and clinical covariates that distinguish the groups. Conclusions The multi-step decision tree method described here will lead to better understanding of complex disease in general by allowing purely data-driven disease endotypes to facilitate the discovery of new mechanisms underlying these diseases. This application should be considered a complement to ongoing efforts to better define and diagnose known endotypes. When coupled with existing methods developed to determine the genetics of gene expression, these methods provide a mechanism for linking genetics and exposomics data and thereby accounting for both major determinants of disease. PMID:24188919
Blakely, Collin M.; Watkins, Thomas B.K.; Wu, Wei; Gini, Beatrice; Chabon, Jacob J.; McCoach, Caroline E.; McGranahan, Nicholas; Wilson, Gareth A.; Birkbak, Nicolai J.; Olivas, Victor R.; Rotow, Julia; Maynard, Ashley; Wang, Victoria; Gubens, Matthew A.; Banks, Kimberly C.; Lanman, Richard B.; Caulin, Aleah F.; John, John St.; Cordero, Anibal R.; Giannikopoulos, Petros; Simmons, Andrew D.; Mack, Philip C.; Gandara, David R.; Husain, Hatim; Doebele, Robert C.; Riess, Jonathan W.; Diehn, Maximilian; Swanton, Charles; Bivona, Trever G.
2017-01-01
A widespread approach to modern cancer therapy is to identify a single oncogenic driver gene and target its mutant protein product (e.g. EGFR inhibitor treatment in EGFR-mutant lung cancers). However, genetically-driven resistance to targeted therapy limits patient survival. Through genomic analysis of 1122 EGFR-mutant lung cancer cell-free DNA samples and whole exome analysis of seven longitudinally collected tumor samples from an EGFR-mutant lung cancer patient, we identify critical co-occurring oncogenic events present in most advanced-stage EGFR-mutant lung cancers. We define new pathways limiting EGFR inhibitor response, including WNT/β-catenin and cell cycle gene (e.g. CDK4, CDK6) alterations. Tumor genomic complexity increases with EGFR inhibitor treatment and co-occurring alterations in CTNNB1, and PIK3CA exhibit non-redundant functions that cooperatively promote tumor metastasis or limit EGFR inhibitor response. This study challenges the prevailing single-gene driver oncogene view and links clinical outcomes to co-occurring genetic alterations in advanced-stage EGFR-mutant lung cancer patients. PMID:29106415
Characterization of a novel variant of Mycobacterium chimaera.
van Ingen, J; Hoefsloot, W; Buijtels, P C A M; Tortoli, E; Supply, P; Dekhuijzen, P N R; Boeree, M J; van Soolingen, D
2012-09-01
In this study, nonchromogenic mycobacteria were isolated from pulmonary samples of three patients in the Netherlands. All isolates had identical, unique 16S rRNA gene and 16S-23S ITS sequences, which were closely related to those of Mycobacterium chimaera and Mycobacterium marseillense. The biochemical features of the isolates differed slightly from those of M. chimaera, suggesting that the isolates may represent a possible separate species within the Mycobacterium avium complex (MAC). However, the cell-wall mycolic acid pattern, analysed by HPLC, and the partial sequences of the hsp65 and rpoB genes were identical to those of M. chimaera. We concluded that the isolates represent a novel variant of M. chimaera. The results of this analysis have led us to question the currently used methods of species definition for members of the genus Mycobacterium, which are based largely on 16S rRNA or rpoB gene sequencing. Definitions based on a single genetic target are likely to be insufficient. Genetic divergence, especially in the MAC, yields strains that cannot be confidently assigned to a specific species based on the analysis of a single genetic target.
Shite, Masato; Yamamura, Yoshimi; Hayashi, Toshimitsu; Kurosaki, Fumiya
2008-11-01
A homology-based cloning strategy yielded Sdga, a cDNA clone presumably encoding alpha-subunit of heterotrimeric guanosine 5'-triphosphate-binding protein complex, from leaf tissues of Scoparia dulcis. Phylogenetic tree analysis of G-protein alpha-subunits from various biological sources suggested that, unlike in animal cells, classification of Galpha-proteins into specific subfamilies could not be applicable to the proteins from higher plants. Restriction digests of genomic DNA of S. dulcis showed a single hybridized signal in Southern blot analysis, suggesting that Sdga is a sole gene encoding Galpha-subunit in this plant. The expression level of Sdga appeared to be maintained at almost constant level after exposure of the leaves to methyl jasmonate as analyzed by reverse-transcription polymerase chain reaction. These results suggest that Sdga plays roles in methyl jasmonate-induced responses of S. dulcis without a notable change in the transcriptional level.
Modular probes for enriching and detecting complex nucleic acid sequences
NASA Astrophysics Data System (ADS)
Wang, Juexiao Sherry; Yan, Yan Helen; Zhang, David Yu
2017-12-01
Complex DNA sequences are difficult to detect and profile, but are important contributors to human health and disease. Existing hybridization probes lack the capability to selectively bind and enrich hypervariable, long or repetitive sequences. Here, we present a generalized strategy for constructing modular hybridization probes (M-Probes) that overcomes these challenges. We demonstrate that M-Probes can tolerate sequence variations of up to 7 nt at prescribed positions while maintaining single nucleotide sensitivity at other positions. M-Probes are also shown to be capable of sequence-selectively binding a continuous DNA sequence of more than 500 nt. Furthermore, we show that M-Probes can detect genes with triplet repeats exceeding a programmed threshold. As a demonstration of this technology, we have developed a hybrid capture method to determine the exact triplet repeat expansion number in the Huntington's gene of genomic DNA using quantitative PCR.
From Biophysics to Evolutionary Genetics: Statistical Aspects of Gene Regulation
NASA Astrophysics Data System (ADS)
Lässig, Michael
Genomic functions often cannot be understood at the level of single genes but require the study of gene networks. This systems biology credo is nearly commonplace by now. Evidence comes from the comparative analysis of entire genomes: current estimates put, for example, the number of human genes at around 22,000, hardly more than the 14,000 of the fruit fly, and not even an order of magnitude higher than the 6,000 of baker's yeast. The complexity and diversity of higher animals, therefore, cannot be explained in terms of their gene numbers. If, however, a biological function requires the concerted action of several genes, and conversely, a gene takes part in several functional contexts, an organism may be defined less by its individual genes but by their interactions. The emerging picture of the genome as a strongly interacting system with many degrees of freedom brings new challenges for experiment and theory, many of which are of a statistical nature. And indeed, this picture continues to make the subject attractive to a growing number of statistical physicists.
Cystic fibrosis gene therapy: a mutation-independent treatment.
Griesenbach, Uta; Davies, Jane C; Alton, Eric
2016-11-01
Since cloning of the disease-causing gene 27 years ago, the development of cystic fibrosis (CF) gene therapy has been pursued. Here, we will summarize key findings with a particular focus on recent developments. Almost 3 decades of research have highlighted the complexity of lung gene transfer and have generated a body of data that has recently led to the completion of a large phase IIB study. This trial has, for the first time, shown that nonviral gene transfer can, albeit modestly, stabilize lung function in CF and provides the impetus for further development of more potent gene transfer agents. Lentiviral vectors, specifically pseudotyped to enable entry into airway epithelial cells have most recently been developed. Persistent expression after a single dose and the ability to be administered repeatedly suggest that these viral vectors hold promise for the treatment of CF; a first-in-man clinical trial will shortly be initiated. Although the development of CF gene therapy has been slower than initially anticipated, recent progress has been encouraging and has renewed the interest of academics and industry to pursue lung gene therapy.
Using a periclinal chimera to unravel layer-specific gene expression in plants.
Filippis, Ioannis; Lopez-Cobollo, Rosa; Abbott, James; Butcher, Sarah; Bishop, Gerard J
2013-09-01
Plant organs are made from multiple cell types, and defining the expression level of a gene in any one cell or group of cells from a complex mixture is difficult. Dicotyledonous plants normally have three distinct layers of cells, L1, L2 and L3. Layer L1 is the single layer of cells making up the epidermis, layer L2 the single cell sub-epidermal layer and layer L3 constitutes the rest of the internal cells. Here we show how it is possible to harvest an organ and characterise the level of layer-specific expression by using a periclinal chimera that has its L1 layer from Solanum pennellii and its L2 and L3 layers from Solanum lycopersicum. This is possible by measuring the level of the frequency of species-specific transcripts. RNA-seq analysis enabled the genome-wide assessment of whether a gene is expressed in the L1 or L2/L3 layers. From 13 277 genes that are expressed in both the chimera and the parental lines and with at least one polymorphism between the parental alleles, we identified 382 genes that are preferentially expressed in L1 in contrast to 1159 genes in L2/L3. Gene ontology analysis shows that many genes preferentially expressed in L1 are involved in cutin and wax biosynthesis, whereas numerous genes that are preferentially expressed in L2/L3 tissue are associated with chloroplastic processes. These data indicate the use of such chimeras and provide detailed information on the level of layer-specific expression of genes. © 2013 East Malling Research The Plant Journal © 2013 John Wiley & Sons Ltd.
Arsovski, Andrej A.; Villota, Maria M.; Rowland, Owen; Subramaniam, Rajagopal; Western, Tamara L.
2009-01-01
Pollination triggers not only embryo development but also the differentiation of the ovule integuments to form a specialized seed coat. The mucilage secretory cells of the Arabidopsis thaliana seed coat undergo a complex differentiation process in which cell growth is followed by the synthesis and secretion of pectinaceous mucilage. A number of genes have been identified affecting mucilage secretory cell differentiation, including MUCILAGE-MODIFIED4 (MUM4). mum4 mutants produce a reduced amount of mucilage and cloning of MUM4 revealed that it encodes a UDP-L-rhamnose synthase that is developmentally up-regulated to provide rhamnose for mucilage pectin synthesis. To identify additional genes acting in mucilage synthesis and secretion, a screen for enhancers of the mum4 phenotype was performed. Eight mum enhancers (men) have been identified, two of which result from defects in known mucilage secretory cell genes (MUM2 and MYB61). Our results show that, in a mum4 background, mutations in MEN1, MEN4, and MEN5 lead to further reductions in mucilage compared to mum4 single mutants, suggesting that they are involved in mucilage synthesis or secretion. Conversely, mutations in MEN2 and MEN6 appear to affect mucilage release rather than quantity. With the exception of men4, whose single mutant exhibits reduced mucilage, none of these genes have a single mutant phenotype, suggesting that they would not have been identified outside the compromised mum4 background. PMID:19401413
Single locus typing of MHC class I and class II B loci in a population of red jungle fowl.
Worley, K; Gillingham, M; Jensen, P; Kennedy, L J; Pizzari, T; Kaufman, J; Richardson, D S
2008-05-01
In species with duplicated major histocompatibility complex (MHC) genes, estimates of genetic variation often rely on multilocus measures of diversity. It is possible that such measures might not always detect more detailed patterns of selection at individual loci. Here, we describe a method that allows us to investigate classical MHC diversity in red jungle fowl (Gallus gallus), the wild ancestor of the domestic chicken, using a single locus approach. This is possible due to the well-characterised gene organisation of the 'minimal essential' MHC (BF/BL region) of the domestic chicken, which comprises two differentially expressed duplicated class I (BF) and two class II B (BLB) genes. Using a combination of reference strand-mediated conformation analysis, cloning and sequencing, we identify nine BF and ten BLB alleles in a captive population of jungle fowl. We show that six BF and five BLB alleles are from the more highly expressed locus of each gene, BF2 and BLB2, respectively. An excess of non-synonymous substitutions across the jungle fowl BF/BL region suggests that diversifying selection has acted on this population. Importantly, single locus screening reveals that the strength of selection is greatest on the highly expressed BF2 locus. This is the first time that a population of red jungle fowl has been typed at the MHC region, laying the basis for further research into the underlying processes acting to maintain MHC diversity in this and other species.
Genome Wide Identification of SARS-CoV Susceptibility Loci Using the Collaborative Cross
Gralinski, Lisa E.; Ferris, Martin T.; Aylor, David L.; Whitmore, Alan C.; Green, Richard; Frieman, Matthew B.; Deming, Damon; Menachery, Vineet D.; Miller, Darla R.; Buus, Ryan J.; Bell, Timothy A.; Churchill, Gary A.; Threadgill, David W.; Katze, Michael G.; McMillan, Leonard; Valdar, William; Heise, Mark T.; Pardo-Manuel de Villena, Fernando; Baric, Ralph S.
2015-01-01
New systems genetics approaches are needed to rapidly identify host genes and genetic networks that regulate complex disease outcomes. Using genetically diverse animals from incipient lines of the Collaborative Cross mouse panel, we demonstrate a greatly expanded range of phenotypes relative to classical mouse models of SARS-CoV infection including lung pathology, weight loss and viral titer. Genetic mapping revealed several loci contributing to differential disease responses, including an 8.5Mb locus associated with vascular cuffing on chromosome 3 that contained 23 genes and 13 noncoding RNAs. Integrating phenotypic and genetic data narrowed this region to a single gene, Trim55, an E3 ubiquitin ligase with a role in muscle fiber maintenance. Lung pathology and transcriptomic data from mice genetically deficient in Trim55 were used to validate its role in SARS-CoV-induced vascular cuffing and inflammation. These data establish the Collaborative Cross platform as a powerful genetic resource for uncovering genetic contributions of complex traits in microbial disease severity, inflammation and virus replication in models of outbred populations. PMID:26452100
Epilepsy with auditory features
Licchetta, Laura; Baldassari, Sara; Palombo, Flavia; Menghi, Veronica; D'Aurizio, Romina; Leta, Chiara; Stipa, Carlotta; Boero, Giovanni; d'Orsi, Giuseppe; Magi, Alberto; Scheffer, Ingrid; Seri, Marco; Tinuper, Paolo; Bisulli, Francesca
2015-01-01
Objective: To identify novel genes implicated in epilepsy with auditory features (EAF) in phenotypically heterogeneous families with unknown molecular basis. Methods: We identified 15 probands with EAF in whom an LGI1 mutation had been excluded. We performed electroclinical phenotyping on all probands and available affected relatives. We used whole-exome sequencing (WES) in 20 individuals with EAF (including all the probands and 5 relatives) to identify single nucleotide variants, small insertions/deletions, and copy number variants. Results: WES revealed likely pathogenic variants in genes that had not been previously associated with EAF: a CNTNAP2 intragenic deletion, 2 truncating mutations of DEPDC5, and a missense SCN1A change. Conclusions: EAF is a clinically and molecularly heterogeneous disease. The association of EAF with CNTNAP2, DEPDC5, and SCN1A mutations widens the phenotypic spectrum related to these genes. CNTNAP2 encodes CASPR2, a member of the voltage-gated potassium channel complex in which LGI1 plays a role. The finding of a CNTNAP2 deletion emphasizes the importance of this complex in EAF and shows biological convergence. PMID:27066544
Recent developments in the genetics of schizophrenia.
Shastry, B S
1999-09-01
Schizophrenia, which is also called "split personality," is a complex and multifactorial mental disorder with variable clinical manifestations. It perhaps represents several diseases and occurs throughout the world. It is a more-prevalent disorder among homeless people and is clinically characterized by hallucinations and delusions. The pathophysiology of schizophrenia is not localized to a single region of the brain and the etiology of this illness is not understood. Because of its complex pattern of inheritance, genetic techniques are not readily applicable in identifying the genes responsible for this disorder. Family, twin, and adoption studies, however, provide strong but indirect support for genetic components in the etiology of schizophrenia. Extensive linkage analyses now suggest that susceptibility genes may be present on chromosomes 5q, 6p, 8p, 13q, 18p, and 22q. Identification and characterization of these and other genes, as well as non-genetic factors, is one of the greatest challenges in biomedicine. This may ultimately lead to the development of a new line of effective and safe drugs or treatments for its prevention or cure.
Functional analysis of human foamy virus accessory reading frames.
Baunach, G; Maurer, B; Hahn, H; Kranz, M; Rethwilm, A
1993-01-01
Foamy viruses belong to the retroviruses which possess a complex genome structure. The human foamy virus (HFV) isolate bears three open reading frames (the so-called bel genes) in the 3' region of the genome which have been reported to give rise to possibly six different proteins via alternative splicing (W. Muranyi and R. M. Flügel, J. Virol. 65:727-735, 1991). In order to analyze the requirements of these proteins for HFV replication in vitro, we constructed a set of single and combinatory bel gene mutants of an infectious molecular clone of HFV. The mutant which lacked the transacting activator, bel-1, was found to be replication incompetent. All other mutants replicated equally well and gave rise to comparable titers of infectious cell-free virus. When HFV proviruses were put under the control of a heterologous promoter (simian virus 40), none of the accessory gene products was found to be required for expression of structural (gag) proteins. There was no evidence for a posttranscriptional regulatory protein that is present in other complex retroviruses. Images PMID:8394455
Rohner, Sabine; Kalck, Veronique; Wang, Xuefei; Ikegami, Kohta; Lieb, Jason D.; Meister, Peter
2013-01-01
Some inducible yeast genes relocate to nuclear pores upon activation, but the general relevance of this phenomenon has remained largely unexplored. Here we show that the bidirectional hsp-16.2/41 promoter interacts with the nuclear pore complex upon activation by heat shock in the nematode Caenorhabditis elegans. Direct pore association was confirmed by both super-resolution microscopy and chromatin immunoprecipitation. The hsp-16.2 promoter was sufficient to mediate perinuclear positioning under basal level conditions of expression, both in integrated transgenes carrying from 1 to 74 copies of the promoter and in a single-copy genomic insertion. Perinuclear localization of the uninduced gene depended on promoter elements essential for induction and required the heat-shock transcription factor HSF-1, RNA polymerase II, and ENY-2, a factor that binds both SAGA and the THO/TREX mRNA export complex. After induction, colocalization with nuclear pores increased significantly at the promoter and along the coding sequence, dependent on the same promoter-associated factors, including active RNA polymerase II, and correlated with nascent transcripts. PMID:23460676
Decoding the role of regulatory element polymorphisms in complex disease.
Vockley, Christopher M; Barrera, Alejandro; Reddy, Timothy E
2017-04-01
Genetic variation in gene regulatory elements contributes to diverse human diseases, ranging from rare and severe developmental defects to common and complex diseases such as obesity and diabetes. Early examples of regulatory mechanisms of human diseases involve large chromosomal rearrangements that change the regulatory connections within the genome. Single nucleotide variants in regulatory elements can also contribute to disease, potentially via demonstrated associations with changes in transcription factor binding, enhancer activity, post-translational histone modifications, long-range enhancer-promoter interactions, or RNA polymerase recruitment. Establishing causality between non-coding genetic variants, gene regulation, and disease has recently become more feasible with advances in genome-editing and epigenome-editing technologies. As establishing causal regulatory mechanisms of diseases becomes routine, functional annotation of target genes is likely to emerge as a major bottleneck for translation into patient benefits. In this review, we discuss the history and recent advances in understanding the regulatory mechanisms of human disease, and new challenges likely to be encountered once establishing those mechanisms becomes rote. Copyright © 2016 Elsevier Ltd. All rights reserved.
Miller, Hilary C; O'Meally, Denis; Ezaz, Tariq; Amemiya, Chris; Marshall-Graves, Jennifer A; Edwards, Scott
2015-05-07
Major histocompatibility complex (MHC) genes are a central component of the vertebrate immune system and usually exist in a single genomic region. However, considerable differences in MHC organization and size exist between different vertebrate lineages. Reptiles occupy a key evolutionary position for understanding how variation in MHC structure evolved in vertebrates, but information on the structure of the MHC region in reptiles is limited. In this study, we investigate the organization and cytogenetic location of MHC genes in the tuatara (Sphenodon punctatus), the sole extant representative of the early-diverging reptilian order Rhynchocephalia. Sequencing and mapping of 12 clones containing class I and II MHC genes from a bacterial artificial chromosome library indicated that the core MHC region is located on chromosome 13q. However, duplication and translocation of MHC genes outside of the core region was evident, because additional class I MHC genes were located on chromosome 4p. We found a total of seven class I sequences and 11 class II β sequences, with evidence for duplication and pseudogenization of genes within the tuatara lineage. The tuatara MHC is characterized by high repeat content and low gene density compared with other species and we found no antigen processing or MHC framework genes on the MHC gene-containing clones. Our findings indicate substantial differences in MHC organization in tuatara compared with mammalian and avian MHCs and highlight the dynamic nature of the MHC. Further sequencing and annotation of tuatara and other reptile MHCs will determine if the tuatara MHC is representative of nonavian reptiles in general. Copyright © 2015 Miller et al.
Fabre, Michel; Koeck, Jean-Louis; Le Flèche, Philippe; Simon, Fabrice; Hervé, Vincent; Vergnaud, Gilles; Pourcel, Christine
2004-01-01
We have analyzed, using complementary molecular methods, the diversity of 43 strains of “Mycobacterium canettii” originating from the Republic of Djibouti, on the Horn of Africa, from 1998 to 2003. Genotyping by multiple-locus variable-number tandem repeat analysis shows that all the strains belong to a single but very distant group when compared to strains of the Mycobacterium tuberculosis complex (MTBC). Thirty-one strains cluster into one large group with little variability and five strains form another group, whereas the other seven are more diverged. In total, 14 genotypes are observed. The DR locus analysis reveals additional variability, some strains being devoid of a direct repeat locus and others having unique spacers. The hsp65 gene polymorphism was investigated by restriction enzyme analysis and sequencing of PCR amplicons. Four new single nucleotide polymorphisms were discovered. One strain was characterized by three nucleotide changes in 441 bp, creating new restriction enzyme polymorphisms. As no sequence variability was found for hsp65 in the whole MTBC, and as a single point mutation separates M. tuberculosis from the closest “M. canettii” strains, this diversity within “M. canettii” subspecies strongly suggests that it is the most probable source species of the MTBC rather than just another branch of the MTBC. PMID:15243089
Genome-Wide Analysis of Gene-Gene and Gene-Environment Interactions Using Closed-Form Wald Tests.
Yu, Zhaoxia; Demetriou, Michael; Gillen, Daniel L
2015-09-01
Despite the successful discovery of hundreds of variants for complex human traits using genome-wide association studies, the degree to which genes and environmental risk factors jointly affect disease risk is largely unknown. One obstacle toward this goal is that the computational effort required for testing gene-gene and gene-environment interactions is enormous. As a result, numerous computationally efficient tests were recently proposed. However, the validity of these methods often relies on unrealistic assumptions such as additive main effects, main effects at only one variable, no linkage disequilibrium between the two single-nucleotide polymorphisms (SNPs) in a pair or gene-environment independence. Here, we derive closed-form and consistent estimates for interaction parameters and propose to use Wald tests for testing interactions. The Wald tests are asymptotically equivalent to the likelihood ratio tests (LRTs), largely considered to be the gold standard tests but generally too computationally demanding for genome-wide interaction analysis. Simulation studies show that the proposed Wald tests have very similar performances with the LRTs but are much more computationally efficient. Applying the proposed tests to a genome-wide study of multiple sclerosis, we identify interactions within the major histocompatibility complex region. In this application, we find that (1) focusing on pairs where both SNPs are marginally significant leads to more significant interactions when compared to focusing on pairs where at least one SNP is marginally significant; and (2) parsimonious parameterization of interaction effects might decrease, rather than increase, statistical power. © 2015 WILEY PERIODICALS, INC.
Windhorst, Dafna A; Mileva-Seitz, Viara R; Rippe, Ralph C A; Tiemeier, Henning; Jaddoe, Vincent W V; Verhulst, Frank C; van IJzendoorn, Marinus H; Bakermans-Kranenburg, Marian J
2016-08-01
In a longitudinal cohort study, we investigated the interplay of harsh parenting and genetic variation across a set of functionally related dopamine genes, in association with children's externalizing behavior. This is one of the first studies to employ gene-based and gene-set approaches in tests of Gene by Environment (G × E) effects on complex behavior. This approach can offer an important alternative or complement to candidate gene and genome-wide environmental interaction (GWEI) studies in the search for genetic variation underlying individual differences in behavior. Genetic variants in 12 autosomal dopaminergic genes were available in an ethnically homogenous part of a population-based cohort. Harsh parenting was assessed with maternal (n = 1881) and paternal (n = 1710) reports at age 3. Externalizing behavior was assessed with the Child Behavior Checklist (CBCL) at age 5 (71 ± 3.7 months). We conducted gene-set analyses of the association between variation in dopaminergic genes and externalizing behavior, stratified for harsh parenting. The association was statistically significant or approached significance for children without harsh parenting experiences, but was absent in the group with harsh parenting. Similarly, significant associations between single genes and externalizing behavior were only found in the group without harsh parenting. Effect sizes in the groups with and without harsh parenting did not differ significantly. Gene-environment interaction tests were conducted for individual genetic variants, resulting in two significant interaction effects (rs1497023 and rs4922132) after correction for multiple testing. Our findings are suggestive of G × E interplay, with associations between dopamine genes and externalizing behavior present in children without harsh parenting, but not in children with harsh parenting experiences. Harsh parenting may overrule the role of genetic factors in externalizing behavior. Gene-based and gene-set analyses offer promising new alternatives to analyses focusing on single candidate polymorphisms when examining the interplay between genetic and environmental factors.
Garza-Brenner, E; Sifuentes-Rincón, A M; Randel, R D; Paredes-Sánchez, F A; Parra-Bracamonte, G M; Arellano Vera, W; Rodríguez Almeida, F A; Segura Cabrera, A
2017-08-01
Cattle temperament is a complex trait, and molecular studies aimed at defining this trait are scarce. We used an interaction networks approach to identify new genes (interacting genes) and to estimate their effects and those of 19 dopamine- and serotonin-related genes on the temperament traits of Charolais cattle. The genes proopiomelanocortin (POMC), neuropeptide Y (NPY), solute carrier family 18, member 2 (SLC18A2) and FBJ murine osteosarcoma viral oncogene homologue (FOSFBJ) were identified as new candidates. Their potential to be associated with temperament was estimated according to their reported biological activities, which included interactions with neural activity, receptor function, targeting or synthesis of neurotransmitters and association with behaviour. Pen score (PS) and exit velocity (EV) measures were determined from 412 Charolais cows to calculate their temperament score (TS). Based on the TS, calm (n = 55; TS, 1.09 ± 0.33) and temperamental (n = 58; TS, 2.27 ± 0.639) cows were selected and genotyped using a 248 single-nucleotide variation (SNV) panel. Of the 248 variations in the panel, only 151 were confirmed to be polymorphic (single-nucleotide polymorphisms; SNPs) in the tested population. Single-marker association analyses between genotypes and temperament measures (EV, PS and/or TS) indicated significant associations of six SNPs from four candidate genes. The markers rs109576799 and rs43696138, located in the DRD3 and HTR2A genes, respectively, were significantly associated with both EV and TS traits. Four markers, rs110365063 and rs137756569 from the POMC gene and rs110365063 and rs135155082 located in SLC18A2 and DRD2, respectively, were associated with PS. The variant rs110365063 located in bovine SLC18A2 causes a change in the amino acid sequence from Ala to Thr. Further studies are needed to confirm the association of genetic profile with cattle temperament; however, our study represents important progress in understanding the regulation of cattle temperament by different genes with divergent functions.
Disabling Cas9 by an anti-CRISPR DNA mimic.
Shin, Jiyung; Jiang, Fuguo; Liu, Jun-Jie; Bray, Nicolas L; Rauch, Benjamin J; Baik, Seung Hyun; Nogales, Eva; Bondy-Denomy, Joseph; Corn, Jacob E; Doudna, Jennifer A
2017-07-01
CRISPR (clustered regularly interspaced short palindromic repeats)-Cas9 gene editing technology is derived from a microbial adaptive immune system, where bacteriophages are often the intended target. Natural inhibitors of CRISPR-Cas9 enable phages to evade immunity and show promise in controlling Cas9-mediated gene editing in human cells. However, the mechanism of CRISPR-Cas9 inhibition is not known, and the potential applications for Cas9 inhibitor proteins in mammalian cells have not been fully established. We show that the anti-CRISPR protein AcrIIA4 binds only to assembled Cas9-single-guide RNA (sgRNA) complexes and not to Cas9 protein alone. A 3.9 Å resolution cryo-electron microscopy structure of the Cas9-sgRNA-AcrIIA4 complex revealed that the surface of AcrIIA4 is highly acidic and binds with a 1:1 stoichiometry to a region of Cas9 that normally engages the DNA protospacer adjacent motif. Consistent with this binding mode, order-of-addition experiments showed that AcrIIA4 interferes with DNA recognition but has no effect on preformed Cas9-sgRNA-DNA complexes. Timed delivery of AcrIIA4 into human cells as either protein or expression plasmid allows on-target Cas9-mediated gene editing while reducing off-target edits. These results provide a mechanistic understanding of AcrIIA4 function and demonstrate that inhibitors can modulate the extent and outcomes of Cas9-mediated gene editing.
Lobach, Irvna; Fan, Ruzone; Carroll, Raymond T.
2011-01-01
With the advent of dense single nucleotide polymorphism genotyping, population-based association studies have become the major tools for identifying human disease genes and for fine gene mapping of complex traits. We develop a genotype-based approach for association analysis of case-control studies of gene-environment interactions in the case when environmental factors are measured with error and genotype data are available on multiple genetic markers. To directly use the observed genotype data, we propose two genotype-based models: genotype effect and additive effect models. Our approach offers several advantages. First, the proposed risk functions can directly incorporate the observed genotype data while modeling the linkage disequihbrium information in the regression coefficients, thus eliminating the need to infer haplotype phase. Compared with the haplotype-based approach, an estimating procedure based on the proposed methods can be much simpler and significantly faster. In addition, there is no potential risk due to haplotype phase estimation. Further, by fitting the proposed models, it is possible to analyze the risk alleles/variants of complex diseases, including their dominant or additive effects. To model measurement error, we adopt the pseudo-likelihood method by Lobach et al. [2008]. Performance of the proposed method is examined using simulation experiments. An application of our method is illustrated using a population-based case-control study of association between calcium intake with the risk of colorectal adenoma development. PMID:21031455
Yu, Bin; Xu, Jia-Meng; Li, Shan; Chen, Cheng; Chen, Rui-Xin; Wang, Lei; Zhang, Yan; Wang, Ming-Hui
2017-01-01
Gene regulatory networks (GRNs) research reveals complex life phenomena from the perspective of gene interaction, which is an important research field in systems biology. Traditional Bayesian networks have a high computational complexity, and the network structure scoring model has a single feature. Information-based approaches cannot identify the direction of regulation. In order to make up for the shortcomings of the above methods, this paper presents a novel hybrid learning method (DBNCS) based on dynamic Bayesian network (DBN) to construct the multiple time-delayed GRNs for the first time, combining the comprehensive score (CS) with the DBN model. DBNCS algorithm first uses CMI2NI (conditional mutual inclusive information-based network inference) algorithm for network structure profiles learning, namely the construction of search space. Then the redundant regulations are removed by using the recursive optimization algorithm (RO), thereby reduce the false positive rate. Secondly, the network structure profiles are decomposed into a set of cliques without loss, which can significantly reduce the computational complexity. Finally, DBN model is used to identify the direction of gene regulation within the cliques and search for the optimal network structure. The performance of DBNCS algorithm is evaluated by the benchmark GRN datasets from DREAM challenge as well as the SOS DNA repair network in Escherichia coli, and compared with other state-of-the-art methods. The experimental results show the rationality of the algorithm design and the outstanding performance of the GRNs. PMID:29113310
Yu, Bin; Xu, Jia-Meng; Li, Shan; Chen, Cheng; Chen, Rui-Xin; Wang, Lei; Zhang, Yan; Wang, Ming-Hui
2017-10-06
Gene regulatory networks (GRNs) research reveals complex life phenomena from the perspective of gene interaction, which is an important research field in systems biology. Traditional Bayesian networks have a high computational complexity, and the network structure scoring model has a single feature. Information-based approaches cannot identify the direction of regulation. In order to make up for the shortcomings of the above methods, this paper presents a novel hybrid learning method (DBNCS) based on dynamic Bayesian network (DBN) to construct the multiple time-delayed GRNs for the first time, combining the comprehensive score (CS) with the DBN model. DBNCS algorithm first uses CMI2NI (conditional mutual inclusive information-based network inference) algorithm for network structure profiles learning, namely the construction of search space. Then the redundant regulations are removed by using the recursive optimization algorithm (RO), thereby reduce the false positive rate. Secondly, the network structure profiles are decomposed into a set of cliques without loss, which can significantly reduce the computational complexity. Finally, DBN model is used to identify the direction of gene regulation within the cliques and search for the optimal network structure. The performance of DBNCS algorithm is evaluated by the benchmark GRN datasets from DREAM challenge as well as the SOS DNA repair network in Escherichia coli , and compared with other state-of-the-art methods. The experimental results show the rationality of the algorithm design and the outstanding performance of the GRNs.
Vogt, Richard G; Große-Wilde, Ewald; Zhou, Jing-Jiang
2015-07-01
Butterflies and moths differ significantly in their daily activities: butterflies are diurnal while moths are largely nocturnal or crepuscular. This life history difference is presumably reflected in their sensory biology, and especially the balance between the use of chemical versus visual signals. Odorant Binding Proteins (OBP) are a class of insect proteins, at least some of which are thought to orchestrate the transfer of odor molecules within an olfactory sensillum (olfactory organ), between the air and odor receptor proteins (ORs) on the olfactory neurons. A Lepidoptera specific subclass of OBPs are the GOBPs and PBPs; these were the first OBPs studied and have well documented associations with olfactory sensilla. We have used the available genomes of two moths, Manduca sexta and Bombyx mori, and two butterflies, Danaus plexippus and Heliconius melpomene, to characterize the GOBP/PBP genes, attempting to identify gene orthologs and document specific gene gain and loss. First, we identified the full repertoire of OBPs in the M. sexta genome, and compared these with the full repertoire of OBPs from the other three lepidopteran genomes, the OBPs of Drosophila melanogaster and select OBPs from other Lepidoptera. We also evaluated the tissue specific expression of the M. sexta OBPs using an available RNAseq databases. In the four lepidopteran species, GOBP2 and all PBPs reside in single gene clusters; in two species GOBP1 is documented to be nearby, about 100 kb from the cluster; all GOBP/PBP genes share a common gene structure indicating a common origin. As such, the GOBP/PBP genes form a gene complex. Our findings suggest that (1) the lepidopteran GOBP/PBP complex is a monophyletic lineage with origins deep within Lepidoptera phylogeny, (2) within this lineage PBP gene evolution is much more dynamic than GOBP gene evolution, and (3) butterflies may have lost a PBP gene that plays an important role in moth pheromone detection, correlating with a shift from olfactory (moth) to visual (butterfly) communication, at least regarding long distance mate recognition. These findings will be clarified by additional lepidopteran genomic data, but the observation that moths and butterflies share most of the PBP/GOBP genes suggests that they also share common chemosensory-based behavioral pathways. Copyright © 2015 Elsevier Ltd. All rights reserved.
Sewell, Holly L; Kaster, Anne-Kristin; Spormann, Alfred M
2017-12-19
The deep marine subsurface is one of the largest unexplored biospheres on Earth and is widely inhabited by members of the phylum Chloroflexi In this report, we investigated genomes of single cells obtained from deep-sea sediments of the Peruvian Margin, which are enriched in such Chloroflexi 16S rRNA gene sequence analysis placed two of these single-cell-derived genomes (DscP3 and Dsc4) in a clade of subphylum I Chloroflexi which were previously recovered from deep-sea sediment in the Okinawa Trough and a third (DscP2-2) as a member of the previously reported DscP2 population from Peruvian Margin site 1230. The presence of genes encoding enzymes of a complete Wood-Ljungdahl pathway, glycolysis/gluconeogenesis, a Rhodobacter nitrogen fixation (Rnf) complex, glyosyltransferases, and formate dehydrogenases in the single-cell genomes of DscP3 and Dsc4 and the presence of an NADH-dependent reduced ferredoxin:NADP oxidoreductase (Nfn) and Rnf in the genome of DscP2-2 imply a homoacetogenic lifestyle of these abundant marine Chloroflexi We also report here the first complete pathway for anaerobic benzoate oxidation to acetyl coenzyme A (CoA) in the phylum Chloroflexi (DscP3 and Dsc4), including a class I benzoyl-CoA reductase. Of remarkable evolutionary significance, we discovered a gene encoding a formate dehydrogenase (FdnI) with reciprocal closest identity to the formate dehydrogenase-like protein (complex iron-sulfur molybdoenzyme [CISM], DET0187) of terrestrial Dehalococcoides/Dehalogenimonas spp. This formate dehydrogenase-like protein has been shown to lack formate dehydrogenase activity in Dehalococcoides/Dehalogenimonas spp. and is instead hypothesized to couple HupL hydrogenase to a reductive dehalogenase in the catabolic reductive dehalogenation pathway. This finding of a close functional homologue provides an important missing link for understanding the origin and the metabolic core of terrestrial Dehalococcoides/Dehalogenimonas spp. and of reductive dehalogenation, as well as the biology of abundant deep-sea Chloroflexi IMPORTANCE The deep marine subsurface is one of the largest unexplored biospheres on Earth and is widely inhabited by members of the phylum Chloroflexi In this report, we investigated genomes of single cells obtained from deep-sea sediments and provide evidence for a homacetogenic lifestyle of these abundant marine Chloroflexi Moreover, genome signature and key metabolic genes indicate an evolutionary relationship between these deep-sea sediment microbes and terrestrial, reductively dehalogenating Dehalococcoides . Copyright © 2017 Sewell et al.
Loveridge, E Joel; Jones, Cerith; Bull, Matthew J; Moody, Suzy C; Kahl, Małgorzata W; Khan, Zainab; Neilson, Louis; Tomeva, Marina; Adams, Sarah E; Wood, Andrew C; Rodriguez-Martin, Daniel; Pinel, Ingrid; Parkhill, Julian; Mahenthiralingam, Eshwar; Crosby, John
2017-07-01
Pseudomonas mesoacidophila ATCC 31433 is a Gram-negative bacterium, first isolated from Japanese soil samples, that produces the monobactam isosulfazecin and the β-lactam-potentiating bulgecins. To characterize the biosynthetic potential of P. mesoacidophila ATCC 31433, its complete genome was determined using single-molecule real-time DNA sequence analysis. The 7.8-Mb genome comprised four replicons, three chromosomal (each encoding rRNA) and one plasmid. Phylogenetic analysis demonstrated that P. mesoacidophila ATCC 31433 was misclassified at the time of its deposition and is a member of the Burkholderia cepacia complex, most closely related to Burkholderia ubonensis The sequenced genome shows considerable additional biosynthetic potential; known gene clusters for malleilactone, ornibactin, isosulfazecin, alkylhydroxyquinoline, and pyrrolnitrin biosynthesis and several uncharacterized biosynthetic gene clusters for polyketides, nonribosomal peptides, and other metabolites were identified. Furthermore, P. mesoacidophila ATCC 31433 harbors many genes associated with environmental resilience and antibiotic resistance and was resistant to a range of antibiotics and metal ions. In summary, this bioactive strain should be designated B. cepacia complex strain ATCC 31433, pending further detailed taxonomic characterization. IMPORTANCE This work reports the complete genome sequence of Pseudomonas mesoacidophila ATCC 31433, a known producer of bioactive compounds. Large numbers of both known and novel biosynthetic gene clusters were identified, indicating that P. mesoacidophila ATCC 31433 is an untapped resource for discovery of novel bioactive compounds. Phylogenetic analysis demonstrated that P. mesoacidophila ATCC 31433 is in fact a member of the Burkholderia cepacia complex, most closely related to the species Burkholderia ubonensis Further investigation of the classification and biosynthetic potential of P. mesoacidophila ATCC 31433 is warranted. Copyright © 2017 Loveridge et al.
Gao, Feng; Song, Weibo; Katz, Laura A.
2014-01-01
In most lineages, diversity among gene family members results from gene duplication followed by sequence divergence. Because of the genome rearrangements during the development of somatic nuclei, gene family evolution in ciliates involves more complex processes. Previous work on the ciliate Chilodonella uncinata revealed that macronuclear β-tubulin gene family members are generated by alternative processing, in which germline regions are alternatively used in multiple macronuclear chromosomes. To further study genome evolution in this ciliate, we analyzed its transcriptome and found that: 1) alternative processing is extensive among gene families; and 2) such gene families are likely to be C. uncinata-specific. We characterized additional macronuclear and micronuclear copies of one candidate alternatively processed gene family -- a protein kinase domain containing protein (PKc) -- from two C. uncinata strains. Analysis of the PKc sequences reveals: 1) multiple PKc gene family members in the macronucleus share some identical regions flanked by divergent regions; and 2) the shared identical regions are processed from a single micronuclear chromosome. We discuss analogous processes in lineages across the eukaryotic tree of life to provide further insights on the impact of genome structure on gene family evolution in eukaryotes. PMID:24749903
Autism genetics: Methodological issues and experimental design.
Sacco, Roberto; Lintas, Carla; Persico, Antonio M
2015-10-01
Autism is a complex neuropsychiatric disorder of developmental origin, where multiple genetic and environmental factors likely interact resulting in a clinical continuum between "affected" and "unaffected" individuals in the general population. During the last two decades, relevant progress has been made in identifying chromosomal regions and genes in linkage or association with autism, but no single gene has emerged as a major cause of disease in a large number of patients. The purpose of this paper is to discuss specific methodological issues and experimental strategies in autism genetic research, based on fourteen years of experience in patient recruitment and association studies of autism spectrum disorder in Italy.
Findley, Seth D; Mormile, Melanie R; Sommer-Hurley, Andrea; Zhang, Xue-Cheng; Tipton, Peter; Arnett, Krista; Porter, James H; Kerley, Monty; Stacey, Gary
2011-11-01
The rumen, the foregut of herbivorous ruminant animals such as cattle, functions as a bioreactor to process complex plant material. Among the numerous and diverse microbes involved in ruminal digestion are the ruminal protozoans, which are single-celled, ciliated eukaryotic organisms. An activity-based screen was executed to identify genes encoding fibrolytic enzymes present in the metatranscriptome of a bovine ruminal protozoan-enriched cDNA expression library. Of the four novel genes identified, two were characterized in biochemical assays. Our results provide evidence for the effective use of functional metagenomics to retrieve novel enzymes from microbial populations that cannot be maintained in axenic cultures.
Mitochondrial genome deletions and minicircles are common in lice (Insecta: Phthiraptera)
2011-01-01
Background The gene composition, gene order and structure of the mitochondrial genome are remarkably stable across bilaterian animals. Lice (Insecta: Phthiraptera) are a major exception to this genomic stability in that the canonical single chromosome with 37 genes found in almost all other bilaterians has been lost in multiple lineages in favour of multiple, minicircular chromosomes with less than 37 genes on each chromosome. Results Minicircular mt genomes are found in six of the ten louse species examined to date and three types of minicircles were identified: heteroplasmic minicircles which coexist with full sized mt genomes (type 1); multigene chromosomes with short, simple control regions, we infer that the genome consists of several such chromosomes (type 2); and multiple, single to three gene chromosomes with large, complex control regions (type 3). Mapping minicircle types onto a phylogenetic tree of lice fails to show a pattern of their occurrence consistent with an evolutionary series of minicircle types. Analysis of the nuclear-encoded, mitochondrially-targetted genes inferred from the body louse, Pediculus, suggests that the loss of mitochondrial single-stranded binding protein (mtSSB) may be responsible for the presence of minicircles in at least species with the most derived type 3 minicircles (Pediculus, Damalinia). Conclusions Minicircular mt genomes are common in lice and appear to have arisen multiple times within the group. Life history adaptive explanations which attribute minicircular mt genomes in lice to the adoption of blood-feeding in the Anoplura are not supported by this expanded data set as minicircles are found in multiple non-blood feeding louse groups but are not found in the blood-feeding genus Heterodoxus. In contrast, a mechanist explanation based on the loss of mtSSB suggests that minicircles may be selectively favoured due to the incapacity of the mt replisome to synthesize long replicative products without mtSSB and thus the loss of this gene lead to the formation of minicircles in lice. PMID:21813020
Mitochondrial genome deletions and minicircles are common in lice (Insecta: Phthiraptera).
Cameron, Stephen L; Yoshizawa, Kazunori; Mizukoshi, Atsushi; Whiting, Michael F; Johnson, Kevin P
2011-08-04
The gene composition, gene order and structure of the mitochondrial genome are remarkably stable across bilaterian animals. Lice (Insecta: Phthiraptera) are a major exception to this genomic stability in that the canonical single chromosome with 37 genes found in almost all other bilaterians has been lost in multiple lineages in favour of multiple, minicircular chromosomes with less than 37 genes on each chromosome. Minicircular mt genomes are found in six of the ten louse species examined to date and three types of minicircles were identified: heteroplasmic minicircles which coexist with full sized mt genomes (type 1); multigene chromosomes with short, simple control regions, we infer that the genome consists of several such chromosomes (type 2); and multiple, single to three gene chromosomes with large, complex control regions (type 3). Mapping minicircle types onto a phylogenetic tree of lice fails to show a pattern of their occurrence consistent with an evolutionary series of minicircle types. Analysis of the nuclear-encoded, mitochondrially-targetted genes inferred from the body louse, Pediculus, suggests that the loss of mitochondrial single-stranded binding protein (mtSSB) may be responsible for the presence of minicircles in at least species with the most derived type 3 minicircles (Pediculus, Damalinia). Minicircular mt genomes are common in lice and appear to have arisen multiple times within the group. Life history adaptive explanations which attribute minicircular mt genomes in lice to the adoption of blood-feeding in the Anoplura are not supported by this expanded data set as minicircles are found in multiple non-blood feeding louse groups but are not found in the blood-feeding genus Heterodoxus. In contrast, a mechanist explanation based on the loss of mtSSB suggests that minicircles may be selectively favoured due to the incapacity of the mt replisome to synthesize long replicative products without mtSSB and thus the loss of this gene lead to the formation of minicircles in lice.
NASA Astrophysics Data System (ADS)
Duan, Lingxun; Bagasra, Omar; Laughlin, Mark A.; Oakes, Joseph W.; Pomerantz, Roger J.
1994-05-01
Human immunodeficiency virus type 1 (HIV-1) has a complex life cycle, which has made it a difficult target for conventional therapeutic modalities. A single-chain antibody moiety, directed against the HIV-1 regulatory protein Rev, which rescues unspliced viral RNA from the nucleus of infected cells, has now been developed. This anti-Rev single-chain construct (SFv) consists of both light and heavy chain variable regions of an anti-Rev monoclonal antibody, which, when expressed intracellularly within human cells, potently inhibits HIV-1 replication. This intracellular SFv molecule is demonstrated to specifically antagonize Rev function. Thus, intracellular SFv expression, against a retroviral regulatory protein, may be useful as a gene therapeutic approach to combat HIV-1 infections.
Huzil, John Torin; Saliaj, Evi; Ivanova, Marina V; Gharagozloo, Marjan; Loureiro, Maria Jimena; Lamprecht, Constanze; Korinek, Andreas; Chen, Ding Wen; Foldvari, Marianna
2015-01-01
Background: The potential use of carbon nanotubes (CNTs) in gene therapy as delivery systems for nucleic acids has been recently recognized. Here, we describe that metallic versus semiconducting single-wall CNTs can produce significant differences in transfection rate and cellular distribution of siRNA in murine PAM212 keratinocytes. Results/Methodology: The results of cell interaction studies, coupled with supportive computational simulations and ultrastructural studies revealed that the use of metallic single wall CNTs resulted in siRNA delivery into both the cytoplasm and nucleus of keratinocytes, whereas semiconducting CNTs resulted in delivery only to the cytoplasm. Conclusion: Using enriched fractions of metallic or semiconducting CNTs for siRNA complex preparation may provide specific subcellular targeting advantages. PMID:28031892
Functional conservation and structural diversification of silk sericins in two moth species.
Zurovec, Michal; Kludkiewicz, Barbara; Fedic, Robert; Sulitkova, Jitka; Mach, Vaclav; Kucerova, Lucie; Sehnal, Frantisek
2013-06-10
Sericins are hydrophilic structural proteins produced by caterpillars in the middle section of silk glands and layered over fibroin proteins secreted in the posterior section. In the process of spinning, fibroins form strong solid filaments, while sericins seal the pair of filaments into a single fiber and glue the fiber into a cocoon. Galleria mellonella and the previously examined Bombyx mori harbor three sericin genes that encode proteins containing long repetitive regions. Galleria sericin genes are similar to each other and the protein repeats are built from short and extremely serine-rich motifs, while Bombyx sericin genes are diversified and encode proteins with long and complex repeats. Developmental changes in sericin properties are controlled at the level of gene expression and splicing. In Galleria , MG-1 sericin is produced throughout larval life until the wandering stage, while the production of MG-2 and MG-3 reaches a peak during cocoon spinning.
Therapeutic applications of CRISPR/Cas9 system in gene therapy.
Mollanoori, Hasan; Teimourian, Shahram
2018-06-01
Gene therapy is based on the principle of the genetic manipulation of DNA or RNA for treating and preventing human diseases. The clustered regularly interspaced short palindromic repeats/CRISPR associated nuclease9 (CRISPR/Cas9) system, derived from the acquired immune system in bacteria and archaea, has provided a new tool for accurate manipulation of genomic sequence to attain a therapeutic result. The advantage of CRISPR which made it an easy and flexible tool for diverse genome editing purposes is that a single protein (Cas9) complex with 2 short RNA sequences, function as a site-specific endonuclease. Recently, application of CRISPR/Cas9 system has become popular for therapeutic aims such as gene therapy. In this article, we review the fundamental mechanisms of CRISPR-Cas9 function and summarize preclinical CRISPR-mediated gene therapy reports on a wide variety of disorders.
Rawat, Suman R.; Männistö, Minna K.; Starovoytov, Valentin; Goodwin, Lynne; Nolan, Matt; Hauser, Loren J.; Land, Miriam; Davenport, Karen Walston; Woyke, Tanja; Häggblom, Max M.
2013-01-01
Granulicella mallensis MP5ACTX8T is a novel species of the genus Granulicella in subdivision 1of Acidobacteria. G. mallensis is of ecological interest being a member of the dominant soil bacterial community active at low temperatures and nutrient limiting conditions in Arctic alpine tundra. G. mallensis is a cold-adapted acidophile and a versatile heterotroph that hydrolyzes a suite of sugars and complex polysaccharides. Genome analysis revealed metabolic versatility with genes involved in metabolism and transport of carbohydrates. These include gene modules encoding the carbohydrate-active enzyme (CAZyme) family involved in breakdown, utilization and biosynthesis of diverse structural and storage polysaccharides including plant based carbon polymers. The genome of Granulicella mallensis MP5ACTX8T consists of a single replicon of 6,237,577 base pairs (bp) with 4,907 protein-coding genes and 53 RNA genes. PMID:24501646
Genetics of schizophrenia in the context of integrative psychiatry.
Sagud, Marina; Mihaljević-Peles, Alma; Pivac, Nela; Muck-Seler, Dorotea; Simunović, Ivona; Jakovljević, Miro
2008-09-01
Epidemiological studies suggest a strong heritability in schizophrenia. Positive family history is the greatest risk factor for developing schizophrenia. However, regarding the genetic factors in schizophrenia, there is a lot of the inconsistency (i.e. non-replication) in the literature of the associations of different genes with schizophrenia. The presence of a single gene is neither sufficient, nor necessary to cause schizophrenia. The understanding of the genetic basis of schizophrenia is complex. Besides different gene polymorphisms, numerous environmental factors, interacting with genes, contribute to susceptibility to schizophrenia. Such factors include the use of street drugs, childhood head injury, maternal infection during pregnancy, paternal age at conception, stressful life events and urban upbringing. While knowing genetic risks, integrative psychiatry may have a role in reducing other modifiable risk factors, including reduction of stress level, stress management strategies, family consultation/education, education against street drugs use, treatment of prodromal symptoms and development of social skills.
Pandey, Daya Shankar; Pan, Indranil; Das, Saptarshi; Leahy, James J; Kwapinski, Witold
2015-03-01
A multi-gene genetic programming technique is proposed as a new method to predict syngas yield production and the lower heating value for municipal solid waste gasification in a fluidized bed gasifier. The study shows that the predicted outputs of the municipal solid waste gasification process are in good agreement with the experimental dataset and also generalise well to validation (untrained) data. Published experimental datasets are used for model training and validation purposes. The results show the effectiveness of the genetic programming technique for solving complex nonlinear regression problems. The multi-gene genetic programming are also compared with a single-gene genetic programming model to show the relative merits and demerits of the technique. This study demonstrates that the genetic programming based data-driven modelling strategy can be a good candidate for developing models for other types of fuels as well. Copyright © 2014 Elsevier Ltd. All rights reserved.
Cloning and expression of the gene for bacteriophage T7 RNA polymerase
Studier, F. William; Davanloo, Parichehre; Rosenberg, Alan H.; Moffatt, Barbara A.; Dunn, John J.
1999-02-09
This application describes a means to clone a functional gene for bacteriophage T7 RNA polymerase. Active T7 RNA polymerase is produced from the cloned gene, and a plasmid has been constructed that can produce the active enzyme in large amounts. T7 RNA polymerase transcribes DNA very efficiently and is highly selective for a relatively long promoter sequence. This enzyme is useful for synthesizing large amounts of RNA in vivo or in vitro, and is capable of producing a single RNA selectively from a complex mixture of DNAs. The procedure used to obtain a clone of the R7 RNA polymerase gene can be applied to other T7-like phages to obtain clones that produce RNA polymerases having different promoter specificities, different bacterial hosts, or other desirable properties. T7 RNA polymerase is also used in a system for selective, high-level synthesis of RNAs and proteins in suitable host cells.
Cloning and expression of the gene for bacteriophage T7 RNA polymerase
Studier, F. William; Davanloo, Parichehre; Rosenberg, Alan H.; Moffatt, Barbara A.; Dunn, John J.
1997-12-02
This application describes a means to clone a functional gene for bacteriophage T7 RNA polymerase. Active T7 RNA polymerase is produced from the cloned gene, and a plasmid has been constructed that can produce the active enzyme in large amounts. T7 RNA polymerase transcribes DNA very efficiently and is highly selective for a relatively long promoter sequence. This enzyme is useful for synthesizing large amounts of RNA in vivo or in vitro, and is capable of producing a single RNA selectively from a complex mixture of DNAs. The procedure used to obtain a clone of the R7 RNA polymerase gene can be applied to other T7-like phages to obtain clones that produce RNA polymerases having different promoter specificities, different bacterial hosts, or other desirable properties. T7 RNA polymerase is also used in a system for selective, high-level synthesis of RNAs and proteins in suitable host cells.
Cloning and expression of the gene for bacteriophage T7 RNA polymerase
Studier, F. William; Davanloo, Parichehre; Rosenberg, Alan H.; Moffatt, Barbara A.; Dunn, John J.
1990-01-01
This application describes a means to clone a functional gene for bacteriophage T7 RNA polymerase. Active T7 RNA polymerase is produced from the cloned gene, and a plasmid has been constructed that can produce the active enzyme in large amounts. T7 RNA polymerase transcribes DNA very efficiently and is highly selective for a relatively long promoter sequence. This enzyme is useful for synthesizing large amounts of RNA in vivo or in vitro, and is capable of producing a single RNA selectively from a complex mixture of DNAs. The procedure used to obtain a clone of the T7 RNA polymerase gene can be applied to other T7-like phages to obtain clones that produce RNA polymerases having different promoter specificities, different bacterial hosts, or other desirable properties. T7 RNA polymerase is also used in a system for selective, high-level synthesis of RNAs and proteins in suitable host cells.
Contribution of rare inherited and de novo variants in 2,871 congenital heart disease probands
Jin, Sheng Chih; Homsy, Jason; Zaidi, Samir; Lu, Qiongshi; Morton, Sarah; DePalma, Steven R.; Zeng, Xue; Qi, Hongjian; Chang, Weni; Sierant, Michael C.; Hung, Wei-Chien; Haider, Shozeb; Zhang, Junhui; Knight, James; Bjornson, Robert D.; Castaldi, Christopher; Tikhonoa, Irina R.; Bilguvar, Kaya; Mane, Shrikant M.; Sanders, Stephan J.; Mital, Seema; Russell, Mark; Gaynor, William; Deanfield, John; Giardini, Alessandro; Porter, George A.; Srivastava, Deepak; Lo, Cecelia W.; Shen, Yufeng; Watkins, W. Scott; Yandell, Mark; Yost, H. Joseph; Tristani-Firouzi, Martin; Newburger, Jane W.; Roberts, Amy E.; Kim, Richard; Zhao, Hongyu; Kaltman, Jonathan R.; Goldmuntz, Elizabeth; Chung, Wendy K.; Seidman, Jonathan G.; Gelb, Bruce D.; Seidman, Christine E.; Lifton, Richard P.; Brueckner, Martina
2017-01-01
Congenital heart disease (CHD) is the leading cause of mortality from birth defects. Exome sequencing of a single cohort of 2,871 CHD probands including 2,645 parent-offspring trios implicated rare inherited mutations in 1.8%, including a recessive founder mutation in GDF1 accounting for ~5% of severe CHD in Ashkenazim, recessive genotypes in MYH6 accounting for ~11% of Shone complex, and dominant FLT4 mutations accounting for 2.3% of Tetralogy of Fallot. De novo mutations (DNMs) accounted for 8% of cases, including ~3% of isolated CHD patients and ~28% with both neurodevelopmental and extra-cardiac congenital anomalies. Seven genes surpassed thresholds for genome-wide significance and 12 genes not previously implicated in CHD had > 70% probability of being disease-related; DNMs in ~440 genes are inferred to contribute to CHD. There was striking overlap between genes with damaging DNMs in probands with CHD and autism. PMID:28991257
Gonzaga-Jauregui, Claudia; Harel, Tamar; Gambin, Tomasz; Kousi, Maria; Griffin, Laurie B.; Francescatto, Ludmila; Ozes, Burcak; Karaca, Ender; Jhangiani, Shalini; Bainbridge, Matthew N.; Lawson, Kim S.; Pehlivan, Davut; Okamoto, Yuji; Withers, Marjorie; Mancias, Pedro; Slavotinek, Anne; Reitnauer, Pamela J; Goksungur, Meryem T.; Shy, Michael; Crawford, Thomas O.; Koenig, Michel; Willer, Jason; Flores, Brittany N.; Pediaditrakis, Igor; Us, Onder; Wiszniewski, Wojciech; Parman, Yesim; Antonellis, Anthony; Muzny, Donna M.; Katsanis, Nicholas; Battaloglu, Esra; Boerwinkle, Eric; Gibbs, Richard A.; Lupski, James R.
2015-01-01
Charcot-Marie-Tooth (CMT) disease is a clinically and genetically heterogeneous distal symmetric polyneuropathy. Whole-exome sequencing (WES) of 40 individuals from 37 unrelated families with CMT-like peripheral neuropathy refractory to molecular diagnosis identified apparent causal mutations in ~45% (17/37) of families. Three candidate disease genes are proposed, supported by a combination of genetic and in vivo studies. Aggregate analysis of mutation data revealed a significantly increased number of rare variants across 58 neuropathy associated genes in subjects versus controls; confirmed in a second ethnically discrete neuropathy cohort, suggesting mutation burden potentially contributes to phenotypic variability. Neuropathy genes shown to have highly penetrant Mendelizing variants (HMPVs) and implicated by burden in families were shown to interact genetically in a zebrafish assay exacerbating the phenotype established by the suppression of single genes. Our findings suggest that the combinatorial effect of rare variants contributes to disease burden and variable expressivity. PMID:26257172
A Bayesian Supertree Model for Genome-Wide Species Tree Reconstruction
De Oliveira Martins, Leonardo; Mallo, Diego; Posada, David
2016-01-01
Current phylogenomic data sets highlight the need for species tree methods able to deal with several sources of gene tree/species tree incongruence. At the same time, we need to make most use of all available data. Most species tree methods deal with single processes of phylogenetic discordance, namely, gene duplication and loss, incomplete lineage sorting (ILS) or horizontal gene transfer. In this manuscript, we address the problem of species tree inference from multilocus, genome-wide data sets regardless of the presence of gene duplication and loss and ILS therefore without the need to identify orthologs or to use a single individual per species. We do this by extending the idea of Maximum Likelihood (ML) supertrees to a hierarchical Bayesian model where several sources of gene tree/species tree disagreement can be accounted for in a modular manner. We implemented this model in a computer program called guenomu whose inputs are posterior distributions of unrooted gene tree topologies for multiple gene families, and whose output is the posterior distribution of rooted species tree topologies. We conducted extensive simulations to evaluate the performance of our approach in comparison with other species tree approaches able to deal with more than one leaf from the same species. Our method ranked best under simulated data sets, in spite of ignoring branch lengths, and performed well on empirical data, as well as being fast enough to analyze relatively large data sets. Our Bayesian supertree method was also very successful in obtaining better estimates of gene trees, by reducing the uncertainty in their distributions. In addition, our results show that under complex simulation scenarios, gene tree parsimony is also a competitive approach once we consider its speed, in contrast to more sophisticated models. PMID:25281847
The function of dog models in developing gene therapy strategies for human health.
Nowend, Keri L; Starr-Moss, Alison N; Murphy, Keith E
2011-08-01
The domestic dog is of great benefit to humankind, not only through companionship and working activities cultivated through domestication and selective breeding, but also as a model for biomedical research. Many single-gene traits have been well-characterized at the genomic level, and recent advances in whole-genome association studies will allow for better understanding of complex, multigenic hereditary diseases. Additionally, the dog serves as an invaluable large animal model for assessment of novel therapeutic agents. Thus, the dog has filled a crucial step in the translation of basic research to new treatment regimens for various human diseases. Four well-characterized diseases in canine models are discussed as they relate to other animal model availability, novel therapeutic approach, and extrapolation to human gene therapy trials.
Cracking the egg: virtual embryogenesis of real robots.
Cussat-Blanc, Sylvain; Pollack, Jordan
2014-01-01
All multicellular living beings are created from a single cell. A developmental process, called embryogenesis, takes this first fertilized cell down a complex path of reproduction, migration, and specialization into a complex organism adapted to its environment. In most cases, the first steps of the embryogenesis take place in a protected environment such as in an egg or in utero. Starting from this observation, we propose a new approach to the generation of real robots, strongly inspired by living systems. Our robots are composed of tens of specialized cells, grown from a single cell using a bio-inspired virtual developmental process. Virtual cells, controlled by gene regulatory networks, divide, migrate, and specialize to produce the robot's body plan (morphology), and then the robot is manually built from this plan. Because the robot is as easy to assemble as Lego, the building process could be easily automated.
Genomic Regions in Local Endangered Sheep Encode Potentially Favorable Genes.
Moioli, Bianca; Steri, Roberto; Catillo, Gennaro
2018-01-02
The economic evaluation of farm animal genetic resources plays a key role in developing conservation programs. However, to date, the link between diversity as assessed by neutral genetic markers and the functional diversity is not yet understood. Two genome-wide comparisons, using over 44,000 Single Nucleotide Polymorphisms, identified the markers with the highest difference in allele frequency between the Alpago endangered breed and two clusters, composed of four specialized dairy sheep, and four meat breeds respectively. The genes in proximity of these markers were mapped to known pathways of the Gene Ontology to determine which ones were most represented. Our results indicated that the differences of the Alpago breed from the more productive sheep rely upon genes involved in cellular defense and repair mechanisms. A higher number of different markers and genes were detected in the comparison with the specialized dairy sheep. These genes play a role in complex biological processes: metabolic, homeostatic, neurological system, and macromolecular organization; such processes may possibly explain the evolution of gene function as a result of selection to improve milk yield.
Wu, Wenming; Loan, Kieu The Loan; Lee, Nae Yoon
2012-05-07
Consistent temperature control in an on-chip flow-through polymerase chain reaction (PCR) employing two or more heaters is one of the main obstacles for device miniaturization and integration when realizing micro total analysis systems (μTAS), and also leads to operational complexity. In this study, we propose a qiandu (right triangular prism)-shaped polydimethylsiloxane (PDMS) microdevice with serpentine microchannels fabricated on its slanted plane, and apply the device for an on-chip flow-through PCR employing a single heater. The inclined nature of the qiandu-shaped microdevice enables the formation of a surface temperature gradient along the slanted plane of the microdevice in a height-dependent manner by the use of a single heater, and enables liquid to traverse over wide ranges of temperatures, including the three temperature zones--denaturation, annealing, and extension temperatures--required in a typical PCR. The feasibility of the qiandu-shaped PDMS microdevice as a versatile platform for performing a flow-through PCR was examined by employing multiple templates and varying the inclination angle of the device. In addition, the potential of performing a multiplex PCR using a single qiandu-shaped PDMS microdevice was explored. A 409 bp long gene fragment effective as a marker for diagnosing lung cancer and a 230 bp long gene fragment from a plasmid vector were simultaneously amplified in less than 25 min on a single microdevice, paving the way for a microscale, multiplex PCR on a single device employing a single heater.
2018-01-01
Effect-directed analysis (EDA) is a commonly used approach for effect-based identification of endocrine disruptive chemicals in complex (environmental) mixtures. However, for routine toxicity assessment of, for example, water samples, current EDA approaches are considered time-consuming and laborious. We achieved faster EDA and identification by downscaling of sensitive cell-based hormone reporter gene assays and increasing fractionation resolution to allow testing of smaller fractions with reduced complexity. The high-resolution EDA approach is demonstrated by analysis of four environmental passive sampler extracts. Downscaling of the assays to a 384-well format allowed analysis of 64 fractions in triplicate (or 192 fractions without technical replicates) without affecting sensitivity compared to the standard 96-well format. Through a parallel exposure method, agonistic and antagonistic androgen and estrogen receptor activity could be measured in a single experiment following a single fractionation. From 16 selected candidate compounds, identified through nontargeted analysis, 13 could be confirmed chemically and 10 were found to be biologically active, of which the most potent nonsteroidal estrogens were identified as oxybenzone and piperine. The increased fractionation resolution and the higher throughput that downscaling provides allow for future application in routine high-resolution screening of large numbers of samples in order to accelerate identification of (emerging) endocrine disruptors. PMID:29547277
Functionality of In vitro Reconstituted Group II Intron RmInt1-Derived Ribonucleoprotein Particles.
Molina-Sánchez, Maria D; García-Rodríguez, Fernando M; Toro, Nicolás
2016-01-01
The functional unit of mobile group II introns is a ribonucleoprotein particle (RNP) consisting of the intron-encoded protein (IEP) and the excised intron RNA. The IEP has reverse transcriptase activity but also promotes RNA splicing, and the RNA-protein complex triggers site-specific DNA insertion by reverse splicing, in a process called retrohoming. In vitro reconstituted ribonucleoprotein complexes from the Lactococcus lactis group II intron Ll.LtrB, which produce a double strand break, have recently been studied as a means of developing group II intron-based gene targeting methods for higher organisms. The Sinorhizobium meliloti group II intron RmInt1 is an efficient mobile retroelement, the dispersal of which appears to be linked to transient single-stranded DNA during replication. The RmInt1IEP lacks the endonuclease domain (En) and cannot cut the bottom strand to generate the 3' end to initiate reverse transcription. We used an Escherichia coli expression system to produce soluble and active RmInt1 IEP and reconstituted RNPs with purified components in vitro . The RNPs generated were functional and reverse-spliced into a single-stranded DNA target. This work constitutes the starting point for the use of group II introns lacking DNA endonuclease domain-derived RNPs for highly specific gene targeting methods.
Functionality of In vitro Reconstituted Group II Intron RmInt1-Derived Ribonucleoprotein Particles
Molina-Sánchez, Maria D.; García-Rodríguez, Fernando M.; Toro, Nicolás
2016-01-01
The functional unit of mobile group II introns is a ribonucleoprotein particle (RNP) consisting of the intron-encoded protein (IEP) and the excised intron RNA. The IEP has reverse transcriptase activity but also promotes RNA splicing, and the RNA-protein complex triggers site-specific DNA insertion by reverse splicing, in a process called retrohoming. In vitro reconstituted ribonucleoprotein complexes from the Lactococcus lactis group II intron Ll.LtrB, which produce a double strand break, have recently been studied as a means of developing group II intron-based gene targeting methods for higher organisms. The Sinorhizobium meliloti group II intron RmInt1 is an efficient mobile retroelement, the dispersal of which appears to be linked to transient single-stranded DNA during replication. The RmInt1IEP lacks the endonuclease domain (En) and cannot cut the bottom strand to generate the 3′ end to initiate reverse transcription. We used an Escherichia coli expression system to produce soluble and active RmInt1 IEP and reconstituted RNPs with purified components in vitro. The RNPs generated were functional and reverse-spliced into a single-stranded DNA target. This work constitutes the starting point for the use of group II introns lacking DNA endonuclease domain-derived RNPs for highly specific gene targeting methods. PMID:27730127
Normanno, Davide; Vanzi, Francesco; Pavone, Francesco Saverio
2008-01-01
Gene expression regulation is a fundamental biological process which deploys specific sets of genomic information depending on physiological or environmental conditions. Several transcription factors (including lac repressor, LacI) are present in the cell at very low copy number and increase their local concentration by binding to multiple sites on DNA and looping the intervening sequence. In this work, we employ single-molecule manipulation to experimentally address the role of DNA supercoiling in the dynamics and stability of LacI-mediated DNA looping. We performed measurements over a range of degrees of supercoiling between −0.026 and +0.026, in the absence of axial stretching forces. A supercoiling-dependent modulation of the lifetimes of both the looped and unlooped states was observed. Our experiments also provide evidence for multiple structural conformations of the LacI–DNA complex, depending on torsional constraints. The supercoiling-dependent modulation demonstrated here adds an important element to the model of the lac operon. In fact, the complex network of proteins acting on the DNA in a living cell constantly modifies its topological and mechanical properties: our observations demonstrate the possibility of establishing a signaling pathway from factors affecting DNA supercoiling to transcription factors responsible for the regulation of specific sets of genes. PMID:18310101
Flores-Rozas, H; Kolodner, R D
1998-10-13
The Saccharomyces cerevisiae genome encodes four MutL homologs. Of these, MLH1 and PMS1 are known to act in the MSH2-dependent pathway that repairs DNA mismatches. We have investigated the role of MLH3 in mismatch repair. Mutations in MLH3 increased the rate of reversion of the hom3-10 allele by increasing the rate of deletion of a single T in a run of 7 Ts. Combination of mutations in MLH3 and MSH6 caused a synergistic increase in the hom3-10 reversion rate, whereas the hom3-10 reversion rate in an mlh3 msh3 double mutant was the same as in the respective single mutants. Similar results were observed when the accumulation of mutations at frameshift hot spots in the LYS2 gene was analyzed, although mutation of MLH3 did not cause the same extent of affect at every LYS2 frameshift hot spot. MLH3 interacted with MLH1 in a two-hybrid system. These data are consistent with the idea that a proportion of the repair of specific insertion/deletion mispairs by the MSH3-dependent mismatch repair pathway uses a heterodimeric MLH1-MLH3 complex in place of the MLH1-PMS1 complex.
Jia, Peilin; Wang, Lily; Fanous, Ayman H.; Pato, Carlos N.; Edwards, Todd L.; Zhao, Zhongming
2012-01-01
With the recent success of genome-wide association studies (GWAS), a wealth of association data has been accomplished for more than 200 complex diseases/traits, proposing a strong demand for data integration and interpretation. A combinatory analysis of multiple GWAS datasets, or an integrative analysis of GWAS data and other high-throughput data, has been particularly promising. In this study, we proposed an integrative analysis framework of multiple GWAS datasets by overlaying association signals onto the protein-protein interaction network, and demonstrated it using schizophrenia datasets. Building on a dense module search algorithm, we first searched for significantly enriched subnetworks for schizophrenia in each single GWAS dataset and then implemented a discovery-evaluation strategy to identify module genes with consistent association signals. We validated the module genes in an independent dataset, and also examined them through meta-analysis of the related SNPs using multiple GWAS datasets. As a result, we identified 205 module genes with a joint effect significantly associated with schizophrenia; these module genes included a number of well-studied candidate genes such as DISC1, GNA12, GNA13, GNAI1, GPR17, and GRIN2B. Further functional analysis suggested these genes are involved in neuronal related processes. Additionally, meta-analysis found that 18 SNPs in 9 module genes had P meta<1×10−4, including the gene HLA-DQA1 located in the MHC region on chromosome 6, which was reported in previous studies using the largest cohort of schizophrenia patients to date. These results demonstrated our bi-directional network-based strategy is efficient for identifying disease-associated genes with modest signals in GWAS datasets. This approach can be applied to any other complex diseases/traits where multiple GWAS datasets are available. PMID:22792057
A forward genetic screen reveals essential and non-essential RNAi factors in Paramecium tetraurelia
Marker, Simone; Carradec, Quentin; Tanty, Véronique; Arnaiz, Olivier; Meyer, Eric
2014-01-01
In most eukaryotes, small RNA-mediated gene silencing pathways form complex interacting networks. In the ciliate Paramecium tetraurelia, at least two RNA interference (RNAi) mechanisms coexist, involving distinct but overlapping sets of protein factors and producing different types of short interfering RNAs (siRNAs). One is specifically triggered by high-copy transgenes, and the other by feeding cells with double-stranded RNA (dsRNA)-producing bacteria. In this study, we designed a forward genetic screen for mutants deficient in dsRNA-induced silencing, and a powerful method to identify the relevant mutations by whole-genome sequencing. We present a set of 47 mutant alleles for five genes, revealing two previously unknown RNAi factors: a novel Paramecium-specific protein (Pds1) and a Cid1-like nucleotidyl transferase. Analyses of allelic diversity distinguish non-essential and essential genes and suggest that the screen is saturated for non-essential, single-copy genes. We show that non-essential genes are specifically involved in dsRNA-induced RNAi while essential ones are also involved in transgene-induced RNAi. One of the latter, the RNA-dependent RNA polymerase RDR2, is further shown to be required for all known types of siRNAs, as well as for sexual reproduction. These results open the way for the dissection of the genetic complexity, interconnection, mechanisms and natural functions of RNAi pathways in P. tetraurelia. PMID:24860163
CarD uses a minor groove wedge mechanism to stabilize the RNA polymerase open promoter complex.
Bae, Brian; Chen, James; Davis, Elizabeth; Leon, Katherine; Darst, Seth A; Campbell, Elizabeth A
2015-09-08
A key point to regulate gene expression is at transcription initiation, and activators play a major role. CarD, an essential activator in Mycobacterium tuberculosis, is found in many bacteria, including Thermus species, but absent in Escherichia coli. To delineate the molecular mechanism of CarD, we determined crystal structures of Thermus transcription initiation complexes containing CarD. The structures show CarD interacts with the unique DNA topology presented by the upstream double-stranded/single-stranded DNA junction of the transcription bubble. We confirm that our structures correspond to functional activation complexes, and extend our understanding of the role of a conserved CarD Trp residue that serves as a minor groove wedge, preventing collapse of the transcription bubble to stabilize the transcription initiation complex. Unlike E. coli RNAP, many bacterial RNAPs form unstable promoter complexes, explaining the need for CarD.
Cohen, M M
1989-12-01
The role of chance using a stochastic single gene model has been shown to generate a continuous liability curve resembling that obtained from a multifactorial threshold model. Segregation of some malformations may be explained by a single defective gene that predisposes to, but does not necessarily result in, the malformation. Low penetrance and remarkably variable expressivity that characterize a number of presumed autosomal dominant malformation syndromes are possibly reflections of specific stochastic influences that are intrinsic to the embryonic process itself. Gene analysis is discussed and illustrated. Using polymorphic DNA probes to study cleft palate and ankyloglossia in males and ankyloglossia only in females in a large Icelandic family, the responsible gene was found to be located on the long arm of the X chromosome in the Xq21.1 region. In addition to gene analysis, some of the implications of transgenic analysis using mice are discussed. Among disorders of collagen metabolism, both the osteogenesis imperfectas and the Ehlers-Danlos syndromes are shown to represent genetically heterogeneous groups of connective tissue disorders. The days of thinking about osteogenesis imperfecta as one disorder and the Ehlers-Danlos syndrome as another are a thing of the past; persistence of such thinking is erroneous and misleading. Of the many disorders affecting bone mineral, the complexities of hypophosphatasia and pseudohypoparathyroidism are singled out for discussion. For lysosomal storage disorders, an overview of the mucopolysaccharidoses is provided. Finally, the recently delineated peroxisomal disorders--hyperpipecolic acidemia, rhizomelic chondrodysplasia, neonatal adrenoleukodystrophy, Zellweger syndrome, and infantile Refsum disease--are known to share a distinctive biochemical phenotype, although fibroblast complementation analysis suggests that some of these disorders are etiologically distinct.
Draft genomes of two blister beetles Hycleus cichorii and Hycleus phaleratus
Wu, Yuan-Ming; Li, Jiang
2018-01-01
Abstract Background Commonly known as blister beetles or Spanish fly, there are more than 1500 species in the Meloidae family (Hexapoda: Coleoptera: Tenebrionoidea) that produce the potent defensive blistering agent cantharidin. Cantharidin and its derivatives have been used to treat cancers such as liver, stomach, lung, and esophageal cancers. Hycleus cichorii and Hycleus phaleratus are the most commercially important blister beetles in China due to their ability to biosynthesize this potent vesicant. However, there is a lack of genome reference, which has hindered development of studies on the biosynthesis of cantharidin and a better understanding of its biology and pharmacology. Results We report 2 draft genomes and quantified gene sets for the blister beetles H. cichorii and H. phaleratus, 2 complex genomes with >72% repeats and approximately 1% heterozygosity, using Illumina sequencing data. An integrated assembly pipeline was performed for assembly, and most of the coding regions were obtained. Benchmarking universal single-copy orthologs (BUSCO) assessment showed that our assembly obtained more than 98% of the Endopterygota universal single-copy orthologs. Comparison analysis showed that the completeness of coding genes in our assembly was comparable to other beetle genomes such as Dendroctonus ponderosae and Agrilus planipennis. Gene annotation yielded 13 813 and 13 725 protein-coding genes in H. cichorii and H. phaleratus, of which approximately 89% were functionally annotated. BUSCO assessment showed that approximately 86% and 84% of the Endopterygota universal single-copy orthologs were annotated completely in these 2 gene sets, whose completeness is comparable to that of D. ponderosae and A. planipennis. Conclusions Assembly of both blister beetle genomes provides a valuable resource for future biosynthesis of cantharidin and comparative genomic studies of blister beetles and other beetles. PMID:29444297
Draft genomes of two blister beetles Hycleus cichorii and Hycleus phaleratus.
Wu, Yuan-Ming; Li, Jiang; Chen, Xiang-Sheng
2018-03-01
Commonly known as blister beetles or Spanish fly, there are more than 1500 species in the Meloidae family (Hexapoda: Coleoptera: Tenebrionoidea) that produce the potent defensive blistering agent cantharidin. Cantharidin and its derivatives have been used to treat cancers such as liver, stomach, lung, and esophageal cancers. Hycleus cichorii and Hycleus phaleratus are the most commercially important blister beetles in China due to their ability to biosynthesize this potent vesicant. However, there is a lack of genome reference, which has hindered development of studies on the biosynthesis of cantharidin and a better understanding of its biology and pharmacology. We report 2 draft genomes and quantified gene sets for the blister beetles H. cichorii and H. phaleratus, 2 complex genomes with >72% repeats and approximately 1% heterozygosity, using Illumina sequencing data. An integrated assembly pipeline was performed for assembly, and most of the coding regions were obtained. Benchmarking universal single-copy orthologs (BUSCO) assessment showed that our assembly obtained more than 98% of the Endopterygota universal single-copy orthologs. Comparison analysis showed that the completeness of coding genes in our assembly was comparable to other beetle genomes such as Dendroctonus ponderosae and Agrilus planipennis. Gene annotation yielded 13 813 and 13 725 protein-coding genes in H. cichorii and H. phaleratus, of which approximately 89% were functionally annotated. BUSCO assessment showed that approximately 86% and 84% of the Endopterygota universal single-copy orthologs were annotated completely in these 2 gene sets, whose completeness is comparable to that of D. ponderosae and A. planipennis. Assembly of both blister beetle genomes provides a valuable resource for future biosynthesis of cantharidin and comparative genomic studies of blister beetles and other beetles.
Sugino, N; Kawahara, M; Tatsumi, G; Kanai, A; Matsui, H; Yamamoto, R; Nagai, Y; Fujii, S; Shimazu, Y; Hishizawa, M; Inaba, T; Andoh, A; Suzuki, T; Takaori-Kondo, A
2017-11-01
Lysine-specific demethylase 1 (LSD1) regulates gene expression by affecting histone modifications and is a promising target for acute myeloid leukemia (AML) with specific genetic abnormalities. Novel LSD1 inhibitors, NCD25 and NCD38, inhibited growth of MLL-AF9 leukemia as well as erythroleukemia, megakaryoblastic leukemia and myelodysplastic syndromes (MDSs) overt leukemia cells in the concentration range that normal hematopoiesis was spared. NCD25 and NCD38 invoked the myeloid development programs, hindered the MDS and AML oncogenic programs, and commonly upregulated 62 genes in several leukemia cells. NCD38 elevated H3K27ac level on enhancers of these LSD1 signature genes and newly activated ~500 super-enhancers. Upregulated genes with super-enhancer activation in erythroleukemia cells were enriched in leukocyte differentiation. Eleven genes including GFI1 and ERG, but not CEBPA, were identified as the LSD1 signature with super-enhancer activation. Super-enhancers of these genes were activated prior to induction of the transcripts and myeloid differentiation. Depletion of GFI1 attenuated myeloid differentiation by NCD38. Finally, a single administration of NCD38 causes the in vivo eradication of primary MDS-related leukemia cells with a complex karyotype. Together, NCD38 derepresses super-enhancers of hematopoietic regulators that are silenced abnormally by LSD1, attenuates leukemogenic programs and consequently exerts anti-leukemic effect against MDS-related leukemia with adverse outcome.
Nutrigenomics and nutrigenetics in inflammatory bowel diseases.
Gruber, Lisa; Lichti, Pia; Rath, Eva; Haller, Dirk
2012-10-01
Inflammatory bowel diseases (IBD) including ulcerative colitis and Crohn's disease are chronically relapsing, immune-mediated disorders of the gastrointestinal tract. A major challenge in the treatment of IBD is the heterogenous nature of these pathologies. Both, ulcerative colitis and Crohn's disease are of multifactorial etiology and feature a complex interaction of host genetic susceptibility and environmental factors such as diet and gut microbiota. Genome-wide association studies identified disease-relevant single-nucleotide polymorphisms in approximately 100 genes, but at the same time twin studies also clearly indicated a strong environmental impact in disease development. However, attempts to link dietary factors to the risk of developing IBD, based on epidemiological observations showed controversial outcomes. Yet, emerging high-throughput technologies implying complete biological systems might allow taking nutrient-gene interactions into account for a better classification of patient subsets in the future. In this context, 2 new scientific fields, "nutrigenetics" and "nutrigenomics" have been established. "Nutrigenetics," studying the effect of genetic variations on nutrient-gene interactions and "Nutrigenomics," describing the impact of nutrition on physiology and health status on the level of gene transcription, protein expression, and metabolism. It is hoped that the integration of both research areas will promote the understanding of the complex gene-environment interaction in IBD etiology and in the long-term will lead to personalized nutrition for disease prevention and treatment. This review briefly summarizes data on the impact of nutrients on intestinal inflammation, highlights nutrient-gene interactions, and addresses the potential of applying "omic" technologies in the context of IBD.
Oikkonen, J; Huang, Y; Onkamo, P; Ukkola-Vuoti, L; Raijas, P; Karma, K; Vieland, V J; Järvelä, I
2015-02-01
Humans have developed the perception, production and processing of sounds into the art of music. A genetic contribution to these skills of musical aptitude has long been suggested. We performed a genome-wide scan in 76 pedigrees (767 individuals) characterized for the ability to discriminate pitch (SP), duration (ST) and sound patterns (KMT), which are primary capacities for music perception. Using the Bayesian linkage and association approach implemented in program package KELVIN, especially designed for complex pedigrees, several single nucleotide polymorphisms (SNPs) near genes affecting the functions of the auditory pathway and neurocognitive processes were identified. The strongest association was found at 3q21.3 (rs9854612) with combined SP, ST and KMT test scores (COMB). This region is located a few dozen kilobases upstream of the GATA binding protein 2 (GATA2) gene. GATA2 regulates the development of cochlear hair cells and the inferior colliculus (IC), which are important in tonotopic mapping. The highest probability of linkage was obtained for phenotype SP at 4p14, located next to the region harboring the protocadherin 7 gene, PCDH7. Two SNPs rs13146789 and rs13109270 of PCDH7 showed strong association. PCDH7 has been suggested to play a role in cochlear and amygdaloid complexes. Functional class analysis showed that inner ear and schizophrenia-related genes were enriched inside the linked regions. This study is the first to show the importance of auditory pathway genes in musical aptitude.
Sorrentino, F S; Gallenga, C E; Bonifazzi, C; Perri, P
2016-01-01
Retinitis pigmentosa (RP) is a group of inherited retinal disorders characterized by a complex association between tremendous genotypic multiplicity and great phenotypic heterogeneity. The severity of the clinical manifestation depends on penetrance and expressivity of the disease-gene. Also, various interactions between gene expression and environmental factors have been hypothesized. More than 250 genes with ~4500 causative mutations have been reported to be involved in different RP-related mechanisms. Nowadays, not more than the 50% of RPs are attributable to identified genes, whereas the rest of molecular defects are still undetectable, especially in populations where few genetic screenings have been performed. Therefore, new genetic strategies can be a remarkably useful tool to aid clinical diagnosis, potentially modifying treatment options, and family counseling. Genome-wide analytical techniques (array comparative genomic hybridization and single-nucleotide polymorphism genotyping) and DNA sequencing strategies (arrayed primer extension, Sanger sequencing, and ultra high-throughput sequencing) are successfully used to early make molecular diagnosis detecting single or multiple mutations in the huge heterogeneity of RPs. To date, further research needs to be carried out to better investigate the genotype/phenotype correlation, putting together genetic and clinical findings to provide detailed information concerning the risk of RP development and novel effective treatments. PMID:27564722
Jobst-Schwan, Tilman; Schmidt, Johanna Magdalena; Schneider, Ronen; Hoogstraten, Charlotte A; Ullmann, Jeremy F P; Schapiro, David; Majmundar, Amar J; Kolb, Amy; Eddy, Kaitlyn; Shril, Shirlee; Braun, Daniela A; Poduri, Annapurna; Hildebrandt, Friedhelm
2018-01-01
Until recently, morpholino oligonucleotides have been widely employed in zebrafish as an acute and efficient loss-of-function assay. However, off-target effects and reproducibility issues when compared to stable knockout lines have compromised their further use. Here we employed an acute CRISPR/Cas approach using multiple single guide RNAs targeting simultaneously different positions in two exemplar genes (osgep or tprkb) to increase the likelihood of generating mutations on both alleles in the injected F0 generation and to achieve a similar effect as morpholinos but with the reproducibility of stable lines. This multi single guide RNA approach resulted in median likelihoods for at least one mutation on each allele of >99% and sgRNA specific insertion/deletion profiles as revealed by deep-sequencing. Immunoblot showed a significant reduction for Osgep and Tprkb proteins. For both genes, the acute multi-sgRNA knockout recapitulated the microcephaly phenotype and reduction in survival that we observed previously in stable knockout lines, though milder in the acute multi-sgRNA knockout. Finally, we quantify the degree of mutagenesis by deep sequencing, and provide a mathematical model to quantitate the chance for a biallelic loss-of-function mutation. Our findings can be generalized to acute and stable CRISPR/Cas targeting for any zebrafish gene of interest.
Madampage, Claudia Avis; Rawlyk, Neil; Crockford, Gordon; Van Donkersgoed, Joyce; Dorin, Craig; Potter, Andrew
2015-07-01
Histophilus somni, a causative agent of the bovine respiratory disease complex, can also cause a variety of systemic disorders, including bronchopneumonia, myocarditis, pericarditis, arthritis, pleuritis, and infectious thrombotic meningoencephalitis. The purpose of this study was to determine if currently circulating strains differ from those of the 1980s by identifying genomic changes. Single nucleotide polymorphisms (SNPs) and insertion and deletion (INDEL) sites were examined by whole-genome sequencing in 12 samples, 6 old and 6 new. The 31 028 SNP/INDELs recorded were compared against the reference genome sequence of the pathogenic H. somni strain 2336. The distribution of about 75% of these SNPs within a specified gene differed between old and new isolates and did not follow any particular pattern. The other 25% clustered into 2 groups containing the same SNPs in various genes: group I included 5 old isolates and 1 new isolate; group II included 5 new isolates and 1 old isolate. For putative virulence genes there were more SNPs in group I compared with strain 2336, itself an older isolate, than in group II. Although only 25% of all the SNPs formed 2 clusters, the results suggest some genetic difference in various genes between old and new strains.
Madampage, Claudia Avis; Rawlyk, Neil; Crockford, Gordon; Van Donkersgoed, Joyce; Dorin, Craig; Potter, Andrew
2015-01-01
Histophilus somni, a causative agent of the bovine respiratory disease complex, can also cause a variety of systemic disorders, including bronchopneumonia, myocarditis, pericarditis, arthritis, pleuritis, and infectious thrombotic meningoencephalitis. The purpose of this study was to determine if currently circulating strains differ from those of the 1980s by identifying genomic changes. Single nucleotide polymorphisms (SNPs) and insertion and deletion (INDEL) sites were examined by whole-genome sequencing in 12 samples, 6 old and 6 new. The 31 028 SNP/INDELs recorded were compared against the reference genome sequence of the pathogenic H. somni strain 2336. The distribution of about 75% of these SNPs within a specified gene differed between old and new isolates and did not follow any particular pattern. The other 25% clustered into 2 groups containing the same SNPs in various genes: group I included 5 old isolates and 1 new isolate; group II included 5 new isolates and 1 old isolate. For putative virulence genes there were more SNPs in group I compared with strain 2336, itself an older isolate, than in group II. Although only 25% of all the SNPs formed 2 clusters, the results suggest some genetic difference in various genes between old and new strains. PMID:26130851
Single and multiple phenotype QTL analyses of downy mildew resistance in interspecific grapevines.
Divilov, Konstantin; Barba, Paola; Cadle-Davidson, Lance; Reisch, Bruce I
2018-05-01
Downy mildew resistance across days post-inoculation, experiments, and years in two interspecific grapevine F 1 families was investigated using linear mixed models and Bayesian networks, and five new QTL were identified. Breeding grapevines for downy mildew disease resistance has traditionally relied on qualitative gene resistance, which can be overcome by pathogen evolution. Analyzing two interspecific F 1 families, both having ancestry derived from Vitis vinifera and wild North American Vitis species, across 2 years and multiple experiments, we found multiple loci associated with downy mildew sporulation and hypersensitive response in both families using a single phenotype model. The loci explained between 7 and 17% of the variance for either phenotype, suggesting a complex genetic architecture for these traits in the two families studied. For two loci, we used RNA-Seq to detect differentially transcribed genes and found that the candidate genes at these loci were likely not NBS-LRR genes. Additionally, using a multiple phenotype Bayesian network analysis, we found effects between the leaf trichome density, hypersensitive response, and sporulation phenotypes. Moderate-high heritabilities were found for all three phenotypes, suggesting that selection for downy mildew resistance is an achievable goal by breeding for either physical- or non-physical-based resistance mechanisms, with the combination of the two possibly providing durable resistance.
Aggressive behavior, related conduct problems, and variation in genes affecting dopamine turnover.
Grigorenko, Elena L; De Young, Colin G; Eastman, Maria; Getchell, Marya; Haeffel, Gerald J; Klinteberg, Britt af; Koposov, Roman A; Oreland, Lars; Pakstis, Andrew J; Ponomarev, Oleg A; Ruchkin, Vladislav V; Singh, Jay P; Yrigollen, Carolyn M
2010-01-01
A number of dopamine-related genes have been implicated in the etiology of violent behavior and conduct problems. Of these genes, the ones that code for the enzymes that influence the turnover of dopamine (DA) have received the most attention. In this study, we investigated 12 genetic polymorphisms in four genes involved with DA functioning (COMT, MAOA and MAOB, and DbetaH) in 179 incarcerated male Russian adolescents and two groups of matched controls: boys without criminal records referred to by their teachers as (a) "troubled-behavior-free" boys, n=182; and (b) "troubled-behavior" boys, n=60. The participants were classified as (1) being incarcerated or not, (2) having the DSM-IV diagnosis of conduct disorder (CD) or not, and (3) having committed violent or nonviolent crimes (for the incarcerated individuals only). The findings indicate that, although no single genetic variant in any of the four genes differentiated individuals in the investigated groups, various linear combinations (i.e., haplotypes) and nonlinear combinations (i.e., interactions between variants within and across genes) of genetic variants resulted in informative and robust classifications for two of the three groupings. These combinations of genetic variants differentiated individuals in incarceration vs. nonincarcerated and CD vs. no-CD groups; no informative combinations were established consistently for the grouping by crime within the incarcerated individuals. This study underscores the importance of considering multiple rather than single markers within candidate genes and their additive and interactive combinations, both with themselves and with nongenetic indicators, while attempting to understand the genetic background of such complex behaviors as serious conduct problems. (c) 2010 Wiley-Liss, Inc.
Examination of association to autism of common genetic variationin genes related to dopamine.
Anderson, B M; Schnetz-Boutaud, N; Bartlett, J; Wright, H H; Abramson, R K; Cuccaro, M L; Gilbert, J R; Pericak-Vance, M A; Haines, J L
2008-12-01
Autism is a severe neurodevelopmental disorder characterized by a triad of complications. Autistic individuals display significant disturbances in language and reciprocal social interactions, combined with repetitive and stereotypic behaviors. Prevalence studies suggest that autism is more common than originally believed, with recent estimates citing a rate of one in 150. Although multiple genetic linkage and association studies have yielded multiple suggestive genes or chromosomal regions, a specific risk locus has yet to be identified and widely confirmed. Because many etiologies have been suggested for this complex syndrome, we hypothesize that one of the difficulties in identifying autism genes is that multiple genetic variants may be required to significantly increase the risk of developing autism. Thus, we took the alternative approach of examining 14 prominent dopamine pathway candidate genes for detailed study by genotyping 28 single nucleotide polymorphisms. Although we did observe a nominally significant association for rs2239535 (P=0.008) on chromosome 20, single-locus analysis did not reveal any results as significant after correction for multiple comparisons. No significant interaction was identified when Multifactor Dimensionality Reduction was employed to test specifically for multilocus effects. Although genome-wide linkage scans in autism have provided support for linkage to various loci along the dopamine pathway, our study does not provide strong evidence of linkage or association to any specific gene or combination of genes within the pathway. These results demonstrate that common genetic variation within the tested genes located within this pathway at most play a minor to moderate role in overall autism pathogenesis.
Physiogenomic Analysis of Localized fMRI Brain Activity in Schizophrenia
Windemuth, Andreas; Calhoun, Vince D.; Pearlson, Godfrey D.; Kocherla, Mohan; Jagannathan, Kanchana; Ruaño, Gualberto
2009-01-01
The search for genetic factors associated with disease is complicated by the complexity of the biological pathways linking genotype and phenotype. This analytical complexity is particularly concerning in diseases historically lacking reliable diagnostic biological markers, such as schizophrenia and other mental disorders. We investigate the use of functional magnetic resonance imaging (fMRI) as an intermediate phenotype (endophenotype) to identify physiogenomic associations to schizophrenia. We screened 99 subjects, 30 subjects diagnosed with schizophrenia, 13 unaffected relatives of schizophrenia patients, and 56 unrelated controls, for gene polymorphisms associated with fMRI activation patterns at two locations in temporal and frontal lobes previously implied in schizophrenia. A total of 22 single nucleotide polymorphisms (SNPs) in 15 genes from the dopamine and serotonin neurotransmission pathways were genotyped in all subjects. We identified three SNPs in genes that are significantly associated with fMRI activity. SNPs of the dopamine beta-hydroxylase (DBH) gene and of the dopamine receptor D4 (DRD4) were associated with activity in the temporal and frontal lobes, respectively. One SNP of serotonin-3A receptor (HTR3A) was associated with temporal lobe activity. The results of this study support the physiogenomic analysis of neuroimaging data to discover associations between genotype and disease-related phenotypes. PMID:18330705
High-Content Analysis of CRISPR-Cas9 Gene-Edited Human Embryonic Stem Cells.
Carlson-Stevermer, Jared; Goedland, Madelyn; Steyer, Benjamin; Movaghar, Arezoo; Lou, Meng; Kohlenberg, Lucille; Prestil, Ryan; Saha, Krishanu
2016-01-12
CRISPR-Cas9 gene editing of human cells and tissues holds much promise to advance medicine and biology, but standard editing methods require weeks to months of reagent preparation and selection where much or all of the initial edited samples are destroyed during analysis. ArrayEdit, a simple approach utilizing surface-modified multiwell plates containing one-pot transcribed single-guide RNAs, separates thousands of edited cell populations for automated, live, high-content imaging and analysis. The approach lowers the time and cost of gene editing and produces edited human embryonic stem cells at high efficiencies. Edited genes can be expressed in both pluripotent stem cells and differentiated cells. This preclinical platform adds important capabilities to observe editing and selection in situ within complex structures generated by human cells, ultimately enabling optical and other molecular perturbations in the editing workflow that could refine the specificity and versatility of gene editing. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.
Yang, Xinan Holly; Li, Meiyi; Wang, Bin; Zhu, Wanqi; Desgardin, Aurelie; Onel, Kenan; de Jong, Jill; Chen, Jianjun; Chen, Luonan; Cunningham, John M
2015-03-24
Genes that regulate stem cell function are suspected to exert adverse effects on prognosis in malignancy. However, diverse cancer stem cell signatures are difficult for physicians to interpret and apply clinically. To connect the transcriptome and stem cell biology, with potential clinical applications, we propose a novel computational "gene-to-function, snapshot-to-dynamics, and biology-to-clinic" framework to uncover core functional gene-sets signatures. This framework incorporates three function-centric gene-set analysis strategies: a meta-analysis of both microarray and RNA-seq data, novel dynamic network mechanism (DNM) identification, and a personalized prognostic indicator analysis. This work uses complex disease acute myeloid leukemia (AML) as a research platform. We introduced an adjustable "soft threshold" to a functional gene-set algorithm and found that two different analysis methods identified distinct gene-set signatures from the same samples. We identified a 30-gene cluster that characterizes leukemic stem cell (LSC)-depleted cells and a 25-gene cluster that characterizes LSC-enriched cells in parallel; both mark favorable-prognosis in AML. Genes within each signature significantly share common biological processes and/or molecular functions (empirical p = 6e-5 and 0.03 respectively). The 25-gene signature reflects the abnormal development of stem cells in AML, such as AURKA over-expression. We subsequently determined that the clinical relevance of both signatures is independent of known clinical risk classifications in 214 patients with cytogenetically normal AML. We successfully validated the prognosis of both signatures in two independent cohorts of 91 and 242 patients respectively (log-rank p < 0.0015 and 0.05; empirical p < 0.015 and 0.08). The proposed algorithms and computational framework will harness systems biology research because they efficiently translate gene-sets (rather than single genes) into biological discoveries about AML and other complex diseases.
Core histone genes of Giardia intestinalis: genomic organization, promoter structure, and expression
Yee, Janet; Tang, Anita; Lau, Wei-Ling; Ritter, Heather; Delport, Dewald; Page, Melissa; Adam, Rodney D; Müller, Miklós; Wu, Gang
2007-01-01
Background Giardia intestinalis is a protist found in freshwaters worldwide, and is the most common cause of parasitic diarrhea in humans. The phylogenetic position of this parasite is still much debated. Histones are small, highly conserved proteins that associate tightly with DNA to form chromatin within the nucleus. There are two classes of core histone genes in higher eukaryotes: DNA replication-independent histones and DNA replication-dependent ones. Results We identified two copies each of the core histone H2a, H2b and H3 genes, and three copies of the H4 gene, at separate locations on chromosomes 3, 4 and 5 within the genome of Giardia intestinalis, but no gene encoding a H1 linker histone could be recognized. The copies of each gene share extensive DNA sequence identities throughout their coding and 5' noncoding regions, which suggests these copies have arisen from relatively recent gene duplications or gene conversions. The transcription start sites are at triplet A sequences 1–27 nucleotides upstream of the translation start codon for each gene. We determined that a 50 bp region upstream from the start of the histone H4 coding region is the minimal promoter, and a highly conserved 15 bp sequence called the histone motif (him) is essential for its activity. The Giardia core histone genes are constitutively expressed at approximately equivalent levels and their mRNAs are polyadenylated. Competition gel-shift experiments suggest that a factor within the protein complex that binds him may also be a part of the protein complexes that bind other promoter elements described previously in Giardia. Conclusion In contrast to other eukaryotes, the Giardia genome has only a single class of core histone genes that encode replication-independent histones. Our inability to locate a gene encoding the linker histone H1 leads us to speculate that the H1 protein may not be required for the compaction of Giardia's small and gene-rich genome. PMID:17425802
Evolution of genome size and complexity in the rhabdoviridae.
Walker, Peter J; Firth, Cadhla; Widen, Steven G; Blasdell, Kim R; Guzman, Hilda; Wood, Thomas G; Paradkar, Prasad N; Holmes, Edward C; Tesh, Robert B; Vasilakis, Nikos
2015-02-01
RNA viruses exhibit substantial structural, ecological and genomic diversity. However, genome size in RNA viruses is likely limited by a high mutation rate, resulting in the evolution of various mechanisms to increase complexity while minimising genome expansion. Here we conduct a large-scale analysis of the genome sequences of 99 animal rhabdoviruses, including 45 genomes which we determined de novo, to identify patterns of genome expansion and the evolution of genome complexity. All but seven of the rhabdoviruses clustered into 17 well-supported monophyletic groups, of which eight corresponded to established genera, seven were assigned as new genera, and two were taxonomically ambiguous. We show that the acquisition and loss of new genes appears to have been a central theme of rhabdovirus evolution, and has been associated with the appearance of alternative, overlapping and consecutive ORFs within the major structural protein genes, and the insertion and loss of additional ORFs in each gene junction in a clade-specific manner. Changes in the lengths of gene junctions accounted for as much as 48.5% of the variation in genome size from the smallest to the largest genome, and the frequency with which new ORFs were observed increased in the 3' to 5' direction along the genome. We also identify several new families of accessory genes encoded in these regions, and show that non-canonical expression strategies involving TURBS-like termination-reinitiation, ribosomal frame-shifts and leaky ribosomal scanning appear to be common. We conclude that rhabdoviruses have an unusual capacity for genomic plasticity that may be linked to their discontinuous transcription strategy from the negative-sense single-stranded RNA genome, and propose a model that accounts for the regular occurrence of genome expansion and contraction throughout the evolution of the Rhabdoviridae.
Cox, Liza L; Cox, Timothy C; Moreno Uribe, Lina M; Zhu, Ying; Richter, Chika T; Nidey, Nichole; Standley, Jennifer M; Deng, Mei; Blue, Elizabeth; Chong, Jessica X; Yang, Yueqin; Carstens, Russ P; Anand, Deepti; Lachke, Salil A; Smith, Joshua D; Dorschner, Michael O; Bedell, Bruce; Kirk, Edwin; Hing, Anne V; Venselaar, Hanka; Valencia-Ramirez, Luz C; Bamshad, Michael J; Glass, Ian A; Cooper, Jonathan A; Haan, Eric; Nickerson, Deborah A; van Bokhoven, Hans; Zhou, Huiqing; Krahn, Katy N; Buckley, Michael F; Murray, Jeffrey C; Lidral, Andrew C; Roscioli, Tony
2018-06-07
Non-syndromic cleft lip with or without cleft palate (NS-CL/P) is one of the most common human birth defects and is generally considered a complex trait. Despite numerous loci identified by genome-wide association studies, the effect sizes of common variants are relatively small, with much of the presumed genetic contribution remaining elusive. We report exome-sequencing results in 209 people from 72 multi-affected families with pedigree structures consistent with autosomal-dominant inheritance and variable penetrance. Herein, pathogenic variants are described in four genes encoding components of the p120-catenin complex (CTNND1, PLEKHA7, PLEKHA5) and an epithelial splicing regulator (ESRP2), in addition to the known CL/P-associated gene, CDH1, which encodes E-cadherin. The findings were also validated in a second cohort of 497 people with NS-CL/P, comprising small families and singletons with pathogenic variants in these genes identified in 14% of multi-affected families and 2% of the replication cohort of smaller families. Enriched expression of each gene/protein in human and mouse embryonic oro-palatal epithelia, demonstration of functional impact of CTNND1 and ESRP2 variants, and recapitulation of the CL/P spectrum in Ctnnd1 knockout mice support a causative role in CL/P pathogenesis. These data show that primary defects in regulators of epithelial cell adhesion are the most significant contributors to NS-CL/P identified to date and that inherited and de novo single gene variants explain a substantial proportion of NS-CL/P. Copyright © 2018 American Society of Human Genetics. All rights reserved.
Kong, Fenfen; Liu, Fei; Li, Wei; Guo, Xiaomeng; Wang, Zuhua; Zhang, Hanbo; Li, Qingpo; Luo, Lihua; Du, Yongzhong; Jin, Yi; You, Jian
2016-12-01
Near-infrared (NIR) laser-controlled gene delivery presents some benefits in gene therapy, inducing enhanced gene transfection efficiency. In this study, a "photothermal transfection" agent is obtained by wrapping poly(ethylenimine)-cholesterol derivatives (PEI-Chol) around single-walled carbon nanotubes (SWNTs). The PEI-Chol modified SWNTs (PCS) are effective in compressing DNA molecules and protecting them from DNaseI degradation. Compared to the complexes formed by PEI with DNA (PEI/DNA), complexes of PCS and DNA that are formed (PCS/DNA) exhibit a little lower toxicity to HEK293 and HeLa cells under the same PEI molecule weight and weight ratios. Notably, caveolae-mediated cellular uptake of PCS/DNA occurs, which results in a safer intracellular transport of the gene due to the decreased lysosomal degradation in comparison with that of PEI/DNA whose internalization mainly depends on clathrin rather than caveolae. Furthermore, unlike PEI/DNA, PCS/DNA exhibits a photothermal conversion ability, which promotes DNA release from PCS under NIR laser irradiation. The NIR laser-mediated photothermal transfection of PCS 10K /plasmid TP53 (pTP53) results in more apoptosis and necrosis of HeLa cells in vitro than other groups, and achieves a higher tumor-growth inhibition in vivo than naked pTP53, PEI 25K /pTP53, and PCS 10K /pTP53 alone. The enhanced transfection efficiency of PCS/DNA can be attributed to more efficient DNA internalization into the tumor cells, promotes detachment of DNA from PCS under the mediation of NIR laser and higher DNA stability in the cells due to caveolae-mediated cellular uptake of the complexes. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Evolution of Genome Size and Complexity in the Rhabdoviridae
Walker, Peter J.; Firth, Cadhla; Widen, Steven G.; Blasdell, Kim R.; Guzman, Hilda; Wood, Thomas G.; Paradkar, Prasad N.; Holmes, Edward C.; Tesh, Robert B.; Vasilakis, Nikos
2015-01-01
RNA viruses exhibit substantial structural, ecological and genomic diversity. However, genome size in RNA viruses is likely limited by a high mutation rate, resulting in the evolution of various mechanisms to increase complexity while minimising genome expansion. Here we conduct a large-scale analysis of the genome sequences of 99 animal rhabdoviruses, including 45 genomes which we determined de novo, to identify patterns of genome expansion and the evolution of genome complexity. All but seven of the rhabdoviruses clustered into 17 well-supported monophyletic groups, of which eight corresponded to established genera, seven were assigned as new genera, and two were taxonomically ambiguous. We show that the acquisition and loss of new genes appears to have been a central theme of rhabdovirus evolution, and has been associated with the appearance of alternative, overlapping and consecutive ORFs within the major structural protein genes, and the insertion and loss of additional ORFs in each gene junction in a clade-specific manner. Changes in the lengths of gene junctions accounted for as much as 48.5% of the variation in genome size from the smallest to the largest genome, and the frequency with which new ORFs were observed increased in the 3’ to 5’ direction along the genome. We also identify several new families of accessory genes encoded in these regions, and show that non-canonical expression strategies involving TURBS-like termination-reinitiation, ribosomal frame-shifts and leaky ribosomal scanning appear to be common. We conclude that rhabdoviruses have an unusual capacity for genomic plasticity that may be linked to their discontinuous transcription strategy from the negative-sense single-stranded RNA genome, and propose a model that accounts for the regular occurrence of genome expansion and contraction throughout the evolution of the Rhabdoviridae. PMID:25679389
microRNAs Databases: Developmental Methodologies, Structural and Functional Annotations.
Singh, Nagendra Kumar
2017-09-01
microRNA (miRNA) is an endogenous and evolutionary conserved non-coding RNA, involved in post-transcriptional process as gene repressor and mRNA cleavage through RNA-induced silencing complex (RISC) formation. In RISC, miRNA binds in complementary base pair with targeted mRNA along with Argonaut proteins complex, causes gene repression or endonucleolytic cleavage of mRNAs and results in many diseases and syndromes. After the discovery of miRNA lin-4 and let-7, subsequently large numbers of miRNAs were discovered by low-throughput and high-throughput experimental techniques along with computational process in various biological and metabolic processes. The miRNAs are important non-coding RNA for understanding the complex biological phenomena of organism because it controls the gene regulation. This paper reviews miRNA databases with structural and functional annotations developed by various researchers. These databases contain structural and functional information of animal, plant and virus miRNAs including miRNAs-associated diseases, stress resistance in plant, miRNAs take part in various biological processes, effect of miRNAs interaction on drugs and environment, effect of variance on miRNAs, miRNAs gene expression analysis, sequence of miRNAs, structure of miRNAs. This review focuses on the developmental methodology of miRNA databases such as computational tools and methods used for extraction of miRNAs annotation from different resources or through experiment. This study also discusses the efficiency of user interface design of every database along with current entry and annotations of miRNA (pathways, gene ontology, disease ontology, etc.). Here, an integrated schematic diagram of construction process for databases is also drawn along with tabular and graphical comparison of various types of entries in different databases. Aim of this paper is to present the importance of miRNAs-related resources at a single place.
The evolutionary ecology of complex lifecycle parasites: linking phenomena with mechanisms
Auld, S KJR; Tinsley, M C
2015-01-01
Many parasitic infections, including those of humans, are caused by complex lifecycle parasites (CLPs): parasites that sequentially infect different hosts over the course of their lifecycle. CLPs come from a wide range of taxonomic groups—from single-celled bacteria to multicellular flatworms—yet share many common features in their life histories. Theory tells us when CLPs should be favoured by selection, but more empirical studies are required in order to quantify the costs and benefits of having a complex lifecycle, especially in parasites that facultatively vary their lifecycle complexity. In this article, we identify ecological conditions that favour CLPs over their simple lifecycle counterparts and highlight how a complex lifecycle can alter transmission rate and trade-offs between growth and reproduction. We show that CLPs participate in dynamic host–parasite coevolution, as more mobile hosts can fuel CLP adaptation to less mobile hosts. Then, we argue that a more general understanding of the evolutionary ecology of CLPs is essential for the development of effective frameworks to manage the many diseases they cause. More research is needed identifying the genetics of infection mechanisms used by CLPs, particularly into the role of gene duplication and neofunctionalisation in lifecycle evolution. We propose that testing for signatures of selection in infection genes will reveal much about how and when complex lifecycles evolved, and will help quantify complex patterns of coevolution between CLPs and their various hosts. Finally, we emphasise four key areas where new research approaches will provide fertile opportunities to advance this field. PMID:25227255
Martín, Iker; Ruiz, María F; Sánchez, Lucas
2011-03-15
The gene transformer-2, which is involved in sex determination, has been studied in Drosophila, Musca, Ceratitis, Anastrepha and Lucilia. All these members of Diptera belong to the suborder Brachycera. In this work, it is reported the isolation and characterisation of genes transformer-2 of the dipterans Sciara ocellaris and Bradysia coprophila (formerly Sciara coprophila), which belong to the much less extensively analysed Sciaridae Family of the Suborder Nematocera, which is paraphyletic with respect to Suborder Brachycera. The transformer-2 genes of the studied Sciara species were found to be transcribed in both sexes during development and adult life, in both the soma and germ lines. They produced a single primary transcript, which follows the same alternative splicing in both sexes, giving rise to different mRNAs isoforms. In S. ocellaris the most abundant mRNA isoform encoded a full-length protein of 251 amino acids, while that of B. coprophila encoded a protein of 246 amino acids. Both showed the features of the SR protein family. The less significant mRNA isoforms of both species encoded truncated, presumably non-functional Transformer-2 proteins. The comparison of the functional Sciara Transformer-2 proteins among themselves and those of other insects revealed the greatest degree of conservation in the RRM domain and linker region. In contrast, the RS1 and RS2 domains showed extensive variation with respect to their number of amino acids and their arginine-serine (RS) dipeptide content. The expression of S. ocellaris Transformer-2 protein in Drosophila XX pseudomales lacking the endogenous transformer-2 function caused their partial feminisation. The transformer-2 genes of both Sciaridae species encode a single protein in both sexes that shares the characteristics of the Transformer-2 proteins of other insects. These proteins showed conserved sex-determination function in Drosophila; i.e., they were able to form a complex with the endogenous Drosophila Transformer protein that controls the female-specific splicing of the Drosophila doublesex pre-mRNA. However, it appears that the complex formed between the Drosophila Transformer protein and the Sciara Transformer-2 protein is less effective at inducing the female-specific splicing of the endogenous Drosophila doublesex pre-mRNA than the DrosophilaTransformer-Transformer2 complex. This suggests the existence of species-specific co-evolution of the Transformer and Transformer-2 proteins.
2011-01-01
Background The gene transformer-2, which is involved in sex determination, has been studied in Drosophila, Musca, Ceratitis, Anastrepha and Lucilia. All these members of Diptera belong to the suborder Brachycera. In this work, it is reported the isolation and characterisation of genes transformer-2 of the dipterans Sciara ocellaris and Bradysia coprophila (formerly Sciara coprophila), which belong to the much less extensively analysed Sciaridae Family of the Suborder Nematocera, which is paraphyletic with respect to Suborder Brachycera. Results The transformer-2 genes of the studied Sciara species were found to be transcribed in both sexes during development and adult life, in both the soma and germ lines. They produced a single primary transcript, which follows the same alternative splicing in both sexes, giving rise to different mRNAs isoforms. In S. ocellaris the most abundant mRNA isoform encoded a full-length protein of 251 amino acids, while that of B. coprophila encoded a protein of 246 amino acids. Both showed the features of the SR protein family. The less significant mRNA isoforms of both species encoded truncated, presumably non-functional Transformer-2 proteins. The comparison of the functional Sciara Transformer-2 proteins among themselves and those of other insects revealed the greatest degree of conservation in the RRM domain and linker region. In contrast, the RS1 and RS2 domains showed extensive variation with respect to their number of amino acids and their arginine-serine (RS) dipeptide content. The expression of S. ocellaris Transformer-2 protein in Drosophila XX pseudomales lacking the endogenous transformer-2 function caused their partial feminisation. Conclusions The transformer-2 genes of both Sciaridae species encode a single protein in both sexes that shares the characteristics of the Transformer-2 proteins of other insects. These proteins showed conserved sex-determination function in Drosophila; i.e., they were able to form a complex with the endogenous Drosophila Transformer protein that controls the female-specific splicing of the Drosophila doublesex pre-mRNA. However, it appears that the complex formed between the Drosophila Transformer protein and the Sciara Transformer-2 protein is less effective at inducing the female-specific splicing of the endogenous Drosophila doublesex pre-mRNA than the DrosophilaTransformer-Transformer2 complex. This suggests the existence of species-specific co-evolution of the Transformer and Transformer-2 proteins. PMID:21406087
Biochemical and genetic analysis of Leigh syndrome patients in Korea.
Chae, Jong-Hee; Lee, Jin Sook; Kim, Ki Joong; Hwang, Yong Seung; Hirano, Michio
2008-06-01
Sixteen Korean patients with Leigh syndrome were identified at the Seoul National University Children's Hospital in 2001-2006. Biochemical or molecular defects were identified in 14 patients (87.5%). Thirteen patients had respiratory chain enzyme defects; 9 had complex I deficiency, and 4 had combined defects of complex I+III+IV. Based on the biochemical defects, targeted genetic studies in 4 patients with complex I deficiency revealed two heteroplasmic mitochondrial DNA mutations in ND genes. One patient had the mitochondrial DNA T8993G point mutation. No mitochondrial DNA defects were identified in 11 (68.7%) of our LS patients, who probably have mutations in nuclear DNA. Although a limited study based in a single tertiary medical center, our findings suggest that isolated complex I deficiency may be the most common cause of Leigh syndrome in Korea.
Das, Shouvik; Upadhyaya, Hari D.; Bajaj, Deepak; Kujur, Alice; Badoni, Saurabh; Laxmi; Kumar, Vinod; Tripathi, Shailesh; Gowda, C. L. Laxmipathi; Sharma, Shivali; Singh, Sube; Tyagi, Akhilesh K.; Parida, Swarup K.
2015-01-01
A rapid high-resolution genome-wide strategy for molecular mapping of major QTL(s)/gene(s) regulating important agronomic traits is vital for in-depth dissection of complex quantitative traits and genetic enhancement in chickpea. The present study for the first time employed a NGS-based whole-genome QTL-seq strategy to identify one major genomic region harbouring a robust 100-seed weight QTL using an intra-specific 221 chickpea mapping population (desi cv. ICC 7184 × desi cv. ICC 15061). The QTL-seq-derived major SW QTL (CaqSW1.1) was further validated by single-nucleotide polymorphism (SNP) and simple sequence repeat (SSR) marker-based traditional QTL mapping (47.6% R2 at higher LOD >19). This reflects the reliability and efficacy of QTL-seq as a strategy for rapid genome-wide scanning and fine mapping of major trait regulatory QTLs in chickpea. The use of QTL-seq and classical QTL mapping in combination narrowed down the 1.37 Mb (comprising 177 genes) major SW QTL (CaqSW1.1) region into a 35 kb genomic interval on desi chickpea chromosome 1 containing six genes. One coding SNP (G/A)-carrying constitutive photomorphogenic9 (COP9) signalosome complex subunit 8 (CSN8) gene of these exhibited seed-specific expression, including pronounced differential up-/down-regulation in low and high seed weight mapping parents and homozygous individuals during seed development. The coding SNP mined in this potential seed weight-governing candidate CSN8 gene was found to be present exclusively in all cultivated species/genotypes, but not in any wild species/genotypes of primary, secondary and tertiary gene pools. This indicates the effect of strong artificial and/or natural selection pressure on target SW locus during chickpea domestication. The proposed QTL-seq-driven integrated genome-wide strategy has potential to delineate major candidate gene(s) harbouring a robust trait regulatory QTL rapidly with optimal use of resources. This will further assist us to extrapolate the molecular mechanism underlying complex quantitative traits at a genome-wide scale leading to fast-paced marker-assisted genetic improvement in diverse crop plants, including chickpea. PMID:25922536
Sewell, Holly L.; Kaster, Anne-Kristin
2017-01-01
ABSTRACT The deep marine subsurface is one of the largest unexplored biospheres on Earth and is widely inhabited by members of the phylum Chloroflexi. In this report, we investigated genomes of single cells obtained from deep-sea sediments of the Peruvian Margin, which are enriched in such Chloroflexi. 16S rRNA gene sequence analysis placed two of these single-cell-derived genomes (DscP3 and Dsc4) in a clade of subphylum I Chloroflexi which were previously recovered from deep-sea sediment in the Okinawa Trough and a third (DscP2-2) as a member of the previously reported DscP2 population from Peruvian Margin site 1230. The presence of genes encoding enzymes of a complete Wood-Ljungdahl pathway, glycolysis/gluconeogenesis, a Rhodobacter nitrogen fixation (Rnf) complex, glyosyltransferases, and formate dehydrogenases in the single-cell genomes of DscP3 and Dsc4 and the presence of an NADH-dependent reduced ferredoxin:NADP oxidoreductase (Nfn) and Rnf in the genome of DscP2-2 imply a homoacetogenic lifestyle of these abundant marine Chloroflexi. We also report here the first complete pathway for anaerobic benzoate oxidation to acetyl coenzyme A (CoA) in the phylum Chloroflexi (DscP3 and Dsc4), including a class I benzoyl-CoA reductase. Of remarkable evolutionary significance, we discovered a gene encoding a formate dehydrogenase (FdnI) with reciprocal closest identity to the formate dehydrogenase-like protein (complex iron-sulfur molybdoenzyme [CISM], DET0187) of terrestrial Dehalococcoides/Dehalogenimonas spp. This formate dehydrogenase-like protein has been shown to lack formate dehydrogenase activity in Dehalococcoides/Dehalogenimonas spp. and is instead hypothesized to couple HupL hydrogenase to a reductive dehalogenase in the catabolic reductive dehalogenation pathway. This finding of a close functional homologue provides an important missing link for understanding the origin and the metabolic core of terrestrial Dehalococcoides/Dehalogenimonas spp. and of reductive dehalogenation, as well as the biology of abundant deep-sea Chloroflexi. PMID:29259088
Anderson, Olin D; Coleman-Derr, Devin; Gu, Yong Q; Heath, Sekou
2010-06-16
Among the dietary essential amino acids, the most severely limiting in the cereals is lysine. Since cereals make up half of the human diet, lysine limitation has quality/nutritional consequences. The breakdown of lysine is controlled mainly by the catabolic bifunctional enzyme lysine ketoglutarate reductase - saccharopine dehydrogenase (LKR/SDH). The LKR/SDH gene has been reported to produce transcripts for the bifunctional enzyme and separate monofunctional transcripts. In addition to lysine metabolism, this gene has been implicated in a number of metabolic and developmental pathways, which along with its production of multiple transcript types and complex exon/intron structure suggest an important node in plant metabolism. Understanding more about the LKR/SDH gene is thus interesting both from applied standpoint and for basic plant metabolism. The current report describes a wheat genomic fragment containing an LKR/SDH gene and adjacent genes. The wheat LKR/SDH genomic segment was found to originate from the A-genome of wheat, and EST analysis indicates all three LKR/SDH genes in hexaploid wheat are transcriptionally active. A comparison of a set of plant LKR/SDH genes suggests regions of greater sequence conservation likely related to critical enzymatic functions and metabolic controls. Although most plants contain only a single LKR/SDH gene per genome, poplar contains at least two functional bifunctional genes in addition to a monofunctional LKR gene. Analysis of ESTs finds evidence for monofunctional LKR transcripts in switchgrass, and monofunctional SDH transcripts in wheat, Brachypodium, and poplar. The analysis of a wheat LKR/SDH gene and comparative structural and functional analyses among available plant genes provides new information on this important gene. Both the structure of the LKR/SDH gene and the immediately adjacent genes show lineage-specific differences between monocots and dicots, and findings suggest variation in activity of LKR/SDH genes among plants. Although most plant genomes seem to contain a single conserved LKR/SDH gene per genome, poplar possesses multiple contiguous genes. A preponderance of SDH transcripts suggests the LKR region may be more rate-limiting. Only switchgrass has EST evidence for LKR monofunctional transcripts. Evidence for monofunctional SDH transcripts shows a novel intron in wheat, Brachypodium, and poplar.
The complexity of gene expression dynamics revealed by permutation entropy
2010-01-01
Background High complexity is considered a hallmark of living systems. Here we investigate the complexity of temporal gene expression patterns using the concept of Permutation Entropy (PE) first introduced in dynamical systems theory. The analysis of gene expression data has so far focused primarily on the identification of differentially expressed genes, or on the elucidation of pathway and regulatory relationships. We aim to study gene expression time series data from the viewpoint of complexity. Results Applying the PE complexity metric to abiotic stress response time series data in Arabidopsis thaliana, genes involved in stress response and signaling were found to be associated with the highest complexity not only under stress, but surprisingly, also under reference, non-stress conditions. Genes with house-keeping functions exhibited lower PE complexity. Compared to reference conditions, the PE of temporal gene expression patterns generally increased upon stress exposure. High-complexity genes were found to have longer upstream intergenic regions and more cis-regulatory motifs in their promoter regions indicative of a more complex regulatory apparatus needed to orchestrate their expression, and to be associated with higher correlation network connectivity degree. Arabidopsis genes also present in other plant species were observed to exhibit decreased PE complexity compared to Arabidopsis specific genes. Conclusions We show that Permutation Entropy is a simple yet robust and powerful approach to identify temporal gene expression profiles of varying complexity that is equally applicable to other types of molecular profile data. PMID:21176199
Unprecedented high-resolution view of bacterial operon architecture revealed by RNA sequencing.
Conway, Tyrrell; Creecy, James P; Maddox, Scott M; Grissom, Joe E; Conkle, Trevor L; Shadid, Tyler M; Teramoto, Jun; San Miguel, Phillip; Shimada, Tomohiro; Ishihama, Akira; Mori, Hirotada; Wanner, Barry L
2014-07-08
We analyzed the transcriptome of Escherichia coli K-12 by strand-specific RNA sequencing at single-nucleotide resolution during steady-state (logarithmic-phase) growth and upon entry into stationary phase in glucose minimal medium. To generate high-resolution transcriptome maps, we developed an organizational schema which showed that in practice only three features are required to define operon architecture: the promoter, terminator, and deep RNA sequence read coverage. We precisely annotated 2,122 promoters and 1,774 terminators, defining 1,510 operons with an average of 1.98 genes per operon. Our analyses revealed an unprecedented view of E. coli operon architecture. A large proportion (36%) of operons are complex with internal promoters or terminators that generate multiple transcription units. For 43% of operons, we observed differential expression of polycistronic genes, despite being in the same operons, indicating that E. coli operon architecture allows fine-tuning of gene expression. We found that 276 of 370 convergent operons terminate inefficiently, generating complementary 3' transcript ends which overlap on average by 286 nucleotides, and 136 of 388 divergent operons have promoters arranged such that their 5' ends overlap on average by 168 nucleotides. We found 89 antisense transcripts of 397-nucleotide average length, 7 unannotated transcripts within intergenic regions, and 18 sense transcripts that completely overlap operons on the opposite strand. Of 519 overlapping transcripts, 75% correspond to sequences that are highly conserved in E. coli (>50 genomes). Our data extend recent studies showing unexpected transcriptome complexity in several bacteria and suggest that antisense RNA regulation is widespread. Importance: We precisely mapped the 5' and 3' ends of RNA transcripts across the E. coli K-12 genome by using a single-nucleotide analytical approach. Our resulting high-resolution transcriptome maps show that ca. one-third of E. coli operons are complex, with internal promoters and terminators generating multiple transcription units and allowing differential gene expression within these operons. We discovered extensive antisense transcription that results from more than 500 operons, which fully overlap or extensively overlap adjacent divergent or convergent operons. The genomic regions corresponding to these antisense transcripts are highly conserved in E. coli (including Shigella species), although it remains to be proven whether or not they are functional. Our observations of features unearthed by single-nucleotide transcriptome mapping suggest that deeper layers of transcriptional regulation in bacteria are likely to be revealed in the future. Copyright © 2014 Conway et al.
Morita, Yuko; Nakamori, Shigeru; Takagi, Hiroshi
2003-01-01
We previously isolated a mutant which showed a high tolerance to freezing that correlated with higher levels of intracellular l-proline derived from l-proline analogue-resistant mutants. The mutation responsible for the analogue resistance and l-proline accumulation was a single nuclear dominant mutation. By introducing the mutant-derived genomic library into a non-l-proline-utilizing strain, the mutant was found to carry an allele of the wild-type PRO1 gene encoding γ-glutamyl kinase, which resulted in a single amino acid replacement; Asp (GAC) at position 154 was replaced by Asn (AAC). Interestingly, the allele of PRO1 was shown to enhance the activities of γ-glutamyl kinase and γ-glutamyl phosphate reductase, both of which catalyze the first two steps of l-proline synthesis from l-glutamate and which together may form a complex in vivo. When cultured in liquid minimal medium, yeast cells expressing the mutated γ-glutamyl kinase were found to accumulate intracellular l-proline and showed a prominent increase in cell viability after freezing at −20°C compared to the viability of cells harboring the wild-type PRO1 gene. These results suggest that the altered γ-glutamyl kinase results in stabilization of the complex or has an indirect effect on γ-glutamyl phosphate reductase activity, which leads to an increase in l-proline production in Saccharomyces cerevisiae. The approach described in this paper could be a practical method for breeding novel freeze-tolerant yeast strains. PMID:12513997
COLE-TOBIAN, JENNIFER L.; ZIMMERMAN, PETER A.; KING, CHRISTOPHER L.
2013-01-01
Individuals living in malaria endemic areas are often infected with multiple parasite clones. Currently used single nucleotide polymorphism (SNP) genotyping methods for malaria parasites are cumbersome; furthermore, few methods currently exist that can rapidly determine the most abundant clone in these complex infections. Here we describe an oligonucleotide ligation assay (OLA) to distinguish SNPs in the Plasmodium vivax Duffy binding protein gene (Pvdbp) at 14 polymorphic residues simultaneously. Allele abundance is determined by the highest mean fluorescent intensity of each allele. Using mixtures of plasmids encoding known haplotypes of the Pvdbp, single clones of P. vivax parasites from infected Aotus monkeys, and well-defined mixed infections from field samples, we were able to identify the predominant Pvdbp genotype with > 93% accuracy when the dominant clone is twice as abundant as a lesser genotype and > 97% of the time if the ratio was 5:1 or greater. Thus, the OLA can accurately, reproducibly, and rapidly determine the predominant parasite haplotype in complex blood stage infections. PMID:17255222
Patel, Vidushi S; Cooper, Steven J B; Deakin, Janine E; Fulton, Bob; Graves, Tina; Warren, Wesley C; Wilson, Richard K; Graves, Jennifer A M
2008-07-25
Vertebrate alpha (alpha)- and beta (beta)-globin gene families exemplify the way in which genomes evolve to produce functional complexity. From tandem duplication of a single globin locus, the alpha- and beta-globin clusters expanded, and then were separated onto different chromosomes. The previous finding of a fossil beta-globin gene (omega) in the marsupial alpha-cluster, however, suggested that duplication of the alpha-beta cluster onto two chromosomes, followed by lineage-specific gene loss and duplication, produced paralogous alpha- and beta-globin clusters in birds and mammals. Here we analyse genomic data from an egg-laying monotreme mammal, the platypus (Ornithorhynchus anatinus), to explore haemoglobin evolution at the stem of the mammalian radiation. The platypus alpha-globin cluster (chromosome 21) contains embryonic and adult alpha- globin genes, a beta-like omega-globin gene, and the GBY globin gene with homology to cytoglobin, arranged as 5'-zeta-zeta'-alphaD-alpha3-alpha2-alpha1-omega-GBY-3'. The platypus beta-globin cluster (chromosome 2) contains single embryonic and adult globin genes arranged as 5'-epsilon-beta-3'. Surprisingly, all of these globin genes were expressed in some adult tissues. Comparison of flanking sequences revealed that all jawed vertebrate alpha-globin clusters are flanked by MPG-C16orf35 and LUC7L, whereas all bird and mammal beta-globin clusters are embedded in olfactory genes. Thus, the mammalian alpha- and beta-globin clusters are orthologous to the bird alpha- and beta-globin clusters respectively. We propose that alpha- and beta-globin clusters evolved from an ancient MPG-C16orf35-alpha-beta-GBY-LUC7L arrangement 410 million years ago. A copy of the original beta (represented by omega in marsupials and monotremes) was inserted into an array of olfactory genes before the amniote radiation (>315 million years ago), then duplicated and diverged to form orthologous clusters of beta-globin genes with different expression profiles in different lineages.
Verhoef, Talitha I; Hill, Melissa; Drury, Suzanne; Mason, Sarah; Jenkins, Lucy; Morris, Stephen; Chitty, Lyn S
2016-07-01
Evaluate the costs of offering non-invasive prenatal diagnosis (NIPD) for single gene disorders compared to traditional invasive testing to inform NIPD implementation into clinical practice. Total costs of diagnosis using NIPD or invasive testing pathways were compared for a representative set of single gene disorders. For autosomal dominant conditions, where NIPD molecular techniques are straightforward, NIPD cost £314 less than invasive testing. NIPD for autosomal recessive and X-linked conditions requires more complicated technical approaches and total costs were more than invasive testing, e.g. NIPD for spinal muscular atrophy was £1090 more than invasive testing. Impact of test uptake on costs was assessed using sickle cell disorder as an example. Anticipated high uptake of NIPD resulted in an incremental cost of NIPD over invasive testing of £48 635 per 100 pregnancies at risk of sickle cell disorder. Total costs of NIPD are dependent upon the complexity of the testing technique required. Anticipated increased demand for testing may have economic implications for prenatal diagnostic services. Ethical issues requiring further consideration are highlighted including directing resources to NIPD when used for information only and restricting access to safe tests if it is not cost-effective to develop NIPD for rare conditions. © 2016 The Authors. Prenatal Diagnosis published by John Wiley & Sons, Ltd. © 2016 The Authors. Prenatal Diagnosis published by John Wiley & Sons, Ltd.
Hemoglobin genetics: recent contributions of GWAS and gene editing
Smith, Elenoe C.; Orkin, Stuart H.
2016-01-01
The β-hemoglobinopathies are inherited disorders resulting from altered coding potential or expression of the adult β-globin gene. Impaired expression of β-globin reduces adult hemoglobin (α2β2) production, the hallmark of β-thalassemia. A single-base mutation at codon 6 leads to formation of HbS (α2βS2) and sickle cell disease. While the basis of these diseases is known, therapy remains largely supportive. Bone marrow transplantation is the only curative therapy. Patients with elevated levels of fetal hemoglobin (HbF, α2γ2) as adults exhibit reduced symptoms and enhanced survival. The β-globin gene locus is a paradigm of cell- and developmental stage-specific regulation. Although the principal erythroid cell transcription factors are known, mechanisms responsible for silencing of the γ-globin gene were obscure until application of genome-wide association studies (GWAS). Here, we review findings in the field. GWAS identified BCL11A as a candidate negative regulator of γ-globin expression. Subsequent studies have established BCL11A as a quantitative repressor. GWAS-related single-nucleotide polymorphisms lie within an essential erythroid enhancer of the BCL11A gene. Disruption of a discrete region within the enhancer reduces BCL11A expression and induces HbF expression, providing the basis for gene therapy using gene editing tools. A recently identified, second silencing factor, leukemia/lymphoma-related factor/Pokemon, shares features with BCL11A, including interaction with the nucleosome remodeling deacetylase repressive complex. These findings suggest involvement of a common pathway for HbF silencing. In addition, we discuss other factors that may be involved in γ-globin gene silencing and their potential manipulation for therapeutic benefit in treating the β-hemoglobinopathies. PMID:27340226
Haplotypes and gene expression implicate the MAPT region for Parkinson disease
Tobin, J.E.; Latourelle, J.C.; Lew, M.F.; Klein, C.; Suchowersky, O.; Shill, H.A.; Golbe, L.I.; Mark, M.H.; Growdon, J.H.; Wooten, G.F.; Racette, B.A.; Perlmutter, J.S.; Watts, R.; Guttman, M.; Baker, K.B.; Goldwurm, S.; Pezzoli, G.; Singer, C.; Saint-Hilaire, M.H.; Hendricks, A.E.; Williamson, S.; Nagle, M.W.; Wilk, J.B.; Massood, T.; Laramie, J.M.; DeStefano, A.L.; Litvan, I.; Nicholson, G.; Corbett, A.; Isaacson, S.; Burn, D.J.; Chinnery, P.F.; Pramstaller, P.P.; Sherman, S.; Al-hinti, J.; Drasby, E.; Nance, M.; Moller, A.T.; Ostergaard, K.; Roxburgh, R.; Snow, B.; Slevin, J.T.; Cambi, F.; Gusella, J.F.; Myers, R.H.
2009-01-01
Background Microtubule-associated protein tau (MAPT) has been associated with several neurodegenerative disorders including forms of parkinsonism and Parkinson disease (PD). We evaluated the association of the MAPT region with PD in a large cohort of familial PD cases recruited by the GenePD Study. In addition, postmortem brain samples from patients with PD and neurologically normal controls were used to evaluate whether the expression of the 3-repeat and 4-repeat isoforms of MAPT, and neighboring genes Saitohin (STH) and KIAA1267, are altered in PD cerebellum. Methods Twenty-one single-nucleotide polymorphisms (SNPs) in the region of MAPT on chromosome 17q21 were genotyped in the GenePD Study. Single SNPs and haplotypes, including the H1 haplotype, were evaluated for association to PD. Relative quantification of gene expression was performed using real-time RT-PCR. Results After adjusting for multiple comparisons, SNP rs1800547 was significantly associated with PD affection. While the H1 haplotype was associated with a significantly increased risk for PD, a novel H1 subhaplotype was identified that predicted a greater increased risk for PD. The expression of 4-repeat MAPT, STH, and KIAA1267 was significantly increased in PD brains relative to controls. No difference in expression was observed for 3-repeat MAPT. Conclusions This study supports a role for MAPT in the pathogenesis of familial and idiopathic Parkinson disease (PD). Interestingly, the results of the gene expression studies suggest that other genes in the vicinity of MAPT, specifically STH and KIAA1267, may also have a role in PD and suggest complex effects for the genes in this region on PD risk. PMID:18509094
Novel Two-Step Hierarchical Screening of Mutant Pools Reveals Mutants under Selection in Chicks
Yang, Hee-Jeong; Bogomolnaya, Lydia M.; Elfenbein, Johanna R.; Endicott-Yazdani, Tiana; Reynolds, M. Megan; Porwollik, Steffen; Cheng, Pui; Xia, Xiao-Qin
2016-01-01
Contaminated chicken/egg products are major sources of human salmonellosis, yet the strategies used by Salmonella to colonize chickens are poorly understood. We applied a novel two-step hierarchical procedure to identify new genes important for colonization and persistence of Salmonella enterica serotype Typhimurium in chickens. A library of 182 S. Typhimurium mutants each containing a targeted deletion of a group of contiguous genes (for a total of 2,069 genes deleted) was used to identify regions under selection at 1, 3, and 9 days postinfection in chicks. Mutants in 11 regions were under selection at all assayed times (colonization mutants), and mutants in 15 regions were under selection only at day 9 (persistence mutants). We assembled a pool of 92 mutants, each deleted for a single gene, representing nearly all genes in nine regions under selection. Twelve single gene deletion mutants were under selection in this assay, and we confirmed 6 of 9 of these candidate mutants via competitive infections and complementation analysis in chicks. STM0580, STM1295, STM1297, STM3612, STM3615, and STM3734 are needed for Salmonella to colonize and persist in chicks and were not previously associated with this ability. One of these key genes, STM1297 (selD), is required for anaerobic growth and supports the ability to utilize formate under these conditions, suggesting that metabolism of formate is important during infection. We report a hierarchical screening strategy to interrogate large portions of the genome during infection of animals using pools of mutants of low complexity. Using this strategy, we identified six genes not previously known to be needed during infection in chicks, and one of these (STM1297) suggests an important role for formate metabolism during infection. PMID:26857572
Deep sampling of the Palomero maize transcriptome by a high throughput strategy of pyrosequencing.
Vega-Arreguín, Julio C; Ibarra-Laclette, Enrique; Jiménez-Moraila, Beatriz; Martínez, Octavio; Vielle-Calzada, Jean Philippe; Herrera-Estrella, Luis; Herrera-Estrella, Alfredo
2009-07-06
In-depth sequencing analysis has not been able to determine the overall complexity of transcriptional activity of a plant organ or tissue sample. In some cases, deep parallel sequencing of Expressed Sequence Tags (ESTs), although not yet optimized for the sequencing of cDNAs, has represented an efficient procedure for validating gene prediction and estimating overall gene coverage. This approach could be very valuable for complex plant genomes. In addition, little emphasis has been given to efforts aiming at an estimation of the overall transcriptional universe found in a multicellular organism at a specific developmental stage. To explore, in depth, the transcriptional diversity in an ancient maize landrace, we developed a protocol to optimize the sequencing of cDNAs and performed 4 consecutive GS20-454 pyrosequencing runs of a cDNA library obtained from 2 week-old Palomero Toluqueño maize plants. The protocol reported here allowed obtaining over 90% of informative sequences. These GS20-454 runs generated over 1.5 Million reads, representing the largest amount of sequences reported from a single plant cDNA library. A collection of 367,391 quality-filtered reads (30.09 Mb) from a single run was sufficient to identify transcripts corresponding to 34% of public maize ESTs databases; total sequences generated after 4 filtered runs increased this coverage to 50%. Comparisons of all 1.5 Million reads to the Maize Assembled Genomic Islands (MAGIs) provided evidence for the transcriptional activity of 11% of MAGIs. We estimate that 5.67% (86,069 sequences) do not align with public ESTs or annotated genes, potentially representing new maize transcripts. Following the assembly of 74.4% of the reads in 65,493 contigs, real-time PCR of selected genes confirmed a predicted correlation between the abundance of GS20-454 sequences and corresponding levels of gene expression. A protocol was developed that significantly increases the number, length and quality of cDNA reads using massive 454 parallel sequencing. We show that recurrent 454 pyrosequencing of a single cDNA sample is necessary to attain a thorough representation of the transcriptional universe present in maize, that can also be used to estimate transcript abundance of specific genes. This data suggests that the molecular and functional diversity contained in the vast native landraces remains to be explored, and that large-scale transcriptional sequencing of a presumed ancestor of the modern maize varieties represents a valuable approach to characterize the functional diversity of maize for future agricultural and evolutionary studies.
Kersting, Anna R.; Bornberg-Bauer, Erich; Moore, Andrew D.; Grath, Sonja
2012-01-01
Plant genomes are generally very large, mostly paleopolyploid, and have numerous gene duplicates and complex genomic features such as repeats and transposable elements. Many of these features have been hypothesized to enable plants, which cannot easily escape environmental challenges, to rapidly adapt. Another mechanism, which has recently been well described as a major facilitator of rapid adaptation in bacteria, animals, and fungi but not yet for plants, is modular rearrangement of protein-coding genes. Due to the high precision of profile-based methods, rearrangements can be well captured at the protein level by characterizing the emergence, loss, and rearrangements of protein domains, their structural, functional, and evolutionary building blocks. Here, we study the dynamics of domain rearrangements and explore their adaptive benefit in 27 plant and 3 algal genomes. We use a phylogenomic approach by which we can explain the formation of 88% of all arrangements by single-step events, such as fusion, fission, and terminal loss of domains. We find many domains are lost along every lineage, but at least 500 domains are novel, that is, they are unique to green plants and emerged more or less recently. These novel domains duplicate and rearrange more readily within their genomes than ancient domains and are overproportionally involved in stress response and developmental innovations. Novel domains more often affect regulatory proteins and show a higher degree of structural disorder than ancient domains. Whereas a relatively large and well-conserved core set of single-domain proteins exists, long multi-domain arrangements tend to be species-specific. We find that duplicated genes are more often involved in rearrangements. Although fission events typically impact metabolic proteins, fusion events often create new signaling proteins essential for environmental sensing. Taken together, the high volatility of single domains and complex arrangements in plant genomes demonstrate the importance of modularity for environmental adaptability of plants. PMID:22250127
A pH-responsive carboxylic β-1,3-glucan polysaccharide for complexation with polymeric guests.
Lien, Le Thi Ngoc; Shiraki, Tomohiro; Dawn, Arnab; Tsuchiya, Youichi; Tokunaga, Daisuke; Tamaru, Shun-ichi; Enomoto, Naoya; Hojo, Junichi; Shinkai, Seiji
2011-06-07
The helix-forming nature of β-1,3-glucan polysaccharides is a characteristic that has potential for producing gene carriers, bio-nanomaterials and other chiral nanowires. Herein, carboxylic curdlan (CurCOOH) bearing the β-1,3-polyglucuronic acid structure was successfully prepared from β-1,3-glucan polysaccharide curdlan (Cur) by one-step oxidation using a 4-acetamido-TEMPO/NaClO/NaClO(2) system as the oxidant. The resulting high-molecular-weight CurCOOH was proved to bear the 6-COOH group in 100% purity. The optical rotatory dispersion (ORD) spectra indicated that the obtained CurCOOH behaves as a water-soluble single-strand in various pH aqueous media. This advantage has allowed us to use CurCOOH as a polymeric host to form various macromolecular complexes. For example, complexation of CurCOOH with single-walled carbon nanotubes (SWNTs) resulted in a water-soluble one-dimensional architecture, which formed a dispersion in aqueous solution that was stable for several months, and much more stable than SWNTs complexes of the similar negatively-charged polyacrylic acid (PAA) and polymethacrylic acid (PMAA). It was shown that in the complex, SWNTs are effectively wrapped by a small amount of CurCOOH, enabling them to avoid electrostatic repulsion. This pH-responsive CurCOOH formed a very stable complex with cationic water-soluble polythiophenes (PT-1), which was stabilized not only by the hydrophobic interaction but also by the electrostatic attraction between trimethylammonium cations in PT-1 and dissociated anionic COO(-) groups in CurCOOH. The included PT-1 became CD-active only in the neutral to basic pH region, and the positive Cotton effect suggested that the conjugated main chain is twisted in the right-handed direction. We also found that CurCOOH can interact with polycytidylic acid (poly(C)) only under high NaCl concentrations, the binding and release of which could be controlled by a change in the salt concentration. We believe, therefore, that CurCOOH bearing a dissociable COOH group can act as a new potential polymeric host to construct novel polymeric complexes applicable for gene carriers, biosensors, chiral polymer assemblies, etc.
A System-Level Pathway-Phenotype Association Analysis Using Synthetic Feature Random Forest
Pan, Qinxin; Hu, Ting; Malley, James D.; Andrew, Angeline S.; Karagas, Margaret R.; Moore, Jason H.
2015-01-01
As the cost of genome-wide genotyping decreases, the number of genome-wide association studies (GWAS) has increased considerably. However, the transition from GWAS findings to the underlying biology of various phenotypes remains challenging. As a result, due to its system-level interpretability, pathway analysis has become a popular tool for gaining insights on the underlying biology from high-throughput genetic association data. In pathway analyses, gene sets representing particular biological processes are tested for significant associations with a given phenotype. Most existing pathway analysis approaches rely on single-marker statistics and assume that pathways are independent of each other. As biological systems are driven by complex biomolecular interactions, embracing the complex relationships between single-nucleotide polymorphisms (SNPs) and pathways needs to be addressed. To incorporate the complexity of gene-gene interactions and pathway-pathway relationships, we propose a system-level pathway analysis approach, synthetic feature random forest (SF-RF), which is designed to detect pathway-phenotype associations without making assumptions about the relationships among SNPs or pathways. In our approach, the genotypes of SNPs in a particular pathway are aggregated into a synthetic feature representing that pathway via Random Forest (RF). Multiple synthetic features are analyzed using RF simultaneously and the significance of a synthetic feature indicates the significance of the corresponding pathway. We further complement SF-RF with pathway-based Statistical Epistasis Network (SEN) analysis that evaluates interactions among pathways. By investigating the pathway SEN, we hope to gain additional insights into the genetic mechanisms contributing to the pathway-phenotype association. We apply SF-RF to a population-based genetic study of bladder cancer and further investigate the mechanisms that help explain the pathway-phenotype associations using SEN. The bladder cancer associated pathways we found are both consistent with existing biological knowledge and reveal novel and plausible hypotheses for future biological validations. PMID:24535726
Chadderton, Naomi; Palfi, Arpad; Millington-Ward, Sophia; Gobbo, Oliverio; Overlack, Nora; Carrigan, Matthew; O'Reilly, Mary; Campbell, Matthew; Ehrhardt, Carsten; Wolfrum, Uwe; Humphries, Peter; Kenna, Paul F; Farrar, G Jane
2013-01-01
Leber hereditary optic neuropathy (LHON) is a mitochondrially inherited form of visual dysfunction caused by mutations in several genes encoding subunits of the mitochondrial respiratory NADH-ubiquinone oxidoreductase complex (complex I). Development of gene therapies for LHON has been impeded by genetic heterogeneity and the need to deliver therapies to the mitochondria of retinal ganglion cells (RGCs), the cells primarily affected in LHON. The therapy under development entails intraocular injection of a nuclear yeast gene NADH-quinone oxidoreductase (NDI1) that encodes a single subunit complex I equivalent and as such is mutation independent. NDI1 is imported into mitochondria due to an endogenous mitochondrial localisation signal. Intravitreal injection represents a clinically relevant route of delivery to RGCs not previously used for NDI1. In this study, recombinant adenoassociated virus (AAV) serotype 2 expressing NDI1 (AAV-NDI1) was shown to protect RGCs in a rotenone-induced murine model of LHON. AAV-NDI1 significantly reduced RGC death by 1.5-fold and optic nerve atrophy by 1.4-fold. This led to a significant preservation of retinal function as assessed by manganese enhanced magnetic resonance imaging and optokinetic responses. Intraocular injection of AAV-NDI1 overcomes many barriers previously associated with developing therapies for LHON and holds great therapeutic promise for a mitochondrial disorder for which there are no effective therapies.
Bartsevich, V V; Pakrasi, H B
1995-01-01
During photosynthesis, the photosystem II (PSII) pigment-protein complex catalyzes oxygen evolution, a reaction in which a four-manganese ensemble plays a crucial role. Using a newly developed selection scheme, we have isolated BP13, a random photosynthesis-deficient mutant strain of the cyanobacterium, Synechocystis 6803. This mutant grew slowly under photoautotrophic conditions, and had a low oxygen evolution activity. Biochemical analysis revealed that the lesion in this mutant strain had specifically affected the Mn ensemble in PSII. Interestingly, incubation of BP13 cells with micromolar levels of added Mn induced rapid recovery of oxygen evolution activity. The mutant could be complemented with a fragment of wild-type chromosomal DNA containing three closely linked genes, mntA, mntB and mntC. These gene products showed significant sequence similarities with polypeptide components of bacterial permeases that are members of the 'ABC (ATP binding cassette) superfamily' of transporter proteins. We determined that in the BP13 strain, a single nucleotide change had resulted in the replacement of an alanine by an aspartic acid residue in MntA, a soluble protein containing ATP binding motifs. These results suggest that the mntCAB gene cluster encodes polypeptide components of a Mn transporter, the first such protein complex identified in any organism. PMID:7743991
Novel organization of the common nodulation genes in Rhizobium leguminosarum bv. phaseoli strains.
Vázquez, M; Dávalos, A; de las Peñas, A; Sánchez, F; Quinto, C
1991-01-01
Nodulation by Rhizobium, Bradyrhizobium, and Azorhizobium species in the roots of legumes and nonlegumes requires the proper expression of plant genes and of both common and specific bacterial nodulation genes. The common nodABC genes form an operon or are physically mapped together in all species studied thus far. Rhizobium leguminosarum bv. phaseoli strains are classified in two groups. The type I group has reiterated nifHDK genes and a narrow host range of nodulation. The type II group has a single copy of the nifHDK genes and a wide host range of nodulation. We have found by genetic and nucleotide sequence analysis that in type I strain CE-3, the functional common nodA gene is separated from the nodBC genes by 20 kb and thus is transcriptionally separated from the latter genes. This novel organization could be the result of a complex rearrangement, as we found zones of identity between the two separated nodA and nodBC regions. Moreover, this novel organization of the common nodABC genes seems to be a general characteristic of R. leguminosarum bv. phaseoli type I strains. Despite the separation, the coordination of the expression of these genes seems not to be altered. PMID:1991718
Evidence for complete epistasis of null mutations in murine Fanconi anemia genes Fanca and Fancg.
van de Vrugt, Henri J; Koomen, Mireille; Bakker, Sietske; Berns, Mariska A D; Cheng, Ngan Ching; van der Valk, Martin A; de Vries, Yne; Rooimans, Martin A; Oostra, Anneke B; Hoatlin, Maureen E; Te Riele, Hein; Joenje, Hans; Arwert, Fré
2011-12-10
Fanconi anemia (FA) is a heritable disease characterized by bone marrow failure, congenital abnormalities, and cancer predisposition. The 15 identified FA genes operate in a molecular pathway to preserve genomic integrity. Within this pathway the FA core complex operates as an ubiquitin ligase that activates the complex of FANCD2 and FANCI to coordinate DNA repair. The FA core complex is formed by at least 12 proteins. However, only the FANCL subunit displays ubiquitin ligase activity. FANCA and FANCG are members of the FA core complex for which no other functions have been described than to participate in protein interactions. In this study we generated mice with combined null alleles for Fanca and Fancg to identify extended functions for these genes by characterizing the double mutant mice and cells. Double mutant a(-/-)/g(-/-) mice were born at near Mendelian frequencies without apparent developmental abnormalities. Histological analysis of a(-/-)/g(-/-) mice revealed a Leydig cell hyperplasia and frequent vacuolization of Sertoli cells in testes, while ovaries were depleted from developing follicles and displayed an interstitial cell hyperplasia. These gonadal aberrations were associated with a compromised fertility of a(-/-)/g(-/-) males and females. During the first year of life a(-/-)/g(-/-) did not develop malignancies or bone marrow failure. At the cellular level a(-/-)/g(-/-), Fanca(-/-), and Fancg(-/-) cells proved equally compromised in DNA crosslink and homology-directed repair. Overall the phenotype of a(-/-)/g(-/-) double knockout mice and cells appeared highly similar to the phenotype of Fanca or Fancg single knockouts. The lack of an augmented phenotype suggest that null mutations in Fanca or Fancg are fully epistatic, making additional important functions outside of the FA core complex highly unlikely. 2011 Elsevier B.V. All rights reserved.
A Complex 6p25 Rearrangement in a Child With Multiple Epiphyseal Dysplasia
Bedoyan, Jirair K.; Lesperance, Marci M.; Ackley, Todd; Iyer, Ramaswamy K.; Innis, Jeffrey W.; Misra, Vinod K.
2015-01-01
Genomic rearrangements are increasingly recognized as important contributors to human disease. Here we report on an 11½-year-old child with myopia, Duane retraction syndrome, bilateral mixed hearing loss, skeletal anomalies including multiple epiphyseal dysplasia, and global developmental delay, and a complex 6p25 genomic rearrangement. We have employed oligonucleotide-based comparative genomic hybridization arrays (aCGH) of different resolutions (44 and 244K) as well as a 1 M single nucleotide polymorphism (SNP) array to analyze this complex rearrangement. Our analyses reveal a complex rearrangement involving a ~2.21 Mb interstitial deletion, a ~240 kb terminal deletion, and a 70–80 kb region in between these two deletions that shows maintenance of genomic copy number. The interstitial deletion contains eight known genes, including three Forkhead box containing (FOX) transcription factors (FOXQ1, FOXF2, and FOXC1). The region maintaining genomic copy number partly overlaps the dual specificity protein phosphatase 22 (DUSP22) gene. Array analyses suggest a homozygous loss of genomic material at the 5′ end of DUSP22, which was corroborated using TaqMan® copy number analysis. It is possible that this homozygous genomic loss may render both copies of DUSP22 or its products non-functional. Our analysis suggests a rearrangement mechanism distinct from a previously reported replication-based error-prone mechanism without template switching for a specific 6p25 rearrangement with a 1.22 Mb interstitial deletion. Our study demonstrates the utility and limitations of using oligonucleotide-based aCGH and SNP array technologies of increasing resolutions in order to identify complex DNA rearrangements and gene disruptions. PMID:21204225
Roepke, Elizabeth W.; Hua, An An; Flood, Beverly E.; Bailey, Jake V.
2017-01-01
ABSTRACT We report the closed and annotated genome sequence of Sulfuriferula sp. strain AH1. Strain AH1 has a 2,877,007-bp chromosome that includes a partial Sox system for inorganic sulfur oxidation and a complete nitrogen fixation pathway. It also has a single 39,138-bp plasmid with genes for arsenic and mercury resistance. PMID:28798167
SELF-STERILE AUXOTROPHS AND THEIR RELATION TO HETEROTHALLISM IN SORDARIA FIMICOLA.
EL-ANI, A S
1964-09-04
Eighty morphological mutants in the homothallic fungus Sordaria fimicola were tested on liquid minimal medium for nutritional requirements. Five had nutritional requirements, one for adenine, three for arginine, and one for lysine. All five were from among the eighty single gene mutants that were also partially or completely self-sterile. Nutritional requirements and centromere-locus intervals provide better criteria than morphological characters for selecting self-sterile mutants at complex loci governing heterothallism.
The Chloroplast atpA Gene Cluster in Chlamydomonas reinhardtii1
Drapier, Dominique; Suzuki, Hideki; Levy, Haim; Rimbault, Blandine; Kindle, Karen L.; Stern, David B.; Wollman, Francis-André
1998-01-01
Most chloroplast genes in vascular plants are organized into polycistronic transcription units, which generate a complex pattern of mono-, di-, and polycistronic transcripts. In contrast, most Chlamydomonas reinhardtii chloroplast transcripts characterized to date have been monocistronic. This paper describes the atpA gene cluster in the C. reinhardtii chloroplast genome, which includes the atpA, psbI, cemA, and atpH genes, encoding the α-subunit of the coupling-factor-1 (CF1) ATP synthase, a small photosystem II polypeptide, a chloroplast envelope membrane protein, and subunit III of the CF0 ATP synthase, respectively. We show that promoters precede the atpA, psbI, and atpH genes, but not the cemA gene, and that cemA mRNA is present only as part of di-, tri-, or tetracistronic transcripts. Deletions introduced into the gene cluster reveal, first, that CF1-α can be translated from di- or polycistronic transcripts, and, second, that substantial reductions in mRNA quantity have minimal effects on protein synthesis rates. We suggest that posttranscriptional mRNA processing is common in C. reinhardtii chloroplasts, permitting the expression of multiple genes from a single promoter. PMID:9625716
Genetics of Mitochondrial Disease.
Saneto, Russell P
2017-01-01
Mitochondria are intracellular organelles responsible for adenosine triphosphate production. The strict control of intracellular energy needs require proper mitochondrial functioning. The mitochondria are under dual controls of mitochondrial DNA (mtDNA) and nuclear DNA (nDNA). Mitochondrial dysfunction can arise from changes in either mtDNA or nDNA genes regulating function. There are an estimated ∼1500 proteins in the mitoproteome, whereas the mtDNA genome has 37 proteins. There are, to date, ∼275 genes shown to give rise to disease. The unique physiology of mitochondrial functioning contributes to diverse gene expression. The onset and range of phenotypic expression of disease is diverse, with onset from neonatal to seventh decade of life. The range of dysfunction is heterogeneous, ranging from single organ to multisystem involvement. The complexity of disease expression has severely limited gene discovery. Combining phenotypes with improvements in gene sequencing strategies are improving the diagnosis process. This chapter focuses on the interplay of the unique physiology and gene discovery in the current knowledge of genetically derived mitochondrial disease. Copyright © 2017 Elsevier Inc. All rights reserved.
Purrello, M; Di Pietro, C; Rapisarda, A; Viola, A; Corsaro, C; Motta, S; Grzeschik, K H; Sichel, G
1996-01-01
Dr1 is a nuclear protein of 19 kDa that exists in the nucleoplasm as a homotetramer. By binding to TBP (the DNA-binding subunit of TFIID, and also a subunit of SL1 and TFIIIB), the protein blocks class II and class III preinitiation complex assembly, thus repressing the activity of the corresponding promoters. Since transcription of class I genes is unaffected by Dr1. it has been proposed that the protein may coordinate the expression of class I, class II and class III genes. By somatic cell genetics and fluorescence in situ hybridization, we have localized the gene (DR1), present in the genome of higher eukaryotes as a single copy, to human chromosome region 1p21-->p13. The nucleotide sequence conservation of the coding segment of the gene, as determined by Noah's ark blot analysis, and its ubiquitous transcription suggest that Dr1 has an important biological role, which could be related to the negative control of cell proliferation.
Common variants of xeroderma pigmentosum genes and prostate cancer risk.
Mirecka, Aneta; Paszkowska-Szczur, Katarzyna; Scott, Rodney J; Górski, Bohdan; van de Wetering, Thierry; Wokołorczyk, Dominika; Gromowski, Tomasz; Serrano-Fernandez, Pablo; Cybulski, Cezary; Kashyap, Aniruddh; Gupta, Satish; Gołąb, Adam; Słojewski, Marcin; Sikorski, Andrzej; Lubiński, Jan; Dębniak, Tadeusz
2014-08-10
The genetic basis of prostate cancer (PC) is complex and appears to involve multiple susceptibility genes. A number of studies have evaluated a possible correlation between several NER gene polymorphisms and PC risk, but most of them evaluated only single SNPs among XP genes and the results remain inconsistent. Out of 94 SNPs located in seven XP genes (XPA-XPG) a total of 15 SNPs were assayed in 720 unselected patients with PC and compared to 1121 healthy adults. An increased risk of disease was associated with the XPD SNP, rs1799793 (Asp312Asn) AG genotype (OR=2.60; p<0.001) and with the AA genotype (OR=531; p<0.0001) compared to the control population. Haplotype analysis of XPD revealed one protective haplotype and four associated with an increased disease risk, which showed that the A allele (XPD rs1799793) appeared to drive the main effect on promoting prostate cancer risk. Polymorphism in XPD gene appears to be associated with the risk of prostate cancer. Copyright © 2014. Published by Elsevier B.V.
2015-01-01
In a companion paper (DOI: 10.021/ja410934b) we demonstrate that the C-rich strand of the cis-regulatory element in the BCL2 promoter element is highly dynamic in nature and can form either an i-motif or a flexible hairpin. Under physiological conditions these two secondary DNA structures are found in an equilibrium mixture, which can be shifted by the addition of small molecules that trap out either the i-motif (IMC-48) or the flexible hairpin (IMC-76). In cellular experiments we demonstrate that the addition of these molecules has opposite effects on BCL2 gene expression and furthermore that these effects are antagonistic. In this contribution we have identified a transcriptional factor that recognizes and binds to the BCL2 i-motif to activate transcription. The molecular basis for the recognition of the i-motif by hnRNP LL is determined, and we demonstrate that the protein unfolds the i-motif structure to form a stable single-stranded complex. In subsequent experiments we show that IMC-48 and IMC-76 have opposite, antagonistic effects on the formation of the hnRNP LL–i-motif complex as well as on the transcription factor occupancy at the BCL2 promoter. For the first time we propose that the i-motif acts as a molecular switch that controls gene expression and that small molecules that target the dynamic equilibrium of the i-motif and the flexible hairpin can differentially modulate gene expression. PMID:24559432
Chin, Yoon-Ming; Tan, Lu Ping; Abdul Aziz, Norazlin; Mushiroda, Taisei; Kubo, Michiaki; Mohd Kornain, Noor Kaslina; Tan, Geok Wee; Khoo, Alan Soo-Beng; Krishnan, Gopala; Pua, Kin-Choo; Yap, Yoke-Yeow; Teo, Soo-Hwang; Lim, Paul Vey-Hong; Nakamura, Yusuke; Lum, Chee Lun; Ng, Ching-Ching
2016-10-15
Nasopharyngeal carcinoma (NPC) is an epithelial squamous cell carcinoma on the mucosal lining of the nasopharynx. The etiology of NPC remains elusive despite many reported studies. Most studies employ a single platform approach, neglecting the cumulative influence of both the genome and transcriptome toward NPC development. We aim to employ an integrated pathway approach to identify dysregulated pathways linked to NPC. Our approach combines imputation NPC GWAS data from a Malaysian cohort as well as published expression data GSE12452 from both NPC and non-NPC nasopharynx tissues. Pathway association for GWAS data was performed using MAGENTA while for expression data, GSA-SNP was used with gene p values derived from differential expression values from GEO2R. Our study identified NPC association in the gene ontology (GO) axonemal dynein complex pathway (pGWAS-GSEA = 1.98 × 10(-2) ; pExpr-GSEA = 1.27 × 10(-24) ; pBonf-Combined = 4.15 × 10(-21) ). This association was replicated in a separate cohort using gene expression data from NPC and non-NPC nasopharynx tissues (pAmpliSeq-GSEA = 6.56 × 10(-4) ). Loss of function in the axonemal dynein complex causes impaired cilia function, leading to poor mucociliary clearance and subsequently upper or lower respiratory tract infection, the former of which includes the nasopharynx. Our approach illustrates the potential use of integrated pathway analysis in detecting gene sets involved in the development of NPC in the Malaysian cohort. © 2016 UICC.
Salas-Leiva, Dayana E; Meerow, Alan W; Calonje, Michael; Francisco-Ortega, Javier; Griffith, M Patrick; Nakamura, Kyoko; Sánchez, Vanessa; Knowles, Lindy; Knowles, David
2017-05-01
The Bahamas archipelago is formed by young, tectonically stable carbonate banks that harbor direct geological evidence of global ice-volume changes. We sought to detect signatures of major changes on gene flow patterns and reconstruct the phylogeographic history of the monophyletic Zamia pumila complex across the Bahamas. Nuclear molecular markers with both high and low mutation rates were used to capture two different time scale signatures and test several gene flow and demographic hypotheses. Single-copy nuclear genes unveiled apparent ancestral admixture on Andros, suggesting a significant role of this island as main hub of diversity of the archipelago. We detected demographic and spatial expansion of the Zamia pumila complex on both paleo-provinces around the Piacenzian (Pliocene)/Gelasian (Pleistocene). Populations evidenced signatures of different migration models that have occurred at two different times. Populations on Long Island ( Z. lucayana ) may either represent a secondary colonization of the Bahamas by Zamia or a rapid and early-divergence event of at least one population on the Bahamas. Despite changes in migration patterns with global climate, expected heterozygosity with both marker systems remains within the range reported for cycads, but with significant levels of increased inbreeding detected by the microsatellites. This finding is likely associated with reduced gene flow between and within paleo-provinces, accompanied by genetic drift, as rising seas enforced isolation. Our study highlights the importance of the maintenance of the predominant direction of genetic exchange and the role of overseas dispersion among the islands during climate oscillations. © 2017 Botanical Society of America.
Lee, Wonbae; Gillies, John P.; Jose, Davis; Israels, Brett A.; von Hippel, Peter H.; Marcus, Andrew H.
2016-01-01
Gene 32 protein (gp32) is the single-stranded (ss) DNA binding protein of the bacteriophage T4. It binds transiently and cooperatively to ssDNA sequences exposed during the DNA replication process and regulates the interactions of the other sub-assemblies of the replication complex during the replication cycle. We here use single-molecule FRET techniques to build on previous thermodynamic studies of gp32 binding to initiate studies of the dynamics of the isolated and cooperative binding of gp32 molecules within the replication complex. DNA primer/template (p/t) constructs are used as models to determine the effects of ssDNA lattice length, gp32 concentration, salt concentration, binding cooperativity and binding polarity at p/t junctions. Hidden Markov models (HMMs) and transition density plots (TDPs) are used to characterize the dynamics of the multi-step assembly pathway of gp32 at p/t junctions of differing polarity, and show that isolated gp32 molecules bind to their ssDNA targets weakly and dissociate quickly, while cooperatively bound dimeric or trimeric clusters of gp32 bind much more tightly, can ‘slide’ on ssDNA sequences, and exhibit binding dynamics that depend on p/t junction polarities. The potential relationships of these binding dynamics to interactions with other components of the T4 DNA replication complex are discussed. PMID:27694621
Fornander, Louise H; Frykholm, Karolin; Reymer, Anna; Renodon-Cornière, Axelle; Takahashi, Masayuki; Nordén, Bengt
2012-06-01
Human RAD51 protein (HsRad51) catalyses the DNA strand exchange reaction for homologous recombination. To clarify the molecular mechanism of the reaction in vitro being more effective in the presence of Ca(2+) than of Mg(2+), we have investigated the effect of these ions on the structure of HsRad51 filament complexes with single- and double-stranded DNA, the reaction intermediates. Flow linear dichroism spectroscopy shows that the two ionic conditions induce significantly different structures in the HsRad51/single-stranded DNA complex, while the HsRad51/double-stranded DNA complex does not demonstrate this ionic dependence. In the HsRad51/single-stranded DNA filament, the primary intermediate of the strand exchange reaction, ATP/Ca(2+) induces an ordered conformation of DNA, with preferentially perpendicular orientation of nucleobases relative to the filament axis, while the presence of ATP/Mg(2+), ADP/Mg(2+) or ADP/Ca(2+) does not. A high strand exchange activity is observed for the filament formed with ATP/Ca(2+), whereas the other filaments exhibit lower activity. Molecular modelling suggests that the structural variation is caused by the divalent cation interfering with the L2 loop close to the DNA-binding site. It is proposed that the larger Ca(2+) stabilizes the loop conformation and thereby the protein-DNA interaction. A tight binding of DNA, with bases perpendicularly oriented, could facilitate strand exchange.
Gardeux, Vincent; David, Fabrice P. A.; Shajkofci, Adrian; Schwalie, Petra C.; Deplancke, Bart
2017-01-01
Abstract Motivation Single-cell RNA-sequencing (scRNA-seq) allows whole transcriptome profiling of thousands of individual cells, enabling the molecular exploration of tissues at the cellular level. Such analytical capacity is of great interest to many research groups in the world, yet these groups often lack the expertise to handle complex scRNA-seq datasets. Results We developed a fully integrated, web-based platform aimed at the complete analysis of scRNA-seq data post genome alignment: from the parsing, filtering and normalization of the input count data files, to the visual representation of the data, identification of cell clusters, differentially expressed genes (including cluster-specific marker genes), and functional gene set enrichment. This Automated Single-cell Analysis Pipeline (ASAP) combines a wide range of commonly used algorithms with sophisticated visualization tools. Compared with existing scRNA-seq analysis platforms, researchers (including those lacking computational expertise) are able to interact with the data in a straightforward fashion and in real time. Furthermore, given the overlap between scRNA-seq and bulk RNA-seq analysis workflows, ASAP should conceptually be broadly applicable to any RNA-seq dataset. As a validation, we demonstrate how we can use ASAP to simply reproduce the results from a single-cell study of 91 mouse cells involving five distinct cell types. Availability and implementation The tool is freely available at asap.epfl.ch and R/Python scripts are available at github.com/DeplanckeLab/ASAP. Contact bart.deplancke@epfl.ch Supplementary information Supplementary data are available at Bioinformatics online. PMID:28541377
Gardeux, Vincent; David, Fabrice P A; Shajkofci, Adrian; Schwalie, Petra C; Deplancke, Bart
2017-10-01
Single-cell RNA-sequencing (scRNA-seq) allows whole transcriptome profiling of thousands of individual cells, enabling the molecular exploration of tissues at the cellular level. Such analytical capacity is of great interest to many research groups in the world, yet these groups often lack the expertise to handle complex scRNA-seq datasets. We developed a fully integrated, web-based platform aimed at the complete analysis of scRNA-seq data post genome alignment: from the parsing, filtering and normalization of the input count data files, to the visual representation of the data, identification of cell clusters, differentially expressed genes (including cluster-specific marker genes), and functional gene set enrichment. This Automated Single-cell Analysis Pipeline (ASAP) combines a wide range of commonly used algorithms with sophisticated visualization tools. Compared with existing scRNA-seq analysis platforms, researchers (including those lacking computational expertise) are able to interact with the data in a straightforward fashion and in real time. Furthermore, given the overlap between scRNA-seq and bulk RNA-seq analysis workflows, ASAP should conceptually be broadly applicable to any RNA-seq dataset. As a validation, we demonstrate how we can use ASAP to simply reproduce the results from a single-cell study of 91 mouse cells involving five distinct cell types. The tool is freely available at asap.epfl.ch and R/Python scripts are available at github.com/DeplanckeLab/ASAP. bart.deplancke@epfl.ch. Supplementary data are available at Bioinformatics online. © The Author(s) 2017. Published by Oxford University Press.
Betz-Stablein, B. D.; Töpfer, A.; Littlejohn, M.; Yuen, L.; Colledge, D.; Sozzi, V.; Angus, P.; Thompson, A.; Revill, P.; Beerenwinkel, N.; Warner, N.
2016-01-01
ABSTRACT Chronic hepatitis B (CHB) is prevalent worldwide. The infectious agent, hepatitis B virus (HBV), replicates via an RNA intermediate and is error prone, leading to the rapid generation of closely related but not identical viral variants, including those that can escape host immune responses and antiviral treatments. The complexity of CHB can be further enhanced by the presence of HBV variants with large deletions in the genome generated via splicing (spHBV variants). Although spHBV variants are incapable of autonomous replication, their replication is rescued by wild-type HBV. spHBV variants have been shown to enhance wild-type virus replication, and their prevalence increases with liver disease progression. Single-molecule deep sequencing was performed on whole HBV genomes extracted from samples, including the liver explant, longitudinally collected from a subject with CHB over a 15-year period after liver transplantation. By employing novel bioinformatics methods, this analysis showed that the dynamics of the viral population across a period of changing treatment regimens was complex. The spHBV variants detected in the liver explant remained present posttransplantation, and a highly diverse novel spHBV population as well as variants with multiple deletions in the pre-S genes emerged. The identification of novel mutations outside the HBV reverse transcriptase gene that co-occurred with known drug resistance-associated mutations highlights the relevance of using full-genome deep sequencing and supports the hypothesis that drug resistance involves interactions across the full length of the HBV genome. IMPORTANCE Single-molecule sequencing allowed the characterization, in unprecedented detail, of the evolution of HBV populations and offered unique insights into the dynamics of defective and spHBV variants following liver transplantation and complex treatment regimens. This analysis also showed the rapid adaptation of HBV populations to treatment regimens with evolving drug resistance phenotypes and evidence of purifying selection across the whole genome. Finally, the new open-source bioinformatics tools with the capacity to easily identify potential spliced variants from deep sequencing data are freely available. PMID:27252524
Knowledge-driven genomic interactions: an application in ovarian cancer.
Kim, Dokyoon; Li, Ruowang; Dudek, Scott M; Frase, Alex T; Pendergrass, Sarah A; Ritchie, Marylyn D
2014-01-01
Effective cancer clinical outcome prediction for understanding of the mechanism of various types of cancer has been pursued using molecular-based data such as gene expression profiles, an approach that has promise for providing better diagnostics and supporting further therapies. However, clinical outcome prediction based on gene expression profiles varies between independent data sets. Further, single-gene expression outcome prediction is limited for cancer evaluation since genes do not act in isolation, but rather interact with other genes in complex signaling or regulatory networks. In addition, since pathways are more likely to co-operate together, it would be desirable to incorporate expert knowledge to combine pathways in a useful and informative manner. Thus, we propose a novel approach for identifying knowledge-driven genomic interactions and applying it to discover models associated with cancer clinical phenotypes using grammatical evolution neural networks (GENN). In order to demonstrate the utility of the proposed approach, an ovarian cancer data from the Cancer Genome Atlas (TCGA) was used for predicting clinical stage as a pilot project. We identified knowledge-driven genomic interactions associated with cancer stage from single knowledge bases such as sources of pathway-pathway interaction, but also knowledge-driven genomic interactions across different sets of knowledge bases such as pathway-protein family interactions by integrating different types of information. Notably, an integration model from different sources of biological knowledge achieved 78.82% balanced accuracy and outperformed the top models with gene expression or single knowledge-based data types alone. Furthermore, the results from the models are more interpretable because they are framed in the context of specific biological pathways or other expert knowledge. The success of the pilot study we have presented herein will allow us to pursue further identification of models predictive of clinical cancer survival and recurrence. Understanding the underlying tumorigenesis and progression in ovarian cancer through the global view of interactions within/between different biological knowledge sources has the potential for providing more effective screening strategies and therapeutic targets for many types of cancer.
Genetics of Migraine: Insights into the Molecular Basis of Migraine Disorders.
Sutherland, Heidi G; Griffiths, Lyn R
2017-04-01
Migraine is a complex, debilitating neurovascular disorder, typically characterized by recurring, incapacitating attacks of severe headache often accompanied by nausea and neurological disturbances. It has a strong genetic basis demonstrated by rare migraine disorders caused by mutations in single genes (monogenic), as well as familial clustering of common migraine which is associated with polymorphisms in many genes (polygenic). Hemiplegic migraine is a dominantly inherited, severe form of migraine with associated motor weakness. Family studies have found that mutations in three different ion channels genes, CACNA1A, ATP1A2, and SCN1A can be causal. Functional studies of these mutations has shown that they can result in defective regulation of glutamatergic neurotransmission and the excitatory/inhibitory balance in the brain, which lowers the threshold for cortical spreading depression, a wave of cortical depolarization thought to be involved in headache initiation mechanisms. Other putative genes for monogenic migraine include KCKN18, PRRT2, and CSNK1D, which can also be involved with other disorders. There are a number of primarily vascular disorders caused by mutations in single genes, which are often accompanied by migraine symptoms. Mutations in NOTCH3 causes cerebral autosomal dominant arteriopathy with subcortical infarcts and leukoencephalopathy (CADASIL), a hereditary cerebrovascular disease that leads to ischemic strokes and dementia, but in which migraine is often present, sometimes long before the onset of other symptoms. Mutations in the TREX1 and COL4A1 also cause vascular disorders, but often feature migraine. With respect to common polygenic migraine, genome-wide association studies have now identified single nucleotide polymorphisms at 38 loci significantly associated with migraine risk. Functions assigned to the genes in proximity to these loci suggest that both neuronal and vascular pathways also contribute to the pathophysiology of common migraine. Further studies are required to fully understand these findings and translate them into treatment options for migraine patients. © 2017 American Headache Society.
Pompey, Justine M; Foda, Bardees; Singh, Upinder
2015-01-01
Dicer enzymes process double-stranded RNA (dsRNA) into small RNAs that target gene silencing through the RNA interference (RNAi) pathway. Dicer enzymes are complex, multi-domain RNaseIII proteins, however structural minimalism of this protein has recently emerged in parasitic and fungal systems. The most minimal Dicer, Saccharomyces castellii Dicer1, has a single RNaseIII domain and two double stranded RNA binding domains. In the protozoan parasite Entamoeba histolytica 27nt small RNAs are abundant and mediate silencing, yet no canonical Dicer enzyme has been identified. Although EhRNaseIII does not exhibit robust dsRNA cleavage in vitro, it can process dsRNA in the RNAi-negative background of Saccharomyces cerevisiae, and in conjunction with S. castellii Argonaute1 can partially reconstitute the RNAi pathway. Thus, although EhRNaseIII lacks the domain architecture of canonical or minimal Dicer enzymes, it has dsRNA processing activity that contributes to gene silencing via RNAi. Our data advance the understanding of small RNA biogenesis in Entamoeba as well as broaden the spectrum of non-canonical Dicer enzymes that contribute to the RNAi pathway.
Kim, Seong U; Batule, Bhagwan S; Mun, Hyoyoung; Byun, Ju-Young; Shim, Won-Bo; Kim, Min-Gon
2018-02-07
We have developed a novel strategy for the colorimetric detection of PCR products by utilizing a target-specific primer modified at the 5'-end with an anti-DNAzyme sequence. A single-stranded DNAzyme sequence folds into a G-quadruplex structure with hemin and shows strong peroxidase activity. When the complementary strand binds to the DNAzyme sequence, it blocks the formation of the G-quadraduplex structure and loses its peroxidase activity. In the presence of the target gene, PCR amplification proceeds, and anti-DNAzyme sequence modified primers present in the reaction mixture form a double strand through primer extension. Therefore, it does not block the DNAzyme sequence. Further, a colorimetric signal is generated by the addition of 2,2'-azino-bis(3-ethylbenzothiazoline-6-sulfonate) (ABTS) and H 2 O 2 at the end of the reaction. We have successfully detected a single copy of the HIV type 1 gag gene in buffer and 10 copies in human serum. The strategy developed could be used to detect DNA and RNA in complex biological samples by simple primer designing that includes DNAzyme and a DNA extended primer.
An Empirically Calibrated Model of Cell Fate Decision Following Viral Infection
NASA Astrophysics Data System (ADS)
Coleman, Seth; Igoshin, Oleg; Golding, Ido
The life cycle of the virus (phage) lambda is an established paradigm for the way genetic networks drive cell fate decisions. But despite decades of interrogation, we are still unable to theoretically predict whether the infection of a given cell will result in cell death or viral dormancy. The poor predictive power of current models reflects the absence of quantitative experimental data describing the regulatory interactions between different lambda genes. To address this gap, we are constructing a theoretical model that captures the known interactions in the lambda network. Model assumptions and parameters are calibrated using new single-cell data from our lab, describing the activity of lambda genes at single-molecule resolution. We began with a mean-field model, aimed at exploring the population averaged gene-expression trajectories under different initial conditions. Next, we will develop a stochastic formulation, to capture the differences between individual cells within the population. The eventual goal is to identify how the post-infection decision is driven by the interplay between network topology, initial conditions, and stochastic effects. The insights gained here will inform our understanding of cell fate choices in more complex cellular systems.
Wolfe, Benjamin E.; Tulloss, Rodham E.; Pringle, Anne
2012-01-01
Microbial symbioses have evolved repeatedly across the tree of life, but the genetic changes underlying transitions to symbiosis are largely unknown, especially for eukaryotic microbial symbionts. We used the genus Amanita, an iconic group of mushroom-forming fungi engaged in ectomycorrhizal symbioses with plants, to identify both the origins and potential genetic changes maintaining the stability of this mutualism. A multi-gene phylogeny reveals one origin of the symbiosis within Amanita, with a single transition from saprotrophic decomposition of dead organic matter to biotrophic dependence on host plants for carbon. Associated with this transition are the losses of two cellulase genes, each of which plays a critical role in extracellular decomposition of organic matter. However a third gene, which acts at later stages in cellulose decomposition, is retained by many, but not all, ectomycorrhizal species. Experiments confirm that symbiotic Amanita species have lost the ability to grow on complex organic matter and have therefore lost the capacity to live in forest soils without carbon supplied by a host plant. Irreversible losses of decomposition pathways are likely to play key roles in the evolutionary stability of these ubiquitous mutualisms. PMID:22815710
Singh, Upinder
2015-01-01
Dicer enzymes process double-stranded RNA (dsRNA) into small RNAs that target gene silencing through the RNA interference (RNAi) pathway. Dicer enzymes are complex, multi-domain RNaseIII proteins, however structural minimalism of this protein has recently emerged in parasitic and fungal systems. The most minimal Dicer, Saccharomyces castellii Dicer1, has a single RNaseIII domain and two double stranded RNA binding domains. In the protozoan parasite Entamoeba histolytica 27nt small RNAs are abundant and mediate silencing, yet no canonical Dicer enzyme has been identified. Although EhRNaseIII does not exhibit robust dsRNA cleavage in vitro, it can process dsRNA in the RNAi-negative background of Saccharomyces cerevisiae, and in conjunction with S. castellii Argonaute1 can partially reconstitute the RNAi pathway. Thus, although EhRNaseIII lacks the domain architecture of canonical or minimal Dicer enzymes, it has dsRNA processing activity that contributes to gene silencing via RNAi. Our data advance the understanding of small RNA biogenesis in Entamoeba as well as broaden the spectrum of non-canonical Dicer enzymes that contribute to the RNAi pathway. PMID:26230096
Complete sequence and comparative analysis of the chloroplast genome of Plinia trunciflora
Eguiluz, Maria; Yuyama, Priscila Mary; Guzman, Frank; Rodrigues, Nureyev Ferreira; Margis, Rogerio
2017-01-01
Abstract Plinia trunciflora is a Brazilian native fruit tree from the Myrtaceae family, also known as jaboticaba. This species has great potential by its fruit production. Due to the high content of essential oils in their leaves and of anthocyanins in the fruits, there is also an increasing interest by the pharmaceutical industry. Nevertheless, there are few studies focusing on its molecular biology and genetic characterization. We herein report the complete chloroplast (cp) genome of P. trunciflora using high-throughput sequencing and compare it to other previously sequenced Myrtaceae genomes. The cp genome of P. trunciflora is 159,512 bp in size, comprising inverted repeats of 26,414 bp and single-copy regions of 88,097 bp (LSC) and 18,587 bp (SSC). The genome contains 111 single-copy genes (77 protein-coding, 30 tRNA and four rRNA genes). Phylogenetic analysis using 57 cp protein-coding genes demonstrated that P. trunciflora, Eugenia uniflora and Acca sellowiana form a cluster with closer relationship to Syzygium cumini than with Eucalyptus. The complete cp sequence reported here can be used in evolutionary and population genetics studies, contributing to resolve the complex taxonomy of this species and fill the gap in genetic characterization. PMID:29111566
Zhang, Hao; Liu, Shiyu; Zhu, Bin; Xu, Qiu; Ding, Yin; Jin, Yan
2016-11-14
Tissue-engineering strategies based on mesenchymal stem cells (MSCs) and cell sheets have been widely used for periodontal tissue regeneration. However, given the complexity in periodontal structure, the regeneration methods using a single species of MSC could not fulfill the requirement for periodontal regeneration. We researched the interaction between the periodontal ligament stem cells (PDLSCs) and jaw bone marrow-derived mesenchymal stem cells (JBMMSCs), and constructed a composite cell sheet comprising both of the above MSCs to regenerate complex periodontium-like structures in nude mice. Our results show that by co-culturing PDLSCs and JBMMSCs, the expressions of bone and extracellular matrix (ECM)-related genes and proteins were significantly improved in both MSCs. Further investigations showed that, compared to the cell sheet using PDLSCs or JBMMSCs, the composite stem cell sheet (CSCS), which comprises these two MSCs, expressed higher levels of bone- and ECM-related genes and proteins, and generated a composite structure more similar to the native periodontal tissue physiologically in vivo. In conclusion, our results demonstrate that the crosstalk between PDLSCs and JBMMSCs in cell sheets facilitate regeneration of complex periodontium-like structures, providing a promising new strategy for physiological and functional regeneration of periodontal tissue.
Mechanisms and pathways of growth failure in primordial dwarfism.
Klingseisen, Anna; Jackson, Andrew P
2011-10-01
The greatest difference between species is size; however, the developmental mechanisms determining organism growth remain poorly understood. Primordial dwarfism is a group of human single-gene disorders with extreme global growth failure (which includes Seckel syndrome, microcephalic osteodysplastic primordial dwarfism I [MOPD] types I and II, and Meier-Gorlin syndrome). Ten genes have now been identified for microcephalic primordial dwarfism, encoding proteins involved in fundamental cellular processes including genome replication (ORC1 [origin recognition complex 1], ORC4, ORC6, CDT1, and CDC6), DNA damage response (ATR [ataxia-telangiectasia and Rad3-related]), mRNA splicing (U4atac), and centrosome function (CEP152, PCNT, and CPAP). Here, we review the cellular and developmental mechanisms underlying the pathogenesis of these conditions and address whether further study of these genes could provide novel insight into the physiological regulation of organism growth.
Molecular genetic basis of pod corn (Tunicate maize)
Wingen, Luzie U.; Münster, Thomas; Faigl, Wolfram; Deleu, Wim; Sommer, Hans; Saedler, Heinz; Theißen, Günter
2012-01-01
Pod corn is a classic morphological mutant of maize in which the mature kernels of the cob are covered by glumes, in contrast to generally grown maize varieties in which kernels are naked. Pod corn, known since pre-Columbian times, is the result of a dominant gain-of-function mutation at the Tunicate (Tu) locus. Some classic articles of 20th century maize genetics reported that the mutant Tu locus is complex, but molecular details remained elusive. Here, we show that pod corn is caused by a cis-regulatory mutation and duplication of the ZMM19 MADS-box gene. Although the WT locus contains a single-copy gene that is expressed in vegetative organs only, mutation and duplication of ZMM19 in Tu lead to ectopic expression of the gene in the inflorescences, thus conferring vegetative traits to reproductive organs. PMID:22517751
[A study of PDE6B gene mutation and phenotype in Chinese cases with retinitis pigmentosa].
Cui, Yun; Zhao, Kan-xing; Wang, Li; Wang, Qing; Zhang, Wei; Chen, Wei-ying; Wang, Li-ming
2003-01-01
To identify the mutation spectrum of phosphodiesterase beta subunit (PDE6B) gene, the incidence in Chinese patients with retinitis pigmentosa (RP) and their clinical phenotypic characteristics. Screening of mutations within PDE6B gene was performed using polymerase chain reaction-heteroduplex-single strand conformation polymorphism (PCR-SSCP) and DNA sequence in 35 autosomal recessive (AR) RP and 55 sporadic RP cases. The phenotypes of the patients with the gene mutation were examined and analyzed. Novel complex heterozygous variants of PDE6B gene in a sporadic case, a T to C transversion in codon 323 resulting in the substitution of Gly by Ser and 2 base pairs (bp: G and T) insert between the 27th-28th bp upstream of the 5'-end of exon 10 were both present in a same isolate RP. But they are not found in 100 unrelated healthy individuals. Ocular findings showed diffuse pigmentary retinal degeneration in the midperipheral and peripheral fundi, optic atrophy and vessel attenuation. Multi-focal ERG indicated that the rod function was more severely deteriorated. A mutation was found in a case with RP in a ARRP family, a G to A transversion at 19th base upstream 5'-end of exon 11 (within intron 10) of PDE6B gene. A sporadic RP carried a sequence variant of PDE6B gene, a G to C transition, at the 15th base adjacent to the 3'-end of exon l8. In another isolate case with RP was found 2 bp (GT) insert between 31st and 32nd base upstream 5'-end of exon 4 (in intron 3) of PDE6B gene. There are novel complex heterozygous mutations of PDE6B gene responsible for a sporadic RP patient in China. This gene mutation associated with rod deterioration and RP. Several DNA variants were found in introns of PDE6B gene in national population.
Limit cycles in piecewise-affine gene network models with multiple interaction loops
NASA Astrophysics Data System (ADS)
Farcot, Etienne; Gouzé, Jean-Luc
2010-01-01
In this article, we consider piecewise affine differential equations modelling gene networks. We work with arbitrary decay rates, and under a local hypothesis expressed as an alignment condition of successive focal points. The interaction graph of the system may be rather complex (multiple intricate loops of any sign, multiple thresholds, etc.). Our main result is an alternative theorem showing that if a sequence of region is periodically visited by trajectories, then under our hypotheses, there exists either a unique stable periodic solution, or the origin attracts all trajectories in this sequence of regions. This result extends greatly our previous work on a single negative feedback loop. We give several examples and simulations illustrating different cases.
Adiponectin gene polymorphisms: Association with childhood obesity
Fraga, Vanêssa Gomes; Gomes, Karina Braga
2014-01-01
The current childhood obesity epidemic represents a particular challenge for public health. Understanding of the etiological mechanisms of obesity remains integral in treating this complex disorder. In recent years, studies have elucidated the influence of hormones secreted by adipose tissue named adipokines. Adiponectin is a adipokine that exhibits important anti-inflammatory, insulin-sensitizing and anti-atherogenic properties and it is strongly associated to obesity development. It is well known that adiponectin levels decrease with obesity. Furthermore, studies show that some single nucleotide polymorphisms in the gene encoding adiponectin, ADIPOQ, may influence the expression of this protein. The objective of this paper is to provide an up-to-date review of ADIPOQ polymorphisms in the context of childhood obesity. PMID:27625863
Chromatin isolation by RNA purification (ChIRP).
Chu, Ci; Quinn, Jeffrey; Chang, Howard Y
2012-03-25
Long noncoding RNAs are key regulators of chromatin states for important biological processes such as dosage compensation, imprinting, and developmental gene expression (1,2,3,4,5,6,7). The recent discovery of thousands of lncRNAs in association with specific chromatin modification complexes, such as Polycomb Repressive Complex 2 (PRC2) that mediates histone H3 lysine 27 trimethylation (H3K27me3), suggests broad roles for numerous lncRNAs in managing chromatin states in a gene-specific fashion (8,9). While some lncRNAs are thought to work in cis on neighboring genes, other lncRNAs work in trans to regulate distantly located genes. For instance, Drosophila lncRNAs roX1 and roX2 bind numerous regions on the X chromosome of male cells, and are critical for dosage compensation (10,11). However, the exact locations of their binding sites are not known at high resolution. Similarly, human lncRNA HOTAIR can affect PRC2 occupancy on hundreds of genes genome-wide( 3,12,13), but how specificity is achieved is unclear. LncRNAs can also serve as modular scaffolds to recruit the assembly of multiple protein complexes. The classic trans-acting RNA scaffold is the TERC RNA that serves as the template and scaffold for the telomerase complex (14); HOTAIR can also serve as a scaffold for PRC2 and a H3K4 demethylase complex (13). Prior studies mapping RNA occupancy at chromatin have revealed substantial insights (15,16), but only at a single gene locus at a time. The occupancy sites of most lncRNAs are not known, and the roles of lncRNAs in chromatin regulation have been mostly inferred from the indirect effects of lncRNA perturbation. Just as chromatin immunoprecipitation followed by microarray or deep sequencing (ChIP-chip or ChIP-seq, respectively) has greatly improved our understanding of protein-DNA interactions on a genomic scale, here we illustrate a recently published strategy to map long RNA occupancy genome-wide at high resolution (17). This method, Chromatin Isolation by RNA Purification (ChIRP) (Figure 1), is based on affinity capture of target lncRNA:chromatin complex by tiling antisense-oligos, which then generates a map of genomic binding sites at a resolution of several hundred bases with high sensitivity and low background. ChIRP is applicable to many lncRNAs because the design of affinity-probes is straightforward given the RNA sequence and requires no knowledge of the RNA's structure or functional domains.