Sample records for gene variable number

  1. Exploratory factor analysis of pathway copy number data with an application towards the integration with gene expression data.

    PubMed

    van Wieringen, Wessel N; van de Wiel, Mark A

    2011-05-01

    Realizing that genes often operate together, studies into the molecular biology of cancer shift focus from individual genes to pathways. In order to understand the regulatory mechanisms of a pathway, one must study its genes at all molecular levels. To facilitate such study at the genomic level, we developed exploratory factor analysis for the characterization of the variability of a pathway's copy number data. A latent variable model that describes the call probability data of a pathway is introduced and fitted with an EM algorithm. In two breast cancer data sets, it is shown that the first two latent variables of GO nodes, which inherit a clear interpretation from the call probabilities, are often related to the proportion of aberrations and a contrast of the probabilities of a loss and of a gain. Linking the latent variables to the node's gene expression data suggests that they capture the "global" effect of genomic aberrations on these transcript levels. In all, the proposed method provides an possibly insightful characterization of pathway copy number data, which may be fruitfully exploited to study the interaction between the pathway's DNA copy number aberrations and data from other molecular levels like gene expression.

  2. State Space Model with hidden variables for reconstruction of gene regulatory networks.

    PubMed

    Wu, Xi; Li, Peng; Wang, Nan; Gong, Ping; Perkins, Edward J; Deng, Youping; Zhang, Chaoyang

    2011-01-01

    State Space Model (SSM) is a relatively new approach to inferring gene regulatory networks. It requires less computational time than Dynamic Bayesian Networks (DBN). There are two types of variables in the linear SSM, observed variables and hidden variables. SSM uses an iterative method, namely Expectation-Maximization, to infer regulatory relationships from microarray datasets. The hidden variables cannot be directly observed from experiments. How to determine the number of hidden variables has a significant impact on the accuracy of network inference. In this study, we used SSM to infer Gene regulatory networks (GRNs) from synthetic time series datasets, investigated Bayesian Information Criterion (BIC) and Principle Component Analysis (PCA) approaches to determining the number of hidden variables in SSM, and evaluated the performance of SSM in comparison with DBN. True GRNs and synthetic gene expression datasets were generated using GeneNetWeaver. Both DBN and linear SSM were used to infer GRNs from the synthetic datasets. The inferred networks were compared with the true networks. Our results show that inference precision varied with the number of hidden variables. For some regulatory networks, the inference precision of DBN was higher but SSM performed better in other cases. Although the overall performance of the two approaches is compatible, SSM is much faster and capable of inferring much larger networks than DBN. This study provides useful information in handling the hidden variables and improving the inference precision.

  3. Gene expression variability in human hepatic drug metabolizing enzymes and transporters.

    PubMed

    Yang, Lun; Price, Elvin T; Chang, Ching-Wei; Li, Yan; Huang, Ying; Guo, Li-Wu; Guo, Yongli; Kaput, Jim; Shi, Leming; Ning, Baitang

    2013-01-01

    Interindividual variability in the expression of drug-metabolizing enzymes and transporters (DMETs) in human liver may contribute to interindividual differences in drug efficacy and adverse reactions. Published studies that analyzed variability in the expression of DMET genes were limited by sample sizes and the number of genes profiled. We systematically analyzed the expression of 374 DMETs from a microarray data set consisting of gene expression profiles derived from 427 human liver samples. The standard deviation of interindividual expression for DMET genes was much higher than that for non-DMET genes. The 20 DMET genes with the largest variability in the expression provided examples of the interindividual variation. Gene expression data were also analyzed using network analysis methods, which delineates the similarities of biological functionalities and regulation mechanisms for these highly variable DMET genes. Expression variability of human hepatic DMET genes may affect drug-gene interactions and disease susceptibility, with concomitant clinical implications.

  4. The Mhc class II of the Black grouse (Tetrao tetrix) consists of low numbers of B and Y genes with variable diversity and expression.

    PubMed

    Strand, Tanja; Westerdahl, Helena; Höglund, Jacob; V Alatalo, Rauno; Siitari, Heli

    2007-09-01

    We found that the Black grouse (Tetrao tetrix) possess low numbers of Mhc class II B (BLB) and Y (YLB) genes with variable diversity and expression. We have therefore shown, for the first time, that another bird species (in this case, a wild lek-breeding galliform) shares several features of the simple Mhc of the domestic chicken (Gallus gallus). The Black grouse BLB genes showed the same level of polymorphism that has been reported in chicken, and we also found indications of balancing selection in the peptide-binding regions. The YLB genes were less variable than the BLB genes, also in accordance with earlier studies in chicken, although their functional significance still remains obscure. We hypothesize that the YLB genes could have been under purifying selection, just as the mammal Mhc-E gene cluster.

  5. Variability among Cucurbitaceae species (melon, cucumber and watermelon) in a genomic region containing a cluster of NBS-LRR genes.

    PubMed

    Morata, Jordi; Puigdomènech, Pere

    2017-02-08

    Cucurbitaceae species contain a significantly lower number of genes coding for proteins with similarity to plant resistance genes belonging to the NBS-LRR family than other plant species of similar genome size. A large proportion of these genes are organized in clusters that appear to be hotspots of variability. The genomes of the Cucurbitaceae species measured until now are intermediate in size (between 350 and 450 Mb) and they apparently have not undergone any genome duplications beside those at the origin of eudicots. The cluster containing the largest number of NBS-LRR genes has previously been analyzed in melon and related species and showed a high degree of interspecific and intraspecific variability. It was of interest to study whether similar behavior occurred in other cluster of the same family of genes. The cluster of NBS-LRR genes located in melon chromosome 9 was analyzed and compared with the syntenic regions in other cucurbit genomes. This is the second cluster in number within this species and it contains nine sequences with a NBS-LRR annotation including two genes, Fom1 and Prv, providing resistance against Fusarium and Ppapaya ring-spot virus (PRSV). The variability within the melon species appears to consist essentially of single nucleotide polymorphisms. Clusters of similar genes are present in the syntenic regions of the two species of Cucurbitaceae that were sequenced, cucumber and watermelon. Most of the genes in the syntenic clusters can be aligned between species and a hypothesis of generation of the cluster is proposed. The number of genes in the watermelon cluster is similar to that in melon while a higher number of genes (12) is present in cucumber, a species with a smaller genome than melon. After comparing genome resequencing data of 115 cucumber varieties, deletion of a group of genes is observed in a group of varieties of Indian origin. Clusters of genes coding for NBS-LRR proteins in cucurbits appear to have specific variability in different regions of the genome and between different species. This observation is in favour of considering that the adaptation of plant species to changing environments is based upon the variability that may occur at any location in the genome and that has been produced by specific mechanisms of sequence variation acting on plant genomes. This information could be useful both to understand the evolution of species and for plant breeding.

  6. Stochastic loss and gain of symmetric divisions in the C. elegans epidermis perturbs robustness of stem cell number

    PubMed Central

    Katsanos, Dimitris; Koneru, Sneha L.; Mestek Boukhibar, Lamia; Gritti, Nicola; Ghose, Ritobrata; Appleford, Peter J.; Doitsidou, Maria; Woollard, Alison; van Zon, Jeroen S.; Poole, Richard J.

    2017-01-01

    Biological systems are subject to inherent stochasticity. Nevertheless, development is remarkably robust, ensuring the consistency of key phenotypic traits such as correct cell numbers in a certain tissue. It is currently unclear which genes modulate phenotypic variability, what their relationship is to core components of developmental gene networks, and what is the developmental basis of variable phenotypes. Here, we start addressing these questions using the robust number of Caenorhabditis elegans epidermal stem cells, known as seam cells, as a readout. We employ genetics, cell lineage tracing, and single molecule imaging to show that mutations in lin-22, a Hes-related basic helix-loop-helix (bHLH) transcription factor, increase seam cell number variability. We show that the increase in phenotypic variability is due to stochastic conversion of normally symmetric cell divisions to asymmetric and vice versa during development, which affect the terminal seam cell number in opposing directions. We demonstrate that LIN-22 acts within the epidermal gene network to antagonise the Wnt signalling pathway. However, lin-22 mutants exhibit cell-to-cell variability in Wnt pathway activation, which correlates with and may drive phenotypic variability. Our study demonstrates the feasibility to study phenotypic trait variance in tractable model organisms using unbiased mutagenesis screens. PMID:29108019

  7. Effect of promoter architecture on the cell-to-cell variability in gene expression.

    PubMed

    Sanchez, Alvaro; Garcia, Hernan G; Jones, Daniel; Phillips, Rob; Kondev, Jané

    2011-03-01

    According to recent experimental evidence, promoter architecture, defined by the number, strength and regulatory role of the operators that control transcription, plays a major role in determining the level of cell-to-cell variability in gene expression. These quantitative experiments call for a corresponding modeling effort that addresses the question of how changes in promoter architecture affect variability in gene expression in a systematic rather than case-by-case fashion. In this article we make such a systematic investigation, based on a microscopic model of gene regulation that incorporates stochastic effects. In particular, we show how operator strength and operator multiplicity affect this variability. We examine different modes of transcription factor binding to complex promoters (cooperative, independent, simultaneous) and how each of these affects the level of variability in transcriptional output from cell-to-cell. We propose that direct comparison between in vivo single-cell experiments and theoretical predictions for the moments of the probability distribution of mRNA number per cell can be used to test kinetic models of gene regulation. The emphasis of the discussion is on prokaryotic gene regulation, but our analysis can be extended to eukaryotic cells as well.

  8. Effect of Promoter Architecture on the Cell-to-Cell Variability in Gene Expression

    PubMed Central

    Sanchez, Alvaro; Garcia, Hernan G.; Jones, Daniel; Phillips, Rob; Kondev, Jané

    2011-01-01

    According to recent experimental evidence, promoter architecture, defined by the number, strength and regulatory role of the operators that control transcription, plays a major role in determining the level of cell-to-cell variability in gene expression. These quantitative experiments call for a corresponding modeling effort that addresses the question of how changes in promoter architecture affect variability in gene expression in a systematic rather than case-by-case fashion. In this article we make such a systematic investigation, based on a microscopic model of gene regulation that incorporates stochastic effects. In particular, we show how operator strength and operator multiplicity affect this variability. We examine different modes of transcription factor binding to complex promoters (cooperative, independent, simultaneous) and how each of these affects the level of variability in transcriptional output from cell-to-cell. We propose that direct comparison between in vivo single-cell experiments and theoretical predictions for the moments of the probability distribution of mRNA number per cell can be used to test kinetic models of gene regulation. The emphasis of the discussion is on prokaryotic gene regulation, but our analysis can be extended to eukaryotic cells as well. PMID:21390269

  9. Differences in AMY1 Gene Copy Numbers Derived from Blood, Buccal Cells and Saliva Using Quantitative and Droplet Digital PCR Methods: Flagging the Pitfall.

    PubMed

    Ooi, Delicia Shu Qin; Tan, Verena Ming Hui; Ong, Siong Gim; Chan, Yiong Huak; Heng, Chew Kiat; Lee, Yung Seng

    2017-01-01

    The human salivary (AMY1) gene, encoding salivary α-amylase, has variable copy number variants (CNVs) in the human genome. We aimed to determine if real-time quantitative polymerase chain reaction (qPCR) and the more recently available Droplet Digital PCR (ddPCR) can provide a precise quantification of the AMY1 gene copy number in blood, buccal cells and saliva samples derived from the same individual. Seven participants were recruited and DNA was extracted from the blood, buccal cells and saliva samples provided by each participant. Taqman assay real-time qPCR and ddPCR were conducted to quantify AMY1 gene copy numbers. Statistical analysis was carried out to determine the difference in AMY1 gene copy number between the different biological specimens and different assay methods. We found significant within-individual difference (p<0.01) in AMY1 gene copy number between different biological samples as determined by qPCR. However, there was no significant within-individual difference in AMY1 gene copy number between different biological samples as determined by ddPCR. We also found that AMY1 gene copy number of blood samples were comparable between qPCR and ddPCR, while there is a significant difference (p<0.01) between AMY1 gene copy numbers measured by qPCR and ddPCR for both buccal swab and saliva samples. Despite buccal cells and saliva samples being possible sources of DNA, it is pertinent that ddPCR or a single biological sample, preferably blood sample, be used for determining highly polymorphic gene copy numbers like AMY1, due to the large within-individual variability between different biological samples if real time qPCR is employed.

  10. Origins of extrinsic variability in eukaryotic gene expression

    NASA Astrophysics Data System (ADS)

    Volfson, Dmitri; Marciniak, Jennifer; Blake, William J.; Ostroff, Natalie; Tsimring, Lev S.; Hasty, Jeff

    2006-02-01

    Variable gene expression within a clonal population of cells has been implicated in a number of important processes including mutation and evolution, determination of cell fates and the development of genetic disease. Recent studies have demonstrated that a significant component of expression variability arises from extrinsic factors thought to influence multiple genes simultaneously, yet the biological origins of this extrinsic variability have received little attention. Here we combine computational modelling with fluorescence data generated from multiple promoter-gene inserts in Saccharomyces cerevisiae to identify two major sources of extrinsic variability. One unavoidable source arising from the coupling of gene expression with population dynamics leads to a ubiquitous lower limit for expression variability. A second source, which is modelled as originating from a common upstream transcription factor, exemplifies how regulatory networks can convert noise in upstream regulator expression into extrinsic noise at the output of a target gene. Our results highlight the importance of the interplay of gene regulatory networks with population heterogeneity for understanding the origins of cellular diversity.

  11. Origins of extrinsic variability in eukaryotic gene expression

    NASA Astrophysics Data System (ADS)

    Volfson, Dmitri; Marciniak, Jennifer; Blake, William J.; Ostroff, Natalie; Tsimring, Lev S.; Hasty, Jeff

    2006-03-01

    Variable gene expression within a clonal population of cells has been implicated in a number of important processes including mutation and evolution, determination of cell fates and the development of genetic disease. Recent studies have demonstrated that a significant component of expression variability arises from extrinsic factors thought to influence multiple genes in concert, yet the biological origins of this extrinsic variability have received little attention. Here we combine computational modeling with fluorescence data generated from multiple promoter-gene inserts in Saccharomyces cerevisiae to identify two major sources of extrinsic variability. One unavoidable source arising from the coupling of gene expression with population dynamics leads to a ubiquitous noise floor in expression variability. A second source which is modeled as originating from a common upstream transcription factor exemplifies how regulatory networks can convert noise in upstream regulator expression into extrinsic noise at the output of a target gene. Our results highlight the importance of the interplay of gene regulatory networks with population heterogeneity for understanding the origins of cellular diversity.

  12. A network of epigenetic modifiers and DNA repair genes controls tissue-specific copy number alteration preference.

    PubMed

    Cramer, Dina; Serrano, Luis; Schaefer, Martin H

    2016-11-10

    Copy number alterations (CNAs) in cancer patients show a large variability in their number, length and position, but the sources of this variability are not known. CNA number and length are linked to patient survival, suggesting clinical relevance. We have identified genes that tend to be mutated in samples that have few or many CNAs, which we term CONIM genes (COpy Number Instability Modulators). CONIM proteins cluster into a densely connected subnetwork of physical interactions and many of them are epigenetic modifiers. Therefore, we investigated how the epigenome of the tissue-of-origin influences the position of CNA breakpoints and the properties of the resulting CNAs. We found that the presence of heterochromatin in the tissue-of-origin contributes to the recurrence and length of CNAs in the respective cancer type.

  13. Unifying measures of gene function and evolution.

    PubMed

    Wolf, Yuri I; Carmel, Liran; Koonin, Eugene V

    2006-06-22

    Recent genome analyses revealed intriguing correlations between variables characterizing the functioning of a gene, such as expression level (EL), connectivity of genetic and protein-protein interaction networks, and knockout effect, and variables describing gene evolution, such as sequence evolution rate (ER) and propensity for gene loss. Typically, variables within each of these classes are positively correlated, e.g. products of highly expressed genes also have a propensity to be involved in many protein-protein interactions, whereas variables between classes are negatively correlated, e.g. highly expressed genes, on average, evolve slower than weakly expressed genes. Here, we describe principal component (PC) analysis of seven genome-related variables and propose biological interpretations for the first three PCs. The first PC reflects a gene's 'importance', or the 'status' of a gene in the genomic community, with positive contributions from knockout lethality, EL, number of protein-protein interaction partners and the number of paralogues, and negative contributions from sequence ER and gene loss propensity. The next two PCs define a plane that seems to reflect the functional and evolutionary plasticity of a gene. Specifically, PC2 can be interpreted as a gene's 'adaptability' whereby genes with high adaptability readily duplicate, have many genetic interaction partners and tend to be non-essential. PC3 also might reflect the role of a gene in organismal adaptation albeit with a negative rather than a positive contribution of genetic interactions; we provisionally designate this PC 'reactivity'. The interpretation of PC2 and PC3 as measures of a gene's plasticity is compatible with the observation that genes with high values of these PCs tend to be expressed in a condition- or tissue-specific manner. Functional classes of genes substantially vary in status, adaptability and reactivity, with the highest status characteristic of the translation system and cytoskeletal proteins, highest adaptability seen in cellular processes and signalling genes, and top reactivity characteristic of metabolic enzymes.

  14. Promoter architecture dictates cell-to-cell variability in gene expression.

    PubMed

    Jones, Daniel L; Brewster, Robert C; Phillips, Rob

    2014-12-19

    Variability in gene expression among genetically identical cells has emerged as a central preoccupation in the study of gene regulation; however, a divide exists between the predictions of molecular models of prokaryotic transcriptional regulation and genome-wide experimental studies suggesting that this variability is indifferent to the underlying regulatory architecture. We constructed a set of promoters in Escherichia coli in which promoter strength, transcription factor binding strength, and transcription factor copy numbers are systematically varied, and used messenger RNA (mRNA) fluorescence in situ hybridization to observe how these changes affected variability in gene expression. Our parameter-free models predicted the observed variability; hence, the molecular details of transcription dictate variability in mRNA expression, and transcriptional noise is specifically tunable and thus represents an evolutionarily accessible phenotypic parameter. Copyright © 2014, American Association for the Advancement of Science.

  15. Gene regulation and noise reduction by coupling of stochastic processes

    NASA Astrophysics Data System (ADS)

    Ramos, Alexandre F.; Hornos, José Eduardo M.; Reinitz, John

    2015-02-01

    Here we characterize the low-noise regime of a stochastic model for a negative self-regulating binary gene. The model has two stochastic variables, the protein number and the state of the gene. Each state of the gene behaves as a protein source governed by a Poisson process. The coupling between the two gene states depends on protein number. This fact has a very important implication: There exist protein production regimes characterized by sub-Poissonian noise because of negative covariance between the two stochastic variables of the model. Hence the protein numbers obey a probability distribution that has a peak that is sharper than those of the two coupled Poisson processes that are combined to produce it. Biochemically, the noise reduction in protein number occurs when the switching of the genetic state is more rapid than protein synthesis or degradation. We consider the chemical reaction rates necessary for Poisson and sub-Poisson processes in prokaryotes and eucaryotes. Our results suggest that the coupling of multiple stochastic processes in a negative covariance regime might be a widespread mechanism for noise reduction.

  16. Gene regulation and noise reduction by coupling of stochastic processes

    PubMed Central

    Hornos, José Eduardo M.; Reinitz, John

    2015-01-01

    Here we characterize the low noise regime of a stochastic model for a negative self-regulating binary gene. The model has two stochastic variables, the protein number and the state of the gene. Each state of the gene behaves as a protein source governed by a Poisson process. The coupling between the the two gene states depends on protein number. This fact has a very important implication: there exist protein production regimes characterized by sub-Poissonian noise because of negative covariance between the two stochastic variables of the model. Hence the protein numbers obey a probability distribution that has a peak that is sharper than those of the two coupled Poisson processes that are combined to produce it. Biochemically, the noise reduction in protein number occurs when the switching of genetic state is more rapid than protein synthesis or degradation. We consider the chemical reaction rates necessary for Poisson and sub-Poisson processes in prokaryotes and eucaryotes. Our results suggest that the coupling of multiple stochastic processes in a negative covariance regime might be a widespread mechanism for noise reduction. PMID:25768447

  17. Gene regulation and noise reduction by coupling of stochastic processes.

    PubMed

    Ramos, Alexandre F; Hornos, José Eduardo M; Reinitz, John

    2015-02-01

    Here we characterize the low-noise regime of a stochastic model for a negative self-regulating binary gene. The model has two stochastic variables, the protein number and the state of the gene. Each state of the gene behaves as a protein source governed by a Poisson process. The coupling between the two gene states depends on protein number. This fact has a very important implication: There exist protein production regimes characterized by sub-Poissonian noise because of negative covariance between the two stochastic variables of the model. Hence the protein numbers obey a probability distribution that has a peak that is sharper than those of the two coupled Poisson processes that are combined to produce it. Biochemically, the noise reduction in protein number occurs when the switching of the genetic state is more rapid than protein synthesis or degradation. We consider the chemical reaction rates necessary for Poisson and sub-Poisson processes in prokaryotes and eucaryotes. Our results suggest that the coupling of multiple stochastic processes in a negative covariance regime might be a widespread mechanism for noise reduction.

  18. Gene variants associated with antisocial behaviour: A latent variable approach

    PubMed Central

    Bentley, Mary Jane; Lin, Haiqun; Fernandez, Thomas V.; Lee, Maria; Yrigollen, Carolyn M.; Pakstis, Andrew J.; Katsovich, Liliya; Olds, David L.; Grigorenko, Elena L.; Leckman, James F.

    2013-01-01

    Objective The aim of this study was to determine if a latent variable approach might be useful in identifying shared variance across genetic risk alleles that is associated with antisocial behaviour at age 15 years. Methods Using a conventional latent variable approach, we derived an antisocial phenotype in 328 adolescents utilizing data from a 15-year follow-up of a randomized trial of a prenatal and infancy nurse-home visitation program in Elmira, New York. We then investigated, via a novel latent variable approach, 450 informative genetic polymorphisms in 71 genes previously associated with antisocial behaviour, drug use, affiliative behaviours, and stress response in 241 consenting individuals for whom DNA was available. Haplotype and Pathway analyses were also performed. Results Eight single-nucleotide polymorphisms (SNPs) from 8 genes contributed to the latent genetic variable that in turn accounted for 16.0% of the variance within the latent antisocial phenotype. The number of risk alleles was linearly related to the latent antisocial variable scores. Haplotypes that included the putative risk alleles for all 8 genes were also associated with higher latent antisocial variable scores. In addition, 33 SNPs from 63 of the remaining genes were also significant when added to the final model. Many of these genes interact on a molecular level, forming molecular networks. The results support a role for genes related to dopamine, norepinephrine, serotonin, glutamate, opioid, and cholinergic signaling as well as stress response pathways in mediating susceptibility to antisocial behaviour. Conclusions This preliminary study supports use of relevant behavioural indicators and latent variable approaches to study the potential “co-action” of gene variants associated with antisocial behaviour. It also underscores the cumulative relevance of common genetic variants for understanding the etiology of complex behaviour. If replicated in future studies, this approach may allow the identification of a ‘shared’ variance across genetic risk alleles associated with complex neuropsychiatric dimensional phenotypes using relatively small numbers of well-characterized research participants. PMID:23822756

  19. UGT2B17 and SULT1A1 gene copy number variation (CNV) detection by LabChip microfluidic technology.

    PubMed

    Gaedigk, Andrea; Gaedigk, Roger; Leeder, J Steven

    2010-05-01

    Gene copy number variations (CNVs) are increasingly recognized to play important roles in the expression of genes and hence on their respective enzymatic activities. This has been demonstrated for a number of drug metabolizing genes, such as UDP-glucuronosyltransferases 2B17 (UGT2B17) and sulfotransferase 1A1 (SULT1A1), which are subject to genetic heterogeneity, including CNV. Quantitative assays to assess gene copy number are therefore becoming an integral part of accurate genotype assessment and phenotype prediction. In this study, we evaluated a microfluidics-based system, the Bio-Rad Experion system, to determine the power and utility of this platform to detect UGT2B17 and SULT1A1 CNV in DNA samples derived from blood and tissue. UGT2B17 is known to present with 0, 1 or 2 and SULT1A1 with up to 5 gene copies. Distinct clustering (p<0.001) into copy number groups was achieved for both genes. DNA samples derived from blood exhibited less inter-run variability compared to DNA samples obtained from liver tissue. This variability may be caused by tissue-specific PCR inhibitors as it could be overcome by using DNA from another tissue, or after the DNA had undergone whole genome amplification. This method produced results comparable to those reported for other quantitative test platforms.

  20. Recursive regularization for inferring gene networks from time-course gene expression profiles

    PubMed Central

    Shimamura, Teppei; Imoto, Seiya; Yamaguchi, Rui; Fujita, André; Nagasaki, Masao; Miyano, Satoru

    2009-01-01

    Background Inferring gene networks from time-course microarray experiments with vector autoregressive (VAR) model is the process of identifying functional associations between genes through multivariate time series. This problem can be cast as a variable selection problem in Statistics. One of the promising methods for variable selection is the elastic net proposed by Zou and Hastie (2005). However, VAR modeling with the elastic net succeeds in increasing the number of true positives while it also results in increasing the number of false positives. Results By incorporating relative importance of the VAR coefficients into the elastic net, we propose a new class of regularization, called recursive elastic net, to increase the capability of the elastic net and estimate gene networks based on the VAR model. The recursive elastic net can reduce the number of false positives gradually by updating the importance. Numerical simulations and comparisons demonstrate that the proposed method succeeds in reducing the number of false positives drastically while keeping the high number of true positives in the network inference and achieves two or more times higher true discovery rate (the proportion of true positives among the selected edges) than the competing methods even when the number of time points is small. We also compared our method with various reverse-engineering algorithms on experimental data of MCF-7 breast cancer cells stimulated with two ErbB ligands, EGF and HRG. Conclusion The recursive elastic net is a powerful tool for inferring gene networks from time-course gene expression profiles. PMID:19386091

  1. Novel variable number of tandem repeats of gibbon MAOA gene and its evolutionary significance.

    PubMed

    Choi, Yuri; Jung, Yi-Deun; Ayarpadikannan, Selvam; Koga, Akihiko; Imai, Hiroo; Hirai, Hirohisa; Roos, Christian; Kim, Heui-Soo

    2014-08-01

    Variable number of tandem repeats (VNTRs) are scattered throughout the primate genome, and genetic variation of these VNTRs have been accumulated during primate radiation. Here, we analyzed VNTRs upstream of the monoamine oxidase A (MAOA) gene in 11 different gibbon species. An abundance of truncated VNTR sequences and copy number differences were observed compared to those of human VNTR sequences. To better understand the biological role of these VNTRs, a luciferase activity assay was conducted and results indicated that selected VNTR sequences of the MAOA gene from human and three different gibbon species (Hylobates klossii, Hylobates lar, and Nomascus concolor) showed silencing ability. Together, these data could be useful for understanding the evolutionary history and functional significance of MAOA VNTR sequences in gibbon species.

  2. 6-mercaptopurine influences TPMT gene transcription in a TPMT gene promoter variable number of tandem repeats-dependent manner.

    PubMed

    Kotur, Nikola; Stankovic, Biljana; Kassela, Katerina; Georgitsi, Marianthi; Vicha, Anna; Leontari, Iliana; Dokmanovic, Lidija; Janic, Dragana; Krstovski, Nada; Klaassen, Kristel; Radmilovic, Milena; Stojiljkovic, Maja; Nikcevic, Gordana; Simeonidis, Argiris; Sivolapenko, Gregory; Pavlovic, Sonja; Patrinos, George P; Zukic, Branka

    2012-02-01

    TPMT activity is characterized by a trimodal distribution, namely low, intermediate and high methylator. TPMT gene promoter contains a variable number of GC-rich tandem repeats (VNTRs), namely A, B and C, ranging from three to nine repeats in length in an A(n)B(m)C architecture. We have previously shown that the VNTR architecture in the TPMT gene promoter affects TPMT gene transcription. MATERIALS, METHODS & RESULTS: Here we demonstrate, using reporter assays, that 6-mercaptopurine (6-MP) treatment results in a VNTR architecture-dependent decrease of TPMT gene transcription, mediated by the binding of newly recruited protein complexes to the TPMT gene promoter, upon 6-MP treatment. We also show that acute lymphoblastic leukemia patients undergoing 6-MP treatment display a VNTR architecture-dependent response to 6-MP. These data suggest that the TPMT gene promoter VNTR architecture can be potentially used as a pharmacogenomic marker to predict toxicity due to 6-MP treatment in acute lymphoblastic leukemia patients.

  3. Intratypic variability of a tandem repeat locus within the DNA polymerase gene of human herpes simplex virus type 2.

    PubMed

    Sun, Yongjiang; Chan, Roy Kum Wah; Tan, Suat Hoon

    2004-01-01

    In this study, the irntratypic variability of a tandem repeat locus within the DNA polymerase (pol) gene of human herpes simplex virus type 2 (HSV2) was uncovered. The locus contained variable numbers of tandem dodecanucleotide (5'-GAC GAG GAC GGG-3') repetitive units. Our result showed that approximately 95% of analyzed HSV2 clinical isolates and the current GenBank HSV2 strains contained two copies of the repetitive units. From genital herpes specimens, three new HSV2 strains, which respectively contained 1, 3, and 4 copies of the repetitive units, were identified. This variable number of tandem repeat (VNTR) locus is absent in HSV1, and thus it also contributes to the intertypic variability of HSV1 and HSV2. The intratypic variability of the locus may be useful for HSV2 strain genotyping and this application is discussed.

  4. Variable Copy Number, Intra-Genomic Heterogeneities and Lateral Transfers of the 16S rRNA Gene in Pseudomonas

    PubMed Central

    Bodilis, Josselin; Nsigue-Meilo, Sandrine; Besaury, Ludovic; Quillet, Laurent

    2012-01-01

    Even though the 16S rRNA gene is the most commonly used taxonomic marker in microbial ecology, its poor resolution is still not fully understood at the intra-genus level. In this work, the number of rRNA gene operons, intra-genomic heterogeneities and lateral transfers were investigated at a fine-scale resolution, throughout the Pseudomonas genus. In addition to nineteen sequenced Pseudomonas strains, we determined the 16S rRNA copy number in four other Pseudomonas strains by Southern hybridization and Pulsed-Field Gel Electrophoresis, and studied the intra-genomic heterogeneities by Denaturing Gradient Gel Electrophoresis and sequencing. Although the variable copy number (from four to seven) seems to be correlated with the evolutionary distance, some close strains in the P. fluorescens lineage showed a different number of 16S rRNA genes, whereas all the strains in the P. aeruginosa lineage displayed the same number of genes (four copies). Further study of the intra-genomic heterogeneities revealed that most of the Pseudomonas strains (15 out of 19 strains) had at least two different 16S rRNA alleles. A great difference (5 or 19 nucleotides, essentially grouped near the V1 hypervariable region) was observed only in two sequenced strains. In one of our strains studied (MFY30 strain), we found a difference of 12 nucleotides (grouped in the V3 hypervariable region) between copies of the 16S rRNA gene. Finally, occurrence of partial lateral transfers of the 16S rRNA gene was further investigated in 1803 full-length sequences of Pseudomonas available in the databases. Remarkably, we found that the two most variable regions (the V1 and V3 hypervariable regions) had probably been laterally transferred from another evolutionary distant Pseudomonas strain for at least 48.3 and 41.6% of the 16S rRNA sequences, respectively. In conclusion, we strongly recommend removing these regions of the 16S rRNA gene during the intra-genus diversity studies. PMID:22545126

  5. Integrative analysis of gene expression and copy number alterations using canonical correlation analysis.

    PubMed

    Soneson, Charlotte; Lilljebjörn, Henrik; Fioretos, Thoas; Fontes, Magnus

    2010-04-15

    With the rapid development of new genetic measurement methods, several types of genetic alterations can be quantified in a high-throughput manner. While the initial focus has been on investigating each data set separately, there is an increasing interest in studying the correlation structure between two or more data sets. Multivariate methods based on Canonical Correlation Analysis (CCA) have been proposed for integrating paired genetic data sets. The high dimensionality of microarray data imposes computational difficulties, which have been addressed for instance by studying the covariance structure of the data, or by reducing the number of variables prior to applying the CCA. In this work, we propose a new method for analyzing high-dimensional paired genetic data sets, which mainly emphasizes the correlation structure and still permits efficient application to very large data sets. The method is implemented by translating a regularized CCA to its dual form, where the computational complexity depends mainly on the number of samples instead of the number of variables. The optimal regularization parameters are chosen by cross-validation. We apply the regularized dual CCA, as well as a classical CCA preceded by a dimension-reducing Principal Components Analysis (PCA), to a paired data set of gene expression changes and copy number alterations in leukemia. Using the correlation-maximizing methods, regularized dual CCA and PCA+CCA, we show that without pre-selection of known disease-relevant genes, and without using information about clinical class membership, an exploratory analysis singles out two patient groups, corresponding to well-known leukemia subtypes. Furthermore, the variables showing the highest relevance to the extracted features agree with previous biological knowledge concerning copy number alterations and gene expression changes in these subtypes. Finally, the correlation-maximizing methods are shown to yield results which are more biologically interpretable than those resulting from a covariance-maximizing method, and provide different insight compared to when each variable set is studied separately using PCA. We conclude that regularized dual CCA as well as PCA+CCA are useful methods for exploratory analysis of paired genetic data sets, and can be efficiently implemented also when the number of variables is very large.

  6. Association between the dopamine D4 receptor gene exon III variable number of tandem repeats and political attitudes in female Han Chinese

    PubMed Central

    Ebstein, Richard P.; Monakhov, Mikhail V.; Lu, Yunfeng; Jiang, Yushi; Lai, Poh San; Chew, Soo Hong

    2015-01-01

    Twin and family studies suggest that political attitudes are partially determined by an individual's genotype. The dopamine D4 receptor gene (DRD4) exon III repeat region that has been extensively studied in connection with human behaviour, is a plausible candidate to contribute to individual differences in political attitudes. A first United States study provisionally identified this gene with political attitude along a liberal–conservative axis albeit contingent upon number of friends. In a large sample of 1771 Han Chinese university students in Singapore, we observed a significant main effect of association between the DRD4 exon III variable number of tandem repeats and political attitude. Subjects with two copies of the 4-repeat allele (4R/4R) were significantly more conservative. Our results provided evidence for a role of the DRD4 gene variants in contributing to individual differences in political attitude particularly in females and more generally suggested that associations between individual genes, and neurochemical pathways, contributing to traits relevant to the social sciences can be provisionally identified. PMID:26246555

  7. Association between the dopamine D4 receptor gene exon III variable number of tandem repeats and political attitudes in female Han Chinese.

    PubMed

    Ebstein, Richard P; Monakhov, Mikhail V; Lu, Yunfeng; Jiang, Yushi; Lai, Poh San; Chew, Soo Hong

    2015-08-22

    Twin and family studies suggest that political attitudes are partially determined by an individual's genotype. The dopamine D4 receptor gene (DRD4) exon III repeat region that has been extensively studied in connection with human behaviour, is a plausible candidate to contribute to individual differences in political attitudes. A first United States study provisionally identified this gene with political attitude along a liberal-conservative axis albeit contingent upon number of friends. In a large sample of 1771 Han Chinese university students in Singapore, we observed a significant main effect of association between the DRD4 exon III variable number of tandem repeats and political attitude. Subjects with two copies of the 4-repeat allele (4R/4R) were significantly more conservative. Our results provided evidence for a role of the DRD4 gene variants in contributing to individual differences in political attitude particularly in females and more generally suggested that associations between individual genes, and neurochemical pathways, contributing to traits relevant to the social sciences can be provisionally identified. © 2015 The Author(s).

  8. Improved Sparse Multi-Class SVM and Its Application for Gene Selection in Cancer Classification

    PubMed Central

    Huang, Lingkang; Zhang, Hao Helen; Zeng, Zhao-Bang; Bushel, Pierre R.

    2013-01-01

    Background Microarray techniques provide promising tools for cancer diagnosis using gene expression profiles. However, molecular diagnosis based on high-throughput platforms presents great challenges due to the overwhelming number of variables versus the small sample size and the complex nature of multi-type tumors. Support vector machines (SVMs) have shown superior performance in cancer classification due to their ability to handle high dimensional low sample size data. The multi-class SVM algorithm of Crammer and Singer provides a natural framework for multi-class learning. Despite its effective performance, the procedure utilizes all variables without selection. In this paper, we propose to improve the procedure by imposing shrinkage penalties in learning to enforce solution sparsity. Results The original multi-class SVM of Crammer and Singer is effective for multi-class classification but does not conduct variable selection. We improved the method by introducing soft-thresholding type penalties to incorporate variable selection into multi-class classification for high dimensional data. The new methods were applied to simulated data and two cancer gene expression data sets. The results demonstrate that the new methods can select a small number of genes for building accurate multi-class classification rules. Furthermore, the important genes selected by the methods overlap significantly, suggesting general agreement among different variable selection schemes. Conclusions High accuracy and sparsity make the new methods attractive for cancer diagnostics with gene expression data and defining targets of therapeutic intervention. Availability: The source MATLAB code are available from http://math.arizona.edu/~hzhang/software.html. PMID:23966761

  9. Gene and Chromosomal Copy Number Variations as an Adaptive Mechanism Towards a Parasitic Lifestyle in Trypanosomatids.

    PubMed

    Reis-Cunha, João Luís; Valdivia, Hugo O; Bartholomeu, Daniella Castanheira

    2018-02-01

    Trypanosomatids are a group of kinetoplastid parasites including some of great public health importance, causing debilitating and life-long lasting diseases that affect more than 24 million people worldwide. Among the trypanosomatids, Trypanosoma cruzi, Trypanosoma brucei and species from the Leishmania genus are the most well studied parasites, due to their high prevalence in human infections. These parasites have an extreme genomic and phenotypic variability, with a massive expansion in the copy number of species-specific multigene families enrolled in host-parasite interactions that mediate cellular invasion and immune evasion processes. As most trypanosomatids are heteroxenous, and therefore their lifecycles involve the transition between different hosts, these parasites have developed several strategies to ensure a rapid adaptation to changing environments. Among these strategies, a rapid shift in the repertoire of expressed genes, genetic variability and genome plasticity are key mechanisms. Trypanosomatid genomes are organized into large directional gene clusters that are transcribed polycistronically, where genes derived from the same polycistron may have very distinct mRNA levels. This particular mode of transcription implies that the control of gene expression operates mainly at post-transcriptional level. In this sense, gene duplications/losses were already associated with changes in mRNA levels in these parasites. Gene duplications also allow the generation of sequence variability, as the newly formed copy can diverge without loss of function of the original copy. Recently, aneuploidies have been shown to occur in several Leishmania species and T. cruzi strains. Although aneuploidies are usually associated with debilitating phenotypes in superior eukaryotes, recent data shows that it could also provide increased fitness in stress conditions and generate drug resistance in unicellular eukaryotes. In this review, we will focus on gene and chromosomal copy number variations and their relevance to the evolution of trypanosomatid parasites.

  10. A Legionella pneumophila collagen-like protein encoded by a gene with a variable number of tandem repeats is involved in the adherence and invasion of host cells.

    PubMed

    Vandersmissen, Liesbeth; De Buck, Emmy; Saels, Veerle; Coil, David A; Anné, Jozef

    2010-05-01

    Legionella pneumophila is a Gram-negative, facultative intracellular pathogen and the causative agent of Legionnaires' disease, a severe pneumonia in humans. Analysis of the Legionella sequenced genomes revealed a gene with a variable number of tandem repeats (VNTRs), whose number varies between strains. We examined the strain distribution of this gene among a collection of 108 clinical, environmental and hot spring serotype I strains. Twelve variants were identified, but no correlation was observed between the number of repeat units and clinical and environmental strains. The encoded protein contains the C-terminal consensus motif of outer membrane proteins and has a large region of collagen-like repeats that is encoded by the VNTR region. We have therefore annotated this protein Lcl for Legionella collagen-like protein. Lcl was shown to contribute to the adherence and invasion of host cells and it was demonstrated that the number of repeat units present in lcl had an influence on these adhesion characteristics.

  11. Variability of CAG tandem repeats in exon 1 of the androgen receptor gene is not related with dog intersexuality.

    PubMed

    Nowacka-Woszuk, J; Switonski, M

    2010-02-01

    Numerous mutations of the human androgen receptor (AR) gene cause an intersexual phenotype, called the androgen insensitivity syndrome. The intersexual phenotype is also quite often diagnosed in dogs. The aim of this study was to conduct a comparative analysis of the entire coding sequence (eight exons) of the AR gene in healthy and four intersex dogs, as well as in three other canids (the red fox, arctic fox and Chinese raccoon dog). The coding sequence of the studied species appeared to be conserved (similarity above 97%) and polymorphism was found in exon 1 only. Altogether, 2 SNPs were identified in healthy dogs, 14 in red foxes, 16 in arctic foxes and 6 were found in Chinese raccoon dogs, respectively. Moreover, a variable number of tandem repeats (CAG and CAA), encoding an array of glutamines, was also observed in this exon. The CAA codon numbers were invariable within species, but the CAG repeats were polymorphic. The highest number of the CAG and CAA repeats was found in dogs (from 40 to 42) and the observed variability was similar in intersex and healthy dogs. In the other canids the variability fell within the following ranges: 29-37 (red fox), 37-39 (arctic fox) and 29-32 (Chinese raccoon dog). In addition, a polymorphic microsatellite marker in intron 2 was found in the dog, red fox and Chinese raccoon dog. It was concluded that the polymorphism level of the AR gene in the dog was lower than in the other canids and none of the detected polymorphisms, including variability of the CAG tandem repeats, could be related with the intersexual phenotype of the studied dogs.

  12. Unveiling the pan-genome of the SXT/R391 family of ICEs: molecular characterisation of new variable regions of SXT/R391-like ICEs detected in Pseudoalteromonas sp. and Vibrio scophthalmi.

    PubMed

    Rodríguez-Blanco, Arturo; Lemos, Manuel L; Osorio, Carlos R

    2016-08-01

    Integrating conjugative elements (ICEs) of the SXT/R391 family have been identified in fish-isolated bacterial strains collected from marine aquaculture environments of the northwestern Iberian Peninsula. Here we analysed the variable regions of two ICEs, one preliminarily characterised in a previous study (ICEVscSpa3) and one newly identified (ICEPspSpa1). Bacterial strains harboring these ICEs were phylogenetically assigned to Vibrio scophthalmi and Pseudoalteromonas sp., thus constituting the first evidence of SXT/R391-like ICEs in the genus Pseudoalteromonas to date. Variable DNA regions, which confer element-specific properties to ICEs of this family, were characterised. Interestingly, the two ICEs contained 29 genes not found in variable DNA insertions of previously described ICEs. Most notably, variable gene content for ICEVscSpa3 showed similarity to genes potentially involved in housekeeping functions of replication, nucleotide metabolism and transcription. For these genes, closest homologues were found clustered in the genome of Pseudomonas psychrotolerans L19, suggesting a transfer as a block to ICEVscSpa3. Genes encoding antibiotic resistance, restriction modification systems and toxin/antitoxin systems were absent from hotspots of ICEVscSpa3. In contrast, the variable gene content of ICEPspSpa1 included genes involved in restriction/modification functions in two different hotspots and genes related to ICE maintenance. The present study unveils a relatively large number of novel genes in SXT/R391-ICEs, and demonstrates the major role of ICE elements as contributors to horizontal gene transfer.

  13. Robust Learning of High-dimensional Biological Networks with Bayesian Networks

    NASA Astrophysics Data System (ADS)

    Nägele, Andreas; Dejori, Mathäus; Stetter, Martin

    Structure learning of Bayesian networks applied to gene expression data has become a potentially useful method to estimate interactions between genes. However, the NP-hardness of Bayesian network structure learning renders the reconstruction of the full genetic network with thousands of genes unfeasible. Consequently, the maximal network size is usually restricted dramatically to a small set of genes (corresponding with variables in the Bayesian network). Although this feature reduction step makes structure learning computationally tractable, on the downside, the learned structure might be adversely affected due to the introduction of missing genes. Additionally, gene expression data are usually very sparse with respect to the number of samples, i.e., the number of genes is much greater than the number of different observations. Given these problems, learning robust network features from microarray data is a challenging task. This chapter presents several approaches tackling the robustness issue in order to obtain a more reliable estimation of learned network features.

  14. Sources of Variance in Baseline Gene Expression in the Rodent Liver

    PubMed Central

    Corton, J. Christopher; Bushel, Pierre R.; Fostel, Jennifer; O'Lone, Raegan B.

    2012-01-01

    The use of gene expression profiling in both clinical and laboratory settings would be enhanced by better characterization of variation due to individual, environmental, and technical factors. Analysis of microarray data from untreated or vehicle-treated animals within the control arm of toxicogenomics studies has yielded useful information on baseline fluctuations in liver gene expression in the rodent. Here, studies which highlight contributions of different factors to gene expression variability in the rodent liver are discussed including a large meta-analysis of rat liver, which identified genes that vary in control animals in the absence of chemical treatment. Genes and their pathways that are the most and least variable were identified in a number of these studies. Life stage, fasting, sex, diet, circadian rhythm and liver lobe source can profoundly influence gene expression in the liver. Recognition of biological and technical factors that contribute to variability of background gene expression can help the investigator in the design of an experiment that maximizes sensitivity and reduces the influence of confounders that may lead to misinterpretation of genomic changes. The factors that contribute to variability in liver gene expression in rodents are likely analogous to those contributing to human interindividual variability in drug response and chemical toxicity. Identification of batteries of genes that are altered in a variety of background conditions could be used to predict responses to drugs and chemicals in appropriate models of the human liver. PMID:22230429

  15. Extensive Gains and Losses of Olfactory Receptor Genes in Mammalian Evolution

    PubMed Central

    Niimura, Yoshihito; Nei, Masatoshi

    2007-01-01

    Odor perception in mammals is mediated by a large multigene family of olfactory receptor (OR) genes. The number of OR genes varies extensively among different species of mammals, and most species have a substantial number of pseudogenes. To gain some insight into the evolutionary dynamics of mammalian OR genes, we identified the entire set of OR genes in platypuses, opossums, cows, dogs, rats, and macaques and studied the evolutionary change of the genes together with those of humans and mice. We found that platypuses and primates have <400 functional OR genes while the other species have 800–1,200 functional OR genes. We then estimated the numbers of gains and losses of OR genes for each branch of the phylogenetic tree of mammals. This analysis showed that (i) gene expansion occurred in the placental lineage each time after it diverged from monotremes and from marsupials and (ii) hundreds of gains and losses of OR genes have occurred in an order-specific manner, making the gene repertoires highly variable among different orders. It appears that the number of OR genes is determined primarily by the functional requirement for each species, but once the number reaches the required level, it fluctuates by random duplication and deletion of genes. This fluctuation seems to have been aided by the stochastic nature of OR gene expression. PMID:17684554

  16. Sylvatic plague reduces genetic variability in black-tailed prairie dogs.

    PubMed

    Trudeau, Kristie M; Britten, Hugh B; Restani, Marco

    2004-04-01

    Small, isolated populations are vulnerable to loss of genetic diversity through in-breeding and genetic drift. Sylvatic plague due to infection by the bacterium Yersinia pestis caused an epizootic in the early 1990s resullting in declines and extirpations of many black-tailed prairie dog (Cynomys ludovicianus) colonies in north-central Montana, USA. Plague-induced population bottlenecks may contribute to significant reductions in genetic variability. In contrast, gene flow maintains genetic variability within colonies. We investigated the impacts of the plague epizootic and distance to nearest colony on levels of genetic variability in six prairie dog colonies sampled between June 1999 and July 2001 using 24 variable randomly amplified polymorphic DNA (RAPD) markers. Number of effective alleles per locus (n(e)) and gene diversity (h) were significantly decreased in the three colonies affected by plague that were recovering from the resulting bottlenecks compared with the three colonies that did not experience plague. Genetic variability was not significantly affected by geographic distance between colonies. The majority of variance in gene fieqnencies was found within prairie clog colonies. Conservation of genetic variability in black-tailed prairie dogs will require the preservation of both large and small colony complexes and the gene flow amonog them.

  17. A single determinant dominates the rate of yeast protein evolution.

    PubMed

    Drummond, D Allan; Raval, Alpan; Wilke, Claus O

    2006-02-01

    A gene's rate of sequence evolution is among the most fundamental evolutionary quantities in common use, but what determines evolutionary rates has remained unclear. Here, we carry out the first combined analysis of seven predictors (gene expression level, dispensability, protein abundance, codon adaptation index, gene length, number of protein-protein interactions, and the gene's centrality in the interaction network) previously reported to have independent influences on protein evolutionary rates. Strikingly, our analysis reveals a single dominant variable linked to the number of translation events which explains 40-fold more variation in evolutionary rate than any other, suggesting that protein evolutionary rate has a single major determinant among the seven predictors. The dominant variable explains nearly half the variation in the rate of synonymous and protein evolution. We show that the two most commonly used methods to disentangle the determinants of evolutionary rate, partial correlation analysis and ordinary multivariate regression, produce misleading or spurious results when applied to noisy biological data. We overcome these difficulties by employing principal component regression, a multivariate regression of evolutionary rate against the principal components of the predictor variables. Our results support the hypothesis that translational selection governs the rate of synonymous and protein sequence evolution in yeast.

  18. Using variable rate models to identify genes under selection in sequence pairs: their validity and limitations for EST sequences.

    PubMed

    Church, Sheri A; Livingstone, Kevin; Lai, Zhao; Kozik, Alexander; Knapp, Steven J; Michelmore, Richard W; Rieseberg, Loren H

    2007-02-01

    Using likelihood-based variable selection models, we determined if positive selection was acting on 523 EST sequence pairs from two lineages of sunflower and lettuce. Variable rate models are generally not used for comparisons of sequence pairs due to the limited information and the inaccuracy of estimates of specific substitution rates. However, previous studies have shown that the likelihood ratio test (LRT) is reliable for detecting positive selection, even with low numbers of sequences. These analyses identified 56 genes that show a signature of selection, of which 75% were not identified by simpler models that average selection across codons. Subsequent mapping studies in sunflower show four of five of the positively selected genes identified by these methods mapped to domestication QTLs. We discuss the validity and limitations of using variable rate models for comparisons of sequence pairs, as well as the limitations of using ESTs for identification of positively selected genes.

  19. Comparative genomics of wild type yeast strains unveils important genome diversity

    PubMed Central

    Carreto, Laura; Eiriz, Maria F; Gomes, Ana C; Pereira, Patrícia M; Schuller, Dorit; Santos, Manuel AS

    2008-01-01

    Background Genome variability generates phenotypic heterogeneity and is of relevance for adaptation to environmental change, but the extent of such variability in natural populations is still poorly understood. For example, selected Saccharomyces cerevisiae strains are variable at the ploidy level, have gene amplifications, changes in chromosome copy number, and gross chromosomal rearrangements. This suggests that genome plasticity provides important genetic diversity upon which natural selection mechanisms can operate. Results In this study, we have used wild-type S. cerevisiae (yeast) strains to investigate genome variation in natural and artificial environments. We have used comparative genome hybridization on array (aCGH) to characterize the genome variability of 16 yeast strains, of laboratory and commercial origin, isolated from vineyards and wine cellars, and from opportunistic human infections. Interestingly, sub-telomeric instability was associated with the clinical phenotype, while Ty element insertion regions determined genomic differences of natural wine fermentation strains. Copy number depletion of ASP3 and YRF1 genes was found in all wild-type strains. Other gene families involved in transmembrane transport, sugar and alcohol metabolism or drug resistance had copy number changes, which also distinguished wine from clinical isolates. Conclusion We have isolated and genotyped more than 1000 yeast strains from natural environments and carried out an aCGH analysis of 16 strains representative of distinct genotype clusters. Important genomic variability was identified between these strains, in particular in sub-telomeric regions and in Ty-element insertion sites, suggesting that this type of genome variability is the main source of genetic diversity in natural populations of yeast. The data highlights the usefulness of yeast as a model system to unravel intraspecific natural genome diversity and to elucidate how natural selection shapes the yeast genome. PMID:18983662

  20. Massively parallel rRNA gene sequencing exacerbates the potential for biased community diversity comparisons due to variable library sizes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Gihring, Thomas; Green, Stefan; Schadt, Christopher Warren

    2011-01-01

    Technologies for massively parallel sequencing are revolutionizing microbial ecology and are vastly increasing the scale of ribosomal RNA (rRNA) gene studies. Although pyrosequencing has increased the breadth and depth of possible rRNA gene sampling, one drawback is that the number of reads obtained per sample is difficult to control. Pyrosequencing libraries typically vary widely in the number of sequences per sample, even within individual studies, and there is a need to revisit the behaviour of richness estimators and diversity indices with variable gene sequence library sizes. Multiple reports and review papers have demonstrated the bias in non-parametric richness estimators (e.g.more » Chao1 and ACE) and diversity indices when using clone libraries. However, we found that biased community comparisons are accumulating in the literature. Here we demonstrate the effects of sample size on Chao1, ACE, CatchAll, Shannon, Chao-Shen and Simpson's estimations specifically using pyrosequencing libraries. The need to equalize the number of reads being compared across libraries is reiterated, and investigators are directed towards available tools for making unbiased diversity comparisons.« less

  1. DNA copy number changes define spatial patterns of heterogeneity in colorectal cancer

    PubMed Central

    Mamlouk, Soulafa; Childs, Liam Harold; Aust, Daniela; Heim, Daniel; Melching, Friederike; Oliveira, Cristiano; Wolf, Thomas; Durek, Pawel; Schumacher, Dirk; Bläker, Hendrik; von Winterfeld, Moritz; Gastl, Bastian; Möhr, Kerstin; Menne, Andrea; Zeugner, Silke; Redmer, Torben; Lenze, Dido; Tierling, Sascha; Möbs, Markus; Weichert, Wilko; Folprecht, Gunnar; Blanc, Eric; Beule, Dieter; Schäfer, Reinhold; Morkel, Markus; Klauschen, Frederick; Leser, Ulf; Sers, Christine

    2017-01-01

    Genetic heterogeneity between and within tumours is a major factor determining cancer progression and therapy response. Here we examined DNA sequence and DNA copy-number heterogeneity in colorectal cancer (CRC) by targeted high-depth sequencing of 100 most frequently altered genes. In 97 samples, with primary tumours and matched metastases from 27 patients, we observe inter-tumour concordance for coding mutations; in contrast, gene copy numbers are highly discordant between primary tumours and metastases as validated by fluorescent in situ hybridization. To further investigate intra-tumour heterogeneity, we dissected a single tumour into 68 spatially defined samples and sequenced them separately. We identify evenly distributed coding mutations in APC and TP53 in all tumour areas, yet highly variable gene copy numbers in numerous genes. 3D morpho-molecular reconstruction reveals two clusters with divergent copy number aberrations along the proximal–distal axis indicating that DNA copy number variations are a major source of tumour heterogeneity in CRC. PMID:28120820

  2. Exome sequence analysis suggests genetic burden contributes to phenotypic variability and complex neuropathy

    PubMed Central

    Gonzaga-Jauregui, Claudia; Harel, Tamar; Gambin, Tomasz; Kousi, Maria; Griffin, Laurie B.; Francescatto, Ludmila; Ozes, Burcak; Karaca, Ender; Jhangiani, Shalini; Bainbridge, Matthew N.; Lawson, Kim S.; Pehlivan, Davut; Okamoto, Yuji; Withers, Marjorie; Mancias, Pedro; Slavotinek, Anne; Reitnauer, Pamela J; Goksungur, Meryem T.; Shy, Michael; Crawford, Thomas O.; Koenig, Michel; Willer, Jason; Flores, Brittany N.; Pediaditrakis, Igor; Us, Onder; Wiszniewski, Wojciech; Parman, Yesim; Antonellis, Anthony; Muzny, Donna M.; Katsanis, Nicholas; Battaloglu, Esra; Boerwinkle, Eric; Gibbs, Richard A.; Lupski, James R.

    2015-01-01

    Charcot-Marie-Tooth (CMT) disease is a clinically and genetically heterogeneous distal symmetric polyneuropathy. Whole-exome sequencing (WES) of 40 individuals from 37 unrelated families with CMT-like peripheral neuropathy refractory to molecular diagnosis identified apparent causal mutations in ~45% (17/37) of families. Three candidate disease genes are proposed, supported by a combination of genetic and in vivo studies. Aggregate analysis of mutation data revealed a significantly increased number of rare variants across 58 neuropathy associated genes in subjects versus controls; confirmed in a second ethnically discrete neuropathy cohort, suggesting mutation burden potentially contributes to phenotypic variability. Neuropathy genes shown to have highly penetrant Mendelizing variants (HMPVs) and implicated by burden in families were shown to interact genetically in a zebrafish assay exacerbating the phenotype established by the suppression of single genes. Our findings suggest that the combinatorial effect of rare variants contributes to disease burden and variable expressivity. PMID:26257172

  3. Relationship between the rs1414334 C/G polymorphism in the HTR2C gene and smoking in patients treated with atypical antipsychotics.

    PubMed

    Rico-Gomis, José María; Palazón-Bru, Antonio; Triano-García, Irene; Mahecha-García, Luis Fabián; García-Monsalve, Ana; Navarro-Ruiz, Andrés; Villagordo-Peñalver, Berta; Martínez-Hortelano, Alicia; Gil-Guillén, Vicente Francisco

    2018-04-15

    An association has been found between the C allele of the rs1414334 polymorphism in the HTR2C gene and the metabolic syndrome in psychiatric patients. However, no study has yet evaluated whether this allele is associated with smoking. To assess this issue, therefore, we performed a cross-sectional study with a sample of 166 adult patients treated with atypical antipsychotics in 2012-2013 in a region of Spain. The primary variable was the presence of the C allele of the rs1414334 polymorphism in the HTR2C gene. Secondary variables were the number of pack-years (number of cigarettes per day x number of smoking years ÷ 20), age, gender, schizophrenia, years since diagnosis, metabolic syndrome criteria and SCORE. A stepwise binary logistic regression model was constructed to determine associations between primary and secondary variables and their area under the ROC curve (AUC) was calculated. Of the total sample, 33 patients (19.9%) had the C allele of the polymorphism analyzed. Mean cigarette consumption was 11.6 pack-years. The multivariate analysis showed the following factors as associated with the polymorphism: higher cigarette consumption, being a woman, and not having abdominal obesity. The AUC was 0.706. An association was found between increased cigarette consumption over the years and the presence of the C allele of the rs1414334 polymorphism in the HTR2C gene.

  4. Repetitive DNA and Plant Domestication: Variation in Copy Number and Proximity to Genes of LTR-Retrotransposons among Wild and Cultivated Sunflower (Helianthus annuus) Genotypes

    PubMed Central

    Mascagni, Flavia; Barghini, Elena; Giordani, Tommaso; Rieseberg, Loren H.; Cavallini, Andrea; Natali, Lucia

    2015-01-01

    The sunflower (Helianthus annuus) genome contains a very large proportion of transposable elements, especially long terminal repeat retrotransposons. However, knowledge on the retrotransposon-related variability within this species is still limited. We used next-generation sequencing (NGS) technologies to perform a quantitative and qualitative survey of intraspecific variation of the retrotransposon fraction of the genome across 15 genotypes—7 wild accessions and 8 cultivars—of H. annuus. By mapping the Illumina reads of the 15 genotypes onto a library of sunflower long terminal repeat retrotransposons, we observed considerable variability in redundancy among genotypes, at both superfamily and family levels. In another analysis, we mapped Illumina paired reads to two sets of sequences, that is, long terminal repeat retrotransposons and protein-encoding sequences, and evaluated the extent of retrotransposon proximity to genes in the sunflower genome by counting the number of paired reads in which one read mapped to a retrotransposon and the other to a gene. Large variability among genotypes was also ascertained for retrotransposon proximity to genes. Both long terminal repeat retrotransposon redundancy and proximity to genes varied among retrotransposon families and also between cultivated and wild genotypes. Such differences are discussed in relation to the possible role of long terminal repeat retrotransposons in the domestication of sunflower. PMID:26608057

  5. Apparent polyploidization after gamma irradiation: pitfalls in the use of quantitative polymerase chain reaction (qPCR) for the estimation of mitochondrial and nuclear DNA gene copy numbers.

    PubMed

    Kam, Winnie W Y; Lake, Vanessa; Banos, Connie; Davies, Justin; Banati, Richard

    2013-05-30

    Quantitative polymerase chain reaction (qPCR) has been widely used to quantify changes in gene copy numbers after radiation exposure. Here, we show that gamma irradiation ranging from 10 to 100 Gy of cells and cell-free DNA samples significantly affects the measured qPCR yield, due to radiation-induced fragmentation of the DNA template and, therefore, introduces errors into the estimation of gene copy numbers. The radiation-induced DNA fragmentation and, thus, measured qPCR yield varies with temperature not only in living cells, but also in isolated DNA irradiated under cell-free conditions. In summary, the variability in measured qPCR yield from irradiated samples introduces a significant error into the estimation of both mitochondrial and nuclear gene copy numbers and may give spurious evidence for polyploidization.

  6. Association analysis of the functional MAOA gene promoter and MAOB gene intron 13 polymorphisms in tension type headache patients.

    PubMed

    Edgnülü, Tuba G; Özge, Aynur; Erdal, Nurten; Kuru, Oktay; Erdal, Mehmet E

    2014-01-01

    Monoamine oxidase (MAO) enzymes play an important role in the etiology of many neurological diseases. Tension type headache (TTH) treatments contain inhibitors for selective re-uptake of serotonin and monoamine oxidase inhibitors. MAO (EC 1.4.3.4) has two isoenzymes known as MAOA and MAOB. A promoter polymorphism of a variable number of tandem repeats (VNTR) in the MAOA gene seems to affect MAOA transcriptional activity in vitro. Also, G/A polymorphism in intron 13 (rs1799836) of the MAOB gene have been previously found to be associated with the variability of MAOB enzyme activity. The aim of our study was to investigate a possible association of monoamine oxidase (MAOA and MAOB) gene polymorphisms in tension type headache. MAO gene polymorphisms were examined in a group of 120 TTH patients and in another 168 unrelated healthy volunteers (control group). MAOA promoter and MAOB intron 13 polymorphisms were genotyped using PCR-based methods. An overall comparison between the genotype of MAOA and MAOB genes and allele frequencies of the patients and the control group did not reveal any statistically significant difference between the patients and the control group (p=0.162). Factors like estrogen dosage, the limited number of male patients and other genes' neurotransmitters involved in the etiology of TTH could be responsible for our non-significant results.

  7. Analysis of variable sites between two complete South China tiger (Panthera tigris amoyensis) mitochondrial genomes.

    PubMed

    Zhang, Wenping; Yue, Bisong; Wang, Xiaofang; Zhang, Xiuyue; Xie, Zhong; Liu, Nonglin; Fu, Wenyuan; Yuan, Yaohua; Chen, Daqing; Fu, Danghua; Zhao, Bo; Yin, Yuzhong; Yan, Xiahui; Wang, Xinjing; Zhang, Rongying; Liu, Jie; Li, Maoping; Tang, Yao; Hou, Rong; Zhang, Zhihe

    2011-10-01

    In order to investigate the mitochondrial genome of Panthera tigris amoyensis, two South China tigers (P25 and P27) were analyzed following 15 cymt-specific primer sets. The entire mtDNA sequence was found to be 16,957 bp and 17,001 bp long for P25 and P27 respectively, and this difference in length between P25 and P27 occurred in the number of tandem repeats in the RS-3 segment of the control region. The structural characteristics of complete P. t. amoyensis mitochondrial genomes were also highly similar to those of P. uncia. Additionally, the rate of point mutation was only 0.3% and a total of 59 variable sites between P25 and P27 were found. Out of the 59 variable sites, 6 were located in 6 different tRNA genes, 6 in the 2 rRNA genes, 7 in non-coding regions (one located between tRNA-Asn and tRNA-Tyr and six in the D-loop), and 40 in 10 protein-coding genes. COI held the largest amount of variable sites (9 sites) and Cytb contained the highest variable rate (0.7%) in the complete sequences. Moreover, out of the 40 variable sites located in 10 protein-coding genes, 12 sites were nonsynonymous.

  8. [Identification of potentially invasive species of black flies [Diptera: Simuliidae] from Armenia based on an analysis of variability in the mtDNA barcode of the cox1 gene and chromosomal polymorphism].

    PubMed

    Andrianov, B V; Goryacheva, I I; Vlasov, S V; Gorelova, T V; Harutyunova, M V; Harutyunova, K V; Mayilyan, K R; Zakharov, I A

    2015-03-01

    Black flies (Diptera, Simuliidae) are well known for their medical, environmental, and veterinary importance. The simuliid fauna of Armenia includes 53 species. A number of dominant species are of ecological importance. Complex analysis, which involved morphometric, cytogenetic, and molecular genetic approaches, was conducted to characterize the species status of black flies inhabiting the territory of Armenia. It was shown that the predominant simuliid species, Simulium paraequinum and Simulium kiritshenkoi, belong to a group of species with minimal variability of the cox1 gene. The recently discovered species, Simulium noellery and Simulium [B.] erythrocephalum, which are new to Armenia, can be considered as potentially invasive, which is supported by the low level of variability of the cox1 gene.

  9. Adaptation to climate through flowering phenology: a case study in Medicago truncatula.

    PubMed

    Burgarella, Concetta; Chantret, Nathalie; Gay, Laurène; Prosperi, Jean-Marie; Bonhomme, Maxime; Tiffin, Peter; Young, Nevin D; Ronfort, Joelle

    2016-07-01

    Local climatic conditions likely constitute an important selective pressure on genes underlying important fitness-related traits such as flowering time, and in many species, flowering phenology and climatic gradients strongly covary. To test whether climate shapes the genetic variation on flowering time genes and to identify candidate flowering genes involved in the adaptation to environmental heterogeneity, we used a large Medicago truncatula core collection to examine the association between nucleotide polymorphisms at 224 candidate genes and both climate variables and flowering phenotypes. Unlike genome-wide studies, candidate gene approaches are expected to enrich for the number of meaningful trait associations because they specifically target genes that are known to affect the trait of interest. We found that flowering time mediates adaptation to climatic conditions mainly by variation at genes located upstream in the flowering pathways, close to the environmental stimuli. Variables related to the annual precipitation regime reflected selective constraints on flowering time genes better than the other variables tested (temperature, altitude, latitude or longitude). By comparing phenotype and climate associations, we identified 12 flowering genes as the most promising candidates responsible for phenological adaptation to climate. Four of these genes were located in the known flowering time QTL region on chromosome 7. However, climate and flowering associations also highlighted largely distinct gene sets, suggesting different genetic architectures for adaptation to climate and flowering onset. © 2016 John Wiley & Sons Ltd.

  10. A Genome-Wide Landscape of Retrocopies in Primate Genomes.

    PubMed

    Navarro, Fábio C P; Galante, Pedro A F

    2015-07-29

    Gene duplication is a key factor contributing to phenotype diversity across and within species. Although the availability of complete genomes has led to the extensive study of genomic duplications, the dynamics and variability of gene duplications mediated by retrotransposition are not well understood. Here, we predict mRNA retrotransposition and use comparative genomics to investigate their origin and variability across primates. Analyzing seven anthropoid primate genomes, we found a similar number of mRNA retrotranspositions (∼7,500 retrocopies) in Catarrhini (Old Word Monkeys, including humans), but a surprising large number of retrocopies (∼10,000) in Platyrrhini (New World Monkeys), which may be a by-product of higher long interspersed nuclear element 1 activity in these genomes. By inferring retrocopy orthology, we dated most of the primate retrocopy origins, and estimated a decrease in the fixation rate in recent primate history, implying a smaller number of species-specific retrocopies. Moreover, using RNA-Seq data, we identified approximately 3,600 expressed retrocopies. As expected, most of these retrocopies are located near or within known genes, present tissue-specific and even species-specific expression patterns, and no expression correlation to their parental genes. Taken together, our results provide further evidence that mRNA retrotransposition is an active mechanism in primate evolution and suggest that retrocopies may not only introduce great genetic variability between lineages but also create a large reservoir of potentially functional new genomic loci in primate genomes. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  11. Repetitive DNA and Plant Domestication: Variation in Copy Number and Proximity to Genes of LTR-Retrotransposons among Wild and Cultivated Sunflower (Helianthus annuus) Genotypes.

    PubMed

    Mascagni, Flavia; Barghini, Elena; Giordani, Tommaso; Rieseberg, Loren H; Cavallini, Andrea; Natali, Lucia

    2015-11-24

    The sunflower (Helianthus annuus) genome contains a very large proportion of transposable elements, especially long terminal repeat retrotransposons. However, knowledge on the retrotransposon-related variability within this species is still limited. We used next-generation sequencing (NGS) technologies to perform a quantitative and qualitative survey of intraspecific variation of the retrotransposon fraction of the genome across 15 genotypes--7 wild accessions and 8 cultivars--of H. annuus. By mapping the Illumina reads of the 15 genotypes onto a library of sunflower long terminal repeat retrotransposons, we observed considerable variability in redundancy among genotypes, at both superfamily and family levels. In another analysis, we mapped Illumina paired reads to two sets of sequences, that is, long terminal repeat retrotransposons and protein-encoding sequences, and evaluated the extent of retrotransposon proximity to genes in the sunflower genome by counting the number of paired reads in which one read mapped to a retrotransposon and the other to a gene. Large variability among genotypes was also ascertained for retrotransposon proximity to genes. Both long terminal repeat retrotransposon redundancy and proximity to genes varied among retrotransposon families and also between cultivated and wild genotypes. Such differences are discussed in relation to the possible role of long terminal repeat retrotransposons in the domestication of sunflower. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  12. Increased variability of stimulus-driven cortical responses is associated with genetic variability in children with and without dyslexia.

    PubMed

    Centanni, T M; Pantazis, D; Truong, D T; Gruen, J R; Gabrieli, J D E; Hogan, T P

    2018-05-26

    Individuals with dyslexia exhibit increased brainstem variability in response to sound. It is unknown as to whether increased variability extends to neocortical regions associated with audition and reading, extends to visual stimuli, and whether increased variability characterizes all children with dyslexia or, instead, a specific subset of children. We evaluated the consistency of stimulus-evoked neural responses in children with (N = 20) or without dyslexia (N = 12) as measured by magnetoencephalography (MEG). Approximately half of the children with dyslexia had significantly higher levels of variability in cortical responses to both auditory and visual stimuli in multiple nodes of the reading network. There was a significant and positive relationship between the number of risk alleles at rs6935076 in the dyslexia-susceptibility gene KIAA0319 and the degree of neural variability in primary auditory cortex across all participants. This gene has been linked with neural variability in rodents and in typical readers. These findings indicate that unstable representations of auditory and visual stimuli in auditory and other reading-related neocortical regions are present in a subset of children with dyslexia and support the link between the gene KIAA0319 and the auditory neural variability across children with or without dyslexia. Copyright © 2018 The Authors. Published by Elsevier Ltd.. All rights reserved.

  13. An Independent Filter for Gene Set Testing Based on Spectral Enrichment.

    PubMed

    Frost, H Robert; Li, Zhigang; Asselbergs, Folkert W; Moore, Jason H

    2015-01-01

    Gene set testing has become an indispensable tool for the analysis of high-dimensional genomic data. An important motivation for testing gene sets, rather than individual genomic variables, is to improve statistical power by reducing the number of tested hypotheses. Given the dramatic growth in common gene set collections, however, testing is often performed with nearly as many gene sets as underlying genomic variables. To address the challenge to statistical power posed by large gene set collections, we have developed spectral gene set filtering (SGSF), a novel technique for independent filtering of gene set collections prior to gene set testing. The SGSF method uses as a filter statistic the p-value measuring the statistical significance of the association between each gene set and the sample principal components (PCs), taking into account the significance of the associated eigenvalues. Because this filter statistic is independent of standard gene set test statistics under the null hypothesis but dependent under the alternative, the proportion of enriched gene sets is increased without impacting the type I error rate. As shown using simulated and real gene expression data, the SGSF algorithm accurately filters gene sets unrelated to the experimental outcome resulting in significantly increased gene set testing power.

  14. A genome-wide methylation study on obesity: differential variability and differential methylation.

    PubMed

    Xu, Xiaojing; Su, Shaoyong; Barnes, Vernon A; De Miguel, Carmen; Pollock, Jennifer; Ownby, Dennis; Shi, Hidong; Zhu, Haidong; Snieder, Harold; Wang, Xiaoling

    2013-05-01

    Besides differential methylation, DNA methylation variation has recently been proposed and demonstrated to be a potential contributing factor to cancer risk. Here we aim to examine whether differential variability in methylation is also an important feature of obesity, a typical non-malignant common complex disease. We analyzed genome-wide methylation profiles of over 470,000 CpGs in peripheral blood samples from 48 obese and 48 lean African-American youth aged 14-20 y old. A substantial number of differentially variable CpG sites (DVCs), using statistics based on variances, as well as a substantial number of differentially methylated CpG sites (DMCs), using statistics based on means, were identified. Similar to the findings in cancers, DVCs generally exhibited an outlier structure and were more variable in cases than in controls. By randomly splitting the current sample into a discovery and validation set, we observed that both the DVCs and DMCs identified from the first set could independently predict obesity status in the second set. Furthermore, both the genes harboring DMCs and the genes harboring DVCs showed significant enrichment of genes identified by genome-wide association studies on obesity and related diseases, such as hypertension, dyslipidemia, type 2 diabetes and certain types of cancers, supporting their roles in the etiology and pathogenesis of obesity. We generalized the recent finding on methylation variability in cancer research to obesity and demonstrated that differential variability is also an important feature of obesity-related methylation changes. Future studies on the epigenetics of obesity will benefit from both statistics based on means and statistics based on variances.

  15. Analysis of Copy Number Variation in the Abp Gene Regions of Two House Mouse Subspecies Suggests Divergence during the Gene Family Expansions

    PubMed Central

    Pezer, Željka; Chung, Amanda G.; Karn, Robert C.

    2017-01-01

    Abstract The Androgen-binding protein (Abp) gene region of the mouse genome contains 64 genes, some encoding pheromones that influence assortative mating between mice from different subspecies. Using CNVnator and quantitative PCR, we explored copy number variation in this gene family in natural populations of Mus musculus domesticus (Mmd) and Mus musculus musculus (Mmm), two subspecies of house mice that form a narrow hybrid zone in Central Europe. We found that copy number variation in the center of the Abp gene region is very common in wild Mmd, primarily representing the presence/absence of the final duplications described for the mouse genome. Clustering of Mmd individuals based on this variation did not reflect their geographical origin, suggesting no population divergence in the Abp gene cluster. However, copy number variation patterns differ substantially between Mmd and other mouse taxa. Large blocks of Abp genes are absent in Mmm, Mus musculus castaneus and an outgroup, Mus spretus, although with differences in variation and breakpoint locations. Our analysis calls into question the reliance on a reference genome for interpreting the detailed organization of genes in taxa more distant from the Mmd reference genome. The polymorphic nature of the gene family expansion in all four taxa suggests that the number of Abp genes, especially in the central gene region, is not critical to the survival and reproduction of the mouse. However, Abp haplotypes of variable length may serve as a source of raw genetic material for new signals influencing reproductive communication and thus speciation of mice. PMID:28575204

  16. 5p13 microduplication syndrome: a new case and better clinical definition of the syndrome.

    PubMed

    Novara, Francesca; Alfei, Enrico; D'Arrigo, Stefano; Pantaleoni, Chiara; Beri, Silvana; Achille, Valentina; Sciacca, Francesca L; Giorda, Roberto; Zuffardi, Orsetta; Ciccone, Roberto

    2013-01-01

    Chromosome 5p13 duplication syndrome (OMIM #613174), a contiguous gene syndrome involving duplication of several genes on chromosome 5p13 including NIPBL (OMIM 608667), has been described in rare patients with developmental delay and learning disability, behavioral problems and peculiar facial dysmorphisms. 5p13 duplications described so far present with variable sizes, from 0.25 to 13.6 Mb, and contain a variable number of genes. Here we report another patient with 5p13 duplication syndrome including NIPBL gene only. Proband's phenotype overlapped that reported in patients with 5p13 microduplication syndrome and especially that of subjects with smaller duplications. Moreover, we better define genotype-phenotype relationship associated with this duplication and confirmed that NIPBL was likely the major dosage sensitive gene for the 5p13 microduplication phenotype. Copyright © 2012 Elsevier Masson SAS. All rights reserved.

  17. Interaction of Dopamine Transporter Gene and Observed Parenting Behaviors on Attention-Deficit/Hyperactivity Disorder: A Structural Equation Modeling Approach

    ERIC Educational Resources Information Center

    Li, James J.; Lee, Steve S.

    2013-01-01

    Emerging evidence suggests that some individuals may be simultaneously more responsive to the effects from environmental adversity "and" enrichment (i.e., differential susceptibility). Given that parenting behavior and a variable number tandem repeat polymorphism in the 3'untranslated region of the dopamine transporter (DAT1) gene are…

  18. Variability of cytokine gene expression in intestinal tissue and the impact of normalization with the use of reference genes.

    PubMed

    McGowan, Ian; Janocko, Laura; Burneisen, Shaun; Bhat, Anand; Richardson-Harman, Nicola

    2015-01-01

    To determine the intra- and inter-subject variability of mucosal cytokine gene expression in rectal biopsies from healthy volunteers and to screen cytokine and chemokine mRNA as potential biomarkers of mucosal inflammation. Rectal biopsies were collected from 8 participants (3 biopsies per participant) and 1 additional participant (10 biopsies). Quantitative reverse transcription polymerase chain reaction (RT-qPCR) was used to quantify IL-1β, IL-6, IL-12p40, IL-8, IFN-γ, MIP-1α, MIP-1β, RANTES, and TNF-α gene expression in the rectal tissue. The intra-assay, inter-biopsy and inter-subject variance was measured in the eight participants. Bootstrap re-sampling of the biopsy measurements was performed to determine the accuracy of gene expression data obtained for 10 biopsies obtained from one participant. Cytokines were both non-normalized and normalized using four reference genes (GAPDH, β-actin, β2 microglobulin, and CD45). Cytokine measurement accuracy was increased with the number of biopsy samples, per person; four biopsies were typically needed to produce a mean result within a 95% confidence interval of the subject's cytokine level approximately 80% of the time. Intra-assay precision (% geometric standard deviation) ranged between 8.2 and 96.9 with high variance between patients and even between different biopsies from the same patient. Variability was not greatly reduced with the use of reference genes to normalize data. The number of biopsy samples required to provide an accurate result varied by target although 4 biopsy samples per subject and timepoint, provided for >77% accuracy across all targets tested. Biopsies within the same subjects and between subjects had similar levels of variance while variance within a biopsy (intra-assay) was generally lower. Normalization of inflammatory cytokines against reference genes failed to consistently reduce variance. The accuracy and reliability of mRNA expression of inflammatory cytokines will set a ceiling on the ability of these measures to predict mucosal inflammation. Techniques to reduce variability should be developed within a larger cohort of individuals before normative reference values can be validated. Copyright © 2014 Elsevier Ltd. All rights reserved.

  19. A Sustained Dietary Change Increases Epigenetic Variation in Isogenic Mice

    PubMed Central

    Cowley, Mark J.; Preiss, Thomas; Martin, David I. K.; Suter, Catherine M.

    2011-01-01

    Epigenetic changes can be induced by adverse environmental exposures, such as nutritional imbalance, but little is known about the nature or extent of these changes. Here we have explored the epigenomic effects of a sustained nutritional change, excess dietary methyl donors, by assessing genomic CpG methylation patterns in isogenic mice exposed for one or six generations. We find stochastic variation in methylation levels at many loci; exposure to methyl donors increases the magnitude of this variation and the number of variable loci. Several gene ontology categories are significantly overrepresented in genes proximal to these methylation-variable loci, suggesting that certain pathways are susceptible to environmental influence on their epigenetic states. Long-term exposure to the diet (six generations) results in a larger number of loci exhibiting epigenetic variability, suggesting that some of the induced changes are heritable. This finding presents the possibility that epigenetic variation within populations can be induced by environmental change, providing a vehicle for disease predisposition and possibly a substrate for natural selection. PMID:21541011

  20. Evaluation of Genetic Algorithm Concepts using Model Problems. Part 1; Single-Objective Optimization

    NASA Technical Reports Server (NTRS)

    Holst, Terry L.; Pulliam, Thomas H.

    2003-01-01

    A genetic-algorithm-based optimization approach is described and evaluated using a simple hill-climbing model problem. The model problem utilized herein allows for the broad specification of a large number of search spaces including spaces with an arbitrary number of genes or decision variables and an arbitrary number hills or modes. In the present study, only single objective problems are considered. Results indicate that the genetic algorithm optimization approach is flexible in application and extremely reliable, providing optimal results for all problems attempted. The most difficult problems - those with large hyper-volumes and multi-mode search spaces containing a large number of genes - require a large number of function evaluations for GA convergence, but they always converge.

  1. Diversity and population-genetic properties of copy number variations and multicopy genes in cattle

    PubMed Central

    Bickhart, Derek M.; Xu, Lingyang; Hutchison, Jana L.; Cole, John B.; Null, Daniel J.; Schroeder, Steven G.; Song, Jiuzhou; Garcia, Jose Fernando; Sonstegard, Tad S.; Van Tassell, Curtis P.; Schnabel, Robert D.; Taylor, Jeremy F.; Lewin, Harris A.; Liu, George E.

    2016-01-01

    The diversity and population genetics of copy number variation (CNV) in domesticated animals are not well understood. In this study, we analysed 75 genomes of major taurine and indicine cattle breeds (including Angus, Brahman, Gir, Holstein, Jersey, Limousin, Nelore, and Romagnola), sequenced to 11-fold coverage to identify 1,853 non-redundant CNV regions. Supported by high validation rates in array comparative genomic hybridization (CGH) and qPCR experiments, these CNV regions accounted for 3.1% (87.5 Mb) of the cattle reference genome, representing a significant increase over previous estimates of the area of the genome that is copy number variable (∼2%). Further population genetics and evolutionary genomics analyses based on these CNVs revealed the population structures of the cattle taurine and indicine breeds and uncovered potential diversely selected CNVs near important functional genes, including AOX1, ASZ1, GAT, GLYAT, and KRTAP9-1. Additionally, 121 CNV gene regions were found to be either breed specific or differentially variable across breeds, such as RICTOR in dairy breeds and PNPLA3 in beef breeds. In contrast, clusters of the PRP and PAG genes were found to be duplicated in all sequenced animals, suggesting that subfunctionalization, neofunctionalization, or overdominance play roles in diversifying those fertility-related genes. These CNV results provide a new glimpse into the diverse selection histories of cattle breeds and a basis for correlating structural variation with complex traits in the future. PMID:27085184

  2. Mitochondria and the non-genetic origins of cell-to-cell variability: More is different.

    PubMed

    Guantes, Raúl; Díaz-Colunga, Juan; Iborra, Francisco J

    2016-01-01

    Gene expression activity is heterogeneous in a population of isogenic cells. Identifying the molecular basis of this variability will improve our understanding of phenomena like tumor resistance to drugs, virus infection, or cell fate choice. The complexity of the molecular steps and machines involved in transcription and translation could introduce sources of randomness at many levels, but a common constraint to most of these processes is its energy dependence. In eukaryotic cells, most of this energy is provided by mitochondria. A clonal population of cells may show a large variability in the number and functionality of mitochondria. Here, we discuss how differences in the mitochondrial content of each cell contribute to heterogeneity in gene products. Changes in the amount of mitochondria can also entail drastic alterations of a cell's gene expression program, which ultimately leads to phenotypic diversity. Also watch the Video Abstract. © 2015 WILEY Periodicals, Inc.

  3. Genetic Algorithms Applied to Multi-Objective Aerodynamic Shape Optimization

    NASA Technical Reports Server (NTRS)

    Holst, Terry L.

    2004-01-01

    A genetic algorithm approach suitable for solving multi-objective optimization problems is described and evaluated using a series of aerodynamic shape optimization problems. Several new features including two variations of a binning selection algorithm and a gene-space transformation procedure are included. The genetic algorithm is suitable for finding pareto optimal solutions in search spaces that are defined by any number of genes and that contain any number of local extrema. A new masking array capability is included allowing any gene or gene subset to be eliminated as decision variables from the design space. This allows determination of the effect of a single gene or gene subset on the pareto optimal solution. Results indicate that the genetic algorithm optimization approach is flexible in application and reliable. The binning selection algorithms generally provide pareto front quality enhancements and moderate convergence efficiency improvements for most of the problems solved.

  4. Genetic Algorithms Applied to Multi-Objective Aerodynamic Shape Optimization

    NASA Technical Reports Server (NTRS)

    Holst, Terry L.

    2005-01-01

    A genetic algorithm approach suitable for solving multi-objective problems is described and evaluated using a series of aerodynamic shape optimization problems. Several new features including two variations of a binning selection algorithm and a gene-space transformation procedure are included. The genetic algorithm is suitable for finding Pareto optimal solutions in search spaces that are defined by any number of genes and that contain any number of local extrema. A new masking array capability is included allowing any gene or gene subset to be eliminated as decision variables from the design space. This allows determination of the effect of a single gene or gene subset on the Pareto optimal solution. Results indicate that the genetic algorithm optimization approach is flexible in application and reliable. The binning selection algorithms generally provide Pareto front quality enhancements and moderate convergence efficiency improvements for most of the problems solved.

  5. Looking for variable molecular markers in the chestnut gall wasp Dryocosmus kuriphilus: first comparison across genes.

    PubMed

    Bonal, Raúl; Vargas-Osuna, Enrique; Mena, Juan Diego; Aparicio, José Miguel; Santoro, María; Martín, Angela

    2018-04-04

    The quick spread of the chestnut gall wasp Dryocosmus kuriphilus in Europe constitutes an outstanding example of recent human-aided biological invasion with dramatic economic losses. We screened for the first time a set of five nuclear and mitochondrial genes from D. kuriphilus collected in the Iberian Peninsula, and compared the sequences with those available from the native and invasive range of the species. We found no genetic variability in Iberia in none of the five genes, moreover, the three genes compared with other European samples showed no variability either. We recorded four cytochrome b haplotypes in Europe; one was genuine mitochondrial DNA and the rest nuclear copies of mitDNA (numts), what stresses the need of careful in silico analyses. The numts formed a separate cluster in the gene tree and at least two of them might be orthologous, what suggests that the invasion might have started with more than one individual. Our results point at a low initial population size in Europe followed by a quick population growth. Future studies assessing the expansion of this pest should include a large number of sampling sites and use powerful nuclear markers (e. g. Single Nucleotide Polymorphisms) to detect genetic variability.

  6. Genetic control of biennial bearing in apple

    PubMed Central

    Guitton, Baptiste; Kelner, Jean-Jacques; Velasco, Riccardo; Gardiner, Susan E.; Chagné, David; Costes, Evelyne

    2012-01-01

    Although flowering in mature fruit trees is recurrent, floral induction can be strongly inhibited by concurrent fruiting, leading to a pattern of irregular fruiting across consecutive years referred to as biennial bearing. The genetic determinants of biennial bearing in apple were investigated using the 114 flowering individuals from an F1 population of 122 genotypes, from a ‘Starkrimson’ (strong biennial bearer)בGranny Smith’ (regular bearer) cross. The number of inflorescences, and the number and the mass of harvested fruit were recorded over 6 years and used to calculate 26 variables and indices quantifying yield, precocity of production, and biennial bearing. Inflorescence traits exhibited the highest genotypic effect, and three quantitative trait loci (QTLs) on linkage group (LG) 4, LG8, and LG10 explained 50% of the phenotypic variability for biennial bearing. Apple orthologues of flowering and hormone-related genes were retrieved from the whole-genome assembly of ‘Golden Delicious’ and their position was compared with QTLs. Four main genomic regions that contain floral integrator genes, meristem identity genes, and gibberellin oxidase genes co-located with QTLs. The results indicated that flowering genes are less likely to be responsible for biennial bearing than hormone-related genes. New hypotheses for the control of biennial bearing emerged from QTL and candidate gene co-locations and suggest the involvement of different physiological processes such as the regulation of flowering genes by hormones. The correlation between tree architecture and biennial bearing is also discussed. PMID:21963613

  7. Engineered promoters enable constant gene expression at any copy number in bacteria.

    PubMed

    Segall-Shapiro, Thomas H; Sontag, Eduardo D; Voigt, Christopher A

    2018-04-01

    The internal environment of growing cells is variable and dynamic, making it difficult to introduce reliable parts, such as promoters, for genetic engineering. Here, we applied control-theoretic ideas to design promoters that maintained constant levels of expression at any copy number. Theory predicts that independence to copy number can be achieved by using an incoherent feedforward loop (iFFL) if the negative regulation is perfectly non-cooperative. We engineered iFFLs into Escherichia coli promoters using transcription-activator-like effectors (TALEs). These promoters had near-identical expression in different genome locations and plasmids, even when their copy number was perturbed by genomic mutations or changes in growth medium composition. We applied the stabilized promoters to show that a three-gene metabolic pathway to produce deoxychromoviridans could retain function without re-tuning when the stabilized-promoter-driven genes were moved from a plasmid into the genome.

  8. Questioning the utility of pooling samples in microarray experiments with cell lines.

    PubMed

    Lusa, L; Cappelletti, V; Gariboldi, M; Ferrario, C; De Cecco, L; Reid, J F; Toffanin, S; Gallus, G; McShane, L M; Daidone, M G; Pierotti, M A

    2006-01-01

    We describe a microarray experiment using the MCF-7 breast cancer cell line in two different experimental conditions for which the same number of independent pools as the number of individual samples was hybridized on Affymetrix GeneChips. Unexpectedly, when using individual samples, the number of probe sets found to be differentially expressed between treated and untreated cells was about three times greater than that found using pools. These findings indicate that pooling samples in microarray experiments where the biological variability is expected to be small might not be helpful and could even decrease one's ability to identify differentially expressed genes.

  9. Fully moderated T-statistic for small sample size gene expression arrays.

    PubMed

    Yu, Lianbo; Gulati, Parul; Fernandez, Soledad; Pennell, Michael; Kirschner, Lawrence; Jarjoura, David

    2011-09-15

    Gene expression microarray experiments with few replications lead to great variability in estimates of gene variances. Several Bayesian methods have been developed to reduce this variability and to increase power. Thus far, moderated t methods assumed a constant coefficient of variation (CV) for the gene variances. We provide evidence against this assumption, and extend the method by allowing the CV to vary with gene expression. Our CV varying method, which we refer to as the fully moderated t-statistic, was compared to three other methods (ordinary t, and two moderated t predecessors). A simulation study and a familiar spike-in data set were used to assess the performance of the testing methods. The results showed that our CV varying method had higher power than the other three methods, identified a greater number of true positives in spike-in data, fit simulated data under varying assumptions very well, and in a real data set better identified higher expressing genes that were consistent with functional pathways associated with the experiments.

  10. How Does the Scientific Community Contribute to Gene Ontology?

    PubMed

    Lovering, Ruth C

    2017-01-01

    Collaborations between the scientific community and members of the Gene Ontology (GO) Consortium have led to an increase in the number and specificity of GO terms, as well as increasing the number of GO annotations. A variety of approaches have been taken to encourage research scientists to contribute to the GO, but the success of these approaches has been variable. This chapter reviews both the successes and failures of engaging the scientific community in GO development and annotation, as well as, providing motivation and advice to encourage individual researchers to contribute to GO.

  11. Computational Analysis of Candidate Disease Genes and Variants for Salt-Sensitive Hypertension in Indigenous Southern Africans

    PubMed Central

    Tiffin, Nicki; Meintjes, Ayton; Ramesar, Rajkumar; Bajic, Vladimir B.; Rayner, Brian

    2010-01-01

    Multiple factors underlie susceptibility to essential hypertension, including a significant genetic and ethnic component, and environmental effects. Blood pressure response of hypertensive individuals to salt is heterogeneous, but salt sensitivity appears more prevalent in people of indigenous African origin. The underlying genetics of salt-sensitive hypertension, however, are poorly understood. In this study, computational methods including text- and data-mining have been used to select and prioritize candidate aetiological genes for salt-sensitive hypertension. Additionally, we have compared allele frequencies and copy number variation for single nucleotide polymorphisms in candidate genes between indigenous Southern African and Caucasian populations, with the aim of identifying candidate genes with significant variability between the population groups: identifying genetic variability between population groups can exploit ethnic differences in disease prevalence to aid with prioritisation of good candidate genes. Our top-ranking candidate genes include parathyroid hormone precursor (PTH) and type-1angiotensin II receptor (AGTR1). We propose that the candidate genes identified in this study warrant further investigation as potential aetiological genes for salt-sensitive hypertension. PMID:20886000

  12. Evolution of Prdm Genes in Animals: Insights from Comparative Genomics

    PubMed Central

    Vervoort, Michel; Meulemeester, David; Béhague, Julien; Kerner, Pierre

    2016-01-01

    Prdm genes encode transcription factors with a subtype of SET domain known as the PRDF1-RIZ (PR) homology domain and a variable number of zinc finger motifs. These genes are involved in a wide variety of functions during animal development. As most Prdm genes have been studied in vertebrates, especially in mice, little is known about the evolution of this gene family. We searched for Prdm genes in the fully sequenced genomes of 93 different species representative of all the main metazoan lineages. A total of 976 Prdm genes were identified in these species. The number of Prdm genes per species ranges from 2 to 19. To better understand how the Prdm gene family has evolved in metazoans, we performed phylogenetic analyses using this large set of identified Prdm genes. These analyses allowed us to define 14 different subfamilies of Prdm genes and to establish, through ancestral state reconstruction, that 11 of them are ancestral to bilaterian animals. Three additional subfamilies were acquired during early vertebrate evolution (Prdm5, Prdm11, and Prdm17). Several gene duplication and gene loss events were identified and mapped onto the metazoan phylogenetic tree. By studying a large number of nonmetazoan genomes, we confirmed that Prdm genes likely constitute a metazoan-specific gene family. Our data also suggest that Prdm genes originated before the diversification of animals through the association of a single ancestral SET domain encoding gene with one or several zinc finger encoding genes. PMID:26560352

  13. Analysis of Copy Number Variation in the Abp Gene Regions of Two House Mouse Subspecies Suggests Divergence during the Gene Family Expansions.

    PubMed

    Pezer, Željka; Chung, Amanda G; Karn, Robert C; Laukaitis, Christina M

    2017-06-01

    The Androgen-binding protein ( Abp ) gene region of the mouse genome contains 64 genes, some encoding pheromones that influence assortative mating between mice from different subspecies. Using CNVnator and quantitative PCR, we explored copy number variation in this gene family in natural populations of Mus musculus domesticus ( Mmd ) and Mus musculus musculus ( Mmm ), two subspecies of house mice that form a narrow hybrid zone in Central Europe. We found that copy number variation in the center of the Abp gene region is very common in wild Mmd , primarily representing the presence/absence of the final duplications described for the mouse genome. Clustering of Mmd individuals based on this variation did not reflect their geographical origin, suggesting no population divergence in the Abp gene cluster. However, copy number variation patterns differ substantially between Mmd and other mouse taxa. Large blocks of Abp genes are absent in Mmm , Mus musculus castaneus and an outgroup, Mus spretus , although with differences in variation and breakpoint locations. Our analysis calls into question the reliance on a reference genome for interpreting the detailed organization of genes in taxa more distant from the Mmd reference genome. The polymorphic nature of the gene family expansion in all four taxa suggests that the number of Abp genes, especially in the central gene region, is not critical to the survival and reproduction of the mouse. However, Abp haplotypes of variable length may serve as a source of raw genetic material for new signals influencing reproductive communication and thus speciation of mice. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  14. Improved high-dimensional prediction with Random Forests by the use of co-data.

    PubMed

    Te Beest, Dennis E; Mes, Steven W; Wilting, Saskia M; Brakenhoff, Ruud H; van de Wiel, Mark A

    2017-12-28

    Prediction in high dimensional settings is difficult due to the large number of variables relative to the sample size. We demonstrate how auxiliary 'co-data' can be used to improve the performance of a Random Forest in such a setting. Co-data are incorporated in the Random Forest by replacing the uniform sampling probabilities that are used to draw candidate variables by co-data moderated sampling probabilities. Co-data here are defined as any type information that is available on the variables of the primary data, but does not use its response labels. These moderated sampling probabilities are, inspired by empirical Bayes, learned from the data at hand. We demonstrate the co-data moderated Random Forest (CoRF) with two examples. In the first example we aim to predict the presence of a lymph node metastasis with gene expression data. We demonstrate how a set of external p-values, a gene signature, and the correlation between gene expression and DNA copy number can improve the predictive performance. In the second example we demonstrate how the prediction of cervical (pre-)cancer with methylation data can be improved by including the location of the probe relative to the known CpG islands, the number of CpG sites targeted by a probe, and a set of p-values from a related study. The proposed method is able to utilize auxiliary co-data to improve the performance of a Random Forest.

  15. Learning Parsimonious Classification Rules from Gene Expression Data Using Bayesian Networks with Local Structure.

    PubMed

    Lustgarten, Jonathan Lyle; Balasubramanian, Jeya Balaji; Visweswaran, Shyam; Gopalakrishnan, Vanathi

    2017-03-01

    The comprehensibility of good predictive models learned from high-dimensional gene expression data is attractive because it can lead to biomarker discovery. Several good classifiers provide comparable predictive performance but differ in their abilities to summarize the observed data. We extend a Bayesian Rule Learning (BRL-GSS) algorithm, previously shown to be a significantly better predictor than other classical approaches in this domain. It searches a space of Bayesian networks using a decision tree representation of its parameters with global constraints, and infers a set of IF-THEN rules. The number of parameters and therefore the number of rules are combinatorial to the number of predictor variables in the model. We relax these global constraints to a more generalizable local structure (BRL-LSS). BRL-LSS entails more parsimonious set of rules because it does not have to generate all combinatorial rules. The search space of local structures is much richer than the space of global structures. We design the BRL-LSS with the same worst-case time-complexity as BRL-GSS while exploring a richer and more complex model space. We measure predictive performance using Area Under the ROC curve (AUC) and Accuracy. We measure model parsimony performance by noting the average number of rules and variables needed to describe the observed data. We evaluate the predictive and parsimony performance of BRL-GSS, BRL-LSS and the state-of-the-art C4.5 decision tree algorithm, across 10-fold cross-validation using ten microarray gene-expression diagnostic datasets. In these experiments, we observe that BRL-LSS is similar to BRL-GSS in terms of predictive performance, while generating a much more parsimonious set of rules to explain the same observed data. BRL-LSS also needs fewer variables than C4.5 to explain the data with similar predictive performance. We also conduct a feasibility study to demonstrate the general applicability of our BRL methods on the newer RNA sequencing gene-expression data.

  16. Evolution of gremlin 2 in cetartiodactyl mammals: gene loss coincides with lack of upper jaw incisors in ruminants.

    PubMed

    Opazo, Juan C; Zavala, Kattina; Krall, Paola; Arias, Rodrigo A

    2017-01-01

    Understanding the processes that give rise to genomic variability in extant species is an active area of research within evolutionary biology. With the availability of whole genome sequences, it is possible to quantify different forms of variability such as variation in gene copy number, which has been described as an important source of genetic variability and in consequence of phenotypic variability. Most of the research on this topic has been focused on understanding the biological significance of gene duplication, and less attention has been given to the evolutionary role of gene loss. Gremlin 2 is a member of the DAN gene family and plays a significant role in tooth development by blocking the ligand-signaling pathway of BMP2 and BMP4. The goal of this study was to investigate the evolutionary history of gremlin 2 in cetartiodactyl mammals, a group that possesses highly divergent teeth morphology. Results from our analyses indicate that gremlin 2 has experienced a mixture of gene loss, gene duplication, and rate acceleration. Although the last common ancestor of cetartiodactyls possessed a single gene copy, pigs and camels are the only cetartiodactyl groups that have retained gremlin 2. According to the phyletic distribution of this gene and synteny analyses, we propose that gremlin 2 was lost in the common ancestor of ruminants and cetaceans between 56.3 and 63.5 million years ago as a product of a chromosomal rearrangement. Our analyses also indicate that the rate of evolution of gremlin 2 has been accelerated in the two groups that have retained this gene. Additionally, the lack of this gene could explain the high diversity of teeth among cetartiodactyl mammals; specifically, the presence of this gene could act as a biological constraint. Thus, our results support the notions that gene loss is a way to increase phenotypic diversity and that gremlin 2 is a dispensable gene, at least in cetartiodactyl mammals.

  17. Novel harmonic regularization approach for variable selection in Cox's proportional hazards model.

    PubMed

    Chu, Ge-Jin; Liang, Yong; Wang, Jia-Xuan

    2014-01-01

    Variable selection is an important issue in regression and a number of variable selection methods have been proposed involving nonconvex penalty functions. In this paper, we investigate a novel harmonic regularization method, which can approximate nonconvex Lq  (1/2 < q < 1) regularizations, to select key risk factors in the Cox's proportional hazards model using microarray gene expression data. The harmonic regularization method can be efficiently solved using our proposed direct path seeking approach, which can produce solutions that closely approximate those for the convex loss function and the nonconvex regularization. Simulation results based on the artificial datasets and four real microarray gene expression datasets, such as real diffuse large B-cell lymphoma (DCBCL), the lung cancer, and the AML datasets, show that the harmonic regularization method can be more accurate for variable selection than existing Lasso series methods.

  18. Gene Expression Signatures Based on Variability can Robustly Predict Tumor Progression and Prognosis

    PubMed Central

    Dinalankara, Wikum; Bravo, Héctor Corrada

    2015-01-01

    Gene expression signatures are commonly used to create cancer prognosis and diagnosis methods, yet only a small number of them are successfully deployed in the clinic since many fail to replicate performance on subsequent validation. A primary reason for this lack of reproducibility is the fact that these signatures attempt to model the highly variable and unstable genomic behavior of cancer. Our group recently introduced gene expression anti-profiles as a robust methodology to derive gene expression signatures based on the observation that while gene expression measurements are highly heterogeneous across tumors of a specific cancer type relative to the normal tissue, their degree of deviation from normal tissue expression in specific genes involved in tissue differentiation is a stable tumor mark that is reproducible across experiments and cancer types. Here we show that constructing gene expression signatures based on variability and the anti-profile approach yields classifiers capable of successfully distinguishing benign growths from cancerous growths based on deviation from normal expression. We then show that this same approach generates stable and reproducible signatures that predict probability of relapse and survival based on tumor gene expression. These results suggest that using the anti-profile framework for the discovery of genomic signatures is an avenue leading to the development of reproducible signatures suitable for adoption in clinical settings. PMID:26078586

  19. Assessment of copy number variations in 120 patients with Poland syndrome.

    PubMed

    Vaccari, Carlotta Maria; Tassano, Elisa; Torre, Michele; Gimelli, Stefania; Divizia, Maria Teresa; Romanini, Maria Victoria; Bossi, Simone; Musante, Ilaria; Valle, Maura; Senes, Filippo; Catena, Nunzio; Bedeschi, Maria Francesca; Baban, Anwar; Calevo, Maria Grazia; Acquaviva, Massimo; Lerone, Margherita; Ravazzolo, Roberto; Puliti, Aldamaria

    2016-11-25

    Poland Syndrome (PS) is a rare congenital disorder presenting with agenesis/hypoplasia of the pectoralis major muscle variably associated with thoracic and/or upper limb anomalies. Most cases are sporadic, but familial recurrence, with different inheritance patterns, has been observed. The genetic etiology of PS remains unknown. Karyotyping and array-comparative genomic hybridization (CGH) analyses can identify genomic imbalances that can clarify the genetic etiology of congenital and neurodevelopmental disorders. We previously reported a chromosome 11 deletion in twin girls with pectoralis muscle hypoplasia and skeletal anomalies, and a chromosome six deletion in a patient presenting a complex phenotype that included pectoralis muscle hypoplasia. However, the contribution of genomic imbalances to PS remains largely unknown. To investigate the prevalence of chromosomal imbalances in PS, standard cytogenetic and array-CGH analyses were performed in 120 PS patients. Following the application of stringent filter criteria, 14 rare copy number variations (CNVs) were identified in 14 PS patients in different regions outside known common copy number variations: seven genomic duplications and seven genomic deletions, enclosing the two previously reported PS associated chromosomal deletions. These CNVs ranged from 0.04 to 4.71 Mb in size. Bioinformatic analysis of array-CGH data indicated gene enrichment in pathways involved in cell-cell adhesion, DNA binding and apoptosis processes. The analysis also provided a number of candidate genes possibly causing the developmental defects observed in PS patients, among others REV3L, a gene coding for an error-prone DNA polymerase previously associated with Möbius Syndrome with variable phenotypes including pectoralis muscle agenesis. A number of rare CNVs were identified in PS patients, and these involve genes that represent candidates for further evaluation. Rare inherited CNVs may contribute to, or represent risk factors of PS in a multifactorial mode of inheritance.

  20. The Cohesive Population Genetics of Molecular Drive

    PubMed Central

    Ohta, Tomoko; Dover, Gabriel A.

    1984-01-01

    The long-term population genetics of multigene families is influenced by several biased and unbiased mechanisms of nonreciprocal exchanges (gene conversion, unequal exchanges, transposition) between member genes, often distributed on several chromosomes. These mechanisms cause fluctuations in the copy number of variant genes in an individual and lead to a gradual replacement of an original family of n genes (A) in N number of individuals by a variant gene (a). The process for spreading a variant gene through a family and through a population is called molecular drive. Consideration of the known slow rates of nonreciprocal exchanges predicts that the population variance in the copy number of gene a per individual is small at any given generation during molecular drive. Genotypes at a given generation are expected only to range over a small section of all possible genotypes from one extreme (n number of A) to the other (n number of a). A theory is developed for estimating the size of the population variance by using the concept of identity coefficients. In particular, the variance in the course of spreading of a single mutant gene of a multigene family was investigated in detail, and the theory of identity coefficients at the state of steady decay of genetic variability proved to be useful. Monte Carlo simulations and numerical analysis based on realistic rates of exchange in families of known size reveal the correctness of the theoretical prediction and also assess the effect of bias in turnover. The population dynamics of molecular drive in gradually increasing the mean copy number of a variant gene without the generation of a large variance (population cohesion) is of significance regarding potential interactions between natural selection and molecular drive. PMID:6500260

  1. The cohesive population genetics of molecular drive.

    PubMed

    Ohta, T; Dover, G A

    1984-10-01

    The long-term population genetics of multigene families is influenced by several biased and unbiased mechanisms of nonreciprocal exchanges (gene conversion, unequal exchanges, transposition) between member genes, often distributed on several chromosomes. These mechanisms cause fluctuations in the copy number of variant genes in an individual and lead to a gradual replacement of an original family of n genes (A) in N number of individuals by a variant gene (a). The process for spreading a variant gene through a family and through a population is called molecular drive. Consideration of the known slow rates of nonreciprocal exchanges predicts that the population variance in the copy number of gene a per individual is small at any given generation during molecular drive. Genotypes at a given generation are expected only to range over a small section of all possible genotypes from one extreme (n number of A) to the other (n number of a). A theory is developed for estimating the size of the population variance by using the concept of identity coefficients. In particular, the variance in the course of spreading of a single mutant gene of a multigene family was investigated in detail, and the theory of identity coefficients at the state of steady decay of genetic variability proved to be useful. Monte Carlo simulations and numerical analysis based on realistic rates of exchange in families of known size reveal the correctness of the theoretical prediction and also assess the effect of bias in turnover. The population dynamics of molecular drive in gradually increasing the mean copy number of a variant gene without the generation of a large variance (population cohesion) is of significance regarding potential interactions between natural selection and molecular drive.

  2. Skeletal muscle repair in a mouse model of nemaline myopathy

    PubMed Central

    Sanoudou, Despina; Corbett, Mark A.; Han, Mei; Ghoddusi, Majid; Nguyen, Mai-Anh T.; Vlahovich, Nicole; Hardeman, Edna C.; Beggs, Alan H.

    2012-01-01

    Nemaline myopathy (NM), the most common non-dystrophic congenital myopathy, is a variably severe neuromuscular disorder for which no effective treatment is available. Although a number of genes have been identified in which mutations can cause NM, the pathogenetic mechanisms leading to the phenotypes are poorly understood. To address this question, we examined gene expression patterns in an NM mouse model carrying the human Met9Arg mutation of alpha-tropomyosin slow (Tpm3). We assessed five different skeletal muscles from affected mice, which are representative of muscles with differing fiber-type compositions, different physiological specializations and variable degrees of pathology. Although these same muscles in non-affected mice showed marked variation in patterns of gene expression, with diaphragm being the most dissimilar, the presence of the mutant protein in nemaline muscles resulted in a more similar pattern of gene expression among the muscles. This result suggests a common process or mechanism operating in nemaline muscles independent of the variable degrees of pathology. Transcriptional and protein expression data indicate the presence of a repair process and possibly delayed maturation in nemaline muscles. Markers indicative of satellite cell number, activated satellite cells and immature fibers including M-Cadherin, MyoD, desmin, Pax7 and Myf6 were elevated by western-blot analysis or immunohistochemistry. Evidence suggesting elevated focal repair was observed in nemaline muscle in electron micrographs. This analysis reveals that NM is characterized by a novel repair feature operating in multiple different muscles. PMID:16877500

  3. Skeletal muscle repair in a mouse model of nemaline myopathy.

    PubMed

    Sanoudou, Despina; Corbett, Mark A; Han, Mei; Ghoddusi, Majid; Nguyen, Mai-Anh T; Vlahovich, Nicole; Hardeman, Edna C; Beggs, Alan H

    2006-09-01

    Nemaline myopathy (NM), the most common non-dystrophic congenital myopathy, is a variably severe neuromuscular disorder for which no effective treatment is available. Although a number of genes have been identified in which mutations can cause NM, the pathogenetic mechanisms leading to the phenotypes are poorly understood. To address this question, we examined gene expression patterns in an NM mouse model carrying the human Met9Arg mutation of alpha-tropomyosin slow (Tpm3). We assessed five different skeletal muscles from affected mice, which are representative of muscles with differing fiber-type compositions, different physiological specializations and variable degrees of pathology. Although these same muscles in non-affected mice showed marked variation in patterns of gene expression, with diaphragm being the most dissimilar, the presence of the mutant protein in nemaline muscles resulted in a more similar pattern of gene expression among the muscles. This result suggests a common process or mechanism operating in nemaline muscles independent of the variable degrees of pathology. Transcriptional and protein expression data indicate the presence of a repair process and possibly delayed maturation in nemaline muscles. Markers indicative of satellite cell number, activated satellite cells and immature fibers including M-Cadherin, MyoD, desmin, Pax7 and Myf6 were elevated by western-blot analysis or immunohistochemistry. Evidence suggesting elevated focal repair was observed in nemaline muscle in electron micrographs. This analysis reveals that NM is characterized by a novel repair feature operating in multiple different muscles.

  4. High intraspecific genome diversity in the model arbuscular mycorrhizal symbiont Rhizophagus irregularis.

    PubMed

    Chen, Eric C H; Morin, Emmanuelle; Beaudet, Denis; Noel, Jessica; Yildirir, Gokalp; Ndikumana, Steve; Charron, Philippe; St-Onge, Camille; Giorgi, John; Krüger, Manuela; Marton, Timea; Ropars, Jeanne; Grigoriev, Igor V; Hainaut, Matthieu; Henrissat, Bernard; Roux, Christophe; Martin, Francis; Corradi, Nicolas

    2018-01-22

    Arbuscular mycorrhizal fungi (AMF) are known to improve plant fitness through the establishment of mycorrhizal symbioses. Genetic and phenotypic variations among closely related AMF isolates can significantly affect plant growth, but the genomic changes underlying this variability are unclear. To address this issue, we improved the genome assembly and gene annotation of the model strain Rhizophagus irregularis DAOM197198, and compared its gene content with five isolates of R. irregularis sampled in the same field. All isolates harbor striking genome variations, with large numbers of isolate-specific genes, gene family expansions, and evidence of interisolate genetic exchange. The observed variability affects all gene ontology terms and PFAM protein domains, as well as putative mycorrhiza-induced small secreted effector-like proteins and other symbiosis differentially expressed genes. High variability is also found in active transposable elements. Overall, these findings indicate a substantial divergence in the functioning capacity of isolates harvested from the same field, and thus their genetic potential for adaptation to biotic and abiotic changes. Our data also provide a first glimpse into the genome diversity that resides within natural populations of these symbionts, and open avenues for future analyses of plant-AMF interactions that link AMF genome variation with plant phenotype and fitness. © 2018 The Authors. New Phytologist © 2018 New Phytologist Trust.

  5. MHC class I and MHC class II DRB gene variability in wild and captive Bengal tigers (Panthera tigris tigris).

    PubMed

    Pokorny, Ina; Sharma, Reeta; Goyal, Surendra Prakash; Mishra, Sudanshu; Tiedemann, Ralph

    2010-10-01

    Bengal tigers are highly endangered and knowledge on adaptive genetic variation can be essential for efficient conservation and management. Here we present the first assessment of allelic variation in major histocompatibility complex (MHC) class I and MHC class II DRB genes for wild and captive tigers from India. We amplified, cloned, and sequenced alpha-1 and alpha-2 domain of MHC class I and beta-1 domain of MHC class II DRB genes in 16 tiger specimens of different geographic origin. We detected high variability in peptide-binding sites, presumably resulting from positive selection. Tigers exhibit a low number of MHC DRB alleles, similar to other endangered big cats. Our initial assessment-admittedly with limited geographic coverage and sample size-did not reveal significant differences between captive and wild tigers with regard to MHC variability. In addition, we successfully amplified MHC DRB alleles from scat samples. Our characterization of tiger MHC alleles forms a basis for further in-depth analyses of MHC variability in this illustrative threatened mammal.

  6. Rare Cell Detection by Single-Cell RNA Sequencing as Guided by Single-Molecule RNA FISH.

    PubMed

    Torre, Eduardo; Dueck, Hannah; Shaffer, Sydney; Gospocic, Janko; Gupte, Rohit; Bonasio, Roberto; Kim, Junhyong; Murray, John; Raj, Arjun

    2018-02-28

    Although single-cell RNA sequencing can reliably detect large-scale transcriptional programs, it is unclear whether it accurately captures the behavior of individual genes, especially those that express only in rare cells. Here, we use single-molecule RNA fluorescence in situ hybridization as a gold standard to assess trade-offs in single-cell RNA-sequencing data for detecting rare cell expression variability. We quantified the gene expression distribution for 26 genes that range from ubiquitous to rarely expressed and found that the correspondence between estimates across platforms improved with both transcriptome coverage and increased number of cells analyzed. Further, by characterizing the trade-off between transcriptome coverage and number of cells analyzed, we show that when the number of genes required to answer a given biological question is small, then greater transcriptome coverage is more important than analyzing large numbers of cells. More generally, our report provides guidelines for selecting quality thresholds for single-cell RNA-sequencing experiments aimed at rare cell analyses. Copyright © 2018 Elsevier Inc. All rights reserved.

  7. Variation of gene expression in Bacillus subtilis samples of fermentation replicates.

    PubMed

    Zhou, Ying; Yu, Wen-Bang; Ye, Bang-Ce

    2011-06-01

    The application of comprehensive gene expression profiling technologies to compare wild and mutated microorganism samples or to assess molecular differences between various treatments has been widely used. However, little is known about the normal variation of gene expression in microorganisms. In this study, an Agilent customized microarray representing 4,106 genes was used to quantify transcript levels of five-repeated flasks to assess normal variation in Bacillus subtilis gene expression. CV analysis and analysis of variance were employed to investigate the normal variance of genes and the components of variance, respectively. The results showed that above 80% of the total variation was caused by biological variance. For the 12 replicates, 451 of 4,106 genes exhibited variance with CV values over 10%. The functional category enrichment analysis demonstrated that these variable genes were mainly involved in cell type differentiation, cell type localization, cell cycle and DNA processing, and spore or cyst coat. Using power analysis, the minimal biological replicate number for a B. subtilis microarray experiment was determined to be six. The results contribute to the definition of the baseline level of variability in B. subtilis gene expression and emphasize the importance of replicate microarray experiments.

  8. Constitutional trisomy 8 and Behçet syndrome.

    PubMed

    Becker, Kristin; Fitzgerald, Oliver; Green, Andrew J; Keogan, Mary; Newbury-Ecob, Ruth; Greenhalgh, Lynn; Withers, Stephen; Hollox, Edward J; Aldred, Patricia M R; Armour, John A L

    2009-05-01

    The characteristic clinical features of constitutional trisomy 8 include varying degrees of developmental delay, joint contractures and deep palmar and plantar creases. There is an established literature, which describes features of Behçet syndrome occurring in phenotypically normal individuals with myelodysplastic syndromes and trisomy 8 in their bone marrow. In this article, we describe four patients with constitutional trisomy 8, all with varying clinical phenotypes, who developed features of Behçet, in particular but not exclusively mucocutaneous ulceration. In addition, we examined gene copy numbers of the variable-number neutrophil defensin genes DEFA1A3 in one of the cases (case 1) and her parents, together with 14 cases of Behçet syndrome in comparison with 121 normal controls. The gene copy number was highest in case 1 (copy number 14) and was also increased in her parents (both copy number 9). However the mean copy number for DEFA1A3 among the 14 Behçet syndrome patients was actually lower (5.1) than among the controls (mean of 6.8 copies). Thus, we conclude that patients with constitutional trisomy 8 and those with trisomy 8 confined to the bone marrow are both at increased risk of developing features of Behçet syndrome. The mechanism may relate to increased chromosome 8 gene dosage with further analysis of candidate genes on chromosome 8 required.

  9. Distinct Trajectories of Massive Recent Gene Gains and Losses in Populations of a Microbial Eukaryotic Pathogen

    PubMed Central

    Hartmann, Fanny E.; Croll, Daniel

    2017-01-01

    Abstract Differences in gene content are a significant source of variability within species and have an impact on phenotypic traits. However, little is known about the mechanisms responsible for the most recent gene gains and losses. We screened the genomes of 123 worldwide isolates of the major pathogen of wheat Zymoseptoria tritici for robust evidence of gene copy number variation. Based on orthology relationships in three closely related fungi, we identified 599 gene gains and 1,024 gene losses that have not yet reached fixation within the focal species. Our analyses of gene gains and losses segregating in populations showed that gene copy number variation arose preferentially in subtelomeres and in proximity to transposable elements. Recently lost genes were enriched in virulence factors and secondary metabolite gene clusters. In contrast, recently gained genes encoded mostly secreted protein lacking a conserved domain. We analyzed the frequency spectrum at loci segregating a gene presence–absence polymorphism in four worldwide populations. Recent gene losses showed a significant excess in low-frequency variants compared with genome-wide single nucleotide polymorphism, which is indicative of strong negative selection against gene losses. Recent gene gains were either under weak negative selection or neutral. We found evidence for strong divergent selection among populations at individual loci segregating a gene presence–absence polymorphism. Hence, gene gains and losses likely contributed to local adaptation. Our study shows that microbial eukaryotes harbor extensive copy number variation within populations and that functional differences among recently gained and lost genes led to distinct evolutionary trajectories. PMID:28981698

  10. Quantitative structure-activity relationships studies of CCR5 inhibitors and toxicity of aromatic compounds using gene expression programming.

    PubMed

    Shi, Weimin; Zhang, Xiaoya; Shen, Qi

    2010-01-01

    Quantitative structure-activity relationship (QSAR) study of chemokine receptor 5 (CCR5) binding affinity of substituted 1-(3,3-diphenylpropyl)-piperidinyl amides and ureas and toxicity of aromatic compounds have been performed. The gene expression programming (GEP) was used to select variables and produce nonlinear QSAR models simultaneously using the selected variables. In our GEP implementation, a simple and convenient method was proposed to infer the K-expression from the number of arguments of the function in a gene, without building the expression tree. The results were compared to those obtained by artificial neural network (ANN) and support vector machine (SVM). It has been demonstrated that the GEP is a useful tool for QSAR modeling. Copyright 2009 Elsevier Masson SAS. All rights reserved.

  11. [Family-based association study of a variable number of tandem repeat polymorphism of DAT1 gene with Tourette syndrome in a Chinese Han population].

    PubMed

    Zheng, Lanlan; Han, Zhen-liang; Zhang, Xin-hua; Wang, Xue-qin; Jiang, Wei-hua; Yi, Ming-ji; Liu, Shi-guo

    2013-10-01

    To assess the association of a 40 bp variable number of tandem repeat (VNTR) polymorphism within 3 untranslated region of dopamine transporter gene (DAT1) with Tourette syndrome (TS) in a Chinese Han population. A total of 160 TS patients and their parents were recruited. The VNTR polymorphism was detected with polymerase chain reaction-VNTR analysis, and its association with TS and its subtypes were assessed through a family-based association study comprising transmission disequilibrium test (TDT) and haplotype relative risk (HRR) analysis. The repeat numbers at the DAT1 40 bp locus were 11, 10, 9, 7.5 and 7 among the patients and their parents, with the most common type being a 10-repeat allele. No significant association was detected between the polymorphism and TS (TDT: X ² = 0.472, df = 1, P = 0.583; HRR: X ² = 0.313, P = 0.576, OR = 0.855, 95%CI: 0.493-1.481). Our data suggested that the VNTR polymorphism of DAT1 gene is not associated with susceptibility to TS in Chinese Han population. However, our results are to be validated in larger sets of patients collected from other populations.

  12. Novel Harmonic Regularization Approach for Variable Selection in Cox's Proportional Hazards Model

    PubMed Central

    Chu, Ge-Jin; Liang, Yong; Wang, Jia-Xuan

    2014-01-01

    Variable selection is an important issue in regression and a number of variable selection methods have been proposed involving nonconvex penalty functions. In this paper, we investigate a novel harmonic regularization method, which can approximate nonconvex Lq  (1/2 < q < 1) regularizations, to select key risk factors in the Cox's proportional hazards model using microarray gene expression data. The harmonic regularization method can be efficiently solved using our proposed direct path seeking approach, which can produce solutions that closely approximate those for the convex loss function and the nonconvex regularization. Simulation results based on the artificial datasets and four real microarray gene expression datasets, such as real diffuse large B-cell lymphoma (DCBCL), the lung cancer, and the AML datasets, show that the harmonic regularization method can be more accurate for variable selection than existing Lasso series methods. PMID:25506389

  13. Molecular basis of length polymorphism in the human zeta-globin gene complex.

    PubMed Central

    Goodbourn, S E; Higgs, D R; Clegg, J B; Weatherall, D J

    1983-01-01

    The length polymorphism between the human zeta-globin gene and its pseudogene is caused by an allele-specific variation in the copy number of a tandemly repeating 36-base-pair sequence. This sequence is related to a tandemly repeated 14-base-pair sequence in the 5' flanking region of the human insulin gene, which is known to cause length polymorphism, and to a repetitive sequence in intervening sequence (IVS) 1 of the pseudo-zeta-globin gene. Evidence is presented that the latter is also of variable length, probably because of differences in the copy number of the tandem repeat. The homology between the three length polymorphisms may be an indication of the presence of a more widespread group of related sequences in the human genome, which might be useful for generalized linkage studies. PMID:6308667

  14. A genetic variant of NLRP1 gene is associated with asbestos body burden in patients with malignant pleural mesothelioma.

    PubMed

    Crovella, S; Moura, R R; Cappellani, S; Celsi, F; Trevisan, E; Schneider, M; Brollo, A; Nicastro, E M; Vita, F; Finotto, L; Zabucchi, G; Borelli, V

    2018-01-01

    The presence of asbestos bodies (ABs) in lung parenchyma is considered a histopathologic hallmark of past exposure to asbestos fibers, of which there was a population of longer fibers. The mechanisms underlying AB formation are complex, involving inflammatory responses and iron (Fe) metabolism. Thus, the responsiveness to AB formation is variable, with some individuals appearing to be poor AB formers. The aim of this study was to disclose the possible role of genetic variants of genes encoding inflammasome and iron metabolism proteins in the ability to form ABs in a population of 81 individuals from North East Italy, who died after having developed malignant pleural mesothelioma (MPM). This study included 86 genetic variants distributed in 10 genes involved in Fe metabolism and 7 genetic variants in two genes encoding for inflammasome molecules. Genotypes/haplotypes were compared according to the number of lung ABs. Data showed that the NLRP1 rs12150220 missense variant (H155L) was significantly correlated with numbers of ABs in MPM patients. Specifically, a low number of ABs was detected in individuals carrying the NLRP1 rs12150220 A/T genotype. Our findings suggest that the NLRP1 inflammasome might contribute in the development of lung ABs. It is postulated that the NLRP1 missense variant may be considered as one of the possible host genetic factors contributing to individual variability in coating efficiency, which needs to be taken when assessing occupational exposure to asbestos.

  15. Maternal age and ovarian stimulation independently affect oocyte mtDNA copy number and cumulus cell gene expression in bovine clones.

    PubMed

    Cree, Lynsey M; Hammond, Elizabeth R; Shelling, Andrew N; Berg, Martin C; Peek, John C; Green, Mark P

    2015-06-01

    Does maternal ageing and ovarian stimulation alter mitochondrial DNA (mtDNA) copy number and gene expression of oocytes and cumulus cells from a novel bovine model for human IVF? Oocytes collected from females with identical nuclear genetics show decreased mtDNA copy number and increased expression of an endoplasmic reticulum (ER) stress gene with repect to ovarian stimulation, whilst differences in the expression of genes involved in mitochondrial function, antioxidant protection and apoptosis were evident in relation to maternal ageing and the degree of ovarian stimulation in cumulus cells. Oocyte quality declines with advancing maternal age; however, the underlying mechanism, as well as the effects of ovarian stimulation are poorly understood. Human studies investigating these effects are often limited by differences in age and ovarian stimulation regimens within a patient cohort, as well as genetic and environmental variability. A novel bovine cross-sectional maternal age model for human IVF was undertaken. Follicles were aspirated from young (3 years of age; n = 7 females) and old (10 years of age; n = 5 females) Holstein Freisian clones following multiple unstimulated, mild and standard ovarian stimulation cycles. These bovine cloned females were generated by the process of somatic cell nuclear transfer (SCNT) from the same founder and represent a homogeneous population with reduced genetic and environmental variability. Maternal age and ovarian stimulation effects were investigated in relation to mtDNA copy number, and the expression of 19 genes involved in mitochondrial function, antioxidant protection, oocyte-cumulus cell signalling and follicle development in both oocytes and cumulus cells. Young (3 years of age; n = 7 females) and old (10 years of age; n = 5 females) Holstein Freisian bovine clones were maintained as one herd. Stimulation cycles were based on the long GnRH agonist down-regulation regimen used in human fertility clinics. Follicle growth rates, numbers and diameters were monitored by ultrasonography and aspirated when the lead follicles were >14 mm in diameter. Follicle characteristics were analysed using a mixed model procedure. Quantitative PCR (qPCR) was used to determine mtDNA copy number and reverse transcriptase-qPCR (RT-qPCR) was used to measure gene expression in oocytes and cumulus cells. Method of ovarian stimulation (P = 0.04), but not maternal age (P > 0.1), was associated with a lower mtDNA copy number in oocytes. Neither factor affected mtDNA copy number in cumulus cells. In oocytes, maternal age had no effect on gene expression; however, ovarian stimulation in older females increased the expression of GRP78 (P = 0.02), a gene involved in ER stress. In cumulus cells, increasing maternal age was associated with the higher expression of genes involved in mitochondrial maintenance (TXN2 P = 0.008 and TFAM P = 0.03), whereas ovarian stimulation decreased the expression of genes involved in mitochondrial oxidative stress and apoptosis (TXN2 P = 0.002, PRDX3 P = 0.03 and BAX P = 0.03). The low number of oocyte and cumulus cell samples collected from the unstimulated cycles limited the analysis. Fertilization and developmental potential of the oocytes was not assessed because these were used for mtDNA and gene expression quantification. Delineation of the independent effects of maternal age and ovarian stimulation regimen on mtDNA copy number gene expression in oocytes and cumulus cells was enabled by the removal of genetic and environmental variability in this bovine model for human IVF. Therefore, these extend upon previous knowledge and findings provide relevant insights that are applicable for improving human ovarian stimulation regimens. Funding was provided by Fertility Associates and the University of Auckland. J.C.P. is a shareholder of Fertility Associates and M.P.G. received a fellowship from Fertility Associates. The other authors of this manuscript declare no conflict of interest that could be perceived as prejudicing the impartiality of the reported research. © The Author 2015. Published by Oxford University Press on behalf of the European Society of Human Reproduction and Embryology. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  16. A comprehensive analysis of Helicobacter pylori plasticity zones reveals that they are integrating conjugative elements with intermediate integration specificity.

    PubMed

    Fischer, Wolfgang; Breithaupt, Ute; Kern, Beate; Smith, Stella I; Spicher, Carolin; Haas, Rainer

    2014-04-27

    The human gastric pathogen Helicobacter pylori is a paradigm for chronic bacterial infections. Its persistence in the stomach mucosa is facilitated by several mechanisms of immune evasion and immune modulation, but also by an unusual genetic variability which might account for the capability to adapt to changing environmental conditions during long-term colonization. This variability is reflected by the fact that almost each infected individual is colonized by a genetically unique strain. Strain-specific genes are dispersed throughout the genome, but clusters of genes organized as genomic islands may also collectively be present or absent. We have comparatively analysed such clusters, which are commonly termed plasticity zones, in a high number of H. pylori strains of varying geographical origin. We show that these regions contain fixed gene sets, rather than being true regions of genome plasticity, but two different types and several subtypes with partly diverging gene content can be distinguished. Their genetic diversity is incongruent with variations in the rest of the genome, suggesting that they are subject to horizontal gene transfer within H. pylori populations. We identified 40 distinct integration sites in 45 genome sequences, with a conserved heptanucleotide motif that seems to be the minimal requirement for integration. The significant number of possible integration sites, together with the requirement for a short conserved integration motif and the high level of gene conservation, indicates that these elements are best described as integrating conjugative elements (ICEs) with an intermediate integration site specificity.

  17. Analysis of individual cells identifies cell-to-cell variability following induction of cellular senescence.

    PubMed

    Wiley, Christopher D; Flynn, James M; Morrissey, Christapher; Lebofsky, Ronald; Shuga, Joe; Dong, Xiao; Unger, Marc A; Vijg, Jan; Melov, Simon; Campisi, Judith

    2017-10-01

    Senescent cells play important roles in both physiological and pathological processes, including cancer and aging. In all cases, however, senescent cells comprise only a small fraction of tissues. Senescent phenotypes have been studied largely in relatively homogeneous populations of cultured cells. In vivo, senescent cells are generally identified by a small number of markers, but whether and how these markers vary among individual cells is unknown. We therefore utilized a combination of single-cell isolation and a nanofluidic PCR platform to determine the contributions of individual cells to the overall gene expression profile of senescent human fibroblast populations. Individual senescent cells were surprisingly heterogeneous in their gene expression signatures. This cell-to-cell variability resulted in a loss of correlation among the expression of several senescence-associated genes. Many genes encoding senescence-associated secretory phenotype (SASP) factors, a major contributor to the effects of senescent cells in vivo, showed marked variability with a subset of highly induced genes accounting for the increases observed at the population level. Inflammatory genes in clustered genomic loci showed a greater correlation with senescence compared to nonclustered loci, suggesting that these genes are coregulated by genomic location. Together, these data offer new insights into how genes are regulated in senescent cells and suggest that single markers are inadequate to identify senescent cells in vivo. © 2017 The Authors. Aging Cell published by the Anatomical Society and John Wiley & Sons Ltd.

  18. Associations of GBP2 gene copy number variations with growth traits and transcriptional expression in Chinese cattle.

    PubMed

    Zhang, Gui-Min; Zheng, Li; He, Hua; Song, Cheng-Chuang; Zhang, Zi-Jing; Cao, Xiu-Kai; Lei, Chu-Zhao; Lan, Xian-Yong; Qi, Xing-Lei; Chen, Hong; Huang, Yong-Zhen

    2018-03-20

    Copy number variations (CNVs) recently have been recognized as another important genetic variability followed single nucleotide polymorphisms (SNPs). The guanylate binding protein 2 (GBP2) gene plays an important role in cell proliferation. This study was performed to determine the presence of GBP2 CNV (relative to Angus cattle) in 466 individuals representing six main cattle breeds from China, identify its relationship with growth, and explore the biological effects of gene expression. There were two CNV regions in the GBP2 gene, for three types, CNV1 loss type (relative to Angus cattle) was more frequent in XN than other breeds, and CNV2 loss type (relative to Angus cattle) was more frequent in XN and CDM than other breeds. Though the GBP2 gene copy number presented no correlation with the transcriptional expression of JX (P > .05), but the transcriptional expression in heart is higher than other tissues, and the copy number in muscles and fat of JX is higher than others breeds. Statistical analysis revealed that the GBP2 gene CNV1 and CNV2 were significantly associated with growth traits (P < .05). In conclusion, this research established the correlations between CNVs of GBP2 gene and growth traits in different cattle breeds, and our results suggested that the CNVs in GBP2 gene may be considered markers for the molecular breeding of Chinese beef cattle. Copyright © 2018. Published by Elsevier B.V.

  19. Factors affecting interactome-based prediction of human genes associated with clinical signs.

    PubMed

    González-Pérez, Sara; Pazos, Florencio; Chagoyen, Mónica

    2017-07-17

    Clinical signs are a fundamental aspect of human pathologies. While disease diagnosis is problematic or impossible in many cases, signs are easier to perceive and categorize. Clinical signs are increasingly used, together with molecular networks, to prioritize detected variants in clinical genomics pipelines, even if the patient is still undiagnosed. Here we analyze the ability of these network-based methods to predict genes that underlie clinical signs from the human interactome. Our analysis reveals that these approaches can locate genes associated with clinical signs with variable performance that depends on the sign and associated disease. We analyzed several clinical and biological factors that explain these variable results, including number of genes involved (mono- vs. oligogenic diseases), mode of inheritance, type of clinical sign and gene product function. Our results indicate that the characteristics of the clinical signs and their related diseases should be considered for interpreting the results of network-prediction methods, such as those aimed at discovering disease-related genes and variants. These results are important due the increasing use of clinical signs as an alternative to diseases for studying the molecular basis of human pathologies.

  20. The genotypes and methylation of MAO genes as factors behind smoking behavior.

    PubMed

    Tiili, Emmi M; Mitiushkina, Natalia V; Sukhovskaya, Olga A; Imyanitov, Evgeny N; Hirvonen, Ari P

    2017-11-01

    Smoking dependence is the main cause for tobacco-related illnesses. The addiction-causing substance in tobacco, nicotine, acts through the dopamine pathway in the brain, causing several pleasurable experiences through cigarette smoking. Thus, both genetic and epigenetic factors related to dopamine metabolism may play an important role in influencing an individual's smoking behavior. We studied the 1460 C/T variation and the variable number tandem repeat polymorphism in the MAOA gene and A/G variation in intron 13 in the MAOB gene together with four DNA methylation sites in both of these genes in relation to several smoking-related phenotypes in a study population of 1230 Whites of Russian origin. The genotypes studied were found to be associated with smoking status in women; the MAOB G variant allele was more prevalent in female smokers than nonsmokers [odds ratio (OR): 2.16, 95% confidence interval (CI): 1.08-4.33], whereas a reverse relation was observed for the MAOA 1460 T-variant allele (OR: 0.44, 95% CI: 0.21-0.91) and variable number tandem repeat low-activity alleles (OR: 0.49, 95% CI: 0.24-0.98). Moreover, the mean methylation values of the CpG sites studied in the MAOA gene were related to smoking behavior in women. Similarly, several methylation patterns in the MAOB gene were associated with a smoking history, with each CpG site showing a remarkable sex dependence. Smoking behavior seems to be related to the genetic and epigenetic profile of MAO genes, with considerable individual and sex-related differences.

  1. Potential use of low-copy nuclear genes in DNA barcoding: a comparison with plastid genes in two Hawaiian plant radiations

    PubMed Central

    2013-01-01

    Background DNA barcoding of land plants has relied traditionally on a small number of markers from the plastid genome. In contrast, low-copy nuclear genes have received little attention as DNA barcodes because of the absence of universal primers for PCR amplification. Results From pooled-species 454 transcriptome data we identified two variable intron-less nuclear loci for each of two species-rich genera of the Hawaiian flora: Clermontia (Campanulaceae) and Cyrtandra (Gesneriaceae) and compared their utility as DNA barcodes with that of plastid genes. We found that nuclear genes showed an overall greater variability, but also displayed a high level of heterozygosity, intraspecific variation, and retention of ancient alleles. Thus, nuclear genes displayed fewer species-diagnostic haplotypes compared to plastid genes and no interspecies gaps. Conclusions The apparently greater coalescence times of nuclear genes are likely to limit their utility as barcodes, as only a small proportion of their alleles were fixed and unique to individual species. In both groups, species-diagnostic markers from either genome were scarce on the youngest island; a minimum age of ca. two million years may be needed for a species flock to be barcoded. For young plant groups, nuclear genes may not be a superior alternative to slowly evolving plastid genes. PMID:23394592

  2. Learning style and concept acquisition of community college students in introductory biology

    NASA Astrophysics Data System (ADS)

    Bobick, Sandra Burin

    This study investigated the influence of learning style on concept acquisition within a sample of community college students in a general biology course. There are two subproblems within the larger problem: (1) the influence of demographic variables (age, gender, number of college credits, prior exposure to scientific information) on learning style, and (2) the correlations between prior scientific knowledge, learning style and student understanding of the concept of the gene. The sample included all students enrolled in an introductory general biology course during two consecutive semesters at an urban community college. Initial data was gathered during the first week of the semester, at which time students filled in a short questionnaire (age, gender, number of college credits, prior exposure to science information either through reading/visual sources or a prior biology course). Subjects were then given the Inventory of Learning Processes-Revised (ILP-R) which measures general preferences in five learning styles; Deep Learning; Elaborative Learning, Agentic Learning, Methodical Learning and Literal Memorization. Subjects were then given the Gene Conceptual Knowledge pretest: a 15 question objective section and an essay section. Subjects were exposed to specific concepts during lecture and laboratory exercises. At the last lab, students were given the Genetics Conceptual Knowledge Posttest. Pretest/posttest gains were correlated with demographic variables and learning styles were analyzed for significant correlations. Learning styles, as the independent variable in a simultaneous multiple regression, were significant predictors of results on the gene assessment tests, including pretest, posttest and gain. Of the learning styles, Deep Learning accounted for the greatest positive predictive value of pretest essay and pretest objective results. Literal Memorization was a significant negative predictor for posttest essay, essay gain and objective gain. Simultaneous multiple regression indicated that demographic variables were significant positive predictors for Methodical, Deep and Elaborative Learning Styles. Stepwise multiple regression resulted in number of credits, Read Science and gender (female) as significant predictors of learning styles. The findings of this study emphasize the importance of learning styles in conceptual understanding of the gene and the correlation of nonformal exposure to science information with learning style and conceptual understanding.

  3. rDNA Copy Number Variants Are Frequent Passenger Mutations in Saccharomyces cerevisiae Deletion Collections and de Novo Transformants

    PubMed Central

    Kwan, Elizabeth X.; Wang, Xiaobin S.; Amemiya, Haley M.; Brewer, Bonita J.; Raghuraman, M. K.

    2016-01-01

    The Saccharomyces cerevisiae ribosomal DNA (rDNA) locus is known to exhibit greater instability relative to the rest of the genome. However, wild-type cells preferentially maintain a stable number of rDNA copies, suggesting underlying genetic control of the size of this locus. We performed a screen of a subset of the Yeast Knock-Out (YKO) single gene deletion collection to identify genetic regulators of this locus and to determine if rDNA copy number correlates with yeast replicative lifespan. While we found no correlation between replicative lifespan and rDNA size, we identified 64 candidate strains with significant rDNA copy number differences. However, in the process of validating candidate rDNA variants, we observed that independent isolates of our de novo gene deletion strains had unsolicited but significant changes in rDNA copy number. Moreover, we were not able to recapitulate rDNA phenotypes from the YKO yeast deletion collection. Instead, we found that the standard lithium acetate transformation protocol is a significant source of rDNA copy number variation, with lithium acetate exposure being the treatment causing variable rDNA copy number events after transformation. As the effects of variable rDNA copy number are being increasingly reported, our finding that rDNA is affected by lithium acetate exposure suggested that rDNA copy number variants may be influential passenger mutations in standard strain construction in S. cerevisiae. PMID:27449518

  4. rDNA Copy Number Variants Are Frequent Passenger Mutations in Saccharomyces cerevisiae Deletion Collections and de Novo Transformants.

    PubMed

    Kwan, Elizabeth X; Wang, Xiaobin S; Amemiya, Haley M; Brewer, Bonita J; Raghuraman, M K

    2016-09-08

    The Saccharomyces cerevisiae ribosomal DNA (rDNA) locus is known to exhibit greater instability relative to the rest of the genome. However, wild-type cells preferentially maintain a stable number of rDNA copies, suggesting underlying genetic control of the size of this locus. We performed a screen of a subset of the Yeast Knock-Out (YKO) single gene deletion collection to identify genetic regulators of this locus and to determine if rDNA copy number correlates with yeast replicative lifespan. While we found no correlation between replicative lifespan and rDNA size, we identified 64 candidate strains with significant rDNA copy number differences. However, in the process of validating candidate rDNA variants, we observed that independent isolates of our de novo gene deletion strains had unsolicited but significant changes in rDNA copy number. Moreover, we were not able to recapitulate rDNA phenotypes from the YKO yeast deletion collection. Instead, we found that the standard lithium acetate transformation protocol is a significant source of rDNA copy number variation, with lithium acetate exposure being the treatment causing variable rDNA copy number events after transformation. As the effects of variable rDNA copy number are being increasingly reported, our finding that rDNA is affected by lithium acetate exposure suggested that rDNA copy number variants may be influential passenger mutations in standard strain construction in S. cerevisiae. Copyright © 2016 Kwan et al.

  5. Mining TCGA Data Using Boolean Implications

    PubMed Central

    Sinha, Subarna; Tsang, Emily K.; Zeng, Haoyang; Meister, Michela; Dill, David L.

    2014-01-01

    Boolean implications (if-then rules) provide a conceptually simple, uniform and highly scalable way to find associations between pairs of random variables. In this paper, we propose to use Boolean implications to find relationships between variables of different data types (mutation, copy number alteration, DNA methylation and gene expression) from the glioblastoma (GBM) and ovarian serous cystadenoma (OV) data sets from The Cancer Genome Atlas (TCGA). We find hundreds of thousands of Boolean implications from these data sets. A direct comparison of the relationships found by Boolean implications and those found by commonly used methods for mining associations show that existing methods would miss relationships found by Boolean implications. Furthermore, many relationships exposed by Boolean implications reflect important aspects of cancer biology. Examples of our findings include cis relationships between copy number alteration, DNA methylation and expression of genes, a new hierarchy of mutations and recurrent copy number alterations, loss-of-heterozygosity of well-known tumor suppressors, and the hypermethylation phenotype associated with IDH1 mutations in GBM. The Boolean implication results used in the paper can be accessed at http://crookneck.stanford.edu/microarray/TCGANetworks/. PMID:25054200

  6. The evolution of highly variable immunity genes across a passerine bird radiation.

    PubMed

    O'Connor, E A; Strandh, M; Hasselquist, D; Nilsson, J-Å; Westerdahl, H

    2016-02-01

    To survive, individuals must be able to recognize and eliminate pathogens. The genes of the major histocompatibility complex (MHC) play an essential role in this process in vertebrates as their diversity affects the repertoire of pathogens that can be recognized by the immune system. Emerging evidence suggests that birds within the parvorder Passerida possess an exceptionally high number of MHC genes. However, this has yet to be directly investigated using a consistent framework, and the question of how this MHC diversity has evolved has not been addressed. We used next-generation sequencing to investigate how MHC class I gene copy number and sequence diversity varies across the Passerida radiation using twelve species chosen to represent the phylogenetic range of this group. Additionally, we performed phylogenetic analyses on this data to identify, for the first time, the evolutionary model that best describes how MHC class I gene diversity has evolved within Passerida. We found evidence of multiple MHC class I genes in every family tested, with an extremely broad range in gene copy number across Passerida. There was a strong phylogenetic signal in MHC gene copy number and diversity, and these traits appear to have evolved through a process of Brownian motion in the species studied, that is following the pattern of genetic drift or fluctuating selection, as opposed to towards a single optimal value or through evolutionary 'bursts'. By characterizing MHC class I gene diversity across Passerida in a systematic framework, this study provides a first step towards understanding this huge variation. © 2016 John Wiley & Sons Ltd.

  7. Analysis of host response to bacterial infection using error model based gene expression microarray experiments

    PubMed Central

    Stekel, Dov J.; Sarti, Donatella; Trevino, Victor; Zhang, Lihong; Salmon, Mike; Buckley, Chris D.; Stevens, Mark; Pallen, Mark J.; Penn, Charles; Falciani, Francesco

    2005-01-01

    A key step in the analysis of microarray data is the selection of genes that are differentially expressed. Ideally, such experiments should be properly replicated in order to infer both technical and biological variability, and the data should be subjected to rigorous hypothesis tests to identify the differentially expressed genes. However, in microarray experiments involving the analysis of very large numbers of biological samples, replication is not always practical. Therefore, there is a need for a method to select differentially expressed genes in a rational way from insufficiently replicated data. In this paper, we describe a simple method that uses bootstrapping to generate an error model from a replicated pilot study that can be used to identify differentially expressed genes in subsequent large-scale studies on the same platform, but in which there may be no replicated arrays. The method builds a stratified error model that includes array-to-array variability, feature-to-feature variability and the dependence of error on signal intensity. We apply this model to the characterization of the host response in a model of bacterial infection of human intestinal epithelial cells. We demonstrate the effectiveness of error model based microarray experiments and propose this as a general strategy for a microarray-based screening of large collections of biological samples. PMID:15800204

  8. Distinct Trajectories of Massive Recent Gene Gains and Losses in Populations of a Microbial Eukaryotic Pathogen.

    PubMed

    Hartmann, Fanny E; Croll, Daniel

    2017-11-01

    Differences in gene content are a significant source of variability within species and have an impact on phenotypic traits. However, little is known about the mechanisms responsible for the most recent gene gains and losses. We screened the genomes of 123 worldwide isolates of the major pathogen of wheat Zymoseptoria tritici for robust evidence of gene copy number variation. Based on orthology relationships in three closely related fungi, we identified 599 gene gains and 1,024 gene losses that have not yet reached fixation within the focal species. Our analyses of gene gains and losses segregating in populations showed that gene copy number variation arose preferentially in subtelomeres and in proximity to transposable elements. Recently lost genes were enriched in virulence factors and secondary metabolite gene clusters. In contrast, recently gained genes encoded mostly secreted protein lacking a conserved domain. We analyzed the frequency spectrum at loci segregating a gene presence-absence polymorphism in four worldwide populations. Recent gene losses showed a significant excess in low-frequency variants compared with genome-wide single nucleotide polymorphism, which is indicative of strong negative selection against gene losses. Recent gene gains were either under weak negative selection or neutral. We found evidence for strong divergent selection among populations at individual loci segregating a gene presence-absence polymorphism. Hence, gene gains and losses likely contributed to local adaptation. Our study shows that microbial eukaryotes harbor extensive copy number variation within populations and that functional differences among recently gained and lost genes led to distinct evolutionary trajectories. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  9. VANADIUM EXPOSURE ALTERS SPONTANEOUS BEAT RATE AND GENE EXPRESSION OF CULTURED CARDIAC MYOCYTES

    EPA Science Inventory

    Ambient air pollution particulate matter (PM) exposure is associated with increased morbidity and mortality. Recent toxicological studies report PM-induced changes in a number of cardiac parameters, including heart rate variability, arrhythmias, repolarization, and internal defib...

  10. Quantification of Human Fecal Bifidobacterium Species by Use of Quantitative Real-Time PCR Analysis Targeting the groEL Gene

    PubMed Central

    Junick, Jana

    2012-01-01

    Quantitative real-time PCR assays targeting the groEL gene for the specific enumeration of 12 human fecal Bifidobacterium species were developed. The housekeeping gene groEL (HSP60 in eukaryotes) was used as a discriminative marker for the differentiation of Bifidobacterium adolescentis, B. angulatum, B. animalis, B. bifidum, B. breve, B. catenulatum, B. dentium, B. gallicum, B. longum, B. pseudocatenulatum, B. pseudolongum, and B. thermophilum. The bifidobacterial chromosome contains a single copy of the groEL gene, allowing the determination of the cell number by quantification of the groEL copy number. Real-time PCR assays were validated by comparing fecal samples spiked with known numbers of a given Bifidobacterium species. Independent of the Bifidobacterium species tested, the proportion of groEL copies recovered from fecal samples spiked with 5 to 9 log10 cells/g feces was approximately 50%. The quantification limit was 5 to 6 log10 groEL copies/g feces. The interassay variability was less than 10%, and variability between different DNA extractions was less than 23%. The method developed was applied to fecal samples from healthy adults and full-term breast-fed infants. Bifidobacterial diversity in both adults and infants was low, with mostly ≤3 Bifidobacterium species and B. longum frequently detected. The predominant species in infant and adult fecal samples were B. breve and B. adolescentis, respectively. It was possible to distinguish B. catenulatum and B. pseudocatenulatum. We conclude that the groEL gene is a suitable molecular marker for the specific and accurate quantification of human fecal Bifidobacterium species by real-time PCR. PMID:22307308

  11. Comparative genomic analysis of six new-found integrative conjugative elements (ICEs) in Vibrio alginolyticus.

    PubMed

    Luo, Peng; He, Xiangyan; Wang, Yanhong; Liu, Qiuting; Hu, Chaoqun

    2016-05-04

    Vibrio alginolyticus is ubiquitous in marine and estuarine environments. In 2012-2013, SXT/R391-like integrative conjugative elements (ICEs) in environmental V. alginolyticus strains were discovered and found to occur in 8.9 % of 192 V. alginolyticus strains, which suggests that V. alginolyticus may be a natural pool possessing resourceful ICEs. However, complete ICE sequences originating from this bacterium have not been reported, which represents a significant barrier to characterizing the ICEs of this bacterium and exploring their relationships with other ICEs. In the present study, we acquired six ICE sequences from five V. alginolyticus strains and performed a comparative analysis of these ICE genomes. A sequence analysis showed that there were only 14 variable bases dispersed between ICEValE0601 and ICEValHN492. ICEValE0601 and ICEValHN492 were treated as the same ICE. ICEValA056-1, ICEValE0601 and ICEValHN492 integrate into the 5' end of the host's prfC gene, and their Int and Xis share at least 97 % identity with their counterparts from SXT. ICEValE0601 or ICEValHN492 contain 50 of 52 conserved core genes in the SXT/R391 ICEs (not s025 or s026). ICEValA056-2, ICEValHN396 and ICEValHN437 have a different tRNA-ser integration site and a distinct int/xis module; however, the remaining backbone genes are highly similar to their counterparts in SXT/R391 ICEs. DNA sequences inserted into hotspot and variable regions of the ICEs are of various sizes. The variable genes of six ICEs encode a large array of functions to bestow various adaptive abilities upon their hosts, and only ICEValA056-1 contains drug-resistant genes. Many variable genes have orthologous and functionally related genes to those found in SXT/R391 ICEs, such as genes coding for a toxin-antitoxin system, a restriction-modification system, helicases and endonucleases. Six ICEs also contain a large number of unique genes or gene clusters that were not found in other ICEs. Six ICEs harbor more abundant transposase genes compared with other parts of their host genomes. A phylogenetic analysis indicated that transposase genes in these ICEs are highly diverse. ICEValA056-1, ICEValE0601 and ICEValHN492 are typical members of the SXT/R391 family. ICEValA056-2, ICEValHN396 and ICEValHN437 form a new atypical group belonging to the SXT/R391 family. In addition to the many genes found to be present in other ICEs, six ICEs contain a large number of unique genes or gene clusters that were not found in other ICEs. ICEs may serve as a carrier for transposable genetic elements (TEs) and largely facilitate the dissemination of TEs.

  12. Final technical report

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Edward DeLong

    2011-10-07

    Our overarching goals in this project were to: Develop and improve high-throughput sequencing methods and analytical approaches for quantitative analyses of microbial gene expression at the Hawaii Ocean Time Series Station and the Bermuda Atlantic Time Series Station; Conduct field analyses following gene expression patterns in picoplankton microbial communities in general, and Prochlorococcus flow sorted from that community, as they respond to different environmental variables (light, macronutrients, dissolved organic carbon), that are predicted to influence activity, productivity, and carbon cycling; Use the expression analyses of flow sorted Prochlorococcus to identify horizontally transferred genes and gene products, in particular those thatmore » are located in genomic islands and likely to confer habitat-specific fitness advantages; Use the microbial community gene expression data that we generate to gain insights, and test hypotheses, about the variability, genomic context, activity and function of as yet uncharacterized gene products, that appear highly expressed in the environment. We achieved the above goals, and even more over the course of the project. This includes a number of novel methodological developments, as well as the standardization of microbial community gene expression analyses in both field surveys, and experimental modalities. The availability of these methods, tools and approaches is changing current practice in microbial community analyses.« less

  13. Impact of strong selection for the PrP major gene on genetic variability of four French sheep breeds (Open Access publication)

    PubMed Central

    Palhiere, Isabelle; Brochard, Mickaël; Moazami-Goudarzi, Katayoun; Laloë, Denis; Amigues, Yves; Bed'hom, Bertrand; Neuts, Étienne; Leymarie, Cyril; Pantano, Thais; Cribiu, Edmond Paul; Bibé, Bernard; Verrier, Étienne

    2008-01-01

    Effective selection on the PrP gene has been implemented since October 2001 in all French sheep breeds. After four years, the ARR "resistant" allele frequency increased by about 35% in young males. The aim of this study was to evaluate the impact of this strong selection on genetic variability. It is focussed on four French sheep breeds and based on the comparison of two groups of 94 animals within each breed: the first group of animals was born before the selection began, and the second, 3–4 years later. Genetic variability was assessed using genealogical and molecular data (29 microsatellite markers). The expected loss of genetic variability on the PrP gene was confirmed. Moreover, among the five markers located in the PrP region, only the three closest ones were affected. The evolution of the number of alleles, heterozygote deficiency within population, expected heterozygosity and the Reynolds distances agreed with the criteria from pedigree and pointed out that neutral genetic variability was not much affected. This trend depended on breed, i.e. on their initial states (population size, PrP frequencies) and on the selection strategies for improving scrapie resistance while carrying out selection for production traits. PMID:18990357

  14. pelB gene in isolates of Colletotrichum gloeosporioides from several hosts.

    PubMed

    Medeiros, L V; Maciel, D B; Medeiros, V V; Houllou Kido, L M; Oliveira, N T

    2010-04-13

    Colletotrichum gloeosporioides is an important pathogen for a great number of economically important crops. During the necrotrophic phase of infection by Colletotrichum spp, the degradative enzymes of plant cell walls, such as pectate lyase, clearly increase. A gene pelB that expresses a pectate lyase was identified in isolates of C. gloeosporioides in avocado pathogens. Various molecular studies have identified a kind of specialization of C. gloeosporioides isolates with specific hosts; however, there have been no studies of this gene in isolates from hosts other than avocado. The same is true for other species of Colletotrichum. We examined genetic variability in order to design primers that would amplify pelB gene fragments and compared the products of this amplification in C. gloeosporioides isolates from different hosts. Genetic variability was assessed using ISSR primers; the resultant data were grouped based on the UPGMA clustering method. Primers for the pelB gene were designed from selected GenBank sequences using the Primer 3 program at an annealing temperature of 60 degrees C and product amplification of nearly 600 bp. The ISSR primers were efficient in demonstrating the genetic variability of the Colletotrichum isolates and in distinguishing C. gloeosporioides, C. acutatum and C. sublineolum species. The gene pelB was found in C. gloeosporioides, C. acutatum and C. sublineolum. Amplified restriction fragments using MspI did not reveal differences in pelB gene structure in isolates from the three different host species that we investigated.

  15. Effects of sample size, number of markers, and allelic richness on the detection of spatial genetic pattern

    USGS Publications Warehouse

    Landguth, Erin L.; Gedy, Bradley C.; Oyler-McCance, Sara J.; Garey, Andrew L.; Emel, Sarah L.; Mumma, Matthew; Wagner, Helene H.; Fortin, Marie-Josée; Cushman, Samuel A.

    2012-01-01

    The influence of study design on the ability to detect the effects of landscape pattern on gene flow is one of the most pressing methodological gaps in landscape genetic research. To investigate the effect of study design on landscape genetics inference, we used a spatially-explicit, individual-based program to simulate gene flow in a spatially continuous population inhabiting a landscape with gradual spatial changes in resistance to movement. We simulated a wide range of combinations of number of loci, number of alleles per locus and number of individuals sampled from the population. We assessed how these three aspects of study design influenced the statistical power to successfully identify the generating process among competing hypotheses of isolation-by-distance, isolation-by-barrier, and isolation-by-landscape resistance using a causal modelling approach with partial Mantel tests. We modelled the statistical power to identify the generating process as a response surface for equilibrium and non-equilibrium conditions after introduction of isolation-by-landscape resistance. All three variables (loci, alleles and sampled individuals) affect the power of causal modelling, but to different degrees. Stronger partial Mantel r correlations between landscape distances and genetic distances were found when more loci were used and when loci were more variable, which makes comparisons of effect size between studies difficult. Number of individuals did not affect the accuracy through mean equilibrium partial Mantel r, but larger samples decreased the uncertainty (increasing the precision) of equilibrium partial Mantel r estimates. We conclude that amplifying more (and more variable) loci is likely to increase the power of landscape genetic inferences more than increasing number of individuals.

  16. Effects of sample size, number of markers, and allelic richness on the detection of spatial genetic pattern

    USGS Publications Warehouse

    Landguth, E.L.; Fedy, B.C.; Oyler-McCance, S.J.; Garey, A.L.; Emel, S.L.; Mumma, M.; Wagner, H.H.; Fortin, M.-J.; Cushman, S.A.

    2012-01-01

    The influence of study design on the ability to detect the effects of landscape pattern on gene flow is one of the most pressing methodological gaps in landscape genetic research. To investigate the effect of study design on landscape genetics inference, we used a spatially-explicit, individual-based program to simulate gene flow in a spatially continuous population inhabiting a landscape with gradual spatial changes in resistance to movement. We simulated a wide range of combinations of number of loci, number of alleles per locus and number of individuals sampled from the population. We assessed how these three aspects of study design influenced the statistical power to successfully identify the generating process among competing hypotheses of isolation-by-distance, isolation-by-barrier, and isolation-by-landscape resistance using a causal modelling approach with partial Mantel tests. We modelled the statistical power to identify the generating process as a response surface for equilibrium and non-equilibrium conditions after introduction of isolation-by-landscape resistance. All three variables (loci, alleles and sampled individuals) affect the power of causal modelling, but to different degrees. Stronger partial Mantel r correlations between landscape distances and genetic distances were found when more loci were used and when loci were more variable, which makes comparisons of effect size between studies difficult. Number of individuals did not affect the accuracy through mean equilibrium partial Mantel r, but larger samples decreased the uncertainty (increasing the precision) of equilibrium partial Mantel r estimates. We conclude that amplifying more (and more variable) loci is likely to increase the power of landscape genetic inferences more than increasing number of individuals. ?? 2011 Blackwell Publishing Ltd.

  17. Parameters selection in gene selection using Gaussian kernel support vector machines by genetic algorithm.

    PubMed

    Mao, Yong; Zhou, Xiao-Bo; Pi, Dao-Ying; Sun, You-Xian; Wong, Stephen T C

    2005-10-01

    In microarray-based cancer classification, gene selection is an important issue owing to the large number of variables and small number of samples as well as its non-linearity. It is difficult to get satisfying results by using conventional linear statistical methods. Recursive feature elimination based on support vector machine (SVM RFE) is an effective algorithm for gene selection and cancer classification, which are integrated into a consistent framework. In this paper, we propose a new method to select parameters of the aforementioned algorithm implemented with Gaussian kernel SVMs as better alternatives to the common practice of selecting the apparently best parameters by using a genetic algorithm to search for a couple of optimal parameter. Fast implementation issues for this method are also discussed for pragmatic reasons. The proposed method was tested on two representative hereditary breast cancer and acute leukaemia datasets. The experimental results indicate that the proposed method performs well in selecting genes and achieves high classification accuracies with these genes.

  18. Association of a Monoamine Oxidase-A Gene Promoter Polymorphism with ADHD and Anxiety in Boys with Autism Spectrum Disorder

    ERIC Educational Resources Information Center

    Roohi, Jasmin; DeVincent, Carla J.; Hatchwell, Eli; Gadow, Kenneth D.

    2009-01-01

    The aim of the present study was to examine the association between a variable number tandem repeat (VNTR) functional polymorphism in the promoter region of the MAO-A gene and severity of ADHD and anxiety in boys with ASD. Parents and teachers completed a DSM-IV-referenced rating scale for 5- to 14-year-old boys with ASD (n = 43). Planned…

  19. Analysis of copy number variations in Holstein-Friesian cow genomes based on whole-genome sequence data.

    PubMed

    Mielczarek, M; Frąszczak, M; Giannico, R; Minozzi, G; Williams, John L; Wojdak-Maksymiec, K; Szyda, J

    2017-07-01

    Thirty-two whole genome DNA sequences of cows were analyzed to evaluate inter-individual variability in the distribution and length of copy number variations (CNV) and to functionally annotate CNV breakpoints. The total number of deletions per individual varied between 9,731 and 15,051, whereas the number of duplications was between 1,694 and 5,187. Most of the deletions (81%) and duplications (86%) were unique to a single cow. No relation between the pattern of variant sharing and a family relationship or disease status was found. The animal-averaged length of deletions was from 5,234 to 9,145 bp and the average length of duplications was between 7,254 and 8,843 bp. Highly significant inter-individual variation in length and number of CNV was detected for both deletions and duplications. The majority of deletion and duplication breakpoints were located in intergenic regions and introns, whereas fewer were identified in noncoding transcripts and splice regions. Only 1.35 and 0.79% of the deletion and duplication breakpoints were observed within coding regions. A gene with the highest number of deletion breakpoints codes for protein kinase cGMP-dependent type I, whereas the T-cell receptor α constant gene had the most duplication breakpoints. The functional annotation of genes with the largest incidence of deletion/duplication breakpoints identified 87/112 Kyoto Encyclopedia of Genes and Genomes pathways, but none of the pathways were significantly enriched or depleted with breakpoints. The analysis of Gene Ontology (GO) terms revealed that a cluster with the highest enrichment score among genes with many deletion breakpoints was represented by GO terms related to ion transport, whereas the GO term cluster mostly enriched among the genes with many duplication breakpoints was related to binding of macromolecules. Furthermore, when considering the number of deletion breakpoints per gene functional category, no significant differences were observed between the "housekeeping" and "strong selection" categories, but genes representing the "low selection pressure" group showed a significantly higher number of breakpoints. Copyright © 2017 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.

  20. ZAP-70 staining in chronic lymphocytic leukemia.

    PubMed

    Villamor, Neus

    2005-05-01

    Chronic lymphocytic leukemia (CLL) is the most common chronic leukemia in Western countries. The disease has an extremely variable clinical course, and several prognostic features have been identified to assess individual risk. The configuration of the immunoglobulin variable heavy-chain gene (IgV(H)) is a strong predictor of the outcome. CLL patients with unmutated IgV(H) status have an aggressive clinical course and a short survival. Unfortunately, analysis of IgV(H) gene configuration is not available in most clinical laboratories. A small number of genes are differentially expressed between unmutated IgV(H) and mutated IgV(H) clinical forms of CLL. One of these genes is ZAP-70, which is detected in leukemic cells from patients with the unmutated IgV(H) form of CLL. Flow cytometry presents advantages over other methods to detect ZAP-70, and its quantification by flow cytometry has proved its predictive value. This unit focuses on protocols to quantify ZAP-70 by flow cytometry in CLL.

  1. 5S rRNA gene arrangements in protists: a case of nonadaptive evolution.

    PubMed

    Drouin, Guy; Tsang, Corey

    2012-06-01

    Given their high copy number and high level of expression, one might expect that both the sequence and organization of eukaryotic ribosomal RNA genes would be conserved during evolution. Although the organization of 18S, 5.8S and 28S ribosomal RNA genes is indeed relatively well conserved, that of 5S rRNA genes is much more variable. Here, we review the different types of 5S rRNA gene arrangements which have been observed in protists. This includes linkages to the other ribosomal RNA genes as well as linkages to ubiquitin, splice-leader, snRNA and tRNA genes. Mapping these linkages to independently derived phylogenies shows that these diverse linkages have repeatedly been gained and lost during evolution. This argues against such linkages being the primitive condition not only in protists but also in other eukaryote species. Because the only characteristic the diverse genes with which 5S rRNA genes are found linked with is that they are tandemly repeated, these arrangements are unlikely to provide any selective advantage. Rather, the observed high variability in 5S rRNA genes arrangements is likely the result of the fact that 5S rRNA genes contain internal promoters, that these genes are often transposed by diverse recombination mechanisms and that these new gene arrangements are rapidly homogenized by unequal crossingovers and/or by gene conversions events in species with short generation times and frequent founder events.

  2. Multiple-locus variable number of tandem repeats fingerprinting (MLVF) and virulence factor analysis of methicillin resistant Staphylococcus aureus SCCmec type III.

    PubMed

    Emaneini, Mohammad; Jabalameli, Leila; Iman-Eini, Hossein; Aligholi, Marzieh; Ghasemi, Amir; Nakhjavani, Farrokh Akbari; Taherikalani, Morovat; Khoramian, Babak; Asadollahi, Parisa; Jabalameli, Fereshteh

    2011-01-01

    Methicillin resistant Staphylococcus aureus (MRSA), particularly strains with type III staphylococcal cassette chromosome mec (SCCmec), represent a serious human pathogen in Tehran, Iran. The disease-causing capability depends on their ability to produce a wide variety of virulent factors. The prevalence of exotoxin genes and multiple-locus variable number of tandem repeats fingerprinting (MLVF) profile among MRSA isolates, from patients in Tehran, was evaluated by PCR and Multiplex-PCR. The MLVF typing of 144 MRSA isolates with type III SCCmec produced 5 different MLVF types. Generally, 97.2% (140/144) of all the isolates were positive for at least one of the tested exotoxin genes. The most prevalent genes were hld, found in 87.5% (126/144) of the isolates followed by lukE-lukD and hla found in 72.9% (105/144) and 70.1% (101/144) of the isolates, respectively. The tst gene, belonging to MLVF types I, IV and V, was found among three of the isolates from blood and wound samples. The sea gene was detected in 58.3% (84/144) of the isolates and the sed and see genes were found in one isolate with MLVF type V. The coexistence of genes was observed in the 87.5% (126/144) of the isolates. The rate of coexistence of hld with lukE-lukD, hla with lukE-lukD and sea with lukE-lukD were 66.7% (96/144), 44.4% (64/144) and 44.4% (64/144), respectively. The present study demonstrated that MRSA strains with type III SCCmec show different MLVF patterns and exotoxin profiles.

  3. Single-cell analysis of transcription kinetics across the cell cycle

    PubMed Central

    Skinner, Samuel O; Xu, Heng; Nagarkar-Jaiswal, Sonal; Freire, Pablo R; Zwaka, Thomas P; Golding, Ido

    2016-01-01

    Transcription is a highly stochastic process. To infer transcription kinetics for a gene-of-interest, researchers commonly compare the distribution of mRNA copy-number to the prediction of a theoretical model. However, the reliability of this procedure is limited because the measured mRNA numbers represent integration over the mRNA lifetime, contribution from multiple gene copies, and mixing of cells from different cell-cycle phases. We address these limitations by simultaneously quantifying nascent and mature mRNA in individual cells, and incorporating cell-cycle effects in the analysis of mRNA statistics. We demonstrate our approach on Oct4 and Nanog in mouse embryonic stem cells. Both genes follow similar two-state kinetics. However, Nanog exhibits slower ON/OFF switching, resulting in increased cell-to-cell variability in mRNA levels. Early in the cell cycle, the two copies of each gene exhibit independent activity. After gene replication, the probability of each gene copy to be active diminishes, resulting in dosage compensation. DOI: http://dx.doi.org/10.7554/eLife.12175.001 PMID:26824388

  4. Potential efficacy of mitochondrial genes for animal DNA barcoding: a case study using eutherian mammals.

    PubMed

    Luo, Arong; Zhang, Aibing; Ho, Simon Yw; Xu, Weijun; Zhang, Yanzhou; Shi, Weifeng; Cameron, Stephen L; Zhu, Chaodong

    2011-01-28

    A well-informed choice of genetic locus is central to the efficacy of DNA barcoding. Current DNA barcoding in animals involves the use of the 5' half of the mitochondrial cytochrome oxidase 1 gene (CO1) to diagnose and delimit species. However, there is no compelling a priori reason for the exclusive focus on this region, and it has been shown that it performs poorly for certain animal groups. To explore alternative mitochondrial barcoding regions, we compared the efficacy of the universal CO1 barcoding region with the other mitochondrial protein-coding genes in eutherian mammals. Four criteria were used for this comparison: the number of recovered species, sequence variability within and between species, resolution to taxonomic levels above that of species, and the degree of mutational saturation. Based on 1,179 mitochondrial genomes of eutherians, we found that the universal CO1 barcoding region is a good representative of mitochondrial genes as a whole because the high species-recovery rate (> 90%) was similar to that of other mitochondrial genes, and there were no significant differences in intra- or interspecific variability among genes. However, an overlap between intra- and interspecific variability was still problematic for all mitochondrial genes. Our results also demonstrated that any choice of mitochondrial gene for DNA barcoding failed to offer significant resolution at higher taxonomic levels. We suggest that the CO1 barcoding region, the universal DNA barcode, is preferred among the mitochondrial protein-coding genes as a molecular diagnostic at least for eutherian species identification. Nevertheless, DNA barcoding with this marker may still be problematic for certain eutherian taxa and our approach can be used to test potential barcoding loci for such groups.

  5. Potential efficacy of mitochondrial genes for animal DNA barcoding: a case study using eutherian mammals

    PubMed Central

    2011-01-01

    Background A well-informed choice of genetic locus is central to the efficacy of DNA barcoding. Current DNA barcoding in animals involves the use of the 5' half of the mitochondrial cytochrome oxidase 1 gene (CO1) to diagnose and delimit species. However, there is no compelling a priori reason for the exclusive focus on this region, and it has been shown that it performs poorly for certain animal groups. To explore alternative mitochondrial barcoding regions, we compared the efficacy of the universal CO1 barcoding region with the other mitochondrial protein-coding genes in eutherian mammals. Four criteria were used for this comparison: the number of recovered species, sequence variability within and between species, resolution to taxonomic levels above that of species, and the degree of mutational saturation. Results Based on 1,179 mitochondrial genomes of eutherians, we found that the universal CO1 barcoding region is a good representative of mitochondrial genes as a whole because the high species-recovery rate (> 90%) was similar to that of other mitochondrial genes, and there were no significant differences in intra- or interspecific variability among genes. However, an overlap between intra- and interspecific variability was still problematic for all mitochondrial genes. Our results also demonstrated that any choice of mitochondrial gene for DNA barcoding failed to offer significant resolution at higher taxonomic levels. Conclusions We suggest that the CO1 barcoding region, the universal DNA barcode, is preferred among the mitochondrial protein-coding genes as a molecular diagnostic at least for eutherian species identification. Nevertheless, DNA barcoding with this marker may still be problematic for certain eutherian taxa and our approach can be used to test potential barcoding loci for such groups. PMID:21276253

  6. Correlation between Hox code and vertebral morphology in archosaurs.

    PubMed

    Böhmer, Christine; Rauhut, Oliver W M; Wörheide, Gert

    2015-07-07

    The relationship between developmental genes and phenotypic variation is of central interest in evolutionary biology. An excellent example is the role of Hox genes in the anteroposterior regionalization of the vertebral column in vertebrates. Archosaurs (crocodiles, dinosaurs including birds) are highly variable both in vertebral morphology and number. Nevertheless, functionally equivalent Hox genes are active in the axial skeleton during embryonic development, indicating that the morphological variation across taxa is likely owing to modifications in the pattern of Hox gene expression. By using geometric morphometrics, we demonstrate a correlation between vertebral Hox code and quantifiable vertebral morphology in modern archosaurs, in which the boundaries between morphological subgroups of vertebrae can be linked to anterior Hox gene expression boundaries. Our findings reveal homologous units of cervical vertebrae in modern archosaurs, each with their specific Hox gene pattern, enabling us to trace these homologies in the extinct sauropodomorph dinosaurs, a group with highly variable vertebral counts. Based on the quantifiable vertebral morphology, this allows us to infer the underlying genetic mechanisms in vertebral evolution in fossils, which represents not only an important case study, but will lead to a better understanding of the origin of morphological disparity in recent archosaur vertebral columns.

  7. Correlation between Hox code and vertebral morphology in archosaurs

    PubMed Central

    Böhmer, Christine; Rauhut, Oliver W. M.; Wörheide, Gert

    2015-01-01

    The relationship between developmental genes and phenotypic variation is of central interest in evolutionary biology. An excellent example is the role of Hox genes in the anteroposterior regionalization of the vertebral column in vertebrates. Archosaurs (crocodiles, dinosaurs including birds) are highly variable both in vertebral morphology and number. Nevertheless, functionally equivalent Hox genes are active in the axial skeleton during embryonic development, indicating that the morphological variation across taxa is likely owing to modifications in the pattern of Hox gene expression. By using geometric morphometrics, we demonstrate a correlation between vertebral Hox code and quantifiable vertebral morphology in modern archosaurs, in which the boundaries between morphological subgroups of vertebrae can be linked to anterior Hox gene expression boundaries. Our findings reveal homologous units of cervical vertebrae in modern archosaurs, each with their specific Hox gene pattern, enabling us to trace these homologies in the extinct sauropodomorph dinosaurs, a group with highly variable vertebral counts. Based on the quantifiable vertebral morphology, this allows us to infer the underlying genetic mechanisms in vertebral evolution in fossils, which represents not only an important case study, but will lead to a better understanding of the origin of morphological disparity in recent archosaur vertebral columns. PMID:26085583

  8. Somatic diversification in the heavy chain variable region genes expressed by human autoantibodies bearing a lupus-associated nephritogenic anti-DNA idiotype

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Demaison, C.; Chastagner, P.; Theze, J.

    1994-01-18

    Monoclonal anti-DNA antibodies bearing a lupus nephritis-associated idiotype were derived from five patients with systemic lupus erythematosus (SLE). Genes encoding their heavy (H)-chain variable (V[sub H]) regions were cloned and sequenced. When compared with their closest V[sub h] germ-line gene relatives, these sequences exhibit a number of silent (S) and replacement (R) substitutions. The ratios of R/S mutations were much higher in the complementarity-determining regions (CDRs) of the antibodies than in the framework regions. Molecular amplification of genomic V[sub H] genes and Southern hybridization with somatic CDR2-specific oligonucleotide probes showed that the configuration of the V[sub H] genes corresponding tomore » V[sub H] sequences in the nephritogenic antibodies is not present in the patient's own germ-line DNA, implying that the B-cell clones underwent somatic mutation in vivo. These findings, together with the characteristics of the diversity and junctional gene elements utilized to form the antibody, indicate that these autoantibodies have been driven through somatic selection processes reminiscent of those that govern antibody responses triggered by exogenous stimuli.« less

  9. [Polymorphism of KPI-A genes from plants of the subgenus Potatoe (sect. Petota, Estolonifera and Lycopersicum) and subgenus Solanum].

    PubMed

    Krinitsyna, A A; Mel'nikova, N V; Belenikin, M S; Poltronieri, P; Santino, A; Kudriavtseva, A V; Savilova, A M; Speranskaia, A S

    2013-01-01

    Kunitz-type proteinase inhibitor proteins of group A (KPI-A) are involved in the protection of potato plants from pathogens and pests. Although sequences of large number of the KPI-A genes from different species of cultivated potato (Solanum tuberosum subsp. tuberosum) and a few genes from tomato (Solanum lycopersicum) are known to date, information about the allelic diversity of these genes in other species of the genus Solanum is lacking. In our work, the consensus sequences of the KPI-A genes were established in two species of subgenus Potatoe sect. Petota (Solanum tuberosum subsp. andigenum--5 genes and Solanum stoloniferum--2 genes) and in the subgenus Solanum (Solanum nigrum--5 genes) by amplification, cloning, sequencing and subsequent analysis. The determined sequences of KPI-A genes were 97-100% identical to known sequences of the cultivated potato of sect. Petota (cultivated potato Solanum tuberosum subsp. tuberosum) and sect. Etuberosum (S. palustre). The interspecific variability of these genes did not exceed the intraspecific variability for all studied species except Solanum lycopersicum. The distribution of highly variable and conserved sequences in the mature protein-encoding regions was uniform for all investigated KPI-A genes. However, our attempts to amplify the homologous genes using the same primers and the genomes of Solanum dulcamarum, Solanum lycopersicum and Mandragora officinarum resulted in no product formation. Phylogenetic analysis of KPI-A diversity showed that the sequences of the S. lycopersicum form independent cluster, whereas KPI-A of S. nigrum and species of sect. Etuberosum and sect. Petota are closely related and do not form species-specific subclasters. Although Solanum nigrum is resistant to all known races of economically one of the most important diseases of solanaceous plants oomycete Phytophthora infestans aminoacid sequences encoding by KPI-A genes from its genome have nearly or absolutely no differences to the same from genomes of cultivated potatoes involved by P. infestans.

  10. Evidence of oligogenic sex determination in the apple snail Pomacea canaliculata.

    PubMed

    Yusa, Yoichi; Kumagai, Natsumi

    2018-06-01

    A small number of genes may interact to determine sex, but few such examples have been demonstrated in animals, especially through comprehensive mating experiments. The highly invasive apple snail Pomacea canaliculata is gonochoristic and shows a large variation in brood sex ratio, and the involvement of multiple genes has been suggested for this phenomenon. We conducted mating experiments to determine whether their sex determination involves a few or many genes (i.e., oligogenic or polygenic sex determination, respectively). Full-sib females or males that were born from the same parents were mated to an adult of the opposite sex, and the brood sex ratios of the parents and their offspring were investigated. Analysis of a total of 4288 offspring showed that the sex ratios of offspring from the full-sib females were variable but clustered into only a few values. Similar patterns were observed for the full-sib males, although the effect was less clear because fewer offspring were used (n = 747). Notably, the offspring sex ratios of all full-sib females in some families were nearly 0.5 (proportion of males) with little variation. These results indicate that the number of genotypes of the full-sibs, and hence genes involved in sex determination, is small in this snail. Such oligogenic systems may be a major sex-determining system among animals, especially those with variable sex ratios.

  11. HLA-E regulatory and coding region variability and haplotypes in a Brazilian population sample.

    PubMed

    Ramalho, Jaqueline; Veiga-Castelli, Luciana C; Donadi, Eduardo A; Mendes-Junior, Celso T; Castelli, Erick C

    2017-11-01

    The HLA-E gene is characterized by low but wide expression on different tissues. HLA-E is considered a conserved gene, being one of the least polymorphic class I HLA genes. The HLA-E molecule interacts with Natural Killer cell receptors and T lymphocytes receptors, and might activate or inhibit immune responses depending on the peptide associated with HLA-E and with which receptors HLA-E interacts to. Variable sites within the HLA-E regulatory and coding segments may influence the gene function by modifying its expression pattern or encoded molecule, thus, influencing its interaction with receptors and the peptide. Here we propose an approach to evaluate the gene structure, haplotype pattern and the complete HLA-E variability, including regulatory (promoter and 3'UTR) and coding segments (with introns), by using massively parallel sequencing. We investigated the variability of 420 samples from a very admixed population such as Brazilians by using this approach. Considering a segment of about 7kb, 63 variable sites were detected, arranged into 75 extended haplotypes. We detected 37 different promoter sequences (but few frequent ones), 27 different coding sequences (15 representing new HLA-E alleles) and 12 haplotypes at the 3'UTR segment, two of them presenting a summed frequency of 90%. Despite the number of coding alleles, they encode mainly two different full-length molecules, known as E*01:01 and E*01:03, which corresponds to about 90% of all. In addition, differently from what has been previously observed for other non classical HLA genes, the relationship among the HLA-E promoter, coding and 3'UTR haplotypes is not straightforward because the same promoter and 3'UTR haplotypes were many times associated with different HLA-E coding haplotypes. This data reinforces the presence of only two main full-length HLA-E molecules encoded by the many HLA-E alleles detected in our population sample. In addition, this data does indicate that the distal HLA-E promoter is by far the most variable segment. Further analyses involving the binding of transcription factors and non-coding RNAs, as well as the HLA-E expression in different tissues, are necessary to evaluate whether these variable sites at regulatory segments (or even at the coding sequence) may influence the gene expression profile. Copyright © 2017 Elsevier Ltd. All rights reserved.

  12. Phylogenetic analysis of ionotropic L-glutamate receptor genes in the Bilateria, with special notes on Aplysia californica.

    PubMed

    Greer, Justin B; Khuri, Sawsan; Fieber, Lynne A

    2017-01-11

    The neurotransmitter L-Glutamate (L-Glu) acting at ionotropic L-Glu receptors (iGluR) conveys fast excitatory signal transmission in the nervous systems of all animals. iGluR-dependent neurotransmission is a key component of the synaptic plasticity that underlies learning and memory. During learning, two subtypes of iGluR, α-Amino-3-hydroxy-5-methyl-4-isoxazolepropionic acid receptors (AMPAR) and N-methyl-D-aspartate receptors (NMDAR), are dynamically regulated postsynaptically in vertebrates. Invertebrate organisms such as Aplysia californica (Aplysia) are well-studied models for iGluR-mediated function, yet no studies to date have analyzed the evolutionary relationships between iGluR genes in these species and those in vertebrates, to identify genes that may mediate plasticity. We conducted a thorough phylogenetic analysis spanning Bilateria to elucidate these relationships. The expression status of iGluR genes in the Aplysia nervous system was also examined. Our analysis shows that ancestral genes for both NMDAR and AMPAR subtypes were present in the common bilaterian ancestor. NMDAR genes show very high conservation in motifs responsible for forming the conductance pore of the ion channel. The number of NMDAR subunits is greater in vertebrates due to an increased number of splice variants and an increased number of genes, likely due to gene duplication events. AMPAR subunits form an orthologous group, and there is high variability in the number of AMPAR genes in each species due to extensive taxon specific gene gain and loss. qPCR results show that all 12 Aplysia iGluR subunits are expressed in all nervous system ganglia. Orthologous NMDAR subunits in all species studied suggests conserved function across Bilateria, and potentially a conserved mechanism of neuroplasticity and learning. Vertebrates display an increased number of NMDAR genes and splice variants, which may play a role in their greater diversity of physiological responses. Extensive gene gain and loss of AMPAR genes may result in different physiological properties that are taxon specific. Our results suggest a significant role for L-Glu mediated responses throughout the Aplysia nervous system, consistent with L-Glu's role as the primary excitatory neurotransmitter.

  13. Mitochondrial Genome Variation after Hybridization and Differences in the First and Second Generation Hybrids of Bream Fishes

    PubMed Central

    Zhang, Wei-Zhuo; Xiong, Xue-Mei; Zhang, Xiu-Jie; Wan, Shi-Ming; Guan, Ning-Nan; Nie, Chun-Hong; Zhao, Bo-Wen; Hsiao, Chung-Der; Wang, Wei-Min; Gao, Ze-Xia

    2016-01-01

    Hybridization plays an important role in fish breeding. Bream fishes contribute a lot to aquaculture in China due to their economically valuable characteristics and the present study included five bream species, Megalobrama amblycephala, Megalobrama skolkovii, Megalobrama pellegrini, Megalobrama terminalis and Parabramis pekinensis. As maternal inheritance of mitochondrial genome (mitogenome) involves species specific regulation, we aimed to investigate in which way the inheritance of mitogenome is affected by hybridization in these fish species. With complete mitogenomes of 7 hybrid groups of bream species being firstly reported in the present study, a comparative analysis of 17 mitogenomes was conducted, including representatives of these 5 bream species, 6 first generation hybrids and 6 second generation hybrids. The results showed that these 17 mitogenomes shared the same gene arrangement, and had similar gene size and base composition. According to the phylogenetic analyses, all mitogenomes of the hybrids were consistent with a maternal inheritance. However, a certain number of variable sites were detected in all F1 hybrid groups compared to their female parents, especially in the group of M. terminalis (♀) × M. amblycephala (♂) (MT×MA), with a total of 86 variable sites between MT×MA and its female parent. Among the mitogenomes genes, the protein-coding gene nd5 displayed the highest variability. The number of variation sites was found to be related to phylogenetic relationship of the parents: the closer they are, the lower amount of variation sites their hybrids have. The second generation hybrids showed less mitogenome variation than that of first generation hybrids. The non-synonymous and synonymous substitution rates (dN/dS) were calculated between all the hybrids with their own female parents and the results indicated that most PCGs were under negative selection. PMID:27391325

  14. [Serotonin receptor (5-HTR2A) and dysbindin (DTNBP1) genes and component process variables of short-term verbal memory in schizophrenia].

    PubMed

    Alfimova, M V; Monakhov, M V; Abramova, L I; Golubev, S A; Golimbet, V E

    2009-01-01

    An association study of variations in the DTNBP1 (P1763 and P1578) and 5-HTR2A (T102C and A-1438G) genes with short-term verbal memory efficiency and its component process variables was carried out in 405 patients with schizophrenia and 290 healthy controls. All subjects were asked to recall immediately two sets of 10 words. Total recall, List 1 recall, immediate recall or attention span, proactive interference and a number of intrusions were measured. Patients significantly differed from controls by all memory variables. The efficiency of test performance, efficiency of immediate memory, effect of proactive interference as well as number of intrusions were decreased in the group of patients. Both 5-HTR2A polymorphisms were associated with short-term verbal memory efficiency in the combined sample, with the worst performance observed in carriers of homozygous CC (T102C) and GG (A-1438G) genotypes. The significant effect of the P1763 (DTNBP1) marker on the component process variables (proactive interference and intrusions) was found while its effect on the total recall was non-significant. The homozygotes for GG (P1763) had the worst scores. Overall, the data obtained are in line with the conception of DTNBP1 and 5-HTR2A involvement in different component process variables of memory in healthy subjects and patients with schizophrenia.

  15. High Genetic Diversity Revealed by Variable-Number Tandem Repeat Genotyping and Analysis of hsp65 Gene Polymorphism in a Large Collection of “Mycobacterium canettii” Strains Indicates that the M. tuberculosis Complex Is a Recently Emerged Clone of “M. canettii”

    PubMed Central

    Fabre, Michel; Koeck, Jean-Louis; Le Flèche, Philippe; Simon, Fabrice; Hervé, Vincent; Vergnaud, Gilles; Pourcel, Christine

    2004-01-01

    We have analyzed, using complementary molecular methods, the diversity of 43 strains of “Mycobacterium canettii” originating from the Republic of Djibouti, on the Horn of Africa, from 1998 to 2003. Genotyping by multiple-locus variable-number tandem repeat analysis shows that all the strains belong to a single but very distant group when compared to strains of the Mycobacterium tuberculosis complex (MTBC). Thirty-one strains cluster into one large group with little variability and five strains form another group, whereas the other seven are more diverged. In total, 14 genotypes are observed. The DR locus analysis reveals additional variability, some strains being devoid of a direct repeat locus and others having unique spacers. The hsp65 gene polymorphism was investigated by restriction enzyme analysis and sequencing of PCR amplicons. Four new single nucleotide polymorphisms were discovered. One strain was characterized by three nucleotide changes in 441 bp, creating new restriction enzyme polymorphisms. As no sequence variability was found for hsp65 in the whole MTBC, and as a single point mutation separates M. tuberculosis from the closest “M. canettii” strains, this diversity within “M. canettii” subspecies strongly suggests that it is the most probable source species of the MTBC rather than just another branch of the MTBC. PMID:15243089

  16. Deep sequencing reveals cell-type-specific patterns of single-cell transcriptome variation.

    PubMed

    Dueck, Hannah; Khaladkar, Mugdha; Kim, Tae Kyung; Spaethling, Jennifer M; Francis, Chantal; Suresh, Sangita; Fisher, Stephen A; Seale, Patrick; Beck, Sheryl G; Bartfai, Tamas; Kuhn, Bernhard; Eberwine, James; Kim, Junhyong

    2015-06-09

    Differentiation of metazoan cells requires execution of different gene expression programs but recent single-cell transcriptome profiling has revealed considerable variation within cells of seeming identical phenotype. This brings into question the relationship between transcriptome states and cell phenotypes. Additionally, single-cell transcriptomics presents unique analysis challenges that need to be addressed to answer this question. We present high quality deep read-depth single-cell RNA sequencing for 91 cells from five mouse tissues and 18 cells from two rat tissues, along with 30 control samples of bulk RNA diluted to single-cell levels. We find that transcriptomes differ globally across tissues with regard to the number of genes expressed, the average expression patterns, and within-cell-type variation patterns. We develop methods to filter genes for reliable quantification and to calibrate biological variation. All cell types include genes with high variability in expression, in a tissue-specific manner. We also find evidence that single-cell variability of neuronal genes in mice is correlated with that in rats consistent with the hypothesis that levels of variation may be conserved. Single-cell RNA-sequencing data provide a unique view of transcriptome function; however, careful analysis is required in order to use single-cell RNA-sequencing measurements for this purpose. Technical variation must be considered in single-cell RNA-sequencing studies of expression variation. For a subset of genes, biological variability within each cell type appears to be regulated in order to perform dynamic functions, rather than solely molecular noise.

  17. Identification of ANKRD11 and ZNF778 as candidate genes for autism and variable cognitive impairment in the novel 16q24.3 microdeletion syndrome

    PubMed Central

    Willemsen, Marjolein H; Fernandez, Bridget A; Bacino, Carlos A; Gerkes, Erica; de Brouwer, Arjan PM; Pfundt, Rolph; Sikkema-Raddatz, Birgit; Scherer, Stephen W; Marshall, Christian R; Potocki, Lorraine; van Bokhoven, Hans; Kleefstra, Tjitske

    2010-01-01

    The clinical use of array comparative genomic hybridization in the evaluation of patients with multiple congenital anomalies and/or mental retardation has recently led to the discovery of a number of novel microdeletion and microduplication syndromes. We present four male patients with overlapping molecularly defined de novo microdeletions of 16q24.3. The clinical features observed in these patients include facial dysmorphisms comprising prominent forehead, large ears, smooth philtrum, pointed chin and wide mouth, variable cognitive impairment, autism spectrum disorder, structural anomalies of the brain, seizures and neonatal thrombocytopenia. Although deletions vary in size, the common region of overlap is only 90 kb and comprises two known genes, Ankyrin Repeat Domain 11 (ANKRD11) (MIM 611192) and Zinc Finger 778 (ZNF778), and is located approximately 10 kb distally to Cadherin 15 (CDH15) (MIM 114019). This region is not found as a copy number variation in controls. We propose that these patients represent a novel and distinctive microdeletion syndrome, characterized by autism spectrum disorder, variable cognitive impairment, facial dysmorphisms and brain abnormalities. We suggest that haploinsufficiency of ANKRD11 and/or ZNF778 contribute to this phenotype and speculate that further investigation of non-deletion patients who have features suggestive of this 16q24.3 microdeletion syndrome might uncover other mutations in one or both of these genes. PMID:19920853

  18. Genetic Variability and Distribution of Mating Type Alleles in Field Populations of Leptosphaeria maculans from France

    PubMed Central

    Gout, Lilian; Eckert, Maria; Rouxel, Thierry; Balesdent, Marie-Hélène

    2006-01-01

    Leptosphaeria maculans is the most ubiquitous fungal pathogen of Brassica crops and causes the devastating stem canker disease of oilseed rape worldwide. We used minisatellite markers to determine the genetic structure of L. maculans in four field populations from France. Isolates were collected at three different spatial scales (leaf, 2-m2 field plot, and field) enabling the evaluation of spatial distribution of the mating type alleles and of genetic variability within and among field populations. Within each field population, no gametic disequilibrium between the minisatellite loci was detected and the mating type alleles were present at equal frequencies. Both sexual and asexual reproduction occur in the field, but the genetic structure of these populations is consistent with annual cycles of randomly mating sexual reproduction. All L. maculans field populations had a high level of gene diversity (H = 0.68 to 0.75) and genotypic diversity. Within each field population, the number of genotypes often was very close to the number of isolates. Analysis of molecular variance indicated that >99.5% of the total genetic variability was distributed at a small spatial scale, i.e., within 2-m2 field plots. Population differentiation among the four field populations was low (GST < 0.02), suggesting a high degree of gene exchange between these populations. The high gene flow evidenced here in French populations of L. maculans suggests a rapid countrywide diffusion of novel virulence alleles whenever novel resistance sources are used. PMID:16391041

  19. Fine Analysis of Genetic Diversity of the tpr Gene Family among Treponemal Species, Subspecies and Strains

    PubMed Central

    Centurion-Lara, Arturo; Giacani, Lorenzo; Godornes, Charmie; Molini, Barbara J.; Brinck Reid, Tara; Lukehart, Sheila A.

    2013-01-01

    Background The pathogenic non-cultivable treponemes include three subspecies of Treponema pallidum (pallidum, pertenue, endemicum), T. carateum, T. paraluiscuniculi, and the unclassified Fribourg-Blanc treponeme (Simian isolate). These treponemes are morphologically indistinguishable and antigenically and genetically highly similar, yet cross-immunity is variable or non-existent. Although all of these organisms cause chronic, multistage skin and systemic disease, they have historically been classified by mode of transmission, clinical presentations and host ranges. Whole genome studies underscore the high degree of sequence identity among species, subspecies and strains, pinpointing a limited number of genomic regions for variation. Many of these “hot spots” include members of the tpr gene family, composed of 12 paralogs encoding candidate virulence factors. We hypothesize that the distinct clinical presentations, host specificity, and variable cross-immunity might reside on virulence factors such as the tpr genes. Methodology/Principal Findings Sequence analysis of 11 tpr loci (excluding tprK) from 12 strains demonstrated an impressive heterogeneity, including SNPs, indels, chimeric genes, truncated gene products and large deletions. Comparative analyses of sequences and 3D models of predicted proteins in Subfamily I highlight the striking co-localization of discrete variable regions with predicted surface-exposed loops. A hallmark of Subfamily II is the presence of chimeric genes in the tprG and J loci. Diversity in Subfamily III is limited to tprA and tprL. Conclusions/Significance An impressive sequence variability was found in tpr sequences among the Treponema isolates examined in this study, with most of the variation being consistent within subspecies or species, or between syphilis vs. non-syphilis strains. Variability was seen in the pallidum subspecies, which can be divided into 5 genogroups. These findings support a genetic basis for the classification of these organisms into their respective subspecies and species. Future functional studies will determine whether the identified genetic differences relate to cross-immunity, clinical differences, or host ranges. PMID:23696912

  20. Rigor of cell fate decision by variable p53 pulses and roles of cooperative gene expression by p53

    PubMed Central

    Murakami, Yohei; Takada, Shoji

    2012-01-01

    Upon DNA damage, the cell fate decision between survival and apoptosis is largely regulated by p53-related networks. Recent experiments found a series of discrete p53 pulses in individual cells, which led to the hypothesis that the cell fate decision upon DNA damage is controlled by counting the number of p53 pulses. Under this hypothesis, Sun et al. (2009) modeled the Bax activation switch in the apoptosis signal transduction pathway that can rigorously “count” the number of uniform p53 pulses. Based on experimental evidence, here we use variable p53 pulses with Sun et al.’s model to investigate how the variability in p53 pulses affects the rigor of the cell fate decision by the pulse number. Our calculations showed that the experimentally anticipated variability in the pulse sizes reduces the rigor of the cell fate decision. In addition, we tested the roles of the cooperativity in PUMA expression by p53, finding that lower cooperativity is plausible for more rigorous cell fate decision. This is because the variability in the p53 pulse height is more amplified in PUMA expressions with more cooperative cases. PMID:27857606

  1. Quantifying Intrinsic and Extrinsic Variability in Stochastic Gene Expression Models

    PubMed Central

    Singh, Abhyudai; Soltani, Mohammad

    2013-01-01

    Genetically identical cell populations exhibit considerable intercellular variation in the level of a given protein or mRNA. Both intrinsic and extrinsic sources of noise drive this variability in gene expression. More specifically, extrinsic noise is the expression variability that arises from cell-to-cell differences in cell-specific factors such as enzyme levels, cell size and cell cycle stage. In contrast, intrinsic noise is the expression variability that is not accounted for by extrinsic noise, and typically arises from the inherent stochastic nature of biochemical processes. Two-color reporter experiments are employed to decompose expression variability into its intrinsic and extrinsic noise components. Analytical formulas for intrinsic and extrinsic noise are derived for a class of stochastic gene expression models, where variations in cell-specific factors cause fluctuations in model parameters, in particular, transcription and/or translation rate fluctuations. Assuming mRNA production occurs in random bursts, transcription rate is represented by either the burst frequency (how often the bursts occur) or the burst size (number of mRNAs produced in each burst). Our analysis shows that fluctuations in the transcription burst frequency enhance extrinsic noise but do not affect the intrinsic noise. On the contrary, fluctuations in the transcription burst size or mRNA translation rate dramatically increase both intrinsic and extrinsic noise components. Interestingly, simultaneous fluctuations in transcription and translation rates arising from randomness in ATP abundance can decrease intrinsic noise measured in a two-color reporter assay. Finally, we discuss how these formulas can be combined with single-cell gene expression data from two-color reporter experiments for estimating model parameters. PMID:24391934

  2. Quantifying intrinsic and extrinsic variability in stochastic gene expression models.

    PubMed

    Singh, Abhyudai; Soltani, Mohammad

    2013-01-01

    Genetically identical cell populations exhibit considerable intercellular variation in the level of a given protein or mRNA. Both intrinsic and extrinsic sources of noise drive this variability in gene expression. More specifically, extrinsic noise is the expression variability that arises from cell-to-cell differences in cell-specific factors such as enzyme levels, cell size and cell cycle stage. In contrast, intrinsic noise is the expression variability that is not accounted for by extrinsic noise, and typically arises from the inherent stochastic nature of biochemical processes. Two-color reporter experiments are employed to decompose expression variability into its intrinsic and extrinsic noise components. Analytical formulas for intrinsic and extrinsic noise are derived for a class of stochastic gene expression models, where variations in cell-specific factors cause fluctuations in model parameters, in particular, transcription and/or translation rate fluctuations. Assuming mRNA production occurs in random bursts, transcription rate is represented by either the burst frequency (how often the bursts occur) or the burst size (number of mRNAs produced in each burst). Our analysis shows that fluctuations in the transcription burst frequency enhance extrinsic noise but do not affect the intrinsic noise. On the contrary, fluctuations in the transcription burst size or mRNA translation rate dramatically increase both intrinsic and extrinsic noise components. Interestingly, simultaneous fluctuations in transcription and translation rates arising from randomness in ATP abundance can decrease intrinsic noise measured in a two-color reporter assay. Finally, we discuss how these formulas can be combined with single-cell gene expression data from two-color reporter experiments for estimating model parameters.

  3. Lupin nad9 and nad6 genes and their expression: 5' termini of the nad9 gene transcripts differentiate lupin species.

    PubMed

    Rurek, Michał; Nuc, Katarzyna; Raczyńska, Katarzyna Dorota; Augustyniak, Halina

    2003-10-02

    The mitochondrial nad9 and nad6 genes were analyzed in four lupin species: Lupinus luteus, Lupinus angustifolius, Lupinus albus and Lupinus mutabilis. The nucleotide sequence of these genes confirmed their high conservation, however, higher number of nucleotide substitution was observed in the L. albus genes. Southern hybridizations confirmed the presence of single copy number of these genes in L. luteus, L. albus and L. angustifolius. The expression of nad9 and nad6 genes was analyzed by Northern in different tissue types of analyzed lupin species. Transcription analyses of the two nad genes displayed single predominant mRNA species of about 0.6 kb in L. luteus and L. angustifolius. The L. albus transcripts were larger in size. The nad9 and nad6 transcripts were modified by RNA editing at 8 and 11 positions, in L. luteus and L. angustifolius, respectively. The gene order, rps3-rpl16-nad9, found in Arabidopsis thaliana is also conserved in L. luteus and L. angustifolius mitochondria. L. luteus and L. angustifolius showed some variability in the sequence of the nad9 promoter region. The last feature along with the differences observed in nad9 mRNA 5' termini of two lupins differentiate L. luteus and L. angustifolius species.

  4. Lack of Association of Estrogen Receptor Alpha Gene Polymorphisms with Cardiorespiratory and Metabolic Variables in Young Women

    PubMed Central

    Rebelo, Ana Cristina; Verlengia, Rozangela; Kunz, Vandeni; Tamburus, Nayara; Cerda, Alvaro; Hirata, Rosario; Hirata, Mario; Silva, Ester

    2012-01-01

    This study examined the association of estrogen receptor alpha gene (ESR1) polymorphisms with cardiorespiratory and metabolic parameters in young women. In total, 354 healthy women were selected for cardiopulmonary exercise testing and short-term heart rate (HR) variability (HRV) evaluation. The HRV analysis was determined by the temporal indices rMSSD (square root of the mean squared differences of successive R–R intervals (RRi) divided by the number of RRi minus one), SDNN (root mean square of differences from mean RRi, divided by the number of RRi) and power spectrum components by low frequency (LF), high frequency (HF) and LF/HF ratio. Blood samples were obtained for serum lipids, estradiol and DNA extraction. ESR1 rs2234693 and rs9340799 polymorphisms were analyzed by PCR and fragment restriction analysis. HR and oxygen uptake (VO2) values did not differ between the ESR1 polymorphisms with respect to autonomic modulation. We not find a relationship between ESR1 T–A, T–G, C–A and C–G haplotypes and cardiorespiratory and metabolic variables. Multiple linear regression analysis demonstrated that VO2, total cholesterol and triglycerides influence HRV (p < 0.05). The results suggest that ESR1 variants have no effect on cardiorespiratory and metabolic variables, while HRV indices are influenced by aerobic capacity and lipids in healthy women. PMID:23202974

  5. Quantification of hookworm ova from wastewater matrices using quantitative PCR.

    PubMed

    Gyawali, Pradip; Ahmed, Warish; Sidhu, Jatinder P; Jagals, Paul; Toze, Simon

    2017-07-01

    A quantitative PCR (qPCR) assay was used to quantify Ancylostoma caninum ova in wastewater and sludge samples. We estimated the average gene copy numbers for a single ovum using a mixed population of ova. The average gene copy numbers derived from the mixed population were used to estimate numbers of hookworm ova in A. caninum seeded and unseeded wastewater and sludge samples. The newly developed qPCR assay estimated an average of 3.7×10 3 gene copies per ovum, which was then validated by seeding known numbers of hookworm ova into treated wastewater. The qPCR estimated an average of (1.1±0.1), (8.6±2.9) and (67.3±10.4) ova for treated wastewater that was seeded with (1±0), (10±2) and (100±21) ova, respectively. The further application of the qPCR assay for the quantification of A. caninum ova was determined by seeding a known numbers of ova into the wastewater matrices. The qPCR results indicated that 50%, 90% and 67% of treated wastewater (1L), raw wastewater (1L) and sludge (~4g) samples had variable numbers of A. caninum gene copies. After conversion of the qPCR estimated gene copy numbers to ova for treated wastewater, raw wastewater, and sludge samples, had an average of 0.02, 1.24 and 67 ova, respectively. The result of this study indicated that qPCR can be used for the quantification of hookworm ova from wastewater and sludge samples; however, caution is advised in interpreting qPCR generated data for health risk assessment. Copyright © 2017. Published by Elsevier B.V.

  6. Gene expression signature of cerebellar hypoplasia in a mouse model of Down syndrome during postnatal development

    PubMed Central

    Laffaire, Julien; Rivals, Isabelle; Dauphinot, Luce; Pasteau, Fabien; Wehrle, Rosine; Larrat, Benoit; Vitalis, Tania; Moldrich, Randal X; Rossier, Jean; Sinkus, Ralph; Herault, Yann; Dusart, Isabelle; Potier, Marie-Claude

    2009-01-01

    Background Down syndrome is a chromosomal disorder caused by the presence of three copies of chromosome 21. The mechanisms by which this aneuploidy produces the complex and variable phenotype observed in people with Down syndrome are still under discussion. Recent studies have demonstrated an increased transcript level of the three-copy genes with some dosage compensation or amplification for a subset of them. The impact of this gene dosage effect on the whole transcriptome is still debated and longitudinal studies assessing the variability among samples, tissues and developmental stages are needed. Results We thus designed a large scale gene expression study in mice (the Ts1Cje Down syndrome mouse model) in which we could measure the effects of trisomy 21 on a large number of samples (74 in total) in a tissue that is affected in Down syndrome (the cerebellum) and where we could quantify the defect during postnatal development in order to correlate gene expression changes to the phenotype observed. Statistical analysis of microarray data revealed a major gene dosage effect: for the three-copy genes as well as for a 2 Mb segment from mouse chromosome 12 that we show for the first time as being deleted in the Ts1Cje mice. This gene dosage effect impacts moderately on the expression of euploid genes (2.4 to 7.5% differentially expressed). Only 13 genes were significantly dysregulated in Ts1Cje mice at all four postnatal development stages studied from birth to 10 days after birth, and among them are 6 three-copy genes. The decrease in granule cell proliferation demonstrated in newborn Ts1Cje cerebellum was correlated with a major gene dosage effect on the transcriptome in dissected cerebellar external granule cell layer. Conclusion High throughput gene expression analysis in the cerebellum of a large number of samples of Ts1Cje and euploid mice has revealed a prevailing gene dosage effect on triplicated genes. Moreover using an enriched cell population that is thought responsible for the cerebellar hypoplasia in Down syndrome, a global destabilization of gene expression was not detected. Altogether these results strongly suggest that the three-copy genes are directly responsible for the phenotype present in cerebellum. We provide here a short list of candidate genes. PMID:19331679

  7. Spinal Muscular Atrophy: Current Therapeutic Strategies

    NASA Astrophysics Data System (ADS)

    Kiselyov, Alex S.; Gurney, Mark E.

    Proximal spinal muscular atrophy (SMA) is an autosomal recessive disorder characterized by death of motor neurons in the spinal cord. SMA is caused by deletion and/or mutation of the survival motor neuron gene (SMN1) on chromosome 5q13. There are variable numbers of copies of a second, related gene named SMN2 located in the proximity to SMN1. Both genes encode the same protein (Smn). Loss of SMN1 and incorrect splicing of SMN2 affect cellular levels of Smn triggering death of motor neurons. The severity of SMA is directly related to the normal number of copies of SMN2 carried by the patient. A considerable effort has been dedicated to identifying modalities including both biological and small molecule agents that increase SMN2 promoter activity to upregulate gene transcription and produce increased quantities of full-length Smn protein. This review summarizes recent progress in the area and suggests potential target product profile for an SMA therapeutic.

  8. Spectrum of Phenylalanine Hydroxylase Gene Mutations in Hamadan and Lorestan Provinces of Iran and Their Associations with Variable Number of Tandem Repeat Alleles.

    PubMed

    Alibakhshi, Reza; Moradi, Keivan; Biglari, Mostafa; Shafieenia, Samaneh

    2018-05-01

    Phenylketonuria (PKU) is one of the most common known inherited metabolic diseases. The present study aimed to investigate the status of molecular defects in phenylalanine hydroxylase ( PAH ) gene in western Iranian PKU patients (predominantly from Kermanshah, Hamadan, and Lorestan provinces) during 2014-2016. Additionally, the results were compared with similar studies in Iran. Nucleotide sequence analysis of all 13 exons and their flanking intronic regions of the PAH gene was performed in 18 western Iranian PKU patients. Moreover, a variable number of tandem repeat (VNTR) located in the PAH gene was studied. The results revealed a mutational spectrum encompassing 11 distinct mutations distributed along the PAH gene sequence on 34 of the 36 mutant alleles (diagnostic efficiency of 94.4%). Also, four PAH VNTR alleles (with repeats of 3, 7, 8 and 9) were detected. The three most frequent mutations were IVS9+5G>A, IVS7-5T>C, and p.P281L with the frequency of 27.8%, 11%, and 11%, respectively. The results showed that there is not only a consanguineous relation, but also a difference in PAH characters of mutations between Kermanshah and the other two parts of western Iran (Hamadan and Lorestan). Also, it seems that the spectrum of mutations in western Iran is relatively distinct from other parts of the country, suggesting that this region might be a special PAH gene distribution region. Moreover, our findings can be useful in the identification of genotype to phenotype relationship in patients, and provide future abilities for confirmatory diagnostic testing, prognosis, and predict the severity of PKU patients.

  9. The number of genes encoding repeat domain-containing proteins positively correlates with genome size in amoebal giant viruses

    PubMed Central

    Shukla, Avi; Chatterjee, Anirvan

    2018-01-01

    Abstract Curiously, in viruses, the virion volume appears to be predominantly driven by genome length rather than the number of proteins it encodes or geometric constraints. With their large genome and giant particle size, amoebal viruses (AVs) are ideally suited to study the relationship between genome and virion size and explore the role of genome plasticity in their evolutionary success. Different genomic regions of AVs exhibit distinct genealogies. Although the vertically transferred core genes and their functions are universally conserved across the nucleocytoplasmic large DNA virus (NCLDV) families and are essential for their replication, the horizontally acquired genes are variable across families and are lineage-specific. When compared with other giant virus families, we observed a near–linear increase in the number of genes encoding repeat domain-containing proteins (RDCPs) with the increase in the genome size of AVs. From what is known about the functions of RDCPs in bacteria and eukaryotes and their prevalence in the AV genomes, we envisage important roles for RDCPs in the life cycle of AVs, their genome expansion, and plasticity. This observation also supports the evolution of AVs from a smaller viral ancestor by the acquisition of diverse gene families from the environment including RDCPs that might have helped in host adaption. PMID:29308275

  10. Biodemographic Modeling of the Links Between Fertility Motivation and Fertility Outcomes in the NLSY79

    PubMed Central

    MILLER, WARREN B.; BARD, DAVID E.; PASTA, DAVID J.; RODGERS, JOSEPH LEE

    2010-01-01

    In spite of long-held beliefs that traits related to reproductive success tend to become fixed by evolution with little or no genetic variation, there is now considerable evidence that the natural variation of fertility within populations is genetically influenced and that a portion of that influence is related to the motivational precursors to fertility. We conduct a two-stage analysis to examine these inferences in a time-ordered multivariate context. First, using data from the National Longitudinal Survey of Youth, 1979, and LISREL analysis, we develop a structural equation model in which five hypothesized motivational precursors to fertility, measured in 1979–1982, predict both a child-timing and a child-number outcome, measured in 2002. Second, having chosen two time-ordered sequences of six variables from the SEM to represent our phenotypic models, we use Mx to conduct both univariate and multivariate behavioral genetic analyses with the selected variables. Our results indicate that one or more genes acting within a gene network have additive effects that operate through child-number desires to affect both the timing of the next child born and the final number of children born, that one or more genes acting through a separate network may have additive effects operating through gender role attitudes to produce downstream effects on the two fertility outcomes, and that no genetic variance is associated with either child-timing intentions or educational intentions. PMID:20608103

  11. Comparing machine learning and logistic regression methods for predicting hypertension using a combination of gene expression and next-generation sequencing data.

    PubMed

    Held, Elizabeth; Cape, Joshua; Tintle, Nathan

    2016-01-01

    Machine learning methods continue to show promise in the analysis of data from genetic association studies because of the high number of variables relative to the number of observations. However, few best practices exist for the application of these methods. We extend a recently proposed supervised machine learning approach for predicting disease risk by genotypes to be able to incorporate gene expression data and rare variants. We then apply 2 different versions of the approach (radial and linear support vector machines) to simulated data from Genetic Analysis Workshop 19 and compare performance to logistic regression. Method performance was not radically different across the 3 methods, although the linear support vector machine tended to show small gains in predictive ability relative to a radial support vector machine and logistic regression. Importantly, as the number of genes in the models was increased, even when those genes contained causal rare variants, model predictive ability showed a statistically significant decrease in performance for both the radial support vector machine and logistic regression. The linear support vector machine showed more robust performance to the inclusion of additional genes. Further work is needed to evaluate machine learning approaches on larger samples and to evaluate the relative improvement in model prediction from the incorporation of gene expression data.

  12. Hybrid stochastic simplifications for multiscale gene networks.

    PubMed

    Crudu, Alina; Debussche, Arnaud; Radulescu, Ovidiu

    2009-09-07

    Stochastic simulation of gene networks by Markov processes has important applications in molecular biology. The complexity of exact simulation algorithms scales with the number of discrete jumps to be performed. Approximate schemes reduce the computational time by reducing the number of simulated discrete events. Also, answering important questions about the relation between network topology and intrinsic noise generation and propagation should be based on general mathematical results. These general results are difficult to obtain for exact models. We propose a unified framework for hybrid simplifications of Markov models of multiscale stochastic gene networks dynamics. We discuss several possible hybrid simplifications, and provide algorithms to obtain them from pure jump processes. In hybrid simplifications, some components are discrete and evolve by jumps, while other components are continuous. Hybrid simplifications are obtained by partial Kramers-Moyal expansion [1-3] which is equivalent to the application of the central limit theorem to a sub-model. By averaging and variable aggregation we drastically reduce simulation time and eliminate non-critical reactions. Hybrid and averaged simplifications can be used for more effective simulation algorithms and for obtaining general design principles relating noise to topology and time scales. The simplified models reproduce with good accuracy the stochastic properties of the gene networks, including waiting times in intermittence phenomena, fluctuation amplitudes and stationary distributions. The methods are illustrated on several gene network examples. Hybrid simplifications can be used for onion-like (multi-layered) approaches to multi-scale biochemical systems, in which various descriptions are used at various scales. Sets of discrete and continuous variables are treated with different methods and are coupled together in a physically justified approach.

  13. Genome-Wide Analysis of NBS-LRR Genes in Sorghum Genome Revealed Several Events Contributing to NBS-LRR Gene Evolution in Grass Species

    PubMed Central

    Yang, Xiping; Wang, Jianping

    2016-01-01

    The nucleotide-binding site (NBS)–leucine-rich repeat (LRR) gene family is crucially important for offering resistance to pathogens. To explore evolutionary conservation and variability of NBS-LRR genes across grass species, we identified 88, 107, 24, and 44 full-length NBS-LRR genes in sorghum, rice, maize, and Brachypodium, respectively. A comprehensive analysis was performed on classification, genome organization, evolution, expression, and regulation of these NBS-LRR genes using sorghum as a representative of grass species. In general, the full-length NBS-LRR genes are highly clustered and duplicated in sorghum genome mainly due to local duplications. NBS-LRR genes have basal expression levels and are highly potentially targeted by miRNA. The number of NBS-LRR genes in the four grass species is positively correlated with the gene clustering rate. The results provided a valuable genomic resource and insights for functional and evolutionary studies of NBS-LRR genes in grass species. PMID:26792976

  14. Relationships between Gene Structure and Genome Instability in Flowering Plants.

    PubMed

    Bennetzen, Jeffrey L; Wang, Xuewen

    2018-03-05

    Flowering plant (angiosperm) genomes are exceptional in their variability with respect to genome size, ploidy, chromosome number, gene content, and gene arrangement. Gene movement, although observed in some of the earliest plant genome comparisons, has been relatively underinvestigated. We present herein a description of several interesting properties of plant gene and genome structure that are pertinent to the successful movement of a gene to a new location. These considerations lead us to propose a model that can explain the frequent success of plant gene mobility, namely that Small Insulated Genes Move Around (SIGMAR). The SIGMAR model is then compared with known processes for gene mobilization, and predictions of the SIGMAR model are formulated to encourage future experimentation. The overall results indicate that the frequent gene movement in angiosperm genomes is partly an outcome of the unusual properties of angiosperm genes, especially their small size and insulation from epigenetic silencing. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.

  15. Identification of genome regions determining semen quality in Holstein-Friesian bulls using information theory.

    PubMed

    Borowska, Alicja; Szwaczkowski, Tomasz; Kamiński, Stanisław; Hering, Dorota M; Kordan, Władysław; Lecewicz, Marek

    2018-05-01

    Use of information theory can be an alternative statistical approach to detect genome regions and candidate genes that are associated with livestock traits. The aim of this study was to verify the validity of the SNPs effects on some semen quality variables of bulls using entropy analysis. Records from 288 Holstein-Friesian bulls from one AI station were included. The following semen quality variables were analyzed: CASA kinematic variables of sperm (total motility, average path velocity, straight line velocity, curvilinear velocity, amplitude of lateral head displacement, beat cross frequency, straightness, linearity), sperm membrane integrity (plazmolema, mitochondrial function), sperm ATP content. Molecular data included 48,192 SNPs. After filtering (call rate = 0.95 and MAF = 0.05), 34,794 SNPs were included in the entropy analysis. The entropy and conditional entropy were estimated for each SNP. Conditional entropy quantifies the remaining uncertainty about values of the variable with the knowledge of SNP. The most informative SNPs for each variable were determined. The computations were performed using the R statistical package. A majority of the loci had relatively small contributions. The most informative SNPs for all variables were mainly located on chromosomes: 3, 4, 5 and 16. The results from the study indicate that important genome regions and candidate genes that determine semen quality variables in bulls are located on a number of chromosomes. Some detected clusters of SNPs were located in RNA (U6 and 5S_rRNA) for all the variables for which analysis occurred. Associations between PARK2 as well GALNT13 genes and some semen characteristics were also detected. Copyright © 2018 Elsevier B.V. All rights reserved.

  16. Deletions spanning the neurofibromatosis I gene: Identification and phenotype of five patients

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kayes, L.M.; Burke, W.; Bennett, R.

    Neurofibromatosis type 1 (NF1) is an autosomal dominant disorder characterized by marked variation in clinical severity. To investigate the contribution to variability by genes either contiguous to or contained within the NF1 gene, the authors screened six NF1 patients with mild facial dysmorphology, mental retardation, and/or learning disabilities, for DNA rearrangement of the NF1 region. Five of the six patients had NF1 gene deletions on the basis of quantitative densitometry, locus hemizygosity, and analysis of somatic cell hybrid lines. Analysis of hybrid lines carrying each of the patient's chromosomes 17, with 15 regional DNA markers, demonstrated that each of themore » five patients carried a deletion >700 kb in size. Minimally, each of the deletions involved the entire 350-kb NF1 gene; the three genes - EVI2A, EVI2B, and OMG-that are contained within an NF1 intron; and considerable flanking DNA. For four of the patients, the deletions mapped to the same interval; the deletion in the fifth patient was larger, extending farther in both directions. The remaining NF1 allele presumably produced functional neurofibromin; no gene rearrangements were detected, and RNA-PCR demonstrated that it was transcribed. These data provide compelling evidence that the NF1 disorder results from haploid insufficiency of neurofibromin. Of the three documented de novo deletion cases, two involved the paternal NF1 allele and one the maternal allele. The parental origin of the single remaining expresses NF1 allele had no dramatic effect on patient phenotype. The deletion patients exhibited a variable number of physical anomalies that were not correlated with the extent of their deletion. All five patients with deletions were remarkable for exhibiting a large number of neurfibromas for their age, suggesting that deletion of an unknown gene in the NF1 region may affect tumor initiation or development. 69 refs., 5 figs., 1 tab.« less

  17. ALK gene copy number gain and its clinical significance in hepatocellular carcinoma.

    PubMed

    Jia, Shou-Wei; Fu, Sha; Wang, Fang; Shao, Qiong; Huang, Hong-Bing; Shao, Jian-Yong

    2014-01-07

    To examine the status and clinical significance of anaplastic lymphoma kinase (ALK) gene alterations in hepatocellular carcinoma (HCC) patients. A total of 213 cases of HCC were examined by fluorescent in situ hybridization using dual color break-apart ALK probes for the detection of chromosomal translocation and gene copy number gain. HCC tissue microarrays were constructed, and the correlation between the ALK status and clinicopathological variables was assessed by χ(2) test or Fisher's exact test. Survival analysis was estimated using the Kaplan-Meier approach with a Log-rank test. Univariate and multivariate analyses of clinical variables were performed using the Cox proportional hazards regression model. ALK gene translocation was not observed in any of the HCC cases included in the present study. ALK gene copy number gain (ALK/CNG) (≥ 4 copies/cell) was detected in 28 (13.15%) of the 213 HCC patients. The 3-year progression-free-survival (PFS) rate for ALK/CNG-positive HCC patients was significantly poorer than ALK/CNG-negative patients (27.3% vs 42.5%, P = 0.048), especially for patients with advanced stage III/IV (0% vs 33.5%, P = 0.007), and patients with grade III disease (24.8% vs 49.9%, P = 0.023). ALK/CNG-positive HCC patients had a significantly poorer prognosis than ALK/CNG-negative patients in the subgroup that was negative for serum hepatitis B virus DNA, with significantly different 3-year overall survival rates (18.2% vs 63.6%, P = 0.021) and PFS rates (18.2% vs 46.9%, P = 0.019). Multivariate Cox proportional hazards regression analysis suggested that ALK/CNG prevalence can predict death in HCC (HR = 1.596; 95%CI: 1.008-2.526, P = 0.046). ALK/CNG, but not translocation of ALK, is present in HCC and may be an unfavorable prognostic predictor.

  18. ALK gene copy number gain and its clinical significance in hepatocellular carcinoma

    PubMed Central

    Jia, Shou-Wei; Fu, Sha; Wang, Fang; Shao, Qiong; Huang, Hong-Bing; Shao, Jian-Yong

    2014-01-01

    AIM: To examine the status and clinical significance of anaplastic lymphoma kinase (ALK) gene alterations in hepatocellular carcinoma (HCC) patients. METHODS: A total of 213 cases of HCC were examined by fluorescent in situ hybridization using dual color break-apart ALK probes for the detection of chromosomal translocation and gene copy number gain. HCC tissue microarrays were constructed, and the correlation between the ALK status and clinicopathological variables was assessed by χ2 test or Fisher’s exact test. Survival analysis was estimated using the Kaplan-Meier approach with a Log-rank test. Univariate and multivariate analyses of clinical variables were performed using the Cox proportional hazards regression model. RESULTS: ALK gene translocation was not observed in any of the HCC cases included in the present study. ALK gene copy number gain (ALK/CNG) (≥ 4 copies/cell) was detected in 28 (13.15%) of the 213 HCC patients. The 3-year progression-free-survival (PFS) rate for ALK/CNG-positive HCC patients was significantly poorer than ALK/CNG-negative patients (27.3% vs 42.5%, P = 0.048), especially for patients with advanced stage III/IV (0% vs 33.5%, P = 0.007), and patients with grade III disease (24.8% vs 49.9%, P = 0.023). ALK/CNG-positive HCC patients had a significantly poorer prognosis than ALK/CNG-negative patients in the subgroup that was negative for serum hepatitis B virus DNA, with significantly different 3-year overall survival rates (18.2% vs 63.6%, P = 0.021) and PFS rates (18.2% vs 46.9%, P = 0.019). Multivariate Cox proportional hazards regression analysis suggested that ALK/CNG prevalence can predict death in HCC (HR = 1.596; 95%CI: 1.008-2.526, P = 0.046). CONCLUSION: ALK/CNG, but not translocation of ALK, is present in HCC and may be an unfavorable prognostic predictor. PMID:24415871

  19. Diversity of ARSACS mutations in French-Canadians.

    PubMed

    Thiffault, I; Dicaire, M J; Tetreault, M; Huang, K N; Demers-Lamarche, J; Bernard, G; Duquette, A; Larivière, R; Gehring, K; Montpetit, A; McPherson, P S; Richter, A; Montermini, L; Mercier, J; Mitchell, G A; Dupré, N; Prévost, C; Bouchard, J P; Mathieu, J; Brais, B

    2013-01-01

    The growing number of spastic ataxia of Charlevoix-Saguenay (SACS) gene mutations reported worldwide has broadened the clinical phenotype of autosomal recessive spastic ataxia of Charlevoix-Saguenay (ARSACS). The identification of Quebec ARSACS cases without two known SACS mutation led to the development of a multi-modal genomic strategy to uncover mutations in this large gene and explore phenotype variability. Search for SACS mutations by combining various methods on 20 cases with a classical French-Canadian ARSACS phenotype without two mutations and a group of 104 sporadic or recessive spastic ataxia cases of unknown cause. Western blot on lymphoblast protein from cases with different genotypes was probed to establish if they still expressed sacsin. A total of 12 mutations, including 7 novels, were uncovered in Quebec ARSACS cases. The screening of 104 spastic ataxia cases of unknown cause for 98 SACS mutations did not uncover carriers of two mutations. Compounds heterozygotes for one missense SACS mutation were found to minimally express sacsin. The large number of SACS mutations present even in Quebec suggests that the size of the gene alone may explain the great genotypic diversity. This study does not support an expanding ARSACS phenotype in the French-Canadian population. Most mutations lead to loss of function, though phenotypic variability in other populations may reflect partial loss of function with preservation of some sacsin expression. Our results also highlight the challenge of SACS mutation screening and the necessity to develop new generation sequencing methods to ensure low cost complete gene sequencing.

  20. Low frequency of broadly neutralizing HIV antibodies during chronic infection even in quaternary epitope targeting antibodies containing large numbers of somatic mutations.

    PubMed

    Hicar, Mark D; Chen, Xuemin; Kalams, Spyros A; Sojar, Hakimuddin; Landucci, Gary; Forthal, Donald N; Spearman, Paul; Crowe, James E

    2016-02-01

    Neutralizing antibodies (Abs) are thought to be a critical component of an appropriate HIV vaccine response. It has been proposed that Abs recognizing conformationally dependent quaternary epitopes on the HIV envelope (Env) trimer may be necessary to neutralize diverse HIV strains. A number of recently described broadly neutralizing monoclonal Abs (mAbs) recognize complex and quaternary epitopes. Generally, many such Abs exhibit extensive numbers of somatic mutations and unique structural characteristics. We sought to characterize the native antibody (Ab) response against circulating HIV focusing on such conformational responses, without a prior selection based on neutralization. Using a capture system based on VLPs incorporating cleaved envelope protein, we identified a selection of B cells that produce quaternary epitope targeting Abs (QtAbs). Similar to a number of broadly neutralizing Abs, the Ab genes encoding these QtAbs showed extensive numbers of somatic mutations. However, when expressed as recombinant molecules, these Abs failed to neutralize virus or mediate ADCVI activity. Molecular analysis showed unusually high numbers of mutations in the Ab heavy chain framework 3 region of the variable genes. The analysis suggests that large numbers of somatic mutations occur in Ab genes encoding HIV Abs in chronically infected individuals in a non-directed, stochastic, manner. Copyright © 2015 Elsevier Ltd. All rights reserved.

  1. Molecular characterization of FXI deficiency.

    PubMed

    Berber, Ergul

    2011-02-01

    Factor XI (FXI) deficiency is a rare autosomal bleeding disease associated with genetic defects in the FXI gene. It is a heterogeneous disorder with variable tendency in bleeding and variable causative FXI gene mutations. It is characterized as a cross-reacting material-negative (CRM-) FXI deficiency due to decreased FXI levels or cross-reacting material-positive (CRM+) FXI deficiency due to impaired FXI function. Increasing number of mutations has been reported in FXI mutation database, and most of the mutations are affecting serine protease (SP) domain of the protein. Functional characterization for the mutations helps to better understand the molecular basis of FXI deficiency. Prevalence of the disease is higher in certain populations such as Ashkenazi Jews. The purpose of this review is to give an overview of the molecular basis of congenital FXI deficiency.

  2. Synthetic maps of human gene frequencies in Europeans.

    PubMed

    Menozzi, P; Piazza, A; Cavalli-Sforza, L

    1978-09-01

    Multivarate techniques can be used to condense the information for a large number of loci and alleles into one or a few synthetic variables. The geographic distribution of synthetic variables can be plotted by the same technique used in mapping the gene frequency of a single allele. Synthetic maps were constructed for Europe and the Near East, with the use of principal components to condense the information of 38 independent alleles from ten loci. The first principal component summarizes close to 30% of the total information and shows gradients. Maps thus constructed show clines in remarkable agreement with those expected on the basis of the spread of early farming in Europe, thus supporting the hypothesis that this spread was a demic spread rather than a cultural diffusion of farming technology.

  3. [The influence of immobilized fibronectin on karyotypic variability of two rat kangaroo kidney cell lines].

    PubMed

    Polianskaia, G G; Goriachaia, T S; Pinaev, G P

    2007-01-01

    The numerical and structural karyotypic variability has been investigated in "markerless" Rat kangaroo kidney cell lines NBL-3-17 and NBL-3-11 when cultivating on a fibronectin-coated surface. In cell line NBL-3-17, cultivated on the fibronectin-coated surface for 1, 2, 4 and 8 days, the character of cell distribution for the chromosome number has changed. These changes involve a significant decrease in frequency of cells with modal number of chromosomes, and an increase in frequency of cells with lower chromosomal number. Many new additional structural variants of the karyotype (SVK) appear. The observed alterations seem to be due preference adhesion of cells with lower chromosome number, disturbances of mitotic apparatus and selection of SVK, which are more adopted to changes in culture conditions. Detachment of cells from the fibronectin-coated surface, followed by 5 days cultivation on a hydrophilic surface restored control distribution. In cell line NBL-3-11, cultivated on the fibronectin-coated surface for 1, 2, 4 and 8 days, the character of numerical karyotypic variability did not change compared to control variants. In cell line NBL-3-17 the frequency of chromosomal aberrations under cultivation on the fibronectin-coated surface for 1, 2, 4 and 8 days did not change relative to control variants. In cell line NBL-3-11 the frequency of chromosomal aberrations under the same conditions significantly increases, mainly at the expence of chromosomal, chromatid breaks and dicentrics (telomeric association) relative to control variants. We discuss possible reasons of differences in the character of numerical and structural karyotypic variability between cell lines NBL-3-17 (hypotriploid) and NBL-3-11 (hypodiploid) under cultivation on fibronectin. The reasons of the observed interline karyotypic differences possibly consist in peculiarity of karyotypic structure of cell line NBL-3-11 and in the change of gene expression, namely in a dose of certain functioning genes in the hypotryploid cell line NBL-3-17.

  4. Stochastic model for gene transcription on Drosophila melanogaster embryos

    NASA Astrophysics Data System (ADS)

    Prata, Guilherme N.; Hornos, José Eduardo M.; Ramos, Alexandre F.

    2016-02-01

    We examine immunostaining experimental data for the formation of stripe 2 of even-skipped (eve) transcripts on D. melanogaster embryos. An estimate of the factor converting immunofluorescence intensity units into molecular numbers is given. The analysis of the eve dynamics at the region of stripe 2 suggests that the promoter site of the gene has two distinct regimes: an earlier phase when it is predominantly activated until a critical time when it becomes mainly repressed. That suggests proposing a stochastic binary model for gene transcription on D. melanogaster embryos. Our model has two random variables: the transcripts number and the state of the source of mRNAs given as active or repressed. We are able to reproduce available experimental data for the average number of transcripts. An analysis of the random fluctuations on the number of eves and their consequences on the spatial precision of stripe 2 is presented. We show that the position of the anterior or posterior borders fluctuate around their average position by ˜1 % of the embryo length, which is similar to what is found experimentally. The fitting of data by such a simple model suggests that it can be useful to understand the functions of randomness during developmental processes.

  5. A statistical approach to identify, monitor, and manage incomplete curated data sets.

    PubMed

    Howe, Douglas G

    2018-04-02

    Many biological knowledge bases gather data through expert curation of published literature. High data volume, selective partial curation, delays in access, and publication of data prior to the ability to curate it can result in incomplete curation of published data. Knowing which data sets are incomplete and how incomplete they are remains a challenge. Awareness that a data set may be incomplete is important for proper interpretation, to avoiding flawed hypothesis generation, and can justify further exploration of published literature for additional relevant data. Computational methods to assess data set completeness are needed. One such method is presented here. In this work, a multivariate linear regression model was used to identify genes in the Zebrafish Information Network (ZFIN) Database having incomplete curated gene expression data sets. Starting with 36,655 gene records from ZFIN, data aggregation, cleansing, and filtering reduced the set to 9870 gene records suitable for training and testing the model to predict the number of expression experiments per gene. Feature engineering and selection identified the following predictive variables: the number of journal publications; the number of journal publications already attributed for gene expression annotation; the percent of journal publications already attributed for expression data; the gene symbol; and the number of transgenic constructs associated with each gene. Twenty-five percent of the gene records (2483 genes) were used to train the model. The remaining 7387 genes were used to test the model. One hundred and twenty-two and 165 of the 7387 tested genes were identified as missing expression annotations based on their residuals being outside the model lower or upper 95% confidence interval respectively. The model had precision of 0.97 and recall of 0.71 at the negative 95% confidence interval and precision of 0.76 and recall of 0.73 at the positive 95% confidence interval. This method can be used to identify data sets that are incompletely curated, as demonstrated using the gene expression data set from ZFIN. This information can help both database resources and data consumers gauge when it may be useful to look further for published data to augment the existing expertly curated information.

  6. Correlation of Metabolic Variables with the Number of ORFs in Human Pathogenic and Phylogenetically Related Non- or Less-Pathogenic Bacteria.

    PubMed

    Brambila-Tapia, Aniel Jessica Leticia; Poot-Hernández, Augusto Cesar; Garcia-Guevara, Jose Fernando; Rodríguez-Vázquez, Katya

    2016-06-01

    To date, a few works have performed a correlation of metabolic variables in bacteria; however specific correlations with these variables have not been reported. In this work, we included 36 human pathogenic bacteria and 18 non- or less-pathogenic-related bacteria and obtained all metabolic variables, including enzymes, metabolic pathways, enzymatic steps and specific metabolic pathways, and enzymatic steps of particular metabolic processes, from a reliable metabolic database (KEGG). Then, we correlated the number of the open reading frames (ORF) with these variables and with the proportions of these variables, and we observed a negative correlation with the proportion of enzymes (r = -0.506, p < 0.0001), metabolic pathways (r = -0.871, p < 00.0001), enzymatic reactions (r = -0.749, p < 00.0001), and with the proportions of central metabolism variables as well as a positive correlation with the proportions of multistep reactions (r = 0.650, p < 00.0001) and secondary metabolism variables. The proportion of multifunctional reactions (r: -0.114, p = 0.41) and the proportion of enzymatic steps (r: -0.205, p = 0.14) did not present a significant correlation. These correlations indicate that as the size of a genome (measured in the number of ORFs) increases, the proportion of genes that encode enzymes significantly diminishes (especially those related to central metabolism), suggesting that when essential metabolic pathways are complete, an increase in the number of ORFs does not require a similar increase in the metabolic pathways and enzymes, but only a slight increase is sufficient to cope with a large genome.

  7. Heme oxygenase-1 gene promoter microsatellite polymorphism is associated with progressive atherosclerosis and incident cardiovascular disease.

    PubMed

    Pechlaner, Raimund; Willeit, Peter; Summerer, Monika; Santer, Peter; Egger, Georg; Kronenberg, Florian; Demetz, Egon; Weiss, Günter; Tsimikas, Sotirios; Witztum, Joseph L; Willeit, Karin; Iglseder, Bernhard; Paulweber, Bernhard; Kedenko, Lyudmyla; Haun, Margot; Meisinger, Christa; Gieger, Christian; Müller-Nurasyid, Martina; Peters, Annette; Willeit, Johann; Kiechl, Stefan

    2015-01-01

    The enzyme heme oxygenase-1 (HO-1) exerts cytoprotective effects in response to various cellular stressors. A variable number tandem repeat polymorphism in the HO-1 gene promoter region has previously been linked to cardiovascular disease. We examined this association prospectively in the general population. Incidence of stroke, myocardial infarction, or vascular death was registered between 1995 and 2010 in 812 participants of the Bruneck Study aged 45 to 84 years (49.4% males). Carotid atherosclerosis progression was quantified by high-resolution ultrasound. HO-1 variable number tandem repeat length was determined by polymerase chain reaction. Subjects with ≥32 tandem repeats on both HO-1 alleles compared with the rest of the population (recessive trait) featured substantially increased cardiovascular disease risk (hazard ratio [95% confidence interval], 5.45 [2.39, 12.42]; P<0.0001), enhanced atherosclerosis progression (median difference in atherosclerosis score [interquartile range], 2.1 [0.8, 5.6] versus 0.0 [0.0, 2.2] mm; P=0.0012), and a trend toward higher levels of oxidized phospholipids on apolipoprotein B-100 (median oxidized phospholipids/apolipoprotein B level [interquartile range], 11364 [4160, 18330] versus 4844 [3174, 12284] relative light units; P=0.0554). Increased cardiovascular disease risk in those homozygous for ≥32 repeats was also detected in a pooled analysis of 7848 participants of the Bruneck, SAPHIR, and KORA prospective studies (hazard ratio [95% confidence interval], 3.26 [1.50, 7.33]; P=0.0043). This study found a strong association between the HO-1 variable number tandem repeat polymorphism and cardiovascular disease risk confined to subjects with a high number of repeats on both HO-1 alleles and provides evidence for accelerated atherogenesis and decreased antioxidant defense in this vascular high-risk group. © 2014 American Heart Association, Inc.

  8. Analysis of the GRNs Inference by Using Tsallis Entropy and a Feature Selection Approach

    NASA Astrophysics Data System (ADS)

    Lopes, Fabrício M.; de Oliveira, Evaldo A.; Cesar, Roberto M.

    An important problem in the bioinformatics field is to understand how genes are regulated and interact through gene networks. This knowledge can be helpful for many applications, such as disease treatment design and drugs creation purposes. For this reason, it is very important to uncover the functional relationship among genes and then to construct the gene regulatory network (GRN) from temporal expression data. However, this task usually involves data with a large number of variables and small number of observations. In this way, there is a strong motivation to use pattern recognition and dimensionality reduction approaches. In particular, feature selection is specially important in order to select the most important predictor genes that can explain some phenomena associated with the target genes. This work presents a first study about the sensibility of entropy methods regarding the entropy functional form, applied to the problem of topology recovery of GRNs. The generalized entropy proposed by Tsallis is used to study this sensibility. The inference process is based on a feature selection approach, which is applied to simulated temporal expression data generated by an artificial gene network (AGN) model. The inferred GRNs are validated in terms of global network measures. Some interesting conclusions can be drawn from the experimental results, as reported for the first time in the present paper.

  9. Spectral gene set enrichment (SGSE).

    PubMed

    Frost, H Robert; Li, Zhigang; Moore, Jason H

    2015-03-03

    Gene set testing is typically performed in a supervised context to quantify the association between groups of genes and a clinical phenotype. In many cases, however, a gene set-based interpretation of genomic data is desired in the absence of a phenotype variable. Although methods exist for unsupervised gene set testing, they predominantly compute enrichment relative to clusters of the genomic variables with performance strongly dependent on the clustering algorithm and number of clusters. We propose a novel method, spectral gene set enrichment (SGSE), for unsupervised competitive testing of the association between gene sets and empirical data sources. SGSE first computes the statistical association between gene sets and principal components (PCs) using our principal component gene set enrichment (PCGSE) method. The overall statistical association between each gene set and the spectral structure of the data is then computed by combining the PC-level p-values using the weighted Z-method with weights set to the PC variance scaled by Tracy-Widom test p-values. Using simulated data, we show that the SGSE algorithm can accurately recover spectral features from noisy data. To illustrate the utility of our method on real data, we demonstrate the superior performance of the SGSE method relative to standard cluster-based techniques for testing the association between MSigDB gene sets and the variance structure of microarray gene expression data. Unsupervised gene set testing can provide important information about the biological signal held in high-dimensional genomic data sets. Because it uses the association between gene sets and samples PCs to generate a measure of unsupervised enrichment, the SGSE method is independent of cluster or network creation algorithms and, most importantly, is able to utilize the statistical significance of PC eigenvalues to ignore elements of the data most likely to represent noise.

  10. Human genetic factors in tuberculosis: an update.

    PubMed

    van Tong, Hoang; Velavan, Thirumalaisamy P; Thye, Thorsten; Meyer, Christian G

    2017-09-01

    Tuberculosis (TB) is a major threat to human health, especially in many developing countries. Human genetic variability has been recognised to be of great relevance in host responses to Mycobacterium tuberculosis infection and in regulating both the establishment and the progression of the disease. An increasing number of candidate gene and genome-wide association studies (GWAS) have focused on human genetic factors contributing to susceptibility or resistance to TB. To update previous reviews on human genetic factors in TB we searched the MEDLINE database and PubMed for articles from 1 January 2014 through 31 March 2017 and reviewed the role of human genetic variability in TB. Search terms applied in various combinations were 'tuberculosis', 'human genetics', 'candidate gene studies', 'genome-wide association studies' and 'Mycobacterium tuberculosis'. Articles in English retrieved and relevant references cited in these articles were reviewed. Abstracts and reports from meetings were also included. This review provides a recent summary of associations of polymorphisms of human genes with susceptibility/resistance to TB. © 2017 John Wiley & Sons Ltd.

  11. Performance of Glutamate Dehydrogenase and Triose Phosphate Isomerase Genes in the Analysis of Genotypic Variability of Isolates of Giardia duodenalis from Livestocks

    PubMed Central

    Fava, Natália M. N.; Soares, Rodrigo M.; Scalia, Luana A. M.; Kalapothakis, Evanguedes; Pena, Isabella F.; Vieira, Carlos U.; Faria, Elaine S. M.; Cunha, Maria J.; Couto, Talles R.; Cury, Márcia Cristina

    2013-01-01

    Giardia duodenalis is a small intestinal protozoan parasite of several terrestrial vertebrates. This work aims to assess the genotypic variability of Giardia duodenalis isolates from cattle, sheep and pigs in the Southeast of Brazil, by comparing the standard characterization between glutamate dehydrogenase (gdh) and triose phosphate isomerase (tpi) primers. Fecal samples from the three groups of animals were analyzed using the zinc sulphate centrifugal flotation technique. Out of 59 positive samples, 30 were from cattle, 26 from sheep and 3 from pigs. Cyst pellets were stored and submitted to PCR and nested-PCR reactions with gdh and tpi primers. Fragment amplification of gdh and tpi genes was observed in 25 (42.4%) and 36 (61.0%) samples, respectively. Regarding the sequencing, 24 sequences were obtained with gdh and 20 with tpi. For both genes, there was a prevalence of E specific species assemblage, although some isolates have been identified as A and B, by the tpi sequencing. This has also shown a larger number of heterogeneous sequences, which have been attribute to mixed infections between assemblages B and E. The largest variability of inter-assemblage associated to the frequency of heterogeneity provided by tpi sequencing reinforces the polymorphic nature of this gene and makes it an excellent target for studies on molecular epidemiology. PMID:24308010

  12. Microevolution Analysis of Bacillus coahuilensis Unveils Differences in Phosphorus Acquisition Strategies and Their Regulation.

    PubMed

    Gómez-Lunar, Zulema; Hernández-González, Ismael; Rodríguez-Torres, María-Dolores; Souza, Valeria; Olmedo-Álvarez, Gabriela

    2016-01-01

    Bacterial genomes undergo numerous events of gene losses and gains that generate genome variability among strains of the same species (microevolution). Our aim was to compare the genomes and relevant phenotypes of three Bacillus coahuilensis strains from two oligotrophic hydrological systems in the Cuatro Ciénegas Basin (México), to unveil the environmental challenges that this species cope with, and the microevolutionary differences in these genotypes. Since the strains were isolated from a low P environment, we placed emphasis on the search of different phosphorus acquisition strategies. The three B. coahuilensis strains exhibited similar numbers of coding DNA sequences, of which 82% (2,893) constituted the core genome, and 18% corresponded to accessory genes. Most of the genes in this last group were associated with mobile genetic elements (MGEs) or were annotated as hypothetical proteins. Ten percent of the pangenome consisted of strain-specific genes. Alignment of the three B. coahuilensis genomes indicated a high level of synteny and revealed the presence of several genomic islands. Unexpectedly, one of these islands contained genes that encode the 2-keto-3-deoxymannooctulosonic acid (Kdo) biosynthesis enzymes, a feature associated to cell walls of Gram-negative bacteria. Some microevolutionary changes were clearly associated with MGEs. Our analysis revealed inconsistencies between phenotype and genotype, which we suggest result from the impossibility to map regulatory features to genome analysis. Experimental results revealed variability in the types and numbers of auxotrophies between the strains that could not consistently be explained by in silico metabolic models. Several intraspecific differences in preferences for carbohydrate and phosphorus utilization were observed. Regarding phosphorus recycling, scavenging, and storage, variations were found between the three genomes. The three strains exhibited differences regarding alkaline phosphatase that revealed that in addition to gene gain and loss, regulation adjustment of gene expression also has contributed to the intraspecific diversity of B. coahuilensis.

  13. Evolution of genes and repeats in the Nimrod superfamily.

    PubMed

    Somogyi, Kálmán; Sipos, Botond; Pénzes, Zsolt; Kurucz, Eva; Zsámboki, János; Hultmark, Dan; Andó, István

    2008-11-01

    The recently identified Nimrod superfamily is characterized by the presence of a special type of EGF repeat, the NIM repeat, located right after a typical CCXGY/W amino acid motif. On the basis of structural features, nimrod genes can be divided into three types. The proteins encoded by Draper-type genes have an EMI domain at the N-terminal part and only one copy of the NIM motif, followed by a variable number of EGF-like repeats. The products of Nimrod B-type and Nimrod C-type genes (including the eater gene) have different kinds of N-terminal domains, and lack EGF-like repeats but contain a variable number of NIM repeats. Draper and Nimrod C-type (but not Nimrod B-type) proteins carry a transmembrane domain. Several members of the superfamily were claimed to function as receptors in phagocytosis and/or binding of bacteria, which indicates an important role in the cellular immunity and the elimination of apoptotic cells. In this paper, the evolution of the Nimrod superfamily is studied with various methods on the level of genes and repeats. A hypothesis is presented in which the NIM repeat, along with the EMI domain, emerged by structural reorganizations at the end of an EGF-like repeat chain, suggesting a mechanism for the formation of novel types of repeats. The analyses revealed diverse evolutionary patterns in the sequences containing multiple NIM repeats. Although in the Nimrod B and Nimrod C proteins show characteristics of independent evolution, many internal NIM repeats in Eater sequences seem to have undergone concerted evolution. An analysis of the nimrod genes has been performed using phylogenetic and other methods and an evolutionary scenario of the origin and diversification of the Nimrod superfamily is proposed. Our study presents an intriguing example how the evolution of multigene families may contribute to the complexity of the innate immune response.

  14. Detection and Characteristics of Rifampicin-Resistant Isolates of Mycobacterium tuberculosis.

    PubMed

    Cherednichenko, A G; Dymova, M A; Solodilova, O A; Petrenko, T I; Prozorov, A I; Filipenko, M L

    2016-03-01

    Genotyping and analysis the drug resistance of 59 isolates of M. tuberculosis obtained from patients living in Altai Territory were performed using a BACTEC MGIT 960 fluorometric system by means of VNTR typing (variable number tandem repeat), PCR-RFLP analysis, and sequence analysis. The occurrence frequency was highest for isolates of the Beijing family (n=30, 50.8%). Analysis of mutation spectrum in the rpoB gene associated with rifampicin resistance revealed the major mutation (codon 531 of the rpoB gene) in 93% samples, which allows us to use rapid test systems.

  15. Associations between period 3 gene polymorphisms and sleep- /chronotype-related variables in patients with late-life insomnia.

    PubMed

    Mansour, Hader A; Wood, Joel; Chowdari, Kodavali V; Tumuluru, Divya; Bamne, Mikhil; Monk, Timothy H; Hall, Martica H; Buysse, Daniel J; Nimgaonkar, Vishwajit L

    2017-01-01

    A variable number tandem repeat polymorphism (VNTR) in the period 3 (PER3) gene has been associated with heritable sleep and circadian variables, including self-rated chronotypes, polysomnographic (PSG) variables, insomnia and circadian sleep-wake disorders. This report describes novel molecular and clinical analyses of PER3 VNTR polymorphisms to better define their functional consequences. As the PER3 VNTR is located in the exonic (protein coding) region of PER3, we initially investigated whether both alleles (variants) are transcribed into messenger RNA in human fibroblasts. The VNTR showed bi-allelic gene expression. We next investigated genetic associations in relation to clinical variables in 274 older adult Caucasian individuals. Independent variables included genotypes for the PER3 VNTR as well as a representative set of single nucleotide polymorphisms (SNPs) that tag common variants at the PER3 locus (linkage disequilibrium (LD) between genetic variants < 0.5). In order to comprehensively evaluate variables analyzed individually in prior analyses, dependent measures included PSG total sleep time and sleep latency, self-rated chronotype, estimated with the Composite Scale (CS), and lifestyle regularity, estimated using the social rhythm metric (SRM). Initially, genetic polymorphisms were individually analyzed in relation to each outcome variable using analysis of variance (ANOVA). Nominally significant associations were further tested using regression analyses that incorporated individual ANOVA-associated DNA variants as potential predictors and each of the selected sleep/circadian variables as outcomes. The covariates included age, gender, body mass index and an index of medical co-morbidity. Significant genetic associations with the VNTR were not detected with the sleep or circadian variables. Nominally significant associations were detected between SNP rs1012477 and CS scores (p = 0.003) and between rs10462021 and SRM (p = 0.047); rs11579477 and average delta power (p = 0.043) (analyses uncorrected for multiple comparisons). In conclusion, alleles of the VNTR are expressed at the transcript level and may have a functional effect in cells expressing the PER3 gene. PER3 polymorphisms had a modest impact on selected sleep/circadian variables in our sample, suggesting that PER3 is associated with sleep and circadian function beyond VNTR polymorphisms. Further replicate analyses in larger, independent samples are recommended.

  16. Hybrid stochastic simplifications for multiscale gene networks

    PubMed Central

    Crudu, Alina; Debussche, Arnaud; Radulescu, Ovidiu

    2009-01-01

    Background Stochastic simulation of gene networks by Markov processes has important applications in molecular biology. The complexity of exact simulation algorithms scales with the number of discrete jumps to be performed. Approximate schemes reduce the computational time by reducing the number of simulated discrete events. Also, answering important questions about the relation between network topology and intrinsic noise generation and propagation should be based on general mathematical results. These general results are difficult to obtain for exact models. Results We propose a unified framework for hybrid simplifications of Markov models of multiscale stochastic gene networks dynamics. We discuss several possible hybrid simplifications, and provide algorithms to obtain them from pure jump processes. In hybrid simplifications, some components are discrete and evolve by jumps, while other components are continuous. Hybrid simplifications are obtained by partial Kramers-Moyal expansion [1-3] which is equivalent to the application of the central limit theorem to a sub-model. By averaging and variable aggregation we drastically reduce simulation time and eliminate non-critical reactions. Hybrid and averaged simplifications can be used for more effective simulation algorithms and for obtaining general design principles relating noise to topology and time scales. The simplified models reproduce with good accuracy the stochastic properties of the gene networks, including waiting times in intermittence phenomena, fluctuation amplitudes and stationary distributions. The methods are illustrated on several gene network examples. Conclusion Hybrid simplifications can be used for onion-like (multi-layered) approaches to multi-scale biochemical systems, in which various descriptions are used at various scales. Sets of discrete and continuous variables are treated with different methods and are coupled together in a physically justified approach. PMID:19735554

  17. Amplification of the EGFR gene can be maintained and modulated by variation of EGF concentrations in in vitro models of glioblastoma multiforme

    PubMed Central

    Mokri, Poroshista; Lamp, Nora; Linnebacher, Michael; Classen, Carl Friedrich; Erbersdobler, Andreas; Schneider, Björn

    2017-01-01

    Glioblastoma multiforme (GBM) is the most common and lethal brain tumor in adults. It is known that amplification of the epidermal growth factor receptor gene (EGFR) occurs in approximately 40% of GBM, leading to enhanced activation of the EGFR signaling pathway and promoting tumor growth. Although GBM mutations are stably maintained in GBM in vitro models, rapid loss of EGFR gene amplification is a common observation during cell culture. To maintain EGFR amplification in vitro, heterotopic GBM xenografts with elevated EGFR copy number were cultured under varying serum conditions and EGF concentrations. EGFR copy numbers were assessed over several passages by quantitative PCR and chromogenic in situ hybridization. As expected, in control assays with 10% FCS, cells lost EGFR amplification with increasing passage numbers. However, cells cultured under serum free conditions stably maintained elevated copy numbers. Furthermore, EGFR protein expression positively correlated with genomic amplification levels. Although elevated EGFR copy numbers could be maintained over several passages in vitro, levels of EGFR amplification were variable and dependent on the EGF concentration in the medium. In vitro cultures of GBM cells with elevated EGFR copy number and corresponding EGFR protein expression should prove valuable preclinical tools to gain a better understanding of EGFR driven glioblastoma and assist in the development of new improved therapies. PMID:28934307

  18. Genome complexity in the coelacanth is reflected in its adaptive immune system

    USGS Publications Warehouse

    Saha, Nil Ratan; Ota, Tatsuya; Litman, Gary W.; Hansen, John; Parra, Zuly; Hsu, Ellen; Buonocore, Francesco; Canapa, Adriana; Cheng, Jan-Fang; Amemiya, Chris T.

    2014-01-01

    We have analyzed the available genome and transcriptome resources from the coelacanth in order to characterize genes involved in adaptive immunity. Two highly distinctive IgW-encoding loci have been identified that exhibit a unique genomic organization, including a multiplicity of tandemly repeated constant region exons. The overall organization of the IgW loci precludes typical heavy chain class switching. A locus encoding IgM could not be identified either computationally or by using several different experimental strategies. Four distinct sets of genes encoding Ig light chains were identified. This includes a variant sigma-type Ig light chain previously identified only in cartilaginous fishes and which is now provisionally denoted sigma-2. Genes encoding α/β and γ/δ T-cell receptors, and CD3, CD4, and CD8 co-receptors also were characterized. Ig heavy chain variable region genes and TCR components are interspersed within the TCR α/δ locus; this organization previously was reported only in tetrapods and raises questions regarding evolution and functional cooption of genes encoding variable regions. The composition, organization and syntenic conservation of the major histocompatibility complex locus have been characterized. We also identified large numbers of genes encoding cytokines and their receptors, and other genes associated with adaptive immunity. In terms of sequence identity and organization, the adaptive immune genes of the coelacanth more closely resemble orthologous genes in tetrapods than those in teleost fishes, consistent with current phylogenomic interpretations. Overall, the work reported described herein highlights the complexity inherent in the coelacanth genome and provides a rich catalog of immune genes for future investigations.

  19. An Integrative Framework for Bayesian Variable Selection with Informative Priors for Identifying Genes and Pathways

    PubMed Central

    Ander, Bradley P.; Zhang, Xiaoshuai; Xue, Fuzhong; Sharp, Frank R.; Yang, Xiaowei

    2013-01-01

    The discovery of genetic or genomic markers plays a central role in the development of personalized medicine. A notable challenge exists when dealing with the high dimensionality of the data sets, as thousands of genes or millions of genetic variants are collected on a relatively small number of subjects. Traditional gene-wise selection methods using univariate analyses face difficulty to incorporate correlational, structural, or functional structures amongst the molecular measures. For microarray gene expression data, we first summarize solutions in dealing with ‘large p, small n’ problems, and then propose an integrative Bayesian variable selection (iBVS) framework for simultaneously identifying causal or marker genes and regulatory pathways. A novel partial least squares (PLS) g-prior for iBVS is developed to allow the incorporation of prior knowledge on gene-gene interactions or functional relationships. From the point view of systems biology, iBVS enables user to directly target the joint effects of multiple genes and pathways in a hierarchical modeling diagram to predict disease status or phenotype. The estimated posterior selection probabilities offer probabilitic and biological interpretations. Both simulated data and a set of microarray data in predicting stroke status are used in validating the performance of iBVS in a Probit model with binary outcomes. iBVS offers a general framework for effective discovery of various molecular biomarkers by combining data-based statistics and knowledge-based priors. Guidelines on making posterior inferences, determining Bayesian significance levels, and improving computational efficiencies are also discussed. PMID:23844055

  20. An integrative framework for Bayesian variable selection with informative priors for identifying genes and pathways.

    PubMed

    Peng, Bin; Zhu, Dianwen; Ander, Bradley P; Zhang, Xiaoshuai; Xue, Fuzhong; Sharp, Frank R; Yang, Xiaowei

    2013-01-01

    The discovery of genetic or genomic markers plays a central role in the development of personalized medicine. A notable challenge exists when dealing with the high dimensionality of the data sets, as thousands of genes or millions of genetic variants are collected on a relatively small number of subjects. Traditional gene-wise selection methods using univariate analyses face difficulty to incorporate correlational, structural, or functional structures amongst the molecular measures. For microarray gene expression data, we first summarize solutions in dealing with 'large p, small n' problems, and then propose an integrative Bayesian variable selection (iBVS) framework for simultaneously identifying causal or marker genes and regulatory pathways. A novel partial least squares (PLS) g-prior for iBVS is developed to allow the incorporation of prior knowledge on gene-gene interactions or functional relationships. From the point view of systems biology, iBVS enables user to directly target the joint effects of multiple genes and pathways in a hierarchical modeling diagram to predict disease status or phenotype. The estimated posterior selection probabilities offer probabilitic and biological interpretations. Both simulated data and a set of microarray data in predicting stroke status are used in validating the performance of iBVS in a Probit model with binary outcomes. iBVS offers a general framework for effective discovery of various molecular biomarkers by combining data-based statistics and knowledge-based priors. Guidelines on making posterior inferences, determining Bayesian significance levels, and improving computational efficiencies are also discussed.

  1. The shaping and functional consequences of the dosage effect landscape in multiple myeloma.

    PubMed

    Samur, Mehmet K; Shah, Parantu K; Wang, Xujun; Minvielle, Stéphane; Magrangeas, Florence; Avet-Loiseau, Hervé; Munshi, Nikhil C; Li, Cheng

    2013-10-02

    Multiple myeloma (MM) is a malignant proliferation of plasma B cells. Based on recurrent aneuploidy such as copy number alterations (CNAs), myeloma is divided into two subtypes with different CNA patterns and patient survival outcomes. How aneuploidy events arise, and whether they contribute to cancer cell evolution are actively studied. The large amount of transcriptomic changes resultant of CNAs (dosage effect) pose big challenges for identifying functional consequences of CNAs in myeloma in terms of specific driver genes and pathways. In this study, we hypothesize that gene-wise dosage effect varies as a result from complex regulatory networks that translate the impact of CNAs to gene expression, and studying this variation can provide insights into functional effects of CNAs. We propose gene-wise dosage effect score and genome-wide karyotype plot as tools to measure and visualize concordant copy number and expression changes across cancer samples. We find that dosage effect in myeloma is widespread yet variable, and it is correlated with gene expression level and CNA frequencies in different chromosomes. Our analysis suggests that despite the enrichment of differentially expressed genes between hyperdiploid MM and non-hyperdiploid MM in the trisomy chromosomes, the chromosomal proportion of dosage sensitive genes is higher in the non-trisomy chromosomes. Dosage-sensitive genes are enriched by genes with protein translation and localization functions, and dosage resistant genes are enriched by apoptosis genes. These results point to future studies on differential dosage sensitivity and resistance of pro- and anti-proliferation pathways and their variation across patients as therapeutic targets and prognosis markers. Our findings support the hypothesis that recurrent CNAs in myeloma are selected by their functional consequences. The novel dosage effect score defined in this work will facilitate integration of copy number and expression data for identifying driver genes in cancer genomics studies. The accompanying R code is available at http://www.canevolve.org/dosageEffect/.

  2. Genome-wide patterns of copy number variation in the diversified chicken genomes using next-generation sequencing.

    PubMed

    Yi, Guoqiang; Qu, Lujiang; Liu, Jianfeng; Yan, Yiyuan; Xu, Guiyun; Yang, Ning

    2014-11-07

    Copy number variation (CNV) is important and widespread in the genome, and is a major cause of disease and phenotypic diversity. Herein, we performed a genome-wide CNV analysis in 12 diversified chicken genomes based on whole genome sequencing. A total of 8,840 CNV regions (CNVRs) covering 98.2 Mb and representing 9.4% of the chicken genome were identified, ranging in size from 1.1 to 268.8 kb with an average of 11.1 kb. Sequencing-based predictions were confirmed at a high validation rate by two independent approaches, including array comparative genomic hybridization (aCGH) and quantitative PCR (qPCR). The Pearson's correlation coefficients between sequencing and aCGH results ranged from 0.435 to 0.755, and qPCR experiments revealed a positive validation rate of 91.71% and a false negative rate of 22.43%. In total, 2,214 (25.0%) predicted CNVRs span 2,216 (36.4%) RefSeq genes associated with specific biological functions. Besides two previously reported copy number variable genes EDN3 and PRLR, we also found some promising genes with potential in phenotypic variation. Two genes, FZD6 and LIMS1, related to disease susceptibility/resistance are covered by CNVRs. The highly duplicated SOCS2 may lead to higher bone mineral density. Entire or partial duplication of some genes like POPDC3 may have great economic importance in poultry breeding. Our results based on extensive genetic diversity provide a more refined chicken CNV map and genome-wide gene copy number estimates, and warrant future CNV association studies for important traits in chickens.

  3. The ubiquitous mitochondrial creatine kinase gene maps to a conserved region on human chromosome 15q15 and mouse chromosome 2 bands F1-F3

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Steeghs, K.; Wieringa, B.; Merkx, G.

    1994-11-01

    Members of the creatine kinase isoenzyme family (CKs; EC 2.7.3.2) are found in mitochondria and specialized subregions of the cytoplasm and catalyze the reversible exchange of high-energy phosphoryl between ATP and phosphocreatine. At least four functionally active genes, which encode the distinct CK subunits CKB, CKM, CKMT1 (ubiquitous), and CKMT2 (sarcomeric), and a variable number of CKB pseudogenes have been identified. Here, we report the use of a CKMT1 containing phage to map the CKMT1 gene by in situ hybridization on both human and mouse chromosomes.

  4. Fluorescent protein-mediated colour polymorphism in reef corals: multicopy genes extend the adaptation/acclimatization potential to variable light environments.

    PubMed

    Gittins, John R; D'Angelo, Cecilia; Oswald, Franz; Edwards, Richard J; Wiedenmann, Jörg

    2015-01-01

    The genomic framework that enables corals to adjust to unfavourable conditions is crucial for coral reef survival in a rapidly changing climate. We have explored the striking intraspecific variability in the expression of coral pigments from the green fluorescent protein (GFP) family to elucidate the genomic basis for the plasticity of stress responses among reef corals. We show that multicopy genes can greatly increase the dynamic range over which corals can modulate transcript levels in response to the light environment. Using the red fluorescent protein amilFP597 in the coral Acropora millepora as a model, we demonstrate that its expression increases with light intensity, but both the minimal and maximal gene transcript levels vary markedly among colour morphs. The pigment concentration in the tissue of different morphs is strongly correlated with the number of gene copies with a particular promoter type. These findings indicate that colour polymorphism in reef corals can be caused by the environmentally regulated expression of multicopy genes. High-level expression of amilFP597 is correlated with reduced photodamage of zooxanthellae under acute light stress, supporting a photoprotective function of this pigment. The cluster of light-regulated pigment genes can enable corals to invest either in expensive high-level pigmentation, offering benefits under light stress, or to rely on low tissue pigment concentrations and use the conserved resources for other purposes, which is preferable in less light-exposed environments. The genomic framework described here allows corals to pursue different strategies to succeed in habitats with highly variable light stress levels. In summary, our results suggest that the intraspecific plasticity of reef corals' stress responses is larger than previously thought. © 2014 The Authors Molecular Ecology Published by John Wiley & Sons Ltd.

  5. Gene Expression Signatures Characterized by Longitudinal Stability and Interindividual Variability Delineate Baseline Phenotypic Groups with Distinct Responses to Immune Stimulation.

    PubMed

    Scheid, Adam D; Van Keulen, Virginia P; Felts, Sara J; Neier, Steven C; Middha, Sumit; Nair, Asha A; Techentin, Robert W; Gilbert, Barry K; Jen, Jin; Neuhauser, Claudia; Zhang, Yuji; Pease, Larry R

    2018-03-01

    Human immunity exhibits remarkable heterogeneity among individuals, which engenders variable responses to immune perturbations in human populations. Population studies reveal that, in addition to interindividual heterogeneity, systemic immune signatures display longitudinal stability within individuals, and these signatures may reliably dictate how given individuals respond to immune perturbations. We hypothesize that analyzing relationships among these signatures at the population level may uncover baseline immune phenotypes that correspond with response outcomes to immune stimuli. To test this, we quantified global gene expression in peripheral blood CD4 + cells from healthy individuals at baseline and following CD3/CD28 stimulation at two time points 1 mo apart. Systemic CD4 + cell baseline and poststimulation molecular immune response signatures (MIRS) were defined by identifying genes expressed at levels that were stable between time points within individuals and differential among individuals in each state. Iterative differential gene expression analyses between all possible phenotypic groupings of at least three individuals using the baseline and stimulated MIRS gene sets revealed shared baseline and response phenotypic groupings, indicating the baseline MIRS contained determinants of immune responsiveness. Furthermore, significant numbers of shared phenotype-defining sets of determinants were identified in baseline data across independent healthy cohorts. Combining the cohorts and repeating the analyses resulted in identification of over 6000 baseline immune phenotypic groups, implying that the MIRS concept may be useful in many immune perturbation contexts. These findings demonstrate that patterns in complex gene expression variability can be used to define immune phenotypes and discover determinants of immune responsiveness. Copyright © 2018 by The American Association of Immunologists, Inc.

  6. The phosphotransferase system-dependent sucrose utilization regulon in enteropathogenic Escherichia coli strains is located in a variable chromosomal region containing iap sequences.

    PubMed

    Treviño-Quintanilla, Luis Gerardo; Escalante, Adelfo; Caro, Alma Delia; Martínez, Alfredo; González, Ricardo; Puente, José Luis; Bolívar, Francisco; Gosset, Guillermo

    2007-01-01

    The capacity to utilize sucrose as a carbon and energy source (Scr(+) phenotype) is a highly variable trait among Escherichia coli strains. In this study, seven enteropathogenic E. coli (EPEC) strains from different sources were studied for their capacity to grow using sucrose. Liquid media cultures showed that all analyzed strains have the Scr(+) phenotype and two distinct groups were defined: one of five and another of two strains displaying doubling times of 67 and 125 min, respectively. The genes conferring the Scr(+) phenotype in one of the fast-growing strains (T19) were cloned and sequenced. Comparative sequence analysis revealed that this strain possesses the scr regulon genes scrKYABR, encoding phosphoenolpyruvate:phosphotransferase system-dependent sucrose transport and utilization activities. Transcript level quantification revealed sucrose-dependent induction of scrK and scrR genes in fast-growing strains, whereas no transcripts were detected in slow-growing strains. Sequence comparison analysis revealed that the scr genes in strain T19 are almost identical to those present in the scr regulon of prototype EPEC E2348/69 and in both strains, the scr genes are inserted in the chromosomal intergenic region of hypothetical genes ygcE and ygcF. Comparison of the ygcE-ygcF intergenic region sequence of strains MG1655, enterohemorrhagic EDL933, uropathogenic ECFT073 and EPEC T19-E2348/69 revealed that the number of extragenic highly repeated iap sequences corresponded to nine, four, two and none, respectively. These results show that the iap sequence-containing chromosomal ygcE-ygcF intergenic region is highly variable in E. coli. Copyright (c) 2007 S. Karger AG, Basel.

  7. Does human activity impact the natural antibiotic resistance background? Abundance of antibiotic resistance genes in 21 Swiss lakes.

    PubMed

    Czekalski, Nadine; Sigdel, Radhika; Birtel, Julia; Matthews, Blake; Bürgmann, Helmut

    2015-08-01

    Antibiotic resistance genes (ARGs) are emerging environmental contaminants, known to be continuously discharged into the aquatic environment via human and animal waste. Freshwater aquatic environments represent potential reservoirs for ARG and potentially allow sewage-derived ARG to persist and spread in the environment. This may create increased opportunities for an eventual contact with, and gene transfer to, human and animal pathogens via the food chain or drinking water. However, assessment of this risk requires a better understanding of the level and variability of the natural resistance background and the extent of the human impact. We have analyzed water samples from 21 Swiss lakes, taken at sampling points that were not under the direct influence of local contamination sources and analyzed the relative abundance of ARG using quantitative real-time PCR. Copy numbers of genes mediating resistance to three different broad-spectrum antibiotic classes (sulfonamides: sul1, sul2, tetracyclines: tet(B), tet(M), tet(W) and fluoroquinolones: qnrA) were normalized to copy numbers of bacterial 16S rRNA genes. We used multiple linear regression to assess if ARG abundance is related to human activities in the catchment, microbial community composition and the eutrophication status of the lakes. Sul genes were detected in all sampled lakes, whereas only four lakes contained quantifiable numbers of tet genes, and qnrA remained below detection in all lakes. Our data indicate higher abundance of sul1 in lakes with increasing number and capacity of wastewater treatment plants (WWTPs) in the catchment. sul2 abundance was rather related to long water residence times and eutrophication status. Our study demonstrates the potential of freshwater lakes to preserve antibiotic resistance genes, and provides a reference for ARG abundance from lake systems with low human impact as a baseline for assessing ARG contamination in lake water. Copyright © 2015 Elsevier Ltd. All rights reserved.

  8. On the Interplay of Telomeres, Nevi and the Risk of Melanoma

    PubMed Central

    Bodelon, Clara; Pfeiffer, Ruth M.; Bollati, Valentina; Debbache, Julien; Calista, Donato; Ghiorzo, Paola; Fargnoli, Maria Concetta; Bianchi-Scarra, Giovanna; Peris, Ketty; Hoxha, Mirjam; Hutchinson, Amy; Burdette, Laurie; Burke, Laura; Fang, Shenying; Tucker, Margaret A.; Goldstein, Alisa M.; Lee, Jeffrey E.; Wei, Qingyi; Savage, Sharon A.; Yang, Xiaohong R.; Amos, Christopher; Landi, Maria Teresa

    2012-01-01

    The relationship between telomeres, nevi and melanoma is complex. Shorter telomeres have been found to be associated with many cancers and with number of nevi, a known risk factor for melanoma. However, shorter telomeres have also been found to decrease melanoma risk. We performed a systematic analysis of telomere-related genes and tagSNPs within these genes, in relation to the risk of melanoma, dysplastic nevi, and nevus count combining data from four studies conducted in Italy. In addition, we examined whether telomere length measured in peripheral blood leukocytes is related to the risk of melanoma, dysplastic nevi, number of nevi, or telomere-related SNPs. A total of 796 cases and 770 controls were genotyped for 517 SNPs in 39 telomere-related genes genotyped with a custom-made array. Replication of the top SNPs was conducted in two American populations consisting of 488 subjects from 53 melanoma-prone families and 1,086 cases and 1,024 controls from a case-control study. We estimated odds ratios for associations with SNPs and combined SNP P-values to compute gene region-specific, functional group-specific, and overall P-value using an adaptive rank-truncated product algorithm. In the Mediterranean population, we found suggestive evidence that RECQL4, a gene involved in genome stability, RTEL1, a gene regulating telomere elongation, and TERF2, a gene implicated in the protection of telomeres, were associated with melanoma, the presence of dysplastic nevi and number of nevi, respectively. However, these associations were not found in the American samples, suggesting variable melanoma susceptibility for these genes across populations or chance findings in our discovery sample. Larger studies across different populations are necessary to clarify these associations. PMID:23300679

  9. Colonia Tovar: the history of a semi-isolated Venezuelan population of German ancestry described by HLA class I genes.

    PubMed

    Gendzekhadze, K; Montagnani, S; Ogando, V; Balbas, O; Mendez-Castellano, H; Layrisse, Z

    2003-11-01

    The history of Colonia Tovar is very complex, being the home of descendants of only a small fraction of immigrants arriving to the South American continent from a specific region of Germany, with a restricted number of founders, small population size and consanguineous mating, experiencing isolation for 100 years, with later migrations, a low rate of population growth and a high mean number of children per couple. How complex is its genetic structure? Do the highly polymorphic HLA genes reflect its history and confirm the story of this population described by other genes? Several studies have been made in this population, but we describe for the first time the HLA Class I variability in the population of Colonia Tovar using PCR-SSOP. Random genetic drift, founder effect and gene flow could explain the HLA allele and haplotype frequencies observed in this population but alleles at the class I loci were insufficient to identify the German origin of the community established through history. This agrees with findings obtained testing other genetic systems (ACP, AK, ESD, G6PD, GLO, PGM, PGD, ALB, CP, HP, TF), but the HLA-typing results indicate that the original gene pool has been diluted due to gene flow from the surrounding Mestizo population.

  10. A map of human microRNA variation uncovers unexpectedly high levels of variability

    PubMed Central

    2012-01-01

    Background MicroRNAs (miRNAs) are key components of the gene regulatory network in many species. During the past few years, these regulatory elements have been shown to be involved in an increasing number and range of diseases. Consequently, the compilation of a comprehensive map of natural variability in a healthy population seems an obvious requirement for future research on miRNA-related pathologies. Methods Data on 14 populations from the 1000 Genomes Project were analyzed, along with new data extracted from 60 exomes of healthy individuals from a population from southern Spain, sequenced in the context of the Medical Genome Project, to derive an accurate map of miRNA variability. Results Despite the common belief that miRNAs are highly conserved elements, analysis of the sequences of the 1,152 individuals indicated that the observed level of variability is double what was expected. A total of 527 variants were found. Among these, 45 variants affected the recognition region of the corresponding miRNA and were found in 43 different miRNAs, 26 of which are known to be involved in 57 diseases. Different parts of the mature structure of the miRNA were affected to different degrees by variants, which suggests the existence of a selective pressure related to the relative functional impact of the change. Moreover, 41 variants showed a significant deviation from the Hardy-Weinberg equilibrium, which supports the existence of a selective process against some alleles. The average number of variants per individual in miRNAs was 28. Conclusions Despite an expectation that miRNAs would be highly conserved genomic elements, our study reports a level of variability comparable to that observed for coding genes. PMID:22906193

  11. Confirmation of chromosomal microarray as a first-tier clinical diagnostic test for individuals with developmental delay, intellectual disability, autism spectrum disorders and dysmorphic features.

    PubMed

    Battaglia, Agatino; Doccini, Viola; Bernardini, Laura; Novelli, Antonio; Loddo, Sara; Capalbo, Anna; Filippi, Tiziana; Carey, John C

    2013-11-01

    Submicroscopic chromosomal rearrangements are the most common identifiable causes of intellectual disability and autism spectrum disorders associated with dysmorphic features. Chromosomal microarray (CMA) can detect copy number variants <1 Mb and identifies size and presence of known genes. The aim of this study was to demonstrate the usefulness of CMA, as a first-tier tool in detecting the etiology of unexplained intellectual disability/autism spectrum disorders (ID/ASDs) associated with dysmorphic features in a large cohort of pediatric patients. We studied 349 individuals; 223 males, 126 females, aged 5 months-19 years. Blood samples were analyzed with CMA at a resolution ranging from 1 Mb to 40 Kb. The imbalance was confirmed by FISH or qPCR. We considered copy number variants (CNVs) causative if the variant was responsible for a known syndrome, encompassed gene/s of known function, occurred de novo or, if inherited, the parent was variably affected, and/or the involved gene/s had been reported in association with ID/ASDs in dedicated databases. 91 CNVs were detected in 77 (22.06%) patients: 5 (6.49%) of those presenting with borderline cognitive impairment, 54 (70.13%) with a variable degree of DD/ID, and 18/77 (23.38%) with ID of variable degree and ASDs. 16/77 (20.8%) patients had two different rearrangements. Deletions exceeded duplications (58 versus 33); 45.05% (41/91) of the detected CNVs were de novo, 45.05% (41/91) inherited, and 9.9% (9/91) unknown. The CNVs caused the phenotype in 57/77 (74%) patients; 12/57 (21.05%) had ASDs/ID, and 45/57 (78.95%) had DD/ID. Our study provides further evidence of the high diagnostic yield of CMA for genetic testing in children with unexplained ID/ASDs who had dysmorphic features. We confirm the value of CMA as the first-tier tool in the assessment of those conditions in the pediatric setting. Copyright © 2013 European Paediatric Neurology Society. Published by Elsevier Ltd. All rights reserved.

  12. Copy number variation of human AMY1 is a minor contributor to variation in salivary amylase expression and activity.

    PubMed

    Carpenter, Danielle; Mitchell, Laura M; Armour, John A L

    2017-02-20

    Salivary amylase in humans is encoded by the copy variable gene AMY1 in the amylase gene cluster on chromosome 1. Although the role of salivary amylase is well established, the consequences of the copy number variation (CNV) at AMY1 on salivary amylase protein production are less well understood. The amylase gene cluster is highly structured with a fundamental difference between odd and even AMY1 copy number haplotypes. In this study, we aimed to explore, in samples from 119 unrelated individuals, not only the effects of AMY1 CNV on salivary amylase protein expression and amylase enzyme activity but also whether there is any evidence for underlying difference between the common haplotypes containing odd numbers of AMY1 and even copy number haplotypes. AMY1 copy number was significantly correlated with the variation observed in salivary amylase production (11.7% of variance, P < 0.0005) and enzyme activity (13.6% of variance, P < 0.0005) but did not explain the majority of observed variation between individuals. AMY1-odd and AMY1-even haplotypes showed a different relationship between copy number and expression levels, but the difference was not statistically significant (P = 0.052). Production of salivary amylase is correlated with AMY1 CNV, but the majority of interindividual variation comes from other sources. Long-range haplotype structure may affect expression, but this was not significant in our data.

  13. The MAOA promoter polymorphism, disruptive behavior disorders, and early onset substance use disorder: gene-environment interaction.

    PubMed

    Vanyukov, Michael M; Maher, Brion S; Devlin, Bernie; Kirillova, Galina P; Kirisci, Levent; Yu, Ling-Mei; Ferrell, Robert E

    2007-12-01

    Conduct, oppositional defiant, and attention deficit hyperactivity disorders, reflecting early antisociality and behavior dysregulation, are predictive of substance use disorders. Liabilities to these disorders share genetic and environmental variance. Parenting characteristics have been shown to influence development of antisociality, moderated by variation at the MAOA gene, which has also been associated with the risk for substance use disorders. To extend these findings, we tested the relationships between the MAOA promoter polymorphism (variable number tandem repeat), indices of child's perception of paternal and maternal parenting, and disruptive behavior disorders and substance use disorders. A sample of 148 European-American males was assessed prospectively at ages from 10-12 to 18-19 years and genotyped for the monoamine oxidase A variable number tandem repeat. The Diagnostic and statistical manual of mental disorder-III-R diagnoses were obtained using standard methodology. Parenting was assessed using a scale summarizing the child's evaluation of the parenting style (parent's behavior toward him, parental emotional distance and involvement). Correlation, logistic regression, and Cox proportional hazard regression analysis was used to determine the relationships between the variables. The strength of association between parenting index and conduct and attention deficit hyperactivity disorders depended on the MAOA genotype. Unlike earlier findings, the parenting-risk relationships were observed in the 'high-' rather than 'low-activity' genotypes. The strength and direction of relationships depended on the parental sex. The MAOA polymorphism's association with the risk for substance use disorders was detected when parenting was controlled for. The results are consistent with the contribution of the MAOA gene, parenting style and their interactions to variation in the risk for early onset behavior disorders and liability to substance use disorders.

  14. Tumor gene expression and prognosis in breast cancer patients with 10 or more positive lymph nodes.

    PubMed

    Cobleigh, Melody A; Tabesh, Bita; Bitterman, Pincas; Baker, Joffre; Cronin, Maureen; Liu, Mei-Lan; Borchik, Russell; Mosquera, Juan-Miguel; Walker, Michael G; Shak, Steven

    2005-12-15

    This study, along with two others, was done to develop the 21-gene Recurrence Score assay (Oncotype DX) that was validated in a subsequent independent study and is used to aid decision making about chemotherapy in estrogen receptor (ER)-positive, node-negative breast cancer patients. Patients with >or=10 nodes diagnosed from 1979 to 1999 were identified. RNA was extracted from paraffin blocks, and expression of 203 candidate genes was quantified using reverse transcription-PCR (RT-PCR). Seventy-eight patients were studied. As of August 2002, 77% of patients had distant recurrence or breast cancer death. Univariate Cox analysis of clinical and immunohistochemistry variables indicated that HER2/immunohistochemistry, number of involved nodes, progesterone receptor (PR)/immunohistochemistry (% cells), and ER/immunohistochemistry (% cells) were significantly associated with distant recurrence-free survival (DRFS). Univariate Cox analysis identified 22 genes associated with DRFS. Higher expression correlated with shorter DRFS for the HER2 adaptor GRB7 and the macrophage marker CD68. Higher expression correlated with longer DRFS for tumor protein p53-binding protein 2 (TP53BP2) and the ER axis genes PR and Bcl2. Multivariate methods, including stepwise variable selection and bootstrap resampling of the Cox proportional hazards regression model, identified several genes, including TP53BP2 and Bcl2, as significant predictors of DRFS. Tumor gene expression profiles of archival tissues, some more than 20 years old, provide significant information about risk of distant recurrence even among patients with 10 or more nodes.

  15. Determinism and randomness in the evolution of introns and sine inserts in mouse and human mitochondrial solute carrier and cytokine receptor genes.

    PubMed

    Cianciulli, Antonia; Calvello, Rosa; Panaro, Maria A

    2015-04-01

    In the homologous genes studied, the exons and introns alternated in the same order in mouse and human. We studied, in both species: corresponding short segments of introns, whole corresponding introns and complete homologous genes. We considered the total number of nucleotides and the number and orientation of the SINE inserts. Comparisons of mouse and human data series showed that at the level of individual relatively short segments of intronic sequences the stochastic variability prevails in the local structuring, but at higher levels of organization a deterministic component emerges, conserved in mouse and human during the divergent evolution, despite the ample re-editing of the intronic sequences and the fact that processes such as SINE spread had taken place in an independent way in the two species. Intron conservation is negatively correlated with the SINE occupancy, suggesting that virus inserts interfere with the conservation of the sequences inherited from the common ancestor. Copyright © 2015 Elsevier Ltd. All rights reserved.

  16. Generation of the first Autosomal Dominant Osteopetrosis Type II (ADO2) disease models

    PubMed Central

    Alam, Imranul; Gray, Amie K.; Chu, Kang; Ichikawa, Shoji; Mohammad, Khalid S.; Capannolo, Marta; Capulli, Mattia; Maurizi, Antonio; Muraca, Maurizio; Teti, Anna; Econs, Michael J.; Fattore, Andrea Del

    2013-01-01

    Autosomal Dominant Osteopetrosis Type II (ADO2) is a heritable osteosclerotic disorder dependent on osteoclast impairment. In most patients it results from heterozygous missense mutations in the chloride channel 7 (CLCN7) gene, encoding for a 2Cl−/1H+ antiporter. By a knock-in strategy inserting a missense mutation in the Clcn7 gene, our two research groups independently generated mouse models of ADO2 on different genetic backgrounds carrying the homolog of the most frequent heterozygous mutation (p.G213R) in the Clcn7 gene found in humans. Our results demonstrate that the heterozygous model holds true presenting with higher bone mass, increased numbers of poorly resorbing osteoclasts and a lethal phenotype in the homozygous state. Considerable variability is observed in the heterozygous mice according with the mouse background, suggesting that modifier genes could influence the penetrance of the disease gene. PMID:24185277

  17. Copy number variation in the region harboring SOX9 gene in dogs with testicular/ovotesticular disorder of sex development (78,XX; SRY-negative).

    PubMed

    Marcinkowska-Swojak, Malgorzata; Szczerbal, Izabela; Pausch, Hubert; Nowacka-Woszuk, Joanna; Flisikowski, Krzysztof; Dzimira, Stanislaw; Nizanski, Wojciech; Payan-Carreira, Rita; Fries, Ruedi; Kozlowski, Piotr; Switonski, Marek

    2015-10-01

    Although the disorder of sex development in dogs with female karyotype (XX DSD) is quite common, its molecular basis is still unclear. Among mutations underlying XX DSD in mammals are duplication of a long sequence upstream of the SOX9 gene (RevSex) and duplication of the SOX9 gene (also observed in dogs). We performed a comparative analysis of 16 XX DSD and 30 control female dogs, using FISH and MLPA approaches. Our study was focused on a region harboring SOX9 and a region orthologous to the human RevSex (CanRevSex), which was located by in silico analysis downstream of SOX9. Two highly polymorphic copy number variable regions (CNVRs): CNVR1 upstream of SOX9 and CNVR2 encompassing CanRevSex were identified. Although none of the detected copy number variants were specific to either affected or control animals, we observed that the average number of copies in CNVR1 was higher in XX DSD. No copy variation of SOX9 was observed. Our extensive studies have excluded duplication of SOX9 as the common cause of XX DSD in analyzed samples. However, it remains possible that the causative mutation is hidden in highly polymorphic CNVR1.

  18. Copy number variation in the region harboring SOX9 gene in dogs with testicular/ovotesticular disorder of sex development (78,XX; SRY-negative)

    PubMed Central

    Marcinkowska-Swojak, Malgorzata; Szczerbal, Izabela; Pausch, Hubert; Nowacka-Woszuk, Joanna; Flisikowski, Krzysztof; Dzimira, Stanislaw; Nizanski, Wojciech; Payan-Carreira, Rita; Fries, Ruedi; Kozlowski, Piotr; Switonski, Marek

    2015-01-01

    Although the disorder of sex development in dogs with female karyotype (XX DSD) is quite common, its molecular basis is still unclear. Among mutations underlying XX DSD in mammals are duplication of a long sequence upstream of the SOX9 gene (RevSex) and duplication of the SOX9 gene (also observed in dogs). We performed a comparative analysis of 16 XX DSD and 30 control female dogs, using FISH and MLPA approaches. Our study was focused on a region harboring SOX9 and a region orthologous to the human RevSex (CanRevSex), which was located by in silico analysis downstream of SOX9. Two highly polymorphic copy number variable regions (CNVRs): CNVR1 upstream of SOX9 and CNVR2 encompassing CanRevSex were identified. Although none of the detected copy number variants were specific to either affected or control animals, we observed that the average number of copies in CNVR1 was higher in XX DSD. No copy variation of SOX9 was observed. Our extensive studies have excluded duplication of SOX9 as the common cause of XX DSD in analyzed samples. However, it remains possible that the causative mutation is hidden in highly polymorphic CNVR1. PMID:26423656

  19. A large-scale survey of genetic copy number variations among Han Chinese residing in Taiwan

    PubMed Central

    Lin, Chien-Hsing; Li, Ling-Hui; Ho, Sheng-Feng; Chuang, Tzu-Po; Wu, Jer-Yuarn; Chen, Yuan-Tsong; Fann, Cathy SJ

    2008-01-01

    Background Copy number variations (CNVs) have recently been recognized as important structural variations in the human genome. CNVs can affect gene expression and thus may contribute to phenotypic differences. The copy number inferring tool (CNIT) is an effective hidden Markov model-based algorithm for estimating allele-specific copy number and predicting chromosomal alterations from single nucleotide polymorphism microarrays. The CNIT algorithm, which was constructed using data from 270 HapMap multi-ethnic individuals, was applied to identify CNVs from 300 unrelated Han Chinese individuals in Taiwan. Results Using stringent selection criteria, 230 regions with variable copy numbers were identified in the Han Chinese population; 133 (57.83%) had been reported previously, 64 displayed greater than 1% CNV allele frequency. The average size of the CNV regions was 322 kb (ranging from 1.48 kb to 5.68 Mb) and covered a total of 2.47% of the human genome. A total of 196 of the CNV regions were simple deletions and 27 were simple amplifications. There were 449 genes and 5 microRNAs within these CNV regions; some of these genes are known to be associated with diseases. Conclusion The identified CNVs are characteristic of the Han Chinese population and should be considered when genetic studies are conducted. The CNV distribution in the human genome is still poorly characterized, and there is much diversity among different ethnic populations. PMID:19108714

  20. Using the gene ontology to scan multilevel gene sets for associations in genome wide association studies.

    PubMed

    Schaid, Daniel J; Sinnwell, Jason P; Jenkins, Gregory D; McDonnell, Shannon K; Ingle, James N; Kubo, Michiaki; Goss, Paul E; Costantino, Joseph P; Wickerham, D Lawrence; Weinshilboum, Richard M

    2012-01-01

    Gene-set analyses have been widely used in gene expression studies, and some of the developed methods have been extended to genome wide association studies (GWAS). Yet, complications due to linkage disequilibrium (LD) among single nucleotide polymorphisms (SNPs), and variable numbers of SNPs per gene and genes per gene-set, have plagued current approaches, often leading to ad hoc "fixes." To overcome some of the current limitations, we developed a general approach to scan GWAS SNP data for both gene-level and gene-set analyses, building on score statistics for generalized linear models, and taking advantage of the directed acyclic graph structure of the gene ontology when creating gene-sets. However, other types of gene-set structures can be used, such as the popular Kyoto Encyclopedia of Genes and Genomes (KEGG). Our approach combines SNPs into genes, and genes into gene-sets, but assures that positive and negative effects of genes on a trait do not cancel. To control for multiple testing of many gene-sets, we use an efficient computational strategy that accounts for LD and provides accurate step-down adjusted P-values for each gene-set. Application of our methods to two different GWAS provide guidance on the potential strengths and weaknesses of our proposed gene-set analyses. © 2011 Wiley Periodicals, Inc.

  1. Individual Responsiveness to Exercise-Induced Fat Loss and Improvement of Metabolic Profile in Young Women is Associated with Polymorphisms of Adrenergic Receptor Genes

    PubMed Central

    Leońska-Duniec, Agata; Jastrzębski, Zbigniew; Jażdżewska, Aleksandra; Moska, Waldemar; Lulińska-Kuklik, Ewelina; Sawczuk, Marek; Gubaydullina, Svetlana I.; Shakirova, Alsu T.; Cięszczyk, Pawel; Maszczyk, Adam; Ahmetov, Ildus I.

    2018-01-01

    The effectiveness of physical exercise on fat loss and improvement of aerobic capacity varies considerably between individuals. A strong linkage exists between common allelic variants of the adrenergic receptor genes and weight gain, as well as changes in body composition. Therefore we aimed to check if body composition and metabolic variables were modulated by the ADRB2 (Gly16Arg and Glu27Gln), ADRB3 (Trp64Arg) and ADRA2A (rs553668 G/A) gene polymorphisms in 163 Polish sedentary women (age 19-24; body mass index (BMI) 21.7 ± 0.2 kg·m-2) involved in a 12-week aerobic training program. Only 74.8% of participants lost fat mass. On average, participants lost 5.8 (10.4)% of their relative fat mass with training (range: +28.3 to -63.6%). The improvement of VO2max was significantly greater in women who could lose their fat mass compared to women who were unsuccessful in fat loss (4.5 (5.6)% vs. 1.5 (3.8)%; p = 0.0045). The carriers of a low number (0-3) of obesity-related risk alleles (ADRB2 Gly16, ADRB2 Glu27, ADRA2A rs553668 G) were more successful in fat mass loss compared to the carriers of a high number (5-6) of risk alleles (7.7 (9.8) vs 4.0 (9.4)%, p = 0.0362). The presented results support the assumption that variation within adrenergic receptor genes contributes to interindividual changes of body composition in response to physical exercise. Key points There is a wide range of individual variability in the change of relative fat mass and BMI in response to a 12-week aerobic training program. The efficiency of fat loss was inversely correlated with the improvement of VO2max in response to a 12-week aerobic training. The carriers of a low number of obesity-related risk alleles were more successful in fat mass loss compared to the carriers of a high number of risk alleles. PMID:29535587

  2. Distribution and survival of Vibrio vulnificus genotypes in postharvest Gulf Coast (USA) oysters under refrigeration.

    PubMed

    Wood, R R; Arias, C R

    2012-07-01

      The effect of refrigeration on the seafood-borne pathogen Vibrio vulnificus was investigated in terms of genotype selection and persistence in refrigerated oysters.   Naturally occurring numbers of V. vulnificus in oysters from two different locations were compared during a 2-week period under refrigeration conditions. At different time points, V. vulnificus isolates were recovered from oysters and ascribed to 16S rRNA gene type A, B or AB using restriction fragment length polymorphism. Initial V. vulnificus numbers were higher than 10(4) most probable number (MPN) g(-1) and remained unchanged throughout the duration of the study. 16S rRNA gene type B isolates accounted for 53% of the isolates recovered. Amplified fragment length polymorphism analysis confirmed the high genetic variability previously observed within this species but revealed the presence of two main genetic groups within the species that matched 16S rRNA gene ascription.   Vibrio vulnificus numbers in oysters did not significantly declined over the shelf life of the product and refrigeration did not select for specific V. vulnificus types.   The prevalence of V. vulnificus 16S rRNA gene type B in oysters was higher than previously reported from the same geographic area and was not significantly reduced during the storage period. Vibrio vulnificus is divided into two clear genotypes, regardless of the genetic marker used. © 2012 The Authors. Journal of Applied Microbiology © 2012 The Society for Applied Microbiology.

  3. Two different secondary metabolism gene clusters occupied the same ancestral locus in fungal dermatophytes of the arthrodermataceae.

    PubMed

    Zhang, Han; Rokas, Antonis; Slot, Jason C

    2012-01-01

    Dermatophyte fungi of the family Arthrodermataceae (Eurotiomycetes) colonize keratinized tissue, such as skin, frequently causing superficial mycoses in humans and other mammals, reptiles, and birds. Competition with native microflora likely underlies the propensity of these dermatophytes to produce a diversity of antibiotics and compounds for scavenging iron, which is extremely scarce, as well as the presence of an unusually large number of putative secondary metabolism gene clusters, most of which contain non-ribosomal peptide synthetases (NRPS), in their genomes. To better understand the historical origins and diversification of NRPS-containing gene clusters we examined the evolution of a variable locus (VL) that exists in one of three alternative conformations among the genomes of seven dermatophyte species. The first conformation of the VL (termed VLA) contains only 539 base pairs of sequence and lacks protein-coding genes, whereas the other two conformations (termed VLB and VLC) span 36 Kb and 27 Kb and contain 12 and 10 genes, respectively. Interestingly, both VLB and VLC appear to contain distinct secondary metabolism gene clusters; VLB contains a NRPS gene as well as four porphyrin metabolism genes never found to be physically linked in the genomes of 128 other fungal species, whereas VLC also contains a NRPS gene as well as several others typically found associated with secondary metabolism gene clusters. Phylogenetic evidence suggests that the VL locus was present in the ancestor of all seven species achieving its present distribution through subsequent differential losses or retentions of specific conformations. We propose that the existence of variable loci, similar to the one we studied, in fungal genomes could potentially explain the dramatic differences in secondary metabolic diversity between closely related species of filamentous fungi, and contribute to host adaptation and the generation of metabolic diversity.

  4. Association of Higher Defensin β-4 Genomic Copy Numbers with Behçet's Disease in Iraqi Patients.

    PubMed

    Hameed, Ammar F; Jaradat, Sameh; Al-Musawi, Bassam M; Sharquie, Khalifa; Ibrahim, Mazin J; Hayani, Raafa K; Norgauer, Johannes

    2015-11-01

    Behçet's disease (BD) is an immune-mediated small vessel systemic vasculitis. Human β-defensins are antimicrobial peptides associated with many inflammatory diseases and are encoded by the β-defensin family of multiple-copy genes. However, their role in BD necessitates further investigation. The aim of the present study was to investigate the possible association of BD in its various clinical forms with defensin β-4 (DEFB4) genomic copy numbers. This case-control study was conducted from January to September 2011 and included 50 control subjects and 27 unrelated Iraqi BD patients registered at Baghdad Teaching Hospital, Bagdad, Iraq. Copy numbers of the DEFB4 gene were determined using the comparative cycle threshold method by duplex real-time polymerase chain reaction technology at the Department of Dermatology of Jena University Hospital, Jena, Germany. DEFB4 genomic copy numbers were significantly higher in the BD group compared to the control group (P = 0.010). However, no statistically significant association was found between copy numbers and clinical variables within the BD group. The DEFB4 copy number polymorphism may be associated with BD; however, it is not associated with different clinical manifestations of the disease.

  5. Gene Expression Profile Analysis is Directly Affected by the Selected Reference Gene: The Case of Leaf-Cutting Atta Sexdens

    PubMed Central

    Máximo, Wesley P. F.; Zanetti, Ronald; Paiva, Luciano V.

    2018-01-01

    Although several ant species are important targets for the development of molecular control strategies, only a few studies focus on identifying and validating reference genes for quantitative reverse transcription polymerase chain reaction (RT-qPCR) data normalization. We provide here an extensive study to identify and validate suitable reference genes for gene expression analysis in the ant Atta sexdens, a threatening agricultural pest in South America. The optimal number of reference genes varies according to each sample and the result generated by RefFinder differed about which is the most suitable reference gene. Results suggest that the RPS16, NADH and SDHB genes were the best reference genes in the sample pool according to stability values. The SNF7 gene expression pattern was stable in all evaluated sample set. In contrast, when using less stable reference genes for normalization a large variability in SNF7 gene expression was recorded. There is no universal reference gene suitable for all conditions under analysis, since these genes can also participate in different cellular functions, thus requiring a systematic validation of possible reference genes for each specific condition. The choice of reference genes on SNF7 gene normalization confirmed that unstable reference genes might drastically change the expression profile analysis of target candidate genes. PMID:29419794

  6. Antibiotic Susceptibility and Molecular Diversity of Bacillus anthracis Strains in Chad: Detection of a New Phylogenetic Subgroup

    PubMed Central

    Maho, Angaya; Rossano, Alexandra; Hächler, Herbert; Holzer, Anita; Schelling, Esther; Zinsstag, Jakob; Hassane, Mahamat H.; Toguebaye, Bhen S.; Akakpo, Ayayi J.; Van Ert, Matthew; Keim, Paul; Kenefic, Leo; Frey, Joachim; Perreten, Vincent

    2006-01-01

    We genotyped 15 Bacillus anthracis isolates from Chad, Africa, using multiple-locus variable-number tandem repeat analysis and three additional direct-repeat markers. We identified two unique genotypes that represent a novel genetic lineage in the A cluster. Chadian isolates were susceptible to 11 antibiotics and free of 94 antibiotic resistance genes. PMID:16954291

  7. Dopamine D4 receptor gene polymorphism and personality traits in healthy volunteers.

    PubMed

    Persson, M L; Wasserman, D; Geijer, T; Frisch, A; Rockah, R; Michaelovsky, E; Apter, A; Weizman, A; Jönsson, E G; Bergman, H

    2000-01-01

    An association between long alleles of a variable number tandem repeat (VNTR) polymorphism in the dopamine receptor D4 gene and the extraversion related personality traits Excitement and Novelty Seeking has been reported in healthy subjects. In an attempt to replicate the previous findings, 256 healthy Caucasian volunteers were analysed for a potential relationship between the dopamine receptor D4 exon III VNTR polymorphism and Extraversion as assessed by the Revised Neo Personality Inventory (NEO PI-R). The present study did not yield evidence for an association between Extraversion and the dopamine receptor D4 polymorphism.

  8. A genome-wide detection of copy number variation using SNP genotyping arrays in Beijing-You chickens.

    PubMed

    Zhou, Wei; Liu, Ranran; Zhang, Jingjing; Zheng, Maiqing; Li, Peng; Chang, Guobin; Wen, Jie; Zhao, Guiping

    2014-10-01

    Copy number variation (CNV) has been recently examined in many species and is recognized as being a source of genetic variability, especially for disease-related phenotypes. In this study, the PennCNV software, a genome-wide CNV detection system based on the 60 K SNP BeadChip was used on a total sample size of 1,310 Beijing-You chickens (a Chinese local breed). After quality control, 137 high confidence CNVRs covering 27.31 Mb of the chicken genome and corresponding to 2.61 % of the whole chicken genome. Within these regions, 131 known genes or coding sequences were involved. Q-PCR was applied to verify some of the genes related to disease development. Results showed that copy number of genes such as, phosphatidylinositol-5-phosphate 4-kinase II alpha, PHD finger protein 14, RHACD8 (a CD8α- like messenger RNA), MHC B-G, zinc finger protein, sarcosine dehydrogenase and ficolin 2 varied between individual chickens, which also supports the reliability of chip-detection of the CNVs. As one source of genomic variation, CNVs may provide new insight into the relationship between the genome and phenotypic characteristics.

  9. Assembly and comparison of two closely related Brassica napus genomes.

    PubMed

    Bayer, Philipp E; Hurgobin, Bhavna; Golicz, Agnieszka A; Chan, Chon-Kit Kenneth; Yuan, Yuxuan; Lee, HueyTyng; Renton, Michael; Meng, Jinling; Li, Ruiyuan; Long, Yan; Zou, Jun; Bancroft, Ian; Chalhoub, Boulos; King, Graham J; Batley, Jacqueline; Edwards, David

    2017-12-01

    As an increasing number of plant genome sequences become available, it is clear that gene content varies between individuals, and the challenge arises to predict the gene content of a species. However, genome comparison is often confounded by variation in assembly and annotation. Differentiating between true gene absence and variation in assembly or annotation is essential for the accurate identification of conserved and variable genes in a species. Here, we present the de novo assembly of the B. napus cultivar Tapidor and comparison with an improved assembly of the Brassica napus cultivar Darmor-bzh. Both cultivars were annotated using the same method to allow comparison of gene content. We identified genes unique to each cultivar and differentiate these from artefacts due to variation in the assembly and annotation. We demonstrate that using a common annotation pipeline can result in different gene predictions, even for closely related cultivars, and repeat regions which collapse during assembly impact whole genome comparison. After accounting for differences in assembly and annotation, we demonstrate that the genome of Darmor-bzh contains a greater number of genes than the genome of Tapidor. Our results are the first step towards comparison of the true differences between B. napus genomes and highlight the potential sources of error in future production of a B. napus pangenome. © 2017 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.

  10. Repeat-Associated Plasticity in the Helicobacter pylori RD Gene Family▿ †

    PubMed Central

    Shak, Joshua R.; Dick, Jonathan J.; Meinersmann, Richard J.; Perez-Perez, Guillermo I.; Blaser, Martin J.

    2009-01-01

    The bacterium Helicobacter pylori is remarkable for its ability to persist in the human stomach for decades without provoking sterilizing immunity. Since repetitive DNA can facilitate adaptive genomic flexibility via increased recombination, insertion, and deletion, we searched the genomes of two H. pylori strains for nucleotide repeats. We discovered a family of genes with extensive repetitive DNA that we have termed the H. pylori RD gene family. Each gene of this family is composed of a conserved 3′ region, a variable mid-region encoding 7 and 11 amino acid repeats, and a 5′ region containing one of two possible alleles. Analysis of five complete genome sequences and PCR genotyping of 42 H. pylori strains revealed extensive variation between strains in the number, location, and arrangement of RD genes. Furthermore, examination of multiple strains isolated from a single subject's stomach revealed intrahost variation in repeat number and composition. Despite prior evidence that the protein products of this gene family are expressed at the bacterial cell surface, enzyme-linked immunosorbent assay and immunoblot studies revealed no consistent seroreactivity to a recombinant RD protein by H. pylori-positive hosts. The pattern of repeats uncovered in the RD gene family appears to reflect slipped-strand mispairing or domain duplication, allowing for redundancy and subsequent diversity in genotype and phenotype. This novel family of hypervariable genes with conserved, repetitive, and allelic domains may represent an important locus for understanding H. pylori persistence in its natural host. PMID:19749042

  11. Repeat-associated plasticity in the Helicobacter pylori RD gene family.

    PubMed

    Shak, Joshua R; Dick, Jonathan J; Meinersmann, Richard J; Perez-Perez, Guillermo I; Blaser, Martin J

    2009-11-01

    The bacterium Helicobacter pylori is remarkable for its ability to persist in the human stomach for decades without provoking sterilizing immunity. Since repetitive DNA can facilitate adaptive genomic flexibility via increased recombination, insertion, and deletion, we searched the genomes of two H. pylori strains for nucleotide repeats. We discovered a family of genes with extensive repetitive DNA that we have termed the H. pylori RD gene family. Each gene of this family is composed of a conserved 3' region, a variable mid-region encoding 7 and 11 amino acid repeats, and a 5' region containing one of two possible alleles. Analysis of five complete genome sequences and PCR genotyping of 42 H. pylori strains revealed extensive variation between strains in the number, location, and arrangement of RD genes. Furthermore, examination of multiple strains isolated from a single subject's stomach revealed intrahost variation in repeat number and composition. Despite prior evidence that the protein products of this gene family are expressed at the bacterial cell surface, enzyme-linked immunosorbent assay and immunoblot studies revealed no consistent seroreactivity to a recombinant RD protein by H. pylori-positive hosts. The pattern of repeats uncovered in the RD gene family appears to reflect slipped-strand mispairing or domain duplication, allowing for redundancy and subsequent diversity in genotype and phenotype. This novel family of hypervariable genes with conserved, repetitive, and allelic domains may represent an important locus for understanding H. pylori persistence in its natural host.

  12. Some Like It Hot, Some Like It Warm: Phenotyping to Explore Thermotolerance Diversity

    PubMed Central

    Yeh, Ching-Hui; Kaplinsky, Nicholas J.; Hu, Catherine; Charng, Yee-yung

    2012-01-01

    Plants have evolved overlapping but distinct cellular responses to different aspects of high temperature stress. These responses include basal thermotolerance, short- and long-term acquired thermotolerance, and thermotolerance to moderately high temperatures. This thermotolerance diversity’ means that multiple phenotypic assays are essential for fully describing the functions of genes involved in heat stress responses. A large number of genes with potential roles in heat stress responses have been identified using genetic screens and genome wide expression studies. We examine the range of phenotypic assays that have been used to characterize thermotolerance phenotypes in both Arabidopsis and crop plants. Three major variables differentiate thermotolerance assays: 1) the heat stress regime used, 2) the developmental stage of the plants being studied, and 3) the actual phenotype which is scored. Consideration of these variables will be essential for deepening our understanding of the molecular genetics of plant thermotolerance. PMID:22920995

  13. Chronic lymphocytic leukemia patients exposed to ionizing radiation due to the Chernobyl NPP accident--with focus on immunoglobulin heavy chain gene analysis.

    PubMed

    Abramenko, Iryna; Bilous, Nadia; Chumak, Anatoliy; Davidova, Ekaterina; Kryachok, Iryna; Martina, Zoya; Nechaev, Stanislav; Dyagil, Iryna; Bazyka, Dmytriy; Bebeshko, Vladimir

    2008-04-01

    Clinical data and immunoglobulin variable heavy chain (IgVH) gene configuration were analyzed in 47 CLL patients, exposed to ionizing radiation (IR) due to Chernobyl NPP accident, and 141 non-exposed patients. Clean-up workers of the second quarter of 1986 (n=19) were picked out as separate group with the highest number of unmutated cases (94.4%), increased usage of IgVH1-69 (33.3%) and IgVH3-21 (16.7%) genes, high frequency of secondary solid tumors (6 cases) and Richter transformation (4 cases). These preliminary data suggest that CLL in the most suffered contingent due to Chernobyl NPP accident might have some specific features.

  14. HvFT1 polymorphism and effect—survey of barley germplasm and expression analysis

    PubMed Central

    Loscos, Jorge; Igartua, Ernesto; Contreras-Moreira, Bruno; Gracia, M. Pilar; Casas, Ana M.

    2014-01-01

    Flowering time in plants is a tightly regulated process. In barley (Hordeum vulgare L.), HvFT1, ortholog of FLOWERING LOCUS T, is the main integrator of the photoperiod and vernalization signals leading to the transition from vegetative to reproductive state of the plant. This gene presents sequence polymorphisms affecting flowering time in the first intron and in the promoter. Recently, copy number variation (CNV) has been described for this gene. An allele with more than one copy was linked to higher gene expression, earlier flowering, and an overriding effect of the vernalization mechanism. This study aims at (1) surveying the distribution of HvFT1 polymorphisms across barley germplasm and (2) assessing gene expression and phenotypic effects of HvFT1 alleles. We analyzed HvFT1 CNV in 109 winter, spring, and facultative barley lines. There was more than one copy of the gene (2–5) only in spring or facultative barleys without a functional vernalization VrnH2 allele. CNV was investigated in several regions inside and around HvFT1. Two models of the gene were found: one with the same number of promoters and transcribed regions, and another with one promoter and variable number of transcribed regions. This last model was found in Nordic barleys only. Analysis of HvFT1 expression showed that association between known polymorphisms at the HvFT1 locus and the expression of the gene was highly dependent on the genetic background. Under long day conditions the earliest flowering lines carried a sensitive PpdH1 allele. Among spring cultivars with different number of copies, no clear relation was found between CNV, gene expression and flowering time. This was confirmed in a set of doubled haploid lines of a population segregating for HvFT1 CNV. Earlier flowering in the presence of several copies of HvFT1 was only seen in cultivar Tammi, which carries one promoter, suggesting a relation of gene structure with its regulation. HvCEN also affected to a large extent flowering time. PMID:24936204

  15. A Genome-Scale Investigation of How Sequence, Function, and Tree-Based Gene Properties Influence Phylogenetic Inference.

    PubMed

    Shen, Xing-Xing; Salichos, Leonidas; Rokas, Antonis

    2016-09-02

    Molecular phylogenetic inference is inherently dependent on choices in both methodology and data. Many insightful studies have shown how choices in methodology, such as the model of sequence evolution or optimality criterion used, can strongly influence inference. In contrast, much less is known about the impact of choices in the properties of the data, typically genes, on phylogenetic inference. We investigated the relationships between 52 gene properties (24 sequence-based, 19 function-based, and 9 tree-based) with each other and with three measures of phylogenetic signal in two assembled data sets of 2,832 yeast and 2,002 mammalian genes. We found that most gene properties, such as evolutionary rate (measured through the percent average of pairwise identity across taxa) and total tree length, were highly correlated with each other. Similarly, several gene properties, such as gene alignment length, Guanine-Cytosine content, and the proportion of tree distance on internal branches divided by relative composition variability (treeness/RCV), were strongly correlated with phylogenetic signal. Analysis of partial correlations between gene properties and phylogenetic signal in which gene evolutionary rate and alignment length were simultaneously controlled, showed similar patterns of correlations, albeit weaker in strength. Examination of the relative importance of each gene property on phylogenetic signal identified gene alignment length, alongside with number of parsimony-informative sites and variable sites, as the most important predictors. Interestingly, the subsets of gene properties that optimally predicted phylogenetic signal differed considerably across our three phylogenetic measures and two data sets; however, gene alignment length and RCV were consistently included as predictors of all three phylogenetic measures in both yeasts and mammals. These results suggest that a handful of sequence-based gene properties are reliable predictors of phylogenetic signal and could be useful in guiding the choice of phylogenetic markers. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  16. A preliminary study of genetic diversity of MSP-1 types in Plasmodium falciparum in southern province of Sistan Baluchistan of Iran.

    PubMed

    Zahra, Zamani; Reza, Razavi Mohammad; Mehdi, Assmar; Sedigheh, Sadeghi; Fatemeh, Pourfallah; Nikoo, Nasoohi; Ashraf, Sheibani; Mohammad, Raisi

    2007-02-01

    Plasmodiumfalciparum merozoite surface protein-1 (MSP-1) shows extensive antigenic diversity. This is due to the presence of seven variable blocks, five semi-conserved and also five conserved blocks. The variable blocks in the MSP-1 gene are principally dimorphic, displaying either K1 or MAD20 type; except for the block 2 region which is represented by three alleles, an RO33 type in addition to the other two. Allelic diversity is reported to be generated by intra-genic recombination between the variable blocks. A study of allelic variation of MSP-1 gene in Plasmodium falciparum was carried out in the southern province of Sistan Baluchistan in Iran in 2001-2003. Samples were obtained from 30 febrile patients and DNA was extracted and association types between blocks 2 and 6 was identified on each block using specific primers and compared with those from Vietnam, Brazil and Africa. The association types obtained, were similar though less in number than the ones from Vietnam, but more than those from Africa and Brazil.

  17. Inter-laboratory analysis of selected genetically modified plant reference materials with digital PCR.

    PubMed

    Dobnik, David; Demšar, Tina; Huber, Ingrid; Gerdes, Lars; Broeders, Sylvia; Roosens, Nancy; Debode, Frederic; Berben, Gilbert; Žel, Jana

    2018-01-01

    Digital PCR (dPCR), as a new technology in the field of genetically modified (GM) organism (GMO) testing, enables determination of absolute target copy numbers. The purpose of our study was to test the transferability of methods designed for quantitative PCR (qPCR) to dPCR and to carry out an inter-laboratory comparison of the performance of two different dPCR platforms when determining the absolute GM copy numbers and GM copy number ratio in reference materials certified for GM content in mass fraction. Overall results in terms of measured GM% were within acceptable variation limits for both tested dPCR systems. However, the determined absolute copy numbers for individual genes or events showed higher variability between laboratories in one third of the cases, most possibly due to variability in the technical work, droplet size variability, and analysis of the raw data. GMO quantification with dPCR and qPCR was comparable. As methods originally designed for qPCR performed well in dPCR systems, already validated qPCR assays can most generally be used for dPCR technology with the purpose of GMO detection. Graphical abstract The output of three different PCR-based platforms was assessed in an inter-laboratory comparison.

  18. Biasogram: Visualization of Confounding Technical Bias in Gene Expression Data

    PubMed Central

    Krzystanek, Marcin; Szallasi, Zoltan; Eklund, Aron C.

    2013-01-01

    Gene expression profiles of clinical cohorts can be used to identify genes that are correlated with a clinical variable of interest such as patient outcome or response to a particular drug. However, expression measurements are susceptible to technical bias caused by variation in extraneous factors such as RNA quality and array hybridization conditions. If such technical bias is correlated with the clinical variable of interest, the likelihood of identifying false positive genes is increased. Here we describe a method to visualize an expression matrix as a projection of all genes onto a plane defined by a clinical variable and a technical nuisance variable. The resulting plot indicates the extent to which each gene is correlated with the clinical variable or the technical variable. We demonstrate this method by applying it to three clinical trial microarray data sets, one of which identified genes that may have been driven by a confounding technical variable. This approach can be used as a quality control step to identify data sets that are likely to yield false positive results. PMID:23613961

  19. Intercellular Variability in Protein Levels from Stochastic Expression and Noisy Cell Cycle Processes

    PubMed Central

    Soltani, Mohammad; Vargas-Garcia, Cesar A.; Antunes, Duarte; Singh, Abhyudai

    2016-01-01

    Inside individual cells, expression of genes is inherently stochastic and manifests as cell-to-cell variability or noise in protein copy numbers. Since proteins half-lives can be comparable to the cell-cycle length, randomness in cell-division times generates additional intercellular variability in protein levels. Moreover, as many mRNA/protein species are expressed at low-copy numbers, errors incurred in partitioning of molecules between two daughter cells are significant. We derive analytical formulas for the total noise in protein levels when the cell-cycle duration follows a general class of probability distributions. Using a novel hybrid approach the total noise is decomposed into components arising from i) stochastic expression; ii) partitioning errors at the time of cell division and iii) random cell-division events. These formulas reveal that random cell-division times not only generate additional extrinsic noise, but also critically affect the mean protein copy numbers and intrinsic noise components. Counter intuitively, in some parameter regimes, noise in protein levels can decrease as cell-division times become more stochastic. Computations are extended to consider genome duplication, where transcription rate is increased at a random point in the cell cycle. We systematically investigate how the timing of genome duplication influences different protein noise components. Intriguingly, results show that noise contribution from stochastic expression is minimized at an optimal genome-duplication time. Our theoretical results motivate new experimental methods for decomposing protein noise levels from synchronized and asynchronized single-cell expression data. Characterizing the contributions of individual noise mechanisms will lead to precise estimates of gene expression parameters and techniques for altering stochasticity to change phenotype of individual cells. PMID:27536771

  20. Resurgence of Pertussis and Emergence of the Ptxp3 Toxin Promoter Allele in South Italy.

    PubMed

    Loconsole, Daniela; De Robertis, Anna Lisa; Morea, Anna; Metallo, Angela; Lopalco, Pier Luigi; Chironna, Maria

    2018-05-01

    Despite universal immunization programs, pertussis remains a major public health concern. This study aimed to describe the pertussis epidemiology in the Puglia region in 2006-2015 and to identify recent polymorphisms in Bordetella pertussis virulence-associated genes. The pertussis cases in 2006-2015 were identified from the National Hospital Discharge Database and the Information System of Infectious Diseases. Samples of pertussis cases in 2014-2016 that were confirmed by the Regional Reference Laboratory were subjected to ptxA, ptxP and prn gene sequencing and, in 10 cases, multiple-locus variable-number tandem repeat analysis. In Puglia in 2006-2015, the pertussis incidence rose from an average of 1.39/100,000 inhabitants in 2006-2013 to 2.56-2.54/100,000 in 2014-2015. In infants <1 year of age, the incidence rose from an average of 60.4/100,000 infants in 2006-2013 to 149.9/100,000 in 2015. Of the 661 cases recorded in 2006-2015, 80.3% required hospitalization; of these, 45.4% were <1 year of age. Of the 80 sequenced samples, the allelic profile ptxA1-ptxP3-prn2 was detected in 74. This variant was detected in both vaccinated and unvaccinated people. Six Bordetella pertussis samples were prn deficient. The multiple-locus variable-number tandem repeat analysis cases exhibited multiple-locus variable-number tandem repeat analysis-type 27. The pertussis incidence in Puglia has risen. The hypervirulent strain was also found in vaccinated people. This suggests bacterial adaptation to the vaccine and raises questions about acellular vaccine effectiveness. Prevention of infant pertussis cases is best achieved by immunizing the pregnant mother. Enhanced surveillance and systematic laboratory confirmation of pertussis should be improved in Italy.

  1. Pseudomonas stutzeri Nitrite Reductase Gene Abundance in Environmental Samples Measured by Real-Time PCR

    PubMed Central

    Grüntzig, Verónica; Nold, Stephen C.; Zhou, Jizhong; Tiedje, James M.

    2001-01-01

    We used real-time PCR to quantify the denitrifying nitrite reductase gene (nirS), a functional gene of biogeochemical significance. The assay was tested in vitro and applied to environmental samples. The primer-probe set selected was specific for nirS sequences that corresponded approximately to the Pseudomonas stutzeri species. The assay was linear from 1 to 106 gene copies (r2 = 0.999). Variability at low gene concentrations did not allow detection of twofold differences in gene copy number at less than 100 copies. DNA spiking and cell-addition experiments gave predicted results, suggesting that this assay provides an accurate measure of P. stutzeri nirS abundance in environmental samples. Although P. stutzeri abundance was high in lake sediment and groundwater samples, we detected low or no abundance of this species in marine sediment samples from Puget Sound (Wash.) and from the Washington ocean margin. These results suggest that P. stutzeri may not be a dominant marine denitrifier. PMID:11157241

  2. The computational core and fixed point organization in Boolean networks

    NASA Astrophysics Data System (ADS)

    Correale, L.; Leone, M.; Pagnani, A.; Weigt, M.; Zecchina, R.

    2006-03-01

    In this paper, we analyse large random Boolean networks in terms of a constraint satisfaction problem. We first develop an algorithmic scheme which allows us to prune simple logical cascades and underdetermined variables, returning thereby the computational core of the network. Second, we apply the cavity method to analyse the number and organization of fixed points. We find in particular a phase transition between an easy and a complex regulatory phase, the latter being characterized by the existence of an exponential number of macroscopically separated fixed point clusters. The different techniques developed are reinterpreted as algorithms for the analysis of single Boolean networks, and they are applied in the analysis of and in silico experiments on the gene regulatory networks of baker's yeast (Saccharomyces cerevisiae) and the segment-polarity genes of the fruitfly Drosophila melanogaster.

  3. The evaluation of angiotensin-converting enzyme (ACE) gene I/D and IL-4 gene intron 3 VNTR polymorphisms in coronary artery disease.

    PubMed

    Basol, Nursah; Celik, Atac; Karakus, Nevin; Ozturk, Sibel Demir; Ozsoy, Sibel Demir; Yigit, Serbulent

    2014-01-01

    Genetic polymorphism is a strong risk factor for coronary artery disease (CAD). In the present study, our aim was to evaluate angiotensin-converting enzyme (ACE) gene I/D polymorphism and interleukin-4 (IL-4) gene Intron 3 variable number of tandem repeat (VNTR) polymorphism in CAD. One hundred and twenty-four CAD patients and one hundred and twenty-three controls were enrolled. Genomic DNA was isolated and genotyped using polymerase chain reaction (PCR) analyses. The risk associated with inheriting the combined genotypes for the two polymorphisms were evaluated and it was found that the individuals who were P2P2-homozygous at IL-4 gene intron 3 VNTR and DD-homozygous at ACE gene I/D have a higher risk of developing CAD. Although, there is no correlation between IL4 VNTR polymorphism and ACE gene polymorphism and CAD, there is a strong association between CAD and co-existence of IL-4 VNTR and ACE gene polymorphisms in the Turkish population. Copyright © 2014 International Institute of Anticancer Research (Dr. John G. Delinassios), All rights reserved.

  4. Genetic Relatedness of Clostridium difficile Isolates from Various Origins Determined by Triple-Locus Sequence Analysis Based on Toxin Regulatory Genes tcdC, tcdR, and cdtR▿

    PubMed Central

    Bouvet, Philippe J. M.; Popoff, Michel R.

    2008-01-01

    A triple-locus nucleotide sequence analysis based on toxin regulatory genes tcdC, tcdR and cdtR was initiated to assess the sequence variability of these genes among Clostridium difficile isolates and to study the genetic relatedness between isolates. A preliminary investigation of the variability of the tcdC gene was done with 57 clinical and veterinary isolates. Twenty-three isolates representing nine main clusters were selected for tcdC, tcdR, and cdtR analysis. The numbers of alleles found for tcdC, tcdR and cdtR were nine, six, and five, respectively. All strains possessed the cdtR gene except toxin A-negative toxin B-positive variants. All but one binary toxin CDT-positive isolate harbored a deletion (>1 bp) in the tcdC gene. The combined analyses of the three genes allowed us to distinguish five lineages correlated with the different types of deletion in tcdC, i.e., 18 bp (associated or not with a deletion at position 117), 36 bp, 39 bp, and 54 bp, and with the wild-type tcdC (no deletion). The tcdR and tcdC genes, though located within the same pathogenicity locus, were found to have evolved separately. Coevolution of the three genes was noted only with strains harboring a 39-bp or a 54-bp deletion in tcdC that formed two homogeneous, separate divergent clusters. Our study supported the existence of the known clones (PCR ribotype 027 isolates and toxin A-negative toxin B-positive C. difficile variants) and evidence for clonality of isolates with a 39-bp deletion (toxinotype V, PCR ribotype 078) that are frequently isolated worldwide from human infections and from food animals. PMID:18832125

  5. Recurrent Rearrangements of Human Amylase Genes Create Multiple Independent CNV Series.

    PubMed

    Shwan, Nzar A A; Louzada, Sandra; Yang, Fengtang; Armour, John A L

    2017-05-01

    The human amylase gene cluster includes the human salivary (AMY1) and pancreatic amylase genes (AMY2A and AMY2B), and is a highly variable and dynamic region of the genome. Copy number variation (CNV) of AMY1 has been implicated in human dietary adaptation, and in population association with obesity, but neither of these findings has been independently replicated. Despite these functional implications, the structural genomic basis of CNV has only been defined in detail very recently. In this work, we use high-resolution analysis of copy number, and analysis of segregation in trios, to define new, independent allelic series of amylase CNVs in sub-Saharan Africans, including a series of higher-order expansions of a unit consisting of one copy each of AMY1, AMY2A, and AMY2B. We use fiber-FISH (fluorescence in situ hybridization) to define unexpected complexity in the accompanying rearrangements. These findings demonstrate recurrent involvement of the amylase gene region in genomic instability, involving at least five independent rearrangements of the pancreatic amylase genes (AMY2A and AMY2B). Structural features shared by fundamentally distinct lineages strongly suggest that the common ancestral state for the human amylase cluster contained more than one, and probably three, copies of AMY1. © 2017 WILEY PERIODICALS, INC.

  6. Simulated maximum likelihood method for estimating kinetic rates in gene expression.

    PubMed

    Tian, Tianhai; Xu, Songlin; Gao, Junbin; Burrage, Kevin

    2007-01-01

    Kinetic rate in gene expression is a key measurement of the stability of gene products and gives important information for the reconstruction of genetic regulatory networks. Recent developments in experimental technologies have made it possible to measure the numbers of transcripts and protein molecules in single cells. Although estimation methods based on deterministic models have been proposed aimed at evaluating kinetic rates from experimental observations, these methods cannot tackle noise in gene expression that may arise from discrete processes of gene expression, small numbers of mRNA transcript, fluctuations in the activity of transcriptional factors and variability in the experimental environment. In this paper, we develop effective methods for estimating kinetic rates in genetic regulatory networks. The simulated maximum likelihood method is used to evaluate parameters in stochastic models described by either stochastic differential equations or discrete biochemical reactions. Different types of non-parametric density functions are used to measure the transitional probability of experimental observations. For stochastic models described by biochemical reactions, we propose to use the simulated frequency distribution to evaluate the transitional density based on the discrete nature of stochastic simulations. The genetic optimization algorithm is used as an efficient tool to search for optimal reaction rates. Numerical results indicate that the proposed methods can give robust estimations of kinetic rates with good accuracy.

  7. DM-BLD: differential methylation detection using a hierarchical Bayesian model exploiting local dependency.

    PubMed

    Wang, Xiao; Gu, Jinghua; Hilakivi-Clarke, Leena; Clarke, Robert; Xuan, Jianhua

    2017-01-15

    The advent of high-throughput DNA methylation profiling techniques has enabled the possibility of accurate identification of differentially methylated genes for cancer research. The large number of measured loci facilitates whole genome methylation study, yet posing great challenges for differential methylation detection due to the high variability in tumor samples. We have developed a novel probabilistic approach, D: ifferential M: ethylation detection using a hierarchical B: ayesian model exploiting L: ocal D: ependency (DM-BLD), to detect differentially methylated genes based on a Bayesian framework. The DM-BLD approach features a joint model to capture both the local dependency of measured loci and the dependency of methylation change in samples. Specifically, the local dependency is modeled by Leroux conditional autoregressive structure; the dependency of methylation changes is modeled by a discrete Markov random field. A hierarchical Bayesian model is developed to fully take into account the local dependency for differential analysis, in which differential states are embedded as hidden variables. Simulation studies demonstrate that DM-BLD outperforms existing methods for differential methylation detection, particularly when the methylation change is moderate and the variability of methylation in samples is high. DM-BLD has been applied to breast cancer data to identify important methylated genes (such as polycomb target genes and genes involved in transcription factor activity) associated with breast cancer recurrence. A Matlab package of DM-BLD is available at http://www.cbil.ece.vt.edu/software.htm CONTACT: Xuan@vt.eduSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  8. Copy Number Alterations and Methylation in Ewing's Sarcoma

    PubMed Central

    Jahromi, Mona S.; Jones, Kevin B.; Schiffman, Joshua D.

    2011-01-01

    Ewing's sarcoma is the second most common bone malignancy affecting children and young adults. The prognosis is especially poor in metastatic or relapsed disease. The cell of origin remains elusive, but the EWS-FLI1 fusion oncoprotein is present in the majority of cases. The understanding of the molecular basis of Ewing's sarcoma continues to progress slowly. EWS-FLI1 affects gene expression, but other factors must also be at work such as mutations, gene copy number alterations, and promoter methylation. This paper explores in depth two molecular aspects of Ewing's sarcoma: copy number alterations (CNAs) and methylation. While CNAs consistently have been reported in Ewing's sarcoma, their clinical significance has been variable, most likely due to small sample size and tumor heterogeneity. Methylation is thought to be important in oncogenesis and balanced karyotype cancers such as Ewing's, yet it has received only minimal attention in prior studies. Future CNA and methylation studies will help to understand the molecular basis of this disease. PMID:21437220

  9. Involvement of the major histocompatibility complex region in the genetic regulation of circulating CD8 T-cell numbers in humans.

    PubMed

    Cruz, E; Vieira, J; Gonçalves, R; Alves, H; Almeida, S; Rodrigues, P; Lacerda, R; Porto, G

    2004-07-01

    Variability in T-lymphocyte numbers is partially explained by a genetic regulation. From studies in animal models, it is known that the Major Histocompatibility Complex (MHC) is involved in this regulation. In humans, this has not been shown yet. The objective of the present study was to test the hypothesis that genes in the MHC region influence the regulation of T-lymphocyte numbers. Two approaches were used. Association studies between T-cell counts (CD4(+) and CD8(+)) or total lymphocyte counts and HLA class I alleles (A and B) or mutations in the HFE (C282Y and H63D), the hemochromatosis gene, in an unrelated population (n = 264). A second approach was a sibpair correlation analysis of the same T-cell counts in relation to HLA-HFE haplotypes in subjects belonging to 48 hemochromatosis families (n = 456 sibpairs). In the normal population, results showed a strong statistically significant association of the HLA-A*01 with high numbers of CD8(+) T cells and a less powerful association with the HLA-A*24 with low numbers of CD8(+) T cells. Sibpair correlations revealed the most significant correlation for CD8(+) T-cell numbers for sibpairs with HLA-HFE-identical haplotypes. This was not observed for CD4(+) T cells. These results show that the MHC region is involved in the genetic regulation of CD8(+) T-cell numbers in humans. Identification of genes responsible for this control may have important biological and clinical implications.

  10. Whole-genome sequencing identifies recurrent mutations in chronic lymphocytic leukaemia

    PubMed Central

    Puente, Xose S.; Pinyol, Magda; Quesada, Víctor; Conde, Laura; Ordóñez, Gonzalo R.; Villamor, Neus; Escaramis, Georgia; Jares, Pedro; Beà, Sílvia; González-Díaz, Marcos; Bassaganyas, Laia; Baumann, Tycho; Juan, Manel; López-Guerra, Mónica; Colomer, Dolors; Tubío, José M. C.; López, Cristina; Navarro, Alba; Tornador, Cristian; Aymerich, Marta; Rozman, María; Hernández, Jesús M.; Puente, Diana A.; Freije, José M. P.; Velasco, Gloria; Gutiérrez-Fernández, Ana; Costa, Dolors; Carrió, Anna; Guijarro, Sara; Enjuanes, Anna; Hernández, Lluís; Yagüe, Jordi; Nicolás, Pilar; Romeo-Casabona, Carlos M.; Himmelbauer, Heinz; Castillo, Ester; Dohm, Juliane C.; de Sanjosé, Silvia; Piris, Miguel A.; de Alava, Enrique; Miguel, Jesús San; Royo, Romina; Gelpí, Josep L.; Torrents, David; Orozco, Modesto; Pisano, David G.; Valencia, Alfonso; Guigó, Roderic; Bayés, Mónica; Heath, Simon; Gut, Marta; Klatt, Peter; Marshall, John; Raine, Keiran; Stebbings, Lucy A.; Futreal, P. Andrew; Stratton, Michael R.; Campbell, Peter J.; Gut, Ivo; López-Guillermo, Armando; Estivill, Xavier; Montserrat, Emili; López-Otín, Carlos; Campo, Elías

    2012-01-01

    Chronic lymphocytic leukaemia (CLL), the most frequent leukaemia in adults in Western countries, is a heterogeneous disease with variable clinical presentation and evolution1,2. Two major molecular subtypes can be distinguished, characterized respectively by a high or low number of somatic hypermutations in the variable region of immunoglobulin genes3,4. The molecular changes leading to the pathogenesis of the disease are still poorly understood. Here we performed whole-genome sequencing of four cases of CLL and identified 46 somatic mutations that potentially affect gene function. Further analysis of these mutations in 363 patients with CLL identified four genes that are recurrently mutated: notch 1 (NOTCH1), exportin 1 (XPO1), myeloid differentiation primary response gene 88 (MYD88) and kelch-like 6 (KLHL6). Mutations in MYD88 and KLHL6 are predominant in cases of CLL with mutated immunoglobulin genes, whereas NOTCH1 and XPO1 mutations are mainly detected in patients with unmutated immunoglobulins. The patterns of somatic mutation, supported by functional and clinical analyses, strongly indicate that the recurrent NOTCH1, MYD88 and XPO1 mutations are oncogenic changes that contribute to the clinical evolution of the disease. To our knowledge, this is the first comprehensive analysis of CLL combining whole-genome sequencing with clinical characteristics and clinical outcomes. It highlights the usefulness of this approach for the identification of clinically relevant mutations in cancer. PMID:21642962

  11. Complete mitochondrial genomes of Trisidos kiyoni and Potiarca pilula: Varied mitochondrial genome size and highly rearranged gene order in Arcidae

    PubMed Central

    Sun, Shao’e; Li, Qi; Kong, Lingfeng; Yu, Hong

    2016-01-01

    We present the complete mitochondrial genomes (mitogenomes) of Trisidos kiyoni and Potiarca pilula, both important species from the family Arcidae (Arcoida: Arcacea). Typical bivalve mtDNA features were described, such as the relatively conserved gene number (36 and 37), a high A + T content (62.73% and 61.16%), the preference for A + T-rich codons, and the evidence of non-optimal codon usage. The mitogenomes of Arcidae species are exceptional for their extraordinarily large and variable sizes and substantial gene rearrangements. The mitogenome of T. kiyoni (19,614 bp) and P. pilula (28,470 bp) are the two smallest Arcidae mitogenomes. The compact mitogenomes are weakly associated with gene number and primarily reflect shrinkage of the non-coding regions. The varied size in Arcidae mitogenomes reflect a dynamic history of expansion. A significant positive correlation is observed between mitogenome size and the combined length of cox1-3, the lengths of Cytb, and the combined length of rRNAs (rrnS and rrnL) (P < 0.001). Both protein coding genes (PCGs) and tRNA rearrangements is observed in P. pilula and T. kiyoni mitogenomes. This analysis imply that the complicated gene rearrangement in mitochondrial genome could be considered as one of key characters in inferring higher-level phylogenetic relationship of Arcidae. PMID:27653979

  12. IgVH gene analysis suggests that peritoneal B cells do not contribute to the gut immune system in man.

    PubMed

    Boursier, Laurent; Farstad, Inger Nina; Mellembakken, Jan Roar; Brandtzaeg, Per; Spencer, Jo

    2002-09-01

    The contribution of peritoneal B cells to the intestinal lamina propria plasma cell population is well documented in mice, but unknown in humans. We have analyzed immunoglobulin (Ig) genes of human peritoneal B cells, because such genes show distinctive characteristics in mucosal B cells, particularly highly mutated variable regions. Here, we report the characteristics of variable region genes used by IgM, IgA and IgG in peritoneal cells. We focused on the properties of IgV(H)4-34 to allow comparisons of like-with-like between different isotypes and cells from different immune compartments. We observed that the IgM genes were mostly unmutated, and that the mutated subset had less mutations than would be expected in a mucosal B cell population. Likewise, the IgV(H)4-34 genes used by IgA and IgG from peritoneal B cells had significantly lower numbers of mutations than observed in the mucosal counterparts. Other trends observed, while not reaching statistical significance, followed the trend of peripheral B cells. The peritoneal B cell population had more IgA1 than IgA2 sequences, and there was no dominance of J(H)4 in the IgA from peritoneum or spleen, in contrast to the mucosal sequences. Overall, this study suggested that human peritoneal B cell are either peripheral or mixed in origin; they are unlikely to represent an inductive compartment for the mucosal B cell system.

  13. Gene expression models for prediction of longitudinal dispersion coefficient in streams

    NASA Astrophysics Data System (ADS)

    Sattar, Ahmed M. A.; Gharabaghi, Bahram

    2015-05-01

    Longitudinal dispersion is the key hydrologic process that governs transport of pollutants in natural streams. It is critical for spill action centers to be able to predict the pollutant travel time and break-through curves accurately following accidental spills in urban streams. This study presents a novel gene expression model for longitudinal dispersion developed using 150 published data sets of geometric and hydraulic parameters in natural streams in the United States, Canada, Europe, and New Zealand. The training and testing of the model were accomplished using randomly-selected 67% (100 data sets) and 33% (50 data sets) of the data sets, respectively. Gene expression programming (GEP) is used to develop empirical relations between the longitudinal dispersion coefficient and various control variables, including the Froude number which reflects the effect of reach slope, aspect ratio, and the bed material roughness on the dispersion coefficient. Two GEP models have been developed, and the prediction uncertainties of the developed GEP models are quantified and compared with those of existing models, showing improved prediction accuracy in favor of GEP models. Finally, a parametric analysis is performed for further verification of the developed GEP models. The main reason for the higher accuracy of the GEP models compared to the existing regression models is that exponents of the key variables (aspect ratio and bed material roughness) are not constants but a function of the Froude number. The proposed relations are both simple and accurate and can be effectively used to predict the longitudinal dispersion coefficients in natural streams.

  14. Rare copy number variants and congenital heart defects in the 22q11.2 deletion syndrome.

    PubMed

    Mlynarski, Elisabeth E; Xie, Michael; Taylor, Deanne; Sheridan, Molly B; Guo, Tingwei; Racedo, Silvia E; McDonald-McGinn, Donna M; Chow, Eva W C; Vorstman, Jacob; Swillen, Ann; Devriendt, Koen; Breckpot, Jeroen; Digilio, Maria Cristina; Marino, Bruno; Dallapiccola, Bruno; Philip, Nicole; Simon, Tony J; Roberts, Amy E; Piotrowicz, Małgorzata; Bearden, Carrie E; Eliez, Stephan; Gothelf, Doron; Coleman, Karlene; Kates, Wendy R; Devoto, Marcella; Zackai, Elaine; Heine-Suñer, Damian; Goldmuntz, Elizabeth; Bassett, Anne S; Morrow, Bernice E; Emanuel, Beverly S

    2016-03-01

    The 22q11.2 deletion syndrome (22q11DS; velocardiofacial/DiGeorge syndrome; VCFS/DGS; MIM #192430; 188400) is the most common microdeletion syndrome. The phenotypic presentation of 22q11DS is highly variable; approximately 60-75 % of 22q11DS patients have been reported to have a congenital heart defect (CHD), mostly of the conotruncal type, and/or aortic arch defect. The etiology of the cardiac phenotypic variability is not currently known for the majority of patients. We hypothesized that rare copy number variants (CNVs) outside the 22q11.2 deleted region may modify the risk of being born with a CHD in this sensitized population. Rare CNV analysis was performed using Affymetrix SNP Array 6.0 data from 946 22q11DS subjects with CHDs (n = 607) or with normal cardiac anatomy (n = 339). Although there was no significant difference in the overall burden of rare CNVs, an overabundance of CNVs affecting cardiac-related genes was detected in 22q11DS individuals with CHDs. When the rare CNVs were examined with regard to gene interactions, specific cardiac networks, such as Wnt signaling, appear to be overrepresented in 22q11DS CHD cases but not 22q11DS controls with a normal heart. Collectively, these data suggest that CNVs outside the 22q11.2 region may contain genes that modify risk for CHDs in some 22q11DS patients.

  15. The complete sequences and gene organisation of the mitochondrial genomes of the heterodont bivalves Acanthocardia tuberculata and Hiatella arctica – and the first record for a putative Atpase subunit 8 gene in marine bivalves

    PubMed Central

    Dreyer, Hermann; Steiner, Gerhard

    2006-01-01

    Background Mitochondrial (mt) gene arrangement is highly variable among molluscs and especially among bivalves. Of the 30 complete molluscan mt-genomes published to date, only one is of a heterodont bivalve, although this is the most diverse taxon in terms of species numbers. We determined the complete sequence of the mitochondrial genomes of Acanthocardia tuberculata and Hiatella arctica, (Mollusca, Bivalvia, Heterodonta) and describe their gene contents and genome organisations to assess the variability of these features among the Bivalvia and their value for phylogenetic inference. Results The size of the mt-genome in Acanthocardia tuberculata is 16.104 basepairs (bp), and in Hiatella arctica 18.244 bp. The Acanthocardia mt-genome contains 12 of the typical protein coding genes, lacking the Atpase subunit 8 (atp8) gene, as all published marine bivalves. In contrast, a complete atp8 gene is present in Hiatella arctica. In addition, we found a putative truncated atp8 gene when re-annotating the mt-genome of Venerupis philippinarum. Both mt-genomes reported here encode all genes on the same strand and have an additional trnM. In Acanthocardia several large non-coding regions are present. One of these contains 3.5 nearly identical copies of a 167 bp motive. In Hiatella, the 3' end of the NADH dehydrogenase subunit (nad)6 gene is duplicated together with the adjacent non-coding region. The gene arrangement of Hiatella is markedly different from all other known molluscan mt-genomes, that of Acanthocardia shows few identities with the Venerupis philippinarum. Phylogenetic analyses on amino acid and nucleotide levels robustly support the Heterodonta and the sister group relationship of Acanthocardia and Venerupis. Monophyletic Bivalvia are resolved only by a Bayesian inference of the nucleotide data set. In all other analyses the two unionid species, being to only ones with genes located on both strands, do not group with the remaining bivalves. Conclusion The two mt-genomes reported here add to and underline the high variability of gene order and presence of duplications in bivalve and molluscan taxa. Some genomic traits like the loss of the atp8 gene or the encoding of all genes on the same strand are homoplastic among the Bivalvia. These characters, gene order, and the nucleotide sequence data show considerable potential of resolving phylogenetic patterns at lower taxonomic levels. PMID:16948842

  16. ITEP: an integrated toolkit for exploration of microbial pan-genomes.

    PubMed

    Benedict, Matthew N; Henriksen, James R; Metcalf, William W; Whitaker, Rachel J; Price, Nathan D

    2014-01-03

    Comparative genomics is a powerful approach for studying variation in physiological traits as well as the evolution and ecology of microorganisms. Recent technological advances have enabled sequencing large numbers of related genomes in a single project, requiring computational tools for their integrated analysis. In particular, accurate annotations and identification of gene presence and absence are critical for understanding and modeling the cellular physiology of newly sequenced genomes. Although many tools are available to compare the gene contents of related genomes, new tools are necessary to enable close examination and curation of protein families from large numbers of closely related organisms, to integrate curation with the analysis of gain and loss, and to generate metabolic networks linking the annotations to observed phenotypes. We have developed ITEP, an Integrated Toolkit for Exploration of microbial Pan-genomes, to curate protein families, compute similarities to externally-defined domains, analyze gene gain and loss, and generate draft metabolic networks from one or more curated reference network reconstructions in groups of related microbial species among which the combination of core and variable genes constitute the their "pan-genomes". The ITEP toolkit consists of: (1) a series of modular command-line scripts for identification, comparison, curation, and analysis of protein families and their distribution across many genomes; (2) a set of Python libraries for programmatic access to the same data; and (3) pre-packaged scripts to perform common analysis workflows on a collection of genomes. ITEP's capabilities include de novo protein family prediction, ortholog detection, analysis of functional domains, identification of core and variable genes and gene regions, sequence alignments and tree generation, annotation curation, and the integration of cross-genome analysis and metabolic networks for study of metabolic network evolution. ITEP is a powerful, flexible toolkit for generation and curation of protein families. ITEP's modular design allows for straightforward extension as analysis methods and tools evolve. By integrating comparative genomics with the development of draft metabolic networks, ITEP harnesses the power of comparative genomics to build confidence in links between genotype and phenotype and helps disambiguate gene annotations when they are evaluated in both evolutionary and metabolic network contexts.

  17. Efficient mitochondrial biogenesis drives incomplete penetrance in Leber’s hereditary optic neuropathy

    PubMed Central

    Iommarini, Luisa; Giordano, Luca; Maresca, Alessandra; Pisano, Annalinda; Valentino, Maria Lucia; Caporali, Leonardo; Liguori, Rocco; Deceglie, Stefania; Roberti, Marina; Fanelli, Francesca; Fracasso, Flavio; Ross-Cisneros, Fred N.; D’Adamo, Pio; Hudson, Gavin; Pyle, Angela; Yu-Wai-Man, Patrick; Chinnery, Patrick F.; Zeviani, Massimo; Salomao, Solange R.; Berezovsky, Adriana; Belfort, Rubens; Ventura, Dora Fix; Moraes, Milton; Moraes Filho, Milton; Barboni, Piero; Sadun, Federico; De Negri, Annamaria; Sadun, Alfredo A.; Tancredi, Andrea; Mancini, Massimiliano; d’Amati, Giulia; Loguercio Polosa, Paola; Cantatore, Palmiro

    2014-01-01

    Leber’s hereditary optic neuropathy is a maternally inherited blinding disease caused as a result of homoplasmic point mutations in complex I subunit genes of mitochondrial DNA. It is characterized by incomplete penetrance, as only some mutation carriers become affected. Thus, the mitochondrial DNA mutation is necessary but not sufficient to cause optic neuropathy. Environmental triggers and genetic modifying factors have been considered to explain its variable penetrance. We measured the mitochondrial DNA copy number and mitochondrial mass indicators in blood cells from affected and carrier individuals, screening three large pedigrees and 39 independently collected smaller families with Leber’s hereditary optic neuropathy, as well as muscle biopsies and cells isolated by laser capturing from post-mortem specimens of retina and optic nerves, the latter being the disease targets. We show that unaffected mutation carriers have a significantly higher mitochondrial DNA copy number and mitochondrial mass compared with their affected relatives and control individuals. Comparative studies of fibroblasts from affected, carriers and controls, under different paradigms of metabolic demand, show that carriers display the highest capacity for activating mitochondrial biogenesis. Therefore we postulate that the increased mitochondrial biogenesis in carriers may overcome some of the pathogenic effect of mitochondrial DNA mutations. Screening of a few selected genetic variants in candidate genes involved in mitochondrial biogenesis failed to reveal any significant association. Our study provides a valuable mechanism to explain variability of penetrance in Leber’s hereditary optic neuropathy and clues for high throughput genetic screening to identify the nuclear modifying gene(s), opening an avenue to develop predictive genetic tests on disease risk and therapeutic strategies. PMID:24369379

  18. The (CA)n polymorphism of ERβ gene is associated with FtM transsexualism.

    PubMed

    Fernández, Rosa; Esteva, Isabel; Gómez-Gil, Esther; Rumbo, Teresa; Almaraz, Mari Cruz; Roda, Ester; Haro-Mora, Juan-Jesús; Guillamón, Antonio; Pásaro, Eduardo

    2014-03-01

    Transsexualism is a gender identity disorder with a multifactorial etiology. Neurodevelopmental processes and genetic factors seem to be implicated. The aim of this study was to investigate the possible influence of the sex hormone-related genes ERβ (estrogen receptor β), AR (androgen receptor), and CYP19A1 (aromatase) in the etiology of female-to-male (FtM) transsexualism. In 273 FtMs and 371 control females, we carried out a molecular analysis of three variable regions: the CA repeats in intron 5 of ERβ; the CAG repeats in exon 1 of AR, and the TTTA repeats in intron 4 of CYP19A1. We investigated the possible influence of genotype on transsexualism by performing a molecular analysis of the variable regions of genes ERβ, AR, and CYP19A1 in 644 individuals (FtMs and control females). FtMs differed significantly from control group with respect to the median repeat length polymorphism ERβ (P = 0.002) but not with respect to the length of the other two studied polymorphisms. The repeat numbers in ERβ were significantly higher in FtMs than in control group, and the likelihood of developing transsexualism was higher (odds ratio: 2.001 [1.15-3.46]) in the subjects with the genotype homozygous for long alleles. There is an association between the ERβ gene and FtM transsexualism. Our data support the finding that ERβ function is directly proportional to the size of the analyzed polymorphism, so a greater number of repeats implies greater transcription activation, possibly by increasing the function of the complex hormone ERβ receptor and thereby encouraging less feminization or a defeminization of the female brain and behavior. © 2013 International Society for Sexual Medicine.

  19. High intralocus variability and interlocus recombination promote immunological diversity in a minimal major histocompatibility system.

    PubMed

    Wilson, Anthony B; Whittington, Camilla M; Bahr, Angela

    2014-12-20

    The genes of the major histocompatibility complex (MHC/MH) have attracted considerable scientific interest due to their exceptional levels of variability and important function as part of the adaptive immune system. Despite a large number of studies on MH class II diversity of both model and non-model organisms, most research has focused on patterns of genetic variability at individual loci, failing to capture the functional diversity of the biologically active dimeric molecule. Here, we take a systematic approach to the study of MH variation, analyzing patterns of genetic variation at MH class IIα and IIβ loci of the seahorse, which together form the immunologically active peptide binding cleft of the MH class II molecule. The seahorse carries a minimal class II system, consisting of single copies of both MH class IIα and IIβ, which are physically linked and inherited in a Mendelian fashion. Both genes are ubiquitously expressed and detectible in the brood pouch of male seahorses throughout pregnancy. Genetic variability of the two genes is high, dominated by non-synonymous variation concentrated in their peptide-binding regions. Coding variation outside these regions is negligible, a pattern thought to be driven by intra- and interlocus recombination. Despite the tight physical linkage of MH IIα and IIβ loci, recombination has produced novel composite alleles, increasing functional diversity at sites responsible for antigen recognition. Antigen recognition by the adaptive immune system of the seahorse is enhanced by high variability at both MH class IIα and IIβ loci. Strong positive selection on sites involved in pathogen recognition, coupled with high levels of intra- and interlocus recombination, produce a patchwork pattern of genetic variation driven by genetic hitchhiking. Studies focusing on variation at individual MH loci may unintentionally overlook an important component of ecologically relevant variation.

  20. Characterization of global loss of imprinting in fetal overgrowth syndrome induced by assisted reproduction

    PubMed Central

    Chen, Zhiyuan; Hagen, Darren E.; Elsik, Christine G.; Ji, Tieming; Morris, Collin James; Moon, Laura Emily; Rivera, Rocío Melissa

    2015-01-01

    Embryos generated with the use of assisted reproductive technologies (ART) can develop overgrowth syndromes. In ruminants, the condition is referred to as large offspring syndrome (LOS) and exhibits variable phenotypic abnormalities including overgrowth, enlarged tongue, and abdominal wall defects. These characteristics recapitulate those observed in the human loss-of-imprinting (LOI) overgrowth syndrome Beckwith–Wiedemann (BWS). We have recently shown LOI at the KCNQ1 locus in LOS, the most common epimutation in BWS. Although the first case of ART-induced LOS was reported in 1995, studies have not yet determined the extent of LOI in this condition. Here, we determined allele-specific expression of imprinted genes previously identified in human and/or mouse in day ∼105 Bos taurus indicus × Bos taurus taurus F1 hybrid control and LOS fetuses using RNAseq. Our analysis allowed us to determine the monoallelic expression of 20 genes in tissues of control fetuses. LOS fetuses displayed variable LOI compared with controls. Biallelic expression of imprinted genes in LOS was associated with tissue-specific hypomethylation of the normally methylated parental allele. In addition, a positive correlation was observed between body weight and the number of biallelically expressed imprinted genes in LOS fetuses. Furthermore, not only was there loss of allele-specific expression of imprinted genes in LOS, but also differential transcript amounts of these genes between control and overgrown fetuses. In summary, we characterized previously unidentified imprinted genes in bovines and identified misregulation of imprinting at multiple loci in LOS. We concluded that LOS is a multilocus LOI syndrome, as is BWS. PMID:25825726

  1. Characterization of global loss of imprinting in fetal overgrowth syndrome induced by assisted reproduction.

    PubMed

    Chen, Zhiyuan; Hagen, Darren E; Elsik, Christine G; Ji, Tieming; Morris, Collin James; Moon, Laura Emily; Rivera, Rocío Melissa

    2015-04-14

    Embryos generated with the use of assisted reproductive technologies (ART) can develop overgrowth syndromes. In ruminants, the condition is referred to as large offspring syndrome (LOS) and exhibits variable phenotypic abnormalities including overgrowth, enlarged tongue, and abdominal wall defects. These characteristics recapitulate those observed in the human loss-of-imprinting (LOI) overgrowth syndrome Beckwith-Wiedemann (BWS). We have recently shown LOI at the KCNQ1 locus in LOS, the most common epimutation in BWS. Although the first case of ART-induced LOS was reported in 1995, studies have not yet determined the extent of LOI in this condition. Here, we determined allele-specific expression of imprinted genes previously identified in human and/or mouse in day ∼105 Bos taurus indicus × Bos taurus taurus F1 hybrid control and LOS fetuses using RNAseq. Our analysis allowed us to determine the monoallelic expression of 20 genes in tissues of control fetuses. LOS fetuses displayed variable LOI compared with controls. Biallelic expression of imprinted genes in LOS was associated with tissue-specific hypomethylation of the normally methylated parental allele. In addition, a positive correlation was observed between body weight and the number of biallelically expressed imprinted genes in LOS fetuses. Furthermore, not only was there loss of allele-specific expression of imprinted genes in LOS, but also differential transcript amounts of these genes between control and overgrown fetuses. In summary, we characterized previously unidentified imprinted genes in bovines and identified misregulation of imprinting at multiple loci in LOS. We concluded that LOS is a multilocus LOI syndrome, as is BWS.

  2. Evolution of the genetic variability of eight French dairy cattle breeds assessed by pedigree analysis.

    PubMed

    Danchin-Burge, C; Leroy, G; Brochard, M; Moureaux, S; Verrier, E

    2012-06-01

    A pedigree analysis was performed on eight French dairy cattle breeds to assess their change in genetic variability since a first analysis completed in 1996. The Holstein, Normande and Montbéliarde breeds are selected internationally with over hundreds of thousands cows registered in the performance recording system. Three breeds are internationally selected but with limited numbers of cows in France (Brown Swiss, French Simmental and French Red Pied). The last two remaining breeds (Abondance and Tarentaise) are raised at regional level. The effective numbers of ancestors of cows born between 2004 and 2007 varied between 15 (Abondance and Tarentaise) and 51 (French Red Pied). The effective population sizes (classical approach) varied between 53 (Abondance) and 197 (French Red Pied). This article also compares the genetic variability of the ex situ (collections of the French National Cryobank) and in situ populations. The results were commented in regard to the recent history of gene flows in the different breeds as well as the existence of more or less stringent bottlenecks. Our results showed that whatever the size of the breeds, their genetic diversity impoverished quite rapidly since 1996 and they all could be considered as quite poor from a genetic diversity point of view. It shows the need for setting up cryobanks as gene reservoirs as well as sustainable breeding programmes that include loss of genetic diversity as an integrated control parameter. © 2011 Blackwell Verlag GmbH.

  3. NKG2C zygosity influences CD94/NKG2C receptor function and the NK-cell compartment redistribution in response to human cytomegalovirus.

    PubMed

    Muntasell, Aura; López-Montañés, María; Vera, Andrea; Heredia, Gemma; Romo, Neus; Peñafiel, Judith; Moraru, Manuela; Vila, Joan; Vilches, Carlos; López-Botet, Miguel

    2013-12-01

    Human cytomegalovirus (HCMV) infection promotes a persistent expansion of a functionally competent NK-cell subset expressing the activating CD94/NKG2C receptor. Factors underlying the wide variability of this effect observed in HCMV-seropositive healthy individuals and exacerbated in immunocompromized patients are uncertain. A deletion of the NKG2C gene has been reported, and an apparent relation of NKG2C genotype with circulating NKG2C(+) NK-cell numbers was observed in HCMV(+) children. We have assessed the influence of NKG2C gene dose on the NK-cell repertoire in a cohort of young healthy adults (N = 130, median age 19 years). Our results revealed a relation of NKG2C copy number with surface receptor levels and with NKG2C(+) NK-cell numbers in HCMV(+) subjects, independently of HLA-E dimorphism. Functional studies showed quantitative differences in signaling (i.e. iCa(2+) influx), degranulation, and IL-15-dependent proliferation, in response to NKG2C engagement, between NK cells from NKG2C(+/+) and hemizygous subjects. These observations provide a mechanistic interpretation on the way the NKG2C genotype influences steady-state NKG2C(+) NK-cell numbers, further supporting an active involvement of the receptor in the HCMV-induced reconfiguration of the NK-cell compartment. The putative implications of NKG2C zygosity over viral control and other clinical variables deserve attention. © 2013 WILEY‐VCH Verlag GmbH & Co. KGaA, Weinheim.

  4. Copy Number Alterations Associated with Acute Lymphoblastic Leukemia in Mexican Children. A report from The Mexican Inter-Institutional Group for the identification of the causes of childhood leukemia.

    PubMed

    Rosales-Rodríguez, Beatriz; Fernández-Ramírez, Fernando; Núñez-Enríquez, Juan Carlos; Velázquez-Wong, Ana Claudia; Medina-Sansón, Aurora; Jiménez-Hernández, Elva; Flores-Lujano, Janet; Peñaloza-González, José Gabriel; Espinosa-Elizondo, Rosa Martha; Pérez-Saldívar, María Luisa; Torres-Nava, José Refugio; Martín-Trejo, Jorge Alfonso; Martínez-Morales, Gabriela Bibiana; Bekker-Méndez, Vilma Carolina; Mejía-Aranguré, Juan Manuel; Rosas-Vargas, Haydee

    2016-11-01

    B-cell precursor acute lymphocytic leukemia (B-ALL) represents a worldwide public health issue. Particularly, Mexico is one of the countries with the highest incidence of ALL in children. Between the multiple factors involved in ALL etiology, genetic alterations are clearly one of the most relevant features. In this work, a group of 24 B-ALL patients, all negative for the four most frequent gene fusions (ETV6-RUNX1, BCR-ABL1, TCF3-PBX1 and MLL-AF4), were included in a high-resolution microarray analysis in order to evaluate genomic copy-number alterations (CNAs). The results of this preliminary report showed a broad genomic heterogeneity among the studied samples; 58% of the patients were hyperdiploid and 33% displayed a chromosome 9p deletion of variable length affecting genes CDKN2A/B, two patients displayed genomic instability with a high number of focal CNAs, three patients presented unique duplications affecting 2q, 12p and 1q, respectively, and one patient displayed no copy number imbalances. The copy-number profile of 44 genes previously related to B-ALL was heterogeneous as well. Overall results highlight the need for a detailed description of the genetic alterations in ALL cancer cells in order to understand the molecular pathogenesis of the disease and to identify any prognostic markers with clinical significance. Copyright © 2016 IMSS. Published by Elsevier Inc. All rights reserved.

  5. CRISPR/Cas9-mediated gene knockout is insensitive to target copy number but is dependent on guide RNA potency and Cas9/sgRNA threshold expression level

    PubMed Central

    Yuen, Garmen; Khan, Fehad J.; Gao, Shaojian; Stommel, Jayne M.; Batchelor, Eric; Wu, Xiaolin

    2017-01-01

    Abstract CRISPR/Cas9 is a powerful gene editing tool for gene knockout studies and functional genomic screens. Successful implementation of CRISPR often requires Cas9 to elicit efficient target knockout in a population of cells. In this study, we investigated the role of several key factors, including variation in target copy number, inherent potency of sgRNA guides, and expression level of Cas9 and sgRNA, in determining CRISPR knockout efficiency. Using isogenic, clonal cell lines with variable copy numbers of an EGFP transgene, we discovered that CRISPR knockout is relatively insensitive to target copy number, but is highly dependent on the potency of the sgRNA guide sequence. Kinetic analysis revealed that most target mutation occurs between 5 and 10 days following Cas9/sgRNA transduction, while sgRNAs with different potencies differ by their knockout time course and by their terminal-phase knockout efficiency. We showed that prolonged, low level expression of Cas9 and sgRNA often fails to elicit target mutation, particularly if the potency of the sgRNA is also low. Our findings provide new insights into the behavior of CRISPR/Cas9 in mammalian cells that could be used for future improvement of this platform. PMID:29036671

  6. CRISPR/Cas9-mediated gene knockout is insensitive to target copy number but is dependent on guide RNA potency and Cas9/sgRNA threshold expression level.

    PubMed

    Yuen, Garmen; Khan, Fehad J; Gao, Shaojian; Stommel, Jayne M; Batchelor, Eric; Wu, Xiaolin; Luo, Ji

    2017-11-16

    CRISPR/Cas9 is a powerful gene editing tool for gene knockout studies and functional genomic screens. Successful implementation of CRISPR often requires Cas9 to elicit efficient target knockout in a population of cells. In this study, we investigated the role of several key factors, including variation in target copy number, inherent potency of sgRNA guides, and expression level of Cas9 and sgRNA, in determining CRISPR knockout efficiency. Using isogenic, clonal cell lines with variable copy numbers of an EGFP transgene, we discovered that CRISPR knockout is relatively insensitive to target copy number, but is highly dependent on the potency of the sgRNA guide sequence. Kinetic analysis revealed that most target mutation occurs between 5 and 10 days following Cas9/sgRNA transduction, while sgRNAs with different potencies differ by their knockout time course and by their terminal-phase knockout efficiency. We showed that prolonged, low level expression of Cas9 and sgRNA often fails to elicit target mutation, particularly if the potency of the sgRNA is also low. Our findings provide new insights into the behavior of CRISPR/Cas9 in mammalian cells that could be used for future improvement of this platform. Published by Oxford University Press on behalf of Nucleic Acids Research 2017.

  7. Molecular characterization of phosphorylcholine expression on the lipooligosaccharide of Histophilus somni.

    PubMed

    Elswaifi, Shaadi F; St Michael, Frank; Sreenivas, Avula; Cox, Andrew; Carman, George M; Inzana, Thomas J

    2009-10-01

    Histophilus somni (Haemophilus somnus) is an important pathogen of cattle that is responsible for respiratory disease, septicemia, and systemic diseases such as thrombotic meningoencephalitis, myocarditis, and abortion. A variety of virulence factors have been identified in H. somni, including compositional and antigenic variation of the lipooligosaccharide (LOS). Phosphorylcholine (ChoP) has been identified as one of the components of H. somni LOS that undergoes antigenic variation. In this study, five genes (lic1ABCD(Hs) and glpQ) with homology to genes responsible for ChoP expression in Haemophilus influenzae LOS were identified in the H. somni genome. An H. somni open reading frame (ORF) with homology to H. influenzae lic1A (lic1A(Hi)) contained a variable number of tandem repeats (VNTR). However, whereas the tetranucleotide repeat 5'-CAAT-3' is present in lic1A(Hi), the VNTR in H. somni lic1A (lic1A(Hs)) consisted of 5'-AACC-3'. Due to the propensity of VNTR to vary during replication and cause the ORF to shift in and out of frame with the upstream start codon, the VNTR were deleted from lic1A(Hs) to maintain the gene constitutively on. This construct was cloned into Escherichia coli, and functional enzyme assays confirmed that lic1A(Hs) encoded a choline kinase, and that the VNTR were not required for expression of a functional gene product. Variation in the number of VNTR in lic1A(Hs) correlated with antigenic variation of ChoP expression in H. somni strain 124P. However, antigenic variation of ChoP expression in strain 738 predominately occurred through variable extension/truncation of the LOS outer core. These results indicated that the lic1(Hs) genes controlled expression of ChoP on the LOS, but that in H. somni there are two potential mechanisms that account for antigenic variation of ChoP.

  8. Molecular characterization of phosphorylcholine expression on the lipooligosaccharide of Histophilus somni

    PubMed Central

    Elswaifi, Shaadi F.; St. Michael, Frank; Sreenivas, Avula; Cox, Andrew; Carman, George M.; Inzana, Thomas J.

    2013-01-01

    Histophilus somni (Haemophilus somnus) is an important pathogen of cattle that is responsible for respiratory disease, septicemia, and systemic diseases such as thrombotic meningoencephalitis, myocarditis, and abortion. A variety of virulence factors have been identified in H. somni, including compositional and antigenic variation of the lipooligosaccharide (LOS). Phosphorylcholine (ChoP) has been identified as one of the components of H. somni LOS that undergoes antigenic variation. In this study, five genes (lic1ABCDHs and glpQ) with homology to genes responsible for ChoP expression in Haemophilus influenzae LOS were identified in the H. somni genome. An H. somni open reading frame (ORF) with homology to H. influenzae lic1A (lic1AHi) contained a variable number of tandem repeats (VNTR). However, whereas the tetranucleotide repeat 5′-CAAT-3′ is present in lic1AHi, the VNTR in H. somni lic1A (lic1AHs) consisted of 5′-AACC-3′. Due to the propensity of VNTR to vary during replication and cause the ORF to shift in and out of frame with the upstream start codon, the VNTR were deleted from lic1AHs to maintain the gene constitutively on. This construct was cloned into Escherichia coli, and functional enzyme assays confirmed that lic1AHs encoded a choline kinase, and that the VNTR were not required for expression of a functional gene product. Variation in the number of VNTR in lic1AHs correlated with antigenic variation of ChoP expression in H. somni strain 124P. However, antigenic variation of ChoP expression in strain 738 predominately occurred through variable extension/truncation of the LOS outer core. These results indicated that the lic1Hs genes controlled expression of ChoP on the LOS, but that in H. somni there are two potential mechanisms that account for antigenic variation of ChoP. PMID:19682567

  9. Comparison of seven techniques for typing international epidemic strains of Clostridium difficile: restriction endonuclease analysis, pulsed-field gel electrophoresis, PCR-ribotyping, multilocus sequence typing, multilocus variable-number tandem-repeat analysis, amplified fragment length polymorphism, and surface layer protein A gene sequence typing.

    PubMed

    Killgore, George; Thompson, Angela; Johnson, Stuart; Brazier, Jon; Kuijper, Ed; Pepin, Jacques; Frost, Eric H; Savelkoul, Paul; Nicholson, Brad; van den Berg, Renate J; Kato, Haru; Sambol, Susan P; Zukowski, Walter; Woods, Christopher; Limbago, Brandi; Gerding, Dale N; McDonald, L Clifford

    2008-02-01

    Using 42 isolates contributed by laboratories in Canada, The Netherlands, the United Kingdom, and the United States, we compared the results of analyses done with seven Clostridium difficile typing techniques: multilocus variable-number tandem-repeat analysis (MLVA), amplified fragment length polymorphism (AFLP), surface layer protein A gene sequence typing (slpAST), PCR-ribotyping, restriction endonuclease analysis (REA), multilocus sequence typing (MLST), and pulsed-field gel electrophoresis (PFGE). We assessed the discriminating ability and typeability of each technique as well as the agreement among techniques in grouping isolates by allele profile A (AP-A) through AP-F, which are defined by toxinotype, the presence of the binary toxin gene, and deletion in the tcdC gene. We found that all isolates were typeable by all techniques and that discrimination index scores for the techniques tested ranged from 0.964 to 0.631 in the following order: MLVA, REA, PFGE, slpAST, PCR-ribotyping, MLST, and AFLP. All the techniques were able to distinguish the current epidemic strain of C. difficile (BI/027/NAP1) from other strains. All of the techniques showed multiple types for AP-A (toxinotype 0, binary toxin negative, and no tcdC gene deletion). REA, slpAST, MLST, and PCR-ribotyping all included AP-B (toxinotype III, binary toxin positive, and an 18-bp deletion in tcdC) in a single group that excluded other APs. PFGE, AFLP, and MLVA grouped two, one, and two different non-AP-B isolates, respectively, with their AP-B isolates. All techniques appear to be capable of detecting outbreak strains, but only REA and MLVA showed sufficient discrimination to distinguish strains from different outbreaks.

  10. Frequency of 3' VNTR Polymorphism in the Dopamine Transporter Gene SLC6A3 in Humans Predisposed to Antisocial Behavior.

    PubMed

    Cherepkova, E V; Aftanas, L I; Maksimov, N; Menshanov, P N

    2016-11-01

    Predisposition to antisocial behavior can be related to the presence of certain polymorphic variants of genes encoding dopaminergic system proteins. We studied the frequencies of allele variants and genotypes of variable number tandem repeat polymorphism in 3' untranslated region (3' VTNR) of the dopaminergic transporter SLC6A3 gene in Caucasian men committed socially dangerous violent and non-violent crimes. Alleles with 9 and 10 repeats were most frequent in both the control group and group of men predisposed to antisocial behavior. At the same time, the 10/10 genotype was more frequently observed in the group of men prone to antisocial non-violent behavior. Hence, the presence of certain variants of 3' VTNR polymorphism of SLC6A3 gene in men is associated with predisposition to certain forms of antisocial behavior.

  11. Loss of function of 1-FEH IIb has more impact on post-harvest inulin degradation in Cichorium intybus than copy number variation of its close paralog 1-FEH IIa

    PubMed Central

    Dauchot, Nicolas; Raulier, Pierre; Maudoux, Olivier; Notté, Christine; Draye, Xavier; Van Cutsem, Pierre

    2015-01-01

    Key Message: The loss of mini-exon 2 in the 1-FEH IIb glycosyl-hydrolase results in a putative non-functional allele. This loss of function has a strong impact on the susceptibility to post-harvest inulin depolymerization. Significant variation of copy number was identified in its close paralog 1-FEH IIa, but no quantitative effect of copy number on carbohydrates-related phenotypes was detected. Inulin polyfructan is the second most abundant storage carbohydrate in flowering plants. After harvest, it is depolymerized by fructan exohydrolases (FEHs) as an adaptive response to end-season cold temperatures. In chicory, the intensity of this depolymerization differs between cultivars but also between individuals within a cultivar. Regarding this phenotypic variability, we recently identified statistically significant associations between inulin degradation and genetic polymorphisms located in three FEHs. We present here new results of a systematic analysis of copy number variation (CNV) in five key members of the chicory (Cichorium intybus) GH32 multigenic family, including three FEH genes and the two inulin biosynthesis genes: 1-SST and 1-FFT. qPCR analysis identified a significant variability of relative copy number only in the 1-FEH IIa gene. However, this CNV had no quantitative effect. Instead, cloning of the full length gDNA of a close paralogous sequence (1-FEH IIb) identified a 1028 bp deletion in lines less susceptible to post-harvest inulin depolymerization. This region comprises a 9 bp mini-exon containing one of the three conserved residues of the active site. This results in a putative non-functional 1-FEH IIb allele and an observed lower inulin depolymerization. Extensive genotyping confirmed that the loss of mini-exon 2 in 1-FEH IIb and the previously identified 47 bp duplication located in the 3′UTR of 1-FEH IIa belong to a single haplotype, both being statistically associated with reduced susceptibility to post-harvest inulin depolymerization. Emergence of these haplotypes is discussed. PMID:26157446

  12. Part mutual information for quantifying direct associations in networks.

    PubMed

    Zhao, Juan; Zhou, Yiwei; Zhang, Xiujun; Chen, Luonan

    2016-05-03

    Quantitatively identifying direct dependencies between variables is an important task in data analysis, in particular for reconstructing various types of networks and causal relations in science and engineering. One of the most widely used criteria is partial correlation, but it can only measure linearly direct association and miss nonlinear associations. However, based on conditional independence, conditional mutual information (CMI) is able to quantify nonlinearly direct relationships among variables from the observed data, superior to linear measures, but suffers from a serious problem of underestimation, in particular for those variables with tight associations in a network, which severely limits its applications. In this work, we propose a new concept, "partial independence," with a new measure, "part mutual information" (PMI), which not only can overcome the problem of CMI but also retains the quantification properties of both mutual information (MI) and CMI. Specifically, we first defined PMI to measure nonlinearly direct dependencies between variables and then derived its relations with MI and CMI. Finally, we used a number of simulated data as benchmark examples to numerically demonstrate PMI features and further real gene expression data from Escherichia coli and yeast to reconstruct gene regulatory networks, which all validated the advantages of PMI for accurately quantifying nonlinearly direct associations in networks.

  13. The challenge for genetic epidemiologists: how to analyze large numbers of SNPs in relation to complex diseases.

    PubMed

    Heidema, A Geert; Boer, Jolanda M A; Nagelkerke, Nico; Mariman, Edwin C M; van der A, Daphne L; Feskens, Edith J M

    2006-04-21

    Genetic epidemiologists have taken the challenge to identify genetic polymorphisms involved in the development of diseases. Many have collected data on large numbers of genetic markers but are not familiar with available methods to assess their association with complex diseases. Statistical methods have been developed for analyzing the relation between large numbers of genetic and environmental predictors to disease or disease-related variables in genetic association studies. In this commentary we discuss logistic regression analysis, neural networks, including the parameter decreasing method (PDM) and genetic programming optimized neural networks (GPNN) and several non-parametric methods, which include the set association approach, combinatorial partitioning method (CPM), restricted partitioning method (RPM), multifactor dimensionality reduction (MDR) method and the random forests approach. The relative strengths and weaknesses of these methods are highlighted. Logistic regression and neural networks can handle only a limited number of predictor variables, depending on the number of observations in the dataset. Therefore, they are less useful than the non-parametric methods to approach association studies with large numbers of predictor variables. GPNN on the other hand may be a useful approach to select and model important predictors, but its performance to select the important effects in the presence of large numbers of predictors needs to be examined. Both the set association approach and random forests approach are able to handle a large number of predictors and are useful in reducing these predictors to a subset of predictors with an important contribution to disease. The combinatorial methods give more insight in combination patterns for sets of genetic and/or environmental predictor variables that may be related to the outcome variable. As the non-parametric methods have different strengths and weaknesses we conclude that to approach genetic association studies using the case-control design, the application of a combination of several methods, including the set association approach, MDR and the random forests approach, will likely be a useful strategy to find the important genes and interaction patterns involved in complex diseases.

  14. Family-based association study between monoamine oxidase A (MAOA) gene promoter VNTR polymorphism and Tourette's syndrome in Chinese Han population.

    PubMed

    Liu, Shiguo; Wang, Xueqin; Xu, Longqiang; Zheng, Lanlan; Ge, Yinlin; Ma, Xu

    2015-02-01

    To clarify the association of monoamine oxidase A- variable number of tandem repeat (MAOA-pVNTR) with susceptibility to Tourette's syndrome (TS) in Chinese Han population we discuss the genetic contribution of MAOA-VNTR in 141 TS patients including all their parents in Chinese Han population using transmission disequilibrium test (TDT) design. Our results revealed that no significant association was found in the MAOA gene promoter VNTR polymorphism and TS in Chinese Han population (TDT = 1.515, df = 1, p > 0.05). The negative result may be mainly due to the small sample size, but we don't deny the role of gene coding serotonergic or monoaminergic structures in the etiology of TS.

  15. Detection of susceptibility genes as modifiers due to subgroup differences in complex disease.

    PubMed

    Bergen, Sarah E; Maher, Brion S; Fanous, Ayman H; Kendler, Kenneth S

    2010-08-01

    Complex diseases invariably involve multiple genes and often exhibit variable symptom profiles. The extent to which disease symptoms, course, and severity differ between affected individuals may result from underlying genetic heterogeneity. Genes with modifier effects may or may not also influence disease susceptibility. In this study, we have simulated data in which a subset of cases differ by some effect size (ES) on a quantitative trait and are also enriched for a risk allele. Power to detect this 'pseudo-modifier' gene in case-only and case-control designs was explored blind to case substructure. Simulations involved 1000 iterations and calculations for 80% power at P<0.01 while varying the risk allele frequency (RAF), sample size (SS), ES, odds ratio (OR), and proportions of the case subgroups. With realistic values for the RAF (0.20), SS (3000) and ES (1), an OR of 1.7 is necessary to detect a pseudo-modifier gene. Unequal numbers of subjects in the case groups result in little decrement in power until the group enriched for the risk allele is <30% or >70% of the total case population. In practice, greater numbers of subjects and selection of a quantitative trait with a large range will provide researchers with greater power to detect a pseudo-modifier gene. However, even under ideal conditions, studies involving alleles with low frequencies or low ORs are usually underpowered for detection of a modifier or susceptibility gene. This may explain some of the inconsistent association results for many candidate gene studies of complex diseases.

  16. Genes commonly deleted in childhood B-cell precursor acute lymphoblastic leukemia: association with cytogenetics and clinical features

    PubMed Central

    Schwab, Claire J.; Chilton, Lucy; Morrison, Heather; Jones, Lisa; Al-Shehhi, Halima; Erhorn, Amy; Russell, Lisa J.; Moorman, Anthony V.; Harrison, Christine J.

    2013-01-01

    In childhood B-cell precursor acute lymphoblastic leukemia, cytogenetics is important in diagnosis and as an indicator of response to therapy, thus playing a key role in risk stratification of patients for treatment. Little is known of the relationship between different cytogenetic subtypes in B-cell precursor acute lymphoblastic leukemia and the recently reported copy number abnormalities affecting significant leukemia associated genes. In a consecutive series of 1427 childhood B-cell precursor acute lymphoblastic leukemia patients, we have determined the incidence and type of copy number abnormalities using multiplex ligation-dependent probe amplification. We have shown strong links between certain deletions and cytogenetic subtypes, including the novel association between RB1 deletions and intrachromosomal amplification of chromosome 21. In this study, we characterized the different copy number abnormalities and show heterogeneity of PAX5 and IKZF1 deletions and the recurrent nature of RB1 deletions. Whole gene losses are often indicative of larger deletions, visible by conventional cytogenetics. An increased number of copy number abnormalities is associated with NCI high risk, specifically deletions of IKZF1 and CDKN2A/B, which occur more frequently among these patients. IKZF1 deletions and rearrangements of CRLF2 among patients with undefined karyotypes may point to the poor risk BCR-ABL1-like group. In conclusion, this study has demonstrated in a large representative cohort of children with B-cell precursor acute lymphoblastic leukemia that the pattern of copy number abnormalities is highly variable according to the primary genetic abnormality. PMID:23508010

  17. Gene-Gene-Environment Interactions of Serotonin Transporter, Monoamine Oxidase A and Childhood Maltreatment Predict Aggressive Behavior in Chinese Adolescents

    PubMed Central

    Zhang, Yun; Ming, Qing-sen; Yi, Jin-yao; Wang, Xiang; Chai, Qiao-lian; Yao, Shu-qiao

    2017-01-01

    Gene-environment interactions that moderate aggressive behavior have been identified independently in the serotonin transporter (5-HTT) gene and monoamine oxidase A gene (MAOA). The aim of the present study was to investigate epistasis interactions between MAOA-variable number tandem repeat (VNTR), 5-HTTlinked polymorphism (LPR) and child abuse and the effects of these on aggressive tendencies in a group of otherwise healthy adolescents. A group of 546 Chinese male adolescents completed the Child Trauma Questionnaire and Youth self-report of the Child Behavior Checklist. Buccal cells were collected for DNA analysis. The effects of childhood abuse, MAOA-VNTR, 5-HTTLPR genotypes and their interactive gene-gene-environmental effects on aggressive behavior were analyzed using a linear regression model. The effect of child maltreatment was significant, and a three-way interaction among MAOA-VNTR, 5-HTTLPR and sexual abuse (SA) relating to aggressive behaviors was identified. Chinese male adolescents with high expression of the MAOA-VNTR allele and 5-HTTLPR “SS” genotype exhibited the highest aggression tendencies with an increase in SA during childhood. The findings reported support aggression being a complex behavior involving the synergistic effects of gene-gene-environment interactions. PMID:28203149

  18. Isolation of Notl sites from chromosome 22q11

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ten Hoeve, J.; Groffen, J.; Heisterkamp, N.

    1993-12-01

    Chromosome 22q11 contains a large number of interesting loci, including genes associated with cancer and developmental defects. The region is also the site of the lambda immunoglobulin variable and constants regions and the BCR, [gamma]-glutamyl transpeptidase, and GGT-like activity multigene families. Because of the complexities associated with mapping highly related gene families, the authors have examined the utility of mapping large areas of DNA using a defined approach. A total of 21 complete NotI sites from band q11 were cloned and ordered into six noncontiguous clusters of sites using a combination of somatic cell hybrid panels, NotI jumping and linkingmore » libraries, and fluorescence in situ hybridization. The largest cluster spanned an estimated 2 Mb of NotI fragments, the smallest 115 kb. Approximately 3.5 Mb of band q11 could be examined for rearrangements in NotI restriction enzyme fragments. A number of conserved sequences, two genes, and a minimum of two families of related sequences were identified adjacent to NotI sites. 51 refs., 5 figs., 4 tabs.« less

  19. FISH and AgNor mapping of the 45S and 5S rRNA genes in wild and cultivated species of Capsicum (Solananceae).

    PubMed

    Scaldaferro, Marisel A; da Cruz, M Victoria Romero; Cecchini, Nicolás M; Moscone, Eduardo A

    2016-02-01

    Chromosome number and position of rDNA were studied in 12 wild and cultivated species of the genus Capsicum with chromosome numbers x = 12 and x = 13 (22 samples). For the first time in these species, the 5S and 45S rRNA loci were localized and physically mapped using two-color fluorescence in situ hybridization and AgNOR banding. We focused on the comparison of the results obtained with both methods with the aim of accurately revealing the real functional rRNA genes. The analyzes were based on a previous work that reported that the 18S-5.8S-25S loci mostly coincide with GC-rich heterochromatic regions and likely have given rise to satellite DNAs, which are not active genes. These data show the variability of rDNA within karyotypes of the genus Capsicum, providing anchor points for (comparative) genetic maps. In addition, the obtained information might be useful for studies on evolution of repetitive DNA.

  20. Animal Mitochondrial DNA as We Do Not Know It: mt-Genome Organization and Evolution in Nonbilaterian Lineages

    PubMed Central

    Pett, Walker

    2016-01-01

    Abstract Animal mitochondrial DNA (mtDNA) is commonly described as a small, circular molecule that is conserved in size, gene content, and organization. Data collected in the last decade have challenged this view by revealing considerable diversity in animal mitochondrial genome organization. Much of this diversity has been found in nonbilaterian animals (phyla Cnidaria, Ctenophora, Placozoa, and Porifera), which, from a phylogenetic perspective, form the main branches of the animal tree along with Bilateria. Within these groups, mt-genomes are characterized by varying numbers of both linear and circular chromosomes, extra genes (e.g. atp9, polB, tatC), large variation in the number of encoded mitochondrial transfer RNAs (tRNAs) (0–25), at least seven different genetic codes, presence/absence of introns, tRNA and mRNA editing, fragmented ribosomal RNA genes, translational frameshifting, highly variable substitution rates, and a large range of genome sizes. This newly discovered diversity allows a better understanding of the evolutionary plasticity and conservation of animal mtDNA and provides insights into the molecular and evolutionary mechanisms shaping mitochondrial genomes. PMID:27557826

  1. Predicting Viral Infection From High-Dimensional Biomarker Trajectories

    PubMed Central

    Chen, Minhua; Zaas, Aimee; Woods, Christopher; Ginsburg, Geoffrey S.; Lucas, Joseph; Dunson, David; Carin, Lawrence

    2013-01-01

    There is often interest in predicting an individual’s latent health status based on high-dimensional biomarkers that vary over time. Motivated by time-course gene expression array data that we have collected in two influenza challenge studies performed with healthy human volunteers, we develop a novel time-aligned Bayesian dynamic factor analysis methodology. The time course trajectories in the gene expressions are related to a relatively low-dimensional vector of latent factors, which vary dynamically starting at the latent initiation time of infection. Using a nonparametric cure rate model for the latent initiation times, we allow selection of the genes in the viral response pathway, variability among individuals in infection times, and a subset of individuals who are not infected. As we demonstrate using held-out data, this statistical framework allows accurate predictions of infected individuals in advance of the development of clinical symptoms, without labeled data and even when the number of biomarkers vastly exceeds the number of individuals under study. Biological interpretation of several of the inferred pathways (factors) is provided. PMID:23704802

  2. Heteroduplex analysis can increase the informativeness of PCR-amplified VNTR markers: Application using a marker tightly linked to the COL2A1 gene

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wilkin, D.J.; Cohn, D.H.; Koprivnikar, K.E.

    1993-02-01

    Variable number of tandem repeat (VNTR) polymorphism provide a high degree of informativeness in linkage studies. Whether performed by standard methods or by polymerase chain reaction (PCR), analysis of these markers involves assessment of the length of each allele. VNTR alleles usually differ in the number of tandem repeats. During PCR amplification of a VNTR closely linked to the type II collagen gene (COL2A1), we identified allelic microheterogeneity through the analysis of unique heteroduplexes between amplified strands of the two alleles. In one large pedigree, heteroduplex analysis identified only three distinct alleles. The identification of these heteroduplexes allowed the determinationmore » of the COL2A1 inheritance pattern in the family, which otherwise would have been noninformative. 26 refs., 3 figs.« less

  3. Expression and phylogenetic analyses reveal paralogous lineages of putatively classical and non-classical MHC-I genes in three sparrow species (Passer).

    PubMed

    Drews, Anna; Strandh, Maria; Råberg, Lars; Westerdahl, Helena

    2017-06-26

    The Major Histocompatibility Complex (MHC) plays a central role in immunity and has been given considerable attention by evolutionary ecologists due to its associations with fitness-related traits. Songbirds have unusually high numbers of MHC class I (MHC-I) genes, but it is not known whether all are expressed and equally important for immune function. Classical MHC-I genes are highly expressed, polymorphic and present peptides to T-cells whereas non-classical MHC-I genes have lower expression, are more monomorphic and do not present peptides to T-cells. To get a better understanding of the highly duplicated MHC genes in songbirds, we studied gene expression in a phylogenetic framework in three species of sparrows (house sparrow, tree sparrow and Spanish sparrow), using high-throughput sequencing. We hypothesize that sparrows could have classical and non-classical genes, as previously indicated though never tested using gene expression. The phylogenetic analyses reveal two distinct types of MHC-I alleles among the three sparrow species, one with high and one with low level of polymorphism, thus resembling classical and non-classical genes, respectively. All individuals had both types of alleles, but there was copy number variation both within and among the sparrow species. However, the number of highly polymorphic alleles that were expressed did not vary between species, suggesting that the structural genomic variation is counterbalanced by conserved gene expression. Overall, 50% of the MHC-I alleles were expressed in sparrows. Expression of the highly polymorphic alleles was very variable, whereas the alleles with low polymorphism had uniformly low expression. Interestingly, within an individual only one or two alleles from the polymorphic genes were highly expressed, indicating that only a single copy of these is highly expressed. Taken together, the phylogenetic reconstruction and the analyses of expression suggest that sparrows have both classical and non-classical MHC-I genes, and that the evolutionary origin of these genes predate the split of the three investigated sparrow species 7 million years ago. Because only the classical MHC-I genes are involved in antigen presentation, the function of different MHC-I genes should be considered in future ecological and evolutionary studies of MHC-I in sparrows and other songbirds.

  4. Seasonal Changes in Bacterial and Archaeal Gene Expression Patterns across Salinity Gradients in the Columbia River Coastal Margin

    PubMed Central

    Smith, Maria W.; Herfort, Lydie; Tyrol, Kaitlin; Suciu, Dominic; Campbell, Victoria; Crump, Byron C.; Peterson, Tawnya D.; Zuber, Peter; Baptista, Antonio M.; Simon, Holly M.

    2010-01-01

    Through their metabolic activities, microbial populations mediate the impact of high gradient regions on ecological function and productivity of the highly dynamic Columbia River coastal margin (CRCM). A 2226-probe oligonucleotide DNA microarray was developed to investigate expression patterns for microbial genes involved in nitrogen and carbon metabolism in the CRCM. Initial experiments with the environmental microarrays were directed toward validation of the platform and yielded high reproducibility in multiple tests. Bioinformatic and experimental validation also indicated that >85% of the microarray probes were specific for their corresponding target genes and for a few homologs within the same microbial family. The validated probe set was used to query gene expression responses by microbial assemblages to environmental variability. Sixty-four samples from the river, estuary, plume, and adjacent ocean were collected in different seasons and analyzed to correlate the measured variability in chemical, physical and biological water parameters to differences in global gene expression profiles. The method produced robust seasonal profiles corresponding to pre-freshet spring (April) and late summer (August). Overall relative gene expression was high in both seasons and was consistent with high microbial abundance measured by total RNA, heterotrophic bacterial production, and chlorophyll a. Both seasonal patterns involved large numbers of genes that were highly expressed relative to background, yet each produced very different gene expression profiles. April patterns revealed high differential gene expression in the coastal margin samples (estuary, plume and adjacent ocean) relative to freshwater, while little differential gene expression was observed along the river-to-ocean transition in August. Microbial gene expression profiles appeared to relate, in part, to seasonal differences in nutrient availability and potential resource competition. Furthermore, our results suggest that highly-active particle-attached microbiota in the Columbia River water column may perform dissimilatory nitrate reduction (both dentrification and DNRA) within anoxic particle microniches. PMID:20967204

  5. A Comparative In Silico Study of the Antioxidant Defense Gene Repertoire of Distinct Lifestyle Trypanosomatid Species

    PubMed Central

    Beltrame-Botelho, Ingrid Thaís; Talavera-López, Carlos; Andersson, Björn; Grisard, Edmundo Carlos; Stoco, Patricia Hermes

    2016-01-01

    Kinetoplastids are an ancestral group of protists that contains free-living species and parasites with distinct mechanisms in response to stress. Here, we compared genes involved in antioxidant defense (AD), proposing an evolution model among trypanosomatids. All genes were identified in Bodo saltans, suggesting that AD mechanisms have evolved prior to adaptation for parasitic lifestyles. While most of the monoxenous and dixenous parasites revealed minor differences from B. saltans, the endosymbiont-bearing species have an increased number of genes. The absence of these genes was mainly observed in the extracellular parasites of the genera Phytomonas and Trypanosoma. In trypanosomes, a distinction was observed between stercorarian and salivarian parasites, except for Trypanosoma rangeli. Our analyses indicate that the variability of AD among trypanosomatids at the genomic level is not solely due to the geographical isolation, being mainly related to specific adaptations of their distinct biological cycles within insect vectors and to a parasitism of a wide range of hosts. PMID:27840574

  6. Knowledge Discovery in Biological Databases for Revealing Candidate Genes Linked to Complex Phenotypes.

    PubMed

    Hassani-Pak, Keywan; Rawlings, Christopher

    2017-06-13

    Genetics and "omics" studies designed to uncover genotype to phenotype relationships often identify large numbers of potential candidate genes, among which the causal genes are hidden. Scientists generally lack the time and technical expertise to review all relevant information available from the literature, from key model species and from a potentially wide range of related biological databases in a variety of data formats with variable quality and coverage. Computational tools are needed for the integration and evaluation of heterogeneous information in order to prioritise candidate genes and components of interaction networks that, if perturbed through potential interventions, have a positive impact on the biological outcome in the whole organism without producing negative side effects. Here we review several bioinformatics tools and databases that play an important role in biological knowledge discovery and candidate gene prioritization. We conclude with several key challenges that need to be addressed in order to facilitate biological knowledge discovery in the future.

  7. Intergenic Variable-Number Tandem-Repeat Polymorphism Upstream of rocA Alters Toxin Production and Enhances Virulence in Streptococcus pyogenes.

    PubMed

    Zhu, Luchang; Olsen, Randall J; Horstmann, Nicola; Shelburne, Samuel A; Fan, Jia; Hu, Ye; Musser, James M

    2016-07-01

    Variable-number tandem-repeat (VNTR) polymorphisms are ubiquitous in bacteria. However, only a small fraction of them has been functionally studied. Here, we report an intergenic VNTR polymorphism that confers an altered level of toxin production and increased virulence in Streptococcus pyogenes The nature of the polymorphism is a one-unit deletion in a three-tandem-repeat locus upstream of the rocA gene encoding a sensor kinase. S. pyogenes strains with this type of polymorphism cause human infection and produce significantly larger amounts of the secreted cytotoxins S. pyogenes NADase (SPN) and streptolysin O (SLO). Using isogenic mutant strains, we demonstrate that deleting one or more units of the tandem repeats abolished RocA production, reduced CovR phosphorylation, derepressed multiple CovR-regulated virulence factors (such as SPN and SLO), and increased virulence in a mouse model of necrotizing fasciitis. The phenotypic effect of the VNTR polymorphism was nearly the same as that of inactivating the rocA gene. In summary, we identified and characterized an intergenic VNTR polymorphism in S. pyogenes that affects toxin production and virulence. These new findings enhance understanding of rocA biology and the function of VNTR polymorphisms in S. pyogenes. Copyright © 2016, American Society for Microbiology. All Rights Reserved.

  8. Genic Variability and Strategies of Adaptation in Animals

    PubMed Central

    Selander, Robert K.; Kaufman, Donald W.

    1973-01-01

    Levels of genic heterozygosity, as measured by surveys of allozymic variation, are much lower in populations of large, mobile animals (most vertebrates) than in those of small, relatively immobile animals (most invertebrates). This difference is not consistent with theories relating variability to population size (species number) or dispersal ability (gene flow), but it is predicted by Levins' theory of adaptive strategies in relation to environmental uncertainty (“grain”). Mobility and degree of homeostatic control apparently are important factors influencing levels of genic heterozygosity in natural populations. The results argue indirectly that at least a major proportion of allozymic variation is maintained by natural selection. PMID:4515944

  9. A novel DNMT1 mutation associated with early onset hereditary sensory and autonomic neuropathy, cataplexy, cerebellar atrophy, scleroderma, endocrinopathy, and common variable immune deficiency.

    PubMed

    Fox, Robin; Ealing, John; Murphy, Helen; Gow, David P; Gosal, David

    2016-09-01

    DNA methyltransferase 1 (DNMT1) is an enzyme which has a role in methylation of DNA, gene regulation, and chromatin stability. Missense mutations in the DNMT1 gene have been previously associated with two neurological syndromes: hereditary sensory and autonomic neuropathy type 1 with dementia and deafness (HSAN1E) and autosomal dominant cerebellar ataxia, deafness, and narcolepsy (ADCA-DN). We report a case showing overlap of both of these syndromes plus associated clinical features of common variable immune deficiency, scleroderma, and endocrinopathy that could also be mutation associated. Our patient was found to be heterozygous for a previously unreported frameshift mutation, c.1635_1637delCAA p.(Asn545del) in the DNMT1 gene exon 20. This case displays both the first frameshift mutation described in the literature which is associated with a phenotype with a high degree of overlap between HSAN1E and ADCA-DN and early age of onset (c. 8 years). Our case is also of interest as the patient displays a number of new non-neurological features, which could also be DNMT1 mutation related. © 2016 Peripheral Nerve Society.

  10. Molecular identification and typing of Burkholderia pseudomallei and Burkholderia mallei: when is enough enough?

    PubMed

    Antonov, Valery A; Tkachenko, Galina A; Altukhova, Viktoriya V; Savchenko, Sergey S; Zinchenko, Olga V; Viktorov, Dmitry V; Zamaraev, Valery S; Ilyukhin, Vladimir I; Alekseev, Vladimir V

    2008-12-01

    Burkholderia mallei and B. pseudomallei are highly pathogenic microorganisms for both humans and animals. Moreover, they are regarded as potential agents of bioterrorism. Thus, rapid and unequivocal detection and identification of these dangerous pathogens is critical. In the present study, we describe the use of an optimized protocol for the early diagnosis of experimental glanders and melioidosis and for the rapid differentiation and typing of Burkholderia strains. This experience with PCR-based identification methods indicates that single PCR targets (23S and 16S rRNA genes, 16S-23S intergenic region, fliC and type III secretion gene cluster) should be used with caution for identification of B. mallei and B. pseudomallei, and need to be used alongside molecular methods such as gene sequencing. Several molecular typing procedures have been used to identify genetically related B. pseudomallei and B. mallei isolates, including ribotyping, pulsed-field gel electrophoresis and multilocus sequence typing. However, these methods are time consuming and technically challenging for many laboratories. RAPD, variable amplicon typing scheme, Rep-PCR, BOX-PCR and multiple-locus variable-number tandem repeat analysis have been recommended by us for the rapid differentiation of B. mallei and B. pseudomallei strains.

  11. Alteration of gene expression by alcohol exposure at early neurulation.

    PubMed

    Zhou, Feng C; Zhao, Qianqian; Liu, Yunlong; Goodlett, Charles R; Liang, Tiebing; McClintick, Jeanette N; Edenberg, Howard J; Li, Lang

    2011-02-21

    We have previously demonstrated that alcohol exposure at early neurulation induces growth retardation, neural tube abnormalities, and alteration of DNA methylation. To explore the global gene expression changes which may underline these developmental defects, microarray analyses were performed in a whole embryo mouse culture model that allows control over alcohol and embryonic variables. Alcohol caused teratogenesis in brain, heart, forelimb, and optic vesicle; a subset of the embryos also showed cranial neural tube defects. In microarray analysis (accession number GSM9545), adopting hypothesis-driven Gene Set Enrichment Analysis (GSEA) informatics and intersection analysis of two independent experiments, we found that there was a collective reduction in expression of neural specification genes (neurogenin, Sox5, Bhlhe22), neural growth factor genes [Igf1, Efemp1, Klf10 (Tieg), and Edil3], and alteration of genes involved in cell growth, apoptosis, histone variants, eye and heart development. There was also a reduction of retinol binding protein 1 (Rbp1), and de novo expression of aldehyde dehydrogenase 1B1 (Aldh1B1). Remarkably, four key hematopoiesis genes (glycophorin A, adducin 2, beta-2 microglobulin, and ceruloplasmin) were absent after alcohol treatment, and histone variant genes were reduced. The down-regulation of the neurospecification and the neurotrophic genes were further confirmed by quantitative RT-PCR. Furthermore, the gene expression profile demonstrated distinct subgroups which corresponded with two distinct alcohol-related neural tube phenotypes: an open (ALC-NTO) and a closed neural tube (ALC-NTC). Further, the epidermal growth factor signaling pathway and histone variants were specifically altered in ALC-NTO, and a greater number of neurotrophic/growth factor genes were down-regulated in the ALC-NTO than in the ALC-NTC embryos. This study revealed a set of genes vulnerable to alcohol exposure and genes that were associated with neural tube defects during early neurulation.

  12. Recurrent 15q11.2 BP1-BP2 microdeletions and microduplications in the etiology of neurodevelopmental disorders.

    PubMed

    Picinelli, Chiara; Lintas, Carla; Piras, Ignazio Stefano; Gabriele, Stefano; Sacco, Roberto; Brogna, Claudia; Persico, Antonio Maria

    2016-12-01

    Rare and common CNVs can contribute to the etiology of neurodevelopmental disorders. One of the recurrent genomic aberrations associated with these phenotypes and proposed as a susceptibility locus is the 15q11.2 BP1-BP2 CNV encompassing TUBGCP5, CYFIP1, NIPA2, and NIPA1. Characterizing by array-CGH a cohort of 243 families with various neurodevelopmental disorders, we identified five patients carrying the 15q11.2 duplication and one carrying the deletion. All CNVs were confirmed by qPCR and were inherited, except for one duplication where parents were not available. The phenotypic spectrum of CNV carriers was broad but mainly neurodevelopmental, in line with all four genes being implicated in axonal growth and neural connectivity. Phenotypically normal and mildly affected carriers complicate the interpretation of this aberration. This variability may be due to reduced penetrance or altered gene dosage on a particular genetic background. We evaluated the expression levels of the four genes in peripheral blood RNA and found the expected reduction in the deleted case, while duplicated carriers displayed high interindividual variability. These data suggest that differential expression of these genes could partially account for differences in clinical phenotypes, especially among duplication carriers. Furthermore, urinary Mg 2+ levels appear negatively correlated with NIPA2 gene copy number, suggesting they could potentially represent a useful biomarker, whose reliability will need replication in larger samples. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.

  13. Exploring internal features of 16S rRNA gene for identification of clinically relevant species of the genus Streptococcus

    PubMed Central

    2011-01-01

    Background Streptococcus is an economically important genus as a number of species belonging to this genus are human and animal pathogens. The genus has been divided into different groups based on 16S rRNA gene sequence similarity. The variability observed among the members of these groups is low and it is difficult to distinguish them. The present study was taken up to explore 16S rRNA gene sequence to develop methods that can be used for preliminary identification and can supplement the existing methods for identification of clinically-relevant isolates of the genus Streptococcus. Methods 16S rRNA gene sequences belonging to the isolates of S. dysgalactiae, S. equi, S. pyogenes, S. agalactiae, S. bovis, S. gallolyticus, S. mutans, S. sobrinus, S. mitis, S. pneumoniae, S. thermophilus and S. anginosus were analyzed with the purpose to define genetic variability within each species to generate a phylogenetic framework, to identify species-specific signatures and in-silico restriction enzyme analysis. Results The framework based analysis was used to segregate Streptococcus spp. previously identified upto genus level. This segregation was validated using species-specific signatures and in-silico restriction enzyme analysis. 43 uncharacterized Streptococcus spp. could be identified using this approach. Conclusions The markers generated exploring 16S rRNA gene sequences provided useful tool that can be further used for identification of different species of the genus Streptococcus. PMID:21702978

  14. Two Different Secondary Metabolism Gene Clusters Occupied the Same Ancestral Locus in Fungal Dermatophytes of the Arthrodermataceae

    PubMed Central

    Zhang, Han; Rokas, Antonis; Slot, Jason C.

    2012-01-01

    Background Dermatophyte fungi of the family Arthrodermataceae (Eurotiomycetes) colonize keratinized tissue, such as skin, frequently causing superficial mycoses in humans and other mammals, reptiles, and birds. Competition with native microflora likely underlies the propensity of these dermatophytes to produce a diversity of antibiotics and compounds for scavenging iron, which is extremely scarce, as well as the presence of an unusually large number of putative secondary metabolism gene clusters, most of which contain non-ribosomal peptide synthetases (NRPS), in their genomes. To better understand the historical origins and diversification of NRPS-containing gene clusters we examined the evolution of a variable locus (VL) that exists in one of three alternative conformations among the genomes of seven dermatophyte species. Results The first conformation of the VL (termed VLA) contains only 539 base pairs of sequence and lacks protein-coding genes, whereas the other two conformations (termed VLB and VLC) span 36 Kb and 27 Kb and contain 12 and 10 genes, respectively. Interestingly, both VLB and VLC appear to contain distinct secondary metabolism gene clusters; VLB contains a NRPS gene as well as four porphyrin metabolism genes never found to be physically linked in the genomes of 128 other fungal species, whereas VLC also contains a NRPS gene as well as several others typically found associated with secondary metabolism gene clusters. Phylogenetic evidence suggests that the VL locus was present in the ancestor of all seven species achieving its present distribution through subsequent differential losses or retentions of specific conformations. Conclusions We propose that the existence of variable loci, similar to the one we studied, in fungal genomes could potentially explain the dramatic differences in secondary metabolic diversity between closely related species of filamentous fungi, and contribute to host adaptation and the generation of metabolic diversity. PMID:22860027

  15. Gene selection heuristic algorithm for nutrigenomics studies.

    PubMed

    Valour, D; Hue, I; Grimard, B; Valour, B

    2013-07-15

    Large datasets from -omics studies need to be deeply investigated. The aim of this paper is to provide a new method (LEM method) for the search of transcriptome and metabolome connections. The heuristic algorithm here described extends the classical canonical correlation analysis (CCA) to a high number of variables (without regularization) and combines well-conditioning and fast-computing in "R." Reduced CCA models are summarized in PageRank matrices, the product of which gives a stochastic matrix that resumes the self-avoiding walk covered by the algorithm. Then, a homogeneous Markov process applied to this stochastic matrix converges the probabilities of interconnection between genes, providing a selection of disjointed subsets of genes. This is an alternative to regularized generalized CCA for the determination of blocks within the structure matrix. Each gene subset is thus linked to the whole metabolic or clinical dataset that represents the biological phenotype of interest. Moreover, this selection process reaches the aim of biologists who often need small sets of genes for further validation or extended phenotyping. The algorithm is shown to work efficiently on three published datasets, resulting in meaningfully broadened gene networks.

  16. Expression of activation-induced cytidine deaminase gene in B lymphocytes of patients with common variable immunodeficiency.

    PubMed

    Abolhassani, Hassan; Farrokhi, Amir Salek; Pourhamdi, Shabnam; Mohammadinejad, Payam; Sadeghi, Bamdad; Moazzeni, Seyed-Mohammad; Aghamohammadi, Asghar

    2013-08-01

    Common variable immunodeficiency (CVID) is a heterogeneous disorder characterized by reduced serum level of IgG, IgA or IgM and recurrent bacterial infections. Class switch recombination (CSR) as a critical process in immunoglobulin production is defective in a group of CVID patients. Activation-induced cytidine deaminase (AID) protein is an important molecule involving CSR process. The aim of this study was to investigate the AID gene mRNA production in a group of CVID patients indicating possible role of this molecule in this disorder. Peripheral blood mononuclear cells (PBMC) of 29 CVID patients and 21 healthy controls were isolated and stimulated by CD40L and IL-4 to induce AID gene expression. After 5 days AID gene mRNA production was investigated by real time polymerase chain reaction. AID gene was expressed in all of the studied patients. However the mean density of extracted AID mRNA showed higher level in CVID patients (230.95±103.04 ng/ml) rather than controls (210.00±44.72 ng/ml; P=0.5). CVID cases with lower level of AID had decreased total level of IgE (P=0.04) and stimulated IgE production (P=0.02); while cases with increased level of AID presented higher level of IgA (P=0.04) and numbers of B cells (P=0.02) and autoimmune disease (P=0.02). Different levels of AID gene expression may have important roles in dysregulation of immune system and final clinical presentation in CVID patients. Therefore investigating the expression of AID gene can help in classifying CVID patients.

  17. Nucleotide variability of protamine genes influencing bull sperm motility variables.

    PubMed

    H M, Yathish; Kumar, Subodh; Chaudhary, Rajni; Mishra, Chinmoy; A, Sivakumar; Kumar, Amit; Chauhan, Anuj; Ghosh, S K; Mitra, Abhijit

    2018-06-01

    Protamines (PRMs), important proteins of chromatin condensation in spermiogenesis, are promising candidate genes to explore markers of sperm motility. The coding and in-silico predicted promoter regions of these genes were investigated in 102 crossbred and 32 purebred cattle. Also, mRNA quantification was done to explore its possibility as diagnostic tool of infertility. The PCR-SSCP analysis indicated there were two band patterns only in fragment I of the PRM1 and fragment II of the PRM2 gene. The sequence analysis revealed A152G and G179A transitions in the PRM1 gene. Similarly, G35A, A49G and A64G transitions were identified in the PRM2 gene which resulted in altered amino acid sequences from arginine (R) to glutamine (Q), from arginine (R) to glycine (G) and from arginine (R) to glycine (G), respectively. This caused the reduction in molecular weight of PRM2 from 2157.66 to 1931.33 Da due to reduction in the number of basic amino acids. These altered properties of the PRM2 protein led to the reduction in Mass Motility (MM: P < 0.01), Initial Progressive Motility (IPM; P < 0.05) and Post Thaw Motility (PTM; P < 0.05) in crossbred bulls. The least squares analysis of variance indicated there was an effect of PRM2 haplotypes on MM (P = 0.0069), IPM (P = 0.0306) and PTM (P = 0.0500) in crossbred cattle and on PTM (P = 0.0408) in the overall cattle population. Based on the RT-qPCR analysis, however, there was not any significant variation of PRM1 and PRM2 gene expression among sperm of Vrindavani bulls with relatively lesser and greater sperm motility. Copyright © 2018 Elsevier B.V. All rights reserved.

  18. Primary cutaneous B-cell lymphoma is associated with somatically hypermutated immunoglobulin variable genes and frequent use of VH1-69 and VH4-59 segments.

    PubMed

    Perez, M; Pacchiarotti, A; Frontani, M; Pescarmona, E; Caprini, E; Lombardo, G A; Russo, G; Faraggiana, T

    2010-03-01

    Accurate assessment of the somatic mutational status of clonal immunoglobulin variable region (IgV) genes is relevant in elucidating tumour cell origin in B-cell lymphoma; virgin B cells bear unmutated IgV genes, while germinal centre and postfollicular B cells carry mutated IgV genes. Furthermore, biases in the IgV repertoire and distribution pattern of somatic mutations indicate a possible antigen role in the pathogenesis of B-cell malignancies. This work investigates the cellular origin and antigenic selection in primary cutaneous B-cell lymphoma (PCBCL). We analysed the nucleotide sequence of clonal IgV heavy-chain gene (IgVH) rearrangements in 51 cases of PCBCL (25 follicle centre, 19 marginal zone and seven diffuse large B-cell lymphoma, leg-type) and compared IgVH sequences with their closest germline segment in the GenBank database. Molecular data were then correlated with histopathological features. We showed that all but one of the 51 IgVH sequences analysed exhibited extensive somatic hypermutations. The detected mutation rate ranged from 1.6% to 21%, with a median rate of 9.8% and was independent of PCBCL histotype. Calculation of antigen-selection pressure showed that 39% of the mutated IgVH genes displayed a number of replacement mutations and silent mutations in a pattern consistent with antigenic selection. Furthermore, two segments, VH1-69 (12%) and VH4-59 (14%), were preferentially used in our case series. Data indicate that neoplastic B cells of PBCBL have experienced germinal centre reaction and also suggest that the involvement of IgVH genes is not entirely random in PCBCL and that common antigen epitopes could be pathologically relevant in cutaneous lymphomagenesis.

  19. Immunoglobulin heavy variable (IGHV) genes and alleles: new entities, new names and implications for research and prognostication in chronic lymphocytic leukaemia.

    PubMed

    Xochelli, Aliki; Agathangelidis, Andreas; Kavakiotis, Ioannis; Minga, Evangelia; Sutton, Lesley Ann; Baliakas, Panagiotis; Chouvarda, Ioanna; Giudicelli, Véronique; Vlahavas, Ioannis; Maglaveras, Nikos; Bonello, Lisa; Trentin, Livio; Tedeschi, Alessandra; Panagiotidis, Panagiotis; Geisler, Christian; Langerak, Anton W; Pospisilova, Sarka; Jelinek, Diane F; Oscier, David; Chiorazzi, Nicholas; Darzentas, Nikos; Davi, Fred; Ghia, Paolo; Rosenquist, Richard; Hadzidimitriou, Anastasia; Belessi, Chrysoula; Lefranc, Marie-Paule; Stamatopoulos, Kostas

    2015-01-01

    Νext generation sequencing studies in Homo sapiens have identified novel immunoglobulin heavy variable (IGHV) genes and alleles necessitating changes in the international ImMunoGeneTics information system (IMGT) GENE-DB and reference directories of IMGT/V-QUEST. In chronic lymphocytic leukaemia (CLL), the somatic hypermutation (SHM) status of the clonotypic rearranged IGHV gene is strongly associated with patient outcome. Correct determination of this parameter strictly depends on the comparison of the nucleotide sequence of the clonotypic rearranged IGHV gene with that of the closest germline counterpart. Consequently, changes in the reference directories could, in principle, affect the correct interpretation of the IGHV mutational status in CLL. To this end, we analyzed 8066 productive IG heavy chain (IGH) rearrangement sequences from our consortium both before and after the latest update of the IMGT/V-QUEST reference directory. Differences were identified in 405 cases (5 % of the cohort). In 291/405 sequences (71.9 %), changes concerned only the IGHV gene or allele name, whereas a change in the percent germline identity (%GI) was noted in 114/405 (28.1 %) sequences; in 50/114 (43.8 %) sequences, changes in the %GI led to a change in the mutational set. In conclusion, recent changes in the IMGT reference directories affected the interpretation of SHM in a sizeable number of IGH rearrangement sequences from CLL patients. This indicates that both physicians and researchers should consider a re-evaluation of IG sequence data, especially for those IGH rearrangement sequences that, up to date, have a GI close to 98 %, where caution is warranted.

  20. Spatial Variability of Cyanobacteria and Heterotrophic Bacteria in Lake Taihu (China).

    PubMed

    Qian, Haifeng; Lu, Tao; Song, Hao; Lavoie, Michel; Xu, Jiahui; Fan, Xiaoji; Pan, Xiangliang

    2017-09-01

    Cyanobacterial blooms frequently occur in Lake Taihu (China), but the intertwined relationships between biotic and abiotic factors modulating the frequency and duration of the blooms remain enigmatic. To better understand the relationships between the key abiotic and biotic factors and cyanobacterial blooms, we measured the abundance and diversity of prokaryotic organisms by high-throughput sequencing, the abundance of key genes involved in microcystin production and nitrogen fixation or loss as well as several physicochemical parameters at several stations in Lake Taihu during a cyanobacterial bloom of Microcystis sp.. Measurements of the copy number of denitrification-related genes and 16S rRNA analyses show that denitrification potential and denitrifying bacteria abundance increased in concert with non-diazotrophic cyanobacteria (Microcystis sp.), suggesting limited competition between cyanobacteria and heterotrophic denitrifiers for nutrients, although potential bacteria-mediated N loss may hamper Microcystis growth. The present study provides insight into the importance of different abiotic and biotic factors in controlling cyanobacteria and heterotrophic bacteria spatial variability in Lake Taihu.

  1. Differential impact of transplantation on peripheral and tissue-associated viral reservoirs: Implications for HIV gene therapy

    PubMed Central

    Peterson, Christopher W.; Wang, Jianbin; Deleage, Claire; Reddy, Sowmya; Kaur, Jasbir; Polacino, Patricia; Reik, Andreas; Huang, Meei-Li; Holmes, Michael C.; Estes, Jacob D.

    2018-01-01

    Autologous transplantation and engraftment of HIV-resistant cells in sufficient numbers should recapitulate the functional cure of the Berlin Patient, with applicability to a greater number of infected individuals and with a superior safety profile. A robust preclinical model of suppressed HIV infection is critical in order to test such gene therapy-based cure strategies, both alone and in combination with other cure strategies. Here, we present a nonhuman primate (NHP) model of latent infection using simian/human immunodeficiency virus (SHIV) and combination antiretroviral therapy (cART) in pigtail macaques. We demonstrate that transplantation of CCR5 gene-edited hematopoietic stem/progenitor cells (HSPCs) persist in infected and suppressed animals, and that protected cells expand through virus-dependent positive selection. CCR5 gene-edited cells are readily detectable in tissues, namely those closely associated with viral reservoirs such as lymph nodes and gastrointestinal tract. Following autologous transplantation, tissue-associated SHIV DNA and RNA levels in suppressed animals are significantly reduced (p ≤ 0.05), relative to suppressed, untransplanted control animals. In contrast, the size of the peripheral reservoir, measured by QVOA, is variably impacted by transplantation. Our studies demonstrate that CCR5 gene editing is equally feasible in infected and uninfected animals, that edited cells persist, traffic to, and engraft in tissue reservoirs, and that this approach significantly reduces secondary lymphoid tissue viral reservoir size. Our robust NHP model of HIV gene therapy and viral persistence can be immediately applied to the investigation of combinatorial approaches that incorporate anti-HIV gene therapy, immune modulators, therapeutic vaccination, and latency reversing agents. PMID:29672640

  2. Multilocus genotyping of Giardia duodenalis in Brazilian children.

    PubMed

    Scalia, Luana A M; Fava, Natália M N; Soares, Rodrigo M; Limongi, Jean E; da Cunha, Maria Júlia R; Pena, Isabella F; Kalapothakis, Evanguedes; Cury, Márcia C

    2016-06-01

    Giardia duodenalis is a parasite of several mammalian species, including humans, distributed worldwide. This research aimed to identify the molecular assemblages/sub-assemblages of G. duodenalis and to determine the intra-assemblage genetic variation of the different genes of assemblages A and B in pre-school children in the cities of Araguari and Uberlândia, Minas Gerais, Brazil. The molecular characterization followed β-giardin (bg), glutamate dehydrogenase (gdh) and triose phosphate isomerase (tpi) protocols. Of 226 stool samples, G. duodenalis cysts were found in 45 (19.9%). The tpi gene was amplified in 34 samples: 16 assemblage A, 14 B and four mixed samples A/B. The gdh gene was amplified in 32 samples, including 14 A, 16 B and two A/B. For the bg gene, 19 samples were sequenced: nine assemblage A, five B, three E, and two mixed, A/E and B/E. Animal-specific assemblage E were identified by bg, but were not confirmed for other genes. Twelve samples were characterized by full agreement of the three genes. Two new multilocus genotyping (MLGs) for assemblage A and two new MLGs for assemblage B were also described. These findings substantiate the importance of using more than one gene protocol since the sensitivity and genetic variability changes with the locus used.Access numbers: The GenBank access numbers for the nucleotide sequences reported in this article are: JQ794877-JQ794890, JX033113-JX033118. © The Author 2016. Published by Oxford University Press on behalf of Royal Society of Tropical Medicine and Hygiene. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  3. Sampling gene diversity across the supergroup Amoebozoa: large EST data sets from Acanthamoeba castellanii, Hartmannella vermiformis, Physarum polycephalum, Hyperamoeba dachnaya and Hyperamoeba sp.

    PubMed

    Watkins, Russell F; Gray, Michael W

    2008-04-01

    From comparative analysis of EST data for five taxa within the eukaryotic supergroup Amoebozoa, including two free-living amoebae (Acanthamoeba castellanii, Hartmannella vermiformis) and three slime molds (Physarum polycephalum, Hyperamoeba dachnaya and Hyperamoeba sp.), we obtained new broad-range perspectives on the evolution and biosynthetic capacity of this assemblage. Together with genome sequences for the amoebozoans Dictyostelium discoideum and Entamoeba histolytica, and including partial genome sequence available for A. castellanii, we used the EST data to identify genes that appear to be exclusive to the supergroup, and to specific clades therein. Many of these genes are likely involved in cell-cell communication or differentiation. In examining on a broad scale a number of characters that previously have been considered in simpler cross-species comparisons, typically between Dictyostelium and Entamoeba, we find that Amoebozoa as a whole exhibits striking variation in the number and distribution of biosynthetic pathways, for example, ones for certain critical stress-response molecules, including trehalose and mannitol. Finally, we report additional compelling cases of lateral gene transfer within Amoebozoa, further emphasizing that although this process has influenced genome evolution in all examined amoebozoan taxa, it has done so to a variable extent.

  4. Mitochondrial genome sequences and comparative genomics ofPhytophthora ramorum and P. sojae

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Martin, Frank N.; Douda, Bensasson; Tyler, Brett M.

    The complete sequences of the mitochondrial genomes of theoomycetes of Phytophthora ramorum and P. sojae were determined during thecourse of their complete nuclear genome sequencing (Tyler, et al. 2006).Both are circular, with sizes of 39,314 bp for P. ramorum and 42,975 bpfor P. sojae. Each contains a total of 37 identifiable protein-encodinggenes, 25 or 26 tRNAs (P. sojae and P. ramorum, respectively)specifying19 amino acids, and a variable number of ORFs (7 for P. ramorum and 12for P. sojae) which are potentially additional functional genes.Non-coding regions comprise approximately 11.5 percent and 18.4 percentof the genomes of P. ramorum and P. sojae,more » respectively. Relative to P.sojae, there is an inverted repeat of 1,150 bp in P. ramorum thatincludes an unassigned unique ORF, a tRNA gene, and adjacent non-codingsequences, but otherwise the gene order in both species is identical.Comparisons of these genomes with published sequences of the P. infestansmitochondrial genome reveals a number of similarities, but the gene orderin P. infestans differs in two adjacent locations due to inversions.Sequence alignments of the three genomes indicated sequence conservationranging from 75 to 85 percent and that specific regions were morevariable than others.« less

  5. The distribution and impact of common copy-number variation in the genome of the domesticated apple, Malus x domestica Borkh.

    PubMed

    Boocock, James; Chagné, David; Merriman, Tony R; Black, Michael A

    2015-10-23

    Copy number variation (CNV) is a common feature of eukaryotic genomes, and a growing body of evidence suggests that genes affected by CNV are enriched in processes that are associated with environmental responses. Here we use next generation sequence (NGS) data to detect copy-number variable regions (CNVRs) within the Malus x domestica genome, as well as to examine their distribution and impact. CNVRs were detected using NGS data derived from 30 accessions of M. x domestica analyzed using the read-depth method, as implemented in the CNVrd2 software. To improve the reliability of our results, we developed a quality control and analysis procedure that involved checking for organelle DNA, not repeat masking, and the determination of CNVR identity using a permutation testing procedure. Overall, we identified 876 CNVRs, which spanned 3.5 % of the apple genome. To verify that detected CNVRs were not artifacts, we analyzed the B- allele-frequencies (BAF) within a single nucleotide polymorphism (SNP) array dataset derived from a screening of 185 individual apple accessions and found the CNVRs were enriched for SNPs having aberrant BAFs (P < 1e-13, Fisher's Exact test). Putative CNVRs overlapped 845 gene models and were enriched for resistance (R) gene models (P < 1e-22, Fisher's exact test). Of note was a cluster of resistance gene models on chromosome 2 near a region containing multiple major gene loci conferring resistance to apple scab. We present the first analysis and catalogue of CNVRs in the M. x domestica genome. The enrichment of the CNVRs with R gene models and their overlap with gene loci of agricultural significance draw attention to a form of unexplored genetic variation in apple. This research will underpin further investigation of the role that CNV plays within the apple genome.

  6. CCL3L1 copy number and susceptibility to malaria

    PubMed Central

    Carpenter, Danielle; Färnert, Anna; Rooth, Ingegerd; Armour, John A.L.; Shaw, Marie-Anne

    2012-01-01

    Copy number variation can contribute to the variation observed in susceptibility to complex diseases. Here we present the first study to investigate copy number variation of the chemokine gene CCL3L1 with susceptibility to malaria. We present a family-based genetic analysis of a Tanzanian population (n = 922), using parasite load, mean number of clinical infections of malaria and haemoglobin levels as phenotypes. Copy number of CCL3L1 was measured using the paralogue ratio test (PRT) and the dataset exhibited copy numbers ranging between 1 and 10 copies per diploid genome (pdg). Association between copy number and phenotypes was assessed. Furthermore, we were able to identify copy number haplotypes in some families, using microsatellites within the copy variable region, for transmission disequilibrium testing. We identified a high level of copy number haplotype diversity and find some evidence for an association of low CCL3L1 copy number with protection from anaemia. PMID:22484763

  7. CCL3L1 copy number and susceptibility to malaria.

    PubMed

    Carpenter, Danielle; Färnert, Anna; Rooth, Ingegerd; Armour, John A L; Shaw, Marie-Anne

    2012-07-01

    Copy number variation can contribute to the variation observed in susceptibility to complex diseases. Here we present the first study to investigate copy number variation of the chemokine gene CCL3L1 with susceptibility to malaria. We present a family-based genetic analysis of a Tanzanian population (n=922), using parasite load, mean number of clinical infections of malaria and haemoglobin levels as phenotypes. Copy number of CCL3L1 was measured using the paralogue ratio test (PRT) and the dataset exhibited copy numbers ranging between 1 and 10 copies per diploid genome (pdg). Association between copy number and phenotypes was assessed. Furthermore, we were able to identify copy number haplotypes in some families, using microsatellites within the copy variable region, for transmission disequilibrium testing. We identified a high level of copy number haplotype diversity and find some evidence for an association of low CCL3L1 copy number with protection from anaemia. Copyright © 2012 Elsevier B.V. All rights reserved.

  8. Gene sequence variability of the three surface proteins of human respiratory syncytial virus (HRSV) in Texas.

    PubMed

    Tapia, Lorena I; Shaw, Chad A; Aideyan, Letisha O; Jewell, Alan M; Dawson, Brian C; Haq, Taha R; Piedra, Pedro A

    2014-01-01

    Human respiratory syncytial virus (HRSV) has three surface glycoproteins: small hydrophobic (SH), attachment (G) and fusion (F), encoded by three consecutive genes (SH-G-F). A 270-nt fragment of the G gene is used to genotype HRSV isolates. This study genotyped and investigated the variability of the gene and amino acid sequences of the three surface proteins of HRSV strains collected from 1987 to 2005 from one center. Sixty original clinical isolates and 5 prototype strains were analyzed. Sequences containing SH, F and G genes were generated, and multiple alignments and phylogenetic trees were analyzed. Genetic variability by protein domains comparing virus genotypes was assessed. Complete sequences of the SH-G-F genes were obtained for all 65 samples: HRSV-A = 35; HRSV-B = 30. In group A strains, genotypes GA5 and GA2 were predominant. For HRSV-B strains, the genotype GB4 was predominant from 1992 to 1994 and only genotype BA viruses were detected in 2004-2005. Different genetic variability at nucleotide level was detected between the genes, with G gene being the most variable and the highest variability detected in the 270-nt G fragment that is frequently used to genotype the virus. High variability (>10%) was also detected in the signal peptide and transmembrane domains of the F gene of HRSV A strains. Variability among the HRSV strains resulting in non-synonymous changes was detected in hypervariable domains of G protein, the signal peptide of the F protein, a not previously defined domain in the F protein, and the antigenic site Ø in the pre-fusion F. Divergent trends were observed between HRSV -A and -B groups for some functional domains. A diverse population of HRSV -A and -B genotypes circulated in Houston during an 18 year period. We hypothesize that diverse sequence variation of the surface protein genes provide HRSV strains a survival advantage in a partially immune-protected community.

  9. Gene Sequence Variability of the Three Surface Proteins of Human Respiratory Syncytial Virus (HRSV) in Texas

    PubMed Central

    Tapia, Lorena I.; Shaw, Chad A.; Aideyan, Letisha O.; Jewell, Alan M.; Dawson, Brian C.; Haq, Taha R.; Piedra, Pedro A.

    2014-01-01

    Human respiratory syncytial virus (HRSV) has three surface glycoproteins: small hydrophobic (SH), attachment (G) and fusion (F), encoded by three consecutive genes (SH-G-F). A 270-nt fragment of the G gene is used to genotype HRSV isolates. This study genotyped and investigated the variability of the gene and amino acid sequences of the three surface proteins of HRSV strains collected from 1987 to 2005 from one center. Sixty original clinical isolates and 5 prototype strains were analyzed. Sequences containing SH, F and G genes were generated, and multiple alignments and phylogenetic trees were analyzed. Genetic variability by protein domains comparing virus genotypes was assessed. Complete sequences of the SH-G-F genes were obtained for all 65 samples: HRSV-A = 35; HRSV-B = 30. In group A strains, genotypes GA5 and GA2 were predominant. For HRSV-B strains, the genotype GB4 was predominant from 1992 to 1994 and only genotype BA viruses were detected in 2004–2005. Different genetic variability at nucleotide level was detected between the genes, with G gene being the most variable and the highest variability detected in the 270-nt G fragment that is frequently used to genotype the virus. High variability (>10%) was also detected in the signal peptide and transmembrane domains of the F gene of HRSV A strains. Variability among the HRSV strains resulting in non-synonymous changes was detected in hypervariable domains of G protein, the signal peptide of the F protein, a not previously defined domain in the F protein, and the antigenic site Ø in the pre-fusion F. Divergent trends were observed between HRSV -A and -B groups for some functional domains. A diverse population of HRSV -A and -B genotypes circulated in Houston during an 18 year period. We hypothesize that diverse sequence variation of the surface protein genes provide HRSV strains a survival advantage in a partially immune-protected community. PMID:24625544

  10. Introduced T cell receptor variable region gene segments recombine in pre-B cells: evidence that B and T cells use a common recombinase.

    PubMed

    Yancopoulos, G D; Blackwell, T K; Suh, H; Hood, L; Alt, F W

    1986-01-31

    We have recently proposed that a common recombinase performs all of the many variable region gene assembly events in B and T cells, and that the specificity of these joining events is mediated by regulating the "accessibility" of the involved gene segments. To test this possibility, we have introduced "accessible" T cell receptor (TCR) variable region gene segments into a pre-B cell line capable of recombining endogenous and transfected immunoglobulin (Ig) variable region gene segments. Although the corresponding "inaccessible" endogenous TCR gene segments do not rearrange in this line or in B cells in general, the introduced TCR gene segments join very frequently and, in fact, closely resemble introduced Ig gene segments in their recombination characteristics. These observations suggest a new role for conventional Ig transcriptional enhancers--recombinational enhancement. Our studies provide insight into additional aspects of the joining mechanism such as N region insertion, aberrant joining, and recombination-recognition sequence requirements for joining.

  11. Evaluation of the ability of Streptococcus agalactiae strains isolated from genital and neonatal specimens to bind to human fibrinogen and correlation with characteristics of the fbsA and fbsB genes.

    PubMed

    Rosenau, Agnès; Martins, Karine; Amor, Souheila; Gannier, François; Lanotte, Philippe; van der Mee-Marquet, Nathalie; Mereghetti, Laurent; Quentin, Roland

    2007-03-01

    The ability of 111 Streptococcus agalactiae strains to bind to human fibrinogen was quantified. We correlated the percentages of bacteria that bound to immobilized fibrinogen with fibrinogen-binding (fbs) gene characteristics of strains and with clinical origin, serotypes, and phylogenetic positions of strains. Percentages varied from 0.4 to 29.9%. Fifty-five strains (49.5%) had the fbsB gene sensu stricto described by Gutekunst et al. (Infect. Immun., 72:3495-3504, 2004), allowing adhesion to human fibrinogen, and all of the other strains had an fgag variant gene. Ninety strains (81.1%) had a fbsA gene and 55 of them also had the fbsB gene. The other 21 strains (18.9%) had a truncated form of fbsA without the fbsB gene sensu stricto. The numbers of 48-nucleotide repeat sequences (rs) in the fbsA gene varied from 2 to 26. The population of strains with the highest ability to bind to human fibrinogen significantly more frequently had the fbsB gene sensu stricto and 4 to 7 rs in the fbsA gene (P < 0.05). However, the single strain that carried the highest number of rs (26 rs) in the fbsA gene showed high fibrinogen-binding activity (24.3%). Strains exhibiting significantly higher levels of binding to human fibrinogen belonged to a phylogenetic group of strains associated with neonatal meningitis, currently known as the ST-17 clone, that is mostly composed of serotype III strains. These findings indicate that S. agalactiae strains possess a wide variety of fbs gene content that markedly influences the ability of strains to bind to human fibrinogen. Variations in the configuration and the expression of the Fbs proteins may therefore partly explain the variability of virulence in S. agalactiae species.

  12. Gene expression of commensal Lactobacillus johnsonii strain NCC533 during in vitro growth and in the murine gut.

    PubMed

    Denou, Emmanuel; Berger, Bernard; Barretto, Caroline; Panoff, Jean-Michel; Arigoni, Fabrizio; Brüssow, Harald

    2007-11-01

    Work with pathogens like Vibrio cholerae has shown major differences between genes expressed in bacteria grown in vitro and in vivo. To explore this subject for commensals, we investigated the transcription of the Lactobacillus johnsonii NCC533 genome during in vitro and in vivo growth using the microarray technology. During broth growth, 537, 626, and 277 of the 1,756 tested genes were expressed during exponential phase, "adaptation" (early stationary phase), and stationary phase, respectively. One hundred one, 150, and 33 genes, respectively, were specifically transcribed in these three phases. To explore the in vivo transcription program, we fed L. johnsonii containing a resistance plasmid to antibiotic-treated mice. After a 2-day washout phase, we determined the viable-cell counts of lactobacilli that were in the lumina and associated with the mucosae of different gut segments. While the cell counts showed a rather uniform distribution along the gut, we observed marked differences with respect to the expression of the Lactobacillus genome. The largest number of transcribed genes was in the stomach (n = 786); the next-largest numbers occurred in the cecum (n = 391) and the jejunum (n = 296), while only 26 Lactobacillus genes were transcribed in the colon. In vitro and in vivo transcription programs overlapped only partially. One hundred ninety-one of the transcripts from the lactobacilli in the stomach were not detected during in vitro growth; 202 and 213 genes, respectively, were transcribed under all in vitro and in vivo conditions; but the core transcriptome for all growth conditions comprised only 103 genes. Forty-four percent of the NCC533 genes were not detectably transcribed under any of the investigated conditions. Nontranscribed genes were clustered on the genome and enriched in the variable-genome part. Our data revealed not only major differences between in vitro- and in vivo-expressed genes in a Lactobacillus gut commensal organism but also marked changes in the expression of genes along the digestive tract.

  13. Human beta-globin gene polymorphisms characterized in DNA extracted from ancient bones 12,000 years old.

    PubMed

    Béraud-Colomb, E; Roubin, R; Martin, J; Maroc, N; Gardeisen, A; Trabuchet, G; Goosséns, M

    1995-12-01

    Analyzing the nuclear DNA from ancient human bones is an essential step to the understanding of genetic diversity in current populations, provided that such systematic studies are experimentally feasible. This article reports the successful extraction and amplification of nuclear DNA from the beta-globin region from 5 of 10 bone specimens up to 12,000 years old. These have been typed for beta-globin frameworks by sequencing through two variable positions and for a polymorphic (AT) chi (T) gamma microsatellite 500 bp upstream of the beta-globin gene. These specimens of human remains are somewhat older than those analyzed in previous nuclear gene sequencing reports and considerably older than those used to study high-copy-number human mtDNA. These results show that the systematic study of nuclear DNA polymorphisms of ancient populations is feasible.

  14. Global sequence variation in the histidine-rich proteins 2 and 3 of Plasmodium falciparum: implications for the performance of malaria rapid diagnostic tests

    PubMed Central

    2010-01-01

    Background Accurate diagnosis is essential for prompt and appropriate treatment of malaria. While rapid diagnostic tests (RDTs) offer great potential to improve malaria diagnosis, the sensitivity of RDTs has been reported to be highly variable. One possible factor contributing to variable test performance is the diversity of parasite antigens. This is of particular concern for Plasmodium falciparum histidine-rich protein 2 (PfHRP2)-detecting RDTs since PfHRP2 has been reported to be highly variable in isolates of the Asia-Pacific region. Methods The pfhrp2 exon 2 fragment from 458 isolates of P. falciparum collected from 38 countries was amplified and sequenced. For a subset of 80 isolates, the exon 2 fragment of histidine-rich protein 3 (pfhrp3) was also amplified and sequenced. DNA sequence and statistical analysis of the variation observed in these genes was conducted. The potential impact of the pfhrp2 variation on RDT detection rates was examined by analysing the relationship between sequence characteristics of this gene and the results of the WHO product testing of malaria RDTs: Round 1 (2008), for 34 PfHRP2-detecting RDTs. Results Sequence analysis revealed extensive variations in the number and arrangement of various repeats encoded by the genes in parasite populations world-wide. However, no statistically robust correlation between gene structure and RDT detection rate for P. falciparum parasites at 200 parasites per microlitre was identified. Conclusions The results suggest that despite extreme sequence variation, diversity of PfHRP2 does not appear to be a major cause of RDT sensitivity variation. PMID:20470441

  15. Desynapsis and spontaneous trisomy in jute (Corchorus olitorius L.).

    PubMed

    Basak, S L; Paria, P

    1980-11-01

    Cytological studies in desynaptic plants, isolated at the F6 generation of an intervarietal cross of Corchorus olitorius L., have shown variable numbers of bivalents and univalents in the PMC's at metaphase I, resulting in irregular distribution of chromosomes at anaphase I. The progenies of the desynaptic plants consisted of 9.24 percent of all possible primary trisomies except trisomie 6. The desynaptic condition is controlled by a pair of simple recessive genes.

  16. Copy number variation of the APC gene is associated with regulation of bone mineral density☆

    PubMed Central

    Chew, Shelby; Dastani, Zari; Brown, Suzanne J.; Lewis, Joshua R.; Dudbridge, Frank; Soranzo, Nicole; Surdulescu, Gabriela L.; Richards, J. Brent; Spector, Tim D.; Wilson, Scott G.

    2012-01-01

    Introduction Genetic studies of osteoporosis have commonly examined SNPs in candidate genes or whole genome analyses, but insertions and deletions of DNA, collectively called copy number variations (CNVs), also comprise a large amount of the genetic variability between individuals. Previously, SNPs in the APC gene have been strongly associated with femoral neck and lumbar spine volumetric bone mineral density in older men. In addition, familial adenomatous polyposis patients carrying heterozygous mutations in the APC gene have been shown to have significantly higher mean bone mineral density than age- and sex-matched controls suggesting the importance of this gene in regulating bone mineral density. We examined CNV within the APC gene region to test for association with bone mineral density. Methods DNA was extracted from venous blood, genotyped using the Human Hap610 arrays and CNV determined from the fluorescence intensity data in 2070 Caucasian men and women aged 47.0 ± 13.0 (mean ± SD) years, to assess the effects of the CNV on bone mineral density at the forearm, spine and total hip sites. Results Data for covariate adjusted bone mineral density from subjects grouped by APC CNV genotype showed significant difference (P = 0.02–0.002). Subjects with a single copy loss of APC had a 7.95%, 13.10% and 13.36% increase in bone mineral density at the forearm, spine and total hip sites respectively, compared to subjects with two copies of the APC gene. Conclusions These data support previous findings of APC regulating bone mineral density and demonstrate that a novel CNV of the APC gene is significantly associated with bone mineral density in Caucasian men and women. PMID:22884971

  17. CNV-RF Is a Random Forest-Based Copy Number Variation Detection Method Using Next-Generation Sequencing.

    PubMed

    Onsongo, Getiria; Baughn, Linda B; Bower, Matthew; Henzler, Christine; Schomaker, Matthew; Silverstein, Kevin A T; Thyagarajan, Bharat

    2016-11-01

    Simultaneous detection of small copy number variations (CNVs) (<0.5 kb) and single-nucleotide variants in clinically significant genes is of great interest for clinical laboratories. The analytical variability in next-generation sequencing (NGS) and artifacts in coverage data because of issues with mappability along with lack of robust bioinformatics tools for CNV detection have limited the utility of targeted NGS data to identify CNVs. We describe the development and implementation of a bioinformatics algorithm, copy number variation-random forest (CNV-RF), that incorporates a machine learning component to identify CNVs from targeted NGS data. Using CNV-RF, we identified 12 of 13 deletions in samples with known CNVs, two cases with duplications, and identified novel deletions in 22 additional cases. Furthermore, no CNVs were identified among 60 genes in 14 cases with normal copy number and no CNVs were identified in another 104 patients with clinical suspicion of CNVs. All positive deletions and duplications were confirmed using a quantitative PCR method. CNV-RF also detected heterozygous deletions and duplications with a specificity of 50% across 4813 genes. The ability of CNV-RF to detect clinically relevant CNVs with a high degree of sensitivity along with confirmation using a low-cost quantitative PCR method provides a framework for providing comprehensive NGS-based CNV/single-nucleotide variant detection in a clinical molecular diagnostics laboratory. Copyright © 2016 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.

  18. Low-copy nuclear primers and ycf1 primers in Cactaceae.

    PubMed

    Franck, Alan R; Cochrane, Bruce J; Garey, James R

    2012-10-01

    To increase the number of variable regions available for phylogenetic study in the Cactaceae, primers were developed for a portion of the plastid ycf1 gene and intron-spanning regions of two low-copy nuclear genes (isi1, nhx1). • Primers were tested on several families within Caryophyllales, focusing on the Cactaceae. Gel electrophoresis indicated positive amplification in most samples. Sequences of these three regions (isi1, nhx1, ycf1) from Harrisia exhibited variation similar to or greater than two plastid regions (atpB-rbcL intergenic spacer and rpl16 intron). • The isi, nhx, and ycf1 primers amplify phylogenetically useful information applicable to the Cactaceae and other families in the Caryophyllales.

  19. Methodological requirements for valid tissue-based biomarker studies that can be used in clinical practice.

    PubMed

    True, Lawrence D

    2014-03-01

    Paralleling the growth of ever more cost efficient methods to sequence the whole genome in minute fragments of tissue has been the identification of increasingly numerous molecular abnormalities in cancers--mutations, amplifications, insertions and deletions of genes, and patterns of differential gene expression, i.e., overexpression of growth factors and underexpression of tumor suppressor genes. These abnormalities can be translated into assays to be used in clinical decision making. In general terms, the result of such an assay is subject to a large number of variables regarding the characteristics of the available sample, particularities of the used assay, and the interpretation of the results. This review discusses the effects of these variables on assays of tissue-based biomarkers, classified by macromolecule--DNA, RNA (including micro RNA, messenger RNA, long noncoding RNA, protein, and phosphoprotein). Since the majority of clinically applicable biomarkers are immunohistochemically detectable proteins this review focuses on protein biomarkers. However, the principles outlined are mostly applicable to any other analyte. A variety of preanalytical variables impacts on the results obtained, including analyte stability (which is different for different analytes, i.e., DNA, RNA, or protein), period of warm and of cold ischemia, fixation time, tissue processing, sample storage time, and storage conditions. In addition, assay variables play an important role, including reagent specificity (notably but not uniquely an issue concerning antibodies used in immunohistochemistry), technical components of the assay, quantitation, and assay interpretation. Finally, appropriateness of an assay for clinical application is an important issue. Reference is made to publicly available guidelines to improve on biomarker development in general and requirements for clinical use in particular. Strategic goals are formulated in order to improve on the quality of biomarker reporting, including issues of analyte quality, experimental detail, assay efficiency and precision, and assay appropriateness.

  20. Analysis of the Variability of Epstein-Barr Virus Genes in Infectious Mononucleosis: Investigation of the Potential Correlation with Biochemical Parameters of Hepatic Involvement.

    PubMed

    Banko, Ana; Lazarevic, Ivana; Stevanovic, Goran; Cirkovic, Andja; Karalic, Danijela; Cupic, Maja; Banko, Bojan; Milovanovic, Jovica; Jovanovic, Tanja

    2016-09-01

    Primary Epstein-Barr virus (EBV) infection is usually asymptomatic, although at times it results in the benign lymphoproliferative disease, infectious mononucleosis (IM), during which almost half of patients develop hepatitis. The aims of the present study are to evaluate polymorphisms of EBV genes circulating in IM isolates from this geographic region and to investigate the correlation of viral sequence patterns with the available IM biochemical parameters. The study included plasma samples from 128 IM patients. The genes EBNA2, LMP1 , and EBNA1 were amplified using nested-PCR. EBNA2 genotyping was performed by visualization of PCR products using gel electrophoresis. Investigation of LMP1 and EBNA1 included sequence, phylogenetic, and statistical analyses. The presence of EBV DNA in plasma samples showed correlation with patients' necessity for hospitalization (p=0.034). The majority of EBV isolates was genotype 1. LMP1 variability showed 4 known variants, and two new deletions (27-bp and 147-bp). Of the 3 analyzed attributes of LMP1 isolates, the number of 33-bp repeats less than the reference 4.5 was the only one that absolutely correlated with the elevated levels of transaminases. EBNA1 variability was presented by prototype subtypes. A particular combination of EBNA2, LMP1 , and EBNA1 polymorphisms, deleted LMP1/P-thr and non-deleted LMP1/P-ala , as well as genotype 1/ 4.5 33-bp LMP1 repeats or genotype 2/ 4.5 33-bp LMP1 repeats showed correlation with elevated AST (aspartate aminotransferase) and ALT (alanine transaminase). This is the first study which identified the association between EBV variability and biochemical parameters in IM patients. These results showed a possibility for the identification of hepatic related diagnostic EBV markers.

  1. Early transcriptomic changes induced by magnesium deficiency in Arabidopsis thaliana reveal the alteration of circadian clock gene expression in roots and the triggering of abscisic acid-responsive genes.

    PubMed

    Hermans, Christian; Vuylsteke, Marnik; Coppens, Frederik; Craciun, Adrian; Inzé, Dirk; Verbruggen, Nathalie

    2010-07-01

    *Plant growth and development ultimately depend on environmental variables such as the availability of essential minerals. Unravelling how nutrients affect gene expression will help to understand how they regulate plant growth. *This study reports the early transcriptomic response to magnesium (Mg) deprivation in Arabidopsis. Whole-genome transcriptome was studied in the roots and young mature leaves 4, 8 and 28 h after the removal of Mg from the nutrient solution. *The highest number of regulated genes was first observed in the roots. Contrary to other mineral deficiencies, Mg depletion did not induce a higher expression of annotated genes in Mg uptake. Remarkable responses include the perturbation of the central oscillator of the circadian clock in roots and the triggering of abscisic acid (ABA) signalling, with half of the up-regulated Mg genes in leaves being ABA-responsive. However, no change in ABA content was observed. *The specificity of the response of some Mg-regulated genes was challenged by studying their expression after other mineral deficiencies and environmental stresses. The possibility to develop markers for Mg incipient deficiency is discussed here.

  2. Identification, inheritance, and linkage of B-G-like and MHC class I genes in cranes

    USGS Publications Warehouse

    Jarvi, S.I.; Goto, R.M.; Gee, G.F.; Briles, W.E.; Miller, M.M.

    1999-01-01

    We identified B-G-like genes in the whooping and Florida sandhill cranes and linked them to the major histocompatibility complex (MHC). We evaluated the inheritance of B-G-like genes in families of whooping and Florida sandhill cranes using restriction fragment patterns (RFPs). Two B-G-like genes, designated wcbgl and wcbg2, were located within 8 kb of one another. The fully sequenced wcbg2 gene encodes a B-G IgV-like domain, an additional Ig-like domain, a transmembrane domain, and a single heptad domain typical of '-helical coiled coils. Patterns of restriction fragments in DNA from the whooping crane and from a number of other species indicate that the B-G-like gene families of cranes are large with diverse sequences. Segregation of RFPs in families of Florida sandhill cranes provide evidence for genetic polymorphism in the B-G-like genes. The restriction fragments generally segregated in concert with MHC haplotypes assigned by serological typing and by single stranded conformational polymorphism (SSCP) assays based in the second exon of the crane MHC class I genes. This study supports the concept of a long-term association of polymorphic B-G-like genes with the MHC. It also establishes SSCP as a means for evaluating MHC genetic variability in cranes.

  3. Identification, inheritance, and linkage of B-G-like and MHC class I genes in cranes.

    PubMed

    Jarvi, S I; Goto, R M; Gee, G F; Briles, W E; Miller, M M

    1999-01-01

    We identified B-G-like genes in the whooping and Florida sandhill cranes and linked them to the major histocompatibility complex (MHC). We evaluated the inheritance of B-G-like genes in families of whooping and Florida sandhill cranes using restriction fragment patterns (RFPs). Two B-G-like genes, designated wcbg1 and wcbg2, were located within 8 kb of one another. The fully sequenced wcbg2 gene encodes a B-G IgV-like domain, an additional Ig-like domain, a transmembrane domain, and a single heptad domain typical of alpha-helical coiled coils. Patterns of restriction fragments in DNA from the whooping crane and from a number of other species indicate that the B-G-like gene families of cranes are large with diverse sequences. Segregation of RFPs in families of Florida sandhill cranes provide evidence for genetic polymorphism in the B-G-like genes. The restriction fragments generally segregated in concert with MHC haplotypes assigned by serological typing and by single stranded conformational polymorphism (SSCP) assays based in the second exon of the crane MHC class I genes. This study supports the concept of a long-term association of polymorphic B-G-like genes with the MHC. It also establishes SSCP as a means for evaluating MHC genetic variability in cranes.

  4. spa typing for epidemiological surveillance of Staphylococcus aureus.

    PubMed

    Hallin, Marie; Friedrich, Alexander W; Struelens, Marc J

    2009-01-01

    The spa typing method is based on sequencing of the polymorphic X region of the protein A gene (spa), present in all strains of Staphylococcus aureus. The X region is constituted of a variable number of 24-bp repeats flanked by well-conserved regions. This single-locus sequence-based typing method combines a number of technical advantages, such as rapidity, reproducibility, and portability. Moreover, due to its repeat structure, the spa locus simultaneously indexes micro- and macrovariations, enabling the use of spa typing in both local and global epidemiological studies. These studies are facilitated by the establishment of standardized spa type nomenclature and Internet shared databases.

  5. Somatic diversification of chicken immunoglobulin light chains by point mutations.

    PubMed

    Parvari, R; Ziv, E; Lantner, F; Heller, D; Schechter, I

    1990-04-01

    The light-chain locus of chicken has 1 functional V lambda 1 gene, 1 J gene, and 25 pseudo-V lambda-genes (where V = variable and J = joining). A major problem is which somatic mechanisms expand this extremely limited germ-line information to generate many different antibodies. Weill's group [Reynaud, C. A., Anquez, V., Grimal, H. & Weill, J. C. (1987) Cell 48, 379-388] has shown that the pseudo-V lambda-genes diversify the rearranged V lambda 1 by gene conversion. Here we demonstrate that chicken light chains are further diversified by somatic point mutations and by V lambda 1-J flexible joining. Somatic point mutations were identified in the J and 3' noncoding DNA of rearranged light-chain genes of chicken. These regions were analyzed because point mutations in V lambda 1 are obscured by gene conversion; the J and 3' noncoding DNA are presented in one copy per haploid genome and are not subject to gene conversion. In rodents point mutations occur as frequently in the V-J coding regions as in the adjacent flanking DNA. Therefore, we conclude that somatic point mutations diversify the V lambda 1 of chicken. The frequency (0-1%) and distribution of the mutations (decreasing in number with increased distance from the V lambda 1 segment) in chicken were as observed in rodents. Sequence variability at the V lambda 1-J junctions could be attributed to imprecise joining of the V lambda 1 and J genes. The modification by gene conversion of rearranged V lambda 1 genes in the bursa was similar in chicken aged 3 months (9.5%) or 3 weeks (9.1%)--i.e., gene conversion that generates the preimmune repertoire in the bursa seems to level off around 3 weeks of age. This preimmune repertoire can be further diversified by somatic point mutations that presumably lead to the formation of antibodies with increased affinity. A segment with structural features of a matrix association region [(A + T)-rich and four topoisomerase II binding sites] was identified in the middle of the J-C lambda intron (where C = constant).

  6. OGEE v2: an update of the online gene essentiality database with special focus on differentially essential genes in human cancer cell lines.

    PubMed

    Chen, Wei-Hua; Lu, Guanting; Chen, Xiao; Zhao, Xing-Ming; Bork, Peer

    2017-01-04

    OGEE is an Online GEne Essentiality database. To enhance our understanding of the essentiality of genes, in OGEE we collected experimentally tested essential and non-essential genes, as well as associated gene properties known to contribute to gene essentiality. We focus on large-scale experiments, and complement our data with text-mining results. We organized tested genes into data sets according to their sources, and tagged those with variable essentiality statuses across data sets as conditionally essential genes, intending to highlight the complex interplay between gene functions and environments/experimental perturbations. Developments since the last public release include increased numbers of species and gene essentiality data sets, inclusion of non-coding essential sequences and genes with intermediate essentiality statuses. In addition, we included 16 essentiality data sets from cancer cell lines, corresponding to 9 human cancers; with OGEE, users can easily explore the shared and differentially essential genes within and between cancer types. These genes, especially those derived from cell lines that are similar to tumor samples, could reveal the oncogenic drivers, paralogous gene expression pattern and chromosomal structure of the corresponding cancer types, and can be further screened to identify targets for cancer therapy and/or new drug development. OGEE is freely available at http://ogee.medgenius.info. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  7. Genomic analysis of differentiation between soil types reveals candidate genes for local adaptation in Arabidopsis lyrata.

    PubMed

    Turner, Thomas L; von Wettberg, Eric J; Nuzhdin, Sergey V

    2008-09-11

    Serpentine soil, which is naturally high in heavy metal content and has low calcium to magnesium ratios, comprises a difficult environment for most plants. An impressive number of species are endemic to serpentine, and a wide range of non-endemic plant taxa have been shown to be locally adapted to these soils. Locating genomic polymorphisms which are differentiated between serpentine and non-serpentine populations would provide candidate loci for serpentine adaptation. We have used the Arabidopsis thaliana tiling array, which has 2.85 million probes throughout the genome, to measure genetic differentiation between populations of Arabidopsis lyrata growing on granitic soils and those growing on serpentinic soils. The significant overrepresentation of genes involved in ion transport and other functions provides a starting point for investigating the molecular basis of adaptation to soil ion content, water retention, and other ecologically and economically important variables. One gene in particular, calcium-exchanger 7, appears to be an excellent candidate gene for adaptation to low CaratioMg ratio in A. lyrata.

  8. Analysis of promoter polymorphism in monoamine oxidase A (MAOA) gene in completed suicide on Slovenian population.

    PubMed

    Uršič, Katarina; Zupanc, Tomaž; Paska, Alja Videtič

    2018-04-23

    Suicide is a well-defined public health problem and is a complex phenomenon influenced by a number of different risk factors, including genetic ones. Numerous studies have examined serotonin system genes. Monoamine oxidase A (MAO-A) is an outer mitochondrial membrane enzyme which is involved in the metabolic pathway of serotonin degradation. Upstream variable number of tandem repeats (uVNTR) in the promoter region of MAOA gene affects the activity of transcription. In the present study we genotyped MAOA-uVNTR polymorphism in 266 suicide victims and 191 control subjects of Slovenian population, which ranks among the European and world populations with the highest suicide rate. Genotyping was performed with polymerase chain reaction and agarose gel electrophoresis. Using a separate statistical analysis for female and male subjects we determined the differences in genotype distributions of MAOA-uVNTR polymorphism between the studied groups. Statistical analysis showed a trend towards 3R allele and suicide, and associated 3R allele with non-violent suicide method on stratified data (20 suicide victims). This is the first study associating highly suicidal Slovenian population with MAOA-uVNTR polymorphism. Copyright © 2018 Elsevier B.V. All rights reserved.

  9. Sequencing chromosomal abnormalities reveals neurodevelopmental loci that confer risk across diagnostic boundaries

    PubMed Central

    Talkowski, Michael E.; Rosenfeld, Jill A.; Blumenthal, Ian; Pillalamarri, Vamsee; Chiang, Colby; Heilbut, Adrian; Ernst, Carl; Hanscom, Carrie; Rossin, Elizabeth; Lindgren, Amelia; Pereira, Shahrin; Ruderfer, Douglas; Kirby, Andrew; Ripke, Stephan; Harris, David; Lee, Ji-Hyun; Ha, Kyungsoo; Kim, Hyung-Goo; Solomon, Benjamin D.; Gropman, Andrea L.; Lucente, Diane; Sims, Katherine; Ohsumi, Toshiro K.; Borowsky, Mark L.; Loranger, Stephanie; Quade, Bradley; Lage, Kasper; Miles, Judith; Wu, Bai-Lin; Shen, Yiping; Neale, Benjamin; Shaffer, Lisa G.; Daly, Mark J.; Morton, Cynthia C.; Gusella, James F.

    2012-01-01

    SUMMARY Balanced chromosomal abnormalities (BCAs) represent a reservoir of single gene disruptions in neurodevelopmental disorders (NDD). We sequenced BCAs in autism and related NDDs, revealing disruption of 33 loci in four general categories: 1) genes associated with abnormal neurodevelopment (e.g., AUTS2, FOXP1, CDKL5), 2) single gene contributors to microdeletion syndromes (MBD5, SATB2, EHMT1, SNURF-SNRPN), 3) novel risk loci (e.g., CHD8, KIRREL3, ZNF507), and 4) genes associated with later onset psychiatric disorders (e.g., TCF4, ZNF804A, PDE10A, GRIN2B, ANK3). We also discovered profoundly increased burden of copy number variants among 19,556 neurodevelopmental cases compared to 13,991 controls (p = 2.07×10−47) and enrichment of polygenic risk alleles from autism and schizophrenia genome-wide association studies (p = 0.0018 and 0.0009, respectively). Our findings suggest a polygenic risk model of autism incorporating loci of strong effect and indicate that some neurodevelopmental genes are sensitive to perturbation by multiple mutational mechanisms, leading to variable phenotypic outcomes that manifest at different life stages. PMID:22521361

  10. A New Chicken Genome Assembly Provides Insight into Avian Genome Structure.

    PubMed

    Warren, Wesley C; Hillier, LaDeana W; Tomlinson, Chad; Minx, Patrick; Kremitzki, Milinn; Graves, Tina; Markovic, Chris; Bouk, Nathan; Pruitt, Kim D; Thibaud-Nissen, Francoise; Schneider, Valerie; Mansour, Tamer A; Brown, C Titus; Zimin, Aleksey; Hawken, Rachel; Abrahamsen, Mitch; Pyrkosz, Alexis B; Morisson, Mireille; Fillon, Valerie; Vignal, Alain; Chow, William; Howe, Kerstin; Fulton, Janet E; Miller, Marcia M; Lovell, Peter; Mello, Claudio V; Wirthlin, Morgan; Mason, Andrew S; Kuo, Richard; Burt, David W; Dodgson, Jerry B; Cheng, Hans H

    2017-01-05

    The importance of the Gallus gallus (chicken) as a model organism and agricultural animal merits a continuation of sequence assembly improvement efforts. We present a new version of the chicken genome assembly (Gallus_gallus-5.0; GCA_000002315.3), built from combined long single molecule sequencing technology, finished BACs, and improved physical maps. In overall assembled bases, we see a gain of 183 Mb, including 16.4 Mb in placed chromosomes with a corresponding gain in the percentage of intact repeat elements characterized. Of the 1.21 Gb genome, we include three previously missing autosomes, GGA30, 31, and 33, and improve sequence contig length 10-fold over the previous Gallus_gallus-4.0. Despite the significant base representation improvements made, 138 Mb of sequence is not yet located to chromosomes. When annotated for gene content, Gallus_gallus-5.0 shows an increase of 4679 annotated genes (2768 noncoding and 1911 protein-coding) over those in Gallus_gallus-4.0. We also revisited the question of what genes are missing in the avian lineage, as assessed by the highest quality avian genome assembly to date, and found that a large fraction of the original set of missing genes are still absent in sequenced bird species. Finally, our new data support a detailed map of MHC-B, encompassing two segments: one with a highly stable gene copy number and another in which the gene copy number is highly variable. The chicken model has been a critical resource for many other fields of study, and this new reference assembly will substantially further these efforts. Copyright © 2017 Warren et al.

  11. Gene features selection for three-class disease classification via multiple orthogonal partial least square discriminant analysis and S-plot using microarray data.

    PubMed

    Yang, Mingxing; Li, Xiumin; Li, Zhibin; Ou, Zhimin; Liu, Ming; Liu, Suhuan; Li, Xuejun; Yang, Shuyu

    2013-01-01

    DNA microarray analysis is characterized by obtaining a large number of gene variables from a small number of observations. Cluster analysis is widely used to analyze DNA microarray data to make classification and diagnosis of disease. Because there are so many irrelevant and insignificant genes in a dataset, a feature selection approach must be employed in data analysis. The performance of cluster analysis of this high-throughput data depends on whether the feature selection approach chooses the most relevant genes associated with disease classes. Here we proposed a new method using multiple Orthogonal Partial Least Squares-Discriminant Analysis (mOPLS-DA) models and S-plots to select the most relevant genes to conduct three-class disease classification and prediction. We tested our method using Golub's leukemia microarray data. For three classes with subtypes, we proposed hierarchical orthogonal partial least squares-discriminant analysis (OPLS-DA) models and S-plots to select features for two main classes and their subtypes. For three classes in parallel, we employed three OPLS-DA models and S-plots to choose marker genes for each class. The power of feature selection to classify and predict three-class disease was evaluated using cluster analysis. Further, the general performance of our method was tested using four public datasets and compared with those of four other feature selection methods. The results revealed that our method effectively selected the most relevant features for disease classification and prediction, and its performance was better than that of the other methods.

  12. Relationship between Organic Carbon and Opportunistic Pathogens in Simulated Glass Water Heaters.

    PubMed

    Williams, Krista; Pruden, Amy; Falkinham, Joseph O; Edwards, Marc; Williams, Krista; Pruden, Amy; Falkinham, Joseph O; Edwards, Marc

    2015-06-09

    Controlling organic carbon levels in municipal water has been hypothesized to limit downstream growth of bacteria and opportunistic pathogens in premise plumbing (OPPPs). Here, the relationships between influent organic carbon (0-15,000 µg ozonated fulvic acid /L) and the number of total bacteria [16S rRNA genes and heterotrophic plate counts (HPCs)] and a wide range of OPPPs (gene copy numbers of Acanthamoeba polyphaga, Vermamoeba vermiformis, Legionella pneumophila, and Mycobacterium avium) were examined in the bulk water of 120-mL simulated glass water heaters (SGWHs). The SGWHs were operated at 32-37 °C, which is representative of conditions encountered at the bottom of electric water heaters, with water changes of 80% three times per week to simulate low use. This design presented advantages of controlled and replicated (triplicate) conditions and avoided other potential limitations to OPPP growth in order to isolate the variable of organic carbon. Over seventeen months, strong correlations were observed between total organic carbon (TOC) and both 16S rRNA gene copy numbers and HPC counts (avg. R2 > 0.89). Although M. avium gene copies were occasionally correlated with TOC (avg. R2 = 0.82 to 0.97, for 2 out of 4 time points) and over a limited TOC range (0-1000 µg/L), no other correlations were identified between other OPPPs and added TOC. These results suggest that reducing organic carbon in distributed water is not adequate as a sole strategy for controlling OPPPs, although it may have promise in conjunction with other approaches.

  13. Rare Copy Number Variants Are a Common Cause of Short Stature

    PubMed Central

    Zahnleiter, Diana; Uebe, Steffen; Ekici, Arif B.; Hoyer, Juliane; Wiesener, Antje; Wieczorek, Dagmar; Kunstmann, Erdmute; Reis, André; Doerr, Helmuth-Guenther; Rauch, Anita; Thiel, Christian T.

    2013-01-01

    Human growth has an estimated heritability of about 80%–90%. Nevertheless, the underlying cause of shortness of stature remains unknown in the majority of individuals. Genome-wide association studies (GWAS) showed that both common single nucleotide polymorphisms and copy number variants (CNVs) contribute to height variation under a polygenic model, although explaining only a small fraction of overall genetic variability in the general population. Under the hypothesis that severe forms of growth retardation might also be caused by major gene effects, we searched for rare CNVs in 200 families, 92 sporadic and 108 familial, with idiopathic short stature compared to 820 control individuals. Although similar in number, patients had overall significantly larger CNVs (p-value<1×10−7). In a gene-based analysis of all non-polymorphic CNVs>50 kb for gene function, tissue expression, and murine knock-out phenotypes, we identified 10 duplications and 10 deletions ranging in size from 109 kb to 14 Mb, of which 7 were de novo (p<0.03) and 13 inherited from the likewise affected parent but absent in controls. Patients with these likely disease causing 20 CNVs were smaller than the remaining group (p<0.01). Eleven (55%) of these CNVs either overlapped with known microaberration syndromes associated with short stature or contained GWAS loci for height. Haploinsufficiency (HI) score and further expression profiling suggested dosage sensitivity of major growth-related genes at these loci. Overall 10% of patients carried a disease-causing CNV indicating that, like in neurodevelopmental disorders, rare CNVs are a frequent cause of severe growth retardation. PMID:23516380

  14. Relationship between Organic Carbon and Opportunistic Pathogens in Simulated Glass Water Heaters

    PubMed Central

    Williams, Krista; Pruden, Amy; Falkinham, Joseph O.; Edwards, Marc

    2015-01-01

    Controlling organic carbon levels in municipal water has been hypothesized to limit downstream growth of bacteria and opportunistic pathogens in premise plumbing (OPPPs). Here, the relationships between influent organic carbon (0–15,000 µg ozonated fulvic acid /L) and the number of total bacteria [16S rRNA genes and heterotrophic plate counts (HPCs)] and a wide range of OPPPs (gene copy numbers of Acanthamoeba polyphaga, Vermamoeba vermiformis, Legionella pneumophila, and Mycobacterium avium) were examined in the bulk water of 120-mL simulated glass water heaters (SGWHs). The SGWHs were operated at 32–37 °C, which is representative of conditions encountered at the bottom of electric water heaters, with water changes of 80% three times per week to simulate low use. This design presented advantages of controlled and replicated (triplicate) conditions and avoided other potential limitations to OPPP growth in order to isolate the variable of organic carbon. Over seventeen months, strong correlations were observed between total organic carbon (TOC) and both 16S rRNA gene copy numbers and HPC counts (avg. R2 > 0.89). Although M. avium gene copies were occasionally correlated with TOC (avg. R2 = 0.82 to 0.97, for 2 out of 4 time points) and over a limited TOC range (0–1000 µg/L), no other correlations were identified between other OPPPs and added TOC. These results suggest that reducing organic carbon in distributed water is not adequate as a sole strategy for controlling OPPPs, although it may have promise in conjunction with other approaches. PMID:26066310

  15. Haplotype Phasing and Inheritance of Copy Number Variants in Nuclear Families

    PubMed Central

    Palta, Priit; Kaplinski, Lauris; Nagirnaja, Liina; Veidenberg, Andres; Möls, Märt; Nelis, Mari; Esko, Tõnu; Metspalu, Andres; Laan, Maris; Remm, Maido

    2015-01-01

    DNA copy number variants (CNVs) that alter the copy number of a particular DNA segment in the genome play an important role in human phenotypic variability and disease susceptibility. A number of CNVs overlapping with genes have been shown to confer risk to a variety of human diseases thus highlighting the relevance of addressing the variability of CNVs at a higher resolution. So far, it has not been possible to deterministically infer the allelic composition of different haplotypes present within the CNV regions. We have developed a novel computational method, called PiCNV, which enables to resolve the haplotype sequence composition within CNV regions in nuclear families based on SNP genotyping microarray data. The algorithm allows to i) phase normal and CNV-carrying haplotypes in the copy number variable regions, ii) resolve the allelic copies of rearranged DNA sequence within the haplotypes and iii) infer the heritability of identified haplotypes in trios or larger nuclear families. To our knowledge this is the first program available that can deterministically phase null, mono-, di-, tri- and tetraploid genotypes in CNV loci. We applied our method to study the composition and inheritance of haplotypes in CNV regions of 30 HapMap Yoruban trios and 34 Estonian families. For 93.6% of the CNV loci, PiCNV enabled to unambiguously phase normal and CNV-carrying haplotypes and follow their transmission in the corresponding families. Furthermore, allelic composition analysis identified the co-occurrence of alternative allelic copies within 66.7% of haplotypes carrying copy number gains. We also observed less frequent transmission of CNV-carrying haplotypes from parents to children compared to normal haplotypes and identified an emergence of several de novo deletions and duplications in the offspring. PMID:25853576

  16. Haplotype phasing and inheritance of copy number variants in nuclear families.

    PubMed

    Palta, Priit; Kaplinski, Lauris; Nagirnaja, Liina; Veidenberg, Andres; Möls, Märt; Nelis, Mari; Esko, Tõnu; Metspalu, Andres; Laan, Maris; Remm, Maido

    2015-01-01

    DNA copy number variants (CNVs) that alter the copy number of a particular DNA segment in the genome play an important role in human phenotypic variability and disease susceptibility. A number of CNVs overlapping with genes have been shown to confer risk to a variety of human diseases thus highlighting the relevance of addressing the variability of CNVs at a higher resolution. So far, it has not been possible to deterministically infer the allelic composition of different haplotypes present within the CNV regions. We have developed a novel computational method, called PiCNV, which enables to resolve the haplotype sequence composition within CNV regions in nuclear families based on SNP genotyping microarray data. The algorithm allows to i) phase normal and CNV-carrying haplotypes in the copy number variable regions, ii) resolve the allelic copies of rearranged DNA sequence within the haplotypes and iii) infer the heritability of identified haplotypes in trios or larger nuclear families. To our knowledge this is the first program available that can deterministically phase null, mono-, di-, tri- and tetraploid genotypes in CNV loci. We applied our method to study the composition and inheritance of haplotypes in CNV regions of 30 HapMap Yoruban trios and 34 Estonian families. For 93.6% of the CNV loci, PiCNV enabled to unambiguously phase normal and CNV-carrying haplotypes and follow their transmission in the corresponding families. Furthermore, allelic composition analysis identified the co-occurrence of alternative allelic copies within 66.7% of haplotypes carrying copy number gains. We also observed less frequent transmission of CNV-carrying haplotypes from parents to children compared to normal haplotypes and identified an emergence of several de novo deletions and duplications in the offspring.

  17. Characterization and probiotic potential of Lactobacillus plantarum strains isolated from cheeses.

    PubMed

    Zago, Miriam; Fornasari, Maria Emanuela; Carminati, Domenico; Burns, Patricia; Suàrez, Viviana; Vinderola, Gabriel; Reinheimer, Jorge; Giraffa, Giorgio

    2011-08-01

    Ninety-eight Lactobacillus plantarum strains isolated from Italian and Argentinean cheeses were evaluated for probiotic potential. After a preliminary subtractive screening based on the presence of msa and bsh genes, 27 strains were characterized. In general, the selected strains showed high resistance to lysozyme, good adaptation to simulated gastric juice, and a moderate to low bile tolerance. The capacity to agglutinate yeast cells in a mannose-specific manner, as well as the cell surface hydrophobicity was found to be variable among strains. Very high β-galactosidase activity was shown by a considerable number of the tested strains, whereas variable prebiotic utilization ability was observed. Only tetracycline resistance was observed in two highly resistant strains which harbored the tetM gene, whereas none of the strains showed β-glucuronidase activity or was capable of inhibiting pathogens. Three strains (Lp790, Lp813, and Lp998) were tested by in vivo trials. A considerable heterogeneity was found among a number of L. plantarum strains screened in this study, leading to the design of multiple cultures to cooperatively link strains showing the widest range of useful traits. Among the selected strains, Lp790, Lp813, and Lp998 showed the best probiotic potential and would be promising candidates for inclusion as starter cultures for the manufacture of probiotic fermented foods. Copyright © 2011 Elsevier Ltd. All rights reserved.

  18. Effect of genotype and environment on branching in weedy green millet (Setaria viridis) and domesticated foxtail millet (Setaria italica) (Poaceae).

    PubMed

    Doust, Andrew N; Kellogg, Elizabeth A

    2006-04-01

    Many domesticated crops are derived from species whose life history includes weedy characteristics, such as the ability to vary branching patterns in response to environmental conditions. However, domesticated crop plants are characterized by less variable plant architecture, as well as by a general reduction in vegetative branching compared to their progenitor species. Here we examine weedy green millet and its domesticate foxtail millet that differ in the number of tillers (basal branches) and axillary branches along each tiller. Branch number in F(2:3) progeny of a cross between the two species varies with genotype, planting density, and other environmental variables, with significant genotype-environment interactions (GEI). This is shown by a complex pattern of reaction norms and by variation in the pattern of significant quantitative trait loci (QTL) amongst trials. Individual and joint analyses of high and low density trials indicate that most QTL have significant GEI. Dominance and epistasis also explain some variation in branching. Likely candidate genes underlying the QTL (based on map position and phenotypic effect) include teosinte branched1 and barren stalk1. Phytochrome B, which has been found to affect response to shading in other plants, explains little or no variation. Much variation in branching is explained by QTL that do not have obvious candidate genes from maize or rice.

  19. Unscrambling butterfly oogenesis

    PubMed Central

    2013-01-01

    Background Butterflies are popular model organisms to study physiological mechanisms underlying variability in oogenesis and egg provisioning in response to environmental conditions. Nothing is known, however, about; the developmental mechanisms governing butterfly oogenesis, how polarity in the oocyte is established, or which particular maternal effect genes regulate early embryogenesis. To gain insights into these developmental mechanisms and to identify the conserved and divergent aspects of butterfly oogenesis, we analysed a de novo ovarian transcriptome of the Speckled Wood butterfly Pararge aegeria (L.), and compared the results with known model organisms such as Drosophila melanogaster and Bombyx mori. Results A total of 17306 contigs were annotated, with 30% possibly novel or highly divergent sequences observed. Pararge aegeria females expressed 74.5% of the genes that are known to be essential for D. melanogaster oogenesis. We discuss the genes involved in all aspects of oogenesis, including vitellogenesis and choriogenesis, plus those implicated in hormonal control of oogenesis and transgenerational hormonal effects in great detail. Compared to other insects, a number of significant differences were observed in; the genes involved in stem cell maintenance and differentiation in the germarium, establishment of oocyte polarity, and in several aspects of maternal regulation of zygotic development. Conclusions This study provides valuable resources to investigate a number of divergent aspects of butterfly oogenesis requiring further research. In order to fully unscramble butterfly oogenesis, we also now also have the resources to investigate expression patterns of oogenesis genes under a range of environmental conditions, and to establish their function. PMID:23622113

  20. Comparison of normalization methods for differential gene expression analysis in RNA-Seq experiments

    PubMed Central

    Maza, Elie; Frasse, Pierre; Senin, Pavel; Bouzayen, Mondher; Zouine, Mohamed

    2013-01-01

    In recent years, RNA-Seq technologies became a powerful tool for transcriptome studies. However, computational methods dedicated to the analysis of high-throughput sequencing data are yet to be standardized. In particular, it is known that the choice of a normalization procedure leads to a great variability in results of differential gene expression analysis. The present study compares the most widespread normalization procedures and proposes a novel one aiming at removing an inherent bias of studied transcriptomes related to their relative size. Comparisons of the normalization procedures are performed on real and simulated data sets. Real RNA-Seq data sets analyses, performed with all the different normalization methods, show that only 50% of significantly differentially expressed genes are common. This result highlights the influence of the normalization step on the differential expression analysis. Real and simulated data sets analyses give similar results showing 3 different groups of procedures having the same behavior. The group including the novel method named “Median Ratio Normalization” (MRN) gives the lower number of false discoveries. Within this group the MRN method is less sensitive to the modification of parameters related to the relative size of transcriptomes such as the number of down- and upregulated genes and the gene expression levels. The newly proposed MRN method efficiently deals with intrinsic bias resulting from relative size of studied transcriptomes. Validation with real and simulated data sets confirmed that MRN is more consistent and robust than existing methods. PMID:26442135

  1. Conditional clustering of temporal expression profiles

    PubMed Central

    Wang, Ling; Montano, Monty; Rarick, Matt; Sebastiani, Paola

    2008-01-01

    Background Many microarray experiments produce temporal profiles in different biological conditions but common cluster techniques are not able to analyze the data conditional on the biological conditions. Results This article presents a novel technique to cluster data from time course microarray experiments performed across several experimental conditions. Our algorithm uses polynomial models to describe the gene expression patterns over time, a full Bayesian approach with proper conjugate priors to make the algorithm invariant to linear transformations, and an iterative procedure to identify genes that have a common temporal expression profile across two or more experimental conditions, and genes that have a unique temporal profile in a specific condition. Conclusion We use simulated data to evaluate the effectiveness of this new algorithm in finding the correct number of clusters and in identifying genes with common and unique profiles. We also use the algorithm to characterize the response of human T cells to stimulations of antigen-receptor signaling gene expression temporal profiles measured in six different biological conditions and we identify common and unique genes. These studies suggest that the methodology proposed here is useful in identifying and distinguishing uniquely stimulated genes from commonly stimulated genes in response to variable stimuli. Software for using this clustering method is available from the project home page. PMID:18334028

  2. B cell Variable genes have evolved their codon usage to focus the targeted patterns of somatic mutation on the complementarity determining regions

    PubMed Central

    Saini, Jasmine; Hershberg, Uri

    2015-01-01

    The exceptional ability of B cells to diversify through somatic mutation and improve affinity of the repertoire towards the antigens is the cornerstone of adaptive immunity. Somatic mutation is not evenly distributed and exhibits certain micro-sequence specificities. We show here that the combination of somatic mutation targeting and the codon usage in human B cell receptor (BCR) Variable (V) genes create expected patterns of mutation and post mutation changes that are focused on their complementarity determining regions (CDR). T cell V genes are also skewed in targeting mutations but to a lesser extent and are lacking the codon usage bias observed in BCRs. This suggests that the observed skew in T cell receptors is due to their amino acid usage, which is similar to that of BCRs. The mutation targeting and the codon bias allow B cell CDRs to diversify by specifically accumulating nonconservative changes. We counted the distribution of mutations to CDR in 4 different human datasets. In all four cases we found that the number of actual mutations in the CDR correlated significantly with the V gene mutation biases to the CDR predicted by our models. Finally, it appears that the mutation bias in V genes indeed relates to their long-term survival in actual human repertoires. We observed that resting repertoires of B cells overexpressed V genes that were especially biased towards focused mutation and change in the CDR. This bias in V gene usage was somewhat relaxed at the height of the immune response to a vaccine, presumably because of the need for a wider diversity in a primary response. However, older patients did not retain this flexibility and were biased towards using only highly skewed V genes at all stages of their response. PMID:25660968

  3. B cell variable genes have evolved their codon usage to focus the targeted patterns of somatic mutation on the complementarity determining regions.

    PubMed

    Saini, Jasmine; Hershberg, Uri

    2015-05-01

    The exceptional ability of B cells to diversify through somatic mutation and improve affinity of the repertoire toward the antigens is the cornerstone of adaptive immunity. Somatic mutation is not evenly distributed and exhibits certain micro-sequence specificities. We show here that the combination of somatic mutation targeting and the codon usage in human B cell receptor (BCR) Variable (V) genes create expected patterns of mutation and post mutation changes that are focused on their complementarity determining regions (CDR). T cell V genes are also skewed in targeting mutations but to a lesser extent and are lacking the codon usage bias observed in BCRs. This suggests that the observed skew in T cell receptors is due to their amino acid usage, which is similar to that of BCRs. The mutation targeting and the codon bias allow B cell CDRs to diversify by specifically accumulating nonconservative changes. We counted the distribution of mutations to CDR in 4 different human datasets. In all four cases we found that the number of actual mutations in the CDR correlated significantly with the V gene mutation biases to the CDR predicted by our models. Finally, it appears that the mutation bias in V genes indeed relates to their long-term survival in actual human repertoires. We observed that resting repertoires of B cells overexpressed V genes that were especially biased toward focused mutation and change in the CDR. This bias in V gene usage was somewhat relaxed at the height of the immune response to a vaccine, presumably because of the need for a wider diversity in a primary response. However, older patients did not retain this flexibility and were biased toward using only highly skewed V genes at all stages of their response. Copyright © 2015 Elsevier Ltd. All rights reserved.

  4. Whole-genome sequencing reveals mutational landscape underlying phenotypic differences between two widespread Chinese cattle breeds.

    PubMed

    Xu, Yao; Jiang, Yu; Shi, Tao; Cai, Hanfang; Lan, Xianyong; Zhao, Xin; Plath, Martin; Chen, Hong

    2017-01-01

    Whole-genome sequencing provides a powerful tool to obtain more genetic variability that could produce a range of benefits for cattle breeding industry. Nanyang (Bos indicus) and Qinchuan (Bos taurus) are two important Chinese indigenous cattle breeds with distinct phenotypes. To identify the genetic characteristics responsible for variation in phenotypes between the two breeds, in the present study, we for the first time sequenced the genomes of four Nanyang and four Qinchuan cattle with 10 to 12 fold on average of 97.86% and 98.98% coverage of genomes, respectively. Comparison with the Bos_taurus_UMD_3.1 reference assembly yielded 9,010,096 SNPs for Nanyang, and 6,965,062 for Qinchuan cattle, 51% and 29% of which were novel SNPs, respectively. A total of 154,934 and 115,032 small indels (1 to 3 bp) were found in the Nanyang and Qinchuan genomes, respectively. The SNP and indel distribution revealed that Nanyang showed a genetically high diversity as compared to Qinchuan cattle. Furthermore, a total of 2,907 putative cases of copy number variation (CNV) were identified by aligning Nanyang to Qinchuan genome, 783 of which (27%) encompassed the coding regions of 495 functional genes. The gene ontology (GO) analysis revealed that many CNV genes were enriched in the immune system and environment adaptability. Among several CNV genes related to lipid transport and fat metabolism, Lepin receptor gene (LEPR) overlapping with CNV_1815 showed remarkably higher copy number in Qinchuan than Nanyang (log2 (ratio) = -2.34988; P value = 1.53E-102). Further qPCR and association analysis investigated that the copy number of the LEPR gene presented positive correlations with transcriptional expression and phenotypic traits, suggesting the LEPR CNV may contribute to the higher fat deposition in muscles of Qinchuan cattle. Our findings provide evidence that the distinct phenotypes of Nanyang and Qinchuan breeds may be due to the different genetic variations including SNPs, indels and CNV.

  5. Whole-genome sequencing reveals mutational landscape underlying phenotypic differences between two widespread Chinese cattle breeds

    PubMed Central

    Jiang, Yu; Shi, Tao; Cai, Hanfang; Lan, Xianyong; Zhao, Xin; Plath, Martin; Chen, Hong

    2017-01-01

    Whole-genome sequencing provides a powerful tool to obtain more genetic variability that could produce a range of benefits for cattle breeding industry. Nanyang (Bos indicus) and Qinchuan (Bos taurus) are two important Chinese indigenous cattle breeds with distinct phenotypes. To identify the genetic characteristics responsible for variation in phenotypes between the two breeds, in the present study, we for the first time sequenced the genomes of four Nanyang and four Qinchuan cattle with 10 to 12 fold on average of 97.86% and 98.98% coverage of genomes, respectively. Comparison with the Bos_taurus_UMD_3.1 reference assembly yielded 9,010,096 SNPs for Nanyang, and 6,965,062 for Qinchuan cattle, 51% and 29% of which were novel SNPs, respectively. A total of 154,934 and 115,032 small indels (1 to 3 bp) were found in the Nanyang and Qinchuan genomes, respectively. The SNP and indel distribution revealed that Nanyang showed a genetically high diversity as compared to Qinchuan cattle. Furthermore, a total of 2,907 putative cases of copy number variation (CNV) were identified by aligning Nanyang to Qinchuan genome, 783 of which (27%) encompassed the coding regions of 495 functional genes. The gene ontology (GO) analysis revealed that many CNV genes were enriched in the immune system and environment adaptability. Among several CNV genes related to lipid transport and fat metabolism, Lepin receptor gene (LEPR) overlapping with CNV_1815 showed remarkably higher copy number in Qinchuan than Nanyang (log2 (ratio) = -2.34988; P value = 1.53E-102). Further qPCR and association analysis investigated that the copy number of the LEPR gene presented positive correlations with transcriptional expression and phenotypic traits, suggesting the LEPR CNV may contribute to the higher fat deposition in muscles of Qinchuan cattle. Our findings provide evidence that the distinct phenotypes of Nanyang and Qinchuan breeds may be due to the different genetic variations including SNPs, indels and CNV. PMID:28841720

  6. Clinical and genetic diversity of SMN1-negative proximal spinal muscular atrophies

    PubMed Central

    Jordanova, Albena

    2014-01-01

    Hereditary spinal muscular atrophy is a motor neuron disorder characterized by muscle weakness and atrophy due to degeneration of the anterior horn cells of the spinal cord. Initially, the disease was considered purely as an autosomal recessive condition caused by loss-of-function SMN1 mutations on 5q13. Recent developments in next generation sequencing technologies, however, have unveiled a growing number of clinical conditions designated as non-5q forms of spinal muscular atrophy. At present, 16 different genes and one unresolved locus are associated with proximal non-5q forms, having high phenotypic variability and diverse inheritance patterns. This review provides an overview of the current knowledge regarding the phenotypes, causative genes, and disease mechanisms associated with proximal SMN1-negative spinal muscular atrophies. We describe the molecular and cellular functions enriched among causative genes, and discuss the challenges in the post-genomics era of spinal muscular atrophy research. PMID:24970098

  7. Diurnal and developmental differences in gene expression between adult dispersing and flightless morphs of the wing polymorphic cricket, Gryllus firmus: Implications for life-history evolution.

    PubMed

    Zera, Anthony J; Vellichirammal, Neetha Nanoth; Brisson, Jennifer A

    2018-04-12

    The functional basis of life history adaptation is a key topic of research in life history evolution. Studies of wing-polymorphism in the cricket Gryllus firmus have played a prominent role in this field. However, prior in-depth investigations of morph specialization have primarily focused on a single hormone, juvenile hormone, and a single aspect of intermediary metabolism, the fatty-acid biosynthetic component of lipid metabolism. Moreover, the role of diurnal variation in life history adaptation in G. firmus has been understudied, as is the case for organisms in general. Here, we identify genes whose expression differs consistently between the morphs independent of time-of-day during early adulthood, as well as genes that exhibit a strong pattern of morph-specific diurnal expression. We find strong, consistent, morph-specific differences in the expression of genes involved in endocrine regulation, carbohydrate and lipid metabolism, and immunity - in particular, in the expression of an insulin-like-peptide precursor gene and genes involved in triglyceride production. We also find that the flight-capable morph exhibited a substantially greater number of genes exhibiting diurnal change in gene expression compared with the flightless morph, correlated with the greater circadian change in the hemolymph juvenile titer in the dispersing morph. In fact, diurnal differences in expression within the dispersing morph at different times of the day were significantly greater in magnitude than differences between dispersing and flightless morphs at the same time-of-day. These results provide important baseline information regarding the potential role of variable gene expression on life history specialization in morphs of G. firmus, and the first information on genetically-variable, diurnal change in gene expression, associated with a key life history polymorphism. These results also suggest the existence of prominent morph-specific circadian differences in gene expression in G. firmus, possibly caused by the morph-specific circadian rhythm in the juvenile hormone titer. Copyright © 2018 Elsevier Ltd. All rights reserved.

  8. Ruminant Rhombencephalitis-Associated Listeria monocytogenes Alleles Linked to a Multilocus Variable-Number Tandem-Repeat Analysis Complex ▿ †

    PubMed Central

    Balandyté, Lina; Brodard, Isabelle; Frey, Joachim; Oevermann, Anna; Abril, Carlos

    2011-01-01

    Listeria monocytogenes is among the most important food-borne pathogens and is well adapted to persist in the environment. To gain insight into the genetic relatedness and potential virulence of L. monocytogenes strains causing central nervous system (CNS) infections, we used multilocus variable-number tandem-repeat analysis (MLVA) to subtype 183 L. monocytogenes isolates, most from ruminant rhombencephalitis and some from human patients, food, and the environment. Allelic-profile-based comparisons grouped L. monocytogenes strains mainly into three clonal complexes and linked single-locus variants (SLVs). Clonal complex A essentially consisted of isolates from human and ruminant brain samples. All but one rhombencephalitis isolate from cattle were located in clonal complex A. In contrast, food and environmental isolates mainly clustered into clonal complex C, and none was classified as clonal complex A. Isolates of the two main clonal complexes (A and C) obtained by MLVA were analyzed by PCR for the presence of 11 virulence-associated genes (prfA, actA, inlA, inlB, inlC, inlD, inlE, inlF, inlG, inlJ, and inlC2H). Virulence gene analysis revealed significant differences in the actA, inlF, inlG, and inlJ allelic profiles between clinical isolates (complex A) and nonclinical isolates (complex C). The association of particular alleles of actA, inlF, and newly described alleles of inlJ with isolates from CNS infections (particularly rhombencephalitis) suggests that these virulence genes participate in neurovirulence of L. monocytogenes. The overall absence of inlG in clinical complex A and its presence in complex C isolates suggests that the InlG protein is more relevant for the survival of L. monocytogenes in the environment. PMID:21984240

  9. A large population-based association study between HLA and KIR genotypes and measles vaccine antibody responses.

    PubMed

    Ovsyannikova, Inna G; Schaid, Daniel J; Larrabee, Beth R; Haralambieva, Iana H; Kennedy, Richard B; Poland, Gregory A

    2017-01-01

    Human antibody response to measles vaccine is highly variable in the population. Host genes contribute to inter-individual antibody response variation. The killer cell immunoglobulin-like receptors (KIR) are recognized to interact with HLA molecules and possibly influence humoral immune response to viral antigens. To expand on and improve our previous work with HLA genes, and to explore the genetic contribution of KIR genes to the inter-individual variability in measles vaccine-induced antibody responses, we performed a large population-based study in 2,506 healthy immunized subjects (ages 11 to 41 years) to identify HLA and KIR associations with measles vaccine-induced neutralizing antibodies. After correcting for the large number of statistical tests of allele effects on measles-specific neutralizing antibody titers, no statistically significant associations were found for either HLA or KIR loci. However, suggestive associations worthy of follow-up in other cohorts include B*57:01, DQB1*06:02, and DRB1*15:05 alleles. Specifically, the B*57:01 allele (1,040 mIU/mL; p = 0.0002) was suggestive of an association with lower measles antibody titer. In contrast, the DQB1*06:02 (1,349 mIU/mL; p = 0.0004) and DRB1*15:05 (2,547 mIU/mL; p = 0.0004) alleles were suggestive of an association with higher measles antibodies. Notably, the associations with KIR genotypes were strongly nonsignificant, suggesting that KIR loci in terms of copy number and haplotypes are not likely to play a major role in antibody response to measles vaccination. These findings refine our knowledge of the role of HLA and KIR alleles in measles vaccine-induced immunity.

  10. Variable number of tandem repeat polymorphisms of DRD4: re-evaluation of selection hypothesis and analysis of association with schizophrenia

    PubMed Central

    Hattori, Eiji; Nakajima, Mizuho; Yamada, Kazuo; Iwayama, Yoshimi; Toyota, Tomoko; Saitou, Naruya; Yoshikawa, Takeo

    2009-01-01

    Associations have been reported between the variable number of tandem repeat (VNTR) polymorphisms in the exon 3 of dopamine D4 receptor gene gene and multiple psychiatric illnesses/traits. We examined the distribution of VNTR alleles of different length in a Japanese cohort and found that, as reported earlier, the size of allele ‘7R' was much rarer (0.5%) in Japanese than in Caucasian populations (∼20%). This presents a challenge to an earlier proposed hypothesis that positive selection favoring the allele 7R has contributed to its high frequency. To further address the issue of selection, we carried out sequencing of the VNTR region not only from human but also from chimpanzee samples, and made inference on the ancestral repeat motif and haplotype by use of a phylogenetic analysis program. The most common 4R variant was considered to be the ancestral haplotype as earlier proposed. However, in a gene tree of VNTR constructed on the basis of this inferred ancestral haplotype, the allele 7R had five descendent haplotypes in relatively long lineage, where genetic drift can have major influence. We also tested this length polymorphism for association with schizophrenia, studying two Japanese sample sets (one with 570 cases and 570 controls, and the other with 124 pedigrees). No evidence of association between the allele 7R and schizophrenia was found in any of the two data sets. Collectively, this study suggests that the VNTR variation does not have an effect large enough to cause either selection or a detectable association with schizophrenia in a study of samples of moderate size. PMID:19092778

  11. Identification and Characterisation of a Hyper-Variable Apoplastic Effector Gene Family of the Potato Cyst Nematodes

    PubMed Central

    Eves-van den Akker, Sebastian; Lilley, Catherine J.; Jones, John T.; Urwin, Peter E.

    2014-01-01

    Sedentary endoparasitic nematodes are obligate biotrophs that modify host root tissues, using a suite of effector proteins to create and maintain a feeding site that is their sole source of nutrition. Using assumptions about the characteristics of genes involved in plant-nematode biotrophic interactions to inform the identification strategy, we provide a description and characterisation of a novel group of hyper-variable extracellular effectors termed HYP, from the potato cyst nematode Globodera pallida. HYP effectors comprise a large gene family, with a modular structure, and have unparalleled diversity between individuals of the same population: no two nematodes tested had the same genetic complement of HYP effectors. Individuals vary in the number, size, and type of effector subfamilies. HYP effectors are expressed throughout the biotrophic stages in large secretory cells associated with the amphids of parasitic stage nematodes as confirmed by in situ hybridisation. The encoded proteins are secreted into the host roots where they are detectable by immunochemistry in the apoplasm, between the anterior end of the nematode and the feeding site. We have identified HYP effectors in three genera of plant parasitic nematodes capable of infecting a broad range of mono- and dicotyledon crop species. In planta RNAi targeted to all members of the effector family causes a reduction in successful parasitism. PMID:25255291

  12. Identification and characterisation of a hyper-variable apoplastic effector gene family of the potato cyst nematodes.

    PubMed

    Eves-van den Akker, Sebastian; Lilley, Catherine J; Jones, John T; Urwin, Peter E

    2014-09-01

    Sedentary endoparasitic nematodes are obligate biotrophs that modify host root tissues, using a suite of effector proteins to create and maintain a feeding site that is their sole source of nutrition. Using assumptions about the characteristics of genes involved in plant-nematode biotrophic interactions to inform the identification strategy, we provide a description and characterisation of a novel group of hyper-variable extracellular effectors termed HYP, from the potato cyst nematode Globodera pallida. HYP effectors comprise a large gene family, with a modular structure, and have unparalleled diversity between individuals of the same population: no two nematodes tested had the same genetic complement of HYP effectors. Individuals vary in the number, size, and type of effector subfamilies. HYP effectors are expressed throughout the biotrophic stages in large secretory cells associated with the amphids of parasitic stage nematodes as confirmed by in situ hybridisation. The encoded proteins are secreted into the host roots where they are detectable by immunochemistry in the apoplasm, between the anterior end of the nematode and the feeding site. We have identified HYP effectors in three genera of plant parasitic nematodes capable of infecting a broad range of mono- and dicotyledon crop species. In planta RNAi targeted to all members of the effector family causes a reduction in successful parasitism.

  13. Prevalence of neurotoxic Clostridium botulinum type C in the gastrointestinal tracts of tilapia (Oreochromis mossambicus) in the Salton Sea.

    PubMed

    Nol, P; Rocke, T E; Gross, K; Yuill, T M

    2004-07-01

    Tilapia (Oreochromis mossambicus) have been implicated as the source of type C toxin in avian botulism outbreaks in pelicans (Pelecanus erythrorhynchos, Pelecanus occidentalis californicus) at the Salton Sea in southern California (USA). We collected sick, dead, and healthy fish from various sites throughout the Sea during the summers of 1999 through 2001 and tested them for the presence of Clostridium botulinum type C cells by polymerase chain reaction targeting the C(1) neurotoxin gene. Four of 96 (4%), 57 of 664 (9%), and five of 355 (1%) tilapia tested were positive for C. botulinum type C toxin gene in 1999, 2000, and 2001, respectively. The total number of positive fish was significantly greater in 2000 than in 2001 (P<0.0001). No difference in numbers of positives was detected between sick and dead fish compared with live fish. In 2000, no significant relationships were revealed among the variables studied, such as location and date of collection.

  14. Prevalence of neurotoxic Clostridium botulinum type C in the gastrointestinal tracts of tilapis (Oreochromis mossambicus) in the Salton Sea

    USGS Publications Warehouse

    Nol, P.J.; Rocke, T.E.; Gross, K.; Yuill, Thomas M.

    2004-01-01

    Tilapia (Oreochromis mossambicus) have been implicated as the source of type C toxin in avian botulism outbreaks in pelicans (Pelecanus erythrorhynchos, Pelecanus occidentalis californicus) at the Salton Sea in southern California (USA). We collected sick, dead, and healthy fish from various sites throughout the Sea during the summers of 1999 through 2001 and tested them for the presence of Clostridium botulinum type C cells by polymerase chain reaction targeting the C1 neurotoxin gene. Four of 96 (4%), 57 of 664 (9%), and five of 355 (1%) tilapia tested were positive for C. botulinum type C toxin gene in 1999, 2000, and 2001, respectively. The total number of positive fish was significantly greater in 2000 than in 2001 (P<0.0001). No difference in numbers of positives was detected between sick and dead fish compared with live fish. In 2000, no significant relationships were revealed among the variables studied, such as location and date of collection.

  15. Refining the 22q11.2 deletion breakpoints in DiGeorge syndrome by aCGH

    PubMed Central

    Bittel, D.C.; Yu, S.; Newkirk, H.; Kibiryeva, N.; Holt, S.; Butler, M.G.; Cooley, L.D.

    2009-01-01

    Hemizygous deletions of the chromosome 22q11.2 region result in the 22q11.2 deletion syndrome also referred to as DiGeorge, Velocardiofacial or Shprintzen syndromes. The phenotype is variable but commonly includes conotruncal cardiac defects, palatal abnormalities, learning and behavioral problems, immune deficiency, and facial anomalies. Four distinct highly homologous blocks of low copy number repeat sequences (LCRs) flank the deletion region. Mispairing of LCRs during meiosis with unequal meiotic exchange is assumed to cause the recurrent and consistent deletions. The proximal LCR is reportedly located at 22q11.2 from 17.037 to 17.083 Mb while the distal LCR is located from 19.835 to 19.880 Mb. Although the chromosome breakpoints are thought to localize to the LCRs, the positions of the breakpoints have been investigated in only a few individuals. Therefore, we used high resolution oligonucleotide-based 244K microarray comparative genomic hybridization (aCGH) to resolve the breakpoints in a cohort of 20 subjects with known 22q11.2 deletions. We also investigated copy number variation (CNV) in the rest of the genome. The 22q11.2 breaks occurred on either side of the LCR in our subjects, although more commonly on the distal side of the reported proximal LCR. The proximal breakpoints in our subjects spanned the region from 17.036 to 17.398 Mb. This region includes the genes DGCR6 (DiGeorge syndrome critical region protein 6) and PRODH (proline dehydrogenase 1), along with three open reading frames that may encode proteins of unknown function. The distal breakpoints spanned the region from 19.788 to 20.122 Mb. This region includes the genes GGT2 (gamma-glutamyltransferase-like protein 2), HIC2 (hypermethylated in cancer 2), and multiple transcripts of unknown function. The genes in these two breakpoint regions are variably hemizygous depending on the location of the breakpoints. Our 20 subjects had 254 CNVs throughout the genome, 94 duplications and 160 deletions, ranging in size from 1 kb to 2.4 Mb. The presence or absence of genes at the breakpoints depending on the size of the deletion plus variation in the rest of the genome due to CNVs likely contribute to the variable phenotype associated with the 22q11.2 deletion or DiGeorge syndrome. PMID:19420922

  16. Computational Tools and Algorithms for Designing Customized Synthetic Genes

    PubMed Central

    Gould, Nathan; Hendy, Oliver; Papamichail, Dimitris

    2014-01-01

    Advances in DNA synthesis have enabled the construction of artificial genes, gene circuits, and genomes of bacterial scale. Freedom in de novo design of synthetic constructs provides significant power in studying the impact of mutations in sequence features, and verifying hypotheses on the functional information that is encoded in nucleic and amino acids. To aid this goal, a large number of software tools of variable sophistication have been implemented, enabling the design of synthetic genes for sequence optimization based on rationally defined properties. The first generation of tools dealt predominantly with singular objectives such as codon usage optimization and unique restriction site incorporation. Recent years have seen the emergence of sequence design tools that aim to evolve sequences toward combinations of objectives. The design of optimal protein-coding sequences adhering to multiple objectives is computationally hard, and most tools rely on heuristics to sample the vast sequence design space. In this review, we study some of the algorithmic issues behind gene optimization and the approaches that different tools have adopted to redesign genes and optimize desired coding features. We utilize test cases to demonstrate the efficiency of each approach, as well as identify their strengths and limitations. PMID:25340050

  17. A Distance Measure for Genome Phylogenetic Analysis

    NASA Astrophysics Data System (ADS)

    Cao, Minh Duc; Allison, Lloyd; Dix, Trevor

    Phylogenetic analyses of species based on single genes or parts of the genomes are often inconsistent because of factors such as variable rates of evolution and horizontal gene transfer. The availability of more and more sequenced genomes allows phylogeny construction from complete genomes that is less sensitive to such inconsistency. For such long sequences, construction methods like maximum parsimony and maximum likelihood are often not possible due to their intensive computational requirement. Another class of tree construction methods, namely distance-based methods, require a measure of distances between any two genomes. Some measures such as evolutionary edit distance of gene order and gene content are computational expensive or do not perform well when the gene content of the organisms are similar. This study presents an information theoretic measure of genetic distances between genomes based on the biological compression algorithm expert model. We demonstrate that our distance measure can be applied to reconstruct the consensus phylogenetic tree of a number of Plasmodium parasites from their genomes, the statistical bias of which would mislead conventional analysis methods. Our approach is also used to successfully construct a plausible evolutionary tree for the γ-Proteobacteria group whose genomes are known to contain many horizontally transferred genes.

  18. Copy number variations of genes involved in stress responses reflect the redox state and DNA damage in brewing yeasts.

    PubMed

    Adamczyk, Jagoda; Deregowska, Anna; Skoneczny, Marek; Skoneczna, Adrianna; Natkanska, Urszula; Kwiatkowska, Aleksandra; Rawska, Ewa; Potocki, Leszek; Kuna, Ewelina; Panek, Anita; Lewinska, Anna; Wnuk, Maciej

    2016-09-01

    The yeast strains of the Saccharomyces sensu stricto complex involved in beer production are a heterogeneous group whose genetic and genomic features are not adequately determined. Thus, the aim of the present study was to provide a genetic characterization of selected group of commercially available brewing yeasts both ale top-fermenting and lager bottom-fermenting strains. Molecular karyotyping revealed that the diversity of chromosome patterns and four strains with the most accented genetic variabilities were selected and subjected to genome-wide array-based comparative genomic hybridization (array-CGH) analysis. The differences in the gene copy number were found in five functional gene categories: (1) maltose metabolism and transport, (2) response to toxin, (3) siderophore transport, (4) cellular aldehyde metabolic process, and (5) L-iditol 2-dehydrogenase activity (p < 0.05). In the Saflager W-34/70 strain (Fermentis) with the most affected array-CGH profile, loss of aryl-alcohol dehydrogenase (AAD) gene dosage correlated with an imbalanced redox state, oxidative DNA damage and breaks, lower levels of nucleolar proteins Nop1 and Fob1, and diminished tolerance to fermentation-associated stress stimuli compared to other strains. We suggest that compromised stress response may not only promote oxidant-based changes in the nucleolus state that may affect fermentation performance but also provide novel directions for future strain improvement.

  19. Single-step method for β-galactosidase assays in Escherichia coli using a 96-well microplate reader.

    PubMed

    Schaefer, Jorrit; Jovanovic, Goran; Kotta-Loizou, Ioly; Buck, Martin

    2016-06-15

    Historically, the lacZ gene is one of the most universally used reporters of gene expression in molecular biology. Its activity can be quantified using an artificial substrate, o-nitrophenyl-ß-d-galactopyranoside (ONPG). However, the traditional method for measuring LacZ activity (first described by J. H. Miller in 1972) can be challenging for a large number of samples, is prone to variability, and involves hazardous compounds for lysis (e.g., chloroform, toluene). Here we describe a single-step assay using a 96-well microplate reader with a proven alternative cell permeabilization method. This modified protocol reduces handling time by 90%. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.

  20. Monoamine Oxidase A Promoter Variable Number of Tandem Repeats (MAOA-uVNTR) in Alcoholics According to Lesch Typology

    PubMed Central

    Samochowiec, Agnieszka; Chęć, Magdalena; Kopaczewska, Edyta; Samochowiec, Jerzy; Lesch, Otto; Grochans, Elżbieta; Jasiewicz, Andrzej; Bienkowski, Przemyslaw; Łukasz, Kołodziej; Grzywacz, Anna

    2015-01-01

    Background: The aim of this study was to examine the association between the MAOA-uVNTR gene polymorphism in a homogeneous subgroups of patients with alcohol dependence categorized according to Lesch’s typology. Methods: DNA was provided from alcohol dependent (AD) patients (n = 370) and healthy control subjects (n = 168) all of Polish descent. The history of alcoholism was obtained using the Polish version of the Semi-Structured Assessment for the Genetics of Alcoholism (SSAGA). Samples were genotyped using PCR methods. Results: We found no association between alcohol dependence and MAOA gene polymorphism. Conclusions: Lesch typology is a clinical consequence of the disease and its phenotypic description is too complex for a simple genetic analysis. PMID:25809512

  1. [Prognostic factors of early breast cancer].

    PubMed

    Almagro, Elena; González, Cynthia S; Espinosa, Enrique

    2016-02-19

    Decision about the administration of adjuvant therapy for early breast cancer depends on the evaluation of prognostic factors. Lymph node status, tumor size and grade of differentiation are classical variables in this regard, and can be complemented by hormonal receptor status and HER2 expression. These factors can be combined into prognostic indexes to better estimate the risk of relapse or death. Other factors are less important. Gene profiles have emerged in recent years to identify low-risk patients who can forgo adjuvant chemotherapy. A number of profiles are available and can be used in selected cases. In the future, gene profiling will be used to select patients for treatment with new targeted therapies. Copyright © 2015 Elsevier España, S.L.U. All rights reserved.

  2. Flexibility in the structure of spiral flowers and its underlying mechanisms.

    PubMed

    Wang, Peipei; Liao, Hong; Zhang, Wengen; Yu, Xianxian; Zhang, Rui; Shan, Hongyan; Duan, Xiaoshan; Yao, Xu; Kong, Hongzhi

    2015-12-07

    Spiral flowers usually bear a variable number of organs, suggestive of the flexibility in structure. The mechanisms underlying the flexibility, however, remain unclear. Here we show that in Nigella damascena, a species with spiral flowers, different types of floral organs show different ranges of variation in number. We also show that the total number of organs per flower is largely dependent on the initial size of the floral meristem, whereas the respective numbers of different types of floral organs are determined by the functional domains of corresponding genetic programmes. By conducting extensive expression and functional studies, we further elucidate the genetic programmes that specify the identities of different types of floral organs. Notably, the AGL6-lineage member NdAGL6, rather than the AP1-lineage members NdFL1/2, is an A-function gene, whereas petaloidy of sepals is not controlled by AP3- or PI-lineage members. Moreover, owing to the formation of a regulatory network, some floral organ identity genes also regulate the boundaries between different types of floral organs. On the basis of these results, we propose that the floral organ identity determination programme is highly dynamic and shows considerable flexibility. Transitions from spiral to whorled flowers, therefore, may be explained by evolution of the mechanisms that reduce the flexibility.

  3. Microarray image analysis: background estimation using quantile and morphological filters.

    PubMed

    Bengtsson, Anders; Bengtsson, Henrik

    2006-02-28

    In a microarray experiment the difference in expression between genes on the same slide is up to 103 fold or more. At low expression, even a small error in the estimate will have great influence on the final test and reference ratios. In addition to the true spot intensity the scanned signal consists of different kinds of noise referred to as background. In order to assess the true spot intensity background must be subtracted. The standard approach to estimate background intensities is to assume they are equal to the intensity levels between spots. In the literature, morphological opening is suggested to be one of the best methods for estimating background this way. This paper examines fundamental properties of rank and quantile filters, which include morphological filters at the extremes, with focus on their ability to estimate between-spot intensity levels. The bias and variance of these filter estimates are driven by the number of background pixels used and their distributions. A new rank-filter algorithm is implemented and compared to methods available in Spot by CSIRO and GenePix Pro by Axon Instruments. Spot's morphological opening has a mean bias between -47 and -248 compared to a bias between 2 and -2 for the rank filter and the variability of the morphological opening estimate is 3 times higher than for the rank filter. The mean bias of Spot's second method, morph.close.open, is between -5 and -16 and the variability is approximately the same as for morphological opening. The variability of GenePix Pro's region-based estimate is more than ten times higher than the variability of the rank-filter estimate and with slightly more bias. The large variability is because the size of the background window changes with spot size. To overcome this, a non-adaptive region-based method is implemented. Its bias and variability are comparable to that of the rank filter. The performance of more advanced rank filters is equal to the best region-based methods. However, in order to get unbiased estimates these filters have to be implemented with great care. The performance of morphological opening is in general poor with a substantial spatial-dependent bias.

  4. Polymorphisms of vitamin K-related genes (EPHX1 and VKORC1L1) and stable warfarin doses.

    PubMed

    Chung, Jee-Eun; Lee, Kyung Eun; Chang, Byung Chul; Gwak, Hye Sun

    2018-01-30

    The aim of this study was to investigate the possible effects of EPHX1 and VKORC1L1 polymorphisms on variability of responses to warfarin. Sixteen single nucleotide polymorphisms (SNPs) in 201 patients with stable warfarin doses were analyzed including genes of VKORC1, CYP2C9, CYP4F2, GGCX, EPHX1 and VKORC1L1. Univariate analysis was conducted for the association of genotypes with stable warfarin doses. Multiple linear regression analysis was used to investigate factors that independently affected the inter-individual variability of warfarin dose requirements. The rs4072879 of VKORC1L1 (A>G) was significantly associated with stable warfarin doses; wild homozygote carriers (AA) required significantly lower stable warfarin doses than those with the variant G allele (5.02±1.56 vs. 5.96±2.01mg; p=0.001). Multivariate analysis showed that EPHX1 rs1877724 and VKORC1L1 rs4072879 accounted for 1.5% and 1.3% of the warfarin dose variability. Adding EPHX1 and VKORC1L1 SNPs to the base model including non-genetic variables (operation age, body weight and the therapy of ACEI or ARB) and genetic variables (VKORC1 rs9934438, CYP2C9 rs1057910, and CYP4F2 rs2108622) gave a number needed to genotype of 34. This study showed that polymorphisms of EPHX1 and VKORC1L1 could be determinants of stable warfarin doses. Copyright © 2017. Published by Elsevier B.V.

  5. Reduced MHC and neutral variation in the Galápagos hawk, an island endemic

    PubMed Central

    2011-01-01

    Background Genes at the major histocompatibility complex (MHC) are known for high levels of polymorphism maintained by balancing selection. In small or bottlenecked populations, however, genetic drift may be strong enough to overwhelm the effect of balancing selection, resulting in reduced MHC variability. In this study we investigated MHC evolution in two recently diverged bird species: the endemic Galápagos hawk (Buteo galapagoensis), which occurs in small, isolated island populations, and its widespread mainland relative, the Swainson's hawk (B. swainsoni). Results We amplified at least two MHC class II B gene copies in each species. We recovered only three different sequences from 32 Galápagos hawks, while we amplified 20 unique sequences in 20 Swainson's hawks. Most of the sequences clustered into two groups in a phylogenetic network, with one group likely representing pseudogenes or nonclassical loci. Neutral genetic diversity at 17 microsatellite loci was also reduced in the Galápagos hawk compared to the Swainson's hawk. Conclusions The corresponding loss in neutral diversity suggests that the reduced variability present at Galápagos hawk MHC class II B genes compared to the Swainson's hawk is primarily due to a founder event followed by ongoing genetic drift in small populations. However, purifying selection could also explain the low number of MHC alleles present. This lack of variation at genes involved in the adaptive immune response could be cause for concern should novel diseases reach the archipelago. PMID:21612651

  6. Variability of the caprine whey protein genes and their association with milk yield, composition and renneting properties in the Sarda breed: 2. The BLG gene.

    PubMed

    Dettori, Maria Luisa; Pazzola, Michele; Pira, Emanuela; Puggioni, Ornella; Vacca, Giuseppe Massimo

    2015-11-01

    The variability of the promoter region and the 3'UTR (exon-7) of the BLG gene, encoding the β-lactoglobulin, was investigated by sequencing in 263 lactating Sarda goats in order to assess its association with milk traits. Milk traits included: milk yield, fat, total protein and lactose content, pH, daily fat and protein yield (DFPY), freezing point, milk energy, somatic cell count, total microbial mesophilic count, rennet coagulation time (RCT), curd firming rate (k20) and curd firmness (a30). A total of 7 polymorphic sites were detected and the sequence analysed was given accession number KM817769. Only three SNPs (c.-381C>T, c.-323C>T and c.*420C>A) had minor allele frequency higher than 0.05. The effects of farm, stage of lactation and the interaction farm × stage of lactation significantly influenced all the milk traits (P T and c.*420C>A (P T (P < 0.001). The c.-381TT homozygous goats showed lower pH, RCT and k20 than c.-381CT (P < 0.05). In conclusion the polymorphism of the goat BLG gene did not affect the total protein content of the Sarda goat milk, and only weakly influenced RCT and k20. On the other hand, an interesting effect on milk yields and DFPY emerged in two SNPs. This information might be useful in dairy goat breeding programs.

  7. Gene-Environment Interplay in Physical, Psychological, and Cognitive Domains in Mid to Late Adulthood: Is APOE a Variability Gene?

    PubMed

    Reynolds, Chandra A; Gatz, Margaret; Christensen, Kaare; Christiansen, Lene; Dahl Aslan, Anna K; Kaprio, Jaakko; Korhonen, Tellervo; Kremen, William S; Krueger, Robert; McGue, Matt; Neiderhiser, Jenae M; Pedersen, Nancy L

    2016-01-01

    Despite emerging interest in gene-environment interaction (GxE) effects, there is a dearth of studies evaluating its potential relevance apart from specific hypothesized environments and biometrical variance trends. Using a monozygotic within-pair approach, we evaluated evidence of G×E for body mass index (BMI), depressive symptoms, and cognition (verbal, spatial, attention, working memory, perceptual speed) in twin studies from four countries. We also evaluated whether APOE is a 'variability gene' across these measures and whether it partly represents the 'G' in G×E effects. In all three domains, G×E effects were pervasive across country and gender, with small-to-moderate effects. Age-cohort trends were generally stable for BMI and depressive symptoms; however, they were variable-with both increasing and decreasing age-cohort trends-for different cognitive measures. Results also suggested that APOE may represent a 'variability gene' for depressive symptoms and spatial reasoning, but not for BMI or other cognitive measures. Hence, additional genes are salient beyond APOE.

  8. Copy number variability in Parkinson's disease: assembling the puzzle through a systems biology approach.

    PubMed

    La Cognata, Valentina; Morello, Giovanna; D'Agata, Velia; Cavallaro, Sebastiano

    2017-01-01

    Parkinson's disease (PD), the second most common progressive neurodegenerative disorder of aging, was long believed to be a non-genetic sporadic origin syndrome. The proof that several genetic loci are responsible for rare Mendelian forms has represented a revolutionary breakthrough, enabling to reveal molecular mechanisms underlying this debilitating still incurable condition. While single nucleotide polymorphisms (SNPs) and small indels constitute the most commonly investigated DNA variations accounting for only a limited number of PD cases, larger genomic molecular rearrangements have emerged as significant PD-causing mutations, including submicroscopic Copy Number Variations (CNVs). CNVs constitute a prevalent source of genomic variations and substantially participate in each individual's genomic makeup and phenotypic outcome. However, the majority of genetic studies have focused their attention on single candidate-gene mutations or on common variants reaching a significant statistical level of acceptance. This gene-centric approach is insufficient to uncover the genetic background of polygenic multifactorial disorders like PD, and potentially masks rare individual CNVs that all together might contribute to disease development or progression. In this review, we will discuss literature and bioinformatic data describing the involvement of CNVs on PD pathobiology. We will analyze the most frequent copy number changes in familiar PD genes and provide a "systems biology" overview of rare individual rearrangements that could functionally act on commonly deregulated molecular pathways. Assessing the global genome-wide burden of CNVs in PD patients may reveal new disease-related molecular mechanisms, and open the window to a new possible genetic scenario in the unsolved PD puzzle.

  9. Links among nitrification, nitrifier communities, and edaphic properties in contrasting soils receiving dairy slurry.

    PubMed

    Fortuna, Ann-Marie; Honeycutt, C Wayne; Vandemark, George; Griffin, Timothy S; Larkin, Robert P; He, Zhongqi; Wienhold, Brian J; Sistani, Karamat R; Albrecht, Stephan L; Woodbury, Bryan L; Torbert, Henry A; Powell, J Mark; Hubbard, Robert K; Eigenberg, Roger A; Wright, Robert J; Alldredge, J Richard; Harsh, James B

    2012-01-01

    Soil biotic and abiotic factors strongly influence nitrogen (N) availability and increases in nitrification rates associated with the application of manure. In this study, we examine the effects of edaphic properties and a dairy (Bos taurus) slurry amendment on N availability, nitrification rates and nitrifier communities. Soils of variable texture and clay mineralogy were collected from six USDA-ARS research sites and incubated for 28 d with and without dairy slurry applied at a rate of ~300 kg N ha(-1). Periodically, subsamples were removed for analyses of 2 M KCl extractable N and nitrification potential, as well as gene copy numbers of ammonia-oxidizing bacteria (AOB) and archaea (AOA). Spearman coefficients for nitrification potentials and AOB copy number were positively correlated with total soil C, total soil N, cation exchange capacity, and clay mineralogy in treatments with and without slurry application. Our data show that the quantity and type of clay minerals present in a soil affect nitrifier populations, nitrification rates, and the release of inorganic N. Nitrogen mineralization, nitrification potentials, and edaphic properties were positively correlated with AOB gene copy numbers. On average, AOA gene copy numbers were an order of magnitude lower than those of AOB across the six soils and did not increase with slurry application. Our research suggests that the two nitrifier communities overlap but have different optimum environmental conditions for growth and activity that are partly determined by the interaction of manure-derived ammonium with soil properties. Copyright © by the American Society of Agronomy, Crop Science Society of America, and Soil Science Society of America, Inc.

  10. Chromosome mapping, molecular cloning and expression analysis of a novel gene response for leaf width in rice.

    PubMed

    Wu, Yahui; Luo, Lixin; Chen, Likai; Tao, Xingxing; Huang, Ming; Wang, Hui; Chen, Zhiqiang; Xiao, Wuming

    2016-11-18

    Genetic analysis revealed that narrow leaf, small panicle, thin and slender stems as well as low fertility rate of an Indica rice variety were recessive traits and controlled by a single gene. Applying map-based cloning strategy, a novel narrow leaf gene, which was named nal11 was delimited to an interval of 58.3 kb between the InDel markers N10 and InD5016. There are 9 genes in the mapping interval, and only a heat shock DNAJ protein encode gene (Os07g09450) has a specific G to T SNP, which was occurred at the last base of the second exon of Os07g09450 in ZYX. 5' and 3' RACE result shown that there were two transcripts in NAL11, and the SNP in nal11 leads to a variable shear of mRNA. In addition, this type of mRNA alternative splicing together with a stop codon closely followed the SNP which caused termination of translation destroyed the DNAJ domain of nal11's product. These results suggested that the heat shock DNAJ gene was most likely to be the candidate gene of nal11. The results of RT-PCR and real-time PCR further verified that the SNP in the ZYX-nal11 gene affects mRNA splicing pattern. Phenotype of ZYX may be caused by a statistically significant reduction in the total number of small veins in leaf, size and number of small vascular bundles and cells in stems, similar to several previous reported mutations. The basic molecular information we provide here will be useful for further investigations of the physiological function of the heat shock DNAJ gene, which will be helpful in better understanding the role of the DNAJ family in regulation of plant type traits such as leaf width of rice. Copyright © 2016 Elsevier Inc. All rights reserved.

  11. Autosomal Dominant Cataract: Intrafamilial Phenotypic Variability, Interocular Asymmetry, and Variable Progression in Four Chilean Families

    PubMed Central

    Shafie, Suraiya M.; Barria von-Bischhoffshausen, Fernando R.; Bateman, J. Bronwyn

    2006-01-01

    PURPOSE To document intrafamilial and interocular phenotypic variability of autosomal dominant cataract (ADC). DESIGN Prospective observational case series. METHODS We performed ophthalmologic examination in four Chilean ADC families. RESULTS The families exhibited variability with respect to morphology, location with the lens, color and density of cataracts among affected members. We documented asymmetry between eyes in the morphology, location within the lens, color and density of cataracts, and a variable rate of progression. CONCLUSIONS The cataracts in these families exhibit wide intrafamilial and interocular phenotypic variability, supporting the premise that the mutated genes are expressed differentially in individuals and between eyes; other genes or environmental factors may be the bases for this variability. Marked progression among some family members underscores the variable clinical course of a common mutation within a family. Like retinitis pigmentosa, classification of ADC will be most useful if based on the gene and specific mutation. PMID:16564818

  12. Draft genome assembly of the Bengalese finch, Lonchura striata domestica, a model for motor skill variability and learning

    PubMed Central

    Mets, David G; Brainard, Michael S

    2018-01-01

    Abstract Background Vocal learning in songbirds has emerged as a powerful model for sensorimotor learning. Neurobehavioral studies of Bengalese finch (Lonchura striata domestica) song, naturally more variable and plastic than songs of other finch species, have demonstrated the importance of behavioral variability for initial learning, maintenance, and plasticity of vocalizations. However, the molecular and genetic underpinnings of this variability and the learning it supports are poorly understood. Findings To establish a platform for the molecular analysis of behavioral variability and plasticity, we generated an initial draft assembly of the Bengalese finch genome from a single male animal to 151× coverage and an N50 of 3.0 MB. Furthermore, we developed an initial set of gene models using RNA-seq data from 8 samples that comprise liver, muscle, cerebellum, brainstem/midbrain, and forebrain tissue from juvenile and adult Bengalese finches of both sexes. Conclusions We provide a draft Bengalese finch genome and gene annotation to facilitate the study of the molecular-genetic influences on behavioral variability and the process of vocal learning. These data will directly support many avenues for the identification of genes involved in learning, including differential expression analysis, comparative genomic analysis (through comparison to existing avian genome assemblies), and derivation of genetic maps for linkage analysis. Bengalese finch gene models and sequences will be essential for subsequent manipulation (molecular or genetic) of genes and gene products, enabling novel mechanistic investigations into the role of variability in learned behavior. PMID:29618046

  13. Draft genome assembly of the Bengalese finch, Lonchura striata domestica, a model for motor skill variability and learning.

    PubMed

    Colquitt, Bradley M; Mets, David G; Brainard, Michael S

    2018-03-01

    Vocal learning in songbirds has emerged as a powerful model for sensorimotor learning. Neurobehavioral studies of Bengalese finch (Lonchura striata domestica) song, naturally more variable and plastic than songs of other finch species, have demonstrated the importance of behavioral variability for initial learning, maintenance, and plasticity of vocalizations. However, the molecular and genetic underpinnings of this variability and the learning it supports are poorly understood. To establish a platform for the molecular analysis of behavioral variability and plasticity, we generated an initial draft assembly of the Bengalese finch genome from a single male animal to 151× coverage and an N50 of 3.0 MB. Furthermore, we developed an initial set of gene models using RNA-seq data from 8 samples that comprise liver, muscle, cerebellum, brainstem/midbrain, and forebrain tissue from juvenile and adult Bengalese finches of both sexes. We provide a draft Bengalese finch genome and gene annotation to facilitate the study of the molecular-genetic influences on behavioral variability and the process of vocal learning. These data will directly support many avenues for the identification of genes involved in learning, including differential expression analysis, comparative genomic analysis (through comparison to existing avian genome assemblies), and derivation of genetic maps for linkage analysis. Bengalese finch gene models and sequences will be essential for subsequent manipulation (molecular or genetic) of genes and gene products, enabling novel mechanistic investigations into the role of variability in learned behavior.

  14. Genetic and epigenetic contributions to the cortical phenotype in mammals☆

    PubMed Central

    Larsen, DeLaine D.; Krubitzer, Leah

    2008-01-01

    One aspect of cortical organization, cortical field size, is variable both within and across species. The observed variability arises from a variety of sources, including genes intrinsic to the neocortex and a number of extrinsic and epigenetic factors. Genes intrinsic to the cortex are directly involved in the development and specification of cortical fields and are regulated from both signaling centers located outside of the neocortex, which secrete diffusible molecules, and the expression of transcription factors within the neocortex. In addition, extrinsic factors such as the type, location and density of sensory receptor arrays and how these receptor arrays are utilized, are also strongly related to cortical field size. Epigenetic factors including the relative activity patterns generated by the different types of physical stimuli in a given environment also contribute to differences in cortical organization, including cortical field size. Since both genetic and epigenetic factors contribute to cortical organization, some aspects of the cortical phenotype evolve, while other aspects of the cortical phenotype persist only if the environment in which an individual develops is relatively stable. PMID:18331904

  15. The nucleotide sequence of the entire ribosomal DNA operon and the structure of the large subunit rRNA of Giardia muris.

    PubMed

    van Keulen, H; Gutell, R R; Campbell, S R; Erlandsen, S L; Jarroll, E L

    1992-10-01

    The total nucleotide sequence of the rDNA of Giardia muris, an intestinal protozoan parasite of rodents, has been determined. The repeat unit is 7668 basepairs (bp) in size and consists of a spacer of 3314 bp, a small-subunit rRNA (SSU-rRNA) gene of 1429, and a large-subunit rRNA (LSU-rRNA) gene of 2698 bp. The spacer contains long direct repeats and is heterogeneous in size. The LSU-rRNA of G. muris was compared to that of the human intestinal parasite Giardia duodenalis, to the bird parasite Giardia ardeae, and to that of Escherichia coli. The LSU-rRNA has a size comparable to the 23S rRNA of E. coli but shows structural features typical for eukaryotes. Some variable regions are typically small and account for the overall smaller size of this rRNA. The structure of the G. muris LSU-rRNA is similar to that of the other Giardia rRNA, but each rRNA has characteristic features residing in a number of variable regions.

  16. Regional differentiation among populations of the Diamondback terrapin (Malaclemys terrapin)

    USGS Publications Warehouse

    Hart, Kristen M.; Hunter, Margaret E.; King, Tim L.

    2014-01-01

    The Diamondback terrapin (Malaclemys terrapin) is a brackish-water turtle species whose populations have been fragmented due to anthropogenic activity such as development of coastal habitat and entrapment in commercial blue crab (Callinectes sapidus) fishing gear. Genetic analyses can improve conservation efforts for the long-term protection of the species. We used microsatellite DNA analysis to investigate levels of gene flow among and genetic variability within 21 geographically separate collections of the species distributed from Massachusetts to Texas. Quantified levels of genetic variability (allelic diversity, genotypic frequencies, and heterozygosity) revealed three zones of genetic discontinuity, resulting in four discrete populations: Northeast Atlantic, Coastal Mid-Atlantic, Florida and Texas/Louisiana. The average number of alleles and expected heterozygosity for the four genetic clusters were NA = 6.54 and HE = 0.050, respectively. However, the geographic boundaries of the populations did not correspond to accepted terrapin subspecies limits. Our results illuminate not only the need to sample terrapins in additional sites, specifically in the southeast, but also the necessity for allowing uninterrupted gene flow among population groupings to preserve current levels of genetic diversity.

  17. Minding the gap: Frequency of indels in mtDNA control region sequence data and influence on population genetic analyses

    USGS Publications Warehouse

    Pearce, J.M.

    2006-01-01

    Insertions and deletions (indels) result in sequences of various lengths when homologous gene regions are compared among individuals or species. Although indels are typically phylogenetically informative, occurrence and incorporation of these characters as gaps in intraspecific population genetic data sets are rarely discussed. Moreover, the impact of gaps on estimates of fixation indices, such as FST, has not been reviewed. Here, I summarize the occurrence and population genetic signal of indels among 60 published studies that involved alignments of multiple sequences from the mitochondrial DNA (mtDNA) control region of vertebrate taxa. Among 30 studies observing indels, an average of 12% of both variable and parsimony-informative sites were composed of these sites. There was no consistent trend between levels of population differentiation and the number of gap characters in a data block. Across all studies, the average influence on estimates of ??ST was small, explaining only an additional 1.8% of among population variance (range 0.0-8.0%). Studies most likely to observe an increase in ??ST with the inclusion of gap characters were those with < 20 variable sites, but a near equal number of studies with few variable sites did not show an increase. In contrast to studies at interspecific levels, the influence of indels for intraspecific population genetic analyses of control region DNA appears small, dependent upon total number of variable sites in the data block, and related to species-specific characteristics and the spatial distribution of mtDNA lineages that contain indels. ?? 2006 Blackwell Publishing Ltd.

  18. Genetic variability in captive populations of the stingless bee Tetragonisca angustula.

    PubMed

    Santiago, Leandro R; Francisco, Flávio O; Jaffé, Rodolfo; Arias, Maria C

    2016-08-01

    Low genetic variability has normally been considered a consequence of animal husbandry and a major contributing factor to declining bee populations. Here, we performed a molecular analysis of captive and wild populations of the stingless bee Tetragonisca angustula, one of the most commonly kept species across South America. Microsatellite analyses showed similar genetic variability between wild and captive populations However, captive populations showed lower mitochondrial genetic variability. Male-mediated gene flow, transport and division of nests are suggested as the most probable explanations for the observed patterns of genetic structure. We conclude that increasing the number of colonies kept through nest divisions does not negatively affect nuclear genetic variability, which seems to be maintained by small-scale male dispersal and human-mediated nest transport. However, the transport of nests from distant localities should be practiced with caution given the high genetic differentiation observed between samples from western and eastern areas. The high genetic structure verified is the result of a long-term evolutionary process, and bees from distant localities may represent unique evolutionary lineages.

  19. Normalization of High Dimensional Genomics Data Where the Distribution of the Altered Variables Is Skewed

    PubMed Central

    Landfors, Mattias; Philip, Philge; Rydén, Patrik; Stenberg, Per

    2011-01-01

    Genome-wide analysis of gene expression or protein binding patterns using different array or sequencing based technologies is now routinely performed to compare different populations, such as treatment and reference groups. It is often necessary to normalize the data obtained to remove technical variation introduced in the course of conducting experimental work, but standard normalization techniques are not capable of eliminating technical bias in cases where the distribution of the truly altered variables is skewed, i.e. when a large fraction of the variables are either positively or negatively affected by the treatment. However, several experiments are likely to generate such skewed distributions, including ChIP-chip experiments for the study of chromatin, gene expression experiments for the study of apoptosis, and SNP-studies of copy number variation in normal and tumour tissues. A preliminary study using spike-in array data established that the capacity of an experiment to identify altered variables and generate unbiased estimates of the fold change decreases as the fraction of altered variables and the skewness increases. We propose the following work-flow for analyzing high-dimensional experiments with regions of altered variables: (1) Pre-process raw data using one of the standard normalization techniques. (2) Investigate if the distribution of the altered variables is skewed. (3) If the distribution is not believed to be skewed, no additional normalization is needed. Otherwise, re-normalize the data using a novel HMM-assisted normalization procedure. (4) Perform downstream analysis. Here, ChIP-chip data and simulated data were used to evaluate the performance of the work-flow. It was found that skewed distributions can be detected by using the novel DSE-test (Detection of Skewed Experiments). Furthermore, applying the HMM-assisted normalization to experiments where the distribution of the truly altered variables is skewed results in considerably higher sensitivity and lower bias than can be attained using standard and invariant normalization methods. PMID:22132175

  20. Horizontal gene transfer in Histophilus somni and its role in the evolution of pathogenic strain 2336, as determined by comparative genomic analyses

    PubMed Central

    2011-01-01

    Background Pneumonia and myocarditis are the most commonly reported diseases due to Histophilus somni, an opportunistic pathogen of the reproductive and respiratory tracts of cattle. Thus far only a few genes involved in metabolic and virulence functions have been identified and characterized in H. somni using traditional methods. Analyses of the genome sequences of several Pasteurellaceae species have provided insights into their biology and evolution. In view of the economic and ecological importance of H. somni, the genome sequence of pneumonia strain 2336 has been determined and compared to that of commensal strain 129Pt and other members of the Pasteurellaceae. Results The chromosome of strain 2336 (2,263,857 bp) contained 1,980 protein coding genes, whereas the chromosome of strain 129Pt (2,007,700 bp) contained only 1,792 protein coding genes. Although the chromosomes of the two strains differ in size, their average GC content, gene density (total number of genes predicted on the chromosome), and percentage of sequence (number of genes) that encodes proteins were similar. The chromosomes of these strains also contained a number of discrete prophage regions and genomic islands. One of the genomic islands in strain 2336 contained genes putatively involved in copper, zinc, and tetracycline resistance. Using the genome sequence data and comparative analyses with other members of the Pasteurellaceae, several H. somni genes that may encode proteins involved in virulence (e.g., filamentous haemaggutinins, adhesins, and polysaccharide biosynthesis/modification enzymes) were identified. The two strains contained a total of 17 ORFs that encode putative glycosyltransferases and some of these ORFs had characteristic simple sequence repeats within them. Most of the genes/loci common to both the strains were located in different regions of the two chromosomes and occurred in opposite orientations, indicating genome rearrangement since their divergence from a common ancestor. Conclusions Since the genome of strain 129Pt was ~256,000 bp smaller than that of strain 2336, these genomes provide yet another paradigm for studying evolutionary gene loss and/or gain in regard to virulence repertoire and pathogenic ability. Analyses of the complete genome sequences revealed that bacteriophage- and transposon-mediated horizontal gene transfer had occurred at several loci in the chromosomes of strains 2336 and 129Pt. It appears that these mobile genetic elements have played a major role in creating genomic diversity and phenotypic variability among the two H. somni strains. PMID:22111657

  1. Horizontal gene transfer in Histophilus somni and its role in the evolution of pathogenic strain 2336, as determined by comparative genomic analyses.

    PubMed

    Siddaramappa, Shivakumara; Challacombe, Jean F; Duncan, Alison J; Gillaspy, Allison F; Carson, Matthew; Gipson, Jenny; Orvis, Joshua; Zaitshik, Jeremy; Barnes, Gentry; Bruce, David; Chertkov, Olga; Detter, J Chris; Han, Cliff S; Tapia, Roxanne; Thompson, Linda S; Dyer, David W; Inzana, Thomas J

    2011-11-23

    Pneumonia and myocarditis are the most commonly reported diseases due to Histophilus somni, an opportunistic pathogen of the reproductive and respiratory tracts of cattle. Thus far only a few genes involved in metabolic and virulence functions have been identified and characterized in H. somni using traditional methods. Analyses of the genome sequences of several Pasteurellaceae species have provided insights into their biology and evolution. In view of the economic and ecological importance of H. somni, the genome sequence of pneumonia strain 2336 has been determined and compared to that of commensal strain 129Pt and other members of the Pasteurellaceae. The chromosome of strain 2336 (2,263,857 bp) contained 1,980 protein coding genes, whereas the chromosome of strain 129Pt (2,007,700 bp) contained only 1,792 protein coding genes. Although the chromosomes of the two strains differ in size, their average GC content, gene density (total number of genes predicted on the chromosome), and percentage of sequence (number of genes) that encodes proteins were similar. The chromosomes of these strains also contained a number of discrete prophage regions and genomic islands. One of the genomic islands in strain 2336 contained genes putatively involved in copper, zinc, and tetracycline resistance. Using the genome sequence data and comparative analyses with other members of the Pasteurellaceae, several H. somni genes that may encode proteins involved in virulence (e.g., filamentous haemaggutinins, adhesins, and polysaccharide biosynthesis/modification enzymes) were identified. The two strains contained a total of 17 ORFs that encode putative glycosyltransferases and some of these ORFs had characteristic simple sequence repeats within them. Most of the genes/loci common to both the strains were located in different regions of the two chromosomes and occurred in opposite orientations, indicating genome rearrangement since their divergence from a common ancestor. Since the genome of strain 129Pt was ~256,000 bp smaller than that of strain 2336, these genomes provide yet another paradigm for studying evolutionary gene loss and/or gain in regard to virulence repertoire and pathogenic ability. Analyses of the complete genome sequences revealed that bacteriophage- and transposon-mediated horizontal gene transfer had occurred at several loci in the chromosomes of strains 2336 and 129Pt. It appears that these mobile genetic elements have played a major role in creating genomic diversity and phenotypic variability among the two H. somni strains.

  2. A Mitochondrial Mutator System in Maize1[w

    PubMed Central

    Kuzmin, Evgeny V.; Duvick, Donald N.; Newton, Kathleen J.

    2005-01-01

    The P2 line of maize (Zea mays) is characterized by mitochondrial genome destabilization, initiated by recessive nuclear mutations. These alleles alter copy number control of mitochondrial subgenomes and disrupt normal transfer of mitochondrial genomic components to progeny, resulting in differences in mitochondrial DNA profiles among sibling plants and between parents and progeny. The mitochondrial DNA changes are often associated with variably defective phenotypes, reflecting depletion of essential mitochondrial genes. The P2 nuclear genotype can be considered a natural mutagenesis system for maize mitochondria. It dramatically accelerates mitochondrial genomic divergence by increasing low copy-number subgenomes, by rapidly amplifying aberrant recombination products, and by causing the random loss of normal components of the mitochondrial genomes. PMID:15681663

  3. Variability of nitrifying communities in surface coastal waters of the Eastern South Pacific (∼36° S).

    PubMed

    Levipan, Héctor A; Molina, Verónica; Anguita, Cristóbal; Rain-Franco, Angel; Belmar, Lucy; Fernandez, Camila

    2016-08-03

    We report the seasonal and single-diurnal variability of potentially active members of the prokaryote community in coastal surface waters off central Chile and the relationship between nitrifiers and solar radiation by combining 16S cDNA-based pyrosequencing, RT-qPCR of specific gene markers for nitrifiers (amoA, for general AOA, AOA-A, AOA-B, Nitrosopumilus maritimus and beta-AOB; and 16S rRNA gene for Nitrospina-like NOB), and solar irradiance measurements. We also evaluated the effects of artificial UVA-PAR and PAR spectra on nitrifiers by RT-qPCR. All nitrifiers (except AOA-B ecotype) were detected via RT-qPCR but AOA was the only group detected by pyrosequencing. Results showed high variability in their transcriptional levels during the day which could be associated to sunlight intensity thresholds in winter although AOA and Nitrospina-like NOB transcript number were also potentially related with environmental substrate availability. Only N. maritimus amoA transcripts showed a significant negative correlation with solar irradiances in both periods. During spring-summer, Nitrospina transcripts decreased at higher sunlight intensities, whereas the opposite was found during winter under natural (in situ) and artificial light experiments. In summary, a nitrifying community with variable tolerance to solar radiation is responsible for daily nitrification, and was particularly diverse during winter in the study area. © 2016 Society for Applied Microbiology and John Wiley & Sons Ltd.

  4. A large-scale study of the random variability of a coding sequence: a study on the CFTR gene.

    PubMed

    Modiano, Guido; Bombieri, Cristina; Ciminelli, Bianca Maria; Belpinati, Francesca; Giorgi, Silvia; Georges, Marie des; Scotet, Virginie; Pompei, Fiorenza; Ciccacci, Cinzia; Guittard, Caroline; Audrézet, Marie Pierre; Begnini, Angela; Toepfer, Michael; Macek, Milan; Ferec, Claude; Claustres, Mireille; Pignatti, Pier Franco

    2005-02-01

    Coding single nucleotide substitutions (cSNSs) have been studied on hundreds of genes using small samples (n(g) approximately 100-150 genes). In the present investigation, a large random European population sample (average n(g) approximately 1500) was studied for a single gene, the CFTR (Cystic Fibrosis Transmembrane conductance Regulator). The nonsynonymous (NS) substitutions exhibited, in accordance with previous reports, a mean probability of being polymorphic (q > 0.005), much lower than that of the synonymous (S) substitutions, but they showed a similar rate of subpolymorphic (q < 0.005) variability. This indicates that, in autosomal genes that may have harmful recessive alleles (nonduplicated genes with important functions), genetic drift overwhelms selection in the subpolymorphic range of variability, making disadvantageous alleles behave as neutral. These results imply that the majority of the subpolymorphic nonsynonymous alleles of these genes are selectively negative or even pathogenic.

  5. Influence of NR3C1 and VDR polymorphisms on stable warfarin dose in patients with mechanical cardiac valves.

    PubMed

    Lee, Kyung Eun; Chung, Jee Eun; Yi, Boram; Cho, Yoon Jeong; Kim, Hyun Jeong; Lee, Gwan Yung; Kim, Joo Hee; Chang, Byung Chul; Gwak, Hye Sun

    2017-06-01

    The aim of this study was to evaluate the associations between polymorphisms of VKORC1, CYP2C9, CYP4F2, NR3C1 and VDR genes and stable warfarin doses in Korean patients with mechanical heart valves. Seventeen single-nucleotide polymorphisms (SNPs) in 204 patients with stable warfarin dose were analyzed: VKORC1 (rs9934438), CYP2C9 (rs1057910), CYP4F2 (rs2108622), NR3C1 (rs41423247, rs1800445, rs56149945, rs10052957, rs6198, rs33388, rs6196, and rs244465), and VDR (rs1544410, rs11568820, rs731236, rs757343, rs7975232, and rs2228570). Statistical analyses were conducted to evaluate the associations of gene variations with stable warfarin dose. Number needed to genotype was obtained by calculating the percentage of patients whose predicted dose was at least 20% higher or lower than the actual stable dose. The combined genotypes of rs7975232 and rs2228570 of the VDR gene revealed a significant association with stable warfarin dose, along with VKORC1, CYP2C9, and CYP4F2 polymorphisms. Patients with the genotype combination GT,TT/CT,CC of VDR rs7975232/rs2228570 required significantly higher stable warfarin dose (5.79±2.02mg) than those with the other genotypic combinations (5.19±1.78mg, p=0.034). Multivariate analysis showed that VDR rs7975232/rs2228570 explained 2.0% of the 47.5% variability in overall warfarin dose. Adding VDR SNP combinations to the base model including non-genetic variables (age, sex, and body weight) and genetic variables (VKORC1 rs9934438, CYP2C9 rs1057910, and CYP4F2 rs2108622) gave a number needed to genotype of 41. This study showed that stable warfarin dose is associated with VDR SNPs along with VKORC1, CYP2C9, and CYP4F2 SNPs. Copyright © 2017 Elsevier B.V. All rights reserved.

  6. Response variables for evaluation of the effectiveness of conservation corridors.

    PubMed

    Gregory, Andrew J; Beier, Paul

    2014-06-01

    Many studies have evaluated effectiveness of corridors by measuring species presence in and movement through small structural corridors. However, few studies have assessed whether these response variables are adequate for assessing whether the conservation goals of the corridors have been achieved or considered the costs or lag times involved in measuring the response variables. We examined 4 response variables-presence of the focal species in the corridor, interpatch movement via the corridor, gene flow, and patch occupancy--with respect to 3 criteria--relevance to conservation goals, lag time (fewest generations at which a positive response to the corridor might be evident with a particular variable), and the cost of a study when applying a particular variable. The presence variable had the least relevance to conservation goals, no lag time advantage compared with interpatch movement, and only a moderate cost advantage over interpatch movement or gene flow. Movement of individual animals between patches was the most appropriate response variable for a corridor intended to provide seasonal migration, but it was not an appropriate response variable for corridor dwellers, and for passage species it was only moderately relevant to the goals of gene flow, demographic rescue, and recolonization. Response variables related to gene flow provided a good trade-off among cost, relevance to conservation goals, and lag time. Nonetheless, the lag time of 10-20 generations means that evaluation of conservation corridors cannot occur until a few decades after a corridor has been established. Response variables related to occupancy were most relevant to conservation goals, but the lag time and costs to detect corridor effects on occupancy were much greater than the lag time and costs to detect corridor effects on gene flow. © 2014 Society for Conservation Biology.

  7. A Genetic Basis for Functional Hypothalamic Amenorrhea

    PubMed Central

    Caronia, Lisa M.; Martin, Cecilia; Welt, Corrine K.; Sykiotis, Gerasimos P.; Quinton, Richard; Thambundit, Apisadaporn; Avbelj, Magdalena; Dhruvakumar, Sadhana; Plummer, Lacey; Hughes, Virginia A.; Seminara, Stephanie B.; Boepple, Paul A.; Sidis, Yisrael; Crowley, William F.; Martin, Kathryn A.; Hall, Janet E.; Pitteloud, Nelly

    2011-01-01

    BACKGROUND Functional hypothalamic amenorrhea is a reversible form of gonadotropin-releasing hormone (GnRH) deficiency commonly triggered by stressors such as excessive exercise, nutritional deficits, or psychological distress. Women vary in their susceptibility to inhibition of the reproductive axis by such stressors, but it is unknown whether this variability reflects a genetic predisposition to hypothalamic amenorrhea. We hypothesized that mutations in genes involved in idiopathic hypogonadotropic hypogonadism, a congenital form of GnRH deficiency, are associated with hypothalamic amenorrhea. METHODS We analyzed the coding sequence of genes associated with idiopathic hypogonadotropic hypogonadism in 55 women with hypothalamic amenorrhea and performed in vitro studies of the identified mutations. RESULTS Six heterozygous mutations were identified in 7 of the 55 patients with hypothalamic amenorrhea: two variants in the fibroblast growth factor receptor 1 gene FGFR1 (G260E and R756H), two in the prokineticin receptor 2 gene PROKR2 (R85H and L173R), one in the GnRH receptor gene GNRHR (R262Q), and one in the Kall-mann syndrome 1 sequence gene KAL1 (V371I). No mutations were found in a cohort of 422 controls with normal menstrual cycles. In vitro studies showed that FGFR1 G260E, FGFR1 R756H, and PROKR2 R85H are loss-of-function mutations, as has been previously shown for PROKR2 L173R and GNRHR R262Q. CONCLUSIONS Rare variants in genes associated with idiopathic hypogonadotropic hypogonadism are found in women with hypothalamic amenorrhea, suggesting that these mutations may contribute to the variable susceptibility of women to the functional changes in GnRH secretion that characterize hypothalamic amenorrhea. Our observations provide evidence for the role of rare variants in common multifactorial disease. (Funded by the Eunice Kennedy Shriver National Institute of Child Health and Human Development and others; ClinicalTrials.gov number, NCT00494169.) PMID:21247312

  8. Selection of Reference Gene Expression in a Schizophrenia Brain Cohort

    PubMed Central

    Weickert, Cynthia Shannon; Sheedy, Donna; Rothmond, Debora A.; Dedova, Irina; Fung, Samantha; Garrick, Therese; Wong, Jenny; Harding, Antony J.; Sivagnanansundaram, Sinthuja; Hunt, Clare; Duncan, Carlotta; Sundqvist, Nina; Tsai, Shan-Yuan; Anand, Jasna; Draganic, Daren; Harper, Clive

    2010-01-01

    Objective To conduct postmortem human brain research into the neuropathological basis of schizophrenia, it is critical to establish cohorts that are well-characterised and well-matched. Our objective was to determine if specimen characteristics, including: diagnosis, age, postmortem interval (PMI), brain acidity (pH), and/or the agonal state of the subject at death related to RNA quality, and to determine the most appropriate reference gene mRNAs. Methods We selected a matched cohort of 74 cases (37 schizophrenia / schizoaffective disorder cases and 37 controls cases). Middle frontal gyrus tissue was pulverised, tissue pH was measured, RNA isolated for cDNA from each case, and RNA integrity number (RIN) measurements were assessed. Using RT-PCR, we measured nine housekeeper genes and calculated a geomean in each diagnostic group. Results We found that the RINs were very good (mean 7.3) and all nine housekeeper control genes were significantly correlated with RIN. Seven of nine housekeeper genes were also correlated with pH, and two clinical variables, agonal state and duration of illness did have an effect on some control mRNAs. No major impact of PMI or freezer time on housekeeper mRNAs was detected. Our results show that people with schizophrenia had significantly less PPIA, and SDHA and tended to have less GUSB and B2M mRNA suggesting that these control genes may not be good candidates for normalisation. Conclusions In our cohort, less than 10% variability in RIN values was detected and the diagnostic groups were well matched overall. Our cohort was adequately powered (0.80–0.90) to detect mRNA differences (25%) due to disease. Our study suggests that multiple factors should be considered in mRNA expression studies of human brain tissues. When schizophrenia cases are adequately matched to control cases subtle differences in gene expression can be reliably detected. PMID:20073568

  9. A genetic basis for functional hypothalamic amenorrhea.

    PubMed

    Caronia, Lisa M; Martin, Cecilia; Welt, Corrine K; Sykiotis, Gerasimos P; Quinton, Richard; Thambundit, Apisadaporn; Avbelj, Magdalena; Dhruvakumar, Sadhana; Plummer, Lacey; Hughes, Virginia A; Seminara, Stephanie B; Boepple, Paul A; Sidis, Yisrael; Crowley, William F; Martin, Kathryn A; Hall, Janet E; Pitteloud, Nelly

    2011-01-20

    Functional hypothalamic amenorrhea is a reversible form of gonadotropin-releasing hormone (GnRH) deficiency commonly triggered by stressors such as excessive exercise, nutritional deficits, or psychological distress. Women vary in their susceptibility to inhibition of the reproductive axis by such stressors, but it is unknown whether this variability reflects a genetic predisposition to hypothalamic amenorrhea. We hypothesized that mutations in genes involved in idiopathic hypogonadotropic hypogonadism, a congenital form of GnRH deficiency, are associated with hypothalamic amenorrhea. We analyzed the coding sequence of genes associated with idiopathic hypogonadotropic hypogonadism in 55 women with hypothalamic amenorrhea and performed in vitro studies of the identified mutations. Six heterozygous mutations were identified in 7 of the 55 patients with hypothalamic amenorrhea: two variants in the fibroblast growth factor receptor 1 gene FGFR1 (G260E and R756H), two in the prokineticin receptor 2 gene PROKR2 (R85H and L173R), one in the GnRH receptor gene GNRHR (R262Q), and one in the Kallmann syndrome 1 sequence gene KAL1 (V371I). No mutations were found in a cohort of 422 controls with normal menstrual cycles. In vitro studies showed that FGFR1 G260E, FGFR1 R756H, and PROKR2 R85H are loss-of-function mutations, as has been previously shown for PROKR2 L173R and GNRHR R262Q. Rare variants in genes associated with idiopathic hypogonadotropic hypogonadism are found in women with hypothalamic amenorrhea, suggesting that these mutations may contribute to the variable susceptibility of women to the functional changes in GnRH secretion that characterize hypothalamic amenorrhea. Our observations provide evidence for the role of rare variants in common multifactorial disease. (Funded by the Eunice Kennedy Shriver National Institute of Child Health and Human Development and others; ClinicalTrials.gov number, NCT00494169.).

  10. Chromosomal Copy Number Variation in Saccharomyces pastorianus Is Evidence for Extensive Genome Dynamics in Industrial Lager Brewing Strains.

    PubMed

    van den Broek, M; Bolat, I; Nijkamp, J F; Ramos, E; Luttik, M A H; Koopman, F; Geertman, J M; de Ridder, D; Pronk, J T; Daran, J-M

    2015-09-01

    Lager brewing strains of Saccharomyces pastorianus are natural interspecific hybrids originating from the spontaneous hybridization of Saccharomyces cerevisiae and Saccharomyces eubayanus. Over the past 500 years, S. pastorianus has been domesticated to become one of the most important industrial microorganisms. Production of lager-type beers requires a set of essential phenotypes, including the ability to ferment maltose and maltotriose at low temperature, the production of flavors and aromas, and the ability to flocculate. Understanding of the molecular basis of complex brewing-related phenotypic traits is a prerequisite for rational strain improvement. While genome sequences have been reported, the variability and dynamics of S. pastorianus genomes have not been investigated in detail. Here, using deep sequencing and chromosome copy number analysis, we showed that S. pastorianus strain CBS1483 exhibited extensive aneuploidy. This was confirmed by quantitative PCR and by flow cytometry. As a direct consequence of this aneuploidy, a massive number of sequence variants was identified, leading to at least 1,800 additional protein variants in S. pastorianus CBS1483. Analysis of eight additional S. pastorianus strains revealed that the previously defined group I strains showed comparable karyotypes, while group II strains showed large interstrain karyotypic variability. Comparison of three strains with nearly identical genome sequences revealed substantial chromosome copy number variation, which may contribute to strain-specific phenotypic traits. The observed variability of lager yeast genomes demonstrates that systematic linking of genotype to phenotype requires a three-dimensional genome analysis encompassing physical chromosomal structures, the copy number of individual chromosomes or chromosomal regions, and the allelic variation of copies of individual genes. Copyright © 2015, van den Broek et al.

  11. Chromosomal Copy Number Variation in Saccharomyces pastorianus Is Evidence for Extensive Genome Dynamics in Industrial Lager Brewing Strains

    PubMed Central

    van den Broek, M.; Bolat, I.; Nijkamp, J. F.; Ramos, E.; Luttik, M. A. H.; Koopman, F.; Geertman, J. M.; de Ridder, D.; Pronk, J. T.

    2015-01-01

    Lager brewing strains of Saccharomyces pastorianus are natural interspecific hybrids originating from the spontaneous hybridization of Saccharomyces cerevisiae and Saccharomyces eubayanus. Over the past 500 years, S. pastorianus has been domesticated to become one of the most important industrial microorganisms. Production of lager-type beers requires a set of essential phenotypes, including the ability to ferment maltose and maltotriose at low temperature, the production of flavors and aromas, and the ability to flocculate. Understanding of the molecular basis of complex brewing-related phenotypic traits is a prerequisite for rational strain improvement. While genome sequences have been reported, the variability and dynamics of S. pastorianus genomes have not been investigated in detail. Here, using deep sequencing and chromosome copy number analysis, we showed that S. pastorianus strain CBS1483 exhibited extensive aneuploidy. This was confirmed by quantitative PCR and by flow cytometry. As a direct consequence of this aneuploidy, a massive number of sequence variants was identified, leading to at least 1,800 additional protein variants in S. pastorianus CBS1483. Analysis of eight additional S. pastorianus strains revealed that the previously defined group I strains showed comparable karyotypes, while group II strains showed large interstrain karyotypic variability. Comparison of three strains with nearly identical genome sequences revealed substantial chromosome copy number variation, which may contribute to strain-specific phenotypic traits. The observed variability of lager yeast genomes demonstrates that systematic linking of genotype to phenotype requires a three-dimensional genome analysis encompassing physical chromosomal structures, the copy number of individual chromosomes or chromosomal regions, and the allelic variation of copies of individual genes. PMID:26150454

  12. Genetic architecture, inter-relationship and selection criteria for yield improvement in rice (Oryza sativa L.).

    PubMed

    Yadav, S K; Pandey, P; Kumar, B; Suresh, B G

    2011-05-01

    This study has been conducted to determine the extent of genetic association between yield of Rice (Oryza sativa L.) and its components. The present experiment was carried out with 40 Rice (Oryza sativa L.) genotypes which were evaluated in a randomized block design with 3 replications during wet season of 2007 and 2008. Results showed that sufficient amount of variability was found in the entire gene pool for all traits studied. Higher magnitude of genotypic and phenotypic coefficients of variation was recorded for seed yield, harvest index, biological yield, number of spikelets per panicle, flag leaf length, plant height and number of tillers indicates that these characters are least influence by environment. High heritability coupled with high genetic advance as percent of mean was registered for seed yield, harvest index, number of spikelets per panicle, biological yield and flag leaf length, suggesting preponderance of additive gene action in the expression of these characters. Grain yield was significantly and positively associated with harvest index, number of tillers per hill, number of panicle per plant, panicle length, number of spikelet's per panicle and test weight at both genotypic and phenotypic levels. Path coefficient analysis revealed that harvest index, biological yield, number of tillers per hill, panicle length, number of spikelets per panicle, plant height and test weight had direct positive effect on seed yield, indicating these are the main contributors to yield. From this study it may be concluded that harvest index, number of tillers per hill, panicle length and number of spikelet per panicle and test weight are the most important characters that contributed directly to yield. Thus, these characters may serve selection criteria for improving genetic potential of rice.

  13. Multivariate Analysis of Genotype-Phenotype Association.

    PubMed

    Mitteroecker, Philipp; Cheverud, James M; Pavlicev, Mihaela

    2016-04-01

    With the advent of modern imaging and measurement technology, complex phenotypes are increasingly represented by large numbers of measurements, which may not bear biological meaning one by one. For such multivariate phenotypes, studying the pairwise associations between all measurements and all alleles is highly inefficient and prevents insight into the genetic pattern underlying the observed phenotypes. We present a new method for identifying patterns of allelic variation (genetic latent variables) that are maximally associated-in terms of effect size-with patterns of phenotypic variation (phenotypic latent variables). This multivariate genotype-phenotype mapping (MGP) separates phenotypic features under strong genetic control from less genetically determined features and thus permits an analysis of the multivariate structure of genotype-phenotype association, including its dimensionality and the clustering of genetic and phenotypic variables within this association. Different variants of MGP maximize different measures of genotype-phenotype association: genetic effect, genetic variance, or heritability. In an application to a mouse sample, scored for 353 SNPs and 11 phenotypic traits, the first dimension of genetic and phenotypic latent variables accounted for >70% of genetic variation present in all 11 measurements; 43% of variation in this phenotypic pattern was explained by the corresponding genetic latent variable. The first three dimensions together sufficed to account for almost 90% of genetic variation in the measurements and for all the interpretable genotype-phenotype association. Each dimension can be tested as a whole against the hypothesis of no association, thereby reducing the number of statistical tests from 7766 to 3-the maximal number of meaningful independent tests. Important alleles can be selected based on their effect size (additive or nonadditive effect on the phenotypic latent variable). This low dimensionality of the genotype-phenotype map has important consequences for gene identification and may shed light on the evolvability of organisms. Copyright © 2016 by the Genetics Society of America.

  14. The nuclear 18S ribosomal RNA gene as a source of phylogenetic information in the genus Taenia.

    PubMed

    Yan, Hongbin; Lou, Zhongzi; Li, Li; Ni, Xingwei; Guo, Aijiang; Li, Hongmin; Zheng, Yadong; Dyachenko, Viktor; Jia, Wanzhong

    2013-03-01

    Most species of the genus Taenia are of considerable medical and veterinary significance. In this study, complete nuclear 18S rRNA gene sequences were obtained from seven members of genus Taenia [Taenia multiceps, Taenia saginata, Taenia asiatica, Taenia solium, Taenia pisiformis, Taenia hydatigena, and Taenia taeniaeformis] and a phylogeny inferred using these sequences. Most of the variable sites fall within the variable regions, V1-V5. We show that sequences from the nuclear 18S ribosomal RNA gene have considerable promise as sources of phylogenetic information within the genus Taenia. Furthermore, given that almost all the variable sites lie within defined variable portions of that gene, it will be appropriate and economical to sequence only those regions for additional species of Taenia.

  15. Diversity of Clostridium perfringens isolates from various sources and prevalence of conjugative plasmids.

    PubMed

    Park, Miseon; Deck, Joanna; Foley, Steven L; Nayak, Rajesh; Songer, J Glenn; Seibel, Janice R; Khan, Saeed A; Rooney, Alejandro P; Hecht, David W; Rafii, Fatemeh

    2016-04-01

    Clostridium perfringens is an important pathogen, causing food poisoning and other mild to severe infections in humans and animals. Some strains of C. perfringens contain conjugative plasmids, which may carry antimicrobial resistance and toxin genes. We studied genomic and plasmid diversity of 145 C. perfringens type A strains isolated from soils, foods, chickens, clinical samples, and domestic animals (porcine, bovine and canine), from different geographic areas in the United States between 1994 and 2006, using multiple-locus variable-number tandem repeat analysis (MLVA) and/or pulsed-field gel electrophoresis (PFGE). MLVA detected the genetic diversity in a majority of the isolates. PFGE, using SmaI and KspI, confirmed the MLVA results but also detected differences among the strains that could not be differentiated by MLVA. All of the PFGE profiles of the strains were different, except for a few of the epidemiologically related strains, which were identical. The PFGE profiles of strains isolated from the same domestic animal species were clustered more closely with each other than with other strains. However, a variety of C. perfringens strains with distinct genetic backgrounds were found among the clinical isolates. Variation was also observed in the size and number of plasmids in the strains. Primers for the internal fragment of a conjugative tcpH gene of C. perfringens plasmid pCPF4969 amplified identical size fragments from a majority of strains tested; and this gene hybridized to the various-sized plasmids of these strains. The sequences of the PCR-amplified tcpH genes from 12 strains showed diversity among the tcpH genes. Regardless of the sources of the isolates, the genetic diversity of C. perfringens extended to the plasmids carrying conjugative genes. Published by Elsevier Ltd.

  16. Gene-body hypermethylation of ATM in peripheral blood DNA of bilateral breast cancer patients

    PubMed Central

    Flanagan, James M.; Munoz-Alegre, Marta; Henderson, Stephen; Tang, Thomas; Sun, Ping; Johnson, Nichola; Fletcher, Olivia; dos Santos Silva, Isabel; Peto, Julian; Boshoff, Chris; Narod, Steven; Petronis, Arturas

    2009-01-01

    Bilaterality of breast cancer is an indicator of constitutional cancer susceptibility; however, the molecular causes underlying this predisposition in the majority of cases is not known. We hypothesize that epigenetic misregulation of cancer-related genes could partially account for this predisposition. We have performed methylation microarray analysis of peripheral blood DNA from 14 women with bilateral breast cancer compared with 14 unaffected matched controls throughout 17 candidate breast cancer susceptibility genes including BRCA1, BRCA2, CHEK2, ATM, ESR1, SFN, CDKN2A, TP53, GSTP1, CDH1, CDH13, HIC1, PGR, SFRP1, MLH1, RARB and HSD17B4. We show that the majority of methylation variability is associated with intragenic repetitive elements. Detailed validation of the tiled region around ATM was performed by bisulphite modification and pyrosequencing of the same samples and in a second set of peripheral blood DNA from 190 bilateral breast cancer patients compared with 190 controls. We show significant hypermethylation of one intragenic repetitive element in breast cancer cases compared with controls (P = 0.0017), with the highest quartile of methylation associated with a 3-fold increased risk of breast cancer (OR 3.20, 95% CI 1.78–5.86, P = 0.000083). Increased methylation of this locus is associated with lower steady-state ATM mRNA level and correlates with age of cancer patients but not controls, suggesting a combined age–phenotype-related association. This research demonstrates the potential for gene-body epigenetic misregulation of ATM and other cancer-related genes in peripheral blood DNA that may be useful as a novel marker to estimate breast cancer risk. Accession numbers: The microarray data and associated .BED and .WIG files can be accessed through Gene Expression Omnibus accession number: GSE14603. PMID:19153073

  17. The Spike-and-Slab Lasso Generalized Linear Models for Prediction and Associated Genes Detection.

    PubMed

    Tang, Zaixiang; Shen, Yueping; Zhang, Xinyan; Yi, Nengjun

    2017-01-01

    Large-scale "omics" data have been increasingly used as an important resource for prognostic prediction of diseases and detection of associated genes. However, there are considerable challenges in analyzing high-dimensional molecular data, including the large number of potential molecular predictors, limited number of samples, and small effect of each predictor. We propose new Bayesian hierarchical generalized linear models, called spike-and-slab lasso GLMs, for prognostic prediction and detection of associated genes using large-scale molecular data. The proposed model employs a spike-and-slab mixture double-exponential prior for coefficients that can induce weak shrinkage on large coefficients, and strong shrinkage on irrelevant coefficients. We have developed a fast and stable algorithm to fit large-scale hierarchal GLMs by incorporating expectation-maximization (EM) steps into the fast cyclic coordinate descent algorithm. The proposed approach integrates nice features of two popular methods, i.e., penalized lasso and Bayesian spike-and-slab variable selection. The performance of the proposed method is assessed via extensive simulation studies. The results show that the proposed approach can provide not only more accurate estimates of the parameters, but also better prediction. We demonstrate the proposed procedure on two cancer data sets: a well-known breast cancer data set consisting of 295 tumors, and expression data of 4919 genes; and the ovarian cancer data set from TCGA with 362 tumors, and expression data of 5336 genes. Our analyses show that the proposed procedure can generate powerful models for predicting outcomes and detecting associated genes. The methods have been implemented in a freely available R package BhGLM (http://www.ssg.uab.edu/bhglm/). Copyright © 2017 by the Genetics Society of America.

  18. Variability and population genetic structure in Achyrocline flaccida (Weinm.) DC., a species with high value in folk medicine in South America.

    PubMed

    Rosa, Juliana da; Weber, Gabriela Gomes; Cardoso, Rafaela; Górski, Felipe; Da-Silva, Paulo Roberto

    2017-01-01

    Better knowledge of medicinal plant species and their conservation is an urgent need worldwide. Decision making for conservation strategies can be based on the knowledge of the variability and population genetic structure of the species and on the events that may influence these genetic parameters. Achyrocline flaccida (Weinm.) DC. is a native plant from the grassy fields of South America with high value in folk medicine. In spite of its importance, no genetic and conservation studies are available for the species. In this work, microsatellite and ISSR (inter-simple sequence repeat) markers were used to estimate the genetic variability and structure of seven populations of A. flaccida from southern Brazil. The microsatellite markers were inefficient in A. flaccida owing to a high number of null alleles. After the evaluation of 42 ISSR primers on one population, 10 were selected for further analysis of seven A. flaccida populations. The results of ISSR showed that the high number of exclusive absence of loci might contribute to the inter-population differentiation. Genetic variability of the species was high (Nei's diversity of 0.23 and Shannon diversity of 0.37). AMOVA indicated higher genetic variability within (64.7%) than among (33.96%) populations, and the variability was unevenly distributed (FST 0.33). Gene flow among populations ranged from 1.68 to 5.2 migrants per generation, with an average of 1.39. The results of PCoA and Bayesian analyses corroborated and indicated that the populations are structured. The observed genetic variability and population structure of A. flaccida are discussed in the context of the vegetation formation history in southern Brazil, as well as the possible anthropogenic effects. Additionally, we discuss the implications of the results in the conservation of the species.

  19. The ace-1 Locus Is Amplified in All Resistant Anopheles gambiae Mosquitoes: Fitness Consequences of Homogeneous and Heterogeneous Duplications

    PubMed Central

    Djogbénou, Luc S.; Berthomieu, Arnaud; Makoundou, Patrick; Baba-Moussa, Lamine S.; Fiston-Lavier, Anna-Sophie; Belkhir, Khalid; Labbé, Pierrick; Weill, Mylène

    2016-01-01

    Gene copy-number variations are widespread in natural populations, but investigating their phenotypic consequences requires contemporary duplications under selection. Such duplications have been found at the ace-1 locus (encoding the organophosphate and carbamate insecticides’ target) in the mosquito Anopheles gambiae (the major malaria vector); recent studies have revealed their intriguing complexity, consistent with the involvement of various numbers and types (susceptible or resistant to insecticide) of copies. We used an integrative approach, from genome to phenotype level, to investigate the influence of duplication architecture and gene-dosage on mosquito fitness. We found that both heterogeneous (i.e., one susceptible and one resistant ace-1 copy) and homogeneous (i.e., identical resistant copies) duplications segregated in field populations. The number of copies in homogeneous duplications was variable and positively correlated with acetylcholinesterase activity and resistance level. Determining the genomic structure of the duplicated region revealed that, in both types of duplication, ace-1 and 11 other genes formed tandem 203kb amplicons. We developed a diagnostic test for duplications, which showed that ace-1 was amplified in all 173 resistant mosquitoes analyzed (field-collected in several African countries), in heterogeneous or homogeneous duplications. Each type was associated with different fitness trade-offs: heterogeneous duplications conferred an intermediate phenotype (lower resistance and fitness costs), whereas homogeneous duplications tended to increase both resistance and fitness cost, in a complex manner. The type of duplication selected seemed thus to depend on the intensity and distribution of selection pressures. This versatility of trade-offs available through gene duplication highlights the importance of large mutation events in adaptation to environmental variation. This impressive adaptability could have a major impact on vector control in Africa. PMID:27918584

  20. Pharmacogenetics of tacrolimus and sirolimus in renal transplant patients: from retrospective analyses to prospective studies.

    PubMed

    Anglicheau, D; Legendre, C; Thervet, E

    2007-09-01

    The promises of pharmacogenetics are to elucidate the inherited basis of differences between individual responses to drugs in order to identify the right drug and dose for each patient. The recent identification of genetic polymorphisms in drug-metabolizing enzymes and drug transporters led to the hypothesis that genetic factors may be implicated in the interindividual variability of the pharmacokinetic or pharmacodynamic characteristics of immunosuppressive drugs, major side effects, and efficacy. The purpose of this study was to provide a short overview of recent results obtained in the field of pharmacogenetics of tacrolimus and sirolimus, both substrates of the cytochrome P450 3A (CYP3A) enzymes and of the efflux pump P-glycoprotein, the product of the Multidrug Resistance-1 (MDR1) genes. A number of retrospective studies that demonstrated a link between the polymorphisms governing the CYP3A5 protein expression, with more conflicting results with the MDR1 gene polymorphisms, related to the daily dose necessary to achieve adequate blood tacrolimus levels. The CYP3A5 polymorphisms have also been associated with sirolimus pharmacokinetics. One challenge is to investigate the combined effect of a number of different polymorphisms in various genes to define genetic backgrounds with different pharmacokinetic profiles using high throughput technologies. Another challenge is to move toward prospective randomized studies to explore whether a pharmacogenetic approach, taking into account a limited number of polymorphisms prior to drug treatment, could be used on an individual basis to guide initial dosing of a given drug. The last challenge is based on "target" pharmacogenetics to investigate the role of the polymorphisms of other genes implicated in the efficacy and/or safety of the drug.

  1. Analysis of continuous-time switching networks

    NASA Astrophysics Data System (ADS)

    Edwards, R.

    2000-11-01

    Models of a number of biological systems, including gene regulation and neural networks, can be formulated as switching networks, in which the interactions between the variables depend strongly on thresholds. An idealized class of such networks in which the switching takes the form of Heaviside step functions but variables still change continuously in time has been proposed as a useful simplification to gain analytic insight. These networks, called here Glass networks after their originator, are simple enough mathematically to allow significant analysis without restricting the range of dynamics found in analogous smooth systems. A number of results have been obtained before, particularly regarding existence and stability of periodic orbits in such networks, but important cases were not considered. Here we present a coherent method of analysis that summarizes previous work and fills in some of the gaps as well as including some new results. Furthermore, we apply this analysis to a number of examples, including surprising long and complex limit cycles involving sequences of hundreds of threshold transitions. Finally, we show how the above methods can be extended to investigate aperiodic behaviour in specific networks, though a complete analysis will have to await new results in matrix theory and symbolic dynamics.

  2. Genetic Characterization of Circulating African Swine Fever Viruses in Nigeria (2007-2015).

    PubMed

    Luka, P D; Achenbach, J E; Mwiine, F N; Lamien, C E; Shamaki, D; Unger, H; Erume, J

    2017-10-01

    Sequencing and analysis of three discrete genome regions of African swine fever viruses (ASFV) from archival samples collected in 2007-2011 and active and passive surveillance between 2012 and 2015 in Nigeria were carried out. Analysis was conducted by genotyping of three single-copy African swine fever (ASF) genes. The E183L and B646L genes that encode structural proteins p54 and p72, respectively, were utilized to delineate genotypes before intragenotypic resolution by characterization of the tetrameric amino acid repeat region within the hypervariable central variable region of the B602L gene. The results showed no variation in the p72 and p54 gene regions sequenced. Phylogeny of p72 sequences revealed that all the Nigerian isolates belonged to genotype I, while that of the p54 recovered the Ia genotype. Analysis of B602L gene revealed the differences in the number of tetrameric repeats. Four new variants (Tet-15, Tet-17a, Tet-17b and Tet-48) were recovered, while a fifth variant (Tet-20) was the most widely distributed in the country displacing Tet-36 reported previously in 2003-2006. The viruses responsible for ASF outbreaks in Nigeria are from very closely related but mutated variants of the virus that have been circulating since 1997. A practical implication of the genetic variability of the Nigerian viral isolates in this study is the need for continuous sampling and analysis of circulating viruses, which will provide epidemiological information on the evolution of ASFV in the field versus new incursion for informed strategic control of the disease in the country. © 2016 Blackwell Verlag GmbH.

  3. Lifestyle Evolution in Cyanobacterial Symbionts of Sponges

    PubMed Central

    Burgsdorf, Ilia; Slaby, Beate M.; Handley, Kim M.; Haber, Markus; Blom, Jochen; Marshall, Christopher W.; Gilbert, Jack A.; Hentschel, Ute

    2015-01-01

    ABSTRACT The “Candidatus Synechococcus spongiarum” group includes different clades of cyanobacteria with high 16S rRNA sequence identity (~99%) and is the most abundant and widespread cyanobacterial symbiont of marine sponges. The first draft genome of a “Ca. Synechococcus spongiarum” group member was recently published, providing evidence of genome reduction by loss of genes involved in several nonessential functions. However, “Ca. Synechococcus spongiarum” includes a variety of clades that may differ widely in genomic repertoire and consequently in physiology and symbiotic function. Here, we present three additional draft genomes of “Ca. Synechococcus spongiarum,” each from a different clade. By comparing all four symbiont genomes to those of free-living cyanobacteria, we revealed general adaptations to life inside sponges and specific adaptations of each phylotype. Symbiont genomes shared about half of their total number of coding genes. Common traits of “Ca. Synechococcus spongiarum” members were a high abundance of DNA modification and recombination genes and a reduction in genes involved in inorganic ion transport and metabolism, cell wall biogenesis, and signal transduction mechanisms. Moreover, these symbionts were characterized by a reduced number of antioxidant enzymes and low-weight peptides of photosystem II compared to their free-living relatives. Variability within the “Ca. Synechococcus spongiarum” group was mostly related to immune system features, potential for siderophore-mediated iron transport, and dependency on methionine from external sources. The common absence of genes involved in synthesis of residues, typical of the O antigen of free-living Synechococcus species, suggests a novel mechanism utilized by these symbionts to avoid sponge predation and phage attack. PMID:26037118

  4. Lifestyle Evolution in Cyanobacterial Symbionts of Sponges

    DOE PAGES

    Burgsdorf, Ilia; Slaby, Beate M.; Handley, Kim M.; ...

    2015-06-02

    The “Candidatus Synechococcus spongiarum” group includes different clades of cyanobacteria with high 16S rRNA sequence identity (~99%) and is the most abundant and widespread cyanobacterial symbiont of marine sponges. The first draft genome of a “Ca. Synechococcus spongiarum” group member was recently published, providing evidence of genome reduction by loss of genes involved in several nonessential functions. However, “Ca. Synechococcus spongiarum” includes a variety of clades that may differ widely in genomic repertoire and consequently in physiology and symbiotic function. Here, we present three additional draft genomes of “Ca. Synechococcus spongiarum,” each from a different clade. By comparing all fourmore » symbiont genomes to those of free-living cyanobacteria, we revealed general adaptations to life inside sponges and specific adaptations of each phylotype. Symbiont genomes shared about half of their total number of coding genes. Common traits of “Ca. Synechococcus spongiarum” members were a high abundance of DNA modification and recombination genes and a reduction in genes involved in inorganic ion transport and metabolism, cell wall biogenesis, and signal transduction mechanisms. Moreover, these symbionts were characterized by a reduced number of antioxidant enzymes and low-weight peptides of photosystem II compared to their free-living relatives. Variability within the “Ca. Synechococcus spongiarum” group was mostly related to immune system features, potential for siderophore-mediated iron transport, and dependency on methionine from external sources. The common absence of genes involved in synthesis of residues, typical of the O antigen of free-living Synechococcus species, suggests a novel mechanism utilized by these symbionts to avoid sponge predation and phage attack.« less

  5. Lifestyle Evolution in Cyanobacterial Symbionts of Sponges

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Burgsdorf, Ilia; Slaby, Beate M.; Handley, Kim M.

    The “Candidatus Synechococcus spongiarum” group includes different clades of cyanobacteria with high 16S rRNA sequence identity (~99%) and is the most abundant and widespread cyanobacterial symbiont of marine sponges. The first draft genome of a “Ca. Synechococcus spongiarum” group member was recently published, providing evidence of genome reduction by loss of genes involved in several nonessential functions. However, “Ca. Synechococcus spongiarum” includes a variety of clades that may differ widely in genomic repertoire and consequently in physiology and symbiotic function. Here, we present three additional draft genomes of “Ca. Synechococcus spongiarum,” each from a different clade. By comparing all fourmore » symbiont genomes to those of free-living cyanobacteria, we revealed general adaptations to life inside sponges and specific adaptations of each phylotype. Symbiont genomes shared about half of their total number of coding genes. Common traits of “Ca. Synechococcus spongiarum” members were a high abundance of DNA modification and recombination genes and a reduction in genes involved in inorganic ion transport and metabolism, cell wall biogenesis, and signal transduction mechanisms. Moreover, these symbionts were characterized by a reduced number of antioxidant enzymes and low-weight peptides of photosystem II compared to their free-living relatives. Variability within the “Ca. Synechococcus spongiarum” group was mostly related to immune system features, potential for siderophore-mediated iron transport, and dependency on methionine from external sources. The common absence of genes involved in synthesis of residues, typical of the O antigen of free-living Synechococcus species, suggests a novel mechanism utilized by these symbionts to avoid sponge predation and phage attack.« less

  6. Comprehensive Analysis of Mouse Bitter Taste Receptors Reveals Different Molecular Receptive Ranges for Orthologous Receptors in Mice and Humans*

    PubMed Central

    Lossow, Kristina; Hübner, Sandra; Roudnitzky, Natacha; Slack, Jay P.; Pollastro, Federica; Behrens, Maik; Meyerhof, Wolfgang

    2016-01-01

    One key to animal survival is the detection and avoidance of potentially harmful compounds by their bitter taste. Variable numbers of taste 2 receptor genes expressed in the gustatory end organs enable bony vertebrates (Euteleostomi) to recognize numerous bitter chemicals. It is believed that the receptive ranges of bitter taste receptor repertoires match the profiles of bitter chemicals that the species encounter in their diets. Human and mouse genomes contain pairs of orthologous bitter receptor genes that have been conserved throughout evolution. Moreover, expansions in both lineages generated species-specific sets of bitter taste receptor genes. It is assumed that the orthologous bitter taste receptor genes mediate the recognition of bitter toxins relevant for both species, whereas the lineage-specific receptors enable the detection of substances differently encountered by mice and humans. By challenging 34 mouse bitter taste receptors with 128 prototypical bitter substances in a heterologous expression system, we identified cognate compounds for 21 receptors, 19 of which were previously orphan receptors. We have demonstrated that mouse taste 2 receptors, like their human counterparts, vary greatly in their breadth of tuning, ranging from very broadly to extremely narrowly tuned receptors. However, when compared with humans, mice possess fewer broadly tuned receptors and an elevated number of narrowly tuned receptors, supporting the idea that a large receptor repertoire is the basis for the evolution of specialized receptors. Moreover, we have demonstrated that sequence-orthologous bitter taste receptors have distinct agonist profiles. Species-specific gene expansions have enabled further diversification of bitter substance recognition spectra. PMID:27226572

  7. BASiCS: Bayesian Analysis of Single-Cell Sequencing Data

    PubMed Central

    Vallejos, Catalina A.; Marioni, John C.; Richardson, Sylvia

    2015-01-01

    Single-cell mRNA sequencing can uncover novel cell-to-cell heterogeneity in gene expression levels in seemingly homogeneous populations of cells. However, these experiments are prone to high levels of unexplained technical noise, creating new challenges for identifying genes that show genuine heterogeneous expression within the population of cells under study. BASiCS (Bayesian Analysis of Single-Cell Sequencing data) is an integrated Bayesian hierarchical model where: (i) cell-specific normalisation constants are estimated as part of the model parameters, (ii) technical variability is quantified based on spike-in genes that are artificially introduced to each analysed cell’s lysate and (iii) the total variability of the expression counts is decomposed into technical and biological components. BASiCS also provides an intuitive detection criterion for highly (or lowly) variable genes within the population of cells under study. This is formalised by means of tail posterior probabilities associated to high (or low) biological cell-to-cell variance contributions, quantities that can be easily interpreted by users. We demonstrate our method using gene expression measurements from mouse Embryonic Stem Cells. Cross-validation and meaningful enrichment of gene ontology categories within genes classified as highly (or lowly) variable supports the efficacy of our approach. PMID:26107944

  8. BASiCS: Bayesian Analysis of Single-Cell Sequencing Data.

    PubMed

    Vallejos, Catalina A; Marioni, John C; Richardson, Sylvia

    2015-06-01

    Single-cell mRNA sequencing can uncover novel cell-to-cell heterogeneity in gene expression levels in seemingly homogeneous populations of cells. However, these experiments are prone to high levels of unexplained technical noise, creating new challenges for identifying genes that show genuine heterogeneous expression within the population of cells under study. BASiCS (Bayesian Analysis of Single-Cell Sequencing data) is an integrated Bayesian hierarchical model where: (i) cell-specific normalisation constants are estimated as part of the model parameters, (ii) technical variability is quantified based on spike-in genes that are artificially introduced to each analysed cell's lysate and (iii) the total variability of the expression counts is decomposed into technical and biological components. BASiCS also provides an intuitive detection criterion for highly (or lowly) variable genes within the population of cells under study. This is formalised by means of tail posterior probabilities associated to high (or low) biological cell-to-cell variance contributions, quantities that can be easily interpreted by users. We demonstrate our method using gene expression measurements from mouse Embryonic Stem Cells. Cross-validation and meaningful enrichment of gene ontology categories within genes classified as highly (or lowly) variable supports the efficacy of our approach.

  9. Genetic variation in lipoprotein (a) levels in families enriched for coronary artery disease is determined almost entirely by the apolipoprotein (a) gene locus.

    PubMed Central

    DeMeester, C A; Bu, X; Gray, R J; Lusis, A J; Rotter, J I

    1995-01-01

    Lipoprotein (a) (Lp[a]) is a cholesterol-rich lipoprotein resembling LDL but also containing a large polypeptide designated apolipoprotein (a) (apo[a]). Its levels are highly variable among individuals and, in a number of studies, are strongly correlated with the risk of coronary artery disease (CAD). In an effort to determine which genes control Lp(a) levels, we have studied 25 multiplex families (comprising 298 members) enriched for CAD. The apo(a) gene was genotyped among the families, using a highly informative pulse-field gel electrophoresis procedure. In addition, polymorphisms of the gene for the other major protein of Lp(a), apolipoprotein B (apoB), were examined. Quantitative sib-pair linkage analysis indicates that apo(a) is the major gene controlling Lp(a) levels in this CAD population (P = .001; 99 sib pairs), whereas the apoB gene demonstrated no significant quantitative linkage effect. We estimate that the apo(a) locus accounts for < or = 98% of variance of Lp(a) serum levels. Approximately 43% of this variation is explained by size polymorphisms within the apo(a) gene. These results indicate that the apo(a) gene is the major determinant of Lp(a) serum levels not only in the general population but also in a high-risk CAD population. Images Figure 2 PMID:7825589

  10. Effects of AAV-mediated knockdown of nNOS and GPx-1 gene expression in rat hippocampus after traumatic brain injury.

    PubMed

    Boone, Deborah R; Leek, Jeanna M; Falduto, Michael T; Torres, Karen E O; Sell, Stacy L; Parsley, Margaret A; Cowart, Jeremy C; Uchida, Tatsuo; Micci, Maria-Adelaide; DeWitt, Douglas S; Prough, Donald S; Hellmich, Helen L

    2017-01-01

    Virally mediated RNA interference (RNAi) to knock down injury-induced genes could improve functional outcome after traumatic brain injury (TBI); however, little is known about the consequences of gene knockdown on downstream cell signaling pathways and how RNAi influences neurodegeneration and behavior. Here, we assessed the effects of adeno-associated virus (AAV) siRNA vectors that target two genes with opposing roles in TBI pathogenesis: the allegedly detrimental neuronal nitric oxide synthase (nNOS) and the potentially protective glutathione peroxidase 1 (GPx-1). In rat hippocampal progenitor cells, three siRNAs that target different regions of each gene (nNOS, GPx-1) effectively knocked down gene expression. However, in vivo, in our rat model of fluid percussion brain injury, the consequences of AAV-siRNA were variable. One nNOS siRNA vector significantly reduced the number of degenerating hippocampal neurons and showed a tendency to improve working memory. GPx-1 siRNA treatment did not alter TBI-induced neurodegeneration or working memory deficits. Nevertheless, microarray analysis of laser captured, virus-infected neurons showed that knockdown of nNOS or GPx-1 was specific and had broad effects on downstream genes. Since nNOS knockdown only modestly ameliorated TBI-induced working memory deficits, despite widespread genomic changes, manipulating expression levels of single genes may not be sufficient to alter functional outcome after TBI.

  11. Occurrence of the structural enterocin A, P, B, L50B genes in enterococci of different origin.

    PubMed

    Strompfová, Viola; Lauková, Andrea; Simonová, Monika; Marcináková, Miroslava

    2008-12-10

    Enterococci are well-known producers of antimicrobial peptides--bacteriocins (enterocins) and the number of characterized enterocins has been significantly increased. Recently, enterocins are of great interest for their potential as biopreservatives in food or feed while research on enterocins as alternative antimicrobials in humans and animals is only at the beginning. The present study provides a survey about the occurrence of enterocin structural genes A, P, B, L50B in a target of 427 strains of Enterococcus faecium (368) and Enterococcus faecalis (59) species from different sources (animal isolates, food and feed) performed by PCR method. Based on our results, 234 strains possessed one or more enterocin structural gene(s). The genes of enterocin P and enterocin A were the most frequently detected structural genes among the PCR positive strains (170 and 155 strains, respectively). Different frequency of the enterocin genes occurrence was detected in strains according to their origin; the strains from horses and silage showed the highest frequency of enterocin genes presence. All possible combinations of the tested genes occurred at least twice except the combination of the gene of enterocin B and L50B which possessed neither strain. The gene of enterocin A was exclusively detected among E. faecium strains, while the gene of enterocin P, B, L50B were detected in strains of both species E. faecium and E. faecalis. In conclusion, a high-frequency and variability of enterocin structural genes exists among enterococci of different origin what offers a big possibility to find effective bacteriocin-producing strains for their application in veterinary medicine.

  12. Nucleotide variability at its limit? Insights into the number and evolutionary dynamics of the sex-determining specificities of the honey bee Apis mellifera.

    PubMed

    Lechner, Sarah; Ferretti, Luca; Schöning, Caspar; Kinuthia, Wanja; Willemsen, David; Hasselmann, Martin

    2014-02-01

    Deciphering the evolutionary processes driving nucleotide variation in multiallelic genes is limited by the number of genetic systems in which such genes occur. The complementary sex determiner (csd) gene in the honey bee Apis mellifera is an informative example for studying allelic diversity and the underlying evolutionary forces in a well-described model of balancing selection. Acting as the primary signal of sex determination, diploid individuals heterozygous for csd develop into females, whereas csd homozygotes are diploid males that have zero fitness. Examining 77 of the functional heterozygous csd allele pairs, we established a combinatorical criteria that provide insights into the minimum number of amino acid differences among those pairs. Given a data set of 244 csd sequences, we show that the total number of csd alleles found in A. mellifera ranges from 53 (locally) to 87 (worldwide), which is much higher than was previously reported (20). Using a coupon-collector model, we extrapolate the presence of in total 116-145 csd alleles worldwide. The hypervariable region (HVR) is of particular importance in determining csd allele specificity, and we provide for this region evidence of high evolutionary rate for length differences exceeding those of microsatellites. The proportion of amino acids driven by positive selection and the rate of nonsynonymous substitutions in the HVR-flanking regions reach values close to 1 but differ with respect to the HVR length. Using a model of csd coalescence, we identified the high originating rate of csd specificities as a major evolutionary force, leading to an origin of a novel csd allele every 400,000 years. The csd polymorphism frequencies in natural populations indicate an excess of new mutations, whereas signs of ancestral transspecies polymorphism can still be detected. This study provides a comprehensive view of the enormous diversity and the evolutionary forces shaping a multiallelic gene.

  13. Nucleotide Variability at Its Limit? Insights into the Number and Evolutionary Dynamics of the Sex-Determining Specificities of the Honey Bee Apis mellifera

    PubMed Central

    Lechner, Sarah; Ferretti, Luca; Schöning, Caspar; Kinuthia, Wanja; Willemsen, David; Hasselmann, Martin

    2014-01-01

    Deciphering the evolutionary processes driving nucleotide variation in multiallelic genes is limited by the number of genetic systems in which such genes occur. The complementary sex determiner (csd) gene in the honey bee Apis mellifera is an informative example for studying allelic diversity and the underlying evolutionary forces in a well-described model of balancing selection. Acting as the primary signal of sex determination, diploid individuals heterozygous for csd develop into females, whereas csd homozygotes are diploid males that have zero fitness. Examining 77 of the functional heterozygous csd allele pairs, we established a combinatorical criteria that provide insights into the minimum number of amino acid differences among those pairs. Given a data set of 244 csd sequences, we show that the total number of csd alleles found in A. mellifera ranges from 53 (locally) to 87 (worldwide), which is much higher than was previously reported (20). Using a coupon-collector model, we extrapolate the presence of in total 116–145 csd alleles worldwide. The hypervariable region (HVR) is of particular importance in determining csd allele specificity, and we provide for this region evidence of high evolutionary rate for length differences exceeding those of microsatellites. The proportion of amino acids driven by positive selection and the rate of nonsynonymous substitutions in the HVR-flanking regions reach values close to 1 but differ with respect to the HVR length. Using a model of csd coalescence, we identified the high originating rate of csd specificities as a major evolutionary force, leading to an origin of a novel csd allele every 400,000 years. The csd polymorphism frequencies in natural populations indicate an excess of new mutations, whereas signs of ancestral transspecies polymorphism can still be detected. This study provides a comprehensive view of the enormous diversity and the evolutionary forces shaping a multiallelic gene. PMID:24170493

  14. Latent feature decompositions for integrative analysis of multi-platform genomic data

    PubMed Central

    Gregory, Karl B.; Momin, Amin A.; Coombes, Kevin R.; Baladandayuthapani, Veerabhadran

    2015-01-01

    Increased availability of multi-platform genomics data on matched samples has sparked research efforts to discover how diverse molecular features interact both within and between platforms. In addition, simultaneous measurements of genetic and epigenetic characteristics illuminate the roles their complex relationships play in disease progression and outcomes. However, integrative methods for diverse genomics data are faced with the challenges of ultra-high dimensionality and the existence of complex interactions both within and between platforms. We propose a novel modeling framework for integrative analysis based on decompositions of the large number of platform-specific features into a smaller number of latent features. Subsequently we build a predictive model for clinical outcomes accounting for both within- and between-platform interactions based on Bayesian model averaging procedures. Principal components, partial least squares and non-negative matrix factorization as well as sparse counterparts of each are used to define the latent features, and the performance of these decompositions is compared both on real and simulated data. The latent feature interactions are shown to preserve interactions between the original features and not only aid prediction but also allow explicit selection of outcome-related features. The methods are motivated by and applied to, a glioblastoma multiforme dataset from The Cancer Genome Atlas to predict patient survival times integrating gene expression, microRNA, copy number and methylation data. For the glioblastoma data, we find a high concordance between our selected prognostic genes and genes with known associations with glioblastoma. In addition, our model discovers several relevant cross-platform interactions such as copy number variation associated gene dosing and epigenetic regulation through promoter methylation. On simulated data, we show that our proposed method successfully incorporates interactions within and between genomic platforms to aid accurate prediction and variable selection. Our methods perform best when principal components are used to define the latent features. PMID:26146492

  15. Functional regression method for whole genome eQTL epistasis analysis with sequencing data.

    PubMed

    Xu, Kelin; Jin, Li; Xiong, Momiao

    2017-05-18

    Epistasis plays an essential rule in understanding the regulation mechanisms and is an essential component of the genetic architecture of the gene expressions. However, interaction analysis of gene expressions remains fundamentally unexplored due to great computational challenges and data availability. Due to variation in splicing, transcription start sites, polyadenylation sites, post-transcriptional RNA editing across the entire gene, and transcription rates of the cells, RNA-seq measurements generate large expression variability and collectively create the observed position level read count curves. A single number for measuring gene expression which is widely used for microarray measured gene expression analysis is highly unlikely to sufficiently account for large expression variation across the gene. Simultaneously analyzing epistatic architecture using the RNA-seq and whole genome sequencing (WGS) data poses enormous challenges. We develop a nonlinear functional regression model (FRGM) with functional responses where the position-level read counts within a gene are taken as a function of genomic position, and functional predictors where genotype profiles are viewed as a function of genomic position, for epistasis analysis with RNA-seq data. Instead of testing the interaction of all possible pair-wises SNPs, the FRGM takes a gene as a basic unit for epistasis analysis, which tests for the interaction of all possible pairs of genes and use all the information that can be accessed to collectively test interaction between all possible pairs of SNPs within two genome regions. By large-scale simulations, we demonstrate that the proposed FRGM for epistasis analysis can achieve the correct type 1 error and has higher power to detect the interactions between genes than the existing methods. The proposed methods are applied to the RNA-seq and WGS data from the 1000 Genome Project. The numbers of pairs of significantly interacting genes after Bonferroni correction identified using FRGM, RPKM and DESeq were 16,2361, 260 and 51, respectively, from the 350 European samples. The proposed FRGM for epistasis analysis of RNA-seq can capture isoform and position-level information and will have a broad application. Both simulations and real data analysis highlight the potential for the FRGM to be a good choice of the epistatic analysis with sequencing data.

  16. Alcohol exposure alters DNA methylation profiles in mouse embryos at early neurulation

    PubMed Central

    Liu, Yunlong; Balaraman, Yokesh; Wang, Guohua; Nephew, Kenneth P.; Zhou, Feng C.

    2009-01-01

    Alcohol exposure during development can cause variable neurofacial deficit and growth retardation known as fetal alcohol spectrum disorders (FASD). The mechanism underlying FASD is not fully understood. However, alcohol, which is known to affect methyl donor metabolism, may induce aberrant epigenetic changes contributing to FASD. Using a tightly controlled whole-embryo culture, we investigated the effect of alcohol exposure (88 mM) at early embryonic neurulation on genome-wide DNA methylation and gene expression in the C57BL/6 mouse. The DNA methylation landscape around promoter CpG islands at early mouse development was analyzed using MeDIP (methylated DNA immunoprecipitation) coupled with microarray (MeDIP-chip). At early neurulation, genes associated with high CpG promoters (HCP) had a lower ratio of methylation but a greater ratio of expression. Alcohol-induced alterations in DNA methylation were observed, particularly in genes on chromosomes 7, 10 and X; remarkably, a >10 fold increase in the number of genes with increased methylation on chromosomes 10 and X was observed in alcohol-exposed embryos with a neural tube defect phenotype compared to embryos without a neural tube defect. Significant changes in methylation were seen in imprinted genes, genes known to play roles in cell cycle, growth, apoptosis, cancer, and in a large number of genes associated with olfaction. Altered methylation was associated with significant (p < 0.01) changes in expression for 84 genes. Sequenom EpiTYPER DNA methylation analysis was used for validation of the MeDIP-chip data. Increased methylation of genes known to play a role in metabolism (Cyp4f13) and decreased methylation of genes associated with development (Nlgn3, Elavl2, Sox21 and Sim1), imprinting (Igf2r) and chromatin (Hist1h3d) was confirmed. In a mouse model for FASD, we show for the first time that alcohol exposure during early neurulation can induce aberrant changes in DNA methylation patterns with associated changes in gene expression, which together may contribute to the observed abnormal fetal development. PMID:20009564

  17. Alcohol exposure alters DNA methylation profiles in mouse embryos at early neurulation.

    PubMed

    Liu, Yunlong; Balaraman, Yokesh; Wang, Guohua; Nephew, Kenneth P; Zhou, Feng C

    2009-10-01

    Alcohol exposure during development can cause variable neurofacial deficit and growth retardation known as fetal alcohol spectrum disorders (FASD). The mechanism underlying FASD is not fully understood. However, alcohol, which is known to affect methyl donor metabolism, may induce aberrant epigenetic changes contributing to FASD. Using a tightly controlled whole-embryo culture, we investigated the effect of alcohol exposure (88mM) at early embryonic neurulation on genome-wide DNA methylation and gene expression in the C57BL/6 mouse. The DNA methylation landscape around promoter CpG islands at early mouse development was analyzed using MeDIP (methylated DNA immunoprecipitation) coupled with microarray (MeDIP-chip). At early neurulation, genes associated with high CpG promoters (HCP) had a lower ratio of methylation but a greater ratio of expression. Alcohol-induced alterations in DNA methylation were observed, particularly in genes on chromosomes 7, 10, and X; remarkably, a >10 fold increase in the number of genes with increased methylation on chromosomes 10 and X was observed in alcohol-exposed embryos with a neural tube defect phenotype compared to embryos without a neural tube defect. Significant changes in methylation were seen in imprinted genes, genes known to play roles in cell cycle, growth, apoptosis, cancer, and in a large number of genes associated with olfaction. Altered methylation was associated with significant (p<0.01) changes in expression for 84 genes. Sequenom EpiTYPER DNA methylation analysis was used for validation of the MeDIP-chip data. Increased methylation of genes known to play a role in metabolism (Cyp4f13) and decreased methylation of genes associated with development (Nlgn3, Elavl2, Sox21 and Sim1), imprinting (Igf2r) and chromatin (Hist1h3d) was confirmed. In a mouse model for FASD, we show for the first time that alcohol exposure during early neurulation can induce aberrant changes in DNA methylation patterns with associated changes in gene expression, which together may contribute to the observed abnormal fetal development.

  18. GeoChip-Based Analysis of the Functional Gene Diversity and Metabolic Potential of Microbial Communities in Acid Mine Drainage▿ †

    PubMed Central

    Xie, Jianping; He, Zhili; Liu, Xinxing; Liu, Xueduan; Van Nostrand, Joy D.; Deng, Ye; Wu, Liyou; Zhou, Jizhong; Qiu, Guanzhou

    2011-01-01

    Acid mine drainage (AMD) is an extreme environment, usually with low pH and high concentrations of metals. Although the phylogenetic diversity of AMD microbial communities has been examined extensively, little is known about their functional gene diversity and metabolic potential. In this study, a comprehensive functional gene array (GeoChip 2.0) was used to analyze the functional diversity, composition, structure, and metabolic potential of AMD microbial communities from three copper mines in China. GeoChip data indicated that these microbial communities were functionally diverse as measured by the number of genes detected, gene overlapping, unique genes, and various diversity indices. Almost all key functional gene categories targeted by GeoChip 2.0 were detected in the AMD microbial communities, including carbon fixation, carbon degradation, methane generation, nitrogen fixation, nitrification, denitrification, ammonification, nitrogen reduction, sulfur metabolism, metal resistance, and organic contaminant degradation, which suggested that the functional gene diversity was higher than was previously thought. Mantel test results indicated that AMD microbial communities are shaped largely by surrounding environmental factors (e.g., S, Mg, and Cu). Functional genes (e.g., narG and norB) and several key functional processes (e.g., methane generation, ammonification, denitrification, sulfite reduction, and organic contaminant degradation) were significantly (P < 0.10) correlated with environmental variables. This study presents an overview of functional gene diversity and the structure of AMD microbial communities and also provides insights into our understanding of metabolic potential in AMD ecosystems. PMID:21097602

  19. A Robust Unified Approach to Analyzing Methylation and Gene Expression Data

    PubMed Central

    Khalili, Abbas; Huang, Tim; Lin, Shili

    2009-01-01

    Microarray technology has made it possible to investigate expression levels, and more recently methylation signatures, of thousands of genes simultaneously, in a biological sample. Since more and more data from different biological systems or technological platforms are being generated at an incredible rate, there is an increasing need to develop statistical methods that are applicable to multiple data types and platforms. Motivated by such a need, a flexible finite mixture model that is applicable to methylation, gene expression, and potentially data from other biological systems, is proposed. Two major thrusts of this approach are to allow for a variable number of components in the mixture to capture non-biological variation and small biases, and to use a robust procedure for parameter estimation and probe classification. The method was applied to the analysis of methylation signatures of three breast cancer cell lines. It was also tested on three sets of expression microarray data to study its power and type I error rates. Comparison with a number of existing methods in the literature yielded very encouraging results; lower type I error rates and comparable/better power were achieved based on the limited study. Furthermore, the method also leads to more biologically interpretable results for the three breast cancer cell lines. PMID:20161265

  20. Interactions between Early Parenting and a Polymorphism of the Child’s Dopamine Transporter Gene in Predicting Future Child Conduct Disorder Symptoms

    PubMed Central

    Lahey, Benjamin B.; Rathouz, Paul J.; Lee, Steve S.; Chronis-Tuscano, Andrea; Pelham, William E.; Waldman, Irwin D.; Cook, Edwin H.

    2010-01-01

    Mounting evidence suggests that genetic risks for mental disorders often interact with the social environment, but most studies still ignore environmental moderation of genetic influences. We tested interactions between maternal parenting and the variable number tandem repeat (VNTR) polymorphism in the 3′ untranslated region (UTR) of the dopamine transporter gene in the child to increase understanding of gene-environment interactions involving early parenting. Participants were part of a 9-year longitudinal study of 4–6-year-old children who met criteria for attention-deficit/hyperactivity disorder (ADHD) and demographically matched controls. Maternal parenting was observed during standard mother-child interactions in wave 1. The child’s conduct disorder (CD) symptoms 5–8 years later were measured using separate structured diagnostic interviews of the mother and youth. Controlling for ADHD symptoms and child disruptive behavior during the mother-child interaction, there was a significant inverse relation between levels of both positive and negative parenting at 4–6 years and the number of later CD symptoms, but primarily among children with two copies of the 9-repeat allele of the VNTR. The significant interaction with negative parenting was replicated in parent and youth reports of CD symptoms separately. PMID:21171728

  1. The Escherichia coli Cpx envelope stress response regulates genes of diverse function that impact antibiotic resistance and membrane integrity.

    PubMed

    Raivio, Tracy L; Leblanc, Shannon K D; Price, Nancy L

    2013-06-01

    The Cpx envelope stress response mediates adaptation to stresses that cause envelope protein misfolding. Adaptation is partly conferred through increased expression of protein folding and degradation factors. The Cpx response also plays a conserved role in the regulation of virulence determinant expression and impacts antibiotic resistance. We sought to identify adaptive mechanisms that may be involved in these important functions by characterizing changes in the transcriptome of two different Escherichia coli strains when the Cpx response is induced. We show that, while there is considerable strain- and condition-specific variability in the Cpx response, the regulon is enriched for proteins and functions that are inner membrane associated under all conditions. Genes that were changed by Cpx pathway induction under all conditions were involved in a number of cellular functions and included several intergenic regions, suggesting that posttranscriptional regulation is important during Cpx-mediated adaptation. Some Cpx-regulated genes are centrally involved in energetics and play a role in antibiotic resistance. We show that a number of small, uncharacterized envelope proteins are Cpx regulated and at least two of these affect phenotypes associated with membrane integrity. Altogether, our work suggests new mechanisms of Cpx-mediated envelope stress adaptation and antibiotic resistance.

  2. Alternative-splicing-mediated gene expression

    NASA Astrophysics Data System (ADS)

    Wang, Qianliang; Zhou, Tianshou

    2014-01-01

    Alternative splicing (AS) is a fundamental process during gene expression and has been found to be ubiquitous in eukaryotes. However, how AS impacts gene expression levels both quantitatively and qualitatively remains to be fully explored. Here, we analyze two common models of gene expression, each incorporating a simple splice mechanism that a pre-mRNA is spliced into two mature mRNA isoforms in a probabilistic manner. In the constitutive expression case, we show that the steady-state molecular numbers of two mature mRNA isoforms follow mutually independent Poisson distributions. In the bursting expression case, we demonstrate that the tail decay of the steady-state distribution for both mature mRNA isoforms that in general are not mutually independent can be characterized by the product of mean burst size and splicing probability. In both cases, we find that AS can efficiently modulate both the variability (measured by variance) and the noise level of the total mature mRNA, and in particular, the latter is always lower than the noise level of the pre-mRNA, implying that AS always reduces the noise. These results altogether reveal that AS is a mechanism of efficiently controlling the gene expression noise.

  3. Practical applications of the bioinformatics toolbox for narrowing quantitative trait loci.

    PubMed

    Burgess-Herbert, Sarah L; Cox, Allison; Tsaih, Shirng-Wern; Paigen, Beverly

    2008-12-01

    Dissecting the genes involved in complex traits can be confounded by multiple factors, including extensive epistatic interactions among genes, the involvement of epigenetic regulators, and the variable expressivity of traits. Although quantitative trait locus (QTL) analysis has been a powerful tool for localizing the chromosomal regions underlying complex traits, systematically identifying the causal genes remains challenging. Here, through its application to plasma levels of high-density lipoprotein cholesterol (HDL) in mice, we demonstrate a strategy for narrowing QTL that utilizes comparative genomics and bioinformatics techniques. We show how QTL detected in multiple crosses are subjected to both combined cross analysis and haplotype block analysis; how QTL from one species are mapped to the concordant regions in another species; and how genomewide scans associating haplotype groups with their phenotypes can be used to prioritize the narrowed regions. Then we illustrate how these individual methods for narrowing QTL can be systematically integrated for mouse chromosomes 12 and 15, resulting in a significantly reduced number of candidate genes, often from hundreds to <10. Finally, we give an example of how additional bioinformatics resources can be combined with experiments to determine the most likely quantitative trait genes.

  4. The Genetics of Pulmonary Arterial Hypertension

    PubMed Central

    Austin, Eric D.; Loyd, James E.

    2014-01-01

    Pulmonary arterial hypertension (PAH) is a progressive and fatal disease for which there is an ever-expanding body of genetic and related pathophysiological information on disease pathogenesis. A number of germline gene mutations have now been described, including mutations in the gene coding bone morphogenic protein receptor type 2 (BMPR2) and related genes. Recent advanced gene sequencing methods have facilitated the discovery of additional genes with mutations among those with and without familial forms of PAH (CAV1, KCNK3, EIF2AK4). The reduced penetrance, variable expressivity, and female predominance of PAH suggest that genetic, genomic and other factors modify disease expression. These multi-faceted variations are an active area of investigation in the field, including but not limited to common genetic variants and epigenetic processes, and may provide novel opportunities for pharmacologic intervention in the near future. They also highlight the need for a systems-oriented multi-level approach to incorporate the multitude of biologic variations now associated with PAH. Ultimately, improved understanding provides the opportunity for improved patient and family counseling about this devastating disease, but do require in depth understanding of the genetic factors relevant to PAH. PMID:24951767

  5. Exclusion of the APC gene as the cause of a variant form of familial adenomatous polyposis (FAP)

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Stella, A.; Resta, N.; Susca, F.

    Familial adenomatous polyposis (FAP) is a premalignant disease inherited as an autosomal dominant trait, characterized by hundreds to thousands of polyps in the colorectal tract. Recently, the syndrome has been shown to be caused by mutations in the APC (adenomatous polyposis coli) gene located on chromosome 5q21. The authors studied two families that both presented a phenotype different from that of the classical form of FAP. The most important findings observed in these two kindreds are (a) low and variable number of colonic polyps (from 5 to 100) and (b) a slower evolution of the disease, with colon cancer occurringmore » at a more advanced age than in FAP in spite of the early onset of intestinal manifestations. To determine whether mutations of the APC gene are also responsible for this variant syndrome, linkage studies were performed by using a series of markers both intragenic and tightly linked to the APC gene. The results provide evidence for exclusion of the APC gene as the cause of the variant form of polyposis present in the two families described. 30 refs., 1 fig., 1 tab.« less

  6. Study on the association between drug‑resistance and gene mutations of the active efflux pump acrAB‑tolC gene and its regulatory genes.

    PubMed

    Ma, Quan-Ping; Su, Liang; Liu, Jing-Wen; Yao, Ming-Xiao; Yuan, Guang-Ying

    2018-06-01

    The aim of the present study was to investigate the correlation between the multi‑drug resistance of Shigella flexneri and the drug‑resistant gene cassette carried by integrons; in the meanwhile, to detect the associations between drug‑resistance and gene mutations of the active efflux pump acrAB‑tolC gene and its regulatory genes, including marOR, acrR and soxS. A total of 158 isolates were isolated from the stool samples of 1,026 children with diarrhoea aged 14 years old between May 2012 and October 2015 in Henan. The K‑B method was applied for the determination of drug resistance of Shigella flexneri, and polymerase chain reaction amplification was used for class 1, 2 and 3 integrase genes. Enzyme digestion and sequence analysis were performed for the variable regions of positive strains. Based on the drug sensitivity assessment, multi‑drug resistant strains that were resistant to five or more antibiotics, and sensitive strains were selected for amplification. Their active efflux pump genes, acrA and acrB, and regulatory genes, marOR, acrR and soxS, were selected for sequencing. The results revealed that 91.1% of the 158 strains were multi‑resistant to ampicillin, chloramphenicol, tetracycline and streptomycin, and 69.6% of the strains were multi‑resistant to sulfamethoxazole/trimethoprim. The resistance to ceftazidime, ciprofloxacin and levofloxacin was <32.9%. All strains (100%) were sensitive to cefoxitin, cefoperazone/sulbactam and imipenem. The rate of the class 1 integron positivity was 91.9% (144/158). Among these class 1 integron‑positive strains, 18 strains exhibited the resistance gene cassette dfrV in the variable region of the strain, four strains exhibited dfrA17‑aadA5 in the variable region and 140 strains exhibited blaOXA‑30‑aadA1 in the variable region. Four strains showed no resistance gene in the variable regions. The rate of class 2 integron positivity was 86.1% (136/158), and all positive strains harboured the dfrA1‑sat1‑aadA resistance gene cassette in the variable region. The class 3 integrase gene was not detected in these strains. The gene sequencing showed the deletion of base CATT in the 36, 37, 38, 39 site in the marOR gene, which is a regulatory gene of the active efflux pump, AcrAB‑TolC. Taken together, the multi‑drug resistance of Shigella flexneri was closely associated with gene mutations of class 1 and 2 integrons and the marOR gene.

  7. Increased Grik4 Gene Dosage Causes Imbalanced Circuit Output and Human Disease-Related Behaviors.

    PubMed

    Arora, Vineet; Pecoraro, Valeria; Aller, M Isabel; Román, Celia; Paternain, Ana V; Lerma, Juan

    2018-06-26

    Altered glutamatergic neurotransmission is thought to contribute to mental disorders and neurodegenerative diseases. Copy-number variation in genes associated with glutamatergic synapses represents a source of genetic variability, possibly underlying neurological and mental disease susceptibility. The GRIK4 gene encodes a high-affinity kainate receptor subunit of essentially unknown function, although de novo duplication of the 11q23.3-q24.1 locus to which it maps has been detected in autism and other disorders. To determine how changes in the dose of Grik4 affect synaptic activity, we studied mice overexpressing this gene in the forebrain. A mild gain in Grik4 enhances synaptic transmission, causing a persistent imbalance in inhibitory and excitatory activity and disturbing the circuits responsible for the main amygdala outputs. These changes in glutamatergic activity reverse when Grik4 levels are normalized; thus, they may account for the behavioral abnormalities in disorders like autism or schizophrenia. Copyright © 2018 Agencia Estatal Consejo Superior de Investigaciones Científicas. Published by Elsevier Inc. All rights reserved.

  8. Characterization of extensively drug-resistant Mycobacterium tuberculosis in Nepal.

    PubMed

    Poudel, Ajay; Maharjan, Bhagwan; Nakajima, Chie; Fukushima, Yukari; Pandey, Basu D; Beneke, Antje; Suzuki, Yasuhiko

    2013-01-01

    The emergence of extensively drug-resistant tuberculosis (XDR-TB) has raised public health concern for global control of TB. Although molecular characterization of drug resistance-associated mutations in multidrug-resistant isolates in Nepal has been made, mutations in XDR isolates and their genotypes have not been reported previously. In this study, we identified and characterized 13 XDR Mycobacterium tuberculosis isolates from clinical isolates in Nepal. The most prevalent mutations involved in rifampicin, isoniazid, ofloxacin, and kanamycin/capreomycin resistance were Ser531Leu in rpoB gene (92.3%), Ser315Thr in katG gene (92.3%), Asp94Gly in gyrA gene (53.9%) and A1400G in rrs gene (61.5%), respectively. Spoligotyping and multilocus sequence typing revealed that 69% belonged to Beijing family, especially modern types. Further typing with 26-loci variable number of tandem repeats suggested the current spread of XDR M. tuberculosis. Our result highlights the need to reinforce the TB policy in Nepal with regard to control and detection strategies. Copyright © 2012 Elsevier Ltd. All rights reserved.

  9. Recent amplification and impact of MITEs on the genome of grapevine (Vitis vinifera L.)

    PubMed Central

    Benjak, Andrej; Boué, Stéphanie; Forneck, Astrid

    2009-01-01

    Miniature inverted-repeat transposable elements (MITEs) are a particular type of defective class II transposons present in genomes as highly homogeneous populations of small elements. Their high copy number and close association to genes make their potential impact on gene evolution particularly relevant. Here, we present a detailed analysis of the MITE families directly related to grapevine “cut-and-paste” transposons. Our results show that grapevine MITEs have transduplicated and amplified genomic sequences, including gene sequences and fragments of other mobile elements. Our results also show that although some of the MITE families were already present in the ancestor of the European and American Vitis wild species, they have been amplified and have been actively transposing accompanying grapevine domestication and breeding. We show that MITEs are abundant in grapevine and some of them are frequently inserted within the untranslated regions of grapevine genes. MITE insertions are highly polymorphic among grapevine cultivars, which frequently generate transcript variability. The data presented here show that MITEs have greatly contributed to the grapevine genetic diversity which has been used for grapevine domestication and breeding. PMID:20333179

  10. Genomic Analysis of Differentiation between Soil Types Reveals Candidate Genes for Local Adaptation in Arabidopsis lyrata

    PubMed Central

    Turner, Thomas L.; von Wettberg, Eric J.; Nuzhdin, Sergey V.

    2008-01-01

    Serpentine soil, which is naturally high in heavy metal content and has low calcium to magnesium ratios, comprises a difficult environment for most plants. An impressive number of species are endemic to serpentine, and a wide range of non-endemic plant taxa have been shown to be locally adapted to these soils. Locating genomic polymorphisms which are differentiated between serpentine and non-serpentine populations would provide candidate loci for serpentine adaptation. We have used the Arabidopsis thaliana tiling array, which has 2.85 million probes throughout the genome, to measure genetic differentiation between populations of Arabidopsis lyrata growing on granitic soils and those growing on serpentinic soils. The significant overrepresentation of genes involved in ion transport and other functions provides a starting point for investigating the molecular basis of adaptation to soil ion content, water retention, and other ecologically and economically important variables. One gene in particular, calcium-exchanger 7, appears to be an excellent candidate gene for adaptation to low Ca∶Mg ratio in A. lyrata. PMID:18784841

  11. [Gene method for inconsistent hydrological frequency calculation. I: Inheritance, variability and evolution principles of hydrological genes].

    PubMed

    Xie, Ping; Wu, Zi Yi; Zhao, Jiang Yan; Sang, Yan Fang; Chen, Jie

    2018-04-01

    A stochastic hydrological process is influenced by both stochastic and deterministic factors. A hydrological time series contains not only pure random components reflecting its inheri-tance characteristics, but also deterministic components reflecting variability characteristics, such as jump, trend, period, and stochastic dependence. As a result, the stochastic hydrological process presents complicated evolution phenomena and rules. To better understand these complicated phenomena and rules, this study described the inheritance and variability characteristics of an inconsistent hydrological series from two aspects: stochastic process simulation and time series analysis. In addition, several frequency analysis approaches for inconsistent time series were compared to reveal the main problems in inconsistency study. Then, we proposed a new concept of hydrological genes origined from biological genes to describe the inconsistent hydrolocal processes. The hydrologi-cal genes were constructed using moments methods, such as general moments, weight function moments, probability weight moments and L-moments. Meanwhile, the five components, including jump, trend, periodic, dependence and pure random components, of a stochastic hydrological process were defined as five hydrological bases. With this method, the inheritance and variability of inconsistent hydrological time series were synthetically considered and the inheritance, variability and evolution principles were fully described. Our study would contribute to reveal the inheritance, variability and evolution principles in probability distribution of hydrological elements.

  12. A novel 'splice site' HCN4 Gene mutation, c.1737+1 G>T, causes familial bradycardia, reduced heart rate response, impaired chronotropic competence and increased short-term heart rate variability.

    PubMed

    Hategan, Lidia; Csányi, Beáta; Ördög, Balázs; Kákonyi, Kornél; Tringer, Annamária; Kiss, Orsolya; Orosz, Andrea; Sághy, László; Nagy, István; Hegedűs, Zoltán; Rudas, László; Széll, Márta; Varró, András; Forster, Tamás; Sepp, Róbert

    2017-08-15

    The most important molecular determinant of heart rate regulation in sino-atrial pacemaker cells includes hyperpolarization-activated, cyclic nucleotide-gated ion channels, the major isoform of which is encoded by the HCN4 gene. Mutations affecting the HCN4 gene are associated primarily with sick sinus syndrome. A novel c.1737+1 G>T 'splice-site' HCN4 mutation was identified in a large family with familial bradycardia which co-segregated with the disease providing a two-point LOD score of 4.87. Twelve out of the 22 investigated family members [4 males, 8 females average age 36 (SD 6) years] were considered as clinically affected (heart rate<60/min on resting ECG). Minimum [36 (SD 7) vs. 47 (SD 5) bpm, p=0.0087) and average heart rates [62 (SD 8) vs. 73 (SD 8) bpm, p=0.0168) were significantly lower in carriers on 24-hour Holter recordings. Under maximum exercise test carriers achieved significantly lower heart rates than non-carrier family members, and percent heart rate reserve and percent corrected heart rate reserve were significantly lower in carriers. Applying rigorous criteria for chronotropic incompetence a higher number of carriers exhibited chronotropic incompetence. Parameters, characterizing short-term variability of heart rate (i.e. rMSSD and pNN50%) were increased in carrier family members, even after normalization for heart rate, in the 24-hour ECG recordings with the same relative increase in 5-minute recordings. The identified novel 'splice site' HCN4 gene mutation, c.1737+1 G>T, causes familial bradycardia and leads to reduced heart rate response, impaired chronotropic competence and increased short-term heart rate variability in the mutation carriers. Copyright © 2017 Elsevier B.V. All rights reserved.

  13. Genotype and cardiovascular phenotype correlations with TBX1 in 1,022 velo-cardio- facial/DiGeorge/22q11.2 deletion syndrome patients

    PubMed Central

    Guo, Tingwei; McGinn, Donna McDonald; Blonska, Anna; Shanske, Alan; Bassett, Anne; Chow, Eva; Bowser, Mark; Sheridan, Molly; Beemer, Frits; Devriendt, Koen; Swillen, Ann; Breckpot, Jeroen; Digilio, M. Cristina; Marino, Bruno; Dallapiccola, Bruno; Carpenter, Courtney; Zheng, Xin; Johnson, Jacob; Chung, Jonathan; Higgins, Anne Marie; Philip, Nicole; Simon, Tony J.; Coleman, Karlene; Heine-Suner, Damian; Rosell, Jordi; Kates, Wendy; Devoto, Marcella; Goldmuntz, Elizabeth; Zackai, Elaine; Wang, Tao; Shprintzen, Robert; Emanuel, Beverly; Morrow, Bernice

    2011-01-01

    Haploinsufficiency of TBX1, encoding a T-box transcription factor, is largely responsible for the physical malformations in velo-cardio-facial/DiGeorge/22q11.2 deletion syndrome (22q11DS) patients. Cardiovascular malformations in these patients are highly variable, raising the question as to whether DNA variations in the TBX1 locus on the remaining allele of 22q11.2, could be responsible. To test this, a large sample size is needed. The TBX1 gene was sequenced in 360 consecutive 22q11DS patients. Rare and common variations were identified. We did not detect enrichment in rare SNP number in those with or without a congenital heart defect. One exception was that there was increased number of very rare SNPs between those with normal heart anatomy compared to those with right-sided aortic arch or persistent truncus arteriosus, suggesting potentially protective roles in the SNPs for these phenotype enrichment groups. Nine common SNPs (MAF >0.05) were chosen and used to genotype the entire cohort of 1,022 22q11DS subjects. We did not find a correlation between common SNPs or haplotypes and cardiovascular phenotype. This work demonstrates that common DNA variations in TBX1 do not explain variable cardiovascular expression in 22q11DS patients, implicating existence of modifiers in other genes on 22q11.2 or elsewhere in the genome. PMID:21796729

  14. Frequent loss of lineages and deficient duplications accounted for low copy number of disease resistance genes in Cucurbitaceae

    PubMed Central

    2013-01-01

    Background The sequenced genomes of cucumber, melon and watermelon have relatively few R-genes, with 70, 75 and 55 copies only, respectively. The mechanism for low copy number of R-genes in Cucurbitaceae genomes remains unknown. Results Manual annotation of R-genes in the sequenced genomes of Cucurbitaceae species showed that approximately half of them are pseudogenes. Comparative analysis of R-genes showed frequent loss of R-gene loci in different Cucurbitaceae species. Phylogenetic analysis, data mining and PCR cloning using degenerate primers indicated that Cucurbitaceae has limited number of R-gene lineages (subfamilies). Comparison between R-genes from Cucurbitaceae and those from poplar and soybean suggested frequent loss of R-gene lineages in Cucurbitaceae. Furthermore, the average number of R-genes per lineage in Cucurbitaceae species is approximately 1/3 that in soybean or poplar. Therefore, both loss of lineages and deficient duplications in extant lineages accounted for the low copy number of R-genes in Cucurbitaceae. No extensive chimeras of R-genes were found in any of the sequenced Cucurbitaceae genomes. Nevertheless, one lineage of R-genes from Trichosanthes kirilowii, a wild Cucurbitaceae species, exhibits chimeric structures caused by gene conversions, and may contain a large number of distinct R-genes in natural populations. Conclusions Cucurbitaceae species have limited number of R-gene lineages and each genome harbors relatively few R-genes. The scarcity of R-genes in Cucurbitaceae species was due to frequent loss of R-gene lineages and infrequent duplications in extant lineages. The evolutionary mechanisms for large variation of copy number of R-genes in different plant species were discussed. PMID:23682795

  15. Genetic variation and population structure in Jamunapari goats using microsatellites, mitochondrial DNA, and milk protein genes.

    PubMed

    Rout, P K; Thangraj, K; Mandal, A; Roy, R

    2012-01-01

    Jamunapari, a dairy goat breed of India, has been gradually declining in numbers in its home tract over the years. We have analysed genetic variation and population history in Jamunapari goats based on 17 microsatellite loci, 2 milk protein loci, mitochondrial hypervariable region I (HVRI) sequencing, and three Y-chromosomal gene sequencing. We used the mitochondrial DNA (mtDNA) mismatch distribution, microsatellite data, and bottleneck tests to infer the population history and demography. The mean number of alleles per locus was 9.0 indicating that the allelic variation was high in all the loci and the mean heterozygosity was 0.769 at nuclear loci. Although the population size is smaller than 8,000 individuals, the amount of variability both in terms of allelic richness and gene diversity was high in all the microsatellite loci except ILST 005. The gene diversity and effective number of alleles at milk protein loci were higher than the 10 other Indian goat breeds that they were compared to. Mismatch analysis was carried out and the analysis revealed that the population curve was unimodal indicating the expansion of population. The genetic diversity of Y-chromosome genes was low in the present study. The observed mean M ratio in the population was above the critical significance value (Mc) and close to one indicating that it has maintained a slowly changing population size. The mode-shift test did not detect any distortion of allele frequency and the heterozygosity excess method showed that there was no significant departure from mutation-drift equilibrium detected in the population. However, the effects of genetic bottlenecks were observed in some loci due to decreased heterozygosity and lower level of M ratio. There were two observed genetic subdivisions in the population supporting the observations of farmers in different areas. This base line information on genetic diversity, bottleneck analysis, and mismatch analysis was obtained to assist the conservation decision and management of the breed.

  16. Genetic Variation and Population Structure in Jamunapari Goats Using Microsatellites, Mitochondrial DNA, and Milk Protein Genes

    PubMed Central

    Rout, P. K.; Thangraj, K.; Mandal, A.; Roy, R.

    2012-01-01

    Jamunapari, a dairy goat breed of India, has been gradually declining in numbers in its home tract over the years. We have analysed genetic variation and population history in Jamunapari goats based on 17 microsatellite loci, 2 milk protein loci, mitochondrial hypervariable region I (HVRI) sequencing, and three Y-chromosomal gene sequencing. We used the mitochondrial DNA (mtDNA) mismatch distribution, microsatellite data, and bottleneck tests to infer the population history and demography. The mean number of alleles per locus was 9.0 indicating that the allelic variation was high in all the loci and the mean heterozygosity was 0.769 at nuclear loci. Although the population size is smaller than 8,000 individuals, the amount of variability both in terms of allelic richness and gene diversity was high in all the microsatellite loci except ILST 005. The gene diversity and effective number of alleles at milk protein loci were higher than the 10 other Indian goat breeds that they were compared to. Mismatch analysis was carried out and the analysis revealed that the population curve was unimodal indicating the expansion of population. The genetic diversity of Y-chromosome genes was low in the present study. The observed mean M ratio in the population was above the critical significance value (Mc) and close to one indicating that it has maintained a slowly changing population size. The mode-shift test did not detect any distortion of allele frequency and the heterozygosity excess method showed that there was no significant departure from mutation-drift equilibrium detected in the population. However, the effects of genetic bottlenecks were observed in some loci due to decreased heterozygosity and lower level of M ratio. There were two observed genetic subdivisions in the population supporting the observations of farmers in different areas. This base line information on genetic diversity, bottleneck analysis, and mismatch analysis was obtained to assist the conservation decision and management of the breed. PMID:22606053

  17. Overlapping 16p13.11 deletion and gain of copies variations associated with childhood onset psychosis include genes with mechanistic implications for autism associated pathways: Two case reports.

    PubMed

    Brownstein, Catherine A; Kleiman, Robin J; Engle, Elizabeth C; Towne, Meghan C; D'Angelo, Eugene J; Yu, Timothy W; Beggs, Alan H; Picker, Jonathan; Fogler, Jason M; Carroll, Devon; Schmitt, Rachel C O; Wolff, Robert R; Shen, Yiping; Lip, Va; Bilguvar, Kaya; Kim, April; Tembulkar, Sahil; O'Donnell, Kyle; Gonzalez-Heydrich, Joseph

    2016-05-01

    Copy number variability at 16p13.11 has been associated with intellectual disability, autism, schizophrenia, epilepsy, and attention-deficit hyperactivity disorder. Adolescent/adult- onset psychosis has been reported in a subset of these cases. Here, we report on two children with CNVs in 16p13.11 that developed psychosis before the age of 7. The genotype and neuropsychiatric abnormalities of these patients highlight several overlapping genes that have possible mechanistic relevance to pathways previously implicated in Autism Spectrum Disorders, including the mTOR signaling and the ubiquitin-proteasome cascades. A careful screening of the 16p13.11 region is warranted in patients with childhood onset psychosis. © 2016 Wiley Periodicals, Inc.

  18. Overlapping 16p13.11 Deletion and Gain of Copies Variations Associated with Childhood Onset Psychosis Include Genes with Mechanistic Implications for Autism Associated Pathways: Two Case Reports

    PubMed Central

    Brownstein, Catherine A.; Kleiman, Robin J.; Engle, Elizabeth C.; Towne, Meghan C.; D’Angelo, Eugene J.; Yu, Timothy W.; Beggs, Alan H.; Picker, Jonathan; Fogler, Jason M.; Carroll, Devon; Schmitt, Rachel C. O.; Wolff, Robert R.; Shen, Yiping; Lip, Va; Bilguvar, Kaya; Kim, April; Tembulkar, Sahil; O’Donnell, Kyle; Gonzalez-Heydrich, Joseph

    2016-01-01

    Copy number variability at 16p13.11 has been associated with intellectual disability, autism, schizophrenia, epilepsy and attention-deficit hyperactivity disorder. Adolescent/adult- onset psychosis has been reported in a subset of these cases. Here, we report on two children with CNVs in 16p13.11 that developed psychosis before the age of 7. The genotype and neuropsychiatric abnormalities of these patients highlight several overlapping genes that have possible mechanistic relevance to pathways previously implicated in Autism Spectrum Disorders, including the mTOR signaling and the ubiquitin-proteasome cascades. A careful screening of the 16p13.11 region is warranted in patients with childhood onset psychosis. PMID:26887912

  19. Genetic studies on the ghrelin, growth hormone secretagogue receptor (GHSR) and ghrelin O-acyl transferase (GOAT) genes.

    PubMed

    Liu, Boyang; Garcia, Edwin A; Korbonits, Márta

    2011-11-01

    Ghrelin is a 28 amino acid peptide hormone that is produced both centrally and peripherally. Regulated by the ghrelin O-acyl transferase enzyme, ghrelin exerts its action through the growth hormone secretagogue receptor, and is implicated in a diverse range of physiological processes. These implications have placed the ghrelin signaling pathway at the center of a large number of candidate gene and genome-wide studies which aim to identify the genetic basis of human heterogeneity. In this review we summarize the available data on the genetic variability of ghrelin, its receptor and its regulatory enzyme, and their association with obesity, stature, type 2 diabetes, cardiovascular disease, eating disorders, and reward seeking behavior. Copyright © 2011 Elsevier Inc. All rights reserved.

  20. Epidermal growth factor receptor and AKT1 gene copy numbers by multi-gene fluorescence in situ hybridization impact on prognosis in breast cancer.

    PubMed

    Li, Jiao; Su, Wei; Zhang, Sheng; Hu, Yunhui; Liu, Jingjing; Zhang, Xiaobei; Bai, Jingchao; Yuan, Weiping; Hu, Linping; Cheng, Tao; Zetterberg, Anders; Lei, Zhenmin; Zhang, Jin

    2015-05-01

    The epidermal growth factor receptor (EGFR)/PI3K/AKT signaling pathway aberrations play significant roles in breast cancer occurrence and development. However, the status of EGFR and AKT1 gene copy numbers remains unclear. In this study, we showed that the rates of EGFR and AKT1 gene copy number alterations were associated with the prognosis of breast cancer. Among 205 patients, high EGFR and AKT1 gene copy numbers were observed in 34.6% and 27.8% of cases by multi-gene fluorescence in situ hybridization, respectively. Co-heightened EGFR/AKT1 gene copy numbers were identified in 11.7% cases. No changes were found in 49.3% of patients. Although changes in EGFR and AKT1 gene copy numbers had no correlation with patients' age, tumor stage, histological grade and the expression status of other molecular makers, high EGFR (P = 0.0002) but not AKT1 (P = 0.1177) gene copy numbers correlated with poor 5-year overall survival. The patients with co-heightened EGFR/AKT1 gene copy numbers displayed a poorer prognosis than those with tumors with only high EGFR gene copy numbers (P = 0.0383). Both Univariate (U) and COX multivariate (C) analyses revealed that high EGFR and AKT1 gene copy numbers (P = 0.000 [U], P = 0.0001 [C]), similar to histological grade (P = 0.001 [U], P = 0.012 [C]) and lymph node metastasis (P = 0.046 [U], P = 0.158 [C]), were independent prognostic indicators of 5-year overall survival. These results indicate that high EGFR and AKT1 gene copy numbers were relatively frequent in breast cancer. Co-heightened EGFR/AKT1 gene copy numbers had a worse outcome than those with only high EGFR gene copy numbers, suggesting that evaluation of these two genes together may be useful for selecting patients for anti-EGFR-targeted therapy or anti-EGFR/AKT1-targeted therapy and for predicting outcomes. © 2015 The Authors. Cancer Science published by Wiley Publishing Asia Pty Ltd on behalf of Japanese Cancer Association.

  1. Genome-wide survey and characterization of the WRKY gene family in Populus trichocarpa.

    PubMed

    He, Hongsheng; Dong, Qing; Shao, Yuanhua; Jiang, Haiyang; Zhu, Suwen; Cheng, Beijiu; Xiang, Yan

    2012-07-01

    WRKY transcription factors participate in diverse physiological and developmental processes in plants. They have highly conserved WRKYGQK amino acid sequences in their N-termini, followed by the novel zinc-finger-like motifs, Cys₂His₂ or Cys₂HisCys. To date, numerous WRKY genes have been identified and characterized in a number of herbaceous species. Survey and characterization of WRKY genes in a ligneous species would facilitate a better understanding of the evolutionary processes and functions of this gene family. In this study, 104 poplar WRKY genes (PtWRKY) were identified in the latest poplar genome sequence. According to their structural features, the predicted members were divided into the previously defined groups I-III, as described in rice. In addition, chromosomal localization of the genes demonstrated that there might be WRKY gene hot spots in 2.3 Mb regions on chromosome 14. Furthermore, approximately 83% (86 out of 104) WRKY genes participated in gene duplication events, including 69% (29 out of 42) gene pairs which exhibited segmental duplication. Using semi-quantitative RT-PCR, the expression patterns of subgroup III genes were investigated under different stresses [cold, drought, salinity and salicylic acid (SA)]. The data revealed that these genes presented different expression levels in response to various stress conditions. Expression analysis exhibited PtWRKY76 gene induced markedly in 0.1 mM SA or 25% PEG-6000 treatment. The results presented here provide a fundamental clue for cloning specific function genes in further studies and applications. This study identified 104 poplar WRKY genes and demonstrated WRKY gene hot spots on chromosome 14. Furthermore, semi-quantitative RT-PCR showed variable stress responses in subgroup III.

  2. Molecular Typing and Virulence Gene Profiles of Enterotoxin Gene Cluster (egc)-Positive Staphylococcus aureus Isolates Obtained from Various Food and Clinical Specimens.

    PubMed

    Song, Minghui; Shi, Chunlei; Xu, Xuebing; Shi, Xianming

    2016-11-01

    The enterotoxin gene cluster (egc) has been proposed to contribute to the Staphylococcus aureus colonization, which highlights the need to evaluate genetic diversity and virulence gene profiles of the egc-positive population. Here, a total of 43 egc-positive isolates (16.2%) were identified from 266 S. aureus isolates that were obtained from various food and clinical specimens in Shanghai. Seven different egc profiles were found based on the polymerase chain reaction (PCR) result for egc genes. Then, these 43 egc-positive isolates were further typed by multilocus sequence typing, pulsed-field gel electrophoresis (PFGE), multiple-locus variable-number tandem-repeat analysis (MLVA), and accessory gene regulatory (agr) typing. It showed that the 43 egc-positive isolates displayed 17 sequence types, 28 PFGE patterns, 29 MLVA types, and 4 agr types, respectively. Among them, the dominant clonal lineage was CC5-agr II (48.84%). Thirty toxin and 20 adhesion-associated genes were detected by PCR in egc-positive isolates. Notably, invasive toxin genes showed a high prevalence, such as 76.7% for Panton-Valentine leukocidin encoding genes, 27.9% for sec, and 23.3% for tsst-1. Most of the examined adhesion-associated genes were found to be conserved (76.7-100%), whereas the fnbB gene was only found in 8 (18.6%) isolates. In addition, 33 toxin gene profiles and 13 adhesion gene profiles were identified, respectively. Our results imply that isolates belonging to the same clonal lineage harbored similar adhesion gene profiles but diverse toxin gene profiles. Overall, the high prevalence of invasive virulence genes increases the potential risk of egc-positive isolates in S. aureus infection.

  3. Natural selection on marine carnivores elaborated a diverse family of classical MHC class I genes exhibiting haplotypic gene content variation and allelic polymorphism

    PubMed Central

    Norman, Paul J.; Parham, Peter

    2012-01-01

    Pinnipeds, marine carnivores, diverged from terrestrial carnivores ~45 million years ago, before their adaptation to marine environments. This lifestyle change exposed pinnipeds to different microbiota and pathogens, with probable impact on their MHC class I genes. Investigating this question, genomic sequences were determined for 71 MHC class I variants: 27 from harbor seal and 44 from gray seal. These variants form three MHC class I gene lineages, one comprising a pseudogene. The second, a candidate nonclassical MHC class I gene, comprises a nonpolymorphic transcribed gene related to dog DLA-79 and giant panda Aime-1906. The third is the diversity lineage, which includes 62 of the 71 seal MHC class I variants. All are transcribed, and they minimally represent six harbor and 12 gray seal MHC class I genes. Besides species-specific differences in gene number, seal MHC class I haplotypes exhibit gene content variation and allelic polymorphism. Patterns of sequence variation, and of positions for positively selected sites, indicate the diversity lineage genes are the seals’ classical MHC class I genes. Evidence that expansion of diversity lineage genes began before gray and harbor seals diverged is the presence in both species of two distinctive sublineages of diversity lineage genes. Pointing to further expansion following the divergence are the presence of species-specific genes and greater MHC class I diversity in gray seals than harbor seals. The elaboration of a complex variable family of classical MHC class I genes in pinnipeds contrasts with the single, highly polymorphic classical MHC class I gene of dog and giant panda, terrestrial carnivores. PMID:23001684

  4. Distinct Genetic Signatures for Variability in Total and Free Serum Thyroxine Levels in Four Sets of Recombinant Inbred Mice

    PubMed Central

    Lu, Lu; Aliesky, Holly A.; Williams, Robert W.; Rapoport, Basil

    2011-01-01

    C3H/He and BALB/c mice have elevated serum thyroxine levels associated with low deiodinase type-1 activity whereas C57BL/6 (B6) mice have low thyroxine levels and elevated deiodinase type-1 activity. High-resolution genetic maps are available for four sets of recombinant inbred (RI) mice derived from B6 parents bred to C3H/He, BALB/c, DBA/2, or A strains. Total and free T4 (T-T4 and F-T4) levels in females from these RI sets (BXH, CXB, BXD, and AXBXA) were analyzed to test two hypotheses: first, serum T4 variability is linked to the deiodinase type-1 gene; second, because of their shared B6 parent, the RI sets will share linkages responsible for T-T4 or F-T4 variability. A number of chromosomes (Chr) and loci were linked to T-T4 (Chr 1, 4, 13, 11) or F-T4 (Chr 1, 6, 13, 18, 19). Linkage between T-T4 and Chr 4 was limited to CXB and BXH strains, but the locus was distinct from the deiodinase type-1 gene. Surprisingly, many linkages were unique providing “genetic signatures” for T-T4 or F-T4 in each set of RI mice. Indeed, the strongest linkage between T-T4 (or F-T4) and a Chr 2 locus (logarithm of the odds scores >4.4) was only observed in AXBXA strains. Some loci corresponded to genes/Chr associated in humans with variable TSH or T-T4 levels. Unlike inbred mice, human populations are extremely diverse. Consequently, our data suggest that the contributions of unique chromosomes/loci controlling T-T4 and F-T4 in distinct human subgroups are likely to be “buried” in genetic analyses of heterogeneous human populations. PMID:21209025

  5. Analysis of the Variability of Epstein-Barr Virus Genes in Infectious Mononucleosis: Investigation of the Potential Correlation with Biochemical Parameters of Hepatic Involvement

    PubMed Central

    Lazarevic, Ivana; Stevanovic, Goran; Cirkovic, Andja; Karalic, Danijela; Cupic, Maja; Banko, Bojan; Milovanovic, Jovica; Jovanovic, Tanja

    2016-01-01

    Summary Background Primary Epstein-Barr virus (EBV) infection is usually asymptomatic, although at times it results in the benign lymphoproliferative disease, infectious mononucleosis (IM), during which almost half of patients develop hepatitis. The aims of the present study are to evaluate polymorphisms of EBV genes circulating in IM isolates from this geographic region and to investigate the correlation of viral sequence patterns with the available IM biochemical parameters. Methods The study included plasma samples from 128 IM patients. The genes EBNA2, LMP1, and EBNA1 were amplified using nested-PCR. EBNA2 genotyping was performed by visualization of PCR products using gel electrophoresis. Investigation of LMP1 and EBNA1 included sequence, phylogenetic, and statistical analyses. Results The presence of EBV DNA in plasma samples showed correlation with patients’ necessity for hospitalization (p=0.034). The majority of EBV isolates was genotype 1. LMP1 variability showed 4 known variants, and two new deletions (27-bp and 147-bp). Of the 3 analyzed attributes of LMP1 isolates, the number of 33-bp repeats less than the reference 4.5 was the only one that absolutely correlated with the elevated levels of transaminases. EBNA1 variability was presented by prototype subtypes. A particular combination of EBNA2, LMP1, and EBNA1 polymorphisms, deleted LMP1/P-thr and non-deleted LMP1/P-ala, as well as genotype 1/ 4.5 33-bp LMP1 repeats or genotype 2/ 4.5 33-bp LMP1 repeats showed correlation with elevated AST (aspartate aminotransferase) and ALT (alanine transaminase). Conclusions This is the first study which identified the association between EBV variability and biochemical parameters in IM patients. These results showed a possibility for the identification of hepatic related diagnostic EBV markers. PMID:28356886

  6. Clinical presentations of 23 half-siblings from a mosaic neurofibromatosis type 1 sperm donor.

    PubMed

    Ejerskov, C; Farholt, S; Skovby, F; Vestergaard, E M; Haagerup, A

    2016-03-01

    The Danish sperm donor number 7042 has fathered several offspring with neurofibromatosis type 1 (NF1) worldwide. NF1 is caused by loss-of-function mutations in the NF1 gene and more than 1000 NF1 mutations are identified. Analysis of the donor sperm demonstrated gonosomal mosaicism with an intragenic deletion involving exons 15-29 in the NF1 gene. At the two Danish reference centres for NF1 patients, we evaluated 23 half-siblings from the donor. Nine were diagnosed with NF1. The severity grade of NF1 progressed from minimal to mild/moderate within 3 years of follow-up. The NF1 phenotype shows great variability in intra- and inter-family expressivity and to date only two NF1 genotype-phenotype correlations have been established. This rare possibility of a long-term follow-up of a cohort of half-siblings with NF1 makes further studies including phenotypic variability and search for modifier genes possible. To achieve this goal, we have initiated The International Donor 7042 NF1 Offspring Registry. Research facilitated via this registry may reveal important new knowledge of clinical characteristics and prognostics for the specific NF1 genotype and thereby contribute to future individualised targeted clinical follow-up and treatment. © 2015 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  7. Holoprosencephaly: signalling interactions between the brain and the face, the environment and the genes, and the phenotypic variability in animal models and humans

    PubMed Central

    Graf, Daniel; Marcucio, Ralph

    2014-01-01

    Holoprosencephaly (HPE) is the most common developmental defect of the forebrain characterized by inadequate or absent midline division of the forebrain into cerebral hemispheres, with concomitant midline facial defects in the majority of cases. Understanding the pathogenesis of HPE requires knowledge of the relationship between the developing brain and the facial structures during embryogenesis. A number of signalling pathways control and coordinate the development of the brain and face, including Sonic hedgehog (SHH), Bone Morphogenetic Protein (BMP), Fibroblast Growth Factor (FGF), and Nodal signalling. Mutations in these pathways have been identified in animal models of HPE and human patients. Due to incomplete penetrance and variable expressivity of HPE, patients carrying defined mutations may not manifest the disease at all, or have a spectrum of defects. It is currently unknown what drives manifestation of HPE in genetically at risk individuals, but it has been speculated that other gene mutations and environmental factors may combine as cumulative insults. HPE can be diagnosed in utero by a high-resolution prenatal ultrasound or a fetal magnetic resonance imaging, sometimes in combination with molecular testing from chorionic villi or amniotic fluid sampling. Currently, there are no effective preventive methods for HPE. Better understanding of the mechanisms of gene-environment interactions in HPE would provide avenues for such interventions. PMID:25339593

  8. Seasonality and the Response of the Thecosome Pteropod Limacina retroversa to CO2 in the Gulf of Maine

    NASA Astrophysics Data System (ADS)

    Maas, A.; Tarrant, A. M.; Bergan, A. J.; Wang, A. Z.; Lawson, G. L.

    2016-02-01

    Limacina retroversa is a thecosomatous pteropod found year round in the Gulf of Maine. Because carbonate chemistry within this shelf system is spatially variable and exhibits seasonal cycles, pteropods in this region may already be exposed to under-saturated, and hence corrosive, waters during certain seasons. To understand the implications of this variability, we have explored the physiological responses of L. retroversa at four time points over the course of a year to determine whether pteropods vary seasonally in their sensitivity to CO2 exposure on time-scales relevant to acclimation responses. In the laboratory, these animals were exposed to CO2 (ambient, 800, 1200 ppm) for 7-14 days and their response was assessed using an integrated set of metabolic, gene-expression and shell condition metrics. Similar to previous work with this species and others, pronounced changes in shell condition of exposed adults were discernible after less than 3 days of exposure, while changes to respiration rate were not consistently apparent. There were, however, seasonal variations in respiration rate indicative of an acclimation response. Differential expression analyses (RNAseq) revealed pronounced changes in gene expression among seasons, while laboratory CO2 exposure resulted in a lower number of differentially expressed transcripts. These gene expression studies, together with both respiration rate and shell condition metrics provide an integrated picture of the seasonal effect of CO2 on this sentinel species.

  9. Genotypic variability-based genome-wide association study identifies non-additive loci HLA-C and IL12B for psoriasis.

    PubMed

    Wei, Wen-Hua; Massey, Jonathan; Worthington, Jane; Barton, Anne; Warren, Richard B

    2018-03-01

    Genome-wide association studies (GWASs) have identified a number of loci for psoriasis but largely ignored non-additive effects. We report a genotypic variability-based GWAS (vGWAS) that can prioritize non-additive loci without requiring prior knowledge of interaction types or interacting factors in two steps, using a mixed model to partition dichotomous phenotypes into an additive component and non-additive environmental residuals on the liability scale and then the Levene's (Brown-Forsythe) test to assess equality of the residual variances across genotype groups genome widely. The vGWAS identified two genome-wide significant (P < 5.0e-08) non-additive loci HLA-C and IL12B that were also genome-wide significant in an accompanying GWAS in the discovery cohort. Both loci were statistically replicated in vGWAS of an independent cohort with a small sample size. HLA-C and IL12B were reported in moderate gene-gene and/or gene-environment interactions in several occasions. We found a moderate interaction with age-of-onset of psoriasis, which was replicated indirectly. The vGWAS also revealed five suggestive loci (P < 6.76e-05) including FUT2 that was associated with psoriasis with environmental aspects triggered by virus infection and/or metabolic factors. Replication and functional investigation are needed to validate the suggestive vGWAS loci.

  10. Identification of Genes Involved in Breast Cancer Metastasis by Integrating Protein-Protein Interaction Information with Expression Data.

    PubMed

    Tian, Xin; Xin, Mingyuan; Luo, Jian; Liu, Mingyao; Jiang, Zhenran

    2017-02-01

    The selection of relevant genes for breast cancer metastasis is critical for the treatment and prognosis of cancer patients. Although much effort has been devoted to the gene selection procedures by use of different statistical analysis methods or computational techniques, the interpretation of the variables in the resulting survival models has been limited so far. This article proposes a new Random Forest (RF)-based algorithm to identify important variables highly related with breast cancer metastasis, which is based on the important scores of two variable selection algorithms, including the mean decrease Gini (MDG) criteria of Random Forest and the GeneRank algorithm with protein-protein interaction (PPI) information. The new gene selection algorithm can be called PPIRF. The improved prediction accuracy fully illustrated the reliability and high interpretability of gene list selected by the PPIRF approach.

  11. Influence of flanking sequences on variability in expression levels of an introduced gene in transgenic tobacco plants.

    PubMed Central

    Dean, C; Jones, J; Favreau, M; Dunsmuir, P; Bedbrook, J

    1988-01-01

    The petunia rbcS gene SSU301 was introduced into tobacco using Agrobacterium tumefaciens-mediated transformation. The time at which rbcS expression was maximal after transfer of the tobacco plants to the greenhouse was determined. The expression level of the SSU301 gene varied up to 9 fold between individual tobacco plants which had been standardized physiologically as much as possible. The presence of adjacent pUC plasmid sequences did not affect the expression of the SSU301 gene. In an attempt to reduce the between-transformant variability in expression, the SSU301 gene was introduced into tobacco surrounded by 10kb of 5' and 13 kb of 3' DNA sequences which normally flank SSU301 in petunia. The longer flanking regions did not reduce the between-transformant variability of SSU301 gene expression. Images PMID:3174450

  12. Analysis, Characterization, and Loci of the tuf Genes in Lactobacillus and Bifidobacterium Species and Their Direct Application for Species Identification

    PubMed Central

    Ventura, Marco; Canchaya, Carlos; Meylan, Valèrie; Klaenhammer, Todd R.; Zink, Ralf

    2003-01-01

    We analyzed the tuf gene, encoding elongation factor Tu, from 33 strains representing 17 Lactobacillus species and 8 Bifidobacterium species. The tuf sequences were aligned and used to infer phylogenesis among species of lactobacilli and bifidobacteria. We demonstrated that the synonymous substitution affecting this gene renders elongation factor Tu a reliable molecular clock for investigating evolutionary distances of lactobacilli and bifidobacteria. In fact, the phylogeny generated by these tuf sequences is consistent with that derived from 16S rRNA analysis. The investigation of a multiple alignment of tuf sequences revealed regions conserved among strains belonging to the same species but distinct from those of other species. PCR primers complementary to these regions allowed species-specific identification of closely related species, such as Lactobacillus casei group members. These tuf gene-based assays developed in this study provide an alternative to present methods for the identification for lactic acid bacterial species. Since a variable number of tuf genes have been described for bacteria, the presence of multiple genes was examined. Southern analysis revealed one tuf gene in the genomes of lactobacilli and bifidobacteria, but the tuf gene was arranged differently in the genomes of these two taxa. Our results revealed that the tuf gene in bifidobacteria is flanked by the same gene constellation as the str operon, as originally reported for Escherichia coli. In contrast, bioinformatic and transcriptional analyses of the DNA region flanking the tuf gene in four Lactobacillus species indicated the same four-gene unit and suggested a novel tuf operon specific for the genus Lactobacillus. PMID:14602655

  13. Discrimination of germline V genes at different sequencing lengths and mutational burdens: A new tool for identifying and evaluating the reliability of V gene assignment.

    PubMed

    Zhang, Bochao; Meng, Wenzhao; Prak, Eline T Luning; Hershberg, Uri

    2015-12-01

    Immune repertoires are collections of lymphocytes that express diverse antigen receptor gene rearrangements consisting of Variable (V), (Diversity (D) in the case of heavy chains) and Joining (J) gene segments. Clonally related cells typically share the same germline gene segments and have highly similar junctional sequences within their third complementarity determining regions. Identifying clonal relatedness of sequences is a key step in the analysis of immune repertoires. The V gene is the most important for clone identification because it has the longest sequence and the greatest number of sequence variants. However, accurate identification of a clone's germline V gene source is challenging because there is a high degree of similarity between different germline V genes. This difficulty is compounded in antibodies, which can undergo somatic hypermutation. Furthermore, high-throughput sequencing experiments often generate partial sequences and have significant error rates. To address these issues, we describe a novel method to estimate which germline V genes (or alleles) cannot be discriminated under different conditions (read lengths, sequencing errors or somatic hypermutation frequencies). Starting with any set of germline V genes, this method measures their similarity using different sequencing lengths and calculates their likelihood of unambiguous assignment under different levels of mutation. Hence, one can identify, under different experimental and biological conditions, the germline V genes (or alleles) that cannot be uniquely identified and bundle them together into groups of specific V genes with highly similar sequences. Copyright © 2015 Elsevier B.V. All rights reserved.

  14. The immunoglobulin heavy chain locus of the duck. Genomic organization and expression of D, J, and C region genes.

    PubMed

    Lundqvist, M L; Middleton, D L; Hazard, S; Warr, G W

    2001-12-14

    The region of the duck IgH locus extending from upstream of the proximal diversity (D) segment to downstream of the constant gene cluster has been cloned and mapped. A sequence contig of 48,796 base pairs established that the organization of the genes is D-J(H)-mu-alpha-upsilon. No evidence for a functional homologue (or remnant) of a delta gene was found. The alpha gene is in inverted transcriptional orientation; class switch to IgA expression thus requires inversion of the approximately 27-kilobase pair region that includes both mu and alpha genes. The secreted forms of duck alpha and mu are each encoded by 4 constant region exons, and the hydrophobic C-terminal regions of the membrane receptor forms of alpha and mu are encoded by one and two transmembrane exons, respectively. Putative switch (S) regions were identified for duck mu and upsilon by comparison with chicken Smu and Supsilon sequences and for duck alpha by comparison with mouse Salpha. The duck IgH locus is rich in complex variable number tandem repeats, which occupy approximately 60% of the sequenced region, and occur at a much higher frequency in the IgH locus than in other sequenced regions of the duck genome.

  15. Garlic Influences Gene Expression In Vivo and In Vitro.

    PubMed

    Charron, Craig S; Dawson, Harry D; Novotny, Janet A

    2016-02-01

    There is a large body of preclinical research aimed at understanding the roles of garlic and garlic-derived preparations in the promotion of human health. Most of this research has targeted the possible functions of garlic in maintaining cardiovascular health and in preventing and treating cancer. A wide range of outcome variables has been used to investigate the bioactivity of garlic, ranging from direct measures of health status such as cholesterol concentrations, blood pressure, and changes in tumor size and number, to molecular and biochemical measures such as mRNA gene expression, protein concentration, enzyme activity, and histone acetylation status. Determination of how garlic influences mRNA gene expression has proven to be a valuable approach to elucidating the mechanisms of garlic bioactivity. Preclinical studies investigating the health benefits of garlic far outnumber human studies and have made frequent use of mRNA gene expression measurement. There is an immediate need to understand mRNA gene expression in humans as well. Although safety and ethical constraints limit the types of available human tissue, peripheral whole blood is readily accessible, and measuring mRNA gene expression in whole blood may provide a unique window to understanding how garlic intake affects human health. © 2016 American Society for Nutrition.

  16. Drug metabolising enzyme polymorphisms in Middle- and Eastern-European Slavic populations.

    PubMed

    Hubacek, Jaroslav A

    2014-01-01

    Inter-individual differences in genes for drug metabolising enzymes and drug transporters are important for understanding efficacy in drug therapy. These differences are important both for the timely estimation of the dosage that should be prescribed to a patient and for the detection of individuals who are prone to side effects from the drug at normal doses. This review summarises the literature concerning the gene variants within nine major drug metabolising enzymes and drug transporters (i.e., CYP1A2, CYP2A6, CYP2C9, CYP2C19, CYP2D6, CYP2E1, CYP3A4, CYP3A5, and MDR-1) in the Middle European region. Notably, published data are not extensive, and most studies were performed on relatively low numbers of individuals. No country has a complete coverage of all genes. Two variants (C2677T/A and C3435T) within the multidrug resistance-1 (MDR-1) gene and variants within the CYP2C9 gene were analysed within most Slavic populations. Nevertheless, even from this incomplete coverage (where unexpectedly high variability was at times seen both between and within populations), it could be extrapolated that the variants within the drug metabolising enzyme genes are present in roughly the same frequencies as in neighbouring countries.

  17. Analysis of the cytochrome c oxidase subunit 1 (COX1) gene reveals the unique evolution of the giant panda.

    PubMed

    Hu, Yao-Dong; Pang, Hui-Zhong; Li, De-Sheng; Ling, Shan-Shan; Lan, Dan; Wang, Ye; Zhu, Yun; Li, Di-Yan; Wei, Rong-Ping; Zhang, He-Min; Wang, Cheng-Dong

    2016-11-05

    As the rate-limiting enzyme of the mitochondrial respiratory chain, cytochrome c oxidase (COX) plays a crucial role in biological metabolism. "Living fossil" giant panda (Ailuropoda melanoleuca) is well-known for its special bamboo diet. In an effort to explore functional variation of COX1 in the energy metabolism behind giant panda's low-energy bamboo diet, we looked at genetic variation of COX1 gene in giant panda, and tested for its selection effect. In 1545 base pairs of the gene from 15 samples, 9 positions were variable and 1 mutation leaded to an amino acid sequence change. COX1 gene produces six haplotypes, nucleotide (pi), haplotype diversity (Hd). In addition, the average number of nucleotide differences (k) is 0.001629±0.001036, 0.8083±0.0694 and 2.517, respectively. Also, dN/dS ratio is significantly below 1. These results indicated that giant panda had a low population genetic diversity, and an obvious purifying selection of the COX1 gene which reduces synthesis of ATP determines giant panda's low-energy bamboo diet. Phylogenetic trees based on the COX1 gene were constructed to demonstrate that giant panda is the sister group of other Ursidae. Copyright © 2016 Elsevier B.V. All rights reserved.

  18. RNA-seq: technical variability and sampling

    PubMed Central

    2011-01-01

    Background RNA-seq is revolutionizing the way we study transcriptomes. mRNA can be surveyed without prior knowledge of gene transcripts. Alternative splicing of transcript isoforms and the identification of previously unknown exons are being reported. Initial reports of differences in exon usage, and splicing between samples as well as quantitative differences among samples are beginning to surface. Biological variation has been reported to be larger than technical variation. In addition, technical variation has been reported to be in line with expectations due to random sampling. However, strategies for dealing with technical variation will differ depending on the magnitude. The size of technical variance, and the role of sampling are examined in this manuscript. Results In this study three independent Solexa/Illumina experiments containing technical replicates are analyzed. When coverage is low, large disagreements between technical replicates are apparent. Exon detection between technical replicates is highly variable when the coverage is less than 5 reads per nucleotide and estimates of gene expression are more likely to disagree when coverage is low. Although large disagreements in the estimates of expression are observed at all levels of coverage. Conclusions Technical variability is too high to ignore. Technical variability results in inconsistent detection of exons at low levels of coverage. Further, the estimate of the relative abundance of a transcript can substantially disagree, even when coverage levels are high. This may be due to the low sampling fraction and if so, it will persist as an issue needing to be addressed in experimental design even as the next wave of technology produces larger numbers of reads. We provide practical recommendations for dealing with the technical variability, without dramatic cost increases. PMID:21645359

  19. Transcriptome-Level Signatures in Gene Expression and Gene Expression Variability during Bacterial Adaptive Evolution.

    PubMed

    Erickson, Keesha E; Otoupal, Peter B; Chatterjee, Anushree

    2017-01-01

    Antibiotic-resistant bacteria are an increasingly serious public health concern, as strains emerge that demonstrate resistance to almost all available treatments. One factor that contributes to the crisis is the adaptive ability of bacteria, which exhibit remarkable phenotypic and gene expression heterogeneity in order to gain a survival advantage in damaging environments. This high degree of variability in gene expression across biological populations makes it a challenging task to identify key regulators of bacterial adaptation. Here, we research the regulation of adaptive resistance by investigating transcriptome profiles of Escherichia coli upon adaptation to disparate toxins, including antibiotics and biofuels. We locate potential target genes via conventional gene expression analysis as well as using a new analysis technique examining differential gene expression variability. By investigating trends across the diverse adaptation conditions, we identify a focused set of genes with conserved behavior, including those involved in cell motility, metabolism, membrane structure, and transport, and several genes of unknown function. To validate the biological relevance of the observed changes, we synthetically perturb gene expression using clustered regularly interspaced short palindromic repeat (CRISPR)-dCas9. Manipulation of select genes in combination with antibiotic treatment promotes adaptive resistance as demonstrated by an increased degree of antibiotic tolerance and heterogeneity in MICs. We study the mechanisms by which identified genes influence adaptation and find that select differentially variable genes have the potential to impact metabolic rates, mutation rates, and motility. Overall, this work provides evidence for a complex nongenetic response, encompassing shifts in gene expression and gene expression variability, which underlies adaptive resistance. IMPORTANCE Even initially sensitive bacteria can rapidly thwart antibiotic treatment through stress response processes known as adaptive resistance. Adaptive resistance fosters transient tolerance increases and the emergence of mutations conferring heritable drug resistance. In order to extend the applicable lifetime of new antibiotics, we must seek to hinder the occurrence of bacterial adaptive resistance; however, the regulation of adaptation is difficult to identify due to immense heterogeneity emerging during evolution. This study specifically seeks to generate heterogeneity by adapting bacteria to different stresses and then examines gene expression trends across the disparate populations in order to pinpoint key genes and pathways associated with adaptive resistance. The targets identified here may eventually inform strategies for impeding adaptive resistance and prolonging the effectiveness of antibiotic treatment.

  20. Extensive Copy Number Variation in Fermentation-Related Genes Among Saccharomyces cerevisiae Wine Strains.

    PubMed

    Steenwyk, Jacob; Rokas, Antonis

    2017-05-05

    Due to the importance of Saccharomyces cerevisiae in wine-making, the genomic variation of wine yeast strains has been extensively studied. One of the major insights stemming from these studies is that wine yeast strains harbor low levels of genetic diversity in the form of single nucleotide polymorphisms (SNPs). Genomic structural variants, such as copy number (CN) variants, are another major type of variation segregating in natural populations. To test whether genetic diversity in CN variation is also low across wine yeast strains, we examined genome-wide levels of CN variation in 132 whole-genome sequences of S. cerevisiae wine strains. We found an average of 97.8 CN variable regions (CNVRs) affecting ∼4% of the genome per strain. Using two different measures of CN diversity, we found that gene families involved in fermentation-related processes such as copper resistance ( CUP ), flocculation ( FLO ), and glucose metabolism ( HXT ), as well as the SNO gene family whose members are expressed before or during the diauxic shift, showed substantial CN diversity across the 132 strains examined. Importantly, these same gene families have been shown, through comparative transcriptomic and functional assays, to be associated with adaptation to the wine fermentation environment. Our results suggest that CN variation is a substantial contributor to the genomic diversity of wine yeast strains, and identify several candidate loci whose levels of CN variation may affect the adaptation and performance of wine yeast strains during fermentation. Copyright © 2017 Steenwyk and Rokas.

  1. Distribution of virulence genes and genotyping of CTX-M-15-producing Klebsiella pneumoniae isolated from patients with community-acquired urinary tract infection (CA-UTI).

    PubMed

    Ranjbar, Reza; Memariani, Hamed; Sorouri, Rahim; Memariani, Mojtaba

    2016-11-01

    Klebsiella pneumoniae is one of the most important agents of community-acquired urinary tract infection (CA-UTI). In addition to extended-spectrum β-lactamases (ESBLs), a number of virulence factors have been shown to play an important role in the pathogenesis of K. pneumoniae, including capsule, siderophores, and adhesins. Little is known about the genetic diversity and virulence content of the CTX-M-15-producing K. pneumoniae isolated from CA-UTI in Iran. A total of 152 K. pneumoniae isolates were collected from CA-UTI patients in Tehran from September 2015 through April 2016. Out of 152 isolates, 40 (26.3%) carried bla CTX-M-15 . PCR was performed for detection of virulence genes in CTX-M-15-producing isolates. Furthermore, all of these isolates were subjected to multiple-locus variable-number of tandem repeat (VNTR) analysis (MLVA). Using MLVA method, 36 types were identified. CTX-M-15-producing K. pneumoniae isolates were grouped into 5 clonal complexes (CCs). Of these isolates, mrkD was the most prevalent virulence gene (95%), followed by kpn (60%), rmpA (37.5%), irp (35%), and magA (2.5%). No correlation between MLVA types or CCs and virulence genes or antibiotic resistance patterns was observed. Overall, it is thought that CTX-M-15-producing K. pneumoniae strains isolated from CA-UTI have arisen from different clones. Copyright © 2016 Elsevier Ltd. All rights reserved.

  2. Diversity in the Toll-Like Receptor Genes of the African Penguin (Spheniscus demersus).

    PubMed

    Dalton, Desiré Lee; Vermaak, Elaine; Roelofse, Marli; Kotze, Antoinette

    2016-01-01

    The African penguin, Spheniscus demersus, is listed as Endangered by the IUCN Red List of Threatened Species due to the drastic reduction in population numbers over the last 20 years. To date, the only studies on immunogenetic variation in penguins have been conducted on the major histocompatibility complex (MHC) genes. It was shown in humans that up to half of the genetic variability in immune responses to pathogens are located in non-MHC genes. Toll-like receptors (TLRs) are now increasingly being studied in a variety of taxa as a broader approach to determine functional genetic diversity. In this study, we confirm low genetic diversity in the innate immune region of African penguins similar to that observed in New Zealand robin that has undergone several severe population bottlenecks. Single nucleotide polymorphism (SNP) diversity across TLRs varied between ex situ and in situ penguins with the number of non-synonymous alterations in ex situ populations (n = 14) being reduced in comparison to in situ populations (n = 16). Maintaining adaptive diversity is of vital importance in the assurance populations as these animals may potentially be used in the future for re-introductions. Therefore, this study provides essential data on immune gene diversity in penguins and will assist in providing an additional monitoring tool for African penguin in the wild, as well as to monitor diversity in ex situ populations and to ensure that diversity found in the in situ populations are captured in the assurance populations.

  3. DNA Damage Response and Repair Gene Alterations Are Associated with Improved Survival in Patients with Platinum-Treated Advanced Urothelial Carcinoma.

    PubMed

    Teo, Min Yuen; Bambury, Richard M; Zabor, Emily C; Jordan, Emmet; Al-Ahmadie, Hikmat; Boyd, Mariel E; Bouvier, Nancy; Mullane, Stephanie A; Cha, Eugene K; Roper, Nitin; Ostrovnaya, Irina; Hyman, David M; Bochner, Bernard H; Arcila, Maria E; Solit, David B; Berger, Michael F; Bajorin, Dean F; Bellmunt, Joaquim; Iyer, Gopakumar; Rosenberg, Jonathan E

    2017-07-15

    Purpose: Platinum-based chemotherapy remains the standard treatment for advanced urothelial carcinoma by inducing DNA damage. We hypothesize that somatic alterations in DNA damage response and repair (DDR) genes are associated with improved sensitivity to platinum-based chemotherapy. Experimental Design: Patients with diagnosis of locally advanced and metastatic urothelial carcinoma treated with platinum-based chemotherapy who had exon sequencing with the Memorial Sloan Kettering-Integrated Mutation Profiling of Actionable Cancer Targets (MSK-IMPACT) assay were identified. Patients were dichotomized based on the presence/absence of alterations in a panel of 34 DDR genes. DDR alteration status was correlated with clinical outcomes and disease features. Results: One hundred patients were identified, of which 47 harbored alterations in DDR genes. Patients with DDR alterations had improved progression-free survival (9.3 vs. 6.0 months, log-rank P = 0.007) and overall survival (23.7 vs. 13.0 months, log-rank P = 0.006). DDR alterations were also associated with higher number mutations and copy-number alterations. A trend toward positive correlation between DDR status and nodal metastases and inverse correlation with visceral metastases were observed. Different DDR pathways also suggested variable impact on clinical outcomes. Conclusions: Somatic DDR alteration is associated with improved clinical outcomes in platinum-treated patients with advanced urothelial carcinoma. Once validated, it can improve patient selection for clinical practice and future study enrollment. Clin Cancer Res; 23(14); 3610-8. ©2017 AACR . ©2017 American Association for Cancer Research.

  4. Epilepsy genetics: the ongoing revolution.

    PubMed

    Lesca, G; Depienne, C

    2015-01-01

    Epilepsies have long remained refractory to gene identification due to several obstacles, including a highly variable inter- and intrafamilial expressivity of the phenotypes, a high frequency of phenocopies, and a huge genetic heterogeneity. Recent technological breakthroughs, such as array comparative genomic hybridization and next generation sequencing, have been leading, in the past few years, to the identification of an increasing number of genomic regions and genes in which mutations or copy-number variations cause various epileptic disorders, revealing an enormous diversity of pathophysiological mechanisms. The field that has undergone the most striking revolution is that of epileptic encephalopathies, for which most of causing genes have been discovered since the year 2012. Some examples are the continuous spike-and-waves during slow-wave sleep and Landau-Kleffner syndromes for which the recent discovery of the role of GRIN2A mutations has finally confirmed the genetic bases. These new technologies begin to be used for diagnostic applications, and the main challenge now resides in the interpretation of the huge mass of variants detected by these methods. The identification of causative mutations in epilepsies provides definitive confirmation of the clinical diagnosis, allows accurate genetic counselling, and sometimes permits the development of new appropriate and specific antiepileptic therapies. Future challenges include the identification of the genetic or environmental factors that modify the epileptic phenotypes caused by mutations in a given gene and the understanding of the role of somatic mutations in sporadic epilepsies. Copyright © 2015 Elsevier Masson SAS. All rights reserved.

  5. The role of clinical variables, neuropsychological performance and SLC6A4 and COMT gene polymorphisms on the prediction of early response to fluoxetine in major depressive disorder.

    PubMed

    Gudayol-Ferré, Esteve; Herrera-Guzmán, Ixchel; Camarena, Beatriz; Cortés-Penagos, Carlos; Herrera-Abarca, Jorge E; Martínez-Medina, Patricia; Cruz, David; Hernández, Sandra; Genis, Alma; Carrillo-Guerrero, Mariana Y; Avilés Reyes, Rubén; Guàrdia-Olmos, Joan

    2010-12-01

    Major depressive disorder (MDD) is treated with antidepressants, but only between 50% and 70% of the patients respond to the initial treatment. Several authors suggested different factors that could predict antidepressant response, including clinical, psychophysiological, neuropsychological, neuroimaging, and genetic variables. However, these different predictors present poor prognostic sensitivity and specificity by themselves. The aim of our work is to study the possible role of clinical variables, neuropsychological performance, and the 5HTTLPR, rs25531, and val108/58Met COMT polymorphisms in the prediction of the response to fluoxetine after 4weeks of treatment in a sample of patient with MDD. 64 patients with MDD were genotyped according to the above-mentioned polymorphisms, and were clinically and neuropsychologically assessed before a 4-week fluoxetine treatment. Fluoxetine response was assessed by using the Hamilton Depression Rating Scale. We carried out a binary logistic regression model for the potential predictive variables. Out of the clinical variables studied, only the number of anxiety disorders comorbid with MDD have predicted a poor response to the treatment. A combination of a good performance in variables of attention and low performance in planning could predict a good response to fluoxetine in patients with MDD. None of the genetic variables studied had predictive value in our model. The possible placebo effect has not been controlled. Our study is focused on response prediction but not in remission prediction. Our work suggests that the combination of the number of comorbid anxiety disorders, an attentional variable, and two planning variables makes it possible to correctly classify 82% of the depressed patients who responded to the treatment with fluoxetine, and 74% of the patients who did not respond to that treatment. Copyright © 2010 Elsevier B.V. All rights reserved.

  6. Higher-order organisation of extremely amplified, potentially functional and massively methylated 5S rDNA in European pikes (Esox sp.).

    PubMed

    Symonová, Radka; Ocalewicz, Konrad; Kirtiklis, Lech; Delmastro, Giovanni Battista; Pelikánová, Šárka; Garcia, Sonia; Kovařík, Aleš

    2017-05-18

    Pikes represent an important genus (Esox) harbouring a pre-duplication karyotype (2n = 2x = 50) of economically important salmonid pseudopolyploids. Here, we have characterized the 5S ribosomal RNA genes (rDNA) in Esox lucius and its closely related E. cisalpinus using cytogenetic, molecular and genomic approaches. Intragenomic homogeneity and copy number estimation was carried out using Illumina reads. The higher-order structure of rDNA arrays was investigated by the analysis of long PacBio reads. Position of loci on chromosomes was determined by FISH. DNA methylation was analysed by methylation-sensitive restriction enzymes. The 5S rDNA loci occupy exclusively (peri)centromeric regions on 30-38 acrocentric chromosomes in both E. lucius and E. cisalpinus. The large number of loci is accompanied by extreme amplification of genes (>20,000 copies), which is to the best of our knowledge one of the highest copy number of rRNA genes in animals ever reported. Conserved secondary structures of predicted 5S rRNAs indicate that most of the amplified genes are potentially functional. Only few SNPs were found in genic regions indicating their high homogeneity while intergenic spacers were more heterogeneous and several families were identified. Analysis of 10-30 kb-long molecules sequenced by the PacBio technology (containing about 40% of total 5S rDNA) revealed that the vast majority (96%) of genes are organised in large several kilobase-long blocks. Dispersed genes or short tandems were less common (4%). The adjacent 5S blocks were directly linked, separated by intervening DNA and even inverted. The 5S units differing in the intergenic spacers formed both homogeneous and heterogeneous (mixed) blocks indicating variable degree of homogenisation between the loci. Both E. lucius and E. cisalpinus 5S rDNA was heavily methylated at CG dinucleotides. Extreme amplification of 5S rRNA genes in the Esox genome occurred in the absence of significant pseudogenisation suggesting its recent origin and/or intensive homogenisation processes. The dense methylation of units indicates that powerful epigenetic mechanisms have evolved in this group of fish to silence amplified genes. We discuss how the higher-order repeat structures impact on homogenisation of 5S rDNA in the genome.

  7. Selection of housekeeping genes for gene expression studies in the adult rat submandibular gland under normal, inflamed, atrophic and regenerative states

    PubMed Central

    Silver, Nicholas; Cotroneo, Emanuele; Proctor, Gordon; Osailan, Samira; Paterson, Katherine L; Carpenter, Guy H

    2008-01-01

    Background Real-time PCR is a reliable tool with which to measure mRNA transcripts, and provides valuable information on gene expression profiles. Endogenous controls such as housekeeping genes are used to normalise mRNA levels between samples for sensitive comparisons of mRNA transcription. Selection of the most stable control gene(s) is therefore critical for the reliable interpretation of gene expression data. For the purpose of this study, 7 commonly used housekeeping genes were investigated in salivary submandibular glands under normal, inflamed, atrophic and regenerative states. Results The program NormFinder identified the suitability of HPRT to use as a single gene for normalisation within the normal, inflamed and regenerative states, and GAPDH in the atrophic state. For normalisation to multiple housekeeping genes, for each individual state, the optimal number of housekeeping genes as given by geNorm was: ACTB/UBC in the normal, ACTB/YWHAZ in the inflamed, ACTB/HPRT in the atrophic and ACTB/GAPDH in the regenerative state. The most stable housekeeping gene identified between states (compared to normal) was UBC. However, ACTB, identified as one of the most stably expressed genes within states, was found to be one of the most variable between states. Furthermore we demonstrated that normalising between states to ACTB, rather than UBC, introduced an approximately 3 fold magnitude of error. Conclusion Using NormFinder, our studies demonstrated the suitability of HPRT to use as a single gene for normalisation within the normal, inflamed and regenerative groups and GAPDH in the atrophic group. However, if normalising to multiple housekeeping genes, we recommend normalising to those identified by geNorm. For normalisation across the physiological states, we recommend the use of UBC. PMID:18637167

  8. Discordant expression and variable numbers of neighboring GGA- and GAA-rich triplet repeats in the 3' untranslated regions of two groups of messenger RNAs encoded by the rat polymeric immunoglobulin receptor gene.

    PubMed Central

    Koch, K S; Gleiberman, A S; Aoki, T; Leffert, H L; Feren, A; Jones, A L; Fodor, E J

    1995-01-01

    An unusual S1-nuclease sensitive microsatellite (STMS) has been found in the single copy, rat polymeric immunoglobulin receptor gene (PIGR) terminal exon. In Fisher rats, elements within or beyond the STMS are expressed variably in the 3' untranslated regions (3'UTRs) of two 'Groups' of PIGR-encoded hepatic mRNAs (pIg-R) during liver regeneration. STMS elements include neighboring constant regions (a 60-bp d[GA]-rich tract with a chi-like octamer, followed by 15 tandem d[GGA] repeats) that merge directly with 36 or 39 tandem d[GAA] repeats (Fisher or Wistar strains, respectively) interrupted by d[AA] between their 5th-6th repeat units. The Wistar STMS is flanked upstream by two regions of nearly contiguous d[CA] or d[CT] repeats in the 3' end of intron 8; and downstream, by a 283 bp 'unit' containing several inversions at its 5' end, and two polyadenylation signals at its 3' end. The 283 nt unit is expressed in Group 1 pIg-R mRNAs; but it is absent in the Group 2 family so that their GAA repeats merge with their poly A tails. In contrast to genomic sequence, GGA triplet repeats are amplified (n > or = 24-26), whereas GAA triplet repeats are truncated variably (n < or = 9-37) and expressed uninterruptedly in both mRNA Groups. These results suggest that 3' end processing of the rat PIGR gene may involve misalignment, slippage and premature termination of RNA polymerase II. The function of this unusual processing and possible roles of chi-like octamers in quiescent or extrahepatic tissues are discussed. Images PMID:7739889

  9. DNA Barcodes of Asian Houbara Bustard (Chlamydotis undulata macqueenii)

    PubMed Central

    Arif, Ibrahim A.; Khan, Haseeb A.; Williams, Joseph B.; Shobrak, Mohammad; Arif, Waad I.

    2012-01-01

    Populations of Houbara Bustards have dramatically declined in recent years. Captive breeding and reintroduction programs have had limited success in reviving population numbers and thus new technological solutions involving molecular methods are essential for the long term survival of this species. In this study, we sequenced the 694 bp segment of COI gene of the four specimens of Asian Houbara Bustard (Chlamydotis undulata macqueenii). We also compared these sequences with earlier published barcodes of 11 individuals comprising different families of the orders Gruiformes, Ciconiiformes, Podicipediformes and Crocodylia (out group). The pair-wise sequence comparison showed a total of 254 variable sites across all the 15 sequences from different taxa. Three of the four specimens of Houbara Bustard had an identical sequence of COI gene and one individual showed a single nucleotide difference (G > A transition at position 83). Within the bustard family (Otididae), comparison among the three species (Asian Houbara Bustard, Great Bustard (Otis tarda) and the Little Bustard (Tetrax tetrax)), representing three different genera, showed 116 variable sites. For another family (Rallidae), the intra-family variable sites among the individuals of four different genera were found to be 146. The COI genetic distances among the 15 individuals varied from 0.000 to 0.431. Phylogenetic analysis using 619 bp nucleotide segment of COI clearly discriminated all the species representing different genera, families and orders. All the four specimens of Houbara Bustard formed a single clade and are clearly separated from other two individuals of the same family (Otis tarda and Tetrax tetrax). The nucleotide sequence of partial segment of COI gene effectively discriminated the closely related species. This is the first study reporting the barcodes of Houbara Bustard and would be helpful in future molecular studies, particularly for the conservation of this threatened bird in Saudi Arabia. PMID:22408462

  10. Characteristics of functional enrichment and gene expression level of human putative transcriptional target genes.

    PubMed

    Osato, Naoki

    2018-01-19

    Transcriptional target genes show functional enrichment of genes. However, how many and how significantly transcriptional target genes include functional enrichments are still unclear. To address these issues, I predicted human transcriptional target genes using open chromatin regions, ChIP-seq data and DNA binding sequences of transcription factors in databases, and examined functional enrichment and gene expression level of putative transcriptional target genes. Gene Ontology annotations showed four times larger numbers of functional enrichments in putative transcriptional target genes than gene expression information alone, independent of transcriptional target genes. To compare the number of functional enrichments of putative transcriptional target genes between cells or search conditions, I normalized the number of functional enrichment by calculating its ratios in the total number of transcriptional target genes. With this analysis, native putative transcriptional target genes showed the largest normalized number of functional enrichments, compared with target genes including 5-60% of randomly selected genes. The normalized number of functional enrichments was changed according to the criteria of enhancer-promoter interactions such as distance from transcriptional start sites and orientation of CTCF-binding sites. Forward-reverse orientation of CTCF-binding sites showed significantly higher normalized number of functional enrichments than the other orientations. Journal papers showed that the top five frequent functional enrichments were related to the cellular functions in the three cell types. The median expression level of transcriptional target genes changed according to the criteria of enhancer-promoter assignments (i.e. interactions) and was correlated with the changes of the normalized number of functional enrichments of transcriptional target genes. Human putative transcriptional target genes showed significant functional enrichments. Functional enrichments were related to the cellular functions. The normalized number of functional enrichments of human putative transcriptional target genes changed according to the criteria of enhancer-promoter assignments and correlated with the median expression level of the target genes. These analyses and characters of human putative transcriptional target genes would be useful to examine the criteria of enhancer-promoter assignments and to predict the novel mechanisms and factors such as DNA binding proteins and DNA sequences of enhancer-promoter interactions.

  11. Analysis of sequence variability in the macronuclear DNA of Paramecium tetraurelia: A somatic view of the germline

    PubMed Central

    Duret, Laurent; Cohen, Jean; Jubin, Claire; Dessen, Philippe; Goût, Jean-François; Mousset, Sylvain; Aury, Jean-Marc; Jaillon, Olivier; Noël, Benjamin; Arnaiz, Olivier; Bétermier, Mireille; Wincker, Patrick; Meyer, Eric; Sperling, Linda

    2008-01-01

    Ciliates are the only unicellular eukaryotes known to separate germinal and somatic functions. Diploid but silent micronuclei transmit the genetic information to the next sexual generation. Polyploid macronuclei express the genetic information from a streamlined version of the genome but are replaced at each sexual generation. The macronuclear genome of Paramecium tetraurelia was recently sequenced by a shotgun approach, providing access to the gene repertoire. The 72-Mb assembly represents a consensus sequence for the somatic DNA, which is produced after sexual events by reproducible rearrangements of the zygotic genome involving elimination of repeated sequences, precise excision of unique-copy internal eliminated sequences (IES), and amplification of the cellular genes to high copy number. We report use of the shotgun sequencing data (>106 reads representing 13× coverage of a completely homozygous clone) to evaluate variability in the somatic DNA produced by these developmental genome rearrangements. Although DNA amplification appears uniform, both of the DNA elimination processes produce sequence heterogeneity. The variability that arises from IES excision allowed identification of hundreds of putative new IESs, compared to 42 that were previously known, and revealed cases of erroneous excision of segments of coding sequences. We demonstrate that IESs in coding regions are under selective pressure to introduce premature termination of translation in case of excision failure. PMID:18256234

  12. Prospecting for pig single nucleotide polymorphisms in the human genome: have we struck gold?

    PubMed

    Grapes, L; Rudd, S; Fernando, R L; Megy, K; Rocha, D; Rothschild, M F

    2006-06-01

    Gene-to-gene variation in the frequency of single nucleotide polymorphisms (SNPs) has been observed in humans, mice, rats, primates and pigs, but a relationship across species in this variation has not been described. Here, the frequency of porcine coding SNPs (cSNPs) identified by in silico methods, and the frequency of murine cSNPs, were compared with the frequency of human cSNPs across homologous genes. From 150,000 porcine expressed sequence tag (EST) sequences, a total of 452 SNP-containing sequence clusters were found, totalling 1394 putative SNPs. All the clustered porcine EST annotations and SNP data have been made publicly available at http://sputnik.btk.fi/project?name=swine. Human and murine cSNPs were identified from dbSNP and were characterized as either validated or total number of cSNPs (validated plus non-validated) for comparison purposes. The correlation between in silico pig cSNP and validated human cSNP densities was found to be 0.77 (p < 0.00001) for a set of 25 homologous genes, while a correlation of 0.48 (p < 0.0005) was found for a primarily random sample of 50 homologous human and mouse genes. This is the first evidence of conserved gene-to-gene variability in cSNP frequency across species and indicates that site-directed screening of porcine genes that are homologous to cSNP-rich human genes may rapidly advance cSNP discovery in pigs.

  13. Multilocus Association Mapping Using Variable-Length Markov Chains

    PubMed Central

    Browning, Sharon R.

    2006-01-01

    I propose a new method for association-based gene mapping that makes powerful use of multilocus data, is computationally efficient, and is straightforward to apply over large genomic regions. The approach is based on the fitting of variable-length Markov chain models, which automatically adapt to the degree of linkage disequilibrium (LD) between markers to create a parsimonious model for the LD structure. Edges of the fitted graph are tested for association with trait status. This approach can be thought of as haplotype testing with sophisticated windowing that accounts for extent of LD to reduce degrees of freedom and number of tests while maximizing information. I present analyses of two published data sets that show that this approach can have better power than single-marker tests or sliding-window haplotypic tests. PMID:16685642

  14. Multilocus association mapping using variable-length Markov chains.

    PubMed

    Browning, Sharon R

    2006-06-01

    I propose a new method for association-based gene mapping that makes powerful use of multilocus data, is computationally efficient, and is straightforward to apply over large genomic regions. The approach is based on the fitting of variable-length Markov chain models, which automatically adapt to the degree of linkage disequilibrium (LD) between markers to create a parsimonious model for the LD structure. Edges of the fitted graph are tested for association with trait status. This approach can be thought of as haplotype testing with sophisticated windowing that accounts for extent of LD to reduce degrees of freedom and number of tests while maximizing information. I present analyses of two published data sets that show that this approach can have better power than single-marker tests or sliding-window haplotypic tests.

  15. Recurrent chromosomal gains and heterogeneous driver mutations characterise papillary renal cancer evolution

    PubMed Central

    Kovac, Michal; Navas, Carolina; Horswell, Stuart; Salm, Max; Bardella, Chiara; Rowan, Andrew; Stares, Mark; Castro-Giner, Francesc; Fisher, Rosalie; de Bruin, Elza C.; Kovacova, Monika; Gorman, Maggie; Makino, Seiko; Williams, Jennet; Jaeger, Emma; Jones, Angela; Howarth, Kimberley; Larkin, James; Pickering, Lisa; Gore, Martin; Nicol, David L.; Hazell, Steven; Stamp, Gordon; O’Brien, Tim; Challacombe, Ben; Matthews, Nik; Phillimore, Benjamin; Begum, Sharmin; Rabinowitz, Adam; Varela, Ignacio; Chandra, Ashish; Horsfield, Catherine; Polson, Alexander; Tran, Maxine; Bhatt, Rupesh; Terracciano, Luigi; Eppenberger-Castori, Serenella; Protheroe, Andrew; Maher, Eamonn; El Bahrawy, Mona; Fleming, Stewart; Ratcliffe, Peter; Heinimann, Karl; Swanton, Charles; Tomlinson, Ian

    2015-01-01

    Papillary renal cell carcinoma (pRCC) is an important subtype of kidney cancer with a problematic pathological classification and highly variable clinical behaviour. Here we sequence the genomes or exomes of 31 pRCCs, and in four tumours, multi-region sequencing is undertaken. We identify BAP1, SETD2, ARID2 and Nrf2 pathway genes (KEAP1, NHE2L2 and CUL3) as probable drivers, together with at least eight other possible drivers. However, only ~10% of tumours harbour detectable pathogenic changes in any one driver gene, and where present, the mutations are often predicted to be present within cancer sub-clones. We specifically detect parallel evolution of multiple SETD2 mutations within different sub-regions of the same tumour. By contrast, large copy number gains of chromosomes 7, 12, 16 and 17 are usually early, monoclonal changes in pRCC evolution. The predominance of large copy number variants as the major drivers for pRCC highlights an unusual mode of tumorigenesis that may challenge precision medicine approaches. PMID:25790038

  16. Slugs: potential novel vectors of Escherichia coli O157.

    PubMed

    Sproston, Emma L; Macrae, M; Ogden, Iain D; Wilson, Michael J; Strachan, Norval J C

    2006-01-01

    Field and laboratory studies were performed to determine whether slugs could act as novel vectors for pathogen (e.g., Escherichia coli O157) transfer from animal feces to salad vegetables. Escherichia coli O157 was isolated from 0.21% of field slugs from an Aberdeenshire sheep farm. These isolates carried the verocytotoxin genes (vt1 and vt2) and the attaching and effacing gene (eae), suggesting that they are potentially pathogenic to humans. Strain typing using multilocus variable number tandem repeats analysis showed that slug and sheep isolates were indistinguishable. Laboratory experiments using an E. coli mutant resistant to nalidixic acid showed that the ubiquitous slug species Deroceras reticulatum could carry viable E. coli on its external surface for up to 14 days. Slugs that had been fed E. coli shed viable bacteria in their feces with numbers showing a short but statistically significant linear log decline. Further, it was found that E. coli persisted for up to 3 weeks in excreted slug feces, and hence, we conclude that slugs have the potential to act as novel vectors of E. coli O157.

  17. Slugs: Potential Novel Vectors of Escherichia coli O157

    PubMed Central

    Sproston, Emma L.; Macrae, M.; Ogden, Iain D.; Wilson, Michael J.; Strachan, Norval J. C.

    2006-01-01

    Field and laboratory studies were performed to determine whether slugs could act as novel vectors for pathogen (e.g., Escherichia coli O157) transfer from animal feces to salad vegetables. Escherichia coli O157 was isolated from 0.21% of field slugs from an Aberdeenshire sheep farm. These isolates carried the verocytotoxin genes (vt1 and vt2) and the attaching and effacing gene (eae), suggesting that they are potentially pathogenic to humans. Strain typing using multilocus variable number tandem repeats analysis showed that slug and sheep isolates were indistinguishable. Laboratory experiments using an E. coli mutant resistant to nalidixic acid showed that the ubiquitous slug species Deroceras reticulatum could carry viable E. coli on its external surface for up to 14 days. Slugs that had been fed E. coli shed viable bacteria in their feces with numbers showing a short but statistically significant linear log decline. Further, it was found that E. coli persisted for up to 3 weeks in excreted slug feces, and hence, we conclude that slugs have the potential to act as novel vectors of E. coli O157. PMID:16391036

  18. Tracing phylogenomic events leading to diversity of Haemophilus influenzae and the emergence of Brazilian Purpuric Fever (BPF)-associated clones

    PubMed Central

    Papazisi, Leka; Ratnayake, Shashikala; Remortel, Brian G.; Bock, Geoffrey R.; Liang, Wei; Saeed, Alexander I.; Liu, Jia; Fleischmann, Robert D.; Kilian, Mogens; Peterson, Scott N.

    2010-01-01

    Here we report the use of a multi-genome DNA microarray to elucidate the genomic events associated with the emergence of the clonal variants of H. influenzae biogroup aegyptius causing Brazilian Purpuric Fever (BPF), an important pediatric disease with a high mortality rate. We performed directed genome sequencing of strain HK1212 unique loci to construct a species DNA microarray. Comparative genome hybridization using this microarray enabled us to determine and compare gene complements, and infer reliable phylogenomic relationships among members of the species. The higher genomic variability observed in the genomes of BPF-related strains (clones) and their close relatives may be characterized by significant gene flux related to a subset of functional role categories. We found that the acquisition of a large number of virulence determinants featuring numerous cell membrane proteins coupled to the loss of genes involved in transport, central biosynthetic pathways and in particular, energy production pathways to be characteristics of the BPF genomic variants. PMID:20654709

  19. Alu element insertion in PKLR gene as a novel cause of pyruvate kinase deficiency in Middle Eastern patients.

    PubMed

    Lesmana, Harry; Dyer, Lisa; Li, Xia; Denton, James; Griffiths, Jenna; Chonat, Satheesh; Seu, Katie G; Heeney, Matthew M; Zhang, Kejian; Hopkin, Robert J; Kalfa, Theodosia A

    2018-03-01

    Pyruvate kinase deficiency (PKD) is the most frequent red blood cell enzyme abnormality of the glycolytic pathway and the most common cause of hereditary nonspherocytic hemolytic anemia. Over 250 PKLR-gene mutations have been described, including missense/nonsense, splicing and regulatory mutations, small insertions, small and gross deletions, causing PKD and hemolytic anemia of variable severity. Alu retrotransposons are the most abundant mobile DNA sequences in the human genome, contributing to almost 11% of its mass. Alu insertions have been associated with a number of human diseases either by disrupting a coding region or a splice signal. Here, we report on two unrelated Middle Eastern patients, both born from consanguineous parents, with transfusion-dependent hemolytic anemia, where sequence analysis revealed a homozygous insertion of AluYb9 within exon 6 of the PKLR gene, causing precipitous decrease of PKLR RNA levels. This Alu element insertion consists a previously unrecognized mechanism underlying pathogenesis of PKD. © 2017 Wiley Periodicals, Inc.

  20. SMARCB1/INI1 germline mutations contribute to 10% of sporadic schwannomatosis.

    PubMed

    Rousseau, Guillaume; Noguchi, Tetsuro; Bourdon, Violaine; Sobol, Hagay; Olschwang, Sylviane

    2011-01-24

    Schwannomatosis is a disease characterized by multiple non-vestibular schwannomas. Although biallelic NF2 mutations are found in schwannomas, no germ line event is detected in schwannomatosis patients. In contrast, germline mutations of the SMARCB1 (INI1) tumor suppressor gene were described in familial and sporadic schwannomatosis patients. To delineate the SMARCB1 gene contribution, the nine coding exons were sequenced in a series of 56 patients affected with a variable number of non-vestibular schwannomas. Nine variants scattered along the sequence of SMARCB1 were identified. Five of them were classified as deleterious. All five patients carrying a SMARCB1 mutation had more multiple schwannomas, corresponding to 10.2% of patients with schwannomatosis. They were also diagnosed before 35 years of age. These results suggest that patients with schwannomas have a significant probability of carrying a SMARCB1 mutation. Combined with data available from other studies, they confirm the clinical indications for genetic screening of the SMARCB1 gene.

  1. SMARCB1/INI1 germline mutations contribute to 10% of sporadic schwannomatosis

    PubMed Central

    2011-01-01

    Background Schwannomatosis is a disease characterized by multiple non-vestibular schwannomas. Although biallelic NF2 mutations are found in schwannomas, no germ line event is detected in schwannomatosis patients. In contrast, germline mutations of the SMARCB1 (INI1) tumor suppressor gene were described in familial and sporadic schwannomatosis patients. Methods To delineate the SMARCB1 gene contribution, the nine coding exons were sequenced in a series of 56 patients affected with a variable number of non-vestibular schwannomas. Results Nine variants scattered along the sequence of SMARCB1 were identified. Five of them were classified as deleterious. All five patients carrying a SMARCB1 mutation had more multiple schwannomas, corresponding to 10.2% of patients with schwannomatosis. They were also diagnosed before 35 years of age. Conclusions These results suggest that patients with schwannomas have a significant probability of carrying a SMARCB1 mutation. Combined with data available from other studies, they confirm the clinical indications for genetic screening of the SMARCB1 gene. PMID:21255467

  2. Expression Analysis of the Theileria parva Subtelomere-Encoded Variable Secreted Protein Gene Family

    PubMed Central

    Schmied, Stéfanie; Affentranger, Sarah; Parvanova, Iana; Kang'a, Simon; Nene, Vishvanath; Katzer, Frank; McKeever, Declan; Müller, Joachim; Bishop, Richard; Pain, Arnab; Dobbelaere, Dirk A. E.

    2009-01-01

    Background The intracellular protozoan parasite Theileria parva transforms bovine lymphocytes inducing uncontrolled proliferation. Proteins released from the parasite are assumed to contribute to phenotypic changes of the host cell and parasite persistence. With 85 members, genes encoding subtelomeric variable secreted proteins (SVSPs) form the largest gene family in T. parva. The majority of SVSPs contain predicted signal peptides, suggesting secretion into the host cell cytoplasm. Methodology/Principal Findings We analysed SVSP expression in T. parva-transformed cell lines established in vitro by infection of T or B lymphocytes with cloned T. parva parasites. Microarray and quantitative real-time PCR analysis revealed mRNA expression for a wide range of SVSP genes. The pattern of mRNA expression was largely defined by the parasite genotype and not by host background or cell type, and found to be relatively stable in vitro over a period of two months. Interestingly, immunofluorescence analysis carried out on cell lines established from a cloned parasite showed that expression of a single SVSP encoded by TP03_0882 is limited to only a small percentage of parasites. Epitope-tagged TP03_0882 expressed in mammalian cells was found to translocate into the nucleus, a process that could be attributed to two different nuclear localisation signals. Conclusions Our analysis reveals a complex pattern of Theileria SVSP mRNA expression, which depends on the parasite genotype. Whereas in cell lines established from a cloned parasite transcripts can be found corresponding to a wide range of SVSP genes, only a minority of parasites appear to express a particular SVSP protein. The fact that a number of SVSPs contain functional nuclear localisation signals suggests that proteins released from the parasite could contribute to phenotypic changes of the host cell. This initial characterisation will facilitate future studies on the regulation of SVSP gene expression and the potential biological role of these enigmatic proteins. PMID:19325907

  3. Heterologous Production of a Novel Cyclic Peptide Compound, KK-1, in Aspergillus oryzae.

    PubMed

    Yoshimi, Akira; Yamaguchi, Sigenari; Fujioka, Tomonori; Kawai, Kiyoshi; Gomi, Katsuya; Machida, Masayuki; Abe, Keietsu

    2018-01-01

    A novel cyclic peptide compound, KK-1, was originally isolated from the plant-pathogenic fungus Curvularia clavata . It consists of 10 amino acid residues, including five N -methylated amino acid residues, and has potent antifungal activity. Recently, the genome-sequencing analysis of C. clavata was completed, and the biosynthetic genes involved in KK-1 production were predicted by using a novel gene cluster mining tool, MIDDAS-M. These genes form an approximately 75-kb cluster, which includes nine open reading frames, containing a non-ribosomal peptide synthetase (NRPS) gene. To determine whether the predicted genes were responsible for the biosynthesis of KK-1, we performed heterologous production of KK-1 in Aspergillus oryzae by introduction of the cluster genes into the genome of A. oryzae . The NRPS gene was split in two fragments and then reconstructed in the A. oryzae genome, because the gene was quite large (approximately 40 kb). The remaining seven genes in the cluster, excluding the regulatory gene kkR , were simultaneously introduced into the strain of A. oryzae in which NRPS had already been incorporated. To evaluate the heterologous production of KK-1 in A. oryzae , gene expression was analyzed by RT-PCR and KK-1 productivity was quantified by HPLC. KK-1 was produced in variable quantities by a number of transformed strains, along with expression of the cluster genes. The amount of KK-1 produced by the strain with the greatest expression of all genes was lower than that produced by the original producer, C. clavata . Therefore, expression of the cluster genes is necessary and sufficient for the heterologous production of KK-1 in A. oryzae , although there may be unknown factors limiting productivity in this species.

  4. Heterologous Production of a Novel Cyclic Peptide Compound, KK-1, in Aspergillus oryzae

    PubMed Central

    Yoshimi, Akira; Yamaguchi, Sigenari; Fujioka, Tomonori; Kawai, Kiyoshi; Gomi, Katsuya; Machida, Masayuki; Abe, Keietsu

    2018-01-01

    A novel cyclic peptide compound, KK-1, was originally isolated from the plant-pathogenic fungus Curvularia clavata. It consists of 10 amino acid residues, including five N-methylated amino acid residues, and has potent antifungal activity. Recently, the genome-sequencing analysis of C. clavata was completed, and the biosynthetic genes involved in KK-1 production were predicted by using a novel gene cluster mining tool, MIDDAS-M. These genes form an approximately 75-kb cluster, which includes nine open reading frames, containing a non-ribosomal peptide synthetase (NRPS) gene. To determine whether the predicted genes were responsible for the biosynthesis of KK-1, we performed heterologous production of KK-1 in Aspergillus oryzae by introduction of the cluster genes into the genome of A. oryzae. The NRPS gene was split in two fragments and then reconstructed in the A. oryzae genome, because the gene was quite large (approximately 40 kb). The remaining seven genes in the cluster, excluding the regulatory gene kkR, were simultaneously introduced into the strain of A. oryzae in which NRPS had already been incorporated. To evaluate the heterologous production of KK-1 in A. oryzae, gene expression was analyzed by RT-PCR and KK-1 productivity was quantified by HPLC. KK-1 was produced in variable quantities by a number of transformed strains, along with expression of the cluster genes. The amount of KK-1 produced by the strain with the greatest expression of all genes was lower than that produced by the original producer, C. clavata. Therefore, expression of the cluster genes is necessary and sufficient for the heterologous production of KK-1 in A. oryzae, although there may be unknown factors limiting productivity in this species. PMID:29686660

  5. Genome-wide Identification and Expression Analysis of the CDPK Gene Family in Grape, Vitis spp.

    PubMed

    Zhang, Kai; Han, Yong-Tao; Zhao, Feng-Li; Hu, Yang; Gao, Yu-Rong; Ma, Yan-Fei; Zheng, Yi; Wang, Yue-Jin; Wen, Ying-Qiang

    2015-06-30

    Calcium-dependent protein kinases (CDPKs) play vital roles in plant growth and development, biotic and abiotic stress responses, and hormone signaling. Little is known about the CDPK gene family in grapevine. In this study, we performed a genome-wide analysis of the 12X grape genome (Vitis vinifera) and identified nineteen CDPK genes. Comparison of the structures of grape CDPK genes allowed us to examine their functional conservation and differentiation. Segmentally duplicated grape CDPK genes showed high structural conservation and contributed to gene family expansion. Additional comparisons between grape and Arabidopsis thaliana demonstrated that several grape CDPK genes occured in the corresponding syntenic blocks of Arabidopsis, suggesting that these genes arose before the divergence of grapevine and Arabidopsis. Phylogenetic analysis divided the grape CDPK genes into four groups. Furthermore, we examined the expression of the corresponding nineteen homologous CDPK genes in the Chinese wild grape (Vitis pseudoreticulata) under various conditions, including biotic stress, abiotic stress, and hormone treatments. The expression profiles derived from reverse transcription and quantitative PCR suggested that a large number of VpCDPKs responded to various stimuli on the transcriptional level, indicating their versatile roles in the responses to biotic and abiotic stresses. Moreover, we examined the subcellular localization of VpCDPKs by transiently expressing six VpCDPK-GFP fusion proteins in Arabidopsis mesophyll protoplasts; this revealed high variability consistent with potential functional differences. Taken as a whole, our data provide significant insights into the evolution and function of grape CDPKs and a framework for future investigation of grape CDPK genes.

  6. UNUSUAL FLORAL ORGANS Controls Meristem Identity and Organ Primordia Fate in Arabidopsis.

    PubMed

    Wilkinson, M. D.; Haughn, G. W.

    1995-09-01

    A novel gene that is involved in regulating flower initiation and development has been identified in Arabidopsis. This gene has been designated UNUSUAL FLORAL ORGANS (UFO), with five corresponding nuclear recessive alleles designated ufo[middot]1 to ufo[middot]5. Under short day-length conditions, ufo homozygotes generate more coflorescences than do the wild type, and coflorescences often appear apical to the first floral shoot, resulting in a period of inflorescence development in which regions of floral and coflorescence shoots are produced alternately. ufo enhances the phenotype of weak leafy alleles, and the double mutant Ufo-1 Apetala1-1 produces only coflorescence-like shoots, suggesting that these two genes control different aspects of floral initiation. Floral development was also altered in Ufo plants. Ufo flowers have an altered organ number in all whorls, and organs in the first, second, and third whorls exhibit variable homeotic transformations. Ufo single and double mutant phenotypes suggest that the floral changes result from reduction in class B floral homeotic gene expression and fluctuations in the expression boundaries of class C function and FLO10. Surprisingly, in situ hybridization analysis revealed no obvious differences in expression pattern or level in developing Ufo flowers compared with that of the wild type for any class B or C gene studied. We propose that UFO acts in concert with known floral initiation genes and regulates the domains of floral homeotic gene function.

  7. A promoter polymorphism in the monoamine oxidase A gene is associated with the pineal MAOA activity in Alzheimer's disease patients.

    PubMed

    Wu, Ying-Hui; Fischer, David F; Swaab, Dick F

    2007-09-05

    Monoamine oxidase A (MAOA) is involved in the pathogenesis of mood disorders and Alzheimer's disease (AD). MAOA activity and gene expression have been found to be up-regulated in different brain areas of AD patients, including the pineal gland. Increased pineal MAOA activity might contribute to the reduced pineal melatonin production in AD. A promoter polymorphism of a variable number tandem repeats (VNTR) in the MAOA gene shows to affect MAOA transcriptional activity in vitro. Here we examined in 63 aged controls and 44 AD patients the effects of the MAOA-VNTR on MAOA gene expression and activity in the pineal gland as endophenotypes, and on melatonin production. AD patients carrying long MAOA-VNTR genotype (consisting of 3.5- or 4-repeat alleles) showed higher MAOA gene expression and activity than the short-genotyped (i.e., 3-repeat allele) AD patients. Moreover, the AD-related up-regulation of MAOA showed up only among long-genotype bearing subjects. There was no significant effect of the MAOA-VNTR on MAOA activity or gene expression in controls, or on melatonin production in both controls and AD patients. Our data suggest that the MAOA-VNTR affects the activity and gene expression of MAOA in the brain of AD patients, and is involved in the changes of monoamine metabolism.

  8. Positive and negative regulation of V(D)J recombination by the E2A proteins.

    PubMed

    Bain, G; Romanow, W J; Albers, K; Havran, W L; Murre, C

    1999-01-18

    A key feature of B and T lymphocyte development is the generation of antigen receptors through the rearrangement and assembly of the germline variable (V), diversity (D), and joining (J) gene segments. However, the mechanisms responsible for regulating developmentally ordered gene rearrangements are largely unknown. Here we show that the E2A gene products are essential for the proper coordinated temporal regulation of V(D)J rearrangements within the T cell receptor (TCR) gamma and delta loci. Specifically, we show that E2A is required during adult thymocyte development to inhibit rearrangements to the gamma and delta V regions that normally recombine almost exclusively during fetal thymocyte development. The continued rearrangement of the fetal Vgamma3 gene segment in E2A-deficient adult thymocytes correlates with increased levels of Vgamma3 germline transcripts and increased levels of double-stranded DNA breaks at the recombination signal sequence bordering Vgamma3. Additionally, rearrangements to a number of Vgamma and Vdelta gene segments used predominantly during adult development are significantly reduced in E2A-deficient thymocytes. Interestingly, at distinct stages of T lineage development, both the increased and decreased rearrangement of particular Vdelta gene segments is highly sensitive to the dosage of the E2A gene products, suggesting that the concentration of the E2A proteins is rate limiting for the recombination reaction involving these Vdelta regions.

  9. Foundational Principles for Large-Scale Inference: Illustrations Through Correlation Mining.

    PubMed

    Hero, Alfred O; Rajaratnam, Bala

    2016-01-01

    When can reliable inference be drawn in fue "Big Data" context? This paper presents a framework for answering this fundamental question in the context of correlation mining, wifu implications for general large scale inference. In large scale data applications like genomics, connectomics, and eco-informatics fue dataset is often variable-rich but sample-starved: a regime where the number n of acquired samples (statistical replicates) is far fewer than fue number p of observed variables (genes, neurons, voxels, or chemical constituents). Much of recent work has focused on understanding the computational complexity of proposed methods for "Big Data". Sample complexity however has received relatively less attention, especially in the setting when the sample size n is fixed, and the dimension p grows without bound. To address fuis gap, we develop a unified statistical framework that explicitly quantifies the sample complexity of various inferential tasks. Sampling regimes can be divided into several categories: 1) the classical asymptotic regime where fue variable dimension is fixed and fue sample size goes to infinity; 2) the mixed asymptotic regime where both variable dimension and sample size go to infinity at comparable rates; 3) the purely high dimensional asymptotic regime where the variable dimension goes to infinity and the sample size is fixed. Each regime has its niche but only the latter regime applies to exa cale data dimension. We illustrate this high dimensional framework for the problem of correlation mining, where it is the matrix of pairwise and partial correlations among the variables fua t are of interest. Correlation mining arises in numerous applications and subsumes the regression context as a special case. we demonstrate various regimes of correlation mining based on the unifying perspective of high dimensional learning rates and sample complexity for different structured covariance models and different inference tasks.

  10. Importance of Viral Sequence Length and Number of Variable and Informative Sites in Analysis of HIV Clustering.

    PubMed

    Novitsky, Vlad; Moyo, Sikhulile; Lei, Quanhong; DeGruttola, Victor; Essex, M

    2015-05-01

    To improve the methodology of HIV cluster analysis, we addressed how analysis of HIV clustering is associated with parameters that can affect the outcome of viral clustering. The extent of HIV clustering and tree certainty was compared between 401 HIV-1C near full-length genome sequences and subgenomic regions retrieved from the LANL HIV Database. Sliding window analysis was based on 99 windows of 1,000 bp and 45 windows of 2,000 bp. Potential associations between the extent of HIV clustering and sequence length and the number of variable and informative sites were evaluated. The near full-length genome HIV sequences showed the highest extent of HIV clustering and the highest tree certainty. At the bootstrap threshold of 0.80 in maximum likelihood (ML) analysis, 58.9% of near full-length HIV-1C sequences but only 15.5% of partial pol sequences (ViroSeq) were found in clusters. Among HIV-1 structural genes, pol showed the highest extent of clustering (38.9% at a bootstrap threshold of 0.80), although it was significantly lower than in the near full-length genome sequences. The extent of HIV clustering was significantly higher for sliding windows of 2,000 bp than 1,000 bp. We found a strong association between the sequence length and proportion of HIV sequences in clusters, and a moderate association between the number of variable and informative sites and the proportion of HIV sequences in clusters. In HIV cluster analysis, the extent of detectable HIV clustering is directly associated with the length of viral sequences used, as well as the number of variable and informative sites. Near full-length genome sequences could provide the most informative HIV cluster analysis. Selected subgenomic regions with a high extent of HIV clustering and high tree certainty could also be considered as a second choice.

  11. Importance of Viral Sequence Length and Number of Variable and Informative Sites in Analysis of HIV Clustering

    PubMed Central

    Novitsky, Vlad; Moyo, Sikhulile; Lei, Quanhong; DeGruttola, Victor

    2015-01-01

    Abstract To improve the methodology of HIV cluster analysis, we addressed how analysis of HIV clustering is associated with parameters that can affect the outcome of viral clustering. The extent of HIV clustering and tree certainty was compared between 401 HIV-1C near full-length genome sequences and subgenomic regions retrieved from the LANL HIV Database. Sliding window analysis was based on 99 windows of 1,000 bp and 45 windows of 2,000 bp. Potential associations between the extent of HIV clustering and sequence length and the number of variable and informative sites were evaluated. The near full-length genome HIV sequences showed the highest extent of HIV clustering and the highest tree certainty. At the bootstrap threshold of 0.80 in maximum likelihood (ML) analysis, 58.9% of near full-length HIV-1C sequences but only 15.5% of partial pol sequences (ViroSeq) were found in clusters. Among HIV-1 structural genes, pol showed the highest extent of clustering (38.9% at a bootstrap threshold of 0.80), although it was significantly lower than in the near full-length genome sequences. The extent of HIV clustering was significantly higher for sliding windows of 2,000 bp than 1,000 bp. We found a strong association between the sequence length and proportion of HIV sequences in clusters, and a moderate association between the number of variable and informative sites and the proportion of HIV sequences in clusters. In HIV cluster analysis, the extent of detectable HIV clustering is directly associated with the length of viral sequences used, as well as the number of variable and informative sites. Near full-length genome sequences could provide the most informative HIV cluster analysis. Selected subgenomic regions with a high extent of HIV clustering and high tree certainty could also be considered as a second choice. PMID:25560745

  12. Quantity-activity relationship of denitrifying bacteria and environmental scaling in streams of a forested watershed

    USGS Publications Warehouse

    O'Connor, B.L.; Hondzo, Miki; Dobraca, D.; LaPara, T.M.; Finlay, J.A.; Brezonik, P.L.

    2006-01-01

    The spatial variability of subreach denitrification rates in streams was evaluated with respect to controlling environmental conditions, molecular examination of denitrifying bacteria, and dimensional analysis. Denitrification activities ranged from 0 and 800 ng-N gsed-1 d-1 with large variations observed within short distances (<50 m) along stream reaches. A log-normal probability distribution described the range in denitrification activities and was used to define low (16% of the probability distributibn), medium (68%), and high (16%) denitrification potential groups. Denitrifying bacteria were quantified using a competitive polymerase chain reaction (cPCR) technique that amplified the nirK gene that encodes for nitrite reductase. Results showed a range of nirK quantities from 103 to 107 gene-copy-number gsed.-1 A nonparametric statistical test showed no significant difference in nirK quantifies among stream reaches, but revealed that samples with a high denitrification potential had significantly higher nirK quantities. Denitrification activity was positively correlated with nirK quantities with scatter in the data that can be attributed to varying environmental conditions along stream reaches. Dimensional analysis was used to evaluate denitrification activities according to environmental variables that describe fluid-flow properties, nitrate and organic material quantities, and dissolved oxygen flux. Buckingham's pi theorem was used to generate dimensionless groupings and field data were used to determine scaling parameters. The resulting expressions between dimensionless NO3- flux and dimensionless groupings of environmental variables showed consistent scaling, which indicates that the subreach variability in denitrification rates can be predicted by the controlling physical, chemical, and microbiological conditions. Copyright 2006 by the American Geophysical Union.

  13. Insights on the functional impact of microRNAs present in autism-associated copy number variants.

    PubMed

    Vaishnavi, Varadarajan; Manikandan, Mayakannan; Tiwary, Basant K; Munirajan, Arasambattu Kannan

    2013-01-01

    Autism spectrum disorder is a complex neurodevelopmental disorder that appears during the first three years of infancy and lasts throughout a person's life. Recently a large category of genomic structural variants, denoted as copy number variants (CNVs), were established to be a major contributor of the pathophysiology of autism. To date almost all studies have focussed only on the genes present in the CNV loci, but the impact of non-coding regulatory microRNAs (miRNAs) present in these regions remain largely unexplored. Hence we attempted to elucidate the biological and functional significance of miRNAs present in autism-associated CNV loci and their target genes by using a series of computational tools. We demonstrate that nearly 11% of the CNV loci harbor miRNAs and a few of these miRNAs were previously reported to be associated with autism. A systematic analysis of the CNV-miRNAs based on their interactions with the target genes enabled the identification of top 10 miRNAs namely hsa-miR-590-3p, hsa-miR-944, hsa-miR-570, hsa-miR-34a, hsa-miR-124, hsa-miR-548f, hsa-miR-429, hsa-miR-200b, hsa-miR-195 and hsa-miR-497 as hub molecules. Further, the CNV-miRNAs formed a regulatory loop with transcription factors and their downstream target genes, and annotation of these target genes indicated their functional involvement in neurodevelopment and synapse. Moreover, miRNAs present in deleted and duplicated CNV loci may explain the difference in dosage of the crucial genes controlled by them. These CNV-miRNAs can also impair the global processing and biogenesis of all miRNAs by targeting key molecules in the miRNA pathway. To our knowledge, this is the first report to highlight the significance of CNV-microRNAs and their target genes to contribute towards the genetic heterogeneity and phenotypic variability of autism.

  14. Successful Recovery of Nuclear Protein-Coding Genes from Small Insects in Museums Using Illumina Sequencing.

    PubMed

    Kanda, Kojun; Pflug, James M; Sproul, John S; Dasenko, Mark A; Maddison, David R

    2015-01-01

    In this paper we explore high-throughput Illumina sequencing of nuclear protein-coding, ribosomal, and mitochondrial genes in small, dried insects stored in natural history collections. We sequenced one tenebrionid beetle and 12 carabid beetles ranging in size from 3.7 to 9.7 mm in length that have been stored in various museums for 4 to 84 years. Although we chose a number of old, small specimens for which we expected low sequence recovery, we successfully recovered at least some low-copy nuclear protein-coding genes from all specimens. For example, in one 56-year-old beetle, 4.4 mm in length, our de novo assembly recovered about 63% of approximately 41,900 nucleotides in a target suite of 67 nuclear protein-coding gene fragments, and 70% using a reference-based assembly. Even in the least successfully sequenced carabid specimen, reference-based assembly yielded fragments that were at least 50% of the target length for 34 of 67 nuclear protein-coding gene fragments. Exploration of alternative references for reference-based assembly revealed few signs of bias created by the reference. For all specimens we recovered almost complete copies of ribosomal and mitochondrial genes. We verified the general accuracy of the sequences through comparisons with sequences obtained from PCR and Sanger sequencing, including of conspecific, fresh specimens, and through phylogenetic analysis that tested the placement of sequences in predicted regions. A few possible inaccuracies in the sequences were detected, but these rarely affected the phylogenetic placement of the samples. Although our sample sizes are low, an exploratory regression study suggests that the dominant factor in predicting success at recovering nuclear protein-coding genes is a high number of Illumina reads, with success at PCR of COI and killing by immersion in ethanol being secondary factors; in analyses of only high-read samples, the primary significant explanatory variable was body length, with small beetles being more successfully sequenced.

  15. Genomic Porosity between Invasive Chondrostoma nasus and Endangered Endemic Parachondrostoma toxostoma (Cyprinidae): The Evolution of MHC IIB Genes

    PubMed Central

    Šimková, Andrea; Civáňová, Kristína; Gettová, Lenka; Gilles, André

    2013-01-01

    Two cyprinid species, Parachondrostoma toxostoma, an endemic threatened species, and Chondrostoma nasus, an invasive species, live in sympatry in southern France and form two sympatric zones where the presence of intergeneric hybrids is reported. To estimate the potential threat to endemic species linked to the introduction of invasive species, we focused on the DAB genes (functional MHC IIB genes) because of their adaptive significance and role in parasite resistance. More specifically, we investigated (1) the variability of MHC IIB genes, (2) the selection pattern shaping MHC polymorphism, and (3) the extent to which trans-species evolution and intergeneric hybridization affect MHC polymorphism. In sympatric areas, the native species has more diversified MHC IIB genes when compared to the invasive species, probably resulting from the different origins and dispersal of both species. A similar level of MHC polymorphism was found at population level in both species, suggesting similar mechanisms generating MHC diversity. In contrast, a higher number of DAB-like alleles per specimen were found in invasive species. Invasive species tended to express the alleles of two DAB lineages, whilst native species tended to express the alleles of only the DAB3 lineage. Hybrids have a pattern of MHC expression intermediate between both species. Whilst positive selection acting on peptide binding sites (PBS) was demonstrated in both species, a slightly higher number of positively selected sites were identified in C. nasus, which could result from parasite-mediated selection. Bayesian clustering analysis revealed a similar pattern of structuring for the genetic variation when using microsatellites or the MHC approach. We confirmed the importance of trans-species evolution for MHC polymorphism. In addition, we demonstrated bidirectional gene flow for MHC IIB genes in sympatric areas. The positive significant correlation between MHC and microsatellites suggests that demographic factors may contribute to MHC variation on a short time scale. PMID:23824831

  16. Successful Recovery of Nuclear Protein-Coding Genes from Small Insects in Museums Using Illumina Sequencing

    PubMed Central

    Dasenko, Mark A.

    2015-01-01

    In this paper we explore high-throughput Illumina sequencing of nuclear protein-coding, ribosomal, and mitochondrial genes in small, dried insects stored in natural history collections. We sequenced one tenebrionid beetle and 12 carabid beetles ranging in size from 3.7 to 9.7 mm in length that have been stored in various museums for 4 to 84 years. Although we chose a number of old, small specimens for which we expected low sequence recovery, we successfully recovered at least some low-copy nuclear protein-coding genes from all specimens. For example, in one 56-year-old beetle, 4.4 mm in length, our de novo assembly recovered about 63% of approximately 41,900 nucleotides in a target suite of 67 nuclear protein-coding gene fragments, and 70% using a reference-based assembly. Even in the least successfully sequenced carabid specimen, reference-based assembly yielded fragments that were at least 50% of the target length for 34 of 67 nuclear protein-coding gene fragments. Exploration of alternative references for reference-based assembly revealed few signs of bias created by the reference. For all specimens we recovered almost complete copies of ribosomal and mitochondrial genes. We verified the general accuracy of the sequences through comparisons with sequences obtained from PCR and Sanger sequencing, including of conspecific, fresh specimens, and through phylogenetic analysis that tested the placement of sequences in predicted regions. A few possible inaccuracies in the sequences were detected, but these rarely affected the phylogenetic placement of the samples. Although our sample sizes are low, an exploratory regression study suggests that the dominant factor in predicting success at recovering nuclear protein-coding genes is a high number of Illumina reads, with success at PCR of COI and killing by immersion in ethanol being secondary factors; in analyses of only high-read samples, the primary significant explanatory variable was body length, with small beetles being more successfully sequenced. PMID:26716693

  17. The carriers of the A/G-G/G allelic combination of the c.2039 A>G and c.-29 G>A FSH receptor polymorphisms retrieve the highest number of oocytes in IVF/ICSI cycles.

    PubMed

    Allegra, Adolfo; Marino, Angelo; Raimondo, Stefania; Maiorana, Antonio; Gullo, Salvatore; Scaglione, Piero; Volpes, Aldo; Alessandro, Riccardo

    2017-02-01

    The objective of this study was the elucidation of the possible role of the single-nucleotide polymorphisms (SNP) at position -29 and 2039 of the FSH receptor gene (FSHR) as independent predictive markers of ovarian response. Indeed, the tailoring of reproductive treatments is crucial for both maximizing the success of IVF patients and obtaining a reduction in hypo- or hyper-response rates. This prospective, observational study analyzed the association of -29 and 2039 FSHR polymorphisms with the number of retrieved oocytes in 140 patients attending an IVF/ICSI cycle for severe male factors (≤5,000,000 spermatozoa/mL) or tubal factors at the ANDROS Day Surgery Clinic, Palermo, Italy. The results of this study demonstrate that the genetic combination of A/G for polymorphism c.2039 A>G with G/G for polymorphism c.-29 G>A is significantly associated with the highest number of collected oocytes (p = 0.03). This association was significant even after controlling for the effect of other clinical variables. The A/G-G/G allelic variant, identified as an independent variable, if confirmed in a larger number of patients, could be considered as a new genetic biomarker, which could increase the efficacy of prediction models for ovarian stimulation.

  18. Variegated clonality and rapid emergence of new molecular lesions in xenografts of acute lymphoblastic leukemia are associated with drug resistance.

    PubMed

    Nowak, Daniel; Liem, Natalia L M; Mossner, Maximilian; Klaumünzer, Marion; Papa, Rachael A; Nowak, Verena; Jann, Johann C; Akagi, Tadayuki; Kawamata, Norihiko; Okamoto, Ryoko; Thoennissen, Nils H; Kato, Motohiro; Sanada, Masashi; Hofmann, Wolf-Karsten; Ogawa, Seishi; Marshall, Glenn M; Lock, Richard B; Koeffler, H Phillip

    2015-01-01

    The use of genome-wide copy-number analysis and massive parallel sequencing has revolutionized the understanding of the clonal architecture of pediatric acute lymphoblastic leukemia (ALL) by demonstrating that this disease is composed of highly variable clonal ancestries following the rules of Darwinian selection. The current study aimed to analyze the molecular composition of childhood ALL biopsies and patient-derived xenografts with particular emphasis on mechanisms associated with acquired chemoresistance. Genomic DNA from seven primary pediatric ALL patient samples, 29 serially passaged xenografts, and six in vivo selected chemoresistant xenografts were analyzed with 250K single-nucleotide polymorphism arrays. Copy-number analysis of non-drug-selected xenografts confirmed a highly variable molecular pattern of variegated subclones. Whereas primary patient samples from initial diagnosis displayed a mean of 5.7 copy-number alterations per sample, serially passaged xenografts contained a mean of 8.2 and chemoresistant xenografts a mean of 10.5 copy-number alterations per sample, respectively. Resistance to cytarabine was explained by a new homozygous deletion of the DCK gene, whereas methotrexate resistance was associated with monoallelic deletion of FPGS and mutation of the remaining allele. This study demonstrates that selecting for chemoresistance in xenografted human ALL cells can reveal novel mechanisms associated with drug resistance. Copyright © 2015 ISEH - International Society for Experimental Hematology. Published by Elsevier Inc. All rights reserved.

  19. A comparison of bootstrap methods and an adjusted bootstrap approach for estimating the prediction error in microarray classification.

    PubMed

    Jiang, Wenyu; Simon, Richard

    2007-12-20

    This paper first provides a critical review on some existing methods for estimating the prediction error in classifying microarray data where the number of genes greatly exceeds the number of specimens. Special attention is given to the bootstrap-related methods. When the sample size n is small, we find that all the reviewed methods suffer from either substantial bias or variability. We introduce a repeated leave-one-out bootstrap (RLOOB) method that predicts for each specimen in the sample using bootstrap learning sets of size ln. We then propose an adjusted bootstrap (ABS) method that fits a learning curve to the RLOOB estimates calculated with different bootstrap learning set sizes. The ABS method is robust across the situations we investigate and provides a slightly conservative estimate for the prediction error. Even with small samples, it does not suffer from large upward bias as the leave-one-out bootstrap and the 0.632+ bootstrap, and it does not suffer from large variability as the leave-one-out cross-validation in microarray applications. Copyright (c) 2007 John Wiley & Sons, Ltd.

  20. The human clinical phenotypes of altered CHRNA7 copy number.

    PubMed

    Gillentine, Madelyn A; Schaaf, Christian P

    2015-10-15

    Copy number variants (CNVs) have been implicated in multiple neuropsychiatric conditions, including autism spectrum disorder (ASD), schizophrenia, and intellectual disability (ID). Chromosome 15q13 is a hotspot for such CNVs due to the presence of low copy repeat (LCR) elements, which facilitate non-allelic homologous recombination (NAHR). Several of these CNVs have been overrepresented in individuals with neuropsychiatric disorders; yet variable expressivity and incomplete penetrance are commonly seen. Dosage sensitivity of the CHRNA7 gene, which encodes for the α7 nicotinic acetylcholine receptor in the human brain, has been proposed to have a major contribution to the observed cognitive and behavioral phenotypes, as it represents the smallest region of overlap to all the 15q13.3 deletions and duplications. Individuals with zero to four copies of CHRNA7 have been reported in the literature, and represent a range of clinical severity, with deletions causing generally more severe and more highly penetrant phenotypes. Potential mechanisms to account for the variable expressivity within each group of 15q13.3 CNVs will be discussed. Copyright © 2015 Elsevier Inc. All rights reserved.

  1. Molecular characterization of Shiga-toxigenic Escherichia coli isolated from diverse sources from India by multi-locus variable number tandem repeat analysis (MLVA).

    PubMed

    Kumar, A; Taneja, N; Sharma, R K; Sharma, H; Ramamurthy, T; Sharma, M

    2014-12-01

    In a first study from India, a diverse collection of 140 environmental and clinical non-O157 Shiga-toxigenic Escherichia coli strains from a large geographical area in north India was typed by multi-locus variable number tandem repeat analysis (MLVA). The distribution of major virulence genes stx1, stx2 and eae was found to be 78%, 70% and 10%, respectively; 15 isolates were enterohaemorrhagic E. coli (stx1 +/stx2 + and eae +). By MLVA analysis, 44 different alleles were obtained. Dendrogram analysis revealed 104 different genotypes and 19 MLVA-type complexes divided into two main lineages, i.e. mutton and animal stool. Human isolates presented a statistically significant greater odds ratio for clustering with mutton samples compared to animal stool isolates. Five human isolates clustered with animal stool strains suggesting that some of the human infections may be from cattle, perhaps through milk, contact or the environment. Further epidemiological studies are required to explore these sources in context with occurrence of human cases.

  2. Copy-number and gene dependency analysis reveals partial copy loss of wild-type SF3B1 as a novel cancer vulnerability. | Office of Cancer Genomics

    Cancer.gov

    Genomic instability is a hallmark of human cancer, and results in widespread somatic copy number alterations. We used a genome-scale shRNA viability screen in human cancer cell lines to systematically identify genes that are essential in the context of particular copy-number alterations (copy-number associated gene dependencies). The most enriched class of copy-number associated gene dependencies was CYCLOPS (Copy-number alterations Yielding Cancer Liabilities Owing to Partial losS) genes, and spliceosome components were the most prevalent.

  3. Application of droplet digital PCR to determine copy number of endogenous genes and transgenes in sugarcane.

    PubMed

    Sun, Yue; Joyce, Priya Aiyar

    2017-11-01

    Droplet digital PCR combined with the low copy ACT allele as endogenous reference gene, makes accurate and rapid estimation of gene copy number in Q208 A and Q240 A attainable. Sugarcane is an important cultivated crop with both high polyploidy and aneuploidy in its 10 Gb genome. Without a known copy number reference gene, it is difficult to accurately estimate the copy number of any gene of interest by PCR-based methods in sugarcane. Recently, a new technology, known as droplet digital PCR (ddPCR) has been developed which can measure the absolute amount of the target DNA in a given sample. In this study, we deduced the true copy number of three endogenous genes, actin depolymerizing factor (ADF), adenine phosphoribosyltransferase (APRT) and actin (ACT) in three Australian sugarcane varieties, using ddPCR by comparing the absolute amounts of the above genes with a transgene of known copy number. A single copy of the ACT allele was detected in Q208 A , two copies in Q240 A , but was absent in Q117. Copy number variation was also observed for both APRT and ADF, and ranged from 9 to 11 in the three tested varieties. Using this newly developed ddPCR method, transgene copy number was successfully determined in 19 transgenic Q208 A and Q240 A events using ACT as the reference endogenous gene. Our study demonstrates that ddPCR can be used for high-throughput genetic analysis and is a quick, accurate and reliable alternative method for gene copy number determination in sugarcane. This discovered ACT allele would be a suitable endogenous reference gene for future gene copy number variation and dosage studies of functional genes in Q208 A and Q240 A .

  4. Role of Cell-to-Cell Variability in Activating a Positive Feedback Antiviral Response in Human Dendritic Cells

    PubMed Central

    Hu, Jianzhong; Nudelman, German; Shimoni, Yishai; Kumar, Madhu; Ding, Yaomei; López, Carolina; Hayot, Fernand; Wetmur, James G.; Sealfon, Stuart C.

    2011-01-01

    In the first few hours following Newcastle disease viral infection of human monocyte-derived dendritic cells, the induction of IFNB1 is extremely low and the secreted type I interferon response is below the limits of ELISA assay. However, many interferon-induced genes are activated at this time, for example DDX58 (RIGI), which in response to viral RNA induces IFNB1. We investigated whether the early induction of IFNBI in only a small percentage of infected cells leads to low level IFN secretion that then induces IFN-responsive genes in all cells. We developed an agent-based mathematical model to explore the IFNBI and DDX58 temporal dynamics. Simulations showed that a small number of early responder cells provide a mechanism for efficient and controlled activation of the DDX58-IFNBI positive feedback loop. The model predicted distributions of single cell responses that were confirmed by single cell mRNA measurements. The results suggest that large cell-to-cell variation plays an important role in the early innate immune response, and that the variability is essential for the efficient activation of the IFNB1 based feedback loop. PMID:21347441

  5. Careful accounting of extrinsic noise in protein expression reveals correlations among its sources

    NASA Astrophysics Data System (ADS)

    Cole, John A.; Luthey-Schulten, Zaida

    2017-06-01

    In order to grow and replicate, living cells must express a diverse array of proteins, but the process by which proteins are made includes a great deal of inherent randomness. Understanding this randomness—whether it arises from the discrete stochastic nature of chemical reactivity ("intrinsic" noise), or from cell-to-cell variability in the concentrations of molecules involved in gene expression, or from the timings of important cell-cycle events like DNA replication and cell division ("extrinsic" noise)—remains a challenge. In this article we analyze a model of gene expression that accounts for several extrinsic sources of noise, including those associated with chromosomal replication, cell division, and variability in the numbers of RNA polymerase, ribonuclease E, and ribosomes. We then attempt to fit our model to a large proteomics and transcriptomics data set and find that only through the introduction of a few key correlations among the extrinsic noise sources can we accurately recapitulate the experimental data. These include significant correlations between the rate of mRNA degradation (mediated by ribonuclease E) and the rates of both transcription (RNA polymerase) and translation (ribosomes) and, strikingly, an anticorrelation between the transcription and the translation rates themselves.

  6. Common polymorphic variation in the genetically diverse African insulin gene and its association with size at birth.

    PubMed

    Petry, Clive J; Rayco-Solon, Pura; Fulford, Anthony J C; Stead, John D H; Wingate, Dianne L; Ong, Ken K; Sirugo, Giorgio; Prentice, Andrew M; Dunger, David B

    2009-09-01

    The insulin variable number of tandem repeats (INS VNTR) has been variably associated with size at birth in non-African populations. Small size at birth is a major determinant of neonatal mortality, so the INS VNTR may influence survival. We tested the hypothesis, therefore, that genetic variation around the INS VNTR in a rural Gambian population, who experience seasonal variation in nutrition and subsequently birth weight, may be associated with foetal and early growth. Six polymorphisms flanking the INS VNTR were genotyped in over 2,500 people. Significant associations were detected between the maternally inherited SNP 27 (rs689) allele and birth length [effect size 17.5 (5.2-29.8) mm; P = 0.004; n = 361]. Significant associations were also found between the maternally inherited African-specific SNP 28 (rs5506) allele and post-natal weight gain [effect size 0.19 (0.05-0.32) z score points/year; P = 0.005; n = 728). These results suggest that in the Gambian population studied there are associations between polymorphic variation in the genetically diverse INS gene and foetal and early growth characteristics, which contribute to overall polygenic associations with these traits.

  7. Microbial community structure and diversity in the soil spatial profile of 5-year-old Robinia pseudoacacia 'Idaho,' determined by 454 sequencing of the 16S RNA gene.

    PubMed

    Chang, Yanping; Bu, Xiangpan; Niu, Weibo; Xiu, Yu; Wang, Huafang

    2013-01-01

    Relatively little information is available regarding the variability of microbial communities inhabiting deeper soil layers. We investigated the distribution of soil microbial communities down to 1.2 m in 5-year-old Robinia pseudoacacia 'Idaho' soil by 454 sequencing of the 16S RNA gene. The average number of sequences per sample was 12,802. The Shannon and Chao 1 indices revealed various relative microbial abundances and even distribution of microbial diversity for all evaluated sample depths. The predicted diversity in the topsoil exceeded that of the corresponding subsoil. The changes in the relative abundance of the major soil bacterial phyla showed decreasing, increasing, or no consistent trends with respect to sampling depth. Despite their novelty, members of the new candidate phyla OD1 and TM7 were widespread. Environmental variables affecting the bacterial community within the environment appeared to differ from those reported previously, especially the lack of detectable effect from pH. Overall, we found that the overall relative abundance fluctuated with the physical and chemical properties of the soil, root system, and sampling depth. Such information may facilitate forest soil management.

  8. The evolution of an osmotically inducible dps in the genus Streptomyces.

    PubMed

    Facey, Paul D; Hitchings, Matthew D; Williams, Jason S; Skibinski, David O F; Dyson, Paul J; Del Sol, Ricardo

    2013-01-01

    Dps proteins are found almost ubiquitously in bacterial genomes and there is now an appreciation of their multifaceted roles in various stress responses. Previous studies have shown that this family of proteins assemble into dodecamers and their quaternary structure is entirely critical to their function. Moreover, the numbers of dps genes per bacterial genome is variable; even amongst closely related species - however, for many genera this enigma is yet to be satisfactorily explained. We reconstruct the most probable evolutionary history of Dps in Streptomyces genomes. Typically, these bacteria encode for more than one Dps protein. We offer the explanation that variation in the number of dps per genome among closely related Streptomyces can be explained by gene duplication or lateral acquisition, and the former preceded a subsequent shift in expression patterns for one of the resultant paralogs. We show that the genome of S. coelicolor encodes for three Dps proteins including a tailless Dps. Our in vivo observations show that the tailless protein, unlike the other two Dps in S. coelicolor, does not readily oligomerise. Phylogenetic and bioinformatic analyses combined with expression studies indicate that in several Streptomyces species at least one Dps is significantly over-expressed during osmotic shock, but the identity of the ortholog varies. In silico analysis of dps promoter regions coupled with gene expression studies of duplicated dps genes shows that paralogous gene pairs are expressed differentially and this correlates with the presence of a sigB promoter. Lastly, we identify a rare novel clade of Dps and show that a representative of these proteins in S. coelicolor possesses a dodecameric quaternary structure of high stability.

  9. Expression of phytoene synthase1 and carotene desaturase crtI genes result in an increase in the total carotenoids content in transgenic elite wheat (Triticum aestivum L.).

    PubMed

    Cong, Ling; Wang, Cheng; Chen, Ling; Liu, Huijuan; Yang, Guangxiao; He, Guangyuan

    2009-09-23

    Dietary micronutrient deficiencies, such as the lack of vitamin A, are a major source of morbidity and mortality worldwide. Carotenoids in food can function as provitamin A in humans, while grains of Chinese elite wheat cultivars generally have low carotenoid contents. To increase the carotenoid contents in common wheat endosperm, transgenic wheat has been generated by expressing the maize y1 gene encoding phytoene synthase driven by a endosperm-specific 1Dx5 promoter in the elite wheat (Triticum aestivum L.) variety EM12, together with the bacterial phytoene desaturase crtI gene from Erwinia uredovora under the constitutive CaMV 35S promoter control. A clear increase of the carotenoid content was detected in the endosperms of transgenic wheat that visually showed a light yellow color. The total carotenoids content was increased up to 10.8-fold as compared with the nontransgenic EM12 cultivar. To test whether the variability of total carotenoid content in different transgenic lines was due to differences in the transgene copy number or expression pattern, Southern hybridization and semiquantitative reverse transcriptase polymerase chain reaction analyses were curried out. The results showed that transgene copy numbers and transcript levels did not associate well with carotenoid contents. The expression patterns of endogenous carotenoid genes, such as the phytoene synthases and carotene desaturases, were also investigated in wild-type and transgenic wheat lines. No significant changes in expression levels of these genes were detected in the transgenic endosperms, indicating that the increase in carotenoid transgenic wheat endosperms resulted from the expression of transgenes.

  10. Community Composition of Nitrous Oxide Consuming Bacteria in the Oxygen Minimum Zone of the Eastern Tropical South Pacific

    PubMed Central

    Sun, Xin; Jayakumar, Amal; Ward, Bess B.

    2017-01-01

    The ozone-depleting and greenhouse gas, nitrous oxide (N2O), is mainly consumed by the microbially mediated anaerobic process, denitrification. N2O consumption is the last step in canonical denitrification, and is also the least O2 tolerant step. Community composition of total and active N2O consuming bacteria was analyzed based on total (DNA) and transcriptionally active (RNA) nitrous oxide reductase (nosZ) genes using a functional gene microarray. The total and active nosZ communities were dominated by a limited number of nosZ archetypes, affiliated with bacteria from marine, soil and marsh environments. In addition to nosZ genes related to those of known marine denitrifiers, atypical nosZ genes, related to those of soil bacteria that do not possess a complete denitrification pathway, were also detected, especially in surface waters. The community composition of the total nosZ assemblage was significantly different from the active assemblage. The community composition of the total nosZ assemblage was significantly different between coastal and off-shore stations. The low oxygen assemblages from both stations were similar to each other, while the higher oxygen assemblages were more variable. Community composition of the active nosZ assemblage was also significantly different between stations, and varied with N2O concentration but not O2. Notably, nosZ assemblages were not only present but also active in oxygenated seawater: the abundance of total and active nosZ bacteria from oxygenated surface water (indicated by nosZ gene copy number) was similar to or even larger than in anoxic waters, implying the potential for N2O consumption even in the oxygenated surface water. PMID:28702012

  11. Comprehensive Analysis of Mouse Bitter Taste Receptors Reveals Different Molecular Receptive Ranges for Orthologous Receptors in Mice and Humans.

    PubMed

    Lossow, Kristina; Hübner, Sandra; Roudnitzky, Natacha; Slack, Jay P; Pollastro, Federica; Behrens, Maik; Meyerhof, Wolfgang

    2016-07-15

    One key to animal survival is the detection and avoidance of potentially harmful compounds by their bitter taste. Variable numbers of taste 2 receptor genes expressed in the gustatory end organs enable bony vertebrates (Euteleostomi) to recognize numerous bitter chemicals. It is believed that the receptive ranges of bitter taste receptor repertoires match the profiles of bitter chemicals that the species encounter in their diets. Human and mouse genomes contain pairs of orthologous bitter receptor genes that have been conserved throughout evolution. Moreover, expansions in both lineages generated species-specific sets of bitter taste receptor genes. It is assumed that the orthologous bitter taste receptor genes mediate the recognition of bitter toxins relevant for both species, whereas the lineage-specific receptors enable the detection of substances differently encountered by mice and humans. By challenging 34 mouse bitter taste receptors with 128 prototypical bitter substances in a heterologous expression system, we identified cognate compounds for 21 receptors, 19 of which were previously orphan receptors. We have demonstrated that mouse taste 2 receptors, like their human counterparts, vary greatly in their breadth of tuning, ranging from very broadly to extremely narrowly tuned receptors. However, when compared with humans, mice possess fewer broadly tuned receptors and an elevated number of narrowly tuned receptors, supporting the idea that a large receptor repertoire is the basis for the evolution of specialized receptors. Moreover, we have demonstrated that sequence-orthologous bitter taste receptors have distinct agonist profiles. Species-specific gene expansions have enabled further diversification of bitter substance recognition spectra. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.

  12. Community Composition of Nitrous Oxide Consuming Bacteria in the Oxygen Minimum Zone of the Eastern Tropical South Pacific.

    PubMed

    Sun, Xin; Jayakumar, Amal; Ward, Bess B

    2017-01-01

    The ozone-depleting and greenhouse gas, nitrous oxide (N 2 O), is mainly consumed by the microbially mediated anaerobic process, denitrification. N 2 O consumption is the last step in canonical denitrification, and is also the least O 2 tolerant step. Community composition of total and active N 2 O consuming bacteria was analyzed based on total (DNA) and transcriptionally active (RNA) nitrous oxide reductase ( nosZ ) genes using a functional gene microarray. The total and active nosZ communities were dominated by a limited number of nosZ archetypes, affiliated with bacteria from marine, soil and marsh environments. In addition to nosZ genes related to those of known marine denitrifiers, atypical nosZ genes, related to those of soil bacteria that do not possess a complete denitrification pathway, were also detected, especially in surface waters. The community composition of the total nosZ assemblage was significantly different from the active assemblage. The community composition of the total nosZ assemblage was significantly different between coastal and off-shore stations. The low oxygen assemblages from both stations were similar to each other, while the higher oxygen assemblages were more variable. Community composition of the active nosZ assemblage was also significantly different between stations, and varied with N 2 O concentration but not O 2 . Notably, nosZ assemblages were not only present but also active in oxygenated seawater: the abundance of total and active nosZ bacteria from oxygenated surface water (indicated by nosZ gene copy number) was similar to or even larger than in anoxic waters, implying the potential for N 2 O consumption even in the oxygenated surface water.

  13. T7. PHARMACOGENETIC OF TARDIVE DYSKINESIA -- A FOLLOW-UP ON THE VALBENAZINE TARGET VMAT2/SLC18A2

    PubMed Central

    Zai, Clement; Tiwari, Arun; Mueller, Daniel; Voineskos, Aristotle; Potkin, Steven G; Lieberman, Jeffrey; Meltzer, Herbert; Remington, Gary; Kennedy, James

    2018-01-01

    Abstract Background Tardive dyskinesia (TD) is a motor side effect that may arise after long-term treatment of antipsychotic drugs. Its etiology is not well understood, but a number of risk factors have been associated with TD. TD occurrence appears to be familial, thus suggesting a genetic component. We previously reported on an association between the SLC18A2 gene that codes for the vesicular monoamine transporter 2 (VMAT2) that packages monoamines including dopamine from the cytoplasm into synaptic vesicles (Zai et al, 2013). In the present study, we examined the dopamine transporter gene SLC6A3 by itself and in conjunction with SLC18A2 for possible association with TD. Methods We genotyped and analyzed the variable-number tandem repeat (VNTR) polymorphism in the 3’ untranslated region of the SLC6A3 gene in our European sample of 187 schizophrenia/schizoaffective disorder patients assessed for TD occurrence based on the Abnormal Involuntary Movement Scale (AIMS). We also explored the interaction between the VNTR and the TD-associated SLC18A2 marker rs363224. Results Our preliminary analysis did not show the SLC6A3 VNTR to be associated with TD occurrence or severity. There also appeared to be no significant interaction between SLC6A3 VNTR and SLC18A2 rs363224 in TD occurrence or severity (p>0.05). Discussion Our findings did not support a major role of the dopamine transporter gene in TD risk or severity, but we will examine additional putative functional markers in this gene.

  14. Gene Introgression in Weeds Depends on Initial Gene Location in the Crop: Brassica napus-Raphanus raphanistrum Model.

    PubMed

    Adamczyk-Chauvat, Katarzyna; Delaunay, Sabrina; Vannier, Anne; François, Caroline; Thomas, Gwenaëlle; Eber, Frédérique; Lodé, Maryse; Gilet, Marie; Huteau, Virginie; Morice, Jérôme; Nègre, Sylvie; Falentin, Cyril; Coriton, Olivier; Darmency, Henri; Alrustom, Bachar; Jenczewski, Eric; Rousseau-Gueutin, Mathieu; Chèvre, Anne-Marie

    2017-07-01

    The effect of gene location within a crop genome on its transfer to a weed genome remains an open question for gene flow assessment. To elucidate this question, we analyzed advanced generations of intergeneric hybrids, derived from an initial pollination of known oilseed rape varieties ( Brassica napus , AACC, 2 n  = 38) by a local population of wild radish ( Raphanus raphanistrum , RrRr, 2 n  = 18). After five generations of recurrent pollination, 307 G5 plants with a chromosome number similar to wild radish were genotyped using 105 B. napus specific markers well distributed along the chromosomes. They revealed that 49.8% of G5 plants carried at least one B. napus genomic region. According to the frequency of B. napus markers (0-28%), four classes were defined: Class 1 (near zero frequency), with 75 markers covering ∼70% of oilseed rape genome; Class 2 (low frequency), with 20 markers located on 11 genomic regions; Class 3 (high frequency), with eight markers on three genomic regions; and Class 4 (higher frequency), with two adjacent markers detected on A10. Therefore, some regions of the oilseed rape genome are more prone than others to be introgressed into wild radish. Inheritance and growth of plant progeny revealed that genomic regions of oilseed rape could be stably introduced into wild radish and variably impact the plant fitness (plant height and seed number). Our results pinpoint that novel technologies enabling the targeted insertion of transgenes should select genomic regions that are less likely to be introgressed into the weed genome, thereby reducing gene flow. Copyright © 2017 by the Genetics Society of America.

  15. Social Context–Induced Song Variation Affects Female Behavior and Gene Expression

    PubMed Central

    Woolley, Sarah C; Doupe, Allison J

    2008-01-01

    Social cues modulate the performance of communicative behaviors in a range of species, including humans, and such changes can make the communication signal more salient. In songbirds, males use song to attract females, and song organization can differ depending on the audience to which a male sings. For example, male zebra finches (Taeniopygia guttata) change their songs in subtle ways when singing to a female (directed song) compared with when they sing in isolation (undirected song), and some of these changes depend on altered neural activity from a specialized forebrain-basal ganglia circuit, the anterior forebrain pathway (AFP). In particular, variable activity in the AFP during undirected song is thought to actively enable syllable variability, whereas the lower and less-variable AFP firing during directed singing is associated with more stereotyped song. Consequently, directed song has been suggested to reflect a “performance” state, and undirected song a form of vocal motor “exploration.” However, this hypothesis predicts that directed–undirected song differences, despite their subtlety, should matter to female zebra finches, which is a question that has not been investigated. We tested female preferences for this natural variation in song in a behavioral approach assay, and we found that both mated and socially naive females could discriminate between directed and undirected song—and strongly preferred directed song. These preferences, which appeared to reflect attention especially to aspects of song variability controlled by the AFP, were enhanced by experience, as they were strongest for mated females responding to their mate's directed songs. We then measured neural activity using expression of the immediate early gene product ZENK, and found that social context and song familiarity differentially modulated the number of ZENK-expressing cells in telencephalic auditory areas. Specifically, the number of ZENK-expressing cells in the caudomedial mesopallium (CMM) was most affected by whether a song was directed or undirected, whereas the caudomedial nidopallium (NCM) was most affected by whether a song was familiar or unfamiliar. Together these data demonstrate that females detect and prefer the features of directed song and suggest that high-level auditory areas including the CMM are involved in this social perception. PMID:18351801

  16. Understanding genetic regulatory networks

    NASA Astrophysics Data System (ADS)

    Kauffman, Stuart

    2003-04-01

    Random Boolean networks (RBM) were introduced about 35 years ago as first crude models of genetic regulatory networks. RBNs are comprised of N on-off genes, connected by a randomly assigned regulatory wiring diagram where each gene has K inputs, and each gene is controlled by a randomly assigned Boolean function. This procedure samples at random from the ensemble of all possible NK Boolean networks. The central ideas are to study the typical, or generic properties of this ensemble, and see 1) whether characteristic differences appear as K and biases in Boolean functions are introducted, and 2) whether a subclass of this ensemble has properties matching real cells. Such networks behave in an ordered or a chaotic regime, with a phase transition, "the edge of chaos" between the two regimes. Networks with continuous variables exhibit the same two regimes. Substantial evidence suggests that real cells are in the ordered regime. A key concept is that of an attractor. This is a reentrant trajectory of states of the network, called a state cycle. The central biological interpretation is that cell types are attractors. A number of properties differentiate the ordered and chaotic regimes. These include the size and number of attractors, the existence in the ordered regime of a percolating "sea" of genes frozen in the on or off state, with a remainder of isolated twinkling islands of genes, a power law distribution of avalanches of gene activity changes following perturbation to a single gene in the ordered regime versus a similar power law distribution plus a spike of enormous avalanches of gene changes in the chaotic regime, and the existence of branching pathway of "differentiation" between attractors induced by perturbations in the ordered regime. Noise is serious issue, since noise disrupts attractors. But numerical evidence suggests that attractors can be made very stable to noise, and meanwhile, metaplasias may be a biological manifestation of noise. As we learn more about the wiring diagram and constraints on rules controlling real genes, we can build refined ensembles reflecting these properties, study the generic properties of the refined ensembles, and hope to gain insight into the dynamics of real cells.

  17. Clustering Genes of Common Evolutionary History

    PubMed Central

    Gori, Kevin; Suchan, Tomasz; Alvarez, Nadir; Goldman, Nick; Dessimoz, Christophe

    2016-01-01

    Phylogenetic inference can potentially result in a more accurate tree using data from multiple loci. However, if the loci are incongruent—due to events such as incomplete lineage sorting or horizontal gene transfer—it can be misleading to infer a single tree. To address this, many previous contributions have taken a mechanistic approach, by modeling specific processes. Alternatively, one can cluster loci without assuming how these incongruencies might arise. Such “process-agnostic” approaches typically infer a tree for each locus and cluster these. There are, however, many possible combinations of tree distance and clustering methods; their comparative performance in the context of tree incongruence is largely unknown. Furthermore, because standard model selection criteria such as AIC cannot be applied to problems with a variable number of topologies, the issue of inferring the optimal number of clusters is poorly understood. Here, we perform a large-scale simulation study of phylogenetic distances and clustering methods to infer loci of common evolutionary history. We observe that the best-performing combinations are distances accounting for branch lengths followed by spectral clustering or Ward’s method. We also introduce two statistical tests to infer the optimal number of clusters and show that they strongly outperform the silhouette criterion, a general-purpose heuristic. We illustrate the usefulness of the approach by 1) identifying errors in a previous phylogenetic analysis of yeast species and 2) identifying topological incongruence among newly sequenced loci of the globeflower fly genus Chiastocheta. We release treeCl, a new program to cluster genes of common evolutionary history (http://git.io/treeCl). PMID:26893301

  18. Sequence polymorphisms at the growth hormone GH1/GH2-N and GH2-Z gene copies and their relationship with dairy traits in domestic sheep (Ovis aries).

    PubMed

    Vacca, G M; Dettori, M L; Balia, F; Luridiana, S; Mura, M C; Carcangiu, V; Pazzola, M

    2013-09-01

    The purpose was to analyze the growth hormone GH1/GH2-N and GH2-Z gene copies and to assess their possible association with milk traits in Sarda sheep. Two hundred multiparous lactating ewes were monitored. The two gene copies were amplified separately and each was used as template for a nested PCR, to investigate single strand conformation polymorphism (SSCP) of the 5'UTR, exon-1, exon-5 and 3'UTR DNA regions. SSCP analysis revealed marked differences in the number of polymorphic patterns between the two genes. Sequencing revealed five nucleotide changes at the GH1/GH2-N gene. Five nucleotide changes occurred at the GH2-Z gene: one was located in exon-5 (c.556G > A) and resulted in a putative amino acid substitution G186S. All the nucleotide changes were copy-specific, except c.*30delT, which was common to both GH1/GH2-N and GH2-Z. Variability in the promoter regions of each gene might have consequences on the expression level, due to the involvement in potential transcription factor binding sites. Both gene copies influenced milk yield. A correlation with milk protein and casein content was also evidenced. These results may have implications that make them useful for future breeding strategies in dairy sheep breeding.

  19. The Mitochondrial Cytochrome Oxidase Subunit I Gene Occurs on a Minichromosome with Extensive Heteroplasmy in Two Species of Chewing Lice, Geomydoecus aurei and Thomomydoecus minor

    PubMed Central

    Pietan, Lucas L.; Spradling, Theresa A.

    2016-01-01

    In animals, mitochondrial DNA (mtDNA) typically occurs as a single circular chromosome with 13 protein-coding genes and 22 tRNA genes. The various species of lice examined previously, however, have shown mitochondrial genome rearrangements with a range of chromosome sizes and numbers. Our research demonstrates that the mitochondrial genomes of two species of chewing lice found on pocket gophers, Geomydoecus aurei and Thomomydoecus minor, are fragmented with the 1,536 base-pair (bp) cytochrome-oxidase subunit I (cox1) gene occurring as the only protein-coding gene on a 1,916–1,964 bp minicircular chromosome in the two species, respectively. The cox1 gene of T. minor begins with an atypical start codon, while that of G. aurei does not. Components of the non-protein coding sequence of G. aurei and T. minor include a tRNA (isoleucine) gene, inverted repeat sequences consistent with origins of replication, and an additional non-coding region that is smaller than the non-coding sequence of other lice with such fragmented mitochondrial genomes. Sequences of cox1 minichromosome clones for each species reveal extensive length and sequence heteroplasmy in both coding and noncoding regions. The highly variable non-gene regions of G. aurei and T. minor have little sequence similarity with one another except for a 19-bp region of phylogenetically conserved sequence with unknown function. PMID:27589589

  20. Independence of heritable influences on the food intake of free-living humans.

    PubMed

    de Castro, John M

    2002-01-01

    The time of day of meal ingestion, the number of people present at the meal, the subjective state of hunger, and the estimated before-meal contents in the stomach have been established as influences on the amount eaten in a meal and these influences have been shown to be heritable. Because these factors intercorrelate, the calculated heritabilities for some of these variables might result indirectly from their covariation with one of the other heritable variables. The independence of the heritability of the influence of these four factors was investigated with 110 identical and 102 fraternal same-sex and 53 fraternal mixed-sex adult twin pairs who were paid to maintain 7-d food-intake diaries. From the diary reports, the meal sizes were calculated and subjected to multiple regression analysis using the estimated before-meal stomach contents, the reported number of other people present, the subjective hunger ratings, and the time of day of the meal as predictors. Linear structural modeling was applied to the beta-coefficients from the multiple regression to investigate whether the heritability of the influences of these four variables was independent. Significant genetic effects were found for the beta-coefficients for all four variables, indicating that the heritability of their relationship with intake is to some extent independent and heritable. This suggests that influences of multiple factors on intake are influenced by the genes and become part of the total package of genetically determined physiologic, sociocultural, and psychological processes that regulate energy balance.

Top