Science.gov

Sample records for family genome-wide chip

  1. Genome-wide family-based linkage analysis of exome chip variants and cardiometabolic risk.

    PubMed

    Hellwege, Jacklyn N; Palmer, Nicholette D; Raffield, Laura M; Ng, Maggie C Y; Hawkins, Gregory A; Long, Jirong; Lorenzo, Carlos; Norris, Jill M; Ida Chen, Y-D; Speliotes, Elizabeth K; Rotter, Jerome I; Langefeld, Carl D; Wagenknecht, Lynne E; Bowden, Donald W

    2014-05-01

    Linkage analysis of complex traits has had limited success in identifying trait-influencing loci. Recently, coding variants have been implicated as the basis for some biomedical associations. We tested whether coding variants are the basis for linkage peaks of complex traits in 42 African-American (n = 596) and 90 Hispanic (n = 1,414) families in the Insulin Resistance Atherosclerosis Family Study (IRASFS) using Illumina HumanExome Beadchips. A total of 92,157 variants in African Americans (34%) and 81,559 (31%) in Hispanics were polymorphic and tested using two-point linkage and association analyses with 37 cardiometabolic phenotypes. In African Americans 77 LOD scores greater than 3 were observed. The highest LOD score was 4.91 with the APOE SNP rs7412 (MAF = 0.13) with plasma apolipoprotein B (ApoB). This SNP was associated with ApoB (P-value = 4 × 10(-19)) and accounted for 16.2% of the variance in African Americans. In Hispanic families, 104 LOD scores were greater than 3. The strongest evidence of linkage (LOD = 4.29) was with rs5882 (MAF = 0.46) in CETP with HDL. CETP variants were strongly associated with HDL (0.00049 < P-value <4.6 × 10(-12)), accounting for up to 4.5% of the variance. These loci have previously been shown to have effects on the biomedical traits evaluated here. Thus, evidence of strong linkage in this genome wide survey of primarily coding variants was uncommon. Loci with strong evidence of linkage was characterized by large contributions to the variance, and, in these cases, are common variants. Less compelling evidence of linkage and association was observed with additional loci that may require larger family sets to confirm.

  2. Family-Based Genome-Wide Association Scan of Attention-Deficit/Hyperactivity Disorder

    PubMed Central

    Mick, Eric; Todorov, Alexandre; Smalley, Susan; Hu, Xiaolan; Loo, Sandra; Todd, Richard D.; Biederman, Joseph; Byrne, Deirdre; Dechairo, Bryan; Guiney, Allan; McCracken, James; McGough, James; Nelson, Stanley F.; Reiersen, Angela M.; Wilens, Timothy E.; Wozniak, Janet; Neale, Benjamin M.; Faraone, Stephen V.

    2013-01-01

    Objective . Genes likely play a substantial role in the etiology of attention-deficit hyperactivity disorder (ADHD). However, the genetic architecture of the disorder is unknown, and prior genome-wide association studies have not identified a genome-wide significant association. We have conducted a third, independent multi-site GWAS of DSM-IV-TR ADHD. Method . Families were ascertained at Massachusetts General Hospital (MGH, N=309 trios), Washington University at St Louis (WASH-U, N=272 trios), and University of California at Los Angeles (UCLA, N=156 trios). Genotyping was conducted with the Illumina Human1M or Human1M-Duo BeadChip platforms. After applying quality control filters, association with ADHD was tested with 835,136 SNPs in 735 DSM-IV ADHD trios from 732 families. Results . Our smallest p-value (6.7E-07) did not reach the threshold for genome-wide statistical significance (5.0E-08) but one of the 20 most significant associations was located in a candidate gene of interest for ADHD, (SLC9A9, rs9810857, p=6.4E-6). We also conducted gene-based tests of candidate genes identified in the literature and found additional evidence of association with SLC9A9. Conclusion . We and our colleagues in the Psychiatric GWAS Consortium are working to pool together GWAS samples to establish the large data sets needed to follow-up on these results and to identify genes for ADHD and other disorders. PMID:20732626

  3. Genome-wide linkage analysis is a powerful prenatal diagnostic tool in families with unknown genetic defects.

    PubMed

    Arélin, Maria; Schulze, Bernt; Müller-Myhsok, Bertram; Horn, Denise; Diers, Alexander; Uhlenberg, Birgit; Nürnberg, Peter; Nürnberg, Gudrun; Becker, Christian; Mundlos, Stefan; Lindner, Tom H; Sperling, Karl; Hoffmann, Katrin

    2013-04-01

    Genome-wide linkage analysis is an established tool to map inherited diseases. To our knowledge it has not been used in prenatal diagnostics of any genetic disorder. We present a family with a severe recessive mental retardation syndrome, where the mother wished pregnancy termination to avoid delivering another affected child. By genome-wide scanning using the Affymetrix (Santa Clara, CA, USA) 10k chip we were able to establish the disease haplotype. Without knowing the exact genetic defect, we excluded the condition in the fetus. The woman finally gave birth to a healthy baby. We suggest that genome-wide linkage analysis--based on either SNP mapping or full-genome sequencing--is a very useful tool in prenatal diagnostics of diseases.

  4. Genome-wide analysis of TCP family in tobacco.

    PubMed

    Chen, L; Chen, Y Q; Ding, A M; Chen, H; Xia, F; Wang, W F; Sun, Y H

    2016-01-01

    The TCP family is a transcription factor family, members of which are extensively involved in plant growth and development as well as in signal transduction in the response against many physiological and biochemical stimuli. In the present study, 61 TCP genes were identified in tobacco (Nicotiana tabacum) genome. Bioinformatic methods were employed for predicting and analyzing the gene structure, gene expression, phylogenetic analysis, and conserved domains of TCP proteins in tobacco. The 61 NtTCP genes were divided into three diverse groups, based on the division of TCP genes in tomato and Arabidopsis, and the results of the conserved domain and sequence analyses further confirmed the classification of the NtTCP genes. The expression pattern of NtTCP also demonstrated that majority of these genes play important roles in all the tissues, while some special genes exercise their functions only in specific tissues. In brief, the comprehensive and thorough study of the TCP family in other plants provides sufficient resources for studying the structure and functions of TCPs in tobacco. PMID:27323069

  5. Spotting and validation of a genome wide oligonucleotide chip with duplicate measurement of each gene

    SciTech Connect

    Thomassen, Mads . E-mail: mads.thomassen@ouh.fyns-amt.dk; Skov, Vibe; Eiriksdottir, Freyja; Tan, Qihua; Jochumsen, Kirsten; Fritzner, Niels; Brusgaard, Klaus; Dahlgaard, Jesper; Kruse, Torben A.

    2006-06-16

    The quality of DNA microarray based gene expression data relies on the reproducibility of several steps in a microarray experiment. We have developed a spotted genome wide microarray chip with oligonucleotides printed in duplicate in order to minimise undesirable biases, thereby optimising detection of true differential expression. The validation study design consisted of an assessment of the microarray chip performance using the MessageAmp and FairPlay labelling kits. Intraclass correlation coefficient (ICC) was used to demonstrate that MessageAmp was significantly more reproducible than FairPlay. Further examinations with MessageAmp revealed the applicability of the system. The linear range of the chips was three orders of magnitude, the precision was high, as 95% of measurements deviated less than 1.24-fold from the expected value, and the coefficient of variation for relative expression was 13.6%. Relative quantitation was more reproducible than absolute quantitation and substantial reduction of variance was attained with duplicate spotting. An analysis of variance (ANOVA) demonstrated no significant day-to-day variation.

  6. A genome-wide quantitative trait loci scan of neurocognitive performances in families with schizophrenia.

    PubMed

    Lien, Y-J; Liu, C-M; Faraone, S V; Tsuang, M T; Hwu, H-G; Hsiao, P-C; Chen, W J

    2010-10-01

    Patients with schizophrenia frequently display neurocognitive dysfunction, and genetic studies suggest it to be an endophenotype for schizophrenia. Genetic studies of such traits may thus help elucidate the biological pathways underlying genetic susceptibility to schizophrenia. This study aimed to identify loci influencing neurocognitive performance in schizophrenia. The sample comprised of 1207 affected individuals and 1035 unaffected individuals of Han Chinese ethnicity from 557 sib-pair families co-affected with DSM-IV (Diagnostic and Statistical Manual, Fourth Edition) schizophrenia. Subjects completed a face-to-face semi-structured interview, the continuous performance test (CPT) and the Wisconsin card sorting test (WCST), and were genotyped with 386 microsatellite markers across the genome. A series of autosomal genome-wide multipoint nonparametric quantitative trait loci (QTL) linkage analysis were performed in affected individuals only. Determination of genome-wide empirical significance was performed using 1000 simulated genome scans. One linkage peak attaining genome-wide significance was identified: 12q24.32 for undegraded CPT hit rate [nonparametric linkage z (NPL-Z) scores = 3.32, genome-wide empirical P = 0.03]. This result was higher than the peak linkage signal obtained in the previous genome-wide scan using a dichotomous diagnosis of schizophrenia. The identification of 12q24.32 as a QTL has not been consistently implicated in previous linkage studies on schizophrenia, which suggests that the analysis of endophenotypes provides additional information from what is seen in analyses that rely on diagnoses. This region with linkage to a particular neurocognitive feature may inform functional hypotheses for further genetic studies for schizophrenia.

  7. Genome-wide analysis and identification of genes related to expansin gene family in indica rice.

    PubMed

    Hemalatha, N; Rajesh, M K; Narayanan, N K

    2011-01-01

    In this study, we carried out genome-wide analyses to explore expansin gene family in the genome of indica rice. Reference nucleotides were chosen as query sequences for searches in the indica rice genome database. Clones having genomic sequences similar to expansin were taken and converted to amino acid sequences. Putative sequences were subjected to PROSITE and Pfam databases, and 21 signature-sequences-related expansin gene family was obtained. The presence of transmembrane domains was also predicted for all 21 expansin proteins. A phylogenetic tree was generated from the alignments of the proteins sequences to examine the phylogenetic relationship of indica rice expansin proteins.

  8. Genome-wide analysis of homeobox gene family in legumes: identification, gene duplication and expression profiling.

    PubMed

    Bhattacharjee, Annapurna; Ghangal, Rajesh; Garg, Rohini; Jain, Mukesh

    2015-01-01

    Homeobox genes encode transcription factors that are known to play a major role in different aspects of plant growth and development. In the present study, we identified homeobox genes belonging to 14 different classes in five legume species, including chickpea, soybean, Medicago, Lotus and pigeonpea. The characteristic differences within homeodomain sequences among various classes of homeobox gene family were quite evident. Genome-wide expression analysis using publicly available datasets (RNA-seq and microarray) indicated that homeobox genes are differentially expressed in various tissues/developmental stages and under stress conditions in different legumes. We validated the differential expression of selected chickpea homeobox genes via quantitative reverse transcription polymerase chain reaction. Genome duplication analysis in soybean indicated that segmental duplication has significantly contributed in the expansion of homeobox gene family. The Ka/Ks ratio of duplicated homeobox genes in soybean showed that several members of this family have undergone purifying selection. Moreover, expression profiling indicated that duplicated genes might have been retained due to sub-functionalization. The genome-wide identification and comprehensive gene expression profiling of homeobox gene family members in legumes will provide opportunities for functional analysis to unravel their exact role in plant growth and development.

  9. Joint Analysis for Genome-Wide Association Studies in Family-Based Designs

    PubMed Central

    Sha, Qiuying; Zhang, Zhaogong; Zhang, Shuanglin

    2011-01-01

    In family-based data, association information can be partitioned into the between-family information and the within-family information. Based on this observation, Steen et al. (Nature Genetics. 2005, 683–691) proposed an interesting two-stage test for genome-wide association (GWA) studies under family-based designs which performs genomic screening and replication using the same data set. In the first stage, a screening test based on the between-family information is used to select markers. In the second stage, an association test based on the within-family information is used to test association at the selected markers. However, we learn from the results of case-control studies (Skol et al. Nature Genetics. 2006, 209–213) that this two-stage approach may be not optimal. In this article, we propose a novel two-stage joint analysis for GWA studies under family-based designs. For this joint analysis, we first propose a new screening test that is based on the between-family information and is robust to population stratification. This new screening test is used in the first stage to select markers. Then, a joint test that combines the between-family information and within-family information is used in the second stage to test association at the selected markers. By extensive simulation studies, we demonstrate that the joint analysis always results in increased power to detect genetic association and is robust to population stratification. PMID:21799758

  10. Genome-wide linkage analysis for celiac disease in North American families.

    PubMed

    Neuhausen, Susan L; Feolo, Mike; Camp, Nicola J; Farnham, James; Book, Linda; Zone, John J

    2002-07-22

    Celiac disease (CD) is an autoimmune disease caused by sensitivity to the dietary protein gluten. It has a prevalence of 1 in 250 in the United States. Multiple-case families are common with a risk to siblings from 10-12%. Previous linkage studies have found no significant evidence for linkage other than to HLA. In this study, we performed a genome-wide search on 62 families with at least two cases of CD to identify non-HLA loci for CD. Two-point and multipoint parametric and nonparametric analyses were performed on the entire set of families and on sets stratified by the HLA genotype. Accounting for multiple testing, we found genome-wide intermediate linkage evidence at 18q (heterogeneity LOD (HLOD) = 3.6) and at 3p (HLOD = 3.2) and suggestive evidence at 5p (HLOD = 2.7). Good consensus between two-point and multipoint evidence was not found, and after genotyping with new markers in these regions, our results were inconclusive. The 18q region had intermediate two-point evidence, but weak multipoint evidence. At 3p and 5p, the addition of follow-up markers added flanking support, yet multipoint evidence was still lacking. Our results indicate that multipoint analyses may be hindered by the complexity of CD. Multipoint analyses are not robust to model misspecification, and further development of models is needed. Additional study of these and other families is necessary to validate or rule out the regions implicated in this study.

  11. Genome-wide sequencing to identify the cause of hereditary cancer syndromes: with examples from familial pancreatic cancer.

    PubMed

    Roberts, Nicholas J; Klein, Alison P

    2013-11-01

    Advances in our understanding of the human genome and next-generation technologies have facilitated the use of genome-wide sequencing to decipher the genetic basis of Mendelian disease and hereditary cancer syndromes. However, the application of genome-wide sequencing in hereditary cancer syndromes has had mixed success, in part, due to complex nature of the underlying genetic architecture. In this review we discuss the use of genome-wide sequencing in both Mendelian diseases and hereditary cancer syndromes, highlighting the potential and challenges of this approach using familial pancreatic cancer as an example. PMID:23196058

  12. [Genome-wide analysis and functional prediction of the Trihelix transcription factor family in rice].

    PubMed

    Jianhui, Ji; Yingjun, Zhou; Hehe, Wu; Liming, Yang

    2015-12-01

    The Trihelix transcription factor family plays an essential role in plant growth, development and stress response. However, the studies about identification and analysis of this gene family in rice on the genome-wide level have not been reported. In this study, 31 members of the Trihelix family, which contain highly conserved and characteristic trihelix domain through sequence clustering and functional domains analysis, were identified in rice genome database using bioinformatic tools. These members could be classified into 5 subfamilies (I~V) based on the evolutionary relationship and domain characteristics. Clustering analyses of the Trihelix family in rice, Arabidopsis, Brachypodium distachyom and Sorghum bicolor showed that each species contained different members of subfamily although the classification of the Trihelix family were consistent in these four species, which indicated that the differentiation of the Trihelix gene family occur earlier than that of these species. The conserved motifs in the Trihelix family of rice analyzed using the MEME program were highly consistent with the results of clustering analyses. Intraspecific and interspecific chromosomal replication in partial Trihelix family members were found to exist in rice and between rice and other species through chromosome replication analysis. Microarray data analysis revealed diverse expression patterns of Trihelix family genes in different tissues of rice or in response to six different phytohormones. Moreover, 20 members of the Trihelix transcription factor family were found to interact with other proteins in rice using RiceFRIEND online database analysis. Therefore, our results preliminarily identified the evolution, chromosome distribution and replication, expression patterns, phytohormones response of the Trihelix transcription factor family and the interaction between trihelix family proteins and other proteins in rice, which will provide a basis to further reveal the molecular evolution

  13. Genome-Wide Characterization and Expression Profiles of the Superoxide Dismutase Gene Family in Gossypium.

    PubMed

    Zhang, Jingbo; Li, Bo; Yang, Yang; Hu, Wenran; Chen, Fangyuan; Xie, Lixia; Fan, Ling

    2016-01-01

    Superoxide dismutase (SOD) as a group of significant and ubiquitous enzymes plays a critical function in plant growth and development. Previously this gene family has been investigated in Arabidopsis and rice; it has not yet been characterized in cotton. In our study, it was the first time for us to perform a genome-wide analysis of SOD gene family in cotton. Our results showed that 10 genes of SOD gene family were identified in Gossypium arboreum and Gossypium raimondii, including 6 Cu-Zn-SODs, 2 Fe-SODs, and 2 Mn-SODs. The chromosomal distribution analysis revealed that SOD genes are distributed across 7 chromosomes in Gossypium arboreum and 8 chromosomes in Gossypium raimondii. Segmental duplication is predominant duplication event and major contributor for expansion of SOD gene family. Gene structure and protein structure analysis showed that SOD genes have conserved exon/intron arrangement and motif composition. Microarray-based expression analysis revealed that SOD genes have important function in abiotic stress. Moreover, the tissue-specific expression profile reveals the functional divergence of SOD genes in different organs development of cotton. Taken together, this study has imparted new insights into the putative functions of SOD gene family in cotton. Findings of the present investigation could help in understanding the role of SOD gene family in various aspects of the life cycle of cotton. PMID:27660755

  14. Genome-Wide Characterization and Expression Profiles of the Superoxide Dismutase Gene Family in Gossypium

    PubMed Central

    Zhang, Jingbo; Li, Bo; Yang, Yang; Hu, Wenran; Chen, Fangyuan; Xie, Lixia

    2016-01-01

    Superoxide dismutase (SOD) as a group of significant and ubiquitous enzymes plays a critical function in plant growth and development. Previously this gene family has been investigated in Arabidopsis and rice; it has not yet been characterized in cotton. In our study, it was the first time for us to perform a genome-wide analysis of SOD gene family in cotton. Our results showed that 10 genes of SOD gene family were identified in Gossypium arboreum and Gossypium raimondii, including 6 Cu-Zn-SODs, 2 Fe-SODs, and 2 Mn-SODs. The chromosomal distribution analysis revealed that SOD genes are distributed across 7 chromosomes in Gossypium arboreum and 8 chromosomes in Gossypium raimondii. Segmental duplication is predominant duplication event and major contributor for expansion of SOD gene family. Gene structure and protein structure analysis showed that SOD genes have conserved exon/intron arrangement and motif composition. Microarray-based expression analysis revealed that SOD genes have important function in abiotic stress. Moreover, the tissue-specific expression profile reveals the functional divergence of SOD genes in different organs development of cotton. Taken together, this study has imparted new insights into the putative functions of SOD gene family in cotton. Findings of the present investigation could help in understanding the role of SOD gene family in various aspects of the life cycle of cotton.

  15. Genome-Wide Characterization and Expression Profiles of the Superoxide Dismutase Gene Family in Gossypium

    PubMed Central

    Zhang, Jingbo; Li, Bo; Yang, Yang; Hu, Wenran; Chen, Fangyuan; Xie, Lixia

    2016-01-01

    Superoxide dismutase (SOD) as a group of significant and ubiquitous enzymes plays a critical function in plant growth and development. Previously this gene family has been investigated in Arabidopsis and rice; it has not yet been characterized in cotton. In our study, it was the first time for us to perform a genome-wide analysis of SOD gene family in cotton. Our results showed that 10 genes of SOD gene family were identified in Gossypium arboreum and Gossypium raimondii, including 6 Cu-Zn-SODs, 2 Fe-SODs, and 2 Mn-SODs. The chromosomal distribution analysis revealed that SOD genes are distributed across 7 chromosomes in Gossypium arboreum and 8 chromosomes in Gossypium raimondii. Segmental duplication is predominant duplication event and major contributor for expansion of SOD gene family. Gene structure and protein structure analysis showed that SOD genes have conserved exon/intron arrangement and motif composition. Microarray-based expression analysis revealed that SOD genes have important function in abiotic stress. Moreover, the tissue-specific expression profile reveals the functional divergence of SOD genes in different organs development of cotton. Taken together, this study has imparted new insights into the putative functions of SOD gene family in cotton. Findings of the present investigation could help in understanding the role of SOD gene family in various aspects of the life cycle of cotton. PMID:27660755

  16. Genome-wide identification and expression analysis of MAPK and MAPKK gene family in Malus domestica.

    PubMed

    Zhang, Shizhong; Xu, Ruirui; Luo, Xiaocui; Jiang, Zesheng; Shu, Huairui

    2013-12-01

    MAPK signal transduction modules play crucial roles in regulating many biological processes in plants, which are composed of three classes of hierarchically organized protein kinases, namely MAPKKKs, MAPKKs, and MAPKs. Although genome-wide analysis of this family has been carried out in some species, little is known about MAPK and MAPKK genes in apple (Malus domestica). In this study, a total of 26 putative apple MAPK genes (MdMPKs) and 9 putative apple MAPKK genes (MdMKKs) have been identified and located within the apple genome. Phylogenetic analysis revealed that MdMAPKs and MdMAPKKs could be divided into 4 subfamilies (groups A, B, C and D), respectively. The predicted MdMAPKs and MdMAPKKs were distributed across 13 out of 17 chromosomes with different densities. In addition, analysis of exon-intron junctions and of intron phase inside the predicted coding region of each candidate gene has revealed high levels of conservation within and between phylogenetic groups. According to the microarray and expressed sequence tag (EST) analysis, the different expression patterns indicate that they may play different roles during fruit development and rootstock-scion interaction process. Moreover, MAPK and MAPKK genes were performed expression profile analyses in different tissues (root, stem, leaf, flower and fruit), and all of the selected genes were expressed in at least one of the tissues tested, indicating that the MAPKs and MAPKKs are involved in various aspects of physiological and developmental processes of apple. To our knowledge, this is the first report of a genome-wide analysis of the apple MAPK and MAPKK gene family. This study provides valuable information for understanding the classification and putative functions of the MAPK signal in apple.

  17. Genome-wide identification and phylogenetic analysis of the SBP-box gene family in melons.

    PubMed

    Ma, Y; Guo, J W; Bade, R; Men, Z H; Hasi, A

    2014-10-27

    The SBP-box gene family is specific to plants and encodes a class of zinc finger-containing transcription factors with a broad range of functions. Although SBP-box genes have been identified in numerous plants, including green algae, moss, silver birch, snapdragon, Arabidopsis, rice, and maize, there is little information concerning SBP-box genes, or the corresponding miR156/157, function in melon. Using the highly conserved sequence of the Arabidopsis thaliana SBP-box domain protein as a probe of information sequence, the genome-wide protein database of melon was explored to obtain 13 SBP-box protein sequences, which were further divided into 4 groups, based on phylogenetic analysis. A further analysis centered on the melon SBP-box genetic family's phylogenetic evolution, sequence similarities, gene structure, and miR156 target sequence was also conducted. Analysis of all the expression patterns of melon SBP-box family genes showed that the SBP-box genes were detected in 7 kinds of tissue, and fruit had the highest expression level. CmSBP11 tends to present its specific expression in melon fruit and root. CmSBP09 expression was the highest in flower. Overall, the molecular evolution and expression pattern of the melon SBP-box gene family, revealed by these results, suggest its function differentiation that followed gene duplication.

  18. Genome-wide identification and comparison of legume MLO gene family.

    PubMed

    Rispail, Nicolas; Rubiales, Diego

    2016-01-01

    MLO proteins are highly conserved proteins with seven trans-membrane domains. Specific MLO genes have been linked to plant disease susceptibility. Others are involved in plant reproduction and in root thigmomorphogenesis. Functions of the remaining MLOs are still unknown. Here we performed a genome-wide survey of the MLO family in eight legume species from different clades of the Papillionoideae sub-family. A total of 118 MLO sequences were identified and characterized. Their deduced protein sequences shared the characteristics of MLO proteins. The total number of MLO genes per legume species varied from 13 to 20 depending on the species. Legume MLOs were evenly distributed over their genomes and tended to localize within syntenic blocks conserved across legume genomes. Phylogenetic analysis indicated that these sequences clustered in seven well-defined clades. Comparison of MLO protein sequences revealed 34 clade-specific motifs in the variable regions of the proteins. Comparative analyses of the MLO family between legume species also uncovered several evolutionary differences between the tropical legume species from the Phaseoloid clades and the other legume species. Altogether, this study provides interesting new features on the evolution of the MLO family. It also provides valuable clues to identify additional MLO genes from non-sequenced species. PMID:27596925

  19. Genome-wide identification and comparison of legume MLO gene family

    PubMed Central

    Rispail, Nicolas; Rubiales, Diego

    2016-01-01

    MLO proteins are highly conserved proteins with seven trans-membrane domains. Specific MLO genes have been linked to plant disease susceptibility. Others are involved in plant reproduction and in root thigmomorphogenesis. Functions of the remaining MLOs are still unknown. Here we performed a genome-wide survey of the MLO family in eight legume species from different clades of the Papillionoideae sub-family. A total of 118 MLO sequences were identified and characterized. Their deduced protein sequences shared the characteristics of MLO proteins. The total number of MLO genes per legume species varied from 13 to 20 depending on the species. Legume MLOs were evenly distributed over their genomes and tended to localize within syntenic blocks conserved across legume genomes. Phylogenetic analysis indicated that these sequences clustered in seven well-defined clades. Comparison of MLO protein sequences revealed 34 clade-specific motifs in the variable regions of the proteins. Comparative analyses of the MLO family between legume species also uncovered several evolutionary differences between the tropical legume species from the Phaseoloid clades and the other legume species. Altogether, this study provides interesting new features on the evolution of the MLO family. It also provides valuable clues to identify additional MLO genes from non-sequenced species. PMID:27596925

  20. Genome-wide Linkage Analysis of Carotid Artery Lumen Diameter: The Strong Heart Family Study

    PubMed Central

    Bella, Jonathan N.; Cole, Shelley A.; Laston, Sandy; Almasy, Laura; Comuzzie, Anthony; Lee, Elisa T.; Best, Lyle G.; Fabsitz, Richard R.; Howard, Barbara V.; MacCluer, Jean W.; Roman, Mary J.; Devereux, Richard B.; Göring, Harald H.H.

    2014-01-01

    Background A significant proportion of the variability in carotid artery lumen diameter is attributable to genetic factors. Methods Carotid ultrasonography and genotyping were performed in the 3,300 American Indian participants in the Strong Heart Family Study (SHFS) to identify chromosomal regions harboring novel genes associated with inter-individual variation in carotid artery lumen diameter. Genome-wide linkage analysis was conducted using standard variance component linkage methods, implemented in SOLAR, based on multipoint identity-by-descent matrices. Results Genome-wide linkage analysis revealed a significant evidence for linkage for a locus for left carotid artery diastolic and systolic lumen diameter in Arizona SHFS participants on chromosome 7 at 120 cM (lod=4.85 and 3.77, respectively, after sex and age adjustment, and lod=3.12 and 2.72, respectively, after adjustment for sex, age, height, weight, systolic and diastolic blood pressure, diabetes mellitus and current smoking). Other regions with suggestive evidence of linkage for left carotid artery diastolic and systolic lumen diameter was found on chromosome 12 at 153 cM (lod=2.20 and 2.60, respectively, after sex and age adjustment, and lod=2.44 and 2.16, respectively, after full covariate adjustment) in Oklahoma SHFS participants; suggestive linkage for right carotid artery diastolic and systolic lumen diameter was found on chromosome 9 at 154 cM (lod=2.72 and 3.19, respectively after sex and age adjustment, and lod=2.36 and 2.21, respectively, after full covariate adjustment) in Oklahoma SHFS participants. Conclusion We found significant evidence for loci influencing carotid artery lumen diameter on chromosome 7q and suggestive linkage on chromosomes 12q and 9q. PMID:23871337

  1. A genome-wide search for linkage to allergic rhinitis in Danish sib-pair families.

    PubMed

    Kruse, Lisbeth Venø; Nyegaard, Mette; Christensen, Ulla; Møller-Larsen, Steffen; Haagerup, Annette; Deleuran, Mette; Hansen, Lars Gudmund; Venø, Stine Krogh; Goossens, Dirk; Del-Favero, Jurgen; Børglum, Anders Dupont

    2012-09-01

    Allergic rhinitis (AR) is a complex disorder with a polygenic, multifactorial aetiology. Twin studies have found the genetic contribution to be substantial. We collected and clinically characterised a sample consisting of 127 Danish nuclear families with at least two siblings suffering from AR or allergic conjunctivitis including 540 individuals (286 children and 254 parents). A whole-genome linkage scan, using 424 microsatellite markers, was performed on both this sample and an earlier collected sample consisting of 130 families with atopic dermatitis and other atopic disorders. A third sib-pair family sample, which was previously collected and genotyped, was added to the analysis increasing the total sample size to 357 families consisting of 1508 individuals. In total, 190 families with AR was included. The linkage analysis software Genehunter NPL, Genehunter MOD, and Genehunter Imprinting were used to obtain nonparametric and parametric linkage results. Family-based association analysis of positional candidate SNPs was carried out using the FBAT program. We obtained genome-wide significant linkage to a novel AR locus at 1p13 and suggestive linkage to two novel regions at 1q31-q32 and 20p12, respectively. Family-based association analysis of SNPs in the candidate locus DNND1B/CRB1 at 1q31 showed no significant association and could not explain the linkage signal observed. Suggestive evidence of linkage was also obtained at three AR loci previously reported (2q14-q23, 2q23, and 12p13) and indication of linkage was observed at a number of additional loci. Likely maternal imprinting was observed at 2q23, and possible maternal imprinting at 3q28. PMID:22419170

  2. Genome-Wide Identification and Expression Analysis of WRKY Gene Family in Capsicum annuum L.

    PubMed Central

    Diao, Wei-Ping; Snyder, John C.; Wang, Shu-Bin; Liu, Jin-Bing; Pan, Bao-Gui; Guo, Guang-Jun; Wei, Ge

    2016-01-01

    The WRKY family of transcription factors is one of the most important families of plant transcriptional regulators with members regulating multiple biological processes, especially in regulating defense against biotic and abiotic stresses. However, little information is available about WRKYs in pepper (Capsicum annuum L.). The recent release of completely assembled genome sequences of pepper allowed us to perform a genome-wide investigation for pepper WRKY proteins. In the present study, a total of 71 WRKY genes were identified in the pepper genome. According to structural features of their encoded proteins, the pepper WRKY genes (CaWRKY) were classified into three main groups, with the second group further divided into five subgroups. Genome mapping analysis revealed that CaWRKY were enriched on four chromosomes, especially on chromosome 1, and 15.5% of the family members were tandemly duplicated genes. A phylogenetic tree was constructed depending on WRKY domain' sequences derived from pepper and Arabidopsis. The expression of 21 selected CaWRKY genes in response to seven different biotic and abiotic stresses (salt, heat shock, drought, Phytophtora capsici, SA, MeJA, and ABA) was evaluated by quantitative RT-PCR; Some CaWRKYs were highly expressed and up-regulated by stress treatment. Our results will provide a platform for functional identification and molecular breeding studies of WRKY genes in pepper. PMID:26941768

  3. Genome-Wide Identification and Expression of Xenopus F-Box Family of Proteins

    PubMed Central

    Saritas-Yildirim, Banu; Pliner, Hannah A.; Ochoa, Angelica; Silva, Elena M.

    2015-01-01

    Protein degradation via the multistep ubiquitin/26S proteasome pathway is a rapid way to alter the protein profile and drive cell processes and developmental changes. Many key regulators of embryonic development are targeted for degradation by E3 ubiquitin ligases. The most studied family of E3 ubiquitin ligases is the SCF ubiquitin ligases, which use F-box adaptor proteins to recognize and recruit target proteins. Here, we used a bioinformatics screen and phylogenetic analysis to identify and annotate the family of F-box proteins in the Xenopus tropicalis genome. To shed light on the function of the F-box proteins, we analyzed expression of F-box genes during early stages of Xenopus development. Many F-box genes are broadly expressed with expression domains localized to diverse tissues including brain, spinal cord, eye, neural crest derivatives, somites, kidneys, and heart. All together, our genome-wide identification and expression profiling of the Xenopus F-box family of proteins provide a foundation for future research aimed to identify the precise role of F-box dependent E3 ubiquitin ligases and their targets in the regulatory circuits of development. PMID:26327321

  4. Genome-wide analysis and expression profiling of the phospholipase D gene family in Gossypium arboreum.

    PubMed

    Tang, Kai; Dong, Chunjuan; Liu, Jinyuan

    2016-02-01

    The plant phospholipase D (PLD) plays versatile functions in multiple aspects of plant growth, development, and stress responses. However, until now, our knowledge concerning the PLD gene family members and their expression patterns in cotton has been limited. In this study, we performed for the first time the genome-wide analysis and expression profiling of PLD gene family in Gossypium arboretum, and finally, a total of 19 non-redundant PLD genes (GaPLDs) were identified. Based on the phylogenetic analysis, they were divided into six well-supported clades (α, β/γ, δ, ε, ζ and φ). Most of the GaPLD genes within the same clade showed the similar exon-intron organization and highly conserved motif structures. Additionally, the chromosomal distribution pattern revealed that GaPLD genes were unevenly distributed across 10 of the 13 cotton chromosomes. Segmental duplication is the major contributor to the expansion of GaPLD gene family and estimated to have occurred from 19.61 to 20.44 million years ago when a recent large-scale genome duplication occurred in cotton. Moreover, the expression profiling provides the functional divergence of GaPLD genes in cotton and provides some new light on the molecular mechanisms of GaPLDα1 and GaPLDδ2 in fiber development. PMID:26718354

  5. Family-Based Genome-Wide Association Scan of Attention-Deficit/Hyperactivity Disorder

    ERIC Educational Resources Information Center

    Mick, Eric; Todorov, Alexandre; Smalley, Susan; Hu, Xiaolan; Loo, Sandra; Todd, Richard D.; Biederman, Joseph; Byrne, Deirdre; Dechairo, Bryan; Guiney, Allan; McCracken, James; McGough, James; Nelson, Stanley F.; Reiersen, Angela M.; Wilens, Timothy E.; Wozniak, Janet; Neale, Benjamin M.; Faraone, Stephen V.

    2010-01-01

    Objective: Genes likely play a substantial role in the etiology of attention-deficit/hyperactivity disorder (ADHD). However, the genetic architecture of the disorder is unknown, and prior genome-wide association studies (GWAS) have not identified a genome-wide significant association. We have conducted a third, independent, multisite GWAS of…

  6. Genome-wide identification of the expansin gene family in tobacco (Nicotiana tabacum).

    PubMed

    Ding, Anming; Marowa, Prince; Kong, Yingzhen

    2016-10-01

    Expansins are pH-dependent cell wall loosening proteins which form a large family in plants. They have been shown to be involved in various developmental processes and been implicated in enabling plants' ability to absorb nutrients from the soil as well as conferring biotic and abiotic stress resistances. It is therefore clear that they can be potential targets in genetic engineering for crop improvement. Tobacco (Nicotiana tabacum) is a major crop species as well as a model organism. Considering that only a few tobacco expansins have been studied, a genome-wide analysis of the tobacco expansin gene family is necessary. In this study, we identified 52 expansins in tobacco, which were classified into four subfamilies: 36 NtEXPAs, 6 NtEXPBs, 3 NtEXLAs and 7 NtEXLBs. Compared to other species, the NtEXLB subfamily size was relatively larger. Phylogenetic analysis showed that the 52 tobacco expansins were divided into 13 subgroups. Gene structure analysis revealed that genes within subfamilies/subgroups exhibited similar characteristics such as gene structure and protein motif arrangement. Whole-genome duplication and tandem duplication events may have played important roles in the expanding of tobacco expansins. Cis-Acting element analysis revealed that each expansin gene was regulated or several expansin genes were co-regulated by both internal and environmental factors. 35 of these genes were identified as being expressed according to a microarray analysis. In contrast to most NtEXPAs which had higher expression levels in young organs, NtEXLAs and NtEXLBs were preferentially expressed in mature or senescent tissues, suggesting that they might play different roles in different organs or at different developmental stages. As the first step towards genome-wide analysis of the tobacco expansin gene family, our work provides solid background information related to structure, evolution and expression as well as regulatory cis-acting elements of the tobacco expansins. This

  7. Genome-wide identification of the expansin gene family in tobacco (Nicotiana tabacum).

    PubMed

    Ding, Anming; Marowa, Prince; Kong, Yingzhen

    2016-10-01

    Expansins are pH-dependent cell wall loosening proteins which form a large family in plants. They have been shown to be involved in various developmental processes and been implicated in enabling plants' ability to absorb nutrients from the soil as well as conferring biotic and abiotic stress resistances. It is therefore clear that they can be potential targets in genetic engineering for crop improvement. Tobacco (Nicotiana tabacum) is a major crop species as well as a model organism. Considering that only a few tobacco expansins have been studied, a genome-wide analysis of the tobacco expansin gene family is necessary. In this study, we identified 52 expansins in tobacco, which were classified into four subfamilies: 36 NtEXPAs, 6 NtEXPBs, 3 NtEXLAs and 7 NtEXLBs. Compared to other species, the NtEXLB subfamily size was relatively larger. Phylogenetic analysis showed that the 52 tobacco expansins were divided into 13 subgroups. Gene structure analysis revealed that genes within subfamilies/subgroups exhibited similar characteristics such as gene structure and protein motif arrangement. Whole-genome duplication and tandem duplication events may have played important roles in the expanding of tobacco expansins. Cis-Acting element analysis revealed that each expansin gene was regulated or several expansin genes were co-regulated by both internal and environmental factors. 35 of these genes were identified as being expressed according to a microarray analysis. In contrast to most NtEXPAs which had higher expression levels in young organs, NtEXLAs and NtEXLBs were preferentially expressed in mature or senescent tissues, suggesting that they might play different roles in different organs or at different developmental stages. As the first step towards genome-wide analysis of the tobacco expansin gene family, our work provides solid background information related to structure, evolution and expression as well as regulatory cis-acting elements of the tobacco expansins. This

  8. Genome-Wide Identification, Characterization and Expression Profiling of ADF Family Genes in Solanum lycopersicum L.

    PubMed Central

    Khatun, Khadiza; Robin, Arif Hasan Khan; Park, Jong-In; Kim, Chang Kil; Lim, Ki-Byung; Kim, Min-Bae; Lee, Do-Jin; Nou, Ill Sup; Chung, Mi-Young

    2016-01-01

    The actin depolymerizing factor (ADF) proteins have growth, development, defense-related and growth regulatory functions in plants. The present study used genome-wide analysis to investigate ADF family genes in tomato. Eleven tomato ADF genes were identified and differential expression patterns were found in different organs. SlADF6 was preferentially expressed in roots, suggesting its function in root development. SlADF1, SlADF3 and SlADF10 were predominately expressed in the flowers compared to the other organs and specifically in the stamen compared to other flower parts, indicating their potential roles in pollen development. The comparatively higher expression of SlADF3 and SlADF11 at early fruit developmental stages might implicate them in determining final fruit size. SlADF5 and SlADF8 had relatively higher levels of expression five days after the breaker stage of fruit development, suggesting their possible role in fruit ripening. Notably, six genes were induced by cold and heat, seven by drought, five by NaCl, and four each by abscisic acid (ABA), jasmonic acid (JA) and wounding treatments. The differential expression patterns of the SlADF genes under different types of stresses suggested their function in stress tolerance in tomato plants. Our results will be helpful for the functional characterization of ADF genes during organ and fruit development of tomato under different stresses. PMID:27690110

  9. Genome-Wide Identification and Characterization of WRKY Gene Family in Peanut

    PubMed Central

    Song, Hui; Wang, Pengfei; Lin, Jer-Young; Zhao, Chuanzhi; Bi, Yuping; Wang, Xingjun

    2016-01-01

    WRKY, an important transcription factor family, is widely distributed in the plant kingdom. Many reports focused on analysis of phylogenetic relationship and biological function of WRKY protein at the whole genome level in different plant species. However, little is known about WRKY proteins in the genome of Arachis species and their response to salicylic acid (SA) and jasmonic acid (JA) treatment. In this study, we identified 77 and 75 WRKY proteins from the two wild ancestral diploid genomes of cultivated tetraploid peanut, Arachis duranensis and Arachis ipaënsis, using bioinformatics approaches. Most peanut WRKY coding genes were located on A. duranensis chromosome A6 and A. ipaënsis chromosome B3, while the least number of WRKY genes was found in chromosome 9. The WRKY orthologous gene pairs in A. duranensis and A. ipaënsis chromosomes were highly syntenic. Our analysis indicated that segmental duplication events played a major role in AdWRKY and AiWRKY genes, and strong purifying selection was observed in gene duplication pairs. Furthermore, we translate the knowledge gained from the genome-wide analysis result of wild ancestral peanut to cultivated peanut to reveal that gene activities of specific cultivated peanut WRKY gene were changed due to SA and JA treatment. Peanut WRKY7, 8 and 13 genes were down-regulated, whereas WRKY1 and 12 genes were up-regulated with SA and JA treatment. These results could provide valuable information for peanut improvement. PMID:27200012

  10. Genome-wide analysis of the omega-3 fatty acid desaturase gene family in Gossypium

    DOE PAGES

    Yurchenko, Olga P.; Park, Sunjung; Ilut, Daniel C.; Inmon, Jay J.; Millhollon, Jon C.; Liechty, Zach; Page, Justin T.; Jenks, Matthew A.; Chapman, Kent D.; Udall, Joshua A.; et al

    2014-11-18

    The majority of commercial cotton varieties planted worldwide are derived from Gossypium hirsutum, which is a naturally occurring allotetraploid produced by interspecific hybridization of A- and D-genome diploid progenitor species. While most cotton species are adapted to warm, semi-arid tropical and subtropical regions, and thus perform well in these geographical areas, cotton seedlings are sensitive to cold temperature, which can significantly reduce crop yields. One of the common biochemical responses of plants to cold temperatures is an increase in omega-3 fatty acids, which protects cellular function by maintaining membrane integrity. The purpose of our study was to identify and characterizemore » the omega-3 fatty acid desaturase (FAD) gene family in G. hirsutum, with an emphasis on identifying omega-3 FADs involved in cold temperature adaptation. Results: Eleven omega-3 FAD genes were identified in G. hirsutum, and characterization of the gene family in extant A and D diploid species (G. herbaceum and G. raimondii, respectively) allowed for unambiguous genome assignment of all homoeologs in tetraploid G. hirsutum. The omega-3 FAD family of cotton includes five distinct genes, two of which encode endoplasmic reticulum-type enzymes (FAD3-1 and FAD3-2) and three that encode chloroplast-type enzymes (FAD7/8-1, FAD7/8-2, and FAD7/8-3). The FAD3-2 gene was duplicated in the A genome progenitor species after the evolutionary split from the D progenitor, but before the interspecific hybridization event that gave rise to modern tetraploid cotton. RNA-seq analysis revealed conserved, gene-specific expression patterns in various organs and cell types and semi-quantitative RT-PCR further revealed that FAD7/8-1 was specifically induced during cold temperature treatment of G. hirsutum seedlings. Conclusions: The omega-3 FAD gene family in cotton was characterized at the genome-wide level in three species, showing relatively ancient establishment of the gene family prior

  11. Genome-Wide Identification, Characterization and Expression Analysis of the TCP Gene Family in Prunus mume

    PubMed Central

    Zhou, Yuzhen; Xu, Zongda; Zhao, Kai; Yang, Weiru; Cheng, Tangren; Wang, Jia; Zhang, Qixiang

    2016-01-01

    TCP proteins, belonging to a plant-specific transcription factors family, are known to have great functions in plant development, especially flower and leaf development. However, there is little information about this gene family in Prunus mume, which is widely cultivated in China as an ornamental and fruit tree. Here a genome-wide analysis of TCP genes was performed to explore their evolution in P. mume. Nineteen PmTCPs were identified and three of them contained putative miR319 target sites. Phylogenetic and comprehensive bioinformatics analyses of these genes revealed that different types of TCP genes had undergone different evolutionary processes and the genes in the same clade had similar chromosomal location, gene structure, and conserved domains. Expression analysis of these PmTCPs indicated that there were diverse expression patterns among different clades. Most TCP genes were predominantly expressed in flower, leaf, and stem, and showed high expression levels in the different stages of flower bud differentiation, especially in petal formation stage and gametophyte development. Genes in TCP-P subfamily had main roles in both flower development and gametophyte development. The CIN genes in double petal cultivars might have key roles in the formation of petal, while they were correlated with gametophyte development in the single petal cultivar. The CYC/TB1 type genes were highly detected in the formation of petal and pistil. The less-complex flower types of P. mume might result from the fact that there were only two CYC type genes present in P. mume and a lack of CYC2 genes to control the identity of flower types. These results lay the foundation for further study on the functions of TCP genes during flower development.

  12. Genome-Wide Identification, Characterization and Expression Analysis of the TCP Gene Family in Prunus mume

    PubMed Central

    Zhou, Yuzhen; Xu, Zongda; Zhao, Kai; Yang, Weiru; Cheng, Tangren; Wang, Jia; Zhang, Qixiang

    2016-01-01

    TCP proteins, belonging to a plant-specific transcription factors family, are known to have great functions in plant development, especially flower and leaf development. However, there is little information about this gene family in Prunus mume, which is widely cultivated in China as an ornamental and fruit tree. Here a genome-wide analysis of TCP genes was performed to explore their evolution in P. mume. Nineteen PmTCPs were identified and three of them contained putative miR319 target sites. Phylogenetic and comprehensive bioinformatics analyses of these genes revealed that different types of TCP genes had undergone different evolutionary processes and the genes in the same clade had similar chromosomal location, gene structure, and conserved domains. Expression analysis of these PmTCPs indicated that there were diverse expression patterns among different clades. Most TCP genes were predominantly expressed in flower, leaf, and stem, and showed high expression levels in the different stages of flower bud differentiation, especially in petal formation stage and gametophyte development. Genes in TCP-P subfamily had main roles in both flower development and gametophyte development. The CIN genes in double petal cultivars might have key roles in the formation of petal, while they were correlated with gametophyte development in the single petal cultivar. The CYC/TB1 type genes were highly detected in the formation of petal and pistil. The less-complex flower types of P. mume might result from the fact that there were only two CYC type genes present in P. mume and a lack of CYC2 genes to control the identity of flower types. These results lay the foundation for further study on the functions of TCP genes during flower development. PMID:27630648

  13. Genome-wide analysis of SAUR gene family in Solanaceae species.

    PubMed

    Wu, Jian; Liu, Songyu; He, Yanjun; Guan, Xiaoyan; Zhu, Xiangfei; Cheng, Lin; Wang, Jie; Lu, Gang

    2012-11-01

    The plant hormone auxin plays a vital role in regulating many aspects of plant growth and development. Small auxin up-regulated RNAs (SAURs) are primary auxin response genes hypothesized to be involved in auxin signaling pathway, but their functions remain unclear. Here, a genome-wide search for SAUR gene homologues in Solanaceae species identified 99 and 134 members of SAUR gene family from tomato and potato, respectively. Phylogenetic analysis indicated that the SAUR proteins from Arabidopsis, rice, sorghum, tomato and potato were divided into four major groups with 16 subgroups. Among them, 25 histidine-rich SAURs genes with metal-binding characteristics were found in Arabidopsis, sorghum and Solanaceae species, but not in rice. Using tomato as a model, a comprehensive overview of SAUR gene family is presented, including the gene structures, phylogeny and chromosome locations. Quantitative real-time PCR analysis indicated that 11 randomly selected SlSAUR genes in tomato could be expressed at least in one of the tomato organs/tissues tested. However, different SlSAUR genes displayed distinctive expression levels. SlSAUR16 and SlSAUR71 exhibited highly tissue-specific expression patterns. Almost all of the detected SlSAURs showed an accumulating pattern of mRNA along tomato flower and fruit development. Some of them displayed differential response to exogenous IAA treatment. The abiotic (cold, salt and drought) stresses significantly modified transcript levels of SlSAURs genes. Most of them were down-regulated in response to abiotic stresses (drought, heat and salinity), but SlSAUR58, as a histidine-rich SAUR gene, was up-regulated after salt treatment, indicating that it may play a specific role in the salt signaling transduction pathway. Our comparative analysis provides some basic genomic information for the SAUR genes in the Solanaceae species and will pave the way for deciphering their function during plant development.

  14. Genome-Wide Identification, Characterization and Expression Analysis of the TCP Gene Family in Prunus mume.

    PubMed

    Zhou, Yuzhen; Xu, Zongda; Zhao, Kai; Yang, Weiru; Cheng, Tangren; Wang, Jia; Zhang, Qixiang

    2016-01-01

    TCP proteins, belonging to a plant-specific transcription factors family, are known to have great functions in plant development, especially flower and leaf development. However, there is little information about this gene family in Prunus mume, which is widely cultivated in China as an ornamental and fruit tree. Here a genome-wide analysis of TCP genes was performed to explore their evolution in P. mume. Nineteen PmTCPs were identified and three of them contained putative miR319 target sites. Phylogenetic and comprehensive bioinformatics analyses of these genes revealed that different types of TCP genes had undergone different evolutionary processes and the genes in the same clade had similar chromosomal location, gene structure, and conserved domains. Expression analysis of these PmTCPs indicated that there were diverse expression patterns among different clades. Most TCP genes were predominantly expressed in flower, leaf, and stem, and showed high expression levels in the different stages of flower bud differentiation, especially in petal formation stage and gametophyte development. Genes in TCP-P subfamily had main roles in both flower development and gametophyte development. The CIN genes in double petal cultivars might have key roles in the formation of petal, while they were correlated with gametophyte development in the single petal cultivar. The CYC/TB1 type genes were highly detected in the formation of petal and pistil. The less-complex flower types of P. mume might result from the fact that there were only two CYC type genes present in P. mume and a lack of CYC2 genes to control the identity of flower types. These results lay the foundation for further study on the functions of TCP genes during flower development. PMID:27630648

  15. Genome-wide characterization of phenylalanine ammonia-lyase gene family in watermelon (Citrullus lanatus).

    PubMed

    Dong, Chun-Juan; Shang, Qing-Mao

    2013-07-01

    Phenylalanine ammonia-lyase (PAL), the first enzyme in the phenylpropanoid pathway, plays a critical role in plant growth, development, and adaptation. PAL enzymes are encoded by a gene family in plants. Here, we report a genome-wide search for PAL genes in watermelon. A total of 12 PAL genes, designated ClPAL1-12, are identified . Nine are arranged in tandem in two duplication blocks located on chromosomes 4 and 7, and the other three ClPAL genes are distributed as single copies on chromosomes 2, 3, and 8. Both the cDNA and protein sequences of ClPALs share an overall high identity with each other. A phylogenetic analysis places 11 of the ClPALs into a separate cucurbit subclade, whereas ClPAL2, which belongs to neither monocots nor dicots, may serve as an ancestral PAL in plants. In the cucurbit subclade, seven ClPALs form homologous pairs with their counterparts from cucumber. Expression profiling reveals that 11 of the ClPAL genes are expressed and show preferential expression in the stems and male and female flowers. Six of the 12 ClPALs are moderately or strongly expressed in the fruits, particularly in the pulp, suggesting the potential roles of PAL in the development of fruit color and flavor. A promoter motif analysis of the ClPAL genes implies redundant but distinctive cis-regulatory structures for stress responsiveness. Finally, duplication events during the evolution and expansion of the ClPAL gene family are discussed, and the relationships between the ClPAL genes and their cucumber orthologs are estimated.

  16. Genome-wide identification and expression analysis of the expansin gene family in tomato.

    PubMed

    Lu, Yongen; Liu, Lifeng; Wang, Xin; Han, Zhihui; Ouyang, Bo; Zhang, Junhong; Li, Hanxia

    2016-04-01

    Plant expansins are capable of inducing pH-dependent cell wall extension and stress relaxation. They may be useful as targets for crop improvement to enhance fruit development and stress resistance. Tomato is a major agricultural crop and a model plant for studying fruit development. Because only some tomato expansins have been studied, a genome-wide analysis of the tomato expansin family is necessary. In this study, we identified 25 SlEXPAs, eight SlEXPBs, one SlEXLA, four SlEXLBs, and five short homologs in the tomato genome. 25 of these genes were identified as being expressed. Bioinformatic analysis showed that although tomato expansins share similarities with those from other plants, they also exhibit specific features regarding genetic structure and amino acid sequences, which indicates a unique evolutionary process. Segmental and tandem duplication events have played important roles in expanding the tomato expansin family. Additionally, the 3-exon/2-intron structure may form the basic organization of expansin genes. We identified new expansin genes preferentially expressed in fruits (SlEXPA8, SlEXPB8, and SlEXLB1), roots (SlEXPA9, SlEXLB2, and SlEXLB4), and floral organs. Among the analyzed genes those that were inducible by hormone or stress treatments, including SlEXPA3, SlEXPA7, SlEXPB1-B2, SlEXPB8, SlEXLB1-LB2, and SlEXLB4. Our findings may further clarify the biological activities of tomato expansins, especially those related to fruit development and stress resistance, and contribute to the genetic modification of tomato plants to improve crop quality and yield.

  17. Genome-wide linkage analysis for loci affecting pulse pressure: the Family Blood Pressure Program.

    PubMed

    Bielinski, Suzette J; Lynch, Amy I; Miller, Michael B; Weder, Alan; Cooper, Richard; Oberman, Albert; Chen, Yii-Der Ida; Turner, Stephen T; Fornage, Myriam; Province, Michael; Arnett, Donna K

    2005-12-01

    Pulse pressure, the difference between systolic and diastolic blood pressure, is an independent risk factor for cardiovascular disease. Increased pulse pressure reflects reduced compliance of arteries and is a marker of atherosclerosis. To locate genes that affect pulse pressure, a genome-wide linkage scan for quantitative trait loci influencing pulse pressure was performed using variance components methods as implemented in sequential oligogenic linkage analysis routines. The analysis sample included 10 798 participants in 3320 families who were recruited as part of the Family Blood Pressure Program and were phenotyped with an oscillometric blood pressure measurement device using a consistent protocol across centers. Pulse pressure was adjusted for the effects of sex, age, age2, age-by-sex interaction, age2-by-sex interaction, body mass index, and field center to remove sources of variation other than the genetic effects related to pulse pressure. Significant linkage was observed on chromosome 18 (logarithm of odds [LOD]=3.2) in a combined racial sample, chromosome 20 (LOD=4.4), and 17 (LOD=3.6) in Hispanics, chromosome 21 (LOD=4.3) in whites, chromosome 19 (LOD=3.1) in a combined sample of blacks and whites, and chromosome 7 (logarithm of odds [LOD]=3.1) in blacks from the GenNet Network. Our genome scan shows significant evidence for linkage for pulse pressure in multiple areas of the genome, supporting previous published linkage studies. The identification of these loci for pulse pressure and the apparent congruence with other blood pressure phenotypes provide increased support that these regions contain genes influencing blood pressure phenotypes.

  18. Genome-wide analysis of histone modifications by ChIP-chip to identify silenced genes in gastric cancer.

    PubMed

    Zhu, Xinjiang; Liu, Jian; Xu, Xiaoyang; Zhang, Chundong; Dai, Dongqiu

    2015-05-01

    The present study aimed to identify novel histone modification markers in gastric cancer (GC) by chromatin immunoprecipitation microarray (ChIP-chip) analysis and to determine whether these markers were able to discriminate between normal and GC cells. We also tested for correlations with DNA methylation. We probed a human CpG island microarray with DNA from a GC cell line (MKN45) by chromatin immunoprecipitation (ChIP). ChIP-reverse-transcriptase quantitative polymerase chain reaction PCR (RT-qPCR) was used to validate the microarray results. Additionally, mRNA expression levels and the DNA methylation of potential target genes were evaluated by RT-qPCR and methylation-specific PCR (MSP). The moults showed that 134 genes exhibited the highest signal-to-noise ratio of H3-K9 trimethylation over acetylation and 46 genes exhibited the highest signal-to-noise ratio of H3-K9 trimethylation over H3-K4 trimethylation in MKN45 cells. The ChIP-qPCR results agreed with those obtained from the ChIP-chip analysis. Aberrant DNA methylation status and mRNA expression levels were also identified for selected genes (PSD, SMARCC1 and Vps37A) in the GC cell lines. The results suggest that CpG island microarray coupled with ChIP (ChIP-chip) can identify novel targets of gene silencing in GC. Additionally, ChIP-chip is the best approach for assessing the genome-wide status of epigenetic regulation, which may allow for a broader genomic understanding compared to the knowledge that has been accumulated from single-gene studies. PMID:25738530

  19. Genome-wide analysis of histone modifications by ChIP-chip to identify silenced genes in gastric cancer.

    PubMed

    Zhu, Xinjiang; Liu, Jian; Xu, Xiaoyang; Zhang, Chundong; Dai, Dongqiu

    2015-05-01

    The present study aimed to identify novel histone modification markers in gastric cancer (GC) by chromatin immunoprecipitation microarray (ChIP-chip) analysis and to determine whether these markers were able to discriminate between normal and GC cells. We also tested for correlations with DNA methylation. We probed a human CpG island microarray with DNA from a GC cell line (MKN45) by chromatin immunoprecipitation (ChIP). ChIP-reverse-transcriptase quantitative polymerase chain reaction PCR (RT-qPCR) was used to validate the microarray results. Additionally, mRNA expression levels and the DNA methylation of potential target genes were evaluated by RT-qPCR and methylation-specific PCR (MSP). The moults showed that 134 genes exhibited the highest signal-to-noise ratio of H3-K9 trimethylation over acetylation and 46 genes exhibited the highest signal-to-noise ratio of H3-K9 trimethylation over H3-K4 trimethylation in MKN45 cells. The ChIP-qPCR results agreed with those obtained from the ChIP-chip analysis. Aberrant DNA methylation status and mRNA expression levels were also identified for selected genes (PSD, SMARCC1 and Vps37A) in the GC cell lines. The results suggest that CpG island microarray coupled with ChIP (ChIP-chip) can identify novel targets of gene silencing in GC. Additionally, ChIP-chip is the best approach for assessing the genome-wide status of epigenetic regulation, which may allow for a broader genomic understanding compared to the knowledge that has been accumulated from single-gene studies.

  20. Genome-wide association study and follow-up analysis of adiposity traits in Hispanic Americans: the IRAS Family Study.

    PubMed

    Norris, Jill M; Langefeld, Carl D; Talbert, Matthew E; Wing, Maria R; Haritunians, Talin; Fingerlin, Tasha E; Hanley, Anthony J G; Ziegler, Julie T; Taylor, Kent D; Haffner, Steven M; Chen, Yii-Der I; Bowden, Donald W; Wagenknecht, Lynne E

    2009-10-01

    We investigated candidate genomic regions associated with computed tomography (CT)-derived measures of adiposity in Hispanics from the Insulin Resistance Atherosclerosis Study Family Study (IRASFS). In 1,190 Hispanic individuals from 92 families 3 from the San Luis Valley, Colorado and San Antonio, Texas, we measured CT-derived visceral adipose tissue (VAT), subcutaneous adipose tissue (SAT), and visceral:subcutaneous ratio (VSR). A genome-wide association study (GWAS) was completed using the Illumina HumanHap 300 BeadChip (approximately 317K single-nucleotide polymorphisms (SNPs)) in 229 individuals from the San Antonio site (stage 1). In total, 297 SNPs with evidence for association with VAT, SAT, or VSR, adjusting for age and sex (P<0.001), were genotyped in the remaining 961 Hispanic samples. The entire Hispanic cohort (n=1,190) was then tested for association, adjusting for age, sex, site of recruitment, and admixture estimates (stage 2). In stage 3, additional SNPs were genotyped in four genic regions showing evidence of association in stage 2. Several SNPs were associated in the GWAS (P<1x10(-5)) and were confirmed to be significantly associated in the entire Hispanic cohort (P<0.01), including: rs7543757 for VAT, rs4754373 and rs11212913 for SAT, and rs4541696 and rs4134351 for VSR. Numerous SNPs were associated with multiple adiposity phenotypes. Targeted analysis of four genes whose SNPs were significant in stage 2 suggests candidate genes for influencing the distribution (RGS6) and amount of adiposity (NGEF). Several candidate loci, including RGS6 and NGEF, are associated with CT-derived adipose fat measures in Hispanic Americans in a three-stage genetic association study.

  1. Genome-wide identification and characterization of WRKY gene family in Salix suchowensis.

    PubMed

    Bi, Changwei; Xu, Yiqing; Ye, Qiaolin; Yin, Tongming; Ye, Ning

    2016-01-01

    WRKY proteins are the zinc finger transcription factors that were first identified in plants. They can specifically interact with the W-box, which can be found in the promoter region of a large number of plant target genes, to regulate the expressions of downstream target genes. They also participate in diverse physiological and growing processes in plants. Prior to this study, a plenty of WRKY genes have been identified and characterized in herbaceous species, but there is no large-scale study of WRKY genes in willow. With the whole genome sequencing of Salix suchowensis, we have the opportunity to conduct the genome-wide research for willow WRKY gene family. In this study, we identified 85 WRKY genes in the willow genome and renamed them from SsWRKY1 to SsWRKY85 on the basis of their specific distributions on chromosomes. Due to their diverse structural features, the 85 willow WRKY genes could be further classified into three main groups (group I-III), with five subgroups (IIa-IIe) in group II. With the multiple sequence alignment and the manual search, we found three variations of the WRKYGQK heptapeptide: WRKYGRK, WKKYGQK and WRKYGKK, and four variations of the normal zinc finger motif, which might execute some new biological functions. In addition, the SsWRKY genes from the same subgroup share the similar exon-intron structures and conserved motif domains. Further studies of SsWRKY genes revealed that segmental duplication events (SDs) played a more prominent role in the expansion of SsWRKY genes. Distinct expression profiles of SsWRKY genes with RNA sequencing data revealed that diverse expression patterns among five tissues, including tender roots, young leaves, vegetative buds, non-lignified stems and barks. With the analyses of WRKY gene family in willow, it is not only beneficial to complete the functional and annotation information of WRKY genes family in woody plants, but also provide important references to investigate the expansion and evolution of

  2. Genome-wide identification and characterization of WRKY gene family in Salix suchowensis

    PubMed Central

    Ye, Qiaolin; Yin, Tongming

    2016-01-01

    WRKY proteins are the zinc finger transcription factors that were first identified in plants. They can specifically interact with the W-box, which can be found in the promoter region of a large number of plant target genes, to regulate the expressions of downstream target genes. They also participate in diverse physiological and growing processes in plants. Prior to this study, a plenty of WRKY genes have been identified and characterized in herbaceous species, but there is no large-scale study of WRKY genes in willow. With the whole genome sequencing of Salix suchowensis, we have the opportunity to conduct the genome-wide research for willow WRKY gene family. In this study, we identified 85 WRKY genes in the willow genome and renamed them from SsWRKY1 to SsWRKY85 on the basis of their specific distributions on chromosomes. Due to their diverse structural features, the 85 willow WRKY genes could be further classified into three main groups (group I–III), with five subgroups (IIa–IIe) in group II. With the multiple sequence alignment and the manual search, we found three variations of the WRKYGQK heptapeptide: WRKYGRK, WKKYGQK and WRKYGKK, and four variations of the normal zinc finger motif, which might execute some new biological functions. In addition, the SsWRKY genes from the same subgroup share the similar exon–intron structures and conserved motif domains. Further studies of SsWRKY genes revealed that segmental duplication events (SDs) played a more prominent role in the expansion of SsWRKY genes. Distinct expression profiles of SsWRKY genes with RNA sequencing data revealed that diverse expression patterns among five tissues, including tender roots, young leaves, vegetative buds, non-lignified stems and barks. With the analyses of WRKY gene family in willow, it is not only beneficial to complete the functional and annotation information of WRKY genes family in woody plants, but also provide important references to investigate the expansion and evolution

  3. Genome-Wide Identification and Characterization of R2R3MYB Family in Cucumis sativus

    PubMed Central

    Li, Qiang; Zhang, Cunjia; Li, Jing; Wang, Lina; Ren, Zhonghai

    2012-01-01

    Background The R2R3MYB proteins comprise one of the largest families of transcription factors in plants. Although genome-wide analysis of this family has been carried out in some species, little is known about R2R3MYB genes in cucumber (Cucumis sativus L.). Principal Findings This study has identified 55 R2R3MYB genes in the latest cucumber genome and the CsR2R3MYB family contained the smallest number of identified genes compared to other species that have been studied due to the absence of recent gene duplication events. These results were also supported by genome distribution and gene duplication analysis. Phylogenetic analysis showed that they could be classified into 11 subgroups. The evolutionary relationships and the intron - exon organizations that showed similarities with Arabidopsis, Vitis and Glycine R2R3MYB proteins were also analyzed and suggested strong gene conservation but also the expansions of particular functional genes during the evolution of the plant species. In addition, we found that 8 out of 55 (∼14.54%) cucumber R2R3MYB genes underwent alternative splicing events, producing a variety of transcripts from a single gene, which illustrated the extremely high complexity of transcriptome regulation. Tissue-specific expression profiles showed that 50 cucumber R2R3MYB genes were expressed in at least one of the tissues and the other 5 genes showed very low expression in all tissues tested, which suggested that cucumber R2R3MYB genes took part in many cellular processes. The transcript abundance level analysis during abiotic conditions (NaCl, ABA and low temperature treatments) identified a group of R2R3MYB genes that responded to one or more treatments. Conclusions This study has produced a comparative genomics analysis of the cucumber R2R3MYB gene family and has provided the first steps towards the selection of CsR2R3MYB genes for cloning and functional dissection that can be used in further studies to uncover their roles in cucumber growth and

  4. Genome-wide identification and characterization of WRKY gene family in Salix suchowensis

    PubMed Central

    Ye, Qiaolin; Yin, Tongming

    2016-01-01

    WRKY proteins are the zinc finger transcription factors that were first identified in plants. They can specifically interact with the W-box, which can be found in the promoter region of a large number of plant target genes, to regulate the expressions of downstream target genes. They also participate in diverse physiological and growing processes in plants. Prior to this study, a plenty of WRKY genes have been identified and characterized in herbaceous species, but there is no large-scale study of WRKY genes in willow. With the whole genome sequencing of Salix suchowensis, we have the opportunity to conduct the genome-wide research for willow WRKY gene family. In this study, we identified 85 WRKY genes in the willow genome and renamed them from SsWRKY1 to SsWRKY85 on the basis of their specific distributions on chromosomes. Due to their diverse structural features, the 85 willow WRKY genes could be further classified into three main groups (group I–III), with five subgroups (IIa–IIe) in group II. With the multiple sequence alignment and the manual search, we found three variations of the WRKYGQK heptapeptide: WRKYGRK, WKKYGQK and WRKYGKK, and four variations of the normal zinc finger motif, which might execute some new biological functions. In addition, the SsWRKY genes from the same subgroup share the similar exon–intron structures and conserved motif domains. Further studies of SsWRKY genes revealed that segmental duplication events (SDs) played a more prominent role in the expansion of SsWRKY genes. Distinct expression profiles of SsWRKY genes with RNA sequencing data revealed that diverse expression patterns among five tissues, including tender roots, young leaves, vegetative buds, non-lignified stems and barks. With the analyses of WRKY gene family in willow, it is not only beneficial to complete the functional and annotation information of WRKY genes family in woody plants, but also provide important references to investigate the expansion and evolution

  5. Genome-Wide Profiling of Yeast DNA:RNA Hybrid Prone Sites with DRIP-Chip

    PubMed Central

    Lu, Phoebe Y. T.; Luo, Zongli; Hamza, Akil; Kobor, Michael S.; Stirling, Peter C.; Hieter, Philip

    2014-01-01

    DNA:RNA hybrid formation is emerging as a significant cause of genome instability in biological systems ranging from bacteria to mammals. Here we describe the genome-wide distribution of DNA:RNA hybrid prone loci in Saccharomyces cerevisiae by DNA:RNA immunoprecipitation (DRIP) followed by hybridization on tiling microarray. These profiles show that DNA:RNA hybrids preferentially accumulated at rDNA, Ty1 and Ty2 transposons, telomeric repeat regions and a subset of open reading frames (ORFs). The latter are generally highly transcribed and have high GC content. Interestingly, significant DNA:RNA hybrid enrichment was also detected at genes associated with antisense transcripts. The expression of antisense-associated genes was also significantly altered upon overexpression of RNase H, which degrades the RNA in hybrids. Finally, we uncover mutant-specific differences in the DRIP profiles of a Sen1 helicase mutant, RNase H deletion mutant and Hpr1 THO complex mutant compared to wild type, suggesting different roles for these proteins in DNA:RNA hybrid biology. Our profiles of DNA:RNA hybrid prone loci provide a resource for understanding the properties of hybrid-forming regions in vivo, extend our knowledge of hybrid-mitigating enzymes, and contribute to models of antisense-mediated gene regulation. A summary of this paper was presented at the 26th International Conference on Yeast Genetics and Molecular Biology, August 2013. PMID:24743342

  6. Genome-wide profiling of yeast DNA:RNA hybrid prone sites with DRIP-chip.

    PubMed

    Chan, Yujia A; Aristizabal, Maria J; Lu, Phoebe Y T; Luo, Zongli; Hamza, Akil; Kobor, Michael S; Stirling, Peter C; Hieter, Philip

    2014-04-01

    DNA:RNA hybrid formation is emerging as a significant cause of genome instability in biological systems ranging from bacteria to mammals. Here we describe the genome-wide distribution of DNA:RNA hybrid prone loci in Saccharomyces cerevisiae by DNA:RNA immunoprecipitation (DRIP) followed by hybridization on tiling microarray. These profiles show that DNA:RNA hybrids preferentially accumulated at rDNA, Ty1 and Ty2 transposons, telomeric repeat regions and a subset of open reading frames (ORFs). The latter are generally highly transcribed and have high GC content. Interestingly, significant DNA:RNA hybrid enrichment was also detected at genes associated with antisense transcripts. The expression of antisense-associated genes was also significantly altered upon overexpression of RNase H, which degrades the RNA in hybrids. Finally, we uncover mutant-specific differences in the DRIP profiles of a Sen1 helicase mutant, RNase H deletion mutant and Hpr1 THO complex mutant compared to wild type, suggesting different roles for these proteins in DNA:RNA hybrid biology. Our profiles of DNA:RNA hybrid prone loci provide a resource for understanding the properties of hybrid-forming regions in vivo, extend our knowledge of hybrid-mitigating enzymes, and contribute to models of antisense-mediated gene regulation. A summary of this paper was presented at the 26th International Conference on Yeast Genetics and Molecular Biology, August 2013. PMID:24743342

  7. Genome-wide variant analysis of simplex autism families with an integrative clinical-bioinformatics pipeline

    PubMed Central

    Jiménez-Barrón, Laura T.; O'Rawe, Jason A.; Wu, Yiyang; Yoon, Margaret; Fang, Han; Iossifov, Ivan; Lyon, Gholson J.

    2015-01-01

    Autism spectrum disorders (ASDs) are a group of developmental disabilities that affect social interaction and communication and are characterized by repetitive behaviors. There is now a large body of evidence that suggests a complex role of genetics in ASDs, in which many different loci are involved. Although many current population-scale genomic studies have been demonstrably fruitful, these studies generally focus on analyzing a limited part of the genome or use a limited set of bioinformatics tools. These limitations preclude the analysis of genome-wide perturbations that may contribute to the development and severity of ASD-related phenotypes. To overcome these limitations, we have developed and utilized an integrative clinical and bioinformatics pipeline for generating a more complete and reliable set of genomic variants for downstream analyses. Our study focuses on the analysis of three simplex autism families consisting of one affected child, unaffected parents, and one unaffected sibling. All members were clinically evaluated and widely phenotyped. Genotyping arrays and whole-genome sequencing were performed on each member, and the resulting sequencing data were analyzed using a variety of available bioinformatics tools. We searched for rare variants of putative functional impact that were found to be segregating according to de novo, autosomal recessive, X-linked, mitochondrial, and compound heterozygote transmission models. The resulting candidate variants included three small heterozygous copy-number variations (CNVs), a rare heterozygous de novo nonsense mutation in MYBBP1A located within exon 1, and a novel de novo missense variant in LAMB3. Our work demonstrates how more comprehensive analyses that include rich clinical data and whole-genome sequencing data can generate reliable results for use in downstream investigations. PMID:27148569

  8. Genome-Wide Analysis and Characterization of Aux/IAA Family Genes in Brassica rapa

    PubMed Central

    Rameneni, Jana Jeevan; Li, Xiaonan; Sivanandhan, Ganesan; Choi, Su Ryun; Pang, Wenxing; Im, Subin; Lim, Yong Pyo

    2016-01-01

    Auxins are the key players in plant growth development involving leaf formation, phototropism, root, fruit and embryo development. Auxin/Indole-3-Acetic Acid (Aux/IAA) are early auxin response genes noted as transcriptional repressors in plant auxin signaling. However, many studies focus on Aux/ARF gene families and much less is known about the Aux/IAA gene family in Brassica rapa (B. rapa). Here we performed a comprehensive genome-wide analysis and identified 55 Aux/IAA genes in B. rapa using four conserved motifs of Aux/IAA family (PF02309). Chromosomal mapping of the B. rapa Aux/IAA (BrIAA) genes facilitated understanding cluster rearrangement of the crucifer building blocks in the genome. Phylogenetic analysis of BrIAA with Arabidopsis thaliana, Oryza sativa and Zea mays identified 51 sister pairs including 15 same species (BrIAA—BrIAA) and 36 cross species (BrIAA—AtIAA) IAA genes. Among the 55 BrIAA genes, expression of 43 and 45 genes were verified using Genebank B. rapa ESTs and in home developed microarray data from mature leaves of Chiifu and RcBr lines. Despite their huge morphological difference, tissue specific expression analysis of BrIAA genes between the parental lines Chiifu and RcBr showed that the genes followed a similar pattern of expression during leaf development and a different pattern during bud, flower and siliqua development stages. The response of the BrIAA genes to abiotic and auxin stress at different time intervals revealed their involvement in stress response. Single Nucleotide Polymorphisms between IAA genes of reference genome Chiifu and RcBr were focused and identified. Our study examines the scope of conservation and divergence of Aux/IAA genes and their structures in B. rapa. Analyzing the expression and structural variation between two parental lines will significantly contribute to functional genomics of Brassica crops and we belive our study would provide a foundation in understanding the Aux/IAA genes in B. rapa. PMID

  9. Genome-Wide Analysis and Characterization of Aux/IAA Family Genes in Brassica rapa.

    PubMed

    Paul, Parameswari; Dhandapani, Vignesh; Rameneni, Jana Jeevan; Li, Xiaonan; Sivanandhan, Ganesan; Choi, Su Ryun; Pang, Wenxing; Im, Subin; Lim, Yong Pyo

    2016-01-01

    Auxins are the key players in plant growth development involving leaf formation, phototropism, root, fruit and embryo development. Auxin/Indole-3-Acetic Acid (Aux/IAA) are early auxin response genes noted as transcriptional repressors in plant auxin signaling. However, many studies focus on Aux/ARF gene families and much less is known about the Aux/IAA gene family in Brassica rapa (B. rapa). Here we performed a comprehensive genome-wide analysis and identified 55 Aux/IAA genes in B. rapa using four conserved motifs of Aux/IAA family (PF02309). Chromosomal mapping of the B. rapa Aux/IAA (BrIAA) genes facilitated understanding cluster rearrangement of the crucifer building blocks in the genome. Phylogenetic analysis of BrIAA with Arabidopsis thaliana, Oryza sativa and Zea mays identified 51 sister pairs including 15 same species (BrIAA-BrIAA) and 36 cross species (BrIAA-AtIAA) IAA genes. Among the 55 BrIAA genes, expression of 43 and 45 genes were verified using Genebank B. rapa ESTs and in home developed microarray data from mature leaves of Chiifu and RcBr lines. Despite their huge morphological difference, tissue specific expression analysis of BrIAA genes between the parental lines Chiifu and RcBr showed that the genes followed a similar pattern of expression during leaf development and a different pattern during bud, flower and siliqua development stages. The response of the BrIAA genes to abiotic and auxin stress at different time intervals revealed their involvement in stress response. Single Nucleotide Polymorphisms between IAA genes of reference genome Chiifu and RcBr were focused and identified. Our study examines the scope of conservation and divergence of Aux/IAA genes and their structures in B. rapa. Analyzing the expression and structural variation between two parental lines will significantly contribute to functional genomics of Brassica crops and we belive our study would provide a foundation in understanding the Aux/IAA genes in B. rapa.

  10. Genome-Wide Analysis and Characterization of Aux/IAA Family Genes in Brassica rapa.

    PubMed

    Paul, Parameswari; Dhandapani, Vignesh; Rameneni, Jana Jeevan; Li, Xiaonan; Sivanandhan, Ganesan; Choi, Su Ryun; Pang, Wenxing; Im, Subin; Lim, Yong Pyo

    2016-01-01

    Auxins are the key players in plant growth development involving leaf formation, phototropism, root, fruit and embryo development. Auxin/Indole-3-Acetic Acid (Aux/IAA) are early auxin response genes noted as transcriptional repressors in plant auxin signaling. However, many studies focus on Aux/ARF gene families and much less is known about the Aux/IAA gene family in Brassica rapa (B. rapa). Here we performed a comprehensive genome-wide analysis and identified 55 Aux/IAA genes in B. rapa using four conserved motifs of Aux/IAA family (PF02309). Chromosomal mapping of the B. rapa Aux/IAA (BrIAA) genes facilitated understanding cluster rearrangement of the crucifer building blocks in the genome. Phylogenetic analysis of BrIAA with Arabidopsis thaliana, Oryza sativa and Zea mays identified 51 sister pairs including 15 same species (BrIAA-BrIAA) and 36 cross species (BrIAA-AtIAA) IAA genes. Among the 55 BrIAA genes, expression of 43 and 45 genes were verified using Genebank B. rapa ESTs and in home developed microarray data from mature leaves of Chiifu and RcBr lines. Despite their huge morphological difference, tissue specific expression analysis of BrIAA genes between the parental lines Chiifu and RcBr showed that the genes followed a similar pattern of expression during leaf development and a different pattern during bud, flower and siliqua development stages. The response of the BrIAA genes to abiotic and auxin stress at different time intervals revealed their involvement in stress response. Single Nucleotide Polymorphisms between IAA genes of reference genome Chiifu and RcBr were focused and identified. Our study examines the scope of conservation and divergence of Aux/IAA genes and their structures in B. rapa. Analyzing the expression and structural variation between two parental lines will significantly contribute to functional genomics of Brassica crops and we belive our study would provide a foundation in understanding the Aux/IAA genes in B. rapa. PMID

  11. Genome-wide association study reveals greater polygenic loading for schizophrenia in cases with a family history of illness.

    PubMed

    Bigdeli, Tim B; Ripke, Stephan; Bacanu, Silviu-Alin; Lee, Sang Hong; Wray, Naomi R; Gejman, Pablo V; Rietschel, Marcella; Cichon, Sven; St Clair, David; Corvin, Aiden; Kirov, George; McQuillin, Andrew; Gurling, Hugh; Rujescu, Dan; Andreassen, Ole A; Werge, Thomas; Blackwood, Douglas H R; Pato, Carlos N; Pato, Michele T; Malhotra, Anil K; O'Donovan, Michael C; Kendler, Kenneth S; Fanous, Ayman H

    2016-03-01

    Genome-wide association studies (GWAS) of schizophrenia have yielded more than 100 common susceptibility variants, and strongly support a substantial polygenic contribution of a large number of small allelic effects. It has been hypothesized that familial schizophrenia is largely a consequence of inherited rather than environmental factors. We investigated the extent to which familiality of schizophrenia is associated with enrichment for common risk variants detectable in a large GWAS. We analyzed single nucleotide polymorphism (SNP) data for cases reporting a family history of psychotic illness (N = 978), cases reporting no such family history (N = 4,503), and unscreened controls (N = 8,285) from the Psychiatric Genomics Consortium (PGC1) study of schizophrenia. We used a multinomial logistic regression approach with model-fitting to detect allelic effects specific to either family history subgroup. We also considered a polygenic model, in which we tested whether family history positive subjects carried more schizophrenia risk alleles than family history negative subjects, on average. Several individual SNPs attained suggestive but not genome-wide significant association with either family history subgroup. Comparison of genome-wide polygenic risk scores based on GWAS summary statistics indicated a significant enrichment for SNP effects among family history positive compared to family history negative cases (Nagelkerke's R(2 ) = 0.0021; P = 0.00331; P-value threshold <0.4). Estimates of variability in disease liability attributable to the aggregate effect of genome-wide SNPs were significantly greater for family history positive compared to family history negative cases (0.32 and 0.22, respectively; P = 0.031). We found suggestive evidence of allelic effects detectable in large GWAS of schizophrenia that might be specific to particular family history subgroups. However, consideration of a polygenic risk score indicated a significant

  12. ChIP on chip and ChIP-Seq assays: genome-wide analysis of transcription factor binding and histone modifications.

    PubMed

    Pillai, Smitha; Chellappan, Srikumar P

    2015-01-01

    Deregulation of transcriptional activity of many genes has been causatively linked to human diseases including cancer. Altered patterns of gene expression in normal and cancer cells are the result of inappropriate expression of transcription factors and chromatin modifying proteins. Chromatin immunoprecipitation assay is a well-established tool for investigating the interactions between regulatory proteins and DNA at distinct stages of gene activation. ChIP coupled with DNA microarrays, known as ChIP on chip, or sequencing of DNA associated with the factors (ChIP-Seq) allow us to determine the entire spectrum of in vivo DNA binding sites for a given protein. This has been of immense value because ChIP on chip assays and ChIP-Seq experiments can provide a snapshot of the transcriptional regulatory mechanisms on a genome-wide scale. This chapter outlines the general strategies used to carry out ChIP-chip assays to study the differential recruitment of regulatory molecules based on the studies conducted in our lab as well as other published protocols; these can be easily modified to a ChIP-Seq analysis.

  13. Combined genome-wide linkage and association analyses of fasting glucose level in healthy twins and families of Korea.

    PubMed

    Suh, Young Ju; Kim, Sunghwan; Kim, So Hun; Park, Jia; Lim, Hyun Ae; Park, Hyun Ju; Choi, Hangseok; Ng, Daniel; Lee, Mi Kyeong; Nam, Moonsuk

    2013-03-01

    This study was undertaken to identify genetic polymorphisms that are associated with the risk of an elevated fasting glucose (FG) level using genome-wide analyses. We explored a quantitative trait locus (QTL) for FG level in a genome-wide study from a Korean twin-family cohort (the Healthy Twin Study) using a combined linkage and family-based association analysis approach. We investigated 1,754 individuals, which included 432 families and 219 pairs of monozygotic twins. Regions of chromosomes 2q23.3-2q31.1, 15q26.1-15q26.3, 16p12.1, and 20p13-20p12.2, were found to show evidence of linkage with FG level, and several markers in these regions were found to be significantly associated with FG level using family-based or general association tests. In particular, a single-nucleotide polymorphism (rs6138953) on the PTPRA gene in the 20p13 region (combined P = 1.8 × 10(-6)) was found to be associated with FG level, and the PRKCB1 gene (in 16p12.1) to be possibly associated with FG level. In conclusion, multiple regions of chromosomes 2q23.3-2q31.1, 15q26.1-15q26.3, 16p12.1, and 20p13-20p12.2 are associated with FG level in our Korean twin-family cohort. The combined approach of genome-wide linkage and family-based association analysis is useful to identify novel or known genetic regions concerning FG level in a family cohort study.

  14. Genome-wide identification, isolation and expression analysis of auxin response factor (ARF) gene family in sweet orange (Citrus sinensis)

    PubMed Central

    Li, Si-Bei; OuYang, Wei-Zhi; Hou, Xiao-Jin; Xie, Liang-Liang; Hu, Chun-Gen; Zhang, Jin-Zhi

    2015-01-01

    Auxin response factors (ARFs) are an important family of proteins in auxin-mediated response, with key roles in various physiological and biochemical processes. To date, a genome-wide overview of the ARF gene family in citrus was not available. A systematic analysis of this gene family in citrus was begun by carrying out a genome-wide search for the homologs of ARFs. A total of 19 nonredundant ARF genes (CiARF) were found and validated from the sweet orange. A comprehensive overview of the CiARFs was undertaken, including the gene structures, phylogenetic analysis, chromosome locations, conserved motifs of proteins, and cis-elements in promoters of CiARF. Furthermore, expression profiling using real-time PCR revealed many CiARF genes, albeit with different patterns depending on types of tissues and/or developmental stages. Comprehensive expression analysis of these genes was also performed under two hormone treatments using real-time PCR. Indole-3-acetic acid (IAA) and N-1-napthylphthalamic acid (NPA) treatment experiments revealed differential up-regulation and down-regulation, respectively, of the 19 citrus ARF genes in the callus of sweet orange. Our comprehensive analysis of ARF genes further elucidates the roles of CiARF family members during citrus growth and development process. PMID:25870601

  15. Genome-Wide Analysis of the Musa WRKY Gene Family: Evolution and Differential Expression during Development and Stress.

    PubMed

    Goel, Ridhi; Pandey, Ashutosh; Trivedi, Prabodh K; Asif, Mehar H

    2016-01-01

    The WRKY gene family plays an important role in the development and stress responses in plants. As information is not available on the WRKY gene family in Musa species, genome-wide analysis has been carried out in this study using available genomic information from two species, Musa acuminata and Musa balbisiana. Analysis identified 147 and 132 members of the WRKY gene family in M. acuminata and M. balbisiana, respectively. Evolutionary analysis suggests that the WRKY gene family expanded much before the speciation in both the species. Most of the orthologs retained in two species were from the γ duplication event which occurred prior to α and β genome-wide duplication (GWD) events. Analysis also suggests that subtle changes in nucleotide sequences during the course of evolution have led to the development of new motifs which might be involved in neo-functionalization of different WRKY members in two species. Expression and cis-regulatory motif analysis suggest possible involvement of Group II and Group III WRKY members during various stresses and growth/development including fruit ripening process respectively. PMID:27014321

  16. Genome-Wide Analysis of the Musa WRKY Gene Family: Evolution and Differential Expression during Development and Stress

    PubMed Central

    Goel, Ridhi; Pandey, Ashutosh; Trivedi, Prabodh K.; Asif, Mehar H.

    2016-01-01

    The WRKY gene family plays an important role in the development and stress responses in plants. As information is not available on the WRKY gene family in Musa species, genome-wide analysis has been carried out in this study using available genomic information from two species, Musa acuminata and Musa balbisiana. Analysis identified 147 and 132 members of the WRKY gene family in M. acuminata and M. balbisiana, respectively. Evolutionary analysis suggests that the WRKY gene family expanded much before the speciation in both the species. Most of the orthologs retained in two species were from the γ duplication event which occurred prior to α and β genome-wide duplication (GWD) events. Analysis also suggests that subtle changes in nucleotide sequences during the course of evolution have led to the development of new motifs which might be involved in neo-functionalization of different WRKY members in two species. Expression and cis-regulatory motif analysis suggest possible involvement of Group II and Group III WRKY members during various stresses and growth/development including fruit ripening process respectively. PMID:27014321

  17. Genome-Wide Analysis of Basic/Helix-Loop-Helix Transcription Factor Family in Rice and Arabidopsis1[W

    PubMed Central

    Li, Xiaoxing; Duan, Xuepeng; Jiang, Haixiong; Sun, Yujin; Tang, Yuanping; Yuan, Zheng; Guo, Jingkang; Liang, Wanqi; Chen, Liang; Yin, Jingyuan; Ma, Hong; Wang, Jian; Zhang, Dabing

    2006-01-01

    The basic/helix-loop-helix (bHLH) transcription factors and their homologs form a large family in plant and animal genomes. They are known to play important roles in the specification of tissue types in animals. On the other hand, few plant bHLH proteins have been studied functionally. Recent completion of whole genome sequences of model plants Arabidopsis (Arabidopsis thaliana) and rice (Oryza sativa) allows genome-wide analysis and comparison of the bHLH family in flowering plants. We have identified 167 bHLH genes in the rice genome, and their phylogenetic analysis indicates that they form well-supported clades, which are defined as subfamilies. In addition, sequence analysis of potential DNA-binding activity, the sequence motifs outside the bHLH domain, and the conservation of intron/exon structural patterns further support the evolutionary relationships among these proteins. The genome distribution of rice bHLH genes strongly supports the hypothesis that genome-wide and tandem duplication contributed to the expansion of the bHLH gene family, consistent with the birth-and-death theory of gene family evolution. Bioinformatics analysis suggests that rice bHLH proteins can potentially participate in a variety of combinatorial interactions, endowing them with the capacity to regulate a multitude of transcriptional programs. In addition, similar expression patterns suggest functional conservation between some rice bHLH genes and their close Arabidopsis homologs. PMID:16896230

  18. Genome-wide identification, isolation and expression analysis of auxin response factor (ARF) gene family in sweet orange (Citrus sinensis).

    PubMed

    Li, Si-Bei; OuYang, Wei-Zhi; Hou, Xiao-Jin; Xie, Liang-Liang; Hu, Chun-Gen; Zhang, Jin-Zhi

    2015-01-01

    Auxin response factors (ARFs) are an important family of proteins in auxin-mediated response, with key roles in various physiological and biochemical processes. To date, a genome-wide overview of the ARF gene family in citrus was not available. A systematic analysis of this gene family in citrus was begun by carrying out a genome-wide search for the homologs of ARFs. A total of 19 nonredundant ARF genes (CiARF) were found and validated from the sweet orange. A comprehensive overview of the CiARFs was undertaken, including the gene structures, phylogenetic analysis, chromosome locations, conserved motifs of proteins, and cis-elements in promoters of CiARF. Furthermore, expression profiling using real-time PCR revealed many CiARF genes, albeit with different patterns depending on types of tissues and/or developmental stages. Comprehensive expression analysis of these genes was also performed under two hormone treatments using real-time PCR. Indole-3-acetic acid (IAA) and N-1-napthylphthalamic acid (NPA) treatment experiments revealed differential up-regulation and down-regulation, respectively, of the 19 citrus ARF genes in the callus of sweet orange. Our comprehensive analysis of ARF genes further elucidates the roles of CiARF family members during citrus growth and development process.

  19. Genome-wide Study of Families with Absolute Pitch Reveals Linkage to 8q24.21 and Locus Heterogeneity

    PubMed Central

    Theusch, Elizabeth; Basu, Analabha; Gitschier, Jane

    2009-01-01

    Absolute pitch (AP) is the rare ability to instantaneously recognize and label tones with their musical note names without using a reference pitch for comparison. The etiology of AP is complex. Prior studies have implicated both genetic and environmental factors in its genesis, yet the molecular basis for AP remains unknown. To locate regions of the human genome that may harbor AP-predisposing genetic variants, we performed a genome-wide linkage study on 73 multiplex AP families by genotyping them with 6090 SNP markers. Nonparametric multipoint linkage analyses were conducted, and the strongest evidence for linkage was observed on chromosome 8q24.21 in the subset of 45 families with European ancestry (exponential LOD score = 3.464, empirical genome-wide p = 0.03). Other regions with suggestive LOD scores included chromosomes 7q22.3, 8q21.11, and 9p21.3. Of these four regions, only the 7q22.3 linkage peak was also evident when 19 families with East Asian ancestry were analyzed separately. Though only one of these regions has yet reached statistical significance individually, we detected a larger number of independent linkage peaks than expected by chance overall, indicating that AP is genetically heterogeneous. PMID:19576568

  20. Family-based genome-wide association study of frontal θ oscillations identifies potassium channel gene KCNJ6.

    PubMed

    Kang, S J; Rangaswamy, M; Manz, N; Wang, J-C; Wetherill, L; Hinrichs, T; Almasy, L; Brooks, A; Chorlian, D B; Dick, D; Hesselbrock, V; Kramer, J; Kuperman, S; Nurnberger, J; Rice, J; Schuckit, M; Tischfield, J; Bierut, L J; Edenberg, H J; Goate, A; Foroud, T; Porjesz, B

    2012-08-01

    Event-related oscillations (EROs) represent highly heritable neuroelectric correlates of cognitive processes that manifest deficits in alcoholics and in offspring at high risk to develop alcoholism. Theta ERO to targets in the visual oddball task has been shown to be an endophenotype for alcoholism. A family-based genome-wide association study was performed for the frontal theta ERO phenotype using 634 583 autosomal single nucleotide polymorphisms (SNPs) genotyped in 1560 family members from 117 families densely affected by alcohol use disorders, recruited in the Collaborative Study on the Genetics of Alcoholism. Genome-wide significant association was found with several SNPs on chromosome 21 in KCNJ6 (a potassium inward rectifier channel; KIR3.2/GIRK2), with the most significant SNP at P = 4.7 × 10(-10)). The same SNPs were also associated with EROs from central and parietal electrodes, but with less significance, suggesting that the association is frontally focused. One imputed synonymous SNP in exon four, highly correlated with our top three SNPs, was significantly associated with the frontal theta ERO phenotype. These results suggest KCNJ6 or its product GIRK2 account for some of the variations in frontal theta band oscillations. GIRK2 receptor activation contributes to slow inhibitory postsynaptic potentials that modulate neuronal excitability, and therefore influence neuronal networks. PMID:22554406

  1. Genome-wide identification and characterization of aquaporin gene family in moso bamboo (Phyllostachys edulis).

    PubMed

    Sun, Huayu; Li, Lichao; Lou, Yongfeng; Zhao, Hansheng; Gao, Zhimin

    2016-05-01

    Aquaporins (AQPs) are known to play a major role in maintaining water and hydraulic conductivity balance in the plant system. Numerous studies have showed AQPs execute multi-function throughout plant growth and development, including water transport, nitrogen, carbon, and micronutrient acquisition etc. However, little information on AQPs is known in bamboo. In this study, we present the first genome-wide identification and characterization of AQP genes in moso bamboo (Phyllostachys edulis) using bioinformatics. In total, 26 AQP genes were identified by homologous analysis, which were divided into four groups (PIPs, TIPs, NIPs, and SIPs) based on the phylogenetic analysis. All the genes were located on 26 different scaffolds respectively on basis of the gene mapped to bamboo genome. Evolutionary analysis indicated that Ph. edulis was more close to Oryza sativa than Zea mays in the genetic relationship. Besides, qRT-PCR was used to analyze gene expression profiles, which revealed that AQP genes were expressed constitutively in all the detected tissues, and were all responsive to the environmental cues such as drought, water, and NaCl stresses. This data suggested that AQPs may play fundamental roles in maintaining normal growth and development of bamboo, which would contribute to better understanding for the complex regulation mechanism involved in the fast-growing process of bamboo. Furthermore, the result could provide valuable information for further research on bamboo functional genomics. PMID:26993482

  2. Family based genome-wide copy number scan identifies complex rearrangements at 17q21.31 in dyslexics.

    PubMed

    Veerappa, Avinash M; Saldanha, Marita; Padakannaya, Prakash; Ramachandra, Nallur B

    2014-10-01

    Developmental dyslexia (DD) is a complex heritable disorder with unexpected difficulty in learning to read and spell despite adequate intelligence, education, environment, and normal senses. We performed genome-wide screening for copy number variations (CNVs) in 10 large Indian dyslexic families using Affymetrix Genome-Wide Human SNP Array 6.0. Results revealed the complex genomic rearrangements due to one non-contiguous deletion and five contiguous micro duplications and micro deletions at 17q21.31 region in three dyslexic families. CNVs in this region harbor the genes KIAA1267, LRRC37A, ARL17A/B, NSFP1, and NSF. The CNVs in case 1 and case 2 at this locus were found to be in homozygous state and case 3 was a de novo CNV. These CNVs were found with at least one CNV having a common break and end points in the parents. This cluster of genes containing NSF is implicated in learning, cognition, and memory, though not formally associated with dyslexia. Molecular network analysis of these and other dyslexia related module genes suggests NSF and other genes to be associated with cellular/vesicular membrane fusion and synaptic transmission. Thus, we suggest that NSF in this cluster would be the nearest gene responsible for the learning disability phenotype.

  3. Genome-wide association study of vitamin D concentrations in Hispanic Americans: the IRAS family study.

    PubMed

    Engelman, Corinne D; Meyers, Kristin J; Ziegler, Julie T; Taylor, Kent D; Palmer, Nicholette D; Haffner, Steven M; Fingerlin, Tasha E; Wagenknecht, Lynne E; Rotter, Jerome I; Bowden, Donald W; Langefeld, Carl D; Norris, Jill M

    2010-10-01

    Vitamin D deficiency is associated with many adverse health outcomes. There are several well established environmental predictors of vitamin D concentrations, yet studies of the genetic determinants of vitamin D concentrations are in their infancy. Our objective was to conduct a pilot genome-wide association (GWA) study of 25-hydroxyvitamin D (25[OH]D) and 1,25-dihydroxyvitamin D (1,25[OH](2)D) concentrations in a subset of 229 Hispanic subjects, followed by replication genotyping of 50 single nucleotide polymorphisms (SNPs) in the entire sample of 1190 Hispanics from San Antonio, Texas and San Luis Valley, Colorado. Of the 309,200 SNPs that met all quality control criteria, three SNPs in high linkage disequilibrium (LD) with each other were significantly associated with 1,25[OH](2)D (rs6680429, rs9970802, and rs10889028) at a Bonferroni corrected P-value threshold of 1.62 × 10(-7), however none met the threshold for 25[OH]D. Of the 50 SNPs selected for replication genotyping, five for 25[OH]D (rs2806508, rs10141935, rs4778359, rs1507023, and rs9937918) and eight for 1,25[OH](2)D (rs6680429, rs1348864, rs4559029, rs12667374, rs7781309, rs10505337, rs2486443, and rs2154175) were replicated in the entire sample of Hispanics (P<0.01). In conclusion, we identified several SNPs that were associated with vitamin D metabolite concentrations in Hispanics. These candidate polymorphisms merit further investigation in independent populations and other ethnicities.

  4. Genome-wide analysis and molecular dissection of the SPL gene family in Salvia miltiorrhiza.

    PubMed

    Zhang, Linsu; Wu, Bin; Zhao, Degang; Li, Caili; Shao, Fenjuan; Lu, Shanfa

    2014-01-01

    SQUAMOSA promoter binding protein-likes (SPLs) are plant-specific transcription factors playing vital regulatory roles in plant growth and development. There is no information about SPLs in Salvia miltiorrhiza (Danshen), a significant medicinal plant widely used in Traditional Chinese medicine (TCM) for >1,700 years and an emerging model plant for TCM studies. Through genome-wide identification and subsequent molecular cloning, we identified a total 15 SmSPLs with divergent sequence features, gene structures, and motifs. Comparative analysis showed sequence conservation between SmSPLs and their Arabidopsis counterparts. A phylogenetic tree clusters SmSPLs into six groups. Many of the motifs identified commonly exist in a group/subgroup, implying their functional redundancy. Eight SmSPLs were predicted and experimentally validated to be targets of miR156/157. SmSPLs were differentially expressed in various tissues of S. milltiorrhiza. The expression of miR156/157-targeted SmSPLs was increased with the maturation of S. miltiorrhiza, whereas the expression of miR156/157 was decreased, confirming the regulatory roles of miR156/157 in SmSPLs and suggesting the functions of SmSPLs in S. miltiorrhiza development. The expression of miR156/157 was negatively correlated with miR172 during the maturation of S. miltiorrhiza. The results indicate the significance and complexity of SmSPL-, miR156-, and miR172-mediated regulation of developmental timing in S. miltiorrhiza. PMID:24112769

  5. Genome-Wide Identification and Expression Profile of Dof Transcription Factor Gene Family in Pepper (Capsicum annuum L.).

    PubMed

    Wu, Zhiming; Cheng, Jiaowen; Cui, Junjie; Xu, Xiaowan; Liang, Guansheng; Luo, Xirong; Chen, Xiaocui; Tang, Xiangqun; Hu, Kailin; Qin, Cheng

    2016-01-01

    Dof (DNA-binding One Zinc Finger) transcription factor family is unique to plants and has diverse roles associated with plant-specific phenomena, such as light, phytohormone and defense responses as well as seed development and germination. Although, genome-wide analysis of this family has been performed in many species, information regarding Dof genes in the pepper, Capsicum annuum L., is extremely limited. In this study, exhaustive searches of pepper genome revealed 33 potential CaDofs that were phylogenetically clustered into four subgroups. Twenty-nine of the 33 Dof genes could be mapped on 11 chromosomes, except for chromosome 7. The intron/exon organizations and conserved motif compositions of these genes were also analyzed. Additionally, phylogenetic analysis and classification of the Dof transcription factor family in eight plant species revealed that S. lycopersicum and C. annuum as well as O. sativa and S. bicolor Dof proteins may have evolved conservatively. Moreover, comprehensive expression analysis of CaDofs using a RNA-seq atlas and quantitative real-time polymerase chain reaction (qRT-PCR) revealed that these genes exhibit a variety of expression patterns. Most of the CaDofs were expressed in at least one of the tissues tested, whereas several genes were identified as being highly responsive to heat and salt stresses. Overall, this study describes the first genome-wide analysis of the pepper Dof family, whose genes exhibited different expression patterns in all primary fruit developmental stages and tissue types, as in response to abiotic stress. In particular, some Dof genes might be used as biomarkers for heat and salt stress. The results could expand our understanding of the roles of Dof genes in pepper.

  6. Genome-Wide Identification and Expression Profile of Dof Transcription Factor Gene Family in Pepper (Capsicum annuum L.)

    PubMed Central

    Wu, Zhiming; Cheng, Jiaowen; Cui, Junjie; Xu, Xiaowan; Liang, Guansheng; Luo, Xirong; Chen, Xiaocui; Tang, Xiangqun; Hu, Kailin; Qin, Cheng

    2016-01-01

    Dof (DNA-binding One Zinc Finger) transcription factor family is unique to plants and has diverse roles associated with plant-specific phenomena, such as light, phytohormone and defense responses as well as seed development and germination. Although, genome-wide analysis of this family has been performed in many species, information regarding Dof genes in the pepper, Capsicum annuum L., is extremely limited. In this study, exhaustive searches of pepper genome revealed 33 potential CaDofs that were phylogenetically clustered into four subgroups. Twenty-nine of the 33 Dof genes could be mapped on 11 chromosomes, except for chromosome 7. The intron/exon organizations and conserved motif compositions of these genes were also analyzed. Additionally, phylogenetic analysis and classification of the Dof transcription factor family in eight plant species revealed that S. lycopersicum and C. annuum as well as O. sativa and S. bicolor Dof proteins may have evolved conservatively. Moreover, comprehensive expression analysis of CaDofs using a RNA-seq atlas and quantitative real-time polymerase chain reaction (qRT-PCR) revealed that these genes exhibit a variety of expression patterns. Most of the CaDofs were expressed in at least one of the tissues tested, whereas several genes were identified as being highly responsive to heat and salt stresses. Overall, this study describes the first genome-wide analysis of the pepper Dof family, whose genes exhibited different expression patterns in all primary fruit developmental stages and tissue types, as in response to abiotic stress. In particular, some Dof genes might be used as biomarkers for heat and salt stress. The results could expand our understanding of the roles of Dof genes in pepper. PMID:27200047

  7. Genome-Wide Identification and Expression Profile of Dof Transcription Factor Gene Family in Pepper (Capsicum annuum L.).

    PubMed

    Wu, Zhiming; Cheng, Jiaowen; Cui, Junjie; Xu, Xiaowan; Liang, Guansheng; Luo, Xirong; Chen, Xiaocui; Tang, Xiangqun; Hu, Kailin; Qin, Cheng

    2016-01-01

    Dof (DNA-binding One Zinc Finger) transcription factor family is unique to plants and has diverse roles associated with plant-specific phenomena, such as light, phytohormone and defense responses as well as seed development and germination. Although, genome-wide analysis of this family has been performed in many species, information regarding Dof genes in the pepper, Capsicum annuum L., is extremely limited. In this study, exhaustive searches of pepper genome revealed 33 potential CaDofs that were phylogenetically clustered into four subgroups. Twenty-nine of the 33 Dof genes could be mapped on 11 chromosomes, except for chromosome 7. The intron/exon organizations and conserved motif compositions of these genes were also analyzed. Additionally, phylogenetic analysis and classification of the Dof transcription factor family in eight plant species revealed that S. lycopersicum and C. annuum as well as O. sativa and S. bicolor Dof proteins may have evolved conservatively. Moreover, comprehensive expression analysis of CaDofs using a RNA-seq atlas and quantitative real-time polymerase chain reaction (qRT-PCR) revealed that these genes exhibit a variety of expression patterns. Most of the CaDofs were expressed in at least one of the tissues tested, whereas several genes were identified as being highly responsive to heat and salt stresses. Overall, this study describes the first genome-wide analysis of the pepper Dof family, whose genes exhibited different expression patterns in all primary fruit developmental stages and tissue types, as in response to abiotic stress. In particular, some Dof genes might be used as biomarkers for heat and salt stress. The results could expand our understanding of the roles of Dof genes in pepper. PMID:27200047

  8. The Use of Multiplicity Corrections, Order Statistics and Generalized Family-Wise Statistics with Application to Genome-Wide Studies

    PubMed Central

    Schrodi, Steven J.

    2016-01-01

    The most important decision faced by large-scale studies, such as those presently encountered in human genetics, is to distinguish between those tests that are true positives from those that are not. In the context of genetics, this entails the determination of genetic markers that actually underlie medically-relevant phenotypes from a vast number of makers typically interrogated in genome-wide studies. A critical part of these decisions relies on the appropriate statistical assessment of data obtained from tests across numerous markers. Several methods have been developed to aid with such analyses, with family-wise approaches, such as the Bonferroni and Dunn-Šidàk corrections, being popular. Conditions that motivate the use of family-wise corrections are explored. Although simple to implement, one major limitation of these approaches is that they assume that p-values are i.i.d. uniformly distributed under the null hypothesis. However, several factors may violate this assumption in genome-wide studies including effects from confounding by population stratification, the presence of related individuals, the correlational structure among genetic markers, and the use of limiting distributions for test statistics. Even after adjustment for such effects, the distribution of p-values can substantially depart from a uniform distribution under the null hypothesis. In this work, I present a decision theory for the use of family-wise corrections for multiplicity and a generalization of the Dunn-Šidàk correction that relaxes the assumption of uniformly-distributed null p-values. The independence assumption is also relaxed and handled through calculating the effective number of independent tests. I also explicitly show the relationship between order statistics and family-wise correction procedures. This generalization may be applicable to multiplicity problems outside of genomics. PMID:27128491

  9. The Use of Multiplicity Corrections, Order Statistics and Generalized Family-Wise Statistics with Application to Genome-Wide Studies.

    PubMed

    Schrodi, Steven J

    2016-01-01

    The most important decision faced by large-scale studies, such as those presently encountered in human genetics, is to distinguish between those tests that are true positives from those that are not. In the context of genetics, this entails the determination of genetic markers that actually underlie medically-relevant phenotypes from a vast number of makers typically interrogated in genome-wide studies. A critical part of these decisions relies on the appropriate statistical assessment of data obtained from tests across numerous markers. Several methods have been developed to aid with such analyses, with family-wise approaches, such as the Bonferroni and Dunn-Šidàk corrections, being popular. Conditions that motivate the use of family-wise corrections are explored. Although simple to implement, one major limitation of these approaches is that they assume that p-values are i.i.d. uniformly distributed under the null hypothesis. However, several factors may violate this assumption in genome-wide studies including effects from confounding by population stratification, the presence of related individuals, the correlational structure among genetic markers, and the use of limiting distributions for test statistics. Even after adjustment for such effects, the distribution of p-values can substantially depart from a uniform distribution under the null hypothesis. In this work, I present a decision theory for the use of family-wise corrections for multiplicity and a generalization of the Dunn-Šidàk correction that relaxes the assumption of uniformly-distributed null p-values. The independence assumption is also relaxed and handled through calculating the effective number of independent tests. I also explicitly show the relationship between order statistics and family-wise correction procedures. This generalization may be applicable to multiplicity problems outside of genomics. PMID:27128491

  10. Genome-wide identification and characterization of R2R3MYB family in Rosaceae.

    PubMed

    González, Máximo; Carrasco, Basilio; Salazar, Erika

    2016-09-01

    Transcription factors R2R3MYB family have been associated with the control of secondary metabolites, development of structures, cold tolerance and response to biotic and abiotic stress, among others. In recent years, genomes of Rosaceae botanical family are available. Although this information has been used to study the karyotype evolution of these species from an ancestral genome, there are no studies that treat the evolution and diversity of gene families present in these species or in the botanical family. Here we present the first comparative study of the R2R3MYB subfamily of transcription factors in three species of Rosaceae family (Malus domestica, Prunus persica and Fragaria vesca). We described 186, 98 and 86 non-redundant gene models for apple, peach and strawberry, respectively. In this research, we analyzed the intron-exon structure and genomic distribution of R2R3MYB families mentioned above. The phylogenetic comparisons revealed putative functions of some R2R3MYB transcription factors. This analysis found 44 functional subgroups, seven of which were unique for Rosaceae. In addition, our results showed a highly collinearity among some genes revealing the existence of conserved gene models between the three species studied. Although some gene models in these species have been validated under several approaches, more research in the Rosaceae family is necessary to determine gene expression patterns in specific tissues and development stages to facilitate understanding of the regulatory and biochemical mechanism in this botanical family. PMID:27408811

  11. Cyclic nucleotide gated channel gene family in tomato: genome-wide identification and functional analyses in disease resistance

    PubMed Central

    Saand, Mumtaz A.; Xu, You-Ping; Li, Wen; Wang, Ji-Peng; Cai, Xin-Zhong

    2015-01-01

    The cyclic nucleotide gated channel (CNGC) is suggested to be one of the important calcium conducting channels. Nevertheless, genome-wide identification and systemic functional analysis of CNGC gene family in crop plant species have not yet been conducted. In this study, we performed genome-wide identification of CNGC gene family in the economically important crop tomato (Solanum lycopersicum L.) and analyzed function of the group IVb SlCNGC genes in disease resistance. Eighteen CNGC genes were identified in tomato genome, and four CNGC loci that were misannotated at database were corrected by cloning and sequencing. Detailed bioinformatics analyses on gene structure, domain composition and phylogenetic relationship of the SlCNGC gene family were conducted and the group-specific feature was revealed. Comprehensive expression analyses demonstrated that SlCNGC genes were highly, widely but differently responsive to diverse stimuli. Pharmacological assays showed that the putative CNGC activators cGMP and cAMP enhanced resistance against Sclerotinia sclerotiorum. Silencing of group IVb SlCNGC genes significantly enhanced resistance to fungal pathogens Pythium aphanidermatum and S. sclerotiorum, strongly reduced resistance to viral pathogen Tobacco rattle virus, while attenuated PAMP- and DAMP-triggered immunity as shown by obvious decrease of the flg22- and AtPep1-elicited hydrogen peroxide accumulation in SlCNGC-silenced plants. Additionally, silencing of these SlCNGC genes significantly altered expression of a set of Ca2+ signaling genes including SlCaMs, SlCDPKs, and SlCAMTA3. Collectively, our results reveal that group IV SlCNGC genes regulate a wide range of resistance in tomato probably by affecting Ca2+ signaling. PMID:25999969

  12. Genome-wide prediction and annotation of Burkholderia pseudomallei AraC/XylS family transcription regulator.

    PubMed

    Lim, Boon-San; Chong, Chan-Eng; Zamrod, Zulkeflie; Nathan, Sheila; Mohamed, Rahmah

    2007-01-01

    Many members of the AraC/XylS family transcription regulator have been proven to play a critical role in regulating bacterial virulence factors in response to environmental stress. By using the Hidden Markov Model (HMM) profile built from the alignment of a 99 amino acid conserved domain sequence of 273 AraC/XylS family transcription regulators, we detected a total of 45 AraC/XylS family transcription regulators in the genome of the Gram-negative pathogen, Burkholderia pseudomallei. Further in silico analysis of each detected AraC/XylS family transcription regulatory protein and its neighboring genes allowed us to make a first-order guess on the role of some of these transcription regulators in regulating important virulence factors such as those involved in three type III secretion systems and biosynthesis of pyochelin, exopolysaccharide (EPS) and phospholipase C. This paper has demonstrated an efficient and systematic genome-wide scale prediction of the AraC/XylS family that can be applied to other protein families. PMID:18391231

  13. Genome-Wide Identification, Characterization and Expression Analysis of the Chalcone Synthase Family in Maize

    PubMed Central

    Han, Yahui; Ding, Ting; Su, Bo; Jiang, Haiyang

    2016-01-01

    Members of the chalcone synthase (CHS) family participate in the synthesis of a series of secondary metabolites in plants, fungi and bacteria. The metabolites play important roles in protecting land plants against various environmental stresses during the evolutionary process. Our research was conducted on comprehensive investigation of CHS genes in maize (Zea mays L.), including their phylogenetic relationships, gene structures, chromosomal locations and expression analysis. Fourteen CHS genes (ZmCHS01–14) were identified in the genome of maize, representing one of the largest numbers of CHS family members identified in one organism to date. The gene family was classified into four major classes (classes I–IV) based on their phylogenetic relationships. Most of them contained two exons and one intron. The 14 genes were unevenly located on six chromosomes. Two segmental duplication events were identified, which might contribute to the expansion of the maize CHS gene family to some extent. In addition, quantitative real-time PCR and microarray data analyses suggested that ZmCHS genes exhibited various expression patterns, indicating functional diversification of the ZmCHS genes. Our results will contribute to future studies of the complexity of the CHS gene family in maize and provide valuable information for the systematic analysis of the functions of the CHS gene family. PMID:26828478

  14. Genome-Wide Identification, Characterization and Expression Analysis of the Chalcone Synthase Family in Maize.

    PubMed

    Han, Yahui; Ding, Ting; Su, Bo; Jiang, Haiyang

    2016-01-01

    Members of the chalcone synthase (CHS) family participate in the synthesis of a series of secondary metabolites in plants, fungi and bacteria. The metabolites play important roles in protecting land plants against various environmental stresses during the evolutionary process. Our research was conducted on comprehensive investigation of CHS genes in maize (Zea mays L.), including their phylogenetic relationships, gene structures, chromosomal locations and expression analysis. Fourteen CHS genes (ZmCHS01-14) were identified in the genome of maize, representing one of the largest numbers of CHS family members identified in one organism to date. The gene family was classified into four major classes (classes I-IV) based on their phylogenetic relationships. Most of them contained two exons and one intron. The 14 genes were unevenly located on six chromosomes. Two segmental duplication events were identified, which might contribute to the expansion of the maize CHS gene family to some extent. In addition, quantitative real-time PCR and microarray data analyses suggested that ZmCHS genes exhibited various expression patterns, indicating functional diversification of the ZmCHS genes. Our results will contribute to future studies of the complexity of the CHS gene family in maize and provide valuable information for the systematic analysis of the functions of the CHS gene family. PMID:26828478

  15. Genome-wide analysis of Aux/IAA and ARF gene families in Populus trichocarpa

    SciTech Connect

    Kalluri, Udaya C; DiFazio, Stephen P; Brunner, A.; Tuskan, Gerald A

    2007-01-01

    Auxin/Indole-3-Acetic Acid (Aux/IAA) and Auxin Response Factor (ARF) transcription factors are key regulators of auxin responses in plants. A total of 35 Aux/IAA and 39 ARF genes were identified in the Populus genome. Comparative phylogenetic analysis revealed that the subgroups PoptrARF2, 6, 9 and 16 and PoptrIAA3, 16, 27 and 29 have differentially expanded in Populus relative to Arabidopsis. Activator ARFs were found to be two fold-overrepresented in the Populus genome. PoptrIAA and PoptrARF gene families appear to have expanded due to high segmental and low tandem duplication events. Furthermore, expression studies showed that genes in the expanded PoptrIAA3 subgroup display differential expression. The gene-family analysis reported here will be useful in conducting future functional genomics studies to understand how the molecular roles of these large gene families translate into a diversity of biologically meaningful auxin effects.

  16. Variation in genome-wide mutation rates within and between human families.

    PubMed

    Conrad, Donald F; Keebler, Jonathan E M; DePristo, Mark A; Lindsay, Sarah J; Zhang, Yujun; Casals, Ferran; Idaghdour, Youssef; Hartl, Chris L; Torroja, Carlos; Garimella, Kiran V; Zilversmit, Martine; Cartwright, Reed; Rouleau, Guy A; Daly, Mark; Stone, Eric A; Hurles, Matthew E; Awadalla, Philip

    2011-07-01

    J.B.S. Haldane proposed in 1947 that the male germline may be more mutagenic than the female germline. Diverse studies have supported Haldane's contention of a higher average mutation rate in the male germline in a variety of mammals, including humans. Here we present, to our knowledge, the first direct comparative analysis of male and female germline mutation rates from the complete genome sequences of two parent-offspring trios. Through extensive validation, we identified 49 and 35 germline de novo mutations (DNMs) in two trio offspring, as well as 1,586 non-germline DNMs arising either somatically or in the cell lines from which the DNA was derived. Most strikingly, in one family, we observed that 92% of germline DNMs were from the paternal germline, whereas, in contrast, in the other family, 64% of DNMs were from the maternal germline. These observations suggest considerable variation in mutation rates within and between families. PMID:21666693

  17. Variation in genome-wide mutation rates within and between human families.

    PubMed

    Conrad, Donald F; Keebler, Jonathan E M; DePristo, Mark A; Lindsay, Sarah J; Zhang, Yujun; Casals, Ferran; Idaghdour, Youssef; Hartl, Chris L; Torroja, Carlos; Garimella, Kiran V; Zilversmit, Martine; Cartwright, Reed; Rouleau, Guy A; Daly, Mark; Stone, Eric A; Hurles, Matthew E; Awadalla, Philip

    2011-06-12

    J.B.S. Haldane proposed in 1947 that the male germline may be more mutagenic than the female germline. Diverse studies have supported Haldane's contention of a higher average mutation rate in the male germline in a variety of mammals, including humans. Here we present, to our knowledge, the first direct comparative analysis of male and female germline mutation rates from the complete genome sequences of two parent-offspring trios. Through extensive validation, we identified 49 and 35 germline de novo mutations (DNMs) in two trio offspring, as well as 1,586 non-germline DNMs arising either somatically or in the cell lines from which the DNA was derived. Most strikingly, in one family, we observed that 92% of germline DNMs were from the paternal germline, whereas, in contrast, in the other family, 64% of DNMs were from the maternal germline. These observations suggest considerable variation in mutation rates within and between families.

  18. Family-based genome-wide copy number scan identifies five new genes of dyslexia involved in dendritic spinal plasticity.

    PubMed

    Veerappa, Avinash M; Saldanha, Marita; Padakannaya, Prakash; Ramachandra, Nallur B

    2013-08-01

    Genome-wide screening for copy number variations (CNVs) in ten Indian dyslexic families revealed the presence of five de novo CNVs in regions harboring GABARAP, NEGR1, ACCN1, DCDC5, and one in already known candidate gene CNTNAP2. These genes are located on regions of chromosomes 17p13.1, 1p31.1, 17q11.21, 11p14.1 and 7q35, respectively, and are implicated in learning, cognition and memory processes through dendritic spinal plasticity, though not formally associated with dyslexia. Molecular network analysis of these and other dyslexia-related module genes suggests them to be associated with synaptic transmission, axon guidance and cell adhesion. Thus, we suggest that dyslexia may also be caused by neuronal disconnection in addition to the earlier view that it is due to neuronal migrational disorder.

  19. Genome-wide identification, characterization, and expression analysis of the MLO gene family in Cucumis sativus.

    PubMed

    Zhou, S J; Jing, Z; Shi, J L

    2013-12-11

    Mildew resistance locus o (MLO) is a plant-specific seven-transmembrane (TM) gene family. Several studies have revealed that certain members of the MLO gene family mediate powdery mildew susceptibility in three plant species, namely, Arabidopsis, barley, and tomato. The sequenced cucumber genome provides an opportunity to conduct a comprehensive overview of the MLO gene family. Fourteen genes (designated CsMLO01 through CsMLO14) have been identified within the Cucumis sativus genome by using an in silico cloning method with the MLO amino acid sequences of Arabidopsis thaliana and rice as probes. Sequence alignment revealed that numerous features of the gene family, such as TMs, a calmodulin-binding domain, peptide domains I and II, and 30 important amino acid residues for MLO function, are well conserved. Phylogenetic analysis of the MLO genes from cucumber and other plant species reveals seven different clades (I through VII). Three of these clades comprised MLO genes from A. thaliana, rice, maize, and cucumber, suggesting that these genes may have evolved after the divergence of monocots and dicots. In silico mapping showed that these CsMLOs were located on chromosomes 1, 2, 3, 4, 5, and 6 without any obvious clustering, except CsMLO01. To our knowledge, this paper is the first comprehensive report on MLO genes in C. sativus. These findings will facilitate the functional characterization of the MLOs related to powdery mildew susceptibility and assist in the development of disease resistance in cucumber.

  20. Genome-Wide Identification of the Invertase Gene Family in Populus

    PubMed Central

    Su, Xiaoxing; Rao, Pian; An, Xinmin

    2015-01-01

    Invertase plays a crucial role in carbohydrate partitioning and plant development as it catalyses the irreversible hydrolysis of sucrose into glucose and fructose. The invertase family in plants is composed of two sub-families: acid invertases, which are targeted to the cell wall and vacuole; and neutral/alkaline invertases, which function in the cytosol. In this study, 5 cell wall invertase genes (PtCWINV1-5), 3 vacuolar invertase genes (PtVINV1-3) and 16 neutral/alkaline invertase genes (PtNINV1-16) were identified in the Populus genome and found to be distributed on 14 chromosomes. A comprehensive analysis of poplar invertase genes was performed, including structures, chromosome location, phylogeny, evolutionary pattern and expression profiles. Phylogenetic analysis indicated that the two sub-families were both divided into two clades. Segmental duplication is contributed to neutral/alkaline sub-family expansion. Furthermore, the Populus invertase genes displayed differential expression in roots, stems, leaves, leaf buds and in response to salt/cold stress and pathogen infection. In addition, the analysis of enzyme activity and sugar content revealed that invertase genes play key roles in the sucrose metabolism of various tissues and organs in poplar. This work lays the foundation for future functional analysis of the invertase genes in Populus and other woody perennials. PMID:26393355

  1. Genome-Wide Identification of the Invertase Gene Family in Populus.

    PubMed

    Chen, Zhong; Gao, Kai; Su, Xiaoxing; Rao, Pian; An, Xinmin

    2015-01-01

    Invertase plays a crucial role in carbohydrate partitioning and plant development as it catalyses the irreversible hydrolysis of sucrose into glucose and fructose. The invertase family in plants is composed of two sub-families: acid invertases, which are targeted to the cell wall and vacuole; and neutral/alkaline invertases, which function in the cytosol. In this study, 5 cell wall invertase genes (PtCWINV1-5), 3 vacuolar invertase genes (PtVINV1-3) and 16 neutral/alkaline invertase genes (PtNINV1-16) were identified in the Populus genome and found to be distributed on 14 chromosomes. A comprehensive analysis of poplar invertase genes was performed, including structures, chromosome location, phylogeny, evolutionary pattern and expression profiles. Phylogenetic analysis indicated that the two sub-families were both divided into two clades. Segmental duplication is contributed to neutral/alkaline sub-family expansion. Furthermore, the Populus invertase genes displayed differential expression in roots, stems, leaves, leaf buds and in response to salt/cold stress and pathogen infection. In addition, the analysis of enzyme activity and sugar content revealed that invertase genes play key roles in the sucrose metabolism of various tissues and organs in poplar. This work lays the foundation for future functional analysis of the invertase genes in Populus and other woody perennials. PMID:26393355

  2. Genome-wide analysis of the MADS-box gene family in Brassica rapa (Chinese cabbage).

    PubMed

    Duan, Weike; Song, Xiaoming; Liu, Tongkun; Huang, Zhinan; Ren, Jun; Hou, Xilin; Li, Ying

    2015-02-01

    The MADS-box gene family is an ancient and well-studied transcription factor family that functions in almost every developmental process in plants. There are a number of reports about the MADS-box family in different plant species, but systematic analysis of the MADS-box transcription factor family in Brassica rapa (Chinese cabbage) is still lacking. In this study, 160 MADS-box transcription factors were identified from the entire Chinese cabbage genome and compared with the MADS-box factors from 21 other representative plant species. A detailed list of MADS proteins from these 22 species was sorted. Phylogenetic analysis of the BrMADS genes, together with their Arabidopsis and rice counterparts, showed that the BrMADS genes were categorised into type I (Mα, Mβ, Mγ) and type II (MIKC(C), MIKC*) groups, and the MIKC(C) proteins were further divided into 13 subfamilies. The Chinese cabbage type II group has 95 members, which is twice as much as the Arabidopsis type II group, indicating that the Chinese cabbage type II genes have been retained more frequently than the type I genes. Finally, RNA-seq transcriptome data and quantitative real-time PCR analysis revealed that BrMADS genes are expressed in a tissue-specific manner similar to Arabidopsis. Interestingly, a number of BrMIKC genes showed responses to different abiotic stress treatments, suggesting a function for some of the genes in these processes as well. Taken together, the characterization of the B. rapa MADS-box family presented here, will certainly help in the selection of appropriate candidate genes and further facilitate functional studies in Chinese cabbage.

  3. Genome-wide analysis of the MADS-box gene family in Brassica rapa (Chinese cabbage).

    PubMed

    Duan, Weike; Song, Xiaoming; Liu, Tongkun; Huang, Zhinan; Ren, Jun; Hou, Xilin; Li, Ying

    2015-02-01

    The MADS-box gene family is an ancient and well-studied transcription factor family that functions in almost every developmental process in plants. There are a number of reports about the MADS-box family in different plant species, but systematic analysis of the MADS-box transcription factor family in Brassica rapa (Chinese cabbage) is still lacking. In this study, 160 MADS-box transcription factors were identified from the entire Chinese cabbage genome and compared with the MADS-box factors from 21 other representative plant species. A detailed list of MADS proteins from these 22 species was sorted. Phylogenetic analysis of the BrMADS genes, together with their Arabidopsis and rice counterparts, showed that the BrMADS genes were categorised into type I (Mα, Mβ, Mγ) and type II (MIKC(C), MIKC*) groups, and the MIKC(C) proteins were further divided into 13 subfamilies. The Chinese cabbage type II group has 95 members, which is twice as much as the Arabidopsis type II group, indicating that the Chinese cabbage type II genes have been retained more frequently than the type I genes. Finally, RNA-seq transcriptome data and quantitative real-time PCR analysis revealed that BrMADS genes are expressed in a tissue-specific manner similar to Arabidopsis. Interestingly, a number of BrMIKC genes showed responses to different abiotic stress treatments, suggesting a function for some of the genes in these processes as well. Taken together, the characterization of the B. rapa MADS-box family presented here, will certainly help in the selection of appropriate candidate genes and further facilitate functional studies in Chinese cabbage. PMID:25216934

  4. Genome-Wide Identification, Phylogenetic and Expression Analyses of the Ubiquitin-Conjugating Enzyme Gene Family in Maize

    PubMed Central

    Jue, Dengwei; Sang, Xuelian; Lu, Shengqiao; Dong, Chen; Zhao, Qiufang; Chen, Hongliang; Jia, Liqiang

    2015-01-01

    Background Ubiquitination is a post-translation modification where ubiquitin is attached to a substrate. Ubiquitin-conjugating enzymes (E2s) play a major role in the ubiquitin transfer pathway, as well as a variety of functions in plant biological processes. To date, no genome-wide characterization of this gene family has been conducted in maize (Zea mays). Methodology/Principal Findings In the present study, a total of 75 putative ZmUBC genes have been identified and located in the maize genome. Phylogenetic analysis revealed that ZmUBC proteins could be divided into 15 subfamilies, which include 13 ubiquitin-conjugating enzymes (ZmE2s) and two independent ubiquitin-conjugating enzyme variant (UEV) groups. The predicted ZmUBC genes were distributed across 10 chromosomes at different densities. In addition, analysis of exon-intron junctions and sequence motifs in each candidate gene has revealed high levels of conservation within and between phylogenetic groups. Tissue expression analysis indicated that most ZmUBC genes were expressed in at least one of the tissues, indicating that these are involved in various physiological and developmental processes in maize. Moreover, expression profile analyses of ZmUBC genes under different stress treatments (4°C, 20% PEG6000, and 200 mM NaCl) and various expression patterns indicated that these may play crucial roles in the response of plants to stress. Conclusions Genome-wide identification, chromosome organization, gene structure, evolutionary and expression analyses of ZmUBC genes have facilitated in the characterization of this gene family, as well as determined its potential involvement in growth, development, and stress responses. This study provides valuable information for better understanding the classification and putative functions of the UBC-encoding genes of maize. PMID:26606743

  5. Genome-Wide Comparative Analysis of 20 Miniature Inverted-Repeat Transposable Element Families in Brassica rapa and B. oleracea

    PubMed Central

    Sampath, Perumal; Murukarthick, Jayakodi; Izzah, Nur Kholilatul; Lee, Jonghoon; Choi, Hong-Il; Shirasawa, Kenta; Choi, Beom-Soon; Liu, Shengyi; Nou, Ill-Sup; Yang, Tae-Jin

    2014-01-01

    Miniature inverted-repeat transposable elements (MITEs) are ubiquitous, non-autonomous class II transposable elements. Here, we conducted genome-wide comparative analysis of 20 MITE families in B. rapa, B. oleracea, and Arabidopsis thaliana. A total of 5894 and 6026 MITE members belonging to the 20 families were found in the whole genome pseudo-chromosome sequences of B. rapa and B. oleracea, respectively. Meanwhile, only four of the 20 families, comprising 573 members, were identified in the Arabidopsis genome, indicating that most of the families were activated in the Brassica genus after divergence from Arabidopsis. Copy numbers varied from 4 to 1459 for each MITE family, and there was up to 6-fold variation between B. rapa and B. oleracea. In particular, analysis of intact members showed that whereas eleven families were present in similar copy numbers in B. rapa and B. oleracea, nine families showed copy number variation ranging from 2- to 16-fold. Four of those families (BraSto-3, BraTo-3, 4, 5) were more abundant in B. rapa, and the other five (BraSto-1, BraSto-4, BraTo-1, 7 and BraHAT-1) were more abundant in B. oleracea. Overall, 54% and 51% of the MITEs resided in or within 2 kb of a gene in the B. rapa and B. oleracea genomes, respectively. Notably, 92 MITEs were found within the CDS of annotated genes, suggesting that MITEs might play roles in diversification of genes in the recently triplicated Brassica genome. MITE insertion polymorphism (MIP) analysis of 289 MITE members showed that 52% and 23% were polymorphic at the inter- and intra-species levels, respectively, indicating that there has been recent MITE activity in the Brassica genome. These recently activated MITE families with abundant MIP will provide useful resources for molecular breeding and identification of novel functional genes arising from MITE insertion. PMID:24747717

  6. Genome-wide comparative analysis of 20 miniature inverted-repeat transposable element families in Brassica rapa and B. oleracea.

    PubMed

    Sampath, Perumal; Murukarthick, Jayakodi; Izzah, Nur Kholilatul; Lee, Jonghoon; Choi, Hong-Il; Shirasawa, Kenta; Choi, Beom-Soon; Liu, Shengyi; Nou, Ill-Sup; Yang, Tae-Jin

    2014-01-01

    Miniature inverted-repeat transposable elements (MITEs) are ubiquitous, non-autonomous class II transposable elements. Here, we conducted genome-wide comparative analysis of 20 MITE families in B. rapa, B. oleracea, and Arabidopsis thaliana. A total of 5894 and 6026 MITE members belonging to the 20 families were found in the whole genome pseudo-chromosome sequences of B. rapa and B. oleracea, respectively. Meanwhile, only four of the 20 families, comprising 573 members, were identified in the Arabidopsis genome, indicating that most of the families were activated in the Brassica genus after divergence from Arabidopsis. Copy numbers varied from 4 to 1459 for each MITE family, and there was up to 6-fold variation between B. rapa and B. oleracea. In particular, analysis of intact members showed that whereas eleven families were present in similar copy numbers in B. rapa and B. oleracea, nine families showed copy number variation ranging from 2- to 16-fold. Four of those families (BraSto-3, BraTo-3, 4, 5) were more abundant in B. rapa, and the other five (BraSto-1, BraSto-4, BraTo-1, 7 and BraHAT-1) were more abundant in B. oleracea. Overall, 54% and 51% of the MITEs resided in or within 2 kb of a gene in the B. rapa and B. oleracea genomes, respectively. Notably, 92 MITEs were found within the CDS of annotated genes, suggesting that MITEs might play roles in diversification of genes in the recently triplicated Brassica genome. MITE insertion polymorphism (MIP) analysis of 289 MITE members showed that 52% and 23% were polymorphic at the inter- and intra-species levels, respectively, indicating that there has been recent MITE activity in the Brassica genome. These recently activated MITE families with abundant MIP will provide useful resources for molecular breeding and identification of novel functional genes arising from MITE insertion.

  7. Genome-wide comparative analysis of 20 miniature inverted-repeat transposable element families in Brassica rapa and B. oleracea.

    PubMed

    Sampath, Perumal; Murukarthick, Jayakodi; Izzah, Nur Kholilatul; Lee, Jonghoon; Choi, Hong-Il; Shirasawa, Kenta; Choi, Beom-Soon; Liu, Shengyi; Nou, Ill-Sup; Yang, Tae-Jin

    2014-01-01

    Miniature inverted-repeat transposable elements (MITEs) are ubiquitous, non-autonomous class II transposable elements. Here, we conducted genome-wide comparative analysis of 20 MITE families in B. rapa, B. oleracea, and Arabidopsis thaliana. A total of 5894 and 6026 MITE members belonging to the 20 families were found in the whole genome pseudo-chromosome sequences of B. rapa and B. oleracea, respectively. Meanwhile, only four of the 20 families, comprising 573 members, were identified in the Arabidopsis genome, indicating that most of the families were activated in the Brassica genus after divergence from Arabidopsis. Copy numbers varied from 4 to 1459 for each MITE family, and there was up to 6-fold variation between B. rapa and B. oleracea. In particular, analysis of intact members showed that whereas eleven families were present in similar copy numbers in B. rapa and B. oleracea, nine families showed copy number variation ranging from 2- to 16-fold. Four of those families (BraSto-3, BraTo-3, 4, 5) were more abundant in B. rapa, and the other five (BraSto-1, BraSto-4, BraTo-1, 7 and BraHAT-1) were more abundant in B. oleracea. Overall, 54% and 51% of the MITEs resided in or within 2 kb of a gene in the B. rapa and B. oleracea genomes, respectively. Notably, 92 MITEs were found within the CDS of annotated genes, suggesting that MITEs might play roles in diversification of genes in the recently triplicated Brassica genome. MITE insertion polymorphism (MIP) analysis of 289 MITE members showed that 52% and 23% were polymorphic at the inter- and intra-species levels, respectively, indicating that there has been recent MITE activity in the Brassica genome. These recently activated MITE families with abundant MIP will provide useful resources for molecular breeding and identification of novel functional genes arising from MITE insertion. PMID:24747717

  8. Genome-wide analysis of the GRAS gene family in Chinese cabbage (Brassica rapa ssp. pekinensis).

    PubMed

    Song, Xiao-Ming; Liu, Tong-Kun; Duan, Wei-Ke; Ma, Qing-Hua; Ren, Jun; Wang, Zhen; Li, Ying; Hou, Xi-Lin

    2014-01-01

    The GRAS gene family is one of the most important families of transcriptional regulators. In this study, 48 GRAS genes are identified from Chinese cabbage, and they are classified into eight groups according to the classification of Arabidopsis. The characterization, classification, gene structure and phylogenetic construction of GRAS proteins are performed. Distribution mapping shows that GRAS proteins are nonrandomly localized in 10 chromosomes. Fifty-five orthologous gene pairs are shared by Chinese cabbage and Arabidopsis, and interaction networks of these orthologous genes are constructed. The expansion of GRAS genes in Chinese cabbage results from genome triplication. Among the 17 species examined, 14 higher plants carry the GRAS genes, whereas two lower plants and one fungi species do not. Furthermore, the expression patterns of GRAS genes exhibit differences in three tissues based on RNA-seq data. Taken together, this comprehensive analysis will provide rich resources for studying GRAS protein functions in Chinese cabbage.

  9. Genome-wide identification and analysis of MAPK and MAPKK gene families in Brachypodium distachyon.

    PubMed

    Chen, Lihong; Hu, Wei; Tan, Shenglong; Wang, Min; Ma, Zhanbing; Zhou, Shiyi; Deng, Xiaomin; Zhang, Yang; Huang, Chao; Yang, Guangxiao; He, Guangyuan

    2012-01-01

    MAPK cascades are universal signal transduction modules and play important roles in plant growth, development and in response to a variety of biotic and abiotic stresses. Although MAPKs and MAPKKs have been systematically investigated in several plant species including Arabidopsis, rice and poplar, no systematic analysis has been conducted in the emerging monocot model plant Brachypodium distachyon. In the present study, a total of 16 MAPK genes and 12 MAPKK genes were identified from B. distachyon. An analysis of the genomic evolution showed that both tandem and segment duplications contributed significantly to the expansion of MAPK and MAPKK families. Evolutionary relationships within subfamilies were supported by exon-intron organizations and the architectures of conserved protein motifs. Synteny analysis between B. distachyon and the other two plant species of rice and Arabidopsis showed that only one homolog of B. distachyon MAPKs was found in the corresponding syntenic blocks of Arabidopsis, while 13 homologs of B. distachyon MAPKs and MAPKKs were found in that of rice, which was consistent with the speciation process of the three species. In addition, several interactive protein pairs between the two families in B. distachyon were found through yeast two hybrid assay, whereas their orthologs of a pair in Arabidopsis and other plant species were not found to interact with each other. Finally, expression studies of closely related family members among B. distachyon, Arabidopsis and rice showed that even recently duplicated representatives may fulfill different functions and be involved in different signal pathways. Taken together, our data would provide a foundation for evolutionary and functional characterization of MAPK and MAPKK gene families in B. distachyon and other plant species to unravel their biological roles.

  10. [Genome-wide identification and bioinformatic analysis of PPR gene family in tomato].

    PubMed

    Ding, Anming; Li, Ling; Qu, Xu; Sun, Tingting; Chen, Yaqiong; Zong, Peng; Li, Zunqiang; Gong, Daping; Sun, Yuhe

    2014-01-01

    Pentatricopeptide repeats (PPRs) genes constitute one of the largest gene families in plants, which play a broad and essential role in plant growth and development. In this study, the protein sequences annotated by the tomato (S. lycopersicum L.) genome project were screened with the Pfam PPR sequences. A total of 471 putative PPR-encoding genes were identified. Based on the motifs defined in A. thaliana L., protein structure and conserved sequences for each tomato motif were analyzed. We also analyzed phylogenetic relationship, subcellular localization, expression and GO analysis of the identified gene sequences. Our results demonstrate that tomato PPR gene family contains two subfamilies, P and PLS, each accounting for half of the family. PLS subfamily can be divided into four subclasses i.e., PLS, E, E+ and DYW. Each subclass of sequences forms a clade in the phylogenetic tree. The PPR motifs were found highly conserved among plants. The tomato PPR genes were distributed over 12 chromosomes and most of them lack introns. The majority of PPR proteins harbor mitochondrial or chloroplast localization sequences, whereas GO analysis showed that most PPR proteins participate in RNA-related biological processes.

  11. Genome-wide analysis of the AP2/ERF family in Musa species reveals divergence and neofunctionalisation during evolution.

    PubMed

    Lakhwani, Deepika; Pandey, Ashutosh; Dhar, Yogeshwar Vikram; Bag, Sumit Kumar; Trivedi, Prabodh Kumar; Asif, Mehar Hasan

    2016-01-01

    AP2/ERF domain containing transcription factor super family is one of the important regulators in the plant kingdom. The involvement of AP2/ERF family members has been elucidated in various processes associated with plant growth, development as well as in response to hormones, biotic and abiotic stresses. In this study, we carried out genome-wide analysis to identify members of AP2/ERF family in Musa acuminata (A genome) and Musa balbisiana (B genome) and changes leading to neofunctionalisation of genes. Analysis identified 265 and 318 AP2/ERF encoding genes in M. acuminata and M. balbisiana respectively which were further classified into ERF, DREB, AP2, RAV and Soloist groups. Comparative analysis indicated that AP2/ERF family has undergone duplication, loss and divergence during evolution and speciation of the Musa A and B genomes. We identified nine genes which are up-regulated during fruit ripening and might be components of the regulatory machinery operating during ethylene-dependent ripening in banana. Tissue-specific expression analysis of the genes suggests that different regulatory mechanisms might be involved in peel and pulp ripening process through recruiting specific ERFs in these tissues. Analysis also suggests that MaRAV-6 and MaERF026 have structurally diverged from their M. balbisiana counterparts and have attained new functions during ripening. PMID:26733055

  12. Genome-wide analysis of the AP2/ERF family in Musa species reveals divergence and neofunctionalisation during evolution

    PubMed Central

    Lakhwani, Deepika; Pandey, Ashutosh; Dhar, Yogeshwar Vikram; Bag, Sumit Kumar; Trivedi, Prabodh Kumar; Asif, Mehar Hasan

    2016-01-01

    AP2/ERF domain containing transcription factor super family is one of the important regulators in the plant kingdom. The involvement of AP2/ERF family members has been elucidated in various processes associated with plant growth, development as well as in response to hormones, biotic and abiotic stresses. In this study, we carried out genome-wide analysis to identify members of AP2/ERF family in Musa acuminata (A genome) and Musa balbisiana (B genome) and changes leading to neofunctionalisation of genes. Analysis identified 265 and 318 AP2/ERF encoding genes in M. acuminata and M. balbisiana respectively which were further classified into ERF, DREB, AP2, RAV and Soloist groups. Comparative analysis indicated that AP2/ERF family has undergone duplication, loss and divergence during evolution and speciation of the Musa A and B genomes. We identified nine genes which are up-regulated during fruit ripening and might be components of the regulatory machinery operating during ethylene-dependent ripening in banana. Tissue-specific expression analysis of the genes suggests that different regulatory mechanisms might be involved in peel and pulp ripening process through recruiting specific ERFs in these tissues. Analysis also suggests that MaRAV-6 and MaERF026 have structurally diverged from their M. balbisiana counterparts and have attained new functions during ripening. PMID:26733055

  13. Genome-wide identification and expression profiling of the SnRK2 gene family in Malus prunifolia.

    PubMed

    Shao, Yun; Qin, Yuan; Zou, Yangjun; Ma, Fengwang

    2014-11-15

    Sucrose non-fermenting-1-related protein kinase 2 (SnRK2) constitutes a small plant-specific serine/threonine kinase family with essential roles in the abscisic acid (ABA) signal pathway and in responses to osmotic stress. Although a genome-wide analysis of this family has been conducted in some species, little is known about SnRK2 genes in apple (Malus domestica). We identified 14 putative sequences encoding 12 deduced SnRK2 proteins within the apple genome. Gene chromosomal location and synteny analysis of the apple SnRK2 genes indicated that tandem and segmental duplications have likely contributed to the expansion and evolution of these genes. All 12 full-length coding sequences were confirmed by cloning from Malus prunifolia. The gene structure and motif compositions of the apple SnRK2 genes were analyzed. Phylogenetic analysis showed that MpSnRK2s could be classified into four groups. Profiling of these genes presented differential patterns of expression in various tissues. Under stress conditions, transcript levels for some family members were up-regulated in the leaves in response to drought, salinity, or ABA treatments. This suggested their possible roles in plant response to abiotic stress. Our findings provide essential information about SnRK2 genes in apple and will contribute to further functional dissection of this gene family.

  14. Genome-Wide Identification and Expression Analysis of the WRKY Gene Family in Cassava

    PubMed Central

    Wei, Yunxie; Shi, Haitao; Xia, Zhiqiang; Tie, Weiwei; Ding, Zehong; Yan, Yan; Wang, Wenquan; Hu, Wei; Li, Kaimian

    2016-01-01

    The WRKY family, a large family of transcription factors (TFs) found in higher plants, plays central roles in many aspects of physiological processes and adaption to environment. However, little information is available regarding the WRKY family in cassava (Manihot esculenta). In the present study, 85 WRKY genes were identified from the cassava genome and classified into three groups according to conserved WRKY domains and zinc-finger structure. Conserved motif analysis showed that all of the identified MeWRKYs had the conserved WRKY domain. Gene structure analysis suggested that the number of introns in MeWRKY genes varied from 1 to 5, with the majority of MeWRKY genes containing three exons. Expression profiles of MeWRKY genes in different tissues and in response to drought stress were analyzed using the RNA-seq technique. The results showed that 72 MeWRKY genes had differential expression in their transcript abundance and 78 MeWRKY genes were differentially expressed in response to drought stresses in different accessions, indicating their contribution to plant developmental processes and drought stress resistance in cassava. Finally, the expression of 9 WRKY genes was analyzed by qRT-PCR under osmotic, salt, ABA, H2O2, and cold treatments, indicating that MeWRKYs may be involved in different signaling pathways. Taken together, this systematic analysis identifies some tissue-specific and abiotic stress-responsive candidate MeWRKY genes for further functional assays in planta, and provides a solid foundation for understanding of abiotic stress responses and signal transduction mediated by WRKYs in cassava. PMID:26904033

  15. Genome-Wide Identification and Expression Analysis of the WRKY Gene Family in Cassava.

    PubMed

    Wei, Yunxie; Shi, Haitao; Xia, Zhiqiang; Tie, Weiwei; Ding, Zehong; Yan, Yan; Wang, Wenquan; Hu, Wei; Li, Kaimian

    2016-01-01

    The WRKY family, a large family of transcription factors (TFs) found in higher plants, plays central roles in many aspects of physiological processes and adaption to environment. However, little information is available regarding the WRKY family in cassava (Manihot esculenta). In the present study, 85 WRKY genes were identified from the cassava genome and classified into three groups according to conserved WRKY domains and zinc-finger structure. Conserved motif analysis showed that all of the identified MeWRKYs had the conserved WRKY domain. Gene structure analysis suggested that the number of introns in MeWRKY genes varied from 1 to 5, with the majority of MeWRKY genes containing three exons. Expression profiles of MeWRKY genes in different tissues and in response to drought stress were analyzed using the RNA-seq technique. The results showed that 72 MeWRKY genes had differential expression in their transcript abundance and 78 MeWRKY genes were differentially expressed in response to drought stresses in different accessions, indicating their contribution to plant developmental processes and drought stress resistance in cassava. Finally, the expression of 9 WRKY genes was analyzed by qRT-PCR under osmotic, salt, ABA, H2O2, and cold treatments, indicating that MeWRKYs may be involved in different signaling pathways. Taken together, this systematic analysis identifies some tissue-specific and abiotic stress-responsive candidate MeWRKY genes for further functional assays in planta, and provides a solid foundation for understanding of abiotic stress responses and signal transduction mediated by WRKYs in cassava. PMID:26904033

  16. [Genome-wide identification and expression analysis of the WRKY gene family in peach].

    PubMed

    Yanbing, Gu; Zhirui, Ji; Fumei, Chi; Zhuang, Qiao; Chengnan, Xu; Junxiang, Zhang; Zongshan, Zhou; Qinglong, Dong

    2016-03-01

    The WRKY transcription factors are one of the largest families of transcriptional regulators and play diverse regulatory roles in biotic and abiotic stresses, plant growth and development processes. In this study, the WRKY DNA-binding domain (Pfam Database number: PF03106) downloaded from Pfam protein families database was exploited to identify WRKY genes from the peach (Prunus persica 'Lovell') genome using HMMER 3.0. The obtained amino acid sequences were analyzed with DNAMAN 5.0, WebLogo 3, MEGA 5.1, MapInspect and MEME bioinformatics softwares. Totally 61 peach WRKY genes were found in the peach genome. Our phylogenetic analysis revealed that peach WRKY genes were classified into three Groups: Ⅰ, Ⅱ and Ⅲ. The WRKY N-terminal and C-terminal domains of Group Ⅰ (group I-N and group I-C) were monophyletic. The Group Ⅱ was sub-divided into five distinct clades (groupⅡ-a, Ⅱ-b, Ⅱ-c, Ⅱ-d and Ⅱ-e). Our domain analysis indicated that the WRKY regions contained a highly conserved heptapeptide stretch WRKYGQK at its N-terminus followed by a zinc-finger motif. The chromosome mapping analysis showed that peach WRKY genes were distributed with different densities over 8 chromosomes. The intron-exon structure analysis revealed that structures of the WRKY gene were highly conserved in the peach. The conserved motif analysis showed that the conserved motifs 1, 2 and 3, which specify the WRKY domain, were observed in all peach WRKY proteins, motif 5 as the unknown domain was observed in group Ⅱ-d, two WRKY domains were assigned to GroupⅠ. SqRT-PCR and qRT-PCR results indicated that 16 PpWRKY genes were expressed in roots, stems, leaves, flowers and fruits at various expression levels. Our analysis thus identified the PpWRKY gene families, and future functional studies are needed to reveal its specific roles. PMID:27001479

  17. Genome-Wide Identification and Expression Analysis of the WRKY Gene Family in Cassava.

    PubMed

    Wei, Yunxie; Shi, Haitao; Xia, Zhiqiang; Tie, Weiwei; Ding, Zehong; Yan, Yan; Wang, Wenquan; Hu, Wei; Li, Kaimian

    2016-01-01

    The WRKY family, a large family of transcription factors (TFs) found in higher plants, plays central roles in many aspects of physiological processes and adaption to environment. However, little information is available regarding the WRKY family in cassava (Manihot esculenta). In the present study, 85 WRKY genes were identified from the cassava genome and classified into three groups according to conserved WRKY domains and zinc-finger structure. Conserved motif analysis showed that all of the identified MeWRKYs had the conserved WRKY domain. Gene structure analysis suggested that the number of introns in MeWRKY genes varied from 1 to 5, with the majority of MeWRKY genes containing three exons. Expression profiles of MeWRKY genes in different tissues and in response to drought stress were analyzed using the RNA-seq technique. The results showed that 72 MeWRKY genes had differential expression in their transcript abundance and 78 MeWRKY genes were differentially expressed in response to drought stresses in different accessions, indicating their contribution to plant developmental processes and drought stress resistance in cassava. Finally, the expression of 9 WRKY genes was analyzed by qRT-PCR under osmotic, salt, ABA, H2O2, and cold treatments, indicating that MeWRKYs may be involved in different signaling pathways. Taken together, this systematic analysis identifies some tissue-specific and abiotic stress-responsive candidate MeWRKY genes for further functional assays in planta, and provides a solid foundation for understanding of abiotic stress responses and signal transduction mediated by WRKYs in cassava.

  18. [Genome-wide identification and expression analysis of the WRKY gene family in peach].

    PubMed

    Yanbing, Gu; Zhirui, Ji; Fumei, Chi; Zhuang, Qiao; Chengnan, Xu; Junxiang, Zhang; Zongshan, Zhou; Qinglong, Dong

    2016-03-01

    The WRKY transcription factors are one of the largest families of transcriptional regulators and play diverse regulatory roles in biotic and abiotic stresses, plant growth and development processes. In this study, the WRKY DNA-binding domain (Pfam Database number: PF03106) downloaded from Pfam protein families database was exploited to identify WRKY genes from the peach (Prunus persica 'Lovell') genome using HMMER 3.0. The obtained amino acid sequences were analyzed with DNAMAN 5.0, WebLogo 3, MEGA 5.1, MapInspect and MEME bioinformatics softwares. Totally 61 peach WRKY genes were found in the peach genome. Our phylogenetic analysis revealed that peach WRKY genes were classified into three Groups: Ⅰ, Ⅱ and Ⅲ. The WRKY N-terminal and C-terminal domains of Group Ⅰ (group I-N and group I-C) were monophyletic. The Group Ⅱ was sub-divided into five distinct clades (groupⅡ-a, Ⅱ-b, Ⅱ-c, Ⅱ-d and Ⅱ-e). Our domain analysis indicated that the WRKY regions contained a highly conserved heptapeptide stretch WRKYGQK at its N-terminus followed by a zinc-finger motif. The chromosome mapping analysis showed that peach WRKY genes were distributed with different densities over 8 chromosomes. The intron-exon structure analysis revealed that structures of the WRKY gene were highly conserved in the peach. The conserved motif analysis showed that the conserved motifs 1, 2 and 3, which specify the WRKY domain, were observed in all peach WRKY proteins, motif 5 as the unknown domain was observed in group Ⅱ-d, two WRKY domains were assigned to GroupⅠ. SqRT-PCR and qRT-PCR results indicated that 16 PpWRKY genes were expressed in roots, stems, leaves, flowers and fruits at various expression levels. Our analysis thus identified the PpWRKY gene families, and future functional studies are needed to reveal its specific roles.

  19. Genome-wide analysis of the GRAS gene family in physic nut (Jatropha curcas L.).

    PubMed

    Wu, Z Y; Wu, P Z; Chen, Y P; Li, M R; Wu, G J; Jiang, H W

    2015-01-01

    GRAS proteins play vital roles in plant growth and development. Physic nut (Jatropha curcas L.) was found to have a total of 48 GRAS family members (JcGRAS), 15 more than those found in Arabidopsis. The JcGRAS genes were divided into 12 subfamilies or 15 ancient monophyletic lineages based on the phylogenetic analysis of GRAS proteins from both flowering and lower plants. The functions of GRAS genes in 9 subfamilies have been reported previously for several plants, while the genes in the remaining 3 subfamilies were of unknown function; we named the latter families U1 to U3. No member of U3 subfamily is present in Arabidopsis and Poaceae species according to public genome sequence data. In comparison with the number of GRAS genes in Arabidopsis, more were detected in physic nut, resulting from the retention of many ancient GRAS subfamilies and the formation of tandem repeats during evolution. No evidence of recent duplication among JcGRAS genes was observed in physic nut. Based on digital gene expression data, 21 of the 48 genes exhibited differential expression in four tissues analyzed. Two members of subfamily U3 were expressed only in buds and flowers, implying that they may play specific roles. Our results provide valuable resources for future studies on the functions of GRAS proteins in physic nut. PMID:26782574

  20. Basic leucine zipper family in barley: genome-wide characterization of members and expression analysis.

    PubMed

    Pourabed, Ehsan; Ghane Golmohamadi, Farzan; Soleymani Monfared, Peyman; Razavi, Seyed Morteza; Shobbar, Zahra-Sadat

    2015-01-01

    The basic leucine zipper (bZIP) family is one of the largest and most diverse transcription factors in eukaryotes participating in many essential plant processes. We identified 141 bZIP proteins encoded by 89 genes from the Hordeum vulgare genome. HvbZIPs were classified into 11 groups based on their DNA-binding motif. Amino acid sequence alignment of the HvbZIPs basic-hinge regions revealed some highly conserved residues within each group. The leucine zipper heptads were analyzed predicting their dimerization properties. 34 conserved motifs were identified outside the bZIP domain. Phylogenetic analysis indicated that major diversification within the bZIP family predated the monocot/dicot divergence, although intra-species duplication and parallel evolution seems to be occurred afterward. Localization of HvbZIPs on the barley chromosomes revealed that different groups have been distributed on seven chromosomes of barley. Six types of intron pattern were detected within the basic-hinge regions. Most of the detected cis-elements in the promoter and UTR sequences were involved in seed development or abiotic stress response. Microarray data analysis revealed differential expression pattern of HvbZIPs in response to ABA treatment, drought, and cold stresses and during barley grain development and germination. This information would be helpful for functional characterization of bZIP transcription factors in barley.

  1. Genome-wide analysis of the WRKY gene family in physic nut (Jatropha curcas L.).

    PubMed

    Xiong, Wangdan; Xu, Xueqin; Zhang, Lin; Wu, Pingzhi; Chen, Yaping; Li, Meiru; Jiang, Huawu; Wu, Guojiang

    2013-07-25

    The WRKY proteins, which contain highly conserved WRKYGQK amino acid sequences and zinc-finger-like motifs, constitute a large family of transcription factors in plants. They participate in diverse physiological and developmental processes. WRKY genes have been identified and characterized in a number of plant species. We identified a total of 58 WRKY genes (JcWRKY) in the genome of the physic nut (Jatropha curcas L.). On the basis of their conserved WRKY domain sequences, all of the JcWRKY proteins could be assigned to one of the previously defined groups, I-III. Phylogenetic analysis of JcWRKY genes with Arabidopsis and rice WRKY genes, and separately with castor bean WRKY genes, revealed no evidence of recent gene duplication in JcWRKY gene family. Analysis of transcript abundance of JcWRKY gene products were tested in different tissues under normal growth condition. In addition, 47 WRKY genes responded to at least one abiotic stress (drought, salinity, phosphate starvation and nitrogen starvation) in individual tissues (leaf, root and/or shoot cortex). Our study provides a useful reference data set as the basis for cloning and functional analysis of physic nut WRKY genes. PMID:23644253

  2. Basic leucine zipper family in barley: genome-wide characterization of members and expression analysis.

    PubMed

    Pourabed, Ehsan; Ghane Golmohamadi, Farzan; Soleymani Monfared, Peyman; Razavi, Seyed Morteza; Shobbar, Zahra-Sadat

    2015-01-01

    The basic leucine zipper (bZIP) family is one of the largest and most diverse transcription factors in eukaryotes participating in many essential plant processes. We identified 141 bZIP proteins encoded by 89 genes from the Hordeum vulgare genome. HvbZIPs were classified into 11 groups based on their DNA-binding motif. Amino acid sequence alignment of the HvbZIPs basic-hinge regions revealed some highly conserved residues within each group. The leucine zipper heptads were analyzed predicting their dimerization properties. 34 conserved motifs were identified outside the bZIP domain. Phylogenetic analysis indicated that major diversification within the bZIP family predated the monocot/dicot divergence, although intra-species duplication and parallel evolution seems to be occurred afterward. Localization of HvbZIPs on the barley chromosomes revealed that different groups have been distributed on seven chromosomes of barley. Six types of intron pattern were detected within the basic-hinge regions. Most of the detected cis-elements in the promoter and UTR sequences were involved in seed development or abiotic stress response. Microarray data analysis revealed differential expression pattern of HvbZIPs in response to ABA treatment, drought, and cold stresses and during barley grain development and germination. This information would be helpful for functional characterization of bZIP transcription factors in barley. PMID:25173685

  3. Genome-Wide Identification, Evolution and Expression Analysis of mTERF Gene Family in Maize

    PubMed Central

    Zhao, Yanxin; Cai, Manjun; Zhang, Xiaobo; Li, Yurong; Zhang, Jianhua; Zhao, Hailiang; Kong, Fei; Zheng, Yonglian; Qiu, Fazhan

    2014-01-01

    Plant mitochondrial transcription termination factor (mTERF) genes comprise a large family with important roles in regulating organelle gene expression. In this study, a comprehensive database search yielded 31 potential mTERF genes in maize (Zea mays L.) and most of them were targeted to mitochondria or chloroplasts. Maize mTERF were divided into nine main groups based on phylogenetic analysis, and group IX represented the mitochondria and species-specific clade that diverged from other groups. Tandem and segmental duplication both contributed to the expansion of the mTERF gene family in the maize genome. Comprehensive expression analysis of these genes, using microarray data and RNA-seq data, revealed that these genes exhibit a variety of expression patterns. Environmental stimulus experiments revealed differential up or down-regulation expression of maize mTERF genes in seedlings exposed to light/dark, salts and plant hormones, respectively, suggesting various important roles of maize mTERF genes in light acclimation and stress-related responses. These results will be useful for elucidating the roles of mTERF genes in the growth, development and stress response of maize. PMID:24718683

  4. Comparative analysis of genome-wide Mlo gene family in Cajanus cajan and Phaseolus vulgaris.

    PubMed

    Deshmukh, Reena; Singh, V K; Singh, B D

    2016-04-01

    The Mlo gene was discovered in barley because the mutant 'mlo' allele conferred broad-spectrum, non-race-specific resistance to powdery mildew caused by Blumeria graminis f. sp. hordei. The Mlo genes also play important roles in growth and development of plants, and in responses to biotic and abiotic stresses. The Mlo gene family has been characterized in several crop species, but only a single legume species, soybean (Glycine max L.), has been investigated so far. The present report describes in silico identification of 18 CcMlo and 20 PvMlo genes in the important legume crops Cajanus cajan (L.) Millsp. and Phaseolus vulgaris L., respectively. In silico analysis of gene organization, protein properties and conserved domains revealed that the C. cajan and P. vulgaris Mlo gene paralogs are more divergent from each other than from their orthologous pairs. The comparative phylogenetic analysis classified CcMlo and PvMlo genes into three major clades. A comparative analysis of CcMlo and PvMlo proteins with the G. max Mlo proteins indicated close association of one CcMlo, one PvMlo with two GmMlo genes, indicating that there was no further expansion of the Mlo gene family after the separation of these species. Thus, most of the diploid species of eudicots might be expected to contain 15-20 Mlo genes. The genes CcMlo12 and 14, and PvMlo11 and 12 are predicted to participate in powdery mildew resistance. If this prediction were verified, these genes could be targeted by TILLING or CRISPR to isolate powdery mildew resistant mutants. PMID:26961357

  5. Comparative analysis of genome-wide Mlo gene family in Cajanus cajan and Phaseolus vulgaris.

    PubMed

    Deshmukh, Reena; Singh, V K; Singh, B D

    2016-04-01

    The Mlo gene was discovered in barley because the mutant 'mlo' allele conferred broad-spectrum, non-race-specific resistance to powdery mildew caused by Blumeria graminis f. sp. hordei. The Mlo genes also play important roles in growth and development of plants, and in responses to biotic and abiotic stresses. The Mlo gene family has been characterized in several crop species, but only a single legume species, soybean (Glycine max L.), has been investigated so far. The present report describes in silico identification of 18 CcMlo and 20 PvMlo genes in the important legume crops Cajanus cajan (L.) Millsp. and Phaseolus vulgaris L., respectively. In silico analysis of gene organization, protein properties and conserved domains revealed that the C. cajan and P. vulgaris Mlo gene paralogs are more divergent from each other than from their orthologous pairs. The comparative phylogenetic analysis classified CcMlo and PvMlo genes into three major clades. A comparative analysis of CcMlo and PvMlo proteins with the G. max Mlo proteins indicated close association of one CcMlo, one PvMlo with two GmMlo genes, indicating that there was no further expansion of the Mlo gene family after the separation of these species. Thus, most of the diploid species of eudicots might be expected to contain 15-20 Mlo genes. The genes CcMlo12 and 14, and PvMlo11 and 12 are predicted to participate in powdery mildew resistance. If this prediction were verified, these genes could be targeted by TILLING or CRISPR to isolate powdery mildew resistant mutants.

  6. Genome-wide identification, classification, and analysis of NADP-ME family members from 12 crucifer species.

    PubMed

    Tao, Peng; Guo, Weiling; Li, Biyuan; Wang, Wuhong; Yue, Zhichen; Lei, Juanli; Zhao, Yanting; Zhong, Xinmin

    2016-06-01

    NADP-dependent malic enzymes (NADP-MEs) play essential roles in both normal development and stress responses in plants. Here, genome-wide analysis was performed to identify 65 putative NADP-ME genes from 12 crucifer species. These NADP-ME genes were grouped into five categories of syntenic orthologous genes and were divided into three clades of a phylogenic tree. Promoter motif analysis showed that NADP-ME1 genes in Group IV were more conserved with each other than the other NADP-ME genes in Groups I and II. A nucleotide motif involved in ABA responses, desiccation and seed development was found in the promoters of most NADP-ME1 genes. Generally, the NADP-ME genes of Brassica rapa, B. oleracea and B. napus had less introns than their corresponding Arabidopsis orthologs. In these three Brassica species, the NADP-ME genes derived from the least fractionated subgenome have lost less introns than those from the medium fractionated and most fractionated subgenomes. BrNADP-ME1 showed the highest expression in petals and mature embryos. Two paralogous NADP-ME2 genes (BrNADP-ME2a and BrNADP-ME2b) shared similar expression profiles and differential expression levels. BrNADP-ME3 showed down-regulation during embryogenesis and reached its lowest expression in early cotyledonary embryos. BrNADP-ME4 was expressed widely in multiple organs and showed high expression during the whole embryogenesis process. Different NADP-ME genes of B. rapa showed differential gene expression profiles in young leaves after ABA treatment or cold stress. Our genome-wide identification and characterization of NADP-ME genes extend our understanding of the evolution or function of this family in Brassicaceae.

  7. Genome-wide identification, classification, and analysis of NADP-ME family members from 12 crucifer species.

    PubMed

    Tao, Peng; Guo, Weiling; Li, Biyuan; Wang, Wuhong; Yue, Zhichen; Lei, Juanli; Zhao, Yanting; Zhong, Xinmin

    2016-06-01

    NADP-dependent malic enzymes (NADP-MEs) play essential roles in both normal development and stress responses in plants. Here, genome-wide analysis was performed to identify 65 putative NADP-ME genes from 12 crucifer species. These NADP-ME genes were grouped into five categories of syntenic orthologous genes and were divided into three clades of a phylogenic tree. Promoter motif analysis showed that NADP-ME1 genes in Group IV were more conserved with each other than the other NADP-ME genes in Groups I and II. A nucleotide motif involved in ABA responses, desiccation and seed development was found in the promoters of most NADP-ME1 genes. Generally, the NADP-ME genes of Brassica rapa, B. oleracea and B. napus had less introns than their corresponding Arabidopsis orthologs. In these three Brassica species, the NADP-ME genes derived from the least fractionated subgenome have lost less introns than those from the medium fractionated and most fractionated subgenomes. BrNADP-ME1 showed the highest expression in petals and mature embryos. Two paralogous NADP-ME2 genes (BrNADP-ME2a and BrNADP-ME2b) shared similar expression profiles and differential expression levels. BrNADP-ME3 showed down-regulation during embryogenesis and reached its lowest expression in early cotyledonary embryos. BrNADP-ME4 was expressed widely in multiple organs and showed high expression during the whole embryogenesis process. Different NADP-ME genes of B. rapa showed differential gene expression profiles in young leaves after ABA treatment or cold stress. Our genome-wide identification and characterization of NADP-ME genes extend our understanding of the evolution or function of this family in Brassicaceae. PMID:26839002

  8. Genome-Wide Identification and Expression Analysis of the NAC Transcription Factor Family in Cassava.

    PubMed

    Hu, Wei; Wei, Yunxie; Xia, Zhiqiang; Yan, Yan; Hou, Xiaowan; Zou, Meiling; Lu, Cheng; Wang, Wenquan; Peng, Ming

    2015-01-01

    NAC [no apical meristem (NAM), Arabidopsis transcription activation factor [ATAF1/2] and cup-shaped cotyledon (CUC2)] proteins is one of the largest groups of plant specific transcription factors and plays a crucial role in plant growth, development, and adaption to the environment. Currently, no information is known about the NAC family in cassava. In this study, 96 NAC genes (MeNACs) were identified from the cassava genome. Phylogenetic analysis of the NACs from cassava and Arabidopsis showed that MeNAC proteins can be clustered into 16 subgroups. Gene structure analysis found that the number of introns of MeNAC genes varied from 0 to 5, with the majority of MeNAC genes containing two introns, indicating a small gene structure diversity of cassava NAC genes. Conserved motif analysis revealed that all of the identified MeNACs had the conserved NAC domain and/or NAM domain. Global expression analysis suggested that MeNAC genes exhibited different expression profiles in different tissues between wild subspecies and cultivated varieties, indicating their involvement in the functional diversity of different accessions. Transcriptome analysis demonstrated that MeNACs had a widely transcriptional response to drought stress and that they had differential expression profiles in different accessions, implying their contribution to drought stress resistance in cassava. Finally, the expression of twelve MeNAC genes was analyzed under osmotic, salt, cold, ABA, and H2O2 treatments, indicating that cassava NACs may represent convergence points of different signaling pathways. Taken together, this work found some excellent tissue-specific and abiotic stress-responsive candidate MeNAC genes, which would provide a solid foundation for functional investigation of the NAC family, crop improvement and improved understanding of signal transduction in plants. These data bring new insight on the complexity of the transcriptional control of MeNAC genes and support the hypothesis that

  9. A flexible multi-species genome-wide 60K SNP chip developed from pooled resequencing of 240 Eucalyptus tree genomes across 12 species.

    PubMed

    Silva-Junior, Orzenil B; Faria, Danielle A; Grattapaglia, Dario

    2015-06-01

    We used whole genome resequencing of pooled individuals to develop a high-density single-nucleotide polymorphism (SNP) chip for Eucalyptus. Genomes of 240 trees of 12 species were sequenced at 3.5× each, and 46 997 586 raw SNP variants were subject to multivariable filtering metrics toward a multispecies, genome-wide distributed chip content. Of the 60 904 SNPs on the chip, 59 222 were genotyped and 51 204 were polymorphic across 14 Eucalyptus species, providing a 96% genome-wide coverage with 1 SNP/12-20 kb, and 47 069 SNPs at ≤ 10 kb from 30 444 of the 33 917 genes in the Eucalyptus genome. Given the EUChip60K multi-species genotyping flexibility, we show that both the sample size and taxonomic composition of cluster files impact heterozygous call specificity and sensitivity by benchmarking against 'gold standard' genotypes derived from deeply sequenced individual tree genomes. Thousands of SNPs were shared across species, likely representing ancient variants arisen before the split of these taxa, hinting to a recent eucalypt radiation. We show that the variable SNP filtering constraints allowed coverage of the entire site frequency spectrum, mitigating SNP ascertainment bias. The EUChip60K represents an outstanding tool with which to address population genomics questions in Eucalyptus and to empower genomic selection, GWAS and the broader study of complex trait variation in eucalypts.

  10. Genome-Wide Characterization and Expression Analysis of the Germin-Like Protein Family in Rice and Arabidopsis

    PubMed Central

    Li, Lu; Xu, Xihui; Chen, Chen; Shen, Zhenguo

    2016-01-01

    Previous studies have shown that germin-like proteins (GLPs) are present ubiquitously in rice and Arabidopsis. However, the understanding regarding their role in development and abiotic/biotic stress resistance remains limited. In the present study, we report genome-wide identification, characterisation, subcellular localization, enzyme activity, and expression analysis of the GLP gene family in rice and Arabidopsis to study their functions. In total, 43 and 32 GLPs in the rice and Arabidopsis genome were identified based on a systematic analysis, respectively. The GLP genes were clustered into six clades based on phylogenetic analysis, and many stress and developmental-related cis-elements were detected in promoters of GLP genes. In addition, subcellular location and superoxide dismutase (SOD) analysis demonstrated that the random selected OsGLP genes on chromosomes 8 and 4 of rice were expressed in the cell wall with SOD activity. Overall, our results showed that tandem duplication events, especially the clusters of tandem duplication genes on chromosome 8 in rice, play a major role in expansion of the GLP family and thus increase our understanding of the role of the GLP family in abiotic/biotic stress and development. PMID:27669230

  11. Genome-wide analysis of the SBP-box gene family in Chinese cabbage (Brassica rapa subsp. pekinensis).

    PubMed

    Tan, Hua-Wei; Song, Xiao-Ming; Duan, Wei-Ke; Wang, Yan; Hou, Xi-Lin

    2015-11-01

    The SQUAMOSA PROMOTER BINDING PROTEIN (SBP)-box gene family contains highly conserved plant-specific transcription factors that play an important role in plant development, especially in flowering. Chinese cabbage (Brassica rapa subsp. pekinensis) is a leafy vegetable grown worldwide and is used as a model crop for research in genome duplication. The present study aimed to characterize the SBP-box transcription factor genes in Chinese cabbage. Twenty-nine SBP-box genes were identified in the Chinese cabbage genome and classified into six groups. We identified 23 orthologous and 5 co-orthologous SBP-box gene pairs between Chinese cabbage and Arabidopsis. An interaction network among these genes was constructed. Sixteen SBP-box genes were expressed more abundantly in flowers than in other tissues, suggesting their involvement in flowering. We show that the MiR156/157 family members may regulate the coding regions or 3'-UTR regions of Chinese cabbage SBP-box genes. As SBP-box genes were found to potentially participate in some plant development pathways, quantitative real-time PCR analysis was performed and showed that Chinese cabbage SBP-box genes were also sensitive to the exogenous hormones methyl jasmonic acid and salicylic acid. The SBP-box genes have undergone gene duplication and loss, evolving a more refined regulation for diverse stimulation in plant tissues. Our comprehensive genome-wide analysis provides insights into the SBP-box gene family of Chinese cabbage.

  12. Genome-Wide Characterization and Expression Analysis of the Germin-Like Protein Family in Rice and Arabidopsis.

    PubMed

    Li, Lu; Xu, Xihui; Chen, Chen; Shen, Zhenguo

    2016-01-01

    Previous studies have shown that germin-like proteins (GLPs) are present ubiquitously in rice and Arabidopsis. However, the understanding regarding their role in development and abiotic/biotic stress resistance remains limited. In the present study, we report genome-wide identification, characterisation, subcellular localization, enzyme activity, and expression analysis of the GLP gene family in rice and Arabidopsis to study their functions. In total, 43 and 32 GLPs in the rice and Arabidopsis genome were identified based on a systematic analysis, respectively. The GLP genes were clustered into six clades based on phylogenetic analysis, and many stress and developmental-related cis-elements were detected in promoters of GLP genes. In addition, subcellular location and superoxide dismutase (SOD) analysis demonstrated that the random selected OsGLP genes on chromosomes 8 and 4 of rice were expressed in the cell wall with SOD activity. Overall, our results showed that tandem duplication events, especially the clusters of tandem duplication genes on chromosome 8 in rice, play a major role in expansion of the GLP family and thus increase our understanding of the role of the GLP family in abiotic/biotic stress and development. PMID:27669230

  13. Genome-Wide Identification of the Target Genes of AP2-O, a Plasmodium AP2-Family Transcription Factor

    PubMed Central

    Kaneko, Izumi; Iwanaga, Shiroh; Kato, Tomomi; Kobayashi, Issei; Yuda, Masao

    2015-01-01

    Stage-specific transcription is a fundamental biological process in the life cycle of the Plasmodium parasite. Proteins containing the AP2 DNA-binding domain are responsible for stage-specific transcriptional regulation and belong to the only known family of transcription factors in Plasmodium parasites. Comprehensive identification of their target genes will advance our understanding of the molecular basis of stage-specific transcriptional regulation and stage-specific parasite development. AP2-O is an AP2 family transcription factor that is expressed in the mosquito midgut-invading stage, called the ookinete, and is essential for normal morphogenesis of this stage. In this study, we identified the genome-wide target genes of AP2-O by chromatin immunoprecipitation-sequencing and elucidate how this AP2 family transcription factor contributes to the formation of this motile stage. The analysis revealed that AP2-O binds specifically to the upstream genomic regions of more than 500 genes, suggesting that approximately 10% of the parasite genome is directly regulated by AP2-O. These genes are involved in distinct biological processes such as morphogenesis, locomotion, midgut penetration, protection against mosquito immunity and preparation for subsequent oocyst development. This direct and global regulation by AP2-O provides a model for gene regulation in Plasmodium parasites and may explain how these parasites manage to control their complex life cycle using a small number of sequence-specific AP2 transcription factors. PMID:26018192

  14. Genome-wide identification and expression analysis of the IQD gene family in moso bamboo (Phyllostachys edulis)

    PubMed Central

    Wu, Min; Li, Yuan; Chen, Danmei; Liu, Huanlong; Zhu, Dongyue; Xiang, Yan

    2016-01-01

    Members of the plant-specific IQ67-domain (IQD) protein family are involved in various aspects of normal plant growth and developmental processes as well as basal defence response. Although hundreds of IQD proteins have been identified, only a small number of IQDs have been functionally characterized. Moreover, no systematic study has been performed on moso bamboo. In this study, we performed for the first time a genome-wide identification and expression analysis of the IQD gene family in moso bamboo. We identified 29 non-redundant PeIQD encoding genes. Analysis of the evolutionary patterns and divergence revealed that the IQD genes underwent a large-scale event around 12 million years ago and the division times of IQD family genes between moso bamboo and rice, and, between moso bamboo and Brachypodium, were found to be 20–35 MYA and 25–40 MYA, respectively. We surveyed the putative promoter regions of the PeIQD genes, which showed that largely stress-related cis-elements existed in these genes. The expression profiles of the IQD genes shed light on their functional divergence. Additionally, a yeast two-hybrid assay proved that PeIQD8 can interact with PeCaM2 and that IQ or I in the IQ motif is required for PeIQD8 to combine with CaM2. PMID:27094318

  15. Genome-wide identification, classification, and expression analysis of CDPK and its closely related gene families in poplar (Populus trichocarpa).

    PubMed

    Zuo, Ran; Hu, Ruibo; Chai, Guohua; Xu, Meiling; Qi, Guang; Kong, Yingzhen; Zhou, Gongke

    2013-03-01

    Calcium-dependent protein kinases (CDPKs) are Ca(2+)-binding proteins known to play crucial roles in Ca(2+) signal transduction pathways which have been identified throughout plant kingdom and in certain types of protists. Genome-wide analysis of CDPKs have been carried out in Arabidopsis, rice and wheat, and quite a few of CDPKs were proved to play crucial roles in plant stress responsive signature pathways. In this study, a comprehensive analysis of Populus CDPK and its closely related gene families was performed, including phylogeny, chromosome locations, gene structures, and expression profiles. Thirty Populus CDPK genes and twenty closely related kinase genes were identified, which were phylogenetically clustered into eight distinct subfamilies and predominately distributed across fifteen linkage groups (LG). Genomic organization analyses indicated that purifying selection has played a pivotal role in the retention and maintenance of Populus CDPK gene family. Furthermore, microarray analysis showed that a number of Populus CDPK and its closely related genes differentially expressed across disparate tissues and under various stresses. The expression profiles of paralogous pairs were also investigated to reveal their evolution fates. In addition, quantitative real-time RT-PCR was performed on nine selected CDPK genes to confirm their responses to drought stress treatment. These observations may lay the foundation for future functional analysis of Populus CDPK and its closely related gene families to unravel their biological roles.

  16. Genome-wide analysis of the SBP-box gene family in Chinese cabbage (Brassica rapa subsp. pekinensis).

    PubMed

    Tan, Hua-Wei; Song, Xiao-Ming; Duan, Wei-Ke; Wang, Yan; Hou, Xi-Lin

    2015-11-01

    The SQUAMOSA PROMOTER BINDING PROTEIN (SBP)-box gene family contains highly conserved plant-specific transcription factors that play an important role in plant development, especially in flowering. Chinese cabbage (Brassica rapa subsp. pekinensis) is a leafy vegetable grown worldwide and is used as a model crop for research in genome duplication. The present study aimed to characterize the SBP-box transcription factor genes in Chinese cabbage. Twenty-nine SBP-box genes were identified in the Chinese cabbage genome and classified into six groups. We identified 23 orthologous and 5 co-orthologous SBP-box gene pairs between Chinese cabbage and Arabidopsis. An interaction network among these genes was constructed. Sixteen SBP-box genes were expressed more abundantly in flowers than in other tissues, suggesting their involvement in flowering. We show that the MiR156/157 family members may regulate the coding regions or 3'-UTR regions of Chinese cabbage SBP-box genes. As SBP-box genes were found to potentially participate in some plant development pathways, quantitative real-time PCR analysis was performed and showed that Chinese cabbage SBP-box genes were also sensitive to the exogenous hormones methyl jasmonic acid and salicylic acid. The SBP-box genes have undergone gene duplication and loss, evolving a more refined regulation for diverse stimulation in plant tissues. Our comprehensive genome-wide analysis provides insights into the SBP-box gene family of Chinese cabbage. PMID:26599708

  17. Genome wide in silico characterization of Dof gene families of pigeonpea (Cajanus cajan (L) Millsp.).

    PubMed

    Malviya, N; Gupta, S; Singh, V K; Yadav, M K; Bisht, N C; Sarangi, B K; Yadav, D

    2015-02-01

    The DNA binding with One Finger (Dof) protein is a plant specific transcription factor involved in the regulation of wide range of processes. The analysis of whole genome sequence of pigeonpea has identified 38 putative Dof genes (CcDof) distributed on 8 chromosomes. A total of 17 out of 38 CcDof genes were found to be intronless. A comprehensive in silico characterization of CcDof gene family including the gene structure, chromosome location, protein motif, phylogeny, gene duplication and functional divergence has been attempted. The phylogenetic analysis resulted in 3 major clusters with closely related members in phylogenetic tree revealed common motif distribution. The in silico cis-regulatory element analysis revealed functional diversity with predominance of light responsive and stress responsive elements indicating the possibility of these CcDof genes to be associated with photoperiodic control and biotic and abiotic stress. The duplication pattern showed that tandem duplication is predominant over segmental duplication events. The comparative phylogenetic analysis of these Dof proteins along with 78 soybean, 36 Arabidopsis and 30 rice Dof proteins revealed 7 major clusters. Several groups of orthologs and paralogs were identified based on phylogenetic tree constructed. Our study provides useful information for functional characterization of CcDof genes. PMID:25344821

  18. The MAPKKK gene family in Gossypium raimondii: genome-wide identification, classification and expression analysis.

    PubMed

    Yin, Zujun; Wang, Junjuan; Wang, Delong; Fan, Weili; Wang, Shuai; Ye, Wuwei

    2013-01-01

    Mitogen-activated protein kinase (MAPK) cascades are conserved signal transduction pathways in all eukaryotic organisms. MAPKKKs (MAPK kinase kinases) operate at the top levels of these cascades. Recently, this family of genes has been systematically investigated in Arabidopsis, rice and maize, but has not yet been characterized in cotton. In this study, we identified 78 putative MAPKKK genes in the genome of the diploid cotton, Gossypium raimondii. They were classified into three subfamilies, of which 12 were ZIK, 22 were MEKK and 44 were Raf. The ZIK and MEKK genes displayed a scattered genomic distribution across 11 of the 13 chromosomes, whereas Raf genes were distributed across the entire genome. Their conserved patterns observed for introns and additional domains were consistent with the evolutionary relationships inferred from the phylogenetic analysis within subfamily. Transcriptome sequencing data were used to investigate their transcript profiles in mature leaves, 0 day and 3 days post-anthesis (DPA) ovules. Sixty MAPKKK genes were expressed, of which 41 were strongly expressed in mature leaves. Twelve MAPKKK genes were more highly expressed in 3-DPA ovules than in 0-DPA ovules. Our results provide a foundation for future evolutionary and functional characterizations of MAPKKK genes in cotton and probably other Gossypium plants.

  19. Genome-wide analysis of the MYB gene family in physic nut (Jatropha curcas L.).

    PubMed

    Zhou, Changpin; Chen, Yanbo; Wu, Zhenying; Lu, Wenjia; Han, Jinli; Wu, Pingzhi; Chen, Yaping; Li, Meiru; Jiang, Huawu; Wu, Guojiang

    2015-11-01

    The MYB proteins comprise one of the largest transcription factor families in plants, and play key roles in regulatory networks controlling development, metabolism, and stress responses. A total of 125 MYB genes (JcMYB) have been identified in the physic nut (Jatropha curcas L.) genome, including 120 2R-type MYB, 4 3R-MYB, and 1 4R-MYB genes. Based on exon-intron arrangement of MYBs from both lower (Physcomitrella patens) and higher (physic nut, Arabidopsis, and rice) plants, we can classify plant MYB genes into ten groups (MI-X), except for MIX genes which are nonexistent in higher plants. We also observed that MVIII genes may be one of the most ancient MYB types which consist of both R2R3- and 3R-MYB genes. Most MYB genes (76.8% in physic nut) belong to the MI group which can be divided into 34 subgroups. The JcMYB genes were nonrandomly distributed on its 11 linkage groups (LGs). The expansion of MYB genes across several subgroups was observed and resulted from genome triplication of ancient dicotyledons and from both ancient and recent tandem duplication events in the physic nut genome. The expression patterns of several MYB duplicates in the physic nut showed differences in four tissues (root, stem, leaf, and seed), and 34 MYB genes responded to at least one abiotic stressor (drought, salinity, phosphate starvation, and nitrogen starvation) in leaves and/or roots based on the data analysis of digital gene expression tags. Overexpression of the JcMYB001 gene in Arabidopsis increased its sensitivity to drought and salinity stresses. PMID:26142104

  20. Genome-wide characterization of the CBF/DREB1 gene family in Brassica rapa.

    PubMed

    Lee, Sang-Choon; Lim, Myung-Ho; Yu, Jae-Gyeong; Park, Beom-Seok; Yang, Tae-Jin

    2012-12-01

    The C-repeat/dehydration-responsive element binding transcription factors (CBF/DREBs) are important proteins in involved in responses to abiotic stress in plants. We identified ten BrDREB1 genes belonging to the CBF/DREB1 gene family in the Brassica rapa whole genome sequence, whereas six genes are found in the Arabidopsis thaliana genome. The deduced amino acid sequences of the B. rapa genes showed conserved motifs shared with other known plant CBF/DREB1s. Comparative analysis revealed that nine of the BrDREB1 genes were derived from the recent genome triplication in the tribe Brassiceae and the other one was translocated. The nine genes were located in seven of the 12 macrosyntenic blocks that are triplicated counterparts of four Arabidopsis macrosyntenic blocks harboring six CBF/DREB1 genes: one gene on each of three blocks and three tandemly arrayed genes on another block. We inspected the expression patterns of eight BrDREB1 genes by RT-PCR and microarray database searches. All eight genes were highly up-regulated during cold (4 °C) treatment, and some of them were also responsive to salt (250 mM NaCl), drought (air drying), and ABA (100 μM) treatment. Microarray data for plant developmental stages revealed that BrDREB1C2 was highly expressed during a period of cold treatment for vernalization, similar to abiotic stress-inducible genes homologous to Bn28a, Bn47, Bn115, and BoRS1, but almost opposite of BrFLC genes. Taken together, the number of BrDREB1 genes increased to 10 by genome triplication and reorganization, providing additional functions in B. rapa abiotic stress responses and development, as distinct from their Arabidopsis homologs.

  1. Genome-Wide Comparative Analysis of Chemosensory Gene Families in Five Tsetse Fly Species.

    PubMed

    Macharia, Rosaline; Mireji, Paul; Murungi, Edwin; Murilla, Grace; Christoffels, Alan; Aksoy, Serap; Masiga, Daniel

    2016-02-01

    For decades, odour-baited traps have been used for control of tsetse flies (Diptera; Glossinidae), vectors of African trypanosomes. However, differential responses to known attractants have been reported in different Glossina species, hindering establishment of a universal vector control tool. Availability of full genome sequences of five Glossina species offers an opportunity to compare their chemosensory repertoire and enhance our understanding of their biology in relation to chemosensation. Here, we identified and annotated the major chemosensory gene families in Glossina. We identified a total of 118, 115, 124, and 123 chemosensory genes in Glossina austeni, G. brevipalpis, G. f. fuscipes, G. pallidipes, respectively, relative to 127 reported in G. m. morsitans. Our results show that tsetse fly genomes have fewer chemosensory genes when compared to other dipterans such as Musca domestica (n>393), Drosophila melanogaster (n = 246) and Anopheles gambiae (n>247). We also found that Glossina chemosensory genes are dispersed across distantly located scaffolds in their respective genomes, in contrast to other insects like D. melanogaster whose genes occur in clusters. Further, Glossina appears to be devoid of sugar receptors and to have expanded CO2 associated receptors, potentially reflecting Glossina's obligate hematophagy and the need to detect hosts that may be out of sight. We also identified, in all species, homologs of Ir84a; a Drosophila-specific ionotropic receptor that promotes male courtship suggesting that this is a conserved trait in tsetse flies. Notably, our selection analysis revealed that a total of four gene loci (Gr21a, GluRIIA, Gr28b, and Obp83a) were under positive selection, which confers fitness advantage to species. These findings provide a platform for studies to further define the language of communication of tsetse with their environment, and influence development of novel approaches for control. PMID:26886411

  2. Genome-wide analysis of the MYB gene family in physic nut (Jatropha curcas L.).

    PubMed

    Zhou, Changpin; Chen, Yanbo; Wu, Zhenying; Lu, Wenjia; Han, Jinli; Wu, Pingzhi; Chen, Yaping; Li, Meiru; Jiang, Huawu; Wu, Guojiang

    2015-11-01

    The MYB proteins comprise one of the largest transcription factor families in plants, and play key roles in regulatory networks controlling development, metabolism, and stress responses. A total of 125 MYB genes (JcMYB) have been identified in the physic nut (Jatropha curcas L.) genome, including 120 2R-type MYB, 4 3R-MYB, and 1 4R-MYB genes. Based on exon-intron arrangement of MYBs from both lower (Physcomitrella patens) and higher (physic nut, Arabidopsis, and rice) plants, we can classify plant MYB genes into ten groups (MI-X), except for MIX genes which are nonexistent in higher plants. We also observed that MVIII genes may be one of the most ancient MYB types which consist of both R2R3- and 3R-MYB genes. Most MYB genes (76.8% in physic nut) belong to the MI group which can be divided into 34 subgroups. The JcMYB genes were nonrandomly distributed on its 11 linkage groups (LGs). The expansion of MYB genes across several subgroups was observed and resulted from genome triplication of ancient dicotyledons and from both ancient and recent tandem duplication events in the physic nut genome. The expression patterns of several MYB duplicates in the physic nut showed differences in four tissues (root, stem, leaf, and seed), and 34 MYB genes responded to at least one abiotic stressor (drought, salinity, phosphate starvation, and nitrogen starvation) in leaves and/or roots based on the data analysis of digital gene expression tags. Overexpression of the JcMYB001 gene in Arabidopsis increased its sensitivity to drought and salinity stresses.

  3. Genome-Wide Comparative Analysis of Chemosensory Gene Families in Five Tsetse Fly Species.

    PubMed

    Macharia, Rosaline; Mireji, Paul; Murungi, Edwin; Murilla, Grace; Christoffels, Alan; Aksoy, Serap; Masiga, Daniel

    2016-02-01

    For decades, odour-baited traps have been used for control of tsetse flies (Diptera; Glossinidae), vectors of African trypanosomes. However, differential responses to known attractants have been reported in different Glossina species, hindering establishment of a universal vector control tool. Availability of full genome sequences of five Glossina species offers an opportunity to compare their chemosensory repertoire and enhance our understanding of their biology in relation to chemosensation. Here, we identified and annotated the major chemosensory gene families in Glossina. We identified a total of 118, 115, 124, and 123 chemosensory genes in Glossina austeni, G. brevipalpis, G. f. fuscipes, G. pallidipes, respectively, relative to 127 reported in G. m. morsitans. Our results show that tsetse fly genomes have fewer chemosensory genes when compared to other dipterans such as Musca domestica (n>393), Drosophila melanogaster (n = 246) and Anopheles gambiae (n>247). We also found that Glossina chemosensory genes are dispersed across distantly located scaffolds in their respective genomes, in contrast to other insects like D. melanogaster whose genes occur in clusters. Further, Glossina appears to be devoid of sugar receptors and to have expanded CO2 associated receptors, potentially reflecting Glossina's obligate hematophagy and the need to detect hosts that may be out of sight. We also identified, in all species, homologs of Ir84a; a Drosophila-specific ionotropic receptor that promotes male courtship suggesting that this is a conserved trait in tsetse flies. Notably, our selection analysis revealed that a total of four gene loci (Gr21a, GluRIIA, Gr28b, and Obp83a) were under positive selection, which confers fitness advantage to species. These findings provide a platform for studies to further define the language of communication of tsetse with their environment, and influence development of novel approaches for control.

  4. Genome-Wide Comparative Analysis of Chemosensory Gene Families in Five Tsetse Fly Species

    PubMed Central

    Macharia, Rosaline; Mireji, Paul; Murungi, Edwin; Murilla, Grace; Christoffels, Alan; Aksoy, Serap; Masiga, Daniel

    2016-01-01

    For decades, odour-baited traps have been used for control of tsetse flies (Diptera; Glossinidae), vectors of African trypanosomes. However, differential responses to known attractants have been reported in different Glossina species, hindering establishment of a universal vector control tool. Availability of full genome sequences of five Glossina species offers an opportunity to compare their chemosensory repertoire and enhance our understanding of their biology in relation to chemosensation. Here, we identified and annotated the major chemosensory gene families in Glossina. We identified a total of 118, 115, 124, and 123 chemosensory genes in Glossina austeni, G. brevipalpis, G. f. fuscipes, G. pallidipes, respectively, relative to 127 reported in G. m. morsitans. Our results show that tsetse fly genomes have fewer chemosensory genes when compared to other dipterans such as Musca domestica (n>393), Drosophila melanogaster (n = 246) and Anopheles gambiae (n>247). We also found that Glossina chemosensory genes are dispersed across distantly located scaffolds in their respective genomes, in contrast to other insects like D. melanogaster whose genes occur in clusters. Further, Glossina appears to be devoid of sugar receptors and to have expanded CO2 associated receptors, potentially reflecting Glossina's obligate hematophagy and the need to detect hosts that may be out of sight. We also identified, in all species, homologs of Ir84a; a Drosophila-specific ionotropic receptor that promotes male courtship suggesting that this is a conserved trait in tsetse flies. Notably, our selection analysis revealed that a total of four gene loci (Gr21a, GluRIIA, Gr28b, and Obp83a) were under positive selection, which confers fitness advantage to species. These findings provide a platform for studies to further define the language of communication of tsetse with their environment, and influence development of novel approaches for control. PMID:26886411

  5. Genome-wide characterization of the CBF/DREB1 gene family in Brassica rapa.

    PubMed

    Lee, Sang-Choon; Lim, Myung-Ho; Yu, Jae-Gyeong; Park, Beom-Seok; Yang, Tae-Jin

    2012-12-01

    The C-repeat/dehydration-responsive element binding transcription factors (CBF/DREBs) are important proteins in involved in responses to abiotic stress in plants. We identified ten BrDREB1 genes belonging to the CBF/DREB1 gene family in the Brassica rapa whole genome sequence, whereas six genes are found in the Arabidopsis thaliana genome. The deduced amino acid sequences of the B. rapa genes showed conserved motifs shared with other known plant CBF/DREB1s. Comparative analysis revealed that nine of the BrDREB1 genes were derived from the recent genome triplication in the tribe Brassiceae and the other one was translocated. The nine genes were located in seven of the 12 macrosyntenic blocks that are triplicated counterparts of four Arabidopsis macrosyntenic blocks harboring six CBF/DREB1 genes: one gene on each of three blocks and three tandemly arrayed genes on another block. We inspected the expression patterns of eight BrDREB1 genes by RT-PCR and microarray database searches. All eight genes were highly up-regulated during cold (4 °C) treatment, and some of them were also responsive to salt (250 mM NaCl), drought (air drying), and ABA (100 μM) treatment. Microarray data for plant developmental stages revealed that BrDREB1C2 was highly expressed during a period of cold treatment for vernalization, similar to abiotic stress-inducible genes homologous to Bn28a, Bn47, Bn115, and BoRS1, but almost opposite of BrFLC genes. Taken together, the number of BrDREB1 genes increased to 10 by genome triplication and reorganization, providing additional functions in B. rapa abiotic stress responses and development, as distinct from their Arabidopsis homologs. PMID:23148914

  6. Genome-wide Identification of TCP Family Transcription Factors from Populus euphratica and Their Involvement in Leaf Shape Regulation.

    PubMed

    Ma, Xiaodong; Ma, Jianchao; Fan, Di; Li, Chaofeng; Jiang, Yuanzhong; Luo, Keming

    2016-01-01

    Higher plants have been shown to experience a juvenile vegetative phase, an adult vegetative phase, and a reproductive phase during its postembryonic development and distinct lateral organ morphologies have been observed at the different development stages. Populus euphratica, commonly known as a desert poplar, has developed heteromorphic leaves during its development. The TCP family genes encode a group of plant-specific transcription factors involved in several aspects of plant development. In particular, TCPs have been shown to influence leaf size and shape in many herbaceous plants. However, whether these functions are conserved in woody plants remains unknown. In the present study, we carried out genome-wide identification of TCP genes in P. euphratica and P. trichocarpa, and 33 and 36 genes encoding putative TCP proteins were found, respectively. Phylogenetic analysis of the poplar TCPs together with Arabidopsis TCPs indicated a biased expansion of the TCP gene family via segmental duplications. In addition, our results have also shown a correlation between different expression patterns of several P. euphratica TCP genes and leaf shape variations, indicating their involvement in the regulation of leaf shape development. PMID:27605130

  7. Genome-Wide Analysis, Classification, Evolution, and Expression Analysis of the Cytochrome P450 93 Family in Land Plants

    PubMed Central

    Du, Hai; Ran, Feng; Dong, Hong-Li; Wen, Jing; Li, Jia-Na; Liang, Zhe

    2016-01-01

    Cytochrome P450 93 family (CYP93) belonging to the cytochrome P450 superfamily plays important roles in diverse plant processes. However, no previous studies have investigated the evolution and expression of the members of this family. In this study, we performed comprehensive genome-wide analysis to identify CYP93 genes in 60 green plants. In all, 214 CYP93 proteins were identified; they were specifically found in flowering plants and could be classified into ten subfamilies—CYP93A–K, with the last two being identified first. CYP93A is the ancestor that was derived in flowering plants, and the remaining showed lineage-specific distribution—CYP93B and CYP93C are present in dicots; CYP93F is distributed only in Poaceae; CYP93G and CYP93J are monocot-specific; CYP93E is unique to legumes; CYP93H and CYP93K are only found in Aquilegia coerulea, and CYP93D is Brassicaceae-specific. Each subfamily generally has conserved gene numbers, structures, and characteristics, indicating functional conservation during evolution. Synonymous nucleotide substitution (dN/dS) analysis showed that CYP93 genes are under strong negative selection. Comparative expression analyses of CYP93 genes in dicots and monocots revealed that they are preferentially expressed in the roots and tend to be induced by biotic and/or abiotic stresses, in accordance with their well-known functions in plant secondary biosynthesis. PMID:27760179

  8. Genome-wide Identification of TCP Family Transcription Factors from Populus euphratica and Their Involvement in Leaf Shape Regulation

    PubMed Central

    Ma, Xiaodong; Ma, Jianchao; Fan, Di; Li, Chaofeng; Jiang, Yuanzhong; Luo, Keming

    2016-01-01

    Higher plants have been shown to experience a juvenile vegetative phase, an adult vegetative phase, and a reproductive phase during its postembryonic development and distinct lateral organ morphologies have been observed at the different development stages. Populus euphratica, commonly known as a desert poplar, has developed heteromorphic leaves during its development. The TCP family genes encode a group of plant-specific transcription factors involved in several aspects of plant development. In particular, TCPs have been shown to influence leaf size and shape in many herbaceous plants. However, whether these functions are conserved in woody plants remains unknown. In the present study, we carried out genome-wide identification of TCP genes in P. euphratica and P. trichocarpa, and 33 and 36 genes encoding putative TCP proteins were found, respectively. Phylogenetic analysis of the poplar TCPs together with Arabidopsis TCPs indicated a biased expansion of the TCP gene family via segmental duplications. In addition, our results have also shown a correlation between different expression patterns of several P. euphratica TCP genes and leaf shape variations, indicating their involvement in the regulation of leaf shape development. PMID:27605130

  9. Comprehensive analysis of CCCH-type zinc finger gene family in citrus (Clementine mandarin) by genome-wide characterization.

    PubMed

    Liu, Shengrui; Khan, Muhammad Rehman Gul; Li, Yongping; Zhang, Jinzhi; Hu, Chungen

    2014-10-01

    The CCCH-type zinc finger proteins comprise a large gene family of regulatory proteins and are widely distributed in eukaryotic organisms. The CCCH proteins have been implicated in multiple biological processes and environmental responses in plants. Little information is available, however, about CCCH genes in plants, especially in woody plants such as citrus. The release of the whole-genome sequence of citrus allowed us to perform a genome-wide analysis of CCCH genes and to compare the identified proteins with their orthologs in model plants. In this study, 62 CCCH genes and a total of 132 CCCH motifs were identified, and a comprehensive analysis including the chromosomal locations, phylogenetic relationships, functional annotations, gene structures and conserved motifs was performed. Distribution mapping revealed that 54 of the 62 CCCH genes are unevenly dispersed on the nine citrus chromosomes. Based on phylogenetic analysis and gene structural features, we constructed 5 subfamilies of 62 CCCH members and integrative subfamilies from citrus, Arabidopsis, and rice, respectively. Importantly, large numbers of SNPs and InDels in 26 CCCH genes were identified from Poncirus trifoliata and Fortunella japonica using whole-genome deep re-sequencing. Furthermore, citrus CCCH genes showed distinct temporal and spatial expression patterns in different developmental processes and in response to various stress conditions. Our comprehensive analysis of CleC3Hs is a valuable resource that further elucidates the roles of CCCH family members in plant growth and development. In addition, variants and comparative genomics analyses deepen our understanding of the evolution of the CCCH gene family and will contribute to further genetics and genomics studies of citrus and other plant species.

  10. Genome-wide analysis of auxin response factor gene family members in medicinal model plant Salvia miltiorrhiza

    PubMed Central

    Xu, Zhichao; Ji, Aijia; Chen, Shilin

    2016-01-01

    ABSTRACT Auxin response factors (ARFs) can function as transcriptional activators or repressors to regulate the expression of auxin response genes by specifically binding to auxin response elements (AuxREs) during plant development. Based on a genome-wide strategy using the medicinal model plant Salvia miltiorrhiza, 25 S. miltiorrhiza ARF (SmARF) gene family members in four classes (class Ia, IIa, IIb and III) were comprehensively analyzed to identify characteristics including gene structures, conserved domains, phylogenetic relationships and expression patterns. In a hybrid analysis of the phylogenetic tree, microRNA targets, and expression patterns of SmARFs in different organs, root tissues, and methyl jasmonate or indole-3-acetic acid treatment conditions, we screened for candidate SmARFs involved in various developmental processes of S. miltiorrhiza. Based on this analysis, we predicted that SmARF25, SmARF7, SmARF16 and SmARF20 are involved in flower, leaf, stem and root development, respectively. With the further insight into the targets of miR160 and miR167, specific SmARF genes in S. miltiorrhiza might encode products that participate in biological processes as described for ARF genes in Arabidopsis. Our results provide a foundation for understanding the molecular basis and regulatory mechanisms of SmARFs in S. miltiorrhiza. PMID:27230647

  11. Genome-wide analysis of auxin response factor gene family members in medicinal model plant Salvia miltiorrhiza.

    PubMed

    Xu, Zhichao; Ji, Aijia; Song, Jingyuan; Chen, Shilin

    2016-01-01

    Auxin response factors (ARFs) can function as transcriptional activators or repressors to regulate the expression of auxin response genes by specifically binding to auxin response elements (AuxREs) during plant development. Based on a genome-wide strategy using the medicinal model plant Salvia miltiorrhiza, 25 S. miltiorrhiza ARF (SmARF) gene family members in four classes (class Ia, IIa, IIb and III) were comprehensively analyzed to identify characteristics including gene structures, conserved domains, phylogenetic relationships and expression patterns. In a hybrid analysis of the phylogenetic tree, microRNA targets, and expression patterns of SmARFs in different organs, root tissues, and methyl jasmonate or indole-3-acetic acid treatment conditions, we screened for candidate SmARFs involved in various developmental processes of S. miltiorrhiza Based on this analysis, we predicted that SmARF25, SmARF7, SmARF16 and SmARF20 are involved in flower, leaf, stem and root development, respectively. With the further insight into the targets of miR160 and miR167, specific SmARF genes in S. miltiorrhiza might encode products that participate in biological processes as described for ARF genes in Arabidopsis Our results provide a foundation for understanding the molecular basis and regulatory mechanisms of SmARFs in S. miltiorrhiza. PMID:27230647

  12. Exploring Genome-wide DNA Methylation Profiles Altered in Kashin-Beck Disease Using Infinium Human Methylation 450 Bead Chips.

    PubMed

    Shi, Xiao Wei; Shi, Bo Hui; Lyu, Ai Li; Zhang, Feng; Zhou, Tian Tian; Guo, Xiong

    2016-07-01

    To understand how differentially methylated genes (DMGs) might affect the pathogenesis of Kashin-Beck disease (KBD). Genome-wide methylation profiling of whole blood from 12 matched KBD and controls pairs was performed using a high-resolution Infinium 450 K methylation array. In total, 97 CpG sites were differentially methylated in KBD compared to the normal controls; of these sites, 36 sites were significantly hypermethylated (covering 22 genes) and 61 sites were significantly hypomethylated (covering 34 genes). Of these genes, 14 significant pathways were identified, the most significant P value pathway was type I diabetes mellitus pathway and pathways associated with autoimmune diseases and inflammatory diseases were included in this study. Subsequently, 4 CpG sites in HLA-DRB1 were validated using bisulfite sequencing polymerase chain reaction (BSP) in articular cartilage, and the results showed significant differences in the methylation status between KBD and controls, consistent with the results of the high-resolution array. These results suggested that differences in genome-wide DNA methylation exist between KBD and the controls, and the biological pathways support the autoimmune disease and inflammatory disease hypothesis of KBD.

  13. Exploring Genome-wide DNA Methylation Profiles Altered in Kashin-Beck Disease Using Infinium Human Methylation 450 Bead Chips.

    PubMed

    Shi, Xiao Wei; Shi, Bo Hui; Lyu, Ai Li; Zhang, Feng; Zhou, Tian Tian; Guo, Xiong

    2016-07-01

    To understand how differentially methylated genes (DMGs) might affect the pathogenesis of Kashin-Beck disease (KBD). Genome-wide methylation profiling of whole blood from 12 matched KBD and controls pairs was performed using a high-resolution Infinium 450 K methylation array. In total, 97 CpG sites were differentially methylated in KBD compared to the normal controls; of these sites, 36 sites were significantly hypermethylated (covering 22 genes) and 61 sites were significantly hypomethylated (covering 34 genes). Of these genes, 14 significant pathways were identified, the most significant P value pathway was type I diabetes mellitus pathway and pathways associated with autoimmune diseases and inflammatory diseases were included in this study. Subsequently, 4 CpG sites in HLA-DRB1 were validated using bisulfite sequencing polymerase chain reaction (BSP) in articular cartilage, and the results showed significant differences in the methylation status between KBD and controls, consistent with the results of the high-resolution array. These results suggested that differences in genome-wide DNA methylation exist between KBD and the controls, and the biological pathways support the autoimmune disease and inflammatory disease hypothesis of KBD. PMID:27554126

  14. Genome-wide analysis of Aux/IAA gene family in Solanaceae species using tomato as a model.

    PubMed

    Wu, Jian; Peng, Zhen; Liu, Songyu; He, Yanjun; Cheng, Lin; Kong, Fuling; Wang, Jie; Lu, Gang

    2012-04-01

    Auxin plays key roles in a wide variety of plant activities, including embryo development, leaf formation, phototropism, fruit development and root initiation and development. Auxin/indoleacetic acid (Aux/IAA) genes, encoding short-lived nuclear proteins, are key regulators in the auxin transduction pathway. But how they work is still unknown. In order to conduct a systematic analysis of this gene family in Solanaceae species, a genome-wide search for the homologues of auxin response genes was carried out. Here, 26 and 27 non redundant AUX/IAAs were identified in tomato and potato, respectively. Using tomato as a model, a comprehensive overview of SlIAA gene family is presented, including the gene structures, phylogeny, chromosome locations, conserved motifs and cis-elements in promoter sequences. A phylogenetic tree generated from alignments of the predicted protein sequences of 31 OsIAAs, 29 AtIAAs, 31 ZmIAAs, and 26 SlIAAs revealed that these IAAs were clustered into three major groups and ten subgroups. Among them, seven subgroups were present in both monocot and dicot species, which indicated that the major functional diversification within the IAA family predated the monocot/dicot divergence. In contrast, group C and some other subgroups seemed to be species-specific. Quantitative real-time PCR (qRT-PCR) analysis showed that 19 of the 26 SlIAA genes could be detected in all tomato organs/tissues, however, seven of them were specifically expressed in some of tomato tissues. The transcript abundance of 17 SlIAA genes were increased within a few hours when the seedlings were treated with exogenous IAA. However, those of other six SlIAAs were decreased. The results of stress treatments showed that most SIIAA family genes responded to at least one of the three stress treatments, however, they exhibited diverse expression levels under different abiotic stress conditions in tomato seedlings. SlIAA20, SlIAA21 and SlIAA22 were not significantly influenced by stress

  15. Genome-wide analysis of the omega-3 fatty acid desaturase gene family in Gossypium

    SciTech Connect

    Yurchenko, Olga P.; Park, Sunjung; Ilut, Daniel C.; Inmon, Jay J.; Millhollon, Jon C.; Liechty, Zach; Page, Justin T.; Jenks, Matthew A.; Chapman, Kent D.; Udall, Joshua A.; Gore, Michael A.; Dyer, John M.

    2014-11-18

    The majority of commercial cotton varieties planted worldwide are derived from Gossypium hirsutum, which is a naturally occurring allotetraploid produced by interspecific hybridization of A- and D-genome diploid progenitor species. While most cotton species are adapted to warm, semi-arid tropical and subtropical regions, and thus perform well in these geographical areas, cotton seedlings are sensitive to cold temperature, which can significantly reduce crop yields. One of the common biochemical responses of plants to cold temperatures is an increase in omega-3 fatty acids, which protects cellular function by maintaining membrane integrity. The purpose of our study was to identify and characterize the omega-3 fatty acid desaturase (FAD) gene family in G. hirsutum, with an emphasis on identifying omega-3 FADs involved in cold temperature adaptation. Results: Eleven omega-3 FAD genes were identified in G. hirsutum, and characterization of the gene family in extant A and D diploid species (G. herbaceum and G. raimondii, respectively) allowed for unambiguous genome assignment of all homoeologs in tetraploid G. hirsutum. The omega-3 FAD family of cotton includes five distinct genes, two of which encode endoplasmic reticulum-type enzymes (FAD3-1 and FAD3-2) and three that encode chloroplast-type enzymes (FAD7/8-1, FAD7/8-2, and FAD7/8-3). The FAD3-2 gene was duplicated in the A genome progenitor species after the evolutionary split from the D progenitor, but before the interspecific hybridization event that gave rise to modern tetraploid cotton. RNA-seq analysis revealed conserved, gene-specific expression patterns in various organs and cell types and semi-quantitative RT-PCR further revealed that FAD7/8-1 was specifically induced during cold temperature treatment of G. hirsutum seedlings. Conclusions: The omega-3 FAD gene family in cotton was characterized at the genome-wide level

  16. MADS-box gene family in rice: genome-wide identification, organization and expression profiling during reproductive development and stress

    PubMed Central

    Arora, Rita; Agarwal, Pinky; Ray, Swatismita; Singh, Ashok Kumar; Singh, Vijay Pal; Tyagi, Akhilesh K; Kapoor, Sanjay

    2007-01-01

    Background MADS-box transcription factors, besides being involved in floral organ specification, have also been implicated in several aspects of plant growth and development. In recent years, there have been reports on genomic localization, protein motif structure, phylogenetic relationships, gene structure and expression of the entire MADS-box family in the model plant system, Arabidopsis. Though there have been some studies in rice as well, an analysis of the complete MADS-box family along with a comprehensive expression profiling was still awaited after the completion of rice genome sequencing. Furthermore, owing to the role of MADS-box family in flower development, an analysis involving structure, expression and functional aspects of MADS-box genes in rice and Arabidopsis was required to understand the role of this gene family in reproductive development. Results A genome-wide molecular characterization and microarray-based expression profiling of the genes encoding MADS-box transcription factor family in rice is presented. Using a thorough annotation exercise, 75 MADS-box genes have been identified in rice and categorized into MIKCc, MIKC*, Mα, Mβ and Mγ groups based on phylogeny. Chromosomal localization of these genes reveals that 16 MADS-box genes, mostly MIKCc-type, are located within the duplicated segments of the rice genome, whereas most of the M-type genes, 20 in all, seem to have resulted from tandem duplications. Nine members belonging to the Mβ group, which was considered absent in monocots, have also been identified. The expression profiles of all the MADS-box genes have been analyzed under 11 temporal stages of panicle and seed development, three abiotic stress conditions, along with three stages of vegetative development. Transcripts for 31 genes accumulate preferentially in the reproductive phase, of which, 12 genes are specifically expressed in seeds, and six genes show expression specific to panicle development. Differential expression of

  17. Genome-wide organization and expression profiling of the NAC transcription factor family in potato (Solanum tuberosum L.).

    PubMed

    Singh, Anil Kumar; Sharma, Vishal; Pal, Awadhesh Kumar; Acharya, Vishal; Ahuja, Paramvir Singh

    2013-08-01

    NAC [no apical meristem (NAM), Arabidopsis thaliana transcription activation factor [ATAF1/2] and cup-shaped cotyledon (CUC2)] proteins belong to one of the largest plant-specific transcription factor (TF) families and play important roles in plant development processes, response to biotic and abiotic cues and hormone signalling. Our genome-wide analysis identified 110 StNAC genes in potato encoding for 136 proteins, including 14 membrane-bound TFs. The physical map positions of StNAC genes on 12 potato chromosomes were non-random, and 40 genes were found to be distributed in 16 clusters. The StNAC proteins were phylogenetically clustered into 12 subgroups. Phylogenetic analysis of StNACs along with their Arabidopsis and rice counterparts divided these proteins into 18 subgroups. Our comparative analysis has also identified 36 putative TNAC proteins, which appear to be restricted to Solanaceae family. In silico expression analysis, using Illumina RNA-seq transcriptome data, revealed tissue-specific, biotic, abiotic stress and hormone-responsive expression profile of StNAC genes. Several StNAC genes, including StNAC072 and StNAC101that are orthologs of known stress-responsive Arabidopsis RESPONSIVE TO DEHYDRATION 26 (RD26) were identified as highly abiotic stress responsive. Quantitative real-time polymerase chain reaction analysis largely corroborated the expression profile of StNAC genes as revealed by the RNA-seq data. Taken together, this analysis indicates towards putative functions of several StNAC TFs, which will provide blue-print for their functional characterization and utilization in potato improvement.

  18. Genome-Wide Identification and Expression Profiling of Tomato Hsp20 Gene Family in Response to Biotic and Abiotic Stresses

    PubMed Central

    Yu, Jiahong; Cheng, Yuan; Feng, Kun; Ruan, Meiying; Ye, Qingjing; Wang, Rongqing; Li, Zhimiao; Zhou, Guozhi; Yao, Zhuping; Yang, Yuejian; Wan, Hongjian

    2016-01-01

    The Hsp20 genes are involved in the response of plants to environment stresses including heat shock and also play a vital role in plant growth and development. They represent the most abundant small heat shock proteins (sHsps) in plants, but little is known about this family in tomato (Solanum lycopersicum), an important vegetable crop in the world. Here, we characterized heat shock protein 20 (SlHsp20) gene family in tomato through integration of gene structure, chromosome location, phylogenetic relationship, and expression profile. Using bioinformatics-based methods, we identified at least 42 putative SlHsp20 genes in tomato. Sequence analysis revealed that most of SlHsp20 genes possessed no intron or a relatively short intron in length. Chromosome mapping indicated that inter-arm and intra-chromosome duplication events contributed remarkably to the expansion of SlHsp20 genes. Phylogentic tree of Hsp20 genes from tomato and other plant species revealed that SlHsp20 genes were grouped into 13 subfamilies, indicating that these genes may have a common ancestor that generated diverse subfamilies prior to the mono-dicot split. In addition, expression analysis using RNA-seq in various tissues and developmental stages of cultivated tomato and the wild relative Solanum pimpinellifolium revealed that most of these genes (83%) were expressed in at least one stage from at least one genotype. Out of 42 genes, 4 genes were expressed constitutively in almost all the tissues analyzed, implying that these genes might have specific housekeeping function in tomato cell under normal growth conditions. Two SlHsp20 genes displayed differential expression levels between cultivated tomato and S. pimpinellifolium in vegetative (leaf and root) and reproductive organs (floral bud and flower), suggesting inter-species diversification for functional specialization during the process of domestication. Based on genome-wide microarray analysis, we showed that the transcript levels of SlHsp20

  19. Genome-Wide Identification and Expression Profiling of Tomato Hsp20 Gene Family in Response to Biotic and Abiotic Stresses.

    PubMed

    Yu, Jiahong; Cheng, Yuan; Feng, Kun; Ruan, Meiying; Ye, Qingjing; Wang, Rongqing; Li, Zhimiao; Zhou, Guozhi; Yao, Zhuping; Yang, Yuejian; Wan, Hongjian

    2016-01-01

    The Hsp20 genes are involved in the response of plants to environment stresses including heat shock and also play a vital role in plant growth and development. They represent the most abundant small heat shock proteins (sHsps) in plants, but little is known about this family in tomato (Solanum lycopersicum), an important vegetable crop in the world. Here, we characterized heat shock protein 20 (SlHsp20) gene family in tomato through integration of gene structure, chromosome location, phylogenetic relationship, and expression profile. Using bioinformatics-based methods, we identified at least 42 putative SlHsp20 genes in tomato. Sequence analysis revealed that most of SlHsp20 genes possessed no intron or a relatively short intron in length. Chromosome mapping indicated that inter-arm and intra-chromosome duplication events contributed remarkably to the expansion of SlHsp20 genes. Phylogentic tree of Hsp20 genes from tomato and other plant species revealed that SlHsp20 genes were grouped into 13 subfamilies, indicating that these genes may have a common ancestor that generated diverse subfamilies prior to the mono-dicot split. In addition, expression analysis using RNA-seq in various tissues and developmental stages of cultivated tomato and the wild relative Solanum pimpinellifolium revealed that most of these genes (83%) were expressed in at least one stage from at least one genotype. Out of 42 genes, 4 genes were expressed constitutively in almost all the tissues analyzed, implying that these genes might have specific housekeeping function in tomato cell under normal growth conditions. Two SlHsp20 genes displayed differential expression levels between cultivated tomato and S. pimpinellifolium in vegetative (leaf and root) and reproductive organs (floral bud and flower), suggesting inter-species diversification for functional specialization during the process of domestication. Based on genome-wide microarray analysis, we showed that the transcript levels of SlHsp20

  20. Genome-Wide Identification and Expression Profiling of Tomato Hsp20 Gene Family in Response to Biotic and Abiotic Stresses.

    PubMed

    Yu, Jiahong; Cheng, Yuan; Feng, Kun; Ruan, Meiying; Ye, Qingjing; Wang, Rongqing; Li, Zhimiao; Zhou, Guozhi; Yao, Zhuping; Yang, Yuejian; Wan, Hongjian

    2016-01-01

    The Hsp20 genes are involved in the response of plants to environment stresses including heat shock and also play a vital role in plant growth and development. They represent the most abundant small heat shock proteins (sHsps) in plants, but little is known about this family in tomato (Solanum lycopersicum), an important vegetable crop in the world. Here, we characterized heat shock protein 20 (SlHsp20) gene family in tomato through integration of gene structure, chromosome location, phylogenetic relationship, and expression profile. Using bioinformatics-based methods, we identified at least 42 putative SlHsp20 genes in tomato. Sequence analysis revealed that most of SlHsp20 genes possessed no intron or a relatively short intron in length. Chromosome mapping indicated that inter-arm and intra-chromosome duplication events contributed remarkably to the expansion of SlHsp20 genes. Phylogentic tree of Hsp20 genes from tomato and other plant species revealed that SlHsp20 genes were grouped into 13 subfamilies, indicating that these genes may have a common ancestor that generated diverse subfamilies prior to the mono-dicot split. In addition, expression analysis using RNA-seq in various tissues and developmental stages of cultivated tomato and the wild relative Solanum pimpinellifolium revealed that most of these genes (83%) were expressed in at least one stage from at least one genotype. Out of 42 genes, 4 genes were expressed constitutively in almost all the tissues analyzed, implying that these genes might have specific housekeeping function in tomato cell under normal growth conditions. Two SlHsp20 genes displayed differential expression levels between cultivated tomato and S. pimpinellifolium in vegetative (leaf and root) and reproductive organs (floral bud and flower), suggesting inter-species diversification for functional specialization during the process of domestication. Based on genome-wide microarray analysis, we showed that the transcript levels of SlHsp20

  1. Why Breeding Values Estimated Using Familial Data Should Not Be Used for Genome-Wide Association Studies

    PubMed Central

    Ekine, Chinyere C.; Rowe, Suzanne J.; Bishop, Stephen C.; de Koning, Dirk-Jan

    2013-01-01

    In animal breeding, the genetic potential of an animal is summarized as its estimated breeding value, which is derived from its own performance as well as the performance of related individuals. Here, we illustrate why estimated breeding values are not suitable as a phenotype for genome-wide association studies. We simulated human-type and pig-type pedigrees with a range of quantitative trait loci (QTL) effects (0.5–3% of phenotypic variance) and heritabilities (0.3−0.8). We analyzed 1000 replicates of each scenario with four models: (a) a full mixed model including a polygenic effect, (b) a regression analysis using the residual of a mixed model as a trait score (so called GRAMMAR approach), (c) a regression analysis using the estimated breeding value as a trait score, and (d) a regression analysis that uses the raw phenotype as a trait score. We show that using breeding values as a trait score gives very high false-positive rates (up 14% in human pedigrees and >60% in pig pedigrees). Simulations based on a real pedigree show that additional generations of pedigree increase the type I error. Including the family relationship as a random effect provides the greatest power to detect QTL while controlling for type I error at the desired level and providing the most accurate estimates of the QTL effect. Both the use of residuals and the use of breeding values result in deflated estimates of the QTL effect. We derive the contributions of QTL effects to the breeding value and residual and show how this affects the estimates. PMID:24362310

  2. Genome-Wide Analysis and Evolution of the Pto-Like Protein Kinase (PLPK) Gene Family in Pepper

    PubMed Central

    Venkatesh, Jelli; Jahn, Molly; Kang, Byoung-Cheorl

    2016-01-01

    The tomato Pto gene, which encodes a serine/threonine kinase (STK) domain-containing protein, confers resistance to bacterial speck disease caused by Pseudomonas syringae pv. tomato (Pst). In this study, in vivo recognition assays using PVX constructs showed that AvrPto was specifically recognized in the pepper genotypes. This AvrPto recognition caused a nonhost hypersensitive response (HR) and localization of the PVX::AvrPto fusion protein to inoculated pepper leaf tissues, which indicates the presence of a similar Pto recognition mechanism in pepper as in tomato. However, genome-wide analysis in pepper revealed no Pto clade corresponding to that in tomato, suggesting an alternative system for Pto recognition in pepper. Nevertheless, 25 Pto-like protein kinases (PLPKs) with a highly conserved STK domain have been identified in the pepper genome. For the majority of the amino acid sites in the STK domain of Ptos and PLPKs, nonsynonymous (dN) to synonymous (dS) nucleotide substitution ratios (ω) were less than one, suggesting that purifying selection played a predominant role in the evolutionary process. However, some amino acid sites were found to be subjected to episodic positive selection in the course of evolution of Pto homologs, and, thus, different evolutionary processes might have shaped the Pto gene family in plants. Based on RNA-seq data, PLPK genes and other Pto pathway genes, such as Prf, Pti1, Pti5, and Pti6 were expressed in all tested pepper genotypes. Therefore, the nonhost HR against Pst in pepper may be due to the recognition of the AvrPto effector by a PLPK homolog, and subsequent action of downstream components of the Pto signaling pathway. However, the possibility remains that the recognition of AvrPto in pepper plants may involve activities of other receptor like kinases (RLKs). The identification of the PLPKs in this study will serve as a foundation for further efforts to understand the roles of PLPKs in nonhost resistance. PMID:27536870

  3. Genome-Wide Analysis and Evolution of the Pto-Like Protein Kinase (PLPK) Gene Family in Pepper.

    PubMed

    Venkatesh, Jelli; Jahn, Molly; Kang, Byoung-Cheorl

    2016-01-01

    The tomato Pto gene, which encodes a serine/threonine kinase (STK) domain-containing protein, confers resistance to bacterial speck disease caused by Pseudomonas syringae pv. tomato (Pst). In this study, in vivo recognition assays using PVX constructs showed that AvrPto was specifically recognized in the pepper genotypes. This AvrPto recognition caused a nonhost hypersensitive response (HR) and localization of the PVX::AvrPto fusion protein to inoculated pepper leaf tissues, which indicates the presence of a similar Pto recognition mechanism in pepper as in tomato. However, genome-wide analysis in pepper revealed no Pto clade corresponding to that in tomato, suggesting an alternative system for Pto recognition in pepper. Nevertheless, 25 Pto-like protein kinases (PLPKs) with a highly conserved STK domain have been identified in the pepper genome. For the majority of the amino acid sites in the STK domain of Ptos and PLPKs, nonsynonymous (dN) to synonymous (dS) nucleotide substitution ratios (ω) were less than one, suggesting that purifying selection played a predominant role in the evolutionary process. However, some amino acid sites were found to be subjected to episodic positive selection in the course of evolution of Pto homologs, and, thus, different evolutionary processes might have shaped the Pto gene family in plants. Based on RNA-seq data, PLPK genes and other Pto pathway genes, such as Prf, Pti1, Pti5, and Pti6 were expressed in all tested pepper genotypes. Therefore, the nonhost HR against Pst in pepper may be due to the recognition of the AvrPto effector by a PLPK homolog, and subsequent action of downstream components of the Pto signaling pathway. However, the possibility remains that the recognition of AvrPto in pepper plants may involve activities of other receptor like kinases (RLKs). The identification of the PLPKs in this study will serve as a foundation for further efforts to understand the roles of PLPKs in nonhost resistance. PMID:27536870

  4. Why breeding values estimated using familial data should not be used for genome-wide association studies.

    PubMed

    Ekine, Chinyere C; Rowe, Suzanne J; Bishop, Stephen C; de Koning, Dirk-Jan

    2014-02-19

    In animal breeding, the genetic potential of an animal is summarized as its estimated breeding value, which is derived from its own performance as well as the performance of related individuals. Here, we illustrate why estimated breeding values are not suitable as a phenotype for genome-wide association studies. We simulated human-type and pig-type pedigrees with a range of quantitative trait loci (QTL) effects (0.5-3% of phenotypic variance) and heritabilities (0.3-0.8). We analyzed 1000 replicates of each scenario with four models: (a) a full mixed model including a polygenic effect, (b) a regression analysis using the residual of a mixed model as a trait score (so called GRAMMAR approach), (c) a regression analysis using the estimated breeding value as a trait score, and (d) a regression analysis that uses the raw phenotype as a trait score. We show that using breeding values as a trait score gives very high false-positive rates (up 14% in human pedigrees and >60% in pig pedigrees). Simulations based on a real pedigree show that additional generations of pedigree increase the type I error. Including the family relationship as a random effect provides the greatest power to detect QTL while controlling for type I error at the desired level and providing the most accurate estimates of the QTL effect. Both the use of residuals and the use of breeding values result in deflated estimates of the QTL effect. We derive the contributions of QTL effects to the breeding value and residual and show how this affects the estimates.

  5. Comparative phylogenetic analysis of genome-wide Mlo gene family members from Glycine max and Arabidopsis thaliana.

    PubMed

    Deshmukh, Reena; Singh, V K; Singh, B D

    2014-06-01

    Powdery mildew locus O (Mlo) gene family is one of the largest seven transmembrane protein-encoding gene families. The Mlo proteins act as negative regulators of powdery mildew resistance and a loss-of-function mutation in Mlo is known to confer broad-spectrum resistance to powdery mildew. In addition, the Mlo gene family members are known to participate in various developmental and biotic and abiotic stress response-related pathways. Therefore, a genome-wide similarity search using the characterized Mlo protein sequences of Arabidopsis thaliana was carried out to identify putative Mlo genes in soybean (Glycine max) genome. This search identified 39 Mlo domain containing protein-encoding genes that were distributed on 15 of the 20 G. max chromosomes. The putative promoter regions of these Mlo genes contained response elements for different external stimuli, including different hormones and abiotic stresses. Of the 39 GmMlo proteins, 35 were rich (8.7-13.1 %) in leucine, while five were serine-rich (9.2-11.9 %). Furthermore, all the GmMlo members were localized in the plasma membrane. Phylogenetic analysis of the GmMlo and the AtMlo proteins classified them into three main clusters, and the cluster I comprised two sub-clusters. Multiple sequence alignment visualized the location of seven transmembrane domains, and a conserved CaM-binding domain. Some of the GmMlo proteins (GmMlo10, 20, 22, 23, 32, 36, 37) contained less than seven transmembrane domains. The motif analysis yielded 27 motifs; out of these, motif 2, the only motif present in all the GmMlos, was highly conserved and three amino acid residues were essentially invariant. Five of the GmMlo members were much smaller in size; presumably they originated through deletion following a gene duplication event. The presence of a large number of GmMlo members in the G. max genome may be due to its paleopolyploid nature and the large genome size as compared to that of Arabidopsis. The findings of this study may

  6. Genome-wide DNA methylation detection by MethylCap-seq and Infinium HumanMethylation450 BeadChips: an independent large-scale comparison

    PubMed Central

    De Meyer, Tim; Bady, Pierre; Trooskens, Geert; Kurscheid, Sebastian; Bloch, Jocelyne; Kros, Johan M.; Hainfellner, Johannes A.; Stupp, Roger; Delorenzi, Mauro; Hegi, Monika E.; Van Criekinge, Wim

    2015-01-01

    Two cost-efficient genome-scale methodologies to assess DNA-methylation are MethylCap-seq and Illumina’s Infinium HumanMethylation450 BeadChips (HM450). Objective information regarding the best-suited methodology for a specific research question is scant. Therefore, we performed a large-scale evaluation on a set of 70 brain tissue samples, i.e. 65 glioblastoma and 5 non-tumoral tissues. As MethylCap-seq coverages were limited, we focused on the inherent capacity of the methodology to detect methylated loci rather than a quantitative analysis. MethylCap-seq and HM450 data were dichotomized and performances were compared using a gold standard free Bayesian modelling procedure. While conditional specificity was adequate for both approaches, conditional sensitivity was systematically higher for HM450. In addition, genome-wide characteristics were compared, revealing that HM450 probes identified substantially fewer regions compared to MethylCap-seq. Although results indicated that the latter method can detect more potentially relevant DNA-methylation, this did not translate into the discovery of more differentially methylated loci between tumours and controls compared to HM450. Our results therefore indicate that both methodologies are complementary, with a higher sensitivity for HM450 and a far larger genome-wide coverage for MethylCap-seq, but also that a more comprehensive character does not automatically imply more significant results in biomarker studies. PMID:26482909

  7. First all-in-one diagnostic tool for DNA intelligence: genome-wide inference of biogeographic ancestry, appearance, relatedness, and sex with the Identitas v1 Forensic Chip.

    PubMed

    Keating, Brendan; Bansal, Aruna T; Walsh, Susan; Millman, Jonathan; Newman, Jonathan; Kidd, Kenneth; Budowle, Bruce; Eisenberg, Arthur; Donfack, Joseph; Gasparini, Paolo; Budimlija, Zoran; Henders, Anjali K; Chandrupatla, Hareesh; Duffy, David L; Gordon, Scott D; Hysi, Pirro; Liu, Fan; Medland, Sarah E; Rubin, Laurence; Martin, Nicholas G; Spector, Timothy D; Kayser, Manfred

    2013-05-01

    When a forensic DNA sample cannot be associated directly with a previously genotyped reference sample by standard short tandem repeat profiling, the investigation required for identifying perpetrators, victims, or missing persons can be both costly and time consuming. Here, we describe the outcome of a collaborative study using the Identitas Version 1 (v1) Forensic Chip, the first commercially available all-in-one tool dedicated to the concept of developing intelligence leads based on DNA. The chip allows parallel interrogation of 201,173 genome-wide autosomal, X-chromosomal, Y-chromosomal, and mitochondrial single nucleotide polymorphisms for inference of biogeographic ancestry, appearance, relatedness, and sex. The first assessment of the chip's performance was carried out on 3,196 blinded DNA samples of varying quantities and qualities, covering a wide range of biogeographic origin and eye/hair coloration as well as variation in relatedness and sex. Overall, 95 % of the samples (N = 3,034) passed quality checks with an overall genotype call rate >90 % on variable numbers of available recorded trait information. Predictions of sex, direct match, and first to third degree relatedness were highly accurate. Chip-based predictions of biparental continental ancestry were on average ~94 % correct (further support provided by separately inferred patrilineal and matrilineal ancestry). Predictions of eye color were 85 % correct for brown and 70 % correct for blue eyes, and predictions of hair color were 72 % for brown, 63 % for blond, 58 % for black, and 48 % for red hair. From the 5 % of samples (N = 162) with <90 % call rate, 56 % yielded correct continental ancestry predictions while 7 % yielded sufficient genotypes to allow hair and eye color prediction. Our results demonstrate that the Identitas v1 Forensic Chip holds great promise for a wide range of applications including criminal investigations, missing person investigations, and for national security

  8. Heritability and Genome-Wide Association Studies for Hair Color in a Dutch Twin Family Based Sample

    PubMed Central

    Lin, Bochao Danae; Mbarek, Hamdi; Willemsen, Gonneke; Dolan, Conor V.; Fedko, Iryna O.; Abdellaoui, Abdel; de Geus, Eco J.; Boomsma, Dorret I.; Hottenga, Jouke-Jan

    2015-01-01

    Hair color is one of the most visible and heritable traits in humans. Here, we estimated heritability by structural equation modeling (N = 20,142), and performed a genome wide association (GWA) analysis (N = 7091) and a GCTA study (N = 3340) on hair color within a large cohort of twins, their parents and siblings from the Netherlands Twin Register (NTR). Self-reported hair color was analyzed as five binary phenotypes, namely “blond versus non-blond”, “red versus non-red”, “brown versus non-brown”, “black versus non-black”, and “light versus dark”. The broad-sense heritability of hair color was estimated between 73% and 99% and the genetic component included non-additive genetic variance. Assortative mating for hair color was significant, except for red and black hair color. From GCTA analyses, at most 24.6% of the additive genetic variance in hair color was explained by 1000G well-imputed SNPs. Genome-wide association analysis for each hair color showed that SNPs in the MC1R region were significantly associated with red, brown and black hair, and also with light versus dark hair color. Five other known genes (HERC2, TPCN2, SLC24A4, IRF4, and KITLG) gave genome-wide significant hits for blond, brown and light versus dark hair color. We did not find and replicate any new loci for hair color. PMID:26184321

  9. CHIP-BASED DIRECT GENOTYPING OF CODING VARIANTS IN GENOME WIDE ASSOCIATION STUDIES: UTILITY, ISSUES AND PROSPECTS

    PubMed Central

    Nievergelt, Caroline M.; Wineinger, Nathan E.; Libiger, Ondrej; Pham, Phillip; Zhang, Guangfa; Baker, Dewleen G.; Schork, Nicholas J.

    2014-01-01

    There is considerable debate about the most efficient way to interrogate rare coding variants in association studies. The options include direct genotyping of specific known coding variants in genes or, alternatively, sequencing across the entire exome to capture known as well as novel variants. Each strategy has advantages and disadvantages, but the availability of cost-efficient exome arrays has made the former appealing. Here we consider the utility of a direct genotyping chip, the Illumina HumanExome array (HE), by evaluating its content based on: 1. functionality; and 2. amenability to imputation. We explored these issues by genotyping a large, ethnically diverse cohort on the HumanOmniExpressExome array (HOEE) which combines the HE with content from the GWAS array (HOE). We find that the use of the HE is likely to be a cost-effective way of expanding GWAS, but does have some drawbacks that deserve consideration when planning studies. PMID:24521671

  10. Genome-wide identification and analysis of FK506-binding protein family gene family in strawberry (Fragaria × ananassa).

    PubMed

    Leng, Xiangpeng; Liu, Dan; Zhao, Mizhen; Sun, Xin; Li, Yu; Mu, Qian; Zhu, Xudong; Li, Pengyu; Fang, Jinggui

    2014-01-25

    The FK506 binding proteins (FKBPs) are abundant and ubiquitous proteins belonging to the large peptidyl-prolylcis-trans isomerase superfamily. FKBPs are known to be involved in many biological processes including hormone signaling, plant growth, and stress responses through a chaperone or an isomerization of proline residues during protein folding. The availability of complete strawberry genome sequences allowed the identification of 23 FKBP genes by HMMER and blast analysis. Chromosome scaffold locations of these FKBP genes in the strawberry genome were determined and the protein domain and motif organization of FaFKBPs analyzed. The phylogenetic relationships between strawberry FKBPs were also assessed. The expression profiles of FaFKBPs genes results revealed that most FaFKBPs were expressed in all tissues, while a few FaFKBPs were specifically expressed in some of the tissues. These data not only contribute to some better understanding of the complex regulation of the strawberry FKBP gene family, but also provide valuable information for further research in strawberry functional genomics. PMID:24230972

  11. Genome-wide identification and analysis of FK506-binding protein family gene family in strawberry (Fragaria × ananassa).

    PubMed

    Leng, Xiangpeng; Liu, Dan; Zhao, Mizhen; Sun, Xin; Li, Yu; Mu, Qian; Zhu, Xudong; Li, Pengyu; Fang, Jinggui

    2014-01-25

    The FK506 binding proteins (FKBPs) are abundant and ubiquitous proteins belonging to the large peptidyl-prolylcis-trans isomerase superfamily. FKBPs are known to be involved in many biological processes including hormone signaling, plant growth, and stress responses through a chaperone or an isomerization of proline residues during protein folding. The availability of complete strawberry genome sequences allowed the identification of 23 FKBP genes by HMMER and blast analysis. Chromosome scaffold locations of these FKBP genes in the strawberry genome were determined and the protein domain and motif organization of FaFKBPs analyzed. The phylogenetic relationships between strawberry FKBPs were also assessed. The expression profiles of FaFKBPs genes results revealed that most FaFKBPs were expressed in all tissues, while a few FaFKBPs were specifically expressed in some of the tissues. These data not only contribute to some better understanding of the complex regulation of the strawberry FKBP gene family, but also provide valuable information for further research in strawberry functional genomics.

  12. Genome-wide analysis of tandem repeats in Tribolium castaneum genome reveals abundant and highly dynamic tandem repeat families with satellite DNA features in euchromatic chromosomal arms.

    PubMed

    Pavlek, Martina; Gelfand, Yevgeniy; Plohl, Miroslav; Meštrović, Nevenka

    2015-12-01

    Although satellite DNAs are well-explored components of heterochromatin and centromeres, little is known about emergence, dispersal and possible impact of comparably structured tandem repeats (TRs) on the genome-wide scale. Our bioinformatics analysis of assembled Tribolium castaneum genome disclosed significant contribution of TRs in euchromatic chromosomal arms and clear predominance of satellite DNA-typical 170 bp monomers in arrays of ≥5 repeats. By applying different experimental approaches, we revealed that the nine most prominent TR families Cast1-Cast9 extracted from the assembly comprise ∼4.3% of the entire genome and reside almost exclusively in euchromatic regions. Among them, seven families that build ∼3.9% of the genome are based on ∼170 and ∼340 bp long monomers. Results of phylogenetic analyses of 2500 monomers originating from these families show high-sequence dynamics, evident by extensive exchanges between arrays on non-homologous chromosomes. In addition, our analysis shows that concerted evolution acts more efficiently on longer than on shorter arrays. Efficient genome-wide distribution of nine TR families implies the role of transposition only in expansion of the most dispersed family, and involvement of other mechanisms is anticipated. Despite similarities in sequence features, FISH experiments indicate high-level compartmentalization of centromeric and euchromatic tandem repeats.

  13. Investigation of transferability of BovineSNP50 BeadChip from cattle to water buffalo for genome wide association study.

    PubMed

    Wu, Jun Jing; Song, Li Jun; Wu, Fang Jie; Liang, Xian Wei; Yang, Bing Zhuang; Wathes, D Claire; Pollott, Geoff E; Cheng, Zhangrui; Shi, De Shun; Liu, Qing You; Yang, Li Guo; Zhang, Shu Jun

    2013-02-01

    Cattle and water buffalo belong to the same subfamily Bovinae and share chromosome banding and gene order homology. In this study, we used genome-wide Illumina BovineSNP50 BeadChip to analyze 91 DNA samples from three breeds of water buffalo (Nili-Ravi, Murrah and their crossbred with local GuangXi buffalos in China), to demonstrate the genetic divergence between cattle and water buffalo through a large single nucleotide polymorphism (SNP) transferability study at the whole genome level, and performed association analysis of functional traits in water buffalo as well. A total of 40,766 (75.5 %) bovine SNPs were found in the water buffalo genome, but 49,936 (92.5 %) were with only one allele, and finally 935 were identified to be polymorphic and useful for association analysis in water buffalo. Therefore, the genome sequences of water buffalo and cattle shared a high level of homology but the polymorphic status of the bovine SNPs varied between these two species. The different patterns of mutations between species may associate with their phenotypic divergence due to genome evolution. Among 935 bovine SNPs, we identified a total of 9 and 7 SNPs significantly associated to fertility and milk production traits in water buffalo, respectively. However, more works in larger sample size are needed in future to verify these candidate SNPs for water buffalo.

  14. A Genome Wide Association Study on Age at First Calving Using High Density Single Nucleotide Polymorphism Chips in Hanwoo (Bos taurus coreanae)

    PubMed Central

    Hyeong, K.-E.; Iqbal, A.; Kim, J.-J.

    2014-01-01

    Age at first calving is an important trait for achieving earlier reproductive performance. To detect quantitative trait loci (QTL) for reproductive traits, a genome wide association study was conducted on the 96 Hanwoo cows that were born between 2008 and 2010 from 13 sires in a local farm (Juk-Am Hanwoo farm, Suncheon, Korea) and genotyped with the Illumina 50K bovine single nucleotide polymorphism (SNP) chips. Phenotypes were regressed on additive and dominance effects for each SNP using a simple linear regression model after the effects of birth-year-month and polygenes were considered. A forward regression procedure was applied to determine the best set of SNPs for age at first calving. A total of 15 QTL were detected at the comparison-wise 0.001 level. Two QTL with strong statistical evidence were found at 128.9 Mb and 111.1 Mb on bovine chromosomes (BTA) 2 and 7, respectively, each of which accounted for 22% of the phenotypic variance. Also, five significant SNPs were detected on BTAs 10, 16, 20, 26, and 29. Multiple QTL were found on BTAs 1, 2, 7, and 14. The significant QTLs may be applied via marker assisted selection to increase rate of genetic gain for the trait, after validation tests in other Hanwoo cow populations. PMID:25178291

  15. Genome-wide linkage analysis of QTL for growth and body composition employing the PorcineSNP60 BeadChip

    PubMed Central

    2012-01-01

    Background The traditional strategy to map QTL is to use linkage analysis employing a limited number of markers. These analyses report wide QTL confidence intervals, making very difficult to identify the gene and polymorphisms underlying the QTL effects. The arrival of genome-wide panels of SNPs makes available thousands of markers increasing the information content and therefore the likelihood of detecting and fine mapping QTL regions. The aims of the current study are to confirm previous QTL regions for growth and body composition traits in different generations of an Iberian x Landrace intercross (IBMAP) and especially identify new ones with narrow confidence intervals by employing the PorcineSNP60 BeadChip in linkage analyses. Results Three generations (F3, Backcross 1 and Backcross 2) of the IBMAP and their related animals were genotyped with PorcineSNP60 BeadChip. A total of 8,417 SNPs equidistantly distributed across autosomes were selected after filtering by quality, position and frequency to perform the QTL scan. The joint and separate analyses of the different IBMAP generations allowed confirming QTL regions previously identified in chromosomes 4 and 6 as well as new ones mainly for backfat thickness in chromosomes 4, 5, 11, 14 and 17 and shoulder weight in chromosomes 1, 2, 9 and 13; and many other to the chromosome-wide signification level. In addition, most of the detected QTLs displayed narrow confidence intervals, making easier the selection of positional candidate genes. Conclusions The use of higher density of markers has allowed to confirm results obtained in previous QTL scans carried out with microsatellites. Moreover several new QTL regions have been now identified in regions probably not covered by markers in previous scans, most of these QTLs displayed narrow confidence intervals. Finally, prominent putative biological and positional candidate genes underlying those QTL effects are listed based on recent porcine genome annotation. PMID

  16. Family-based Association Analyses of Imputed Genotypes Reveal Genome-Wide Significant Association of Alzheimer’s disease with OSBPL6, PTPRG and PDCL3

    PubMed Central

    Herold, Christine; Hooli, Basavaraj V.; Mullin, Kristina; Liu, Tian; Roehr, Johannes T; Mattheisen, Manuel; Parrado, Antonio R.; Bertram, Lars; Lange, Christoph; Tanzi, Rudolph E.

    2015-01-01

    The genetic basis of Alzheimer's disease (AD) is complex and heterogeneous. Over 200 highly penetrant pathogenic variants in the genes APP, PSEN1 and PSEN2 cause a subset of early-onset familial Alzheimer's disease (EOFAD). On the other hand, susceptibility to late-onset forms of AD (LOAD) is indisputably associated to the ε4 allele in the gene APOE, and more recently to variants in more than two-dozen additional genes identified in the large-scale genome-wide association studies (GWAS) and meta-analyses reports. Taken together however, although the heritability in AD is estimated to be as high as 80%, a large proportion of the underlying genetic factors still remain to be elucidated. In this study we performed a systematic family-based genome-wide association and meta-analysis on close to 15 million imputed variants from three large collections of AD families (~3,500 subjects from 1,070 families). Using a multivariate phenotype combining affection status and onset age, meta-analysis of the association results revealed three single nucleotide polymorphisms (SNPs) that achieved genome-wide significance for association with AD risk: rs7609954 in the gene PTPRG (P-value = 3.98·10−08), rs1347297 in the gene OSBPL6 (P-value = 4.53·10−08), and rs1513625 near PDCL3 (P-value = 4.28·10−08). In addition, rs72953347 in OSBPL6 (P-value = 6.36·10−07) and two SNPs in the gene CDKAL1 showed marginally significant association with LOAD (rs10456232, P-value: 4.76·10−07; rs62400067, P-value: 3.54·10−07). In summary, family-based GWAS meta-analysis of imputed SNPs revealed novel genomic variants in (or near) PTPRG, OSBPL6, and PDCL3 that influence risk for AD with genome-wide significance. PMID:26830138

  17. A genome-wide analysis of the ASYMMETRIC LEAVES2/LATERAL ORGAN BOUNDARIES (AS2/LOB) gene family in barley (Hordeum vulgare L.)* #

    PubMed Central

    Guo, Bao-jian; Wang, Jun; Lin, Shen; Tian, Zheng; Zhou, Kai; Luan, Hai-ye; Lyu, Chao; Zhang, Xin-zhong; Xu, Ru-gen

    2016-01-01

    ASYMMETRIC LEAVES2/LATERAL ORGAN BOUNDARIES (AS2/LOB) genes are a family of plant specific transcription factors, which play an important role in the regulation of plant lateral organ development and metabolism. However, a genome-wide analysis of the AS2/LOB gene family is still not available for barley. In the present study, 24 AS2-like (ASL)/LOB domain (LBD) genes were identified based on the barley (Hordeum vulgare L.) genome sequence. A phylogenetic tree of ASL/LBD proteins from barley, Arabidopsis, maize, and rice was constructed. The ASL/LBD genes were classified into two classes, class I and class II, which were divided into five and two subgroups, respectively. Genes homologous in barley and Arabidopsis were analyzed. In addition, the structure and chromosomal locations of the genes were analyzed. Expression profiles indicated that barley HvASL/LBD genes exhibit a variety of expression patterns, suggesting that they are involved in various aspects of physiological and developmental processes. This genome-wide analysis of the barley AS2/LOB gene family contributes to our understanding of the functions of the AS2/LOB gene family. PMID:27704746

  18. Integration of Genome-Wide Computation DRE Search, AhR ChIP-chip and Gene Expression Analyses of TCDD-Elicited Responses in the Mouse Liver

    PubMed Central

    2011-01-01

    Background The aryl hydrocarbon receptor (AhR) is a ligand-activated transcription factor (TF) that mediates responses to 2,3,7,8-tetrachlorodibenzo-p-dioxin (TCDD). Integration of TCDD-induced genome-wide AhR enrichment, differential gene expression and computational dioxin response element (DRE) analyses further elucidate the hepatic AhR regulatory network. Results Global ChIP-chip and gene expression analyses were performed on hepatic tissue from immature ovariectomized mice orally gavaged with 30 μg/kg TCDD. ChIP-chip analysis identified 14,446 and 974 AhR enriched regions (1% false discovery rate) at 2 and 24 hrs, respectively. Enrichment density was greatest in the proximal promoter, and more specifically, within ± 1.5 kb of a transcriptional start site (TSS). AhR enrichment also occurred distal to a TSS (e.g. intergenic DNA and 3' UTR), extending the potential gene expression regulatory roles of the AhR. Although TF binding site analyses identified over-represented DRE sequences within enriched regions, approximately 50% of all AhR enriched regions lacked a DRE core (5'-GCGTG-3'). Microarray analysis identified 1,896 number of TCDD-responsive genes (|fold change| ≥ 1.5, P1(t) > 0.999). Integrating this gene expression data with our ChIP-chip and DRE analyses only identified 625 differentially expressed genes that involved an AhR interaction at a DRE. Functional annotation analysis of differentially regulated genes associated with AhR enrichment identified overrepresented processes related to fatty acid and lipid metabolism and transport, and xenobiotic metabolism, which are consistent with TCDD-elicited steatosis in the mouse liver. Conclusions Details of the AhR regulatory network have been expanded to include AhR-DNA interactions within intragenic and intergenic genomic regions. Moreover, the AhR can interact with DNA independent of a DRE core suggesting there are alternative mechanisms of AhR-mediated gene regulation. PMID:21762485

  19. Genome-wide identification of multifunctional laccase gene family in cotton (Gossypium spp.); expression and biochemical analysis during fiber development

    PubMed Central

    Balasubramanian, Vimal Kumar; Rai, Krishan Mohan; Thu, Sandi Win; Hii, Mei Mei; Mendu, Venugopal

    2016-01-01

    The single-celled cotton fibers, produced from seed coat epidermal cells are the largest natural source of textile fibers. The economic value of cotton fiber lies in its length and quality. The multifunctional laccase enzymes play important roles in cell elongation, lignification and pigmentation in plants and could play crucial role in cotton fiber quality. Genome-wide analysis of cultivated allotetraploid (G. hirsutum) and its progenitor diploid (G. arboreum and G. raimondii) cotton species identified 84, 44 and 46 laccase genes, respectively. Analysis of chromosomal location, phylogeny, conserved domain and physical properties showed highly conserved nature of laccases across three cotton species. Gene expression, enzymatic activity and biochemical analysis of developing cotton fibers was performed using G. arboreum species. Of the total 44, 40 laccases showed expression during different stages of fiber development. The higher enzymatic activity of laccases correlated with higher lignin content at 25 DPA (Days Post Anthesis). Further, analysis of cotton fiber phenolic compounds showed an overall decrease at 25 DPA indicating possible incorporation of these substrates into lignin polymer during secondary cell wall biosynthesis. Overall data indicate significant roles of laccases in cotton fiber development, and presents an excellent opportunity for manipulation of fiber development and quality. PMID:27679939

  20. Genome-Wide Comparative Analyses Reveal the Dynamic Evolution of Nucleotide-Binding Leucine-Rich Repeat Gene Family among Solanaceae Plants

    PubMed Central

    Seo, Eunyoung; Kim, Seungill; Yeom, Seon-In; Choi, Doil

    2016-01-01

    Plants have evolved an elaborate innate immune system against invading pathogens. Within this system, intracellular nucleotide-binding leucine-rich repeat (NLR) immune receptors are known play critical roles in effector-triggered immunity (ETI) plant defense. We performed genome-wide identification and classification of NLR-coding sequences from the genomes of pepper, tomato, and potato using fixed criteria. We then compared genomic duplication and evolution features. We identified intact 267, 443, and 755 NLR-encoding genes in tomato, potato, and pepper genomes, respectively. Phylogenetic analysis and classification of Solanaceae NLRs revealed that the majority of NLR super family members fell into 14 subgroups, including a TIR-NLR (TNL) subgroup and 13 non-TNL subgroups. Specific subgroups have expanded in each genome, with the expansion in pepper showing subgroup-specific physical clusters. Comparative analysis of duplications showed distinct duplication patterns within pepper and among Solanaceae plants suggesting subgroup- or species-specific gene duplication events after speciation, resulting in divergent evolution. Taken together, genome-wide analysis of NLR family members provide insights into their evolutionary history in Solanaceae. These findings also provide important foundational knowledge for understanding NLR evolution and will empower broader characterization of disease resistance genes to be used for crop breeding. PMID:27559340

  1. Genome-Wide Comparative Analyses Reveal the Dynamic Evolution of Nucleotide-Binding Leucine-Rich Repeat Gene Family among Solanaceae Plants.

    PubMed

    Seo, Eunyoung; Kim, Seungill; Yeom, Seon-In; Choi, Doil

    2016-01-01

    Plants have evolved an elaborate innate immune system against invading pathogens. Within this system, intracellular nucleotide-binding leucine-rich repeat (NLR) immune receptors are known play critical roles in effector-triggered immunity (ETI) plant defense. We performed genome-wide identification and classification of NLR-coding sequences from the genomes of pepper, tomato, and potato using fixed criteria. We then compared genomic duplication and evolution features. We identified intact 267, 443, and 755 NLR-encoding genes in tomato, potato, and pepper genomes, respectively. Phylogenetic analysis and classification of Solanaceae NLRs revealed that the majority of NLR super family members fell into 14 subgroups, including a TIR-NLR (TNL) subgroup and 13 non-TNL subgroups. Specific subgroups have expanded in each genome, with the expansion in pepper showing subgroup-specific physical clusters. Comparative analysis of duplications showed distinct duplication patterns within pepper and among Solanaceae plants suggesting subgroup- or species-specific gene duplication events after speciation, resulting in divergent evolution. Taken together, genome-wide analysis of NLR family members provide insights into their evolutionary history in Solanaceae. These findings also provide important foundational knowledge for understanding NLR evolution and will empower broader characterization of disease resistance genes to be used for crop breeding.

  2. Genome-Wide Comparative Analyses Reveal the Dynamic Evolution of Nucleotide-Binding Leucine-Rich Repeat Gene Family among Solanaceae Plants.

    PubMed

    Seo, Eunyoung; Kim, Seungill; Yeom, Seon-In; Choi, Doil

    2016-01-01

    Plants have evolved an elaborate innate immune system against invading pathogens. Within this system, intracellular nucleotide-binding leucine-rich repeat (NLR) immune receptors are known play critical roles in effector-triggered immunity (ETI) plant defense. We performed genome-wide identification and classification of NLR-coding sequences from the genomes of pepper, tomato, and potato using fixed criteria. We then compared genomic duplication and evolution features. We identified intact 267, 443, and 755 NLR-encoding genes in tomato, potato, and pepper genomes, respectively. Phylogenetic analysis and classification of Solanaceae NLRs revealed that the majority of NLR super family members fell into 14 subgroups, including a TIR-NLR (TNL) subgroup and 13 non-TNL subgroups. Specific subgroups have expanded in each genome, with the expansion in pepper showing subgroup-specific physical clusters. Comparative analysis of duplications showed distinct duplication patterns within pepper and among Solanaceae plants suggesting subgroup- or species-specific gene duplication events after speciation, resulting in divergent evolution. Taken together, genome-wide analysis of NLR family members provide insights into their evolutionary history in Solanaceae. These findings also provide important foundational knowledge for understanding NLR evolution and will empower broader characterization of disease resistance genes to be used for crop breeding. PMID:27559340

  3. Genome-wide investigation and expression analysis of AP2-ERF gene family in salt tolerant common bean

    PubMed Central

    Kavas, Musa; Kizildogan, Aslihan; Gökdemir, Gökhan; Baloglu, Mehmet Cengiz

    2015-01-01

    Apetala2-ethylene-responsive element binding factor (AP2-ERF) superfamily with common AP2-DNA binding domain have developmentally and physiologically important roles in plants. Since common bean genome project has been completed recently, it is possible to identify all of the AP2-ERF genes in the common bean genome. In this study, a comprehensive genome-wide in silico analysis identified 180 AP2-ERF superfamily genes in common bean (Phaseolus vulgaris). Based on the amino acid alignment and phylogenetic analyses, superfamily members were classified into four subfamilies: DREB (54), ERF (95), AP2 (27) and RAV (3), as well as one soloist. The physical and chemical characteristics of amino acids, interaction between AP2-ERF proteins, cis elements of promoter region of AP2-ERF genes and phylogenetic trees were predicted and analyzed. Additionally, expression levels of AP2-ERF genes were evaluated by in silico and qRT-PCR analyses. In silico micro-RNA target transcript analyses identified nearly all PvAP2-ERF genes as targets of by 44 different plant species' miRNAs were identified in this study. The most abundant target genes were PvAP2/ERF-20-25-62-78-113-173. miR156, miR172 and miR838 were the most important miRNAs found in targeting and BLAST analyses. Interactome analysis revealed that the transcription factor PvAP2-ERF78, an ortholog of Arabidopsis At2G28550, was potentially interacted with at least 15 proteins, indicating that it was very important in transcriptional regulation. Here we present the first study to identify and characterize the AP2-ERF transcription factors in common bean using whole-genome analysis, and the findings may serve as a references for future functional research on the transcription factors in common bean. PMID:27152109

  4. A genome-wide linkage scan of bipolar disorder in Latino families identifies susceptibility loci at 8q24 and 14q32.

    PubMed

    Gonzalez, Suzanne; Camarillo, Cynthia; Rodriguez, Marco; Ramirez, Mercedes; Zavala, Juan; Armas, Regina; Contreras, Salvador A; Contreras, Javier; Dassori, Albana; Almasy, Laura; Flores, Deborah; Jerez, Alvaro; Raventós, Henriette; Ontiveros, Alfonso; Nicolini, Humberto; Escamilla, Michael

    2014-09-01

    A genome-wide nonparametric linkage screen was performed to localize Bipolar Disorder (BP) susceptibility loci in a sample of 3757 individuals of Latino ancestry. The sample included 963 individuals with BP phenotype (704 relative pairs) from 686 families recruited from the US, Mexico, Costa Rica, and Guatemala. Non-parametric analyses were performed over a 5 cM grid with an average genetic coverage of 0.67 cM. Multipoint analyses were conducted across the genome using non-parametric Kong & Cox LOD scores along with Sall statistics for all relative pairs. Suggestive and significant genome-wide thresholds were calculated based on 1000 simulations. Single-marker association tests in the presence of linkage were performed assuming a multiplicative model with a population prevalence of 2%. We identified two genome-wide significant susceptibly loci for BP at 8q24 and 14q32, and a third suggestive locus at 2q13-q14. Within these three linkage regions, the top associated single marker (rs1847694, P = 2.40 × 10(-5)) is located 195 Kb upstream of DPP10 in Chromosome 2. DPP10 is prominently expressed in brain neuronal populations, where it has been shown to bind and regulate Kv4-mediated A-type potassium channels. Taken together, these results provide additional evidence that 8q24, 14q32, and 2q13-q14 are susceptibly loci for BP and these regions may be involved in the pathogenesis of BP in the Latino population. PMID:25044503

  5. Genome-wide linkage scan and association study of PARL to the expression of LHON families in Thailand.

    PubMed

    Phasukkijwatana, Nopasak; Kunhapan, Bussaraporn; Stankovich, Jim; Chuenkongkaew, Wanicha L; Thomson, Russell; Thornton, Timothy; Bahlo, Melanie; Mushiroda, Taisei; Nakamura, Yusuke; Mahasirimongkol, Surakameth; Tun, Aung Win; Srisawat, Chatchawan; Limwongse, Chanin; Peerapittayamongkol, Chayanon; Sura, Thanyachai; Suthammarak, Wichit; Lertrit, Patcharee

    2010-07-01

    Leber hereditary optic neuropathy (LHON) is the most common mitochondrially inherited disease causing blindness, preferentially in young adult males. Most of the patients carry the G11778A mitochondrial DNA (mtDNA) mutation. However, the marked incomplete penetrance and the gender bias indicate some additional genetic and/or environmental factors to disease expression. Herein, we first conducted a genome-wide linkage scan with 400 microsatellite markers in 9 large Thai LHON G11778A pedigrees. Using an affecteds-only nonparametric linkage analysis, 4 regions on chromosomes 3, 12, 13 and 18 showed Zlr scores greater than 2 (P < 0.025), which is consistently significant across several linkage statistics. The most suggestive marker D3S1565 (Zlr > 2 in 10 of 16 allele sharing models tested) was then expanded to include the region 3q26.2-3q28 covering SLC7A14 (3q26.2), MFN1 (3q26.32), MRPL47 (3q26.33), MCCC1 (3q27.1), PARL (3q27.1) and OPA1 (3q28-q29). All of these candidate genes were selected from the Maestro database and had known to be localized in mitochondria. Sixty tag SNPs were genotyped in 86 cases, 211 of their relatives and 32 unrelated Thai controls, by multiplex-PCR-based Invader assay. Analyses using a powerful association testing tool that adjusts for relatedness (the M(QLS) statistic) showed the most evidence of association between two SNPs, rs3749446 and rs1402000 (located in PARL presenilins-associated rhomboid-like) and LHON expression (both P = 8.8 x 10(-5)). The mitochondrial PARL protease has been recently known to play a role with a dynamin-related OPA1 protein in preventing apoptotic events by slowing down the release of cytochrome c out of mitochondrial cristae junctions. Moreover, PARL is required to activate the intramembranous proteolyses resulting in the degradation of an accumulated pro-apoptotic protein in the outer mitochondrial membrane. Under these circumstances, variants of PARL are suggested to influence cell death by apoptosis which

  6. Genome-wide analysis and identification of KT/HAK/KUP potassium transporter gene family in peach (Prunus persica).

    PubMed

    Song, Z Z; Ma, R J; Yu, M L

    2015-01-30

    The KT/HAK/KUP family members encoding high-affinity potassium (K(+)) transporters mediate K(+) transport across the plasma membranes of plant cells to maintain plant normal growth and metabolic activities. In this paper, we identified 16 potassium transporter genes in the peach (Prunus persica) using the Hidden Markov model scanning strategy and searching the peach genome database. Utilizing the Arabidopsis KT/HAK/KUP family as a reference, phylogenetic analysis indicates that the KT/HAK/KUP family in the peach can be classified into 3 groups. Genomic localization indicated that 16 KT/HAK/KUP family genes were well distributed on 7 scaffolds. Gene structure analysis showed that the KT/HAK/KUP family genes have 6-9 introns. In addition, all of the KT/HAK/KUP family members were hydrophobic proteins; they exhibited similar secondary structure patterns and homologous tertiary structures. Putative cis-elements involved in abiotic stress adaption, Ca(2+) response, light and circadian rhythm regulation, and seed development were observed in the promoters of the KT/HAK/KUP family genes. Subcellular localization prediction indicated that the KT/HAK/KUP members were mainly located in the plasma membrane. Expression levels of the KT/HAK/ KUP family genes were much higher in the fruit and flower than those in the other 7 tissues examined, indicating that the KT/HAK/KUP family genes may have important roles in K(+) uptake and transport, which mainly contribute to flower formation and fruit development in the peach.

  7. The easy road to genome-wide medium density SNP screening in a non-model species: development and application of a 10 K SNP-chip for the house sparrow (Passer domesticus).

    PubMed

    Hagen, Ingerid J; Billing, Anna M; Rønning, Bernt; Pedersen, Sindre A; Pärn, Henrik; Slate, Jon; Jensen, Henrik

    2013-05-01

    With the advent of next generation sequencing, new avenues have opened to study genomics in wild populations of non-model species. Here, we describe a successful approach to a genome-wide medium density Single Nucleotide Polymorphism (SNP) panel in a non-model species, the house sparrow (Passer domesticus), through the development of a 10 K Illumina iSelect HD BeadChip. Genomic DNA and cDNA derived from six individuals were sequenced on a 454 GS FLX system and generated a total of 1.2 million sequences, in which SNPs were detected. As no reference genome exists for the house sparrow, we used the zebra finch (Taeniopygia guttata) reference genome to determine the most likely position of each SNP. The 10 000 SNPs on the SNP-chip were selected to be distributed evenly across 31 chromosomes, giving on average one SNP per 100 000 bp. The SNP-chip was screened across 1968 individual house sparrows from four island populations. Of the original 10 000 SNPs, 7413 were found to be variable, and 99% of these SNPs were successfully called in at least 93% of all individuals. We used the SNP-chip to demonstrate the ability of such genome-wide marker data to detect population sub-division, and compared these results to similar analyses using microsatellites. The SNP-chip will be used to map Quantitative Trait Loci (QTL) for fitness-related phenotypic traits in natural populations.

  8. [Genome-wide identification, phylogenetic analysis and expression profiling of the WOX family genes in Solanum lycopersicum].

    PubMed

    Li, Xiaoxu; Liu, Cheng; Li, Wei; Zhang, Zenglin; Gao, Xiaoming; Zhou, Hui; Guo, Yongfeng

    2016-05-01

    Members of the plant-specific WOX transcription factor family have been reported to play important roles in cell to cell communication as well as other physiological and developmental processes. In this study, ten members of the WOX transcription factor family were identified in Solanum lycopersicum with HMMER. Neighbor-joining phylogenetic tree, maximum-likelihood tree and Bayesian-inference tree were constructed and similar topologies were shown using the protein sequences of the homeodomain. Phylogenetic study revealed that the 25 WOX family members from Arabidopsis and tomato fall into three clades and nine subfamilies. The patterns of exon-intron structures and organization of conserved domains in Arabidopsis and tomato were consistent based on the phylogenetic results. Transcriptome analysis showed that the expression patterns of SlWOXs were different in different tissue types. Gene Ontology (GO) analysis suggested that, as transcription factors, the SlWOX family members could be involved in a number of biological processes including cell to cell communication and tissue development. Our results are useful for future studies on WOX family members in tomato and other plant species. PMID:27232493

  9. Genome-Wide Identification and Expression Analysis of Homeodomain Leucine Zipper Subfamily IV (HDZ IV) Gene Family from Musa accuminata.

    PubMed

    Pandey, Ashutosh; Misra, Prashant; Alok, Anshu; Kaur, Navneet; Sharma, Shivani; Lakhwani, Deepika; Asif, Mehar H; Tiwari, Siddharth; Trivedi, Prabodh K

    2016-01-01

    The homeodomain zipper family (HD-ZIP) of transcription factors is present only in plants and plays important role in the regulation of plant-specific processes. The subfamily IV of HDZ transcription factors (HD-ZIP IV) has primarily been implicated in the regulation of epidermal structure development. Though this gene family is present in all lineages of land plants, members of this gene family have not been identified in banana, which is one of the major staple fruit crops. In the present work, we identified 21 HDZIV encoding genes in banana by the computational analysis of banana genome resource. Our analysis suggested that these genes putatively encode proteins having all the characteristic domains of HDZIV transcription factors. The phylogenetic analysis of the banana HDZIV family genes further confirmed that after separation from a common ancestor, the banana, and poales lineages might have followed distinct evolutionary paths. Further, we conclude that segmental duplication played a major role in the evolution of banana HDZIV encoding genes. All the identified banana HDZIV genes expresses in different banana tissue, however at varying levels. The transcript levels of some of the banana HDZIV genes were also detected in banana fruit pulp, suggesting their putative role in fruit attributes. A large number of genes of this family showed modulated expression under drought and salinity stress. Taken together, the present work lays a foundation for elucidation of functional aspects of the banana HDZIV encoding genes and for their possible use in the banana improvement programs.

  10. Genome-Wide Identification and Expression Analysis of Homeodomain Leucine Zipper Subfamily IV (HDZ IV) Gene Family from Musa accuminata

    PubMed Central

    Pandey, Ashutosh; Misra, Prashant; Alok, Anshu; Kaur, Navneet; Sharma, Shivani; Lakhwani, Deepika; Asif, Mehar H.; Tiwari, Siddharth; Trivedi, Prabodh K.

    2016-01-01

    The homeodomain zipper family (HD-ZIP) of transcription factors is present only in plants and plays important role in the regulation of plant-specific processes. The subfamily IV of HDZ transcription factors (HD-ZIP IV) has primarily been implicated in the regulation of epidermal structure development. Though this gene family is present in all lineages of land plants, members of this gene family have not been identified in banana, which is one of the major staple fruit crops. In the present work, we identified 21 HDZIV encoding genes in banana by the computational analysis of banana genome resource. Our analysis suggested that these genes putatively encode proteins having all the characteristic domains of HDZIV transcription factors. The phylogenetic analysis of the banana HDZIV family genes further confirmed that after separation from a common ancestor, the banana, and poales lineages might have followed distinct evolutionary paths. Further, we conclude that segmental duplication played a major role in the evolution of banana HDZIV encoding genes. All the identified banana HDZIV genes expresses in different banana tissue, however at varying levels. The transcript levels of some of the banana HDZIV genes were also detected in banana fruit pulp, suggesting their putative role in fruit attributes. A large number of genes of this family showed modulated expression under drought and salinity stress. Taken together, the present work lays a foundation for elucidation of functional aspects of the banana HDZIV encoding genes and for their possible use in the banana improvement programs. PMID:26870050

  11. Genome-wide analysis of the bHLH transcription factor family in Chinese cabbage (Brassica rapa ssp. pekinensis).

    PubMed

    Song, Xiao-Ming; Huang, Zhi-Nan; Duan, Wei-Ke; Ren, Jun; Liu, Tong-Kun; Li, Ying; Hou, Xi-Lin

    2014-02-01

    Basic helix-loop-helix (bHLH) transcription factors are widely distributed in eukaryotic organisms and are thought to be one of the largest families of regulatory proteins. This important family of transcriptional regulators plays crucial roles in plant development. However, a systematic analysis of the bHLH transcription factor family has not been reported in Chinese cabbage. In this study, 230 bHLH transcription factors were identified from the whole Chinese cabbage genome and compared with proteins from other representative plants, fungi and metazoans. The Chinese cabbage bHLH (BrabHLH) gene family could be classified into 24 subfamilies. Phylogenetic analysis of BrabHLHs along with bHLHs from Arabidopsis and rice indicated 26 subfamilies. The identification, classification, phylogenetic reconstruction, conserved motifs, chromosome distribution, functional annotation, expression patterns and interaction networks of BrabHLHs were analyzed. Distribution mapping showed that BrabHLHs were non-randomly located on the ten Chinese cabbage chromosomes. One hundred and twenty-four orthologous bHLH genes were identified between Chinese cabbage and Arabidopsis, and the interaction networks of the orthologous genes were constructed in Chinese cabbage. Quantitative RT-PCR analysis showed that expressions of BrabHLH genes varied widely under different abiotic stress treatments for different times. Thus, this comprehensive analysis of BrabHLHs represents a rich resource, aiding the elucidation of the roles of bHLH family members in plant growth and development. Furthermore, the comparative genomics analysis deepened our understanding of the evolution of this gene family after a polyploidy event.

  12. Genome-Wide Analysis of APETALA2/Ethylene-Responsive Factor (AP2/ERF) Gene Family in Barley (Hordeum vulgare L.)

    PubMed Central

    Lin, Shen; Luan, Haiye; Lv, Chao; Zhang, Xinzhong; Song, Xiyun; Xu, Rugen

    2016-01-01

    APETALA2/Ethylene-Responsive Factor (AP2/ERF) gene family is plant specific transcription factor. It plays critical roles in development process, tolerance to biotic and abiotic stresses, and responses to plant hormones. However, limited data are available on the contributions of AP2/ERF gene family in barley (Hordeum vulgare L.). In the present study, 121 HvAP2/ERF genes in barley were identified by using bioinformatics methods. A total of 118 HvAP2/ERF (97.5%) genes were located on seven chromosomes. According to phylogenetic classification of AP2/ERF family in Arabidopsis, HvAP2/ERF proteins were divided into AP2 (APETALA2), RAV (Related to ABI3/VP), DREB (dehydration responsive element binding), ERF (ethylene responsive factors) and soloist sub families. The analysis of duplication events indicated that tandem repeat and segmental duplication contributed to the expansion of the AP2/ERF family in barley. HvDREB1s/2s genes displayed various expression patterns under abiotic stress and phytohormone. Taken together, the data generated in this study will be useful for genome-wide analysis to determine the precise role of the HvAP2/ERF gene during barley development, abiotic stress and phytohormone responses with the ultimate goal of improving crop production. PMID:27598245

  13. Genome-wide identification of MAPKK and MAPKKK gene families in tomato and transcriptional profiling analysis during development and stress response.

    PubMed

    Wu, Jian; Wang, Jie; Pan, Changtian; Guan, Xiaoyan; Wang, Yan; Liu, Songyu; He, Yanjun; Chen, Jingli; Chen, Lifei; Lu, Gang

    2014-01-01

    Mitogen-activated protein kinase (MAPK) cascades have important functions in plant growth, development, and response to various stresses. The MAPKK and MAPKKK gene families in tomato have never been systematically analyzed. In this study, we performed a genome-wide analysis of the MAPKK and MAPKKK gene families in tomato and identified 5 MAPKK genes and 89 MAPKKK genes. Phylogenetic analyses of the MAPKK and MAPKKK gene families showed that all the MAPKK genes formed four groups (groups A, B, C, and D), whereas all the MAPKKK genes were classified into three subfamilies, namely, MEKK, RAF, and ZIK. Evolutionary analysis showed that whole genome or chromosomal segment duplications were the main factors responsible for the expansion of the MAPKK and MAPKKK gene families in tomato. Quantitative real-time RT-PCR analysis showed that the majority of MAPKK and MAPKKK genes were expressed in all tested organs with considerable differences in transcript levels indicating that they might be constitutively expressed. However, the expression level of most of these genes changed significantly under heat, cold, drought, salt, and Pseudomonas syringae treatment. Furthermore, their expression levels exhibited significant changes in response to salicylic acid and indole-3-acetic acid treatment, implying that these genes might have important roles in the plant hormone network. Our comparative analysis of the MAPKK and MAPKKK families would improve our understanding of the evolution and functional characterization of MAPK cascades in tomato.

  14. Genome-Wide Analysis of APETALA2/Ethylene-Responsive Factor (AP2/ERF) Gene Family in Barley (Hordeum vulgare L.).

    PubMed

    Guo, Baojian; Wei, Yafeng; Xu, Ruibin; Lin, Shen; Luan, Haiye; Lv, Chao; Zhang, Xinzhong; Song, Xiyun; Xu, Rugen

    2016-01-01

    APETALA2/Ethylene-Responsive Factor (AP2/ERF) gene family is plant specific transcription factor. It plays critical roles in development process, tolerance to biotic and abiotic stresses, and responses to plant hormones. However, limited data are available on the contributions of AP2/ERF gene family in barley (Hordeum vulgare L.). In the present study, 121 HvAP2/ERF genes in barley were identified by using bioinformatics methods. A total of 118 HvAP2/ERF (97.5%) genes were located on seven chromosomes. According to phylogenetic classification of AP2/ERF family in Arabidopsis, HvAP2/ERF proteins were divided into AP2 (APETALA2), RAV (Related to ABI3/VP), DREB (dehydration responsive element binding), ERF (ethylene responsive factors) and soloist sub families. The analysis of duplication events indicated that tandem repeat and segmental duplication contributed to the expansion of the AP2/ERF family in barley. HvDREB1s/2s genes displayed various expression patterns under abiotic stress and phytohormone. Taken together, the data generated in this study will be useful for genome-wide analysis to determine the precise role of the HvAP2/ERF gene during barley development, abiotic stress and phytohormone responses with the ultimate goal of improving crop production. PMID:27598245

  15. Genome-Wide Analysis of the Glutathione S-Transferase Gene Family in Capsella rubella: Identification, Expression, and Biochemical Functions

    PubMed Central

    He, Gang; Guan, Chao-Nan; Chen, Qiang-Xin; Gou, Xiao-Jun; Liu, Wei; Zeng, Qing-Yin; Lan, Ting

    2016-01-01

    Extensive subfunctionalization might explain why so many genes have been maintained after gene duplication, which provides the engine for gene family expansion. However, it is still a particular challenge to trace the evolutionary dynamics and features of functional divergences in a supergene family over the course of evolution. In this study, we identified 49 Glutathione S-transferase (GST) genes from the Capsella rubella, a close relative of Arabidopsis thaliana and a member of the mustard family. Capsella GSTs can be categorized into eight classes, with tau and phi GSTs being the most numerous. The expansion of the two classes mainly occurs through tandem gene duplication, which results in tandem-arrayed gene clusters on chromosomes. By integrating phylogenetic analysis, expression patterns, and biochemical functions of Capsella and Arabidopsis GSTs, functional divergence, both in gene expression and enzymatic properties, were clearly observed in paralogous gene pairs in Capsella (even the most recent duplicates), and orthologous GSTs in Arabidopsis/Capsella. This study provides functional evidence for the expansion and organization of a large gene family in closely related species.

  16. Genome-wide analysis of the R2R3-MYB transcription factor gene family in sweet orange (Citrus sinensis).

    PubMed

    Liu, Chaoyang; Wang, Xia; Xu, Yuantao; Deng, Xiuxin; Xu, Qiang

    2014-10-01

    MYB transcription factor represents one of the largest gene families in plant genomes. Sweet orange (Citrus sinensis) is one of the most important fruit crops worldwide, and recently the genome has been sequenced. This provides an opportunity to investigate the organization and evolutionary characteristics of sweet orange MYB genes from whole genome view. In the present study, we identified 100 R2R3-MYB genes in the sweet orange genome. A comprehensive analysis of this gene family was performed, including the phylogeny, gene structure, chromosomal localization and expression pattern analyses. The 100 genes were divided into 29 subfamilies based on the sequence similarity and phylogeny, and the classification was also well supported by the highly conserved exon/intron structures and motif composition. The phylogenomic comparison of MYB gene family among sweet orange and related plant species, Arabidopsis, cacao and papaya suggested the existence of functional divergence during evolution. Expression profiling indicated that sweet orange R2R3-MYB genes exhibited distinct temporal and spatial expression patterns. Our analysis suggested that the sweet orange MYB genes may play important roles in different plant biological processes, some of which may be potentially involved in citrus fruit quality. These results will be useful for future functional analysis of the MYB gene family in sweet orange.

  17. Genome-Wide Identification, Characterization and Expression Analysis of the Solute Carrier 6 Gene Family in Silkworm (Bombyx mori)

    PubMed Central

    Tang, Xin; Liu, Huawei; Chen, Quanmei; Wang, Xin; Xiong, Ying; Zhao, Ping

    2016-01-01

    The solute carrier 6 (SLC6) gene family, initially known as the neurotransmitter transporters, plays vital roles in the regulation of neurotransmitter signaling, nutrient absorption and motor behavior. In this study, a total of 16 candidate genes were identified as SLC6 family gene homologs in the silkworm (Bombyx mori) genome. Spatio-temporal expression patterns of silkworm SLC6 gene transcripts indicated that these genes were highly and specifically expressed in midgut, brain and gonads; moreover, these genes were expressed primarily at the feeding stage or adult stage. Levels of expression for most midgut-specific and midgut-enriched gene transcripts were down-regulated after starvation but up-regulated after re-feeding. In addition, we observed that expression levels of these genes except for BmSLC6-15 and BmGT1 were markedly up-regulated by a juvenile hormone analog. Moreover, brain-enriched genes showed differential expression patterns during wandering and mating processes, suggesting that these genes may be involved in modulating wandering and mating behaviors. Our results improve our understanding of the expression patterns and potential physiological functions of the SLC6 gene family, and provide valuable information for the comprehensive functional analysis of the SLC6 gene family. PMID:27706106

  18. Genome-Wide Analysis of the Glutathione S-Transferase Gene Family in Capsella rubella: Identification, Expression, and Biochemical Functions.

    PubMed

    He, Gang; Guan, Chao-Nan; Chen, Qiang-Xin; Gou, Xiao-Jun; Liu, Wei; Zeng, Qing-Yin; Lan, Ting

    2016-01-01

    Extensive subfunctionalization might explain why so many genes have been maintained after gene duplication, which provides the engine for gene family expansion. However, it is still a particular challenge to trace the evolutionary dynamics and features of functional divergences in a supergene family over the course of evolution. In this study, we identified 49 Glutathione S-transferase (GST) genes from the Capsella rubella, a close relative of Arabidopsis thaliana and a member of the mustard family. Capsella GSTs can be categorized into eight classes, with tau and phi GSTs being the most numerous. The expansion of the two classes mainly occurs through tandem gene duplication, which results in tandem-arrayed gene clusters on chromosomes. By integrating phylogenetic analysis, expression patterns, and biochemical functions of Capsella and Arabidopsis GSTs, functional divergence, both in gene expression and enzymatic properties, were clearly observed in paralogous gene pairs in Capsella (even the most recent duplicates), and orthologous GSTs in Arabidopsis/Capsella. This study provides functional evidence for the expansion and organization of a large gene family in closely related species. PMID:27630652

  19. Genome-wide identification of WRKY family genes and their response to cold stress in Vitis vinifera

    Technology Transfer Automated Retrieval System (TEKTRAN)

    WRKY transcription factors are one of the largest families of transcriptional regulators in plants. WRKY genes are not only found to play significant roles in biotic and abiotic stress response, but also regulate growth and development. Grapevine (Vitis vinifera) production is largely limited by str...

  20. Genome-Wide Analysis of the Glutathione S-Transferase Gene Family in Capsella rubella: Identification, Expression, and Biochemical Functions

    PubMed Central

    He, Gang; Guan, Chao-Nan; Chen, Qiang-Xin; Gou, Xiao-Jun; Liu, Wei; Zeng, Qing-Yin; Lan, Ting

    2016-01-01

    Extensive subfunctionalization might explain why so many genes have been maintained after gene duplication, which provides the engine for gene family expansion. However, it is still a particular challenge to trace the evolutionary dynamics and features of functional divergences in a supergene family over the course of evolution. In this study, we identified 49 Glutathione S-transferase (GST) genes from the Capsella rubella, a close relative of Arabidopsis thaliana and a member of the mustard family. Capsella GSTs can be categorized into eight classes, with tau and phi GSTs being the most numerous. The expansion of the two classes mainly occurs through tandem gene duplication, which results in tandem-arrayed gene clusters on chromosomes. By integrating phylogenetic analysis, expression patterns, and biochemical functions of Capsella and Arabidopsis GSTs, functional divergence, both in gene expression and enzymatic properties, were clearly observed in paralogous gene pairs in Capsella (even the most recent duplicates), and orthologous GSTs in Arabidopsis/Capsella. This study provides functional evidence for the expansion and organization of a large gene family in closely related species. PMID:27630652

  1. Genome-Wide Association and Trans-ethnic Meta-Analysis for Advanced Diabetic Kidney Disease: Family Investigation of Nephropathy and Diabetes (FIND).

    PubMed

    Iyengar, Sudha K; Sedor, John R; Freedman, Barry I; Kao, W H Linda; Kretzler, Matthias; Keller, Benjamin J; Abboud, Hanna E; Adler, Sharon G; Best, Lyle G; Bowden, Donald W; Burlock, Allison; Chen, Yii-Der Ida; Cole, Shelley A; Comeau, Mary E; Curtis, Jeffrey M; Divers, Jasmin; Drechsler, Christiane; Duggirala, Ravi; Elston, Robert C; Guo, Xiuqing; Huang, Huateng; Hoffmann, Michael Marcus; Howard, Barbara V; Ipp, Eli; Kimmel, Paul L; Klag, Michael J; Knowler, William C; Kohn, Orly F; Leak, Tennille S; Leehey, David J; Li, Man; Malhotra, Alka; März, Winfried; Nair, Viji; Nelson, Robert G; Nicholas, Susanne B; O'Brien, Stephen J; Pahl, Madeleine V; Parekh, Rulan S; Pezzolesi, Marcus G; Rasooly, Rebekah S; Rotimi, Charles N; Rotter, Jerome I; Schelling, Jeffrey R; Seldin, Michael F; Shah, Vallabh O; Smiles, Adam M; Smith, Michael W; Taylor, Kent D; Thameem, Farook; Thornley-Brown, Denyse P; Truitt, Barbara J; Wanner, Christoph; Weil, E Jennifer; Winkler, Cheryl A; Zager, Philip G; Igo, Robert P; Hanson, Robert L; Langefeld, Carl D

    2015-08-01

    Diabetic kidney disease (DKD) is the most common etiology of chronic kidney disease (CKD) in the industrialized world and accounts for much of the excess mortality in patients with diabetes mellitus. Approximately 45% of U.S. patients with incident end-stage kidney disease (ESKD) have DKD. Independent of glycemic control, DKD aggregates in families and has higher incidence rates in African, Mexican, and American Indian ancestral groups relative to European populations. The Family Investigation of Nephropathy and Diabetes (FIND) performed a genome-wide association study (GWAS) contrasting 6,197 unrelated individuals with advanced DKD with healthy and diabetic individuals lacking nephropathy of European American, African American, Mexican American, or American Indian ancestry. A large-scale replication and trans-ethnic meta-analysis included 7,539 additional European American, African American and American Indian DKD cases and non-nephropathy controls. Within ethnic group meta-analysis of discovery GWAS and replication set results identified genome-wide significant evidence for association between DKD and rs12523822 on chromosome 6q25.2 in American Indians (P = 5.74x10-9). The strongest signal of association in the trans-ethnic meta-analysis was with a SNP in strong linkage disequilibrium with rs12523822 (rs955333; P = 1.31x10-8), with directionally consistent results across ethnic groups. These 6q25.2 SNPs are located between the SCAF8 and CNKSR3 genes, a region with DKD relevant changes in gene expression and an eQTL with IPCEF1, a gene co-translated with CNKSR3. Several other SNPs demonstrated suggestive evidence of association with DKD, within and across populations. These data identify a novel DKD susceptibility locus with consistent directions of effect across diverse ancestral groups and provide insight into the genetic architecture of DKD. PMID:26305897

  2. Genome-wide comparative in silico analysis of the RNA helicase gene family in Zea mays and Glycine max: a comparison with Arabidopsis and Oryza sativa.

    PubMed

    Xu, Ruirui; Zhang, Shizhong; Huang, Jinguang; Zheng, Chengchao

    2013-01-01

    RNA helicases are enzymes that are thought to unwind double-stranded RNA molecules in an energy-dependent fashion through the hydrolysis of NTP. RNA helicases are associated with all processes involving RNA molecules, including nuclear transcription, editing, splicing, ribosome biogenesis, RNA export, and organelle gene expression. The involvement of RNA helicase in response to stress and in plant growth and development has been reported previously. While their importance in Arabidopsis and Oryza sativa has been partially studied, the function of RNA helicase proteins is poorly understood in Zea mays and Glycine max. In this study, we identified a total of RNA helicase genes in Arabidopsis and other crop species genome by genome-wide comparative in silico analysis. We classified the RNA helicase genes into three subfamilies according to the structural features of the motif II region, such as DEAD-box, DEAH-box and DExD/H-box, and different species showed different patterns of alternative splicing. Secondly, chromosome location analysis showed that the RNA helicase protein genes were distributed across all chromosomes with different densities in the four species. Thirdly, phylogenetic tree analyses identified the relevant homologs of DEAD-box, DEAH-box and DExD/H-box RNA helicase proteins in each of the four species. Fourthly, microarray expression data showed that many of these predicted RNA helicase genes were expressed in different developmental stages and different tissues under normal growth conditions. Finally, real-time quantitative PCR analysis showed that the expression levels of 10 genes in Arabidopsis and 13 genes in Zea mays were in close agreement with the microarray expression data. To our knowledge, this is the first report of a comparative genome-wide analysis of the RNA helicase gene family in Arabidopsis, Oryza sativa, Zea mays and Glycine max. This study provides valuable information for understanding the classification and putative functions of

  3. Genome-Wide Association and Trans-ethnic Meta-Analysis for Advanced Diabetic Kidney Disease: Family Investigation of Nephropathy and Diabetes (FIND).

    PubMed

    Iyengar, Sudha K; Sedor, John R; Freedman, Barry I; Kao, W H Linda; Kretzler, Matthias; Keller, Benjamin J; Abboud, Hanna E; Adler, Sharon G; Best, Lyle G; Bowden, Donald W; Burlock, Allison; Chen, Yii-Der Ida; Cole, Shelley A; Comeau, Mary E; Curtis, Jeffrey M; Divers, Jasmin; Drechsler, Christiane; Duggirala, Ravi; Elston, Robert C; Guo, Xiuqing; Huang, Huateng; Hoffmann, Michael Marcus; Howard, Barbara V; Ipp, Eli; Kimmel, Paul L; Klag, Michael J; Knowler, William C; Kohn, Orly F; Leak, Tennille S; Leehey, David J; Li, Man; Malhotra, Alka; März, Winfried; Nair, Viji; Nelson, Robert G; Nicholas, Susanne B; O'Brien, Stephen J; Pahl, Madeleine V; Parekh, Rulan S; Pezzolesi, Marcus G; Rasooly, Rebekah S; Rotimi, Charles N; Rotter, Jerome I; Schelling, Jeffrey R; Seldin, Michael F; Shah, Vallabh O; Smiles, Adam M; Smith, Michael W; Taylor, Kent D; Thameem, Farook; Thornley-Brown, Denyse P; Truitt, Barbara J; Wanner, Christoph; Weil, E Jennifer; Winkler, Cheryl A; Zager, Philip G; Igo, Robert P; Hanson, Robert L; Langefeld, Carl D

    2015-08-01

    Diabetic kidney disease (DKD) is the most common etiology of chronic kidney disease (CKD) in the industrialized world and accounts for much of the excess mortality in patients with diabetes mellitus. Approximately 45% of U.S. patients with incident end-stage kidney disease (ESKD) have DKD. Independent of glycemic control, DKD aggregates in families and has higher incidence rates in African, Mexican, and American Indian ancestral groups relative to European populations. The Family Investigation of Nephropathy and Diabetes (FIND) performed a genome-wide association study (GWAS) contrasting 6,197 unrelated individuals with advanced DKD with healthy and diabetic individuals lacking nephropathy of European American, African American, Mexican American, or American Indian ancestry. A large-scale replication and trans-ethnic meta-analysis included 7,539 additional European American, African American and American Indian DKD cases and non-nephropathy controls. Within ethnic group meta-analysis of discovery GWAS and replication set results identified genome-wide significant evidence for association between DKD and rs12523822 on chromosome 6q25.2 in American Indians (P = 5.74x10-9). The strongest signal of association in the trans-ethnic meta-analysis was with a SNP in strong linkage disequilibrium with rs12523822 (rs955333; P = 1.31x10-8), with directionally consistent results across ethnic groups. These 6q25.2 SNPs are located between the SCAF8 and CNKSR3 genes, a region with DKD relevant changes in gene expression and an eQTL with IPCEF1, a gene co-translated with CNKSR3. Several other SNPs demonstrated suggestive evidence of association with DKD, within and across populations. These data identify a novel DKD susceptibility locus with consistent directions of effect across diverse ancestral groups and provide insight into the genetic architecture of DKD.

  4. Genome-Wide Association and Trans-ethnic Meta-Analysis for Advanced Diabetic Kidney Disease: Family Investigation of Nephropathy and Diabetes (FIND)

    PubMed Central

    Kretzler, Matthias; Keller, Benjamin J.; Adler, Sharon G.; Best, Lyle G.; Bowden, Donald W.; Burlock, Allison; Chen, Yii-Der Ida; Cole, Shelley A.; Comeau, Mary E.; Curtis, Jeffrey M.; Divers, Jasmin; Drechsler, Christiane; Duggirala, Ravi; Elston, Robert C.; Guo, Xiuqing; Huang, Huateng; Hoffmann, Michael Marcus; Howard, Barbara V.; Ipp, Eli; Kimmel, Paul L.; Klag, Michael J.; Knowler, William C.; Kohn, Orly F.; Leak, Tennille S.; Leehey, David J.; Li, Man; Malhotra, Alka; März, Winfried; Nair, Viji; Nelson, Robert G.; Nicholas, Susanne B.; O’Brien, Stephen J.; Pahl, Madeleine V.; Parekh, Rulan S.; Pezzolesi, Marcus G.; Rasooly, Rebekah S.; Rotimi, Charles N.; Rotter, Jerome I.; Schelling, Jeffrey R.; Seldin, Michael F.; Shah, Vallabh O.; Smiles, Adam M.; Smith, Michael W.; Taylor, Kent D.; Thameem, Farook; Thornley-Brown, Denyse P.; Truitt, Barbara J.; Wanner, Christoph; Weil, E. Jennifer; Winkler, Cheryl A.; Zager, Philip G.; Igo, Robert P.; Hanson, Robert L.; Langefeld, Carl D.

    2015-01-01

    Diabetic kidney disease (DKD) is the most common etiology of chronic kidney disease (CKD) in the industrialized world and accounts for much of the excess mortality in patients with diabetes mellitus. Approximately 45% of U.S. patients with incident end-stage kidney disease (ESKD) have DKD. Independent of glycemic control, DKD aggregates in families and has higher incidence rates in African, Mexican, and American Indian ancestral groups relative to European populations. The Family Investigation of Nephropathy and Diabetes (FIND) performed a genome-wide association study (GWAS) contrasting 6,197 unrelated individuals with advanced DKD with healthy and diabetic individuals lacking nephropathy of European American, African American, Mexican American, or American Indian ancestry. A large-scale replication and trans-ethnic meta-analysis included 7,539 additional European American, African American and American Indian DKD cases and non-nephropathy controls. Within ethnic group meta-analysis of discovery GWAS and replication set results identified genome-wide significant evidence for association between DKD and rs12523822 on chromosome 6q25.2 in American Indians (P = 5.74x10-9). The strongest signal of association in the trans-ethnic meta-analysis was with a SNP in strong linkage disequilibrium with rs12523822 (rs955333; P = 1.31x10-8), with directionally consistent results across ethnic groups. These 6q25.2 SNPs are located between the SCAF8 and CNKSR3 genes, a region with DKD relevant changes in gene expression and an eQTL with IPCEF1, a gene co-translated with CNKSR3. Several other SNPs demonstrated suggestive evidence of association with DKD, within and across populations. These data identify a novel DKD susceptibility locus with consistent directions of effect across diverse ancestral groups and provide insight into the genetic architecture of DKD. PMID:26305897

  5. Genome-wide identification and analysis of FK506-binding protein gene family in peach (Prunus persica).

    PubMed

    Zhang, Yanping; Han, Jan; Liu, Dan; Wen, Xicheng; Li, Yu; Tao, Ran; Peng, Yongbin; Fang, Jinggui; Wang, Chen

    2014-02-25

    The FKBP protein family has prolyl isomerase activity and is related in function to cyclophilins. FKBPs are known to be involved in many biological processes including hormone signaling, plant growth, and stress responses through a chaperone or an isomerization of proline residues during protein folding. The availability of complete peach genome sequences allowed the identification of 21 FKBP genes by HMMER and BLAST analyses. Scaffold locations of these FKBP genes in the peach genome were determined and the protein domain and motif organization of peach FKBPs were analyzed. The phylogenetic relationships between peach FKBPs were also assessed. The expression profiles of peach FKBP gene results revealed that most peach FKBPs were expressed in all tissues, while a few peach FKBPs were specifically expressed in some of the tissues. This data could contribute to better understanding of the complex regulation of the peach FKBP gene family, and also provide valuable information for further research in peach functional genomics.

  6. Genome-wide identification, classification and expression analysis of the heat shock transcription factor family in Chinese cabbage.

    PubMed

    Song, Xiaoming; Liu, Gaofeng; Duan, Weike; Liu, Tongkun; Huang, Zhinan; Ren, Jun; Li, Ying; Hou, Xilin

    2014-08-01

    The Hsf gene family, one of the most important transcription factor families, plays crucial roles in regulating heat resistance. However, a systematic and comprehensive analysis of this gene family has not been reported in Chinese cabbage. Therefore, systematic analysis of the Hsf gene family in Chinese cabbage has profound significance. In this study, 35 BrHsf genes were identified from Chinese cabbage, which could be classified into three groups according to their structural characteristics and phylogenetic comparisons with Arabidopsis and rice. Thirty-three BrHsf genes mapped on chromosomes were further assigned to three subgenomes and eight ancestral karyotypes. Distribution mapping showed that BrHsf genes were non-randomly localized on chromosomes. Chinese cabbage and Arabidopsis shared 22 orthologous gene pairs. The expansion of BrHsf genes mainly resulted from genome triplication. Comparative analysis showed that the most Hsf genes were in Chinese cabbage among the five species analyzed. Interestingly, the number of Hsf genes of heat-resistant plants (Theobroma cacao and Musa acuminata) was fewer than that in Chinese cabbage. The expression patterns of BrHsf genes were different in six tissues, based on RNA-seq. Quantitative real-time-PCR analysis showed that the expression level of BrHsf genes varied under various abiotic stresses. In conclusion, this comprehensive analysis of BrHsf genes will provide rich resources, aiding the determination of Hsfs functions in plant heat resistance. Furthermore, the comparative genomics analysis deepened our understanding of Hsf genes' evolution accompanied by the polyploidy event of Chinese cabbage. PMID:24609322

  7. Genome-Wide Analysis and Expression Profiling of the Phospholipase C Gene Family in Soybean (Glycine max)

    PubMed Central

    Zhou, Yonggang; Dong, Jinye; Chen, Huan; Dong, Yuanyuan; Wang, Nan; Li, Xiaowei; Li, Haiyan

    2015-01-01

    Phosphatidylinositol-specific phospholipase C (PI-PLC) hydrolyses phosphatidylinositol-4,5-bisphosphate to produce diacylglycerol and inositol 1,4,5-trisphosphate. It plays an important role in plant development and abiotic stress responses. However, systematic analysis and expression profiling of the phospholipase C (PLC) gene family in soybean have not been reported. In this study, 12 putative PLC genes were identified in the soybean genome. Soybean PLCs were found on chromosomes 2, 11, 14 and 18 and encoded 58.8–70.06 kD proteins. Expression pattern analysis by RT-PCR demonstrated that expression of the GmPLCs was induced by PEG, NaCl and saline-alkali treatments in roots and leaves. GmPLC transcripts accumulated specifically in roots after ABA treatment. Furthermore, GmPLC transcripts were analyzed in various tissues. The results showed that GmPLC7 was highly expressed in most tissues, whereas GmPLC12 was expressed in early pods specifically. In addition, subcellular localization analysis was carried out and confirmed that GmPLC10 was localized in the plasma membrane in Nicotiana benthamiana. Our genomic analysis of the soybean PLC family provides an insight into the regulation of abiotic stress responses and development. It also provides a solid foundation for the functional characterization of the soybean PLC gene family. PMID:26421918

  8. Genome-Wide Characterization and Expression Profiling of the AUXIN RESPONSE FACTOR (ARF) Gene Family in Eucalyptus grandis

    PubMed Central

    Yu, Hong; Soler, Marçal; Mila, Isabelle; San Clemente, Hélène; Savelli, Bruno; Dunand, Christophe; Paiva, Jorge A. P.; Myburg, Alexander A.; Bouzayen, Mondher; Grima-Pettenati, Jacqueline; Cassan-Wang, Hua

    2014-01-01

    Auxin is a central hormone involved in a wide range of developmental processes including the specification of vascular stem cells. Auxin Response Factors (ARF) are important actors of the auxin signalling pathway, regulating the transcription of auxin-responsive genes through direct binding to their promoters. The recent availability of the Eucalyptus grandis genome sequence allowed us to examine the characteristics and evolutionary history of this gene family in a woody plant of high economic importance. With 17 members, the E. grandis ARF gene family is slightly contracted, as compared to those of most angiosperms studied hitherto, lacking traces of duplication events. In silico analysis of alternative transcripts and gene truncation suggested that these two mechanisms were preeminent in shaping the functional diversity of the ARF family in Eucalyptus. Comparative phylogenetic analyses with genomes of other taxonomic lineages revealed the presence of a new ARF clade found preferentially in woody and/or perennial plants. High-throughput expression profiling among different organs and tissues and in response to environmental cues highlighted genes expressed in vascular cambium and/or developing xylem, responding dynamically to various environmental stimuli. Finally, this study allowed identification of three ARF candidates potentially involved in the auxin-regulated transcriptional program underlying wood formation. PMID:25269088

  9. Genome-Wide Analysis and Expression Profiling of the Phospholipase C Gene Family in Soybean (Glycine max).

    PubMed

    Wang, Fawei; Deng, Yu; Zhou, Yonggang; Dong, Jinye; Chen, Huan; Dong, Yuanyuan; Wang, Nan; Li, Xiaowei; Li, Haiyan

    2015-01-01

    Phosphatidylinositol-specific phospholipase C (PI-PLC) hydrolyses phosphatidylinositol-4,5-bisphosphate to produce diacylglycerol and inositol 1,4,5-trisphosphate. It plays an important role in plant development and abiotic stress responses. However, systematic analysis and expression profiling of the phospholipase C (PLC) gene family in soybean have not been reported. In this study, 12 putative PLC genes were identified in the soybean genome. Soybean PLCs were found on chromosomes 2, 11, 14 and 18 and encoded 58.8-70.06 kD proteins. Expression pattern analysis by RT-PCR demonstrated that expression of the GmPLCs was induced by PEG, NaCl and saline-alkali treatments in roots and leaves. GmPLC transcripts accumulated specifically in roots after ABA treatment. Furthermore, GmPLC transcripts were analyzed in various tissues. The results showed that GmPLC7 was highly expressed in most tissues, whereas GmPLC12 was expressed in early pods specifically. In addition, subcellular localization analysis was carried out and confirmed that GmPLC10 was localized in the plasma membrane in Nicotiana benthamiana. Our genomic analysis of the soybean PLC family provides an insight into the regulation of abiotic stress responses and development. It also provides a solid foundation for the functional characterization of the soybean PLC gene family. PMID:26421918

  10. Genome-wide classification and evolutionary and expression analyses of citrus MYB transcription factor families in sweet orange.

    PubMed

    Hou, Xiao-Jin; Li, Si-Bei; Liu, Sheng-Rui; Hu, Chun-Gen; Zhang, Jin-Zhi

    2014-01-01

    MYB family genes are widely distributed in plants and comprise one of the largest transcription factors involved in various developmental processes and defense responses of plants. To date, few MYB genes and little expression profiling have been reported for citrus. Here, we describe and classify 177 members of the sweet orange MYB gene (CsMYB) family in terms of their genomic gene structures and similarity to their putative Arabidopsis orthologs. According to these analyses, these CsMYBs were categorized into four groups (4R-MYB, 3R-MYB, 2R-MYB and 1R-MYB). Gene structure analysis revealed that 1R-MYB genes possess relatively more introns as compared with 2R-MYB genes. Investigation of their chromosomal localizations revealed that these CsMYBs are distributed across nine chromosomes. Sweet orange includes a relatively small number of MYB genes compared with the 198 members in Arabidopsis, presumably due to a paralog reduction related to repetitive sequence insertion into promoter and non-coding transcribed region of the genes. Comparative studies of CsMYBs and Arabidopsis showed that CsMYBs had fewer gene duplication events. Expression analysis revealed that the MYB gene family has a wide expression profile in sweet orange development and plays important roles in development and stress responses. In addition, 337 new putative microsatellites with flanking sequences sufficient for primer design were also identified from the 177 CsMYBs. These results provide a useful reference for the selection of candidate MYB genes for cloning and further functional analysis forcitrus.

  11. Genome-wide identification and analysis of the SBP-box family genes in apple (Malus × domestica Borkh.).

    PubMed

    Li, Jun; Hou, Hongmin; Li, Xiaoqin; Xiang, Jiang; Yin, Xiangjing; Gao, Hua; Zheng, Yi; Bassett, Carole L; Wang, Xiping

    2013-09-01

    SQUAMOSA promoter binding protein (SBP)-box genes encode a family of plant-specific transcription factors and play many crucial roles in plant development. In this study, 27 SBP-box gene family members were identified in the apple (Malus × domestica Borkh.) genome, 15 of which were suggested to be putative targets of MdmiR156. Plant SBPs were classified into eight groups according to the phylogenetic analysis of SBP-domain proteins. Gene structure, gene chromosomal location and synteny analyses of MdSBP genes within the apple genome demonstrated that tandem and segmental duplications, as well as whole genome duplications, have likely contributed to the expansion and evolution of the SBP-box gene family in apple. Additionally, synteny analysis between apple and Arabidopsis indicated that several paired homologs of MdSBP and AtSPL genes were located in syntenic genomic regions. Tissue-specific expression analysis of MdSBP genes in apple demonstrated their diversified spatiotemporal expression patterns. Most MdmiR156-targeted MdSBP genes, which had relatively high transcript levels in stems, leaves, apical buds and some floral organs, exhibited a more differential expression pattern than most MdmiR156-nontargeted MdSBP genes. Finally, expression analysis of MdSBP genes in leaves upon various plant hormone treatments showed that many MdSBP genes were responsive to different plant hormones, indicating that MdSBP genes may be involved in responses to hormone signaling during stress or in apple development.

  12. Genome-wide gene phylogeny of CIPK family in cassava and expression analysis of partial drought-induced genes

    PubMed Central

    Hu, Wei; Xia, Zhiqiang; Yan, Yan; Ding, Zehong; Tie, Weiwei; Wang, Lianzhe; Zou, Meiling; Wei, Yunxie; Lu, Cheng; Hou, Xiaowan; Wang, Wenquan; Peng, Ming

    2015-01-01

    Cassava is an important food and potential biofuel crop that is tolerant to multiple abiotic stressors. The mechanisms underlying these tolerances are currently less known. CBL-interacting protein kinases (CIPKs) have been shown to play crucial roles in plant developmental processes, hormone signaling transduction, and in the response to abiotic stress. However, no data is currently available about the CPK family in cassava. In this study, a total of 25 CIPK genes were identified from cassava genome based on our previous genome sequencing data. Phylogenetic analysis suggested that 25 MeCIPKs could be classified into four subfamilies, which was supported by exon-intron organizations and the architectures of conserved protein motifs. Transcriptomic analysis of a wild subspecies and two cultivated varieties showed that most MeCIPKs had different expression patterns between wild subspecies and cultivatars in different tissues or in response to drought stress. Some orthologous genes involved in CIPK interaction networks were identified between Arabidopsis and cassava. The interaction networks and co-expression patterns of these orthologous genes revealed that the crucial pathways controlled by CIPK networks may be involved in the differential response to drought stress in different accessions of cassava. Nine MeCIPK genes were selected to investigate their transcriptional response to various stimuli and the results showed the comprehensive response of the tested MeCIPK genes to osmotic, salt, cold, oxidative stressors, and ABA signaling. The identification and expression analysis of CIPK family suggested that CIPK genes are important components of development and multiple signal transduction pathways in cassava. The findings of this study will help lay a foundation for the functional characterization of the CIPK gene family and provide an improved understanding of abiotic stress responses and signaling transduction in cassava. PMID:26579161

  13. Genome-Wide Classification and Evolutionary and Expression Analyses of Citrus MYB Transcription Factor Families in Sweet Orange

    PubMed Central

    Hou, Xiao-Jin; Li, Si-Bei; Liu, Sheng-Rui; Hu, Chun-Gen; Zhang, Jin-Zhi

    2014-01-01

    MYB family genes are widely distributed in plants and comprise one of the largest transcription factors involved in various developmental processes and defense responses of plants. To date, few MYB genes and little expression profiling have been reported for citrus. Here, we describe and classify 177 members of the sweet orange MYB gene (CsMYB) family in terms of their genomic gene structures and similarity to their putative Arabidopsis orthologs. According to these analyses, these CsMYBs were categorized into four groups (4R-MYB, 3R-MYB, 2R-MYB and 1R-MYB). Gene structure analysis revealed that 1R-MYB genes possess relatively more introns as compared with 2R-MYB genes. Investigation of their chromosomal localizations revealed that these CsMYBs are distributed across nine chromosomes. Sweet orange includes a relatively small number of MYB genes compared with the 198 members in Arabidopsis, presumably due to a paralog reduction related to repetitive sequence insertion into promoter and non-coding transcribed region of the genes. Comparative studies of CsMYBs and Arabidopsis showed that CsMYBs had fewer gene duplication events. Expression analysis revealed that the MYB gene family has a wide expression profile in sweet orange development and plays important roles in development and stress responses. In addition, 337 new putative microsatellites with flanking sequences sufficient for primer design were also identified from the 177 CsMYBs. These results provide a useful reference for the selection of candidate MYB genes for cloning and further functional analysis forcitrus. PMID:25375352

  14. Genome-Wide Identification, Classification, and Expression Analysis of 14-3-3 Gene Family in Populus

    PubMed Central

    Tian, Fengxia; Wang, Tan; Xie, Yuli; Zhang, Jin; Hu, Jianjun

    2015-01-01

    Background In plants, 14-3-3 proteins are encoded by a large multigene family and are involved in signaling pathways to regulate plant development and protection from stress. Although twelve Populus 14-3-3s were identified based on the Populus trichocarpa genome V1.1 in a previous study, no systematic analysis including genome organization, gene structure, duplication relationship, evolutionary analysis and expression compendium has been conducted in Populus based on the latest P. trichocarpa genome V3.0. Principal Findings Here, a comprehensive analysis of Populus 14-3-3 family is presented. Two new 14-3-3 genes were identified based on the latest P. trichocarpa genome. In P. trichocarpa, fourteen 14-3-3 genes were grouped into ε and non-ε group. Exon-intron organizations of Populus 14-3-3s are highly conserved within the same group. Genomic organization analysis indicated that purifying selection plays a pivotal role in the retention and maintenance of Populus 14-3-3 family. Protein conformational analysis indicated that Populus 14-3-3 consists of a bundle of nine α-helices (α1-α9); the first four are essential for formation of the dimer, while α3, α5, α7, and α9 form a conserved peptide-binding groove. In addition, α1, α3, α5, α7, and α9 were evolving at a lower rate, while α2, α4, and α6 were evolving at a relatively faster rate. Microarray analyses showed that most Populus 14-3-3s are differentially expressed across tissues and upon exposure to various stresses. Conclusions The gene structures and their coding protein structures of Populus 14-3-3s are highly conserved among group members, suggesting that members of the same group might also have conserved functions. Microarray and qRT-PCR analyses showed that most Populus 14-3-3s were differentially expressed in various tissues and were induced by various stresses. Our investigation provided a better understanding of the complexity of the 14-3-3 gene family in poplars. PMID:25867623

  15. Genome-wide comparison of ferritin family from Archaea, Bacteria, Eukarya, and Viruses: its distribution, characteristic motif, and phylogenetic relationship

    NASA Astrophysics Data System (ADS)

    Bai, Lina; Xie, Ting; Hu, Qingqing; Deng, Changyan; Zheng, Rong; Chen, Wanping

    2015-10-01

    Ferritins are highly conserved proteins that are widely distributed in various species from archaea to humans. The ubiquitous characteristic of these proteins reflects the pivotal contribution of ferritins to the safe storage and timely delivery of iron to achieve iron homeostasis. This study investigated the ferritin genes in 248 genomes from various species, including viruses, archaea, bacteria, and eukarya. The distribution comparison suggests that mammals and eudicots possess abundant ferritin genes, whereas fungi contain very few ferritin genes. Archaea and bacteria show considerable numbers of ferritin genes. Generally, prokaryotes possess three types of ferritin (the typical ferritin, bacterioferritin, and DNA-binding protein from starved cell), whereas eukaryotes have various subunit types of ferritin, thereby indicating the individuation of the ferritin family during evolution. The characteristic motif analysis of ferritins suggested that all key residues specifying the unique structural motifs of ferritin are highly conserved across three domains of life. Meanwhile, the characteristic motifs were also distinguishable between ferritin groups, especially phytoferritins, which show a plant-specific motif. The phylogenetic analyses show that ferritins within the same subfamily or subunits are generally clustered together. The phylogenetic relationships among ferritin members suggest that both gene duplication and horizontal transfer contribute to the wide variety of ferritins, and their possible evolutionary scenario was also proposed. The results contribute to a better understanding of the distribution, characteristic motif, and evolutionary relationship of the ferritin family.

  16. Genome-wide analysis, expression dynamics and varietal comparison of NAC gene family at various developmental stages in Morus notabilis.

    PubMed

    Baranwal, Vinay Kumar; Khurana, Paramjit

    2016-06-01

    NAC genes are important transcription factors and forms a large family in plants. They have shown to play an important role in growth and development and have also been shown to involve in regulation of stress-responsive genes. In the present study, a repertoire of NAC genes in recently published mulberry genome has been identified which consists of a total of 79 members. Structural analysis revealed that most of the NAC genes in mulberry contain two introns. The proteins encoded by them show a wide range of isoelectric points suggestive of their varied roles in varying microcellular environment. Phylogenetic and conserved motif analysis elucidate the presence of 15 sub-groups of these genes along with two novel sub-groups having distinct conserved motifs which are not present in Arabidopsis. Gene ontology term enrichment analysis and cis-element identification from their putative 1 K upstream regulatory region indicates their possible role in important biological processes like organ formation, meristem establishment, senescence, and various biotic and abiotic stresses. Expression analysis across various developmental stages led to identification of their preferential expression in diverse tissues. Taken together, this work provides a solid background information related to structure, function, expression and evolution of NAC gene family in mulberry. PMID:26942603

  17. Genome-wide characterization and analysis of bZIP transcription factor gene family related to abiotic stress in cassava.

    PubMed

    Hu, Wei; Yang, Hubiao; Yan, Yan; Wei, Yunxie; Tie, Weiwei; Ding, Zehong; Zuo, Jiao; Peng, Ming; Li, Kaimian

    2016-01-01

    The basic leucine zipper (bZIP) transcription factor family plays crucial roles in various aspects of biological processes. Currently, no information is available regarding the bZIP family in the important tropical crop cassava. Herein, 77 bZIP genes were identified from cassava. Evolutionary analysis indicated that MebZIPs could be divided into 10 subfamilies, which was further supported by conserved motif and gene structure analyses. Global expression analysis suggested that MebZIPs showed similar or distinct expression patterns in different tissues between cultivated variety and wild subspecies. Transcriptome analysis of three cassava genotypes revealed that many MebZIP genes were activated by drought in the root of W14 subspecies, indicating the involvement of these genes in the strong resistance of cassava to drought. Expression analysis of selected MebZIP genes in response to osmotic, salt, cold, ABA, and H2O2 suggested that they might participate in distinct signaling pathways. Our systematic analysis of MebZIPs reveals constitutive, tissue-specific and abiotic stress-responsive candidate MebZIP genes for further functional characterization in planta, yields new insights into transcriptional regulation of MebZIP genes, and lays a foundation for understanding of bZIP-mediated abiotic stress response. PMID:26947924

  18. Genome-wide identification and comparative analysis of the cation proton antiporters family in pear and four other Rosaceae species.

    PubMed

    Zhou, Hongsheng; Qi, Kaijie; Liu, Xing; Yin, Hao; Wang, Peng; Chen, Jianqing; Wu, Juyou; Zhang, Shaoling

    2016-08-01

    The monovalent cation proton antiporters (CPAs) play essential roles in plant nutrition, development, and signal transduction by regulating ion and pH homeostasis of the cell. The CPAs of plants include the Na(+)/H(+) exchanger, K(+) efflux antiporter, and cation/H(+) exchanger families. However, currently, little is known about the CPA genes in Rosaceae species. In this study, 220 CPA genes were identified from five Rosaceae species (Pyrus bretschneideri, Malus domestica, Prunus persica, Fragaria vesca, and Prunus mume), and 53 of which came from P. bretschneideri. Phylogenetic, structure, collinearity, and gene expression analyses were conducted on the entire CPA genes of pear. Gene expression data showed that 35 and 37 CPA genes were expressed in pear fruit and pollen tubes, respectively. The transcript analysis of some CPA genes under abiotic stress conditions revealed that CPAs may play an important role in pollen tubes growth. The results presented here will be useful in improving understanding of the complexity of the CPA gene family and will promote functional characterization in future studies. PMID:27193473

  19. Using ancestry matching to combine family-based and unrelated samples for genome-wide association studies.

    PubMed

    Crossett, Andrew; Kent, Brian P; Klei, Lambertus; Ringquist, Steven; Trucco, Massimo; Roeder, Kathryn; Devlin, Bernie

    2010-12-10

    We propose a method to analyze family-based samples together with unrelated cases and controls. The method builds on the idea of matched case-control analysis using conditional logistic regression (CLR). For each trio within the family, a case (the proband) and matched pseudo-controls are constructed, based upon the transmitted and untransmitted alleles. Unrelated controls, matched by genetic ancestry, supplement the sample of pseudo-controls; likewise unrelated cases are also paired with genetically matched controls. Within each matched stratum, the case genotype is contrasted with control/pseudo-control genotypes via CLR, using a method we call matched-CLR (mCLR). Eigenanalysis of numerous SNP genotypes provides a tool for mapping genetic ancestry. The result of such an analysis can be thought of as a multidimensional map, or eigenmap, in which the relative genetic similarities and differences amongst individuals is encoded in the map. Once constructed, new individuals can be projected onto the ancestry map based on their genotypes. Successful differentiation of individuals of distinct ancestry depends on having a diverse, yet representative sample from which to construct the ancestry map. Once samples are well-matched, mCLR yields comparable power to competing methods while ensuring excellent control over Type I error.

  20. Genome-wide identification and expression profile of homeodomain-leucine zipper Class I gene family in Cucumis sativus.

    PubMed

    Liu, Wei; Fu, Rao; Li, Qiang; Li, Jing; Wang, Lina; Ren, Zhonghai

    2013-12-01

    The HD-Zip proteins comprise one of the largest families of transcription factors in plants. HD-Zip genes have been grouped into four different classes: HD-Zip I to IV. In this study, we described the identification and structural characterization of Class I HD-Zip genes in cucumber. A complete set of 13 HD-Zip I genes were identified in the cucumber genome using Blast search tools and phylogeny. The cucumber HD-Zip I family contained a smaller number of identified genes compared to other higher plants such as Arabidopsis and maize due to the absence of recent gene duplication events. Chromosomal location of these genes revealed that they are distributed unevenly across 5 of 7 chromosomes. Tissue-specific expression profiles showed that 13 cucumber HD-Zip I genes were expressed in at least one of the tissues, which suggested that cucumber HD-Zip I genes took part in many cellular processes. The transcript abundance level analysis during abiotic stress conditions (NaCl, ABA and low temperature treatments) identified a group of HD-Zip I genes that responded to one or more treatments.

  1. Genome-wide comparison of ferritin family from Archaea, Bacteria, Eukarya, and Viruses: its distribution, characteristic motif, and phylogenetic relationship.

    PubMed

    Bai, Lina; Xie, Ting; Hu, Qingqing; Deng, Changyan; Zheng, Rong; Chen, Wanping

    2015-10-01

    Ferritins are highly conserved proteins that are widely distributed in various species from archaea to humans. The ubiquitous characteristic of these proteins reflects the pivotal contribution of ferritins to the safe storage and timely delivery of iron to achieve iron homeostasis. This study investigated the ferritin genes in 248 genomes from various species, including viruses, archaea, bacteria, and eukarya. The distribution comparison suggests that mammals and eudicots possess abundant ferritin genes, whereas fungi contain very few ferritin genes. Archaea and bacteria show considerable numbers of ferritin genes. Generally, prokaryotes possess three types of ferritin (the typical ferritin, bacterioferritin, and DNA-binding protein from starved cell), whereas eukaryotes have various subunit types of ferritin, thereby indicating the individuation of the ferritin family during evolution. The characteristic motif analysis of ferritins suggested that all key residues specifying the unique structural motifs of ferritin are highly conserved across three domains of life. Meanwhile, the characteristic motifs were also distinguishable between ferritin groups, especially phytoferritins, which show a plant-specific motif. The phylogenetic analyses show that ferritins within the same subfamily or subunits are generally clustered together. The phylogenetic relationships among ferritin members suggest that both gene duplication and horizontal transfer contribute to the wide variety of ferritins, and their possible evolutionary scenario was also proposed. The results contribute to a better understanding of the distribution, characteristic motif, and evolutionary relationship of the ferritin family.

  2. Comprehensive analysis of multi-tissue transcriptome data and the genome-wide investigation of GRAS family in Phyllostachys edulis

    PubMed Central

    Zhao, Hansheng; Dong, Lili; Sun, Huayu; Li, Lichao; Lou, Yongfeng; Wang, Lili; Li, Zuyao; Gao, Zhimin

    2016-01-01

    GRAS family is one of plant specific transcription factors and plays diverse roles in the regulation of plant growth and development as well as in the plant disease resistance and abiotic stress responses. However, the investigation of GRAS family and multi-tissue gene expression profiles still remains unavailable in bamboo (Phyllostachys edulis). Here, we applied RNA-Seq analysis to monitor global transcriptional changes and investigate expression patterns in the five tissues of Ph. edulis, and analyzed a large-scale transcriptional events and patterns. Moreover, the tissue-specific genes and DEGs in different tissues were detected. For example, DEGs in panicle and leaf tissues were abundant in photosynthesis, glutathione, porphyrin and chlorophyll metabolism, whereas those in shoot and rhizome were majority in glycerophospholipid metabolism. In the portion of Ph. edulis GRAS (PeGRAS) analyses, we performed the analysis of phylogenetic, gene structure, conserved motifs, and analyzed the expression profiles of PeGRASs in response to high light and made a co-expression analysis. Additionally, the expression profiles of PeGRASs were validated using quantitative real-time PCR. Thus, PeGRASs based on dynamics profiles of gene expression is helpful in uncovering the specific biological functions which might be of critical values for bioengineering to improve bamboo breeding in future. PMID:27325361

  3. Genome-wide characterization and analysis of bZIP transcription factor gene family related to abiotic stress in cassava.

    PubMed

    Hu, Wei; Yang, Hubiao; Yan, Yan; Wei, Yunxie; Tie, Weiwei; Ding, Zehong; Zuo, Jiao; Peng, Ming; Li, Kaimian

    2016-03-07

    The basic leucine zipper (bZIP) transcription factor family plays crucial roles in various aspects of biological processes. Currently, no information is available regarding the bZIP family in the important tropical crop cassava. Herein, 77 bZIP genes were identified from cassava. Evolutionary analysis indicated that MebZIPs could be divided into 10 subfamilies, which was further supported by conserved motif and gene structure analyses. Global expression analysis suggested that MebZIPs showed similar or distinct expression patterns in different tissues between cultivated variety and wild subspecies. Transcriptome analysis of three cassava genotypes revealed that many MebZIP genes were activated by drought in the root of W14 subspecies, indicating the involvement of these genes in the strong resistance of cassava to drought. Expression analysis of selected MebZIP genes in response to osmotic, salt, cold, ABA, and H2O2 suggested that they might participate in distinct signaling pathways. Our systematic analysis of MebZIPs reveals constitutive, tissue-specific and abiotic stress-responsive candidate MebZIP genes for further functional characterization in planta, yields new insights into transcriptional regulation of MebZIP genes, and lays a foundation for understanding of bZIP-mediated abiotic stress response.

  4. Genome-wide analysis of the homeodomain-leucine zipper (HD-ZIP) gene family in peach (Prunus persica).

    PubMed

    Zhang, C H; Ma, R J; Shen, Z J; Sun, X; Korir, N K; Yu, M L

    2014-04-08

    In this study, 33 homeodomain-leucine zipper (HD-ZIP) genes were identified in peach using the HD-ZIP amino acid sequences of Arabidopsis thaliana as a probe. Based on the phylogenetic analysis and the individual gene or protein characteristics, the HD-ZIP gene family in peach can be classified into 4 subfamilies, HD-ZIP I, II, III, and IV, containing 14, 7, 4, and 8 members, respectively. The most closely related peach HD-ZIP members within the same subfamilies shared very similar gene structure in terms of either intron/exon numbers or lengths. Almost all members of the same subfamily shared common motif compositions, thereby implying that the HD-ZIP proteins within the same subfamily may have functional similarity. The 33 peach HD-ZIP genes were distributed across scaffolds 1 to 7. Although the primary structure varied among HD-ZIP family proteins, their tertiary structures were similar. The results from this study will be useful in selecting candidate genes from specific subfamilies for functional analysis.

  5. Genome-wide analysis and expression profiling of the ERF transcription factor family in potato (Solanum tuberosum L.).

    PubMed

    Charfeddine, Mariam; Saïdi, Mohamed Najib; Charfeddine, Safa; Hammami, Asma; Gargouri Bouzid, Radhia

    2015-04-01

    The ERF transcription factors belong to the AP2/ERF superfamily, one of the largest transcription factor families in plants. They play important roles in plant development processes, as well as in the response to biotic, abiotic, and hormone signaling. In the present study, 155 putative ERF transcription factor genes were identified from the potato (Solanum tuberosum) genome database, and compared with those from Arabidopsis thaliana. The StERF proteins are divided into ten phylogenetic groups. Expression analyses of five StERFs were carried out by semi-quantitative RT-PCR and compared with published RNA-seq data. These latter analyses were used to distinguish tissue-specific, biotic, and abiotic stress genes as well as hormone-responsive StERF genes. The results are of interest to better understand the role of the AP2/ERF genes in response to diverse types of stress in potatoes. A comprehensive analysis of the physiological functions and biological roles of the ERF family genes in S. tuberosum is required to understand crop stress tolerance mechanisms.

  6. Genome-wide characterization and analysis of bZIP transcription factor gene family related to abiotic stress in cassava

    PubMed Central

    Hu, Wei; Yang, Hubiao; Yan, Yan; Wei, Yunxie; Tie, Weiwei; Ding, Zehong; Zuo, Jiao; Peng, Ming; Li, Kaimian

    2016-01-01

    The basic leucine zipper (bZIP) transcription factor family plays crucial roles in various aspects of biological processes. Currently, no information is available regarding the bZIP family in the important tropical crop cassava. Herein, 77 bZIP genes were identified from cassava. Evolutionary analysis indicated that MebZIPs could be divided into 10 subfamilies, which was further supported by conserved motif and gene structure analyses. Global expression analysis suggested that MebZIPs showed similar or distinct expression patterns in different tissues between cultivated variety and wild subspecies. Transcriptome analysis of three cassava genotypes revealed that many MebZIP genes were activated by drought in the root of W14 subspecies, indicating the involvement of these genes in the strong resistance of cassava to drought. Expression analysis of selected MebZIP genes in response to osmotic, salt, cold, ABA, and H2O2 suggested that they might participate in distinct signaling pathways. Our systematic analysis of MebZIPs reveals constitutive, tissue-specific and abiotic stress-responsive candidate MebZIP genes for further functional characterization in planta, yields new insights into transcriptional regulation of MebZIP genes, and lays a foundation for understanding of bZIP-mediated abiotic stress response. PMID:26947924

  7. Genome-Wide Identification, Evolutionary Expansion, and Expression Profile of Homeodomain-Leucine Zipper Gene Family in Poplar (Populus trichocarpa)

    PubMed Central

    Hu, Ruibo; Chi, Xiaoyuan; Chai, Guohua; Kong, Yingzhen; He, Guo; Wang, Xiaoyu; Shi, Dachuan; Zhang, Dongyuan; Zhou, Gongke

    2012-01-01

    Background Homeodomain-leucine zipper (HD-ZIP) proteins are plant-specific transcriptional factors known to play crucial roles in plant development. Although sequence phylogeny analysis of Populus HD-ZIPs was carried out in a previous study, no systematic analysis incorporating genome organization, gene structure, and expression compendium has been conducted in model tree species Populus thus far. Principal Findings In this study, a comprehensive analysis of Populus HD-ZIP gene family was performed. Sixty-three full-length HD-ZIP genes were found in Populus genome. These Populus HD-ZIP genes were phylogenetically clustered into four distinct subfamilies (HD-ZIP I–IV) and predominately distributed across 17 linkage groups (LG). Fifty genes from 25 Populus paralogous pairs were located in the duplicated blocks of Populus genome and then preferentially retained during the sequential evolutionary courses. Genomic organization analyses indicated that purifying selection has played a pivotal role in the retention and maintenance of Populus HD-ZIP gene family. Microarray analysis has shown that 21 Populus paralogous pairs have been differentially expressed across different tissues and under various stresses, with five paralogous pairs showing nearly identical expression patterns, 13 paralogous pairs being partially redundant and three paralogous pairs diversifying significantly. Quantitative real-time RT-PCR (qRT-PCR) analysis performed on 16 selected Populus HD-ZIP genes in different tissues and under both drought and salinity stresses confirms their tissue-specific and stress-inducible expression patterns. Conclusions Genomic organizations indicated that segmental duplications contributed significantly to the expansion of Populus HD-ZIP gene family. Exon/intron organization and conserved motif composition of Populus HD-ZIPs are highly conservative in the same subfamily, suggesting the members in the same subfamilies may also have conservative functionalities

  8. A Genome-Wide Analysis Reveals Stress and Hormone Responsive Patterns of TIFY Family Genes in Brassica rapa

    PubMed Central

    Saha, Gopal; Park, Jong-In; Kayum, Md. Abdul; Nou, Ill-Sup

    2016-01-01

    The TIFY family is a plant-specific group of proteins with a diversity of functions and includes four subfamilies, viz. ZML, TIFY, PPD, and JASMONATE ZIM-domain (JAZ) proteins. TIFY family members, particularly JAZ subfamily proteins, play roles in biological processes such as development and stress and hormone responses in Arabidopsis, rice, chickpea, and grape. However, there is no information about this family in any Brassica crop. This study identifies 36 TIFY genes in Brassica rapa, an economically important crop species in the Brassicaceae. An extensive in silico analysis of phylogenetic grouping, protein motif organization and intron-exon distribution confirmed that there are four subfamilies of BrTIFY proteins. Out of 36 BrTIFY genes, we identified 21 in the JAZ subfamily, seven in the TIFY subfamily, six in ZML and two in PPD. Extensive expression profiling of 21 BrTIFY JAZs in various tissues, especially in floral organs and at different flower growth stages revealed constitutive expression patterns, which suggest that BrTIFY JAZ genes are important during growth and development of B. rapa flowers. A protein interaction network analysis also pointed to association of these proteins with fertility and defense processes of B. rapa. Using a low temperature-treated whole-genome microarray data set, most of the JAZ genes were found to have variable transcript abundance between the contrasting inbred lines Chiifu and Kenshin of B. rapa. Subsequently, the expression of all 21 BrTIFY JAZs in response to cold stress was characterized in the same two lines via qPCR, demonstrating that nine genes were up-regulated. Importantly, the BrTIFY JAZs showed strong and differential expression upon JA treatment, pointing to their probable involvement in JA-mediated growth regulatory functions, especially during flower development and stress responses. Additionally, BrTIFY JAZs were induced in response to salt, drought, Fusarium, ABA, and SA treatments, and six genes (BrTIFY3

  9. A Genome-Wide Analysis Reveals Stress and Hormone Responsive Patterns of TIFY Family Genes in Brassica rapa.

    PubMed

    Saha, Gopal; Park, Jong-In; Kayum, Md Abdul; Nou, Ill-Sup

    2016-01-01

    The TIFY family is a plant-specific group of proteins with a diversity of functions and includes four subfamilies, viz. ZML, TIFY, PPD, and JASMONATE ZIM-domain (JAZ) proteins. TIFY family members, particularly JAZ subfamily proteins, play roles in biological processes such as development and stress and hormone responses in Arabidopsis, rice, chickpea, and grape. However, there is no information about this family in any Brassica crop. This study identifies 36 TIFY genes in Brassica rapa, an economically important crop species in the Brassicaceae. An extensive in silico analysis of phylogenetic grouping, protein motif organization and intron-exon distribution confirmed that there are four subfamilies of BrTIFY proteins. Out of 36 BrTIFY genes, we identified 21 in the JAZ subfamily, seven in the TIFY subfamily, six in ZML and two in PPD. Extensive expression profiling of 21 BrTIFY JAZs in various tissues, especially in floral organs and at different flower growth stages revealed constitutive expression patterns, which suggest that BrTIFY JAZ genes are important during growth and development of B. rapa flowers. A protein interaction network analysis also pointed to association of these proteins with fertility and defense processes of B. rapa. Using a low temperature-treated whole-genome microarray data set, most of the JAZ genes were found to have variable transcript abundance between the contrasting inbred lines Chiifu and Kenshin of B. rapa. Subsequently, the expression of all 21 BrTIFY JAZs in response to cold stress was characterized in the same two lines via qPCR, demonstrating that nine genes were up-regulated. Importantly, the BrTIFY JAZs showed strong and differential expression upon JA treatment, pointing to their probable involvement in JA-mediated growth regulatory functions, especially during flower development and stress responses. Additionally, BrTIFY JAZs were induced in response to salt, drought, Fusarium, ABA, and SA treatments, and six genes (BrTIFY3

  10. A Genome-Wide Analysis Reveals Stress and Hormone Responsive Patterns of TIFY Family Genes in Brassica rapa.

    PubMed

    Saha, Gopal; Park, Jong-In; Kayum, Md Abdul; Nou, Ill-Sup

    2016-01-01

    The TIFY family is a plant-specific group of proteins with a diversity of functions and includes four subfamilies, viz. ZML, TIFY, PPD, and JASMONATE ZIM-domain (JAZ) proteins. TIFY family members, particularly JAZ subfamily proteins, play roles in biological processes such as development and stress and hormone responses in Arabidopsis, rice, chickpea, and grape. However, there is no information about this family in any Brassica crop. This study identifies 36 TIFY genes in Brassica rapa, an economically important crop species in the Brassicaceae. An extensive in silico analysis of phylogenetic grouping, protein motif organization and intron-exon distribution confirmed that there are four subfamilies of BrTIFY proteins. Out of 36 BrTIFY genes, we identified 21 in the JAZ subfamily, seven in the TIFY subfamily, six in ZML and two in PPD. Extensive expression profiling of 21 BrTIFY JAZs in various tissues, especially in floral organs and at different flower growth stages revealed constitutive expression patterns, which suggest that BrTIFY JAZ genes are important during growth and development of B. rapa flowers. A protein interaction network analysis also pointed to association of these proteins with fertility and defense processes of B. rapa. Using a low temperature-treated whole-genome microarray data set, most of the JAZ genes were found to have variable transcript abundance between the contrasting inbred lines Chiifu and Kenshin of B. rapa. Subsequently, the expression of all 21 BrTIFY JAZs in response to cold stress was characterized in the same two lines via qPCR, demonstrating that nine genes were up-regulated. Importantly, the BrTIFY JAZs showed strong and differential expression upon JA treatment, pointing to their probable involvement in JA-mediated growth regulatory functions, especially during flower development and stress responses. Additionally, BrTIFY JAZs were induced in response to salt, drought, Fusarium, ABA, and SA treatments, and six genes (BrTIFY3

  11. Genome-Wide Identification and Comparative Analysis of Cytosine-5 DNA Methyltransferase and Demethylase Families in Wild and Cultivated Peanut

    PubMed Central

    Wang, Pengfei; Gao, Chao; Bian, Xiaotong; Zhao, Shuzhen; Zhao, Chuanzhi; Xia, Han; Song, Hui; Hou, Lei; Wan, Shubo; Wang, Xingjun

    2016-01-01

    DNA methylation plays important roles in genome protection, regulation of gene expression and is associated with plants development. Plant DNA methylation pattern was mediated by cytosine-5 DNA methyltransferase and demethylase. Although the genomes of AA and BB wild peanuts have been fully sequenced, these two gene families have not been studied. In this study we report the identification and analysis of putative cytosine-5 DNA methyltransferases (C5-MTases) and demethylases in AA and BB wild peanuts. Cytosine-5 DNA methyltransferases in AA and BB wild peanuts could be classified in MET, CMT, and DRM2 groups based on their domain organization. This result was supported by the gene and protein structural characteristics and phylogenetic analysis. We found that some wild peanut DRM2 members didn't contain UBA domain which was different from other plants such as Arabidopsis, maize and soybean. Five DNA demethylase encoding genes were found in AA genome and five in BB genome. The selective pressure analysis showed that wild peanut C5-MTase genes mainly underwent purifying selection but many positive selection sites can be detected. Conversely, DNA demethylase genes mainly underwent positive selection during evolution. Additionally, the expression dynamic of cytosine-5 DNA methyltransferase and demethylase genes in different cultivated peanut tissues were analyzed. Expression result showed that cold, heat or PEG stress could influence the expression level of C5-MTase and DNA demethylase genes in cultivated peanut. These results are useful for better understanding the complexity of these two gene families, and will facilitate epigenetic studies in peanut in the future. PMID:26870046

  12. Genome-Wide Comparative Analysis of the Phospholipase D Gene Families among Allotetraploid Cotton and Its Diploid Progenitors

    PubMed Central

    Tang, Kai; Dong, Chun-Juan; Liu, Jin-Yuan

    2016-01-01

    In this study, 40 phospholipase D (PLD) genes were identified from allotetraploid cotton Gossypium hirsutum, and 20 PLD genes were examined in diploid cotton Gossypium raimondii. Combining with 19 previously identified Gossypium arboreum PLD genes, a comparative analysis was performed among the PLD gene families among allotetraploid and two diploid cottons. Based on the orthologous relationships, we found that almost each G. hirsutum PLD had a corresponding homolog in the G. arboreum and G. raimondii genomes, except for GhPLDβ3A, whose homolog GaPLDβ3 may have been lost during the evolution of G. arboreum after the interspecific hybridization. Phylogenetic analysis showed that all of the cotton PLDs were unevenly classified into six numbered subgroups: α, β/γ, δ, ε, ζ and φ. An N-terminal C2 domain was found in the α, β/γ, δ and ε subgroups, while phox homology (PX) and pleckstrin homology (PH) domains were identified in the ζ subgroup. The subgroup φ possessed a single peptide instead of a functional domain. In each phylogenetic subgroup, the PLDs showed high conservation in gene structure and amino acid sequences in functional domains. The expansion of GhPLD and GrPLD gene families were mainly attributed to segmental duplication and partly attributed to tandem duplication. Furthermore, purifying selection played a critical role in the evolution of PLD genes in cotton. Quantitative RT-PCR documented that allotetraploid cotton PLD genes were broadly expressed and each had a unique spatial and developmental expression pattern, indicating their functional diversification in cotton growth and development. Further analysis of cis-regulatory elements elucidated transcriptional regulations and potential functions. Our comparative analysis provided valuable information for understanding the putative functions of the PLD genes in cotton fiber. PMID:27213891

  13. Genome-Wide Identification and Comparative Analysis of Cytosine-5 DNA Methyltransferase and Demethylase Families in Wild and Cultivated Peanut.

    PubMed

    Wang, Pengfei; Gao, Chao; Bian, Xiaotong; Zhao, Shuzhen; Zhao, Chuanzhi; Xia, Han; Song, Hui; Hou, Lei; Wan, Shubo; Wang, Xingjun

    2016-01-01

    DNA methylation plays important roles in genome protection, regulation of gene expression and is associated with plants development. Plant DNA methylation pattern was mediated by cytosine-5 DNA methyltransferase and demethylase. Although the genomes of AA and BB wild peanuts have been fully sequenced, these two gene families have not been studied. In this study we report the identification and analysis of putative cytosine-5 DNA methyltransferases (C5-MTases) and demethylases in AA and BB wild peanuts. Cytosine-5 DNA methyltransferases in AA and BB wild peanuts could be classified in MET, CMT, and DRM2 groups based on their domain organization. This result was supported by the gene and protein structural characteristics and phylogenetic analysis. We found that some wild peanut DRM2 members didn't contain UBA domain which was different from other plants such as Arabidopsis, maize and soybean. Five DNA demethylase encoding genes were found in AA genome and five in BB genome. The selective pressure analysis showed that wild peanut C5-MTase genes mainly underwent purifying selection but many positive selection sites can be detected. Conversely, DNA demethylase genes mainly underwent positive selection during evolution. Additionally, the expression dynamic of cytosine-5 DNA methyltransferase and demethylase genes in different cultivated peanut tissues were analyzed. Expression result showed that cold, heat or PEG stress could influence the expression level of C5-MTase and DNA demethylase genes in cultivated peanut. These results are useful for better understanding the complexity of these two gene families, and will facilitate epigenetic studies in peanut in the future.

  14. Genome-Wide Comparative Analysis of the Phospholipase D Gene Families among Allotetraploid Cotton and Its Diploid Progenitors.

    PubMed

    Tang, Kai; Dong, Chun-Juan; Liu, Jin-Yuan

    2016-01-01

    In this study, 40 phospholipase D (PLD) genes were identified from allotetraploid cotton Gossypium hirsutum, and 20 PLD genes were examined in diploid cotton Gossypium raimondii. Combining with 19 previously identified Gossypium arboreum PLD genes, a comparative analysis was performed among the PLD gene families among allotetraploid and two diploid cottons. Based on the orthologous relationships, we found that almost each G. hirsutum PLD had a corresponding homolog in the G. arboreum and G. raimondii genomes, except for GhPLDβ3A, whose homolog GaPLDβ3 may have been lost during the evolution of G. arboreum after the interspecific hybridization. Phylogenetic analysis showed that all of the cotton PLDs were unevenly classified into six numbered subgroups: α, β/γ, δ, ε, ζ and φ. An N-terminal C2 domain was found in the α, β/γ, δ and ε subgroups, while phox homology (PX) and pleckstrin homology (PH) domains were identified in the ζ subgroup. The subgroup φ possessed a single peptide instead of a functional domain. In each phylogenetic subgroup, the PLDs showed high conservation in gene structure and amino acid sequences in functional domains. The expansion of GhPLD and GrPLD gene families were mainly attributed to segmental duplication and partly attributed to tandem duplication. Furthermore, purifying selection played a critical role in the evolution of PLD genes in cotton. Quantitative RT-PCR documented that allotetraploid cotton PLD genes were broadly expressed and each had a unique spatial and developmental expression pattern, indicating their functional diversification in cotton growth and development. Further analysis of cis-regulatory elements elucidated transcriptional regulations and potential functions. Our comparative analysis provided valuable information for understanding the putative functions of the PLD genes in cotton fiber. PMID:27213891

  15. Genome Wide Analysis of the Apple MYB Transcription Factor Family Allows the Identification of MdoMYB121 Gene Confering Abiotic Stress Tolerance in Plants

    PubMed Central

    Wang, Rong-Kai; Zhang, Rui-Fen; Hao, Yu-Jin

    2013-01-01

    The MYB proteins comprise one of the largest families of transcription factors (TFs) in plants. Although several MYB genes have been characterized to play roles in secondary metabolism, the MYB family has not yet been identified in apple. In this study, 229 apple MYB genes were identified through a genome-wide analysis and divided into 45 subgroups. A computational analysis was conducted using the apple genomic database to yield a complete overview of the MYB family, including the intron-exon organizations, the sequence features of the MYB DNA-binding domains, the carboxy-terminal motifs, and the chromosomal locations. Subsequently, the expression of 18 MYB genes, including 12 were chosen from stress-related subgroups, while another 6 ones from other subgroups, in response to various abiotic stresses was examined. It was found that several of these MYB genes, particularly MdoMYB121, were induced by multiple stresses. The MdoMYB121 was then further functionally characterized. Its predicted protein was found to be localized in the nucleus. A transgenic analysis indicated that the overexpression of the MdoMYB121 gene remarkably enhanced the tolerance to high salinity, drought, and cold stresses in transgenic tomato and apple plants. Our results indicate that the MYB genes are highly conserved in plant species and that MdoMYB121 can be used as a target gene in genetic engineering approaches to improve the tolerance of plants to multiple abiotic stresses. PMID:23950843

  16. Genome-Wide Identification, Classification, and Expression Analysis of Amino Acid Transporter Gene Family in Glycine Max.

    PubMed

    Cheng, Lin; Yuan, Hong-Yu; Ren, Ren; Zhao, Shi-Qi; Han, Ya-Peng; Zhou, Qi-Ying; Ke, Dan-Xia; Wang, Ying-Xiang; Wang, Lei

    2016-01-01

    Amino acid transporters (AATs) play important roles in transporting amino acid across cellular membranes and are essential for plant growth and development. To date, the AAT gene family in soybean (Glycine max L.) has not been characterized. In this study, we identified 189 AAT genes from the entire soybean genomic sequence, and classified them into 12 distinct subfamilies based upon their sequence composition and phylogenetic positions. To further investigate the functions of these genes, we analyzed the chromosome distributions, gene structures, duplication patterns, phylogenetic tree, tissue expression patterns of the 189 AAT genes in soybean. We found that a large number of AAT genes in soybean were expanded via gene duplication, 46 and 36 GmAAT genes were WGD/segmental and tandemly duplicated, respectively. Further comprehensive analyses of the expression profiles of GmAAT genes in various stages of vegetative and reproductive development showed that soybean AAT genes exhibited preferential or distinct expression patterns among different tissues. Overall, our study provides a framework for further analysis of the biological functions of AAT genes in either soybean or other crops.

  17. Genome-wide copy number scan identifies IRF6 involvement in Van der Woude syndrome in an Indian family.

    PubMed

    Manjegowda, Dinesh S; Prasad, Manu; Veerappa, Avinash M; Ramachandra, Nallur B

    2014-01-01

    Summary Van der Woude syndrome (VWS) is an autosomal dominant developmental malformation presenting with bilateral lower lip pits related to cleft lip, cleft palate and other malformations. We performed a whole-genome copy number variations (CNVs) scan in an Indian family with members suffering from VWS using 2·6 million combined SNP and CNV markers. We found CNVs affecting IRF6, a known candidate gene for VWS, in all three cases, while none of the non-VWS members showed any CNVs in the IRF6 region. The duplications and deletions of the chromosomal critical region in 1q32-q41 confirm the involvement of CNVs in IRF6 in South Indian VWS patients. Molecular network analysis of these and other cleft lip/palate related module genes suggests that they are associated with cytokine-mediated signalling pathways and response to interferon-gamma mediated signalling pathways. This is a maiden study indicating the involvement of CNVs in IRF6 in causing VWS in the Indian population. PMID:25579819

  18. Genome-Wide Identification and Expression Analyses of Aquaporin Gene Family during Development and Abiotic Stress in Banana.

    PubMed

    Hu, Wei; Hou, Xiaowan; Huang, Chao; Yan, Yan; Tie, Weiwei; Ding, Zehong; Wei, Yunxie; Liu, Juhua; Miao, Hongxia; Lu, Zhiwei; Li, Meiying; Xu, Biyu; Jin, Zhiqiang

    2015-01-01

    Aquaporins (AQPs) function to selectively control the flow of water and other small molecules through biological membranes, playing crucial roles in various biological processes. However, little information is available on the AQP gene family in bananas. In this study, we identified 47 banana AQP genes based on the banana genome sequence. Evolutionary analysis of AQPs from banana, Arabidopsis, poplar, and rice indicated that banana AQPs (MaAQPs) were clustered into four subfamilies. Conserved motif analysis showed that all banana AQPs contained the typical AQP-like or major intrinsic protein (MIP) domain. Gene structure analysis suggested the majority of MaAQPs had two to four introns with a highly specific number and length for each subfamily. Expression analysis of MaAQP genes during fruit development and postharvest ripening showed that some MaAQP genes exhibited high expression levels during these stages, indicating the involvement of MaAQP genes in banana fruit development and ripening. Additionally, some MaAQP genes showed strong induction after stress treatment and therefore, may represent potential candidates for improving banana resistance to abiotic stress. Taken together, this study identified some excellent tissue-specific, fruit development- and ripening-dependent, and abiotic stress-responsive candidate MaAQP genes, which could lay a solid foundation for genetic improvement of banana cultivars. PMID:26307965

  19. Genome-wide characterization of the β-1,3-glucanase gene family in Gossypium by comparative analysis.

    PubMed

    Xu, Xiaoyang; Feng, Yue; Fang, Shuai; Xu, Jun; Wang, Xinyu; Guo, Wangzhen

    2016-01-01

    The β-1,3-glucanase gene family is involved in a wide range of plant developmental processes as well as pathogen defense mechanisms. Comprehensive analyses of β-1,3-glucanase genes (GLUs) have not been reported in cotton. Here, we identified 67, 68, 130 and 158 GLUs in four sequenced cotton species, G. raimondii (D5), G. arboreum (A2), G. hirsutum acc. TM-1 (AD1), and G. barbadense acc. 3-79 (AD2), respectively. Cotton GLUs can be classified into the eight subfamilies (A-H), and their protein domain architecture and intron/exon structure are relatively conserved within each subfamily. Sixty-seven GLUs in G. raimondii were anchored onto 13 chromosomes, with 27 genes involved in segmental duplications, and 13 in tandem duplications. Expression patterns showed highly developmental and spatial regulation of GLUs in TM-1. In particular, the expression of individual member of GLUs in subfamily E was limited to roots, leaves, floral organs or fibers. Members of subfamily E also showed more protein evolution and subgenome expression bias compared with members of other subfamilies. We clarified that GLU42 and GLU43 in subfamily E were preferentially expressed in root and leaf tissues and significantly upregulated after Verticillium dahliae inoculation. Silencing of GLU42 and GLU43 significantly increased the susceptibility of cotton to V. dahliae. PMID:27353015

  20. Genome-Wide Identification and Expression Analyses of Aquaporin Gene Family during Development and Abiotic Stress in Banana

    PubMed Central

    Hu, Wei; Hou, Xiaowan; Huang, Chao; Yan, Yan; Tie, Weiwei; Ding, Zehong; Wei, Yunxie; Liu, Juhua; Miao, Hongxia; Lu, Zhiwei; Li, Meiying; Xu, Biyu; Jin, Zhiqiang

    2015-01-01

    Aquaporins (AQPs) function to selectively control the flow of water and other small molecules through biological membranes, playing crucial roles in various biological processes. However, little information is available on the AQP gene family in bananas. In this study, we identified 47 banana AQP genes based on the banana genome sequence. Evolutionary analysis of AQPs from banana, Arabidopsis, poplar, and rice indicated that banana AQPs (MaAQPs) were clustered into four subfamilies. Conserved motif analysis showed that all banana AQPs contained the typical AQP-like or major intrinsic protein (MIP) domain. Gene structure analysis suggested the majority of MaAQPs had two to four introns with a highly specific number and length for each subfamily. Expression analysis of MaAQP genes during fruit development and postharvest ripening showed that some MaAQP genes exhibited high expression levels during these stages, indicating the involvement of MaAQP genes in banana fruit development and ripening. Additionally, some MaAQP genes showed strong induction after stress treatment and therefore, may represent potential candidates for improving banana resistance to abiotic stress. Taken together, this study identified some excellent tissue-specific, fruit development- and ripening-dependent, and abiotic stress-responsive candidate MaAQP genes, which could lay a solid foundation for genetic improvement of banana cultivars. PMID:26307965

  1. Genome-wide characterization of the β-1,3-glucanase gene family in Gossypium by comparative analysis

    PubMed Central

    Xu, Xiaoyang; Feng, Yue; Fang, Shuai; Xu, Jun; Wang, Xinyu; Guo, Wangzhen

    2016-01-01

    The β-1,3-glucanase gene family is involved in a wide range of plant developmental processes as well as pathogen defense mechanisms. Comprehensive analyses of β-1,3-glucanase genes (GLUs) have not been reported in cotton. Here, we identified 67, 68, 130 and 158 GLUs in four sequenced cotton species, G. raimondii (D5), G. arboreum (A2), G. hirsutum acc. TM-1 (AD1), and G. barbadense acc. 3–79 (AD2), respectively. Cotton GLUs can be classified into the eight subfamilies (A–H), and their protein domain architecture and intron/exon structure are relatively conserved within each subfamily. Sixty-seven GLUs in G. raimondii were anchored onto 13 chromosomes, with 27 genes involved in segmental duplications, and 13 in tandem duplications. Expression patterns showed highly developmental and spatial regulation of GLUs in TM-1. In particular, the expression of individual member of GLUs in subfamily E was limited to roots, leaves, floral organs or fibers. Members of subfamily E also showed more protein evolution and subgenome expression bias compared with members of other subfamilies. We clarified that GLU42 and GLU43 in subfamily E were preferentially expressed in root and leaf tissues and significantly upregulated after Verticillium dahliae inoculation. Silencing of GLU42 and GLU43 significantly increased the susceptibility of cotton to V. dahliae. PMID:27353015

  2. Genome-wide identification, classification, and analysis of heat shock transcription factor family in Chinese cabbage (Brassica rapa pekinensis).

    PubMed

    Huang, X Y; Tao, P; Li, B Y; Wang, W H; Yue, Z C; Lei, J L; Zhong, X M

    2015-03-27

    Chinese cabbage (Brassica rapa ssp. pekinensis) is one of the most important vegetable crops grown worldwide, and various methods exist for selection, propagation, and cultivation. The entire Chinese cabbage genome has been sequenced, and the heat shock transcription factor family (Hsfs) has been found to play a central role in plant growth and development and in the response to biotic and abiotic stress conditions, particularly in acquired thermotolerance. We analyzed heat tolerance mechanisms in Chinese cabbage. In this study, 30 Hsfs were identified from the Chinese cabbage genome database. The classification, phylogenetic reconstruction, chromosome distribution, conserved motifs, expression analysis, and interaction networks of the Hsfs were predicted and analyzed. Thirty BrHsfs were classified into 3 major classes (class A, B, and C) according to their structural characteristics and phylogenetic comparisons, and class A was further subdivided into 8 subclasses. Distribution mapping results showed that Hsf genes were located on 10 Chinese cabbage chromosomes. The expression profile indicated that Hsfs play differential roles in 5 organs in Chinese cabbage, and likely participate in the development of underground parts and regulation of reproductive growth. An orthologous gene interaction network was constructed, and included MBF1C, ROF1, TBP2, CDC2, and HSP70 5 genes, which are closely related to heat stress. Our results contribute to the understanding of the complexity of Hsfs in Chinese cabbage and provide a basis for further functional gene research.

  3. Genome-wide survey and expression analysis of the PUB family in Chinese cabbage (Brassica rapa ssp. pekinesis).

    PubMed

    Wang, Cheng; Duan, Weike; Riquicho, Ali RamuliMaquina; Jing, Zange; Liu, Tongkun; Hou, Xilin; Li, Ying

    2015-12-01

    U-box proteins are widely distributed among eukaryotic organisms and show a higher prevalence in plants than in other organisms. Plant U-box (PUB) proteins play crucial regulatory roles in various developmental and physiological processes. Previously, 64 and 77 PUB genes have been identified in Arabidopsis (Arabidopsis thaliana) and rice (Oryza sativa), respectively. In this study, 101 putative PUB genes were identified in the Chinese cabbage (Brassica rapa ssp. pekinensis line Chiifu-401-42) genome and compared with other 15 representative plants. By specific protein domains and a phylogenetic analysis, the B. rapa PUB (BrPUB) gene family was subdivided into 10 groups. Localization of BrPUB genes showed an uneven distribution on the ten chromosomes of B. rapa. The orthologous and co-orthologous PUB gene pairs were identified between B. rapa and A. thaliana. RNA-seq transcriptome data of different tissues revealed tissue-specific and differential expression profiles of the BrPUBs, and quantitative real-time PCR analysis showed inverse gene expression patterns of the BrPUB-ARMs in response to cold and heat stresses. Altogether, the identification, classification, phylogenetic analysis, chromosome distribution, conserved motifs, and expression patterns of BrPUBs were predicted and analysed. Importantly, this study of BrPUBs provides a rich resource that will aid in the determination of PUB functions in plant development.

  4. Genome-Wide Identification and Expression Analysis of the Mitogen-Activated Protein Kinase Gene Family in Cassava.

    PubMed

    Yan, Yan; Wang, Lianzhe; Ding, Zehong; Tie, Weiwei; Ding, Xupo; Zeng, Changying; Wei, Yunxie; Zhao, Hongliang; Peng, Ming; Hu, Wei

    2016-01-01

    Mitogen-activated protein kinases (MAPKs) play central roles in plant developmental processes, hormone signaling transduction, and responses to abiotic stress. However, no data are currently available about the MAPK family in cassava, an important tropical crop. Herein, 21 MeMAPK genes were identified from cassava. Phylogenetic analysis indicated that MeMAPKs could be classified into four subfamilies. Gene structure analysis demonstrated that the number of introns in MeMAPK genes ranged from 1 to 10, suggesting large variation among cassava MAPK genes. Conserved motif analysis indicated that all MeMAPKs had typical protein kinase domains. Transcriptomic analysis suggested that MeMAPK genes showed differential expression patterns in distinct tissues and in response to drought stress between wild subspecies and cultivated varieties. Interaction networks and co-expression analyses revealed that crucial pathways controlled by MeMAPK networks may be involved in the differential response to drought stress in different accessions of cassava. Expression of nine selected MAPK genes showed that these genes could comprehensively respond to osmotic, salt, cold, oxidative stressors, and abscisic acid (ABA) signaling. These findings yield new insights into the transcriptional control of MAPK gene expression, provide an improved understanding of abiotic stress responses and signaling transduction in cassava, and lead to potential applications in the genetic improvement of cassava cultivars.

  5. Genome-wide survey and expression analysis of the calcium-dependent protein kinase gene family in cassava.

    PubMed

    Hu, Wei; Hou, Xiaowan; Xia, Zhiqiang; Yan, Yan; Wei, Yunxie; Wang, Lianzhe; Zou, Meiling; Lu, Cheng; Wang, Wenquan; Peng, Ming

    2016-02-01

    Calcium-dependent protein kinases (CPKs) play important roles in regulating plant tolerance to abiotic stress and signal transduction; however, no data are currently available regarding the CPK family in cassava. Herein, we identified 27 CPK genes from cassava based on our previous genome sequencing data. Phylogenetic analysis showed that cassava CPKs could be clustered into three groups, which was further supported by gene structure and conserved protein motif analyses. Global expression analysis suggested that MeCPK genes showed distinct expression patterns in different tissues between wild subspecies and cultivated varieties, indicating their involvement in the functional diversity of different varieties. Transcriptomics, interaction networks, and co-expression assays revealed a broad transcriptional response of cassava CPKs and CPK-mediated networks to drought stress and their differential expression profiles in different varieties, implying their contribution to drought stress tolerance in cassava. Expression analysis of eight MeCPK genes suggested a comprehensive response to osmotic stress, salt, cold, abscisic acid, and H2O2, which indicated that cassava CPKs might be convergence points for different signaling pathways. This study provides a basis for crop improvements and understanding of abiotic stress responses and signal transduction mediated by CPKs in cassava.

  6. Genome-wide analysis of the cyclin family in Arabidopsis and comparative phylogenetic analysis of plant cyclin-like proteins.

    PubMed

    Wang, Guanfang; Kong, Hongzhi; Sun, Yujin; Zhang, Xiaohong; Zhang, Wei; Altman, Naomi; DePamphilis, Claude W; Ma, Hong

    2004-06-01

    Cyclins are primary regulators of the activity of cyclin-dependent kinases, which are known to play critical roles in controlling eukaryotic cell cycle progression. While there has been extensive research on cell cycle mechanisms and cyclin function in animals and yeasts, only a small number of plant cyclins have been characterized functionally. In this paper, we describe an exhaustive search for cyclin genes in the Arabidopsis genome and among available sequences from other vascular plants. Based on phylogenetic analysis, we define 10 classes of plant cyclins, four of which are plant-specific, and a fifth is shared between plants and protists but not animals. Microarray and reverse transcriptase-polymerase chain reaction analyses further provide expression profiles of cyclin genes in different tissues of wild-type Arabidopsis plants. Comparative phylogenetic studies of 174 plant cyclins were also performed. The phylogenetic results imply that the cyclin gene family in plants has experienced more gene duplication events than in animals. Expression patterns and phylogenetic analyses of Arabidopsis cyclin genes suggest potential gene redundancy among members belonging to the same group. We discuss possible divergence and conservation of some plant cyclins. Our study provides an opportunity to rapidly assess the position of plant cyclin genes in terms of evolution and classification, serving as a guide for further functional study of plant cyclins.

  7. Genome-wide scan for quantitative trait loci influencing LDL size and plasma triglyceride in familial hypertriglyceridemia.

    PubMed

    Austin, Melissa A; Edwards, Karen L; Monks, Stephanie A; Koprowicz, Kent M; Brunzell, John D; Motulsky, Arno G; Mahaney, Michael C; Hixson, James E

    2003-11-01

    Small, dense LDLs and hypertriglyceridemia, two highly correlated and genetically influenced risk factors, are known to predict for risk of coronary heart disease. The objective of this study was to perform a whole-genome scan for linkage to LDL size and triglyceride (TG) levels in 26 kindreds with familial hypertriglyceridemia (FHTG). LDL size was estimated using gradient gel electrophoresis, and genotyping was performed for 355 autosomal markers with an average heterozygosity of 76% and an average spacing of 10.2 centimorgans (cMs). Using variance components linkage analysis, one possible linkage was found for LDL size [logarithm of odds (LOD) = 2.1] on chromosome 6, peak at 140 cM distal to marker F13A1 (closest marker D6S2436). With adjustment for TG and/or HDL cholesterol, the LOD scores were reduced, but remained in exactly the same location. For TG, LOD scores of 2.56 and 2.44 were observed at two locations on chromosome 15, with peaks at 29 and 61 cM distal to marker D15S822 (closest markers D15S643 and D15S211, respectively). These peaks were retained with adjustment for LDL size and/or HDL cholesterol. These findings, if confirmed, suggest that LDL particle size and plasma TG levels could be caused by two different genetic loci in FHTG.

  8. Genome-Wide Identification, Classification, and Expression Analysis of Amino Acid Transporter Gene Family in Glycine Max

    PubMed Central

    Cheng, Lin; Yuan, Hong-Yu; Ren, Ren; Zhao, Shi-Qi; Han, Ya-Peng; Zhou, Qi-Ying; Ke, Dan-Xia; Wang, Ying-Xiang; Wang, Lei

    2016-01-01

    Amino acid transporters (AATs) play important roles in transporting amino acid across cellular membranes and are essential for plant growth and development. To date, the AAT gene family in soybean (Glycine max L.) has not been characterized. In this study, we identified 189 AAT genes from the entire soybean genomic sequence, and classified them into 12 distinct subfamilies based upon their sequence composition and phylogenetic positions. To further investigate the functions of these genes, we analyzed the chromosome distributions, gene structures, duplication patterns, phylogenetic tree, tissue expression patterns of the 189 AAT genes in soybean. We found that a large number of AAT genes in soybean were expanded via gene duplication, 46 and 36 GmAAT genes were WGD/segmental and tandemly duplicated, respectively. Further comprehensive analyses of the expression profiles of GmAAT genes in various stages of vegetative and reproductive development showed that soybean AAT genes exhibited preferential or distinct expression patterns among different tissues. Overall, our study provides a framework for further analysis of the biological functions of AAT genes in either soybean or other crops. PMID:27148336

  9. Genome-Wide Identification and Expression Analysis of the Mitogen-Activated Protein Kinase Gene Family in Cassava

    PubMed Central

    Yan, Yan; Wang, Lianzhe; Ding, Zehong; Tie, Weiwei; Ding, Xupo; Zeng, Changying; Wei, Yunxie; Zhao, Hongliang; Peng, Ming; Hu, Wei

    2016-01-01

    Mitogen-activated protein kinases (MAPKs) play central roles in plant developmental processes, hormone signaling transduction, and responses to abiotic stress. However, no data are currently available about the MAPK family in cassava, an important tropical crop. Herein, 21 MeMAPK genes were identified from cassava. Phylogenetic analysis indicated that MeMAPKs could be classified into four subfamilies. Gene structure analysis demonstrated that the number of introns in MeMAPK genes ranged from 1 to 10, suggesting large variation among cassava MAPK genes. Conserved motif analysis indicated that all MeMAPKs had typical protein kinase domains. Transcriptomic analysis suggested that MeMAPK genes showed differential expression patterns in distinct tissues and in response to drought stress between wild subspecies and cultivated varieties. Interaction networks and co-expression analyses revealed that crucial pathways controlled by MeMAPK networks may be involved in the differential response to drought stress in different accessions of cassava. Expression of nine selected MAPK genes showed that these genes could comprehensively respond to osmotic, salt, cold, oxidative stressors, and abscisic acid (ABA) signaling. These findings yield new insights into the transcriptional control of MAPK gene expression, provide an improved understanding of abiotic stress responses and signaling transduction in cassava, and lead to potential applications in the genetic improvement of cassava cultivars.

  10. Genome-Wide Identification and Expression Analysis of the Mitogen-Activated Protein Kinase Gene Family in Cassava

    PubMed Central

    Yan, Yan; Wang, Lianzhe; Ding, Zehong; Tie, Weiwei; Ding, Xupo; Zeng, Changying; Wei, Yunxie; Zhao, Hongliang; Peng, Ming; Hu, Wei

    2016-01-01

    Mitogen-activated protein kinases (MAPKs) play central roles in plant developmental processes, hormone signaling transduction, and responses to abiotic stress. However, no data are currently available about the MAPK family in cassava, an important tropical crop. Herein, 21 MeMAPK genes were identified from cassava. Phylogenetic analysis indicated that MeMAPKs could be classified into four subfamilies. Gene structure analysis demonstrated that the number of introns in MeMAPK genes ranged from 1 to 10, suggesting large variation among cassava MAPK genes. Conserved motif analysis indicated that all MeMAPKs had typical protein kinase domains. Transcriptomic analysis suggested that MeMAPK genes showed differential expression patterns in distinct tissues and in response to drought stress between wild subspecies and cultivated varieties. Interaction networks and co-expression analyses revealed that crucial pathways controlled by MeMAPK networks may be involved in the differential response to drought stress in different accessions of cassava. Expression of nine selected MAPK genes showed that these genes could comprehensively respond to osmotic, salt, cold, oxidative stressors, and abscisic acid (ABA) signaling. These findings yield new insights into the transcriptional control of MAPK gene expression, provide an improved understanding of abiotic stress responses and signaling transduction in cassava, and lead to potential applications in the genetic improvement of cassava cultivars. PMID:27625666

  11. Genome-wide survey and expression analysis of the PUB family in Chinese cabbage (Brassica rapa ssp. pekinesis).

    PubMed

    Wang, Cheng; Duan, Weike; Riquicho, Ali RamuliMaquina; Jing, Zange; Liu, Tongkun; Hou, Xilin; Li, Ying

    2015-12-01

    U-box proteins are widely distributed among eukaryotic organisms and show a higher prevalence in plants than in other organisms. Plant U-box (PUB) proteins play crucial regulatory roles in various developmental and physiological processes. Previously, 64 and 77 PUB genes have been identified in Arabidopsis (Arabidopsis thaliana) and rice (Oryza sativa), respectively. In this study, 101 putative PUB genes were identified in the Chinese cabbage (Brassica rapa ssp. pekinensis line Chiifu-401-42) genome and compared with other 15 representative plants. By specific protein domains and a phylogenetic analysis, the B. rapa PUB (BrPUB) gene family was subdivided into 10 groups. Localization of BrPUB genes showed an uneven distribution on the ten chromosomes of B. rapa. The orthologous and co-orthologous PUB gene pairs were identified between B. rapa and A. thaliana. RNA-seq transcriptome data of different tissues revealed tissue-specific and differential expression profiles of the BrPUBs, and quantitative real-time PCR analysis showed inverse gene expression patterns of the BrPUB-ARMs in response to cold and heat stresses. Altogether, the identification, classification, phylogenetic analysis, chromosome distribution, conserved motifs, and expression patterns of BrPUBs were predicted and analysed. Importantly, this study of BrPUBs provides a rich resource that will aid in the determination of PUB functions in plant development. PMID:26054324

  12. Genome-wide identification, classification, and analysis of heat shock transcription factor family in Chinese cabbage (Brassica rapa pekinensis).

    PubMed

    Huang, X Y; Tao, P; Li, B Y; Wang, W H; Yue, Z C; Lei, J L; Zhong, X M

    2015-01-01

    Chinese cabbage (Brassica rapa ssp. pekinensis) is one of the most important vegetable crops grown worldwide, and various methods exist for selection, propagation, and cultivation. The entire Chinese cabbage genome has been sequenced, and the heat shock transcription factor family (Hsfs) has been found to play a central role in plant growth and development and in the response to biotic and abiotic stress conditions, particularly in acquired thermotolerance. We analyzed heat tolerance mechanisms in Chinese cabbage. In this study, 30 Hsfs were identified from the Chinese cabbage genome database. The classification, phylogenetic reconstruction, chromosome distribution, conserved motifs, expression analysis, and interaction networks of the Hsfs were predicted and analyzed. Thirty BrHsfs were classified into 3 major classes (class A, B, and C) according to their structural characteristics and phylogenetic comparisons, and class A was further subdivided into 8 subclasses. Distribution mapping results showed that Hsf genes were located on 10 Chinese cabbage chromosomes. The expression profile indicated that Hsfs play differential roles in 5 organs in Chinese cabbage, and likely participate in the development of underground parts and regulation of reproductive growth. An orthologous gene interaction network was constructed, and included MBF1C, ROF1, TBP2, CDC2, and HSP70 5 genes, which are closely related to heat stress. Our results contribute to the understanding of the complexity of Hsfs in Chinese cabbage and provide a basis for further functional gene research. PMID:25867366

  13. Genome-Wide Identification and Expression Analysis of the Mitogen-Activated Protein Kinase Gene Family in Cassava.

    PubMed

    Yan, Yan; Wang, Lianzhe; Ding, Zehong; Tie, Weiwei; Ding, Xupo; Zeng, Changying; Wei, Yunxie; Zhao, Hongliang; Peng, Ming; Hu, Wei

    2016-01-01

    Mitogen-activated protein kinases (MAPKs) play central roles in plant developmental processes, hormone signaling transduction, and responses to abiotic stress. However, no data are currently available about the MAPK family in cassava, an important tropical crop. Herein, 21 MeMAPK genes were identified from cassava. Phylogenetic analysis indicated that MeMAPKs could be classified into four subfamilies. Gene structure analysis demonstrated that the number of introns in MeMAPK genes ranged from 1 to 10, suggesting large variation among cassava MAPK genes. Conserved motif analysis indicated that all MeMAPKs had typical protein kinase domains. Transcriptomic analysis suggested that MeMAPK genes showed differential expression patterns in distinct tissues and in response to drought stress between wild subspecies and cultivated varieties. Interaction networks and co-expression analyses revealed that crucial pathways controlled by MeMAPK networks may be involved in the differential response to drought stress in different accessions of cassava. Expression of nine selected MAPK genes showed that these genes could comprehensively respond to osmotic, salt, cold, oxidative stressors, and abscisic acid (ABA) signaling. These findings yield new insights into the transcriptional control of MAPK gene expression, provide an improved understanding of abiotic stress responses and signaling transduction in cassava, and lead to potential applications in the genetic improvement of cassava cultivars. PMID:27625666

  14. Genome-wide identification, evolution and expression analysis of the grape (Vitis vinifera L.) zinc finger-homeodomain gene family.

    PubMed

    Wang, Hao; Yin, Xiangjing; Li, Xiaoqin; Wang, Li; Zheng, Yi; Xu, Xiaozhao; Zhang, Yucheng; Wang, Xiping

    2014-04-03

    Plant zinc finger-homeodomain (ZHD) genes encode a family of transcription factors that have been demonstrated to play an important role in the regulation of plant growth and development. In this study, we identified a total of 13 ZHD genes (VvZHD) in the grape genome that were further classified into at least seven groups. Genome synteny analysis revealed that a number of VvZHD genes were present in the corresponding syntenic blocks of Arabidopsis, indicating that they arose before the divergence of these two species. Gene expression analysis showed that the identified VvZHD genes displayed distinct spatiotemporal expression patterns, and were differentially regulated under various stress conditions and hormone treatments, suggesting that the grape VvZHDs might be also involved in plant response to a variety of biotic and abiotic insults. Our work provides insightful information and knowledge about the ZHD genes in grape, which provides a framework for further characterization of their roles in regulation of stress tolerance as well as other aspects of grape productivity.

  15. Genome-Wide Identification and Evolution Analysis of Trehalose-6-Phosphate Synthase Gene Family in Nelumbo nucifera

    PubMed Central

    Jin, Qijiang; Hu, Xin; Li, Xin; Wang, Bei; Wang, Yanjie; Jiang, Hongwei; Mattson, Neil; Xu, Yingchun

    2016-01-01

    Trehalose-6-phosphate synthase (TPS) plays a key role in plant carbohydrate metabolism and the perception of carbohydrate availability. In the present work, the publicly available Nelumbo nucifera (lotus) genome sequence database was analyzed which led to identification of nine lotus TPS genes (NnTPS). It was found that at least two introns are included in the coding sequences of NnTPS genes. When the motif compositions were analyzed we found that NnTPS generally shared the similar motifs, implying that they have similar functions. The dN/dS ratios were always less than 1 for different domains and regions outside domains, suggesting purifying selection on the lotus TPS gene family. The regions outside TPS domain evolved relatively faster than NnTPS domains. A phylogenetic tree was constructed using all predicted coding sequences of lotus TPS genes, together with those from Arabidopsis, poplar, soybean, and rice. The result indicated that those TPS genes could be clearly divided into two main subfamilies (I-II), where each subfamily could be further divided into 2 (I) and 5 (II) subgroups. Analyses of divergence and adaptive evolution show that purifying selection may have been the main force driving evolution of plant TPS genes. Some of the critical sites that contributed to divergence may have been under positive selection. Transcriptome data analysis revealed that most NnTPS genes were predominantly expressed in sink tissues. Expression pattern of NnTPS genes under copper and submergence stress indicated that NNU_014679 and NNU_022788 might play important roles in lotus energy metabolism and participate in stress response. Our results can facilitate further functional studies of TPS genes in lotus. PMID:27746792

  16. Genome-Wide Function, Evolutionary Characterization and Expression Analysis of Sugar Transporter Family Genes in Pear (Pyrus bretschneideri Rehd).

    PubMed

    Li, Jia-Ming; Zheng, Dan-man; Li, Lei-ting; Qiao, Xin; Wei, Shu-wei; Bai, Bin; Zhang, Shao-ling; Wu, Jun

    2015-09-01

    The sugar transporter (ST) plays an important role in plant growth, development and fruit quality. In this study, a total of 75 ST genes were identified in the pear (Pyrus bretschneideri Rehd) genome based on systematic analysis. Furthermore, all ST genes identified were grouped into eight subfamilies according to conserved domains and phylogenetic analysis. Analysis of cis-regulatory element sequences of all ST genes identified the MYBCOREATCYCB1 promoter in sucrose transporter (SUT) and monosaccharide transporter (MST) genes of pear, while in grape it is exclusively found in SUT subfamily members, indicating divergent transcriptional regulation in different species. Gene duplication event analysis indicated that whole-genome duplication (WGD) and segmental duplication play key roles in ST gene amplification, followed by tandem duplication. Estimation of positive selection at codon sites of ST paralog pairs indicated that all plastidic glucose translocator (pGlcT) subfamily members have evolved under positive selection. In addition, the evolutionary history of ST gene duplications indicated that the ST genes have experienced significant expansion in the whole ST gene family after the second WGD, especially after apple and pear divergence. According to the global RNA sequencing results of pear fruit development, gene expression profiling showed the expression of 53 STs. Combined with quantitative real-time PCR (qRT-PCR) analysis, two polyol/monosaccharide transporter (PLT) and three tonoplast monosaccharide transporter (tMT) members were identified as candidate genes, which may play important roles in sugar accumulation during pear fruit development and ripening. Identification of highly expressed STs in fruit is important for finding novel genes contributing to enhanced levels of sugar content in pear fruit. PMID:26079674

  17. Genome-wide identification and expression analysis of the WRKY gene family in common tobacco (Nicotiana tabacum L.).

    PubMed

    Xiaohua, Xiang; Xinru, Wu; Jiangtao, Chao; Minglei, Yang; Fan, Yang; Guo, Chen; Guanshan, Liu; Yuanying, Wang

    2016-09-01

    The coding products of WRKY gene family plays important roles in plant growth and development as well as in various stress responses. They have been identified in various plants, but only few in common tobacco (Nicotiana tabacum L.). In this study, 164 putative WRKY proteins in the common tobacco genome were identified by using the conserved WRKY sequence (PF03106) from the Pfam database. Phylogenetic trees, functional domain analysis, chromosomal localization, subcellular localization and tissue expression patterns were analyzed with the bioinformatics softwares, including DNAMAN 5.0, Weblogo 3, MEGA 5.1, MG2C and MEME. First of all, phylogenetic trees divided all the candidate genes into three subfamilies: Ⅰ, Ⅱ and Ⅲ, respectively, and subfamily Ⅱ could be further divided into five subgroups: group Ⅱ-a, -b, -c, -d and -e. Secondly, the WRKY regions contained a highly conserved heptapeptide stretch WRKYGQK followed by a zinc-finger motif. Most of the NtWRKY genes contained 2-5 exons and a highly conserved gene structure. Thirdly, 154 out of 164 NtWRKY genes were distributed with different densities on 24 chromosomes, and each subfamily with different patterns and frequency. The largest number of NtWRKY genes was found on chromosome VI, and only one on chromosome X. Fourthly, the majority of NtWRKY members located in the nucleus, with 74 percent of subfamily Ⅲ in the extracellular matrix. Lastly, the members in the same subfamily had different spatial and temporal expression profiles, with 11 NtWRKY genes in roots, stems and leaves expressed at various levels. The expression of genes NtWRKY26, NtWRKY30 and NtWRKY32 can be induced by Phytophthora nicotianae. Our research thus provides valuable information for NtWRKY gene cloning and functional characterization in common tobacco. PMID:27644745

  18. Genome-Wide Survey of the Soybean GATA Transcription Factor Gene Family and Expression Analysis under Low Nitrogen Stress

    PubMed Central

    Zhang, Chanjuan; Hou, Yuqing; Hao, Qingnan; Chen, Haifeng; Chen, Limiao; Yuan, Songli; Shan, Zhihui; Zhang, Xiaojuan; Yang, Zhonglu; Qiu, Dezhen; Zhou, Xinan; Huang, Wenjun

    2015-01-01

    GATA transcription factors are transcriptional regulatory proteins that contain a characteristic type-IV zinc finger DNA-binding domain and recognize the conserved GATA motif in the promoter sequence of target genes. Previous studies demonstrated that plant GATA factors possess critical functions in developmental control and responses to the environment. To date, the GATA factors in soybean (Glycine max) have yet to be characterized. Thus, this study identified 64 putative GATA factors from the entire soybean genomic sequence. The chromosomal distributions, gene structures, duplication patterns, phylogenetic tree, tissue expression patterns, and response to low nitrogen stress of the 64 GATA factors in soybean were analyzed to further investigate the functions of these factors. Results indicated that segmental duplication predominantly contributed to the expansion of the GATA factor gene family in soybean. These GATA proteins were phylogenetically clustered into four distinct subfamilies, wherein their gene structure and motif compositions were considerably conserved. A comparative phylogenetic analysis of the GATA factor zinc finger domain sequences in soybean, Arabidopsis (Arabidopsis thaliana), and rice (Oryza sativa) revealed four major classes. The GATA factors in soybean exhibited expression diversity among different tissues; some of these factors showed tissue-specific expression patterns. Numerous GATA factors displayed upregulation or downregulation in soybean leaf in response to low nitrogen stress, and two GATA factors GATA44 and GATA58 were likely to be involved in the regulation of nitrogen metabolism in soybean. Overexpression of GmGATA44 complemented the reduced chlorophyll phenotype of the Arabidopsis ortholog AtGATA21 mutant, implying that GmGATA44 played an important role in modulating chlorophyll biosynthesis. Overall, our study provides useful information for the further analysis of the biological functions of GATA factors in soybean and other

  19. Genome-Wide Function, Evolutionary Characterization and Expression Analysis of Sugar Transporter Family Genes in Pear (Pyrus bretschneideri Rehd).

    PubMed

    Li, Jia-Ming; Zheng, Dan-man; Li, Lei-ting; Qiao, Xin; Wei, Shu-wei; Bai, Bin; Zhang, Shao-ling; Wu, Jun

    2015-09-01

    The sugar transporter (ST) plays an important role in plant growth, development and fruit quality. In this study, a total of 75 ST genes were identified in the pear (Pyrus bretschneideri Rehd) genome based on systematic analysis. Furthermore, all ST genes identified were grouped into eight subfamilies according to conserved domains and phylogenetic analysis. Analysis of cis-regulatory element sequences of all ST genes identified the MYBCOREATCYCB1 promoter in sucrose transporter (SUT) and monosaccharide transporter (MST) genes of pear, while in grape it is exclusively found in SUT subfamily members, indicating divergent transcriptional regulation in different species. Gene duplication event analysis indicated that whole-genome duplication (WGD) and segmental duplication play key roles in ST gene amplification, followed by tandem duplication. Estimation of positive selection at codon sites of ST paralog pairs indicated that all plastidic glucose translocator (pGlcT) subfamily members have evolved under positive selection. In addition, the evolutionary history of ST gene duplications indicated that the ST genes have experienced significant expansion in the whole ST gene family after the second WGD, especially after apple and pear divergence. According to the global RNA sequencing results of pear fruit development, gene expression profiling showed the expression of 53 STs. Combined with quantitative real-time PCR (qRT-PCR) analysis, two polyol/monosaccharide transporter (PLT) and three tonoplast monosaccharide transporter (tMT) members were identified as candidate genes, which may play important roles in sugar accumulation during pear fruit development and ripening. Identification of highly expressed STs in fruit is important for finding novel genes contributing to enhanced levels of sugar content in pear fruit.

  20. The ARF, AUX/IAA and GH3 gene families in citrus: genome-wide identification and expression analysis during fruitlet drop from abscission zone A.

    PubMed

    Xie, Rangjin; Pang, Shaoping; Ma, Yanyan; Deng, Lie; He, Shaolan; Yi, Shilai; Lv, Qiang; Zheng, Yongqiang

    2015-12-01

    Completion of the whole genome sequencing of citrus enabled us to perform genome-wide identification and functional analysis of the gene families involved in agronomic traits and morphological diversity of citrus. In this study, 22 CitARF, 11 CitGH3 and 26 CitAUX/IAA genes were identified in citrus, respectively. Phylogenetic analysis revealed that all the genes of each gene family could be subdivided into three groups and showed strong evolutionary conservation. The GH3 and AUX/IAA gene families shrank and ARF gene family was highly conserved in the citrus genome after speciation from Arabidopsis thaliana. Tissue-specific expression profiles revealed that 54 genes were expressed in at least one tissue while just 5 genes including CitARF07, CitARF20, CitGH3.04, CitAUX/IAA25 and CitAUX/IAA26 with very low expression level in all tissues tested, suggesting that the CitARF, CitGH3 and CitAUX/IAA gene families played important roles in the development of citrus organs. In addition, our data found that the expression of 2 CitARF, 4 CitGH3 and 4 AUX/IAA genes was affected by IAA treatment, and 7 genes including, CitGH3.04, CitGH3.07, CitAUX/IAA03, CitAUX/IAA04, CitAUX/IAA18, CitAUX/IAA19 and CitAUX/IAA23 were related to fruitlet abscission. This study provides a foundation for future studies on elucidating the precise role of citrus ARF, GH3 and AUX/IAA genes in early steps of auxin signal transduction and open up a new opportunity to uncover the molecular mechanism underlying citrus fruitlet abscission.

  1. The ARF, AUX/IAA and GH3 gene families in citrus: genome-wide identification and expression analysis during fruitlet drop from abscission zone A.

    PubMed

    Xie, Rangjin; Pang, Shaoping; Ma, Yanyan; Deng, Lie; He, Shaolan; Yi, Shilai; Lv, Qiang; Zheng, Yongqiang

    2015-12-01

    Completion of the whole genome sequencing of citrus enabled us to perform genome-wide identification and functional analysis of the gene families involved in agronomic traits and morphological diversity of citrus. In this study, 22 CitARF, 11 CitGH3 and 26 CitAUX/IAA genes were identified in citrus, respectively. Phylogenetic analysis revealed that all the genes of each gene family could be subdivided into three groups and showed strong evolutionary conservation. The GH3 and AUX/IAA gene families shrank and ARF gene family was highly conserved in the citrus genome after speciation from Arabidopsis thaliana. Tissue-specific expression profiles revealed that 54 genes were expressed in at least one tissue while just 5 genes including CitARF07, CitARF20, CitGH3.04, CitAUX/IAA25 and CitAUX/IAA26 with very low expression level in all tissues tested, suggesting that the CitARF, CitGH3 and CitAUX/IAA gene families played important roles in the development of citrus organs. In addition, our data found that the expression of 2 CitARF, 4 CitGH3 and 4 AUX/IAA genes was affected by IAA treatment, and 7 genes including, CitGH3.04, CitGH3.07, CitAUX/IAA03, CitAUX/IAA04, CitAUX/IAA18, CitAUX/IAA19 and CitAUX/IAA23 were related to fruitlet abscission. This study provides a foundation for future studies on elucidating the precise role of citrus ARF, GH3 and AUX/IAA genes in early steps of auxin signal transduction and open up a new opportunity to uncover the molecular mechanism underlying citrus fruitlet abscission. PMID:25982744

  2. Genome-Wide Linkage in a Highly Consanguineous Pedigree Reveals Two Novel Loci on Chromosome 7 for Non-Syndromic Familial Premature Ovarian Failure

    PubMed Central

    Caburet, Sandrine; Zavadakova, Petra; Ben-Neriah, Ziva; Bouhali, Kamal; Dipietromaria, Aurélie; Charon, Céline; Besse, Céline; Laissue, Paul; Chalifa-Caspi, Vered; Christin-Maitre, Sophie; Vaiman, Daniel; Levi, Giovanni; Veitia, Reiner A.; Fellous, Marc

    2012-01-01

    Background The human condition known as Premature Ovarian Failure (POF) is characterized by loss of ovarian function before the age of 40. A majority of POF cases are sporadic, but 10–15% are familial, suggesting a genetic origin of the disease. Although several causal mutations have been identified, the etiology of POF is still unknown for about 90% of the patients. Methodology/Principal Findings We report a genome-wide linkage and homozygosity analysis in one large consanguineous Middle-Eastern POF-affected family presenting an autosomal recessive pattern of inheritance. We identified two regions with a LODmax of 3.26 on chromosome 7p21.1-15.3 and 7q21.3-22.2, which are supported as candidate regions by homozygosity mapping. Sequencing of the coding exons and known regulatory sequences of three candidate genes (DLX5, DLX6 and DSS1) included within the largest region did not reveal any causal mutations. Conclusions/Significance We detect two novel POF-associated loci on human chromosome 7, opening the way to the identification of new genes involved in the control of ovarian development and function. PMID:22428046

  3. Genome-Wide Analysis of the Fasciclin-Like Arabinogalactan Protein Gene Family Reveals Differential Expression Patterns, Localization, and Salt Stress Response in Populus

    PubMed Central

    Zang, Lina; Zheng, Tangchun; Chu, Yanguang; Ding, Changjun; Zhang, Weixi; Huang, Qinjun; Su, Xiaohua

    2015-01-01

    Fasciclin-like arabinogalactan proteins (FLAs) are a subclass of arabinogalactan proteins (AGPs) involved in plant growth, development and response to abiotic stress. Although many studies have been performed to identify molecular functions of individual family members, little information is available on genome-wide identification and characterization of FLAs in the genus Populus. Based on genome-wide analysis, we have identified 35 Populus FLAs which were distributed on 16 chromosomes and phylogenetically clustered into four major groups. Gene structure and motif composition were relatively conserved in each group. All the members contained N-terminal signal peptide, 23 of which included predicted glycosylphosphatidylinositol (GPI) modification sites and were anchored to plasma membranes. Subcellular localization analysis showed that PtrFLA2/20/26 were localized in cell membrane and cytoplasm of protoplasts from Populus stem-differentiating xylem. The Ka/Ks ratios showed that purifying selection has played a leading role in the long-term evolutionary period which greatly maintained the function of this family. The expression profiles showed that 32 PtrFLAs were differentially expressed in four tissues at four seasons based on publicly available microarray data. 18 FLAs were further verified with qRT-PCR in different tissues, which indicated that PtrFLA1/2/3/7/11/12/20/21/22/24/26/30 were significantly expressed in male and female flowers, suggesting close correlations with the reproductive development. In addition, PtrFLA1/9/10/11/17/21/23/24/26/28 were highly expressed in the stems and differentiating xylem, which may be involved in stem development. To determine salt response of FLAs, qRT-PCR was performed to analyze the expression of 18 genes under salinity stress across two time points. Results demonstrated that all the 18 FLAs were expressed in root tissues; especially, PtrFLA2/12/20/21/24/30 were significantly induced at different time points. In summary

  4. A genome-wide association study for milk production traits in Danish Jersey cattle using a 50K single nucleotide polymorphism chip.

    PubMed

    Mai, M D; Sahana, G; Christiansen, F B; Guldbrandtsen, B

    2010-11-01

    Quantitative trait loci for milk production traits in Danish Jersey cattle were mapped by a genome-wide association analysis using a mixed model. The analysis incorporated 1,039 bulls and 33,090 SNP and resulted in 98 detected combinations of QTL and traits on 27 BTA. These QTL comprised 30 for milk index, 50 for fat index, and 18 for protein index. The evidence presents 33 genome-wide QTL on 14 BTA. Of these, 7 had effects on milk index, 21 on fat index, and 5 on protein index. Among the genome-wide QTL, 26 have been previously reported, 2 on BTA4 and BTA5 were new for milk index, and 5 on BTA4, BTA5, BTA13, BTA20, and BTA29 were new QTL for fat index. We found 7 pleiotropic or very closely linked QTL. Most of the QTL were associated with polymorphisms within narrow regions and several may represent the effects of polymorphisms of genes: DGAT1, casein, ARFGAP3, CYP11B1, and CDC-like kinase 4. By a chromosome-wide threshold, 65 additional QTL were detected. Many of them are likely to represent QTL. The results are interesting from a breeding perspective and contribute to the search for the genes causing the polymorphisms important for milk production traits.

  5. Genome-Wide Study of the Tomato SlMLO Gene Family and Its Functional Characterization in Response to the Powdery Mildew Fungus Oidium neolycopersici.

    PubMed

    Zheng, Zheng; Appiano, Michela; Pavan, Stefano; Bracuto, Valentina; Ricciardi, Luigi; Visser, Richard G F; Wolters, Anne-Marie A; Bai, Yuling

    2016-01-01

    The MLO (Mildew Locus O) gene family encodes plant-specific proteins containing seven transmembrane domains and likely acting in signal transduction in a calcium and calmodulin dependent manner. Some members of the MLO family are susceptibility factors toward fungi causing the powdery mildew disease. In tomato, for example, the loss-of-function of the MLO gene SlMLO1 leads to a particular form of powdery mildew resistance, called ol-2, which arrests almost completely fungal penetration. This type of penetration resistance is characterized by the apposition of papillae at the sites of plant-pathogen interaction. Other MLO homologs in Arabidopsis regulate root response to mechanical stimuli (AtMLO4 and AtMLO11) and pollen tube reception by the female gametophyte (AtMLO7). However, the role of most MLO genes remains unknown. In this work, we provide a genome-wide study of the tomato SlMLO gene family. Besides SlMLO1, other 15 SlMLO homologs were identified and characterized with respect to their structure, genomic organization, phylogenetic relationship, and expression profile. In addition, by analysis of transgenic plants, we demonstrated that simultaneous silencing of SlMLO1 and two of its closely related homologs, SlMLO5 and SlMLO8, confer higher level of resistance than the one associated with the ol-2 mutation. The outcome of this study provides evidence for functional redundancy among tomato homolog genes involved in powdery mildew susceptibility. Moreover, we developed a series of transgenic lines silenced for individual SlMLO homologs, which lay the foundation for further investigations aimed at assigning new biological functions to the MLO gene family. PMID:27579028

  6. Genome-wide identification of sweet orange (Citrus sinensis) histone modification gene families and their expression analysis during the fruit development and fruit-blue mold infection process.

    PubMed

    Xu, Jidi; Xu, Haidan; Liu, Yuanlong; Wang, Xia; Xu, Qiang; Deng, Xiuxin

    2015-01-01

    In eukaryotes, histone acetylation and methylation have been known to be involved in regulating diverse developmental processes and plant defense. These histone modification events are controlled by a series of histone modification gene families. To date, there is no study regarding genome-wide characterization of histone modification related genes in citrus species. Based on the two recent sequenced sweet orange genome databases, a total of 136 CsHMs (Citrus sinensis histone modification genes), including 47 CsHMTs (histone methyltransferase genes), 23 CsHDMs (histone demethylase genes), 50 CsHATs (histone acetyltransferase genes), and 16 CsHDACs (histone deacetylase genes) were identified. These genes were categorized to 11 gene families. A comprehensive analysis of these 11 gene families was performed with chromosome locations, phylogenetic comparison, gene structures, and conserved domain compositions of proteins. In order to gain an insight into the potential roles of these genes in citrus fruit development, 42 CsHMs with high mRNA abundance in fruit tissues were selected to further analyze their expression profiles at six stages of fruit development. Interestingly, a numbers of genes were expressed highly in flesh of ripening fruit and some of them showed the increasing expression levels along with the fruit development. Furthermore, we analyzed the expression patterns of all 136 CsHMs response to the infection of blue mold (Penicillium digitatum), which is the most devastating pathogen in citrus post-harvest process. The results indicated that 20 of them showed the strong alterations of their expression levels during the fruit-pathogen infection. In conclusion, this study presents a comprehensive analysis of the histone modification gene families in sweet orange and further elucidates their behaviors during the fruit development and the blue mold infection responses.

  7. Genome-wide association study for colorectal cancer identifies risk polymorphisms in German familial cases and implicates MAPK signalling pathways in disease susceptibility.

    PubMed

    Lascorz, Jesús; Försti, Asta; Chen, Bowang; Buch, Stephan; Steinke, Verena; Rahner, Nils; Holinski-Feder, Elke; Morak, Monika; Schackert, Hans K; Görgens, Heike; Schulmann, Karsten; Goecke, Timm; Kloor, Matthias; Engel, Cristoph; Büttner, Reinhard; Kunkel, Nelli; Weires, Marianne; Hoffmeister, Michael; Pardini, Barbara; Naccarati, Alessio; Vodickova, Ludmila; Novotny, Jan; Schreiber, Stefan; Krawczak, Michael; Bröring, Clemens D; Völzke, Henry; Schafmayer, Clemens; Vodicka, Pavel; Chang-Claude, Jenny; Brenner, Hermann; Burwinkel, Barbara; Propping, Peter; Hampe, Jochen; Hemminki, Kari

    2010-09-01

    Genetic susceptibility accounts for approximately 35% of all colorectal cancer (CRC). Ten common low-risk variants contributing to CRC risk have been identified through genome-wide association studies (GWASs). In our GWAS, 610 664 genotyped single-nucleotide polymorphisms (SNPs) passed the quality control filtering in 371 German familial CRC patients and 1263 controls, and replication studies were conducted in four additional case-control sets (4915 cases and 5607 controls). Known risk loci at 8q24.21 and 11q23 were confirmed, and a previously unreported association, rs12701937, located between the genes GLI3 (GLI family zinc finger 3) and INHBA (inhibin, beta A) [P = 1.1 x 10(-3), odds ratio (OR) 1.14, 95% confidence interval (CI) 1.05-1.23, dominant model in the combined cohort], was identified. The association was stronger in familial cases compared with unselected cases (P = 2.0 x 10(-4), OR 1.36, 95% CI 1.16-1.60, dominant model). Two other unreported SNPs, rs6038071, 40 kb upstream of CSNK2A1 (casein kinase 2, alpha 1 polypeptide) and an intronic marker in MYO3A (myosin IIIA), rs11014993, associated with CRC only in the familial CRC cases (P = 2.5 x 10(-3), recessive model, and P = 2.7 x 10(-4), dominant model). Three software tools successfully pointed to the overrepresentation of genes related to the mitogen-activated protein kinase (MAPK) signalling pathways among the 1340 most strongly associated markers from the GWAS (allelic P value < 10(-3)). The risk of CRC increased significantly with an increasing number of risk alleles in seven genes involved in MAPK signalling events (P(trend) = 2.2 x 10(-16), OR(per allele) = 1.34, 95% CI 1.11-1.61).

  8. Genome-wide study of familial juvenile hyperuricaemic (gouty) nephropathy (FJHN) indicates a new locus, FJHN3, linked to chromosome 2p22.1-p21.

    PubMed

    Piret, Sian E; Danoy, Patrick; Dahan, Karin; Reed, Anita A C; Pryce, Karena; Wong, William; Torres, Rosa J; Puig, Juan G; Müller, Thomas; Kotanko, Peter; Lhotta, Karl; Devuyst, Olivier; Brown, Matthew A; Thakker, Rajesh V

    2011-01-01

    Familial juvenile hyperuricaemic (gouty) nephropathy (FJHN), is an autosomal dominant disease associated with a reduced fractional excretion of urate, and progressive renal failure. FJHN is genetically heterogeneous and due to mutations of three genes: uromodulin (UMOD), renin (REN) and hepatocyte nuclear factor-1beta (HNF-1β) on chromosomes 16p12, 1q32.1, and 17q12, respectively. However, UMOD, REN or HNF-1β mutations are found in only approximately 45% of FJHN probands, indicating the involvement of other genetic loci in approximately 55% of probands. To identify other FJHN loci, we performed a single nucleotide polymorphism (SNP)-based genome-wide linkage analysis, in six FJHN families in whom UMOD, HNF-1β and REN mutations had been excluded. Parametric linkage analysis using a 'rare dominant' model established linkage in five of the six FJHN families, with a LOD score >+3, at 0% recombination, between FJHN and SNPs at chromosome 2p22.1-p21. Analysis of individual recombinants in two unrelated affected individuals defined a approximately 5.5 Mbp interval, flanked telomerically by SNP RS372139 and centromerically by RS896986 that contained the locus, designated FJHN3. The interval contains 28 genes, and DNA sequence analysis of the most likely candidate, solute carrier family 8 member 1 (SLC8A1), did not identify any abnormalities in the FJHN3 probands. FJHN3 is likely located within a approximately 5.5 Mbp interval on chromosome 2p22.1-p21, and identifying the genetic abnormality will help to further elucidate mechanisms predisposing to gout and renal failure.

  9. Genome-wide identification of sweet orange (Citrus sinensis) histone modification gene families and their expression analysis during the fruit development and fruit-blue mold infection process.

    PubMed

    Xu, Jidi; Xu, Haidan; Liu, Yuanlong; Wang, Xia; Xu, Qiang; Deng, Xiuxin

    2015-01-01

    In eukaryotes, histone acetylation and methylation have been known to be involved in regulating diverse developmental processes and plant defense. These histone modification events are controlled by a series of histone modification gene families. To date, there is no study regarding genome-wide characterization of histone modification related genes in citrus species. Based on the two recent sequenced sweet orange genome databases, a total of 136 CsHMs (Citrus sinensis histone modification genes), including 47 CsHMTs (histone methyltransferase genes), 23 CsHDMs (histone demethylase genes), 50 CsHATs (histone acetyltransferase genes), and 16 CsHDACs (histone deacetylase genes) were identified. These genes were categorized to 11 gene families. A comprehensive analysis of these 11 gene families was performed with chromosome locations, phylogenetic comparison, gene structures, and conserved domain compositions of proteins. In order to gain an insight into the potential roles of these genes in citrus fruit development, 42 CsHMs with high mRNA abundance in fruit tissues were selected to further analyze their expression profiles at six stages of fruit development. Interestingly, a numbers of genes were expressed highly in flesh of ripening fruit and some of them showed the increasing expression levels along with the fruit development. Furthermore, we analyzed the expression patterns of all 136 CsHMs response to the infection of blue mold (Penicillium digitatum), which is the most devastating pathogen in citrus post-harvest process. The results indicated that 20 of them showed the strong alterations of their expression levels during the fruit-pathogen infection. In conclusion, this study presents a comprehensive analysis of the histone modification gene families in sweet orange and further elucidates their behaviors during the fruit development and the blue mold infection responses. PMID:26300904

  10. Genome-Wide Study of the Tomato SlMLO Gene Family and Its Functional Characterization in Response to the Powdery Mildew Fungus Oidium neolycopersici

    PubMed Central

    Zheng, Zheng; Appiano, Michela; Pavan, Stefano; Bracuto, Valentina; Ricciardi, Luigi; Visser, Richard G. F.; Wolters, Anne-Marie A.; Bai, Yuling

    2016-01-01

    The MLO (Mildew Locus O) gene family encodes plant-specific proteins containing seven transmembrane domains and likely acting in signal transduction in a calcium and calmodulin dependent manner. Some members of the MLO family are susceptibility factors toward fungi causing the powdery mildew disease. In tomato, for example, the loss-of-function of the MLO gene SlMLO1 leads to a particular form of powdery mildew resistance, called ol-2, which arrests almost completely fungal penetration. This type of penetration resistance is characterized by the apposition of papillae at the sites of plant-pathogen interaction. Other MLO homologs in Arabidopsis regulate root response to mechanical stimuli (AtMLO4 and AtMLO11) and pollen tube reception by the female gametophyte (AtMLO7). However, the role of most MLO genes remains unknown. In this work, we provide a genome-wide study of the tomato SlMLO gene family. Besides SlMLO1, other 15 SlMLO homologs were identified and characterized with respect to their structure, genomic organization, phylogenetic relationship, and expression profile. In addition, by analysis of transgenic plants, we demonstrated that simultaneous silencing of SlMLO1 and two of its closely related homologs, SlMLO5 and SlMLO8, confer higher level of resistance than the one associated with the ol-2 mutation. The outcome of this study provides evidence for functional redundancy among tomato homolog genes involved in powdery mildew susceptibility. Moreover, we developed a series of transgenic lines silenced for individual SlMLO homologs, which lay the foundation for further investigations aimed at assigning new biological functions to the MLO gene family. PMID:27579028

  11. Genome-wide analysis and identification of stress-responsive genes of the NAM-ATAF1,2-CUC2 transcription factor family in apple.

    PubMed

    Su, Hongyan; Zhang, Shizhong; Yuan, Xiaowei; Chen, Changtian; Wang, Xiao-Fei; Hao, Yu-Jin

    2013-10-01

    NAC (NAM, ATAF1,2, and CUC2) proteins constitute one of the largest families of plant-specific transcription factors. To date, little is known about the NAC genes in the apple (Malus domestica). In this study, a total of 180 NAC genes were identified in the apple genome and were phylogenetically clustered into six groups (I-VI) with the NAC genes from Arabidopsis and rice. The predicted apple NAC genes were distributed across all of 17 chromosomes at various densities. Additionally, the gene structure and motif compositions of the apple NAC genes were analyzed. Moreover, the expression of 29 selected apple NAC genes was analyzed in different tissues and under different abiotic stress conditions. All of the selected genes, with the exception of four genes, were expressed in at least one of the tissues tested, which indicates that the NAC genes are involved in various aspects of the physiological and developmental processes of the apple. Encouragingly, 17 of the selected genes were found to respond to one or more of the abiotic stress treatments, and these 17 genes included not only the expected 7 genes that were clustered with the well-known stress-related marker genes in group IV but also 10 genes located in other subgroups, none of which contains members that have been reported to be stress-related. To the best of our knowledge, this report describes the first genome-wide analysis of the apple NAC gene family, and the results should provide valuable information for understanding the classification and putative functions of this family.

  12. Calcium-dependent protein kinase (CDPK) and CDPK-related kinase (CRK) gene families in tomato: genome-wide identification and functional analyses in disease resistance.

    PubMed

    Wang, Ji-Peng; Xu, You-Ping; Munyampundu, Jean-Pierre; Liu, Tian-Yu; Cai, Xin-Zhong

    2016-04-01

    Calcium-dependent protein kinases (CDPKs) and CDPK-related kinases (CRKs) play multiple roles in plant. Nevertheless, genome-wide identification of these two families is limited to several plant species, and role of CRKs in disease resistance remains unclear. In this study, we identified the CDPK and CRK gene families in genome of the economically important crop tomato (Solanum lycopersicum L.) and analyzed their function in resistance to various pathogens. Twenty-nine CDPK and six CRK genes were identified in tomato genome. Both SlCDPK and SlCRK proteins harbored an STKc_CAMK type protein kinase domain, while only SlCDPKs contained EF-hand type Ca(2+) binding domain(s). Phylogenetic analysis revealed that plant CRK family diverged early from CDPKs, and shared a common ancestor gene with subgroup IV CDPKs. Subgroup IV SlCDPK proteins were basic and their genes contained 11 introns, which were distinguished from other subgroups but similar to CRKs. Subgroup I SlCDPKs generally did not carry an N-terminal myristoylation motif while those of the remaining subgroups and SlCRKs universally did. SlCDPK and SlCRK genes were differently responsive to pathogenic stimuli. Furthermore, silencing analyses demonstrated that SlCDPK18 and SlCDPK10 positively regulated nonhost resistance to Xanthomonas oryzae pv. oryzae and host resistance to Pseudomonas syringae pv. tomato (Pst) DC3000, respectively, while SlCRK6 positively regulated resistance to both Pst DC3000 and Sclerotinia sclerotiorum in tomato. In conclusion, CRKs apparently evolved from CDPK lineage, SlCDPK and SlCRK genes regulate a wide range of resistance and SlCRK6 is the first CRK gene proved to function in plant disease resistance. PMID:26520101

  13. Genome-Wide Analysis of the Expression of WRKY Family Genes in Different Developmental Stages of Wild Strawberry (Fragaria vesca) Fruit.

    PubMed

    Zhou, Heying; Li, Yuxuan; Zhang, Qing; Ren, Suyue; Shen, Yuanyue; Qin, Ling; Xing, Yu

    2016-01-01

    WRKY proteins play important regulatory roles in plant developmental processes such as senescence, trichome initiation and embryo morphogenesis. In strawberry, only FaWRKY1 (Fragaria × ananassa) has been characterized, leaving numerous WRKY genes to be identified and their function characterized. The publication of the draft genome sequence of the strawberry genome allowed us to conduct a genome-wide search for WRKY proteins in Fragaria vesca, and to compare the identified proteins with their homologs in model plants. Fifty-nine FvWRKY genes were identified and annotated from the F. vesca genome. Detailed analysis, including gene classification, annotation, phylogenetic evaluation, conserved motif determination and expression profiling, based on RNA-seq data, were performed on all members of the family. Additionally, the expression patterns of the WRKY genes in different fruit developmental stages were further investigated using qRT-PCR, to provide a foundation for further comparative genomics and functional studies of this important class of transcriptional regulators in strawberry. PMID:27138272

  14. Identification of ZNF366 and PTPRD as novel determinants of plasma homocysteine in a family-based genome-wide association study.

    PubMed

    Mälarstig, Anders; Buil, Alfonso; Souto, Juan Carolos; Clarke, Robert; Blanco-Vaca, Francisco; Fontcuberta, Jordi; Peden, John; Andersen, Malin; Silveira, Angela; Barlera, Simona; Seedorf, Udo; Watkins, Hugh; Almasy, Laura; Hamsten, Anders; Soria, José Manuel

    2009-08-13

    Total plasma homocysteine concentration (tHcy) is a biomarker for atherothrombotic disease, but causality remains uncertain. Polymorphisms in the genes involved in methionine metabolism explain only a small fraction of the heritability of tHcy levels. In a genome-wide association study, we examined the genetic determinants of tHcy using a 2-stage design. First, 283 437 single nucleotide polymorphisms (SNPs) were tested for association with tHcy in 387 persons recruited from 21 large Spanish families. Of those, 17 SNPs showed equal or stronger association with tHcy level compared with the MTHFR 677C>T SNP (beta = 0.10, P = .0001). Second, a replication analysis of these 17 SNPs was performed in patients with premature myocardial infarction (n = 1238). Novel associations were found for SNPs near the ZNF366 gene (lead SNP rs7445013; discovery stage: adjusted beta = -0.12, P = 5.30 x 10(-6), replication stage: adjusted beta = -0.13, P = .004) and the PTPRD gene (lead SNP rs973117; discovery stage: adjusted beta = 0.11, P = 5.5 x 10(-6), replication stage: adjusted beta = 0.10, P = .005). These associations were independent of known confounders, including creatinine clearance and plasma fibrinogen concentration. Our findings implicate novel pathways in homocysteine metabolism, and highlight the need for investigation of the associated genes in the etiology of vascular diseases.

  15. Genome-Wide Analysis of the Expression of WRKY Family Genes in Different Developmental Stages of Wild Strawberry (Fragaria vesca) Fruit

    PubMed Central

    Zhang, Qing; Ren, Suyue; Shen, Yuanyue; Qin, Ling; Xing, Yu

    2016-01-01

    WRKY proteins play important regulatory roles in plant developmental processes such as senescence, trichome initiation and embryo morphogenesis. In strawberry, only FaWRKY1 (Fragaria × ananassa) has been characterized, leaving numerous WRKY genes to be identified and their function characterized. The publication of the draft genome sequence of the strawberry genome allowed us to conduct a genome-wide search for WRKY proteins in Fragaria vesca, and to compare the identified proteins with their homologs in model plants. Fifty-nine FvWRKY genes were identified and annotated from the F. vesca genome. Detailed analysis, including gene classification, annotation, phylogenetic evaluation, conserved motif determination and expression profiling, based on RNA-seq data, were performed on all members of the family. Additionally, the expression patterns of the WRKY genes in different fruit developmental stages were further investigated using qRT-PCR, to provide a foundation for further comparative genomics and functional studies of this important class of transcriptional regulators in strawberry. PMID:27138272

  16. Genome-Wide Identification and Analysis of the SBP-Box Family Genes under Phytophthora capsici Stress in Pepper (Capsicum annuum L.)

    PubMed Central

    Zhang, Huai-Xia; Jin, Jing-Hao; He, Yu-Mei; Lu, Bo-Ya; Li, Da-Wei; Chai, Wei-Guo; Khan, Abid; Gong, Zhen-Hui

    2016-01-01

    SQUAMOSA promoter binding protein (SBP)-box genes encode plant-specific transcription factors that are extensively involved in many physiological and biochemical processes, including growth, development, and signal transduction. However, pepper (Capsicum annuum L.) SBP-box family genes have not been well characterized. We investigated SBP-box family genes in the pepper genome and characterized these genes across both compatible and incompatible strain of Phytophthora capsici, and also under different hormone treatments. The results indicated that total 15 members were identified and distributed on seven chromosomes of pepper. Phylogenetic analysis showed that SBP-box genes of pepper can be classified into six groups. In addition, duplication analysis within pepper genome, as well as between pepper and Arabidopsis genomes demonstrated that there are four pairs of homology of SBP-box genes in the pepper genome and 10 pairs between pepper and Arabidopsis genomes. Tissue-specific expression analysis of the CaSBP genes demonstrated their diverse spatiotemporal expression patterns. The expression profiles were similarly analyzed following exposure to P. capsici inoculation and hormone treatments. It was shown that nine of the CaSBP genes (CaSBP01, 02, 03, 04, 05, 06, 11, 12, and 13) exhibited a dramatic up-regulation after compatible HX-9 strain (P. capsici) inoculation, while CaSBP09 and CaSBP15 were down-regulated. In case of PC strain (P. capsici) infection six of the CaSBP genes (CaSBP02, 05, 06, 11, 12, and 13) were arose while CaSBP14 was down regulated. Furthermore, Salicylic acid, Methyl jasmonate and their biosynthesis inhibitors treatment indicated that some of the CaSBP genes are potentially involved in these hormone regulation pathways. This genome-wide identification, as well as characterization of evolutionary relationships and expression profiles of the pepper CaSBP genes, will help to improve pepper stress tolerance in the future. PMID:27148327

  17. Genome-Wide Identification of Jatropha curcas Aquaporin Genes and the Comparative Analysis Provides Insights into the Gene Family Expansion and Evolution in Hevea brasiliensis.

    PubMed

    Zou, Zhi; Yang, Lifu; Gong, Jun; Mo, Yeyong; Wang, Jikun; Cao, Jianhua; An, Feng; Xie, Guishui

    2016-01-01

    Aquaporins (AQPs) are channel-forming integral membrane proteins that transport water and other small solutes across biological membranes. Despite the vital role of AQPs, to date, little is known in physic nut (Jatropha curcas L., Euphorbiaceae), an important non-edible oilseed crop with great potential for the production of biodiesel. In this study, 32 AQP genes were identified from the physic nut genome and the family number is relatively small in comparison to 51 in another Euphorbiaceae plant, rubber tree (Hevea brasiliensis Muell. Arg.). Based on the phylogenetic analysis, the JcAQPs were assigned to five subfamilies, i.e., nine plasma membrane intrinsic proteins (PIPs), nine tonoplast intrinsic proteins (TIPs), eight NOD26-like intrinsic proteins (NIPs), two X intrinsic proteins (XIPs), and four small basic intrinsic proteins (SIPs). Like rubber tree and other plant species, functional prediction based on the aromatic/arginine selectivity filter, Froger's positions, and specificity-determining positions showed a remarkable difference in substrate specificity among subfamilies of JcAQPs. Genome-wide comparative analysis revealed the specific expansion of PIP and TIP subfamilies in rubber tree and the specific gene loss of the XIP subfamily in physic nut. Furthermore, by analyzing deep transcriptome sequencing data, the expression evolution especially the expression divergence of duplicated HbAQP genes was also investigated and discussed. Results obtained from this study not only provide valuable information for future functional analysis and utilization of Jc/HbAQP genes, but also provide a useful reference to survey the gene family expansion and evolution in Euphorbiaceae plants and other plant species. PMID:27066041

  18. Genome-Wide Identification and Analysis of the SBP-Box Family Genes under Phytophthora capsici Stress in Pepper (Capsicum annuum L.).

    PubMed

    Zhang, Huai-Xia; Jin, Jing-Hao; He, Yu-Mei; Lu, Bo-Ya; Li, Da-Wei; Chai, Wei-Guo; Khan, Abid; Gong, Zhen-Hui

    2016-01-01

    SQUAMOSA promoter binding protein (SBP)-box genes encode plant-specific transcription factors that are extensively involved in many physiological and biochemical processes, including growth, development, and signal transduction. However, pepper (Capsicum annuum L.) SBP-box family genes have not been well characterized. We investigated SBP-box family genes in the pepper genome and characterized these genes across both compatible and incompatible strain of Phytophthora capsici, and also under different hormone treatments. The results indicated that total 15 members were identified and distributed on seven chromosomes of pepper. Phylogenetic analysis showed that SBP-box genes of pepper can be classified into six groups. In addition, duplication analysis within pepper genome, as well as between pepper and Arabidopsis genomes demonstrated that there are four pairs of homology of SBP-box genes in the pepper genome and 10 pairs between pepper and Arabidopsis genomes. Tissue-specific expression analysis of the CaSBP genes demonstrated their diverse spatiotemporal expression patterns. The expression profiles were similarly analyzed following exposure to P. capsici inoculation and hormone treatments. It was shown that nine of the CaSBP genes (CaSBP01, 02, 03, 04, 05, 06, 11, 12, and 13) exhibited a dramatic up-regulation after compatible HX-9 strain (P. capsici) inoculation, while CaSBP09 and CaSBP15 were down-regulated. In case of PC strain (P. capsici) infection six of the CaSBP genes (CaSBP02, 05, 06, 11, 12, and 13) were arose while CaSBP14 was down regulated. Furthermore, Salicylic acid, Methyl jasmonate and their biosynthesis inhibitors treatment indicated that some of the CaSBP genes are potentially involved in these hormone regulation pathways. This genome-wide identification, as well as characterization of evolutionary relationships and expression profiles of the pepper CaSBP genes, will help to improve pepper stress tolerance in the future.

  19. Genome-Wide Identification of Jatropha curcas Aquaporin Genes and the Comparative Analysis Provides Insights into the Gene Family Expansion and Evolution in Hevea brasiliensis

    PubMed Central

    Zou, Zhi; Yang, Lifu; Gong, Jun; Mo, Yeyong; Wang, Jikun; Cao, Jianhua; An, Feng; Xie, Guishui

    2016-01-01

    Aquaporins (AQPs) are channel-forming integral membrane proteins that transport water and other small solutes across biological membranes. Despite the vital role of AQPs, to date, little is known in physic nut (Jatropha curcas L., Euphorbiaceae), an important non-edible oilseed crop with great potential for the production of biodiesel. In this study, 32 AQP genes were identified from the physic nut genome and the family number is relatively small in comparison to 51 in another Euphorbiaceae plant, rubber tree (Hevea brasiliensis Muell. Arg.). Based on the phylogenetic analysis, the JcAQPs were assigned to five subfamilies, i.e., nine plasma membrane intrinsic proteins (PIPs), nine tonoplast intrinsic proteins (TIPs), eight NOD26-like intrinsic proteins (NIPs), two X intrinsic proteins (XIPs), and four small basic intrinsic proteins (SIPs). Like rubber tree and other plant species, functional prediction based on the aromatic/arginine selectivity filter, Froger's positions, and specificity-determining positions showed a remarkable difference in substrate specificity among subfamilies of JcAQPs. Genome-wide comparative analysis revealed the specific expansion of PIP and TIP subfamilies in rubber tree and the specific gene loss of the XIP subfamily in physic nut. Furthermore, by analyzing deep transcriptome sequencing data, the expression evolution especially the expression divergence of duplicated HbAQP genes was also investigated and discussed. Results obtained from this study not only provide valuable information for future functional analysis and utilization of Jc/HbAQP genes, but also provide a useful reference to survey the gene family expansion and evolution in Euphorbiaceae plants and other plant species. PMID:27066041

  20. Obsessive–compulsive symptoms in a large population-based twin-family sample are predicted by clinically based polygenic scores and by genome-wide SNPs

    PubMed Central

    den Braber, A; Zilhão, N R; Fedko, I O; Hottenga, J-J; Pool, R; Smit, D J A; Cath, D C; Boomsma, D I

    2016-01-01

    Variation in obsessive–compulsive symptoms (OCS) has a heritable basis, with genetic association studies starting to yield the first suggestive findings. We contribute to insights into the genetic basis of OCS by performing an extensive series of genetic analyses in a homogeneous, population-based sample from the Netherlands. First, phenotypic and genetic longitudinal correlations over a 6-year period were estimated by modeling OCS data from twins and siblings. Second, polygenic risk scores (PRS) for 6931 subjects with genotype and OCS data were calculated based on meta-analysis results from IOCDF-GC, to investigate their predictive value. Third, the contribution of measured single nucleotide polymorphisms (SNPs) to the heritability was estimated using random-effects modeling. Last, we performed an exploratory genome-wide association study (GWAS) of OCS, testing for SNP- and for gene-based associations. Stability in OCS (test–retest correlation 0.63) was mainly explained by genetic stability. The PRS based on clinical samples predicted OCS in our population-based twin-family sample. SNP-based heritability was estimated at 14%. GWAS revealed one SNP (rs8100480), located within the MEF2BNB gene, associated with OCS (P=2.56 × 10−8). Additional gene-based testing resulted in four significantly associated genes, which are located in the same chromosomal region on chromosome 19p13.11: MEF2BNB, RFXANK, MEF2BNB-MEF2B and MEF2B. Thus, common genetic variants explained a significant proportion of OCS trait variation. Genes significantly associated with OCS are expressed in the brain and involved in development and control of immune system functions (RFXANK) and regulation of gene expression of muscle-specific genes (MEF2BNB). MEF2BNB also showed a suggestive association with OCD in an independent case–control study, suggesting a role for this gene in the development of OCS. PMID:26859814

  1. Obsessive-compulsive symptoms in a large population-based twin-family sample are predicted by clinically based polygenic scores and by genome-wide SNPs.

    PubMed

    den Braber, A; Zilhão, N R; Fedko, I O; Hottenga, J-J; Pool, R; Smit, D J A; Cath, D C; Boomsma, D I

    2016-01-01

    Variation in obsessive-compulsive symptoms (OCS) has a heritable basis, with genetic association studies starting to yield the first suggestive findings. We contribute to insights into the genetic basis of OCS by performing an extensive series of genetic analyses in a homogeneous, population-based sample from the Netherlands. First, phenotypic and genetic longitudinal correlations over a 6-year period were estimated by modeling OCS data from twins and siblings. Second, polygenic risk scores (PRS) for 6931 subjects with genotype and OCS data were calculated based on meta-analysis results from IOCDF-GC, to investigate their predictive value. Third, the contribution of measured single nucleotide polymorphisms (SNPs) to the heritability was estimated using random-effects modeling. Last, we performed an exploratory genome-wide association study (GWAS) of OCS, testing for SNP- and for gene-based associations. Stability in OCS (test-retest correlation 0.63) was mainly explained by genetic stability. The PRS based on clinical samples predicted OCS in our population-based twin-family sample. SNP-based heritability was estimated at 14%. GWAS revealed one SNP (rs8100480), located within the MEF2BNB gene, associated with OCS (P=2.56 × 10(-8)). Additional gene-based testing resulted in four significantly associated genes, which are located in the same chromosomal region on chromosome 19p13.11: MEF2BNB, RFXANK, MEF2BNB-MEF2B and MEF2B. Thus, common genetic variants explained a significant proportion of OCS trait variation. Genes significantly associated with OCS are expressed in the brain and involved in development and control of immune system functions (RFXANK) and regulation of gene expression of muscle-specific genes (MEF2BNB). MEF2BNB also showed a suggestive association with OCD in an independent case-control study, suggesting a role for this gene in the development of OCS. PMID:26859814

  2. Principal transcriptional regulation and genome-wide system interactions of the Asp-family and aromatic amino acid networks of amino acid metabolism in plants.

    PubMed

    Less, Hadar; Angelovici, Ruthie; Tzin, Vered; Galili, Gad

    2010-10-01

    Amino acid metabolism is among the most important and best recognized networks within biological systems. In plants, amino acids serve multiple functions associated with growth. Besides their function in protein synthesis, the amino acids are also catabolized into energy-associated metabolites as well we into numerous secondary metabolites, which are essential for plant growth and response to various stresses. Despite the central importance of amino acids in plants growth, elucidation of the regulation of amino acid metabolism within the context of the entire system, particularly transcriptional regulation, is still in its infancy. The different amino acids are synthesized by a number of distinct metabolic networks, which are expected to possess regulatory cross interactions between them for proper coordination of their interactive functions, such as incorporation into proteins. Yet, individual amino acid metabolic networks are also expected to differentially cross interact with various genome-wide gene expression programs and metabolic networks, in respect to their functions as precursors for various metabolites with distinct functions. In the present review, we discuss our recent genomics, metabolic and bioinformatics studies, which were aimed at addressing these questions, focusing mainly on the Asp-family metabolic network as the main example and also comparing it to the aromatic amino acids metabolic network as a second example (Angelovici et al. in Plant Physiol 151:2058-2072, 2009; Less and Galili in BMC Syst Biol 3:14, 2009; Tzin et al. in Plant J 60:156-167, 2009). Our focus on these two networks is because of the followings: (i) both networks are central to plant metabolism and growth and are also precursors for a wide range of primary and secondary metabolites that are indispensable to plant growth; (ii) the amino acids produced by these two networks are also essential to the nutrition and health of human and farm animals; and (iii) both networks contain

  3. Genome-wide transcriptome analysis of two contrasting Brassica rapa doubled haploid lines under cold-stresses using Br135K oligomeric chip.

    PubMed

    Jung, Hee-Jeong; Dong, Xiangshu; Park, Jong-In; Thamilarasan, Senthil Kumar; Lee, Sang Sook; Kim, Yeon-Ki; Lim, Yong-Pyo; Nou, Ill-Sup; Hur, Yoonkang

    2014-01-01

    Genome wide transcription analysis in response to stresses is important to provide a basis of effective engineering strategies to improve stress tolerance in crop plants. We assembled a Brassica rapa oligomeric microarray (Br135K microarray) using sequence information from 41,173 unigenes and analyzed the transcription profiles of two contrasting doubled haploid (DH) lines, Chiifu and Kenshin, under cold-treatments. The two DH lines showed great differences in electrolyte leakage below -4°C, but similar patterns from 4°C to -2°C. Cold-treatments induced 885 and 858 genes in Chiifu and Kenshin, respectively. Overall, 134, and 56 genes showed an intrinsic difference in expression in Chiifu and Kenshin, respectively. Among 5,349 genes that showed no hit found (NHF) in public databases, 61 and 24 were specifically expressed in Chiifu and Kenshin, respectively. Many transcription factor genes (TFs) also showed various characteristics of expression. BrMYB12, BrMYBL2, BrbHLHs, BrbHLH038, a C2H2, a WRKY, BrDREB19 and a integrase-type TF were induced in a Chiifu-specific fashion, while a bHLH (Bra001826/AT3G21330), bHLH, cycling Dof factor and two Dof type TFs were Kenshin specific. Similar to previous studies, a large number of genes were differently induced or regulated among the two genotypes, but many genes, including NHFs, were specifically or intrinsically expressed with genotype specificity. Expression patterns of known-cold responsive genes in plants resulted in discrepancy to membrane leakage in the two DH lines, indicating that timing of gene expression is more important to conferring freezing tolerance rather than expression levels. Otherwise, the tolerance will be related to the levels of transcripts before cold-treatment or regulated by other mechanisms. Overall, these results indicate common signaling pathways and various transcriptional regulatory mechanisms are working together during cold-treatment of B. rapa. Our newly developed Br135K oligomeric

  4. Genome-Wide Transcriptome Analysis of Two Contrasting Brassica rapa Doubled Haploid Lines under Cold-Stresses Using Br135K Oligomeric Chip

    PubMed Central

    Park, Jong-In; Thamilarasan, Senthil Kumar; Lee, Sang Sook; Kim, Yeon-Ki; Lim, Yong-Pyo; Nou, Ill-Sup; Hur, Yoonkang

    2014-01-01

    Genome wide transcription analysis in response to stresses is important to provide a basis of effective engineering strategies to improve stress tolerance in crop plants. We assembled a Brassica rapa oligomeric microarray (Br135K microarray) using sequence information from 41,173 unigenes and analyzed the transcription profiles of two contrasting doubled haploid (DH) lines, Chiifu and Kenshin, under cold-treatments. The two DH lines showed great differences in electrolyte leakage below −4°C, but similar patterns from 4°C to −2°C. Cold-treatments induced 885 and 858 genes in Chiifu and Kenshin, respectively. Overall, 134, and 56 genes showed an intrinsic difference in expression in Chiifu and Kenshin, respectively. Among 5,349 genes that showed no hit found (NHF) in public databases, 61 and 24 were specifically expressed in Chiifu and Kenshin, respectively. Many transcription factor genes (TFs) also showed various characteristics of expression. BrMYB12, BrMYBL2, BrbHLHs, BrbHLH038, a C2H2, a WRKY, BrDREB19 and a integrase-type TF were induced in a Chiifu-specific fashion, while a bHLH (Bra001826/AT3G21330), bHLH, cycling Dof factor and two Dof type TFs were Kenshin specific. Similar to previous studies, a large number of genes were differently induced or regulated among the two genotypes, but many genes, including NHFs, were specifically or intrinsically expressed with genotype specificity. Expression patterns of known-cold responsive genes in plants resulted in discrepancy to membrane leakage in the two DH lines, indicating that timing of gene expression is more important to conferring freezing tolerance rather than expression levels. Otherwise, the tolerance will be related to the levels of transcripts before cold-treatment or regulated by other mechanisms. Overall, these results indicate common signaling pathways and various transcriptional regulatory mechanisms are working together during cold-treatment of B. rapa. Our newly developed Br135K oligomeric

  5. Genome-Wide Analysis of Oleosin Gene Family in 22 Tree Species: An Accelerator for Metabolic Engineering of BioFuel Crops and Agrigenomics Industrial Applications?

    PubMed

    Cao, Heping

    2015-09-01

    Trees contribute to enormous plant oil reserves because many trees contain 50%-80% of oil (triacylglycerols, TAGs) in the fruits and kernels. TAGs accumulate in subcellular structures called oil bodies/droplets, in which TAGs are covered by low-molecular-mass hydrophobic proteins called oleosins (OLEs). The OLEs/TAGs ratio determines the size and shape of intracellular oil bodies. There is a lack of comprehensive sequence analysis and structural information of OLEs among diverse trees. The objectives of this study were to identify OLEs from 22 tree species (e.g., tung tree, tea-oil tree, castor bean), perform genome-wide analysis of OLEs, classify OLEs, identify conserved sequence motifs and amino acid residues, and predict secondary and three-dimensional structures in tree OLEs and OLE subfamilies. Data mining identified 65 OLEs with perfect conservation of the "proline knot" motif (PX5SPX3P) from 19 trees. These OLEs contained >40% hydrophobic amino acid residues. They displayed similar properties and amino acid composition. Genome-wide phylogenetic analysis and multiple sequence alignment demonstrated that these proteins could be classified into five OLE subfamilies. There were distinct patterns of sequence conservation among the OLE subfamilies and within individual tree species. Computational modeling indicated that OLEs were composed of at least three α-helixes connected with short coils without any β-strand and that they exhibited distinct 3D structures and ligand binding sites. These analyses provide fundamental information in the similarity and specificity of diverse OLE isoforms within the same subfamily and among the different species, which should facilitate studying the structure-function relationship and identify critical amino acid residues in OLEs for metabolic engineering of tree TAGs.

  6. Genome-Wide Analysis of Oleosin Gene Family in 22 Tree Species: An Accelerator for Metabolic Engineering of BioFuel Crops and Agrigenomics Industrial Applications?

    PubMed

    Cao, Heping

    2015-09-01

    Trees contribute to enormous plant oil reserves because many trees contain 50%-80% of oil (triacylglycerols, TAGs) in the fruits and kernels. TAGs accumulate in subcellular structures called oil bodies/droplets, in which TAGs are covered by low-molecular-mass hydrophobic proteins called oleosins (OLEs). The OLEs/TAGs ratio determines the size and shape of intracellular oil bodies. There is a lack of comprehensive sequence analysis and structural information of OLEs among diverse trees. The objectives of this study were to identify OLEs from 22 tree species (e.g., tung tree, tea-oil tree, castor bean), perform genome-wide analysis of OLEs, classify OLEs, identify conserved sequence motifs and amino acid residues, and predict secondary and three-dimensional structures in tree OLEs and OLE subfamilies. Data mining identified 65 OLEs with perfect conservation of the "proline knot" motif (PX5SPX3P) from 19 trees. These OLEs contained >40% hydrophobic amino acid residues. They displayed similar properties and amino acid composition. Genome-wide phylogenetic analysis and multiple sequence alignment demonstrated that these proteins could be classified into five OLE subfamilies. There were distinct patterns of sequence conservation among the OLE subfamilies and within individual tree species. Computational modeling indicated that OLEs were composed of at least three α-helixes connected with short coils without any β-strand and that they exhibited distinct 3D structures and ligand binding sites. These analyses provide fundamental information in the similarity and specificity of diverse OLE isoforms within the same subfamily and among the different species, which should facilitate studying the structure-function relationship and identify critical amino acid residues in OLEs for metabolic engineering of tree TAGs. PMID:26258573

  7. Genome-Wide Analysis of Oleosin Gene Family in 22 Tree Species: An Accelerator for Metabolic Engineering of BioFuel Crops and Agrigenomics Industrial Applications?

    PubMed Central

    2015-01-01

    Abstract Trees contribute to enormous plant oil reserves because many trees contain 50%–80% of oil (triacylglycerols, TAGs) in the fruits and kernels. TAGs accumulate in subcellular structures called oil bodies/droplets, in which TAGs are covered by low-molecular-mass hydrophobic proteins called oleosins (OLEs). The OLEs/TAGs ratio determines the size and shape of intracellular oil bodies. There is a lack of comprehensive sequence analysis and structural information of OLEs among diverse trees. The objectives of this study were to identify OLEs from 22 tree species (e.g., tung tree, tea-oil tree, castor bean), perform genome-wide analysis of OLEs, classify OLEs, identify conserved sequence motifs and amino acid residues, and predict secondary and three-dimensional structures in tree OLEs and OLE subfamilies. Data mining identified 65 OLEs with perfect conservation of the “proline knot” motif (PX5SPX3P) from 19 trees. These OLEs contained >40% hydrophobic amino acid residues. They displayed similar properties and amino acid composition. Genome-wide phylogenetic analysis and multiple sequence alignment demonstrated that these proteins could be classified into five OLE subfamilies. There were distinct patterns of sequence conservation among the OLE subfamilies and within individual tree species. Computational modeling indicated that OLEs were composed of at least three α-helixes connected with short coils without any β-strand and that they exhibited distinct 3D structures and ligand binding sites. These analyses provide fundamental information in the similarity and specificity of diverse OLE isoforms within the same subfamily and among the different species, which should facilitate studying the structure-function relationship and identify critical amino acid residues in OLEs for metabolic engineering of tree TAGs. PMID:26258573

  8. Genome-Wide Identification and Analysis of the VQ Motif-Containing Protein Family in Chinese Cabbage (Brassica rapa L. ssp. Pekinensis)

    PubMed Central

    Zhang, Gaoyuan; Wang, Fengde; Li, Jingjuan; Ding, Qian; Zhang, Yihui; Li, Huayin; Zhang, Jiannong; Gao, Jianwei

    2015-01-01

    Previous studies have showed that the VQ motif–containing proteins in Arabidopsis thaliana and Oryza sativa play an important role in plant growth, development, and stress responses. However, little is known about the functions of the VQ genes in Brassica rapa (Chinese cabbage). In this study, we performed genome-wide identification, characterization, and expression analysis of the VQ genes in Chinese cabbage, especially under adverse environment. We identified 57 VQ genes and classified them into seven subgroups (I–VII), which were dispersedly distributed on chromosomes 1 to 10. The expansion of these genes mainly contributed to segmental and tandem duplication. Fifty-four VQ genes contained no introns and 50 VQ proteins were less than 300 amino acids in length. Quantitative real-time PCR showed that the VQ genes were differentially expressed in various tissues and during different abiotic stresses and plant hormone treatments. This study provides a comprehensive overview of Chinese cabbage VQ genes and will benefit the molecular breeding for resistance to stresses and disease, as well as further studies on the biological functions of the VQ proteins. PMID:26633387

  9. Genome-Wide Identification and Analysis of the VQ Motif-Containing Protein Family in Chinese Cabbage (Brassica rapa L. ssp. Pekinensis).

    PubMed

    Zhang, Gaoyuan; Wang, Fengde; Li, Jingjuan; Ding, Qian; Zhang, Yihui; Li, Huayin; Zhang, Jiannong; Gao, Jianwei

    2015-12-02

    Previous studies have showed that the VQ motif-containing proteins in Arabidopsis thaliana and Oryza sativa play an important role in plant growth, development, and stress responses. However, little is known about the functions of the VQ genes in Brassica rapa (Chinese cabbage). In this study, we performed genome-wide identification, characterization, and expression analysis of the VQ genes in Chinese cabbage, especially under adverse environment. We identified 57 VQ genes and classified them into seven subgroups (I-VII), which were dispersedly distributed on chromosomes 1 to 10. The expansion of these genes mainly contributed to segmental and tandem duplication. Fifty-four VQ genes contained no introns and 50 VQ proteins were less than 300 amino acids in length. Quantitative real-time PCR showed that the VQ genes were differentially expressed in various tissues and during different abiotic stresses and plant hormone treatments. This study provides a comprehensive overview of Chinese cabbage VQ genes and will benefit the molecular breeding for resistance to stresses and disease, as well as further studies on the biological functions of the VQ proteins.

  10. Genome-Wide Identification and Analysis of the VQ Motif-Containing Protein Family in Chinese Cabbage (Brassica rapa L. ssp. Pekinensis).

    PubMed

    Zhang, Gaoyuan; Wang, Fengde; Li, Jingjuan; Ding, Qian; Zhang, Yihui; Li, Huayin; Zhang, Jiannong; Gao, Jianwei

    2015-01-01

    Previous studies have showed that the VQ motif-containing proteins in Arabidopsis thaliana and Oryza sativa play an important role in plant growth, development, and stress responses. However, little is known about the functions of the VQ genes in Brassica rapa (Chinese cabbage). In this study, we performed genome-wide identification, characterization, and expression analysis of the VQ genes in Chinese cabbage, especially under adverse environment. We identified 57 VQ genes and classified them into seven subgroups (I-VII), which were dispersedly distributed on chromosomes 1 to 10. The expansion of these genes mainly contributed to segmental and tandem duplication. Fifty-four VQ genes contained no introns and 50 VQ proteins were less than 300 amino acids in length. Quantitative real-time PCR showed that the VQ genes were differentially expressed in various tissues and during different abiotic stresses and plant hormone treatments. This study provides a comprehensive overview of Chinese cabbage VQ genes and will benefit the molecular breeding for resistance to stresses and disease, as well as further studies on the biological functions of the VQ proteins. PMID:26633387

  11. Genome-Wide Copy Number Variation Analysis in Extended Families and Unrelated Individuals Characterized for Musical Aptitude and Creativity in Music

    PubMed Central

    Oikkonen, Jaana; Buck, Gemma; Blancher, Christine; Raijas, Pirre; Karma, Kai; Lähdesmäki, Harri; Järvelä, Irma

    2013-01-01

    Music perception and practice represent complex cognitive functions of the human brain. Recently, evidence for the molecular genetic background of music related phenotypes has been obtained. In order to further elucidate the molecular background of musical phenotypes we analyzed genome wide copy number variations (CNVs) in five extended pedigrees and in 172 unrelated subjects characterized for musical aptitude and creative functions in music. Musical aptitude was defined by combination of the scores of three music tests (COMB scores): auditory structuring ability, Seashores test for pitch and for time. Data on creativity in music (herein composing, improvising and/or arranging music) was surveyed using a web-based questionnaire. Several CNVRs containing genes that affect neurodevelopment, learning and memory were detected. A deletion at 5q31.1 covering the protocadherin-α gene cluster (Pcdha 1-9) was found co-segregating with low music test scores (COMB) in both sample sets. Pcdha is involved in neural migration, differentiation and synaptogenesis. Creativity in music was found to co-segregate with a duplication covering glucose mutarotase gene (GALM) at 2p22. GALM has influence on serotonin release and membrane trafficking of the human serotonin transporter. Interestingly, genes related to serotonergic systems have been shown to associate not only with psychiatric disorders but also with creativity and music perception. Both, Pcdha and GALM, are related to the serotonergic systems influencing cognitive and motor functions, important for music perception and practice. Finally, a 1.3 Mb duplication was identified in a subject with low COMB scores in the region previously linked with absolute pitch (AP) at 8q24. No differences in the CNV burden was detected among the high/low music test scores or creative/non-creative groups. In summary, CNVs and genes found in this study are related to cognitive functions. Our result suggests new candidate genes for music

  12. Genome-wide copy number variation analysis in extended families and unrelated individuals characterized for musical aptitude and creativity in music.

    PubMed

    Ukkola-Vuoti, Liisa; Kanduri, Chakravarthi; Oikkonen, Jaana; Buck, Gemma; Blancher, Christine; Raijas, Pirre; Karma, Kai; Lähdesmäki, Harri; Järvelä, Irma

    2013-01-01

    Music perception and practice represent complex cognitive functions of the human brain. Recently, evidence for the molecular genetic background of music related phenotypes has been obtained. In order to further elucidate the molecular background of musical phenotypes we analyzed genome wide copy number variations (CNVs) in five extended pedigrees and in 172 unrelated subjects characterized for musical aptitude and creative functions in music. Musical aptitude was defined by combination of the scores of three music tests (COMB scores): auditory structuring ability, Seashores test for pitch and for time. Data on creativity in music (herein composing, improvising and/or arranging music) was surveyed using a web-based questionnaire.Several CNVRs containing genes that affect neurodevelopment, learning and memory were detected. A deletion at 5q31.1 covering the protocadherin-α gene cluster (Pcdha 1-9) was found co-segregating with low music test scores (COMB) in both sample sets. Pcdha is involved in neural migration, differentiation and synaptogenesis. Creativity in music was found to co-segregate with a duplication covering glucose mutarotase gene (GALM) at 2p22. GALM has influence on serotonin release and membrane trafficking of the human serotonin transporter. Interestingly, genes related to serotonergic systems have been shown to associate not only with psychiatric disorders but also with creativity and music perception. Both, Pcdha and GALM, are related to the serotonergic systems influencing cognitive and motor functions, important for music perception and practice. Finally, a 1.3 Mb duplication was identified in a subject with low COMB scores in the region previously linked with absolute pitch (AP) at 8q24. No differences in the CNV burden was detected among the high/low music test scores or creative/non-creative groups. In summary, CNVs and genes found in this study are related to cognitive functions. Our result suggests new candidate genes for music perception

  13. Genome-wide identification and characterization of the apple (Malus domestica) HECT ubiquitin-protein ligase family and expression analysis of their responsiveness to abiotic stresses.

    PubMed

    Xu, Jianing; Xing, Shanshan; Cui, Haoran; Chen, Xuesen; Wang, Xiaoyun

    2016-04-01

    The ubiquitin-protein ligases (E3s) directly participate in ubiquitin (Ub) transferring to the target proteins in the ubiquitination pathway. The HECT ubiquitin-protein ligase (UPL), one type of E3s, is characterized as containing a conserved HECT domain of approximately 350 amino acids in the C terminus. Some UPLs were found to be involved in trichome development and leaf senescence in Arabidopsis. However, studies on plant UPLs, such as characteristics of the protein structure, predicted functional motifs of the HECT domain, and the regulatory expression of UPLs have all been limited. Here, we present genome-wide identification of the genes encoding UPLs (HECT gene) in apple. The 13 genes (named as MdUPL1-MdUPL13) from ten different chromosomes were divided into four groups by phylogenetic analysis. Among these groups, the encoding genes in the intron-exon structure and the included additional functional domains were quite different. Notably, the F-box domain was first found in MdUPL7 in plant UPLs. The HECT domain in different MdUPL groups also presented different spatial features and three types of conservative motifs were identified. The promoters of each MdUPL member carried multiple stress-response related elements by cis-acting element analysis. Experimental results demonstrated that the expressions of several MdUPLs were quite sensitive to cold-, drought-, and salt-stresses by qRT-PCR assay. The results of this study helped to elucidate the functions of HECT proteins, especially in Rosaceae plants.

  14. A Genome-Wide Association Study for Culm Cellulose Content in Barley Reveals Candidate Genes Co-Expressed with Members of the CELLULOSE SYNTHASE A Gene Family

    PubMed Central

    Houston, Kelly; Burton, Rachel A.; Sznajder, Beata; Rafalski, Antoni J.; Dhugga, Kanwarpal S.; Mather, Diane E.; Taylor, Jillian; Steffenson, Brian J.; Waugh, Robbie; Fincher, Geoffrey B.

    2015-01-01

    Cellulose is a fundamentally important component of cell walls of higher plants. It provides a scaffold that allows the development and growth of the plant to occur in an ordered fashion. Cellulose also provides mechanical strength, which is crucial for both normal development and to enable the plant to withstand both abiotic and biotic stresses. We quantified the cellulose concentration in the culm of 288 two – rowed and 288 six – rowed spring type barley accessions that were part of the USDA funded barley Coordinated Agricultural Project (CAP) program in the USA. When the population structure of these accessions was analysed we identified six distinct populations, four of which we considered to be comprised of a sufficient number of accessions to be suitable for genome-wide association studies (GWAS). These lines had been genotyped with 3072 SNPs so we combined the trait and genetic data to carry out GWAS. The analysis allowed us to identify regions of the genome containing significant associations between molecular markers and cellulose concentration data, including one region cross-validated in multiple populations. To identify candidate genes we assembled the gene content of these regions and used these to query a comprehensive RNA-seq based gene expression atlas. This provided us with gene annotations and associated expression data across multiple tissues, which allowed us to formulate a supported list of candidate genes that regulate cellulose biosynthesis. Several regions identified by our analysis contain genes that are co-expressed with CELLULOSE SYNTHASE A (HvCesA) across a range of tissues and developmental stages. These genes are involved in both primary and secondary cell wall development. In addition, genes that have been previously linked with cellulose synthesis by biochemical methods, such as HvCOBRA, a gene of unknown function, were also associated with cellulose levels in the association panel. Our analyses provide new insights into the

  15. A Genome-Wide Association Study for Culm Cellulose Content in Barley Reveals Candidate Genes Co-Expressed with Members of the CELLULOSE SYNTHASE A Gene Family.

    PubMed

    Houston, Kelly; Burton, Rachel A; Sznajder, Beata; Rafalski, Antoni J; Dhugga, Kanwarpal S; Mather, Diane E; Taylor, Jillian; Steffenson, Brian J; Waugh, Robbie; Fincher, Geoffrey B

    2015-01-01

    Cellulose is a fundamentally important component of cell walls of higher plants. It provides a scaffold that allows the development and growth of the plant to occur in an ordered fashion. Cellulose also provides mechanical strength, which is crucial for both normal development and to enable the plant to withstand both abiotic and biotic stresses. We quantified the cellulose concentration in the culm of 288 two--rowed and 288 six--rowed spring type barley accessions that were part of the USDA funded barley Coordinated Agricultural Project (CAP) program in the USA. When the population structure of these accessions was analysed we identified six distinct populations, four of which we considered to be comprised of a sufficient number of accessions to be suitable for genome-wide association studies (GWAS). These lines had been genotyped with 3072 SNPs so we combined the trait and genetic data to carry out GWAS. The analysis allowed us to identify regions of the genome containing significant associations between molecular markers and cellulose concentration data, including one region cross-validated in multiple populations. To identify candidate genes we assembled the gene content of these regions and used these to query a comprehensive RNA-seq based gene expression atlas. This provided us with gene annotations and associated expression data across multiple tissues, which allowed us to formulate a supported list of candidate genes that regulate cellulose biosynthesis. Several regions identified by our analysis contain genes that are co-expressed with cellulose synthase A (HvCesA) across a range of tissues and developmental stages. These genes are involved in both primary and secondary cell wall development. In addition, genes that have been previously linked with cellulose synthesis by biochemical methods, such as HvCOBRA, a gene of unknown function, were also associated with cellulose levels in the association panel. Our analyses provide new insights into the genes that

  16. Genome-Wide Association Study of the Child Behavior Checklist Dysregulation Profile

    PubMed Central

    Mick, Eric; McGough, James; Loo, Sandra; Doyle, Alysa E.; Wozniak, Janet; Wilens, Timothy E.; Smalley, Susan; McCracken, James; Biederman, Joseph; Faraone, Stephen V.

    2011-01-01

    Objective A potentially useful tool for understanding the distribution and determinants of emotional dysregulation in children is a Child Behavior Checklist profile comprised of the Attention Problems, Anxious/Depressed, and Aggressive Behavior clinical subscales (CBCL-DP). The CBCL-DP indexes a heritable trait that increases susceptibility for later psychopathology, including severe mood problems and aggressive behavior. We have conducted a genome-wide association study of the CBCL-DP in children with attention-deficit/hyperactivity disorder (ADHD). Method Families were ascertained at Massachusetts General Hospital and University of California, Los Angeles. Genotyping was conducted with the Illumina Human1M or Human1M-Duo BeadChip platforms. Genome-wide association analyses were conducted with the MQFAM multivariate extension of PLINK. Results CBCL data were available for 341 ADHD offspring from 339 ADHD affected trio families from the UCLA (N=128) and the MGH (N=213) sites. We found no genome-wide statistically significant associations but identified several plausible candidate genes among findings at p<5E-05: TMEM132D, LRRC7, SEMA3A, ALK, and STIP1. Conclusions We found suggestive evidence for developmentally expressed genes operant in hippocampal dependent memory and learning with the CBCL-DP. PMID:21784300

  17. Genome-wide analysis and expression profiling under heat and drought treatments of HSP70 gene family in soybean (Glycine max L.).

    PubMed

    Zhang, Ling; Zhao, Hong-Kun; Dong, Qian-Li; Zhang, Yuan-Yu; Wang, Yu-Min; Li, Hai-Yun; Xing, Guo-Jie; Li, Qi-Yun; Dong, Ying-Shan

    2015-01-01

    Heat shock proteins (HSPs) perform a fundamental role in protecting plants against abiotic stresses. Previous studies have made great efforts in the functional analysis of individual family members, but there has not yet been an overall analysis or expression profiling of the HSP70 gene family in soybeans (Glycine max L.). In this study, an investigation of the soybean genome revealed 61 putative HSP70 genes, which were evaluated. These genes were classified into eight sub-families, denoted I-VIII, based on a phylogenetic analysis. In each sub-family, the constituent parts of the gene structure and motif were relatively conserved. These GmHSP70 genes were distributed unequally on 17 of the 20 chromosomes. The analysis of the expression profiles showed that 53 of the 61 GmHSP70 genes were differentially expressed across the 14 tissues. However, most of the GmHSP70s were differentially expressed in a tissue-specific expression pattern. Furthermore, the expression of some of the duplicate genes was partially redundant, while others showed functional diversity. The quantitative real-time PCR (qRT-PCR) analysis of the 61 soybean HSP70 genes confirmed their stress-inducible expression patterns under both drought and heat stress. These findings provide a thorough overview of the evolution and modification of the GmHSP70 gene family, which will help to determine the functional characteristics of the HSP70 genes in soybean growth and development.

  18. Genome-wide identification, structural analysis and new insights into late embryogenesis abundant (LEA) gene family formation pattern in Brassica napus

    PubMed Central

    Liang, Yu; Xiong, Ziyi; Zheng, Jianxiao; Xu, Dongyang; Zhu, Zeyang; Xiang, Jun; Gan, Jianping; Raboanatahiry, Nadia; Yin, Yongtai; Li, Maoteng

    2016-01-01

    Late embryogenesis abundant (LEA) proteins are a diverse and large group of polypeptides that play important roles in desiccation and freezing tolerance in plants. The LEA family has been systematically characterized in some plants but not Brassica napus. In this study, 108 BnLEA genes were identified in the B. napus genome and classified into eight families based on their conserved domains. Protein sequence alignments revealed an abundance of alanine, lysine and glutamic acid residues in BnLEA proteins. The BnLEA gene structure has few introns (<3), and they are distributed unevenly across all 19 chromosomes in B. napus, occurring as gene clusters in chromosomes A9, C2, C4 and C5. More than two-thirds of the BnLEA genes are associated with segmental duplication. Synteny analysis revealed that most LEA genes are conserved, although gene losses or gains were also identified. These results suggest that segmental duplication and whole-genome duplication played a major role in the expansion of the BnLEA gene family. Expression profiles analysis indicated that expression of most BnLEAs was increased in leaves and late stage seeds. This study presents a comprehensive overview of the LEA gene family in B. napus and provides new insights into the formation of this family. PMID:27072743

  19. Genome-wide identification, structural analysis and new insights into late embryogenesis abundant (LEA) gene family formation pattern in Brassica napus.

    PubMed

    Liang, Yu; Xiong, Ziyi; Zheng, Jianxiao; Xu, Dongyang; Zhu, Zeyang; Xiang, Jun; Gan, Jianping; Raboanatahiry, Nadia; Yin, Yongtai; Li, Maoteng

    2016-01-01

    Late embryogenesis abundant (LEA) proteins are a diverse and large group of polypeptides that play important roles in desiccation and freezing tolerance in plants. The LEA family has been systematically characterized in some plants but not Brassica napus. In this study, 108 BnLEA genes were identified in the B. napus genome and classified into eight families based on their conserved domains. Protein sequence alignments revealed an abundance of alanine, lysine and glutamic acid residues in BnLEA proteins. The BnLEA gene structure has few introns (<3), and they are distributed unevenly across all 19 chromosomes in B. napus, occurring as gene clusters in chromosomes A9, C2, C4 and C5. More than two-thirds of the BnLEA genes are associated with segmental duplication. Synteny analysis revealed that most LEA genes are conserved, although gene losses or gains were also identified. These results suggest that segmental duplication and whole-genome duplication played a major role in the expansion of the BnLEA gene family. Expression profiles analysis indicated that expression of most BnLEAs was increased in leaves and late stage seeds. This study presents a comprehensive overview of the LEA gene family in B. napus and provides new insights into the formation of this family. PMID:27072743

  20. Genome-wide identification and expression profiling of the cystatin gene family in apple (Malus × domestica Borkh.).

    PubMed

    Tan, Yanxiao; Wang, Suncai; Liang, Dong; Li, Mingjun; Ma, Fengwang

    2014-06-01

    Cystatins or phytocystatins (PhyCys) comprise a family of plant-specific inhibitors of cysteine proteinases. Such inhibitors are thought to be involved in the regulation of several endogenous processes as well as defense against biotic or abiotic stresses. However, information about this family is limited in apple. We identified 26 PhyCys genes within the entire apple genome. They were clustered into three distinct groups distributed across several chromosomes. All of their putative proteins contained one or two typical cystatin domains, which shared the characteristic motifs of PhyCys. Eight selected genes displayed differential expression patterns in various tissues. Moreover, their transcript levels were also up-regulated significantly in leaves during maturation, senescence or in response to treatment with one or more abiotic stresses. Our results indicated that members of this family may function in tissue development, leaf senescence, and adaptation to adverse environments in apple.

  1. A genome-wide analysis of the auxin/indole-3-acetic acid gene family in hexaploid bread wheat (Triticum aestivum L.)

    PubMed Central

    Qiao, Linyi; Zhang, Xiaojun; Han, Xiao; Zhang, Lei; Li, Xin; Zhan, Haixian; Ma, Jian; Luo, Peigao; Zhang, Wenping; Cui, Lei; Li, Xiaoyan; Chang, Zhijian

    2015-01-01

    The Auxin/indole-3-acetic acid (Aux/IAA) gene family plays key roles in the primary auxin-response process and controls a number of important traits in plants. However, the characteristics of the Aux/IAA gene family in hexaploid bread wheat (Triticum aestivum L.) have long been unknown. In this study, a comprehensive identification of the Aux/IAA gene family was performed using the latest draft genome sequence of the bread wheat “Chinese Spring.” Thirty-four Aux/IAA genes were identified, 30 of which have duplicated genes on the A, B or D sub-genome, with a total of 84 Aux/IAA sequences. These predicted Aux/IAA genes were non-randomly distributed in all the wheat chromosomes except for chromosome 2D. The information of wheat Aux/IAA proteins is also described. Based on an analysis of phylogeny, expression and adaptive evolution, we prove that the Aux/IAA family in wheat has been replicated twice in the two allopolyploidization events of bread wheat, when the tandem duplication also occurred. The duplicated genes have undergone an evolutionary process of purifying selection, resulting in the high conservation of copy genes among sub-genomes and functional redundancy among several members of the TaIAA family. However, functional divergence probably existed in most TaIAA members due to the diversity of the functional domain and expression pattern. Our research provides useful information for further research into the function of Aux/IAA genes in wheat. PMID:26483801

  2. Genome-wide identification, phylogenetic analysis, expression profiling, and protein–protein interaction properties of TOPLESS gene family members in tomato

    PubMed Central

    Hao, Yanwei; Wang, Xinyu; van der Rest, Benoit; Zouine, Mohamed

    2014-01-01

    Members of the TOPLESS gene family emerged recently as key players in gene repression in several mechanisms, especially in auxin perception. The TOPLESS genes constitute, in ‘higher-plant’ genomes, a small multigenic family comprising four to 11 members. In this study, this family was investigated in tomato, a model plant for Solanaceae species and fleshy fruits. Six open reading frames predicted to encode topless-like proteins (SlTPLs) containing the canonical domains (LisH, CTLH, and two WD40 repeats) were identified in the tomato genome. Nuclear localization was confirmed for all members of the SlTPL family with the exception SlTPL6, which localized at the cytoplasm and was excluded from the nucleus. SlTPL genes displayed distinctive expression patterns in different tomato organs, with SlTPL1 showing the highest levels of transcript accumulation in all tissues tested except in ripening fruit where SlTPL3 and SlTPL4 were the most prominently expressed. To gain insight into the specificity of the different TOPLESS paralogues, a protein–protein interaction map between TOPLESS and auxin/indole-3-acetic acid (Aux/IAA) proteins was built using a yeast two-hybrid approach. The PPI map enabled the distinction of two patterns: TOPLESS isoforms interacting with the majority of Aux/IAA, and isoforms with limited capacity for interaction with these protein partners. Interestingly, evolutionary analyses of the TOPLESS gene family revealed that the highly expressed isoforms (SlTPL1, SlTPL3, and SlTPL4) corresponded to the three TPL-related genes undergoing the strongest purifying selection, while the selection was much weaker for SlTPL6, which was expressed at a low level and encoded a protein lacking the capacity to interact with Aux/IAAs. PMID:24399174

  3. Genome-wide analysis of the maize (Zea may L.) CPP-like gene family and expression profiling under abiotic stress.

    PubMed

    Song, X Y; Zhang, Y Y; Wu, F C; Zhang, L

    2016-01-01

    Cysteine-rich polycomb-like (CPP) proteins are members of a small family of transcription factors, which have been identified and characterized in Arabidopsis, rice, and soybean. In this study, we investigated CPP-like genes in the maize genome. The results revealed 13 putative CPP-like genes, which were found to encode 17 distinct transcripts and were distributed unequally on 7 of 10 maize chromosomes. Analysis of phylogenetic relationships showed that Arabidopsis, rice, and maize CPP-like transcription factors can be grouped into two subfamilies. We also used real-time RT-PCR to evaluate changes in the transcript levels of ZmCPP genes in response to abiotic stresses (heat, cold, salt, and drought stresses). These findings provide an overview of the evolution of the ZmCPP gene family, which will aid in the functional characterization of CPP-like genes in maize growth and development. PMID:27525875

  4. Genome-Wide Analysis of the AP2/ERF Family in Eucalyptus grandis: An Intriguing Over-Representation of Stress-Responsive DREB1/CBF Genes

    PubMed Central

    SanClemente, H.; Mounet, F.; Dunand, C.; Marque, G.; Marque, C.; Teulières, C.

    2015-01-01

    Background The AP2/ERF family includes a large number of developmentally and physiologically important transcription factors sharing an AP2 DNA-binding domain. Among them DREB1/CBF and DREB2 factors are known as master regulators respectively of cold and heat/osmotic stress responses. Experimental Approaches The manual annotation of AP2/ERF family from Eucalyptus grandis, Malus, Populus and Vitis genomes allowed a complete phylogenetic study for comparing the structure of this family in woody species and the model Arabidopsis thaliana. Expression profiles of the whole groups of EgrDREB1 and EgrDREB2 were investigated through RNAseq database survey and RT-qPCR analyses. Results The structure and the size of the AP2/ERF family show a global conservation for the plant species under comparison. In addition to an expansion of the ERF subfamily, the tree genomes mainly differ with respect to the group representation within the subfamilies. With regard to the E. grandis DREB subfamily, an obvious feature is the presence of 17 DREB1/CBF genes, the maximum reported to date for dicotyledons. In contrast, only six DREB2 have been identified, which is similar to the other plants species under study, except for Malus. All the DREB1/CBF and DREB2 genes from E. grandis are expressed in at least one condition and all are heat-responsive. Regulation by cold and drought depends on the genes but is not specific of one group; DREB1/CBF group is more cold-inducible than DREB2 which is mainly drought responsive. Conclusion These features suggest that the dramatic expansion of the DREB1/CBF group might be related to the adaptation of this evergreen tree to climate changes when it expanded in Australia. PMID:25849589

  5. A genome-wide analysis of the LBD (LATERAL ORGAN BOUNDARIES domain) gene family in Malus domestica with a functional characterization of MdLBD11.

    PubMed

    Wang, Xiaofei; Zhang, Shizhong; Su, Ling; Liu, Xin; Hao, Yujin

    2013-01-01

    The plant-specific LBD (LATERAL ORGAN BOUNDARIES domain) genes belong to a major family of transcription factor that encode a zinc finger-like domain. It has been shown that LBD genes play crucial roles in the growth and development of Arabidopsis and other plant species. However, no detailed information concerning this family is available for apple. In the present study, we analyzed the apple (Malus domestica) genome and identified 58 LBD genes. This gene family was tested for its phylogenetic relationships with homologous genes in the Arabidopsis genome, as well as its location in the genome, structure and expression. We also transformed one MdLBD gene into Arabidopsis to evaluate its function. Like Arabidopsis, apple LBD genes also have a conserved CX2CX6CX3C zinc finger-like domain in the N terminus and can be divided into two classes. The expression profile indicated that apple LBD genes exhibited a variety of expression patterns, suggesting that they have diverse functions. At the same time, the expression analysis implied that members of this apple gene family were responsive to hormones and stress and that they may participate in hormone-mediated plant organogenesis, which was demonstrated with the overexpression of the apple LBD gene MdLBD11, resulting in an abnormal phenotype. This phenotype included upward curling leaves, delayed flowering, downward-pointing flowers, siliques and other abnormal traits. Based on these data, we concluded that the MdLBD genes may play an important role in apple growth and development as in Arabidopsis and other species.

  6. Genome-Wide Characterization of the MADS-Box Gene Family in Radish (Raphanus sativus L.) and Assessment of Its Roles in Flowering and Floral Organogenesis

    PubMed Central

    Li, Chao; Wang, Yan; Xu, Liang; Nie, Shanshan; Chen, Yinglong; Liang, Dongyi; Sun, Xiaochuan; Karanja, Benard K.; Luo, Xiaobo; Liu, Liwang

    2016-01-01

    The MADS-box gene family is an important transcription factor (TF) family that is involved in various aspects of plant growth and development, especially flowering time and floral organogenesis. Although it has been reported in many plant species, the systematic identification and characterization of MADS-box TF family is still limited in radish (Raphanus sativus L.). In the present study, a comprehensive analysis of MADS-box genes was performed, and a total of 144 MADS-box family members were identified from the whole radish genome. Meanwhile, a detailed list of MADS-box genes from other 28 plant species was also investigated. Through the phylogenetic analysis between radish and Arabidopsis thaliana, all the RsMADS genes were classified into two groups including 68 type I (31 Mα, 12 Mβ and 25Mγ) and 76 type II (70 MIKCC and 6 MIKC∗). Among them, 41 (28.47%) RsMADS genes were located in nine linkage groups of radish from R1 to R9. Moreover, the homologous MADS-box gene pairs were identified among radish, A. thaliana, Chinese cabbage and rice. Additionally, the expression profiles of RsMADS genes were systematically investigated in different tissues and growth stages. Furthermore, quantitative real-time PCR analysis was employed to validate expression patterns of some crucial RsMADS genes. These results could provide a valuable resource to explore the potential functions of RsMADS genes in radish, and facilitate dissecting MADS-box gene-mediated molecular mechanisms underlying flowering and floral organogenesis in root vegetable crops. PMID:27703461

  7. Genome-Wide Identification and Transcriptome-Based Expression Profiling of the Sox Gene Family in the Nile Tilapia (Oreochromis niloticus)

    PubMed Central

    Wei, Ling; Yang, Chao; Tao, Wenjing; Wang, Deshou

    2016-01-01

    The Sox transcription factor family is characterized with the presence of a Sry-related high-mobility group (HMG) box and plays important roles in various biological processes in animals, including sex determination and differentiation, and the development of multiple organs. In this study, 27 Sox genes were identified in the genome of the Nile tilapia (Oreochromis niloticus), and were classified into seven groups. The members of each group of the tilapia Sox genes exhibited a relatively conserved exon-intron structure. Comparative analysis showed that the Sox gene family has undergone an expansion in tilapia and other teleost fishes following their whole genome duplication, and group K only exists in teleosts. Transcriptome-based analysis demonstrated that most of the tilapia Sox genes presented stage-specific and/or sex-dimorphic expressions during gonadal development, and six of the group B Sox genes were specifically expressed in the adult brain. Our results provide a better understanding of gene structure and spatio-temporal expression of the Sox gene family in tilapia, and will be useful for further deciphering the roles of the Sox genes during sex determination and gonadal development in teleosts. PMID:26907269

  8. Genome-Wide Identification and Expression Analysis of MRLK Family Genes Associated with Strawberry (Fragaria vesca) Fruit Ripening and Abiotic Stress Responses

    PubMed Central

    Zhang, Qing; Jia, Meiru; Xing, Yu; Qin, Ling; Li, Bingbing; Jia, Wensuo

    2016-01-01

    Malectin-like domain-containing receptor-like kinases (MRLK) constitute a large and divergent family of proteins in plants; however, little is known about the role of MRLKs in fruit growth and development. In this study, we characterized MRLK family genes in diploid strawberry, Fragaria vesca. Based on an analysis of malectin-like domain and a search in the strawberry genome and NCBI database, we identified 62 FvMRLKs in the strawberry genome, and classified these genes into six subfamilies with distinct malectin domains in the extracellular regions of the encoded proteins. Gene expression analysis indicated that more than 80% of the FvMRLKs were expressed in various tissues, with higher levels in roots than in other organs. Thirty-three FvMRLKs were found to be expressed in fruits during the early stages of development, and over 60% of these exhibited dramatic decreases in expression during fruit growth and development. Moreover, the expression of some FvMRLKs was sensitive to both environmental and internal cues that play critical roles in regulating strawberry fruit development and ripening. Collectively, this study provides valuable insight into the FvMRLKs gene family and its role in regulating strawberry fruit development and ripening. PMID:27685863

  9. Genome-wide identification, expression analysis of auxin-responsive GH3 family genes in maize (Zea mays L.) under abiotic stresses.

    PubMed

    Feng, Shangguo; Yue, Runqing; Tao, Sun; Yang, Yanjun; Zhang, Lei; Xu, Mingfeng; Wang, Huizhong; Shen, Chenjia

    2015-09-01

    Auxin is involved in different aspects of plant growth and development by regulating the expression of auxin-responsive family genes. As one of the three major auxin-responsive families, GH3 (Gretchen Hagen3) genes participate in auxin homeostasis by catalyzing auxin conjugation and bounding free indole-3-acetic acid (IAA) to amino acids. However, how GH3 genes function in responses to abiotic stresses and various hormones in maize is largely unknown. Here, the latest updated maize (Zea mays L.) reference genome sequence was used to characterize and analyze the ZmGH3 family genes from maize. The results showed that 13 ZmGH3 genes were mapped on five maize chromosomes (total 10 chromosomes). Highly diversified gene structures and tissue-specific expression patterns suggested the possibility of function diversification for these genes in response to environmental stresses and hormone stimuli. The expression patterns of ZmGH3 genes are responsive to several abiotic stresses (salt, drought and cadmium) and major stress-related hormones (abscisic acid, salicylic acid and jasmonic acid). Various environmental factors suppress auxin free IAA contents in maize roots suggesting that these abiotic stresses and hormones might alter GH3-mediated auxin levels. The responsiveness of ZmGH3 genes to a wide range of abiotic stresses and stress-related hormones suggested that ZmGH3s are involved in maize tolerance to environmental stresses.

  10. Genome-Wide Identification and Transcriptome-Based Expression Profiling of the Sox Gene Family in the Nile Tilapia (Oreochromis niloticus).

    PubMed

    Wei, Ling; Yang, Chao; Tao, Wenjing; Wang, Deshou

    2016-01-01

    The Sox transcription factor family is characterized with the presence of a Sry-related high-mobility group (HMG) box and plays important roles in various biological processes in animals, including sex determination and differentiation, and the development of multiple organs. In this study, 27 Sox genes were identified in the genome of the Nile tilapia (Oreochromis niloticus), and were classified into seven groups. The members of each group of the tilapia Sox genes exhibited a relatively conserved exon-intron structure. Comparative analysis showed that the Sox gene family has undergone an expansion in tilapia and other teleost fishes following their whole genome duplication, and group K only exists in teleosts. Transcriptome-based analysis demonstrated that most of the tilapia Sox genes presented stage-specific and/or sex-dimorphic expressions during gonadal development, and six of the group B Sox genes were specifically expressed in the adult brain. Our results provide a better understanding of gene structure and spatio-temporal expression of the Sox gene family in tilapia, and will be useful for further deciphering the roles of the Sox genes during sex determination and gonadal development in teleosts. PMID:26907269

  11. Genome-Wide Analysis and Expression Profiling of the SUC and SWEET Gene Families of Sucrose Transporters in Oilseed Rape (Brassica napus L.)

    PubMed Central

    Jian, Hongju; Lu, Kun; Yang, Bo; Wang, Tengyue; Zhang, Li; Zhang, Aoxiang; Wang, Jia; Liu, Liezhao; Qu, Cunmin; Li, Jiana

    2016-01-01

    Sucrose is the principal transported product of photosynthesis from source leaves to sink organs. SUTs/SUCs (sucrose transporters or sucrose carriers) and SWEETs (Sugars Will Eventually be Exported Transporters) play significant central roles in phloem loading and unloading. SUTs/SUCs and SWEETs are key players in sucrose translocation and are associated with crop yields. The SUT/SUC and SWEET genes have been characterized in several plant species, but a comprehensive analysis of these two gene families in oilseed rape has not yet been reported. In our study, 22 and 68 members of the SUT/SUCs and SWEET gene families, respectively, were identified in the oilseed rape (Brassica napus) genome through homology searches. An analysis of the chromosomal distribution, phylogenetic relationships, gene structures, motifs and the cis-acting regulatory elements in the promoters of BnSUC and BnSWEET genes were analyzed. Furthermore, we examined the expression of the 18 BnSUC and 16 BnSWEET genes in different tissues of “ZS11” and the expression of 9 BnSUC and 7 BnSWEET genes in “ZS11” under various conditions, including biotic stress (Sclerotinia sclerotiorum), abiotic stresses (drought, salt and heat), and hormone treatments (abscisic acid, auxin, cytokinin, brassinolide, gibberellin, and salicylic acid). In conclusion, our study provides the first comprehensive analysis of the oilseed rape SUC and SWEET gene families. Information regarding the phylogenetic relationships, gene structure and expression profiles of the SUC and SWEET genes in the different tissues of oilseed rape helps to identify candidates with potential roles in specific developmental processes. Our study advances our understanding of the important roles of sucrose transport in oilseed rape. PMID:27733861

  12. Genome-wide identification and characterization of TIFY family genes in Moso Bamboo (Phyllostachys edulis) and expression profiling analysis under dehydration and cold stresses

    PubMed Central

    Jin, Si-Han; Guo, Han-Du; Zhong, Xiao-Juan; He, Jiao; Li, Xi; Jiang, Ming-Yan; Yu, Xiao-Fang; Ma, Ming-Dong; Chen, Qi-Bing

    2016-01-01

    The proteins containing the TIFY domain belong to a plant-specific family of putative transcription factors and could be divided into four subfamilies: ZML, TIFY, PPD and JAZ. They not only function as key regulators of jasmonate hormonal response, but are also involved in responding to abiotic stress. In this study, we identified 24 TIFY genes (PeTIFYs) in Moso bamboo (Phyllostachys edulis) of Poaceae by analyzing the whole genome sequence. One PeTIFY belongs to TIFY subfamily, 18 and five belong to JAZ and ZML subfamilies, respectively. Two equivocal gene models were re-predicted and a putative retrotransposition event was found in a ZML protein. The distribution and conservation of domain or motif, and gene structure were also analyzed. Phylogenetic analysis with TIFY proteins of Arabidopsis and Oryza sativa indicated that JAZ subfamily could be further divided to four groups. Evolutionary analysis revealed intragenomic duplication and orthologous relationship between P. edulis, O. sativa, and B. distachyon. Calculation of the non-synonymous (Ka) and synonymous (Ks) substitution rates and their ratios indicated that the duplication of PeTIFY may have occurred around 16.7 million years ago (MYA), the divergence time of TIFY family among the P. edulis-O. sativa, P. edulis-B. distachyon, and O. sativa-B. distachyon was approximately 39 MYA, 39 MYA, and 45 MYA, respectively. They appear to have undergone extensive purifying selection during evolution. Transcriptome sequencing revealed that more than 50% of PeTIFY genes could be up-regulated by cold and dehydration stresses, and some PeTIFYs also share homology to know TIFYs involved in abiotic stress tolerance. Our results made insights into TIFY family of Moso bamboo, an economically important non-timber forest resource, and provided candidates for further identification of genes involved in regulating responses to abiotic stress. PMID:27812419

  13. Genome-wide survey of prokaryotic serine proteases: Analysis of distribution and domain architectures of five serine protease families in prokaryotes

    PubMed Central

    Tripathi, Lokesh P; Sowdhamini, R

    2008-01-01

    Background Serine proteases are one of the most abundant groups of proteolytic enzymes found in all the kingdoms of life. While studies have established significant roles for many prokaryotic serine proteases in several physiological processes, such as those associated with metabolism, cell signalling, defense response and development, functional associations for a large number of prokaryotic serine proteases are relatively unknown. Current analysis is aimed at understanding the distribution and probable biological functions of the select serine proteases encoded in representative prokaryotic organisms. Results A total of 966 putative serine proteases, belonging to five families, were identified in the 91 prokaryotic genomes using various sensitive sequence search techniques. Phylogenetic analysis reveals several species-specific clusters of serine proteases suggesting their possible involvement in organism-specific functions. Atypical phylogenetic associations suggest an important role for lateral gene transfer events in facilitating the widespread distribution of the serine proteases in the prokaryotes. Domain organisations of the gene products were analysed, employing sensitive sequence search methods, to infer their probable biological functions. Trypsin, subtilisin and Lon protease families account for a significant proportion of the multi-domain representatives, while the D-Ala-D-Ala carboxypeptidase and the Clp protease families are mostly single-domain polypeptides in prokaryotes. Regulatory domains for protein interaction, signalling, pathogenesis, cell adhesion etc. were found tethered to the serine protease domains. Some domain combinations (such as S1-PDZ; LON-AAA-S16 etc.) were found to be widespread in the prokaryotic lineages suggesting a critical role in prokaryotes. Conclusion Domain architectures of many serine proteases and their homologues identified in prokaryotes are very different from those observed in eukaryotes, suggesting distinct roles

  14. Genome-Wide Analysis of the AP2/ERF Transcription Factors Family and the Expression Patterns of DREB Genes in Moso Bamboo (Phyllostachys edulis).

    PubMed

    Wu, Huili; Lv, Hao; Li, Long; Liu, Jun; Mu, Shaohua; Li, Xueping; Gao, Jian

    2015-01-01

    The AP2/ERF transcription factor family, one of the largest families unique to plants, performs a significant role in terms of regulation of growth and development, and responses to biotic and abiotic stresses. Moso bamboo (Phyllostachys edulis) is a fast-growing non-timber forest species with the highest ecological, economic and social values of all bamboos in Asia. The draft genome of moso bamboo and the available genomes of other plants provide great opportunities to research global information on the AP2/ERF family in moso bamboo. In total, 116 AP2/ERF transcription factors were identified in moso bamboo. The phylogeny analyses indicated that the 116 AP2/ERF genes could be divided into three subfamilies: AP2, RAV and ERF; and the ERF subfamily genes were divided into 11 groups. The gene structures, exons/introns and conserved motifs of the PeAP2/ERF genes were analyzed. Analysis of the evolutionary patterns and divergence showed the PeAP2/ERF genes underwent a large-scale event around 15 million years ago (MYA) and the division time of AP2/ERF family genes between rice and moso bamboo was 15-23 MYA. We surveyed the putative promoter regions of the PeDREBs and showed that largely stress-related cis-elements existed in these genes. Further analysis of expression patterns of PeDREBs revealed that the most were strongly induced by drought, low-temperature and/or high salinity stresses in roots and, in contrast, most PeDREB genes had negative functions in leaves under the same respective stresses. In this study there were two main interesting points: there were fewer members of the PeDREB subfamily in moso bamboo than in other plants and there were differences in DREB gene expression profiles between leaves and roots triggered in response to abiotic stress. The information produced from this study may be valuable in overcoming challenges in cultivating moso bamboo. PMID:25985202

  15. Genome-Wide Analysis of the AP2/ERF Transcription Factors Family and the Expression Patterns of DREB Genes in Moso Bamboo (Phyllostachys edulis)

    PubMed Central

    Li, Long; Liu, Jun; Mu, Shaohua; Li, Xueping; Gao, Jian

    2015-01-01

    The AP2/ERF transcription factor family, one of the largest families unique to plants, performs a significant role in terms of regulation of growth and development, and responses to biotic and abiotic stresses. Moso bamboo (Phyllostachys edulis) is a fast-growing non-timber forest species with the highest ecological, economic and social values of all bamboos in Asia. The draft genome of moso bamboo and the available genomes of other plants provide great opportunities to research global information on the AP2/ERF family in moso bamboo. In total, 116 AP2/ERF transcription factors were identified in moso bamboo. The phylogeny analyses indicated that the 116 AP2/ERF genes could be divided into three subfamilies: AP2, RAV and ERF; and the ERF subfamily genes were divided into 11 groups. The gene structures, exons/introns and conserved motifs of the PeAP2/ERF genes were analyzed. Analysis of the evolutionary patterns and divergence showed the PeAP2/ERF genes underwent a large-scale event around 15 million years ago (MYA) and the division time of AP2/ERF family genes between rice and moso bamboo was 15–23 MYA. We surveyed the putative promoter regions of the PeDREBs and showed that largely stress-related cis-elements existed in these genes. Further analysis of expression patterns of PeDREBs revealed that the most were strongly induced by drought, low-temperature and/or high salinity stresses in roots and, in contrast, most PeDREB genes had negative functions in leaves under the same respective stresses. In this study there were two main interesting points: there were fewer members of the PeDREB subfamily in moso bamboo than in other plants and there were differences in DREB gene expression profiles between leaves and roots triggered in response to abiotic stress. The information produced from this study may be valuable in overcoming challenges in cultivating moso bamboo. PMID:25985202

  16. Genome-wide analysis of SnRK gene family in Brachypodium distachyon and functional characterization of BdSnRK2.9.

    PubMed

    Wang, Lianzhe; Hu, Wei; Sun, Jiutong; Liang, Xiaoyu; Yang, Xiaoyue; Wei, Shuya; Wang, Xiatian; Zhou, Yi; Xiao, Qiang; Yang, Guangxiao; He, Guangyuan

    2015-08-01

    The sucrose non-fermenting 1 (SNF1)-related protein kinases (SnRKs) play key roles in plant signaling pathways including responses to biotic and abiotic stresses. Although SnRKs have been systematically studied in Arabidopsis and rice, there is no information concerning SnRKs in the new Poaceae model plant Brachypodium distachyon. In the present study, a total of 44 BdSnRKs were identified and classified into three subfamilies, including three members of BdSnRK1, 10 of BdSnRK2 and 31 of BdSnRK3 (CIPK) subfamilies. Phylogenetic reconstruction, chromosome distribution and synteny analyses suggested that BdSnRK family had been established before the dicot-monocot lineage parted, and had experienced rapid expansion during the process of plant evolution since then. Expression analysis of the BdSnRK2 subfamily showed that the majority of them could respond to abiotic stress and related signal molecules treatments. Protein-protein interaction and co-expression analyses of BdSnRK2s network showed that SnRK2s might be involved in biological pathway different from that of dicot model plant Arabidopsis. Expression of BdSnRK2.9 in tobacco resulted in increased tolerance to drought and salt stresses through activation of NtABF2. Taken together, comprehensive analyses of BdSnRKs would provide a basis for understanding of evolution and function of BdSnRK family. PMID:26089150

  17. Genome-Wide Identification of Mitogen-Activated Protein Kinase Gene Family across Fungal Lineage Shows Presence of Novel and Diverse Activation Loop Motifs

    PubMed Central

    Mohanta, Tapan Kumar; Mohanta, Nibedita; Parida, Pratap; Panda, Sujogya Kumar; Ponpandian, Lakshmi Narayanan; Bae, Hanhong

    2016-01-01

    The mitogen-activated protein kinase (MAPK) is characterized by the presence of the T-E-Y, T-D-Y, and T-G-Y motifs in its activation loop region and plays a significant role in regulating diverse cellular responses in eukaryotic organisms. Availability of large-scale genome data in the fungal kingdom encouraged us to identify and analyse the fungal MAPK gene family consisting of 173 fungal species. The analysis of the MAPK gene family resulted in the discovery of several novel activation loop motifs (T-T-Y, T-I-Y, T-N-Y, T-H-Y, T-S-Y, K-G-Y, T-Q-Y, S-E-Y and S-D-Y) in fungal MAPKs. The phylogenetic analysis suggests that fungal MAPKs are non-polymorphic, had evolved from their common ancestors around 1500 million years ago, and are distantly related to plant MAPKs. We are the first to report the presence of nine novel activation loop motifs in fungal MAPKs. The specificity of the activation loop motif plays a significant role in controlling different growth and stress related pathways in fungi. Hence, the presences of these nine novel activation loop motifs in fungi are of special interest. PMID:26918378

  18. Genome-wide identification and characterization of WRKY transcriptional factor family in apple and analysis of their responses to waterlogging and drought stress.

    PubMed

    Meng, Dong; Li, Yuanyuan; Bai, Yang; Li, Mingjun; Cheng, Lailiang

    2016-06-01

    As one of the largest transcriptional factor families in plants, WRKY genes play significant roles in various biotic and abiotic stress responses. Although the WRKY gene family has been characterized in a few plant species, the details remain largely unknown in the apple (Malus domestica Borkh.). In this study, we identified a total of 127 MdWRKYs from the apple genome, which were divided into four subgroups according to the WRKY domains and zinc finger motif. Most of them were mapped onto the apple's 17 chromosomes and were expressed in more than one tissue, including shoot tips, mature leaves, fruit and apple calli. We then contrasted WRKY expression patterns between calli grown in solid medium (control) and liquid medium (representing waterlogging stress) and found that 34 WRKY genes were differentially expressed between the two growing conditions. Finally, we determined the expression patterns of 10 selected WRKY genes in an apple rootstock, G41, in response to waterlogging and drought stress, which identified candidate genes involved in responses to water stress for functional analysis. Our data provide interesting candidate MdWRKYs for future functional analysis and demonstrate that apple callus is a useful system for characterizing gene expression and function in apple. PMID:26970718

  19. The heat shock factor gene family in Salix suchowensis: a genome-wide survey and expression profiling during development and abiotic stresses

    PubMed Central

    Zhang, Jin; Li, Yu; Jia, Hui-Xia; Li, Jian-Bo; Huang, Juan; Lu, Meng-Zhu; Hu, Jian-Jun

    2015-01-01

    Heat shock transcription factors (Hsfs), which act as important transcriptional regulatory proteins, play crucial roles in plant developmental processes, and stress responses. Recently, the genome of the shrub willow Salix suchowensis was fully sequenced. In this study, a total of 27 non-redundant Hsf genes were identified from the S. suchowensis genome. Phylogenetic analysis revealed that the members of the SsuHsf family can be divided into three groups (class A, B, and C) based on their structural characteristics. Promoter analysis indicated that the SsuHsfs promoters included various cis-acting elements related to hormone and/or stress responses. Furthermore, the expression profiles of 27 SsuHsfs were analyzed in different tissues and under various stresses (heat, drought, salt, and ABA treatment) using RT-PCR. The results demonstrated that the SsuHsfs were involved in abiotic stress responses. Our results contribute to a better understanding of the complexity of the SsuHsf gene family, and will facilitate functional characterization in future studies. PMID:26442061

  20. Genome-Wide Identification, Evolutionary Analysis and Expression Profiles of LATERAL ORGAN BOUNDARIES DOMAIN Gene Family in Lotus japonicus and Medicago truncatula

    PubMed Central

    Yang, Tianquan; Fang, Genwang yue; He, Hua

    2016-01-01

    The LATERAL ORGAN BOUNDARIES DOMAIN (LBD) gene family has been well-studied in Arabidopsis and play crucial roles in the diverse growth and development processes including establishment and maintenance of boundary of developmental lateral organs. In this study we identified and characterized 38 LBD genes in Lotus japonicus (LjLBD) and 57 LBD genes in Medicago truncatula (MtLBD), both of which are model legume plants that have some specific development features absent in Arabidopsis. The phylogenetic relationships, their locations in the genome, genes structure and conserved motifs were examined. The results revealed that all LjLBD and MtLBD genes could be distinctly divided into two classes: Class I and II. The evolutionary analysis showed that Type I functional divergence with some significantly site-specific shifts may be the main force for the divergence between Class I and Class II. In addition, the expression patterns of LjLBD genes uncovered the diverse functions in plant development. Interestingly, we found that two LjLBD proteins that were highly expressed during compound leaf and pulvinus development, can interact via yeast two-hybrid assays. Taken together, our findings provide an evolutionary and genetic foundation in further understanding the molecular basis of LBD gene family in general, specifically in L. japonicus and M. truncatula. PMID:27560982

  1. Genome-wide microarrray analysis reveals roles for the REF-1 family member HLH-29 in ferritin synthesis and peroxide stress response.

    PubMed

    Quach, Thanh K; Chou, Han Ting; Wang, Kun; Milledge, Gaolin Zheng; Johnson, Casonya M

    2013-01-01

    In Caenorhabditis elegans, the six proteins that make up the REF-1 family have been identified as functional homologs of the Hairy/Enhancer of Split (HES) proteins. These transcription factors act in both Notch dependent and Notch-independent pathways to regulate embryonic events during development; however, their post-embryonic functions are not well defined. As a first step toward understanding how the REF-1 family works together to coordinate post-embryonic events, we used gene expression microarray analysis to identify transcriptional targets of HLH-29 in L4/young adult stage animals. Here we show that HLH-29 targets are genes needed for the regulation of growth and lifespan, including genes required for oxidative stress response and fatty acid metabolism, and the ferritin genes, ftn-1 and ftn-2. We show that HLH-29 regulates ftn-1 expression via promoter sequences upstream of the iron-dependent element that is recognized by the hypoxia inducible factor, HIF-1. Additionally, hlh-29 mutants are more resistant to peroxide stress than wild-type animals and ftn-1(RNAi) animals, even in the presence of excess iron. Finally we show that HLH-29 acts parallel to DAF-16 but upstream of the microphthalmia transcription factor ortholog, HLH-30, to regulate ftn-1 expression under normal growth conditions.

  2. Genome-Wide Identification, Evolutionary Analysis and Expression Profiles of LATERAL ORGAN BOUNDARIES DOMAIN Gene Family in Lotus japonicus and Medicago truncatula.

    PubMed

    Yang, Tianquan; Fang, Genwang Yue; He, Hua; Chen, Jianghua

    2016-01-01

    The LATERAL ORGAN BOUNDARIES DOMAIN (LBD) gene family has been well-studied in Arabidopsis and play crucial roles in the diverse growth and development processes including establishment and maintenance of boundary of developmental lateral organs. In this study we identified and characterized 38 LBD genes in Lotus japonicus (LjLBD) and 57 LBD genes in Medicago truncatula (MtLBD), both of which are model legume plants that have some specific development features absent in Arabidopsis. The phylogenetic relationships, their locations in the genome, genes structure and conserved motifs were examined. The results revealed that all LjLBD and MtLBD genes could be distinctly divided into two classes: Class I and II. The evolutionary analysis showed that Type I functional divergence with some significantly site-specific shifts may be the main force for the divergence between Class I and Class II. In addition, the expression patterns of LjLBD genes uncovered the diverse functions in plant development. Interestingly, we found that two LjLBD proteins that were highly expressed during compound leaf and pulvinus development, can interact via yeast two-hybrid assays. Taken together, our findings provide an evolutionary and genetic foundation in further understanding the molecular basis of LBD gene family in general, specifically in L. japonicus and M. truncatula. PMID:27560982

  3. Genome-Wide Analysis of Drosophila RBf2 Protein Highlights the Diversity of RB Family Targets and Possible Role in Regulation of Ribosome Biosynthesis.

    PubMed

    Wei, Yiliang; Mondal, Shamba S; Mouawad, Rima; Wilczyński, Bartek; Henry, R William; Arnosti, David N

    2015-05-20

    RBf2 is a recently evolved retinoblastoma family member in Drosophila that differs from RBf1, especially in the C-terminus. To investigate whether the unique features of RBf2 contribute to diverse roles in gene regulation, we performed chromatin immunoprecipitation sequencing for both RBf2 and RBf1 in embryos. A previous model for RB-E2F interactions suggested that RBf1 binds dE2F1 or dE2F2, whereas RBf2 is restricted to binding to dE2F2; however, we found that RBf2 targets approximately twice as many genes as RBf1. Highly enriched among the RBf2 targets were ribosomal protein genes. We tested the functional significance of this finding by assessing RBf activity on ribosomal protein promoters and the endogenous genes. RBf1 and RBf2 significantly repressed expression of some ribosomal protein genes, although not all bound genes showed transcriptional effects. Interestingly, many ribosomal protein genes are similarly targeted in human cells, indicating that these interactions may be relevant for control of ribosome biosynthesis and growth. We carried out bioinformatic analysis to investigate the basis for differential targeting by these two proteins and found that RBf2-specific promoters have distinct sequence motifs, suggesting unique targeting mechanisms. Association of RBf2 with these promoters appears to be independent of dE2F2/dDP, although promoters bound by both RBf1 and RBf2 require dE2F2/dDP. The presence of unique RBf2 targets suggest that evolutionary appearance of this corepressor represents the acquisition of potentially novel roles in gene regulation for the RB family.

  4. Genome wide identification of the immunophilin gene family in Leptosphaeria maculans: a causal agent of Blackleg disease in Oilseed Rape (Brassica napus).

    PubMed

    Singh, Khushwant; Zouhar, Miloslav; Mazakova, Jana; Rysanek, Pavel

    2014-10-01

    Abstract Phoma stem canker (blackleg) is a disease of world-wide importance on oilseed rape (Brassica napus) and can cause serious losses for crops globally. The disease is caused by dothideomycetous fungus, Leptosphaeria maculans, which is highly virulent/aggressive. Cyclophilins (CYPs) and FK506-binding proteins (FKBPs) are ubiquitous proteins belonging to the peptidyl-prolyl cis/trans isomerase (PPIase) family. They are collectively referred to as immunophilins (IMMs). In the present study, IMM genes, CYP and FKBP in haploid strain v23.1.3 of L. maculans genome, were identified and classified. Twelve CYPs and five FKBPs were determined in total. Domain architecture analysis revealed the presence of a conserved cyclophilin-like domain (CLD) in the case of CYPs and FKBP_C in the case of FKBPs. Interestingly, IMMs in L. maculans also subgrouped into single domain (SD) and multidomain (MD) proteins. They were primarily found to be localized in cytoplasm, nuclei, and mitochondria. Homologous and orthologous gene pairs were also determined by comparison with the model organism Saccharomyces cerevisiae. Remarkably, IMMs of L. maculans contain shorter introns in comparison to exons. Moreover, CYPs, in contrast with FKBPs, contain few exons. However, two CYPs were determined as being intronless. The expression profile of IMMs in both mycelium and infected primary leaves of B. napus demonstrated their potential role during infection. Secondary structure analysis revealed the presence of atypical eight β strands and two α helices fold architecture. Gene ontology analysis of IMMs predicted their significant role in protein folding and PPIase activity. Taken together, our findings for the first time present new prospects of this highly conserved gene family in phytopathogenic fungus.

  5. Genome wide identification of the immunophilin gene family in Leptosphaeria maculans: a causal agent of Blackleg disease in Oilseed Rape (Brassica napus).

    PubMed

    Singh, Khushwant; Zouhar, Miloslav; Mazakova, Jana; Rysanek, Pavel

    2014-10-01

    Abstract Phoma stem canker (blackleg) is a disease of world-wide importance on oilseed rape (Brassica napus) and can cause serious losses for crops globally. The disease is caused by dothideomycetous fungus, Leptosphaeria maculans, which is highly virulent/aggressive. Cyclophilins (CYPs) and FK506-binding proteins (FKBPs) are ubiquitous proteins belonging to the peptidyl-prolyl cis/trans isomerase (PPIase) family. They are collectively referred to as immunophilins (IMMs). In the present study, IMM genes, CYP and FKBP in haploid strain v23.1.3 of L. maculans genome, were identified and classified. Twelve CYPs and five FKBPs were determined in total. Domain architecture analysis revealed the presence of a conserved cyclophilin-like domain (CLD) in the case of CYPs and FKBP_C in the case of FKBPs. Interestingly, IMMs in L. maculans also subgrouped into single domain (SD) and multidomain (MD) proteins. They were primarily found to be localized in cytoplasm, nuclei, and mitochondria. Homologous and orthologous gene pairs were also determined by comparison with the model organism Saccharomyces cerevisiae. Remarkably, IMMs of L. maculans contain shorter introns in comparison to exons. Moreover, CYPs, in contrast with FKBPs, contain few exons. However, two CYPs were determined as being intronless. The expression profile of IMMs in both mycelium and infected primary leaves of B. napus demonstrated their potential role during infection. Secondary structure analysis revealed the presence of atypical eight β strands and two α helices fold architecture. Gene ontology analysis of IMMs predicted their significant role in protein folding and PPIase activity. Taken together, our findings for the first time present new prospects of this highly conserved gene family in phytopathogenic fungus. PMID:25259854

  6. Genome-Wide Analysis of Drosophila RBf2 Protein Highlights the Diversity of RB Family Targets and Possible Role in Regulation of Ribosome Biosynthesis

    PubMed Central

    Wei, Yiliang; Mondal, Shamba S.; Mouawad, Rima; Wilczyński, Bartek; Henry, R. William; Arnosti, David N.

    2015-01-01

    RBf2 is a recently evolved retinoblastoma family member in Drosophila that differs from RBf1, especially in the C-terminus. To investigate whether the unique features of RBf2 contribute to diverse roles in gene regulation, we performed chromatin immunoprecipitation sequencing for both RBf2 and RBf1 in embryos. A previous model for RB−E2F interactions suggested that RBf1 binds dE2F1 or dE2F2, whereas RBf2 is restricted to binding to dE2F2; however, we found that RBf2 targets approximately twice as many genes as RBf1. Highly enriched among the RBf2 targets were ribosomal protein genes. We tested the functional significance of this finding by assessing RBf activity on ribosomal protein promoters and the endogenous genes. RBf1 and RBf2 significantly repressed expression of some ribosomal protein genes, although not all bound genes showed transcriptional effects. Interestingly, many ribosomal protein genes are similarly targeted in human cells, indicating that these interactions may be relevant for control of ribosome biosynthesis and growth. We carried out bioinformatic analysis to investigate the basis for differential targeting by these two proteins and found that RBf2-specific promoters have distinct sequence motifs, suggesting unique targeting mechanisms. Association of RBf2 with these promoters appears to be independent of dE2F2/dDP, although promoters bound by both RBf1 and RBf2 require dE2F2/dDP. The presence of unique RBf2 targets suggest that evolutionary appearance of this corepressor represents the acquisition of potentially novel roles in gene regulation for the RB family. PMID:25999584

  7. Genome-wide Analyses of the Structural Gene Families Involved in the Legume-specific 5-Deoxyisoflavonoid Biosynthesis of Lotus japonicus

    PubMed Central

    Shimada, Norimoto; Sato, Shusei; Akashi, Tomoyoshi; Nakamura, Yasukazu; Tabata, Satoshi; Ayabe, Shin-ichi; Aoki, Toshio

    2007-01-01

    Abstract A model legume Lotus japonicus (Regel) K. Larsen is one of the subjects of genome sequencing and functional genomics programs. In the course of targeted approaches to the legume genomics, we analyzed the genes encoding enzymes involved in the biosynthesis of the legume-specific 5-deoxyisoflavonoid of L. japonicus, which produces isoflavan phytoalexins on elicitor treatment. The paralogous biosynthetic genes were assigned as comprehensively as possible by biochemical experiments, similarity searches, comparison of the gene structures, and phylogenetic analyses. Among the 10 biosynthetic genes investigated, six comprise multigene families, and in many cases they form gene clusters in the chromosomes. Semi-quantitative reverse transcriptase–PCR analyses showed coordinate up-regulation of most of the genes during phytoalexin induction and complex accumulation patterns of the transcripts in different organs. Some paralogous genes exhibited similar expression specificities, suggesting their genetic redundancy. The molecular evolution of the biosynthetic genes is discussed. The results presented here provide reliable annotations of the genes and genetic markers for comparative and functional genomics of leguminous plants. PMID:17452423

  8. Genome- wide characterization of Nuclear Factor Y (NF-Y) gene family of sorghum [Sorghum bicolor (L.) Moench]: a bioinformatics approach.

    PubMed

    Malviya, Neha; Jaiswal, Parul; Yadav, Dinesh

    2016-01-01

    Nuclear factor Y (NF-Y) is a heterotrimeric transcription factor (TF) complex with preferential binding to CCAAT elements of promoters, regulating gene expression in most of the higher eukaryotes. The availability of plant genome sequences have revealed multiple number of genes coding for the three subunits, namely NF-YA, NF-YB and NF-YC in contrast to single NF-Y gene for each subunit reported in yeast and animals. A total of 33 NF-YTF comprising of 8 NF-YA, 11 NF-YB and 14 NF-YC subunits were accessed from the sorghum genome. The bioinformatic characterization of NF-Y gene family of sorghum for gene structure, chromosome location, protein motif, phylogeny, gene duplication and in-silico expression under abiotic stresses have been attempted in the present study. The identified SbNF-Y genes are distributed on all the 10 chromosomes of sorghum with variability in the frequency and 18 out of 33 SbNF-Ys were found to be intronless. Segmental duplication event was found to be predominant feature based on gene duplication pattern study. Several orthologs and paralogs groups were disclosed through the comprehensive phylogenetic analysis of SbNF-Y proteins along with 36 Arabidopsis and 28 rice NF-Y proteins. In-silico expression analysis under abiotic stresses using rice transcriptome data revealed several of the sorghum NF-Y genes to be associated with salt, drought, cold and heat stresses.

  9. Genome-wide analyses of the bZIP family reveal their involvement in the development, ripening and abiotic stress response in banana

    PubMed Central

    Hu, Wei; Wang, Lianzhe; Tie, Weiwei; Yan, Yan; Ding, Zehong; Liu, Juhua; Li, Meiying; Peng, Ming; Xu, Biyu; Jin, Zhiqiang

    2016-01-01

    The leucine zipper (bZIP) transcription factors play important roles in multiple biological processes. However, less information is available regarding the bZIP family in the important fruit crop banana. In this study, 121 bZIP transcription factor genes were identified in the banana genome. Phylogenetic analysis showed that MabZIPs were classified into 11 subfamilies. The majority of MabZIP genes in the same subfamily shared similar gene structures and conserved motifs. The comprehensive transcriptome analysis of two banana genotypes revealed the differential expression patterns of MabZIP genes in different organs, in various stages of fruit development and ripening, and in responses to abiotic stresses, including drought, cold, and salt. Interaction networks and co-expression assays showed that group A MabZIP-mediated networks participated in various stress signaling, which was strongly activated in Musa ABB Pisang Awak. This study provided new insights into the complicated transcriptional control of MabZIP genes and provided robust tissue-specific, development-dependent, and abiotic stress-responsive candidate MabZIP genes for potential applications in the genetic improvement of banana cultivars. PMID:27445085

  10. The auxin response factor transcription factor family in soybean: genome-wide identification and expression analyses during development and water stress.

    PubMed

    Ha, Chien Van; Le, Dung Tien; Nishiyama, Rie; Watanabe, Yasuko; Sulieman, Saad; Tran, Uyen Thi; Mochida, Keiichi; Dong, Nguyen Van; Yamaguchi-Shinozaki, Kazuko; Shinozaki, Kazuo; Tran, Lam-Son Phan

    2013-10-01

    In plants, the auxin response factor (ARF) transcription factors play important roles in regulating diverse biological processes, including development, growth, cell division and responses to environmental stimuli. An exhaustive search of soybean genome revealed 51 GmARFs, many of which were formed by genome duplications. The typical GmARFs (43 members) contain a DNA-binding domain, an ARF domain and an auxin/indole acetic acid (AUX/IAA) dimerization domain, whereas the remaining eight members lack the dimerization domain. Phylogenetic analysis of the ARFs from soybean and Arabidopsis revealed both similarity and divergence between the two ARF families, as well as enabled us to predict the functions of the GmARFs. Using quantitative real-time polymerase chain reaction (qRT-PCR) and available soybean Affymetrix array and Illumina transcriptome sequence data, a comprehensive expression atlas of GmARF genes was obtained in various organs and tissues, providing useful information about their involvement in defining the precise nature of individual tissues. Furthermore, expression profiling using qRT-PCR and microarray data revealed many water stress-responsive GmARFs in soybean, albeit with different patterns depending on types of tissues and/or developmental stages. Our systematic analysis has identified excellent tissue-specific and/or stress-responsive candidate GmARF genes for in-depth in planta functional analyses, which would lead to potential applications in the development of genetically modified soybean cultivars with enhanced drought tolerance. PMID:23810914

  11. Genome-wide analyses of the bZIP family reveal their involvement in the development, ripening and abiotic stress response in banana.

    PubMed

    Hu, Wei; Wang, Lianzhe; Tie, Weiwei; Yan, Yan; Ding, Zehong; Liu, Juhua; Li, Meiying; Peng, Ming; Xu, Biyu; Jin, Zhiqiang

    2016-01-01

    The leucine zipper (bZIP) transcription factors play important roles in multiple biological processes. However, less information is available regarding the bZIP family in the important fruit crop banana. In this study, 121 bZIP transcription factor genes were identified in the banana genome. Phylogenetic analysis showed that MabZIPs were classified into 11 subfamilies. The majority of MabZIP genes in the same subfamily shared similar gene structures and conserved motifs. The comprehensive transcriptome analysis of two banana genotypes revealed the differential expression patterns of MabZIP genes in different organs, in various stages of fruit development and ripening, and in responses to abiotic stresses, including drought, cold, and salt. Interaction networks and co-expression assays showed that group A MabZIP-mediated networks participated in various stress signaling, which was strongly activated in Musa ABB Pisang Awak. This study provided new insights into the complicated transcriptional control of MabZIP genes and provided robust tissue-specific, development-dependent, and abiotic stress-responsive candidate MabZIP genes for potential applications in the genetic improvement of banana cultivars. PMID:27445085

  12. Genome-wide survey of Aux/IAA gene family members in potato (Solanum tuberosum): Identification, expression analysis, and evaluation of their roles in tuber development.

    PubMed

    Gao, Junpeng; Cao, Xiaoli; Shi, Shandang; Ma, Yuling; Wang, Kai; Liu, Shengjie; Chen, Dan; Chen, Qin; Ma, Haoli

    2016-03-01

    The Auxin/indole-3-acetic acid (Aux/IAA) genes encode short-lived nuclear proteins that are known to be involved in the primary cellular responses to auxin. To date, systematic analysis of the Aux/IAA genes in potato (Solanum tuberosum) has not been conducted. In this study, a total of 26 potato Aux/IAA genes were identified (designated from StIAA1 to StIAA26), and the distribution of four conserved domains shared by the StIAAs were analyzed based on multiple sequence alignment and a motif-based sequence analysis. A phylogenetic analysis of the Aux/IAA gene families of potato and Arabidopsis was also conducted. In order to assess the roles of StIAA genes in tuber development, the results of RNA-seq studies were reformatted to analyze the expression patterns of StIAA genes, and then verified by quantitative real-time PCR. A large number of StIAA genes (12 genes) were highly expressed in stolon organs and in during the tuber initiation and expansion developmental stages, and most of these genes were responsive to indoleacetic acid treatment. Our results suggested that StIAA genes were involved in the process of tuber development and provided insights into functional roles of potato Aux/IAA genes.

  13. Genome-Wide Identification, Phylogeny, and Expression Analyses of the 14-3-3 Family Reveal Their Involvement in the Development, Ripening, and Abiotic Stress Response in Banana

    PubMed Central

    Li, Meiying; Ren, Licheng; Xu, Biyu; Yang, Xiaoliang; Xia, Qiyu; He, Pingping; Xiao, Susheng; Guo, Anping; Hu, Wei; Jin, Zhiqiang

    2016-01-01

    Plant 14-3-3 proteins act as critical components of various cellular signaling processes and play an important role in regulating multiple physiological processes. However, less information is known about the 14-3-3 gene family in banana. In this study, 25 14-3-3 genes were identified from the banana genome. Based on the evolutionary analysis, banana 14-3-3 proteins were clustered into ε and non-ε groups. Conserved motif analysis showed that all identified banana 14-3-3 genes had the typical 14-3-3 motif. The gene structure of banana 14-3-3 genes showed distinct class-specific divergence between the ε group and the non-ε group. Most banana 14-3-3 genes showed strong transcript accumulation changes during fruit development and postharvest ripening in two banana varieties, indicating that they might be involved in regulating fruit development and ripening. Moreover, some 14-3-3 genes also showed great changes after osmotic, cold, and salt treatments in two banana varieties, suggested their potential role in regulating banana response to abiotic stress. Taken together, this systemic analysis reveals the involvement of banana 14-3-3 genes in fruit development, postharvest ripening, and response to abiotic stress and provides useful information for understanding the functions of 14-3-3 genes in banana. PMID:27713761

  14. Genome-Wide Analyses of a Plant-Specific LIM-Domain Gene Family Implicate Its Evolutionary Role in Plant Diversification

    PubMed Central

    Zhao, Man; He, Lingli; Gu, Yongzhe; Wang, Yan; Chen, Qingshan; He, Chaoying

    2014-01-01

    The Arabidopsis DA1 genes appear to have multiple functions in regulating organ size and abiotic stress response, but the biological roles of its closely related genes remain unknown. Evolutionary analyses might provide some clues to aid in an understanding of their functional diversification. In this work, we characterized the molecular evolution and expressional diversification of DA1-like genes. Surveying 354 sequenced genomes revealed 142 DA1-like genes only in plants, indicating plant-specificity of these genes. The DA1-like protein modular structure was composed of two UIMs (ubiquitin interaction motifs), one LIM-domain (from lin-11, isl-1, and mec-3), and a conserved C-terminal, and was distinguishable from the already defined three groups of LIM-domain proteins. We further found that the DA1-like genes diverged into Classes I and II at the ancestor of seed plants and acquired 13 clade-specific residues during their evolutionary history. Moreover, diverse intron size evolution was noted following the transition from size-expandable introns to minimal ones, accompanying the emergence and diversification of angiosperms. Functional diversification as it relates to gene expression was further investigated in soybean. Glycine max DA1 genes showed diverse tissues expression patterns during development and had substantially varied abiotic stress response expression. Thus, variations in the coding regions, intron size, and gene expression contributed to the functional diversification of this gene family. Our data suggest that the evolution of the DA1-like genes facilitated the development of diverse molecular and functional diversification patterns to accompany the successful radiation of plants into diverse environments during evolution. PMID:24723730

  15. Genome-wide analysis of the CaHsp20 gene family in pepper: comprehensive sequence and expression profile analysis under heat stress

    PubMed Central

    Guo, Meng; Liu, Jin-Hong; Lu, Jin-Ping; Zhai, Yu-Fei; Wang, Hu; Gong, Zhen-Hui; Wang, Shu-Bin; Lu, Ming-Hui

    2015-01-01

    The Hsp20 genes are present in all plant species and play important roles in alleviating heat stress and enhancing plant thermotolerance by preventing the irreversible aggregation of denaturing proteins. However, very little is known about the CaHsp20 gene family in pepper (Capsicum annuum L.), an important vegetable crop with character of temperate but thermosensitive. In this study, a total of 35 putative pepper Hsp20 genes (CaHsp20s) were identified and renamed on the basis of their molecular weight, and then their gene structure, genome location, gene duplication, phylogenetic relationship, and interaction network were also analyzed. The expression patterns of CaHsp20 genes in four different tissues (root, stem, leaf, and flower) from the thermotolerant line R9 under heat stress condition were measured using semi-quantitative RT-PCR. The transcripts of most CaHsp20 genes maintained a low level in all of the four tissues under normal temperature condition, but were highly induced by heat stress, while the expression of CaHsp16.6b, 16.7, and 23.8 were only detected in specific tissues and were not so sensitive to heat stress like other CaHsp20 genes. In addition, compared to those in thermotolerant line R9, the expression peak of most CaHsp20 genes in thermosensitive line B6 under heat stress was hysteretic, and several CaHsp20 genes (CaHsp16.4, 18.2a, 18.7, 21.2, 22.0, 25.8, and 25.9) showed higher expression levels in both line B6 and R9. These data suggest that the CaHsp20 genes may be involved in heat stress and defense responses in pepper, which provides the basis for further functional analyses of CaHsp20s in the formation of pepper acquired thermotoleance. PMID:26483820

  16. Genome-wide identification, classification and analysis of HD-ZIP gene family in citrus, and its potential roles in somatic embryogenesis regulation.

    PubMed

    Ge, Xiao-Xia; Liu, Zheng; Wu, Xiao-Meng; Chai, Li-Jun; Guo, Wen-Wu

    2015-12-10

    The homeodomain-leucine zipper (HD-Zip) transcription factors, which belong to a class of Homeobox proteins, has been reported to be involved in different biological processes of plants, including growth and development, photomorphogenesis, flowering, fruit ripening and adaptation responses to environmental stresses. In this study, 27 HD-Zip genes (CsHBs) were identified in Citrus. Based on the phylogenetic analysis and characteristics of individual gene or protein, the HD-Zip gene family in Citrus can be classified into 4 subfamilies, i.e. HD-Zip I, HD-Zip II, HD-Zip III, and HD-Zip IV containing 16, 2, 4, and 5 members respectively. The digital expression patterns of 27 HD-Zip genes were analyzed in the callus, flower, leaf and fruit of Citrus sinensis. The qRT-PCR and RT-PCR analyses of six selected HD-Zip genes were performed in six citrus cultivars with different embryogenic competence and in the embryo induction stages, which revealed that these genes were differentially expressed and might be involved in citrus somatic embryogenesis (SE). The results exhibited that the expression of CsHB1 was up-regulated in somatic embryo induction process, and its expression was higher in citrus cultivars with high embryogenic capacity than in cultivars recalcitrant to form somatic embryos. Moreover, a microsatellite site of three nucleotide repeats was found in CsHB1 gene among eighteen citrus genotypes, indicating the possible association of CsHB1 gene to the capacity of callus induction.

  17. Genome-wide analysis of AP2/ERF family genes from Lotus corniculatus shows LcERF054 enhances salt tolerance.

    PubMed

    Sun, Zhan-Min; Zhou, Mei-Liang; Xiao, Xing-Guo; Tang, Yi-Xiong; Wu, Yan-Min

    2014-09-01

    Lotus corniculatus is used in agriculture as a main forage plant. Members of the Apetala2/ethylene response factor (AP2/ERF) family play important roles in regulating gene expression in response to many forms of stress, including drought and salt. Here, starting from database of the L. corniculatus var. japonicus genome, we identified 127 AP2/ERF genes by insilico cloning method. The phylogeny, gene structures, and putative conserved motifs in L. corniculatus var. japonicus ERF proteins were analyzed. Based on the number of AP2/ERF domains and the function of the genes, 127 AP2/ERF genes from L. corniculatus var. japonicus were classified into five subfamilies named the AP2, dehydration-responsive element binding factor (DREB), ERF, RAV, and a soloist. Outside the AP2/ERF domain, many L. corniculatus var. japonicus-specific conserved motifs were detected. Expression profile analysis of AP2/ERF genes by quantitative real-time PCR revealed that 19 LcERF genes, including LcERF054 (KJ004728), were significantly induced by salt stress. The results showed that the LcERF054 gene encodes a nuclear transcription activator. Overexpression of LcERF054 in Arabidopsis enhanced the tolerances to salt stress, showed higher germination ratio of seeds, and had elevated levels of relative moisture contents, soluble sugars, proline, and lower levels of malondialdehyde under stress conditions compared to wild-type plants. The expression of hyperosmotic salinity response genes COR15A, LEA4-5, P5CS1, and RD29A was found to be elevated in the LcERF054-overexpressing Arabidopsis plants compared to wild type. These results revealed that the LcERF genes play important roles in L. corniculatus cv Leo under salt stress and that LcERFs are attractive engineering targets in applied efforts to improve abiotic stress tolerances in L. corniculatus cv Leo or other crops. PMID:24777608

  18. Genome-wide analysis of the bHLH gene family in planarians identifies factors required for adult neurogenesis and neuronal regeneration.

    PubMed

    Cowles, Martis W; Brown, David D R; Nisperos, Sean V; Stanley, Brianna N; Pearson, Bret J; Zayas, Ricardo M

    2013-12-01

    In contrast to most well-studied model organisms, planarians have a remarkable ability to completely regenerate a functional nervous system from a pluripotent stem cell population. Thus, planarians provide a powerful model to identify genes required for adult neurogenesis in vivo. We analyzed the basic helix-loop-helix (bHLH) family of transcription factors, many of which are crucial for nervous system development and have been implicated in human diseases. However, their potential roles in adult neurogenesis or central nervous system (CNS) function are not well understood. We identified 44 planarian bHLH homologs, determined their patterns of expression in the animal and assessed their functions using RNAi. We found nine bHLHs expressed in stem cells and neurons that are required for CNS regeneration. Our analyses revealed that homologs of coe, hes (hesl-3) and sim label progenitors in intact planarians, and following amputation we observed an enrichment of coe(+) and sim(+) progenitors near the wound site. RNAi knockdown of coe, hesl-3 or sim led to defects in CNS regeneration, including failure of the cephalic ganglia to properly pattern and a loss of expression of distinct neuronal subtype markers. Together, these data indicate that coe, hesl-3 and sim label neural progenitor cells, which serve to generate new neurons in uninjured or regenerating animals. Our study demonstrates that this model will be useful to investigate how stem cells interpret and respond to genetic and environmental cues in the CNS and to examine the role of bHLH transcription factors in adult tissue regeneration.

  19. Genome-Wide Classification and Evolutionary Analysis of the bHLH Family of Transcription Factors in Arabidopsis, Poplar, Rice, Moss, and Algae1[W

    PubMed Central

    Carretero-Paulet, Lorenzo; Galstyan, Anahit; Roig-Villanova, Irma; Martínez-García, Jaime F.; Bilbao-Castro, Jose R.; Robertson, David L.

    2010-01-01

    Basic helix-loop-helix proteins (bHLHs) are found throughout the three eukaryotic kingdoms and constitute one of the largest families of transcription factors. A growing number of bHLH proteins have been functionally characterized in plants. However, some of these have not been previously classified. We present here an updated and comprehensive classification of the bHLHs encoded by the whole sequenced genomes of Arabidopsis (Arabidopsis thaliana), Populus trichocarpa, Oryza sativa, Physcomitrella patens, and five algae species. We define a plant bHLH consensus motif, which allowed the identification of novel highly diverged atypical bHLHs. Using yeast two-hybrid assays, we confirm that (1) a highly diverged bHLH has retained protein interaction activity and (2) the two most conserved positions in the consensus play an essential role in dimerization. Phylogenetic analysis permitted classification of the 638 bHLH genes identified into 32 subfamilies. Evolutionary and functional relationships within subfamilies are supported by intron patterns, predicted DNA-binding motifs, and the architecture of conserved protein motifs. Our analyses reveal the origin and evolutionary diversification of plant bHLHs through differential expansions, domain shuffling, and extensive sequence divergence. At the functional level, this would translate into different subfamilies evolving specific DNA-binding and protein interaction activities as well as differential transcriptional regulatory roles. Our results suggest a role for bHLH proteins in generating plant phenotypic diversity and provide a solid framework for further investigations into the role carried out in the transcriptional regulation of key growth and developmental processes. PMID:20472752

  20. The auxin response factor gene family in banana: genome-wide identification and expression analyses during development, ripening, and abiotic stress.

    PubMed

    Hu, Wei; Zuo, Jiao; Hou, Xiaowan; Yan, Yan; Wei, Yunxie; Liu, Juhua; Li, Meiying; Xu, Biyu; Jin, Zhiqiang

    2015-01-01

    Auxin signaling regulates various auxin-responsive genes via two types of transcriptional regulators, Auxin Response Factors (ARF) and Aux/IAA. ARF transcription factors act as critical components of auxin signaling that play important roles in modulating various biological processes. However, limited information about this gene family in fruit crops is currently available. Herein, 47 ARF genes were identified in banana based on its genome sequence. Phylogenetic analysis of the ARFs from banana, rice, and Arabidopsis suggested that the ARFs could be divided into four subgroups, among which most ARFs from the banana showed a closer relationship with those from rice than those from Arabidopsis. Conserved motif analysis showed that all identified MaARFs had typical DNA-binding and ARF domains, but 12 members lacked the dimerization domain. Gene structure analysis showed that the number of exons in MaARF genes ranged from 5 to 21, suggesting large variation amongst banana ARF genes. The comprehensive expression profiles of MaARF genes yielded useful information about their involvement in diverse tissues, different stages of fruit development and ripening, and responses to abiotic stresses in different varieties. Interaction networks and co-expression assays indicated the strong transcriptional response of banana ARFs and ARF-mediated networks in early fruit development for different varieties. Our systematic analysis of MaARFs revealed robust tissue-specific, development-dependent, and abiotic stress-responsive candidate MaARF genes for further functional assays in planta. These findings could lead to potential applications in the genetic improvement of banana cultivars, and yield new insights into the complexity of the control of MaARF gene expression at the transcriptional level. Finally, they support the hypothesis that ARFs are a crucial component of the auxin signaling pathway, which regulates a wide range of physiological processes. PMID:26442055

  1. Genome-wide analysis of auxin response factor (ARF) gene family from tomato and analysis of their role in flower and fruit development.

    PubMed

    Kumar, Rahul; Tyagi, Akhilesh K; Sharma, Arun K

    2011-03-01

    Auxin response transcription factors have been widely implicated in auxin-mediated responses during various developmental processes ranging from root and shoot development to flower and fruit development in plants. In order to use them for improvement of agronomic traits related to fruit, we need to have better understanding of their role during fruit development. In this study, 17 SlARF genes have been identified from tomato (Solanum lycopersicum), using various publically available tomato EST databases. Phylogenetic analysis of the 23 AtARF and 17 SlARF proteins results in formation of three major classes and a total of 14 sister pairs, including seven SlARF-AtARF, four SlARF-SlARF and three AtARF-AtARF sister pairs, providing insights into various orthologous relationships between AtARFs and SlARFs. Further, search for orthologs of these SlARFs resulted in identification of nine, ten, four and three ARF genes from potato, tobacco, N. benthemiana and pepper, respectively. A phylogenetic analysis of these genes, along with their orthologs from Solanaceae species, suggests the presence of a common set of the ARF genes in this family. Comparison of the expression of these SlARF genes in wild type and rin mutant provides an insight into their role during different stages of flower and fruit development. This study suggests that ARF genes may play diverse role during flower and fruit development. Comprehensive data generated here will provide a platform for identification of ARF genes and elucidation of their function during reproductive development stages in Solanaceae in general and fruit development in tomato, in particular.

  2. Genome-Wide Survey and Expression Profile Analysis of the Mitogen-Activated Protein Kinase (MAPK) Gene Family in Brassica rapa

    PubMed Central

    Yu, Hao; Qu, Cunmin; Tang, Zhanglin; Li, Jiana; Chai, Yourong; Liang, Ying

    2015-01-01

    Mitogen-activated protein kinase (MAPK) cascades are fundamental signal transduction modules in plants, controlling cell division, development, hormone signaling, and biotic and abiotic stress responses. Although MAPKs have been investigated in several plant species, a comprehensive analysis of the MAPK gene family has hitherto not been performed in Brassica rapa. In this study, we identified 32 MAPKs in the B. rapa genome by conducting BLASTP and syntenic block analyses, and screening for the essential signature motif (TDY or TEY) of plant MAPK proteins. Of the 32 BraMAPK genes retrieved from the Brassica Database, 13 exhibited exon splicing errors, excessive splicing of the 5' sequence, excessive retention of the 5' sequence, and sequencing errors of the 3' end. Phylogenetic trees of the 32 corrected MAPKs from B. rapa and of MAPKs from other plants generated by the neighbor-joining and maximum likelihood methods suggested that BraMAPKs could be divided into four groups (groups A, B, C, and D). Gene number expansion was observed for BraMAPK genes in groups A and D, which may have been caused by the tandem duplication and genome triplication of the ancestral genome of the Brassica progenitor. Except for five members of the BraMAPK10 subfamily, the identified BraMAPKs were expressed in most of the tissues examined, including callus, root, stem, leaf, flower, and silique. Quantitative real-time PCR demonstrated that at least six and five BraMAPKs were induced or repressed by various abiotic stresses and hormone treatments, respectively, suggesting their potential roles in the abiotic stress response and various hormone signal transduction pathways in B. rapa. This study provides valuable insight into the putative physiological and biochemical functions of MAPK genes in B. rapa. PMID:26173020

  3. The auxin response factor gene family in banana: genome-wide identification and expression analyses during development, ripening, and abiotic stress

    PubMed Central

    Hu, Wei; Zuo, Jiao; Hou, Xiaowan; Yan, Yan; Wei, Yunxie; Liu, Juhua; Li, Meiying; Xu, Biyu; Jin, Zhiqiang

    2015-01-01

    Auxin signaling regulates various auxin-responsive genes via two types of transcriptional regulators, Auxin Response Factors (ARF) and Aux/IAA. ARF transcription factors act as critical components of auxin signaling that play important roles in modulating various biological processes. However, limited information about this gene family in fruit crops is currently available. Herein, 47 ARF genes were identified in banana based on its genome sequence. Phylogenetic analysis of the ARFs from banana, rice, and Arabidopsis suggested that the ARFs could be divided into four subgroups, among which most ARFs from the banana showed a closer relationship with those from rice than those from Arabidopsis. Conserved motif analysis showed that all identified MaARFs had typical DNA-binding and ARF domains, but 12 members lacked the dimerization domain. Gene structure analysis showed that the number of exons in MaARF genes ranged from 5 to 21, suggesting large variation amongst banana ARF genes. The comprehensive expression profiles of MaARF genes yielded useful information about their involvement in diverse tissues, different stages of fruit development and ripening, and responses to abiotic stresses in different varieties. Interaction networks and co-expression assays indicated the strong transcriptional response of banana ARFs and ARF-mediated networks in early fruit development for different varieties. Our systematic analysis of MaARFs revealed robust tissue-specific, development-dependent, and abiotic stress-responsive candidate MaARF genes for further functional assays in planta. These findings could lead to potential applications in the genetic improvement of banana cultivars, and yield new insights into the complexity of the control of MaARF gene expression at the transcriptional level. Finally, they support the hypothesis that ARFs are a crucial component of the auxin signaling pathway, which regulates a wide range of physiological processes. PMID:26442055

  4. Genome-Wide Survey and Expression Profile Analysis of the Mitogen-Activated Protein Kinase (MAPK) Gene Family in Brassica rapa.

    PubMed

    Lu, Kun; Guo, Wenjin; Lu, Junxing; Yu, Hao; Qu, Cunmin; Tang, Zhanglin; Li, Jiana; Chai, Yourong; Liang, Ying

    2015-01-01

    Mitogen-activated protein kinase (MAPK) cascades are fundamental signal transduction modules in plants, controlling cell division, development, hormone signaling, and biotic and abiotic stress responses. Although MAPKs have been investigated in several plant species, a comprehensive analysis of the MAPK gene family has hitherto not been performed in Brassica rapa. In this study, we identified 32 MAPKs in the B. rapa genome by conducting BLASTP and syntenic block analyses, and screening for the essential signature motif (TDY or TEY) of plant MAPK proteins. Of the 32 BraMAPK genes retrieved from the Brassica Database, 13 exhibited exon splicing errors, excessive splicing of the 5' sequence, excessive retention of the 5' sequence, and sequencing errors of the 3' end. Phylogenetic trees of the 32 corrected MAPKs from B. rapa and of MAPKs from other plants generated by the neighbor-joining and maximum likelihood methods suggested that BraMAPKs could be divided into four groups (groups A, B, C, and D). Gene number expansion was observed for BraMAPK genes in groups A and D, which may have been caused by the tandem duplication and genome triplication of the ancestral genome of the Brassica progenitor. Except for five members of the BraMAPK10 subfamily, the identified BraMAPKs were expressed in most of the tissues examined, including callus, root, stem, leaf, flower, and silique. Quantitative real-time PCR demonstrated that at least six and five BraMAPKs were induced or repressed by various abiotic stresses and hormone treatments, respectively, suggesting their potential roles in the abiotic stress response and various hormone signal transduction pathways in B. rapa. This study provides valuable insight into the putative physiological and biochemical functions of MAPK genes in B. rapa.

  5. Genome-wide analysis of AP2/ERF family genes from Lotus corniculatus shows LcERF054 enhances salt tolerance.

    PubMed

    Sun, Zhan-Min; Zhou, Mei-Liang; Xiao, Xing-Guo; Tang, Yi-Xiong; Wu, Yan-Min

    2014-09-01

    Lotus corniculatus is used in agriculture as a main forage plant. Members of the Apetala2/ethylene response factor (AP2/ERF) family play important roles in regulating gene expression in response to many forms of stress, including drought and salt. Here, starting from database of the L. corniculatus var. japonicus genome, we identified 127 AP2/ERF genes by insilico cloning method. The phylogeny, gene structures, and putative conserved motifs in L. corniculatus var. japonicus ERF proteins were analyzed. Based on the number of AP2/ERF domains and the function of the genes, 127 AP2/ERF genes from L. corniculatus var. japonicus were classified into five subfamilies named the AP2, dehydration-responsive element binding factor (DREB), ERF, RAV, and a soloist. Outside the AP2/ERF domain, many L. corniculatus var. japonicus-specific conserved motifs were detected. Expression profile analysis of AP2/ERF genes by quantitative real-time PCR revealed that 19 LcERF genes, including LcERF054 (KJ004728), were significantly induced by salt stress. The results showed that the LcERF054 gene encodes a nuclear transcription activator. Overexpression of LcERF054 in Arabidopsis enhanced the tolerances to salt stress, showed higher germination ratio of seeds, and had elevated levels of relative moisture contents, soluble sugars, proline, and lower levels of malondialdehyde under stress conditions compared to wild-type plants. The expression of hyperosmotic salinity response genes COR15A, LEA4-5, P5CS1, and RD29A was found to be elevated in the LcERF054-overexpressing Arabidopsis plants compared to wild type. These results revealed that the LcERF genes play important roles in L. corniculatus cv Leo under salt stress and that LcERFs are attractive engineering targets in applied efforts to improve abiotic stress tolerances in L. corniculatus cv Leo or other crops.

  6. Genome-wide identification, classification and analysis of HD-ZIP gene family in citrus, and its potential roles in somatic embryogenesis regulation.

    PubMed

    Ge, Xiao-Xia; Liu, Zheng; Wu, Xiao-Meng; Chai, Li-Jun; Guo, Wen-Wu

    2015-12-10

    The homeodomain-leucine zipper (HD-Zip) transcription factors, which belong to a class of Homeobox proteins, has been reported to be involved in different biological processes of plants, including growth and development, photomorphogenesis, flowering, fruit ripening and adaptation responses to environmental stresses. In this study, 27 HD-Zip genes (CsHBs) were identified in Citrus. Based on the phylogenetic analysis and characteristics of individual gene or protein, the HD-Zip gene family in Citrus can be classified into 4 subfamilies, i.e. HD-Zip I, HD-Zip II, HD-Zip III, and HD-Zip IV containing 16, 2, 4, and 5 members respectively. The digital expression patterns of 27 HD-Zip genes were analyzed in the callus, flower, leaf and fruit of Citrus sinensis. The qRT-PCR and RT-PCR analyses of six selected HD-Zip genes were performed in six citrus cultivars with different embryogenic competence and in the embryo induction stages, which revealed that these genes were differentially expressed and might be involved in citrus somatic embryogenesis (SE). The results exhibited that the expression of CsHB1 was up-regulated in somatic embryo induction process, and its expression was higher in citrus cultivars with high embryogenic capacity than in cultivars recalcitrant to form somatic embryos. Moreover, a microsatellite site of three nucleotide repeats was found in CsHB1 gene among eighteen citrus genotypes, indicating the possible association of CsHB1 gene to the capacity of callus induction. PMID:26232336

  7. Genome-Wide Survey and Expression Profile Analysis of the Mitogen-Activated Protein Kinase (MAPK) Gene Family in Brassica rapa.

    PubMed

    Lu, Kun; Guo, Wenjin; Lu, Junxing; Yu, Hao; Qu, Cunmin; Tang, Zhanglin; Li, Jiana; Chai, Yourong; Liang, Ying

    2015-01-01

    Mitogen-activated protein kinase (MAPK) cascades are fundamental signal transduction modules in plants, controlling cell division, development, hormone signaling, and biotic and abiotic stress responses. Although MAPKs have been investigated in several plant species, a comprehensive analysis of the MAPK gene family has hitherto not been performed in Brassica rapa. In this study, we identified 32 MAPKs in the B. rapa genome by conducting BLASTP and syntenic block analyses, and screening for the essential signature motif (TDY or TEY) of plant MAPK proteins. Of the 32 BraMAPK genes retrieved from the Brassica Database, 13 exhibited exon splicing errors, excessive splicing of the 5' sequence, excessive retention of the 5' sequence, and sequencing errors of the 3' end. Phylogenetic trees of the 32 corrected MAPKs from B. rapa and of MAPKs from other plants generated by the neighbor-joining and maximum likelihood methods suggested that BraMAPKs could be divided into four groups (groups A, B, C, and D). Gene number expansion was observed for BraMAPK genes in groups A and D, which may have been caused by the tandem duplication and genome triplication of the ancestral genome of the Brassica progenitor. Except for five members of the BraMAPK10 subfamily, the identified BraMAPKs were expressed in most of the tissues examined, including callus, root, stem, leaf, flower, and silique. Quantitative real-time PCR demonstrated that at least six and five BraMAPKs were induced or repressed by various abiotic stresses and hormone treatments, respectively, suggesting their potential roles in the abiotic stress response and various hormone signal transduction pathways in B. rapa. This study provides valuable insight into the putative physiological and biochemical functions of MAPK genes in B. rapa. PMID:26173020

  8. Foxtail Millet NF-Y Families: Genome-Wide Survey and Evolution Analyses Identified Two Functional Genes Important in Abiotic Stresses.

    PubMed

    Feng, Zhi-Juan; He, Guan-Hua; Zheng, Wei-Jun; Lu, Pan-Pan; Chen, Ming; Gong, Ya-Ming; Ma, You-Zhi; Xu, Zhao-Shi

    2015-01-01

    It was reported that Nuclear Factor Y (NF-Y) genes were involved in abiotic stress in plants. Foxtail millet (Setaria italica), an elite stress tolerant crop, provided an impetus for the investigation of the NF-Y families in abiotic responses. In the present study, a total of 39 NF-Y genes were identified in foxtail millet. Synteny analyses suggested that foxtail millet NF-Y genes had experienced rapid expansion and strong purifying selection during the process of plant evolution. De novo transcriptome assembly of foxtail millet revealed 11 drought up-regulated NF-Y genes. SiNF-YA1 and SiNF-YB8 were highly activated in leaves and/or roots by drought and salt stresses. Abscisic acid (ABA) and H2O2 played positive roles in the induction of SiNF-YA1 and SiNF-YB8 under stress treatments. Transient luciferase (LUC) expression assays revealed that SiNF-YA1 and SiNF-YB8 could activate the LUC gene driven by the tobacco (Nicotiana tobacam) NtERD10, NtLEA5, NtCAT, NtSOD, or NtPOD promoter under normal or stress conditions. Overexpression of SiNF-YA1 enhanced drought and salt tolerance by activating stress-related genes NtERD10 and NtCAT1 and by maintaining relatively stable relative water content (RWC) and contents of chlorophyll, superoxide dismutase (SOD), peroxidase (POD), catalase (CAT) and malondialdehyde (MDA) in transgenic lines under stresses. SiNF-YB8 regulated expression of NtSOD, NtPOD, NtLEA5, and NtERD10 and conferred relatively high RWC and chlorophyll contents and low MDA content, resulting in drought and osmotic tolerance in transgenic lines under stresses. Therefore, SiNF-YA1 and SiNF-YB8 could activate stress-related genes and improve physiological traits, resulting in tolerance to abiotic stresses in plants. All these results will facilitate functional characterization of foxtail millet NF-Ys in future studies. PMID:26734043

  9. Foxtail Millet NF-Y Families: Genome-Wide Survey and Evolution Analyses Identified Two Functional Genes Important in Abiotic Stresses

    PubMed Central

    Feng, Zhi-Juan; He, Guan-Hua; Zheng, Wei-Jun; Lu, Pan-Pan; Chen, Ming; Gong, Ya-Ming; Ma, You-Zhi; Xu, Zhao-Shi

    2015-01-01

    It was reported that Nuclear Factor Y (NF-Y) genes were involved in abiotic stress in plants. Foxtail millet (Setaria italica), an elite stress tolerant crop, provided an impetus for the investigation of the NF-Y families in abiotic responses. In the present study, a total of 39 NF-Y genes were identified in foxtail millet. Synteny analyses suggested that foxtail millet NF-Y genes had experienced rapid expansion and strong purifying selection during the process of plant evolution. De novo transcriptome assembly of foxtail millet revealed 11 drought up-regulated NF-Y genes. SiNF-YA1 and SiNF-YB8 were highly activated in leaves and/or roots by drought and salt stresses. Abscisic acid (ABA) and H2O2 played positive roles in the induction of SiNF-YA1 and SiNF-YB8 under stress treatments. Transient luciferase (LUC) expression assays revealed that SiNF-YA1 and SiNF-YB8 could activate the LUC gene driven by the tobacco (Nicotiana tobacam) NtERD10, NtLEA5, NtCAT, NtSOD, or NtPOD promoter under normal or stress conditions. Overexpression of SiNF-YA1 enhanced drought and salt tolerance by activating stress-related genes NtERD10 and NtCAT1 and by maintaining relatively stable relative water content (RWC) and contents of chlorophyll, superoxide dismutase (SOD), peroxidase (POD), catalase (CAT) and malondialdehyde (MDA) in transgenic lines under stresses. SiNF-YB8 regulated expression of NtSOD, NtPOD, NtLEA5, and NtERD10 and conferred relatively high RWC and chlorophyll contents and low MDA content, resulting in drought and osmotic tolerance in transgenic lines under stresses. Therefore, SiNF-YA1 and SiNF-YB8 could activate stress-related genes and improve physiological traits, resulting in tolerance to abiotic stresses in plants. All these results will facilitate functional characterization of foxtail millet NF-Ys in future studies. PMID:26734043

  10. Genome-wide identification of members in the YTH domain-containing RNA-binding protein family in apple and expression analysis of their responsiveness to senescence and abiotic stresses.

    PubMed

    Wang, Na; Yue, Zhiyong; Liang, Dong; Ma, Fengwang

    2014-04-01

    YT521-homology (YTH) domain-containing RNA-binding proteins (YTPs) are a small gene family involved in post-transcriptional regulation. We identified 26 putative YTP gene models in the apple genome. Although plant YTPs have been classified into three groups, those in multi-cellular organisms belong only to Groups A and B. The apple genome contains 22 YTP gene models in Group A and four in Group B. Duplication analysis showed that tandem and segmental duplications contributed only partially to an expansion in apple YTP numbers. YTH was the only recognizable domain in apple YTPs; its three-dimensional structure implied possible motifs for RNA-binding. After the assembly of expressed sequence tags (ESTs) and gene-cloning, we were able to identify 14 apple YTPs that were expressed in various tissues, especially senescing leaves. Expression analysis showed that these YTPs also responded to several abiotic stresses. Taken together, our genome-wide evaluation provides new insight for further research on the effects of those stresses.

  11. Case-Control Genome-Wide Association Study of Attention-Deficit/Hyperactivity Disorder

    ERIC Educational Resources Information Center

    Neale, Benjamin M.; Medland, Sarah; Ripke, Stephan; Anney, Richard J. L.; Asherson, Philip; Buitelaar, Jan; Franke, Barbara; Gill, Michael; Kent, Lindsey; Holmans, Peter; Middleton, Frank; Thapar, Anita; Lesch, Klaus-Peter; Faraone, Stephen V.; Daly, Mark; Nguyen, Thuy Trang; Schafer, Helmut; Steinhausen, Hans-Christoph; Reif, Andreas; Renner, Tobias J.; Romanos, Marcel; Romanos, Jasmin; Warnke, Andreas; Walitza, Susanne; Freitag, Christine; Meyer, Jobst; Palmason, Haukur; Rothenberger, Aribert; Hawi, Ziarih; Sergeant, Joseph; Roeyers, Herbert; Mick, Eric; Biederman, Joseph

    2010-01-01

    Objective: Although twin and family studies have shown attention-deficit/hyperactivity disorder (ADHD) to be highly heritable, genetic variants influencing the trait at a genome-wide significant level have yet to be identified. Thus additional genome-wide association studies (GWAS) are needed. Method: We used case-control analyses of 896 cases…

  12. Meta-Analysis of Genome-Wide Association Studies of Attention-Deficit/Hyperactivity Disorder

    ERIC Educational Resources Information Center

    Neale, Benjamin M.; Medland, Sarah E.; Ripke, Stephan; Asherson, Philip; Franke, Barbara; Lesch, Klaus-Peter; Faraone, Stephen V.; Nguyen, Thuy Trang; Schafer, Helmut; Holmans, Peter; Daly, Mark; Steinhausen, Hans-Christoph; Freitag, Christine; Reif, Andreas; Renner, Tobias J.; Romanos, Marcel; Romanos, Jasmin; Walitza, Susanne; Warnke, Andreas; Meyer, Jobst; Palmason, Haukur; Buitelaar, Jan; Vasquez, Alejandro Arias; Lambregts-Rommelse, Nanda; Gill, Michael; Anney, Richard J. L.; Langely, Kate; O'Donovan, Michael; Williams, Nigel; Owen, Michael; Thapar, Anita; Kent, Lindsey; Sergeant, Joseph; Roeyers, Herbert; Mick, Eric; Biederman, Joseph; Doyle, Alysa; Smalley, Susan; Loo, Sandra; Hakonarson, Hakon; Elia, Josephine; Todorov, Alexandre; Miranda, Ana; Mulas, Fernando; Ebstein, Richard P.; Rothenberger, Aribert; Banaschewski, Tobias; Oades, Robert D.; Sonuga-Barke, Edmund; McGough, James; Nisenbaum, Laura; Middleton, Frank; Hu, Xiaolan; Nelson, Stan

    2010-01-01

    Objective: Although twin and family studies have shown attention-deficit/hyperactivity disorder (ADHD) to be highly heritable, genetic variants influencing the trait at a genome-wide significant level have yet to be identified. As prior genome-wide association studies (GWAS) have not yielded significant results, we conducted a meta-analysis of…

  13. Genome-wide methylation profiling identifies an essential role of reactive oxygen species in pediatric glioblastoma multiforme and validates a methylome specific for H3 histone family 3A with absence of G-CIMP/isocitrate dehydrogenase 1 mutation

    PubMed Central

    Jha, Prerana; Pia Patric, Irene Rosita; Shukla, Sudhanshu; Pathak, Pankaj; Pal, Jagriti; Sharma, Vikas; Thinagararanjan, Sivaarumugam; Santosh, Vani; Suri, Vaishali; Sharma, Mehar Chand; Arivazhagan, Arimappamagan; Suri, Ashish; Gupta, Deepak; Somasundaram, Kumaravel; Sarkar, Chitra

    2014-01-01

    Background Pediatric glioblastoma multiforme (GBM) is rare, and there is a single study, a seminal discovery showing association of histone H3.3 and isocitrate dehydrogenase (IDH)1 mutation with a DNA methylation signature. The present study aims to validate these findings in an independent cohort of pediatric GBM, compare it with adult GBM, and evaluate the involvement of important functionally altered pathways. Methods Genome-wide methylation profiling of 21 pediatric GBM cases was done and compared with adult GBM data (GSE22867). We performed gene mutation analysis of IDH1 and H3 histone family 3A (H3F3A), status evaluation of glioma cytosine–phosphate–guanine island methylator phenotype (G-CIMP), and Gene Ontology analysis. Experimental evaluation of reactive oxygen species (ROS) association was also done. Results Distinct differences were noted between methylomes of pediatric and adult GBM. Pediatric GBM was characterized by 94 hypermethylated and 1206 hypomethylated cytosine–phosphate–guanine (CpG) islands, with 3 distinct clusters, having a trend to prognostic correlation. Interestingly, none of the pediatric GBM cases showed G-CIMP/IDH1 mutation. Gene Ontology analysis identified ROS association in pediatric GBM, which was experimentally validated. H3F3A mutants (36.4%; all K27M) harbored distinct methylomes and showed enrichment of processes related to neuronal development, differentiation, and cell-fate commitment. Conclusions Our study confirms that pediatric GBM has a distinct methylome compared with that of adults. Presence of distinct clusters and an H3F3A mutation–specific methylome indicate existence of epigenetic subgroups within pediatric GBM. Absence of IDH1/G-CIMP status further indicates that findings in adult GBM cannot be simply extrapolated to pediatric GBM and that there is a strong need for identification of separate prognostic markers. A possible role of ROS in pediatric GBM pathogenesis is demonstrated for the first time and

  14. Genome-wide DNA methylation analysis of Haloferax volcanii H26 and identification of DNA methyltransferase related PD-(D/E)XK nuclease family protein HVO_A0006

    PubMed Central

    Ouellette, Matthew; Jackson, Laura; Chimileski, Scott; Papke, R. Thane

    2015-01-01

    Restriction-modification (RM) systems have evolved to protect the cell from invading DNAs and are composed of two enzymes: a DNA methyltransferase and a restriction endonuclease. Although RM systems are present in both archaeal and bacterial genomes, DNA methylation in archaea has not been well defined. In order to characterize the function of RM systems in archaeal species, we have made use of the model haloarchaeon Haloferax volcanii. A genomic DNA methylation analysis of H. volcanii strain H26 was performed using PacBio single molecule real-time (SMRT) sequencing. This analysis was also performed on a strain of H. volcanii in which an annotated DNA methyltransferase gene HVO_A0006 was deleted from the genome. Sequence analysis of H26 revealed two motifs which are modified in the genome: Cm4TAG and GCAm6BN6VTGC. Analysis of the ΔHVO_A0006 strain indicated that it exhibited reduced adenine methylation compared to the parental strain and altered the detected adenine motif. However, protein domain architecture analysis and amino acid alignments revealed that HVO_A0006 is homologous only to the N-terminal endonuclease region of Type IIG RM proteins and contains a PD-(D/E)XK nuclease motif, suggesting that HVO_A0006 is a PD-(D/E)XK nuclease family protein. Further bioinformatic analysis of the HVO_A0006 gene demonstrated that the gene is rare among the Halobacteria. It is surrounded by two transposition genes suggesting that HVO_A0006 is a fragment of a Type IIG RM gene, which has likely been acquired through gene transfer, and affects restriction-modification activity by interacting with another RM system component(s). Here, we present the first genome-wide characterization of DNA methylation in an archaeal species and examine the function of a DNA methyltransferase related gene HVO_A0006. PMID:25904898

  15. Integrative genome-wide approaches in embryonic stem cell research.

    PubMed

    Zhang, Xinyue; Huang, Jing

    2010-10-01

    Embryonic stem (ES) cells are derived from blastocysts. They can differentiate into the three embryonic germ layers and essentially any type of somatic cells. They therefore hold great potential in tissue regeneration therapy. The ethical issues associated with the use of human embryonic stem cells are resolved by the technical break-through of generating induced pluripotent stem (iPS) cells from various types of somatic cells. However, how ES and iPS cells self-renew and maintain their pluripotency is still largely unknown in spite of the great progress that has been made in the last two decades. Integrative genome-wide approaches, such as the gene expression microarray, chromatin immunoprecipitation based microarray (ChIP-chip) and chromatin immunoprecipitation followed by massive parallel sequencing (ChIP-seq) offer unprecedented opportunities to elucidate the mechanism of the pluripotency, reprogramming and DNA damage response of ES and iPS cells. This frontier article summarizes the fundamental biological questions about ES and iPS cells and reviews the recent advances in ES and iPS cell research using genome-wide technologies. To this end, we offer our perspectives on the future of genome-wide studies on stem cells.

  16. Genome-wide association study identifies common loci influencing circulating glycated hemoglobin (HbA1c) levels in non-diabetic subjects: the Long Life Family Study (LLFS)

    PubMed Central

    An, Ping; Miljkovic, Iva; Thyagarajan, Bharat; Kraja, Aldi T.; Daw, E. Warwick; Pankow, James S.; Selvin, Elizabeth; Kao, W. H. Linda; Maruthur, Nisa M.; Nalls, Micahel A.; Liu, Yongmei; Harris, Tamara B.; Lee, Joseph H.; Borecki, Ingrid B.; Christensen, Kaare; Eckfeldt, John H.; Mayeux, Richard; Perls, Thomas T.; Newman, Anne B.; Province, Michael A.

    2014-01-01

    Objective Glycated hemoglobin (HbA1c) is a stable index of chronic glycemic status and hyperglycemia associated with progressive development of insulin resistance and frank diabetes. It is also associated with premature aging and increased mortality. To uncover novel loci for HbA1c that are associated with healthy aging, we conducted a genome-wide association study (GWAS) using non-diabetic participants in the Long Life Family Study (LLFS), a study with familial clustering of exceptional longevity in the US and Denmark. Methods A total of 4,088 non-diabetic subjects from the LLFS were used for GWAS discoveries, and a total of 8,231 non-diabetic subjects from the Atherosclerosis Risk in Communities Study (ARIC, in the MAGIC Consortium) and the Health, Aging, and Body Composition Study (HABC) were used for GWAS replications. HbA1c was adjusted for age, sex, centers, 20 principal components, without and with BMI. A linear mixed effects model was used for association testing. Results Two known loci at GCK rs730497 (or rs2908282) and HK1 rs17476364 were confirmed (p < 5e–8). Of 25 suggestive (5e–8 < p < 1e–5) loci, one known (G6PC2 rs560887, replication p = 5e–5) and one novel (OR10R3P/SPTA1- rs12041363, replication p = 1e–17) loci were replicated (p < 0.0019). Similar findings resulted when HbA1c was further adjusted for BMI. Further validations are crucial for the remaining suggestive loci including the emerged variant near OR10R3P/SPTA1. Conclusions The analysis reconfirmed two known GWAS loci (GCK, HK1) and identified 25 suggestive loci including one reconfirmed variant in G6PC2 and one replicated variant near OR10R3P/SPTA1. Future focused survey of sequence elements containing mainly functional and regulatory variants may yield additional findings. PMID:24405752

  17. Genome-wide analysis of DNA methylation in hepatoblastoma tissues

    PubMed Central

    Cui, Ximao; Liu, Baihui; Zheng, Shan; Dong, Kuiran; Dong, Rui

    2016-01-01

    DNA methylation has a crucial role in cancer biology. In the present study, a genome-wide analysis of DNA methylation in hepatoblastoma (HB) tissues was performed to verify differential methylation levels between HB and normal tissues. As alpha-fetoprotein (AFP) has a critical role in HB, AFP methylation levels were also detected using pyrosequencing. Normal and HB liver tissue samples (frozen tissue) were obtained from patients with HB. Genome-wide analysis of DNA methylation in these tissues was performed using an Infinium HumanMethylation450 BeadChip, and the results were confirmed with reverse transcription-quantitative polymerase chain reaction. The Infinium HumanMethylation450 BeadChip demonstrated distinctively less methylation in HB tissues than in non-tumor tissues. In addition, methylation enrichment was observed in positions near the transcription start site of AFP, which exhibited lower methylation levels in HB tissues than in non-tumor liver tissues. Lastly, a significant negative correlation was observed between AFP messenger RNA expression and DNA methylation percentage, using linear Pearson's R correlation coefficients. The present results demonstrate differential methylation levels between HB and normal tissues, and imply that aberrant methylation of AFP in HB could reflect HB development. Expansion of these findings could provide useful insight into HB biology. PMID:27446465

  18. Genome-wide Association Study of Autism Spectrum Disorder in the East Asian Populations.

    PubMed

    Liu, Xiaoxi; Shimada, Takafumi; Otowa, Takeshi; Wu, Yu-Yu; Kawamura, Yoshiya; Tochigi, Mamoru; Iwata, Yasuhide; Umekage, Tadashi; Toyota, Tomoko; Maekawa, Motoko; Iwayama, Yoshimi; Suzuki, Katsuaki; Kakiuchi, Chihiro; Kuwabara, Hitoshi; Kano, Yukiko; Nishida, Hisami; Sugiyama, Toshiro; Kato, Nobumasa; Chen, Chia-Hsiang; Mori, Norio; Yamada, Kazuo; Yoshikawa, Takeo; Kasai, Kiyoto; Tokunaga, Katsushi; Sasaki, Tsukasa; Gau, Susan Shur-Fen

    2016-03-01

    Autism spectrum disorder is a heterogeneous neurodevelopmental disorder with strong genetic basis. To identify common genetic variations conferring the risk of ASD, we performed a two-stage genome-wide association study using ASD family and healthy control samples obtained from East Asian populations. A total of 166 ASD families (n = 500) and 642 healthy controls from the Japanese population were used as the discovery cohort. Approximately 900,000 single nucleotide polymorphisms (SNPs) were genotyped using Affymetrix Genome-Wide Human SNP array 6.0 chips. In the replication stage, 205 Japanese ASD cases and 184 healthy controls, as well as 418 Chinese Han trios (n = 1,254), were genotyped by TaqMan platform. Case-control analysis, family based association test, and transmission/disequilibrium test (TDT) were then conducted to test the association. In the discovery stage, significant associations were suggested for 14 loci, including 5 known ASD candidate genes: GPC6, JARID2, YTHDC2, CNTN4, and CSMD1. In addition, significant associations were identified for several novel genes with intriguing functions, such as JPH3, PTPRD, CUX1, and RIT2. After a meta-analysis combining the Japanese replication samples, the strongest signal was found at rs16976358 (P = 6.04 × 10(-7)), which is located near the RIT2 gene. In summary, our results provide independent support to known ASD candidate genes and highlight a number of novel genes warranted to be further investigated in a larger sample set in an effort to improve our understanding of the genetic basis of ASD.

  19. A Pooled Genome-Wide Association Study of Asperger Syndrome.

    PubMed

    Warrier, Varun; Chakrabarti, Bhismadev; Murphy, Laura; Chan, Allen; Craig, Ian; Mallya, Uma; Lakatošová, Silvia; Rehnstrom, Karola; Peltonen, Leena; Wheelwright, Sally; Allison, Carrie; Fisher, Simon E; Baron-Cohen, Simon

    2015-01-01

    Asperger Syndrome (AS) is a neurodevelopmental condition characterized by impairments in social interaction and communication, alongside the presence of unusually repetitive, restricted interests and stereotyped behaviour. Individuals with AS have no delay in cognitive and language development. It is a subset of Autism Spectrum Conditions (ASC), which are highly heritable and has a population prevalence of approximately 1%. Few studies have investigated the genetic basis of AS. To address this gap in the literature, we performed a genome-wide pooled DNA association study to identify candidate loci in 612 individuals (294 cases and 318 controls) of Caucasian ancestry, using the Affymetrix GeneChip Human Mapping version 6.0 array. We identified 11 SNPs that had a p-value below 1x10-5. These SNPs were independently genotyped in the same sample. Three of the SNPs (rs1268055, rs7785891 and rs2782448) were nominally significant, though none remained significant after Bonferroni correction. Two of our top three SNPs (rs7785891 and rs2782448) lie in loci previously implicated in ASC. However, investigation of the three SNPs in the ASC genome-wide association dataset from the Psychiatric Genomics Consortium indicated that these three SNPs were not significantly associated with ASC. The effect sizes of the variants were modest, indicating that our study was not sufficiently powered to identify causal variants with precision.

  20. A Pooled Genome-Wide Association Study of Asperger Syndrome.

    PubMed

    Warrier, Varun; Chakrabarti, Bhismadev; Murphy, Laura; Chan, Allen; Craig, Ian; Mallya, Uma; Lakatošová, Silvia; Rehnstrom, Karola; Peltonen, Leena; Wheelwright, Sally; Allison, Carrie; Fisher, Simon E; Baron-Cohen, Simon

    2015-01-01

    Asperger Syndrome (AS) is a neurodevelopmental condition characterized by impairments in social interaction and communication, alongside the presence of unusually repetitive, restricted interests and stereotyped behaviour. Individuals with AS have no delay in cognitive and language development. It is a subset of Autism Spectrum Conditions (ASC), which are highly heritable and has a population prevalence of approximately 1%. Few studies have investigated the genetic basis of AS. To address this gap in the literature, we performed a genome-wide pooled DNA association study to identify candidate loci in 612 individuals (294 cases and 318 controls) of Caucasian ancestry, using the Affymetrix GeneChip Human Mapping version 6.0 array. We identified 11 SNPs that had a p-value below 1x10-5. These SNPs were independently genotyped in the same sample. Three of the SNPs (rs1268055, rs7785891 and rs2782448) were nominally significant, though none remained significant after Bonferroni correction. Two of our top three SNPs (rs7785891 and rs2782448) lie in loci previously implicated in ASC. However, investigation of the three SNPs in the ASC genome-wide association dataset from the Psychiatric Genomics Consortium indicated that these three SNPs were not significantly associated with ASC. The effect sizes of the variants were modest, indicating that our study was not sufficiently powered to identify causal variants with precision. PMID:26176695

  1. A Pooled Genome-Wide Association Study of Asperger Syndrome

    PubMed Central

    Warrier, Varun; Chakrabarti, Bhismadev; Murphy, Laura; Chan, Allen; Craig, Ian; Mallya, Uma; Lakatošová, Silvia; Rehnstrom, Karola; Wheelwright, Sally; Allison, Carrie; Fisher, Simon E.; Baron-Cohen, Simon

    2015-01-01

    Asperger Syndrome (AS) is a neurodevelopmental condition characterized by impairments in social interaction and communication, alongside the presence of unusually repetitive, restricted interests and stereotyped behaviour. Individuals with AS have no delay in cognitive and language development. It is a subset of Autism Spectrum Conditions (ASC), which are highly heritable and has a population prevalence of approximately 1%. Few studies have investigated the genetic basis of AS. To address this gap in the literature, we performed a genome-wide pooled DNA association study to identify candidate loci in 612 individuals (294 cases and 318 controls) of Caucasian ancestry, using the Affymetrix GeneChip Human Mapping version 6.0 array. We identified 11 SNPs that had a p-value below 1x10-5. These SNPs were independently genotyped in the same sample. Three of the SNPs (rs1268055, rs7785891 and rs2782448) were nominally significant, though none remained significant after Bonferroni correction. Two of our top three SNPs (rs7785891 and rs2782448) lie in loci previously implicated in ASC. However, investigation of the three SNPs in the ASC genome-wide association dataset from the Psychiatric Genomics Consortium indicated that these three SNPs were not significantly associated with ASC. The effect sizes of the variants were modest, indicating that our study was not sufficiently powered to identify causal variants with precision. PMID:26176695

  2. Genome-wide association analysis of imputed rare variants: application to seven common complex diseases.

    PubMed

    Mägi, Reedik; Asimit, Jennifer L; Day-Williams, Aaron G; Zeggini, Eleftheria; Morris, Andrew P

    2012-12-01

    Genome-wide association studies have been successful in identifying loci contributing effects to a range of complex human traits. The majority of reproducible associations within these loci are with common variants, each of modest effect, which together explain only a small proportion of heritability. It has been suggested that much of the unexplained genetic component of complex traits can thus be attributed to rare variation. However, genome-wide association study genotyping chips have been designed primarily to capture common variation, and thus are underpowered to detect the effects of rare variants. Nevertheless, we demonstrate here, by simulation, that imputation from an existing scaffold of genome-wide genotype data up to high-density reference panels has the potential to identify rare variant associations with complex traits, without the need for costly re-sequencing experiments. By application of this approach to genome-wide association studies of seven common complex diseases, imputed up to publicly available reference panels, we identify genome-wide significant evidence of rare variant association in PRDM10 with coronary artery disease and multiple genes in the major histocompatibility complex (MHC) with type 1 diabetes. The results of our analyses highlight that genome-wide association studies have the potential to offer an exciting opportunity for gene discovery through association with rare variants, conceivably leading to substantial advancements in our understanding of the genetic architecture underlying complex human traits.

  3. Genome-wide transcription factor binding: beyond direct target regulation.

    PubMed

    MacQuarrie, Kyle L; Fong, Abraham P; Morse, Randall H; Tapscott, Stephen J

    2011-04-01

    The binding of transcription factors to specific DNA target sequences is the fundamental basis of gene regulatory networks. Chromatin immunoprecipitation combined with DNA tiling arrays or high-throughput sequencing (ChIP-chip and ChIP-seq, respectively) has been used in many recent studies that detail the binding sites of various transcription factors. Surprisingly, data from a variety of model organisms and tissues have demonstrated that transcription factors vary greatly in their number of genomic binding sites, and that binding events can significantly exceed the number of known or possible direct gene targets. Thus, current understanding of transcription factor function must expand to encompass what role, if any, binding might have outside of direct transcriptional target regulation. In this review, we discuss the biological significance of genome-wide binding of transcription factors and present models that can account for this phenomenon.

  4. Genome-wide association studies and contribution to cardiovascular physiology

    PubMed Central

    Munroe, Patricia B.

    2015-01-01

    The study of family pedigrees with rare monogenic cardiovascular disorders has revealed new molecular players in physiological processes. Genome-wide association studies of complex traits with a heritable component may afford a similar and potentially intellectually richer opportunity. In this review we focus on the interpretation of genetic associations and the issue of causality in relation to known and potentially new physiology. We mainly discuss cardiometabolic traits as it reflects our personal interests, but the issues pertain broadly in many other disciplines. We also describe some of the resources that are now available that may expedite follow up of genetic association signals into observations on causal mechanisms and pathophysiology. PMID:26106147

  5. Genome-wide association studies and contribution to cardiovascular physiology.

    PubMed

    Munroe, Patricia B; Tinker, Andrew

    2015-09-01

    The study of family pedigrees with rare monogenic cardiovascular disorders has revealed new molecular players in physiological processes. Genome-wide association studies of complex traits with a heritable component may afford a similar and potentially intellectually richer opportunity. In this review we focus on the interpretation of genetic associations and the issue of causality in relation to known and potentially new physiology. We mainly discuss cardiometabolic traits as it reflects our personal interests, but the issues pertain broadly in many other disciplines. We also describe some of the resources that are now available that may expedite follow up of genetic association signals into observations on causal mechanisms and pathophysiology.

  6. Genome-wide association studies in neurology

    PubMed Central

    Tan, Meng-Shan; Jiang, Teng

    2014-01-01

    Genome-wide association studies (GWAS) are a powerful tool for understanding the genetic underpinnings of human disease. In this article, we briefly review the role and findings of GWAS in common neurological diseases, including Stroke, Alzheimer’s disease, Parkinson’s disease, epilepsy, multiple sclerosis, migraine, amyotrophic lateral sclerosis, frontotemporal lobar degeneration, restless legs syndrome, intracranial aneurysm, human prion diseases and moyamoya disease. We then discuss the present and future implications of these findings with regards to disease prediction, uncovering basic biology, and the development of potential therapeutic agents. PMID:25568877

  7. Genome-wide identification of enhancer elements.

    PubMed

    Tulin, Sarah; Barsi, Julius C; Bocconcelli, Carlo; Smith, Joel

    2016-01-01

    We present a prospective genome-wide regulatory element database for the sea urchin embryo and the modified chromosome capture-related methodology used to create it. The method we developed is termed GRIP-seq for genome-wide regulatory element immunoprecipitation and combines features of chromosome conformation capture, chromatin immunoprecipitation, and paired-end next-generation sequencing with molecular steps that enrich for active cis-regulatory elements associated with basal transcriptional machinery. The first GRIP-seq database, available to the community, comes from S. purpuratus 24 hpf embryos and takes advantage of the extremely well-characterized cis-regulatory elements in this system for validation. In addition, using the GRIP-seq database, we identify and experimentally validate a novel, intronic cis-regulatory element at the onecut locus. We find GRIP-seq signal sensitively identifies active cis-regulatory elements with a high signal-to-noise ratio for both distal and intronic elements. This promising GRIP-seq protocol has the potential to address a rate-limiting step in resolving comprehensive, predictive network models in all systems.

  8. Genome-wide identification of enhancer elements.

    PubMed

    Tulin, Sarah; Barsi, Julius C; Bocconcelli, Carlo; Smith, Joel

    2016-01-01

    We present a prospective genome-wide regulatory element database for the sea urchin embryo and the modified chromosome capture-related methodology used to create it. The method we developed is termed GRIP-seq for genome-wide regulatory element immunoprecipitation and combines features of chromosome conformation capture, chromatin immunoprecipitation, and paired-end next-generation sequencing with molecular steps that enrich for active cis-regulatory elements associated with basal transcriptional machinery. The first GRIP-seq database, available to the community, comes from S. purpuratus 24 hpf embryos and takes advantage of the extremely well-characterized cis-regulatory elements in this system for validation. In addition, using the GRIP-seq database, we identify and experimentally validate a novel, intronic cis-regulatory element at the onecut locus. We find GRIP-seq signal sensitively identifies active cis-regulatory elements with a high signal-to-noise ratio for both distal and intronic elements. This promising GRIP-seq protocol has the potential to address a rate-limiting step in resolving comprehensive, predictive network models in all systems. PMID:27389984

  9. Genome-wide approaches to schizophrenia.

    PubMed

    Duan, Jubao; Sanders, Alan R; Gejman, Pablo V

    2010-09-30

    Schizophrenia (SZ) is a common and severe psychiatric disorder with both environmental and genetic risk factors, and a high heritability. After over 20 years of molecular genetics research, new molecular strategies, primarily genome-wide association studies (GWAS), have generated major tangible progress. This new data provides evidence for: (1) a number of chromosomal regions with common polymorphisms showing genome-wide association with SZ (the major histocompatibility complex, MHC, region at 6p22-p21; 18q21.2; and 2q32.1). The associated alleles present small odds ratios (the odds of a risk variant being present in cases vs. controls) and suggest causative involvement of gene regulatory mechanisms in SZ. (2) Polygenic inheritance. (3) Involvement of rare (<1%) and large (>100kb) copy number variants (CNVs). (4) A genetic overlap of SZ with autism and with bipolar disorder (BP) challenging the classical clinical classifications. Most new SZ findings (chromosomal regions and genes) have generated new biological leads. These new findings, however, still need to be translated into a better understanding of the underlying biology and into causal mechanisms. Furthermore, a considerable amount of heritability still remains unexplained (missing heritability). Deep resequencing for rare variants and system biology approaches (e.g., integrating DNA sequence and functional data) are expected to further improve our understanding of the genetic architecture of SZ and its underlying biology. PMID:20433910

  10. Genome-wide association study of antisocial personality disorder.

    PubMed

    Rautiainen, M-R; Paunio, T; Repo-Tiihonen, E; Virkkunen, M; Ollila, H M; Sulkava, S; Jolanki, O; Palotie, A; Tiihonen, J

    2016-01-01

    The pathophysiology of antisocial personality disorder (ASPD) remains unclear. Although the most consistent biological finding is reduced grey matter volume in the frontal cortex, about 50% of the total liability to developing ASPD has been attributed to genetic factors. The contributing genes remain largely unknown. Therefore, we sought to study the genetic background of ASPD. We conducted a genome-wide association study (GWAS) and a replication analysis of Finnish criminal offenders fulfilling DSM-IV criteria for ASPD (N=370, N=5850 for controls, GWAS; N=173, N=3766 for controls and replication sample). The GWAS resulted in suggestive associations of two clusters of single-nucleotide polymorphisms at 6p21.2 and at 6p21.32 at the human leukocyte antigen (HLA) region. Imputation of HLA alleles revealed an independent association with DRB1*01:01 (odds ratio (OR)=2.19 (1.53-3.14), P=1.9 × 10(-5)). Two polymorphisms at 6p21.2 LINC00951-LRFN2 gene region were replicated in a separate data set, and rs4714329 reached genome-wide significance (OR=1.59 (1.37-1.85), P=1.6 × 10(-9)) in the meta-analysis. The risk allele also associated with antisocial features in the general population conditioned for severe problems in childhood family (β=0.68, P=0.012). Functional analysis in brain tissue in open access GTEx and Braineac databases revealed eQTL associations of rs4714329 with LINC00951 and LRFN2 in cerebellum. In humans, LINC00951 and LRFN2 are both expressed in the brain, especially in the frontal cortex, which is intriguing considering the role of the frontal cortex in behavior and the neuroanatomical findings of reduced gray matter volume in ASPD. To our knowledge, this is the first study showing genome-wide significant and replicable findings on genetic variants associated with any personality disorder.

  11. SUSCEPTIBILITY LOCI FOR UMBILICAL HERNIA IN SWINE DETECTED BY GENOME-WIDE ASSOCIATION.

    PubMed

    Liao, X J; Lia, L; Zhang, Z Y; Long, Y; Yang, B; Ruan, G R; Su, Y; Ai, H S; Zhang, W C; Deng, W Y; Xiao, S J; Ren, J; Ding, N S; Huang, L S

    2015-10-01

    Umbilical hernia (UH) is a complex disorder caused by both genetic and environmental factors. UH brings animal welfare problems and severe economic loss to the pig industry. Until now, the genetic basis of UH is poorly understood. The high-density 60K porcine SNP array enables the rapid application of genome-wide association study (GWAS) to identify genetic loci for phenotypic traits at genome wide scale in pigs. The objective of this research was to identify susceptibility loci for swine umbilical hernia using the GWAS approach. We genotyped 478 piglets from 142 families representing three Western commercial breeds with the Illumina PorcineSNP60 BeadChip. Then significant SNPs were detected by GWAS using ROADTRIPS (Robust Association-Detection Test for Related Individuals with Population Substructure) software base on a Bonferroni corrected threshold (P = 1.67E-06) or suggestive threshold (P = 3.34E-05) and false discovery rate (FDR = 0.05). After quality control, 29,924 qualified SNPs and 472 piglets were used for GWAS. Two suggestive loci predisposing to pig UH were identified at 44.25MB on SSC2 (rs81358018, P = 3.34E-06, FDR = 0.049933) and at 45.90MB on SSC17 (rs81479278, P = 3.30E-06, FDR = 0.049933) in Duroc population, respectively. And no SNP was detected to be associated with pig UH at significant level in neither Landrace nor Large White population. Furthermore, we carried out a meta-analysis in the combined pure-breed population containing all the 472 piglets. rs81479278 (P = 1.16E-06, FDR = 0.022475) was identified to associate with pig UH at genome-wide significant level. SRC was characterized as plausible candidate gene for susceptibility to pig UH according to its genomic position and biological functions. To our knowledge, this study gives the first description of GWAS identifying susceptibility loci for umbilical hernia in pigs. Our findings provide deeper insights to the genetic architecture of umbilical hernia in pigs.

  12. Genome-Wide Association Studies of Cancer

    PubMed Central

    Stadler, Zsofia K.; Thom, Peter; Robson, Mark E.; Weitzel, Jeffrey N.; Kauff, Noah D.; Hurley, Karen E.; Devlin, Vincent; Gold, Bert; Klein, Robert J.; Offit, Kenneth

    2010-01-01

    Knowledge of the inherited risk for cancer is an important component of preventive oncology. In addition to well-established syndromes of cancer predisposition, much remains to be discovered about the genetic variation underlying susceptibility to common malignancies. Increased knowledge about the human genome and advances in genotyping technology have made possible genome-wide association studies (GWAS) of human diseases. These studies have identified many important regions of genetic variation associated with an increased risk for human traits and diseases including cancer. Understanding the principles, major findings, and limitations of GWAS is becoming increasingly important for oncologists as dissemination of genomic risk tests directly to consumers is already occurring through commercial companies. GWAS have contributed to our understanding of the genetic basis of cancer and will shed light on biologic pathways and possible new strategies for targeted prevention. To date, however, the clinical utility of GWAS-derived risk markers remains limited. PMID:20585100

  13. Genome-wide Membrane Protein Structure Prediction

    PubMed Central

    Piccoli, Stefano; Suku, Eda; Garonzi, Marianna; Giorgetti, Alejandro

    2013-01-01

    Transmembrane proteins allow cells to extensively communicate with the external world in a very accurate and specific way. They form principal nodes in several signaling pathways and attract large interest in therapeutic intervention, as the majority pharmaceutical compounds target membrane proteins. Thus, according to the current genome annotation methods, a detailed structural/functional characterization at the protein level of each of the elements codified in the genome is also required. The extreme difficulty in obtaining high-resolution three-dimensional structures, calls for computational approaches. Here we review to which extent the efforts made in the last few years, combining the structural characterization of membrane proteins with protein bioinformatics techniques, could help describing membrane proteins at a genome-wide scale. In particular we analyze the use of comparative modeling techniques as a way of overcoming the lack of high-resolution three-dimensional structures in the human membrane proteome. PMID:24403851

  14. Genome-wide association study of Tourette Syndrome

    PubMed Central

    Scharf, Jeremiah M.; Yu, Dongmei; Mathews, Carol A.; Neale, Benjamin M.; Stewart, S. Evelyn; Fagerness, Jesen A; Evans, Patrick; Gamazon, Eric; Edlund, Christopher K.; Service, Susan; Tikhomirov, Anna; Osiecki, Lisa; Illmann, Cornelia; Pluzhnikov, Anna; Konkashbaev, Anuar; Davis, Lea K; Han, Buhm; Crane, Jacquelyn; Moorjani, Priya; Crenshaw, Andrew T.; Parkin, Melissa A.; Reus, Victor I.; Lowe, Thomas L.; Rangel-Lugo, Martha; Chouinard, Sylvain; Dion, Yves; Girard, Simon; Cath, Danielle C; Smit, Jan H; King, Robert A.; Fernandez, Thomas; Leckman, James F.; Kidd, Kenneth K.; Kidd, Judith R.; Pakstis, Andrew J.; State, Matthew; Herrera, Luis Diego; Romero, Roxana; Fournier, Eduardo; Sandor, Paul; Barr, Cathy L; Phan, Nam; Gross-Tsur, Varda; Benarroch, Fortu; Pollak, Yehuda; Budman, Cathy L.; Bruun, Ruth D.; Erenberg, Gerald; Naarden, Allan L; Lee, Paul C; Weiss, Nicholas; Kremeyer, Barbara; Berrío, Gabriel Bedoya; Campbell, Desmond; Silgado, Julio C. Cardona; Ochoa, William Cornejo; Restrepo, Sandra C. Mesa; Muller, Heike; Duarte, Ana V. Valencia; Lyon, Gholson J; Leppert, Mark; Morgan, Jubel; Weiss, Robert; Grados, Marco A.; Anderson, Kelley; Davarya, Sarah; Singer, Harvey; Walkup, John; Jankovic, Joseph; Tischfield, Jay A.; Heiman, Gary A.; Gilbert, Donald L.; Hoekstra, Pieter J.; Robertson, Mary M.; Kurlan, Roger; Liu, Chunyu; Gibbs, J. Raphael; Singleton, Andrew; Hardy, John; Strengman, Eric; Ophoff, Roel; Wagner, Michael; Moessner, Rainald; Mirel, Daniel B.; Posthuma, Danielle; Sabatti, Chiara; Eskin, Eleazar; Conti, David V.; Knowles, James A.; Ruiz-Linares, Andres; Rouleau, Guy A.; Purcell, Shaun; Heutink, Peter; Oostra, Ben A.; McMahon, William; Freimer, Nelson; Cox, Nancy J.; Pauls, David L.

    2012-01-01

    Tourette Syndrome (TS) is a developmental disorder that has one of the highest familial recurrence rates among neuropsychiatric diseases with complex inheritance. However, the identification of definitive TS susceptibility genes remains elusive. Here, we report the first genome-wide association study (GWAS) of TS in 1285 cases and 4964 ancestry-matched controls of European ancestry, including two European-derived population isolates, Ashkenazi Jews from North America and Israel, and French Canadians from Quebec, Canada. In a primary meta-analysis of GWAS data from these European ancestry samples, no markers achieved a genome-wide threshold of significance (p<5 × 10−8); the top signal was found in rs7868992 on chromosome 9q32 within COL27A1 (p=1.85 × 10−6). A secondary analysis including an additional 211 cases and 285 controls from two closely-related Latin-American population isolates from the Central Valley of Costa Rica and Antioquia, Colombia also identified rs7868992 as the top signal (p=3.6 × 10−7 for the combined sample of 1496 cases and 5249 controls following imputation with 1000 Genomes data). This study lays the groundwork for the eventual identification of common TS susceptibility variants in larger cohorts and helps to provide a more complete understanding of the full genetic architecture of this disorder. PMID:22889924

  15. Genome-wide association study of Tourette's syndrome.

    PubMed

    Scharf, J M; Yu, D; Mathews, C A; Neale, B M; Stewart, S E; Fagerness, J A; Evans, P; Gamazon, E; Edlund, C K; Service, S K; Tikhomirov, A; Osiecki, L; Illmann, C; Pluzhnikov, A; Konkashbaev, A; Davis, L K; Han, B; Crane, J; Moorjani, P; Crenshaw, A T; Parkin, M A; Reus, V I; Lowe, T L; Rangel-Lugo, M; Chouinard, S; Dion, Y; Girard, S; Cath, D C; Smit, J H; King, R A; Fernandez, T V; Leckman, J F; Kidd, K K; Kidd, J R; Pakstis, A J; State, M W; Herrera, L D; Romero, R; Fournier, E; Sandor, P; Barr, C L; Phan, N; Gross-Tsur, V; Benarroch, F; Pollak, Y; Budman, C L; Bruun, R D; Erenberg, G; Naarden, A L; Lee, P C; Weiss, N; Kremeyer, B; Berrío, G B; Campbell, D D; Cardona Silgado, J C; Ochoa, W C; Mesa Restrepo, S C; Muller, H; Valencia Duarte, A V; Lyon, G J; Leppert, M; Morgan, J; Weiss, R; Grados, M A; Anderson, K; Davarya, S; Singer, H; Walkup, J; Jankovic, J; Tischfield, J A; Heiman, G A; Gilbert, D L; Hoekstra, P J; Robertson, M M; Kurlan, R; Liu, C; Gibbs, J R; Singleton, A; Hardy, J; Strengman, E; Ophoff, R A; Wagner, M; Moessner, R; Mirel, D B; Posthuma, D; Sabatti, C; Eskin, E; Conti, D V; Knowles, J A; Ruiz-Linares, A; Rouleau, G A; Purcell, S; Heutink, P; Oostra, B A; McMahon, W M; Freimer, N B; Cox, N J; Pauls, D L

    2013-06-01

    Tourette's syndrome (TS) is a developmental disorder that has one of the highest familial recurrence rates among neuropsychiatric diseases with complex inheritance. However, the identification of definitive TS susceptibility genes remains elusive. Here, we report the first genome-wide association study (GWAS) of TS in 1285 cases and 4964 ancestry-matched controls of European ancestry, including two European-derived population isolates, Ashkenazi Jews from North America and Israel and French Canadians from Quebec, Canada. In a primary meta-analysis of GWAS data from these European ancestry samples, no markers achieved a genome-wide threshold of significance (P<5 × 10(-8)); the top signal was found in rs7868992 on chromosome 9q32 within COL27A1 (P=1.85 × 10(-6)). A secondary analysis including an additional 211 cases and 285 controls from two closely related Latin American population isolates from the Central Valley of Costa Rica and Antioquia, Colombia also identified rs7868992 as the top signal (P=3.6 × 10(-7) for the combined sample of 1496 cases and 5249 controls following imputation with 1000 Genomes data). This study lays the groundwork for the eventual identification of common TS susceptibility variants in larger cohorts and helps to provide a more complete understanding of the full genetic architecture of this disorder.

  16. Genome-wide analysis correlates Ayurveda Prakriti

    PubMed Central

    Govindaraj, Periyasamy; Nizamuddin, Sheikh; Sharath, Anugula; Jyothi, Vuskamalla; Rotti, Harish; Raval, Ritu; Nayak, Jayakrishna; Bhat, Balakrishna K.; Prasanna, B. V.; Shintre, Pooja; Sule, Mayura; Joshi, Kalpana S.; Dedge, Amrish P.; Bharadwaj, Ramachandra; Gangadharan, G. G.; Nair, Sreekumaran; Gopinath, Puthiya M.; Patwardhan, Bhushan; Kondaiah, Paturu; Satyamoorthy, Kapaettu; Valiathan, Marthanda Varma Sankaran; Thangaraj, Kumarasamy

    2015-01-01

    The practice of Ayurveda, the traditional medicine of India, is based on the concept of three major constitutional types (Vata, Pitta and Kapha) defined as “Prakriti”. To the best of our knowledge, no study has convincingly correlated genomic variations with the classification of Prakriti. In the present study, we performed genome-wide SNP (single nucleotide polymorphism) analysis (Affymetrix, 6.0) of 262 well-classified male individuals (after screening 3416 subjects) belonging to three Prakritis. We found 52 SNPs (p ≤ 1 × 10−5) were significantly different between Prakritis, without any confounding effect of stratification, after 106 permutations. Principal component analysis (PCA) of these SNPs classified 262 individuals into their respective groups (Vata, Pitta and Kapha) irrespective of their ancestry, which represent its power in categorization. We further validated our finding with 297 Indian population samples with known ancestry. Subsequently, we found that PGM1 correlates with phenotype of Pitta as described in the ancient text of Caraka Samhita, suggesting that the phenotypic classification of India’s traditional medicine has a genetic basis; and its Prakriti-based practice in vogue for many centuries resonates with personalized medicine. PMID:26511157

  17. Genome-wide analysis correlates Ayurveda Prakriti.

    PubMed

    Govindaraj, Periyasamy; Nizamuddin, Sheikh; Sharath, Anugula; Jyothi, Vuskamalla; Rotti, Harish; Raval, Ritu; Nayak, Jayakrishna; Bhat, Balakrishna K; Prasanna, B V; Shintre, Pooja; Sule, Mayura; Joshi, Kalpana S; Dedge, Amrish P; Bharadwaj, Ramachandra; Gangadharan, G G; Nair, Sreekumaran; Gopinath, Puthiya M; Patwardhan, Bhushan; Kondaiah, Paturu; Satyamoorthy, Kapaettu; Valiathan, Marthanda Varma Sankaran; Thangaraj, Kumarasamy

    2015-10-29

    The practice of Ayurveda, the traditional medicine of India, is based on the concept of three major constitutional types (Vata, Pitta and Kapha) defined as "Prakriti". To the best of our knowledge, no study has convincingly correlated genomic variations with the classification of Prakriti. In the present study, we performed genome-wide SNP (single nucleotide polymorphism) analysis (Affymetrix, 6.0) of 262 well-classified male individuals (after screening 3416 subjects) belonging to three Prakritis. We found 52 SNPs (p ≤ 1 × 10(-5)) were significantly different between Prakritis, without any confounding effect of stratification, after 10(6) permutations. Principal component analysis (PCA) of these SNPs classified 262 individuals into their respective groups (Vata, Pitta and Kapha) irrespective of their ancestry, which represent its power in categorization. We further validated our finding with 297 Indian population samples with known ancestry. Subsequently, we found that PGM1 correlates with phenotype of Pitta as described in the ancient text of Caraka Samhita, suggesting that the phenotypic classification of India's traditional medicine has a genetic basis; and its Prakriti-based practice in vogue for many centuries resonates with personalized medicine.

  18. Genome-wide epigenetic modifications in cancer.

    PubMed

    Park, Yoon Jung; Claus, Rainer; Weichenhan, Dieter; Plass, Christoph

    2011-01-01

    Epigenetic alterations in cancer include changes in DNA methylation and associated histone modifications that influence the chromatin states and impact gene expression patterns. Due to recent technological advantages, the scientific community is now obtaining a better picture of the genome-wide epigenetic changes that occur in a cancer genome. These epigenetic alterations are associated with chromosomal instability and changes in transcriptional control which influence the overall gene expression differences seen in many human malignancies. In this review, we will briefly summarize our current knowledge of the epigenetic patterns and mechanisms of gene regulation in healthy tissues and relate this to what is known for cancer genomes. Our focus will be on DNA methylation. We will review the current standing of technologies that have been developed over recent years. This field is experiencing a revolution in the strategies used to measure epigenetic alterations, which includes the incorporation of next generation sequencing tools. We also will review strategies that utilize epigenetic information for translational purposes, with a special emphasis on the potential use of DNA methylation marks for early disease detection and prognosis. The review will close with an outlook on challenges that this field is facing.

  19. [Peach genomics and genome-wide association study: a review].

    PubMed

    Li, Xiong-Wei; Jia, Hui-Juan; Gao, Zhong-Shan

    2013-10-01

    Peach (Prunus persica (L.) Batsch) is one of the most predominant stone fruits in Rosaceae family. The broad climate adaption, diverse cultivation region and good fruit taste make it one of the favorate fruits by consumers. Improving fruit quality and enhancing disease/pest resistance are always a focus for peach genetists and breeders to follow with interests. This paper reviews the main achievements on linkage map and physical map construction, development of various molecular markers, whole genome sequencing and transcriptome sequencing for peach in recent years, and also elaborates the applications of genome wide association study (GWAS) with high density SNP markers in peach and other plant crops. This review also provides a theoretical basis for GWAS analysis in the future study to identify high efficient markers of targeted traits for peach.

  20. Meta-Analysis of Genome-Wide Linkage Studies in Celiac Disease

    PubMed Central

    Forabosco, Paola; Neuhausen, Susan L.; Greco, Luigi; Naluai, Åsa Torinsson; Wijmenga, Cisca; Saavalainen, Päivi; Houlston, Richard S.; Ciclitira, Paul J.; Babron, Marie-Claude; Lewis, Cathryn M.

    2009-01-01

    Objective A meta-analysis of genome-wide linkage studies allows us to summarize the extensive information available from family-based studies, as the field moves into genome-wide association studies. Methods Here we apply the genome scan meta-analysis (GSMA) method, a rank-based, model-free approach, to combine results across eight independent genome-wide linkages performed on celiac disease (CD), including 554 families with over 1,500 affected individuals. We also investigate the agreement between signals we identified from this meta-analysis of linkage studies and those identified from genome-wide association analysis using a hypergeometric distribution. Results Not surprisingly, the most significant result was obtained in the HLA region. Outside the HLA region, suggestive evidence for linkage was obtained at the telomeric region of chromosome 10 (10q26.12-qter; p = 0.00366), and on chromosome 8 (8q22.2-q24.21; p = 0.00491). Testing signals of association and linkage within bins showed no significant evidence for co-localization of results. Conclusion This meta-analysis allowed us to pool the results from available genome-wide linkage studies and to identify novel regions potentially harboring predisposing genetic variation contributing to CD. This study also shows that linkage and association studies may identify different types of disease-predisposing variants. PMID:19622889

  1. Genome Wide Methylome Alterations in Lung Cancer.

    PubMed

    Mullapudi, Nandita; Ye, Bin; Suzuki, Masako; Fazzari, Melissa; Han, Weiguo; Shi, Miao K; Marquardt, Gaby; Lin, Juan; Wang, Tao; Keller, Steven; Zhu, Changcheng; Locker, Joseph D; Spivack, Simon D

    2015-01-01

    Aberrant cytosine 5-methylation underlies many deregulated elements of cancer. Among paired non-small cell lung cancers (NSCLC), we sought to profile DNA 5-methyl-cytosine features which may underlie genome-wide deregulation. In one of the more dense interrogations of the methylome, we sampled 1.2 million CpG sites from twenty-four NSCLC tumor (T)-non-tumor (NT) pairs using a methylation-sensitive restriction enzyme- based HELP-microarray assay. We found 225,350 differentially methylated (DM) sites in adenocarcinomas versus adjacent non-tumor tissue that vary in frequency across genomic compartment, particularly notable in gene bodies (GB; p<2.2E-16). Further, when DM was coupled to differential transcriptome (DE) in the same samples, 37,056 differential loci in adenocarcinoma emerged. Approximately 90% of the DM-DE relationships were non-canonical; for example, promoter DM associated with DE in the same direction. Of the canonical changes noted, promoter (PR) DM loci with reciprocal changes in expression in adenocarcinomas included HBEGF, AGER, PTPRM, DPT, CST1, MELK; DM GB loci with concordant changes in expression included FOXM1, FERMT1, SLC7A5, and FAP genes. IPA analyses showed adenocarcinoma-specific promoter DMxDE overlay identified familiar lung cancer nodes [tP53, Akt] as well as less familiar nodes [HBEGF, NQO1, GRK5, VWF, HPGD, CDH5, CTNNAL1, PTPN13, DACH1, SMAD6, LAMA3, AR]. The unique findings from this study include the discovery of numerous candidate The unique findings from this study include the discovery of numerous candidate methylation sites in both PR and GB regions not previously identified in NSCLC, and many non-canonical relationships to gene expression. These DNA methylation features could potentially be developed as risk or diagnostic biomarkers, or as candidate targets for newer methylation locus-targeted preventive or therapeutic agents. PMID:26683690

  2. Genome Wide Methylome Alterations in Lung Cancer

    PubMed Central

    Suzuki, Masako; Fazzari, Melissa; Han, Weiguo; Shi, Miao K.; Marquardt, Gaby; Lin, Juan; Wang, Tao; Keller, Steven; Zhu, Changcheng; Locker, Joseph D.; Spivack, Simon D.

    2015-01-01

    Aberrant cytosine 5-methylation underlies many deregulated elements of cancer. Among paired non-small cell lung cancers (NSCLC), we sought to profile DNA 5-methyl-cytosine features which may underlie genome-wide deregulation. In one of the more dense interrogations of the methylome, we sampled 1.2 million CpG sites from twenty-four NSCLC tumor (T)–non-tumor (NT) pairs using a methylation-sensitive restriction enzyme- based HELP-microarray assay. We found 225,350 differentially methylated (DM) sites in adenocarcinomas versus adjacent non-tumor tissue that vary in frequency across genomic compartment, particularly notable in gene bodies (GB; p<2.2E-16). Further, when DM was coupled to differential transcriptome (DE) in the same samples, 37,056 differential loci in adenocarcinoma emerged. Approximately 90% of the DM-DE relationships were non-canonical; for example, promoter DM associated with DE in the same direction. Of the canonical changes noted, promoter (PR) DM loci with reciprocal changes in expression in adenocarcinomas included HBEGF, AGER, PTPRM, DPT, CST1, MELK; DM GB loci with concordant changes in expression included FOXM1, FERMT1, SLC7A5, and FAP genes. IPA analyses showed adenocarcinoma-specific promoter DMxDE overlay identified familiar lung cancer nodes [tP53, Akt] as well as less familiar nodes [HBEGF, NQO1, GRK5, VWF, HPGD, CDH5, CTNNAL1, PTPN13, DACH1, SMAD6, LAMA3, AR]. The unique findings from this study include the discovery of numerous candidate The unique findings from this study include the discovery of numerous candidate methylation sites in both PR and GB regions not previously identified in NSCLC, and many non-canonical relationships to gene expression. These DNA methylation features could potentially be developed as risk or diagnostic biomarkers, or as candidate targets for newer methylation locus-targeted preventive or therapeutic agents. PMID:26683690

  3. Genome-Wide Methylation Profiling of Schizophrenia

    PubMed Central

    Rukova, B; Staneva, R; Hadjidekova, S; Stamenov, G; Milanova; Toncheva, D

    2014-01-01

    Schizophrenia is one of the major psychiatric disorders. It is a disorder of complex inheritance, involving both heritable and environmental factors. DNA methylation is an inheritable epigenetic modification that stably alters gene expression. We reasoned that genetic modifications that are a result of environmental stimuli could also make a contribution. We have performed 26 high-resolution genome-wide methylation array analyses to determine the methylation status of 27,627 CpG islands and compared the data between patients and healthy controls. Methylation profiles of DNAs were analyzed in six pools: 220 schizophrenia patients; 220 age-matched healthy controls; 110 female schizophrenia patients; 110 age-matched healthy females; 110 male schizophrenia patients; 110 age-matched healthy males. We also investigated the methylation status of 20 individual patient DNA samples (eight females and 12 males. We found significant differences in the methylation profile between schizophrenia and control DNA pools. We found new candidate genes that principally participate in apoptosis, synaptic transmission and nervous system development (GABRA2, LIN7B, CASP3). Methylation profiles differed between the genders. In females, the most important genes participate in apoptosis and synaptic transmission (XIAP, GABRD, OXT, KRT7), whereas in the males, the implicated genes in the molecular pathology of the disease were DHX37, MAP2K2, FNDC4 and GIPC1. Data from the individual methylation analyses confirmed, the gender-specific pools results. Our data revealed major differences in methylation profiles between schizophrenia patients and controls and between male and female patients. The dysregulated activity of the candidate genes could play a role in schizophrenia pathogenesis. PMID:25937794

  4. Genome-wide association study of antisocial personality disorder

    PubMed Central

    Rautiainen, M-R; Paunio, T; Repo-Tiihonen, E; Virkkunen, M; Ollila, H M; Sulkava, S; Jolanki, O; Palotie, A; Tiihonen, J

    2016-01-01

    The pathophysiology of antisocial personality disorder (ASPD) remains unclear. Although the most consistent biological finding is reduced grey matter volume in the frontal cortex, about 50% of the total liability to developing ASPD has been attributed to genetic factors. The contributing genes remain largely unknown. Therefore, we sought to study the genetic background of ASPD. We conducted a genome-wide association study (GWAS) and a replication analysis of Finnish criminal offenders fulfilling DSM-IV criteria for ASPD (N=370, N=5850 for controls, GWAS; N=173, N=3766 for controls and replication sample). The GWAS resulted in suggestive associations of two clusters of single-nucleotide polymorphisms at 6p21.2 and at 6p21.32 at the human leukocyte antigen (HLA) region. Imputation of HLA alleles revealed an independent association with DRB1*01:01 (odds ratio (OR)=2.19 (1.53–3.14), P=1.9 × 10-5). Two polymorphisms at 6p21.2 LINC00951–LRFN2 gene region were replicated in a separate data set, and rs4714329 reached genome-wide significance (OR=1.59 (1.37–1.85), P=1.6 × 10−9) in the meta-analysis. The risk allele also associated with antisocial features in the general population conditioned for severe problems in childhood family (β=0.68, P=0.012). Functional analysis in brain tissue in open access GTEx and Braineac databases revealed eQTL associations of rs4714329 with LINC00951 and LRFN2 in cerebellum. In humans, LINC00951 and LRFN2 are both expressed in the brain, especially in the frontal cortex, which is intriguing considering the role of the frontal cortex in behavior and the neuroanatomical findings of reduced gray matter volume in ASPD. To our knowledge, this is the first study showing genome-wide significant and replicable findings on genetic variants associated with any personality disorder. PMID:27598967

  5. Genome-wide association study of antisocial personality disorder.

    PubMed

    Rautiainen, M-R; Paunio, T; Repo-Tiihonen, E; Virkkunen, M; Ollila, H M; Sulkava, S; Jolanki, O; Palotie, A; Tiihonen, J

    2016-01-01

    The pathophysiology of antisocial personality disorder (ASPD) remains unclear. Although the most consistent biological finding is reduced grey matter volume in the frontal cortex, about 50% of the total liability to developing ASPD has been attributed to genetic factors. The contributing genes remain largely unknown. Therefore, we sought to study the genetic background of ASPD. We conducted a genome-wide association study (GWAS) and a replication analysis of Finnish criminal offenders fulfilling DSM-IV criteria for ASPD (N=370, N=5850 for controls, GWAS; N=173, N=3766 for controls and replication sample). The GWAS resulted in suggestive associations of two clusters of single-nucleotide polymorphisms at 6p21.2 and at 6p21.32 at the human leukocyte antigen (HLA) region. Imputation of HLA alleles revealed an independent association with DRB1*01:01 (odds ratio (OR)=2.19 (1.53-3.14), P=1.9 × 10(-5)). Two polymorphisms at 6p21.2 LINC00951-LRFN2 gene region were replicated in a separate data set, and rs4714329 reached genome-wide significance (OR=1.59 (1.37-1.85), P=1.6 × 10(-9)) in the meta-analysis. The risk allele also associated with antisocial features in the general population conditioned for severe problems in childhood family (β=0.68, P=0.012). Functional analysis in brain tissue in open access GTEx and Braineac databases revealed eQTL associations of rs4714329 with LINC00951 and LRFN2 in cerebellum. In humans, LINC00951 and LRFN2 are both expressed in the brain, especially in the frontal cortex, which is intriguing considering the role of the frontal cortex in behavior and the neuroanatomical findings of reduced gray matter volume in ASPD. To our knowledge, this is the first study showing genome-wide significant and replicable findings on genetic variants associated with any personality disorder. PMID:27598967

  6. Genome-wide identification and expression analysis of the mitogen-activated protein kinase gene family from banana suggest involvement of specific members in different stages of fruit ripening.

    PubMed

    Asif, Mehar Hasan; Lakhwani, Deepika; Pathak, Sumya; Bhambhani, Sweta; Bag, Sumit K; Trivedi, Prabodh Kumar

    2014-03-01

    Mitogen-activated protein kinases (MAPKs) are important components of the tripartite mitogen-activated protein kinase signaling cascade and play an important role in plant growth and development. Although members of the MAPK gene family have been identified in model plants, little information is available regarding this gene family in fruit crops. In this study, we carried out a computational analysis using the Musa Genome database to identify members of the MAPK gene family in banana, an economically important crop and the most popular fruit worldwide. Our analysis identified 25 members of the MAP kinase (MAPK or MPK) gene family. Phylogenetic analyses of MPKs in Arabidopsis, Oryza, and Populus have classified these MPKs into four subgroups. The presence of conserved domains in the deduced amino acid sequences, phylogeny, and genomic organization strongly support their identity as members of the MPK gene family. Expression analysis during ethylene-induced banana fruit ripening suggests the involvement of several MPKs in the ethylene signal transduction pathway that are necessary for banana fruit ripening. Analysis of the cis-regulatory elements in the promoter regions and the involvement of the identified MPKs in various cellular processes, as analyzed using Pathway Studio, suggest a role for the banana MPK gene family in diverse functions related to growth, development, and the stress response. This report is the first concerning the identification of members of a gene family and the elucidation of their role in various processes using the Musa Genome database. PMID:24275941

  7. Genome-Wide Linkage, Exome Sequencing and Functional Analyses Identify ABCB6 as the Pathogenic Gene of Dyschromatosis Universalis Hereditaria

    PubMed Central

    Wang, Na; Wang, Chuan; Chen, Xuechao; Sheng, Donglai; Fu, Xi’an; See, Kelvin; Foo, Jia Nee; Low, Huiqi; Liany, Herty; Irwan, Ishak Darryl; Liu, Jian; Yang, Baoqi; Chen, Mingfei; Yu, Yongxiang; Yu, Gongqi; Niu, Guiye; You, Jiabao; Zhou, Yan; Ma, Shanshan; Wang, Ting; Yan, Xiaoxiao; Goh, Boon Kee; Common, John E. A.; Lane, Birgitte E.; Sun, Yonghu; Zhou, Guizhi; Lu, Xianmei; Wang, Zhenhua; Tian, Hongqing; Cao, Yuanhua; Chen, Shumin; Liu, Qiji; Liu, Jianjun; Zhang, Furen

    2014-01-01

    Background As a genetic disorder of abnormal pigmentation, the molecular basis of dyschromatosis universalis hereditaria (DUH) had remained unclear until recently when ABCB6 was reported as a causative gene of DUH. Methodology We performed genome-wide linkage scan using Illumina Human 660W-Quad BeadChip and exome sequencing analyses using Agilent SureSelect Human All Exon Kits in a multiplex Chinese DUH family to identify the pathogenic mutations and verified the candidate mutations using Sanger sequencing. Quantitative RT-PCR and Immunohistochemistry was performed to verify the expression of the pathogenic gene, Zebrafish was also used to confirm the functional role of ABCB6 in melanocytes and pigmentation. Results Genome-wide linkage (assuming autosomal dominant inheritance mode) and exome sequencing analyses identified ABCB6 as the disease candidate gene by discovering a coding mutation (c.1358C>T; p.Ala453Val) that co-segregates with the disease phenotype. Further mutation analysis of ABCB6 in four other DUH families and two sporadic cases by Sanger sequencing confirmed the mutation (c.1358C>T; p.Ala453Val) and discovered a second, co-segregating coding mutation (c.964A>C; p.Ser322Lys) in one of the four families. Both mutations were heterozygous in DUH patients and not present in the 1000 Genome Project and dbSNP database as well as 1,516 unrelated Chinese healthy controls. Expression analysis in human skin and mutagenesis interrogation in zebrafish confirmed the functional role of ABCB6 in melanocytes and pigmentation. Given the involvement of ABCB6 mutations in coloboma, we performed ophthalmological examination of the DUH carriers of ABCB6 mutations and found ocular abnormalities in them. Conclusion Our study has advanced our understanding of DUH pathogenesis and revealed the shared pathological mechanism between pigmentary DUH and ocular coloboma. PMID:24498303

  8. Genome-wide association defines more than thirty distinct susceptibility loci for Crohn's disease

    PubMed Central

    Barrett, Jeffrey C.; Hansoul, Sarah; Nicolae, Dan L.; Cho, Judy H.; Duerr, Richard H.; Rioux, John D.; Brant, Steven R.; Silverberg, Mark S.; Taylor, Kent D.; Barmada, M. Michael; Bitton, Alain; Dassopoulos, Themistocles; Datta, Lisa Wu; Green, Todd; Griffiths, Anne M.; Kistner, Emily O.; Murtha, Michael T.; Regueiro, Miguel D.; Rotter, Jerome I.; Schumm, L. Philip; Steinhart, A. Hillary; Targan, Stephan R.; Xavier, Ramnik J.; Libioulle, Cécile; Sandor, Cynthia; Lathrop, Mark; Belaiche, Jacques; Dewit, Olivier; Gut, Ivo; Heath, Simon; Laukens, Debby; Mni, Myriam; Rutgeerts, Paul; Van Gossum, André; Zelenika, Diana; Franchimont, Denis; Hugot, JP; de Vos, Martine; Vermeire, Severine; Louis, Edouard; Cardon, Lon R.; Anderson, Carl A.; Drummond, Hazel; Nimmo, Elaine; Ahmad, Tariq; Prescott, Natalie J; Onnie, Clive M.; Fisher, Sheila A.; Marchini, Jonathan; Ghori, Jilur; Bumpstead, Suzannah; Gwillam, Rhian; Tremelling, Mark; Deloukas, Panos; Mansfield, John; Jewell, Derek; Satsangi, Jack; Mathew, Christopher G.; Parkes, Miles; Georges, Michel; Daly, Mark J.

    2008-01-01

    Several new risk factors for Crohn's disease have been identified in recent genome-wide association studies. To advance gene discovery further we have combined the data from three studies (a total of 3,230 cases and 4,829 controls) and performed replication in 3,664 independent cases with a mixture of population-based and family-based controls. The results strongly confirm 11 previously reported loci and provide genome-wide significant evidence for 21 new loci, including the regions containing STAT3, JAK2, ICOSLG, CDKAL1, and ITLN1. The expanded molecular understanding of the basis of disease offers promise for informed therapeutic development. PMID:18587394

  9. Genome-wide identification and comparative expression analysis reveal a rapid expansion and functional divergence of duplicated genes in the WRKY gene family of cabbage, Brassica oleracea var. capitata.

    PubMed

    Yao, Qiu-Yang; Xia, En-Hua; Liu, Fei-Hu; Gao, Li-Zhi

    2015-02-15

    WRKY transcription factors (TFs), one of the ten largest TF families in higher plants, play important roles in regulating plant development and resistance. To date, little is known about the WRKY TF family in Brassica oleracea. Recently, the completed genome sequence of cabbage (B. oleracea var. capitata) allows us to systematically analyze WRKY genes in this species. A total of 148 WRKY genes were characterized and classified into seven subgroups that belong to three major groups. Phylogenetic and synteny analyses revealed that the repertoire of cabbage WRKY genes was derived from a common ancestor shared with Arabidopsis thaliana. The B. oleracea WRKY genes were found to be preferentially retained after the whole-genome triplication (WGT) event in its recent ancestor, suggesting that the WGT event had largely contributed to a rapid expansion of the WRKY gene family in B. oleracea. The analysis of RNA-Seq data from various tissues (i.e., roots, stems, leaves, buds, flowers and siliques) revealed that most of the identified WRKY genes were positively expressed in cabbage, and a large portion of them exhibited patterns of differential and tissue-specific expression, demonstrating that these gene members might play essential roles in plant developmental processes. Comparative analysis of the expression level among duplicated genes showed that gene expression divergence was evidently presented among cabbage WRKY paralogs, indicating functional divergence of these duplicated WRKY genes.

  10. Genome-wide identification and comparative expression analysis reveal a rapid expansion and functional divergence of duplicated genes in the WRKY gene family of cabbage, Brassica oleracea var. capitata.

    PubMed

    Yao, Qiu-Yang; Xia, En-Hua; Liu, Fei-Hu; Gao, Li-Zhi

    2015-02-15

    WRKY transcription factors (TFs), one of the ten largest TF families in higher plants, play important roles in regulating plant development and resistance. To date, little is known about the WRKY TF family in Brassica oleracea. Recently, the completed genome sequence of cabbage (B. oleracea var. capitata) allows us to systematically analyze WRKY genes in this species. A total of 148 WRKY genes were characterized and classified into seven subgroups that belong to three major groups. Phylogenetic and synteny analyses revealed that the repertoire of cabbage WRKY genes was derived from a common ancestor shared with Arabidopsis thaliana. The B. oleracea WRKY genes were found to be preferentially retained after the whole-genome triplication (WGT) event in its recent ancestor, suggesting that the WGT event had largely contributed to a rapid expansion of the WRKY gene family in B. oleracea. The analysis of RNA-Seq data from various tissues (i.e., roots, stems, leaves, buds, flowers and siliques) revealed that most of the identified WRKY genes were positively expressed in cabbage, and a large portion of them exhibited patterns of differential and tissue-specific expression, demonstrating that these gene members might play essential roles in plant developmental processes. Comparative analysis of the expression level among duplicated genes showed that gene expression divergence was evidently presented among cabbage WRKY paralogs, indicating functional divergence of these duplicated WRKY genes. PMID:25481634

  11. Genome-wide Association Studies of Maximum Number of Drinks

    PubMed Central

    Pan, Yue; Luo, Xingguang; Liu, Xuefeng; Wu, Long-Yang; Zhang, Qunyuan; Wang, Liang; Wang, Weize; Zuo, Lingjun; Wang, Ke-Sheng

    2014-01-01

    Maximum number of drinks (MaxDrinks) defined as “Maximum number of alcoholic drinks consumed in a 24-hour period” is an intermediate phenotype that is closely related to alcohol dependence (AD). Family, twin and adoption studies have shown that the heritability of MaxDrinks is approximately 0.5. We conducted the first genome-wide association (GWA) study and meta-analysis of MaxDrinks as a continuous phenotype. 1059 individuals were from the Collaborative Study on the Genetics of Alcoholism (COGA) sample and 1628 individuals were from the Study of Addiction – Genetics and Environment (SAGE) sample. Family sample with 3137 individuals was from the Australian twin-family study of alcohol use disorder (OZALC). Two population-based Caucasian samples (COGA and SAGE) with 1 million single-nucleotide polymorphisms (SNPs) were used for gene discovery and one family-based Caucasian sample was used for replication. Through meta-analysis we identified 162 SNPs associated with MaxDirnks (p < 10−4). The most significant association with MaxDrinks was observed with SNP rs11128951 (p = 4.27×10−8) near SGOL1 gene at 3p24.3. Furthermore, several SNPs (rs17144687 near DTWD2, rs12108602 near NDST4, and rs2128158 in KCNB2) showed significant associations with MaxDrinks (p < 5×10−7) in the meta-analysis. Especially, 8 SNPs in DDC gene showed significant associations with MaxDrinks (p< 5×10−7) in the SAGE sample. Several flanking SNPs in above genes/regions were confirmed in the OZALC family sample. In conclusions, we identified several genes/regions associated with MaxDrinks. These findings can improve the understanding about the pathogenesis of alcohol consumption phenotypes and alcohol-related disorders. PMID:23953852

  12. Genome-Wide Identification of Calcium Dependent Protein Kinase Gene Family in Plant Lineage Shows Presence of Novel D-x-D and D-E-L Motifs in EF-Hand Domain

    PubMed Central

    Mohanta, Tapan K.; Mohanta, Nibedita; Mohanta, Yugal K.; Bae, Hanhong

    2015-01-01

    Calcium ions are considered ubiquitous second messengers in eukaryotic signal transduction pathways. Intracellular Ca2+ concentration are modulated by various signals such as hormones and biotic and abiotic stresses. Modulation of Ca2+ ion leads to stimulation of calcium dependent protein kinase genes (CPKs), which results in regulation of gene expression and therefore mediates plant growth and development as well as biotic and abiotic stresses. Here, we reported the CPK gene family of 40 different plant species (950 CPK genes) and provided a unified nomenclature system for all of them. In addition, we analyzed their genomic, biochemical and structural conserved features. Multiple sequence alignment revealed that the kinase domain, auto-inhibitory domain and EF-hands regions of regulatory domains are highly conserved in nature. Additionally, the EF-hand domains of higher plants were found to contain four D-x-D and two D-E-L motifs, while lower eukaryotic plants had two D-x-D and one D-x-E motifs in their EF-hands. Phylogenetic analysis showed that CPK genes are clustered into four different groups. By studying the CPK gene family across the plant lineage, we provide the first evidence of the presence of D-x-D motif in the calcium binding EF-hand domain of CPK proteins. PMID:26734045

  13. Genome-Wide Methylation Analyses in Glioblastoma Multiforme

    PubMed Central

    Lai, Rose K.; Chen, Yanwen; Guan, Xiaowei; Nousome, Darryl; Sharma, Charu; Canoll, Peter; Bruce, Jeffrey; Sloan, Andrew E.; Cortes, Etty; Vonsattel, Jean-Paul; Su, Tao; Delgado-Cruzata, Lissette; Gurvich, Irina; Santella, Regina M.; Ostrom, Quinn; Lee, Annette; Gregersen, Peter; Barnholtz-Sloan, Jill

    2014-01-01

    Few studies had investigated genome-wide methylation in glioblastoma multiforme (GBM). Our goals were to study differential methylation across the genome in gene promoters using an array-based method, as well as repetitive elements using surrogate global methylation markers. The discovery sample set for this study consisted of 54 GBM from Columbia University and Case Western Reserve University, and 24 brain controls from the New York Brain Bank. We assembled a validation dataset using methylation data of 162 TCGA GBM and 140 brain controls from dbGAP. HumanMethylation27 Analysis Bead-Chips (Illumina) were used to interrogate 26,486 informative CpG sites in both the discovery and validation datasets. Global methylation levels were assessed by analysis of L1 retrotransposon (LINE1), 5 methyl-deoxycytidine (5m-dC) and 5 hydroxylmethyl-deoxycytidine (5hm-dC) in the discovery dataset. We validated a total of 1548 CpG sites (1307 genes) that were differentially methylated in GBM compared to controls. There were more than twice as many hypomethylated genes as hypermethylated ones. Both the discovery and validation datasets found 5 tumor methylation classes. Pathway analyses showed that the top ten pathways in hypomethylated genes were all related to functions of innate and acquired immunities. Among hypermethylated pathways, transcriptional regulatory network in embryonic stem cells was the most significant. In the study of global methylation markers, 5m-dC level was the best discriminant among methylation classes, whereas in survival analyses, high level of LINE1 methylation was an independent, favorable prognostic factor in the discovery dataset. Based on a pathway approach, hypermethylation in genes that control stem cell differentiation were significant, poor prognostic factors of overall survival in both the discovery and validation datasets. Approaches that targeted these methylated genes may be a future therapeutic goal. PMID:24586730

  14. Genome-Wide Identification and Expression Profiling Analysis of ZmPIN, ZmPILS, ZmLAX and ZmABCB Auxin Transporter Gene Families in Maize (Zea mays L.) under Various Abiotic Stresses

    PubMed Central

    Sun, Tao; Zhang, Lei; Yang, Yanjun; Qi, Jianshuang; Yan, Shufeng; Han, Xiaohua; Wang, Huizhong; Shen, Chenjia

    2015-01-01

    The auxin influx carriers auxin resistant 1/like aux 1 (AUX/LAX), efflux carriers pin-formed (PIN) (together with PIN-like proteins) and efflux/conditional P-glycoprotein (ABCB) are major protein families involved in auxin polar transport. However, how they function in responses to exogenous auxin and abiotic stresses in maize is largely unknown. In this work, the latest updated maize (Zea mays L.) reference genome sequence was used to characterize and analyze the ZmLAX, ZmPIN, ZmPILS and ZmABCB family genes from maize. The results showed that five ZmLAXs, fifteen ZmPINs, nine ZmPILSs and thirty-five ZmABCBs were mapped on all ten maize chromosomes. Highly diversified gene structures, nonconservative transmembrane helices and tissue-specific expression patterns suggested the possibility of function diversification for these genes. Quantitative real-time polymerase chain reaction (qRT-PCR) was used to analyze the expression patterns of ZmLAX, ZmPIN, ZmPILS and ZmABCB genes under exogenous auxin and different environmental stresses. The expression levels of most ZmPIN, ZmPILS, ZmLAX and ZmABCB genes were induced in shoots and were reduced in roots by various abiotic stresses (drought, salt and cold stresses). The opposite expression response patterns indicated the dynamic auxin transport between shoots and roots under abiotic stresses. Analysis of the expression patterns of ZmPIN, ZmPILS, ZmLAX and ZmABCB genes under drought, salt and cold treatment may help us to understand the possible roles of maize auxin transporter genes in responses and tolerance to environmental stresses. PMID:25742625

  15. The National Longitudinal Study of Adolescent to Adult Health (Add Health) Sibling Pairs Genome-Wide Data

    PubMed Central

    McQueen, Matthew B.; Boardman, Jason D.; Domingue, Benjamin W.; Smolen, Andrew; Tabor, Joyce; Killeya-Jones, Ley; Halpern, Carolyn T.; Whitsel, Eric A.; MullanHarris, Kathleen

    2014-01-01

    Here we provide a detailed description of the genome-wide information available on the National Longitudinal Study of Adolescent to Adult Health (Add Health) sibling pair subsample (Harris et al., 2012). A total of 2020 samples were genotyped (including duplicates) arising from 1946 Add Health individuals from the sibling pairs subsample. After various steps for quality control (QC) and quality assurance (QA), we have high quality genome-wide data available on 1,888 individuals. In this report, we first highlight theQC and QA steps that were taken to prune the data of poorly performing samples and genetic markers. We further estimate the pairwise biological relationships using genome-wide data and compare those estimates to the assumed relationships in Add Health. Additionally, using genome-wide data from knownregional reference populations from Europe, West Africa, North and South America, Japan and China, weestimate the relative genetic ancestry of the respondents. Finally, rather than conducting a traditional cross-sectional genome-wide association study (GWAS) of body mass index (BMI), we opted to utilize the extensivepublicly available genome-wide information to conduct a weighted genome-wide association study (GWAS) of longitudinal BMI while accounting for both family and ethnic variation. PMID:25378290

  16. Genome-Wide Scan Reveals Mutation Associated with Melanoma

    MedlinePlus

    ... 1999 Spotlight on Research 2012 July 2012 (historical) Genome-Wide Scan Reveals Mutation Associated with Melanoma A ... out to see if a technology called whole genome sequencing would help them find other genetic risk ...

  17. Genome-wide transcriptome analysis of human epidermal melanocytes

    PubMed Central

    Haltaufderhyde, Kirk D.; Oancea, Elena

    2015-01-01

    Because human epidermal melanocytes (HEMs) provide critical protection against skin cancer, sunburn, and photoaging, a genome-wide perspective of gene expression in these cells is vital to understanding human skin physiology. In this study we performed high throughput sequencing of HEMs to obtain a complete data set of transcript sizes, abundances, and splicing. As expected, we found that melanocyte specific genes that function in pigmentation were among the highest expressed genes. We analyzed receptor, ion channel and transcription factor gene families to get a better understanding of the cell signalling pathways used by melanocytes. We also performed a comparative transcriptomic analysis of lightly versus darkly pigmented HEMs and found 16 genes differentially expressed in the two pigmentation phenotypes; of those, only one putative melanosomal transporter (SLC45A2) has known function in pigmentation. In addition, we found 166 genes with splice isoforms expressed exclusively in one pigmentation phenotype, 17 of which are genes involved in signal transduction. Our melanocyte transcriptome study provides a comprehensive view and may help identify novel pigmentation genes and potential pharmacological targets. PMID:25451175

  18. Genome-Wide Silencing in Drosophila Captures Conserved Apoptotic Effectors

    PubMed Central

    Chew, Su Kit; Chen, Po; Link, Nichole; Galindo, Kathleen A.; Pogue, Kristi; Abrams, John M.

    2009-01-01

    Summary Apoptosis is a conserved form of programmed cell death (PCD) firmly established in the etiology, pathogenesis and treatment of many human diseases. Central to the core machinery of apoptosis are the caspases and their proximal regulators. Current models for caspase control envision a balance of opposing elements, with variable contributions from positive regulators and negative regulators among different cell types and species1. To advance a comprehensive view of components that support caspase-dependent cell death, we conducted a genome-wide silencing screen in the Drosophila model. Our strategy combined a library of dsRNAs together with a chemical antagonist of Inhibitor of Apoptosis Proteins (IAPs) that simulates the action of native regulators in the Reaper/Smac family2. A highly validated set of targets necessary for death provoked by multiple stimuli was identified. Among these, Tango7 is advanced here as a novel effector. Cells depleted for this gene resisted apoptosis at a step prior to induction of effector caspase activity and directed silencing of Tango7 in the animal prevented caspase-dependent PCD. Unlike known apoptosis regulators in this model3, Tango7 activity did not influence stimulus-dependent loss of Drosophila IAP1 (DIAP1) but, instead, regulated levels of the apical caspase Dronc. Likewise, the human Tango7 counterpart, PCID1, similarly impinged on caspase 9, revealing a novel regulatory axis impacting the apoptosome. PMID:19483676

  19. A genome-wide association study to detect genetic variation for postpartum dysgalactia syndrome in five commercial pig breeding lines.

    PubMed

    Preissler, Regine; Tetens, Jens; Reiners, Kerstin; Looft, Holger; Kemper, Nicole

    2013-08-01

    Postpartum dysgalactia syndrome (PDS) in sows is an important disease after parturition with a relevant economic impact, affecting the health and welfare of both sows and piglets. The genetic background of this disease has been discussed and its heritability estimated, but further genetic analyses are lacking in detail. The aim of the current study was to detect loci affecting the susceptibility to PDS through a genome-wide association approach. The study was designed as a family-based association study with matched sampling of affected sows and healthy half- or full-sib control sows on six farms. For the study, 597 sows (322 affected vs. 275 healthy control sows) were genotyped on 62 163 single nucleotide polymorphisms (SNPs) using the Illumina PorcineSNP60 BeadChip. After quality control, 585 sows (314 affected vs. 271 healthy control sows) and 49 740 SNPs remained for further analysis. Statistics were performed mainly with the r package genabel and included a principal component analysis. A statistically significant genome-wide associated SNP was identified on porcine chromosome (SSC) 17. Further promising results with moderate significance were detected on SSC 13 and on an unplaced scaffold with an older annotation on SSC 15. The PRICKLE2 and NRP2 genes were identified as candidate genes near associated SNPs. Several quantitative trait loci (QTL) have been previously described in these genomic regions, including QTL for mammary gland condition, as teat number and non-functional nipples QTL, as well as QTL for body temperature and gestation length.

  20. Genome Wide Association Identifies PPFIA1 as a Candidate Gene for Acute Lung Injury Risk Following Major Trauma

    PubMed Central

    Feng, Rui; O'Keefe, Grant E.; Bradfield, Jonathan; Ware, Lorraine B.; Christiani, David C.; Calfee, Carolyn S.; Cohen, Mitchell J.; Matthay, Michael; Meyer, Nuala J.; Kim, Cecilia; Li, Mingyao; Akey, Joshua; Barnes, Kathleen C.; Sevransky, Jonathan; Lanken, Paul N.; May, Addison K.; Aplenc, Richard; Maloney, James P.; Hakonarson, Hakon

    2012-01-01

    Acute Lung Injury (ALI) is a syndrome with high associated mortality characterized by severe hypoxemia and pulmonary infiltrates in patients with critical illness. We conducted the first investigation to use the genome wide association (GWA) approach to identify putative risk variants for ALI. Genome wide genotyping was performed using the Illumina Human Quad 610 BeadChip. We performed a two-stage GWA study followed by a third stage of functional characterization. In the discovery phase (Phase 1), we compared 600 European American trauma-associated ALI cases with 2266 European American population-based controls. We carried forward the top 1% of single nucleotide polymorphisms (SNPs) at p<0.01 to a replication phase (Phase 2) comprised of a nested case-control design sample of 212 trauma-associated ALI cases and 283 at-risk trauma non-ALI controls from ongoing cohort studies. SNPs that replicated at the 0.05 level in Phase 2 were subject to functional validation (Phase 3) using expression quantitative trait loci (eQTL) analyses in stimulated B-lymphoblastoid cell lines (B-LCL) in family trios. 159 SNPs from the discovery phase replicated in Phase 2, including loci with prior evidence for a role in ALI pathogenesis. Functional evaluation of these replicated SNPs revealed rs471931 on 11q13.3 to exert a cis-regulatory effect on mRNA expression in the PPFIA1 gene (p = 0.0021). PPFIA1 encodes liprin alpha, a protein involved in cell adhesion, integrin expression, and cell-matrix interactions. This study supports the feasibility of future multi-center GWA investigations of ALI risk, and identifies PPFIA1 as a potential functional candidate ALI risk gene for future research. PMID:22295056

  1. Genome-Wide Association for Growth Traits in Canchim Beef Cattle

    PubMed Central

    Buzanskas, Marcos E.; Grossi, Daniela A.; Ventura, Ricardo V.; Schenkel, Flávio S.; Sargolzaei, Mehdi; Meirelles, Sarah L. C.; Mokry, Fabiana B.; Higa, Roberto H.; Mudadu, Maurício A.; da Silva, Marcos V. G. Barbosa.; Niciura, Simone C. M.; Júnior, Roberto A. A. Torres.; Alencar, Maurício M.; Regitano, Luciana C. A.; Munari, Danísio P.

    2014-01-01

    Studies are being conducted on the applicability of genomic data to improve the accuracy of the selection process in livestock, and genome-wide association studies (GWAS) provide valuable information to enhance the understanding on the genetics of complex traits. The aim of this study was to identify genomic regions and genes that play roles in birth weight (BW), weaning weight adjusted for 210 days of age (WW), and long-yearling weight adjusted for 420 days of age (LYW) in Canchim cattle. GWAS were performed by means of the Generalized Quasi-Likelihood Score (GQLS) method using genotypes from the BovineHD BeadChip and estimated breeding values for BW, WW, and LYW. Data consisted of 285 animals from the Canchim breed and 114 from the MA genetic group (derived from crossings between Charolais sires and ½ Canchim + ½ Zebu dams). After applying a false discovery rate correction at a 10% significance level, a total of 4, 12, and 10 SNPs were significantly associated with BW, WW, and LYW, respectively. These SNPs were surveyed to their corresponding genes or to surrounding genes within a distance of 250 kb. The genes DPP6 (dipeptidyl-peptidase 6) and CLEC3B (C-type lectin domain family 3 member B) were highlighted, considering its functions on the development of the brain and skeletal system, respectively. The GQLS method identified regions on chromosome associated with birth weight, weaning weight, and long-yearling weight in Canchim and MA animals. New candidate regions for body weight traits were detected and some of them have interesting biological functions, of which most have not been previously reported. The observation of QTL reports for body weight traits, covering areas surrounding the genes (SNPs) herein identified provides more evidence for these associations. Future studies targeting these areas could provide further knowledge to uncover the genetic architecture underlying growth traits in Canchim cattle. PMID:24733441

  2. Genome-wide errant targeting by Hairy

    PubMed Central

    Kok, Kurtulus; Ay, Ahmet; Li, Li M; Arnosti, David N

    2015-01-01

    Metazoan transcriptional repressors regulate chromatin through diverse histone modifications. Contributions of individual factors to the chromatin landscape in development is difficult to establish, as global surveys reflect multiple changes in regulators. Therefore, we studied the conserved Hairy/Enhancer of Split family repressor Hairy, analyzing histone marks and gene expression in Drosophila embryos. This long-range repressor mediates histone acetylation and methylation in large blocks, with highly context-specific effects on target genes. Most strikingly, Hairy exhibits biochemical activity on many loci that are uncoupled to changes in gene expression. Rather than representing inert binding sites, as suggested for many eukaryotic factors, many regions are targeted errantly by Hairy to modify the chromatin landscape. Our findings emphasize that identification of active cis-regulatory elements must extend beyond the survey of prototypical chromatin marks. We speculate that this errant activity may provide a path for creation of new regulatory elements, facilitating the evolution of novel transcriptional circuits. DOI: http://dx.doi.org/10.7554/eLife.06394.001 PMID:26305409

  3. Genome-wide signatures of male-mediated migration shaping the Indian gene pool.

    PubMed

    ArunKumar, GaneshPrasad; Tatarinova, Tatiana V; Duty, Jeff; Rollo, Debra; Syama, Adhikarla; Arun, Varatharajan Santhakumari; Kavitha, Valampuri John; Triska, Petr; Greenspan, Bennett; Wells, R Spencer; Pitchappan, Ramasamy

    2015-09-01

    Multiple questions relating to contributions of cultural and demographical factors in the process of human geographical dispersal remain largely unanswered. India, a land of early human settlement and the resulting diversity is a good place to look for some of the answers. In this study, we explored the genetic structure of India using a diverse panel of 78 males genotyped using the GenoChip. Their genome-wide single-nucleotide polymorphism (SNP) diversity was examined in the context of various covariates that influence Indian gene pool. Admixture analysis of genome-wide SNP data showed high proportion of the Southwest Asian component in all of the Indian samples. Hierarchical clustering based on admixture proportions revealed seven distinct clusters correlating to geographical and linguistic affiliations. Convex hull overlay of Y-chromosomal haplogroups on the genome-wide SNP principal component analysis brought out distinct non-overlapping polygons of F*-M89, H*-M69, L1-M27, O2a-M95 and O3a3c1-M117, suggesting a male-mediated migration and expansion of the Indian gene pool. Lack of similar correlation with mitochondrial DNA clades indicated a shared genetic ancestry of females. We suggest that ancient male-mediated migratory events and settlement in various regional niches led to the present day scenario and peopling of India.

  4. Genome-wide signatures of male-mediated migration shaping the Indian gene pool.

    PubMed

    ArunKumar, GaneshPrasad; Tatarinova, Tatiana V; Duty, Jeff; Rollo, Debra; Syama, Adhikarla; Arun, Varatharajan Santhakumari; Kavitha, Valampuri John; Triska, Petr; Greenspan, Bennett; Wells, R Spencer; Pitchappan, Ramasamy

    2015-09-01

    Multiple questions relating to contributions of cultural and demographical factors in the process of human geographical dispersal remain largely unanswered. India, a land of early human settlement and the resulting diversity is a good place to look for some of the answers. In this study, we explored the genetic structure of India using a diverse panel of 78 males genotyped using the GenoChip. Their genome-wide single-nucleotide polymorphism (SNP) diversity was examined in the context of various covariates that influence Indian gene pool. Admixture analysis of genome-wide SNP data showed high proportion of the Southwest Asian component in all of the Indian samples. Hierarchical clustering based on admixture proportions revealed seven distinct clusters correlating to geographical and linguistic affiliations. Convex hull overlay of Y-chromosomal haplogroups on the genome-wide SNP principal component analysis brought out distinct non-overlapping polygons of F*-M89, H*-M69, L1-M27, O2a-M95 and O3a3c1-M117, suggesting a male-mediated migration and expansion of the Indian gene pool. Lack of similar correlation with mitochondrial DNA clades indicated a shared genetic ancestry of females. We suggest that ancient male-mediated migratory events and settlement in various regional niches led to the present day scenario and peopling of India. PMID:25994871

  5. Assessing statistical significance in multivariable genome wide association analysis

    PubMed Central

    Buzdugan, Laura; Kalisch, Markus; Navarro, Arcadi; Schunk, Daniel; Fehr, Ernst; Bühlmann, Peter

    2016-01-01

    Motivation: Although Genome Wide Association Studies (GWAS) genotype a very large number of single nucleotide polymorphisms (SNPs), the data are often analyzed one SNP at a time. The low predictive power of single SNPs, coupled with the high significance threshold needed to correct for multiple testing, greatly decreases the power of GWAS. Results: We propose a procedure in which all the SNPs are analyzed in a multiple generalized linear model, and we show its use for extremely high-dimensional datasets. Our method yields P-values for assessing significance of single SNPs or groups of SNPs while controlling for all other SNPs and the family wise error rate (FWER). Thus, our method tests whether or not a SNP carries any additional information about the phenotype beyond that available by all the other SNPs. This rules out spurious correlations between phenotypes and SNPs that can arise from marginal methods because the ‘spuriously correlated’ SNP merely happens to be correlated with the ‘truly causal’ SNP. In addition, the method offers a data driven approach to identifying and refining groups of SNPs that jointly contain informative signals about the phenotype. We demonstrate the value of our method by applying it to the seven diseases analyzed by the Wellcome Trust Case Control Consortium (WTCCC). We show, in particular, that our method is also capable of finding significant SNPs that were not identified in the original WTCCC study, but were replicated in other independent studies. Availability and implementation: Reproducibility of our research is supported by the open-source Bioconductor package hierGWAS. Contact: peter.buehlmann@stat.math.ethz.ch Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27153677

  6. The National Longitudinal Study of Adolescent to Adult Health (Add Health) sibling pairs genome-wide data.

    PubMed

    McQueen, Matthew B; Boardman, Jason D; Domingue, Benjamin W; Smolen, Andrew; Tabor, Joyce; Killeya-Jones, Ley; Halpern, Carolyn T; Whitsel, Eric A; Harris, Kathleen Mullan

    2015-01-01

    Here we provide a detailed description of the genome-wide information available on the National Longitudinal Study of Adolescent to Adult Health (Add Health) sibling pair subsample (Harris et al. in Twin Res Hum Genet 16:391-398, 2013). A total of 2,020 samples were genotyped (including duplicates) arising from 1946 Add Health individuals from the sibling pairs subsample. After various steps for quality control (QC) and quality assurance (QA), we have high quality genome-wide data available on 1,888 individuals. In this report, we first highlight the QC and QA steps that were taken to prune the data of poorly performing samples and genetic markers. We further estimate the pairwise biological relationships using genome-wide data and compare those estimates to the assumed relationships in Add Health. Additionally, using genome-wide data from known regional reference populations from Europe, West Africa, North and South America, Japan and China, we estimate the relative genetic ancestry of the respondents. Finally, rather than conducting a traditional cross-sectional genome-wide association study (GWAS) of body mass index (BMI), we opted to utilize the extensive publicly available genome-wide information to conduct a weighted GWAS of longitudinal BMI while accounting for both family and ethnic variation.

  7. Mosaicism for genome-wide paternal uniparental disomy with features of multiple imprinting disorders: diagnostic and management issues.

    PubMed

    Inbar-Feigenberg, Michal; Choufani, Sanaa; Cytrynbaum, Cheryl; Chen, Yi-An; Steele, Leslie; Shuman, Cheryl; Ray, Peter N; Weksberg, Rosanna

    2013-01-01

    Mosaicism for genome-wide paternal uniparental disomy (UPD) has been reported in only seven live born individuals to date. Clinical presentation includes manifestations of multiple paternal UPD syndromes with high variability, likely due to the variable levels of mosaicism in different somatic tissues. We report an eighth case in a female patient with mosaicism for genome-wide paternal UPD which highlights the complex clinical presentation. Our patient had features of Beckwith-Wiedemann syndrome (BWS), Angelman syndrome, and congenital hyperinsulinism. The clinical findings included prematurity, organomegaly, hemihyperplasia, developmental delay, benign tumors, and cystic lesions. The diagnosis in our patient was established utilizing microarray-based genome-wide DNA methylation analysis performed on leukocyte DNA. Targeted multiplex ligation-dependent probe amplification (MLPA) analysis of chromosome regions 11p15 and 15q13 confirmed mosaicism for paternal UPD at these genomic regions. This case represents the first report of microarray-based genome-wide DNA methylation analysis in the diagnosis of genome-wide paternal UPD. The application of microarray-based genome-wide DNA methylation analysis on selected individuals with complex clinical presentations could be a valuable diagnostic tool to improve the detection rate of mosaic genome-wide paternal UPD. This approach, which screens many loci simultaneously, is more cost-effective and less labor-intensive than performing multiple targeted DNA methylation-based assays. Identification of individuals with mosaicism for genome-wide paternal UPD is an important goal as it confers a low recurrence risk for the family and identifies individuals who require surveillance due to increased tumor risk.

  8. A novel statistic for genome-wide interaction analysis.

    PubMed

    Wu, Xuesen; Dong, Hua; Luo, Li; Zhu, Yun; Peng, Gang; Reveille, John D; Xiong, Momiao

    2010-09-23

    Although great progress in genome-wide association studies (GWAS) has been made, the significant SNP associations identified by GWAS account for only a few percent of the genetic variance, leading many to question where and how we can find the missing heritability. There is increasing interest in genome-wide interaction analysis as a possible source of finding heritability unexplained by current GWAS. However, the existing statistics for testing interaction have low power for genome-wide interaction analysis. To meet challenges raised by genome-wide interactional analysis, we have developed a novel statistic for testing interaction between two loci (either linked or unlinked). The null distribution and the type I error rates of the new statistic for testing interaction are validated using simulations. Extensive power studies show that the developed statistic has much higher power to detect interaction than classical logistic regression. The results identified 44 and 211 pairs of SNPs showing significant evidence of interactions with FDR<0.001 and 0.001genome-wide interaction analysis is a valuable tool for finding remaining missing heritability unexplained by the current GWAS, and the developed novel statistic is able to search significant interaction between SNPs across the genome. Real data analysis showed that the results of genome-wide interaction analysis can be replicated in two independent studies.

  9. A GENOME-WIDE LINKAGE AND ASSOCIATION SCAN REVEALS NOVEL LOCI FOR AUTISM

    PubMed Central

    Weiss, Lauren A.; Arking, Dan E.

    2009-01-01

    Summary Although autism is a highly heritable neurodevelopmental disorder, attempts to identify specific susceptibility genes have thus far met with limited success 1. Genome-wide association studies (GWAS) using half a million or more markers, particularly those with very large sample sizes achieved through meta-analysis, have shown great success in mapping genes for other complex genetic traits (http://www.genome.gov/26525384). Consequently, we initiated a linkage and association mapping study using half a million genome-wide SNPs in a common set of 1,031 multiplex autism families (1,553 affected offspring). We identified regions of suggestive and significant linkage on chromosomes 6q27 and 20p13, respectively. Initial analysis did not yield genome-wide significant associations; however, genotyping of top hits in additional families revealed a SNP on chromosome 5p15 (between SEMA5A and TAS2R1) that was significantly associated with autism (P = 2 × 10−7). We also demonstrated that expression of SEMA5A is reduced in brains from autistic patients, further implicating SEMA5A as an autism susceptibility gene. The linkage regions reported here provide targets for rare variation screening while the discovery of a single novel association demonstrates the action of common variants. PMID:19812673

  10. A genome-wide linkage and association scan reveals novel loci for autism.

    PubMed

    Weiss, Lauren A; Arking, Dan E; Daly, Mark J; Chakravarti, Aravinda

    2009-10-01

    Although autism is a highly heritable neurodevelopmental disorder, attempts to identify specific susceptibility genes have thus far met with limited success. Genome-wide association studies using half a million or more markers, particularly those with very large sample sizes achieved through meta-analysis, have shown great success in mapping genes for other complex genetic traits. Consequently, we initiated a linkage and association mapping study using half a million genome-wide single nucleotide polymorphisms (SNPs) in a common set of 1,031 multiplex autism families (1,553 affected offspring). We identified regions of suggestive and significant linkage on chromosomes 6q27 and 20p13, respectively. Initial analysis did not yield genome-wide significant associations; however, genotyping of top hits in additional families revealed an SNP on chromosome 5p15 (between SEMA5A and TAS2R1) that was significantly associated with autism (P = 2 x 10(-7)). We also demonstrated that expression of SEMA5A is reduced in brains from autistic patients, further implicating SEMA5A as an autism susceptibility gene. The linkage regions reported here provide targets for rare variation screening whereas the discovery of a single novel association demonstrates the action of common variants.

  11. Design and bioinformatics analysis of genome-wide CLIP experiments

    PubMed Central

    Wang, Tao; Xiao, Guanghua; Chu, Yongjun; Zhang, Michael Q.; Corey, David R.; Xie, Yang

    2015-01-01

    The past decades have witnessed a surge of discoveries revealing RNA regulation as a central player in cellular processes. RNAs are regulated by RNA-binding proteins (RBPs) at all post-transcriptional stages, including splicing, transportation, stabilization and translation. Defects in the functions of these RBPs underlie a broad spectrum of human pathologies. Systematic identification of RBP functional targets is among the key biomedical research questions and provides a new direction for drug discovery. The advent of cross-linking immunoprecipitation coupled with high-throughput sequencing (genome-wide CLIP) technology has recently enabled the investigation of genome-wide RBP–RNA binding at single base-pair resolution. This technology has evolved through the development of three distinct versions: HITS-CLIP, PAR-CLIP and iCLIP. Meanwhile, numerous bioinformatics pipelines for handling the genome-wide CLIP data have also been developed. In this review, we discuss the genome-wide CLIP technology and focus on bioinformatics analysis. Specifically, we compare the strengths and weaknesses, as well as the scopes, of various bioinformatics tools. To assist readers in choosing optimal procedures for their analysis, we also review experimental design and procedures that affect bioinformatics analyses. PMID:25958398

  12. Genome-wide association mapping of soybean aphid resistance traits

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Soybean aphid is the most damaging insect pest of soybean in the Upper Midwest and is primarily controlled by insecticides. Soybean aphid resistance (i.e., Rag genes) has been documented in some soybean lines at chromosomes 6, 7, 13, and 16, but more sources of resistance are needed. Genome-wide ass...

  13. A super powerful method for genome wide association study

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Genome-Wide Association Studies shed light on the identification of genes underlying human diseases and agriculturally important traits. This potential has been shadowed by false positive findings. The Mixed Linear Model (MLM) method is flexible enough to simultaneously incorporate population struct...

  14. Massively expedited genome-wide heritability analysis (MEGHA).

    PubMed

    Ge, Tian; Nichols, Thomas E; Lee, Phil H; Holmes, Avram J; Roffman, Joshua L; Buckner, Randy L; Sabuncu, Mert R; Smoller, Jordan W

    2015-02-24

    The discovery and prioritization of heritable phenotypes is a computational challenge in a variety of settings, including neuroimaging genetics and analyses of the vast phenotypic repositories in electronic health record systems and population-based biobanks. Classical estimates of heritability require twin or pedigree data, which can be costly and difficult to acquire. Genome-wide complex trait analysis is an alternative tool to compute heritability estimates from unrelated individuals, using genome-wide data that are increasingly ubiquitous, but is computationally demanding and becomes difficult to apply in evaluating very large numbers of phenotypes. Here we present a fast and accurate statistical method for high-dimensional heritability analysis using genome-wide SNP data from unrelated individuals, termed massively expedited genome-wide heritability analysis (MEGHA) and accompanying nonparametric sampling techniques that enable flexible inferences for arbitrary statistics of interest. MEGHA produces estimates and significance measures of heritability with several orders of magnitude less computational time than existing methods, making heritability-based prioritization of millions of phenotypes based on data from unrelated individuals tractable for the first time to our knowledge. As a demonstration of application, we conducted heritability analyses on global and local morphometric measurements derived from brain structural MRI scans, using genome-wide SNP data from 1,320 unrelated young healthy adults of non-Hispanic European ancestry. We also computed surface maps of heritability for cortical thickness measures and empirically localized cortical regions where thickness measures were significantly heritable. Our analyses demonstrate the unique capability of MEGHA for large-scale heritability-based screening and high-dimensional heritability profile construction.

  15. Genome-wide association studies for hematological traits in Chinese Sutai pigs

    PubMed Central

    2014-01-01

    Background It has been shown that hematological traits are strongly associated with the metabolism and the immune system in domestic pig. However, little is known about the genetic architecture of hematological traits. To identify quantitative trait loci (QTL) controlling hematological traits, we performed single marker Genome-wide association studies (GWAS) and haplotype analysis for 15 hematological traits in 495 Chinese Sutai pigs. Results We identified 161 significant SNPs including 44 genome-wide significant SNPs associated with 11 hematological traits by single marker GWAS. Most of them were located on SSC2. Meanwhile, we detected 499 significant SNPs containing 154 genome-wide significant SNPs associated with 9 hematological traits by haplotype analysis. Most of the identified loci were located on SSC7 and SSC9. Conclusions We detected 4 SNPs with pleiotropic effects on SSC2 by single marker GWAS and (or) on SSC7 by haplotype analysis. Furthermore, through checking the gene functional annotations, positions and their expression variation, we finally selected 7 genes as potential candidates. Specially, we found that three genes (TRIM58, TRIM26 and TRIM21) of them originated from the same gene family and executed similar function of innate and adaptive immune. The findings will contribute to dissection the immune gene network, further identification of causative mutations underlying the identified QTLs and providing insights into the molecular basis of hematological trait in domestic pig. PMID:24674592

  16. Replicability and robustness of genome-wide-association studies for behavioral traits.

    PubMed

    Rietveld, Cornelius A; Conley, Dalton; Eriksson, Nicholas; Esko, Tõnu; Medland, Sarah E; Vinkhuyzen, Anna A E; Yang, Jian; Boardman, Jason D; Chabris, Christopher F; Dawes, Christopher T; Domingue, Benjamin W; Hinds, David A; Johannesson, Magnus; Kiefer, Amy K; Laibson, David; Magnusson, Patrik K E; Mountain, Joanna L; Oskarsson, Sven; Rostapshova, Olga; Teumer, Alexander; Tung, Joyce Y; Visscher, Peter M; Benjamin, Daniel J; Cesarini, David; Koellinger, Philipp D

    2014-11-01

    A recent genome-wide-association study of educational attainment identified three single-nucleotide polymorphisms (SNPs) whose associations, despite their small effect sizes (each R (2) ≈ 0.02%), reached genome-wide significance (p < 5 × 10(-8)) in a large discovery sample and were replicated in an independent sample (p < .05). The study also reported associations between educational attainment and indices of SNPs called "polygenic scores." In three studies, we evaluated the robustness of these findings. Study 1 showed that the associations with all three SNPs were replicated in another large (N = 34,428) independent sample. We also found that the scores remained predictive (R (2) ≈ 2%) in regressions with stringent controls for stratification (Study 2) and in new within-family analyses (Study 3). Our results show that large and therefore well-powered genome-wide-association studies can identify replicable genetic associations with behavioral traits. The small effect sizes of individual SNPs are likely to be a major contributing factor explaining the striking contrast between our results and the disappointing replication record of most candidate-gene studies.

  17. Genome-wide patterns of identity-by-descent sharing in the French Canadian founder population

    PubMed Central

    Gauvin, Héloïse; Moreau, Claudia; Lefebvre, Jean-François; Laprise, Catherine; Vézina, Hélène; Labuda, Damian; Roy-Gagnon, Marie-Hélène

    2014-01-01

    In genetics the ability to accurately describe the familial relationships among a group of individuals can be very useful. Recent statistical tools succeeded in assessing the degree of relatedness up to 6–7 generations with good power using dense genome-wide single-nucleotide polymorphism data to estimate the extent of identity-by-descent (IBD) sharing. It is therefore important to describe genome-wide patterns of IBD sharing for more remote and complex relatedness between individuals, such as that observed in a founder population like Quebec, Canada. Taking advantage of the extended genealogical records of the French Canadian founder population, we first compared different tools to identify regions of IBD in order to best describe genome-wide IBD sharing and its correlation with genealogical characteristics. Results showed that the extent of IBD sharing identified with FastIBD correlates best with relatedness measured using genealogical data. Total length of IBD sharing explained 85% of the genealogical kinship's variance. In addition, we observed significantly higher sharing in pairs of individuals with at least one inbred ancestor compared with those without any. Furthermore, patterns of IBD sharing and average sharing were different across regional populations, consistent with the settlement history of Quebec. Our results suggest that, as expected, the complex relatedness present in founder populations is reflected in patterns of IBD sharing. Using these patterns, it is thus possible to gain insight on the types of distant relationships in a sample from a founder population like Quebec. PMID:24129432

  18. Meta-analysis of genome-wide association studies of attention deficit/hyperactivity disorder

    PubMed Central

    Neale, Benjamin M; Medland, Sarah E.; Ripke, Stephan; Asherson, Philip; Franke, Barbara; Lesch, Klaus-Peter; Faraone, Stephen V.; Nguyen, Thuy Trang; Schäfer, Helmut; Holmans, Peter; Daly, Mark; Steinhausen, Hans-Christoph; Freitag, Christine; Reif, Andreas; Renner, Tobias J.; Romanos, Marcel; Romanos, Jasmin; Walitza, Susanne; Warnke, Andreas; Meyer, Jobst; Palmason, Haukur; Buitelaar, Jan; Vasquez, Alejandro Arias; Lambregts-Rommelse, Nanda; Gill, Michael; Anney, Richard J.L.; Langely, Kate; O’Donovan, Michael; Williams, Nigel; Owen, Michael; Thapar, Anita; Kent, Lindsey; Sergeant, Joseph; Roeyers, Herbert; Mick, Eric; Biederman, Joseph; Doyle, Alysa; Smalley, Susan; Loo, Sandra; Hakonarson, Hakon; Elia, Josephine; Todorov, Alexandre; Miranda, Ana; Mulas, Fernando; Ebstein, Richard P.; Rothenberger, Aribert; Banaschewski, Tobias; Oades, Robert D.; Sonuga-Barke, Edmund; McGough, James; Nisenbaum, Laura; Middleton, Frank; Hu, Xiaolan; Nelson, Stan

    2010-01-01

    Objective Although twin and family studies have shown Attention Deficit/Hyperactivity Disorder (ADHD) to be highly heritable, genetic variants influencing the trait at a genome-wide significant level have yet to be identified. As prior genome-wide association scans (GWAS) have not yielded significant results, we conducted a meta-analysis of existing studies to boost statistical power. Method We used data from four projects: a) the Children’s Hospital of Philadelphia (CHOP), b) phase I of the International Multicenter ADHD Genetics project (IMAGE), c) phase II of IMAGE (IMAGE II), and d) the Pfizer funded study from the University of California, Los Angeles, Washington University and the Massachusetts General Hospital (PUWMa). The final sample size consisted of 2,064 trios, 896 cases and 2,455 controls. For each study, we imputed HapMap SNPs, computed association test statistics and transformed them to Z-scores, and then combined weighted Z-scores in a meta-analysis. Results No genome-wide significant associations were found, although an analysis of candidate genes suggests they may be involved in the disorder. Conclusions Given that ADHD is a highly heritable disorder, our negative results suggest that the effects of common ADHD risk variants must, individually, be very small or that other types of variants, e.g. rare ones, account for much of the disorder’s heritability. PMID:20732625

  19. Revisiting the classification of curtoviruses based on genome-wide pairwise identity.

    PubMed

    Varsani, Arvind; Martin, Darren P; Navas-Castillo, Jesús; Moriones, Enrique; Hernández-Zepeda, Cecilia; Idris, Ali; Murilo Zerbini, F; Brown, Judith K

    2014-07-01

    Members of the genus Curtovirus (family Geminiviridae) are important pathogens of many wild and cultivated plant species. Until recently, relatively few full curtovirus genomes have been characterised. However, with the 19 full genome sequences now available in public databases, we revisit the proposed curtovirus species and strain classification criteria. Using pairwise identities coupled with phylogenetic evidence, revised species and strain demarcation guidelines have been instituted. Specifically, we have established 77 % genome-wide pairwise identity as a species demarcation threshold and 94 % genome-wide pairwise identity as a strain demarcation threshold. Hence, whereas curtovirus sequences with >77 % genome-wide pairwise identity would be classified as belonging to the same species, those sharing >94 % identity would be classified as belonging to the same strain. We provide step-by-step guidelines to facilitate the classification of newly discovered curtovirus full genome sequences and a set of defined criteria for naming new species and strains. The revision yields three curtovirus species: Beet curly top virus (BCTV), Spinach severe surly top virus (SpSCTV) and Horseradish curly top virus (HrCTV). PMID:24463952

  20. Genome-wide meta-analysis of longitudinal alcohol consumption across youth and early adulthood

    PubMed Central

    Adkins, Daniel E.; Clark, Shaunna L.; Copeland, William E.; Kennedy, Martin; Conway, Kevin; Angold, Adrian; Maes, Hermine; Liu, Youfang; Kumar, Gaurav; Erkanli, Alaattin; Patkar, Ashwin A.; Silberg, Judy; Brown, Tyson H.; Fergusson, David M.; Horwood, L. John; Eaves, Lindon; van den Oord, Edwin J.C.G.; Sullivan, Patrick F.; Costello, E. J.

    2016-01-01

    The public health burden of alcohol is unevenly distributed across the life course, with levels of use, abuse and dependence increasing across adolescence and peaking in early adulthood. Here we leverage this temporal patterning to search for common genetic variants predicting developmental trajectories of alcohol consumption. Comparable psychiatric evaluations measuring alcohol consumption were collected in three, longitudinal community samples (N=2,126, obs=12,166). Consumption repeated measurements spanning adolescence and early adulthood were analyzed using linear mixed models, estimating individual consumption trajectories, which were then tested for association with Illumina 660W-Quad genotype data (866,099 SNPs after imputation and QC). Association results were combined across samples using standard meta-analysis methods. Four meta-analysis associations satisfied our pre-determined genome-wide significance criterion (FDR<0.1) and 6 others met our “suggestive” criterion (FDR<0.2). Genome-wide significant associations were highly biological plausible, including associations within GABA transporter 1, SLC6A1 (solute carrier family 6, member 1), and exonic hits in LOC100129340 (mitofusin-1-like). Pathway analyses elaborated single marker results, indicating significant enriched associations to intuitive biological mechanisms including neurotransmission, xenobiotic pharmacodynamics and nuclear hormone receptors. These findings underscore the value of combining longitudinal behavioral data and genome-wide genotype information in order to study developmental patterns and improve statistical power in genomic studies. PMID:26081443

  1. Identifying Human Genome-Wide CNV, LOH and UPD by Targeted Sequencing of Selected Regions.

    PubMed

    Wang, Yu; Li, Wei; Xia, Yingying; Wang, Chongzhi; Tang, Y Tom; Guo, Wenying; Li, Jinliang; Zhao, Xia; Sun, Yepeng; Hu, Juan; Zhen, Hefu; Zhang, Xiandong; Chen, Chao; Shi, Yujian; Li, Lin; Cao, Hongzhi; Du, Hongli; Li, Jian

    2014-01-01

    Copy-number variations (CNV), loss of heterozygosity (LOH), and uniparental disomy (UPD) are large genomic aberrations leading to many common inherited diseases, cancers, and other complex diseases. An integrated tool to identify these aberrations is essential in understanding diseases and in designing clinical interventions. Previous discovery methods based on whole-genome sequencing (WGS) require very high depth of coverage on the whole genome scale, and are cost-wise inefficient. Another approach, whole exome genome sequencing (WEGS), is limited to discovering variations within exons. Thus, we are lacking efficient methods to detect genomic aberrations on the whole genome scale using next-generation sequencing technology. Here we present a method to identify genome-wide CNV, LOH and UPD for the human genome via selectively sequencing a small portion of genome termed Selected Target Regions (SeTRs). In our experiments, the SeTRs are covered by 99.73%~99.95% with sufficient depth. Our developed bioinformatics pipeline calls genome-wide CNVs with high confidence, revealing 8 credible events of LOH and 3 UPD events larger than 5M from 15 individual samples. We demonstrate that genome-wide CNV, LOH and UPD can be detected using a cost-effective SeTRs sequencing approach, and that LOH and UPD can be identified using just a sample grouping technique, without using a matched sample or familial information. PMID:25919136

  2. Genome-wide patterns of identity-by-descent sharing in the French Canadian founder population.

    PubMed

    Gauvin, Héloïse; Moreau, Claudia; Lefebvre, Jean-François; Laprise, Catherine; Vézina, Hélène; Labuda, Damian; Roy-Gagnon, Marie-Hélène

    2014-06-01

    In genetics the ability to accurately describe the familial relationships among a group of individuals can be very useful. Recent statistical tools succeeded in assessing the degree of relatedness up to 6-7 generations with good power using dense genome-wide single-nucleotide polymorphism data to estimate the extent of identity-by-descent (IBD) sharing. It is therefore important to describe genome-wide patterns of IBD sharing for more remote and complex relatedness between individuals, such as that observed in a founder population like Quebec, Canada. Taking advantage of the extended genealogical records of the French Canadian founder population, we first compared different tools to identify regions of IBD in order to best describe genome-wide IBD sharing and its correlation with genealogical characteristics. Results showed that the extent of IBD sharing identified with FastIBD correlates best with relatedness measured using genealogical data. Total length of IBD sharing explained 85% of the genealogical kinship's variance. In addition, we observed significantly higher sharing in pairs of individuals with at least one inbred ancestor compared with those without any. Furthermore, patterns of IBD sharing and average sharing were different across regional populations, consistent with the settlement history of Quebec. Our results suggest that, as expected, the complex relatedness present in founder populations is reflected in patterns of IBD sharing. Using these patterns, it is thus possible to gain insight on the types of distant relationships in a sample from a founder population like Quebec.

  3. Replicability and robustness of genome-wide-association studies for behavioral traits.

    PubMed

    Rietveld, Cornelius A; Conley, Dalton; Eriksson, Nicholas; Esko, Tõnu; Medland, Sarah E; Vinkhuyzen, Anna A E; Yang, Jian; Boardman, Jason D; Chabris, Christopher F; Dawes, Christopher T; Domingue, Benjamin W; Hinds, David A; Johannesson, Magnus; Kiefer, Amy K; Laibson, David; Magnusson, Patrik K E; Mountain, Joanna L; Oskarsson, Sven; Rostapshova, Olga; Teumer, Alexander; Tung, Joyce Y; Visscher, Peter M; Benjamin, Daniel J; Cesarini, David; Koellinger, Philipp D

    2014-11-01

    A recent genome-wide-association study of educational attainment identified three single-nucleotide polymorphisms (SNPs) whose associations, despite their small effect sizes (each R (2) ≈ 0.02%), reached genome-wide significance (p < 5 × 10(-8)) in a large discovery sample and were replicated in an independent sample (p < .05). The study also reported associations between educational attainment and indices of SNPs called "polygenic scores." In three studies, we evaluated the robustness of these findings. Study 1 showed that the associations with all three SNPs were replicated in another large (N = 34,428) independent sample. We also found that the scores remained predictive (R (2) ≈ 2%) in regressions with stringent controls for stratification (Study 2) and in new within-family analyses (Study 3). Our results show that large and therefore well-powered genome-wide-association studies can identify replicable genetic associations with behavioral traits. The small effect sizes of individual SNPs are likely to be a major contributing factor explaining the striking contrast between our results and the disappointing replication record of most candidate-gene studies. PMID:25287667

  4. Meta-analysis of 32 genome-wide linkage studies of schizophrenia

    PubMed Central

    Ng, MYM; Levinson, DF; Faraone, SV; Suarez, BK; DeLisi, LE; Arinami, T; Riley, B; Paunio, T; Pulver, AE; Irmansyah; Holmans, PA; Escamilla, M; Wildenauer, DB; Williams, NM; Laurent, C; Mowry, BJ; Brzustowicz, LM; Maziade, M; Sklar, P; Garver, DL; Abecasis, GR; Lerer, B; Fallin, MD; Gurling, HMD; Gejman, PV; Lindholm, E; Moises, HW; Byerley, W; Wijsman, EM; Forabosco, P; Tsuang, MT; Hwu, H-G; Okazaki, Y; Kendler, KS; Wormley, B; Fanous, A; Walsh, D; O’Neill, FA; Peltonen, L; Nestadt, G; Lasseter, VK; Liang, KY; Papadimitriou, GM; Dikeos, DG; Schwab, SG; Owen, MJ; O’Donovan, MC; Norton, N; Hare, E; Raventos, H; Nicolini, H; Albus, M; Maier, W; Nimgaonkar, VL; Terenius, L; Mallet, J; Jay, M; Godard, S; Nertney, D; Alexander, M; Crowe, RR; Silverman, JM; Bassett, AS; Roy, M-A; Mérette, C; Pato, CN; Pato, MT; Roos, J Louw; Kohn, Y; Amann-Zalcenstein, D; Kalsi, G; McQuillin, A; Curtis, D; Brynjolfson, J; Sigmundsson, T; Petursson, H; Sanders, AR; Duan, J; Jazin, E; Myles-Worsley, M; Karayiorgou, M; Lewis, CM

    2009-01-01

    A genome scan meta-analysis (GSMA) was carried out on 32 independent genome-wide linkage scan analyses that included 3255 pedigrees with 7413 genotyped cases affected with schizophrenia (SCZ) or related disorders. The primary GSMA divided the autosomes into 120 bins, rank-ordered the bins within each study according to the most positive linkage result in each bin, summed these ranks (weighted for study size) for each bin across studies and determined the empirical probability of a given summed rank (PSR) by simulation. Suggestive evidence for linkage was observed in two single bins, on chromosomes 5q (142-168 Mb) and 2q (103-134 Mb). Genome-wide evidence for linkage was detected on chromosome 2q (119-152 Mb) when bin boundaries were shifted to the middle of the previous bins. The primary analysis met empirical criteria for ‘aggregate’ genome-wide significance, indicating that some or all of 10 bins are likely to contain loci linked to SCZ, including regions of chromosomes 1, 2q, 3q, 4q, 5q, 8p and 10q. In a secondary analysis of 22 studies of European-ancestry samples, suggestive evidence for linkage was observed on chromosome 8p (16-33 Mb). Although the newer genome-wide association methodology has greater power to detect weak associations to single common DNA sequence variants, linkage analysis can detect diverse genetic effects that segregate in families, including multiple rare variants within one locus or several weakly associated loci in the same region. Therefore, the regions supported by this meta-analysis deserve close attention in future studies. PMID:19349958

  5. The genetic architecture of seed composition in soybean is refined by genome-wide association scans across multiple populations

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Soybean oil and meal are major contributors to world-wide food production. Consequently, the genetic basis for soybean seed composition has been intensely studied using family-based mapping. Population-based mapping approaches, in the form of genome-wide association (GWA) scans, have been able to re...

  6. Genome-wide patterns of selection in 230 ancient Eurasians.

    PubMed

    Mathieson, Iain; Lazaridis, Iosif; Rohland, Nadin; Mallick, Swapan; Patterson, Nick; Roodenberg, Songül Alpaslan; Harney, Eadaoin; Stewardson, Kristin; Fernandes, Daniel; Novak, Mario; Sirak, Kendra; Gamba, Cristina; Jones, Eppie R; Llamas, Bastien; Dryomov, Stanislav; Pickrell, Joseph; Arsuaga, Juan Luís; de Castro, José María Bermúdez; Carbonell, Eudald; Gerritsen, Fokke; Khokhlov, Aleksandr; Kuznetsov, Pavel; Lozano, Marina; Meller, Harald; Mochalov, Oleg; Moiseyev, Vyacheslav; Guerra, Manuel A Rojo; Roodenberg, Jacob; Vergès, Josep Maria; Krause, Johannes; Cooper, Alan; Alt, Kurt W; Brown, Dorcas; Anthony, David; Lalueza-Fox, Carles; Haak, Wolfgang; Pinhasi, Ron; Reich, David

    2015-12-24

    Ancient DNA makes it possible to observe natural selection directly by analysing samples from populations before, during and after adaptation events. Here we report a genome-wide scan for selection using ancient DNA, capitalizing on the largest ancient DNA data set yet assembled: 230 West Eurasians who lived between 6500 and 300 bc, including 163 with newly reported data. The new samples include, to our knowledge, the first genome-wide ancient DNA from Anatolian Neolithic farmers, whose genetic material we obtained by extracting from petrous bones, and who we show were members of the population that was the source of Europe's first farmers. We also report a transect of the steppe region in Samara between 5600 and 300 bc, which allows us to identify admixture into the steppe from at least two external sources. We detect selection at loci associated with diet, pigmentation and immunity, and two independent episodes of selection on height. PMID:26595274

  7. Genome wide copy number analysis of single cells

    PubMed Central

    Baslan, Timour; Kendall, Jude; Rodgers, Linda; Cox, Hilary; Riggs, Mike; Stepansky, Asya; Troge, Jennifer; Ravi, Kandasamy; Esposito, Diane; Lakshmi, B.; Wigler, Michael; Navin, Nicholas; Hicks, James

    2016-01-01

    Summary Copy number variation (CNV) is increasingly recognized as an important contributor to phenotypic variation in health and disease. Most methods for determining CNV rely on admixtures of cells, where information regarding genetic heterogeneity is lost. Here, we present a protocol that allows for the genome wide copy number analysis of single nuclei isolated from mixed populations of cells. Single nucleus sequencing (SNS), combines flow sorting of single nuclei based on DNA content, whole genome amplification (WGA), followed by next generation sequencing to quantize genomic intervals in a genome wide manner. Multiplexing of single cells is discussed. Additionally, we outline informatic approaches that correct for biases inherent in the WGA procedure and allow for accurate determination of copy number profiles. All together, the protocol takes ~3 days from flow cytometry to sequence-ready DNA libraries. PMID:22555242

  8. Genome-wide association studies in Alzheimer's disease: a review.

    PubMed

    Tosto, Giuseppe; Reitz, Christiane

    2013-10-01

    Over the past decade, research aiming to disentangle the genetic underpinnings of late-onset Alzheimer's disease has mostly focused on the identification of common variants through genome-wide association studies. The identification of several new susceptibility genes through these efforts has reinforced the importance of amyloid precursor protein and tau metabolism in the cause of the disease and has implicated immune response, inflammation, lipid metabolism, endocytosis/intracellular trafficking, and cell migration in the cause of the disease. Ongoing and future large-scale genome-wide association studies, translational studies, and next-generation whole genome or whole exome sequencing efforts, hold the promise to map the specific causative variants in these genes, to identify several additional risk variants, including rare and structural variants, and to identify novel targets for genetic testing, prevention, and treatment.

  9. Genome-wide patterns of selection in 230 ancient Eurasians

    PubMed Central

    Mathieson, Iain; Lazaridis, Iosif; Rohland, Nadin; Mallick, Swapan; Patterson, Nick; Roodenberg, Songül Alpaslan; Harney, Eadaoin; Stewardson, Kristin; Fernandes, Daniel; Novak, Mario; Sirak, Kendra; Gamba, Cristina; Jones, Eppie R.; Llamas, Bastien; Dryomov, Stanislav; Pickrel, Joseph; Arsuaga, Juan Luís; de Castro, José María Bermúdez; Carbonell, Eudald; Gerritsen, Fokke; Khokhlov, Aleksandr; Kuznetsov, Pavel; Lozano, Marina; Meller, Harald; Mochalov, Oleg; Moiseyev, Vayacheslav; Rojo Guerra, Manuel A.; Roodenberg, Jacob; Vergès, Josep Maria; Krause, Johannes; Cooper, Alan; Alt, Kurt W.; Brown, Dorcas; Anthony, David; Lalueza-Fox, Carles; Haak, Wolfgang; Pinhasi, Ron; Reich, David

    2016-01-01

    Ancient DNA makes it possible to directly witness natural selection by analyzing samples from populations before, during and after adaptation events. Here we report the first scan for selection using ancient DNA, capitalizing on the largest genome-wide dataset yet assembled: 230 West Eurasians dating to between 6500 and 1000 BCE, including 163 with newly reported data. The new samples include the first genome-wide data from the Anatolian Neolithic culture whose genetic material we extracted from the DNA-rich petrous bone and who we show were members of the population that was the source of Europe’s first farmers. We also report a complete transect of the steppe region in Samara between 5500 and 1200 BCE that allows us to recognize admixture from at least two external sources into steppe populations during this period. We detect selection at loci associated with diet, pigmentation and immunity, and two independent episodes of selection on height. PMID:26595274

  10. Genome-Wide Significant Loci: How Important Are They?

    PubMed Central

    Björkegren, Johan L.M.; Kovacic, Jason C.; Dudley, Joel T.; Schadt, Eric E.

    2015-01-01

    Genome-wide association studies (GWAS) have been extensively used to study common complex diseases such as coronary artery disease (CAD), revealing 153 suggestive CAD loci, of which at least 46 have been validated as having genome-wide significance. However, these loci collectively explain <10% of the genetic variance in CAD. Thus, we must address the key question of what factors constitute the remaining 90% of CAD heritability. We review possible limitations of GWAS, and contextually consider some candidate CAD loci identified by this method. Looking ahead, we propose systems genetics as a complementary approach to unlocking the CAD heritability and etiology. Systems genetics builds network models of relevant molecular processes by combining genetic and genomic datasets to ultimately identify key “drivers” of disease. By leveraging systems-based genetic approaches, we can help reveal the full genetic basis of common complex disorders, enabling novel diagnostic and therapeutic opportunities. PMID:25720628

  11. Genome-wide patterns of selection in 230 ancient Eurasians.

    PubMed

    Mathieson, Iain; Lazaridis, Iosif; Rohland, Nadin; Mallick, Swapan; Patterson, Nick; Roodenberg, Songül Alpaslan; Harney, Eadaoin; Stewardson, Kristin; Fernandes, Daniel; Novak, Mario; Sirak, Kendra; Gamba, Cristina; Jones, Eppie R; Llamas, Bastien; Dryomov, Stanislav; Pickrell, Joseph; Arsuaga, Juan Luís; de Castro, José María Bermúdez; Carbonell, Eudald; Gerritsen, Fokke; Khokhlov, Aleksandr; Kuznetsov, Pavel; Lozano, Marina; Meller, Harald; Mochalov, Oleg; Moiseyev, Vyacheslav; Guerra, Manuel A Rojo; Roodenberg, Jacob; Vergès, Josep Maria; Krause, Johannes; Cooper, Alan; Alt, Kurt W; Brown, Dorcas; Anthony, David; Lalueza-Fox, Carles; Haak, Wolfgang; Pinhasi, Ron; Reich, David

    2015-12-24

    Ancient DNA makes it possible to observe natural selection directly by analysing samples from populations before, during and after adaptation events. Here we report a genome-wide scan for selection using ancient DNA, capitalizing on the largest ancient DNA data set yet assembled: 230 West Eurasians who lived between 6500 and 300 bc, including 163 with newly reported data. The new samples include, to our knowledge, the first genome-wide ancient DNA from Anatolian Neolithic farmers, whose genetic material we obtained by extracting from petrous bones, and who we show were members of the population that was the source of Europe's first farmers. We also report a transect of the steppe region in Samara between 5600 and 300 bc, which allows us to identify admixture into the steppe from at least two external sources. We detect selection at loci associated with diet, pigmentation and immunity, and two independent episodes of selection on height.

  12. Genome-wide transcriptional profiling reveals molecular signatures of secondary xylem differentiation in Populus tomentosa.

    PubMed

    Yang, X H; Li, X G; Li, B L; Zhang, D Q

    2014-11-11

    Wood formation occurs via cell division, primary cell wall and secondary wall formation, and programmed cell death in the vascular cambium. Transcriptional profiling of secondary xylem differentiation is essential for understanding the molecular mechanisms underlying wood formation. Differential gene expression in secondary xylem differentiation of Populus has been previously investigated using cDNA microarray analysis. However, little is known about the molecular mechanisms from a genome-wide perspective. In this study, the Affymetrix poplar genome chips containing 61,413 probes were used to investigate the changes in the transcriptome during secondary xylem differentiation in Chinese white poplar (Populus tomentosa). Two xylem tissues (newly formed and lignified) were sampled for genome-wide transcriptional profiling. In total, 6843 genes (~11%) were identified with differential expression in the two xylem tissues. Many genes involved in cell division, primary wall modification, and cellulose synthesis were preferentially expressed in the newly formed xylem. In contrast, many genes, including 4-coumarate:cinnamate-4-hydroxylase (C4H), 4-coumarate:CoA ligase (4CL), cinnamyl alcohol dehydrogenase (CAD), and caffeoyl CoA 3-O-methyltransferase (CCoAOMT), associated with lignin biosynthesis were more transcribed in the lignified xylem. The two xylem tissues also showed differential expression of genes related to various hormones; thus, the secondary xylem differentiation could be regulated by hormone signaling. Furthermore, many transcription factor genes were preferentially expressed in the lignified xylem, suggesting that wood lignification involves extensive transcription regulation. The genome-wide transcriptional profiling of secondary xylem differentiation could provide additional insights into the molecular basis of wood formation in poplar species.

  13. Genome-wide association studies in pediatric endocrinology.

    PubMed

    Dauber, Andrew; Hirschhorn, Joel N

    2011-01-01

    Genome-wide association (GWA) studies are a powerful tool for understanding the genetic underpinnings of human disease. In this article, we briefly review the role and findings of GWA studies in type 1 diabetes, stature, pubertal timing, obesity, and vitamin D deficiency. We then discuss the present and future implications of these findings with regards to disease prediction, uncovering basic biology, and the development of novel therapeutic agents.

  14. Genome-wide association study of relative telomere length.

    PubMed

    Prescott, Jennifer; Kraft, Peter; Chasman, Daniel I; Savage, Sharon A; Mirabello, Lisa; Berndt, Sonja I; Weissfeld, Joel L; Han, Jiali; Hayes, Richard B; Chanock, Stephen J; Hunter, David J; De Vivo, Immaculata

    2011-05-10

    Telomere function is essential to maintaining the physical integrity of linear chromosomes and healthy human aging. The probability of forming proper telomere structures depends on the length of the telomeric DNA tract. We attempted to identify common genetic variants associated with log relative telomere length using genome-wide genotyping data on 3,554 individuals from the Nurses' Health Study and the Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial that took part in the National Cancer Institute Cancer Genetic Markers of Susceptibility initiative for breast and prostate cancer. After genotyping 64 independent SNPs selected for replication in additional Nurses' Health Study and Women's Genome Health Study participants, we did not identify genome-wide significant loci; however, we replicated the inverse association of log relative telomere length with the minor allele variant [C] of rs16847897 at the TERC locus (per allele β = -0.03, P = 0.003) identified by a previous genome-wide association study. We did not find evidence for an association with variants at the OBFC1 locus or other loci reported to be associated with telomere length. With this sample size we had >80% power to detect β estimates as small as ±0.10 for SNPs with minor allele frequencies of ≥0.15 at genome-wide significance. However, power is greatly reduced for β estimates smaller than ±0.10, such as those for variants at the TERC locus. In general, common genetic variants associated with telomere length homeostasis have been difficult to detect. Potential biological and technical issues are discussed.

  15. Genome-wide association study of schizophrenia in Ashkenazi Jews.

    PubMed

    Goes, Fernando S; McGrath, John; Avramopoulos, Dimitrios; Wolyniec, Paula; Pirooznia, Mehdi; Ruczinski, Ingo; Nestadt, Gerald; Kenny, Eimear E; Vacic, Vladimir; Peters, Inga; Lencz, Todd; Darvasi, Ariel; Mulle, Jennifer G; Warren, Stephen T; Pulver, Ann E

    2015-12-01

    Schizophrenia is a common, clinically heterogeneous disorder associated with lifelong morbidity and early mortality. Several genetic variants associated with schizophrenia have been identified, but the majority of the heritability remains unknown. In this study, we report on a case-control sample of Ashkenazi Jews (AJ), a founder population that may provide additional insights into genetic etiology of schizophrenia. We performed a genome-wide association analysis (GWAS) of 592 cases and 505 controls of AJ ancestry ascertained in the US. Subsequently, we performed a meta-analysis with an Israeli AJ sample of 913 cases and 1640 controls, followed by a meta-analysis and polygenic risk scoring using summary results from Psychiatric GWAS Consortium 2 schizophrenia study. The U.S. AJ sample showed strong evidence of polygenic inheritance (pseudo-R(2) ∼9.7%) and a SNP-heritability estimate of 0.39 (P = 0.00046). We found no genome-wide significant associations in the U.S. sample or in the combined US/Israeli AJ meta-analysis of 1505 cases and 2145 controls. The strongest AJ specific associations (P-values in 10(-6) -10(-7) range) were in the 22q 11.2 deletion region and included the genes TBX1, GLN1, and COMT. Supportive evidence (meta P < 1 × 10(-4) ) was also found for several previously identified genome-wide significant findings, including the HLA region, CNTN4, IMMP2L, and GRIN2A. The meta-analysis of the U.S. sample with the PGC2 results provided initial genome-wide significant evidence for six new loci. Among the novel potential susceptibility genes is PEPD, a gene involved in proline metabolism, which is associated with a Mendelian disorder characterized by developmental delay and cognitive deficits. PMID:26198764

  16. Genome-wide polymorphisms show unexpected targets of natural selection

    PubMed Central

    Pespeni, Melissa H.; Garfield, David A.; Manier, Mollie K.; Palumbi, Stephen R.

    2012-01-01

    Natural selection can act on all the expressed genes of an individual, leaving signatures of genetic differentiation or diversity at many loci across the genome. New power to assay these genome-wide effects of selection comes from associating multi-locus patterns of polymorphism with gene expression and function. Here, we performed one of the first genome-wide surveys in a marine species, comparing purple sea urchins, Strongylocentrotus purpuratus, from two distant locations along the species' wide latitudinal range. We examined 9112 polymorphic loci from upstream non-coding and coding regions of genes for signatures of selection with respect to gene function and tissue- and ontogenetic gene expression. We found that genetic differentiation (FST) varied significantly across functional gene classes. The strongest enrichment occurred in the upstream regions of E3 ligase genes, enzymes known to regulate protein abundance during development and environmental stress. We found enrichment for high heterozygosity in genes directly involved in immune response, particularly NALP genes, which mediate pro-inflammatory signals during bacterial infection. We also found higher heterozygosity in immune genes in the southern population, where disease incidence and pathogen diversity are greater. Similar to the major histocompatibility complex in mammals, balancing selection may enhance genetic diversity in the innate immune system genes of this invertebrate. Overall, our results show that how genome-wide polymorphism data coupled with growing databases on gene function and expression can combine to detect otherwise hidden signals of selection in natural populations. PMID:21993504

  17. Significance of genome-wide association studies in molecular anthropology.

    PubMed

    Gupta, Vipin; Khadgawat, Rajesh; Sachdeva, Mohinder Pal

    2009-12-01

    The successful advent of a genome-wide approach in association studies raises the hopes of human geneticists for solving a genetic maze of complex traits especially the disorders. This approach, which is replete with the application of cutting-edge technology and supported by big science projects (like Human Genome Project; and even more importantly the International HapMap Project) and various important databases (SNP database, CNV database, etc.), has had unprecedented success in rapidly uncovering many of the genetic determinants of complex disorders. The magnitude of this approach in the genetics of classical anthropological variables like height, skin color, eye color, and other genome diversity projects has certainly expanded the horizons of molecular anthropology. Therefore, in this article we have proposed a genome-wide association approach in molecular anthropological studies by providing lessons from the exemplary study of the Wellcome Trust Case Control Consortium. We have also highlighted the importance and uniqueness of Indian population groups in facilitating the design and finding optimum solutions for other genome-wide association-related challenges.

  18. Voxelwise genome-wide association study (vGWAS).

    PubMed

    Stein, Jason L; Hua, Xue; Lee, Suh; Ho, April J; Leow, Alex D; Toga, Arthur W; Saykin, Andrew J; Shen, Li; Foroud, Tatiana; Pankratz, Nathan; Huentelman, Matthew J; Craig, David W; Gerber, Jill D; Allen, April N; Corneveaux, Jason J; Dechairo, Bryan M; Potkin, Steven G; Weiner, Michael W; Thompson, Paul

    2010-11-15

    The structure of the human brain is highly heritable, and is thought to be influenced by many common genetic variants, many of which are currently unknown. Recent advances in neuroimaging and genetics have allowed collection of both highly detailed structural brain scans and genome-wide genotype information. This wealth of information presents a new opportunity to find the genes influencing brain structure. Here we explore the relation between 448,293 single nucleotide polymorphisms in each of 31,622 voxels of the entire brain across 740 elderly subjects (mean age+/-s.d.: 75.52+/-6.82 years; 438 male) including subjects with Alzheimer's disease, Mild Cognitive Impairment, and healthy elderly controls from the Alzheimer's Disease Neuroimaging Initiative (ADNI). We used tensor-based morphometry to measure individual differences in brain structure at the voxel level relative to a study-specific template based on healthy elderly subjects. We then conducted a genome-wide association at each voxel to identify genetic variants of interest. By studying only the most associated variant at each voxel, we developed a novel method to address the multiple comparisons problem and computational burden associated with the unprecedented amount of data. No variant survived the strict significance criterion, but several genes worthy of further exploration were identified, including CSMD2 and CADPS2. These genes have high relevance to brain structure. This is the first voxelwise genome wide association study to our knowledge, and offers a novel method to discover genetic influences on brain structure.

  19. Genome-wide association and genomic prediction identifies associated loci and predicts the sensitivity of Tobacco ringspot virus in soybean plant introduction

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The genome-wide association study (GWAS) is a useful tool for detecting and characterizing traits of interest including those associated with disease resistance in soybean. The availability of 50,000 single nucleotide polymorphism (SNP) markers (SoySNP50K iSelect BeadChip; www.soybase.org) on 19,652...

  20. A genome-wide analysis of putative functional and exonic variation associated with extremely high intelligence.

    PubMed

    Spain, S L; Pedroso, I; Kadeva, N; Miller, M B; Iacono, W G; McGue, M; Stergiakouli, E; Smith, G D; Putallaz, M; Lubinski, D; Meaburn, E L; Plomin, R; Simpson, M A

    2016-08-01

    Although individual differences in intelligence (general cognitive ability) are highly heritable, molecular genetic analyses to date have had limited success in identifying specific loci responsible for its heritability. This study is the first to investigate exome variation in individuals of extremely high intelligence. Under the quantitative genetic model, sampling from the high extreme of the distribution should provide increased power to detect associations. We therefore performed a case-control association analysis with 1409 individuals drawn from the top 0.0003 (IQ >170) of the population distribution of intelligence and 3253 unselected population-based controls. Our analysis focused on putative functional exonic variants assayed on the Illumina HumanExome BeadChip. We did not observe any individual protein-altering variants that are reproducibly associated with extremely high intelligence and within the entire distribution of intelligence. Moreover, no significant associations were found for multiple rare alleles within individual genes. However, analyses using genome-wide similarity between unrelated individuals (genome-wide complex trait analysis) indicate that the genotyped functional protein-altering variation yields a heritability estimate of 17.4% (s.e. 1.7%) based on a liability model. In addition, investigation of nominally significant associations revealed fewer rare alleles associated with extremely high intelligence than would be expected under the null hypothesis. This observation is consistent with the hypothesis that rare functional alleles are more frequently detrimental than beneficial to intelligence.

  1. Genome-wide association scan suggests basis for microtia in Awassi sheep.

    PubMed

    Jawasreh, K; Boettcher, P J; Stella, A

    2016-08-01

    Hereditary underdevelopment of the ear, a condition also known as microtia, has been observed in several sheep breeds as well as in humans and other species. Its genetic basis in sheep is unknown. The Awassi sheep, a breed native to southwest Asia, carries this phenotype and was targeted for molecular characterization via a genome-wide association study. DNA samples were collected from sheep in Jordan. Eight affected and 12 normal individuals were genotyped with the Illumina OvineSNP50(®) chip. Multilocus analyses failed to identify any genotypic association. In contrast, a single-locus analysis revealed a statistically significant association (P = 0.012, genome-wide) with a SNP at basepair 34 647 499 on OAR23. This marker is adjacent to the gene encoding transcription factor GATA-6, which has been shown to play a role in many developmental processes, including chondrogenesis. The lack of extended homozygosity in this region suggests a fairly ancient mutation, and the time of occurrence was estimated to be approximately 3000 years ago. Many of the earless sheep breeds may thus share the causative mutation, especially within the subgroup of fat-tailed, wool sheep.

  2. Genome-wide analysis distinguishes hyperglycemia regulated epigenetic signatures of primary vascular cells

    PubMed Central

    Pirola, Luciano; Balcerczyk, Aneta; Tothill, Richard W.; Haviv, Izhak; Kaspi, Antony; Lunke, Sebastian; Ziemann, Mark; Karagiannis, Tom; Tonna, Stephen; Kowalczyk, Adam; Beresford-Smith, Bryan; Macintyre, Geoff; Kelong, Ma; Hongyu, Zhang; Zhu, Jingde; El-Osta, Assam

    2011-01-01

    Emerging evidence suggests that poor glycemic control mediates post-translational modifications to the H3 histone tail. We are only beginning to understand the dynamic role of some of the diverse epigenetic changes mediated by hyperglycemia at single loci, yet elevated glucose levels are thought to regulate genome-wide changes, and this still remains poorly understood. In this article we describe genome-wide histone H3K9/K14 hyperacetylation and DNA methylation maps conferred by hyperglycemia in primary human vascular cells. Chromatin immunoprecipitation (ChIP) as well as CpG methylation (CpG) assays, followed by massive parallel sequencing (ChIP-seq and CpG-seq) identified unique hyperacetylation and CpG methylation signatures with proximal and distal patterns of regionalization associative with gene expression. Ingenuity knowledge-based pathway and gene ontology analyses indicate that hyperglycemia significantly affects human vascular chromatin with the transcriptional up-regulation of genes involved in metabolic and cardiovascular disease. We have generated the first installment of a reference collection of hyperglycemia-induced chromatin modifications using robust and reproducible platforms that allow parallel sequencing-by-synthesis of immunopurified content. We uncover that hyperglycemia-mediated induction of genes and pathways associated with endothelial dysfunction occur through modulation of acetylated H3K9/K14 inversely correlated with methyl-CpG content. PMID:21890681

  3. CONAN: copy number variation analysis software for genome-wide association studies

    PubMed Central

    2010-01-01

    Background Genome-wide association studies (GWAS) based on single nucleotide polymorphisms (SNPs) revolutionized our perception of the genetic regulation of complex traits and diseases. Copy number variations (CNVs) promise to shed additional light on the genetic basis of monogenic as well as complex diseases and phenotypes. Indeed, the number of detected associations between CNVs and certain phenotypes are constantly increasing. However, while several software packages support the determination of CNVs from SNP chip data, the downstream statistical inference of CNV-phenotype associations is still subject to complicated and inefficient in-house solutions, thus strongly limiting the performance of GWAS based on CNVs. Results CONAN is a freely available client-server software solution which provides an intuitive graphical user interface for categorizing, analyzing and associating CNVs with phenotypes. Moreover, CONAN assists the evaluation process by visualizing detected associations via Manhattan plots in order to enable a rapid identification of genome-wide significant CNV regions. Various file formats including the information on CNVs in population samples are supported as input data. Conclusions CONAN facilitates the performance of GWAS based on CNVs and the visual analysis of calculated results. CONAN provides a rapid, valid and straightforward software solution to identify genetic variation underlying the 'missing' heritability for complex traits that remains unexplained by recent GWAS. The freely available software can be downloaded at http://genepi-conan.i-med.ac.at. PMID:20546565

  4. Genome-wide Comparative Analysis of Atopic Dermatitis and Psoriasis Gives Insight into Opposing Genetic Mechanisms

    PubMed Central

    Baurecht, Hansjörg; Hotze, Melanie; Brand, Stephan; Büning, Carsten; Cormican, Paul; Corvin, Aiden; Ellinghaus, David; Ellinghaus, Eva; Esparza-Gordillo, Jorge; Fölster-Holst, Regina; Franke, Andre; Gieger, Christian; Hubner, Norbert; Illig, Thomas; Irvine, Alan D.; Kabesch, Michael; Lee, Young A.E.; Lieb, Wolfgang; Marenholz, Ingo; McLean, W.H. Irwin; Morris, Derek W.; Mrowietz, Ulrich; Nair, Rajan; Nöthen, Markus M.; Novak, Natalija; O’Regan, Grainne M.; Schreiber, Stefan; Smith, Catherine; Strauch, Konstantin; Stuart, Philip E.; Trembath, Richard; Tsoi, Lam C.; Weichenthal, Michael; Barker, Jonathan; Elder, James T.; Weidinger, Stephan; Cordell, Heather J.; Brown, Sara J.

    2015-01-01

    Atopic dermatitis and psoriasis are the two most common immune-mediated inflammatory disorders affecting the skin. Genome-wide studies demonstrate a high degree of genetic overlap, but these diseases have mutually exclusive clinical phenotypes and opposing immune mechanisms. Despite their prevalence, atopic dermatitis and psoriasis very rarely co-occur within one individual. By utilizing genome-wide association study and ImmunoChip data from >19,000 individuals and methodologies developed from meta-analysis, we have identified opposing risk alleles at shared loci as well as independent disease-specific loci within the epidermal differentiation complex (chromosome 1q21.3), the Th2 locus control region (chromosome 5q31.1), and the major histocompatibility complex (chromosome 6p21–22). We further identified previously unreported pleiotropic alleles with opposing effects on atopic dermatitis and psoriasis risk in PRKRA and ANXA6/TNIP1. In contrast, there was no evidence for shared loci with effects operating in the same direction on both diseases. Our results show that atopic dermatitis and psoriasis have distinct genetic mechanisms with opposing effects in shared pathways influencing epidermal differentiation and immune response. The statistical analysis methods developed in the conduct of this study have produced additional insight from previously published data sets. The approach is likely to be applicable to the investigation of the genetic basis of other complex traits with overlapping and distinct clinical features. PMID:25574825

  5. A six months exercise intervention influences the genome-wide DNA methylation pattern in human adipose tissue.

    PubMed

    Rönn, Tina; Volkov, Petr; Davegårdh, Cajsa; Dayeh, Tasnim; Hall, Elin; Olsson, Anders H; Nilsson, Emma; Tornberg, Asa; Dekker Nitert, Marloes; Eriksson, Karl-Fredrik; Jones, Helena A; Groop, Leif; Ling, Charlotte

    2013-06-01

    Epigenetic mechanisms are implicated in gene regulation and the development of different diseases. The epigenome differs between cell types and has until now only been characterized for a few human tissues. Environmental factors potentially alter the epigenome. Here we describe the genome-wide pattern of DNA methylation in human adipose tissue from 23 healthy men, with a previous low level of physical activity, before and after a six months exercise intervention. We also investigate the differences in adipose tissue DNA methylation between 31 individuals with or without a family history of type 2 diabetes. DNA methylation was analyzed using Infinium HumanMethylation450 BeadChip, an array containing 485,577 probes covering 99% RefSeq genes. Global DNA methylation changed and 17,975 individual CpG sites in 7,663 unique genes showed altered levels of DNA methylation after the exercise intervention (q<0.05). Differential mRNA expression was present in 1/3 of gene regions with altered DNA methylation, including RALBP1, HDAC4 and NCOR2 (q<0.05). Using a luciferase assay, we could show that increased DNA methylation in vitro of the RALBP1 promoter suppressed the transcriptional activity (p = 0.03). Moreover, 18 obesity and 21 type 2 diabetes candidate genes had CpG sites with differences in adipose tissue DNA methylation in response to exercise (q<0.05), including TCF7L2 (6 CpG sites) and KCNQ1 (10 CpG sites). A simultaneous change in mRNA expression was seen for 6 of those genes. To understand if genes that exhibit differential DNA methylation and mRNA expression in human adipose tissue in vivo affect adipocyte metabolism, we silenced Hdac4 and Ncor2 respectively in 3T3-L1 adipocytes, which resulted in increased lipogenesis both in the basal and insulin stimulated state. In conclusion, exercise induces genome-wide changes in DNA methylation in human adipose tissue, potentially affecting adipocyte metabolism.

  6. Genome Wide Screening of Candidate Genes for Improving Piglet Birth Weight Using High and Low Estimated Breeding Value Populations

    PubMed Central

    Zhang, Lifan; Zhou, Xiang; Michal, Jennifer J.; Ding, Bo; Li, Rui; Jiang, Zhihua

    2014-01-01

    Birth weight is an economically important trait in pig production because it directly impacts piglet growth and survival rate. In the present study, we performed a genome wide survey of candidate genes and pathways associated with individual birth weight (IBW) using the Illumina PorcineSNP60 BeadChip on 24 high (HEBV) and 24 low estimated breeding value (LEBV) animals. These animals were selected from a reference population of 522 individuals produced by three sires and six dam lines, which were crossbreds with multiple breeds. After quality-control, 43,257 SNPs (single nucleotide polymorphisms), including 42,243 autosomal SNPs and 1,014 SNPs on chromosome X, were used in the data analysis. A total of 27 differentially selected regions (DSRs), including 1 on Sus scrofa chromosome 1 (SSC1), 1 on SSC4, 2 on SSC5, 4 on SSC6, 2 on SSC7, 5 on SSC8, 3 on SSC9, 1 on SSC14, 3 on SSC18, and 5 on SSCX, were identified to show the genome wide separations between the HEBV and LEBV groups for IBW in piglets. A DSR with the most number of significant SNPs (including 7 top 0.1% and 31 top 5% SNPs) was located on SSC6, while another DSR with the largest genetic differences in FST was found on SSC18. These regions harbor known functionally important genes involved in growth and development, such as TNFRSF9 (tumor necrosis factor receptor superfamily member 9), CA6 (carbonic anhydrase VI) and MDFIC (MyoD family inhibitor domain containing). A DSR rich in imprinting genes appeared on SSC9, which included PEG10 (paternally expressed 10), SGCE (sarcoglycan, epsilon), PPP1R9A (protein phosphatase 1, regulatory subunit 9A) and ASB4 (ankyrin repeat and SOCS box containing 4). More importantly, our present study provided evidence to support six quantitative trait loci (QTL) regions for pig birth weight, six QTL regions for average birth weight (ABW) and three QTL regions for litter birth weight (LBW) reported previously by other groups. Furthermore, gene ontology analysis with 183 genes

  7. A Six Months Exercise Intervention Influences the Genome-wide DNA Methylation Pattern in Human Adipose Tissue

    PubMed Central

    Rönn, Tina; Volkov, Petr; Davegårdh, Cajsa; Dayeh, Tasnim; Hall, Elin; Olsson, Anders H.; Nilsson, Emma; Tornberg, Åsa; Dekker Nitert, Marloes; Eriksson, Karl-Fredrik; Jones, Helena A.; Groop, Leif; Ling, Charlotte

    2013-01-01

    Epigenetic mechanisms are implicated in gene regulation and the development of different diseases. The epigenome differs between cell types and has until now only been characterized for a few human tissues. Environmental factors potentially alter the epigenome. Here we describe the genome-wide pattern of DNA methylation in human adipose tissue from 23 healthy men, with a previous low level of physical activity, before and after a six months exercise intervention. We also investigate the differences in adipose tissue DNA methylation between 31 individuals with or without a family history of type 2 diabetes. DNA methylation was analyzed using Infinium HumanMethylation450 BeadChip, an array containing 485,577 probes covering 99% RefSeq genes. Global DNA methylation changed and 17,975 individual CpG sites in 7,663 unique genes showed altered levels of DNA methylation after the exercise intervention (q<0.05). Differential mRNA expression was present in 1/3 of gene regions with altered DNA methylation, including RALBP1, HDAC4 and NCOR2 (q<0.05). Using a luciferase assay, we could show that increased DNA methylation in vitro of the RALBP1 promoter suppressed the transcriptional activity (p = 0.03). Moreover, 18 obesity and 21 type 2 diabetes candidate genes had CpG sites with differences in adipose tissue DNA methylation in response to exercise (q<0.05), including TCF7L2 (6 CpG sites) and KCNQ1 (10 CpG sites). A simultaneous change in mRNA expression was seen for 6 of those genes. To understand if genes that exhibit differential DNA methylation and mRNA expression in human adipose tissue in vivo affect adipocyte metabolism, we silenced Hdac4 and Ncor2 respectively in 3T3-L1 adipocytes, which resulted in increased lipogenesis both in the basal and insulin stimulated state. In conclusion, exercise induces genome-wide changes in DNA methylation in human adipose tissue, potentially affecting adipocyte metabolism. PMID:23825961

  8. Siblings with Ischemic Stroke Study (SWISS): Results of a Genome-wide Scan for Stroke Loci

    PubMed Central

    Meschia, James F.; Nalls, Michael; Matarin, Mar; Brott, Thomas G.; Brown, Robert D.; Hardy, John; Kissela, Brett; Rich, Stephen S.; Singleton, Andrew; Hernandez, Dena; Ferrucci, Luigi; Pearce, Kerra; Keller, Margaret; Worrall, Bradford B.

    2011-01-01

    Background and Purpose Ischemic stroke has a strong familial component to risk. The Siblings with Ischemic Stroke Study (SWISS) is a genome-wide family-based analysis that included use of imputed genotypes. SWISS was conducted to examine associations between SNPs and risk of stroke and stroke subtypes within pairs. Methods SWISS enrolled 312 probands with ischemic stroke across 70 US and Canadian centers. Affected siblings were ascertained by centers and confirmed by central record review; unaffected siblings were ascertained by telephone contact. Ischemic stroke was subtyped using TOAST criteria. Genotyping was performed using an Illumina 610 quad array (probands) and an Illumina linkage V array (affected siblings). SNPs were imputed using 1000 Genomes Project data and MACH software. Family-based association analyses were conducted using the sibling-transmission disequilibrium test. Results For all pairs, the correlation of age at stroke within pairs of affected siblings was r = 0.83 (95%CI, 0.78 to 0.86; P < 2.2×10−16). The correlation did not differ substantially by subtype. The concordance of stroke subtypes among affected pairs was 33.8% (kappa = 0.13; P = 5.06×10−4) and did not differ by age at stroke in the proband. Although no SNP achieved genome-wide significance for risk of ischemic stroke, there was clustering of the most associated SNPs on chromosomes 3p (NOS1) and 6p. Conclusions Stroke subtype and age at stroke in affected sibling pairs exhibit significant clustering. No individual SNP reached genome-wide significance. However, two promising candidate loci were identified, including one that contains NOS1, though these risk loci warrant further examination in larger sample collections. PMID:21940970

  9. Genome-wide Mapping Reveals Conservation of Promoter DNA Methylation Following Chicken Domestication

    PubMed Central

    Li, Qinghe; Wang, Yuanyuan; Hu, Xiaoxiang; Zhao, Yaofeng; Li, Ning

    2015-01-01

    It is well-known that environment influences DNA methylation, however, the extent of heritable DNA methylation variation following animal domestication remains largely unknown. Using meDIP-chip we mapped the promoter methylomes for 23,316 genes in muscle tissues of ancestral and domestic chickens. We systematically examined the variation of promoter DNA methylation in terms of different breeds, differentially expressed genes, SNPs and genes undergo genetic selection sweeps. While considerable changes in DNA sequence and gene expression programs were prevalent, we found that the inter-strain DNA methylation patterns were highly conserved in promoter region between the wild and domestic chicken breeds. Our data suggests a global preservation of DNA methylation between the wild and domestic chicken breeds in either a genome-wide or locus-specific scale in chick muscle tissues. PMID:25735894

  10. Genome-wide mapping reveals conservation of promoter DNA methylation following chicken domestication.

    PubMed

    Li, Qinghe; Wang, Yuanyuan; Hu, Xiaoxiang; Zhao, Yaofeng; Li, Ning

    2015-01-01

    It is well-known that environment influences DNA methylation, however, the extent of heritable DNA methylation variation following animal domestication remains largely unknown. Using meDIP-chip we mapped the promoter methylomes for 23,316 genes in muscle tissues of ancestral and domestic chickens. We systematically examined the variation of promoter DNA methylation in terms of different breeds, differentially expressed genes, SNPs and genes undergo genetic selection sweeps. While considerable changes in DNA sequence and gene expression programs were prevalent, we found that the inter-strain DNA methylation patterns were highly conserved in promoter region between the wild and domestic chicken breeds. Our data suggests a global preservation of DNA methylation between the wild and domestic chicken breeds in either a genome-wide or locus-specific scale in chick muscle tissues.

  11. Stories and Challenges of Genome Wide Association Studies in Livestock — A Review

    PubMed Central

    Sharma, Aditi; Lee, Jun Seop; Dang, Chang Gwon; Sudrajad, Pita; Kim, Hyeong Cheol; Yeon, Seong Heum; Kang, Hee Seol; Lee, Seung-Hwan

    2015-01-01

    Undoubtedly livestock is one of the major contributors to the economy of any country. The economic value of livestock includes meat, dairy products, fiber, fertilizer etc. Understanding and identifying the associations of quantitative trait loci (QTL) with the economically important traits is believed to substantially benefit the livestock industry. The past two decades have seen a flurry of interest in mapping the QTL associated with traits of economic importance on the genome. With the availability of single nucleotide polymorphism chip of various densities it is possible to identify regions, QTL and genes on the genome that explain the association and its effect on the phenotype under consideration. Remarkable advancement has been seen in genome wide association studies (GWAS) since its inception till the present day. In this review we describe the progress and challenges of GWAS in various livestock species. PMID:26194229

  12. Association weight matrix: a network-based approach towards functional genome-wide association studies.

    PubMed

    Reverter, Antonio; Fortes, Marina R S

    2013-01-01

    In this chapter we describe the Association Weight Matrix (AWM), a novel procedure to exploit the results from genome-wide association studies (GWAS) and, in combination with network inference algorithms, generate gene networks with regulatory and functional significance. In simple terms, the AWM is a matrix with rows represented by genes and columns represented by phenotypes. Individual {i, j}th elements in the AWM correspond to the association of the SNP in the ith gene to the jth phenotype. While our main objective is to provide a recipe-like tutorial on how to build and use AWM, we also take the opportunity to briefly reason the logic behind each step in the process. To conclude, we discuss the impact on AWM of issues like the number of phenotypes under scrutiny, the density of the SNP chip and the choice of contrast upon which to infer the cause-effect regulatory interactions. PMID:23756904