Science.gov

Sample records for family genome-wide chip

  1. Genome-wide family-based linkage analysis of exome chip variants and cardiometabolic risk.

    PubMed

    Hellwege, Jacklyn N; Palmer, Nicholette D; Raffield, Laura M; Ng, Maggie C Y; Hawkins, Gregory A; Long, Jirong; Lorenzo, Carlos; Norris, Jill M; Ida Chen, Y-D; Speliotes, Elizabeth K; Rotter, Jerome I; Langefeld, Carl D; Wagenknecht, Lynne E; Bowden, Donald W

    2014-05-01

    Linkage analysis of complex traits has had limited success in identifying trait-influencing loci. Recently, coding variants have been implicated as the basis for some biomedical associations. We tested whether coding variants are the basis for linkage peaks of complex traits in 42 African-American (n = 596) and 90 Hispanic (n = 1,414) families in the Insulin Resistance Atherosclerosis Family Study (IRASFS) using Illumina HumanExome Beadchips. A total of 92,157 variants in African Americans (34%) and 81,559 (31%) in Hispanics were polymorphic and tested using two-point linkage and association analyses with 37 cardiometabolic phenotypes. In African Americans 77 LOD scores greater than 3 were observed. The highest LOD score was 4.91 with the APOE SNP rs7412 (MAF = 0.13) with plasma apolipoprotein B (ApoB). This SNP was associated with ApoB (P-value = 4 × 10(-19)) and accounted for 16.2% of the variance in African Americans. In Hispanic families, 104 LOD scores were greater than 3. The strongest evidence of linkage (LOD = 4.29) was with rs5882 (MAF = 0.46) in CETP with HDL. CETP variants were strongly associated with HDL (0.00049 < P-value <4.6 × 10(-12)), accounting for up to 4.5% of the variance. These loci have previously been shown to have effects on the biomedical traits evaluated here. Thus, evidence of strong linkage in this genome wide survey of primarily coding variants was uncommon. Loci with strong evidence of linkage was characterized by large contributions to the variance, and, in these cases, are common variants. Less compelling evidence of linkage and association was observed with additional loci that may require larger family sets to confirm. © 2014 WILEY PERIODICALS, INC.

  2. Genome-wide analysis for protein-DNA interaction: ChIP-chip.

    PubMed

    Tong, Yunguang; Falk, Jeff

    2009-01-01

    Chromatin immunoprecipitation (ChIP) is a well-established procedure for protein-DNA interaction research. ChIP-chip, combining chromatin immunoprecipitation (ChIP) and microarray technology (Chip), enables scientists to survey genome-wide DNA binding sites for a given protein. The ChIP-chip technique has been used to identify transcription factor binding sites, explore epigenomic information and investigate factors in DNA replicate/repairs. Here we describe a protocol for ChIP-chip to study Pituitary Tumor Transforming Gene (PTTG1) in mammalian cells.

  3. Family-based genome-wide association scan of attention-deficit/hyperactivity disorder.

    PubMed

    Mick, Eric; Todorov, Alexandre; Smalley, Susan; Hu, Xiaolan; Loo, Sandra; Todd, Richard D; Biederman, Joseph; Byrne, Deirdre; Dechairo, Bryan; Guiney, Allan; McCracken, James; McGough, James; Nelson, Stanley F; Reiersen, Angela M; Wilens, Timothy E; Wozniak, Janet; Neale, Benjamin M; Faraone, Stephen V

    2010-09-01

    Genes likely play a substantial role in the etiology of attention-deficit/hyperactivity disorder (ADHD). However, the genetic architecture of the disorder is unknown, and prior genome-wide association studies (GWAS) have not identified a genome-wide significant association. We have conducted a third, independent, multisite GWAS of DSM-IV-TR ADHD. Families were ascertained at Massachusetts General Hospital (MGH; N = 309 trios), Washington University at St. Louis (WASH-U; N = 272 trios), and University of California at Los Angeles (UCLA; N = 156 trios). Genotyping was conducted with the Illumina Human1M or Human1M-Duo BeadChip platforms. After applying quality control filters, association with ADHD was tested with 835,136 SNPs in 735 DSM-IV ADHD trios from 732 families. Our smallest p value (6.7E-07) did not reach the threshold for genome-wide statistical significance (5.0E-08), but one of the 20 most significant associations was located in a candidate gene of interest for ADHD (SLC9A9, rs9810857, p = 6.4E-6). We also conducted gene-based tests of candidate genes identified in the literature and found additional evidence of association with SLC9A9. We and our colleagues in the Psychiatric GWAS Consortium are working to pool together GWAS samples to establish the large data sets needed to follow-up on these results and to identify genes for ADHD and other disorders. 2010 American Academy of Child and Adolescent Psychiatry. Published by Elsevier Inc. All rights reserved.

  4. Genome-Wide Association Study of Copy Number Variations in Patients with Familial Neurocardiogenic Syncope.

    PubMed

    Demir, Emre; Hasdemir, Can; Ak, Handan; Atay, Sevcan; Aydin, Hikmet Hakan

    2016-08-01

    Neurocardiogenic syncope (NCS) is the most frequent type of syncope characterized by a self-limited episode of systemic hypotension. In this study, we conducted the first genome-wide association study testing copy number variations for association with NCS. Study population consisted of 107 consecutive patients with recurrent syncope and positive head-up tilt table testing. Four families with NCS were selected for CNV analysis. Affymetrix GeneChip(®) SNP 6.0 array was used for CNV analysis. Data and statistical analysis were performed with Affymetrix genotyping console 4.0 and GraphPad Prism v6. Positive family history of NCS was present in 19.6 % (n = 21) in our study population (n = 107). Twenty-six CNV regions were found to be significantly altered in families with NCS (P < 0.05). Several CNVs were identified in families with NCS. Further studies comprising wider study population are required to determine the effect of these variations on NCS development.

  5. Genome-wide measurement of protein-DNA binding dynamics using competition ChIP.

    PubMed

    Lickwar, Colin R; Mueller, Florian; Lieb, Jason D

    2013-01-01

    Competition chromatin immunoprecipitation (competition ChIP) enables experimenters to measure protein-DNA dynamics at a single locus or across the entire genome, depending on the detection method. Competition ChIP relies on a cell containing two copies of a single DNA-associated factor, with each copy of the factor differentially epitope tagged. One of the copies is expressed constitutively and the second is induced as a competitor. The ratio of isoforms associated with discrete genomic locations is detected by ChIP-on-chip (ChIP-chip) or ChIP-sequencing (ChIP-seq). The rate at which the resident isoform of the protein is replaced by the competitor at each binding location enables the calculation of residence time for that factor at each site of interaction genome wide. Here we provide a detailed protocol for designing and performing competition ChIP experiments in Saccharomyces cerevisiae, which takes ∼5 d to complete (not including strain production and characterizations, which may take as long as 6 months). Included in this protocol are guidelines for downstream bioinformatic analysis to extract residence times throughout the genome.

  6. Genome-wide Target Enrichment-aided Chip Design: a 66 K SNP Chip for Cashmere Goat.

    PubMed

    Qiao, Xian; Su, Rui; Wang, Yang; Wang, Ruijun; Yang, Ting; Li, Xiaokai; Chen, Wei; He, Shiyang; Jiang, Yu; Xu, Qiwu; Wan, Wenting; Zhang, Yaolei; Zhang, Wenguang; Chen, Jiang; Liu, Bin; Liu, Xin; Fan, Yixing; Chen, Duoyuan; Jiang, Huaizhi; Fang, Dongming; Liu, Zhihong; Wang, Xiaowen; Zhang, Yanjun; Mao, Danqing; Wang, Zhiying; Di, Ran; Zhao, Qianjun; Zhong, Tao; Yang, Huanming; Wang, Jian; Wang, Wen; Dong, Yang; Chen, Xiaoli; Xu, Xun; Li, Jinquan

    2017-08-17

    Compared with the commercially available single nucleotide polymorphism (SNP) chip based on the Bead Chip technology, the solution hybrid selection (SHS)-based target enrichment SNP chip is not only design-flexible, but also cost-effective for genotype sequencing. In this study, we propose to design an animal SNP chip using the SHS-based target enrichment strategy for the first time. As an update to the international collaboration on goat research, a 66 K SNP chip for cashmere goat was created from the whole-genome sequencing data of 73 individuals. Verification of this 66 K SNP chip with the whole-genome sequencing data of 436 cashmere goats showed that the SNP call rates was between 95.3% and 99.8%. The average sequencing depth for target SNPs were 40X. The capture regions were shown to be 200 bp that flank target SNPs. This chip was further tested in a genome-wide association analysis of cashmere fineness (fiber diameter). Several top hit loci were found marginally associated with signaling pathways involved in hair growth. These results demonstrate that the 66 K SNP chip is a useful tool in the genomic analyses of cashmere goats. The successful chip design shows that the SHS-based target enrichment strategy could be applied to SNP chip design in other species.

  7. Family-based designs for genome-wide association studies.

    PubMed

    Ott, Jurg; Kamatani, Yoichiro; Lathrop, Mark

    2011-06-01

    Association mapping has successfully identified common SNPs associated with many diseases. However, the inability of this class of variation to account for most of the supposed heritability has led to a renewed interest in methods - primarily linkage analysis - to detect rare variants. Family designs allow for control of population stratification, investigations of questions such as parent-of-origin effects and other applications that are imperfectly or not readily addressed in case-control association studies. This article guides readers through the interface between linkage and association analysis, reviews the new methodologies and provides useful guidelines for applications. Just as effective SNP-genotyping tools helped to realize the potential of association studies, next-generation sequencing tools will benefit genetic studies by improving the power of family-based approaches.

  8. Genome-wide analysis of TCP family in tobacco.

    PubMed

    Chen, L; Chen, Y Q; Ding, A M; Chen, H; Xia, F; Wang, W F; Sun, Y H

    2016-05-23

    The TCP family is a transcription factor family, members of which are extensively involved in plant growth and development as well as in signal transduction in the response against many physiological and biochemical stimuli. In the present study, 61 TCP genes were identified in tobacco (Nicotiana tabacum) genome. Bioinformatic methods were employed for predicting and analyzing the gene structure, gene expression, phylogenetic analysis, and conserved domains of TCP proteins in tobacco. The 61 NtTCP genes were divided into three diverse groups, based on the division of TCP genes in tomato and Arabidopsis, and the results of the conserved domain and sequence analyses further confirmed the classification of the NtTCP genes. The expression pattern of NtTCP also demonstrated that majority of these genes play important roles in all the tissues, while some special genes exercise their functions only in specific tissues. In brief, the comprehensive and thorough study of the TCP family in other plants provides sufficient resources for studying the structure and functions of TCPs in tobacco.

  9. Genetic link between family socioeconomic status and children's educational achievement estimated from genome-wide SNPs.

    PubMed

    Krapohl, E; Plomin, R

    2016-03-01

    One of the best predictors of children's educational achievement is their family's socioeconomic status (SES), but the degree to which this association is genetically mediated remains unclear. For 3000 UK-representative unrelated children we found that genome-wide single-nucleotide polymorphisms could explain a third of the variance of scores on an age-16 UK national examination of educational achievement and half of the correlation between their scores and family SES. Moreover, genome-wide polygenic scores based on a previously published genome-wide association meta-analysis of total number of years in education accounted for ~3.0% variance in educational achievement and ~2.5% in family SES. This study provides the first molecular evidence for substantial genetic influence on differences in children's educational achievement and its association with family SES.

  10. Genetic link between family socioeconomic status and children's educational achievement estimated from genome-wide SNPs

    PubMed Central

    Krapohl, E; Plomin, R

    2016-01-01

    One of the best predictors of children's educational achievement is their family's socioeconomic status (SES), but the degree to which this association is genetically mediated remains unclear. For 3000 UK-representative unrelated children we found that genome-wide single-nucleotide polymorphisms could explain a third of the variance of scores on an age-16 UK national examination of educational achievement and half of the correlation between their scores and family SES. Moreover, genome-wide polygenic scores based on a previously published genome-wide association meta-analysis of total number of years in education accounted for ~3.0% variance in educational achievement and ~2.5% in family SES. This study provides the first molecular evidence for substantial genetic influence on differences in children's educational achievement and its association with family SES. PMID:25754083

  11. Genome-wide analysis of DNA methylation in Arabidopsis using MeDIP-chip.

    PubMed

    Cortijo, Sandra; Wardenaar, René; Colomé-Tatché, Maria; Johannes, Frank; Colot, Vincent

    2014-01-01

    DNA methylation is an epigenetic mark that is essential for preserving genome integrity and normal development in plants and mammals. Although this modification may serve a variety of purposes, it is best known for its role in stable transcriptional silencing of transposable elements and epigenetic regulation of some genes. In addition, it is increasingly recognized that alterations in DNA methylation patterns can sometimes be inherited across multiple generations and thus are a source of heritable phenotypic variation that is independent of any DNA sequence changes. With the advent of genomics, it is now possible to analyze DNA methylation genome-wide with high precision, which is a prerequisite for understanding fully the various functions and phenotypic impact of this modification. Indeed, several so-called epigenomic mapping methods have been developed for the analysis of DNA methylation. Among these, immunoprecipitation of methylated DNA followed by hybridization to genome tiling arrays (MeDIP-chip) arguably offers a reasonable compromise between cost, ease of implementation, and sensitivity to date. Here we describe the application of this method, from DNA extraction to data analysis, to the study of DNA methylation genome-wide in Arabidopsis.

  12. Spotting and validation of a genome wide oligonucleotide chip with duplicate measurement of each gene

    SciTech Connect

    Thomassen, Mads . E-mail: mads.thomassen@ouh.fyns-amt.dk; Skov, Vibe; Eiriksdottir, Freyja; Tan, Qihua; Jochumsen, Kirsten; Fritzner, Niels; Brusgaard, Klaus; Dahlgaard, Jesper; Kruse, Torben A.

    2006-06-16

    The quality of DNA microarray based gene expression data relies on the reproducibility of several steps in a microarray experiment. We have developed a spotted genome wide microarray chip with oligonucleotides printed in duplicate in order to minimise undesirable biases, thereby optimising detection of true differential expression. The validation study design consisted of an assessment of the microarray chip performance using the MessageAmp and FairPlay labelling kits. Intraclass correlation coefficient (ICC) was used to demonstrate that MessageAmp was significantly more reproducible than FairPlay. Further examinations with MessageAmp revealed the applicability of the system. The linear range of the chips was three orders of magnitude, the precision was high, as 95% of measurements deviated less than 1.24-fold from the expected value, and the coefficient of variation for relative expression was 13.6%. Relative quantitation was more reproducible than absolute quantitation and substantial reduction of variance was attained with duplicate spotting. An analysis of variance (ANOVA) demonstrated no significant day-to-day variation.

  13. A Genome-wide Quantitative Linkage Scan of Niacin Skin Flush Response in Families With Schizophrenia

    PubMed Central

    Lien, Yin-Ju; Huang, Sih-Syuan; Liu, Chih-Min; Hwu, Hai-Gwo; Faraone, Stephen V.; Tsuang, Ming T.; Chen, Wei J.

    2013-01-01

    Schizophrenia patients frequently display reduced niacin flush responses, and similar characteristics are also observed in their nonpsychotic relatives. This study aimed to identify loci influencing flush response to niacin in schizophrenia using genome-wide quantitative linkage scan. In a nationwide sample of families with at least 2 siblings affected with schizophrenia in each family, 115 families that had at least 2 affected siblings with information on the niacin skin test were subjected to quantitative trait loci linkage analysis, either involving affected individuals only or the whole family. Nonparametric linkage z (NPL-Z) scores were calculated for each of 386 microsatellite markers spaced at an average of 9-cM intervals. Niacin patches of 3 concentrations (0.001M, 0.01, and 0.1M) were applied to forearm skin, and the flush response was rated at 5, 10, and 15 minutes, respectively, with a 4-point scale. Determination of genome-wide empirical significance was implemented using 1000 simulated genome scans. One linkage peak attaining genome-wide significance was identified at chromosomal region 14q32.12 for 0.01M concentration at 5 minutes (NPL-Z scores = 3.39, genome-wide empirical P = .03) in affected individuals, and the corresponding linkage signal remained strong (NPL-Z scores = 2.87) for the analyses of the whole family. This locus is distinct from the chromosomal region identified in the previous genome-wide scan for the diagnosis of schizophrenia, and the signal was higher than the peak linkage signal in that study. These findings indicate that there might be modifier or susceptibility-modifier genes at 14q32.12 for schizophrenia-related attenuation of flush response to niacin. PMID:21653277

  14. Genome-wide linkage scan of quantitative traits representing symptom dimensions in multiplex schizophrenia families.

    PubMed

    Ryu, Seunghyong; Won, Hong-Hee; Oh, Sohee; Kim, Jong-Won; Park, Taesung; Cho, Eun-Young; Cho, Youngah; Park, Dong Yeon; Lee, Yu-Sang; Kwon, Jun Soo; Hong, Kyung Sue

    2013-12-30

    Symptom dimensions of schizophrenia are likely to be the intermediate phenotypes under the control of disease-susceptibility genes, or separate traits related to disease-modifier genes. This study aimed to identify chromosomal loci linked to symptom dimensions of schizophrenia through genome-wide quantitative trait locus (QTL) linkage analysis. The study subjects consisted of 56 families with 183 members including 123 affected individuals. Symptom evaluations were performed on lifetime basis. Through principal component factor analysis, eight quantitative phenotypes representing symptom dimensions were identified. Genotyping was done for 6008 SNP markers, and genome-wide QTL linkage analysis was performed. No symptom dimension showed a significant linkage attaining genome-wide empirical thresholds. We observed seven regions yielding linkage signals attaining genome-wide empirical thresholds for suggestive linkage (NPL Z score = 2.78-3.49); chromosome 15q26.1 for 'non-paranoid delusion factor', 2p24.3 and 7q31.1 for 'prodromal impairment factor', 1q32.1, 9p21.3, and 9q31.2 for 'negative symptom factor', and 10p13 for 'disorganization factor'. Among these loci, chromosome 2p24.3 and 1q32.1 overlap with susceptibility loci of schizophrenia identified in our previous linkage studies. This study suggests the existence of genetic loci related to various clinical features of schizophrenia. Further genetic analyses for these dimensional phenotypes are warranted. © 2013 Published by Elsevier Ireland Ltd.

  15. Combined genome-wide linkage and targeted association analysis of head circumference in autism spectrum disorder families.

    PubMed

    Woodbury-Smith, M; Bilder, D A; Morgan, J; Jerominski, L; Darlington, T; Dyer, T; Paterson, A D; Coon, H

    2017-01-01

    It has long been recognized that there is an association between enlarged head circumference (HC) and autism spectrum disorder (ASD), but the genetics of HC in ASD is not well understood. In order to investigate the genetic underpinning of HC in ASD, we undertook a genome-wide linkage study of HC followed by linkage signal targeted association among a sample of 67 extended pedigrees with ASD. HC measurements on members of 67 multiplex ASD extended pedigrees were used as a quantitative trait in a genome-wide linkage analysis. The Illumina 6K SNP linkage panel was used, and analyses were carried out using the SOLAR implemented variance components model. Loci identified in this way formed the target for subsequent association analysis using the Illumina OmniExpress chip and imputed genotypes. A modification of the qTDT was used as implemented in SOLAR. We identified a linkage signal spanning 6p21.31 to 6p22.2 (maximum LOD = 3.4). Although targeted association did not find evidence of association with any SNP overall, in one family with the strongest evidence of linkage, there was evidence for association (rs17586672, p = 1.72E-07). Although this region does not overlap with ASD linkage signals in these same samples, it has been associated with other psychiatric risk, including ADHD, developmental dyslexia, schizophrenia, specific language impairment, and juvenile bipolar disorder. The genome-wide significant linkage signal represents the first reported observation of a potential quantitative trait locus for HC in ASD and may be relevant in the context of complex multivariate risk likely leading to ASD.

  16. Genome-wide identification and analysis of the MADS-box gene family in apple.

    PubMed

    Tian, Yi; Dong, Qinglong; Ji, Zhirui; Chi, Fumei; Cong, Peihua; Zhou, Zongshan

    2015-01-25

    The MADS-box gene family is one of the most widely studied families in plants and has diverse developmental roles in flower pattern formation, gametophyte cell division and fruit differentiation. Although the genome-wide analysis of this family has been performed in some species, little is known regarding MADS-box genes in apple (Malus domestica). In this study, 146 MADS-box genes were identified in the apple genome and were phylogenetically clustered into six subgroups (MIKC(c), MIKC*, Mα, Mβ, Mγ and Mδ) with the MADS-box genes from Arabidopsis and rice. The predicted apple MADS-box genes were distributed across all 17 chromosomes at different densities. Additionally, the MADS-box domain, exon length, gene structure and motif compositions of the apple MADS-box genes were analysed. Moreover, the expression of all of the apple MADS-box genes was analysed in the root, stem, leaf, flower tissues and five stages of fruit development. All of the apple MADS-box genes, with the exception of some genes in each group, were expressed in at least one of the tissues tested, which indicates that the MADS-box genes are involved in various aspects of the physiological and developmental processes of the apple. To the best of our knowledge, this report describes the first genome-wide analysis of the apple MADS-box gene family, and the results should provide valuable information for understanding the classification, cloning and putative functions of this family.

  17. Genome-wide analysis of homeobox gene family in legumes: identification, gene duplication and expression profiling.

    PubMed

    Bhattacharjee, Annapurna; Ghangal, Rajesh; Garg, Rohini; Jain, Mukesh

    2015-01-01

    Homeobox genes encode transcription factors that are known to play a major role in different aspects of plant growth and development. In the present study, we identified homeobox genes belonging to 14 different classes in five legume species, including chickpea, soybean, Medicago, Lotus and pigeonpea. The characteristic differences within homeodomain sequences among various classes of homeobox gene family were quite evident. Genome-wide expression analysis using publicly available datasets (RNA-seq and microarray) indicated that homeobox genes are differentially expressed in various tissues/developmental stages and under stress conditions in different legumes. We validated the differential expression of selected chickpea homeobox genes via quantitative reverse transcription polymerase chain reaction. Genome duplication analysis in soybean indicated that segmental duplication has significantly contributed in the expansion of homeobox gene family. The Ka/Ks ratio of duplicated homeobox genes in soybean showed that several members of this family have undergone purifying selection. Moreover, expression profiling indicated that duplicated genes might have been retained due to sub-functionalization. The genome-wide identification and comprehensive gene expression profiling of homeobox gene family members in legumes will provide opportunities for functional analysis to unravel their exact role in plant growth and development.

  18. Genome-Wide Analysis of Homeobox Gene Family in Legumes: Identification, Gene Duplication and Expression Profiling

    PubMed Central

    Garg, Rohini; Jain, Mukesh

    2015-01-01

    Homeobox genes encode transcription factors that are known to play a major role in different aspects of plant growth and development. In the present study, we identified homeobox genes belonging to 14 different classes in five legume species, including chickpea, soybean, Medicago, Lotus and pigeonpea. The characteristic differences within homeodomain sequences among various classes of homeobox gene family were quite evident. Genome-wide expression analysis using publicly available datasets (RNA-seq and microarray) indicated that homeobox genes are differentially expressed in various tissues/developmental stages and under stress conditions in different legumes. We validated the differential expression of selected chickpea homeobox genes via quantitative reverse transcription polymerase chain reaction. Genome duplication analysis in soybean indicated that segmental duplication has significantly contributed in the expansion of homeobox gene family. The Ka/Ks ratio of duplicated homeobox genes in soybean showed that several members of this family have undergone purifying selection. Moreover, expression profiling indicated that duplicated genes might have been retained due to sub-functionalization. The genome-wide identification and comprehensive gene expression profiling of homeobox gene family members in legumes will provide opportunities for functional analysis to unravel their exact role in plant growth and development. PMID:25745864

  19. Genome-wide linkage scan for psoriasis susceptibility loci in multiplex Tunisian families.

    PubMed

    Ammar, M; Bouchlaka-Souissi, C; Helms, C A; Zaraa, I; Jordan, C T; Anbunathan, H; Bouhaha, R; Kouidhi, S; Doss, N; Dhaoui, R; Ben Osman, A; Ben Ammar El Gaied, A; Marrakchi, R; Mokni, M; Bowcock, A M

    2013-03-01

    Psoriasis is a relapsing chronic inflammatory skin disease affecting all population groups, with a peak prevalence of 3% in northern European and Scandinavian caucasians. Epidemiological studies have implicated a genetic component to psoriasis. In the past 12 years multiple genome-wide linkage analyses have identified putative susceptibility loci on several chromosomes, with a major locus in the major histocompatibility complex region. To investigate the genetic basis of familial psoriasis in the Tunisian population using a genome-wide linkage scan in seven ultiplex psoriatic families from Tunisia. Following single nucleotide polymorphism (SNP) genotyping on the Affymetrix 10K SNP array, we performed nonparametric linkage (NPL) multipoint analyses to identify genotypes and obtain evidence for linkage with psoriasis across the genome. No chromosomal region gave consistent evidence for linkage, providing evidence for genetic heterogeneity in Tunisian psoriasis families. Significant evidence for linkage of psoriasis to chromosome 2p12 was seen in one family. We also identified several regions of tentative psoriasis linkage on chromosomes 2q, 4q, 6p, 11q, 12q, 9q and 13q. One family exhibiting suggestive evidence for linkage to 17q25 (PSORS2) was identified and all affected members harboured a p.Gly117Ser mutation in CARD14 (caspase recruitment domain family, member 14), recently reported to lead to psoriasis in a large family from the U.S.A. Our results support the genetic heterogeneity of psoriasis in the Tunisian population, provide confirmatory evidence for a novel psoriasis locus at chromosome 2p12 and reveal a psoriasis family with a mutation at PSORS2. © 2012 The Authors. BJD © 2012 British Association of Dermatologists.

  20. Genome-wide association study of personality traits in the long life family study.

    PubMed

    Bae, Harold T; Sebastiani, Paola; Sun, Jenny X; Andersen, Stacy L; Daw, E Warwick; Terracciano, Antonio; Ferrucci, Luigi; Perls, Thomas T

    2013-01-01

    Personality traits have been shown to be associated with longevity and healthy aging. In order to discover novel genetic modifiers associated with personality traits as related with longevity, we performed a genome-wide association study (GWAS) on personality factors assessed by NEO-five-factor inventory in individuals enrolled in the Long Life Family Study (LLFS), a study of 583 families (N up to 4595) with clustering for longevity in the United States and Denmark. Three SNPs, in almost perfect LD, associated with agreeableness reached genome-wide significance (p < 10(-8)) and replicated in an additional sample of 1279 LLFS subjects, although one (rs9650241) failed to replicate and the other two were not available in two independent replication cohorts, the Baltimore Longitudinal Study of Aging and the New England Centenarian Study. Based on 10,000,000 permutations, the empirical p-value of 2 × 10(-7) was observed for the genome-wide significant SNPs. Seventeen SNPs that reached marginal statistical significance in the two previous GWASs (p-value <10(-4) and 10(-5)), were also marginally significantly associated in this study (p-value <0.05), although none of the associations passed the Bonferroni correction. In addition, we tested age-by-SNP interactions and found some significant associations. Since scores of personality traits in LLFS subjects change in the oldest ages, and genetic factors outweigh environmental factors to achieve extreme ages, these age-by-SNP interactions could be a proxy for complex gene-gene interactions affecting personality traits and longevity.

  1. Genome-wide Linkage Screen in Familial Parkinson Disease Identifies Loci on Chromosomes 3 and 18

    PubMed Central

    Gao, Xiaoyi; Martin, Eden R.; Liu, Yutao; Mayhew, Gregory; Vance, Jeffery M.; Scott, William K.

    2009-01-01

    Parkinson disease (PD) is a complex, multifactorial neurodegenerative disease with substantial evidence for genetic risk factors. We conducted a genome-wide linkage screen of 5824 single-nucleotide polymorphisms in 278 families of European, non-Hispanic descent to localize regions that harbor susceptibility loci for PD. By using parametric and nonparametric linkage analyses and allowing for genetic heterogeneity among families, we found two loci for PD. Significant evidence for linkage was detected on chromosome 18q11 (maximum lod score [MLOD] = 4.1) and suggestive evidence for linkage was obtained on chromosome 3q25 (MLOD = 2.5). These results were strongest in families not previously screened for linkage, and simulation studies suggest that these findings are likely due to locus heterogeneity rather than random statistical error. The finding of two loci (one highly statistically significant) suggests that additional PD susceptibility genes might be identified through targeted candidate gene studies in these regions. PMID:19327735

  2. Scoring overlapping and adjacent signals from genome-wide ChIP and DamID assays.

    PubMed

    Fu, Audrey Qiuyan; Adryan, Boris

    2009-12-01

    Much of the research utilising genome-wide ChIP and DamID assays aims to understand the combinatorial feature of transcription factor binding and the chromatin modification code. With these experimental methods becoming more affordable and widespread, the focus of research is shifting to making sense of the data. Amongst the many challenges arising from data analyses, we are concerned with identifying biologically meaningful co-occurrences of transcription factor binding or chromatin modifications, using genome-wide profiles generated from ChIP and DamID assays. Co-occurrences are reflected in overlapping and adjacent signals in multiple ChIP or DamID profiles. We review existing quantitative methods to score overlaps and to cluster binding events in ChIP and DamID profiles. For pairwise comparison, existing methods either are based on a single score at the genome level or take a genomic, region-specific view. To draw inference from many profiles simultaneously, methods exist to cluster regions by their regulatory importance or to infer cis-regulatory modules for a particular region. We provide a simple guide to some of the statistical tools used by these methods.

  3. Genome-wide classification and expression analysis of MYB transcription factor families in rice and Arabidopsis

    PubMed Central

    2012-01-01

    Background The MYB gene family comprises one of the richest groups of transcription factors in plants. Plant MYB proteins are characterized by a highly conserved MYB DNA-binding domain. MYB proteins are classified into four major groups namely, 1R-MYB, 2R-MYB, 3R-MYB and 4R-MYB based on the number and position of MYB repeats. MYB transcription factors are involved in plant development, secondary metabolism, hormone signal transduction, disease resistance and abiotic stress tolerance. A comparative analysis of MYB family genes in rice and Arabidopsis will help reveal the evolution and function of MYB genes in plants. Results A genome-wide analysis identified at least 155 and 197 MYB genes in rice and Arabidopsis, respectively. Gene structure analysis revealed that MYB family genes possess relatively more number of introns in the middle as compared with C- and N-terminal regions of the predicted genes. Intronless MYB-genes are highly conserved both in rice and Arabidopsis. MYB genes encoding R2R3 repeat MYB proteins retained conserved gene structure with three exons and two introns, whereas genes encoding R1R2R3 repeat containing proteins consist of six exons and five introns. The splicing pattern is similar among R1R2R3 MYB genes in Arabidopsis. In contrast, variation in splicing pattern was observed among R1R2R3 MYB members of rice. Consensus motif analysis of 1kb upstream region (5′ to translation initiation codon) of MYB gene ORFs led to the identification of conserved and over-represented cis-motifs in both rice and Arabidopsis. Real-time quantitative RT-PCR analysis showed that several members of MYBs are up-regulated by various abiotic stresses both in rice and Arabidopsis. Conclusion A comprehensive genome-wide analysis of chromosomal distribution, tandem repeats and phylogenetic relationship of MYB family genes in rice and Arabidopsis suggested their evolution via duplication. Genome-wide comparative analysis of MYB genes and their expression analysis

  4. Genome-wide identification and expression analysis of MAPK and MAPKK gene family in Malus domestica.

    PubMed

    Zhang, Shizhong; Xu, Ruirui; Luo, Xiaocui; Jiang, Zesheng; Shu, Huairui

    2013-12-01

    MAPK signal transduction modules play crucial roles in regulating many biological processes in plants, which are composed of three classes of hierarchically organized protein kinases, namely MAPKKKs, MAPKKs, and MAPKs. Although genome-wide analysis of this family has been carried out in some species, little is known about MAPK and MAPKK genes in apple (Malus domestica). In this study, a total of 26 putative apple MAPK genes (MdMPKs) and 9 putative apple MAPKK genes (MdMKKs) have been identified and located within the apple genome. Phylogenetic analysis revealed that MdMAPKs and MdMAPKKs could be divided into 4 subfamilies (groups A, B, C and D), respectively. The predicted MdMAPKs and MdMAPKKs were distributed across 13 out of 17 chromosomes with different densities. In addition, analysis of exon-intron junctions and of intron phase inside the predicted coding region of each candidate gene has revealed high levels of conservation within and between phylogenetic groups. According to the microarray and expressed sequence tag (EST) analysis, the different expression patterns indicate that they may play different roles during fruit development and rootstock-scion interaction process. Moreover, MAPK and MAPKK genes were performed expression profile analyses in different tissues (root, stem, leaf, flower and fruit), and all of the selected genes were expressed in at least one of the tissues tested, indicating that the MAPKs and MAPKKs are involved in various aspects of physiological and developmental processes of apple. To our knowledge, this is the first report of a genome-wide analysis of the apple MAPK and MAPKK gene family. This study provides valuable information for understanding the classification and putative functions of the MAPK signal in apple.

  5. Genome-wide analysis of plant-specific Dof transcription factor family in tomato.

    PubMed

    Cai, Xiaofeng; Zhang, Yuyang; Zhang, Chanjuan; Zhang, Tingyan; Hu, Tixu; Ye, Jie; Zhang, Junhong; Wang, Taotao; Li, Hanxia; Ye, Zhibiao

    2013-06-01

    The Dof (DNA binding with One Finger) family encoding single zinc finger proteins has been known as a family of plant-specific transcription factors. These transcription factors are involved in a variety of functions of importance for different biological processes in plants. In the current study, we identified 34 Dof family genes in tomato, distributed on 11 chromosomes. A complete overview of SlDof genes in tomato is presented, including the gene structures, chromosome locations, phylogeny, protein motifs and evolution pattern. Phylogenetic analysis of 34 SlDof proteins resulted in four classes constituting six clusters. In addition, a comparative analysis between these genes in tomato, Arabidopsis and rice was also performed. The tomato Dof family expansion has been dated to recent duplication events, and segmental duplication is predominant for the SlDof genes. Furthermore, the SlDof genes displayed differential expression either in their transcript abundance or in their expression patterns under normal growth conditions. This is the first step towards genome-wide analyses of the Dof genes in tomato. Our study provides a very useful reference for cloning and functional analysis of the members of this gene family in tomato and other species.

  6. Conservation, Divergence, and Genome-Wide Distribution of PAL and POX A Gene Families in Plants

    PubMed Central

    Rawal, H. C.; Singh, N. K.; Sharma, T. R.

    2013-01-01

    Genome-wide identification and phylogenetic and syntenic comparison were performed for the genes responsible for phenylalanine ammonia lyase (PAL) and peroxidase A (POX A) enzymes in nine plant species representing very diverse groups like legumes (Glycine max and Medicago truncatula), fruits (Vitis vinifera), cereals (Sorghum bicolor, Zea mays, and Oryza sativa), trees (Populus trichocarpa), and model dicot (Arabidopsis thaliana) and monocot (Brachypodium distachyon) species. A total of 87 and 1045 genes in PAL and POX A gene families, respectively, have been identified in these species. The phylogenetic and syntenic comparison along with motif distributions shows a high degree of conservation of PAL genes, suggesting that these genes may predate monocot/eudicot divergence. The POX A family genes, present in clusters at the subtelomeric regions of chromosomes, might be evolving and expanding with higher rate than the PAL gene family. Our analysis showed that during the expansion of POX A gene family, many groups and subgroups have evolved, resulting in a high level of functional divergence among monocots and dicots. These results will act as a first step toward the understanding of monocot/eudicot evolution and functional characterization of these gene families in the future. PMID:23671845

  7. Genome-wide identification and expression profiling of ankyrin-repeat gene family in maize.

    PubMed

    Jiang, Haiyang; Wu, Qingqing; Jin, Jing; Sheng, Lei; Yan, Hanwei; Cheng, Beijiu; Zhu, Suwen

    2013-09-01

    Members of the ankyrin repeats (ANK) gene family encode ANK domain that are common in diverse organisms and play important roles in cell growth and development, such as cell-cell signal transduction and cell cycle regulation. Recently, genome-wide identification and evolutionary analyses of the ANK gene family have been carried out in Arabidopsis and rice. However, little is known regarding the ANK genes in the entire maize genome. In this study, we described the identification and structural characterization of 71 ANK genes in maize (ZmANK). Then, comprehensive bioinformatics analyses of ZmANK genes family were performed including phylogenetic, domain and motif analysis, chromosomal localization, intron/exon structural patterns, gene duplications and expression profiling. Domain composition analyses showed that ZmANK genes formed ten subfamilies. Five tandem duplications and 14 segmental duplications were identified in ZmANK genes. Furthermore, we took comparative analysis of the total ANK gene family in Arabidopsis, rice and maize, ZmANKs were more closely paired with OsANKs than with AtANKs. At last, expression profile analyses were performed. Forty-one members of ZmANK genes held EST sequences records. Semi-quantitative expression and microarray data analysis of these 41 ZmANK genes demonstrated that ZmANK genes exhibit a various expression pattern, suggesting that functional diversification of ZmANK genes family. The results will present significant insights to explore ANK genes expression and function in future studies in maize.

  8. Genome-wide identification and comparison of legume MLO gene family

    PubMed Central

    Rispail, Nicolas; Rubiales, Diego

    2016-01-01

    MLO proteins are highly conserved proteins with seven trans-membrane domains. Specific MLO genes have been linked to plant disease susceptibility. Others are involved in plant reproduction and in root thigmomorphogenesis. Functions of the remaining MLOs are still unknown. Here we performed a genome-wide survey of the MLO family in eight legume species from different clades of the Papillionoideae sub-family. A total of 118 MLO sequences were identified and characterized. Their deduced protein sequences shared the characteristics of MLO proteins. The total number of MLO genes per legume species varied from 13 to 20 depending on the species. Legume MLOs were evenly distributed over their genomes and tended to localize within syntenic blocks conserved across legume genomes. Phylogenetic analysis indicated that these sequences clustered in seven well-defined clades. Comparison of MLO protein sequences revealed 34 clade-specific motifs in the variable regions of the proteins. Comparative analyses of the MLO family between legume species also uncovered several evolutionary differences between the tropical legume species from the Phaseoloid clades and the other legume species. Altogether, this study provides interesting new features on the evolution of the MLO family. It also provides valuable clues to identify additional MLO genes from non-sequenced species. PMID:27596925

  9. Genome-wide identification and phylogenetic analysis of the ERF gene family in cucumbers

    PubMed Central

    Hu, Lifang; Liu, Shiqiang

    2011-01-01

    Members of the ERF transcription-factor family participate in a number of biological processes, viz., responses to hormones, adaptation to biotic and abiotic stress, metabolism regulation, beneficial symbiotic interactions, cell differentiation and developmental processes. So far, no tissue-expression profile of any cucumber ERF protein has been reported in detail. Recent completion of the cucumber full-genome sequence has come to facilitate, not only genome-wide analysis of ERF family members in cucumbers themselves, but also a comparative analysis with those in Arabidopsis and rice. In this study, 103 hypothetical ERF family genes in the cucumber genome were identified, phylogenetic analysis indicating their classification into 10 groups, designated I to X. Motif analysis further indicated that most of the conserved motifs outside the AP2/ERF domain, are selectively distributed among the specific clades in the phylogenetic tree. From chromosomal localization and genome distribution analysis, it appears that tandem-duplication may have contributed to CsERF gene expansion. Intron/exon structure analysis indicated that a few CsERFs still conserved the former intron-position patterns existent in the common ancestor of monocots and eudicots. Expression analysis revealed the widespread distribution of the cucumber ERF gene family within plant tissues, thereby implying the probability of their performing various roles therein. Furthermore, members of some groups presented mutually similar expression patterns that might be related to their phylogenetic groups. PMID:22215967

  10. Genome-wide identification and phylogenetic analysis of the SBP-box gene family in melons.

    PubMed

    Ma, Y; Guo, J W; Bade, R; Men, Z H; Hasi, A

    2014-10-27

    The SBP-box gene family is specific to plants and encodes a class of zinc finger-containing transcription factors with a broad range of functions. Although SBP-box genes have been identified in numerous plants, including green algae, moss, silver birch, snapdragon, Arabidopsis, rice, and maize, there is little information concerning SBP-box genes, or the corresponding miR156/157, function in melon. Using the highly conserved sequence of the Arabidopsis thaliana SBP-box domain protein as a probe of information sequence, the genome-wide protein database of melon was explored to obtain 13 SBP-box protein sequences, which were further divided into 4 groups, based on phylogenetic analysis. A further analysis centered on the melon SBP-box genetic family's phylogenetic evolution, sequence similarities, gene structure, and miR156 target sequence was also conducted. Analysis of all the expression patterns of melon SBP-box family genes showed that the SBP-box genes were detected in 7 kinds of tissue, and fruit had the highest expression level. CmSBP11 tends to present its specific expression in melon fruit and root. CmSBP09 expression was the highest in flower. Overall, the molecular evolution and expression pattern of the melon SBP-box gene family, revealed by these results, suggest its function differentiation that followed gene duplication.

  11. Genome-wide identification and comparison of legume MLO gene family.

    PubMed

    Rispail, Nicolas; Rubiales, Diego

    2016-09-06

    MLO proteins are highly conserved proteins with seven trans-membrane domains. Specific MLO genes have been linked to plant disease susceptibility. Others are involved in plant reproduction and in root thigmomorphogenesis. Functions of the remaining MLOs are still unknown. Here we performed a genome-wide survey of the MLO family in eight legume species from different clades of the Papillionoideae sub-family. A total of 118 MLO sequences were identified and characterized. Their deduced protein sequences shared the characteristics of MLO proteins. The total number of MLO genes per legume species varied from 13 to 20 depending on the species. Legume MLOs were evenly distributed over their genomes and tended to localize within syntenic blocks conserved across legume genomes. Phylogenetic analysis indicated that these sequences clustered in seven well-defined clades. Comparison of MLO protein sequences revealed 34 clade-specific motifs in the variable regions of the proteins. Comparative analyses of the MLO family between legume species also uncovered several evolutionary differences between the tropical legume species from the Phaseoloid clades and the other legume species. Altogether, this study provides interesting new features on the evolution of the MLO family. It also provides valuable clues to identify additional MLO genes from non-sequenced species.

  12. Genome-wide gene-environment interactions on quantitative traits using family data.

    PubMed

    Sitlani, Colleen M; Dupuis, Josée; Rice, Kenneth M; Sun, Fangui; Pitsillides, Achilleas N; Cupples, L Adrienne; Psaty, Bruce M

    2016-07-01

    Gene-environment interactions may provide a mechanism for targeting interventions to those individuals who would gain the most benefit from them. Searching for interactions agnostically on a genome-wide scale requires large sample sizes, often achieved through collaboration among multiple studies in a consortium. Family studies can contribute to consortia, but to do so they must account for correlation within families by using specialized analytic methods. In this paper, we investigate the performance of methods that account for within-family correlation, in the context of gene-environment interactions with binary exposures and quantitative outcomes. We simulate both cross-sectional and longitudinal measurements, and analyze the simulated data taking family structure into account, via generalized estimating equations (GEE) and linear mixed-effects models. With sufficient exposure prevalence and correct model specification, all methods perform well. However, when models are misspecified, mixed modeling approaches have seriously inflated type I error rates. GEE methods with robust variance estimates are less sensitive to model misspecification; however, when exposures are infrequent, GEE methods require modifications to preserve type I error rate. We illustrate the practical use of these methods by evaluating gene-drug interactions on fasting glucose levels in data from the Framingham Heart Study, a cohort that includes related individuals.

  13. Genome-wide identification and expression profiling of DNA methyltransferase gene family in maize.

    PubMed

    Qian, Yexiong; Xi, Yilong; Cheng, Beijiu; Zhu, Suwen

    2014-10-01

    In this study, we identified eight DNA MTase genes in maize and the diversity of expression patterns of them was presented by EST mining, microarray and semi-quantitative expression profile analyses. DNA methylation plays a pivotal role in promoting genomic stability through diverse biological processes including regulation of gene expression during development and chromatin organization. Although this important biological process is mainly regulated by several conserved Cytosine-5 DNA methyltransferases encoded by a smaller multigene family in plants, investigation of the plant C5-MTase-encoding gene family will serve to elucidate the epigenetic mechanism diversity in plants. Recently, genome-wide identification and evolutionary analyses of the C5-MTase-encoding gene family have been characterized in multiple plant species including Arabidopsis, rice, carrot and wheat. However, little is known regarding the C5-MTase-encoding genes in the entire maize genome. Here, genome-wide identification and expression profile analyses of maize C5-MTase-encoding genes (ZmMETs) were performed from the latest version of the maize (B73) genome. Phylogenetic analysis indicated that the orthologs from the three species (maize, Arabidopsis and rice) were categorized into four classes. Chromosomal location of these genes revealed that they are unevenly distributed on 6 of all 10 chromosomes with three chromosomal/segmental duplication events, suggesting that gene duplication played a key role in expansion of the maize C5-MTase-encoding gene family. Furthermore, EST expression data mining, microarray data and semi-quantitative expression profile analyses detected in the leaves by two different abiotic stress treatments have demonstrated that these genes had temporal and spatial expression pattern and exhibited different expression levels in stress treatments, suggesting that functional diversification of ZmMET genes family. Overall, our study will serve to present signification

  14. Genome-Wide Analysis of the Aquaporin Gene Family in Chickpea (Cicer arietinum L.).

    PubMed

    Deokar, Amit A; Tar'an, Bunyamin

    2016-01-01

    Aquaporins (AQPs) are essential membrane proteins that play critical role in the transport of water and many other solutes across cell membranes. In this study, a comprehensive genome-wide analysis identified 40 AQP genes in chickpea (Cicer arietinum L.). A complete overview of the chickpea AQP (CaAQP) gene family is presented, including their chromosomal locations, gene structure, phylogeny, gene duplication, conserved functional motifs, gene expression, and conserved promoter motifs. To understand AQP's evolution, a comparative analysis of chickpea AQPs with AQP orthologs from soybean, Medicago, common bean, and Arabidopsis was performed. The chickpea AQP genes were found on all of the chickpea chromosomes, except chromosome 7, with a maximum of six genes on chromosome 6, and a minimum of one gene on chromosome 5. Gene duplication analysis indicated that the expansion of chickpea AQP gene family might have been due to segmental and tandem duplications. CaAQPs were grouped into four subfamilies including 15 NOD26-like intrinsic proteins (NIPs), 13 tonoplast intrinsic proteins (TIPs), eight plasma membrane intrinsic proteins (PIPs), and four small basic intrinsic proteins (SIPs) based on sequence similarities and phylogenetic position. Gene structure analysis revealed a highly conserved exon-intron pattern within CaAQP subfamilies supporting the CaAQP family classification. Functional prediction based on conserved Ar/R selectivity filters, Froger's residues, and specificity-determining positions suggested wide differences in substrate specificity among the subfamilies of CaAQPs. Expression analysis of the AQP genes indicated that some of the genes are tissue-specific, whereas few other AQP genes showed differential expression in response to biotic and abiotic stresses. Promoter profiling of CaAQP genes for conserved cis-acting regulatory elements revealed enrichment of cis-elements involved in circadian control, light response, defense and stress responsiveness

  15. Genome-Wide Analysis of the Aquaporin Gene Family in Chickpea (Cicer arietinum L.)

    PubMed Central

    Deokar, Amit A.; Tar'an, Bunyamin

    2016-01-01

    Aquaporins (AQPs) are essential membrane proteins that play critical role in the transport of water and many other solutes across cell membranes. In this study, a comprehensive genome-wide analysis identified 40 AQP genes in chickpea (Cicer arietinum L.). A complete overview of the chickpea AQP (CaAQP) gene family is presented, including their chromosomal locations, gene structure, phylogeny, gene duplication, conserved functional motifs, gene expression, and conserved promoter motifs. To understand AQP's evolution, a comparative analysis of chickpea AQPs with AQP orthologs from soybean, Medicago, common bean, and Arabidopsis was performed. The chickpea AQP genes were found on all of the chickpea chromosomes, except chromosome 7, with a maximum of six genes on chromosome 6, and a minimum of one gene on chromosome 5. Gene duplication analysis indicated that the expansion of chickpea AQP gene family might have been due to segmental and tandem duplications. CaAQPs were grouped into four subfamilies including 15 NOD26-like intrinsic proteins (NIPs), 13 tonoplast intrinsic proteins (TIPs), eight plasma membrane intrinsic proteins (PIPs), and four small basic intrinsic proteins (SIPs) based on sequence similarities and phylogenetic position. Gene structure analysis revealed a highly conserved exon-intron pattern within CaAQP subfamilies supporting the CaAQP family classification. Functional prediction based on conserved Ar/R selectivity filters, Froger's residues, and specificity-determining positions suggested wide differences in substrate specificity among the subfamilies of CaAQPs. Expression analysis of the AQP genes indicated that some of the genes are tissue-specific, whereas few other AQP genes showed differential expression in response to biotic and abiotic stresses. Promoter profiling of CaAQP genes for conserved cis-acting regulatory elements revealed enrichment of cis-elements involved in circadian control, light response, defense and stress responsiveness

  16. Family-Based Genome-Wide Association Scan of Attention-Deficit/Hyperactivity Disorder

    ERIC Educational Resources Information Center

    Mick, Eric; Todorov, Alexandre; Smalley, Susan; Hu, Xiaolan; Loo, Sandra; Todd, Richard D.; Biederman, Joseph; Byrne, Deirdre; Dechairo, Bryan; Guiney, Allan; McCracken, James; McGough, James; Nelson, Stanley F.; Reiersen, Angela M.; Wilens, Timothy E.; Wozniak, Janet; Neale, Benjamin M.; Faraone, Stephen V.

    2010-01-01

    Objective: Genes likely play a substantial role in the etiology of attention-deficit/hyperactivity disorder (ADHD). However, the genetic architecture of the disorder is unknown, and prior genome-wide association studies (GWAS) have not identified a genome-wide significant association. We have conducted a third, independent, multisite GWAS of…

  17. Family-Based Genome-Wide Association Scan of Attention-Deficit/Hyperactivity Disorder

    ERIC Educational Resources Information Center

    Mick, Eric; Todorov, Alexandre; Smalley, Susan; Hu, Xiaolan; Loo, Sandra; Todd, Richard D.; Biederman, Joseph; Byrne, Deirdre; Dechairo, Bryan; Guiney, Allan; McCracken, James; McGough, James; Nelson, Stanley F.; Reiersen, Angela M.; Wilens, Timothy E.; Wozniak, Janet; Neale, Benjamin M.; Faraone, Stephen V.

    2010-01-01

    Objective: Genes likely play a substantial role in the etiology of attention-deficit/hyperactivity disorder (ADHD). However, the genetic architecture of the disorder is unknown, and prior genome-wide association studies (GWAS) have not identified a genome-wide significant association. We have conducted a third, independent, multisite GWAS of…

  18. Genome-Wide Identification and Expression Analysis of WRKY Gene Family in Capsicum annuum L.

    PubMed

    Diao, Wei-Ping; Snyder, John C; Wang, Shu-Bin; Liu, Jin-Bing; Pan, Bao-Gui; Guo, Guang-Jun; Wei, Ge

    2016-01-01

    The WRKY family of transcription factors is one of the most important families of plant transcriptional regulators with members regulating multiple biological processes, especially in regulating defense against biotic and abiotic stresses. However, little information is available about WRKYs in pepper (Capsicum annuum L.). The recent release of completely assembled genome sequences of pepper allowed us to perform a genome-wide investigation for pepper WRKY proteins. In the present study, a total of 71 WRKY genes were identified in the pepper genome. According to structural features of their encoded proteins, the pepper WRKY genes (CaWRKY) were classified into three main groups, with the second group further divided into five subgroups. Genome mapping analysis revealed that CaWRKY were enriched on four chromosomes, especially on chromosome 1, and 15.5% of the family members were tandemly duplicated genes. A phylogenetic tree was constructed depending on WRKY domain' sequences derived from pepper and Arabidopsis. The expression of 21 selected CaWRKY genes in response to seven different biotic and abiotic stresses (salt, heat shock, drought, Phytophtora capsici, SA, MeJA, and ABA) was evaluated by quantitative RT-PCR; Some CaWRKYs were highly expressed and up-regulated by stress treatment. Our results will provide a platform for functional identification and molecular breeding studies of WRKY genes in pepper.

  19. Genome-Wide Identification and Expression Analysis of WRKY Gene Family in Capsicum annuum L.

    PubMed Central

    Diao, Wei-Ping; Snyder, John C.; Wang, Shu-Bin; Liu, Jin-Bing; Pan, Bao-Gui; Guo, Guang-Jun; Wei, Ge

    2016-01-01

    The WRKY family of transcription factors is one of the most important families of plant transcriptional regulators with members regulating multiple biological processes, especially in regulating defense against biotic and abiotic stresses. However, little information is available about WRKYs in pepper (Capsicum annuum L.). The recent release of completely assembled genome sequences of pepper allowed us to perform a genome-wide investigation for pepper WRKY proteins. In the present study, a total of 71 WRKY genes were identified in the pepper genome. According to structural features of their encoded proteins, the pepper WRKY genes (CaWRKY) were classified into three main groups, with the second group further divided into five subgroups. Genome mapping analysis revealed that CaWRKY were enriched on four chromosomes, especially on chromosome 1, and 15.5% of the family members were tandemly duplicated genes. A phylogenetic tree was constructed depending on WRKY domain' sequences derived from pepper and Arabidopsis. The expression of 21 selected CaWRKY genes in response to seven different biotic and abiotic stresses (salt, heat shock, drought, Phytophtora capsici, SA, MeJA, and ABA) was evaluated by quantitative RT-PCR; Some CaWRKYs were highly expressed and up-regulated by stress treatment. Our results will provide a platform for functional identification and molecular breeding studies of WRKY genes in pepper. PMID:26941768

  20. Genome-Wide Identification and Expression of Xenopus F-Box Family of Proteins.

    PubMed

    Saritas-Yildirim, Banu; Pliner, Hannah A; Ochoa, Angelica; Silva, Elena M

    2015-01-01

    Protein degradation via the multistep ubiquitin/26S proteasome pathway is a rapid way to alter the protein profile and drive cell processes and developmental changes. Many key regulators of embryonic development are targeted for degradation by E3 ubiquitin ligases. The most studied family of E3 ubiquitin ligases is the SCF ubiquitin ligases, which use F-box adaptor proteins to recognize and recruit target proteins. Here, we used a bioinformatics screen and phylogenetic analysis to identify and annotate the family of F-box proteins in the Xenopus tropicalis genome. To shed light on the function of the F-box proteins, we analyzed expression of F-box genes during early stages of Xenopus development. Many F-box genes are broadly expressed with expression domains localized to diverse tissues including brain, spinal cord, eye, neural crest derivatives, somites, kidneys, and heart. All together, our genome-wide identification and expression profiling of the Xenopus F-box family of proteins provide a foundation for future research aimed to identify the precise role of F-box dependent E3 ubiquitin ligases and their targets in the regulatory circuits of development.

  1. Genome-wide identification of the expansin gene family in tobacco (Nicotiana tabacum).

    PubMed

    Ding, Anming; Marowa, Prince; Kong, Yingzhen

    2016-10-01

    Expansins are pH-dependent cell wall loosening proteins which form a large family in plants. They have been shown to be involved in various developmental processes and been implicated in enabling plants' ability to absorb nutrients from the soil as well as conferring biotic and abiotic stress resistances. It is therefore clear that they can be potential targets in genetic engineering for crop improvement. Tobacco (Nicotiana tabacum) is a major crop species as well as a model organism. Considering that only a few tobacco expansins have been studied, a genome-wide analysis of the tobacco expansin gene family is necessary. In this study, we identified 52 expansins in tobacco, which were classified into four subfamilies: 36 NtEXPAs, 6 NtEXPBs, 3 NtEXLAs and 7 NtEXLBs. Compared to other species, the NtEXLB subfamily size was relatively larger. Phylogenetic analysis showed that the 52 tobacco expansins were divided into 13 subgroups. Gene structure analysis revealed that genes within subfamilies/subgroups exhibited similar characteristics such as gene structure and protein motif arrangement. Whole-genome duplication and tandem duplication events may have played important roles in the expanding of tobacco expansins. Cis-Acting element analysis revealed that each expansin gene was regulated or several expansin genes were co-regulated by both internal and environmental factors. 35 of these genes were identified as being expressed according to a microarray analysis. In contrast to most NtEXPAs which had higher expression levels in young organs, NtEXLAs and NtEXLBs were preferentially expressed in mature or senescent tissues, suggesting that they might play different roles in different organs or at different developmental stages. As the first step towards genome-wide analysis of the tobacco expansin gene family, our work provides solid background information related to structure, evolution and expression as well as regulatory cis-acting elements of the tobacco expansins. This

  2. Genome-wide analysis of the TPX2 family proteins in Eucalyptus grandis.

    PubMed

    Du, Pingzhou; Kumar, Manoj; Yao, Yuan; Xie, Qiaoli; Wang, Jinyan; Zhang, Baolong; Gan, Siming; Wang, Yuqi; Wu, Ai-Min

    2016-11-24

    The Xklp2 (TPX2) proteins belong to the microtubule-associated (MAP) family of proteins. All members of the family contain the conserved TPX2 motif, which can interact with microtubules, regulate microtubule dynamics or assist with different microtubule functions, for example, maintenance of cell morphology or regulation of cell growth and development. However, the role of members of the TPX family have not been studied in the model tree species Eucalyptus to date. Here, we report the identification of the members of the TPX2 family in Eucalyptus grandis (Eg) and analyse the expression patterns and functions of these genes. In present study, a comprehensive analysis of the plant TPX2 family proteins was performed. Phylogenetic analyses indicated that the genes can be classified into 6 distinct subfamilies. A genome-wide survey identified 12 members of the TPX2 family in the sequenced genome of Eucalyptus grandis. The basic genetic properties of the TPX2 family in Eucalyptus were analysed. Our results suggest that the TPX2 family proteins within different sub-groups are relatively conserved but there are important differences between groups. Quantitative real-time PCR (qRT-PCR) was performed to confirm the expression levels of the genes in different tissues. The results showed that in the whole plant, the levels of EgWDL5 transcript are the highest, followed by those of EgWDL4. Compared with other tissues, the level of the EgMAP20 transcript is the highest in the root. Over-expression of EgMAP20 in Arabidopsis resulted in organ twisting. The cotyledon petioles showed left-handed twisting while the hypocotyl epidermal cells produced right-handed helical twisting. Finally, EgMAP20, EgWDL3 and EgWDL3L were all able to decorate microtubules. Plant TPX2 family proteins were systematically analysed using bioinformatics methods. There are 12 TPX2 family proteins in Eucalyptus. We have performed an initial characterization of the functions of several members of the TPX2

  3. Genome-Wide Identification and Analysis of the TIFY Gene Family in Grape

    PubMed Central

    Zhang, Yucheng; Gao, Min; Singer, Stacy D.; Fei, Zhangjun; Wang, Hua; Wang, Xiping

    2012-01-01

    Background The TIFY gene family constitutes a plant-specific group of genes with a broad range of functions. This family encodes four subfamilies of proteins, including ZML, TIFY, PPD and JASMONATE ZIM-Domain (JAZ) proteins. JAZ proteins are targets of the SCFCOI1 complex, and function as negative regulators in the JA signaling pathway. Recently, it has been reported in both Arabidopsis and rice that TIFY genes, and especially JAZ genes, may be involved in plant defense against insect feeding, wounding, pathogens and abiotic stresses. Nonetheless, knowledge concerning the specific expression patterns and evolutionary history of plant TIFY family members is limited, especially in a woody species such as grape. Methodology/Principal Findings A total of two TIFY, four ZML, two PPD and 11 JAZ genes were identified in the Vitis vinifera genome. Phylogenetic analysis of TIFY protein sequences from grape, Arabidopsis and rice indicated that the grape TIFY proteins are more closely related to those of Arabidopsis than those of rice. Both segmental and tandem duplication events have been major contributors to the expansion of the grape TIFY family. In addition, synteny analysis between grape and Arabidopsis demonstrated that homologues of several grape TIFY genes were found in the corresponding syntenic blocks of Arabidopsis, suggesting that these genes arose before the divergence of lineages that led to grape and Arabidopsis. Analyses of microarray and quantitative real-time RT-PCR expression data revealed that grape TIFY genes are not a major player in the defense against biotrophic pathogens or viruses. However, many of these genes were responsive to JA and ABA, but not SA or ET. Conclusion The genome-wide identification, evolutionary and expression analyses of grape TIFY genes should facilitate further research of this gene family and provide new insights regarding their evolutionary history and regulatory control. PMID:22984514

  4. Genome-wide identification and characterization of the Dof gene family in Medicago truncatula.

    PubMed

    Shu, Y J; Song, L L; Zhang, J; Liu, Y; Guo, C H

    2015-09-09

    The DNA-binding one zinc finger (Dof) family is a classic plant-specific zinc-finger transcription factor family, which is involved in many important processes, including seed maturation and germination, plant growth and development, and light responses. Investigation of the Medicago truncatula genome revealed 42 putative Dof genes, each of which holds one Dof domain. These genes were classified into four groups based on phylogenetic analysis, which are similar to the groups reported for Arabidopsis and rice. Based on genome duplication analysis, it was found that the MtDof genes were distributed on all chromosomes and had expanded through tandem gene duplication and segmental duplication events. Two main duplication regions were identified, one from tandem duplication and another from segmental duplication. By analyzing high-throughput sequencing data from M. truncatula, we found that most of the MtDof genes showed specific expression patterns in different tissues. According to cis-regulatory element analysis, these MtDof genes are regulated by different cis-acting motifs, which are important for the functional divergence of the MtDof genes in different processes. Thus, using genome-wide identification, evolution, and expression pattern analysis of the Dof genes in M. truncatula, our study provides valuable information for understanding the potential function of the Dof genes in regulating the growth and development of M. truncatula.

  5. Statistical challenges for genome-wide association studies of suicidality using family data

    PubMed Central

    Lasky-Su, Jessica; Lange, Christoph

    2010-01-01

    The etiology of suicide is complex in nature with both environmental and genetic causes that are extremely diverse. This extensive heterogeneity weakens the relationship between genotype and phenotype and as a result, we face many challenges when studying the genetic etiology of suicide. We are now in the midst of a genetics revolution, where genotyping costs are decreasing and genotyping speed is increasing at a fast rate, allowing genetic association studies to genotype thousands to millions of SNPs that cover the entire human genome. As such, genome-wide association studies (GWAS) are now the norm. In this article we address several statistical challenges that occur when studying the genetic etiology of suicidality in the age of the genetics revolution. These challenges include 1) the large number of statistical tests; 2) complex phenotypes that are difficult to quantify; and 3) modest genetic effect sizes. We address these statistical issues in the context of family-based study designs. Specifically we discuss several statistical extensions of family-based association tests (FBATs) that work to alleviate these challenges. As our intention it to describe how statistical methodology may work to identify disease variants for suicidality, we avoid the mathematical details of the methodologies presented. PMID:20447807

  6. Genome-wide linkage analysis replicates susceptibility locus for fasting plasma triglycerides: NHLBI Family Heart Study.

    PubMed

    Arnett, Donna K; Miller, Michael B; Coon, Hilary; Ellison, R Curtis; North, Kari E; Province, Michael; Leppert, Mark; Eckfeldt, John H

    2004-11-01

    Recent reports implicate chromosomal regions linked to inter-individual variation in plasma triglycerides. We conducted genome-wide scans to replicate these linkages and/or identify other loci influencing plasma triglycerides in the NHLBI Family Heart Study (FHS). Data were obtained for 501 three-generational families. Genotyping was done by the Utah Molecular Genetics Laboratory and NHLBI Mammalian Genotyping Service; markers from both were placed on one genetic map. Analysis was done using multipoint variance components linkage. Fasting plasma triglycerides were log-transformed and age-, sex-, and field center-adjusted; suggestive linkage evidence was found on chromosome 8 (LOD=2.80 at 89 cM, marker D8S1141). Further adjustment for waist girth, BMI, diabetes, hypertension, and lipid-lowering drugs suggested linkage regions on chromosomes 6 (LOD=2.29 at 79 cM, marker D6S295) and 15 (LOD=1.85 at 43 cM, marker D15S659). Since HDL is correlated with triglycerides and because it was linked to this region on chromosome 15 in FHS, we created a composite triglyceride-HDL phenotype. The combined phenotype LOD score was 3.0 at the same marker on chromosome 15. Chromosome 15 likely harbors a susceptibility locus with an influence on triglycerides and HDL. Regions on chromosomes 6 and 8 may also contain loci contributing to inter-individual variation in plasma triglycerides.

  7. Genome-wide linkage analysis in families with infantile hypertrophic pyloric stenosis indicates novel susceptibility loci.

    PubMed

    Svenningsson, Anna; Söderhäll, Cilla; Persson, Sofia; Lundberg, Fredrik; Luthman, Holger; Chung, Eddie; Gardiner, Mark; Kockum, Ingrid; Nordenskjöld, Agneta

    2012-02-01

    Infantile hypertrophic pyloric stenosis (IHPS) is a common cause of upper gastrointestinal obstruction during infancy. A multifactorial background of the disease is well established. Multiple susceptibility loci including the neuronal nitric oxide synthase (NOS1) gene have previously been linked to IHPS, but contradictory results of linkage studies in different materials indicate genetic heterogeneity. To identify IHPS susceptibility loci, we conducted a genome-wide linkage analysis in 37 Swedish families. In regions where the Swedish material showed most evidence in favor of linkage, 31 additional British IHPS families were analyzed. Evidence in favor of significant linkage was observed in the Swedish material to two loci on chromosome 2q24 (non-parametric linkage (NPL) =3.77) and 7p21 (NPL=4.55). In addition, evidence of suggestive linkage was found to two loci on chromosome 6p21 (NPL=2.97) and 12q24 (NPL=2.63). Extending the material with British samples did not enhance the level of significance. Regions with linkage harbor interesting candidate genes, such as glucagon-like peptide-2 (GLP-2 encoded by the glucagon gene GCG), NOS1, motilin (MLN) and neuropeptide Y (NPY). The coding exons for GLP-2, and NPY were screened for mutations with negative results. In conclusion, we could confirm suggestive linkage to the region harboring the NOS1 gene and detected additional novel susceptibility loci for IHPS.

  8. Genome-wide analysis of the MADS-box gene family in Brachypodium distachyon.

    PubMed

    Wei, Bo; Zhang, Rong-Zhi; Guo, Juan-Juan; Liu, Dan-Mei; Li, Ai-Li; Fan, Ren-Chun; Mao, Long; Zhang, Xiang-Qi

    2014-01-01

    MADS-box genes are important transcription factors for plant development, especially floral organogenesis. Brachypodium distachyon is a model for biofuel plants and temperate grasses such as wheat and barley, but a comprehensive analysis of MADS-box family proteins in Brachypodium is still missing. We report here a genome-wide analysis of the MADS-box gene family in Brachypodium distachyon. We identified 57 MADS-box genes and classified them into 32 MIKC(c)-type, 7 MIKC*-type, 9 Mα, 7 Mβ and 2 Mγ MADS-box genes according to their phylogenetic relationships to the Arabidopsis and rice MADS-box genes. Detailed gene structure and motif distribution were then studied. Investigation of their chromosomal localizations revealed that Brachypodium MADS-box genes distributed evenly across five chromosomes. In addition, five pairs of type II MADS-box genes were found on synteny blocks derived from whole genome duplication blocks. We then performed a systematic expression analysis of Brachypodium MADS-box genes in various tissues, particular floral organs. Further detection under salt, drought, and low-temperature conditions showed that some MADS-box genes may also be involved in abiotic stress responses, including type I genes. Comparative studies of MADS-box genes among Brachypodium, rice and Arabidopsis showed that Brachypodium had fewer gene duplication events. Taken together, this work provides useful data for further functional studies of MADS-box genes in Brachypodium distachyon.

  9. Genome-wide analysis of histone modifications by ChIP-chip to identify silenced genes in gastric cancer.

    PubMed

    Zhu, Xinjiang; Liu, Jian; Xu, Xiaoyang; Zhang, Chundong; Dai, Dongqiu

    2015-05-01

    The present study aimed to identify novel histone modification markers in gastric cancer (GC) by chromatin immunoprecipitation microarray (ChIP-chip) analysis and to determine whether these markers were able to discriminate between normal and GC cells. We also tested for correlations with DNA methylation. We probed a human CpG island microarray with DNA from a GC cell line (MKN45) by chromatin immunoprecipitation (ChIP). ChIP-reverse-transcriptase quantitative polymerase chain reaction PCR (RT-qPCR) was used to validate the microarray results. Additionally, mRNA expression levels and the DNA methylation of potential target genes were evaluated by RT-qPCR and methylation-specific PCR (MSP). The moults showed that 134 genes exhibited the highest signal-to-noise ratio of H3-K9 trimethylation over acetylation and 46 genes exhibited the highest signal-to-noise ratio of H3-K9 trimethylation over H3-K4 trimethylation in MKN45 cells. The ChIP-qPCR results agreed with those obtained from the ChIP-chip analysis. Aberrant DNA methylation status and mRNA expression levels were also identified for selected genes (PSD, SMARCC1 and Vps37A) in the GC cell lines. The results suggest that CpG island microarray coupled with ChIP (ChIP-chip) can identify novel targets of gene silencing in GC. Additionally, ChIP-chip is the best approach for assessing the genome-wide status of epigenetic regulation, which may allow for a broader genomic understanding compared to the knowledge that has been accumulated from single-gene studies.

  10. Genome-wide linkage scan for submaximal exercise heart rate in the HERITAGE family study.

    PubMed

    Spielmann, Nadine; Leon, Arthur S; Rao, D C; Rice, Treva; Skinner, James S; Rankinen, Tuomo; Bouchard, Claude

    2007-12-01

    The purpose of this study was to identify regions of the human genome linked to submaximal exercise heart rates in the sedentary state and in response to a standardized 20-wk endurance training program in blacks and whites of the HERITAGE Family Study. A total of 701 polymorphic markers covering the 22 autosomes were used in the genome-wide linkage scan, with 328 sibling pairs from 99 white nuclear families and 102 pairs from 115 black family units. Steady-state heart rates were measured at the relative intensity of 60% maximal oxygen uptake (HR60) and at the absolute intensity of 50 W (HR50). Baseline phenotypes were adjusted for age, sex, and baseline body mass index (BMI) and training responses (posttraining minus baseline, Delta) were adjusted for age, sex, baseline BMI, and baseline value of the phenotype. Two analytic strategies were used, a multipoint variance components and a regression-based multipoint linkage analysis. In whites, promising linkages (LOD > 1.75) were identified on 18q21-q22 for baseline HR50 (LOD = 2.64; P = 0.0002) and DeltaHR60 (LOD = 2.10; P = 0.0009) and on chromosome 2q33.3 for DeltaHR50 (LOD = 2.13; P = 0.0009). In blacks, evidence of promising linkage for baseline HR50 was detected with several markers within the chromosomal region 10q24-q25.3 (peak LOD = 2.43, P = 0.0004 with D10S597). The most promising regions for fine mapping in the HERITAGE Family Study were found on 2q33 for HR50 training response in whites, on 10q25-26 for baseline HR60 in blacks, and on 18q21-22 for both baseline HR50 and DeltaHR60 in whites.

  11. Genome-wide identification and characterization of aquaporin gene family in common bean (Phaseolus vulgaris L.).

    PubMed

    Ariani, Andrea; Gepts, Paul

    2015-10-01

    Plant aquaporins are a large and diverse family of water channel proteins that are essential for several physiological processes in living organisms. Numerous studies have linked plant aquaporins with a plethora of processes, such as nutrient acquisition, CO2 transport, plant growth and development, and response to abiotic stresses. However, little is known about this protein family in common bean. Here, we present a genome-wide identification of the aquaporin gene family in common bean (Phaseolus vulgaris L.), a legume crop essential for human nutrition. We identified 41 full-length coding aquaporin sequences in the common bean genome, divided by phylogenetic analysis into five sub-families (PIPs, TIPs, NIPs, SIPs and XIPs). Residues determining substrate specificity of aquaporins (i.e., NPA motifs and ar/R selectivity filter) seem conserved between common bean and other plant species, allowing inference of substrate specificity for these proteins. Thanks to the availability of RNA-sequencing datasets, expression levels in different organs and in leaves of wild and domesticated bean accessions were evaluated. Three aquaporins (PvTIP1;1, PvPIP2;4 and PvPIP1;2) have the overall highest mean expressions, with PvTIP1;1 having the highest expression among all aquaporins. We performed an EST database mining to identify drought-responsive aquaporins in common bean. This analysis showed a significant increase in expression for PvTIP1;1 in drought stress conditions compared to well-watered environments. The pivotal role suggested for PvTIP1;1 in regulating water homeostasis and drought stress response in the common bean should be verified by further field experimentation under drought stress.

  12. Genome-Wide Identification and Characterization of WRKY Gene Family in Peanut

    PubMed Central

    Song, Hui; Wang, Pengfei; Lin, Jer-Young; Zhao, Chuanzhi; Bi, Yuping; Wang, Xingjun

    2016-01-01

    WRKY, an important transcription factor family, is widely distributed in the plant kingdom. Many reports focused on analysis of phylogenetic relationship and biological function of WRKY protein at the whole genome level in different plant species. However, little is known about WRKY proteins in the genome of Arachis species and their response to salicylic acid (SA) and jasmonic acid (JA) treatment. In this study, we identified 77 and 75 WRKY proteins from the two wild ancestral diploid genomes of cultivated tetraploid peanut, Arachis duranensis and Arachis ipaënsis, using bioinformatics approaches. Most peanut WRKY coding genes were located on A. duranensis chromosome A6 and A. ipaënsis chromosome B3, while the least number of WRKY genes was found in chromosome 9. The WRKY orthologous gene pairs in A. duranensis and A. ipaënsis chromosomes were highly syntenic. Our analysis indicated that segmental duplication events played a major role in AdWRKY and AiWRKY genes, and strong purifying selection was observed in gene duplication pairs. Furthermore, we translate the knowledge gained from the genome-wide analysis result of wild ancestral peanut to cultivated peanut to reveal that gene activities of specific cultivated peanut WRKY gene were changed due to SA and JA treatment. Peanut WRKY7, 8 and 13 genes were down-regulated, whereas WRKY1 and 12 genes were up-regulated with SA and JA treatment. These results could provide valuable information for peanut improvement. PMID:27200012

  13. Genome-Wide Identification and Characterization of WRKY Gene Family in Peanut.

    PubMed

    Song, Hui; Wang, Pengfei; Lin, Jer-Young; Zhao, Chuanzhi; Bi, Yuping; Wang, Xingjun

    2016-01-01

    WRKY, an important transcription factor family, is widely distributed in the plant kingdom. Many reports focused on analysis of phylogenetic relationship and biological function of WRKY protein at the whole genome level in different plant species. However, little is known about WRKY proteins in the genome of Arachis species and their response to salicylic acid (SA) and jasmonic acid (JA) treatment. In this study, we identified 77 and 75 WRKY proteins from the two wild ancestral diploid genomes of cultivated tetraploid peanut, Arachis duranensis and Arachis ipaënsis, using bioinformatics approaches. Most peanut WRKY coding genes were located on A. duranensis chromosome A6 and A. ipaënsis chromosome B3, while the least number of WRKY genes was found in chromosome 9. The WRKY orthologous gene pairs in A. duranensis and A. ipaënsis chromosomes were highly syntenic. Our analysis indicated that segmental duplication events played a major role in AdWRKY and AiWRKY genes, and strong purifying selection was observed in gene duplication pairs. Furthermore, we translate the knowledge gained from the genome-wide analysis result of wild ancestral peanut to cultivated peanut to reveal that gene activities of specific cultivated peanut WRKY gene were changed due to SA and JA treatment. Peanut WRKY7, 8 and 13 genes were down-regulated, whereas WRKY1 and 12 genes were up-regulated with SA and JA treatment. These results could provide valuable information for peanut improvement.

  14. Genome-wide characterization and comparative analysis of the MLO gene family in cotton.

    PubMed

    Wang, Xiaoyan; Ma, Qifeng; Dou, Lingling; Liu, Zhen; Peng, Renhai; Yu, Shuxun

    2016-06-01

    In plants, MLO (Mildew Locus O) gene encodes a plant-specific seven transmembrane (TM) domain protein involved in several cellular processes, including susceptibility to powdery mildew (PM). In this study, a genome-wide characterization of the MLO gene family in G. raimondii L., G. arboreum L. and G. hirsutum L. was performed. In total, 22, 17 and 38 homologous sequences were identified for each species, respectively. Gene organization, including chromosomal location, gene clustering and gene duplication, was investigated. Homologues related to PM susceptibility in upland cotton were inferred by phylogenetic relationships with functionally characterized MLO proteins. To conduct a comparative analysis between MLO candidate genes from G. raimondii L., G. arboreum L. and G. hirsutum L., orthologous relationships and conserved synteny blocks were constructed. The transcriptional variation of 38 GhMLO genes in response to exogenous application of salt, mannitol (Man), abscisic acid (ABA), ethylene (ETH), jasmonic acid (JA) and salicylic acid (SA) was monitored. Further studies should be conducted to elucidate the functions of MLO genes in PM susceptibility and phytohormone signalling pathways.

  15. Genome-Wide Identification, Characterization and Expression Profiling of ADF Family Genes in Solanum lycopersicum L.

    PubMed

    Khatun, Khadiza; Robin, Arif Hasan Khan; Park, Jong-In; Kim, Chang Kil; Lim, Ki-Byung; Kim, Min-Bae; Lee, Do-Jin; Nou, Ill Sup; Chung, Mi-Young

    2016-09-29

    The actin depolymerizing factor (ADF) proteins have growth, development, defense-related and growth regulatory functions in plants. The present study used genome-wide analysis to investigate ADF family genes in tomato. Eleven tomato ADF genes were identified and differential expression patterns were found in different organs. SlADF6 was preferentially expressed in roots, suggesting its function in root development. SlADF1, SlADF3 and SlADF10 were predominately expressed in the flowers compared to the other organs and specifically in the stamen compared to other flower parts, indicating their potential roles in pollen development. The comparatively higher expression of SlADF3 and SlADF11 at early fruit developmental stages might implicate them in determining final fruit size. SlADF5 and SlADF8 had relatively higher levels of expression five days after the breaker stage of fruit development, suggesting their possible role in fruit ripening. Notably, six genes were induced by cold and heat, seven by drought, five by NaCl, and four each by abscisic acid (ABA), jasmonic acid (JA) and wounding treatments. The differential expression patterns of the SlADF genes under different types of stresses suggested their function in stress tolerance in tomato plants. Our results will be helpful for the functional characterization of ADF genes during organ and fruit development of tomato under different stresses.

  16. Genome-Wide Identification and Characterization of the GmSnRK2 Family in Soybean.

    PubMed

    Zhao, Wei; Cheng, Yi-Hui; Zhang, Chi; Shen, Xin-Jie; You, Qing-Bo; Guo, Wei; Li, Xiang; Song, Xue-Jiao; Zhou, Xin-An; Jiao, Yong-Qing

    2017-08-23

    Sucrose non-fermenting-1 (SNF1)-related protein kinase 2s (SnRK2s) that were reported to be involved in the transduction of abscisic acid (ABA) signaling, play important roles in response to biotic and abiotic stresses in plants. Compared to the systemic investigation of SnRK2s in Arabidopsisthaliana and Oryza sativa, little is known regarding SnRK2s in soybean, which is one of the most important oil and protein crops. In the present study, we performed genome-wide identification and characterization of GmSnRK2s in soybean. In summary, 22 GmSnRK2s were identified and clustered into four groups. Phylogenetic analysis indicated the expansion of SnRK2 gene family during the evolution of soybean. Various cis-acting elements such as ABA Response Elements (ABREs) were identified and analyzed in the promoter regions of GmSnRK2s. The results of RNA sequencing (RNA-Seq) data for different soybean tissues showed that GmSnRK2s exhibited spatio-temporally specific expression patterns during soybean growth and development. Certain GmSnRK2s could respond to the treatments including salinity, ABA and strigolactones. Our results provide a foundation for the further elucidation of the function of GmSnRK2 genes in soybean.

  17. Genome-Wide Identification, Characterization and Expression Profiling of ADF Family Genes in Solanum lycopersicum L.

    PubMed Central

    Khatun, Khadiza; Robin, Arif Hasan Khan; Park, Jong-In; Kim, Chang Kil; Lim, Ki-Byung; Kim, Min-Bae; Lee, Do-Jin; Nou, Ill Sup; Chung, Mi-Young

    2016-01-01

    The actin depolymerizing factor (ADF) proteins have growth, development, defense-related and growth regulatory functions in plants. The present study used genome-wide analysis to investigate ADF family genes in tomato. Eleven tomato ADF genes were identified and differential expression patterns were found in different organs. SlADF6 was preferentially expressed in roots, suggesting its function in root development. SlADF1, SlADF3 and SlADF10 were predominately expressed in the flowers compared to the other organs and specifically in the stamen compared to other flower parts, indicating their potential roles in pollen development. The comparatively higher expression of SlADF3 and SlADF11 at early fruit developmental stages might implicate them in determining final fruit size. SlADF5 and SlADF8 had relatively higher levels of expression five days after the breaker stage of fruit development, suggesting their possible role in fruit ripening. Notably, six genes were induced by cold and heat, seven by drought, five by NaCl, and four each by abscisic acid (ABA), jasmonic acid (JA) and wounding treatments. The differential expression patterns of the SlADF genes under different types of stresses suggested their function in stress tolerance in tomato plants. Our results will be helpful for the functional characterization of ADF genes during organ and fruit development of tomato under different stresses. PMID:27690110

  18. Genome-wide profiling of yeast DNA:RNA hybrid prone sites with DRIP-chip.

    PubMed

    Chan, Yujia A; Aristizabal, Maria J; Lu, Phoebe Y T; Luo, Zongli; Hamza, Akil; Kobor, Michael S; Stirling, Peter C; Hieter, Philip

    2014-04-01

    DNA:RNA hybrid formation is emerging as a significant cause of genome instability in biological systems ranging from bacteria to mammals. Here we describe the genome-wide distribution of DNA:RNA hybrid prone loci in Saccharomyces cerevisiae by DNA:RNA immunoprecipitation (DRIP) followed by hybridization on tiling microarray. These profiles show that DNA:RNA hybrids preferentially accumulated at rDNA, Ty1 and Ty2 transposons, telomeric repeat regions and a subset of open reading frames (ORFs). The latter are generally highly transcribed and have high GC content. Interestingly, significant DNA:RNA hybrid enrichment was also detected at genes associated with antisense transcripts. The expression of antisense-associated genes was also significantly altered upon overexpression of RNase H, which degrades the RNA in hybrids. Finally, we uncover mutant-specific differences in the DRIP profiles of a Sen1 helicase mutant, RNase H deletion mutant and Hpr1 THO complex mutant compared to wild type, suggesting different roles for these proteins in DNA:RNA hybrid biology. Our profiles of DNA:RNA hybrid prone loci provide a resource for understanding the properties of hybrid-forming regions in vivo, extend our knowledge of hybrid-mitigating enzymes, and contribute to models of antisense-mediated gene regulation. A summary of this paper was presented at the 26th International Conference on Yeast Genetics and Molecular Biology, August 2013.

  19. DNA methylation profiling: comparison of genome-wide sequencing methods and the Infinium Human Methylation 450 Bead Chip.

    PubMed

    Walker, Denise L; Bhagwate, Aditya Vijay; Baheti, Saurabh; Smalley, Regenia L; Hilker, Christopher A; Sun, Zhifu; Cunningham, Julie M

    2015-01-01

    To compare the performance of four sequence-based and one microarray methods for DNA methylation profiling. DNA from two cell lines were profiled by reduced representation bisulfite sequencing, methyl capture sequencing (SS-Meth Seq), NimbleGen SeqCapEpi CpGiant(Nimblegen MethSeq), methylated DNA immunoprecipitation (MeDIP) and the Human Methylation 450 Bead Chip (Meth450K). Despite differences in genome-wide coverage, high correlation and concordance were observed between different methods. Significant overlap of differentially methylated regions was identified between sequenced-based platforms. MeDIP provided the best coverage for the whole genome and gene body regions, while RRBS and Nimblegen MethSeq were superior for CpGs in CpG islands and promoters. Methylation analyses can be achieved by any of the five methods but understanding their differences may better address the research question being posed.

  20. Genome-wide analysis of the omega-3 fatty acid desaturase gene family in Gossypium

    DOE PAGES

    Yurchenko, Olga P.; Park, Sunjung; Ilut, Daniel C.; ...

    2014-11-18

    The majority of commercial cotton varieties planted worldwide are derived from Gossypium hirsutum, which is a naturally occurring allotetraploid produced by interspecific hybridization of A- and D-genome diploid progenitor species. While most cotton species are adapted to warm, semi-arid tropical and subtropical regions, and thus perform well in these geographical areas, cotton seedlings are sensitive to cold temperature, which can significantly reduce crop yields. One of the common biochemical responses of plants to cold temperatures is an increase in omega-3 fatty acids, which protects cellular function by maintaining membrane integrity. The purpose of our study was to identify and characterizemore » the omega-3 fatty acid desaturase (FAD) gene family in G. hirsutum, with an emphasis on identifying omega-3 FADs involved in cold temperature adaptation. Results: Eleven omega-3 FAD genes were identified in G. hirsutum, and characterization of the gene family in extant A and D diploid species (G. herbaceum and G. raimondii, respectively) allowed for unambiguous genome assignment of all homoeologs in tetraploid G. hirsutum. The omega-3 FAD family of cotton includes five distinct genes, two of which encode endoplasmic reticulum-type enzymes (FAD3-1 and FAD3-2) and three that encode chloroplast-type enzymes (FAD7/8-1, FAD7/8-2, and FAD7/8-3). The FAD3-2 gene was duplicated in the A genome progenitor species after the evolutionary split from the D progenitor, but before the interspecific hybridization event that gave rise to modern tetraploid cotton. RNA-seq analysis revealed conserved, gene-specific expression patterns in various organs and cell types and semi-quantitative RT-PCR further revealed that FAD7/8-1 was specifically induced during cold temperature treatment of G. hirsutum seedlings. Conclusions: The omega-3 FAD gene family in cotton was characterized at the genome-wide level in three species, showing relatively ancient establishment of the gene family prior

  1. A genome wide association study of fast beta EEG in families of European ancestry.

    PubMed

    Meyers, Jacquelyn L; Zhang, Jian; Manz, Niklas; Rangaswamy, Madhavi; Kamarajan, Chella; Wetherill, Leah; Chorlian, David B; Kang, Sun J; Bauer, Lance; Hesselbrock, Victor; Kramer, John; Kuperman, Samuel; Nurnberger, John I; Tischfield, Jay; Wang, Jen Chyong; Edenberg, Howard J; Goate, Alison; Foroud, Tatiana; Porjesz, Bernice

    2017-05-01

    Differences in fast beta (20-28Hz) electroencephalogram (EEG) oscillatory activity distinguish some individuals with psychiatric and substance use disorders, suggesting that it may be a useful endophenotype for studying the genetics of disorders characterized by neural hyper-excitability. Despite the high heritability estimates provided by twin and family studies, there have been relatively few genetic studies of beta EEG, and to date only one genetic association finding has replicated (i.e., GABRA2). In a sample of 1564 individuals from 117 families of European Ancestry (EA) drawn from the Collaborative Study on the Genetics of Alcoholism (COGA), we performed a Genome-Wide Association Study (GWAS) on resting-state fronto-central fast beta EEG power, adjusting regression models for family relatedness, age, sex, and ancestry. To further characterize genetic findings, we examined the functional and behavioral significance of GWAS findings. Three intronic variants located within DSE (dermatan sulfate epimerase) on 6q22 were associated with fast beta EEG at a genome wide significant level (p<5×10(-8)). The most significant SNP was rs2252790 (p<2.6×10(-8); MAF=0.36; β=0.135). rs2252790 is an eQTL for ROS1 expressed most robustly in the temporal cortex (p=1.2×10(-6)) and for DSE/TSPYL4 expressed most robustly in the hippocampus (p=7.3×10(-4); β=0.29). Previous studies have indicated that DSE is involved in a network of genes integral to membrane organization; gene-based tests indicated that several variants within this network (i.e., DSE, ZEB2, RND3, MCTP1, and CTBP2) were also associated with beta EEG (empirical p<0.05), and of these genes, ZEB2 and CTBP2 were associated with DSM-V Alcohol Use Disorder (AUD; empirical p<0.05).' In this sample of EA families enriched for AUDs, fast beta EEG is associated with variants within DSE on 6q22; the most significant SNP influences the mRNA expression of DSE and ROS1 in hippocampus and temporal cortex, brain regions

  2. Genome-wide characterization of phenylalanine ammonia-lyase gene family in watermelon (Citrullus lanatus).

    PubMed

    Dong, Chun-Juan; Shang, Qing-Mao

    2013-07-01

    Phenylalanine ammonia-lyase (PAL), the first enzyme in the phenylpropanoid pathway, plays a critical role in plant growth, development, and adaptation. PAL enzymes are encoded by a gene family in plants. Here, we report a genome-wide search for PAL genes in watermelon. A total of 12 PAL genes, designated ClPAL1-12, are identified . Nine are arranged in tandem in two duplication blocks located on chromosomes 4 and 7, and the other three ClPAL genes are distributed as single copies on chromosomes 2, 3, and 8. Both the cDNA and protein sequences of ClPALs share an overall high identity with each other. A phylogenetic analysis places 11 of the ClPALs into a separate cucurbit subclade, whereas ClPAL2, which belongs to neither monocots nor dicots, may serve as an ancestral PAL in plants. In the cucurbit subclade, seven ClPALs form homologous pairs with their counterparts from cucumber. Expression profiling reveals that 11 of the ClPAL genes are expressed and show preferential expression in the stems and male and female flowers. Six of the 12 ClPALs are moderately or strongly expressed in the fruits, particularly in the pulp, suggesting the potential roles of PAL in the development of fruit color and flavor. A promoter motif analysis of the ClPAL genes implies redundant but distinctive cis-regulatory structures for stress responsiveness. Finally, duplication events during the evolution and expansion of the ClPAL gene family are discussed, and the relationships between the ClPAL genes and their cucumber orthologs are estimated.

  3. Genome-wide identification, characterization and expression profiling of LIM family genes in Solanum lycopersicum L.

    PubMed

    Khatun, Khadiza; Robin, Arif Hasan Khan; Park, Jong-In; Ahmed, Nasar Uddin; Kim, Chang Kil; Lim, Ki-Byung; Kim, Min-Bae; Lee, Do-Jin; Nou, Ill Sup; Chung, Mi-Young

    2016-11-01

    LIM domain proteins, some of which have been shown to be actin binding proteins, are involved in various developmental activities and cellular processes in plants. To date, the molecular defense-related functions of LIM family genes have not been investigated in any solanaceous vegetable crop species. In this study, we identified 15 LIM family genes in tomato (Solanum lycopersicum L.) through genome-wide analysis and performed expression profiling in different organs of tomato, including fruits at six different developmental stages. We also performed expression profiling of selected tomato LIM genes in plants under ABA, drought, cold, NaCl and heat stress treatment. The encoded proteins of the 15 tomato LIM genes were classified into two main groups, i.e., proteins similar to cysteine-rich proteins and plant-specific DAR proteins, based on differences in functional domains and variability in their C-terminal regions. The DAR proteins contain a so far poorly characterized zinc-finger-like motif that we propose to call DAR-ZF. Six of the 15 LIM genes were expressed only in flowers, indicating that they play flower-specific roles in plants. The other nine genes were expressed in all organs and at various stages of fruit development. SlβLIM1b was expressed relatively highly at the later stage of fruit development, but three other genes, SlWLIM2a, SlDAR2 and SlDAR4, were expressed at the early stage of fruit development. Seven genes were induced by ABA, five by cold, seven by drought, eight by NaCl and seven by heat treatment respectively, indicating their possible roles in abiotic stress tolerance. Our results will be useful for functional analysis of LIM genes during fruit development in tomato plants under different abiotic stresses. Copyright © 2016. Published by Elsevier Masson SAS.

  4. Genome-wide analysis of SAUR gene family in Solanaceae species.

    PubMed

    Wu, Jian; Liu, Songyu; He, Yanjun; Guan, Xiaoyan; Zhu, Xiangfei; Cheng, Lin; Wang, Jie; Lu, Gang

    2012-11-01

    The plant hormone auxin plays a vital role in regulating many aspects of plant growth and development. Small auxin up-regulated RNAs (SAURs) are primary auxin response genes hypothesized to be involved in auxin signaling pathway, but their functions remain unclear. Here, a genome-wide search for SAUR gene homologues in Solanaceae species identified 99 and 134 members of SAUR gene family from tomato and potato, respectively. Phylogenetic analysis indicated that the SAUR proteins from Arabidopsis, rice, sorghum, tomato and potato were divided into four major groups with 16 subgroups. Among them, 25 histidine-rich SAURs genes with metal-binding characteristics were found in Arabidopsis, sorghum and Solanaceae species, but not in rice. Using tomato as a model, a comprehensive overview of SAUR gene family is presented, including the gene structures, phylogeny and chromosome locations. Quantitative real-time PCR analysis indicated that 11 randomly selected SlSAUR genes in tomato could be expressed at least in one of the tomato organs/tissues tested. However, different SlSAUR genes displayed distinctive expression levels. SlSAUR16 and SlSAUR71 exhibited highly tissue-specific expression patterns. Almost all of the detected SlSAURs showed an accumulating pattern of mRNA along tomato flower and fruit development. Some of them displayed differential response to exogenous IAA treatment. The abiotic (cold, salt and drought) stresses significantly modified transcript levels of SlSAURs genes. Most of them were down-regulated in response to abiotic stresses (drought, heat and salinity), but SlSAUR58, as a histidine-rich SAUR gene, was up-regulated after salt treatment, indicating that it may play a specific role in the salt signaling transduction pathway. Our comparative analysis provides some basic genomic information for the SAUR genes in the Solanaceae species and will pave the way for deciphering their function during plant development.

  5. Genome-Wide Identification, Characterization and Expression Analysis of the TCP Gene Family in Prunus mume.

    PubMed

    Zhou, Yuzhen; Xu, Zongda; Zhao, Kai; Yang, Weiru; Cheng, Tangren; Wang, Jia; Zhang, Qixiang

    2016-01-01

    TCP proteins, belonging to a plant-specific transcription factors family, are known to have great functions in plant development, especially flower and leaf development. However, there is little information about this gene family in Prunus mume, which is widely cultivated in China as an ornamental and fruit tree. Here a genome-wide analysis of TCP genes was performed to explore their evolution in P. mume. Nineteen PmTCPs were identified and three of them contained putative miR319 target sites. Phylogenetic and comprehensive bioinformatics analyses of these genes revealed that different types of TCP genes had undergone different evolutionary processes and the genes in the same clade had similar chromosomal location, gene structure, and conserved domains. Expression analysis of these PmTCPs indicated that there were diverse expression patterns among different clades. Most TCP genes were predominantly expressed in flower, leaf, and stem, and showed high expression levels in the different stages of flower bud differentiation, especially in petal formation stage and gametophyte development. Genes in TCP-P subfamily had main roles in both flower development and gametophyte development. The CIN genes in double petal cultivars might have key roles in the formation of petal, while they were correlated with gametophyte development in the single petal cultivar. The CYC/TB1 type genes were highly detected in the formation of petal and pistil. The less-complex flower types of P. mume might result from the fact that there were only two CYC type genes present in P. mume and a lack of CYC2 genes to control the identity of flower types. These results lay the foundation for further study on the functions of TCP genes during flower development.

  6. Genome-wide identification and analysis of the SGR gene family in Cucumis melo L.

    PubMed

    Bade, R G; Bao, M L; Jin, W Y; Ma, Y; Niu, Y D; Hasi, A

    2016-10-17

    Chlorophyll (CHL) is present in many plant organs, and its metabolism is strongly regulated throughout plant development. Understanding the fate of CHL in senescent leaves or during fruit ripening is a complex process. The stay-green (SGR) protein has been shown to affect CHL degradation. In this study, we used the conserved sequences of STAY-GREEN domain protein (NP_567673) in Arabidopsis thaliana as a probe to search SGR family genes in the genome-wide melon protein database. Four candidate SGR family genes were identified in melon (Cucumis melo L. Hetao). The phylogenetic evolution, gene structure, and conserved motifs were subsequently analyzed. In order to verify the function of CmSGR genes in CHL degradation, CmSGR1 and CmSGR2 were transiently overexpressed and silenced using different plasmids in melon. Overexpression of CmSGR1 or CmSGR2 induced leaf yellowing or fruit ripening, while silencing of CmSGR1 or CmSGR2 via RNA interference delayed CHL breakdown during fruit ripening or leaf senescence compared with the wild type. Next, the expression profile was analyzed, and we found that CmSGR genes were expressed ubiquitously. Moreover, CmSGR1 and CmSGR2 were upregulated, and promoted fruit ripening. CmSGR3 and CmSGR4 were more highly expressed in leaves, cotyledon, and stem compared with CmSGR1 or CmSGR2. Thus, we conclude that CmSGR genes are crucial for fruit ripening and leaf senescence. CmSGR protein structure and function were further clarified to provide a theoretical foundation and valuable information for improved performance of melon.

  7. Genome-Wide Identification, Characterization and Expression Analysis of the TCP Gene Family in Prunus mume

    PubMed Central

    Zhou, Yuzhen; Xu, Zongda; Zhao, Kai; Yang, Weiru; Cheng, Tangren; Wang, Jia; Zhang, Qixiang

    2016-01-01

    TCP proteins, belonging to a plant-specific transcription factors family, are known to have great functions in plant development, especially flower and leaf development. However, there is little information about this gene family in Prunus mume, which is widely cultivated in China as an ornamental and fruit tree. Here a genome-wide analysis of TCP genes was performed to explore their evolution in P. mume. Nineteen PmTCPs were identified and three of them contained putative miR319 target sites. Phylogenetic and comprehensive bioinformatics analyses of these genes revealed that different types of TCP genes had undergone different evolutionary processes and the genes in the same clade had similar chromosomal location, gene structure, and conserved domains. Expression analysis of these PmTCPs indicated that there were diverse expression patterns among different clades. Most TCP genes were predominantly expressed in flower, leaf, and stem, and showed high expression levels in the different stages of flower bud differentiation, especially in petal formation stage and gametophyte development. Genes in TCP-P subfamily had main roles in both flower development and gametophyte development. The CIN genes in double petal cultivars might have key roles in the formation of petal, while they were correlated with gametophyte development in the single petal cultivar. The CYC/TB1 type genes were highly detected in the formation of petal and pistil. The less-complex flower types of P. mume might result from the fact that there were only two CYC type genes present in P. mume and a lack of CYC2 genes to control the identity of flower types. These results lay the foundation for further study on the functions of TCP genes during flower development. PMID:27630648

  8. Genome-wide Identification and Expression Analysis of the CDPK Gene Family in Grape, Vitis spp.

    PubMed

    Zhang, Kai; Han, Yong-Tao; Zhao, Feng-Li; Hu, Yang; Gao, Yu-Rong; Ma, Yan-Fei; Zheng, Yi; Wang, Yue-Jin; Wen, Ying-Qiang

    2015-06-30

    Calcium-dependent protein kinases (CDPKs) play vital roles in plant growth and development, biotic and abiotic stress responses, and hormone signaling. Little is known about the CDPK gene family in grapevine. In this study, we performed a genome-wide analysis of the 12X grape genome (Vitis vinifera) and identified nineteen CDPK genes. Comparison of the structures of grape CDPK genes allowed us to examine their functional conservation and differentiation. Segmentally duplicated grape CDPK genes showed high structural conservation and contributed to gene family expansion. Additional comparisons between grape and Arabidopsis thaliana demonstrated that several grape CDPK genes occured in the corresponding syntenic blocks of Arabidopsis, suggesting that these genes arose before the divergence of grapevine and Arabidopsis. Phylogenetic analysis divided the grape CDPK genes into four groups. Furthermore, we examined the expression of the corresponding nineteen homologous CDPK genes in the Chinese wild grape (Vitis pseudoreticulata) under various conditions, including biotic stress, abiotic stress, and hormone treatments. The expression profiles derived from reverse transcription and quantitative PCR suggested that a large number of VpCDPKs responded to various stimuli on the transcriptional level, indicating their versatile roles in the responses to biotic and abiotic stresses. Moreover, we examined the subcellular localization of VpCDPKs by transiently expressing six VpCDPK-GFP fusion proteins in Arabidopsis mesophyll protoplasts; this revealed high variability consistent with potential functional differences. Taken as a whole, our data provide significant insights into the evolution and function of grape CDPKs and a framework for future investigation of grape CDPK genes.

  9. Genome Wide Association Study and Follow-Up Analysis of Adiposity Traits in Hispanic-Americans: the IRAS Family Study

    PubMed Central

    Norris, Jill M.; Langefeld, Carl D.; Talbert, Matthew E.; Wing, Maria R.; Haritunians, Talin; Fingerlin, Tasha E.; Hanley, Anthony J. G.; Ziegler, Julie T.; Taylor, Kent D.; Haffner, Steven M.; Chen, Yii-Der I.; Bowden, Donald W.; Wagenknecht, Lynne E.

    2010-01-01

    We investigated candidate genomic regions associated with computed tomography (CT)-derived measures of adiposity in Hispanic from the IRAS Family Study. In 1190 Hispanic individuals from 92 families from the San Luis Valley, CO and San Antonio, TX, we measured CT-derived visceral adipose tissue (VAT); subcutaneous adipose tissue (SAT); and visceral: subcutaneous ratio (VSR). A genome-wide association study (GWAS) was completed using the Illumina HumanHap 300 BeadChip (~317K single nucleotide polymorphisms (SNPs)) in 229 individuals from the San Antonio site (Stage 1). Two hundred ninety-seven SNPs with evidence for association with VAT, SAT, or VSR, adjusting for age and sex (p<0.001), were genotyped in the remaining 961 Hispanic samples. The entire Hispanic cohort (n = 1190) was then tested for association, adjusting for age, sex, site of recruitment and admixture estimates (Stage 2). In Stage 3, additional SNPs were genotyped in four genic regions showing evidence of association in Stage 2. Several SNPs were associated in the GWAS (p<1×10−5) and were confirmed to be significantly associated in the entire Hispanic cohort (p<0.01), including: rs7543757 for VAT; rs4754373, and rs11212913 for SAT; and rs4541696, and rs4134351 for VSR. Numerous SNPs were associated with multiple adiposity phenotypes. Targeted analysis of four genes whose SNPs were significant in Stage 2 suggest candidate genes for influencing the distribution (RGS6) and amount of adiposity (NGEF). Several candidate loci, including RGS6 and NGEF, are associated with CT-derived adipose fat measures in Hispanic Americans in a three-stage genetic association study. PMID:19461586

  10. Genome-wide association study and follow-up analysis of adiposity traits in Hispanic Americans: the IRAS Family Study.

    PubMed

    Norris, Jill M; Langefeld, Carl D; Talbert, Matthew E; Wing, Maria R; Haritunians, Talin; Fingerlin, Tasha E; Hanley, Anthony J G; Ziegler, Julie T; Taylor, Kent D; Haffner, Steven M; Chen, Yii-Der I; Bowden, Donald W; Wagenknecht, Lynne E

    2009-10-01

    We investigated candidate genomic regions associated with computed tomography (CT)-derived measures of adiposity in Hispanics from the Insulin Resistance Atherosclerosis Study Family Study (IRASFS). In 1,190 Hispanic individuals from 92 families 3 from the San Luis Valley, Colorado and San Antonio, Texas, we measured CT-derived visceral adipose tissue (VAT), subcutaneous adipose tissue (SAT), and visceral:subcutaneous ratio (VSR). A genome-wide association study (GWAS) was completed using the Illumina HumanHap 300 BeadChip (approximately 317K single-nucleotide polymorphisms (SNPs)) in 229 individuals from the San Antonio site (stage 1). In total, 297 SNPs with evidence for association with VAT, SAT, or VSR, adjusting for age and sex (P<0.001), were genotyped in the remaining 961 Hispanic samples. The entire Hispanic cohort (n=1,190) was then tested for association, adjusting for age, sex, site of recruitment, and admixture estimates (stage 2). In stage 3, additional SNPs were genotyped in four genic regions showing evidence of association in stage 2. Several SNPs were associated in the GWAS (P<1x10(-5)) and were confirmed to be significantly associated in the entire Hispanic cohort (P<0.01), including: rs7543757 for VAT, rs4754373 and rs11212913 for SAT, and rs4541696 and rs4134351 for VSR. Numerous SNPs were associated with multiple adiposity phenotypes. Targeted analysis of four genes whose SNPs were significant in stage 2 suggests candidate genes for influencing the distribution (RGS6) and amount of adiposity (NGEF). Several candidate loci, including RGS6 and NGEF, are associated with CT-derived adipose fat measures in Hispanic Americans in a three-stage genetic association study.

  11. Genome-wide comparative analysis of the IQD gene families in Arabidopsis thaliana and Oryza sativa.

    PubMed

    Abel, Steffen; Savchenko, Tatyana; Levy, Maggie

    2005-12-20

    Calcium signaling plays a prominent role in plants for coordinating a wide range of developmental processes and responses to environmental cues. Stimulus-specific generation of intracellular calcium transients, decoding of calcium signatures, and transformation of the signal into cellular responses are integral modules of the transduction process. Several hundred proteins with functions in calcium signaling circuits have been identified, and the number of downstream targets of calcium sensors is expected to increase. We previously identified a novel, calmodulin-binding nuclear protein, IQD1, which stimulates glucosinolate accumulation and plant defense in Arabidopsis thaliana. Here, we present a comparative genome-wide analysis of a new class of putative calmodulin target proteins in Arabidopsis and rice. We identified and analyzed 33 and 29 IQD1-like genes in Arabidopsis thaliana and Oryza sativa, respectively. The encoded IQD proteins contain a plant-specific domain of 67 conserved amino acid residues, referred to as the IQ67 domain, which is characterized by a unique and repetitive arrangement of three different calmodulin recruitment motifs, known as the IQ, 1-5-10, and 1-8-14 motifs. We demonstrated calmodulin binding for IQD20, the smallest IQD protein in Arabidopsis, which consists of a C-terminal IQ67 domain and a short N-terminal extension. A striking feature of IQD proteins is the high isoelectric point (approximately 10.3) and frequency of serine residues (approximately 11%). We compared the Arabidopsis and rice IQD gene families in terms of gene structure, chromosome location, predicted protein properties and motifs, phylogenetic relationships, and evolutionary history. The existence of an IQD-like gene in bryophytes suggests that IQD proteins are an ancient family of calmodulin-binding proteins and arose during the early evolution of land plants. Comparative phylogenetic analyses indicate that the major IQD gene lineages originated before the monocot

  12. Genome-wide comparative analysis of the IQD gene families in Arabidopsis thaliana and Oryza sativa

    PubMed Central

    Abel, Steffen; Savchenko, Tatyana; Levy, Maggie

    2005-01-01

    Background Calcium signaling plays a prominent role in plants for coordinating a wide range of developmental processes and responses to environmental cues. Stimulus-specific generation of intracellular calcium transients, decoding of calcium signatures, and transformation of the signal into cellular responses are integral modules of the transduction process. Several hundred proteins with functions in calcium signaling circuits have been identified, and the number of downstream targets of calcium sensors is expected to increase. We previously identified a novel, calmodulin-binding nuclear protein, IQD1, which stimulates glucosinolate accumulation and plant defense in Arabidopsis thaliana. Here, we present a comparative genome-wide analysis of a new class of putative calmodulin target proteins in Arabidopsis and rice. Results We identified and analyzed 33 and 29 IQD1-like genes in Arabidopsis thaliana and Oryza sativa, respectively. The encoded IQD proteins contain a plant-specific domain of 67 conserved amino acid residues, referred to as the IQ67 domain, which is characterized by a unique and repetitive arrangement of three different calmodulin recruitment motifs, known as the IQ, 1-5-10, and 1-8-14 motifs. We demonstrated calmodulin binding for IQD20, the smallest IQD protein in Arabidopsis, which consists of a C-terminal IQ67 domain and a short N-terminal extension. A striking feature of IQD proteins is the high isoelectric point (~10.3) and frequency of serine residues (~11%). We compared the Arabidopsis and rice IQD gene families in terms of gene structure, chromosome location, predicted protein properties and motifs, phylogenetic relationships, and evolutionary history. The existence of an IQD-like gene in bryophytes suggests that IQD proteins are an ancient family of calmodulin-binding proteins and arose during the early evolution of land plants. Conclusion Comparative phylogenetic analyses indicate that the major IQD gene lineages originated before the

  13. Genome-wide identification and characterization of WRKY gene family in Salix suchowensis

    PubMed Central

    Ye, Qiaolin; Yin, Tongming

    2016-01-01

    WRKY proteins are the zinc finger transcription factors that were first identified in plants. They can specifically interact with the W-box, which can be found in the promoter region of a large number of plant target genes, to regulate the expressions of downstream target genes. They also participate in diverse physiological and growing processes in plants. Prior to this study, a plenty of WRKY genes have been identified and characterized in herbaceous species, but there is no large-scale study of WRKY genes in willow. With the whole genome sequencing of Salix suchowensis, we have the opportunity to conduct the genome-wide research for willow WRKY gene family. In this study, we identified 85 WRKY genes in the willow genome and renamed them from SsWRKY1 to SsWRKY85 on the basis of their specific distributions on chromosomes. Due to their diverse structural features, the 85 willow WRKY genes could be further classified into three main groups (group I–III), with five subgroups (IIa–IIe) in group II. With the multiple sequence alignment and the manual search, we found three variations of the WRKYGQK heptapeptide: WRKYGRK, WKKYGQK and WRKYGKK, and four variations of the normal zinc finger motif, which might execute some new biological functions. In addition, the SsWRKY genes from the same subgroup share the similar exon–intron structures and conserved motif domains. Further studies of SsWRKY genes revealed that segmental duplication events (SDs) played a more prominent role in the expansion of SsWRKY genes. Distinct expression profiles of SsWRKY genes with RNA sequencing data revealed that diverse expression patterns among five tissues, including tender roots, young leaves, vegetative buds, non-lignified stems and barks. With the analyses of WRKY gene family in willow, it is not only beneficial to complete the functional and annotation information of WRKY genes family in woody plants, but also provide important references to investigate the expansion and evolution

  14. Genome-Wide Identification and Characterization of R2R3MYB Family in Cucumis sativus

    PubMed Central

    Li, Qiang; Zhang, Cunjia; Li, Jing; Wang, Lina; Ren, Zhonghai

    2012-01-01

    Background The R2R3MYB proteins comprise one of the largest families of transcription factors in plants. Although genome-wide analysis of this family has been carried out in some species, little is known about R2R3MYB genes in cucumber (Cucumis sativus L.). Principal Findings This study has identified 55 R2R3MYB genes in the latest cucumber genome and the CsR2R3MYB family contained the smallest number of identified genes compared to other species that have been studied due to the absence of recent gene duplication events. These results were also supported by genome distribution and gene duplication analysis. Phylogenetic analysis showed that they could be classified into 11 subgroups. The evolutionary relationships and the intron - exon organizations that showed similarities with Arabidopsis, Vitis and Glycine R2R3MYB proteins were also analyzed and suggested strong gene conservation but also the expansions of particular functional genes during the evolution of the plant species. In addition, we found that 8 out of 55 (∼14.54%) cucumber R2R3MYB genes underwent alternative splicing events, producing a variety of transcripts from a single gene, which illustrated the extremely high complexity of transcriptome regulation. Tissue-specific expression profiles showed that 50 cucumber R2R3MYB genes were expressed in at least one of the tissues and the other 5 genes showed very low expression in all tissues tested, which suggested that cucumber R2R3MYB genes took part in many cellular processes. The transcript abundance level analysis during abiotic conditions (NaCl, ABA and low temperature treatments) identified a group of R2R3MYB genes that responded to one or more treatments. Conclusions This study has produced a comparative genomics analysis of the cucumber R2R3MYB gene family and has provided the first steps towards the selection of CsR2R3MYB genes for cloning and functional dissection that can be used in further studies to uncover their roles in cucumber growth and

  15. Genome-wide identification and characterization of R2R3MYB family in Cucumis sativus.

    PubMed

    Li, Qiang; Zhang, Cunjia; Li, Jing; Wang, Lina; Ren, Zhonghai

    2012-01-01

    The R2R3MYB proteins comprise one of the largest families of transcription factors in plants. Although genome-wide analysis of this family has been carried out in some species, little is known about R2R3MYB genes in cucumber (Cucumis sativus L.). This study has identified 55 R2R3MYB genes in the latest cucumber genome and the CsR2R3MYB family contained the smallest number of identified genes compared to other species that have been studied due to the absence of recent gene duplication events. These results were also supported by genome distribution and gene duplication analysis. Phylogenetic analysis showed that they could be classified into 11 subgroups. The evolutionary relationships and the intron-exon organizations that showed similarities with Arabidopsis, Vitis and Glycine R2R3MYB proteins were also analyzed and suggested strong gene conservation but also the expansions of particular functional genes during the evolution of the plant species. In addition, we found that 8 out of 55 (∼14.54%) cucumber R2R3MYB genes underwent alternative splicing events, producing a variety of transcripts from a single gene, which illustrated the extremely high complexity of transcriptome regulation. Tissue-specific expression profiles showed that 50 cucumber R2R3MYB genes were expressed in at least one of the tissues and the other 5 genes showed very low expression in all tissues tested, which suggested that cucumber R2R3MYB genes took part in many cellular processes. The transcript abundance level analysis during abiotic conditions (NaCl, ABA and low temperature treatments) identified a group of R2R3MYB genes that responded to one or more treatments. This study has produced a comparative genomics analysis of the cucumber R2R3MYB gene family and has provided the first steps towards the selection of CsR2R3MYB genes for cloning and functional dissection that can be used in further studies to uncover their roles in cucumber growth and development.

  16. Genome-wide analysis of WRKY gene family in Cucumis sativus

    PubMed Central

    2011-01-01

    Background WRKY proteins are a large family of transcriptional regulators in higher plant. They are involved in many biological processes, such as plant development, metabolism, and responses to biotic and abiotic stresses. Prior to the present study, only one full-length cucumber WRKY protein had been reported. The recent publication of the draft genome sequence of cucumber allowed us to conduct a genome-wide search for cucumber WRKY proteins, and to compare these positively identified proteins with their homologs in model plants, such as Arabidopsis. Results We identified a total of 55 WRKY genes in the cucumber genome. According to structural features of their encoded proteins, the cucumber WRKY (CsWRKY) genes were classified into three groups (group 1-3). Analysis of expression profiles of CsWRKY genes indicated that 48 WRKY genes display differential expression either in their transcript abundance or in their expression patterns under normal growth conditions, and 23 WRKY genes were differentially expressed in response to at least one abiotic stresses (cold, drought or salinity). The expression profile of stress-inducible CsWRKY genes were correlated with those of their putative Arabidopsis WRKY (AtWRKY) orthologs, except for the group 3 WRKY genes. Interestingly, duplicated group 3 AtWRKY genes appear to have been under positive selection pressure during evolution. In contrast, there was no evidence of recent gene duplication or positive selection pressure among CsWRKY group 3 genes, which may have led to the expressional divergence of group 3 orthologs. Conclusions Fifty-five WRKY genes were identified in cucumber and the structure of their encoded proteins, their expression, and their evolution were examined. Considering that there has been extensive expansion of group 3 WRKY genes in angiosperms, the occurrence of different evolutionary events could explain the functional divergence of these genes. PMID:21955985

  17. Genome-wide analysis of the GH3 family in apple (Malus × domestica)

    PubMed Central

    2013-01-01

    Background Auxin plays important roles in hormone crosstalk and the plant’s stress response. The auxin-responsive Gretchen Hagen3 (GH3) gene family maintains hormonal homeostasis by conjugating excess indole-3-acetic acid (IAA), salicylic acid (SA), and jasmonic acids (JAs) to amino acids during hormone- and stress-related signaling pathways. With the sequencing of the apple (Malus × domestica) genome completed, it is possible to carry out genomic studies on GH3 genes to indentify candidates with roles in abiotic/biotic stress responses. Results Malus sieversii Roem., an apple rootstock with strong drought tolerance and the ancestral species of cultivated apple species, was used as the experimental material. Following genome-wide computational and experimental identification of MdGH3 genes, we showed that MdGH3s were differentially expressed in the leaves and roots of M. sieversii and that some of these genes were significantly induced after various phytohormone and abiotic stress treatments. Given the role of GH3 in the negative feedback regulation of free IAA concentration, we examined whether phytohormones and abiotic stresses could alter the endogenous auxin level. By analyzing the GUS activity of DR5::GUS-transformed Arabidopsis seedlings, we showed that ABA, SA, salt, and cold treatments suppressed the auxin response. These findings suggest that other phytohormones and abiotic stress factors might alter endogenous auxin levels. Conclusion Previous studies showed that GH3 genes regulate hormonal homeostasis. Our study indicated that some GH3 genes were significantly induced in M. sieversii after various phytohormone and abiotic stress treatments, and that ABA, SA, salt, and cold treatments reduce the endogenous level of axuin. Taken together, this study provides evidence that GH3 genes play important roles in the crosstalk between auxin, other phytohormones, and the abiotic stress response by maintaining auxin homeostasis. PMID:23638690

  18. Genome-wide analysis of the GH3 family in apple (Malus × domestica).

    PubMed

    Yuan, Huazhao; Zhao, Kai; Lei, Hengjiu; Shen, Xinjie; Liu, Yun; Liao, Xiong; Li, Tianhong

    2013-05-02

    Auxin plays important roles in hormone crosstalk and the plant's stress response. The auxin-responsive Gretchen Hagen3 (GH3) gene family maintains hormonal homeostasis by conjugating excess indole-3-acetic acid (IAA), salicylic acid (SA), and jasmonic acids (JAs) to amino acids during hormone- and stress-related signaling pathways. With the sequencing of the apple (Malus × domestica) genome completed, it is possible to carry out genomic studies on GH3 genes to indentify candidates with roles in abiotic/biotic stress responses. Malus sieversii Roem., an apple rootstock with strong drought tolerance and the ancestral species of cultivated apple species, was used as the experimental material. Following genome-wide computational and experimental identification of MdGH3 genes, we showed that MdGH3s were differentially expressed in the leaves and roots of M. sieversii and that some of these genes were significantly induced after various phytohormone and abiotic stress treatments. Given the role of GH3 in the negative feedback regulation of free IAA concentration, we examined whether phytohormones and abiotic stresses could alter the endogenous auxin level. By analyzing the GUS activity of DR5::GUS-transformed Arabidopsis seedlings, we showed that ABA, SA, salt, and cold treatments suppressed the auxin response. These findings suggest that other phytohormones and abiotic stress factors might alter endogenous auxin levels. Previous studies showed that GH3 genes regulate hormonal homeostasis. Our study indicated that some GH3 genes were significantly induced in M. sieversii after various phytohormone and abiotic stress treatments, and that ABA, SA, salt, and cold treatments reduce the endogenous level of axuin. Taken together, this study provides evidence that GH3 genes play important roles in the crosstalk between auxin, other phytohormones, and the abiotic stress response by maintaining auxin homeostasis.

  19. Genome-wide variant analysis of simplex autism families with an integrative clinical-bioinformatics pipeline

    PubMed Central

    Jiménez-Barrón, Laura T.; O'Rawe, Jason A.; Wu, Yiyang; Yoon, Margaret; Fang, Han; Iossifov, Ivan; Lyon, Gholson J.

    2015-01-01

    Autism spectrum disorders (ASDs) are a group of developmental disabilities that affect social interaction and communication and are characterized by repetitive behaviors. There is now a large body of evidence that suggests a complex role of genetics in ASDs, in which many different loci are involved. Although many current population-scale genomic studies have been demonstrably fruitful, these studies generally focus on analyzing a limited part of the genome or use a limited set of bioinformatics tools. These limitations preclude the analysis of genome-wide perturbations that may contribute to the development and severity of ASD-related phenotypes. To overcome these limitations, we have developed and utilized an integrative clinical and bioinformatics pipeline for generating a more complete and reliable set of genomic variants for downstream analyses. Our study focuses on the analysis of three simplex autism families consisting of one affected child, unaffected parents, and one unaffected sibling. All members were clinically evaluated and widely phenotyped. Genotyping arrays and whole-genome sequencing were performed on each member, and the resulting sequencing data were analyzed using a variety of available bioinformatics tools. We searched for rare variants of putative functional impact that were found to be segregating according to de novo, autosomal recessive, X-linked, mitochondrial, and compound heterozygote transmission models. The resulting candidate variants included three small heterozygous copy-number variations (CNVs), a rare heterozygous de novo nonsense mutation in MYBBP1A located within exon 1, and a novel de novo missense variant in LAMB3. Our work demonstrates how more comprehensive analyses that include rich clinical data and whole-genome sequencing data can generate reliable results for use in downstream investigations. PMID:27148569

  20. Genome-wide linkage analysis for uric acid in families enriched for hypertension

    PubMed Central

    Rule, Andrew D.; Fridley, Brooke L.; Hunt, Steven C.; Asmann, Yan; Boerwinkle, Eric; Pankow, James S.; Mosley, Thomas H.; Turner, Stephen T.

    2009-01-01

    Background. Uric acid is heritable and associated with hypertension and insulin resistance. We sought to identify genomic regions influencing serum uric acid in families in which two or more siblings had hypertension. Methods. Uric acid levels and microsatellite markers were assayed in the Genetic Epidemiology Network of Arteriopathy (GENOA) cohort (1075 whites and 1333 blacks) and the Hypertension Genetic Epidemiology Network (HyperGEN) cohort (1542 whites and 1627 blacks). Genome-wide linkage analyses of uric acid and bivariate linkage analyses of uric acid with an additional surrogate of insulin resistance were completed. Pathway analysis explored gene sets enriched at loci influencing uric acid. Results. In the GENOA white cohort, loci influencing uric acid were identified on chromosome 8 at 135 cM [multipoint logarithm of odds score (MLS) = 2.4], on chromosome 9 at 113 cM (MLS = 3.7) and on chromosome 16 at 93 cM (MLS = 2.3), but did not replicate in HyperGEN. At these loci, there was evidence of pleiotropy with other surrogates of insulin resistance and genes in the fructose and mannose metabolism pathway were enriched. In the HyperGEN-black cohort, there was some evidence of a locus for uric acid on chromosome 4 at 135 cM (MLS = 2.4) that had modest replication in GENOA (MLS = 1.2). Conclusions. Several novel loci linked to uric acid were identified but none showed clear replication. Widespread diuretic use, a medication that raises uric acid levels, was an important study limitation. Bivariate linkage analyses and pathway analysis were consistent with genes regulating insulin resistance and fructose metabolism contributing to the heritability of uric acid. PMID:19258383

  1. Genome-wide identification and analysis of the MADS-box gene family in sesame.

    PubMed

    Wei, Xin; Wang, Linhai; Yu, Jingyin; Zhang, Yanxin; Li, Donghua; Zhang, Xiurong

    2015-09-10

    MADS-box genes encode transcription factors that play crucial roles in plant growth and development. Sesame (Sesamum indicum L.) is an oil crop that contributes to the daily oil and protein requirements of almost half of the world's population; therefore, a genome-wide analysis of the MADS-box gene family is needed. Fifty-seven MADS-box genes were identified from 14 linkage groups of the sesame genome. Analysis of phylogenetic relationships with Arabidopsis thaliana, Utricularia gibba and Solanum lycopersicum MADS-box genes was performed. Sesame MADS-box genes were clustered into four groups: 28 MIKC(c)-type, 5 MIKC(⁎)-type, 14 Mα-type and 10 Mγ-type. Gene structure analysis revealed from 1 to 22 exons of sesame MADS-box genes. The number of exons in type II MADS-box genes greatly exceeded the number in type I genes. Motif distribution analysis of sesame MADS-box genes also indicated that type II MADS-box genes contained more motifs than type I genes. These results suggested that type II sesame MADS-box genes had more complex structures. By analyzing expression profiles of MADS-box genes in seven sesame transcriptomes, we determined that MIKC(C)-type MADS-box genes played significant roles in sesame flower and seed development. Although most MADS-box genes in the same clade showed similar expression features, some gene functions were diversified from the orthologous Arabidopsis genes. This research will contribute to uncovering the role of MADS-box genes in sesame development.

  2. Motor sequencing deficit as an endophenotype of speech sound disorder: A genome-wide linkage analysis in a multigenerational family

    PubMed Central

    Peter, Beate; Matsushita, Mark; Raskind, Wendy H.

    2012-01-01

    Objectives The purpose of this pilot study was to investigate a measure of motor sequencing deficit as a potential endophenotype of speech sound disorder (SSD) in a multigenerational family with evidence of familial SSD. Methods In a multigenerational family with evidence of a familial motor-based SSD, affectation status and a measure of motor sequencing during oral motor testing were obtained. To further investigate the role of motor sequencing as an endophenotype for genetic studies, parametric and nonparametric linkage analyses were conducted using a genome-wide panel of 404 microsatellites. Results In seven of the ten family members with available data, SSD affectation status and motor sequencing status coincided. Linkage analysis revealed four regions of interest, 6p21, 7q32, 7q36, and 8q24, primarily identified with the measure of motor sequencing ability. The 6p21 region overlaps with a locus implicated in rapid alternating naming in a recent genome-wide dyslexia linkage study. The 7q32 locus contains a locus implicated in dyslexia. The 7q36 locus borders on a gene known to affect component traits of language impairment. Conclusions Results are consistent with a motor-based endophenotype of SSD that would be informative for genetic studies. The linkage results in this first genome-wide study in a multigenerational family with SSD warrant follow-up in additional families and with fine mapping or next-generation approaches to gene identification. PMID:22517379

  3. Genome-Wide Analysis and Characterization of Aux/IAA Family Genes in Brassica rapa

    PubMed Central

    Rameneni, Jana Jeevan; Li, Xiaonan; Sivanandhan, Ganesan; Choi, Su Ryun; Pang, Wenxing; Im, Subin; Lim, Yong Pyo

    2016-01-01

    Auxins are the key players in plant growth development involving leaf formation, phototropism, root, fruit and embryo development. Auxin/Indole-3-Acetic Acid (Aux/IAA) are early auxin response genes noted as transcriptional repressors in plant auxin signaling. However, many studies focus on Aux/ARF gene families and much less is known about the Aux/IAA gene family in Brassica rapa (B. rapa). Here we performed a comprehensive genome-wide analysis and identified 55 Aux/IAA genes in B. rapa using four conserved motifs of Aux/IAA family (PF02309). Chromosomal mapping of the B. rapa Aux/IAA (BrIAA) genes facilitated understanding cluster rearrangement of the crucifer building blocks in the genome. Phylogenetic analysis of BrIAA with Arabidopsis thaliana, Oryza sativa and Zea mays identified 51 sister pairs including 15 same species (BrIAA—BrIAA) and 36 cross species (BrIAA—AtIAA) IAA genes. Among the 55 BrIAA genes, expression of 43 and 45 genes were verified using Genebank B. rapa ESTs and in home developed microarray data from mature leaves of Chiifu and RcBr lines. Despite their huge morphological difference, tissue specific expression analysis of BrIAA genes between the parental lines Chiifu and RcBr showed that the genes followed a similar pattern of expression during leaf development and a different pattern during bud, flower and siliqua development stages. The response of the BrIAA genes to abiotic and auxin stress at different time intervals revealed their involvement in stress response. Single Nucleotide Polymorphisms between IAA genes of reference genome Chiifu and RcBr were focused and identified. Our study examines the scope of conservation and divergence of Aux/IAA genes and their structures in B. rapa. Analyzing the expression and structural variation between two parental lines will significantly contribute to functional genomics of Brassica crops and we belive our study would provide a foundation in understanding the Aux/IAA genes in B. rapa. PMID

  4. Genome-Wide Analysis and Characterization of Aux/IAA Family Genes in Brassica rapa.

    PubMed

    Paul, Parameswari; Dhandapani, Vignesh; Rameneni, Jana Jeevan; Li, Xiaonan; Sivanandhan, Ganesan; Choi, Su Ryun; Pang, Wenxing; Im, Subin; Lim, Yong Pyo

    2016-01-01

    Auxins are the key players in plant growth development involving leaf formation, phototropism, root, fruit and embryo development. Auxin/Indole-3-Acetic Acid (Aux/IAA) are early auxin response genes noted as transcriptional repressors in plant auxin signaling. However, many studies focus on Aux/ARF gene families and much less is known about the Aux/IAA gene family in Brassica rapa (B. rapa). Here we performed a comprehensive genome-wide analysis and identified 55 Aux/IAA genes in B. rapa using four conserved motifs of Aux/IAA family (PF02309). Chromosomal mapping of the B. rapa Aux/IAA (BrIAA) genes facilitated understanding cluster rearrangement of the crucifer building blocks in the genome. Phylogenetic analysis of BrIAA with Arabidopsis thaliana, Oryza sativa and Zea mays identified 51 sister pairs including 15 same species (BrIAA-BrIAA) and 36 cross species (BrIAA-AtIAA) IAA genes. Among the 55 BrIAA genes, expression of 43 and 45 genes were verified using Genebank B. rapa ESTs and in home developed microarray data from mature leaves of Chiifu and RcBr lines. Despite their huge morphological difference, tissue specific expression analysis of BrIAA genes between the parental lines Chiifu and RcBr showed that the genes followed a similar pattern of expression during leaf development and a different pattern during bud, flower and siliqua development stages. The response of the BrIAA genes to abiotic and auxin stress at different time intervals revealed their involvement in stress response. Single Nucleotide Polymorphisms between IAA genes of reference genome Chiifu and RcBr were focused and identified. Our study examines the scope of conservation and divergence of Aux/IAA genes and their structures in B. rapa. Analyzing the expression and structural variation between two parental lines will significantly contribute to functional genomics of Brassica crops and we belive our study would provide a foundation in understanding the Aux/IAA genes in B. rapa.

  5. ChIP on chip and ChIP-Seq assays: genome-wide analysis of transcription factor binding and histone modifications.

    PubMed

    Pillai, Smitha; Chellappan, Srikumar P

    2015-01-01

    Deregulation of transcriptional activity of many genes has been causatively linked to human diseases including cancer. Altered patterns of gene expression in normal and cancer cells are the result of inappropriate expression of transcription factors and chromatin modifying proteins. Chromatin immunoprecipitation assay is a well-established tool for investigating the interactions between regulatory proteins and DNA at distinct stages of gene activation. ChIP coupled with DNA microarrays, known as ChIP on chip, or sequencing of DNA associated with the factors (ChIP-Seq) allow us to determine the entire spectrum of in vivo DNA binding sites for a given protein. This has been of immense value because ChIP on chip assays and ChIP-Seq experiments can provide a snapshot of the transcriptional regulatory mechanisms on a genome-wide scale. This chapter outlines the general strategies used to carry out ChIP-chip assays to study the differential recruitment of regulatory molecules based on the studies conducted in our lab as well as other published protocols; these can be easily modified to a ChIP-Seq analysis.

  6. Genome-wide association study reveals greater polygenic loading for schizophrenia in cases with a family history of illness.

    PubMed

    Bigdeli, Tim B; Ripke, Stephan; Bacanu, Silviu-Alin; Lee, Sang Hong; Wray, Naomi R; Gejman, Pablo V; Rietschel, Marcella; Cichon, Sven; St Clair, David; Corvin, Aiden; Kirov, George; McQuillin, Andrew; Gurling, Hugh; Rujescu, Dan; Andreassen, Ole A; Werge, Thomas; Blackwood, Douglas H R; Pato, Carlos N; Pato, Michele T; Malhotra, Anil K; O'Donovan, Michael C; Kendler, Kenneth S; Fanous, Ayman H

    2016-03-01

    Genome-wide association studies (GWAS) of schizophrenia have yielded more than 100 common susceptibility variants, and strongly support a substantial polygenic contribution of a large number of small allelic effects. It has been hypothesized that familial schizophrenia is largely a consequence of inherited rather than environmental factors. We investigated the extent to which familiality of schizophrenia is associated with enrichment for common risk variants detectable in a large GWAS. We analyzed single nucleotide polymorphism (SNP) data for cases reporting a family history of psychotic illness (N = 978), cases reporting no such family history (N = 4,503), and unscreened controls (N = 8,285) from the Psychiatric Genomics Consortium (PGC1) study of schizophrenia. We used a multinomial logistic regression approach with model-fitting to detect allelic effects specific to either family history subgroup. We also considered a polygenic model, in which we tested whether family history positive subjects carried more schizophrenia risk alleles than family history negative subjects, on average. Several individual SNPs attained suggestive but not genome-wide significant association with either family history subgroup. Comparison of genome-wide polygenic risk scores based on GWAS summary statistics indicated a significant enrichment for SNP effects among family history positive compared to family history negative cases (Nagelkerke's R(2 ) = 0.0021; P = 0.00331; P-value threshold <0.4). Estimates of variability in disease liability attributable to the aggregate effect of genome-wide SNPs were significantly greater for family history positive compared to family history negative cases (0.32 and 0.22, respectively; P = 0.031). We found suggestive evidence of allelic effects detectable in large GWAS of schizophrenia that might be specific to particular family history subgroups. However, consideration of a polygenic risk score indicated a significant

  7. Genome-wide linkage scan identifies two novel genetic loci for coronary artery disease: in GeneQuest families.

    PubMed

    Gao, Hanxiang; Li, Lin; Rao, Shaoqi; Shen, Gongqing; Xi, Quansheng; Chen, Shenghan; Zhang, Zheng; Wang, Kai; Ellis, Stephen G; Chen, Qiuyun; Topol, Eric J; Wang, Qing K

    2014-01-01

    Coronary artery disease (CAD) is the leading cause of death worldwide. Recent genome-wide association studies (GWAS) identified >50 common variants associated with CAD or its complication myocardial infarction (MI), but collectively they account for <20% of heritability, generating a phenomena of "missing heritability". Rare variants with large effects may account for a large portion of missing heritability. Genome-wide linkage studies of large families and follow-up fine mapping and deep sequencing are particularly effective in identifying rare variants with large effects. Here we show results from a genome-wide linkage scan for CAD in multiplex GeneQuest families with early onset CAD and MI. Whole genome genotyping was carried out with 408 markers that span the human genome by every 10 cM and linkage analyses were performed using the affected relative pair analysis implemented in GENEHUNTER. Affected only nonparametric linkage (NPL) analysis identified two novel CAD loci with highly significant evidence of linkage on chromosome 3p25.1 (peak NPL  = 5.49) and 3q29 (NPL  = 6.84). We also identified four loci with suggestive linkage on 9q22.33, 9q34.11, 17p12, and 21q22.3 (NPL  = 3.18-4.07). These results identify novel loci for CAD and provide a framework for fine mapping and deep sequencing to identify new susceptibility genes and novel variants associated with risk of CAD.

  8. Comparison of methods to account for relatedness in genome-wide association studies with family-based data.

    PubMed

    Eu-Ahsunthornwattana, Jakris; Miller, E Nancy; Fakiola, Michaela; Jeronimo, Selma M B; Blackwell, Jenefer M; Cordell, Heather J

    2014-07-01

    Approaches based on linear mixed models (LMMs) have recently gained popularity for modelling population substructure and relatedness in genome-wide association studies. In the last few years, a bewildering variety of different LMM methods/software packages have been developed, but it is not always clear how (or indeed whether) any newly-proposed method differs from previously-proposed implementations. Here we compare the performance of several LMM approaches (and software implementations, including EMMAX, GenABEL, FaST-LMM, Mendel, GEMMA and MMM) via their application to a genome-wide association study of visceral leishmaniasis in 348 Brazilian families comprising 3626 individuals (1972 genotyped). The implementations differ in precise details of methodology implemented and through various user-chosen options such as the method and number of SNPs used to estimate the kinship (relatedness) matrix. We investigate sensitivity to these choices and the success (or otherwise) of the approaches in controlling the overall genome-wide error-rate for both real and simulated phenotypes. We compare the LMM results to those obtained using traditional family-based association tests (based on transmission of alleles within pedigrees) and to alternative approaches implemented in the software packages MQLS, ROADTRIPS and MASTOR. We find strong concordance between the results from different LMM approaches, and all are successful in controlling the genome-wide error rate (except for some approaches when applied naively to longitudinal data with many repeated measures). We also find high correlation between LMMs and alternative approaches (apart from transmission-based approaches when applied to SNPs with small or non-existent effects). We conclude that LMM approaches perform well in comparison to competing approaches. Given their strong concordance, in most applications, the choice of precise LMM implementation cannot be based on power/type I error considerations but must instead be

  9. Comparison of Methods to Account for Relatedness in Genome-Wide Association Studies with Family-Based Data

    PubMed Central

    Eu-ahsunthornwattana, Jakris; Miller, E. Nancy; Fakiola, Michaela; Jeronimo, Selma M. B.; Blackwell, Jenefer M.; Cordell, Heather J.

    2014-01-01

    Approaches based on linear mixed models (LMMs) have recently gained popularity for modelling population substructure and relatedness in genome-wide association studies. In the last few years, a bewildering variety of different LMM methods/software packages have been developed, but it is not always clear how (or indeed whether) any newly-proposed method differs from previously-proposed implementations. Here we compare the performance of several LMM approaches (and software implementations, including EMMAX, GenABEL, FaST-LMM, Mendel, GEMMA and MMM) via their application to a genome-wide association study of visceral leishmaniasis in 348 Brazilian families comprising 3626 individuals (1972 genotyped). The implementations differ in precise details of methodology implemented and through various user-chosen options such as the method and number of SNPs used to estimate the kinship (relatedness) matrix. We investigate sensitivity to these choices and the success (or otherwise) of the approaches in controlling the overall genome-wide error-rate for both real and simulated phenotypes. We compare the LMM results to those obtained using traditional family-based association tests (based on transmission of alleles within pedigrees) and to alternative approaches implemented in the software packages MQLS, ROADTRIPS and MASTOR. We find strong concordance between the results from different LMM approaches, and all are successful in controlling the genome-wide error rate (except for some approaches when applied naively to longitudinal data with many repeated measures). We also find high correlation between LMMs and alternative approaches (apart from transmission-based approaches when applied to SNPs with small or non-existent effects). We conclude that LMM approaches perform well in comparison to competing approaches. Given their strong concordance, in most applications, the choice of precise LMM implementation cannot be based on power/type I error considerations but must instead be

  10. Genome-wide identification, characterization, and expression profile of aquaporin gene family in flax (Linum usitatissimum)

    PubMed Central

    Shivaraj, S. M.; Deshmukh, Rupesh K.; Rai, Rhitu; Bélanger, Richard; Agrawal, Pawan K.; Dash, Prasanta K.

    2017-01-01

    Membrane intrinsic proteins (MIPs) form transmembrane channels and facilitate transport of myriad substrates across the cell membrane in many organisms. Majority of plant MIPs have water transporting ability and are commonly referred as aquaporins (AQPs). In the present study, we identified aquaporin coding genes in flax by genome-wide analysis, their structure, function and expression pattern by pan-genome exploration. Cross-genera phylogenetic analysis with known aquaporins from rice, arabidopsis, and poplar showed five subgroups of flax aquaporins representing 16 plasma membrane intrinsic proteins (PIPs), 17 tonoplast intrinsic proteins (TIPs), 13 NOD26-like intrinsic proteins (NIPs), 2 small basic intrinsic proteins (SIPs), and 3 uncharacterized intrinsic proteins (XIPs). Amongst aquaporins, PIPs contained hydrophilic aromatic arginine (ar/R) selective filter but TIP, NIP, SIP and XIP subfamilies mostly contained hydrophobic ar/R selective filter. Analysis of RNA-seq and microarray data revealed high expression of PIPs in multiple tissues, low expression of NIPs, and seed specific expression of TIP3 in flax. Exploration of aquaporin homologs in three closely related Linum species bienne, grandiflorum and leonii revealed presence of 49, 39 and 19 AQPs, respectively. The genome-wide identification of aquaporins, first in flax, provides insight to elucidate their physiological and developmental roles in flax. PMID:28447607

  11. Genome-wide identification, isolation and expression analysis of auxin response factor (ARF) gene family in sweet orange (Citrus sinensis)

    PubMed Central

    Li, Si-Bei; OuYang, Wei-Zhi; Hou, Xiao-Jin; Xie, Liang-Liang; Hu, Chun-Gen; Zhang, Jin-Zhi

    2015-01-01

    Auxin response factors (ARFs) are an important family of proteins in auxin-mediated response, with key roles in various physiological and biochemical processes. To date, a genome-wide overview of the ARF gene family in citrus was not available. A systematic analysis of this gene family in citrus was begun by carrying out a genome-wide search for the homologs of ARFs. A total of 19 nonredundant ARF genes (CiARF) were found and validated from the sweet orange. A comprehensive overview of the CiARFs was undertaken, including the gene structures, phylogenetic analysis, chromosome locations, conserved motifs of proteins, and cis-elements in promoters of CiARF. Furthermore, expression profiling using real-time PCR revealed many CiARF genes, albeit with different patterns depending on types of tissues and/or developmental stages. Comprehensive expression analysis of these genes was also performed under two hormone treatments using real-time PCR. Indole-3-acetic acid (IAA) and N-1-napthylphthalamic acid (NPA) treatment experiments revealed differential up-regulation and down-regulation, respectively, of the 19 citrus ARF genes in the callus of sweet orange. Our comprehensive analysis of ARF genes further elucidates the roles of CiARF family members during citrus growth and development process. PMID:25870601

  12. A high-density SNP genome-wide linkage search of 206 families identifies susceptibility loci for chronic lymphocytic leukemia

    PubMed Central

    Sellick, Gabrielle S.; Goldin, Lynn R.; Wild, Ruth W.; Slager, Susan L.; Ressenti, Laura; Strom, Sara S.; Dyer, Martin J. S.; Mauro, Francesca R.; Marti, Gerald E.; Fuller, Stephen; Lyttelton, Matthew; Kipps, Thomas J.; Keating, Michael J.; Call, Timothy G.; Catovsky, Daniel

    2007-01-01

    Chronic lymphocytic leukemia (CLL) and other B-cell lymphoproliferative disorders display familial aggregation. To identify a susceptibility gene for CLL, we assembled families from the major European (ICLLC) and American (GEC) consortia to conduct a genome-wide linkage analysis of 101 new CLL pedigrees using a high-density single nucleotide polymorphism (SNP) array and combined the results with data from our previously reported analysis of 105 families. Here, we report on the combined analysis of the 206 families. Multipoint linkage analyses were undertaken using both nonparametric (model-free) and parametric (model-based) methods. After the removal of high linkage disequilibrium SNPs, we obtained a maximum nonparametric linkage (NPL) score of 3.02 (P = .001) on chromosome 2q21.2. The same genomic position also yielded the highest multipoint heterogeneity LOD (HLOD) score under a common recessive model of disease susceptibility (HLOD = 3.11; P = 7.7 × 10−5), which was significant at the genome-wide level. In addition, 2 other chromosomal positions, 6p22.1 (corresponding to the major histocompatibility locus) and 18q21.1, displayed HLOD scores higher than 2.1 (P < .002). None of the regions coincided with areas of common chromosomal abnormalities frequently observed in CLL. These findings provide direct evidence for Mendelian predisposition to CLL and evidence for the location of disease loci. PMID:17687107

  13. Genome-wide identification, isolation and expression analysis of auxin response factor (ARF) gene family in sweet orange (Citrus sinensis).

    PubMed

    Li, Si-Bei; OuYang, Wei-Zhi; Hou, Xiao-Jin; Xie, Liang-Liang; Hu, Chun-Gen; Zhang, Jin-Zhi

    2015-01-01

    Auxin response factors (ARFs) are an important family of proteins in auxin-mediated response, with key roles in various physiological and biochemical processes. To date, a genome-wide overview of the ARF gene family in citrus was not available. A systematic analysis of this gene family in citrus was begun by carrying out a genome-wide search for the homologs of ARFs. A total of 19 nonredundant ARF genes (CiARF) were found and validated from the sweet orange. A comprehensive overview of the CiARFs was undertaken, including the gene structures, phylogenetic analysis, chromosome locations, conserved motifs of proteins, and cis-elements in promoters of CiARF. Furthermore, expression profiling using real-time PCR revealed many CiARF genes, albeit with different patterns depending on types of tissues and/or developmental stages. Comprehensive expression analysis of these genes was also performed under two hormone treatments using real-time PCR. Indole-3-acetic acid (IAA) and N-1-napthylphthalamic acid (NPA) treatment experiments revealed differential up-regulation and down-regulation, respectively, of the 19 citrus ARF genes in the callus of sweet orange. Our comprehensive analysis of ARF genes further elucidates the roles of CiARF family members during citrus growth and development process.

  14. Genome-Wide Analysis of the Musa WRKY Gene Family: Evolution and Differential Expression during Development and Stress

    PubMed Central

    Goel, Ridhi; Pandey, Ashutosh; Trivedi, Prabodh K.; Asif, Mehar H.

    2016-01-01

    The WRKY gene family plays an important role in the development and stress responses in plants. As information is not available on the WRKY gene family in Musa species, genome-wide analysis has been carried out in this study using available genomic information from two species, Musa acuminata and Musa balbisiana. Analysis identified 147 and 132 members of the WRKY gene family in M. acuminata and M. balbisiana, respectively. Evolutionary analysis suggests that the WRKY gene family expanded much before the speciation in both the species. Most of the orthologs retained in two species were from the γ duplication event which occurred prior to α and β genome-wide duplication (GWD) events. Analysis also suggests that subtle changes in nucleotide sequences during the course of evolution have led to the development of new motifs which might be involved in neo-functionalization of different WRKY members in two species. Expression and cis-regulatory motif analysis suggest possible involvement of Group II and Group III WRKY members during various stresses and growth/development including fruit ripening process respectively. PMID:27014321

  15. Genome-wide Study of Families with Absolute Pitch Reveals Linkage to 8q24.21 and Locus Heterogeneity

    PubMed Central

    Theusch, Elizabeth; Basu, Analabha; Gitschier, Jane

    2009-01-01

    Absolute pitch (AP) is the rare ability to instantaneously recognize and label tones with their musical note names without using a reference pitch for comparison. The etiology of AP is complex. Prior studies have implicated both genetic and environmental factors in its genesis, yet the molecular basis for AP remains unknown. To locate regions of the human genome that may harbor AP-predisposing genetic variants, we performed a genome-wide linkage study on 73 multiplex AP families by genotyping them with 6090 SNP markers. Nonparametric multipoint linkage analyses were conducted, and the strongest evidence for linkage was observed on chromosome 8q24.21 in the subset of 45 families with European ancestry (exponential LOD score = 3.464, empirical genome-wide p = 0.03). Other regions with suggestive LOD scores included chromosomes 7q22.3, 8q21.11, and 9p21.3. Of these four regions, only the 7q22.3 linkage peak was also evident when 19 families with East Asian ancestry were analyzed separately. Though only one of these regions has yet reached statistical significance individually, we detected a larger number of independent linkage peaks than expected by chance overall, indicating that AP is genetically heterogeneous. PMID:19576568

  16. Genome-Wide Analyses of the Soybean F-Box Gene Family in Response to Salt Stress.

    PubMed

    Jia, Qi; Xiao, Zhi-Xia; Wong, Fuk-Ling; Sun, Song; Liang, Kang-Jing; Lam, Hon-Ming

    2017-04-12

    The F-box family is one of the largest gene families in plants that regulate diverse life processes, including salt responses. However, the knowledge of the soybean F-box genes and their roles in salt tolerance remains limited. Here, we conducted a genome-wide survey of the soybean F-box family, and their expression analysis in response to salinity via in silico analysis of online RNA-sequencing (RNA-seq) data and quantitative reverse-transcription polymerase chain reaction (qRT-PCR) to predict their potential functions. A total of 725 potential F-box proteins encoded by 509 genes were identified and classified into 9 subfamilies. The gene structures, conserved domains and chromosomal distributions were characterized. There are 76 pairs of duplicate genes identified, including genome-wide segmental and tandem duplication events, which lead to the expansion of the number of F-box genes. The in silico expression analysis showed that these genes would be involved in diverse developmental functions and play an important role in salt response. Our qRT-PCR analysis confirmed 12 salt-responding F-box genes. Overall, our results provide useful information on soybean F-box genes, especially their potential roles in salt tolerance.

  17. Genome-Wide Linkage Scan Identifies Two Novel Genetic Loci for Coronary Artery Disease: In GeneQuest Families

    PubMed Central

    Shen, Gongqing; Xi, Quansheng; Chen, Shenghan; Zhang, Zheng; Wang, Kai; Ellis, Stephen G.; Chen, Qiuyun; Topol, Eric J.; Wang, Qing K.

    2014-01-01

    Coronary artery disease (CAD) is the leading cause of death worldwide. Recent genome-wide association studies (GWAS) identified >50 common variants associated with CAD or its complication myocardial infarction (MI), but collectively they account for <20% of heritability, generating a phenomena of “missing heritability”. Rare variants with large effects may account for a large portion of missing heritability. Genome-wide linkage studies of large families and follow-up fine mapping and deep sequencing are particularly effective in identifying rare variants with large effects. Here we show results from a genome-wide linkage scan for CAD in multiplex GeneQuest families with early onset CAD and MI. Whole genome genotyping was carried out with 408 markers that span the human genome by every 10 cM and linkage analyses were performed using the affected relative pair analysis implemented in GENEHUNTER. Affected only nonparametric linkage (NPL) analysis identified two novel CAD loci with highly significant evidence of linkage on chromosome 3p25.1 (peak NPL  = 5.49) and 3q29 (NPL  = 6.84). We also identified four loci with suggestive linkage on 9q22.33, 9q34.11, 17p12, and 21q22.3 (NPL  = 3.18–4.07). These results identify novel loci for CAD and provide a framework for fine mapping and deep sequencing to identify new susceptibility genes and novel variants associated with risk of CAD. PMID:25485937

  18. Genome-Wide Dissection of the Heat Shock Transcription Factor Family Genes in Arachis

    PubMed Central

    Wang, Pengfei; Song, Hui; Li, Changsheng; Li, Pengcheng; Li, Aiqin; Guan, Hongshan; Hou, Lei; Wang, Xingjun

    2017-01-01

    Heat shock transcription factors (Hsfs) are important transcription factors (TFs) in protecting plants from damages caused by various stresses. The released whole genome sequences of wild peanuts make it possible for genome-wide analysis of Hsfs in peanut. In this study, a total of 16 and 17 Hsf genes were identified from Arachis duranensis and A. ipaensis, respectively. We identified 16 orthologous Hsf gene pairs in both peanut species; however HsfXs was only identified from A. ipaensis. Orthologous pairs between two wild peanut species were highly syntenic. Based on phylogenetic relationship, peanut Hsfs were divided into groups A, B, and C. Selection pressure analysis showed that group B Hsf genes mainly underwent positive selection and group A Hsfs were affected by purifying selection. Small scale segmental and tandem duplication may play important roles in the evolution of these genes. Cis-elements, such as ABRE, DRE, and HSE, were found in the promoters of most Arachis Hsf genes. Five AdHsfs and two AiHsfs contained fungal elicitor responsive elements suggesting their involvement in response to fungi infection. These genes were differentially expressed in cultivated peanut under abiotic stress and Aspergillus flavus infection. AhHsf2 and AhHsf14 were significantly up-regulated after inoculation with A. flavus suggesting their possible role in fungal resistance. PMID:28220134

  19. Genome-wide characterization of the SiDof gene family in foxtail millet (Setaria italica).

    PubMed

    Zhang, Li; Liu, Baoling; Zheng, Gewen; Zhang, Aiying; Li, Runzhi

    2017-01-01

    Dof (DNA binding with one finger) proteins, which constitute a class of transcription factors found exclusively in plants, are involved in numerous physiological and biochemical reactions affecting growth and development. A genome-wide analysis of SiDof genes was performed in this study. Thirty five SiDof genes were identified and those genes were unevenly distributed across nine chromosomes in the Seteria italica genome. Protein lengths, molecular weights, and theoretical isoelectric points of SiDofs all vary greatly. Gene structure analysis demonstrated that most SiDof genes lack introns. Phylogenetic analysis of SiDof proteins and Dof proteins from Arabidopsis thaliana, rice, sorghum, and Setaria viridis revealed six major groups. Analysis of RNA-Seq data indicated that SiDof gene expression levels varied across roots, stems, leaves, and spike. In addition, expression profiling of SiDof genes in response to stress suggested that SiDof 7 and SiDof 15 are involved in drought stress signalling. Overall, this study could provide novel information on SiDofs for further investigation in foxtail millet. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  20. Genome-wide identification and characterization of aquaporin gene family in moso bamboo (Phyllostachys edulis).

    PubMed

    Sun, Huayu; Li, Lichao; Lou, Yongfeng; Zhao, Hansheng; Gao, Zhimin

    2016-05-01

    Aquaporins (AQPs) are known to play a major role in maintaining water and hydraulic conductivity balance in the plant system. Numerous studies have showed AQPs execute multi-function throughout plant growth and development, including water transport, nitrogen, carbon, and micronutrient acquisition etc. However, little information on AQPs is known in bamboo. In this study, we present the first genome-wide identification and characterization of AQP genes in moso bamboo (Phyllostachys edulis) using bioinformatics. In total, 26 AQP genes were identified by homologous analysis, which were divided into four groups (PIPs, TIPs, NIPs, and SIPs) based on the phylogenetic analysis. All the genes were located on 26 different scaffolds respectively on basis of the gene mapped to bamboo genome. Evolutionary analysis indicated that Ph. edulis was more close to Oryza sativa than Zea mays in the genetic relationship. Besides, qRT-PCR was used to analyze gene expression profiles, which revealed that AQP genes were expressed constitutively in all the detected tissues, and were all responsive to the environmental cues such as drought, water, and NaCl stresses. This data suggested that AQPs may play fundamental roles in maintaining normal growth and development of bamboo, which would contribute to better understanding for the complex regulation mechanism involved in the fast-growing process of bamboo. Furthermore, the result could provide valuable information for further research on bamboo functional genomics.

  1. Genome-Wide Analysis of the Cyclin Gene Family in Tomato

    PubMed Central

    Zhang, Tingyan; Wang, Xin; Lu, Yongen; Cai, Xiaofeng; Ye, Zhibiao; Zhang, Junhong

    2014-01-01

    Cyclins play important roles in cell division and cell expansion. They also interact with cyclin-dependent kinases to control cell cycle progression in plants. Our genome-wide analysis identified 52 expressed cyclin genes in tomato. Phylogenetic analysis of the deduced amino sequences of tomato and Arabidopsis cyclin genes divided them into 10 types, A-, B-, C-, D-, H-, L-, T-, U-, SDS- and J18. Pfam analysis indicated that most tomato cyclins contain a cyclin-N domain. C-, H- and J18 types only contain a cyclin-C domain, and U-type cyclins contain another potential cyclin domain. All of the cyclin genes are distributed throughout the tomato genome except for chromosome 8, and 30 of them were found to be segmentally duplicated; they are found on the duplicate segments of chromosome 1, 2, 3, 4, 5, 6, 10, 11 and 12, suggesting that tomato cyclin genes experienced a mass of segmental duplication. Quantitative real-time polymerase chain reaction analysis indicates that the expression patterns of tomato cyclin genes were significantly different in vegetative and reproductive stages. Transcription of most cyclin genes can be enhanced or repressed by exogenous application of gibberellin, which implies that gibberellin maybe a direct regulator of cyclin genes. The study presented here may be useful as a guide for further functional research on tomato cyclins. PMID:24366066

  2. Genome-wide identification and characterization of aquaporin gene family in Beta vulgaris

    PubMed Central

    Kong, Weilong; Yang, Shaozong; Wang, Yulu; Bendahmane, Mohammed

    2017-01-01

    Aquaporins (AQPs) are essential channel proteins that execute multi-functions throughout plant growth and development, including water transport, uncharged solutes uptake, stress response, and so on. Here, we report the first genome-wide identification and characterization AQP (BvAQP) genes in sugar beet (Beta vulgaris), an important crop widely cultivated for feed, for sugar production and for bioethanol production. Twenty-eight sugar beet AQPs (BvAQPs) were identified and assigned into five subfamilies based on phylogenetic analyses: seven of plasma membrane (PIPs), eight of tonoplast (TIPs), nine of NOD26-like (NIPs), three of small basic (SIPs), and one of x-intrinsic proteins (XIPs). BvAQP genes unevenly mapped on all chromosomes, except on chromosome 4. Gene structure and motifs analyses revealed that BvAQP have conserved exon-intron organization and that they exhibit conserved motifs within each subfamily. Prediction of BvAQPs functions, based on key protein domains conservation, showed a remarkable difference in substrate specificity among the five subfamilies. Analyses of BvAQPs expression, by mean of RNA-seq, in different plant organs and in response to various abiotic stresses revealed that they were ubiquitously expressed and that their expression was induced by heat and salt stresses. These results provide a reference base to address further the function of sugar beet aquaporins and to explore future applications for plants growth and development improvements as well as in response to environmental stresses. PMID:28948097

  3. Genome-Wide Dissection of the Heat Shock Transcription Factor Family Genes in Arachis.

    PubMed

    Wang, Pengfei; Song, Hui; Li, Changsheng; Li, Pengcheng; Li, Aiqin; Guan, Hongshan; Hou, Lei; Wang, Xingjun

    2017-01-01

    Heat shock transcription factors (Hsfs) are important transcription factors (TFs) in protecting plants from damages caused by various stresses. The released whole genome sequences of wild peanuts make it possible for genome-wide analysis of Hsfs in peanut. In this study, a total of 16 and 17 Hsf genes were identified from Arachis duranensis and A. ipaensis, respectively. We identified 16 orthologous Hsf gene pairs in both peanut species; however HsfXs was only identified from A. ipaensis. Orthologous pairs between two wild peanut species were highly syntenic. Based on phylogenetic relationship, peanut Hsfs were divided into groups A, B, and C. Selection pressure analysis showed that group B Hsf genes mainly underwent positive selection and group A Hsfs were affected by purifying selection. Small scale segmental and tandem duplication may play important roles in the evolution of these genes. Cis-elements, such as ABRE, DRE, and HSE, were found in the promoters of most Arachis Hsf genes. Five AdHsfs and two AiHsfs contained fungal elicitor responsive elements suggesting their involvement in response to fungi infection. These genes were differentially expressed in cultivated peanut under abiotic stress and Aspergillus flavus infection. AhHsf2 and AhHsf14 were significantly up-regulated after inoculation with A. flavus suggesting their possible role in fungal resistance.

  4. Genome-wide analysis of the DNA-binding with one zinc finger (Dof) transcription factor family in bananas.

    PubMed

    Dong, Chen; Hu, Huigang; Xie, Jianghui

    2016-12-01

    DNA-binding with one finger (Dof) domain proteins are a multigene family of plant-specific transcription factors involved in numerous aspects of plant growth and development. In this study, we report a genome-wide search for Musa acuminata Dof (MaDof) genes and their expression profiles at different developmental stages and in response to various abiotic stresses. In addition, a complete overview of the Dof gene family in bananas is presented, including the gene structures, chromosomal locations, cis-regulatory elements, conserved protein domains, and phylogenetic inferences. Based on the genome-wide analysis, we identified 74 full-length protein-coding MaDof genes unevenly distributed on 11 chromosomes. Phylogenetic analysis with Dof members from diverse plant species showed that MaDof genes can be classified into four subgroups (StDof I, II, III, and IV). The detailed genomic information of the MaDof gene homologs in the present study provides opportunities for functional analyses to unravel the exact role of the genes in plant growth and development.

  5. Family based genome-wide copy number scan identifies complex rearrangements at 17q21.31 in dyslexics.

    PubMed

    Veerappa, Avinash M; Saldanha, Marita; Padakannaya, Prakash; Ramachandra, Nallur B

    2014-10-01

    Developmental dyslexia (DD) is a complex heritable disorder with unexpected difficulty in learning to read and spell despite adequate intelligence, education, environment, and normal senses. We performed genome-wide screening for copy number variations (CNVs) in 10 large Indian dyslexic families using Affymetrix Genome-Wide Human SNP Array 6.0. Results revealed the complex genomic rearrangements due to one non-contiguous deletion and five contiguous micro duplications and micro deletions at 17q21.31 region in three dyslexic families. CNVs in this region harbor the genes KIAA1267, LRRC37A, ARL17A/B, NSFP1, and NSF. The CNVs in case 1 and case 2 at this locus were found to be in homozygous state and case 3 was a de novo CNV. These CNVs were found with at least one CNV having a common break and end points in the parents. This cluster of genes containing NSF is implicated in learning, cognition, and memory, though not formally associated with dyslexia. Molecular network analysis of these and other dyslexia related module genes suggests NSF and other genes to be associated with cellular/vesicular membrane fusion and synaptic transmission. Thus, we suggest that NSF in this cluster would be the nearest gene responsible for the learning disability phenotype.

  6. Replication of the Wellcome Trust genome-wide association study of essential hypertension: the Family Blood Pressure Program.

    PubMed

    Ehret, Georg B; Morrison, Alanna C; O'Connor, Ashley A; Grove, Megan L; Baird, Lisa; Schwander, Karen; Weder, Alan; Cooper, Richard S; Rao, D C; Hunt, Steven C; Boerwinkle, Eric; Chakravarti, Aravinda

    2008-12-01

    Essential hypertension is a principal cardiovascular risk factor whose origin remains unknown. Classical genetic studies have shown that blood pressure is at least partially heritable, opening a window to understanding the pathophysiology of essential hypertension in the human using modern genetic tools. The Wellcome Trust Case Control Consortium has recently published the results of screening the genomes of 2000 essential hypertension cases and 3000 controls using 500 000 genome-wide single nucleotide polymorphisms (SNPs). None of the variants proved to be genome-wide significant after correction for multiple tests but the most significantly associated SNPs (P<10(-5)) constitute a priority list that warrant follow-up in other studies. We describe here replication studies of the top six SNPs in subjects from the US National Heart, Lung, and Blood Institute funded Family Blood Pressure Program comprising 11 433 individuals recruited by hypertensive families. The results suggest that only one of the six SNPs might be associated with essential hypertension in Americans of European origin. This SNP shows a significant but opposite effect in Americans of Hispanic origin and no association in African Americans. The significance of the opposing effect estimates is unclear. No replication could be shown for hypertension status, but there are differences in study design. This attempted replication highlights that essential hypertension studies will require more comprehensive and larger genetic screens.

  7. Genome-wide association study of vitamin D concentrations in Hispanic Americans: The IRAS Family Study

    PubMed Central

    Engelman, Corinne D.; Meyers, Kristin J.; Ziegler, Julie T.; Taylor, Kent D.; Palmer, Nicholette D.; Haffner, Steven M.; Fingerlin, Tasha E.; Wagenknecht, Lynne E.; Rotter, Jerome I.; Bowden, Donald W.; Langefeld, Carl D.; Norris, Jill M.

    2010-01-01

    Vitamin D deficiency is associated with many adverse health outcomes. There are several well established environmental predictors of vitamin D concentrations, yet studies of the genetic determinants of vitamin D concentrations are in their infancy. Our objective was to conduct a pilot genome-wide association (GWA) study of 25-hydroxyvitamin D (25[OH]D) and 1,25-dihydroxyvitamin D (1,25[OH]2D) concentrations in a subset of 229 Hispanic subjects, followed by replication genotyping of 50 single nucleotide polymorphisms (SNPs) in the entire sample of 1,190 Hispanics from San Antonio, Texas and San Luis Valley, Colorado. Of the 309,200 SNPs that met all quality control criteria, three SNPs in high linkage disequilibrium (LD) with each other were significantly associated with 1,25[OH]2D (rs6680429, rs9970802, and rs10889028) at a Bonferroni corrected P-value threshold of 1.62 × 10−7, however none met the threshold for 25[OH]D. Of the 50 SNPs selected for replication genotyping, five for 25[OH]D (rs2806508, rs10141935, rs4778359, rs1507023, and rs9937918) and eight for 1,25[OH]2D (rs6680429, rs1348864, rs4559029, rs12667374, rs7781309, rs10505337, rs2486443, and rs2154175) were replicated in the entire sample of Hispanics (P < 0.01). In conclusion, we identified several SNPs that were associated with vitamin D metabolite concentrations in Hispanics. These candidate polymorphisms merit further investigation in independent populations and other ethnicities. PMID:20600896

  8. Genome-wide association study of vitamin D concentrations in Hispanic Americans: the IRAS family study.

    PubMed

    Engelman, Corinne D; Meyers, Kristin J; Ziegler, Julie T; Taylor, Kent D; Palmer, Nicholette D; Haffner, Steven M; Fingerlin, Tasha E; Wagenknecht, Lynne E; Rotter, Jerome I; Bowden, Donald W; Langefeld, Carl D; Norris, Jill M

    2010-10-01

    Vitamin D deficiency is associated with many adverse health outcomes. There are several well established environmental predictors of vitamin D concentrations, yet studies of the genetic determinants of vitamin D concentrations are in their infancy. Our objective was to conduct a pilot genome-wide association (GWA) study of 25-hydroxyvitamin D (25[OH]D) and 1,25-dihydroxyvitamin D (1,25[OH](2)D) concentrations in a subset of 229 Hispanic subjects, followed by replication genotyping of 50 single nucleotide polymorphisms (SNPs) in the entire sample of 1190 Hispanics from San Antonio, Texas and San Luis Valley, Colorado. Of the 309,200 SNPs that met all quality control criteria, three SNPs in high linkage disequilibrium (LD) with each other were significantly associated with 1,25[OH](2)D (rs6680429, rs9970802, and rs10889028) at a Bonferroni corrected P-value threshold of 1.62 × 10(-7), however none met the threshold for 25[OH]D. Of the 50 SNPs selected for replication genotyping, five for 25[OH]D (rs2806508, rs10141935, rs4778359, rs1507023, and rs9937918) and eight for 1,25[OH](2)D (rs6680429, rs1348864, rs4559029, rs12667374, rs7781309, rs10505337, rs2486443, and rs2154175) were replicated in the entire sample of Hispanics (P<0.01). In conclusion, we identified several SNPs that were associated with vitamin D metabolite concentrations in Hispanics. These candidate polymorphisms merit further investigation in independent populations and other ethnicities. Copyright © 2010 Elsevier Ltd. All rights reserved.

  9. Genome-Wide Identification and Expression Profile of Dof Transcription Factor Gene Family in Pepper (Capsicum annuum L.).

    PubMed

    Wu, Zhiming; Cheng, Jiaowen; Cui, Junjie; Xu, Xiaowan; Liang, Guansheng; Luo, Xirong; Chen, Xiaocui; Tang, Xiangqun; Hu, Kailin; Qin, Cheng

    2016-01-01

    Dof (DNA-binding One Zinc Finger) transcription factor family is unique to plants and has diverse roles associated with plant-specific phenomena, such as light, phytohormone and defense responses as well as seed development and germination. Although, genome-wide analysis of this family has been performed in many species, information regarding Dof genes in the pepper, Capsicum annuum L., is extremely limited. In this study, exhaustive searches of pepper genome revealed 33 potential CaDofs that were phylogenetically clustered into four subgroups. Twenty-nine of the 33 Dof genes could be mapped on 11 chromosomes, except for chromosome 7. The intron/exon organizations and conserved motif compositions of these genes were also analyzed. Additionally, phylogenetic analysis and classification of the Dof transcription factor family in eight plant species revealed that S. lycopersicum and C. annuum as well as O. sativa and S. bicolor Dof proteins may have evolved conservatively. Moreover, comprehensive expression analysis of CaDofs using a RNA-seq atlas and quantitative real-time polymerase chain reaction (qRT-PCR) revealed that these genes exhibit a variety of expression patterns. Most of the CaDofs were expressed in at least one of the tissues tested, whereas several genes were identified as being highly responsive to heat and salt stresses. Overall, this study describes the first genome-wide analysis of the pepper Dof family, whose genes exhibited different expression patterns in all primary fruit developmental stages and tissue types, as in response to abiotic stress. In particular, some Dof genes might be used as biomarkers for heat and salt stress. The results could expand our understanding of the roles of Dof genes in pepper.

  10. The Use of Multiplicity Corrections, Order Statistics and Generalized Family-Wise Statistics with Application to Genome-Wide Studies

    PubMed Central

    Schrodi, Steven J.

    2016-01-01

    The most important decision faced by large-scale studies, such as those presently encountered in human genetics, is to distinguish between those tests that are true positives from those that are not. In the context of genetics, this entails the determination of genetic markers that actually underlie medically-relevant phenotypes from a vast number of makers typically interrogated in genome-wide studies. A critical part of these decisions relies on the appropriate statistical assessment of data obtained from tests across numerous markers. Several methods have been developed to aid with such analyses, with family-wise approaches, such as the Bonferroni and Dunn-Šidàk corrections, being popular. Conditions that motivate the use of family-wise corrections are explored. Although simple to implement, one major limitation of these approaches is that they assume that p-values are i.i.d. uniformly distributed under the null hypothesis. However, several factors may violate this assumption in genome-wide studies including effects from confounding by population stratification, the presence of related individuals, the correlational structure among genetic markers, and the use of limiting distributions for test statistics. Even after adjustment for such effects, the distribution of p-values can substantially depart from a uniform distribution under the null hypothesis. In this work, I present a decision theory for the use of family-wise corrections for multiplicity and a generalization of the Dunn-Šidàk correction that relaxes the assumption of uniformly-distributed null p-values. The independence assumption is also relaxed and handled through calculating the effective number of independent tests. I also explicitly show the relationship between order statistics and family-wise correction procedures. This generalization may be applicable to multiplicity problems outside of genomics. PMID:27128491

  11. Genome-Wide Identification and Expression Profile of Dof Transcription Factor Gene Family in Pepper (Capsicum annuum L.)

    PubMed Central

    Wu, Zhiming; Cheng, Jiaowen; Cui, Junjie; Xu, Xiaowan; Liang, Guansheng; Luo, Xirong; Chen, Xiaocui; Tang, Xiangqun; Hu, Kailin; Qin, Cheng

    2016-01-01

    Dof (DNA-binding One Zinc Finger) transcription factor family is unique to plants and has diverse roles associated with plant-specific phenomena, such as light, phytohormone and defense responses as well as seed development and germination. Although, genome-wide analysis of this family has been performed in many species, information regarding Dof genes in the pepper, Capsicum annuum L., is extremely limited. In this study, exhaustive searches of pepper genome revealed 33 potential CaDofs that were phylogenetically clustered into four subgroups. Twenty-nine of the 33 Dof genes could be mapped on 11 chromosomes, except for chromosome 7. The intron/exon organizations and conserved motif compositions of these genes were also analyzed. Additionally, phylogenetic analysis and classification of the Dof transcription factor family in eight plant species revealed that S. lycopersicum and C. annuum as well as O. sativa and S. bicolor Dof proteins may have evolved conservatively. Moreover, comprehensive expression analysis of CaDofs using a RNA-seq atlas and quantitative real-time polymerase chain reaction (qRT-PCR) revealed that these genes exhibit a variety of expression patterns. Most of the CaDofs were expressed in at least one of the tissues tested, whereas several genes were identified as being highly responsive to heat and salt stresses. Overall, this study describes the first genome-wide analysis of the pepper Dof family, whose genes exhibited different expression patterns in all primary fruit developmental stages and tissue types, as in response to abiotic stress. In particular, some Dof genes might be used as biomarkers for heat and salt stress. The results could expand our understanding of the roles of Dof genes in pepper. PMID:27200047

  12. Efficient generalized least squares method for mixed population and family-based samples in genome-wide association studies.

    PubMed

    Li, Jia; Yang, James; Levin, Albert M; Montgomery, Courtney G; Datta, Indrani; Trudeau, Sheri; Adrianto, Indra; McKeigue, Paul; Iannuzzi, Michael C; Rybicki, Benjamin A

    2014-07-01

    Genome-wide association studies (GWAS) that draw samples from multiple studies with a mixture of relationship structures are becoming more common. Analytical methods exist for using mixed-sample data, but few methods have been proposed for the analysis of genotype-by-environment (G×E) interactions. Using GWAS data from a study of sarcoidosis susceptibility genes in related and unrelated African Americans, we explored the current analytic options for genotype association testing in studies using both unrelated and family-based designs. We propose a novel method-generalized least squares (GLX)-to estimate both SNP and G×E interaction effects for categorical environmental covariates and compared this method to generalized estimating equations (GEE), logistic regression, the Cochran-Armitage trend test, and the WQLS and MQLS methods. We used simulation to demonstrate that the GLX method reduces type I error under a variety of pedigree structures. We also demonstrate its superior power to detect SNP effects while offering computational advantages and comparable power to detect G×E interactions versus GEE. Using this method, we found two novel SNPs that demonstrate a significant genome-wide interaction with insecticide exposure-rs10499003 and rs7745248, located in the intronic and 3' UTR regions of the FUT9 gene on chromosome 6q16.1.

  13. Efficient Generalized Least Squares Method for Mixed Population and Family-based Samples in Genome-wide Association Studies

    PubMed Central

    Li, Jia; Yang, James; Levin, Albert M.; Montgomery, Courtney G.; Datta, Indrani; Trudeau, Sheri; Adrianto, Indra; McKeigue, Paul; Iannuzzi, Michael C.; Rybicki, Benjamin A.

    2014-01-01

    Genome-wide association studies (GWAS) that draw samples from multiple studies with a mixture of relationship structures are becoming more common. Analytical methods exist for using mixed-sample data, but few methods have been proposed for the analysis of genotype-by-environment (G×E) interactions. Using GWAS data from a study of sarcoidosis susceptibility genes in related and unrelated African Americans, we explored the current analytic options for genotype association testing in studies using both unrelated and family-based designs. We propose a novel method—generalized least squares (GLX)—to estimate both SNP and G×E interaction effects for categorical environmental covariates and compared this method to generalized estimating equations (GEE), logistic regression, the Cochran–Armitage trend test, and the WQLS and MQLS methods. We used simulation to demonstrate that the GLX method reduces type I error under a variety of pedigree structures. We also demonstrate its superior power to detect SNP effects while offering computational advantages and comparable power to detect G×E interactions versus GEE. Using this method, we found two novel SNPs that demonstrate a significant genome-wide interaction with insecticide exposure—rs10499003 and rs7745248, located in the intronic and 3′ UTR regions of the FUT9 gene on chromosome 6q16.1. PMID:24845555

  14. Genome-wide identification and characterization of R2R3MYB family in Rosaceae.

    PubMed

    González, Máximo; Carrasco, Basilio; Salazar, Erika

    2016-09-01

    Transcription factors R2R3MYB family have been associated with the control of secondary metabolites, development of structures, cold tolerance and response to biotic and abiotic stress, among others. In recent years, genomes of Rosaceae botanical family are available. Although this information has been used to study the karyotype evolution of these species from an ancestral genome, there are no studies that treat the evolution and diversity of gene families present in these species or in the botanical family. Here we present the first comparative study of the R2R3MYB subfamily of transcription factors in three species of Rosaceae family (Malus domestica, Prunus persica and Fragaria vesca). We described 186, 98 and 86 non-redundant gene models for apple, peach and strawberry, respectively. In this research, we analyzed the intron-exon structure and genomic distribution of R2R3MYB families mentioned above. The phylogenetic comparisons revealed putative functions of some R2R3MYB transcription factors. This analysis found 44 functional subgroups, seven of which were unique for Rosaceae. In addition, our results showed a highly collinearity among some genes revealing the existence of conserved gene models between the three species studied. Although some gene models in these species have been validated under several approaches, more research in the Rosaceae family is necessary to determine gene expression patterns in specific tissues and development stages to facilitate understanding of the regulatory and biochemical mechanism in this botanical family.

  15. Cyclic nucleotide gated channel gene family in tomato: genome-wide identification and functional analyses in disease resistance

    PubMed Central

    Saand, Mumtaz A.; Xu, You-Ping; Li, Wen; Wang, Ji-Peng; Cai, Xin-Zhong

    2015-01-01

    The cyclic nucleotide gated channel (CNGC) is suggested to be one of the important calcium conducting channels. Nevertheless, genome-wide identification and systemic functional analysis of CNGC gene family in crop plant species have not yet been conducted. In this study, we performed genome-wide identification of CNGC gene family in the economically important crop tomato (Solanum lycopersicum L.) and analyzed function of the group IVb SlCNGC genes in disease resistance. Eighteen CNGC genes were identified in tomato genome, and four CNGC loci that were misannotated at database were corrected by cloning and sequencing. Detailed bioinformatics analyses on gene structure, domain composition and phylogenetic relationship of the SlCNGC gene family were conducted and the group-specific feature was revealed. Comprehensive expression analyses demonstrated that SlCNGC genes were highly, widely but differently responsive to diverse stimuli. Pharmacological assays showed that the putative CNGC activators cGMP and cAMP enhanced resistance against Sclerotinia sclerotiorum. Silencing of group IVb SlCNGC genes significantly enhanced resistance to fungal pathogens Pythium aphanidermatum and S. sclerotiorum, strongly reduced resistance to viral pathogen Tobacco rattle virus, while attenuated PAMP- and DAMP-triggered immunity as shown by obvious decrease of the flg22- and AtPep1-elicited hydrogen peroxide accumulation in SlCNGC-silenced plants. Additionally, silencing of these SlCNGC genes significantly altered expression of a set of Ca2+ signaling genes including SlCaMs, SlCDPKs, and SlCAMTA3. Collectively, our results reveal that group IV SlCNGC genes regulate a wide range of resistance in tomato probably by affecting Ca2+ signaling. PMID:25999969

  16. Genome-Wide Identification, Characterization and Expression Analysis of the Chalcone Synthase Family in Maize

    PubMed Central

    Han, Yahui; Ding, Ting; Su, Bo; Jiang, Haiyang

    2016-01-01

    Members of the chalcone synthase (CHS) family participate in the synthesis of a series of secondary metabolites in plants, fungi and bacteria. The metabolites play important roles in protecting land plants against various environmental stresses during the evolutionary process. Our research was conducted on comprehensive investigation of CHS genes in maize (Zea mays L.), including their phylogenetic relationships, gene structures, chromosomal locations and expression analysis. Fourteen CHS genes (ZmCHS01–14) were identified in the genome of maize, representing one of the largest numbers of CHS family members identified in one organism to date. The gene family was classified into four major classes (classes I–IV) based on their phylogenetic relationships. Most of them contained two exons and one intron. The 14 genes were unevenly located on six chromosomes. Two segmental duplication events were identified, which might contribute to the expansion of the maize CHS gene family to some extent. In addition, quantitative real-time PCR and microarray data analyses suggested that ZmCHS genes exhibited various expression patterns, indicating functional diversification of the ZmCHS genes. Our results will contribute to future studies of the complexity of the CHS gene family in maize and provide valuable information for the systematic analysis of the functions of the CHS gene family. PMID:26828478

  17. Genome-Wide Identification and Comparative Analysis of Albumin Family in Vertebrates

    PubMed Central

    Li, Shugang; Cao, Yiping; Geng, Fang

    2017-01-01

    Albumins are the most well-known globular proteins, and the most typical representatives are the serum albumins. However, less attention was paid to the albumin family, except for the human and bovine serum albumin. To characterize the features of albumin family, we have mined all the putative albumin proteins from the available genome sequences. The results showed that albumin is widely distributed in vertebrates, but not present in the bacteria and archaea. The phylogenetic analysis of vertebrate albumin family implied an evolutionary relationship between members of serum albumin, α-fetoprotein, vitamin D–binding protein, and afamin. Meanwhile, a new member from the albumin family was found, namely, extracellular matrix protein 1. The structural analysis revealed that the motifs for forming the internal disulfide bonds are highly conserved in the albumin family, despite the low overall sequence identity across the family. The domain arrangement of albumin proteins indicated that most of vertebrate albumins contain 3 characteristic domains, arising from 2 evolutionary patterns. And a significant trend has been observed that the albumin proteins in higher vertebrate species tend to possess more characteristic domains. This study has provided the fundamental information required for achieving a better understanding of the albumin distribution, phylogenetic relationship, characteristic motif, structure, and new insights into the evolutionary pattern. PMID:28680266

  18. Genome-wide Analysis of WD40 Protein Family in Human

    PubMed Central

    Zou, Xu-Dong; Hu, Xue-Jia; Ma, Jing; Li, Tuan; Ye, Zhi-Qiang; Wu, Yun-Dong

    2016-01-01

    The WD40 proteins, often acting as scaffolds to form functional complexes in fundamental cellular processes, are one of the largest families encoded by the eukaryotic genomes. Systematic studies of this family on genome scale are highly required for understanding their detailed functions, but are currently lacking in the animal lineage. Here we present a comprehensive in silico study of the human WD40 family. We have identified 262 non-redundant WD40 proteins, and grouped them into 21 classes according to their domain architectures. Among them, 11 animal-specific domain architectures have been recognized. Sequence alignment indicates the complicated duplication and recombination events in the evolution of this family. Through further phylogenetic analysis, we have revealed that the WD40 family underwent more expansion than the overall average in the evolutionary early stage, and the early emerged WD40 proteins are prone to domain architectures with fundamental cellular roles and more interactions. While most widely and highly expressed human WD40 genes originated early, the tissue-specific ones often have late origin. These results provide a landscape of the human WD40 family concerning their classification, evolution, and expression, serving as a valuable complement to the previous studies in the plant lineage. PMID:27991561

  19. Genome-wide analysis of Aux/IAA and ARF gene families in Populus trichocarpa

    SciTech Connect

    Kalluri, Udaya C; DiFazio, Stephen P; Brunner, A.; Tuskan, Gerald A

    2007-01-01

    Auxin/Indole-3-Acetic Acid (Aux/IAA) and Auxin Response Factor (ARF) transcription factors are key regulators of auxin responses in plants. A total of 35 Aux/IAA and 39 ARF genes were identified in the Populus genome. Comparative phylogenetic analysis revealed that the subgroups PoptrARF2, 6, 9 and 16 and PoptrIAA3, 16, 27 and 29 have differentially expanded in Populus relative to Arabidopsis. Activator ARFs were found to be two fold-overrepresented in the Populus genome. PoptrIAA and PoptrARF gene families appear to have expanded due to high segmental and low tandem duplication events. Furthermore, expression studies showed that genes in the expanded PoptrIAA3 subgroup display differential expression. The gene-family analysis reported here will be useful in conducting future functional genomics studies to understand how the molecular roles of these large gene families translate into a diversity of biologically meaningful auxin effects.

  20. Variation in genome-wide mutation rates within and between human families.

    PubMed

    Conrad, Donald F; Keebler, Jonathan E M; DePristo, Mark A; Lindsay, Sarah J; Zhang, Yujun; Casals, Ferran; Idaghdour, Youssef; Hartl, Chris L; Torroja, Carlos; Garimella, Kiran V; Zilversmit, Martine; Cartwright, Reed; Rouleau, Guy A; Daly, Mark; Stone, Eric A; Hurles, Matthew E; Awadalla, Philip

    2011-06-12

    J.B.S. Haldane proposed in 1947 that the male germline may be more mutagenic than the female germline. Diverse studies have supported Haldane's contention of a higher average mutation rate in the male germline in a variety of mammals, including humans. Here we present, to our knowledge, the first direct comparative analysis of male and female germline mutation rates from the complete genome sequences of two parent-offspring trios. Through extensive validation, we identified 49 and 35 germline de novo mutations (DNMs) in two trio offspring, as well as 1,586 non-germline DNMs arising either somatically or in the cell lines from which the DNA was derived. Most strikingly, in one family, we observed that 92% of germline DNMs were from the paternal germline, whereas, in contrast, in the other family, 64% of DNMs were from the maternal germline. These observations suggest considerable variation in mutation rates within and between families.

  1. Genome-Wide Identification of the Invertase Gene Family in Populus.

    PubMed

    Chen, Zhong; Gao, Kai; Su, Xiaoxing; Rao, Pian; An, Xinmin

    2015-01-01

    Invertase plays a crucial role in carbohydrate partitioning and plant development as it catalyses the irreversible hydrolysis of sucrose into glucose and fructose. The invertase family in plants is composed of two sub-families: acid invertases, which are targeted to the cell wall and vacuole; and neutral/alkaline invertases, which function in the cytosol. In this study, 5 cell wall invertase genes (PtCWINV1-5), 3 vacuolar invertase genes (PtVINV1-3) and 16 neutral/alkaline invertase genes (PtNINV1-16) were identified in the Populus genome and found to be distributed on 14 chromosomes. A comprehensive analysis of poplar invertase genes was performed, including structures, chromosome location, phylogeny, evolutionary pattern and expression profiles. Phylogenetic analysis indicated that the two sub-families were both divided into two clades. Segmental duplication is contributed to neutral/alkaline sub-family expansion. Furthermore, the Populus invertase genes displayed differential expression in roots, stems, leaves, leaf buds and in response to salt/cold stress and pathogen infection. In addition, the analysis of enzyme activity and sugar content revealed that invertase genes play key roles in the sucrose metabolism of various tissues and organs in poplar. This work lays the foundation for future functional analysis of the invertase genes in Populus and other woody perennials.

  2. Genome-Wide Identification of the Invertase Gene Family in Populus

    PubMed Central

    Su, Xiaoxing; Rao, Pian; An, Xinmin

    2015-01-01

    Invertase plays a crucial role in carbohydrate partitioning and plant development as it catalyses the irreversible hydrolysis of sucrose into glucose and fructose. The invertase family in plants is composed of two sub-families: acid invertases, which are targeted to the cell wall and vacuole; and neutral/alkaline invertases, which function in the cytosol. In this study, 5 cell wall invertase genes (PtCWINV1-5), 3 vacuolar invertase genes (PtVINV1-3) and 16 neutral/alkaline invertase genes (PtNINV1-16) were identified in the Populus genome and found to be distributed on 14 chromosomes. A comprehensive analysis of poplar invertase genes was performed, including structures, chromosome location, phylogeny, evolutionary pattern and expression profiles. Phylogenetic analysis indicated that the two sub-families were both divided into two clades. Segmental duplication is contributed to neutral/alkaline sub-family expansion. Furthermore, the Populus invertase genes displayed differential expression in roots, stems, leaves, leaf buds and in response to salt/cold stress and pathogen infection. In addition, the analysis of enzyme activity and sugar content revealed that invertase genes play key roles in the sucrose metabolism of various tissues and organs in poplar. This work lays the foundation for future functional analysis of the invertase genes in Populus and other woody perennials. PMID:26393355

  3. Genome-wide identification, characterization, and expression analysis of the MLO gene family in Cucumis sativus.

    PubMed

    Zhou, S J; Jing, Z; Shi, J L

    2013-12-11

    Mildew resistance locus o (MLO) is a plant-specific seven-transmembrane (TM) gene family. Several studies have revealed that certain members of the MLO gene family mediate powdery mildew susceptibility in three plant species, namely, Arabidopsis, barley, and tomato. The sequenced cucumber genome provides an opportunity to conduct a comprehensive overview of the MLO gene family. Fourteen genes (designated CsMLO01 through CsMLO14) have been identified within the Cucumis sativus genome by using an in silico cloning method with the MLO amino acid sequences of Arabidopsis thaliana and rice as probes. Sequence alignment revealed that numerous features of the gene family, such as TMs, a calmodulin-binding domain, peptide domains I and II, and 30 important amino acid residues for MLO function, are well conserved. Phylogenetic analysis of the MLO genes from cucumber and other plant species reveals seven different clades (I through VII). Three of these clades comprised MLO genes from A. thaliana, rice, maize, and cucumber, suggesting that these genes may have evolved after the divergence of monocots and dicots. In silico mapping showed that these CsMLOs were located on chromosomes 1, 2, 3, 4, 5, and 6 without any obvious clustering, except CsMLO01. To our knowledge, this paper is the first comprehensive report on MLO genes in C. sativus. These findings will facilitate the functional characterization of the MLOs related to powdery mildew susceptibility and assist in the development of disease resistance in cucumber.

  4. Genome-wide analysis of the MADS-box gene family in Brassica rapa (Chinese cabbage).

    PubMed

    Duan, Weike; Song, Xiaoming; Liu, Tongkun; Huang, Zhinan; Ren, Jun; Hou, Xilin; Li, Ying

    2015-02-01

    The MADS-box gene family is an ancient and well-studied transcription factor family that functions in almost every developmental process in plants. There are a number of reports about the MADS-box family in different plant species, but systematic analysis of the MADS-box transcription factor family in Brassica rapa (Chinese cabbage) is still lacking. In this study, 160 MADS-box transcription factors were identified from the entire Chinese cabbage genome and compared with the MADS-box factors from 21 other representative plant species. A detailed list of MADS proteins from these 22 species was sorted. Phylogenetic analysis of the BrMADS genes, together with their Arabidopsis and rice counterparts, showed that the BrMADS genes were categorised into type I (Mα, Mβ, Mγ) and type II (MIKC(C), MIKC*) groups, and the MIKC(C) proteins were further divided into 13 subfamilies. The Chinese cabbage type II group has 95 members, which is twice as much as the Arabidopsis type II group, indicating that the Chinese cabbage type II genes have been retained more frequently than the type I genes. Finally, RNA-seq transcriptome data and quantitative real-time PCR analysis revealed that BrMADS genes are expressed in a tissue-specific manner similar to Arabidopsis. Interestingly, a number of BrMIKC genes showed responses to different abiotic stress treatments, suggesting a function for some of the genes in these processes as well. Taken together, the characterization of the B. rapa MADS-box family presented here, will certainly help in the selection of appropriate candidate genes and further facilitate functional studies in Chinese cabbage.

  5. Genome-Wide Analysis of the NADK Gene Family in Plants

    PubMed Central

    Li, Wen-Yan; Wang, Xiang; Li, Ri; Li, Wen-Qiang; Chen, Kun-Ming

    2014-01-01

    Background NAD(H) kinase (NADK) is the key enzyme that catalyzes de novo synthesis of NADP(H) from NAD(H) for NADP(H)-based metabolic pathways. In plants, NADKs form functional subfamilies. Studies of these families in Arabidopsis thaliana indicate that they have undergone considerable evolutionary selection; however, the detailed evolutionary history and functions of the various NADKs in plants are not clearly understood. Principal Findings We performed a comparative genomic analysis that identified 74 NADK gene homologs from 24 species representing the eight major plant lineages within the supergroup Plantae: glaucophytes, rhodophytes, chlorophytes, bryophytes, lycophytes, gymnosperms, monocots and eudicots. Phylogenetic and structural analysis classified these NADK genes into four well-conserved subfamilies with considerable variety in the domain organization and gene structure among subfamily members. In addition to the typical NAD_kinase domain, additional domains, such as adenylate kinase, dual-specificity phosphatase, and protein tyrosine phosphatase catalytic domains, were found in subfamily II. Interestingly, NADKs in subfamily III exhibited low sequence similarity (∼30%) in the kinase domain within the subfamily and with the other subfamilies. These observations suggest that gene fusion and exon shuffling may have occurred after gene duplication, leading to specific domain organization seen in subfamilies II and III, respectively. Further analysis of the exon/intron structures showed that single intron loss and gain had occurred, yielding the diversified gene structures, during the process of structural evolution of NADK family genes. Finally, both available global microarray data analysis and qRT-RCR experiments revealed that the NADK genes in Arabidopsis and Oryza sativa show different expression patterns in different developmental stages and under several different abiotic/biotic stresses and hormone treatments, underscoring the functional diversity

  6. Genome-Wide analysis of the AAAP gene family in moso bamboo (Phyllostachys edulis).

    PubMed

    Liu, Huanlong; Wu, Min; Zhu, Dongyue; Pan, Feng; Wang, Yujiao; Wang, Yue; Xiang, Yan

    2017-01-31

    Members of the amino acid/auxin permease (AAAP) gene family play indispensable roles in various plant metabolism and biosynthesis processes. Comprehensive analysis of AAAP genes has been conducted in Arabidopsis, rice, maize and poplar, but has not been reported from moso bamboo. Phylogenetics, evolutionary patterns and further expression profiles analysis of the AAAP gene family in moso bamboo (Phyllostachys edulis) will increase our understanding of this important gene family. In this current study, we conducted phylogenetic, gene structure, promoter region, divergence time, expression patterns and qRT-PCR analysis of the 55 predicted AAAP genes in moso bamboo based on the availability of the moso bamboo genome sequence. We identified 55 putative AAAP (PeAAAP1-55) genes, which were divided into eight distinct subfamilies based on comparative phylogenetic analysis using 184 full-length protein sequences, including 55 sequences from moso bamboo, 58 sequences from rice and 71 sequences from maize. Analysis of evolutionary patterns and divergence showed that the PeAAAP genes have undergone a extensive duplication event approximately 12 million years ago (MYA) and that the split between AAAP family genes in moso bamboo and rice occurred approximately 27 MYA. The microarray analysis suggested that some genes play considerable roles in moso bamboo growth and development. We investigated the expression levels of the 16 AAP subfamily genes under abiotic stress (drought, salt and cold) by qRT-PCR to explore the potential contributions to stress response of individual PeAAAP genes in moso bamboo. The results of this study suggest that PeAAAP genes play crucial roles in moso bamboo growth and development, especially in response to abiotic stress conditions. Our comprehensive, systematic study of the AAAPs gene family in moso bamboo will facilitate further analysis of the functions and evolution of AAAP genes in plants.

  7. A flexible multi-species genome-wide 60K SNP chip developed from pooled resequencing of 240 Eucalyptus tree genomes across 12 species.

    PubMed

    Silva-Junior, Orzenil B; Faria, Danielle A; Grattapaglia, Dario

    2015-06-01

    We used whole genome resequencing of pooled individuals to develop a high-density single-nucleotide polymorphism (SNP) chip for Eucalyptus. Genomes of 240 trees of 12 species were sequenced at 3.5× each, and 46 997 586 raw SNP variants were subject to multivariable filtering metrics toward a multispecies, genome-wide distributed chip content. Of the 60 904 SNPs on the chip, 59 222 were genotyped and 51 204 were polymorphic across 14 Eucalyptus species, providing a 96% genome-wide coverage with 1 SNP/12-20 kb, and 47 069 SNPs at ≤ 10 kb from 30 444 of the 33 917 genes in the Eucalyptus genome. Given the EUChip60K multi-species genotyping flexibility, we show that both the sample size and taxonomic composition of cluster files impact heterozygous call specificity and sensitivity by benchmarking against 'gold standard' genotypes derived from deeply sequenced individual tree genomes. Thousands of SNPs were shared across species, likely representing ancient variants arisen before the split of these taxa, hinting to a recent eucalypt radiation. We show that the variable SNP filtering constraints allowed coverage of the entire site frequency spectrum, mitigating SNP ascertainment bias. The EUChip60K represents an outstanding tool with which to address population genomics questions in Eucalyptus and to empower genomic selection, GWAS and the broader study of complex trait variation in eucalypts. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.

  8. A genome-wide association scan for acute insulin response to glucose in Hispanic Americans: The IRAS Family Study

    PubMed Central

    Rich, S. S.; Goodarzi, M. O.; Palmer, N. D.; Langefeld, C. D.; Ziegler, J.; Haffner, S. M.; Bryer-Ash, M.; Norris, J. M.; Taylor, K. D.; Haritunians, T.; Rotter, J. I.; Chen, Y-D. I.; Wagenknecht, L. E.; Bowden, D. W.; Bergman, R. N.

    2009-01-01

    Aims/Hypothesis The goal of this study was to identify genes and regions in the human genome that are associated with the acute insulin response to glucose (AIRg), an important predictor of type 2 diabetes, in Hispanic-American participants from the Insulin Resistance Atherosclerosis Family Study (IRAS FS). Methods A two-stage genome-wide association scan (GWAS) was performed in IRAS FS Hispanic-American samples. In the first stage, 318K single nucleotide polymorphisms (SNPs) were assessed in 229 Hispanic-American DNA samples (from 34 families) from San Antonio, TX. SNPs with the most significant associations with AIRg were genotyped in the entire set of IRAS FS Hispanic-American samples (n = 1190). In chromosomal regions with evidence of association, additional SNPs were genotyped to capture variation in genes. Results No individual SNP achieved genome-wide levels of significance (P < 5 × 10-7); however, two regions — chromosomes 6p21 and 20p11 — had multiple highly-ranked SNPs that were associated with AIRg. Additional genotyping in these regions supported the initial evidence for variants contributing to variation in AIRg. One region resides in a gene desert between PXT1 and KCTD20 on 6p21 while the region on 20p11 has several viable candidate genes (ENTPD6, PYGB, GINS1 and R4-691N24.1). Conclusions/Interpretation A GWAS in Hispanic-American samples identified several candidate genes and loci that may be associated with AIRg. These associations explain a small component of variation in AIRg. The genes identified are involved in phosphorylation and ion transport and provide preliminary evidence that these processes have importance in beta cell response. PMID:19430760

  9. Genome-wide analysis of the GRAS gene family in Chinese cabbage (Brassica rapa ssp. pekinensis).

    PubMed

    Song, Xiao-Ming; Liu, Tong-Kun; Duan, Wei-Ke; Ma, Qing-Hua; Ren, Jun; Wang, Zhen; Li, Ying; Hou, Xi-Lin

    2014-01-01

    The GRAS gene family is one of the most important families of transcriptional regulators. In this study, 48 GRAS genes are identified from Chinese cabbage, and they are classified into eight groups according to the classification of Arabidopsis. The characterization, classification, gene structure and phylogenetic construction of GRAS proteins are performed. Distribution mapping shows that GRAS proteins are nonrandomly localized in 10 chromosomes. Fifty-five orthologous gene pairs are shared by Chinese cabbage and Arabidopsis, and interaction networks of these orthologous genes are constructed. The expansion of GRAS genes in Chinese cabbage results from genome triplication. Among the 17 species examined, 14 higher plants carry the GRAS genes, whereas two lower plants and one fungi species do not. Furthermore, the expression patterns of GRAS genes exhibit differences in three tissues based on RNA-seq data. Taken together, this comprehensive analysis will provide rich resources for studying GRAS protein functions in Chinese cabbage.

  10. Genome-wide identification and analysis of MAPK and MAPKK gene families in Brachypodium distachyon.

    PubMed

    Chen, Lihong; Hu, Wei; Tan, Shenglong; Wang, Min; Ma, Zhanbing; Zhou, Shiyi; Deng, Xiaomin; Zhang, Yang; Huang, Chao; Yang, Guangxiao; He, Guangyuan

    2012-01-01

    MAPK cascades are universal signal transduction modules and play important roles in plant growth, development and in response to a variety of biotic and abiotic stresses. Although MAPKs and MAPKKs have been systematically investigated in several plant species including Arabidopsis, rice and poplar, no systematic analysis has been conducted in the emerging monocot model plant Brachypodium distachyon. In the present study, a total of 16 MAPK genes and 12 MAPKK genes were identified from B. distachyon. An analysis of the genomic evolution showed that both tandem and segment duplications contributed significantly to the expansion of MAPK and MAPKK families. Evolutionary relationships within subfamilies were supported by exon-intron organizations and the architectures of conserved protein motifs. Synteny analysis between B. distachyon and the other two plant species of rice and Arabidopsis showed that only one homolog of B. distachyon MAPKs was found in the corresponding syntenic blocks of Arabidopsis, while 13 homologs of B. distachyon MAPKs and MAPKKs were found in that of rice, which was consistent with the speciation process of the three species. In addition, several interactive protein pairs between the two families in B. distachyon were found through yeast two hybrid assay, whereas their orthologs of a pair in Arabidopsis and other plant species were not found to interact with each other. Finally, expression studies of closely related family members among B. distachyon, Arabidopsis and rice showed that even recently duplicated representatives may fulfill different functions and be involved in different signal pathways. Taken together, our data would provide a foundation for evolutionary and functional characterization of MAPK and MAPKK gene families in B. distachyon and other plant species to unravel their biological roles.

  11. Genome-Wide Identification of Potential Drug Target in Enterobacteriaceae Family: A Homology-Based Method.

    PubMed

    Hadizadeh, Morteza; Tabatabaiepour, Seyyede Nasim; Tabatabaiepour, Seyyede Zahra; Hosseini Nave, Hossein; Mohammadi, Mohsen; Sohrabi, Seyyed Mohsen

    2017-05-18

    The Enterobacteriaceae is a large family of Gram-negative, facultative anaerobic, non-spore forming rod-shaped bacteria that includes harmless and pathogenic organisms. The emergence and development of drug resistance in Enterobacteriaceae is complicating the treatment of serious infections. The aim of this study is to predict and characterize putative drug targets in Enterobacteriaceae family employing a homology-based computational method. The final putative drug targets were qualitatively characterized via cellular function prediction, subcellular localization prediction, broad-spectrum, and druggability analyses. Of 6,327 analyzed proteins, 35 proteins were selected as final putative drug targets in Enterobacteriaceae family. These putative drug targets were involved in different vital pathways like metabolism, biosynthesis of macromolecule, and cell division. Predicted drug targets were also localized in the cytoplasm and cytoplasmic membrane of the pathogen that acts as antimicrobial or vaccine targets. Of 35 drug targets, 5 targets were druggable and 30 targets were not druggable and were predicted as novel drug targets, which should be further evaluated to develop new antimicrobial. Thirteen drug targets were considered as broad-spectrum targets. It is expected that results of our study could facilitate the production of novel antibacterial for efficient treatment of infections caused by Enterobacteriaceae pathogens.

  12. [Genome-wide identification and bioinformatic analysis of PPR gene family in tomato].

    PubMed

    Ding, Anming; Li, Ling; Qu, Xu; Sun, Tingting; Chen, Yaqiong; Zong, Peng; Li, Zunqiang; Gong, Daping; Sun, Yuhe

    2014-01-01

    Pentatricopeptide repeats (PPRs) genes constitute one of the largest gene families in plants, which play a broad and essential role in plant growth and development. In this study, the protein sequences annotated by the tomato (S. lycopersicum L.) genome project were screened with the Pfam PPR sequences. A total of 471 putative PPR-encoding genes were identified. Based on the motifs defined in A. thaliana L., protein structure and conserved sequences for each tomato motif were analyzed. We also analyzed phylogenetic relationship, subcellular localization, expression and GO analysis of the identified gene sequences. Our results demonstrate that tomato PPR gene family contains two subfamilies, P and PLS, each accounting for half of the family. PLS subfamily can be divided into four subclasses i.e., PLS, E, E+ and DYW. Each subclass of sequences forms a clade in the phylogenetic tree. The PPR motifs were found highly conserved among plants. The tomato PPR genes were distributed over 12 chromosomes and most of them lack introns. The majority of PPR proteins harbor mitochondrial or chloroplast localization sequences, whereas GO analysis showed that most PPR proteins participate in RNA-related biological processes.

  13. Genome wide analysis of Cyclophilin gene family from rice and Arabidopsis and its comparison with yeast.

    PubMed

    Trivedi, Dipesh Kumar; Yadav, Sandep; Vaid, Neha; Tuteja, Narendra

    2012-12-01

    Cyclophilin proteins are the members of immunophillin group of proteins, known for their property of binding to the immune-suppressant drug cyclosporin A, hence named as cyclophilins. These proteins are characterized by the presence of peptidyl prolyl isomerase (PPIase) domain which catalyzes the cis-trans isomerisation process of proline residues. In the present study, an in-silico based approach was followed to identify and characterize the cyclophilin family from rice, Arabidopsis and yeast. We were able to identify 28 rice, 35 Arabidopsis and 8 yeast cyclophilin genes from their respective genomes on the basis of their annotation as well as the presence of highly conserved PPIase domain. The evolutionary relationship of the cyclophilin genes from the three genomes was analyzed using the phylogenetic tree. We have also classified the rice cyclophilin genes on the basis of localization of the protein in cell. The structural similarity of the cyclophilins was also analyzed on the basis of their homology model. The expression analysis performed using Genevestigator revealed a very strong stress responsive behavior of the gene family which was more prominent in later stages of stress. The study indicates the importance of the gene family in stress response as well as several developmental stages thus opening up many avenues for future study on the cyclophilin proteins.

  14. Systematic genome-wide annotation of spliceosomal proteins reveals differential gene family expansion

    PubMed Central

    Barbosa-Morais, Nuno L.; Carmo-Fonseca, Maria; Aparício, Samuel

    2006-01-01

    Although more than 200 human spliceosomal and splicing-associated proteins are known, the evolution of the splicing machinery has not been studied extensively. The recent near-complete sequencing and annotation of distant vertebrate and chordate genomes provides the opportunity for an exhaustive comparative analysis of splicing factors across eukaryotes. We describe here our semiautomated computational pipeline to identify and annotate splicing factors in representative species of eukaryotes. We focused on protein families whose role in splicing is confirmed by experimental evidence. We visually inspected 1894 proteins and manually curated 224 of them. Our analysis shows a general conservation of the core spliceosomal proteins across the eukaryotic lineage, contrasting with selective expansions of protein families known to play a role in the regulation of splicing, most notably of SR proteins in metazoans and of heterogeneous nuclear ribonucleoproteins (hnRNP) in vertebrates. We also observed vertebrate-specific expansion of the CLK and SRPK kinases (which phosphorylate SR proteins), and the CUG-BP/CELF family of splicing regulators. Furthermore, we report several intronless genes amongst splicing proteins in mammals, suggesting that retrotransposition contributed to the complexity of the mammalian splicing apparatus. PMID:16344558

  15. Expression and genome-wide analysis of the xylogen-type gene family.

    PubMed

    Kobayashi, Yuuki; Motose, Hiroyasu; Iwamoto, Kuninori; Fukuda, Hiroo

    2011-06-01

    In higher plants, many extracellular proteins are involved in developmental processes, including cell-cell signaling and cell wall construction. Xylogen is an extracellular arabinogalactan protein (AGP) isolated from Zinnia elegans xylogenic culture medium, which promotes xylem cell differentiation. Xylogen has a unique structure, containing a non-specific lipid transfer protein (nsLTP) domain and AGP domains. We searched for xylogen-type genes in the genomes of land plants, including Arabidopsis thaliana, to further our knowledge of xylogen-type genes as functional extracellular proteins in plants. We found that many xylogen-type genes, including 13 Arabidopsis genes, comprise a gene family in land plants, including Populus trichocarpa, Vitis vinifera, Lotus japonicus, Oryza sativa, Selaginella moellendorffii and Physcomitrella patens. The genes shared an N-terminal signal peptide sequence, a distinct nsLTP domain, one or more AGP domains and a glycosylphosphatidylinositol (GPI)-anchored sequence. We analyzed transgenic plants harboring promoter::GUS (β-glucuronidase) constructs to test expression of the 13 Arabidopsis xylogen-type genes, and detected a diversity of gene family members with related expression patterns. AtXYP2 was the best candidate as the Arabidopsis counterpart of the Zinnia xylogen gene. We observed two distinct expression patterns for several genes, with some anther specific and others preferentially expressed in the endodermis/pericycle. We conclude that xylogen-type genes, which may have diverse functions, form a novel chimeric AGP gene family with a distinct nsLTP domain.

  16. Comparative Genome-Wide Analysis of the Malate Dehydrogenase Gene Families in Cotton

    PubMed Central

    Imran, Muhammad; Tang, Kai; Liu, Jin-Yuan

    2016-01-01

    Malate dehydrogenases (MDHs) play crucial roles in the physiological processes of plant growth and development. In this study, 13 and 25 MDH genes were identified from Gossypium raimondii and Gossypium hirsutum, respectively. Using these and 13 previously reported Gossypium arboretum MDH genes, a comparative molecular analysis between identified MDH genes from G. raimondii, G. hirsutum, and G. arboretum was performed. Based on multiple sequence alignments, cotton MDHs were divided into five subgroups: mitochondrial MDH, peroxisomal MDH, plastidial MDH, chloroplastic MDH and cytoplasmic MDH. Almost all of the MDHs within the same subgroup shared similar gene structure, amino acid sequence, and conserved motifs in their functional domains. An analysis of chromosomal localization suggested that segmental duplication played a major role in the expansion of cotton MDH gene families. Additionally, a selective pressure analysis indicated that purifying selection acted as a vital force in the evolution of MDH gene families in cotton. Meanwhile, an expression analysis showed the distinct expression profiles of GhMDHs in different vegetative tissues and at different fiber developmental stages, suggesting the functional diversification of these genes in cotton growth and fiber development. Finally, a promoter analysis indicated redundant but typical cis-regulatory elements for the potential functions and stress activity of many MDH genes. This study provides fundamental information for a better understanding of cotton MDH gene families and aids in functional analyses of the MDH genes in cotton fiber development. PMID:27829020

  17. Genome-wide identification and expression analysis of WNK kinase gene family in rice.

    PubMed

    Manuka, Rakesh; Saddhe, Ankush Ashok; Kumar, Kundan

    2015-12-01

    Eukaryotic protein kinases represent one of the largest gene families involved in diverse regulatory functions. WNK (With No Lysine) kinases are members of ser/thr protein kinase family, which lack conserved catalytic lysine (K) residue at protein kinase subdomain II and is replaced by either asparagine, serine or glycine residues. They are involved in regulation of flowering time, circadian rhythms and abiotic stresses in Arabidopsis thaliana. In the present study, we have identified 9 members of WNK in rice, showed resemblance to Arabidopsis and human WNK and clustered into five main clades phylogenetically. The predicted genes structure, bonafide conserved signature motif and domains strongly support their identity, as members of WNK kinase family. We have analyzed their chromosomal distribution, physio-chemical properties, subcellular localizations and cis-elements in the promoter regions in silico. Further, transcript analysis of OsWNK by qRT-PCR revealed their differential regulation in tissue specific and abiotic stresses libraries. In conclusion, the identification of nine OsWNK and transcript level expression pattern under abiotic stress using qRT-PCR in rice will significantly contribute towards the understanding of WNK genes in monocots and thus provide a set up for functional genomics studies of WNK protein kinases.

  18. The ankyrin repeat gene family in rice: genome-wide identification, classification and expression profiling.

    PubMed

    Huang, Jianyan; Zhao, Xiaobo; Yu, Huihui; Ouyang, Yidan; Wang, Lei; Zhang, Qifa

    2009-10-01

    Ankyrin repeat (ANK) containing proteins comprise a large protein family. Although many members of this family have been implicated in plant growth, development and signal transduction, only a few ANK genes have been reported in rice. In this study, we analyzed the structures, phylogenetic relationship, genome localizations and expression profiles of 175 ankyrin repeat genes identified in rice (OsANK). Domain composition analysis suggested OsANK proteins can be classified into ten subfamilies. Chromosomal localizations of OsANK genes indicated nine segmental duplication events involving 17 genes and 65 OsANK genes were involved in tandem duplications. The expression profiles of 158 OsANK genes were analyzed in 24 tissues covering the whole life cycle of two rice genotypes, Minghui 63 and Zhenshan 97. Sixteen genes showed preferential expression in given tissues compared to all the other tissues in Minghui 63 and Zhenshan 97. Nine genes were preferentially expressed in stamen of 1 day before flowering, suggesting that these genes may play important roles in pollination and fertilization. Expression data of OsANK genes were also obtained with tissues of seedlings subjected to three phytohormone (NAA, GA3 and KT) and light/dark treatments. Eighteen genes showed differential expression with at least one phytohormone treatment while under light/dark treatments, 13 OsANK genes showed differential expression. Our data provided a very useful reference for cloning and functional analysis of members of this gene family in rice.

  19. Genome wide analysis of Cyclophilin gene family from rice and Arabidopsis and its comparison with yeast

    PubMed Central

    Trivedi, Dipesh Kumar; Yadav, Sandep; Vaid, Neha; Tuteja, Narendra

    2012-01-01

    Cyclophilin proteins are the members of immunophillin group of proteins, known for their property of binding to the immune-suppressant drug cyclosporin A, hence named as cyclophilins. These proteins are characterized by the presence of peptidyl prolyl isomerase (PPIase) domain which catalyzes the cis-trans isomerisation process of proline residues. In the present study, an in-silico based approach was followed to identify and characterize the cyclophilin family from rice, Arabidopsis and yeast. We were able to identify 28 rice, 35 Arabidopsis and 8 yeast cyclophilin genes from their respective genomes on the basis of their annotation as well as the presence of highly conserved PPIase domain. The evolutionary relationship of the cyclophilin genes from the three genomes was analyzed using the phylogenetic tree. We have also classified the rice cyclophilin genes on the basis of localization of the protein in cell. The structural similarity of the cyclophilins was also analyzed on the basis of their homology model. The expression analysis performed using Genevestigator revealed a very strong stress responsive behavior of the gene family which was more prominent in later stages of stress. The study indicates the importance of the gene family in stress response as well as several developmental stages thus opening up many avenues for future study on the cyclophilin proteins. PMID:23073011

  20. Genome-wide evolutionary characterization and expression analyses of WRKY family genes in Brachypodium distachyon.

    PubMed

    Wen, Feng; Zhu, Hong; Li, Peng; Jiang, Min; Mao, Wenqing; Ong, Chermaine; Chu, Zhaoqing

    2014-06-01

    Members of plant WRKY gene family are ancient transcription factors that function in plant growth and development and respond to biotic and abiotic stresses. In our present study, we have investigated WRKY family genes in Brachypodium distachyon, a new model plant of family Poaceae. We identified a total of 86 WRKY genes from B. distachyon and explored their chromosomal distribution and evolution, domain alignment, promoter cis-elements, and expression profiles. Combining the analysis of phylogenetic tree of BdWRKY genes and the result of expression profiling, results showed that most of clustered gene pairs had higher similarities in the WRKY domain, suggesting that they might be functionally redundant. Neighbour-joining analysis of 301 WRKY domains from Oryza sativa, Arabidopsis thaliana, and B. distachyon suggested that BdWRKY domains are evolutionarily more closely related to O. sativa WRKY domains than those of A. thaliana. Moreover, tissue-specific expression profile of BdWRKY genes and their responses to phytohormones and several biotic or abiotic stresses were analysed by quantitative real-time PCR. The results showed that the expression of BdWRKY genes was rapidly regulated by stresses and phytohormones, and there was a strong correlation between promoter cis-elements and the phytohormones-induced BdWRKY gene expression.

  1. Genome-wide comparative analysis of 20 miniature inverted-repeat transposable element families in Brassica rapa and B. oleracea.

    PubMed

    Sampath, Perumal; Murukarthick, Jayakodi; Izzah, Nur Kholilatul; Lee, Jonghoon; Choi, Hong-Il; Shirasawa, Kenta; Choi, Beom-Soon; Liu, Shengyi; Nou, Ill-Sup; Yang, Tae-Jin

    2014-01-01

    Miniature inverted-repeat transposable elements (MITEs) are ubiquitous, non-autonomous class II transposable elements. Here, we conducted genome-wide comparative analysis of 20 MITE families in B. rapa, B. oleracea, and Arabidopsis thaliana. A total of 5894 and 6026 MITE members belonging to the 20 families were found in the whole genome pseudo-chromosome sequences of B. rapa and B. oleracea, respectively. Meanwhile, only four of the 20 families, comprising 573 members, were identified in the Arabidopsis genome, indicating that most of the families were activated in the Brassica genus after divergence from Arabidopsis. Copy numbers varied from 4 to 1459 for each MITE family, and there was up to 6-fold variation between B. rapa and B. oleracea. In particular, analysis of intact members showed that whereas eleven families were present in similar copy numbers in B. rapa and B. oleracea, nine families showed copy number variation ranging from 2- to 16-fold. Four of those families (BraSto-3, BraTo-3, 4, 5) were more abundant in B. rapa, and the other five (BraSto-1, BraSto-4, BraTo-1, 7 and BraHAT-1) were more abundant in B. oleracea. Overall, 54% and 51% of the MITEs resided in or within 2 kb of a gene in the B. rapa and B. oleracea genomes, respectively. Notably, 92 MITEs were found within the CDS of annotated genes, suggesting that MITEs might play roles in diversification of genes in the recently triplicated Brassica genome. MITE insertion polymorphism (MIP) analysis of 289 MITE members showed that 52% and 23% were polymorphic at the inter- and intra-species levels, respectively, indicating that there has been recent MITE activity in the Brassica genome. These recently activated MITE families with abundant MIP will provide useful resources for molecular breeding and identification of novel functional genes arising from MITE insertion.

  2. Genome-Wide Comparative Analysis of 20 Miniature Inverted-Repeat Transposable Element Families in Brassica rapa and B. oleracea

    PubMed Central

    Sampath, Perumal; Murukarthick, Jayakodi; Izzah, Nur Kholilatul; Lee, Jonghoon; Choi, Hong-Il; Shirasawa, Kenta; Choi, Beom-Soon; Liu, Shengyi; Nou, Ill-Sup; Yang, Tae-Jin

    2014-01-01

    Miniature inverted-repeat transposable elements (MITEs) are ubiquitous, non-autonomous class II transposable elements. Here, we conducted genome-wide comparative analysis of 20 MITE families in B. rapa, B. oleracea, and Arabidopsis thaliana. A total of 5894 and 6026 MITE members belonging to the 20 families were found in the whole genome pseudo-chromosome sequences of B. rapa and B. oleracea, respectively. Meanwhile, only four of the 20 families, comprising 573 members, were identified in the Arabidopsis genome, indicating that most of the families were activated in the Brassica genus after divergence from Arabidopsis. Copy numbers varied from 4 to 1459 for each MITE family, and there was up to 6-fold variation between B. rapa and B. oleracea. In particular, analysis of intact members showed that whereas eleven families were present in similar copy numbers in B. rapa and B. oleracea, nine families showed copy number variation ranging from 2- to 16-fold. Four of those families (BraSto-3, BraTo-3, 4, 5) were more abundant in B. rapa, and the other five (BraSto-1, BraSto-4, BraTo-1, 7 and BraHAT-1) were more abundant in B. oleracea. Overall, 54% and 51% of the MITEs resided in or within 2 kb of a gene in the B. rapa and B. oleracea genomes, respectively. Notably, 92 MITEs were found within the CDS of annotated genes, suggesting that MITEs might play roles in diversification of genes in the recently triplicated Brassica genome. MITE insertion polymorphism (MIP) analysis of 289 MITE members showed that 52% and 23% were polymorphic at the inter- and intra-species levels, respectively, indicating that there has been recent MITE activity in the Brassica genome. These recently activated MITE families with abundant MIP will provide useful resources for molecular breeding and identification of novel functional genes arising from MITE insertion. PMID:24747717

  3. Genome-wide systematic characterization of the bZIP transcriptional factor family in tomato (Solanum lycopersicum L.).

    PubMed

    Li, Dayong; Fu, Fuyou; Zhang, Huijuan; Song, Fengming

    2015-10-12

    Transcription factors of the basic leucine zipper (bZIP) family represent exclusively in eukaryotes and have been shown to regulate diverse biological processes in plant growth and development as well as in abiotic and biotic stress responses. However, little is known about the bZIP family in tomato (Solanum lycopersicum L.). The SlbZIP genes were identified using local BLAST and hidden Markov model profile searches. The phylogenetic trees, conserved motifs and gene structures were generated by MEGA6.06, MEME tool and gene Structure Display Server, respectively. The syntenic block diagrams were generated by the Circos software. The transcriptional gene expression profiles were obtained using Genevestigator tool and quantitative RT-PCR. In the present study, we carried out a genome-wide identification and systematic analyses of 69 SlbZIP genes that distributes unevenly on the tomato chromosomes. This family can be divided into 9 groups according to the phylogenetic relationship among the SlbZIP proteins. Six kinds of intron patterns (a-f) within the basic and hinge regions are defined. The additional conserved motifs and their presence of the group specificity were also identified. Further, we predicted the DNA-binding patterns and the dimerization property on the basis of the characteristic features in the basic and hinge regions and the leucine zipper, respectively, which supports our classification greatly and helps to classify 24 distinct subfamilies. Within the SlbZIP family, a total of 40 SlbZIP genes are located in the segmental duplicate regions in the tomato genome, suggesting that the segment chromosomal duplications contribute greatly to the expansion of the tomato SlbZIP family. Expression profiling analyses of 59 SlbZIP genes using quantitative RT-PCR and publicly available microarray data indicate that the tomato SlbZIP genes have distinct and diverse expression patterns in different tissues and developmental stages and many of the tomato bZIP genes

  4. A combined genome-wide linkage and association approach to find susceptibility loci for platelet function phenotypes in European American and African American families with coronary artery disease.

    PubMed

    Mathias, Rasika A; Kim, Yoonhee; Sung, Heejong; Yanek, Lisa R; Mantese, V J; Hererra-Galeano, J Enrique; Ruczinski, Ingo; Wilson, Alexander F; Faraday, Nauder; Becker, Lewis C; Becker, Diane M

    2010-06-07

    The inability of aspirin (ASA) to adequately suppress platelet aggregation is associated with future risk of coronary artery disease (CAD). Heritability studies of agonist-induced platelet function phenotypes suggest that genetic variation may be responsible for ASA responsiveness. In this study, we leverage independent information from genome-wide linkage and association data to determine loci controlling platelet phenotypes before and after treatment with ASA. Clinical data on 37 agonist-induced platelet function phenotypes were evaluated before and after a 2-week trial of ASA (81 mg/day) in 1231 European American and 846 African American healthy subjects with a family history of premature CAD. Principal component analysis was performed to minimize the number of independent factors underlying the covariance of these various phenotypes. Multi-point sib-pair based linkage analysis was performed using a microsatellite marker set, and single-SNP association tests were performed using markers from the Illumina 1 M genotyping chip from deCODE Genetics, Inc. All analyses were performed separately within each ethnic group. Several genomic regions appear to be linked to ASA response factors: a 10 cM region in African Americans on chromosome 5q11.2 had several STRs with suggestive (p-value < 7 x 10-4) and significant (p-value < 2 x 10-5) linkage to post aspirin platelet response to ADP, and ten additional factors had suggestive evidence for linkage (p-value < 7 x 10-4) to thirteen genomic regions. All but one of these factors were aspirin response variables. While the strength of genome-wide SNP association signals for factors showing evidence for linkage is limited, especially at the strict thresholds of genome-wide criteria (N = 9 SNPs for 11 factors), more signals were considered significant when the association signal was weighted by evidence for linkage (N = 30 SNPs). Our study supports the hypothesis that platelet phenotypes in response to ASA likely have genetic

  5. A combined genome-wide linkage and association approach to find susceptibility loci for platelet function phenotypes in European American and African American families with coronary artery disease

    PubMed Central

    2010-01-01

    Background The inability of aspirin (ASA) to adequately suppress platelet aggregation is associated with future risk of coronary artery disease (CAD). Heritability studies of agonist-induced platelet function phenotypes suggest that genetic variation may be responsible for ASA responsiveness. In this study, we leverage independent information from genome-wide linkage and association data to determine loci controlling platelet phenotypes before and after treatment with ASA. Methods Clinical data on 37 agonist-induced platelet function phenotypes were evaluated before and after a 2-week trial of ASA (81 mg/day) in 1231 European American and 846 African American healthy subjects with a family history of premature CAD. Principal component analysis was performed to minimize the number of independent factors underlying the covariance of these various phenotypes. Multi-point sib-pair based linkage analysis was performed using a microsatellite marker set, and single-SNP association tests were performed using markers from the Illumina 1 M genotyping chip from deCODE Genetics, Inc. All analyses were performed separately within each ethnic group. Results Several genomic regions appear to be linked to ASA response factors: a 10 cM region in African Americans on chromosome 5q11.2 had several STRs with suggestive (p-value < 7 × 10-4) and significant (p-value < 2 × 10-5) linkage to post aspirin platelet response to ADP, and ten additional factors had suggestive evidence for linkage (p-value < 7 × 10-4) to thirteen genomic regions. All but one of these factors were aspirin response variables. While the strength of genome-wide SNP association signals for factors showing evidence for linkage is limited, especially at the strict thresholds of genome-wide criteria (N = 9 SNPs for 11 factors), more signals were considered significant when the association signal was weighted by evidence for linkage (N = 30 SNPs). Conclusions Our study supports the hypothesis that platelet phenotypes

  6. Genome-Wide Identification and Expression Analysis of the WRKY Gene Family in Cassava.

    PubMed

    Wei, Yunxie; Shi, Haitao; Xia, Zhiqiang; Tie, Weiwei; Ding, Zehong; Yan, Yan; Wang, Wenquan; Hu, Wei; Li, Kaimian

    2016-01-01

    The WRKY family, a large family of transcription factors (TFs) found in higher plants, plays central roles in many aspects of physiological processes and adaption to environment. However, little information is available regarding the WRKY family in cassava (Manihot esculenta). In the present study, 85 WRKY genes were identified from the cassava genome and classified into three groups according to conserved WRKY domains and zinc-finger structure. Conserved motif analysis showed that all of the identified MeWRKYs had the conserved WRKY domain. Gene structure analysis suggested that the number of introns in MeWRKY genes varied from 1 to 5, with the majority of MeWRKY genes containing three exons. Expression profiles of MeWRKY genes in different tissues and in response to drought stress were analyzed using the RNA-seq technique. The results showed that 72 MeWRKY genes had differential expression in their transcript abundance and 78 MeWRKY genes were differentially expressed in response to drought stresses in different accessions, indicating their contribution to plant developmental processes and drought stress resistance in cassava. Finally, the expression of 9 WRKY genes was analyzed by qRT-PCR under osmotic, salt, ABA, H2O2, and cold treatments, indicating that MeWRKYs may be involved in different signaling pathways. Taken together, this systematic analysis identifies some tissue-specific and abiotic stress-responsive candidate MeWRKY genes for further functional assays in planta, and provides a solid foundation for understanding of abiotic stress responses and signal transduction mediated by WRKYs in cassava.

  7. Genome-Wide Identification and Expression Analysis of the WRKY Gene Family in Cassava

    PubMed Central

    Wei, Yunxie; Shi, Haitao; Xia, Zhiqiang; Tie, Weiwei; Ding, Zehong; Yan, Yan; Wang, Wenquan; Hu, Wei; Li, Kaimian

    2016-01-01

    The WRKY family, a large family of transcription factors (TFs) found in higher plants, plays central roles in many aspects of physiological processes and adaption to environment. However, little information is available regarding the WRKY family in cassava (Manihot esculenta). In the present study, 85 WRKY genes were identified from the cassava genome and classified into three groups according to conserved WRKY domains and zinc-finger structure. Conserved motif analysis showed that all of the identified MeWRKYs had the conserved WRKY domain. Gene structure analysis suggested that the number of introns in MeWRKY genes varied from 1 to 5, with the majority of MeWRKY genes containing three exons. Expression profiles of MeWRKY genes in different tissues and in response to drought stress were analyzed using the RNA-seq technique. The results showed that 72 MeWRKY genes had differential expression in their transcript abundance and 78 MeWRKY genes were differentially expressed in response to drought stresses in different accessions, indicating their contribution to plant developmental processes and drought stress resistance in cassava. Finally, the expression of 9 WRKY genes was analyzed by qRT-PCR under osmotic, salt, ABA, H2O2, and cold treatments, indicating that MeWRKYs may be involved in different signaling pathways. Taken together, this systematic analysis identifies some tissue-specific and abiotic stress-responsive candidate MeWRKY genes for further functional assays in planta, and provides a solid foundation for understanding of abiotic stress responses and signal transduction mediated by WRKYs in cassava. PMID:26904033

  8. Genome-wide identification, evolution of chromobox family genes and their expression in Nile tilapia.

    PubMed

    Liu, Xing-Yong; Zhang, Xian-Bo; Li, Ming-Hui; Zheng, Shu-Qing; Liu, Zhi-Long; Cheng, Yun-Ying; Wang, De-Shou

    2017-01-01

    Chromobox (Cbx) family proteins are transcriptional repressors that involved in epigenetic and developmental processes. In this study, comprehensive analyses of Cbxs were performed using available genome databases from representative animal species. The Cbx family were originated from one Polycomb (Pc) gene like the yeast Pc, which duplicated into two and gave rise to the Pc and the Heterochromatin protein 1 (Hp1) identified in invertebrates from protozoon to lancelet. Rapid expansion of Cbx family members was observed in vertebrates as ~8 (5 Pc and 3 Hp1) were identified in spotted gar, coelacanth and tetrapods. Further expansion of the members to ~14 (9 Pc and 5 Hp1) was observed in teleosts due to the third round genome duplication (3R). Based on transcriptome data from eight adult tilapia tissues, most of the Cbxs were found to be dominantly expressed in the brain, testis, ovary and heart. Analyses of the gonadal transcriptome data from four developmental stages revealed that all Cbxs were expressed in both ovary and testis except Cbx7b, with significant increase of the total and average RPKM from 5 to 90dah (days after hatching). By in situ hybridization, the three most highly and sexual dimorphically expressed Cbx genes in gonads, Cbx1b, Cbx3a and Cbx5, were found to be expressed in phase I and II oocytes of the ovary, and in secondary spermatocytes (Cbx1b and Cbx3a) and spermatids (Cbx5) of the testis. Our results revealed the evolution of Cbx genes and indicated a potential role of Cbxs in epigenetic regulation of gametogenesis.

  9. [Genome-wide identification and expression analysis of the WRKY gene family in peach].

    PubMed

    Yanbing, Gu; Zhirui, Ji; Fumei, Chi; Zhuang, Qiao; Chengnan, Xu; Junxiang, Zhang; Zongshan, Zhou; Qinglong, Dong

    2016-03-01

    The WRKY transcription factors are one of the largest families of transcriptional regulators and play diverse regulatory roles in biotic and abiotic stresses, plant growth and development processes. In this study, the WRKY DNA-binding domain (Pfam Database number: PF03106) downloaded from Pfam protein families database was exploited to identify WRKY genes from the peach (Prunus persica 'Lovell') genome using HMMER 3.0. The obtained amino acid sequences were analyzed with DNAMAN 5.0, WebLogo 3, MEGA 5.1, MapInspect and MEME bioinformatics softwares. Totally 61 peach WRKY genes were found in the peach genome. Our phylogenetic analysis revealed that peach WRKY genes were classified into three Groups: Ⅰ, Ⅱ and Ⅲ. The WRKY N-terminal and C-terminal domains of Group Ⅰ (group I-N and group I-C) were monophyletic. The Group Ⅱ was sub-divided into five distinct clades (groupⅡ-a, Ⅱ-b, Ⅱ-c, Ⅱ-d and Ⅱ-e). Our domain analysis indicated that the WRKY regions contained a highly conserved heptapeptide stretch WRKYGQK at its N-terminus followed by a zinc-finger motif. The chromosome mapping analysis showed that peach WRKY genes were distributed with different densities over 8 chromosomes. The intron-exon structure analysis revealed that structures of the WRKY gene were highly conserved in the peach. The conserved motif analysis showed that the conserved motifs 1, 2 and 3, which specify the WRKY domain, were observed in all peach WRKY proteins, motif 5 as the unknown domain was observed in group Ⅱ-d, two WRKY domains were assigned to GroupⅠ. SqRT-PCR and qRT-PCR results indicated that 16 PpWRKY genes were expressed in roots, stems, leaves, flowers and fruits at various expression levels. Our analysis thus identified the PpWRKY gene families, and future functional studies are needed to reveal its specific roles.

  10. Genome-wide analysis of the AP2/ERF family in Musa species reveals divergence and neofunctionalisation during evolution.

    PubMed

    Lakhwani, Deepika; Pandey, Ashutosh; Dhar, Yogeshwar Vikram; Bag, Sumit Kumar; Trivedi, Prabodh Kumar; Asif, Mehar Hasan

    2016-01-06

    AP2/ERF domain containing transcription factor super family is one of the important regulators in the plant kingdom. The involvement of AP2/ERF family members has been elucidated in various processes associated with plant growth, development as well as in response to hormones, biotic and abiotic stresses. In this study, we carried out genome-wide analysis to identify members of AP2/ERF family in Musa acuminata (A genome) and Musa balbisiana (B genome) and changes leading to neofunctionalisation of genes. Analysis identified 265 and 318 AP2/ERF encoding genes in M. acuminata and M. balbisiana respectively which were further classified into ERF, DREB, AP2, RAV and Soloist groups. Comparative analysis indicated that AP2/ERF family has undergone duplication, loss and divergence during evolution and speciation of the Musa A and B genomes. We identified nine genes which are up-regulated during fruit ripening and might be components of the regulatory machinery operating during ethylene-dependent ripening in banana. Tissue-specific expression analysis of the genes suggests that different regulatory mechanisms might be involved in peel and pulp ripening process through recruiting specific ERFs in these tissues. Analysis also suggests that MaRAV-6 and MaERF026 have structurally diverged from their M. balbisiana counterparts and have attained new functions during ripening.

  11. Genome-wide analysis and expression profiling of the GRF gene family in oilseed rape (Brassica napus L.).

    PubMed

    Ma, Jin-Qi; Jian, Hong-Ju; Yang, Bo; Lu, Kun; Zhang, Ao-Xiang; Liu, Pu; Li, Jia-Na

    2017-07-15

    Growth regulating-factors (GRFs) are plant-specific transcription factors that help regulate plant growth and development. Genome-wide identification and evolutionary analyses of GRF gene families have been performed in Arabidopsis thaliana, Zea mays, Oryza sativa, and Brassica rapa, but a comprehensive analysis of the GRF gene family in oilseed rape (Brassica napus) has not yet been reported. In the current study, we identified 35 members of the BnGRF family in B. napus. We analyzed the chromosomal distribution, phylogenetic relationships (Bayesian Inference and Neighbor Joining method), gene structures, and motifs of the BnGRF family members, as well as the cis-acting regulatory elements in their promoters. We also analyzed the expression patterns of 15 randomly selected BnGRF genes in various tissues and in plant varieties with different harvest indices and gibberellic acid (GA) responses. The expression levels of BnGRFs under GA treatment suggested the presence of possible negative feedback regulation. The evolutionary patterns and expression profiles of BnGRFs uncovered in this study increase our understanding of the important roles played by these genes in oilseed rape. Copyright © 2017. Published by Elsevier B.V.

  12. Genome-wide characterization of soybean P 1B -ATPases gene family provides functional implications in cadmium responses.

    PubMed

    Fang, Xiaolong; Wang, Lei; Deng, Xiaojuan; Wang, Peng; Ma, Qibin; Nian, Hai; Wang, Yingxiang; Yang, Cunyi

    2016-05-20

    The P1B-ATPase subfamily is an important group involved in transporting heavy metals and has been extensively studied in model plants, such as Arabidopsis thaliana and Oryza sativa. Emerging evidence indicates that one homolog in Glycine max is also involved in cadmium (Cd) stress, but the gene family has not been fully investigated in soybean. Here, we identified 20 heavy metal ATPase (HMA) family members in the soybean genome, presented as 10 paralogous pairs, which is significantly greater than the number in Arabidopsis or rice, and was likely caused by the latest whole genome duplication event in soybean. A phylogenetic analysis divided the 20 members into six groups, each having conserved or divergent gene structures and protein motif patterns. The integration of RNA-sequencing and qRT-PCR data from multiple tissues provided an overall expression pattern for the HMA family in soybean. Further comparisons of expression patterns and the single nucleotide polymorphism distribution between paralogous pairs suggested functional conservation and the divergence of HMA genes during soybean evolution. Finally, analyses of the HMAs expressed in response to Cd stress provided evidence on how plants manage Cd tolerance, at least in the two contrasting soybean genotypes examined. The genome-wide identification, chromosomal distribution, gene structures, and evolutionary and expression analyses of the 20 HMA genes in soybean provide an overall insight into their potential involvement in Cd responses. These results will facilitate further research on the HMA gene family, and their conserved and divergent biological functions in soybean.

  13. Genome-wide analysis of the AP2/ERF family in Musa species reveals divergence and neofunctionalisation during evolution

    PubMed Central

    Lakhwani, Deepika; Pandey, Ashutosh; Dhar, Yogeshwar Vikram; Bag, Sumit Kumar; Trivedi, Prabodh Kumar; Asif, Mehar Hasan

    2016-01-01

    AP2/ERF domain containing transcription factor super family is one of the important regulators in the plant kingdom. The involvement of AP2/ERF family members has been elucidated in various processes associated with plant growth, development as well as in response to hormones, biotic and abiotic stresses. In this study, we carried out genome-wide analysis to identify members of AP2/ERF family in Musa acuminata (A genome) and Musa balbisiana (B genome) and changes leading to neofunctionalisation of genes. Analysis identified 265 and 318 AP2/ERF encoding genes in M. acuminata and M. balbisiana respectively which were further classified into ERF, DREB, AP2, RAV and Soloist groups. Comparative analysis indicated that AP2/ERF family has undergone duplication, loss and divergence during evolution and speciation of the Musa A and B genomes. We identified nine genes which are up-regulated during fruit ripening and might be components of the regulatory machinery operating during ethylene-dependent ripening in banana. Tissue-specific expression analysis of the genes suggests that different regulatory mechanisms might be involved in peel and pulp ripening process through recruiting specific ERFs in these tissues. Analysis also suggests that MaRAV-6 and MaERF026 have structurally diverged from their M. balbisiana counterparts and have attained new functions during ripening. PMID:26733055

  14. Genome-wide analysis of the GRAS gene family in physic nut (Jatropha curcas L.).

    PubMed

    Wu, Z Y; Wu, P Z; Chen, Y P; Li, M R; Wu, G J; Jiang, H W

    2015-12-29

    GRAS proteins play vital roles in plant growth and development. Physic nut (Jatropha curcas L.) was found to have a total of 48 GRAS family members (JcGRAS), 15 more than those found in Arabidopsis. The JcGRAS genes were divided into 12 subfamilies or 15 ancient monophyletic lineages based on the phylogenetic analysis of GRAS proteins from both flowering and lower plants. The functions of GRAS genes in 9 subfamilies have been reported previously for several plants, while the genes in the remaining 3 subfamilies were of unknown function; we named the latter families U1 to U3. No member of U3 subfamily is present in Arabidopsis and Poaceae species according to public genome sequence data. In comparison with the number of GRAS genes in Arabidopsis, more were detected in physic nut, resulting from the retention of many ancient GRAS subfamilies and the formation of tandem repeats during evolution. No evidence of recent duplication among JcGRAS genes was observed in physic nut. Based on digital gene expression data, 21 of the 48 genes exhibited differential expression in four tissues analyzed. Two members of subfamily U3 were expressed only in buds and flowers, implying that they may play specific roles. Our results provide valuable resources for future studies on the functions of GRAS proteins in physic nut.

  15. Genome-wide analysis of the WRKY gene family in physic nut (Jatropha curcas L.).

    PubMed

    Xiong, Wangdan; Xu, Xueqin; Zhang, Lin; Wu, Pingzhi; Chen, Yaping; Li, Meiru; Jiang, Huawu; Wu, Guojiang

    2013-07-25

    The WRKY proteins, which contain highly conserved WRKYGQK amino acid sequences and zinc-finger-like motifs, constitute a large family of transcription factors in plants. They participate in diverse physiological and developmental processes. WRKY genes have been identified and characterized in a number of plant species. We identified a total of 58 WRKY genes (JcWRKY) in the genome of the physic nut (Jatropha curcas L.). On the basis of their conserved WRKY domain sequences, all of the JcWRKY proteins could be assigned to one of the previously defined groups, I-III. Phylogenetic analysis of JcWRKY genes with Arabidopsis and rice WRKY genes, and separately with castor bean WRKY genes, revealed no evidence of recent gene duplication in JcWRKY gene family. Analysis of transcript abundance of JcWRKY gene products were tested in different tissues under normal growth condition. In addition, 47 WRKY genes responded to at least one abiotic stress (drought, salinity, phosphate starvation and nitrogen starvation) in individual tissues (leaf, root and/or shoot cortex). Our study provides a useful reference data set as the basis for cloning and functional analysis of physic nut WRKY genes.

  16. Basic leucine zipper family in barley: genome-wide characterization of members and expression analysis.

    PubMed

    Pourabed, Ehsan; Ghane Golmohamadi, Farzan; Soleymani Monfared, Peyman; Razavi, Seyed Morteza; Shobbar, Zahra-Sadat

    2015-01-01

    The basic leucine zipper (bZIP) family is one of the largest and most diverse transcription factors in eukaryotes participating in many essential plant processes. We identified 141 bZIP proteins encoded by 89 genes from the Hordeum vulgare genome. HvbZIPs were classified into 11 groups based on their DNA-binding motif. Amino acid sequence alignment of the HvbZIPs basic-hinge regions revealed some highly conserved residues within each group. The leucine zipper heptads were analyzed predicting their dimerization properties. 34 conserved motifs were identified outside the bZIP domain. Phylogenetic analysis indicated that major diversification within the bZIP family predated the monocot/dicot divergence, although intra-species duplication and parallel evolution seems to be occurred afterward. Localization of HvbZIPs on the barley chromosomes revealed that different groups have been distributed on seven chromosomes of barley. Six types of intron pattern were detected within the basic-hinge regions. Most of the detected cis-elements in the promoter and UTR sequences were involved in seed development or abiotic stress response. Microarray data analysis revealed differential expression pattern of HvbZIPs in response to ABA treatment, drought, and cold stresses and during barley grain development and germination. This information would be helpful for functional characterization of bZIP transcription factors in barley.

  17. Genome-wide investigation and transcriptome analysis of the WRKY gene family in Gossypium.

    PubMed

    Ding, Mingquan; Chen, Jiadong; Jiang, Yurong; Lin, Lifeng; Cao, YueFen; Wang, Minhua; Zhang, Yuting; Rong, Junkang; Ye, Wuwei

    2015-02-01

    WRKY transcription factors play important roles in various stress responses in diverse plant species. In cotton, this family has not been well studied, especially in relation to fiber development. Here, the genomes and transcriptomes of Gossypium raimondii and Gossypium arboreum were investigated to identify fiber development related WRKY genes. This represents the first comprehensive comparative study of WRKY transcription factors in both diploid A and D cotton species. In total, 112 G. raimondii and 109 G. arboreum WRKY genes were identified. No significant gene structure or domain alterations were detected between the two species, but many SNPs distributed unequally in exon and intron regions. Physical mapping revealed that the WRKY genes in G. arboreum were not located in the corresponding chromosomes of G. raimondii, suggesting great chromosome rearrangement in the diploid cotton genomes. The cotton WRKY genes, especially subgroups I and II, have expanded through multiple whole genome duplications and tandem duplications compared with other plant species. Sequence comparison showed many functionally divergent sites between WRKY subgroups, while the genes within each group are under strong purifying selection. Transcriptome analysis suggested that many WRKY genes participate in specific fiber development processes such as fiber initiation, elongation and maturation with different expression patterns between species. Complex WRKY gene expression such as differential Dt and At allelic gene expression in G. hirsutum and alternative splicing events were also observed in both diploid and tetraploid cottons during fiber development process. In conclusion, this study provides important information on the evolution and function of WRKY gene family in cotton species.

  18. A genome-wide search for genes predisposing to familial psoriasis by using a stratification approach.

    PubMed

    Samuelsson, L; Enlund, F; Torinsson, A; Yhr, M; Inerot, A; Enerbäck, C; Wahlström, J; Swanbeck, G; Martinsson, T

    1999-12-01

    We have performed a genome scan, using markers spaced by 10 cM, in the search for psoriasis-susceptibility loci. The family material of 134 affected sibling pairs was ascertained on the basis of a population genetic study in which 65% of the probands had two healthy parents. Genotyping results were analyzed for non-random excessive allele-sharing between sib pairs by using GENEHUNTER ver 1.1. A stratification approach was applied to increase the homogeneity of the material by means of an operational definition of joint complaints among affected individuals. Significant linkage to the human leukocyte antigen region on chromosome 6p in a cohort including 42 families without joint complaints (nonparametric linkage score of 2.83, P=0.002) strongly supported the validity of this operational definition as it replicated results from an earlier linkage report with similar stratification criteria. New candidate regions on chromosomes 3 and 15 were identified. The highest non-parametric linkage values in this study, 2.96 (P=0.0017) and 2.89 (P=0.0020), were reached on chromosome 15 in a subgroup with joint complaints and on chromosome 3 in a subgroup without joint complaints. In addition, confirmation of previously reported loci was established on chromosomes 4q, 6p, and 17q. This study indicates that distinct disease loci might be involved in psoriasis etiology for various phenotypes.

  19. Genome-Wide Identification, Evolution and Expression Analysis of mTERF Gene Family in Maize

    PubMed Central

    Zhao, Yanxin; Cai, Manjun; Zhang, Xiaobo; Li, Yurong; Zhang, Jianhua; Zhao, Hailiang; Kong, Fei; Zheng, Yonglian; Qiu, Fazhan

    2014-01-01

    Plant mitochondrial transcription termination factor (mTERF) genes comprise a large family with important roles in regulating organelle gene expression. In this study, a comprehensive database search yielded 31 potential mTERF genes in maize (Zea mays L.) and most of them were targeted to mitochondria or chloroplasts. Maize mTERF were divided into nine main groups based on phylogenetic analysis, and group IX represented the mitochondria and species-specific clade that diverged from other groups. Tandem and segmental duplication both contributed to the expansion of the mTERF gene family in the maize genome. Comprehensive expression analysis of these genes, using microarray data and RNA-seq data, revealed that these genes exhibit a variety of expression patterns. Environmental stimulus experiments revealed differential up or down-regulation expression of maize mTERF genes in seedlings exposed to light/dark, salts and plant hormones, respectively, suggesting various important roles of maize mTERF genes in light acclimation and stress-related responses. These results will be useful for elucidating the roles of mTERF genes in the growth, development and stress response of maize. PMID:24718683

  20. Genome-wide analysis of ETS-family DNA-binding in vitro and in vivo

    PubMed Central

    Wei, Gong-Hong; Badis, Gwenael; Berger, Michael F; Kivioja, Teemu; Palin, Kimmo; Enge, Martin; Bonke, Martin; Jolma, Arttu; Varjosalo, Markku; Gehrke, Andrew R; Yan, Jian; Talukder, Shaheynoor; Turunen, Mikko; Taipale, Mikko; Stunnenberg, Hendrik G; Ukkonen, Esko; Hughes, Timothy R; Bulyk, Martha L; Taipale, Jussi

    2010-01-01

    Members of the large ETS family of transcription factors (TFs) have highly similar DNA-binding domains (DBDs)—yet they have diverse functions and activities in physiology and oncogenesis. Some differences in DNA-binding preferences within this family have been described, but they have not been analysed systematically, and their contributions to targeting remain largely uncharacterized. We report here the DNA-binding profiles for all human and mouse ETS factors, which we generated using two different methods: a high-throughput microwell-based TF DNA-binding specificity assay, and protein-binding microarrays (PBMs). Both approaches reveal that the ETS-binding profiles cluster into four distinct classes, and that all ETS factors linked to cancer, ERG, ETV1, ETV4 and FLI1, fall into just one of these classes. We identify amino-acid residues that are critical for the differences in specificity between all the classes, and confirm the specificities in vivo using chromatin immunoprecipitation followed by sequencing (ChIP-seq) for a member of each class. The results indicate that even relatively small differences in in vitro binding specificity of a TF contribute to site selectivity in vivo. PMID:20517297

  1. Genome-wide analysis of WOX gene family in rice, sorghum, maize, Arabidopsis and poplar.

    PubMed

    Zhang, Xin; Zong, Jie; Liu, Jianhua; Yin, Jinyuan; Zhang, Dabing

    2010-11-01

    WUSCHEL-related homeobox (WOX) genes form a large gene family specifically expressed in plants. They are known to play important roles in regulating the development of plant tissues and organs by determining cell fate. Recent available whole genome sequences allow us to do more comprehensive phylogenetic analysis of the WOX genes in plants. In the present study, we identified 11 and 21 WOXs from sorghum (Sorghum bicolor) and maize (Zea mays), respectively. The 72 WOX genes from rice (Oryza sativa), sorghum, maize, Arabidopsis (Arabidopsis thaliana) and poplar (Populus trichocarpa) were grouped into three well supported clades with nine subgroups according to the amino acid sequences of their homodomains. Their phylogenetic relationship was also supported by the observation of the motifs outside the homodomain. We observed the variation of duplication events among the nine sub-groups between monocots and eudicots, for instance, more gene duplication events of WOXs within subgroup A for monocots, while, less for dicots in this subgroup. Furthermore, we observed the conserved intron/exon structural patterns of WOX genes in rice, sorghum and Arabidopsis. In addition, WUS (Wuschel)-box and EAR (the ERF-associated amphiphilic repression)-like motif were observed to be conserved among several WOX subgroups in these five plants. Comparative analysis of expression patterns of WOX genes in rice and Arabidopsis suggest that the WOX genes play conserved and various roles in plants. This work provides insights into the evolution of the WOX gene family and is useful for future research.

  2. Genome-wide identification, classification and expression profiling of nicotianamine synthase (NAS) gene family in maize

    PubMed Central

    2013-01-01

    Background Nicotianamine (NA), a ubiquitous molecule in plants, is an important metal ion chelator and the main precursor for phytosiderophores biosynthesis. Considerable progress has been achieved in cloning and characterizing the functions of nicotianamine synthase (NAS) in plants including barley, Arabidopsis and rice. Maize is not only an important cereal crop, but also a model plant for genetics and evolutionary study. The genome sequencing of maize was completed, and many gene families were identified. Although three NAS genes have been characterized in maize, there is still no systematic identification of maize NAS family by genomic mining. Results In this study, nine NAS genes in maize were identified and their expression patterns in different organs including developing seeds were determined. According to the evolutionary relationship and tissue specific expression profiles of ZmNAS genes, they can be subgrouped into two classes. Moreover, the expression patterns of ZmNAS genes in response to fluctuating metal status were analysed. The class I ZmNAS genes were induced under Fe deficiency and were suppressed under Fe excessive conditions, while the expression pattern of class II genes were opposite to class I. The complementary expression patterns of class I and class II ZmNAS genes confirmed the classification of this family. Furthermore, the histochemical localization of ZmNAS1;1/1;2 and ZmNAS3 were determined using in situ hybridization. It was revealed that ZmNAS1;1/1;2, representing the class I genes, mainly expressed in cortex and stele of roots with sufficient Fe, and its expression can expanded in epidermis, as well as shoot apices under Fe deficient conditions. On the contrary, ZmNAS3, one of the class II genes, was accumulated in axillary meristems, leaf primordia and mesophyll cells. These results suggest that the two classes of ZmNAS genes may be regulated on transcriptional level when responds to various demands for iron uptake, translocation

  3. Comparative analysis of genome-wide Mlo gene family in Cajanus cajan and Phaseolus vulgaris.

    PubMed

    Deshmukh, Reena; Singh, V K; Singh, B D

    2016-04-01

    The Mlo gene was discovered in barley because the mutant 'mlo' allele conferred broad-spectrum, non-race-specific resistance to powdery mildew caused by Blumeria graminis f. sp. hordei. The Mlo genes also play important roles in growth and development of plants, and in responses to biotic and abiotic stresses. The Mlo gene family has been characterized in several crop species, but only a single legume species, soybean (Glycine max L.), has been investigated so far. The present report describes in silico identification of 18 CcMlo and 20 PvMlo genes in the important legume crops Cajanus cajan (L.) Millsp. and Phaseolus vulgaris L., respectively. In silico analysis of gene organization, protein properties and conserved domains revealed that the C. cajan and P. vulgaris Mlo gene paralogs are more divergent from each other than from their orthologous pairs. The comparative phylogenetic analysis classified CcMlo and PvMlo genes into three major clades. A comparative analysis of CcMlo and PvMlo proteins with the G. max Mlo proteins indicated close association of one CcMlo, one PvMlo with two GmMlo genes, indicating that there was no further expansion of the Mlo gene family after the separation of these species. Thus, most of the diploid species of eudicots might be expected to contain 15-20 Mlo genes. The genes CcMlo12 and 14, and PvMlo11 and 12 are predicted to participate in powdery mildew resistance. If this prediction were verified, these genes could be targeted by TILLING or CRISPR to isolate powdery mildew resistant mutants.

  4. Genome-Wide Identification and Expression Analysis of the NAC Transcription Factor Family in Cassava.

    PubMed

    Hu, Wei; Wei, Yunxie; Xia, Zhiqiang; Yan, Yan; Hou, Xiaowan; Zou, Meiling; Lu, Cheng; Wang, Wenquan; Peng, Ming

    2015-01-01

    NAC [no apical meristem (NAM), Arabidopsis transcription activation factor [ATAF1/2] and cup-shaped cotyledon (CUC2)] proteins is one of the largest groups of plant specific transcription factors and plays a crucial role in plant growth, development, and adaption to the environment. Currently, no information is known about the NAC family in cassava. In this study, 96 NAC genes (MeNACs) were identified from the cassava genome. Phylogenetic analysis of the NACs from cassava and Arabidopsis showed that MeNAC proteins can be clustered into 16 subgroups. Gene structure analysis found that the number of introns of MeNAC genes varied from 0 to 5, with the majority of MeNAC genes containing two introns, indicating a small gene structure diversity of cassava NAC genes. Conserved motif analysis revealed that all of the identified MeNACs had the conserved NAC domain and/or NAM domain. Global expression analysis suggested that MeNAC genes exhibited different expression profiles in different tissues between wild subspecies and cultivated varieties, indicating their involvement in the functional diversity of different accessions. Transcriptome analysis demonstrated that MeNACs had a widely transcriptional response to drought stress and that they had differential expression profiles in different accessions, implying their contribution to drought stress resistance in cassava. Finally, the expression of twelve MeNAC genes was analyzed under osmotic, salt, cold, ABA, and H2O2 treatments, indicating that cassava NACs may represent convergence points of different signaling pathways. Taken together, this work found some excellent tissue-specific and abiotic stress-responsive candidate MeNAC genes, which would provide a solid foundation for functional investigation of the NAC family, crop improvement and improved understanding of signal transduction in plants. These data bring new insight on the complexity of the transcriptional control of MeNAC genes and support the hypothesis that

  5. Genome-wide identification and expression analysis of the metacaspase gene family in Hevea brasiliensis.

    PubMed

    Liu, Hui; Deng, Zhi; Chen, Jiangshu; Wang, Sen; Hao, Lili; Li, Dejun

    2016-08-01

    Metacaspases, a family of cysteine proteases, have been suggested to play important roles in programmed cell death (PCD) during plant development and stress responses. To date, no systematic characterization of this gene family has been reported in rubber tree (Hevea brasiliensis). In the present study, nine metacaspase genes, designated as HbMC1 to HbMC9, were identified from whole-genome sequence of rubber tree. Multiple sequence alignment and phylogenetic analyses suggested that these genes were divided into two types: type I (HbMC1-HBMC7) and type II (HbMC8 and HbMC9). Gene structure analysis demonstrated that type I and type II HbMCs separately contained four and two introns, indicating the conserved exon-intron organization of HbMCs. Quantitative real-time PCR analysis revealed that HbMCs showed distinct expression patterns in different tissues, suggesting the functional diversity of HbMCs in various tissues during development. Most of the HbMCs were regulated by drought, cold, and salt stress, implying their possible functions in regulating abiotic stress-induced cell death. Of the nine HbMCs, HbMC1, HbMC2, HbMC5, and HbMC8 displayed a significantly higher relative transcript accumulation in barks of tapping panel dryness (TPD) trees compared with healthy trees. In addition, the four genes were up-regulated by ethephon (ET) and methyl jasmonate (MeJA), indicating their potential involvement in TPD resulting from ET- or JA-induced PCD. In summary, this work provides valuable information for further functional characterization of HbMC genes in rubber tree.

  6. Genome-wide identification and characterization of WRKY gene family in Hevea brasiliensis.

    PubMed

    Li, Hui-Liang; Guo, Dong; Yang, Zi-Ping; Tang, Xiao; Peng, Shi-Qing

    2014-07-01

    WRKY proteins constitute a large family of transcription factors. In this study, we identified 81 WRKY genes (named HbWRKY1 to HbWRKY81) in the latest rubber tree genome. Tissue-specific expression profiles showed that 74 HbWRKYs were expressed in at least one of the tissues and the other 7 genes showed very low expression in all tissues tested, which suggested that HbWRKYs took part in many cellular processes. The responses of 20 selected HbWRKYs to jasmonic acid (JA) and ethylene (ET) were analyzed in the latex. 17 HbWRKYs responded to at least one treatment, which included 15 HbWRKYs responding to JA treatment, 15 HbWRKYs to ET, which suggested that these HbWRKYs were regulated by JA and ET. We also observed that HbWRKY3, 14, and 55 bind HbSRPP promoter and activate the transcription in yeast. This study suggests that HbWRKY proteins maybe involved in the transcriptional regulation of nature rubber biosynthesis. Copyright © 2014 Elsevier Inc. All rights reserved.

  7. Genome-wide identification of the potato WRKY transcription factor family.

    PubMed

    Zhang, Chao; Wang, Dongdong; Yang, Chenghui; Kong, Nana; Shi, Zheng; Zhao, Peng; Nan, Yunyou; Nie, Tengkun; Wang, Ruoqiu; Ma, Haoli; Chen, Qin

    2017-01-01

    WRKY transcription factors play pivotal roles in regulation of stress responses. This study identified 79 WRKY genes in potato (Solanum tuberosum). Based on multiple sequence alignment and phylogenetic relationships, WRKY genes were classified into three major groups. The majority of WRKY genes belonged to Group II (52 StWRKYs), Group III had 14 and Group I consisted of 13. The phylogenetic tree further classified Group II into five sub-groups. All StWRKY genes except StWRKY79 were mapped on potato chromosomes, with eight tandem duplication gene pairs and seven segmental duplication gene pairs found from StWRKY family genes. The expression analysis of 22 StWRKYs showed their differential expression levels under various stress conditions. Cis-element prediction showed that a large number of elements related to drought, heat and salicylic acid were present in the promotor regions of StWRKY genes. The expression analysis indicated that seven StWRKYs seemed to respond to stress (heat, drought and salinity) and salicylic acid treatment. These genes are candidates for abiotic stress signaling for further research.

  8. Genome wide in silico characterization of Dof gene families of pigeonpea (Cajanus cajan (L) Millsp.).

    PubMed

    Malviya, N; Gupta, S; Singh, V K; Yadav, M K; Bisht, N C; Sarangi, B K; Yadav, D

    2015-02-01

    The DNA binding with One Finger (Dof) protein is a plant specific transcription factor involved in the regulation of wide range of processes. The analysis of whole genome sequence of pigeonpea has identified 38 putative Dof genes (CcDof) distributed on 8 chromosomes. A total of 17 out of 38 CcDof genes were found to be intronless. A comprehensive in silico characterization of CcDof gene family including the gene structure, chromosome location, protein motif, phylogeny, gene duplication and functional divergence has been attempted. The phylogenetic analysis resulted in 3 major clusters with closely related members in phylogenetic tree revealed common motif distribution. The in silico cis-regulatory element analysis revealed functional diversity with predominance of light responsive and stress responsive elements indicating the possibility of these CcDof genes to be associated with photoperiodic control and biotic and abiotic stress. The duplication pattern showed that tandem duplication is predominant over segmental duplication events. The comparative phylogenetic analysis of these Dof proteins along with 78 soybean, 36 Arabidopsis and 30 rice Dof proteins revealed 7 major clusters. Several groups of orthologs and paralogs were identified based on phylogenetic tree constructed. Our study provides useful information for functional characterization of CcDof genes.

  9. Genome-wide identification and transcriptional profiling analysis of auxin response-related gene families in cucumber

    PubMed Central

    2014-01-01

    Background Auxin signaling has a vital function in the regulation of plant growth and development, both which are known to be mediated by auxin-responsive genes. So far, significant progress has been made toward the identification and characterization of auxin-response genes in several model plants, while no systematic analysis for these families was reported in cucumber (Cucumis sativus L.), a reference species for Cucurbitaceae crops. The comprehensive analyses will help design experiments for functional validation of their precise roles in plant development and stress responses. Results A genome-wide search for auxin-response gene homologues identified 16 auxin-response factors (ARFs), 27 auxin/indole acetic acids (Aux/IAAs), 10 Gretchen Hagen 3 (GH3s), 61 small auxin-up mRNAs (SAURs), and 39 lateral organ boundaries (LBDs) in cucumber. Sequence analysis together with the organization of putative motifs indicated the potential diverse functions of these five auxin-related family members. The distribution and density of auxin response-related genes on chromosomes were not uniform. Evolutionary analysis showed that the chromosomal segment duplications mainly contributed to the expansion of the CsARF, CsIAA, CsGH3, and CsLBD gene families. Quantitative real-time RT-PCR analysis demonstrated that many ARFs, AUX/IAAs, GH3s, SAURs, and LBD genes were expressed in diverse patterns within different organs/tissues and during different development stages. They were also implicated in IAA, methyl jasmonic acid, or salicylic acid response, which is consistent with the finding that a great number of diverse cis-elements are present in their promoter regions involving a variety of signaling transduction pathways. Conclusion Genome-wide comparative analysis of auxin response-related family genes and their expression analysis provide new evidence for the potential role of auxin in development and hormone response of plants. Our data imply that the auxin response genes may be

  10. Genome-Wide Identification of the Target Genes of AP2-O, a Plasmodium AP2-Family Transcription Factor

    PubMed Central

    Kaneko, Izumi; Iwanaga, Shiroh; Kato, Tomomi; Kobayashi, Issei; Yuda, Masao

    2015-01-01

    Stage-specific transcription is a fundamental biological process in the life cycle of the Plasmodium parasite. Proteins containing the AP2 DNA-binding domain are responsible for stage-specific transcriptional regulation and belong to the only known family of transcription factors in Plasmodium parasites. Comprehensive identification of their target genes will advance our understanding of the molecular basis of stage-specific transcriptional regulation and stage-specific parasite development. AP2-O is an AP2 family transcription factor that is expressed in the mosquito midgut-invading stage, called the ookinete, and is essential for normal morphogenesis of this stage. In this study, we identified the genome-wide target genes of AP2-O by chromatin immunoprecipitation-sequencing and elucidate how this AP2 family transcription factor contributes to the formation of this motile stage. The analysis revealed that AP2-O binds specifically to the upstream genomic regions of more than 500 genes, suggesting that approximately 10% of the parasite genome is directly regulated by AP2-O. These genes are involved in distinct biological processes such as morphogenesis, locomotion, midgut penetration, protection against mosquito immunity and preparation for subsequent oocyst development. This direct and global regulation by AP2-O provides a model for gene regulation in Plasmodium parasites and may explain how these parasites manage to control their complex life cycle using a small number of sequence-specific AP2 transcription factors. PMID:26018192

  11. Genome-wide identification, classification, and expression analysis of CDPK and its closely related gene families in poplar (Populus trichocarpa).

    PubMed

    Zuo, Ran; Hu, Ruibo; Chai, Guohua; Xu, Meiling; Qi, Guang; Kong, Yingzhen; Zhou, Gongke

    2013-03-01

    Calcium-dependent protein kinases (CDPKs) are Ca(2+)-binding proteins known to play crucial roles in Ca(2+) signal transduction pathways which have been identified throughout plant kingdom and in certain types of protists. Genome-wide analysis of CDPKs have been carried out in Arabidopsis, rice and wheat, and quite a few of CDPKs were proved to play crucial roles in plant stress responsive signature pathways. In this study, a comprehensive analysis of Populus CDPK and its closely related gene families was performed, including phylogeny, chromosome locations, gene structures, and expression profiles. Thirty Populus CDPK genes and twenty closely related kinase genes were identified, which were phylogenetically clustered into eight distinct subfamilies and predominately distributed across fifteen linkage groups (LG). Genomic organization analyses indicated that purifying selection has played a pivotal role in the retention and maintenance of Populus CDPK gene family. Furthermore, microarray analysis showed that a number of Populus CDPK and its closely related genes differentially expressed across disparate tissues and under various stresses. The expression profiles of paralogous pairs were also investigated to reveal their evolution fates. In addition, quantitative real-time RT-PCR was performed on nine selected CDPK genes to confirm their responses to drought stress treatment. These observations may lay the foundation for future functional analysis of Populus CDPK and its closely related gene families to unravel their biological roles.

  12. Genome-Wide Characterization and Expression Analysis of the Germin-Like Protein Family in Rice and Arabidopsis

    PubMed Central

    Li, Lu; Xu, Xihui; Chen, Chen; Shen, Zhenguo

    2016-01-01

    Previous studies have shown that germin-like proteins (GLPs) are present ubiquitously in rice and Arabidopsis. However, the understanding regarding their role in development and abiotic/biotic stress resistance remains limited. In the present study, we report genome-wide identification, characterisation, subcellular localization, enzyme activity, and expression analysis of the GLP gene family in rice and Arabidopsis to study their functions. In total, 43 and 32 GLPs in the rice and Arabidopsis genome were identified based on a systematic analysis, respectively. The GLP genes were clustered into six clades based on phylogenetic analysis, and many stress and developmental-related cis-elements were detected in promoters of GLP genes. In addition, subcellular location and superoxide dismutase (SOD) analysis demonstrated that the random selected OsGLP genes on chromosomes 8 and 4 of rice were expressed in the cell wall with SOD activity. Overall, our results showed that tandem duplication events, especially the clusters of tandem duplication genes on chromosome 8 in rice, play a major role in expansion of the GLP family and thus increase our understanding of the role of the GLP family in abiotic/biotic stress and development. PMID:27669230

  13. Genome-wide analysis of the SBP-box gene family in Chinese cabbage (Brassica rapa subsp. pekinensis).

    PubMed

    Tan, Hua-Wei; Song, Xiao-Ming; Duan, Wei-Ke; Wang, Yan; Hou, Xi-Lin

    2015-11-01

    The SQUAMOSA PROMOTER BINDING PROTEIN (SBP)-box gene family contains highly conserved plant-specific transcription factors that play an important role in plant development, especially in flowering. Chinese cabbage (Brassica rapa subsp. pekinensis) is a leafy vegetable grown worldwide and is used as a model crop for research in genome duplication. The present study aimed to characterize the SBP-box transcription factor genes in Chinese cabbage. Twenty-nine SBP-box genes were identified in the Chinese cabbage genome and classified into six groups. We identified 23 orthologous and 5 co-orthologous SBP-box gene pairs between Chinese cabbage and Arabidopsis. An interaction network among these genes was constructed. Sixteen SBP-box genes were expressed more abundantly in flowers than in other tissues, suggesting their involvement in flowering. We show that the MiR156/157 family members may regulate the coding regions or 3'-UTR regions of Chinese cabbage SBP-box genes. As SBP-box genes were found to potentially participate in some plant development pathways, quantitative real-time PCR analysis was performed and showed that Chinese cabbage SBP-box genes were also sensitive to the exogenous hormones methyl jasmonic acid and salicylic acid. The SBP-box genes have undergone gene duplication and loss, evolving a more refined regulation for diverse stimulation in plant tissues. Our comprehensive genome-wide analysis provides insights into the SBP-box gene family of Chinese cabbage.

  14. Genome-wide analysis of the MYB gene family in physic nut (Jatropha curcas L.).

    PubMed

    Zhou, Changpin; Chen, Yanbo; Wu, Zhenying; Lu, Wenjia; Han, Jinli; Wu, Pingzhi; Chen, Yaping; Li, Meiru; Jiang, Huawu; Wu, Guojiang

    2015-11-01

    The MYB proteins comprise one of the largest transcription factor families in plants, and play key roles in regulatory networks controlling development, metabolism, and stress responses. A total of 125 MYB genes (JcMYB) have been identified in the physic nut (Jatropha curcas L.) genome, including 120 2R-type MYB, 4 3R-MYB, and 1 4R-MYB genes. Based on exon-intron arrangement of MYBs from both lower (Physcomitrella patens) and higher (physic nut, Arabidopsis, and rice) plants, we can classify plant MYB genes into ten groups (MI-X), except for MIX genes which are nonexistent in higher plants. We also observed that MVIII genes may be one of the most ancient MYB types which consist of both R2R3- and 3R-MYB genes. Most MYB genes (76.8% in physic nut) belong to the MI group which can be divided into 34 subgroups. The JcMYB genes were nonrandomly distributed on its 11 linkage groups (LGs). The expansion of MYB genes across several subgroups was observed and resulted from genome triplication of ancient dicotyledons and from both ancient and recent tandem duplication events in the physic nut genome. The expression patterns of several MYB duplicates in the physic nut showed differences in four tissues (root, stem, leaf, and seed), and 34 MYB genes responded to at least one abiotic stressor (drought, salinity, phosphate starvation, and nitrogen starvation) in leaves and/or roots based on the data analysis of digital gene expression tags. Overexpression of the JcMYB001 gene in Arabidopsis increased its sensitivity to drought and salinity stresses.

  15. Genome-Wide Comparative Analysis of Chemosensory Gene Families in Five Tsetse Fly Species

    PubMed Central

    Macharia, Rosaline; Mireji, Paul; Murungi, Edwin; Murilla, Grace; Christoffels, Alan; Aksoy, Serap; Masiga, Daniel

    2016-01-01

    For decades, odour-baited traps have been used for control of tsetse flies (Diptera; Glossinidae), vectors of African trypanosomes. However, differential responses to known attractants have been reported in different Glossina species, hindering establishment of a universal vector control tool. Availability of full genome sequences of five Glossina species offers an opportunity to compare their chemosensory repertoire and enhance our understanding of their biology in relation to chemosensation. Here, we identified and annotated the major chemosensory gene families in Glossina. We identified a total of 118, 115, 124, and 123 chemosensory genes in Glossina austeni, G. brevipalpis, G. f. fuscipes, G. pallidipes, respectively, relative to 127 reported in G. m. morsitans. Our results show that tsetse fly genomes have fewer chemosensory genes when compared to other dipterans such as Musca domestica (n>393), Drosophila melanogaster (n = 246) and Anopheles gambiae (n>247). We also found that Glossina chemosensory genes are dispersed across distantly located scaffolds in their respective genomes, in contrast to other insects like D. melanogaster whose genes occur in clusters. Further, Glossina appears to be devoid of sugar receptors and to have expanded CO2 associated receptors, potentially reflecting Glossina's obligate hematophagy and the need to detect hosts that may be out of sight. We also identified, in all species, homologs of Ir84a; a Drosophila-specific ionotropic receptor that promotes male courtship suggesting that this is a conserved trait in tsetse flies. Notably, our selection analysis revealed that a total of four gene loci (Gr21a, GluRIIA, Gr28b, and Obp83a) were under positive selection, which confers fitness advantage to species. These findings provide a platform for studies to further define the language of communication of tsetse with their environment, and influence development of novel approaches for control. PMID:26886411

  16. LRR-RLK family from two Citrus species: genome-wide identification and evolutionary aspects.

    PubMed

    Magalhães, Diogo M; Scholte, Larissa L S; Silva, Nicholas V; Oliveira, Guilherme C; Zipfel, Cyril; Takita, Marco A; De Souza, Alessandra A

    2016-08-12

    Leucine-rich repeat receptor-like kinases (LRR-RLKs) represent the largest subfamily of plant RLKs. The functions of most LRR-RLKs have remained undiscovered, and a few that have been experimentally characterized have been shown to have important roles in growth and development as well as in defense responses. Although RLK subfamilies have been previously studied in many plants, no comprehensive study has been performed on this gene family in Citrus species, which have high economic importance and are frequent targets for emerging pathogens. In this study, we performed in silico analysis to identify and classify LRR-RLK homologues in the predicted proteomes of Citrus clementina (clementine) and Citrus sinensis (sweet orange). In addition, we used large-scale phylogenetic approaches to elucidate the evolutionary relationships of the LRR-RLKs and further narrowed the analysis to the LRR-XII group, which contains several previously described cell surface immune receptors. We built integrative protein signature databases for Citrus clementina and Citrus sinensis using all predicted protein sequences obtained from whole genomes. A total of 300 and 297 proteins were identified as LRR-RLKs in C. clementina and C. sinensis, respectively. Maximum-likelihood phylogenetic trees were estimated using Arabidopsis LRR-RLK as a template and they allowed us to classify Citrus LRR-RLKs into 16 groups. The LRR-XII group showed a remarkable expansion, containing approximately 150 paralogs encoded in each Citrus genome. Phylogenetic analysis also demonstrated the existence of two distinct LRR-XII clades, each one constituted mainly by RD and non-RD kinases. We identified 68 orthologous pairs from the C. clementina and C. sinensis LRR-XII genes. In addition, among the paralogs, we identified a subset of 78 and 62 clustered genes probably derived from tandem duplication events in the genomes of C. clementina and C. sinensis, respectively. This work provided the first comprehensive

  17. Genome-wide analysis and expression profile of the bZIP transcription factor gene family in grapevine (Vitis vinifera)

    PubMed Central

    2014-01-01

    Background Basic leucine zipper (bZIP) transcription factor gene family is one of the largest and most diverse families in plants. Current studies have shown that the bZIP proteins regulate numerous growth and developmental processes and biotic and abiotic stress responses. Nonetheless, knowledge concerning the specific expression patterns and evolutionary history of plant bZIP family members remains very limited. Results We identified 55 bZIP transcription factor-encoding genes in the grapevine (Vitis vinifera) genome, and divided them into 10 groups according to the phylogenetic relationship with those in Arabidopsis. The chromosome distribution and the collinearity analyses suggest that expansion of the grapevine bZIP (VvbZIP) transcription factor family was greatly contributed by the segment/chromosomal duplications, which may be associated with the grapevine genome fusion events. Nine intron/exon structural patterns within the bZIP domain and the additional conserved motifs were identified among all VvbZIP proteins, and showed a high group-specificity. The predicted specificities on DNA-binding domains indicated that some highly conserved amino acid residues exist across each major group in the tree of land plant life. The expression patterns of VvbZIP genes across the grapevine gene expression atlas, based on microarray technology, suggest that VvbZIP genes are involved in grapevine organ development, especially seed development. Expression analysis based on qRT-PCR indicated that VvbZIP genes are extensively involved in drought- and heat-responses, with possibly different mechanisms. Conclusions The genome-wide identification, chromosome organization, gene structures, evolutionary and expression analyses of grapevine bZIP genes provide an overall insight of this gene family and their potential involvement in growth, development and stress responses. This will facilitate further research on the bZIP gene family regarding their evolutionary history and

  18. Genome-wide search for breast cancer linkage in large Icelandic non-BRCA1/2 families

    PubMed Central

    2010-01-01

    Introduction: A significant proportion of high-risk breast cancer families are not explained by mutations in known genes. Recent genome-wide searches (GWS) have not revealed any single major locus reminiscent of BRCA1 and BRCA2, indicating that still unidentified genes may explain relatively few families each or interact in a way obscure to linkage analyses. This has drawn attention to possible benefits of studying populations where genetic heterogeneity might be reduced. We thus performed a GWS for linkage on nine Icelandic multiple-case non-BRCA1/2 families of desirable size for mapping highly penetrant loci. To follow up suggestive loci, an additional 13 families from other Nordic countries were genotyped for selected markers. Methods: GWS was performed using 811 microsatellite markers providing about five centiMorgan (cM) resolution. Multipoint logarithm of odds (LOD) scores were calculated using parametric and nonparametric methods. For selected markers and cases, tumour tissue was compared to normal tissue to look for allelic loss indicative of a tumour suppressor gene. Results: The three highest signals were located at chromosomes 6q, 2p and 14q. One family contributed suggestive LOD scores (LOD 2.63 to 3.03, dominant model) at all these regions, without consistent evidence of a tumour suppressor gene. Haplotypes in nine affected family members mapped the loci to 2p23.2 to p21, 6q14.2 to q23.2 and 14q21.3 to q24.3. No evidence of a highly penetrant locus was found among the remaining families. The heterogeneity LOD (HLOD) at the 6q, 2p and 14q loci in all families was 3.27, 1.66 and 1.24, respectively. The subset of 13 Nordic families showed supportive HLODs at chromosome 6q (ranging from 0.34 to 1.37 by country subset). The 2p and 14q loci overlap with regions indicated by large families in previous GWS studies of breast cancer. Conclusions: Chromosomes 2p, 6q and 14q are candidate sites for genes contributing together to high breast cancer risk. A

  19. Genome-Wide Survey and Expression Analysis of Amino Acid Transporter Gene Family in Rice (Oryza sativa L.)

    PubMed Central

    Zhao, Heming; Ma, Haoli; Yu, Li; Wang, Xin; Zhao, Jie

    2012-01-01

    Background Amino acid transporters (AATs) that transport amino acids across cellular membranes are essential for plant growth and development. To date, a genome-wide overview of the AAT gene family in rice is not yet available. Methodology/Principal Findings In this study, a total of 85 AAT genes were identified in rice genome and were classified into eleven distinct subfamilies based upon their sequence composition and phylogenetic relationship. A large number of OsAAT genes were expanded via gene duplication, 23 and 24 OsAAT genes were tandemly and segmentally duplicated, respectively. Comprehensive analyses were performed to investigate the expression profiles of OsAAT genes in various stages of vegetative and reproductive development by using data from EST, Microarrays, MPSS and Real-time PCR. Many OsAAT genes exhibited abundant and tissue-specific expression patterns. Moreover, 21 OsAAT genes were found to be differentially expressed under the treatments of abiotic stresses. Comparative analysis indicates that 26 AAT genes with close evolutionary relationships between rice and Arabidopsis exhibited similar expression patterns. Conclusions/Significance This study will facilitate further studies on OsAAT family and provide useful clues for functional validation of OsAATs. PMID:23166615

  20. Genome-Wide Analysis, Classification, Evolution, and Expression Analysis of the Cytochrome P450 93 Family in Land Plants

    PubMed Central

    Du, Hai; Ran, Feng; Dong, Hong-Li; Wen, Jing; Li, Jia-Na; Liang, Zhe

    2016-01-01

    Cytochrome P450 93 family (CYP93) belonging to the cytochrome P450 superfamily plays important roles in diverse plant processes. However, no previous studies have investigated the evolution and expression of the members of this family. In this study, we performed comprehensive genome-wide analysis to identify CYP93 genes in 60 green plants. In all, 214 CYP93 proteins were identified; they were specifically found in flowering plants and could be classified into ten subfamilies—CYP93A–K, with the last two being identified first. CYP93A is the ancestor that was derived in flowering plants, and the remaining showed lineage-specific distribution—CYP93B and CYP93C are present in dicots; CYP93F is distributed only in Poaceae; CYP93G and CYP93J are monocot-specific; CYP93E is unique to legumes; CYP93H and CYP93K are only found in Aquilegia coerulea, and CYP93D is Brassicaceae-specific. Each subfamily generally has conserved gene numbers, structures, and characteristics, indicating functional conservation during evolution. Synonymous nucleotide substitution (dN/dS) analysis showed that CYP93 genes are under strong negative selection. Comparative expression analyses of CYP93 genes in dicots and monocots revealed that they are preferentially expressed in the roots and tend to be induced by biotic and/or abiotic stresses, in accordance with their well-known functions in plant secondary biosynthesis. PMID:27760179

  1. Genetic epidemiology and genome-wide linkage analysis of carotid artery ultrasound traits in multigenerational African ancestry families.

    PubMed

    Kuipers, Allison L; Kammerer, Candace M; Miljkovic, Iva; Woodard, Genevieve A; Bunker, Clareann H; Patrick, Alan L; Wheeler, Victor W; Newman, Anne B; Zmuda, Joseph M

    2013-11-01

    Intima-media thickness, adventitial diameter and lumen diameter are indicators of cardiovascular disease risk. The influence of genetic factors on these measures in African ancestry populations is not well defined. Therefore, we estimated heritability and performed genome-wide linkage analysis of carotid ultrasound traits in 7 multigenerational families of African ancestry. A total of 395 individuals (7 pedigrees; mean family size = 56; 2392 relative pairs) aged ≥18 years had a common carotid artery ultrasound scan. Statistical analyses were conducted using pedigree-based maximum likelihood methods. Significant covariates included age, sex, body mass index or height and waist, and systolic blood pressure. Residual heritabilities ranged from 0.35 ± 0.10 to 0.64 ± 0.12 (P < 0.0001). We identified a novel quantitative trait locus for adventitial and lumen diameters on chromosome 11 (max LOD = 4.09, 133 cm). Further fine mapping of this region may identify specific mutations predisposing to subclinical vascular disease among African ancestry individuals. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.

  2. A genome-wide scan for type 1 diabetes susceptibility genes in nuclear families with multiple affected siblings in Finland.

    PubMed

    Qiao, Qing; Osterholm, Anne-May; He, Bing; Pitkäniemi, Janne; Cordell, Heather J; Sarti, Cinzia; Kinnunen, Leena; Tuomilehto-Wolf, Eva; Tryggvason, Karl; Tuomilehto, Jaakko

    2007-12-19

    A genome-wide search for genes that predispose to type 1 diabetes using linkage analysis was performed using 900 microsatellite markers in 70 nuclear families with affected siblings from Finland, a population expected to be more genetically homogeneous than others, and having the highest incidence of type 1 diabetes in the world and, yet, the highest proportion in Europe of cases (10%) carrying neither of the highest risk HLA haplotypes that include DR3 or DR4 alleles. In addition to the evidence of linkage to the HLA region on 6p21 (nominal p = 4.0 x 10-6), significant evidence of linkage in other chromosome regions was not detected with a single-locus analysis. The two-locus analysis conditional on the HLA gave a maximum lod score (MLS) of 3.1 (nominal p = 2 x 10-4) on chromosome 9p13 under an additive model; MLS of 2.1 (nominal p = 6.1 x 10-3) on chromosome 17p12 and MLS of 2.5 (nominal p = 2.9 x 10-3) on chromosome 18p11 under a general model. Our genome scan data confirmed the primary contribution of the HLA genes also in the high-risk Finnish population, and suggest that non-HLA genes also contribute to the familial clustering of type 1 diabetes in Finland.

  3. Genome-Wide Analysis, Classification, Evolution, and Expression Analysis of the Cytochrome P450 93 Family in Land Plants.

    PubMed

    Du, Hai; Ran, Feng; Dong, Hong-Li; Wen, Jing; Li, Jia-Na; Liang, Zhe

    2016-01-01

    Cytochrome P450 93 family (CYP93) belonging to the cytochrome P450 superfamily plays important roles in diverse plant processes. However, no previous studies have investigated the evolution and expression of the members of this family. In this study, we performed comprehensive genome-wide analysis to identify CYP93 genes in 60 green plants. In all, 214 CYP93 proteins were identified; they were specifically found in flowering plants and could be classified into ten subfamilies-CYP93A-K, with the last two being identified first. CYP93A is the ancestor that was derived in flowering plants, and the remaining showed lineage-specific distribution-CYP93B and CYP93C are present in dicots; CYP93F is distributed only in Poaceae; CYP93G and CYP93J are monocot-specific; CYP93E is unique to legumes; CYP93H and CYP93K are only found in Aquilegia coerulea, and CYP93D is Brassicaceae-specific. Each subfamily generally has conserved gene numbers, structures, and characteristics, indicating functional conservation during evolution. Synonymous nucleotide substitution (dN/dS) analysis showed that CYP93 genes are under strong negative selection. Comparative expression analyses of CYP93 genes in dicots and monocots revealed that they are preferentially expressed in the roots and tend to be induced by biotic and/or abiotic stresses, in accordance with their well-known functions in plant secondary biosynthesis.

  4. Genome-wide Identification of TCP Family Transcription Factors from Populus euphratica and Their Involvement in Leaf Shape Regulation.

    PubMed

    Ma, Xiaodong; Ma, Jianchao; Fan, Di; Li, Chaofeng; Jiang, Yuanzhong; Luo, Keming

    2016-09-08

    Higher plants have been shown to experience a juvenile vegetative phase, an adult vegetative phase, and a reproductive phase during its postembryonic development and distinct lateral organ morphologies have been observed at the different development stages. Populus euphratica, commonly known as a desert poplar, has developed heteromorphic leaves during its development. The TCP family genes encode a group of plant-specific transcription factors involved in several aspects of plant development. In particular, TCPs have been shown to influence leaf size and shape in many herbaceous plants. However, whether these functions are conserved in woody plants remains unknown. In the present study, we carried out genome-wide identification of TCP genes in P. euphratica and P. trichocarpa, and 33 and 36 genes encoding putative TCP proteins were found, respectively. Phylogenetic analysis of the poplar TCPs together with Arabidopsis TCPs indicated a biased expansion of the TCP gene family via segmental duplications. In addition, our results have also shown a correlation between different expression patterns of several P. euphratica TCP genes and leaf shape variations, indicating their involvement in the regulation of leaf shape development.

  5. Genome-wide identification, classification and expression analysis of the PHD-finger protein family in Populus trichocarpa.

    PubMed

    Wu, Shengnan; Wu, Min; Dong, Qing; Jiang, Haiyang; Cai, Ronghao; Xiang, Yan

    2016-01-01

    The plant homeobox domain (PHD) proteins are widespread in eukaryotes, and play important roles in regulating chromatin and transcription. Comprehensive analyses of PHD-finger proteins have been performed in animals, but few plant PHD-finger proteins involved in growth and development have been characterized functionally. In this study, we conducted a genome-wide survey of PHD-finger proteins in Populus trichocarpa by describing the phylogenetic relationship, gene structure, and chromosomal location and microarray analyses of each predicted PHD-finger family member. We identified 73 PHD-finger genes (PtPHD1-73) and classified them into eleven subfamilies (A-K) by phylogenetic analysis. Seventy-two of the 73 genes were unevenly distributed on all 19 chromosomes, with seven segmental duplication events. Analysis of the Ka (non-synonymous substitution rate)/Ks (synonymous substitution rate) ratios suggested that the duplicated genes of the PHD-finger family mainly underwent purifying selection with restrictive functional divergence after the duplication events. Expression profiles analysis indicated that 67 PHD-finger genes were differentially expressed in various tissues. Quantitative real-time RT-PCR (qRT-PCR) analyses of nine selected PtPHD genes under high salinity, drought and cold stresses were also performed to explore their stress-related expression patterns. The results of this study provide a thorough overview of poplar PHD-finger proteins and will be valuable for further functional research of poplar PHD-finger genes to unravel their biological roles.

  6. Genome-wide and molecular evolution analyses of the phospholipase D gene family in Poplar and Grape.

    PubMed

    Liu, Qi; Zhang, Chiyu; Yang, Yongping; Hu, Xiangyang

    2010-06-18

    The Phospholipase D (PLD) family plays an important role in the regulation of cellular processes in plants, including abscisic acid signaling, programmed cell death, root hair patterning, root growth, freezing tolerance and other stress responses. PLD genes constitute an important gene family in higher plants. However, until now our knowledge concerning the PLD gene family members and their evolutionary relationship in woody plants such as Poplar and Grape has been limited. In this study, we have provided a genome-wide analysis of the PLD gene family in Poplar and Grape. Eighteen and eleven members of the PLD gene family were identified in Poplar and Grape respectively. Phylogenetic and gene structure analyses showed that the PLD gene family can be divided into 6 subgroups: alpha, beta/gamma, delta, epsilon, zeta, and phi, and that the 6 PLD subgroups originated from 4 original ancestors through a series of gene duplications. Interestingly, the majority of the PLD genes from both Poplar (76.5%, 13/17) and Grape (90.9%, 10/11) clustered closely together in the phylogenetic tree to the extent that their evolutionary relationship appears more tightly linked to each other, at least in terms of the PLD gene family, than it does to either Arabidopsis or rice. Five pairs of duplicated PLD genes were identified in Poplar, more than those in Grape, suggesting that frequent gene duplications occurred after these species diverged, resulting in a rapid expansion of the PLD gene family in Poplar. The majority of the gene duplications in Poplar were caused by segmental duplication and were distinct from those in Arabidopsis, rice and Grape. Additionally, the gene duplications in Poplar were estimated to have occurred from 11.31 to 13.76 million years ago, which are later than those that occurred in the other three plant species. Adaptive evolution analysis showed that positive selection contributed to the evolution of the PXPH- and SP-PLDs, whereas purifying selection has driven

  7. Genome-wide linkage scans for loci affecting total cholesterol, HDL-C, and triglycerides: the Family Blood Pressure Program.

    PubMed

    Bielinski, Suzette J; Tang, Weihong; Pankow, James S; Miller, Michael B; Mosley, Thomas H; Boerwinkle, Eric; Olshen, Richard A; Curb, J David; Jaquish, Cashell E; Rao, D C; Weder, Alan; Arnett, Donna K

    2006-10-01

    Atherosclerosis accounts for 75% of all deaths from cardiovascular disease and includes coronary heart disease (CHD), stroke, and other diseases of the arteries. More than half of all CHD is attributable to abnormalities in levels and metabolism of lipids. To locate genes that affect total cholesterol, high density lipoprotein cholesterol (HDL-C), and triglycerides, genome-wide linkage scans for quantitative trait loci were performed using variance components methods as implemented in SOLAR on a large diverse sample recruited as part of the Family Blood Pressure Program. Phenotype and genetic marker data were available for 9,299 subjects in 2,953 families for total cholesterol, 8,668 subjects in 2,736 families for HDL, and 7,760 subjects in 2,499 families for triglycerides. Mean lipid levels were adjusted for the effects of sex, age, age2, age-by-sex interaction, body mass index, smoking status, and field center. HDL-C and triglycerides were further adjusted for average total alcoholic drinks per week and estrogen use. Significant linkage was found for total cholesterol on chromosome 2 (LOD=3.1 at 43 cM) in Hispanics and for HDL-C on chromosome 3 (LOD=3.0 at 182 cM) and 12 (LOD=3.5 at 124 cM) in Asians. In addition, there were 13 regions that showed suggestive linkage (LOD >or= 2.0); 7 for total cholesterol, 4 for HDL, and 2 for triglycerides. The identification of these loci affecting lipid phenotypes and the apparent congruence with previous linkage results provides increased support that these regions contain genes influencing lipid levels.

  8. Establishing an adjusted p-value threshold to control the family-wide type 1 error in genome wide association studies.

    PubMed

    Duggal, Priya; Gillanders, Elizabeth M; Holmes, Taura N; Bailey-Wilson, Joan E

    2008-10-31

    By assaying hundreds of thousands of single nucleotide polymorphisms, genome wide association studies (GWAS) allow for a powerful, unbiased review of the entire genome to localize common genetic variants that influence health and disease. Although it is widely recognized that some correction for multiple testing is necessary, in order to control the family-wide Type 1 Error in genetic association studies, it is not clear which method to utilize. One simple approach is to perform a Bonferroni correction using all n single nucleotide polymorphisms (SNPs) across the genome; however this approach is highly conservative and would "overcorrect" for SNPs that are not truly independent. Many SNPs fall within regions of strong linkage disequilibrium (LD) ("blocks") and should not be considered "independent". We proposed to approximate the number of "independent" SNPs by counting 1 SNP per LD block, plus all SNPs outside of blocks (interblock SNPs). We examined the effective number of independent SNPs for Genome Wide Association Study (GWAS) panels. In the CEPH Utah (CEU) population, by considering the interdependence of SNPs, we could reduce the total number of effective tests within the Affymetrix and Illumina SNP panels from 500,000 and 317,000 to 67,000 and 82,000 "independent" SNPs, respectively. For the Affymetrix 500 K and Illumina 317 K GWAS SNP panels we recommend using 10(-5), 10(-7) and 10(-8) and for the Phase II HapMap CEPH Utah and Yoruba populations we recommend using 10(-6), 10(-7) and 10(-9) as "suggestive", "significant" and "highly significant" p-value thresholds to properly control the family-wide Type 1 error. By approximating the effective number of independent SNPs across the genome we are able to 'correct' for a more accurate number of tests and therefore develop 'LD adjusted' Bonferroni corrected p-value thresholds that account for the interdepdendence of SNPs on well-utilized commercially available SNP "chips". These thresholds will serve as guides

  9. Automatic identification of highly conserved family regions and relationships in genome wide datasets including remote protein sequences.

    PubMed

    Doğan, Tunca; Karaçalı, Bilge

    2013-01-01

    Identifying shared sequence segments along amino acid sequences generally requires a collection of closely related proteins, most often curated manually from the sequence datasets to suit the purpose at hand. Currently developed statistical methods are strained, however, when the collection contains remote sequences with poor alignment to the rest, or sequences containing multiple domains. In this paper, we propose a completely unsupervised and automated method to identify the shared sequence segments observed in a diverse collection of protein sequences including those present in a smaller fraction of the sequences in the collection, using a combination of sequence alignment, residue conservation scoring and graph-theoretical approaches. Since shared sequence fragments often imply conserved functional or structural attributes, the method produces a table of associations between the sequences and the identified conserved regions that can reveal previously unknown protein families as well as new members to existing ones. We evaluated the biological relevance of the method by clustering the proteins in gold standard datasets and assessing the clustering performance in comparison with previous methods from the literature. We have then applied the proposed method to a genome wide dataset of 17793 human proteins and generated a global association map to each of the 4753 identified conserved regions. Investigations on the major conserved regions revealed that they corresponded strongly to annotated structural domains. This suggests that the method can be useful in predicting novel domains on protein sequences.

  10. Deciphering the genetic control of fruit texture in apple by multiple family-based analysis and genome-wide association

    PubMed Central

    Di Guardo, Mario; Bink, Marco C.A.M.; Guerra, Walter; Letschka, Thomas; Lozano, Lidia; Busatto, Nicola; Poles, Lara; Tadiello, Alice; Bianco, Luca; Visser, Richard G.F.; van de Weg, Eric

    2017-01-01

    Abstract Fruit texture is a complex feature composed of mechanical and acoustic properties relying on the modifications occurring in the cell wall throughout fruit development and ripening. Apple is characterized by a large variation in fruit texture behavior that directly impacts both the consumer’s appreciation and post-harvest performance. To decipher the genetic control of fruit texture comprehensively, two complementing quantitative trait locus (QTL) mapping approaches were employed. The first was represented by a pedigree-based analysis (PBA) carried out on six full-sib pedigreed families, while the second was a genome-wide association study (GWAS) performed on a collection of 233 apple accessions. Both plant materials were genotyped with a 20K single nucleotide polymorphism (SNP) array and phenotyped with a sophisticated high-resolution texture analyzer. The overall QTL results indicated the fundamental role of chromosome 10 in controlling the mechanical properties, while chromosomes 2 and 14 were more associated with the acoustic response. The latter QTL, moreover, showed a consistent relationship between the QTL-estimated genotypes and the acoustic performance assessed among seedlings. The in silico annotation of these intervals revealed interesting candidate genes potentially involved in fruit texture regulation, as suggested by the gene expression profile. The joint integration of these approaches sheds light on the specific control of fruit texture, enabling important genetic information to assist in the selection of valuable fruit quality apple varieties. PMID:28338805

  11. Genome-wide analysis of familial dysautonomia and kinetin target genes with patient olfactory ecto-mesenchymal stem cells.

    PubMed

    Boone, Nathalie; Bergon, Aurélie; Loriod, Béatrice; Devèze, Arnaud; Nguyen, Catherine; Axelrod, Felicia B; Ibrahim, El Chérif

    2012-03-01

    Familial dysautonomia (FD) is a rare inherited neurodegenerative disorder. The most common mutation is a c.2204+6T>C transition in the 5' splice site (5'ss) of IKBKAP intron 20, which causes a tissue-specific skipping of exon 20, resulting in lower synthesis of IKAP/hELP1 protein. To better understand the specificity of neuron loss in FD, we modeled the molecular mechanisms of IKBKAP mRNA splicing by studying human olfactory ecto-mesenchymal stem cells (hOE-MSCs) derived from FD patient nasal biopsies. We explored how the modulation of IKBKAP mRNA alternative splicing impacts the transcriptome at the genome-wide level. We found that the FD transcriptional signature was highly associated with biological functions related to the development of the nervous system. In addition, we identified target genes of kinetin, a plant cytokinin that corrects IKBKAP mRNA splicing and increases the expression of IKAP/hELP1. We identified this compound as a putative regulator of splicing factors and added new evidence for a sequence-specific correction of splicing. In conclusion, hOE-MSCs isolated from FD patients represent a promising avenue for modeling the altered genetic expression of FD, demonstrating a methodology that can be applied to a host of other genetic disorders to test the therapeutic potential of candidate molecules. © 2011 Wiley-Liss, Inc.

  12. Genome-wide Linkage Scan of Antisocial Behavior, Depression and Impulsive Substance Use in the UCSF Family Alcoholism Study

    PubMed Central

    Gizer, Ian R.; Ehlers, Cindy L.; Vieten, Cassandra; Feiler, Heidi S.; Gilder, David A.; Wilhelmsen, Kirk C.

    2012-01-01

    OBJECTIVE Epidemiological and clinical studies suggest that rates of antisocial behavior, depression, and impulsive substance use are increased among individuals diagnosed with alcohol dependence relative to those who are not. Thus, the present study conducted genome-wide linkage scans of antisocial behavior, depression, and impulsive substance use in the University of California at San Francisco Family Alcoholism Study. METHODS Antisocial behavior, depressive symptoms, and impulsive substance use were assessed using three scales from the MMPI-2, the Antisocial Practices content scale (ASP), the Depression content scale (DEP), and the revised MacAndrew Alcoholism scale (MAC-R). Linkage analyses were conducted using a variance components approach. RESULTS Suggestive evidence of linkage to three genomic regions independent of alcohol and cannabis dependence diagnostic status was observed: the ASP scale showed evidence of linkage to chromosome 13 at 11 cM, the MAC-R scale showed evidence of linkage to chromosome 15 at 47 cM, and all 3 scales showed evidence of linkage to chromosome 17 at 57–58 cM. CONCLUSIONS Each of these regions has shown prior evidence of linkage and association to substance dependence as well as other psychiatric disorders such as mood and anxiety disorders, ADHD, and schizophrenia thus suggesting potentially broad relations between these regions and psychopathology. PMID:22517380

  13. Genome-wide analysis of auxin response factor gene family members in medicinal model plant Salvia miltiorrhiza

    PubMed Central

    Xu, Zhichao; Ji, Aijia; Chen, Shilin

    2016-01-01

    ABSTRACT Auxin response factors (ARFs) can function as transcriptional activators or repressors to regulate the expression of auxin response genes by specifically binding to auxin response elements (AuxREs) during plant development. Based on a genome-wide strategy using the medicinal model plant Salvia miltiorrhiza, 25 S. miltiorrhiza ARF (SmARF) gene family members in four classes (class Ia, IIa, IIb and III) were comprehensively analyzed to identify characteristics including gene structures, conserved domains, phylogenetic relationships and expression patterns. In a hybrid analysis of the phylogenetic tree, microRNA targets, and expression patterns of SmARFs in different organs, root tissues, and methyl jasmonate or indole-3-acetic acid treatment conditions, we screened for candidate SmARFs involved in various developmental processes of S. miltiorrhiza. Based on this analysis, we predicted that SmARF25, SmARF7, SmARF16 and SmARF20 are involved in flower, leaf, stem and root development, respectively. With the further insight into the targets of miR160 and miR167, specific SmARF genes in S. miltiorrhiza might encode products that participate in biological processes as described for ARF genes in Arabidopsis. Our results provide a foundation for understanding the molecular basis and regulatory mechanisms of SmARFs in S. miltiorrhiza. PMID:27230647

  14. Genome-wide linkage scan of antisocial behavior, depression, and impulsive substance use in the UCSF family alcoholism study.

    PubMed

    Gizer, Ian R; Ehlers, Cindy L; Vieten, Cassandra; Feiler, Heidi S; Gilder, David A; Wilhelmsen, Kirk C

    2012-10-01

    Epidemiological and clinical studies suggest that the rates of antisocial behavior, depression, and impulsive substance use are increased among individuals diagnosed with alcohol dependence relative to those who are not. Thus, the present study conducted genome-wide linkage scans of antisocial behavior, depression, and impulsive substance use in the University of California at San Francisco Family Alcoholism Study. Antisocial behavior, depressive symptoms, and impulsive substance use were assessed using three scales from the Minnesota Multiphasic Personality Inventory - 2nd ed.: the Antisocial Practices content scale, the Depression content scale, and the revised MacAndrew Alcoholism scale. Linkage analyses were carried out using a variance components approach. Suggestive evidence of linkage to three genomic regions independent of alcohol and cannabis dependence diagnostic status was observed: the Antisocial Practices content scale showed evidence of linkage to chromosome 13 at 11 cM, the MacAndrew Alcoholism scale showed evidence of linkage to chromosome 15 at 47 cM, and all three scales showed evidence of linkage to chromosome 17 at 57-58 cM. Each of these regions has shown previous evidence of linkage and association to substance dependence as well as other psychiatric disorders such as mood and anxiety disorders, attention-deficit hyperactivity disorder, and schizophrenia, thus suggesting potentially broad relations between these regions and psychopathology.

  15. Comprehensive analysis of CCCH-type zinc finger gene family in citrus (Clementine mandarin) by genome-wide characterization.

    PubMed

    Liu, Shengrui; Khan, Muhammad Rehman Gul; Li, Yongping; Zhang, Jinzhi; Hu, Chungen

    2014-10-01

    The CCCH-type zinc finger proteins comprise a large gene family of regulatory proteins and are widely distributed in eukaryotic organisms. The CCCH proteins have been implicated in multiple biological processes and environmental responses in plants. Little information is available, however, about CCCH genes in plants, especially in woody plants such as citrus. The release of the whole-genome sequence of citrus allowed us to perform a genome-wide analysis of CCCH genes and to compare the identified proteins with their orthologs in model plants. In this study, 62 CCCH genes and a total of 132 CCCH motifs were identified, and a comprehensive analysis including the chromosomal locations, phylogenetic relationships, functional annotations, gene structures and conserved motifs was performed. Distribution mapping revealed that 54 of the 62 CCCH genes are unevenly dispersed on the nine citrus chromosomes. Based on phylogenetic analysis and gene structural features, we constructed 5 subfamilies of 62 CCCH members and integrative subfamilies from citrus, Arabidopsis, and rice, respectively. Importantly, large numbers of SNPs and InDels in 26 CCCH genes were identified from Poncirus trifoliata and Fortunella japonica using whole-genome deep re-sequencing. Furthermore, citrus CCCH genes showed distinct temporal and spatial expression patterns in different developmental processes and in response to various stress conditions. Our comprehensive analysis of CleC3Hs is a valuable resource that further elucidates the roles of CCCH family members in plant growth and development. In addition, variants and comparative genomics analyses deepen our understanding of the evolution of the CCCH gene family and will contribute to further genetics and genomics studies of citrus and other plant species.

  16. Genome-wide identification and functional analysis of the TIFY gene family in response to drought in cotton.

    PubMed

    Zhao, Ge; Song, Yun; Wang, Caixiang; Butt, Hamama Islam; Wang, Qianhua; Zhang, Chaojun; Yang, Zuoren; Liu, Zhao; Chen, Eryong; Zhang, Xueyan; Li, Fuguang

    2016-12-01

    Jasmonates control many aspects of plant biological processes. They are important for regulating plant responses to various biotic and abiotic stresses, including drought, which is one of the most serious threats to sustainable agricultural production. However, little is known regarding how jasmonate ZIM-domain (JAZ) proteins mediate jasmonic acid signals to improve stress tolerance in cotton. This represents the first comprehensive comparative study of TIFY transcription factors in both diploid A, D and tetraploid AD cotton species. In this study, we identified 21 TIFY family members in the genome of Gossypium arboretum, 28 members from Gossypium raimondii and 50 TIFY genes in Gossypium hirsutum. The phylogenetic analyses indicated the TIFY gene family could be divided into the following four subfamilies: TIFY, PPD, ZML, and JAZ subfamilies. The cotton TIFY genes have expanded through tandem duplications and segmental duplications compared with other plant species. Gene expression profile revealed temporal and tissue specificities for TIFY genes under simulated drought conditions in Gossypium arboretum. The JAZ subfamily members were the most highly expressed genes, suggesting that they have a vital role in responses to drought stress. Over-expression of GaJAZ5 gene decreased water loss, stomatal openings, and the accumulation of H2O2 in Arabidopsis thaliana. Additionally, the results of drought tolerance assays suggested that this subfamily might be involved in increasing drought tolerance. Our study provides new data regarding the genome-wide analysis of TIFY gene families and their important roles in drought tolerance in cotton species. These data may form the basis of future studies regarding the relationship between drought and jasmonic acid.

  17. Genome-Wide Identification, Evolution, and Expression Analysis of the ATP-Binding Cassette Transporter Gene Family in Brassica rapa

    PubMed Central

    Yan, Chao; Duan, Weike; Lyu, Shanwu; Li, Ying; Hou, Xilin

    2017-01-01

    ATP-binding cassette (ABC) proteins can act as transporters of different substrates across biological membranes by hydrolyzing ATP. However, little information is available about ABC transporters in Brassica rapa, an important leafy vegetable. In the present study, we carried out genome-wide identification, characterization and molecular evolution analyses of ABC gene family in B. rapa and 9 other plant species. A total of 179 B. rapa ABC genes (BraABCs) were identified. Among them, 173 BraABCs were identified on 10 chromosomes. Based on phylogenetic analysis and domain organization, the BraABC family could be grouped into eight subfamilies. BraABCs in the same subfamily showed similar motif composition and exon-intron organization. Common and unique cis-elements involved in the transcriptional regulation were also identified in the promoter regions of BraABCs. Tissue-expression analysis of BraABCs demonstrated their diverse spatiotemporal expression profiles. Influences of the whole genome triplication (WGT) on the evolution of BraABCs were studied in detail. BraABCs were preferentially retained compared with their neighboring genes during diploidization after WGT. Synteny analysis identified 76 pairs of syntenic BraABC paralogs among the three subgenomes of B. rapa, and 10 paralog pairs underwent positive selection with ω (= Ka/Ks) ratios greater than 1. Analyses of the expression patterns of syntenic BraABC paralogs pairs across five tissues and under stress treatments revealed their functional conservation, sub-functionalization, neo-functionalization and pseudogenization during evolution. Our study presents a comprehensive overview of the ABC gene family in B. rapa and will be helpful for the further functional study of BraABCs in plant growth, development, and stress responses. PMID:28367152

  18. Genome-wide analysis of the omega-3 fatty acid desaturase gene family in Gossypium

    SciTech Connect

    Yurchenko, Olga P.; Park, Sunjung; Ilut, Daniel C.; Inmon, Jay J.; Millhollon, Jon C.; Liechty, Zach; Page, Justin T.; Jenks, Matthew A.; Chapman, Kent D.; Udall, Joshua A.; Gore, Michael A.; Dyer, John M.

    2014-11-18

    The majority of commercial cotton varieties planted worldwide are derived from Gossypium hirsutum, which is a naturally occurring allotetraploid produced by interspecific hybridization of A- and D-genome diploid progenitor species. While most cotton species are adapted to warm, semi-arid tropical and subtropical regions, and thus perform well in these geographical areas, cotton seedlings are sensitive to cold temperature, which can significantly reduce crop yields. One of the common biochemical responses of plants to cold temperatures is an increase in omega-3 fatty acids, which protects cellular function by maintaining membrane integrity. The purpose of our study was to identify and characterize the omega-3 fatty acid desaturase (FAD) gene family in G. hirsutum, with an emphasis on identifying omega-3 FADs involved in cold temperature adaptation. Results: Eleven omega-3 FAD genes were identified in G. hirsutum, and characterization of the gene family in extant A and D diploid species (G. herbaceum and G. raimondii, respectively) allowed for unambiguous genome assignment of all homoeologs in tetraploid G. hirsutum. The omega-3 FAD family of cotton includes five distinct genes, two of which encode endoplasmic reticulum-type enzymes (FAD3-1 and FAD3-2) and three that encode chloroplast-type enzymes (FAD7/8-1, FAD7/8-2, and FAD7/8-3). The FAD3-2 gene was duplicated in the A genome progenitor species after the evolutionary split from the D progenitor, but before the interspecific hybridization event that gave rise to modern tetraploid cotton. RNA-seq analysis revealed conserved, gene-specific expression patterns in various organs and cell types and semi-quantitative RT-PCR further revealed that FAD7/8-1 was specifically induced during cold temperature treatment of G. hirsutum seedlings. Conclusions: The omega-3 FAD gene family in cotton was characterized at the genome-wide level

  19. Genome-wide analysis of Aux/IAA gene family in Solanaceae species using tomato as a model.

    PubMed

    Wu, Jian; Peng, Zhen; Liu, Songyu; He, Yanjun; Cheng, Lin; Kong, Fuling; Wang, Jie; Lu, Gang

    2012-04-01

    Auxin plays key roles in a wide variety of plant activities, including embryo development, leaf formation, phototropism, fruit development and root initiation and development. Auxin/indoleacetic acid (Aux/IAA) genes, encoding short-lived nuclear proteins, are key regulators in the auxin transduction pathway. But how they work is still unknown. In order to conduct a systematic analysis of this gene family in Solanaceae species, a genome-wide search for the homologues of auxin response genes was carried out. Here, 26 and 27 non redundant AUX/IAAs were identified in tomato and potato, respectively. Using tomato as a model, a comprehensive overview of SlIAA gene family is presented, including the gene structures, phylogeny, chromosome locations, conserved motifs and cis-elements in promoter sequences. A phylogenetic tree generated from alignments of the predicted protein sequences of 31 OsIAAs, 29 AtIAAs, 31 ZmIAAs, and 26 SlIAAs revealed that these IAAs were clustered into three major groups and ten subgroups. Among them, seven subgroups were present in both monocot and dicot species, which indicated that the major functional diversification within the IAA family predated the monocot/dicot divergence. In contrast, group C and some other subgroups seemed to be species-specific. Quantitative real-time PCR (qRT-PCR) analysis showed that 19 of the 26 SlIAA genes could be detected in all tomato organs/tissues, however, seven of them were specifically expressed in some of tomato tissues. The transcript abundance of 17 SlIAA genes were increased within a few hours when the seedlings were treated with exogenous IAA. However, those of other six SlIAAs were decreased. The results of stress treatments showed that most SIIAA family genes responded to at least one of the three stress treatments, however, they exhibited diverse expression levels under different abiotic stress conditions in tomato seedlings. SlIAA20, SlIAA21 and SlIAA22 were not significantly influenced by stress

  20. Genome-Wide Organization and Expression Profiling of the NAC Transcription Factor Family in Potato (Solanum tuberosum L.)

    PubMed Central

    Singh, Anil Kumar; Sharma, Vishal; Pal, Awadhesh Kumar; Acharya, Vishal; Ahuja, Paramvir Singh

    2013-01-01

    NAC [no apical meristem (NAM), Arabidopsis thaliana transcription activation factor [ATAF1/2] and cup-shaped cotyledon (CUC2)] proteins belong to one of the largest plant-specific transcription factor (TF) families and play important roles in plant development processes, response to biotic and abiotic cues and hormone signalling. Our genome-wide analysis identified 110 StNAC genes in potato encoding for 136 proteins, including 14 membrane-bound TFs. The physical map positions of StNAC genes on 12 potato chromosomes were non-random, and 40 genes were found to be distributed in 16 clusters. The StNAC proteins were phylogenetically clustered into 12 subgroups. Phylogenetic analysis of StNACs along with their Arabidopsis and rice counterparts divided these proteins into 18 subgroups. Our comparative analysis has also identified 36 putative TNAC proteins, which appear to be restricted to Solanaceae family. In silico expression analysis, using Illumina RNA-seq transcriptome data, revealed tissue-specific, biotic, abiotic stress and hormone-responsive expression profile of StNAC genes. Several StNAC genes, including StNAC072 and StNAC101that are orthologs of known stress-responsive Arabidopsis RESPONSIVE TO DEHYDRATION 26 (RD26) were identified as highly abiotic stress responsive. Quantitative real-time polymerase chain reaction analysis largely corroborated the expression profile of StNAC genes as revealed by the RNA-seq data. Taken together, this analysis indicates towards putative functions of several StNAC TFs, which will provide blue-print for their functional characterization and utilization in potato improvement. PMID:23649897

  1. Genome-wide organization and expression profiling of the NAC transcription factor family in potato (Solanum tuberosum L.).

    PubMed

    Singh, Anil Kumar; Sharma, Vishal; Pal, Awadhesh Kumar; Acharya, Vishal; Ahuja, Paramvir Singh

    2013-08-01

    NAC [no apical meristem (NAM), Arabidopsis thaliana transcription activation factor [ATAF1/2] and cup-shaped cotyledon (CUC2)] proteins belong to one of the largest plant-specific transcription factor (TF) families and play important roles in plant development processes, response to biotic and abiotic cues and hormone signalling. Our genome-wide analysis identified 110 StNAC genes in potato encoding for 136 proteins, including 14 membrane-bound TFs. The physical map positions of StNAC genes on 12 potato chromosomes were non-random, and 40 genes were found to be distributed in 16 clusters. The StNAC proteins were phylogenetically clustered into 12 subgroups. Phylogenetic analysis of StNACs along with their Arabidopsis and rice counterparts divided these proteins into 18 subgroups. Our comparative analysis has also identified 36 putative TNAC proteins, which appear to be restricted to Solanaceae family. In silico expression analysis, using Illumina RNA-seq transcriptome data, revealed tissue-specific, biotic, abiotic stress and hormone-responsive expression profile of StNAC genes. Several StNAC genes, including StNAC072 and StNAC101that are orthologs of known stress-responsive Arabidopsis RESPONSIVE TO DEHYDRATION 26 (RD26) were identified as highly abiotic stress responsive. Quantitative real-time polymerase chain reaction analysis largely corroborated the expression profile of StNAC genes as revealed by the RNA-seq data. Taken together, this analysis indicates towards putative functions of several StNAC TFs, which will provide blue-print for their functional characterization and utilization in potato improvement.

  2. Deciphering Transcription Factor Binding Patterns from Genome-Wide High Density ChIP-chip Tiling Array Data

    NASA Astrophysics Data System (ADS)

    Li, Juntao; Zhu, Lei; Eshaghi, Majid; Liu, Jianhua; Karuturi, Radha Krishna Murthy

    The binding events of DNA-binding proteins can be extensively characterized by high density ChIP-chip tiling array data. The binding sites and binding occupancy patterns are all very useful to understand the DNA-protein interaction. We propose a statistical procedure which focuses on identifying the interaction signal regions and the patterns of interaction using peakedness and skewness tests. Its utility to annotate the binding signals by analyzing the Tbp1 and Rpb1 ChIP-chip datasets in fission yeast is demonstrated.

  3. Genome-wide identification and characterization of SnRK2 gene family in cotton (Gossypium hirsutum L.).

    PubMed

    Liu, Zhao; Ge, Xiaoyang; Yang, Zuoren; Zhang, Chaojun; Zhao, Ge; Chen, Eryong; Liu, Ji; Zhang, Xueyan; Li, Fuguang

    2017-06-12

    Sucrose non-fermenting-1-related protein kinase 2 (SnRK2) is a plant-specific serine/threonine kinase family involved in the abscisic acid (ABA) signaling pathway and responds to osmotic stress. A genome-wide analysis of this protein family has been conducted previously in some plant species, but little is known about SnRK2 genes in upland cotton (Gossypium hirsutum L.). The recent release of the G. hirsutum genome sequence provides an opportunity to identify and characterize the SnRK2 kinase family in upland cotton. We identified 20 putative SnRK2 sequences in the G. hirsutum genome, designated as GhSnRK2.1 to GhSnRK2.20. All of the sequences encoded hydrophilic proteins. Phylogenetic analysis showed that the GhSnRK2 genes were classifiable into three groups. The chromosomal location and phylogenetic analysis of the cotton SnRK2 genes indicated that segmental duplication likely contributed to the diversification and evolution of the genes. The gene structure and motif composition of the cotton SnRK2 genes were analyzed. Nine exons were conserved in length among all members of the GhSnRK2 family. Although the C-terminus was divergent, seven conserved motifs were present. All GhSnRK2s genes showed expression patterns under abiotic stress based on transcriptome data. The expression profiles of five selected genes were verified in various tissues by quantitative real-time RT-PCR (qRT-PCR). Transcript levels of some family members were up-regulated in response to drought, salinity or ABA treatments, consistent with potential roles in response to abiotic stress. This study is the first comprehensive analysis of SnRK2 genes in upland cotton. Our results provide the fundamental information for the functional dissection of GhSnRK2s and vital availability for the improvement of plant stress tolerance using GhSnRK2s.

  4. First all-in-one diagnostic tool for DNA intelligence: genome-wide inference of biogeographic ancestry, appearance, relatedness, and sex with the Identitas v1 Forensic Chip.

    PubMed

    Keating, Brendan; Bansal, Aruna T; Walsh, Susan; Millman, Jonathan; Newman, Jonathan; Kidd, Kenneth; Budowle, Bruce; Eisenberg, Arthur; Donfack, Joseph; Gasparini, Paolo; Budimlija, Zoran; Henders, Anjali K; Chandrupatla, Hareesh; Duffy, David L; Gordon, Scott D; Hysi, Pirro; Liu, Fan; Medland, Sarah E; Rubin, Laurence; Martin, Nicholas G; Spector, Timothy D; Kayser, Manfred

    2013-05-01

    When a forensic DNA sample cannot be associated directly with a previously genotyped reference sample by standard short tandem repeat profiling, the investigation required for identifying perpetrators, victims, or missing persons can be both costly and time consuming. Here, we describe the outcome of a collaborative study using the Identitas Version 1 (v1) Forensic Chip, the first commercially available all-in-one tool dedicated to the concept of developing intelligence leads based on DNA. The chip allows parallel interrogation of 201,173 genome-wide autosomal, X-chromosomal, Y-chromosomal, and mitochondrial single nucleotide polymorphisms for inference of biogeographic ancestry, appearance, relatedness, and sex. The first assessment of the chip's performance was carried out on 3,196 blinded DNA samples of varying quantities and qualities, covering a wide range of biogeographic origin and eye/hair coloration as well as variation in relatedness and sex. Overall, 95 % of the samples (N = 3,034) passed quality checks with an overall genotype call rate >90 % on variable numbers of available recorded trait information. Predictions of sex, direct match, and first to third degree relatedness were highly accurate. Chip-based predictions of biparental continental ancestry were on average ~94 % correct (further support provided by separately inferred patrilineal and matrilineal ancestry). Predictions of eye color were 85 % correct for brown and 70 % correct for blue eyes, and predictions of hair color were 72 % for brown, 63 % for blond, 58 % for black, and 48 % for red hair. From the 5 % of samples (N = 162) with <90 % call rate, 56 % yielded correct continental ancestry predictions while 7 % yielded sufficient genotypes to allow hair and eye color prediction. Our results demonstrate that the Identitas v1 Forensic Chip holds great promise for a wide range of applications including criminal investigations, missing person investigations, and for national security

  5. Genome-Wide Identification and Expression Profiling of Tomato Hsp20 Gene Family in Response to Biotic and Abiotic Stresses

    PubMed Central

    Yu, Jiahong; Cheng, Yuan; Feng, Kun; Ruan, Meiying; Ye, Qingjing; Wang, Rongqing; Li, Zhimiao; Zhou, Guozhi; Yao, Zhuping; Yang, Yuejian; Wan, Hongjian

    2016-01-01

    The Hsp20 genes are involved in the response of plants to environment stresses including heat shock and also play a vital role in plant growth and development. They represent the most abundant small heat shock proteins (sHsps) in plants, but little is known about this family in tomato (Solanum lycopersicum), an important vegetable crop in the world. Here, we characterized heat shock protein 20 (SlHsp20) gene family in tomato through integration of gene structure, chromosome location, phylogenetic relationship, and expression profile. Using bioinformatics-based methods, we identified at least 42 putative SlHsp20 genes in tomato. Sequence analysis revealed that most of SlHsp20 genes possessed no intron or a relatively short intron in length. Chromosome mapping indicated that inter-arm and intra-chromosome duplication events contributed remarkably to the expansion of SlHsp20 genes. Phylogentic tree of Hsp20 genes from tomato and other plant species revealed that SlHsp20 genes were grouped into 13 subfamilies, indicating that these genes may have a common ancestor that generated diverse subfamilies prior to the mono-dicot split. In addition, expression analysis using RNA-seq in various tissues and developmental stages of cultivated tomato and the wild relative Solanum pimpinellifolium revealed that most of these genes (83%) were expressed in at least one stage from at least one genotype. Out of 42 genes, 4 genes were expressed constitutively in almost all the tissues analyzed, implying that these genes might have specific housekeeping function in tomato cell under normal growth conditions. Two SlHsp20 genes displayed differential expression levels between cultivated tomato and S. pimpinellifolium in vegetative (leaf and root) and reproductive organs (floral bud and flower), suggesting inter-species diversification for functional specialization during the process of domestication. Based on genome-wide microarray analysis, we showed that the transcript levels of SlHsp20

  6. Genome-Wide Identification and Expression Profiling of Tomato Hsp20 Gene Family in Response to Biotic and Abiotic Stresses.

    PubMed

    Yu, Jiahong; Cheng, Yuan; Feng, Kun; Ruan, Meiying; Ye, Qingjing; Wang, Rongqing; Li, Zhimiao; Zhou, Guozhi; Yao, Zhuping; Yang, Yuejian; Wan, Hongjian

    2016-01-01

    The Hsp20 genes are involved in the response of plants to environment stresses including heat shock and also play a vital role in plant growth and development. They represent the most abundant small heat shock proteins (sHsps) in plants, but little is known about this family in tomato (Solanum lycopersicum), an important vegetable crop in the world. Here, we characterized heat shock protein 20 (SlHsp20) gene family in tomato through integration of gene structure, chromosome location, phylogenetic relationship, and expression profile. Using bioinformatics-based methods, we identified at least 42 putative SlHsp20 genes in tomato. Sequence analysis revealed that most of SlHsp20 genes possessed no intron or a relatively short intron in length. Chromosome mapping indicated that inter-arm and intra-chromosome duplication events contributed remarkably to the expansion of SlHsp20 genes. Phylogentic tree of Hsp20 genes from tomato and other plant species revealed that SlHsp20 genes were grouped into 13 subfamilies, indicating that these genes may have a common ancestor that generated diverse subfamilies prior to the mono-dicot split. In addition, expression analysis using RNA-seq in various tissues and developmental stages of cultivated tomato and the wild relative Solanum pimpinellifolium revealed that most of these genes (83%) were expressed in at least one stage from at least one genotype. Out of 42 genes, 4 genes were expressed constitutively in almost all the tissues analyzed, implying that these genes might have specific housekeeping function in tomato cell under normal growth conditions. Two SlHsp20 genes displayed differential expression levels between cultivated tomato and S. pimpinellifolium in vegetative (leaf and root) and reproductive organs (floral bud and flower), suggesting inter-species diversification for functional specialization during the process of domestication. Based on genome-wide microarray analysis, we showed that the transcript levels of SlHsp20

  7. Genome-Wide Survey and Characterization of Fatty Acid Desaturase Gene Family in Brassica napus and Its Parental Species.

    PubMed

    Xue, Yufei; Chen, Baojun; Wang, Rui; Win, Aung Naing; Li, Jiana; Chai, Yourong

    2017-08-10

    Rapeseed (Brassica napus) is an important oilseed crop worldwide, and fatty acid (FA) compositions determine the nutritional and economic value of its seed oil. Fatty acid desaturases (FADs) play a pivotal role in regulating FA compositions, but to date, no comprehensive genome-wide analysis of FAD gene family in rapeseed and its parent species has been reported. In this study, using homology searches, 84, 45, and 44 FAD genes were identified in rapeseed, Brassica rapa, and Brassica oleracea genomes, respectively. These FAD genes were unevenly located in 17 chromosomes and 2 scaffolds of rapeseed, 9 chromosomes and 1 scaffold of B. rapa, and all the chromosomes of B. oleracea. Phylogenetic analysis showed that the soluble and membrane-bound FADs in the three Brassica species were divided into four and six subfamilies, respectively. Generally, the soluble FADs contained two conserved histidine boxes, while three highly conserved histidine boxes were harbored in membrane-bound FADs. Exon-intron structure, intron phase, and motif composition and position were highly conserved in each FAD subfamily. Putative subcellular locations of FAD proteins in three Brassica species were consistent with those of corresponding known FADs. In total, 25 of simple sequence repeat (SSR) loci were found in FAD genes of the three Brassica species. Transcripts of selected FAD genes in the three species were examined in various organs/tissues or stress treatments from NCBI expressed sequence tag (EST) database. This study provides a critical molecular basis for quality improvement of rapeseed oil and facilitates our understanding of key roles of FAD genes in plant growth and development and stress response.

  8. Why breeding values estimated using familial data should not be used for genome-wide association studies.

    PubMed

    Ekine, Chinyere C; Rowe, Suzanne J; Bishop, Stephen C; de Koning, Dirk-Jan

    2014-02-19

    In animal breeding, the genetic potential of an animal is summarized as its estimated breeding value, which is derived from its own performance as well as the performance of related individuals. Here, we illustrate why estimated breeding values are not suitable as a phenotype for genome-wide association studies. We simulated human-type and pig-type pedigrees with a range of quantitative trait loci (QTL) effects (0.5-3% of phenotypic variance) and heritabilities (0.3-0.8). We analyzed 1000 replicates of each scenario with four models: (a) a full mixed model including a polygenic effect, (b) a regression analysis using the residual of a mixed model as a trait score (so called GRAMMAR approach), (c) a regression analysis using the estimated breeding value as a trait score, and (d) a regression analysis that uses the raw phenotype as a trait score. We show that using breeding values as a trait score gives very high false-positive rates (up 14% in human pedigrees and >60% in pig pedigrees). Simulations based on a real pedigree show that additional generations of pedigree increase the type I error. Including the family relationship as a random effect provides the greatest power to detect QTL while controlling for type I error at the desired level and providing the most accurate estimates of the QTL effect. Both the use of residuals and the use of breeding values result in deflated estimates of the QTL effect. We derive the contributions of QTL effects to the breeding value and residual and show how this affects the estimates.

  9. Genome-Wide Analysis and Evolution of the Pto-Like Protein Kinase (PLPK) Gene Family in Pepper

    PubMed Central

    Venkatesh, Jelli; Jahn, Molly; Kang, Byoung-Cheorl

    2016-01-01

    The tomato Pto gene, which encodes a serine/threonine kinase (STK) domain-containing protein, confers resistance to bacterial speck disease caused by Pseudomonas syringae pv. tomato (Pst). In this study, in vivo recognition assays using PVX constructs showed that AvrPto was specifically recognized in the pepper genotypes. This AvrPto recognition caused a nonhost hypersensitive response (HR) and localization of the PVX::AvrPto fusion protein to inoculated pepper leaf tissues, which indicates the presence of a similar Pto recognition mechanism in pepper as in tomato. However, genome-wide analysis in pepper revealed no Pto clade corresponding to that in tomato, suggesting an alternative system for Pto recognition in pepper. Nevertheless, 25 Pto-like protein kinases (PLPKs) with a highly conserved STK domain have been identified in the pepper genome. For the majority of the amino acid sites in the STK domain of Ptos and PLPKs, nonsynonymous (dN) to synonymous (dS) nucleotide substitution ratios (ω) were less than one, suggesting that purifying selection played a predominant role in the evolutionary process. However, some amino acid sites were found to be subjected to episodic positive selection in the course of evolution of Pto homologs, and, thus, different evolutionary processes might have shaped the Pto gene family in plants. Based on RNA-seq data, PLPK genes and other Pto pathway genes, such as Prf, Pti1, Pti5, and Pti6 were expressed in all tested pepper genotypes. Therefore, the nonhost HR against Pst in pepper may be due to the recognition of the AvrPto effector by a PLPK homolog, and subsequent action of downstream components of the Pto signaling pathway. However, the possibility remains that the recognition of AvrPto in pepper plants may involve activities of other receptor like kinases (RLKs). The identification of the PLPKs in this study will serve as a foundation for further efforts to understand the roles of PLPKs in nonhost resistance. PMID:27536870

  10. Genome-wide detection of allelic gene expression in hepatocellular carcinoma cells using a human exome SNP chip.

    PubMed

    Park, Yon Mi; Cheong, Hyun Sub; Lee, Jong-Keuk

    2014-11-10

    Allelic variations in gene expression influence many biological responses and cause phenotypic variations in humans. In this study, Illumina Human Exome BeadChips containing more than 240,000 single nucleotide polymorphisms (SNPs) were used to identify changes in allelic gene expression in hepatocellular carcinoma cells following lipopolysaccharide (LPS) stimulation. We found 17 monoallelically expressed genes, 58 allelic imbalanced genes, and 7 genes showing allele substitution. In addition, we also detected 33 differentially expressed genes following LPS treatment in vitro using these human exome SNP chips. However, alterations in allelic gene expression following LPS treatment were detected in only three genes (MLXIPL, TNC, and MX2), which were observed in one cell line sample only, indicating that changes in allelic gene expression following LPS stimulation of liver cells are rare events. Among a total of 75 genes showing allelic expression in hepatocellular carcinoma cells, either monoallelic or imbalanced, 43 genes (57.33%) had expression quantitative trait loci (eQTL) data, indicating that high-density exome SNP chips are useful and reliable for studying allelic gene expression. Furthermore, most genes showing allelic expression were regulated by cis-acting mechanisms and were also significantly associated with several human diseases. Overall, our study provides a better understanding of allele-specific gene expression in hepatocellular carcinoma cells with and without LPS stimulation and potential clues for the cause of human disease due to alterations in allelic gene expression.

  11. Genome-wide analysis of the ATP-binding cassette (ABC) transporter gene family in sea lamprey and Japanese lamprey.

    PubMed

    Ren, Jianfeng; Chung-Davidson, Yu-Wen; Yeh, Chu-Yin; Scott, Camille; Brown, Titus; Li, Weiming

    2015-06-06

    Lampreys are extant representatives of the jawless vertebrate lineage that diverged from jawed vertebrates around 500 million years ago. Lamprey genomes contain information crucial for understanding the evolution of gene families in vertebrates. The ATP-binding cassette (ABC) gene family is found from prokaryotes to eukaryotes. The recent availability of two lamprey draft genomes from sea lamprey Petromyzon marinus and Japanese lamprey Lethenteron japonicum presents an opportunity to infer early evolutionary events of ABC genes in vertebrates. We conducted a genome-wide survey of the ABC gene family in two lamprey draft genomes. A total of 37 ABC transporters were identified and classified into seven subfamilies; namely seven ABCA genes, 10 ABCB genes, 10 ABCC genes, three ABCD genes, one ABCE gene, three ABCF genes, and three ABCG genes. The ABCA subfamily has expanded from three genes in sea squirts, seven and nine in lampreys and zebrafish, to 13 and 16 in human and mouse. Conversely, the multiple copies of ABCB1-, ABCG1-, and ABCG2-like genes found in sea squirts have contracted in the other species examined. ABCB2 and ABCB3 seem to be new additions in gnathostomes (not in sea squirts or lampreys), which coincides with the emergence of the gnathostome-specific adaptive immune system. All the genes in the ABCD, ABCE and ABCF subfamilies were conserved and had undergone limited duplication and loss events. In the sea lamprey transcriptomes, the ABCE and ABCF gene subfamilies were ubiquitously and highly expressed in all tissues while the members in other gene subfamilies were differentially expressed. Thirteen more lamprey ABC transporter genes were identified in this study compared with a previous study. By concatenating the same gene sequences from the two lampreys, more full length sequences were obtained, which significantly improved both the assignment of gene names and the phylogenetic trees compared with a previous analysis using partial sequences. The ABC

  12. Genome-wide identification and characterization of the superoxide dismutase gene family in Musa acuminata cv. Tianbaojiao (AAA group).

    PubMed

    Feng, Xin; Lai, Zhongxiong; Lin, Yuling; Lai, Gongti; Lian, Conglong

    2015-10-20

    Superoxide dismutase (SOD) is an essential enzyme of the plant antioxidant system that responds to oxidative stresses caused by adverse conditions. Banana is an important staple and economic crop in tropical and subtropical regions. However, its growth and yield are constantly affected by various abiotic stresses. To analyze the roles of distinct SOD genes under various stresses, a detailed characterization and analysis of the SOD gene family in Cavendish banana is indispensable. The presence and structure of the SOD family genes were experimentally verified using 5'/3' RACE-PCR, reverse transcription PCR and PCR. Then, their syntenic relationships, conserved motifs and phylogenetic relationships were analyzed using software. Cis-elements present in the promoters were predicted via PlantCARE. And the expression levels under abiotic and hormonal stresses were determined using real-time quantitative polymerase chain reaction. In total, 25 'Tianbaojiao' SOD cDNAs (MaSODs), which encoded six Cu/ZnSODs, four MnSODs and two FeSODs, were cloned. The 12 MaSOD genes were divided into four groups based on their conserved motifs, which corroborated their classifications based on gene-structure patterns and subcellular localizations. Eleven MaSOD promoters were isolated and found to contain many cis-acting elements involved in stress responses. Gene expression analysis showed that 11 out of the 12 MaSODs were expressed in all tested tissues (leaf, pseudostem and root), whereas MaCSD2B was expressed only in leaves and roots. Specific MaSOD members exhibited different expression patterns under abiotic and hormonal treatments. Among the 12 MaSOD genes, MaCSD1D was the only one that responded to all eight treatments, suggesting that this gene plays a predominant role in reactive oxygen species scavenging caused by various stresses in banana. A genome-wide analysis showed that the 'Tianbaojiao' banana harbored an expanded SOD gene family. Whole genome duplication, segmental

  13. Genome-wide linkage scan for plasma high density lipoprotein cholesterol, apolipoprotein A-1 and triglyceride variation among American Indian populations: the Strong Heart Family Study

    PubMed Central

    Li, X; Monda, KL.; Göring, HHH; Haack, K; Cole, SA; Diego, VP; Almasy, L; Laston, S; Howard, BV; Shara, NM; Lee, ET; Best, LG; Fabsitz, RR; MacCluer, JW; North, KE

    2011-01-01

    Background Recent studies have identified chromosomal regions linked to variation in high density lipoprotein cholesterol (HDL-C), apolipoprotein A-1 (Apo A-1) and triglyceride (TG), although results have been inconsistent and previous studies of American Indian populations are limited Objective In an attempt to localize quantitative trait loci (QTLs) influencing HDL-C, Apo A-1 and TG, we conducted genome-wide linkage scans of subjects of the Strong Heart Family Study. Methods We implemented analyses in 3484 men and women aged 18 years or older, at three study centers. Results With adjustment for age, sex and center, we detected a QTL influencing both HDL-C (LOD = 4.4, genome-wide P = 0.001) and Apo A-1 (LOD = 3.2, genome-wide P = 0.020) nearest marker D6S289 at 6p23 in the Arizona sample. Another QTL influencing Apo A-1 was found nearest marker D9S287 at 9q22.2 (LOD = 3.0, genome-wide P = 0.033) in the North and South Dakotas. We detected a QTL influencing TG nearest marker D15S153 at 15q22.31 (LOD = 4.5 in the overall sample and LOD = 3.8 in the Dakotas sample, genome-wide P = 0.0044) and when additionally adjusted for waist, current smoking, current alcohol, current estrogen, lipid treatment, impaired fasting glucose, and diabetes, nearest marker D10S217 at 10q26.2 (LOD = 3.7, genome-wide P = 0.0058) in the Arizona population. Conclusions The replication of QTLs in regions of the genome that harbor well-known candidate genes suggest that chromosomes 6p, 9q and 15q warrant further investigation with fine mapping for causative polymorphisms in American Indians. PMID:19429595

  14. Family-based association analyses of imputed genotypes reveal genome-wide significant association of Alzheimer's disease with OSBPL6, PTPRG, and PDCL3.

    PubMed

    Herold, C; Hooli, B V; Mullin, K; Liu, T; Roehr, J T; Mattheisen, M; Parrado, A R; Bertram, L; Lange, C; Tanzi, R E

    2016-11-01

    The genetic basis of Alzheimer's disease (AD) is complex and heterogeneous. Over 200 highly penetrant pathogenic variants in the genes APP, PSEN1, and PSEN2 cause a subset of early-onset familial AD. On the other hand, susceptibility to late-onset forms of AD (LOAD) is indisputably associated to the ɛ4 allele in the gene APOE, and more recently to variants in more than two-dozen additional genes identified in the large-scale genome-wide association studies (GWAS) and meta-analyses reports. Taken together however, although the heritability in AD is estimated to be as high as 80%, a large proportion of the underlying genetic factors still remain to be elucidated. In this study, we performed a systematic family-based genome-wide association and meta-analysis on close to 15 million imputed variants from three large collections of AD families (~3500 subjects from 1070 families). Using a multivariate phenotype combining affection status and onset age, meta-analysis of the association results revealed three single nucleotide polymorphisms (SNPs) that achieved genome-wide significance for association with AD risk: rs7609954 in the gene PTPRG (P-value=3.98 × 10(-8)), rs1347297 in the gene OSBPL6 (P-value=4.53 × 10(-8)), and rs1513625 near PDCL3 (P-value=4.28 × 10(-8)). In addition, rs72953347 in OSBPL6 (P-value=6.36 × 10(-7)) and two SNPs in the gene CDKAL1 showed marginally significant association with LOAD (rs10456232, P-value=4.76 × 10(-7); rs62400067, P-value=3.54 × 10(-7)). In summary, family-based GWAS meta-analysis of imputed SNPs revealed novel genomic variants in (or near) PTPRG, OSBPL6, and PDCL3 that influence risk for AD with genome-wide significance.

  15. Heritability and Genome-Wide Association Studies for Hair Color in a Dutch Twin Family Based Sample

    PubMed Central

    Lin, Bochao Danae; Mbarek, Hamdi; Willemsen, Gonneke; Dolan, Conor V.; Fedko, Iryna O.; Abdellaoui, Abdel; de Geus, Eco J.; Boomsma, Dorret I.; Hottenga, Jouke-Jan

    2015-01-01

    Hair color is one of the most visible and heritable traits in humans. Here, we estimated heritability by structural equation modeling (N = 20,142), and performed a genome wide association (GWA) analysis (N = 7091) and a GCTA study (N = 3340) on hair color within a large cohort of twins, their parents and siblings from the Netherlands Twin Register (NTR). Self-reported hair color was analyzed as five binary phenotypes, namely “blond versus non-blond”, “red versus non-red”, “brown versus non-brown”, “black versus non-black”, and “light versus dark”. The broad-sense heritability of hair color was estimated between 73% and 99% and the genetic component included non-additive genetic variance. Assortative mating for hair color was significant, except for red and black hair color. From GCTA analyses, at most 24.6% of the additive genetic variance in hair color was explained by 1000G well-imputed SNPs. Genome-wide association analysis for each hair color showed that SNPs in the MC1R region were significantly associated with red, brown and black hair, and also with light versus dark hair color. Five other known genes (HERC2, TPCN2, SLC24A4, IRF4, and KITLG) gave genome-wide significant hits for blond, brown and light versus dark hair color. We did not find and replicate any new loci for hair color. PMID:26184321

  16. Heritability and Genome-Wide Association Studies for Hair Color in a Dutch Twin Family Based Sample.

    PubMed

    Lin, Bochao Danae; Mbarek, Hamdi; Willemsen, Gonneke; Dolan, Conor V; Fedko, Iryna O; Abdellaoui, Abdel; de Geus, Eco J; Boomsma, Dorret I; Hottenga, Jouke-Jan

    2015-07-13

    Hair color is one of the most visible and heritable traits in humans. Here, we estimated heritability by structural equation modeling (N = 20,142), and performed a genome wide association (GWA) analysis (N = 7091) and a GCTA study (N = 3340) on hair color within a large cohort of twins, their parents and siblings from the Netherlands Twin Register (NTR). Self-reported hair color was analyzed as five binary phenotypes, namely "blond versus non-blond", "red versus non-red", "brown versus non-brown", "black versus non-black", and "light versus dark". The broad-sense heritability of hair color was estimated between 73% and 99% and the genetic component included non-additive genetic variance. Assortative mating for hair color was significant, except for red and black hair color. From GCTA analyses, at most 24.6% of the additive genetic variance in hair color was explained by 1000G well-imputed SNPs. Genome-wide association analysis for each hair color showed that SNPs in the MC1R region were significantly associated with red, brown and black hair, and also with light versus dark hair color. Five other known genes (HERC2, TPCN2, SLC24A4, IRF4, and KITLG) gave genome-wide significant hits for blond, brown and light versus dark hair color. We did not find and replicate any new loci for hair color.

  17. Genome-Wide Linkage Analysis of Quantitative Biomarker Traits of Osteoarthritis in a Large Multigenerational Extended Family

    PubMed Central

    Chen, Hsiang-Cheng; Kraus, Virginia Byers; Li, Yi-Ju; Nelson, Sarah; Haynes, Carol; Johnson, Jessica; Stabler, Thomas; Hauser, Elizabeth R.; Gregory, Simon G.; Kraus, William E.; Shah, Svati H.

    2013-01-01

    Objective The genetic contributions to the multifactorial disorder osteoarthritis (OA) have been increasingly recognized. Our goal was to use OA-related biomarkers of severity and disease burden as quantitative traits to identify genetic susceptibility loci for OA. Methods In a large multigenerational extended family (CARRIAGE family, n=350), we measured five OA-related biomarkers: HA (hyaluronan), COMP (cartilage oligomeric matrix protein), PIIANP (type IIA collagen N-propeptide), CPII (type II procollagen carboxy-propeptide), and C2C (type II collagen cleavage neoepitope). SNP markers (6,090) covering the whole genome were genotyped using the Illumina HumanLinkage-12 BeadChip. Variance components analysis as implemented in SOLAR was used to estimate heritabilities of the quantitative traits, and to calculate two-point and multi-point LOD scores using a polygenic model. Results Four of the five biomarkers showed significant heritability (p<0.01 age and sex adjusted h2r: PIIANP 0.57, HA 0.49, COMP 0.43, C2C 0.30). Fourteen of the 19 loci with multi-point LOD scores >1.5 were near or overlapped previously reported OA susceptibility loci. Four of these loci were identified by more than one biomarker. The maximum multi-point LOD scores for the heritable quantitative biomarker traits were LOD 4.3 for PIIANP (chromosome 8p23.2); LOD 3.2 for COMP (chromosome 8q11.1); LOD 2.0 for HA (chromosome 6q16.3); LOD 2.0 for C2C (chromosome 5q31.2). Conclusions We report the first evidence of genetic susceptibility loci identified by OA-related biomarkers in an extended family. Serum concentrations of PIIANP, HA, COMP and C2C have substantial heritable components and identified several genetic loci potentially contributing to the genetic diversity of OA. PMID:20187133

  18. Genome-wide identification and analysis of FK506-binding protein family gene family in strawberry (Fragaria × ananassa).

    PubMed

    Leng, Xiangpeng; Liu, Dan; Zhao, Mizhen; Sun, Xin; Li, Yu; Mu, Qian; Zhu, Xudong; Li, Pengyu; Fang, Jinggui

    2014-01-25

    The FK506 binding proteins (FKBPs) are abundant and ubiquitous proteins belonging to the large peptidyl-prolylcis-trans isomerase superfamily. FKBPs are known to be involved in many biological processes including hormone signaling, plant growth, and stress responses through a chaperone or an isomerization of proline residues during protein folding. The availability of complete strawberry genome sequences allowed the identification of 23 FKBP genes by HMMER and blast analysis. Chromosome scaffold locations of these FKBP genes in the strawberry genome were determined and the protein domain and motif organization of FaFKBPs analyzed. The phylogenetic relationships between strawberry FKBPs were also assessed. The expression profiles of FaFKBPs genes results revealed that most FaFKBPs were expressed in all tissues, while a few FaFKBPs were specifically expressed in some of the tissues. These data not only contribute to some better understanding of the complex regulation of the strawberry FKBP gene family, but also provide valuable information for further research in strawberry functional genomics.

  19. Investigation of transferability of BovineSNP50 BeadChip from cattle to water buffalo for genome wide association study.

    PubMed

    Wu, Jun Jing; Song, Li Jun; Wu, Fang Jie; Liang, Xian Wei; Yang, Bing Zhuang; Wathes, D Claire; Pollott, Geoff E; Cheng, Zhangrui; Shi, De Shun; Liu, Qing You; Yang, Li Guo; Zhang, Shu Jun

    2013-02-01

    Cattle and water buffalo belong to the same subfamily Bovinae and share chromosome banding and gene order homology. In this study, we used genome-wide Illumina BovineSNP50 BeadChip to analyze 91 DNA samples from three breeds of water buffalo (Nili-Ravi, Murrah and their crossbred with local GuangXi buffalos in China), to demonstrate the genetic divergence between cattle and water buffalo through a large single nucleotide polymorphism (SNP) transferability study at the whole genome level, and performed association analysis of functional traits in water buffalo as well. A total of 40,766 (75.5 %) bovine SNPs were found in the water buffalo genome, but 49,936 (92.5 %) were with only one allele, and finally 935 were identified to be polymorphic and useful for association analysis in water buffalo. Therefore, the genome sequences of water buffalo and cattle shared a high level of homology but the polymorphic status of the bovine SNPs varied between these two species. The different patterns of mutations between species may associate with their phenotypic divergence due to genome evolution. Among 935 bovine SNPs, we identified a total of 9 and 7 SNPs significantly associated to fertility and milk production traits in water buffalo, respectively. However, more works in larger sample size are needed in future to verify these candidate SNPs for water buffalo.

  20. Whole genome-wide association study using affymetrix SNP chip: a two-stage sequential selection method to identify genes that increase the risk of developing complex diseases.

    PubMed

    Yang, Howard H; Hu, Nan; Taylor, Philip R; Lee, Maxwell P

    2008-01-01

    Whole-genome association studies of complex diseases hold great promise to identify systematically genetic loci that influence one's risk of developing these diseases. However, the polygenic nature of the complex diseases and genetic interactions among the genes pose significant challenge in both experimental design and data analysis. High-density genotype data make it possible to identify most of the genetic loci that may be involved in the etiology. On the other hand, utilizing large number of statistic tests could lead to false positives if the tests are not adequately adjusted. In this paper, we discuss a two-stage method that sequentially applies a generalized linear model (GLM) and principal components analysis (PCA) to identify genetic loci that jointly determine the likelihood of developing disease. The method was applied to a pilot case-control study of esophageal squamous cell carcinoma (ESCC) that included 50 ESCC patients and 50 neighborhood-matched controls. Genotype data were determined by using the Affymetrix 10K SNP chip. We will discuss some of the special considerations that are important to the proper interpretation of whole genome-wide association studies, which include multiple comparisons, epistatic interaction among multiple genetic loci, and generalization of predictive models.

  1. Genome-wide linkage analysis of QTL for growth and body composition employing the PorcineSNP60 BeadChip

    PubMed Central

    2012-01-01

    Background The traditional strategy to map QTL is to use linkage analysis employing a limited number of markers. These analyses report wide QTL confidence intervals, making very difficult to identify the gene and polymorphisms underlying the QTL effects. The arrival of genome-wide panels of SNPs makes available thousands of markers increasing the information content and therefore the likelihood of detecting and fine mapping QTL regions. The aims of the current study are to confirm previous QTL regions for growth and body composition traits in different generations of an Iberian x Landrace intercross (IBMAP) and especially identify new ones with narrow confidence intervals by employing the PorcineSNP60 BeadChip in linkage analyses. Results Three generations (F3, Backcross 1 and Backcross 2) of the IBMAP and their related animals were genotyped with PorcineSNP60 BeadChip. A total of 8,417 SNPs equidistantly distributed across autosomes were selected after filtering by quality, position and frequency to perform the QTL scan. The joint and separate analyses of the different IBMAP generations allowed confirming QTL regions previously identified in chromosomes 4 and 6 as well as new ones mainly for backfat thickness in chromosomes 4, 5, 11, 14 and 17 and shoulder weight in chromosomes 1, 2, 9 and 13; and many other to the chromosome-wide signification level. In addition, most of the detected QTLs displayed narrow confidence intervals, making easier the selection of positional candidate genes. Conclusions The use of higher density of markers has allowed to confirm results obtained in previous QTL scans carried out with microsatellites. Moreover several new QTL regions have been now identified in regions probably not covered by markers in previous scans, most of these QTLs displayed narrow confidence intervals. Finally, prominent putative biological and positional candidate genes underlying those QTL effects are listed based on recent porcine genome annotation. PMID

  2. Five blood pressure loci identified by an updated genome-wide linkage scan: meta-analysis of the Family Blood Pressure Program.

    PubMed

    Simino, Jeannette; Shi, Gang; Kume, Rezart; Schwander, Karen; Province, Michael A; Gu, C Charles; Kardia, Sharon; Chakravarti, Aravinda; Ehret, Georg; Olshen, Richard A; Turner, Stephen T; Ho, Low-Tone; Zhu, Xiaofeng; Jaquish, Cashell; Paltoo, Dina; Cooper, Richard S; Weder, Alan; Curb, J David; Boerwinkle, Eric; Hunt, Steven C; Rao, Dabeeru C

    2011-03-01

    A preliminary genome-wide linkage analysis of blood pressure in the Family Blood Pressure Program (FBPP) was reported previously. We harnessed the power and ethnic diversity of the final pooled FBPP dataset to identify novel loci for blood pressure thereby enhancing localization of genes containing less common variants with large effects on blood pressure levels and hypertension. We performed one overall and 4 race-specific meta-analyses of genome-wide blood pressure linkage scans using data on 4,226 African-American, 2,154 Asian, 4,229 Caucasian, and 2,435 Mexican-American participants (total N = 13,044). Variance components models were fit to measured (raw) blood pressure levels and two types of antihypertensive medication adjusted blood pressure phenotypes within each of 10 subgroups defined by race and network. A modified Fisher's method was used to combine the P values for each linkage marker across the 10 subgroups. Five quantitative trait loci (QTLs) were detected on chromosomes 6p22.3, 8q23.1, 20q13.12, 21q21.1, and 21q21.3 based on significant linkage evidence (defined by logarithm of odds (lod) score ≥3) in at least one meta-analysis and lod scores ≥1 in at least 2 subgroups defined by network and race. The chromosome 8q23.1 locus was supported by Asian-, Caucasian-, and Mexican-American-specific meta-analyses. The new QTLs reported justify new candidate gene studies. They may help support results from genome-wide association studies (GWAS) that fall in these QTL regions but fail to achieve the genome-wide significance.

  3. Integration of Genome-Wide Computation DRE Search, AhR ChIP-chip and Gene Expression Analyses of TCDD-Elicited Responses in the Mouse Liver

    PubMed Central

    2011-01-01

    Background The aryl hydrocarbon receptor (AhR) is a ligand-activated transcription factor (TF) that mediates responses to 2,3,7,8-tetrachlorodibenzo-p-dioxin (TCDD). Integration of TCDD-induced genome-wide AhR enrichment, differential gene expression and computational dioxin response element (DRE) analyses further elucidate the hepatic AhR regulatory network. Results Global ChIP-chip and gene expression analyses were performed on hepatic tissue from immature ovariectomized mice orally gavaged with 30 μg/kg TCDD. ChIP-chip analysis identified 14,446 and 974 AhR enriched regions (1% false discovery rate) at 2 and 24 hrs, respectively. Enrichment density was greatest in the proximal promoter, and more specifically, within ± 1.5 kb of a transcriptional start site (TSS). AhR enrichment also occurred distal to a TSS (e.g. intergenic DNA and 3' UTR), extending the potential gene expression regulatory roles of the AhR. Although TF binding site analyses identified over-represented DRE sequences within enriched regions, approximately 50% of all AhR enriched regions lacked a DRE core (5'-GCGTG-3'). Microarray analysis identified 1,896 number of TCDD-responsive genes (|fold change| ≥ 1.5, P1(t) > 0.999). Integrating this gene expression data with our ChIP-chip and DRE analyses only identified 625 differentially expressed genes that involved an AhR interaction at a DRE. Functional annotation analysis of differentially regulated genes associated with AhR enrichment identified overrepresented processes related to fatty acid and lipid metabolism and transport, and xenobiotic metabolism, which are consistent with TCDD-elicited steatosis in the mouse liver. Conclusions Details of the AhR regulatory network have been expanded to include AhR-DNA interactions within intragenic and intergenic genomic regions. Moreover, the AhR can interact with DNA independent of a DRE core suggesting there are alternative mechanisms of AhR-mediated gene regulation. PMID:21762485

  4. Genome-wide analysis of tandem repeats in Tribolium castaneum genome reveals abundant and highly dynamic tandem repeat families with satellite DNA features in euchromatic chromosomal arms.

    PubMed

    Pavlek, Martina; Gelfand, Yevgeniy; Plohl, Miroslav; Meštrović, Nevenka

    2015-12-01

    Although satellite DNAs are well-explored components of heterochromatin and centromeres, little is known about emergence, dispersal and possible impact of comparably structured tandem repeats (TRs) on the genome-wide scale. Our bioinformatics analysis of assembled Tribolium castaneum genome disclosed significant contribution of TRs in euchromatic chromosomal arms and clear predominance of satellite DNA-typical 170 bp monomers in arrays of ≥5 repeats. By applying different experimental approaches, we revealed that the nine most prominent TR families Cast1-Cast9 extracted from the assembly comprise ∼4.3% of the entire genome and reside almost exclusively in euchromatic regions. Among them, seven families that build ∼3.9% of the genome are based on ∼170 and ∼340 bp long monomers. Results of phylogenetic analyses of 2500 monomers originating from these families show high-sequence dynamics, evident by extensive exchanges between arrays on non-homologous chromosomes. In addition, our analysis shows that concerted evolution acts more efficiently on longer than on shorter arrays. Efficient genome-wide distribution of nine TR families implies the role of transposition only in expansion of the most dispersed family, and involvement of other mechanisms is anticipated. Despite similarities in sequence features, FISH experiments indicate high-level compartmentalization of centromeric and euchromatic tandem repeats.

  5. Genome-wide linkage analysis of quantitative biomarker traits of osteoarthritis in a large, multigenerational extended family.

    PubMed

    Chen, Hsiang-Cheng; Kraus, Virginia Byers; Li, Yi-Ju; Nelson, Sarah; Haynes, Carol; Johnson, Jessica; Stabler, Thomas; Hauser, Elizabeth R; Gregory, Simon G; Kraus, William E; Shah, Svati H

    2010-03-01

    The genetic contributions to the multifactorial disorder osteoarthritis (OA) have been increasingly recognized. The goal of the current study was to use OA-related biomarkers of severity and disease burden as quantitative traits to identify genetic susceptibility loci for OA. In a large multigenerational extended family (n = 350), we measured 5 OA-related biomarkers: hyaluronan (HA), cartilage oligomeric matrix protein (COMP), N-propeptide of type IIA collagen (PIIANP), C-propeptide of type II procollagen (CPII), and type II collagen neoepitope (C2C). Single-nucleotide polymorphism markers (n = 6,090) covering the whole genome were genotyped using the Illumina HumanLinkage-12 BeadChip. Variance components analysis, as implemented in the Sequential Oligogenic Linkage Analysis Routines, was used to estimate heritabilities of the quantitative traits and to calculate 2-point and multipoint logarithm of odds (LOD) scores using a polygenic model. After adjusting for age and sex, we found that 4 of the 5 biomarkers exhibited significant heritability (PIIANP 0.57, HA 0.49, COMP 0.43, C2C 0.30; P < or = 0.01 for all). Fourteen of the 19 loci that had multipoint LOD scores of >1.5 were near to or overlapped with previously reported OA susceptibility loci. Four of these loci were identified by more than 1 biomarker. The maximum multipoint LOD scores for the heritable quantitative biomarker traits were 4.3 for PIIANP (chromosome 8p23.2), 3.2 for COMP (chromosome 8q11.1), 2.0 for HA (chromosome 6q16.3), and 2.0 for C2C (chromosome 5q31.2). Herein, we report the first evidence of genetic susceptibility loci identified by OA-related biomarkers in an extended family. Our results demonstrate that serum concentrations of PIIANP, HA, COMP, and C2C have substantial heritable components, and using these biomarkers, several genetic loci potentially contributing to the genetic diversity of OA were identified.

  6. Genome-wide identification and characterization of Glyceraldehyde-3-phosphate dehydrogenase genes family in wheat (Triticum aestivum).

    PubMed

    Zeng, Lingfeng; Deng, Rong; Guo, Ziping; Yang, Shushen; Deng, Xiping

    2016-03-16

    Glyceraldehyde-3-phosphate dehydrogenase (GAPDH) is a central enzyme in glycolysi, we performed genome-wide identification of GAPDH genes in wheat and analyzed their structural characteristics and expression patterns under abiotic stress in wheat. A total of 22 GAPDH genes were identified in wheat cv. Chinese spring; the phylogenetic and structure analysis showed that these GAPDH genes could be divided into four distinct subfamilies. The expression profiles of GAPDH genes showed tissue specificity all over plant development stages. The qRT-PCR results revealed that wheat GAPDHs were involved in several abiotic stress response. Wheat carried 22 GAPDH genes, representing four types of plant GAPDHs (gapA/B, gapC, gapCp and gapN). Whole genome duplication and segmental duplication might account for the expansion of wheat GAPDHs. Expression analysis implied that GAPDHs play roles in plants abiotic stress tolerance.

  7. Family-based Association Analyses of Imputed Genotypes Reveal Genome-Wide Significant Association of Alzheimer’s disease with OSBPL6, PTPRG and PDCL3

    PubMed Central

    Herold, Christine; Hooli, Basavaraj V.; Mullin, Kristina; Liu, Tian; Roehr, Johannes T; Mattheisen, Manuel; Parrado, Antonio R.; Bertram, Lars; Lange, Christoph; Tanzi, Rudolph E.

    2015-01-01

    The genetic basis of Alzheimer's disease (AD) is complex and heterogeneous. Over 200 highly penetrant pathogenic variants in the genes APP, PSEN1 and PSEN2 cause a subset of early-onset familial Alzheimer's disease (EOFAD). On the other hand, susceptibility to late-onset forms of AD (LOAD) is indisputably associated to the ε4 allele in the gene APOE, and more recently to variants in more than two-dozen additional genes identified in the large-scale genome-wide association studies (GWAS) and meta-analyses reports. Taken together however, although the heritability in AD is estimated to be as high as 80%, a large proportion of the underlying genetic factors still remain to be elucidated. In this study we performed a systematic family-based genome-wide association and meta-analysis on close to 15 million imputed variants from three large collections of AD families (~3,500 subjects from 1,070 families). Using a multivariate phenotype combining affection status and onset age, meta-analysis of the association results revealed three single nucleotide polymorphisms (SNPs) that achieved genome-wide significance for association with AD risk: rs7609954 in the gene PTPRG (P-value = 3.98·10−08), rs1347297 in the gene OSBPL6 (P-value = 4.53·10−08), and rs1513625 near PDCL3 (P-value = 4.28·10−08). In addition, rs72953347 in OSBPL6 (P-value = 6.36·10−07) and two SNPs in the gene CDKAL1 showed marginally significant association with LOAD (rs10456232, P-value: 4.76·10−07; rs62400067, P-value: 3.54·10−07). In summary, family-based GWAS meta-analysis of imputed SNPs revealed novel genomic variants in (or near) PTPRG, OSBPL6, and PDCL3 that influence risk for AD with genome-wide significance. PMID:26830138

  8. A genome-wide analysis of the ASYMMETRIC LEAVES2/LATERAL ORGAN BOUNDARIES (AS2/LOB) gene family in barley (Hordeum vulgare L.)* #

    PubMed Central

    Guo, Bao-jian; Wang, Jun; Lin, Shen; Tian, Zheng; Zhou, Kai; Luan, Hai-ye; Lyu, Chao; Zhang, Xin-zhong; Xu, Ru-gen

    2016-01-01

    ASYMMETRIC LEAVES2/LATERAL ORGAN BOUNDARIES (AS2/LOB) genes are a family of plant specific transcription factors, which play an important role in the regulation of plant lateral organ development and metabolism. However, a genome-wide analysis of the AS2/LOB gene family is still not available for barley. In the present study, 24 AS2-like (ASL)/LOB domain (LBD) genes were identified based on the barley (Hordeum vulgare L.) genome sequence. A phylogenetic tree of ASL/LBD proteins from barley, Arabidopsis, maize, and rice was constructed. The ASL/LBD genes were classified into two classes, class I and class II, which were divided into five and two subgroups, respectively. Genes homologous in barley and Arabidopsis were analyzed. In addition, the structure and chromosomal locations of the genes were analyzed. Expression profiles indicated that barley HvASL/LBD genes exhibit a variety of expression patterns, suggesting that they are involved in various aspects of physiological and developmental processes. This genome-wide analysis of the barley AS2/LOB gene family contributes to our understanding of the functions of the AS2/LOB gene family. PMID:27704746

  9. A genome-wide analysis of the ASYMMETRIC LEAVES2/LATERAL ORGAN BOUNDARIES (AS2/LOB) gene family in barley (Hordeum vulgare L.).

    PubMed

    Guo, Bao-Jian; Wang, Jun; Lin, Shen; Tian, Zheng; Zhou, Kai; Luan, Hai-Ye; Lyu, Chao; Zhang, Xin-Zhong; Xu, Ru-Gen

    ASYMMETRIC LEAVES2/LATERAL ORGAN BOUNDARIES (AS2/LOB) genes are a family of plant specific transcription factors, which play an important role in the regulation of plant lateral organ development and metabolism. However, a genome-wide analysis of the AS2/LOB gene family is still not available for barley. In the present study, 24 AS2-like (ASL)/LOB domain (LBD) genes were identified based on the barley (Hordeum vulgare L.) genome sequence. A phylogenetic tree of ASL/LBD proteins from barley, Arabidopsis, maize, and rice was constructed. The ASL/LBD genes were classified into two classes, class I and class II, which were divided into five and two subgroups, respectively. Genes homologous in barley and Arabidopsis were analyzed. In addition, the structure and chromosomal locations of the genes were analyzed. Expression profiles indicated that barley HvASL/LBD genes exhibit a variety of expression patterns, suggesting that they are involved in various aspects of physiological and developmental processes. This genome-wide analysis of the barley AS2/LOB gene family contributes to our understanding of the functions of the AS2/LOB gene family.

  10. Genome-wide scan in Portuguese Island families implicates multiple loci in bipolar disorder: fine mapping adds support on chromosomes 6 and 11.

    PubMed

    Pato, Carlos N; Pato, M T; Kirby, A; Petryshen, T L; Medeiros, H; Carvalho, C; Macedo, A; Dourado, A; Coelho, I; Valente, J; Soares, M J; Ferreira, C P; Lei, M; Verner, A; Hudson, T J; Morley, C P; Kennedy, J L; Azevedo, M H; Daly, M J; Sklar, P

    2004-05-15

    As part of an extensive study in the Portuguese Island population of families with multiple patients suffering from bipolar disorder and schizophrenia, we performed an initial genome-wide scan of 16 extended families with bipolar disorder that identified three regions on chromosomes 2, 11, and 19 with genome-wide suggestive linkage and several other regions, including chromosome 6q, also approached suggestive levels of significance. Dick et al. [2003: Am J Hum Genet 73:107-114] recently reported in a study of 250 families with bipolar disorder a maxLOD score of 3.61 near marker D6S1021 on chromosome 6q. This study replicates this finding having detected a peak NPL = 2.02 (P = 0.025) with the same marker D6S1021(104.7 Mb). Higher-density mapping provided additional support for loci on chromosome 6 including marker D6S1021 with an NPL = 2.59 (P = 0.0068) and peaking at marker D6S1639 (125 Mb) with an NPL = 3.06 (P = 0.0019). A similar pattern was detected with higher-density mapping of chromosome 11 with an NPL = 3.15 (P = 0.0014) at marker D11S1883 (63.1 Mb). Simulations at the density of our fine mapping data indicate that less than 1 scan out of 10 would find two such scores genome-wide in the same scan by chance. Our findings provide additional support for a susceptibility locus for bipolar disorder on 6q, as well as, suggesting the importance of denser scans. Published 2004 Wiley-Liss, Inc.

  11. Genome-Wide Association of Familial Late-Onset Alzheimer's Disease Replicates BIN1 and CLU and Nominates CUGBP2 in Interaction with APOE

    PubMed Central

    Wijsman, Ellen M.; Pankratz, Nathan D.; Choi, Yoonha; Rothstein, Joseph H.; Faber, Kelley M.; Cheng, Rong; Lee, Joseph H.; Bird, Thomas D.; Bennett, David A.; Diaz-Arrastia, Ramon; Goate, Alison M.; Farlow, Martin; Ghetti, Bernardino; Sweet, Robert A.; Foroud, Tatiana M.; Mayeux, Richard

    2011-01-01

    Late-onset Alzheimer's disease (LOAD) is the most common form of dementia in the elderly. The National Institute of Aging-Late Onset Alzheimer's Disease Family Study and the National Cell Repository for Alzheimer's Disease conducted a joint genome-wide association study (GWAS) of multiplex LOAD families (3,839 affected and unaffected individuals from 992 families plus additional unrelated neurologically evaluated normal subjects) using the 610 IlluminaQuad panel. This cohort represents the largest family-based GWAS of LOAD to date, with analyses limited here to the European-American subjects. SNPs near APOE gave highly significant results (e.g., rs2075650, p = 3.2×10−81), but no other genome-wide significant evidence for association was obtained in the full sample. Analyses that stratified on APOE genotypes identified SNPs on chromosome 10p14 in CUGBP2 with genome-wide significant evidence for association within APOE ε4 homozygotes (e.g., rs201119, p = 1.5×10−8). Association in this gene was replicated in an independent sample consisting of three cohorts. There was evidence of association for recently-reported LOAD risk loci, including BIN1 (rs7561528, p = 0.009 with, and p = 0.03 without, APOE adjustment) and CLU (rs11136000, p = 0.023 with, and p = 0.008 without, APOE adjustment), with weaker support for CR1. However, our results provide strong evidence that association with PICALM (rs3851179, p = 0.69 with, and p = 0.039 without, APOE adjustment) and EXOC3L2 is affected by correlation with APOE, and thus may represent spurious association. Our results indicate that genetic structure coupled with ascertainment bias resulting from the strong APOE association affect genome-wide results and interpretation of some recently reported associations. We show that a locus such as APOE, with large effects and strong association with disease, can lead to samples that require appropriate adjustment for this locus to avoid both false

  12. Genome-wide analysis of CrRLK1L gene family in Gossypium and identification of candidate CrRLK1L genes related to fiber development.

    PubMed

    Niu, Erli; Cai, Caiping; Zheng, Yongjie; Shang, Xiaoguang; Fang, Lei; Guo, Wangzhen

    2016-06-01

    Members of the CrRLK1L family, a subgroup of the receptor-like kinase (RLK) gene family, are thought to act as sensors for the integrity of the cell wall and regulators of polar elongation. To better understand the various functions in fiber development, we conducted genome-wide identification and characterization analyses of CrRLK1L family in cotton. Here 44, 40, and 79 CrRLK1L genes were identified from three cotton species: diploid G. raimondii (D5), diploid G. arboreum (A2), and tetraploid G. hirsutum TM-1 (AD1), respectively. The 44 CrRLK1Ls in G. raimondii were anchored to the 12 chromosomes unevenly and were classified into six groups (I-VI), with group II and group IV being further divided into two subgroups (groups IIa and IIb, and IVa and IVb, respectively). These CrRLK1Ls displayed a highly regular pattern of developmental and spatial regulation in cotton. Using the transcriptome data of five chromosomal segment introgression lines (CSILs) and the physical integration of CrRLK1Ls with the quantitative trait loci (QTLs) related to fiber quality traits, we revealed that six CrRLK1L genes were highly associated with fiber development. This study brings new insights into the integrated genome-wide identification of CrRLK1Ls in cotton and provides references for the genetic improvement of cotton fiber.

  13. Genome-wide characterisation and expression profile of the grapevine ATL ubiquitin ligase family reveal biotic and abiotic stress-responsive and development-related members

    PubMed Central

    Ariani, Pietro; Regaiolo, Alice; Lovato, Arianna; Giorgetti, Alejandro; Porceddu, Andrea; Camiolo, Salvatore; Wong, Darren; Castellarin, Simone; Vandelle, Elodie; Polverari, Annalisa

    2016-01-01

    The Arabidopsis Tóxicos en Levadura (ATL) protein family is a class of E3 ubiquitin ligases with a characteristic RING-H2 Zn-finger structure that mediates diverse physiological processes and stress responses in plants. We carried out a genome-wide survey of grapevine (Vitis vinifera L.) ATL genes and retrieved 96 sequences containing the canonical ATL RING-H2 domain. We analysed their genomic organisation, gene structure and evolution, protein domains and phylogenetic relationships. Clustering revealed several clades, as already reported in Arabidopsis thaliana and rice (Oryza sativa), with an expanded subgroup of grapevine-specific genes. Most of the grapevine ATL genes lacked introns and were scattered among the 19 chromosomes, with a high level of duplication retention. Expression profiling revealed that some ATL genes are expressed specifically during early or late development and may participate in the juvenile to mature plant transition, whereas others may play a role in pathogen and/or abiotic stress responses, making them key candidates for further functional analysis. Our data offer the first genome-wide overview and annotation of the grapevine ATL family, and provide a basis for investigating the roles of specific family members in grapevine physiology and stress responses, as well as potential biotechnological applications. PMID:27910910

  14. Genome-wide characterisation and expression profile of the grapevine ATL ubiquitin ligase family reveal biotic and abiotic stress-responsive and development-related members.

    PubMed

    Ariani, Pietro; Regaiolo, Alice; Lovato, Arianna; Giorgetti, Alejandro; Porceddu, Andrea; Camiolo, Salvatore; Wong, Darren; Castellarin, Simone; Vandelle, Elodie; Polverari, Annalisa

    2016-12-02

    The Arabidopsis Tóxicos en Levadura (ATL) protein family is a class of E3 ubiquitin ligases with a characteristic RING-H2 Zn-finger structure that mediates diverse physiological processes and stress responses in plants. We carried out a genome-wide survey of grapevine (Vitis vinifera L.) ATL genes and retrieved 96 sequences containing the canonical ATL RING-H2 domain. We analysed their genomic organisation, gene structure and evolution, protein domains and phylogenetic relationships. Clustering revealed several clades, as already reported in Arabidopsis thaliana and rice (Oryza sativa), with an expanded subgroup of grapevine-specific genes. Most of the grapevine ATL genes lacked introns and were scattered among the 19 chromosomes, with a high level of duplication retention. Expression profiling revealed that some ATL genes are expressed specifically during early or late development and may participate in the juvenile to mature plant transition, whereas others may play a role in pathogen and/or abiotic stress responses, making them key candidates for further functional analysis. Our data offer the first genome-wide overview and annotation of the grapevine ATL family, and provide a basis for investigating the roles of specific family members in grapevine physiology and stress responses, as well as potential biotechnological applications.

  15. Genome-wide identification of multifunctional laccase gene family in cotton (Gossypium spp.); expression and biochemical analysis during fiber development.

    PubMed

    Balasubramanian, Vimal Kumar; Rai, Krishan Mohan; Thu, Sandi Win; Hii, Mei Mei; Mendu, Venugopal

    2016-09-29

    The single-celled cotton fibers, produced from seed coat epidermal cells are the largest natural source of textile fibers. The economic value of cotton fiber lies in its length and quality. The multifunctional laccase enzymes play important roles in cell elongation, lignification and pigmentation in plants and could play crucial role in cotton fiber quality. Genome-wide analysis of cultivated allotetraploid (G. hirsutum) and its progenitor diploid (G. arboreum and G. raimondii) cotton species identified 84, 44 and 46 laccase genes, respectively. Analysis of chromosomal location, phylogeny, conserved domain and physical properties showed highly conserved nature of laccases across three cotton species. Gene expression, enzymatic activity and biochemical analysis of developing cotton fibers was performed using G. arboreum species. Of the total 44, 40 laccases showed expression during different stages of fiber development. The higher enzymatic activity of laccases correlated with higher lignin content at 25 DPA (Days Post Anthesis). Further, analysis of cotton fiber phenolic compounds showed an overall decrease at 25 DPA indicating possible incorporation of these substrates into lignin polymer during secondary cell wall biosynthesis. Overall data indicate significant roles of laccases in cotton fiber development, and presents an excellent opportunity for manipulation of fiber development and quality.

  16. Genome-wide identification of multifunctional laccase gene family in cotton (Gossypium spp.); expression and biochemical analysis during fiber development

    PubMed Central

    Balasubramanian, Vimal Kumar; Rai, Krishan Mohan; Thu, Sandi Win; Hii, Mei Mei; Mendu, Venugopal

    2016-01-01

    The single-celled cotton fibers, produced from seed coat epidermal cells are the largest natural source of textile fibers. The economic value of cotton fiber lies in its length and quality. The multifunctional laccase enzymes play important roles in cell elongation, lignification and pigmentation in plants and could play crucial role in cotton fiber quality. Genome-wide analysis of cultivated allotetraploid (G. hirsutum) and its progenitor diploid (G. arboreum and G. raimondii) cotton species identified 84, 44 and 46 laccase genes, respectively. Analysis of chromosomal location, phylogeny, conserved domain and physical properties showed highly conserved nature of laccases across three cotton species. Gene expression, enzymatic activity and biochemical analysis of developing cotton fibers was performed using G. arboreum species. Of the total 44, 40 laccases showed expression during different stages of fiber development. The higher enzymatic activity of laccases correlated with higher lignin content at 25 DPA (Days Post Anthesis). Further, analysis of cotton fiber phenolic compounds showed an overall decrease at 25 DPA indicating possible incorporation of these substrates into lignin polymer during secondary cell wall biosynthesis. Overall data indicate significant roles of laccases in cotton fiber development, and presents an excellent opportunity for manipulation of fiber development and quality. PMID:27679939

  17. Genome-Wide Comparative Analyses Reveal the Dynamic Evolution of Nucleotide-Binding Leucine-Rich Repeat Gene Family among Solanaceae Plants.

    PubMed

    Seo, Eunyoung; Kim, Seungill; Yeom, Seon-In; Choi, Doil

    2016-01-01

    Plants have evolved an elaborate innate immune system against invading pathogens. Within this system, intracellular nucleotide-binding leucine-rich repeat (NLR) immune receptors are known play critical roles in effector-triggered immunity (ETI) plant defense. We performed genome-wide identification and classification of NLR-coding sequences from the genomes of pepper, tomato, and potato using fixed criteria. We then compared genomic duplication and evolution features. We identified intact 267, 443, and 755 NLR-encoding genes in tomato, potato, and pepper genomes, respectively. Phylogenetic analysis and classification of Solanaceae NLRs revealed that the majority of NLR super family members fell into 14 subgroups, including a TIR-NLR (TNL) subgroup and 13 non-TNL subgroups. Specific subgroups have expanded in each genome, with the expansion in pepper showing subgroup-specific physical clusters. Comparative analysis of duplications showed distinct duplication patterns within pepper and among Solanaceae plants suggesting subgroup- or species-specific gene duplication events after speciation, resulting in divergent evolution. Taken together, genome-wide analysis of NLR family members provide insights into their evolutionary history in Solanaceae. These findings also provide important foundational knowledge for understanding NLR evolution and will empower broader characterization of disease resistance genes to be used for crop breeding.

  18. Genome-Wide Comparative Analyses Reveal the Dynamic Evolution of Nucleotide-Binding Leucine-Rich Repeat Gene Family among Solanaceae Plants

    PubMed Central

    Seo, Eunyoung; Kim, Seungill; Yeom, Seon-In; Choi, Doil

    2016-01-01

    Plants have evolved an elaborate innate immune system against invading pathogens. Within this system, intracellular nucleotide-binding leucine-rich repeat (NLR) immune receptors are known play critical roles in effector-triggered immunity (ETI) plant defense. We performed genome-wide identification and classification of NLR-coding sequences from the genomes of pepper, tomato, and potato using fixed criteria. We then compared genomic duplication and evolution features. We identified intact 267, 443, and 755 NLR-encoding genes in tomato, potato, and pepper genomes, respectively. Phylogenetic analysis and classification of Solanaceae NLRs revealed that the majority of NLR super family members fell into 14 subgroups, including a TIR-NLR (TNL) subgroup and 13 non-TNL subgroups. Specific subgroups have expanded in each genome, with the expansion in pepper showing subgroup-specific physical clusters. Comparative analysis of duplications showed distinct duplication patterns within pepper and among Solanaceae plants suggesting subgroup- or species-specific gene duplication events after speciation, resulting in divergent evolution. Taken together, genome-wide analysis of NLR family members provide insights into their evolutionary history in Solanaceae. These findings also provide important foundational knowledge for understanding NLR evolution and will empower broader characterization of disease resistance genes to be used for crop breeding. PMID:27559340

  19. Genome-wide investigation and expression analysis of AP2-ERF gene family in salt tolerant common bean

    PubMed Central

    Kavas, Musa; Kizildogan, Aslihan; Gökdemir, Gökhan; Baloglu, Mehmet Cengiz

    2015-01-01

    Apetala2-ethylene-responsive element binding factor (AP2-ERF) superfamily with common AP2-DNA binding domain have developmentally and physiologically important roles in plants. Since common bean genome project has been completed recently, it is possible to identify all of the AP2-ERF genes in the common bean genome. In this study, a comprehensive genome-wide in silico analysis identified 180 AP2-ERF superfamily genes in common bean (Phaseolus vulgaris). Based on the amino acid alignment and phylogenetic analyses, superfamily members were classified into four subfamilies: DREB (54), ERF (95), AP2 (27) and RAV (3), as well as one soloist. The physical and chemical characteristics of amino acids, interaction between AP2-ERF proteins, cis elements of promoter region of AP2-ERF genes and phylogenetic trees were predicted and analyzed. Additionally, expression levels of AP2-ERF genes were evaluated by in silico and qRT-PCR analyses. In silico micro-RNA target transcript analyses identified nearly all PvAP2-ERF genes as targets of by 44 different plant species' miRNAs were identified in this study. The most abundant target genes were PvAP2/ERF-20-25-62-78-113-173. miR156, miR172 and miR838 were the most important miRNAs found in targeting and BLAST analyses. Interactome analysis revealed that the transcription factor PvAP2-ERF78, an ortholog of Arabidopsis At2G28550, was potentially interacted with at least 15 proteins, indicating that it was very important in transcriptional regulation. Here we present the first study to identify and characterize the AP2-ERF transcription factors in common bean using whole-genome analysis, and the findings may serve as a references for future functional research on the transcription factors in common bean. PMID:27152109

  20. Genome-wide identification of CBL family and expression analysis of CBLs in response to potassium deficiency in cotton

    PubMed Central

    Sun, Lirong; Wang, Ji

    2017-01-01

    Calcineurin B-like (CBL) proteins, as calcium sensors, play pivotal roles in plant responses to diverse abiotic stresses and in growth and development through interaction with CBL-interacting protein kinases (CIPKs). However, knowledge about functions and evolution of CBLs in Gossypium plants is scarce. Here, we conducted a genome-wide survey and identified 13, 13 and 22 CBL genes in the progenitor diploid Gossypium arboreum and Gossypium raimondii, and the cultivated allotetraploid Gossypium hirsutum, respectively. Analysis of physical properties, chromosomal locations, conserved domains and phylogeny indicated rather conserved nature of CBLs among the three Gossypium species. Moreover, these CBLs have closer genetic evolutionary relationship with the CBLs from cocoa than with those from other plants. Most CBL genes underwent evolution under purifying selection in the three Gossypium plants. Additionally, nearly all G. hirsutum CBL (GhCBL) genes were expressed in the root, stem, leaf, flower and fiber. Many GhCBLs were preferentially expressed in the flower while several GhCBLs were mainly expressed in roots. Expression patterns of GhCBL genes in response to potassium deficiency were also studied. The expression of most GhCBLs were moderately induced in roots after treatments with low-potassium stress. Yeast two-hybrid experiments indicated that GhCBL1-2, GhCBL1-3, GhCBL4-4, GhCBL8, GhCBL9 and GhCBL10-3 interacted with GhCIPK23, respectively. Our results provided a comprehensive view of the CBLs and valuable information for researchers to further investigate the roles and functional mechanisms of the CBLs in Gossypium. PMID:28828254

  1. A genome-wide linkage scan of bipolar disorder in Latino families identifies susceptibility loci at 8q24 and 14q32.

    PubMed

    Gonzalez, Suzanne; Camarillo, Cynthia; Rodriguez, Marco; Ramirez, Mercedes; Zavala, Juan; Armas, Regina; Contreras, Salvador A; Contreras, Javier; Dassori, Albana; Almasy, Laura; Flores, Deborah; Jerez, Alvaro; Raventós, Henriette; Ontiveros, Alfonso; Nicolini, Humberto; Escamilla, Michael

    2014-09-01

    A genome-wide nonparametric linkage screen was performed to localize Bipolar Disorder (BP) susceptibility loci in a sample of 3757 individuals of Latino ancestry. The sample included 963 individuals with BP phenotype (704 relative pairs) from 686 families recruited from the US, Mexico, Costa Rica, and Guatemala. Non-parametric analyses were performed over a 5 cM grid with an average genetic coverage of 0.67 cM. Multipoint analyses were conducted across the genome using non-parametric Kong & Cox LOD scores along with Sall statistics for all relative pairs. Suggestive and significant genome-wide thresholds were calculated based on 1000 simulations. Single-marker association tests in the presence of linkage were performed assuming a multiplicative model with a population prevalence of 2%. We identified two genome-wide significant susceptibly loci for BP at 8q24 and 14q32, and a third suggestive locus at 2q13-q14. Within these three linkage regions, the top associated single marker (rs1847694, P = 2.40 × 10(-5)) is located 195 Kb upstream of DPP10 in Chromosome 2. DPP10 is prominently expressed in brain neuronal populations, where it has been shown to bind and regulate Kv4-mediated A-type potassium channels. Taken together, these results provide additional evidence that 8q24, 14q32, and 2q13-q14 are susceptibly loci for BP and these regions may be involved in the pathogenesis of BP in the Latino population. © 2014 Wiley Periodicals, Inc.

  2. Genome-wide analysis reveals diverged patterns of codon bias, gene expression, and rates of sequence evolution in picea gene families.

    PubMed

    De La Torre, Amanda R; Lin, Yao-Cheng; Van de Peer, Yves; Ingvarsson, Pär K

    2015-03-05

    The recent sequencing of several gymnosperm genomes has greatly facilitated studying the evolution of their genes and gene families. In this study, we examine the evidence for expression-mediated selection in the first two fully sequenced representatives of the gymnosperm plant clade (Picea abies and Picea glauca). We use genome-wide estimates of gene expression (>50,000 expressed genes) to study the relationship between gene expression, codon bias, rates of sequence divergence, protein length, and gene duplication. We found that gene expression is correlated with rates of sequence divergence and codon bias, suggesting that natural selection is acting on Picea protein-coding genes for translational efficiency. Gene expression, rates of sequence divergence, and codon bias are correlated with the size of gene families, with large multicopy gene families having, on average, a lower expression level and breadth, lower codon bias, and higher rates of sequence divergence than single-copy gene families. Tissue-specific patterns of gene expression were more common in large gene families with large gene expression divergence than in single-copy families. Recent family expansions combined with large gene expression variation in paralogs and increased rates of sequence evolution suggest that some Picea gene families are rapidly evolving to cope with biotic and abiotic stress. Our study highlights the importance of gene expression and natural selection in shaping the evolution of protein-coding genes in Picea species, and sets the ground for further studies investigating the evolution of individual gene families in gymnosperms.

  3. Genome Wide Association Studies

    NASA Astrophysics Data System (ADS)

    Sebastiani, Paola; Solovieff, Nadia

    The availability of high throughput technology for parallel genotyping has opened the field of genetics to genome-wide association studies (GWAS). These studies generate massive amount of genetic data that challenge investigators with issues related to data management, statistical analysis of large data sets, visualization, and annotation of results. We will review the common approach to analysis of GWAS data and then discuss options to learn more from these data.

  4. Exome Sequencing and Genome-Wide Linkage Analysis in 17 Families Illustrates the Complex Contribution of TTN Truncating Variants to Dilated Cardiomyopathy

    PubMed Central

    Norton, Nadine; Li, Duanxiang; Rampersaud, Evadnie; Morales, Ana; Martin, Eden R.; Zuchner, Stephan; Guo, Shengru; Gonzalez, Michael; Hedges, Dale J.; Robertson, Peggy D.; Krumm, Niklas; Nickerson, Deborah A.; Hershberger, Ray E.

    2013-01-01

    Background Familial dilated cardiomyopathy is a genetically heterogeneous disease with >30 known genes. TTN truncating variants were recently implicated in a candidate gene study to cause 25% of familial and 18% of sporadic dilated cardiomyopathy (DCM) cases. Methods and Results We used an unbiased genome-wide approach employing both linkage analysis and variant filtering across the exome sequences of 48 individuals affected with DCM from 17 families to identify genetic cause. Linkage analysis ranked the TTN region as falling under the second highest genome-wide multipoint linkage peak, MLOD 1.59. We identified six TTN truncating variants carried by affected with DCM in 7 of 17 DCM families (LOD 2.99); 2 of these 7 families also had novel missense variants segregated with disease. Two additional novel truncating TTN variants did not segregate with DCM. Nucleotide diversity at the TTN locus, including missense variants, was comparable to five other known DCM genes. The average number of missense variants in the exome sequences from the DCM cases or the ~5,400 cases from the Exome Sequencing Project was ~23 per individual. The average number of TTN truncating variants in the Exome Sequencing Project was 0.014 per individual. We also identified a region (chr9q21.11-q22.31) with no known DCM genes with a maximum heterogeneity LOD score of 1.74. Conclusions These data suggest that TTN truncating variants contribute to DCM cause. However, the lack of segregation of all identified TTN truncating variants illustrates the challenge of determining variant pathogenicity even with full exome sequencing. PMID:23418287

  5. Exome sequencing and genome-wide linkage analysis in 17 families illustrate the complex contribution of TTN truncating variants to dilated cardiomyopathy.

    PubMed

    Norton, Nadine; Li, Duanxiang; Rampersaud, Evadnie; Morales, Ana; Martin, Eden R; Zuchner, Stephan; Guo, Shengru; Gonzalez, Michael; Hedges, Dale J; Robertson, Peggy D; Krumm, Niklas; Nickerson, Deborah A; Hershberger, Ray E

    2013-04-01

    BACKGROUND- Familial dilated cardiomyopathy (DCM) is a genetically heterogeneous disease with >30 known genes. TTN truncating variants were recently implicated in a candidate gene study to cause 25% of familial and 18% of sporadic DCM cases. METHODS AND RESULTS- We used an unbiased genome-wide approach using both linkage analysis and variant filtering across the exome sequences of 48 individuals affected with DCM from 17 families to identify genetic cause. Linkage analysis ranked the TTN region as falling under the second highest genome-wide multipoint linkage peak, multipoint logarithm of odds, 1.59. We identified 6 TTN truncating variants carried by individuals affected with DCM in 7 of 17 DCM families (logarithm of odds, 2.99); 2 of these 7 families also had novel missense variants that segregated with disease. Two additional novel truncating TTN variants did not segregate with DCM. Nucleotide diversity at the TTN locus, including missense variants, was comparable with 5 other known DCM genes. The average number of missense variants in the exome sequences from the DCM cases or the ≈5400 cases from the Exome Sequencing Project was ≈23 per individual. The average number of TTN truncating variants in the Exome Sequencing Project was 0.014 per individual. We also identified a region (chr9q21.11-q22.31) with no known DCM genes with a maximum heterogeneity logarithm of odds score of 1.74. CONCLUSIONS- These data suggest that TTN truncating variants contribute to DCM cause. However, the lack of segregation of all identified TTN truncating variants illustrates the challenge of determining variant pathogenicity even with full exome sequencing.

  6. Genome-wide analysis and identification of KT/HAK/KUP potassium transporter gene family in peach (Prunus persica).

    PubMed

    Song, Z Z; Ma, R J; Yu, M L

    2015-01-30

    The KT/HAK/KUP family members encoding high-affinity potassium (K(+)) transporters mediate K(+) transport across the plasma membranes of plant cells to maintain plant normal growth and metabolic activities. In this paper, we identified 16 potassium transporter genes in the peach (Prunus persica) using the Hidden Markov model scanning strategy and searching the peach genome database. Utilizing the Arabidopsis KT/HAK/KUP family as a reference, phylogenetic analysis indicates that the KT/HAK/KUP family in the peach can be classified into 3 groups. Genomic localization indicated that 16 KT/HAK/KUP family genes were well distributed on 7 scaffolds. Gene structure analysis showed that the KT/HAK/KUP family genes have 6-9 introns. In addition, all of the KT/HAK/KUP family members were hydrophobic proteins; they exhibited similar secondary structure patterns and homologous tertiary structures. Putative cis-elements involved in abiotic stress adaption, Ca(2+) response, light and circadian rhythm regulation, and seed development were observed in the promoters of the KT/HAK/KUP family genes. Subcellular localization prediction indicated that the KT/HAK/KUP members were mainly located in the plasma membrane. Expression levels of the KT/HAK/ KUP family genes were much higher in the fruit and flower than those in the other 7 tissues examined, indicating that the KT/HAK/KUP family genes may have important roles in K(+) uptake and transport, which mainly contribute to flower formation and fruit development in the peach.

  7. Genome-wide identification and analysis of the basic leucine zipper (bZIP) transcription factor gene family in Ustilaginoidea virens.

    PubMed

    Yin, Weixiao; Cui, Peng; Wei, Wei; Lin, Yang; Luo, Chaoxi

    2017-08-25

    The basic leucine zipper (bZIP) transcription factor (TF) family is one of the largest and most diverse TF families widely distributed across the eukaryotes. The bZIP TF family play important roles in growth, development, and response to abiotic or biotic stresses, which have been well characterized in plants, but not in plant pathogenic fungi. In this study, we performed genome-wide and systematic bioinformatics analysis of bZIP genes in the fungus Ustilaginoidea virens, the causal agent of rice false smut disease. We identified 28 bZIP family members in the U. virens genome by searching for the bZIP domain in predicted genes. The gene structures, motifs and phylogenetic relationships were analyzed for bZIP genes in U. virens (UvbZIPs). Together with bZIPs from other two fungi, the bZIP genes can be divided into eight groups according to the phylogenetic relationship. Based on RNA-Seq data, the expression profiles of UvbZIPs at different infection stages were evaluated. Results showed that 17 of the UvbZIPs were up-regulated expression during the infection period. Furthermore, 11 infection-related UvbZIPs was investigated under H2O2 stress and the expression level of eight genes have changed, which confirmed their roles in stress tolerance and pathogenicity. In summary, our genome wide systematic characterization and expression analysis of UvbZIPs provided insights into the molecular functions of these genes in U. virens and provides a reference for other pathogens.

  8. Genome-wide identification and analysis of the growth-regulating factor family in Chinese cabbage (Brassica rapa L. ssp. pekinensis).

    PubMed

    Wang, Fengde; Qiu, Nianwei; Ding, Qian; Li, Jingjuan; Zhang, Yihui; Li, Huayin; Gao, Jianwei

    2014-09-22

    Growth regulating factors (GRFs) have been shown to play important roles in plant growth and development. GRF genes represent a large multigene family in plants. Recently, genome-wide structural and evolutionary analyses of the GRF gene families in Arabidopsis, rice, and maize have been reported. Chinese cabbage (Brassica rapa L. ssp. pekinensis) is one of the most important vegetables for agricultural production, and a full genome assembly for this plant has recently been released. However, to our knowledge, the GRF gene family from Chinese cabbage has not been characterized in detail. In this study, genome-wide analysis was carried out to identify all the GRF genes in Chinese cabbage. Based on the complete Chinese cabbage genome sequence, 17 nonredundant GRF genes, named BrGRFs, were identified and classified into six groups. Phylogenetic analysis of the translated GRF protein sequences from Chinese cabbage, Arabidopsis, and rice indicated that the Chinese cabbage GRF proteins were more closely related to the GRF proteins of Arabidopsis than to those of rice. Expression profile analysis showed that the BrGRF genes had uneven transcript levels in different organs or tissues, and the transcription of most BrGRF genes was induced by gibberellic acid (GA3) treatment. Additionally, over-expression of BrGRF8 in transgenic Arabidopsis plants increased the sizes of the leaves and other organs by regulation of cell proliferation. The data obtained from this investigation will contribute to a better understanding of the characteristics of the GRF gene family in Chinese cabbage, and provide a basis for further studies to investigate GRF protein function during development as well as for Chinese cabbage-breeding programs to improve yield and/or head size.

  9. Genome-wide identification, phylogeny and expression analysis of SUN, OFP and YABBY gene family in tomato.

    PubMed

    Huang, Zejun; Van Houten, Jason; Gonzalez, Geoffrey; Xiao, Han; van der Knaap, Esther

    2013-04-01

    Members of the plant-specific gene families IQD/SUN, OFP and YABBY are thought to play important roles in plant growth and development. YABBY family members are involved in lateral organ polarity and growth; OFP members encode transcriptional repressors, whereas the role of IQD/SUN members is less clear. The tomato fruit shape genes SUN, OVATE, and FASCIATED belong to IQD/SUN, OFP and the YABBY gene family, respectively. A gene duplication resulting in high expression of SUN leads to elongated fruit, whereas a premature stop codon in OVATE and a large inversion within FASCIATED control fruit elongation and a flat fruit shape, respectively. In this study, we identified 34 SlSUN, 31 SlOFP and 9 SlYABBY genes in tomato and identified their position on 12 chromosomes. Genome mapping analysis showed that the SlSUN, SlOFP, and SlYABBY genes were enriched on the top and bottom segments of several chromosomes. In particular, on chromosome 10, a cluster of SlOFPs were found to originate from tandem duplication events. We also constructed three phylogenetic trees based on the protein sequences of the IQ67, OVATE and YABBY domains, respectively, from members of these families in Arabidopsis and tomato. The closest putative orthologs of the Arabidopsis and tomato genes were determined by the position on the phylogenetic tree and sequence similarity. Furthermore, expression analysis showed that some family members exhibited tissue-specific expression, whereas others were more ubiquitously expressed. Also, certain family members overlapped with known QTLs controlling fruit shape in Solanaceous plants. Combined, these results may help elucidate the roles of SUN, OFP and YABBY family members in plant growth and development.

  10. [Genome-wide identification, phylogenetic analysis and expression profiling of the WOX family genes in Solanum lycopersicum].

    PubMed

    Li, Xiaoxu; Liu, Cheng; Li, Wei; Zhang, Zenglin; Gao, Xiaoming; Zhou, Hui; Guo, Yongfeng

    2016-05-01

    Members of the plant-specific WOX transcription factor family have been reported to play important roles in cell to cell communication as well as other physiological and developmental processes. In this study, ten members of the WOX transcription factor family were identified in Solanum lycopersicum with HMMER. Neighbor-joining phylogenetic tree, maximum-likelihood tree and Bayesian-inference tree were constructed and similar topologies were shown using the protein sequences of the homeodomain. Phylogenetic study revealed that the 25 WOX family members from Arabidopsis and tomato fall into three clades and nine subfamilies. The patterns of exon-intron structures and organization of conserved domains in Arabidopsis and tomato were consistent based on the phylogenetic results. Transcriptome analysis showed that the expression patterns of SlWOXs were different in different tissue types. Gene Ontology (GO) analysis suggested that, as transcription factors, the SlWOX family members could be involved in a number of biological processes including cell to cell communication and tissue development. Our results are useful for future studies on WOX family members in tomato and other plant species.

  11. Genome-Wide Identification and Expression Analysis of Homeodomain Leucine Zipper Subfamily IV (HDZ IV) Gene Family from Musa accuminata

    PubMed Central

    Pandey, Ashutosh; Misra, Prashant; Alok, Anshu; Kaur, Navneet; Sharma, Shivani; Lakhwani, Deepika; Asif, Mehar H.; Tiwari, Siddharth; Trivedi, Prabodh K.

    2016-01-01

    The homeodomain zipper family (HD-ZIP) of transcription factors is present only in plants and plays important role in the regulation of plant-specific processes. The subfamily IV of HDZ transcription factors (HD-ZIP IV) has primarily been implicated in the regulation of epidermal structure development. Though this gene family is present in all lineages of land plants, members of this gene family have not been identified in banana, which is one of the major staple fruit crops. In the present work, we identified 21 HDZIV encoding genes in banana by the computational analysis of banana genome resource. Our analysis suggested that these genes putatively encode proteins having all the characteristic domains of HDZIV transcription factors. The phylogenetic analysis of the banana HDZIV family genes further confirmed that after separation from a common ancestor, the banana, and poales lineages might have followed distinct evolutionary paths. Further, we conclude that segmental duplication played a major role in the evolution of banana HDZIV encoding genes. All the identified banana HDZIV genes expresses in different banana tissue, however at varying levels. The transcript levels of some of the banana HDZIV genes were also detected in banana fruit pulp, suggesting their putative role in fruit attributes. A large number of genes of this family showed modulated expression under drought and salinity stress. Taken together, the present work lays a foundation for elucidation of functional aspects of the banana HDZIV encoding genes and for their possible use in the banana improvement programs. PMID:26870050

  12. Genome-wide analysis of the bHLH transcription factor family in Chinese cabbage (Brassica rapa ssp. pekinensis).

    PubMed

    Song, Xiao-Ming; Huang, Zhi-Nan; Duan, Wei-Ke; Ren, Jun; Liu, Tong-Kun; Li, Ying; Hou, Xi-Lin

    2014-02-01

    Basic helix-loop-helix (bHLH) transcription factors are widely distributed in eukaryotic organisms and are thought to be one of the largest families of regulatory proteins. This important family of transcriptional regulators plays crucial roles in plant development. However, a systematic analysis of the bHLH transcription factor family has not been reported in Chinese cabbage. In this study, 230 bHLH transcription factors were identified from the whole Chinese cabbage genome and compared with proteins from other representative plants, fungi and metazoans. The Chinese cabbage bHLH (BrabHLH) gene family could be classified into 24 subfamilies. Phylogenetic analysis of BrabHLHs along with bHLHs from Arabidopsis and rice indicated 26 subfamilies. The identification, classification, phylogenetic reconstruction, conserved motifs, chromosome distribution, functional annotation, expression patterns and interaction networks of BrabHLHs were analyzed. Distribution mapping showed that BrabHLHs were non-randomly located on the ten Chinese cabbage chromosomes. One hundred and twenty-four orthologous bHLH genes were identified between Chinese cabbage and Arabidopsis, and the interaction networks of the orthologous genes were constructed in Chinese cabbage. Quantitative RT-PCR analysis showed that expressions of BrabHLH genes varied widely under different abiotic stress treatments for different times. Thus, this comprehensive analysis of BrabHLHs represents a rich resource, aiding the elucidation of the roles of bHLH family members in plant growth and development. Furthermore, the comparative genomics analysis deepened our understanding of the evolution of this gene family after a polyploidy event.

  13. Genome-Wide Analysis of the Glutathione S-Transferase Gene Family in Capsella rubella: Identification, Expression, and Biochemical Functions

    PubMed Central

    He, Gang; Guan, Chao-Nan; Chen, Qiang-Xin; Gou, Xiao-Jun; Liu, Wei; Zeng, Qing-Yin; Lan, Ting

    2016-01-01

    Extensive subfunctionalization might explain why so many genes have been maintained after gene duplication, which provides the engine for gene family expansion. However, it is still a particular challenge to trace the evolutionary dynamics and features of functional divergences in a supergene family over the course of evolution. In this study, we identified 49 Glutathione S-transferase (GST) genes from the Capsella rubella, a close relative of Arabidopsis thaliana and a member of the mustard family. Capsella GSTs can be categorized into eight classes, with tau and phi GSTs being the most numerous. The expansion of the two classes mainly occurs through tandem gene duplication, which results in tandem-arrayed gene clusters on chromosomes. By integrating phylogenetic analysis, expression patterns, and biochemical functions of Capsella and Arabidopsis GSTs, functional divergence, both in gene expression and enzymatic properties, were clearly observed in paralogous gene pairs in Capsella (even the most recent duplicates), and orthologous GSTs in Arabidopsis/Capsella. This study provides functional evidence for the expansion and organization of a large gene family in closely related species. PMID:27630652

  14. Genome-Wide Identification, Characterization and Expression Analysis of the Solute Carrier 6 Gene Family in Silkworm (Bombyx mori)

    PubMed Central

    Tang, Xin; Liu, Huawei; Chen, Quanmei; Wang, Xin; Xiong, Ying; Zhao, Ping

    2016-01-01

    The solute carrier 6 (SLC6) gene family, initially known as the neurotransmitter transporters, plays vital roles in the regulation of neurotransmitter signaling, nutrient absorption and motor behavior. In this study, a total of 16 candidate genes were identified as SLC6 family gene homologs in the silkworm (Bombyx mori) genome. Spatio-temporal expression patterns of silkworm SLC6 gene transcripts indicated that these genes were highly and specifically expressed in midgut, brain and gonads; moreover, these genes were expressed primarily at the feeding stage or adult stage. Levels of expression for most midgut-specific and midgut-enriched gene transcripts were down-regulated after starvation but up-regulated after re-feeding. In addition, we observed that expression levels of these genes except for BmSLC6-15 and BmGT1 were markedly up-regulated by a juvenile hormone analog. Moreover, brain-enriched genes showed differential expression patterns during wandering and mating processes, suggesting that these genes may be involved in modulating wandering and mating behaviors. Our results improve our understanding of the expression patterns and potential physiological functions of the SLC6 gene family, and provide valuable information for the comprehensive functional analysis of the SLC6 gene family. PMID:27706106

  15. Genome-Wide Identification, Characterization and Expression Analysis of the Solute Carrier 6 Gene Family in Silkworm (Bombyx mori).

    PubMed

    Tang, Xin; Liu, Huawei; Chen, Quanmei; Wang, Xin; Xiong, Ying; Zhao, Ping

    2016-10-03

    The solute carrier 6 (SLC6) gene family, initially known as the neurotransmitter transporters, plays vital roles in the regulation of neurotransmitter signaling, nutrient absorption and motor behavior. In this study, a total of 16 candidate genes were identified as SLC6 family gene homologs in the silkworm (Bombyx mori) genome. Spatio-temporal expression patterns of silkworm SLC6 gene transcripts indicated that these genes were highly and specifically expressed in midgut, brain and gonads; moreover, these genes were expressed primarily at the feeding stage or adult stage. Levels of expression for most midgut-specific and midgut-enriched gene transcripts were down-regulated after starvation but up-regulated after re-feeding. In addition, we observed that expression levels of these genes except for BmSLC6-15 and BmGT1 were markedly up-regulated by a juvenile hormone analog. Moreover, brain-enriched genes showed differential expression patterns during wandering and mating processes, suggesting that these genes may be involved in modulating wandering and mating behaviors. Our results improve our understanding of the expression patterns and potential physiological functions of the SLC6 gene family, and provide valuable information for the comprehensive functional analysis of the SLC6 gene family.

  16. Genome-wide analysis of the R2R3-MYB transcription factor gene family in sweet orange (Citrus sinensis).

    PubMed

    Liu, Chaoyang; Wang, Xia; Xu, Yuantao; Deng, Xiuxin; Xu, Qiang

    2014-10-01

    MYB transcription factor represents one of the largest gene families in plant genomes. Sweet orange (Citrus sinensis) is one of the most important fruit crops worldwide, and recently the genome has been sequenced. This provides an opportunity to investigate the organization and evolutionary characteristics of sweet orange MYB genes from whole genome view. In the present study, we identified 100 R2R3-MYB genes in the sweet orange genome. A comprehensive analysis of this gene family was performed, including the phylogeny, gene structure, chromosomal localization and expression pattern analyses. The 100 genes were divided into 29 subfamilies based on the sequence similarity and phylogeny, and the classification was also well supported by the highly conserved exon/intron structures and motif composition. The phylogenomic comparison of MYB gene family among sweet orange and related plant species, Arabidopsis, cacao and papaya suggested the existence of functional divergence during evolution. Expression profiling indicated that sweet orange R2R3-MYB genes exhibited distinct temporal and spatial expression patterns. Our analysis suggested that the sweet orange MYB genes may play important roles in different plant biological processes, some of which may be potentially involved in citrus fruit quality. These results will be useful for future functional analysis of the MYB gene family in sweet orange.

  17. Genome-wide characterization and expression profiling of HD-Zip gene family related to abiotic stress in cassava

    PubMed Central

    Yan, Yan; Tie, Weiwei; Xia, Zhiqiang; Wang, Wenquan; Peng, Ming; Hu, Wei; Zhang, Jiaming

    2017-01-01

    Homeodomain-leucine zipper (HD-Zip) gene family plays important roles in various abiotic stresses and hormone signaling in plants. However, no information is currently available regarding this family in cassava (Manihot esculenta), an important drought-tolerant crop in tropical and sub-tropical areas. Here, 57 HD-Zip genes (MeHDZ01-57) were identified in the cassava genome, and they were classified into four subfamilies based on phylogenetic analysis, which was further supported by their gene structure and conserved motif characteristics. Of which five gene pairs were involved in segmental duplication but none for tandem duplication, suggesting that segmental duplication was the main cause for the expansion of MeHDZ gene family in cassava. Global expression profiles revealed that MeHDZ genes were constitutively expressed, or not expressed, or tissue-specific expressed in examined tissues in both cultivated and wild subspecies. Transcriptomic analysis of three genotypes showed that most of MeHDZ genes responded differently to drought and polyethylene glycol treatments. Subsequently, quantitative RT-PCR analysis revealed comprehensive responses of twelve selected MeHDZ genes to various stimuli including cold, salt, and ABA treatments. These findings will increase our understanding of HD-Zip gene family involved in abiotic stresses and signaling transduction, and will provide a solid base for further functional characterization of MeHDZ genes in cassava. PMID:28249019

  18. Genome-wide identification of WRKY family genes and their response to cold stress in Vitis vinifera

    USDA-ARS?s Scientific Manuscript database

    WRKY transcription factors are one of the largest families of transcriptional regulators in plants. WRKY genes are not only found to play significant roles in biotic and abiotic stress response, but also regulate growth and development. Grapevine (Vitis vinifera) production is largely limited by str...

  19. Genome-wide analysis and expression profiling of DNA-binding with one zinc finger (Dof) transcription factor family in potato.

    PubMed

    Venkatesh, Jelli; Park, Se Won

    2015-09-01

    DNA-binding with one finger (Dof) domain proteins are a multigene family of plant-specific transcription factors involved in numerous aspects of plant growth and development. Here, we report a genome-wide search for Solanum tuberosum Dof (StDof) genes and their expression profiles at various developmental stages and in response to various abiotic stresses. In addition, a complete overview of Dof gene family in potato is presented, including the gene structures, chromosomal locations, cis-regulatory elements, conserved protein domains, and phylogenetic inferences. Based on the genome-wide analysis, we identified 35 full-length protein-coding StDof genes, unevenly distributed on 10 chromosomes. Phylogenetic analysis with Dof members from diverse plant species showed that StDof genes can be classified into four subgroups (StDofI, II, III, and IV). qPCR expression analysis of StDof gene transcripts showed the distinct expression patterns of StDof genes in various potato organs, and tuber developmental stages analyzed. Many StDof genes were upregulated in response to drought, salinity, and ABA treatments. Overall, the StDof gene expression pattern and the number of over-represented cis-acting elements in the promoter regions of the StDof genes indicate that most of the StDof genes have redundant functions. The detailed genomic information and expression profiles of the StDof gene homologs in the present study provide opportunities for functional analyses to unravel the genes' exact role in plant growth and development as well as in abiotic stress tolerance.

  20. Genome-Wide Association and Trans-ethnic Meta-Analysis for Advanced Diabetic Kidney Disease: Family Investigation of Nephropathy and Diabetes (FIND).

    PubMed

    Iyengar, Sudha K; Sedor, John R; Freedman, Barry I; Kao, W H Linda; Kretzler, Matthias; Keller, Benjamin J; Abboud, Hanna E; Adler, Sharon G; Best, Lyle G; Bowden, Donald W; Burlock, Allison; Chen, Yii-Der Ida; Cole, Shelley A; Comeau, Mary E; Curtis, Jeffrey M; Divers, Jasmin; Drechsler, Christiane; Duggirala, Ravi; Elston, Robert C; Guo, Xiuqing; Huang, Huateng; Hoffmann, Michael Marcus; Howard, Barbara V; Ipp, Eli; Kimmel, Paul L; Klag, Michael J; Knowler, William C; Kohn, Orly F; Leak, Tennille S; Leehey, David J; Li, Man; Malhotra, Alka; März, Winfried; Nair, Viji; Nelson, Robert G; Nicholas, Susanne B; O'Brien, Stephen J; Pahl, Madeleine V; Parekh, Rulan S; Pezzolesi, Marcus G; Rasooly, Rebekah S; Rotimi, Charles N; Rotter, Jerome I; Schelling, Jeffrey R; Seldin, Michael F; Shah, Vallabh O; Smiles, Adam M; Smith, Michael W; Taylor, Kent D; Thameem, Farook; Thornley-Brown, Denyse P; Truitt, Barbara J; Wanner, Christoph; Weil, E Jennifer; Winkler, Cheryl A; Zager, Philip G; Igo, Robert P; Hanson, Robert L; Langefeld, Carl D

    2015-08-01

    Diabetic kidney disease (DKD) is the most common etiology of chronic kidney disease (CKD) in the industrialized world and accounts for much of the excess mortality in patients with diabetes mellitus. Approximately 45% of U.S. patients with incident end-stage kidney disease (ESKD) have DKD. Independent of glycemic control, DKD aggregates in families and has higher incidence rates in African, Mexican, and American Indian ancestral groups relative to European populations. The Family Investigation of Nephropathy and Diabetes (FIND) performed a genome-wide association study (GWAS) contrasting 6,197 unrelated individuals with advanced DKD with healthy and diabetic individuals lacking nephropathy of European American, African American, Mexican American, or American Indian ancestry. A large-scale replication and trans-ethnic meta-analysis included 7,539 additional European American, African American and American Indian DKD cases and non-nephropathy controls. Within ethnic group meta-analysis of discovery GWAS and replication set results identified genome-wide significant evidence for association between DKD and rs12523822 on chromosome 6q25.2 in American Indians (P = 5.74x10-9). The strongest signal of association in the trans-ethnic meta-analysis was with a SNP in strong linkage disequilibrium with rs12523822 (rs955333; P = 1.31x10-8), with directionally consistent results across ethnic groups. These 6q25.2 SNPs are located between the SCAF8 and CNKSR3 genes, a region with DKD relevant changes in gene expression and an eQTL with IPCEF1, a gene co-translated with CNKSR3. Several other SNPs demonstrated suggestive evidence of association with DKD, within and across populations. These data identify a novel DKD susceptibility locus with consistent directions of effect across diverse ancestral groups and provide insight into the genetic architecture of DKD.

  1. A genome-wide association scan for acute insulin response to glucose in Hispanic-Americans: the Insulin Resistance Atherosclerosis Family Study (IRAS FS).

    PubMed

    Rich, S S; Goodarzi, M O; Palmer, N D; Langefeld, C D; Ziegler, J; Haffner, S M; Bryer-Ash, M; Norris, J M; Taylor, K D; Haritunians, T; Rotter, J I; Chen, Y-D I; Wagenknecht, L E; Bowden, D W; Bergman, R N

    2009-07-01

    This study sought to identify genes and regions in the human genome that are associated with the acute insulin response to glucose (AIRg), an important predictor of type 2 diabetes, in Hispanic-American participants from the Insulin Resistance Atherosclerosis Family Study (IRAS FS). A two-stage genome-wide association scan (GWAS) was performed in IRAS FS Hispanic-American samples. In the first stage, 317K single nucleotide polymorphisms (SNPs) were assessed in 229 Hispanic-American DNA samples from 34 families from San Antonio, TX, USA. SNPs with the most significant associations with AIRg were genotyped in the entire set of IRAS FS Hispanic-American samples (n = 1,190). In chromosomal regions with evidence of association, additional SNPs were genotyped to capture variation in genes. No individual SNP achieved genome-wide levels of significance (p < 5 x 10(-7)); however, two regions (chromosomes 6p21 and 20p11) had multiple highly ranked SNPs that were associated with AIRg. Additional genotyping in these regions supported the initial evidence of variants contributing to variation in AIRg. One region resides in a gene desert between PXT1 and KCTD20 on 6p21, while the region on 20p11 has several viable candidate genes (ENTPD6, PYGB, GINS1 and RP4-691N24.1). A GWAS in Hispanic-American samples identified several candidate genes and loci that may be associated with AIRg. These associations explain a small component of variation in AIRg. The genes identified are involved in phosphorylation and ion transport, and provide preliminary evidence that these processes are important in beta cell response.

  2. Genome-wide identification of MAPK, MAPKK, and MAPKKK gene families and transcriptional profiling analysis during development and stress response in cucumber.

    PubMed

    Wang, Jie; Pan, Changtian; Wang, Yan; Ye, Lei; Wu, Jian; Chen, Lifei; Zou, Tao; Lu, Gang

    2015-05-15

    The mitogen-activated protein kinase (MAPK) cascade consists of three types of reversibly phosphorylated kinases, namely, MAPK, MAPK kinase (MAPKK/MEK), and MAPK kinase kinase (MAPKKK/MEKK), playing important roles in plant growth, development, and defense response. The MAPK cascade genes have been investigated in detail in model plants, including Arabidopsis, rice, and tomato, but poorly characterized in cucumber (Cucumis sativus L.), a major popular vegetable in Cucurbitaceae crops, which is highly susceptible to environmental stress and pathogen attack. A genome-wide analysis revealed the presence of at least 14 MAPKs, 6 MAPKKs, and 59 MAPKKKs in the cucumber genome. Phylogenetic analyses classified all the CsMAPK and CsMAPKK genes into four groups, whereas the CsMAPKKK genes were grouped into the MEKK, RAF, and ZIK subfamilies. The expansion of these three gene families was mainly contributed by segmental duplication events. Furthermore, the ratios of non-synonymous substitution rates (Ka) and synonymous substitution rates (Ks) implied that the duplicated gene pairs had experienced strong purifying selection. Real-time PCR analysis demonstrated that some MAPK, MAPKK and MAPKKK genes are preferentially expressed in specific organs or tissues. Moreover, the expression levels of most of these genes significantly changed under heat, cold, drought, and Pseudoperonospora cubensis treatments. Exposure to abscisic acid and jasmonic acid markedly affected the expression levels of these genes, thereby implying that they may play important roles in the plant hormone network. A comprehensive genome-wide analysis of gene structure, chromosomal distribution, and evolutionary relationship of MAPK cascade genes in cucumber are present here. Further expression analysis revealed that these genes were involved in important signaling pathways for biotic and abiotic stress responses in cucumber, as well as the response to plant hormones. Our first systematic description of the MAPK

  3. Genome-Wide Comparative In Silico Analysis of the RNA Helicase Gene Family in Zea mays and Glycine max: A Comparison with Arabidopsis and Oryza sativa

    PubMed Central

    Huang, Jinguang; Zheng, Chengchao

    2013-01-01

    RNA helicases are enzymes that are thought to unwind double-stranded RNA molecules in an energy-dependent fashion through the hydrolysis of NTP. RNA helicases are associated with all processes involving RNA molecules, including nuclear transcription, editing, splicing, ribosome biogenesis, RNA export, and organelle gene expression. The involvement of RNA helicase in response to stress and in plant growth and development has been reported previously. While their importance in Arabidopsis and Oryza sativa has been partially studied, the function of RNA helicase proteins is poorly understood in Zea mays and Glycine max. In this study, we identified a total of RNA helicase genes in Arabidopsis and other crop species genome by genome-wide comparative in silico analysis. We classified the RNA helicase genes into three subfamilies according to the structural features of the motif II region, such as DEAD-box, DEAH-box and DExD/H-box, and different species showed different patterns of alternative splicing. Secondly, chromosome location analysis showed that the RNA helicase protein genes were distributed across all chromosomes with different densities in the four species. Thirdly, phylogenetic tree analyses identified the relevant homologs of DEAD-box, DEAH-box and DExD/H-box RNA helicase proteins in each of the four species. Fourthly, microarray expression data showed that many of these predicted RNA helicase genes were expressed in different developmental stages and different tissues under normal growth conditions. Finally, real-time quantitative PCR analysis showed that the expression levels of 10 genes in Arabidopsis and 13 genes in Zea mays were in close agreement with the microarray expression data. To our knowledge, this is the first report of a comparative genome-wide analysis of the RNA helicase gene family in Arabidopsis, Oryza sativa, Zea mays and Glycine max. This study provides valuable information for understanding the classification and putative functions of

  4. Genome-wide comparative in silico analysis of the RNA helicase gene family in Zea mays and Glycine max: a comparison with Arabidopsis and Oryza sativa.

    PubMed

    Xu, Ruirui; Zhang, Shizhong; Huang, Jinguang; Zheng, Chengchao

    2013-01-01

    RNA helicases are enzymes that are thought to unwind double-stranded RNA molecules in an energy-dependent fashion through the hydrolysis of NTP. RNA helicases are associated with all processes involving RNA molecules, including nuclear transcription, editing, splicing, ribosome biogenesis, RNA export, and organelle gene expression. The involvement of RNA helicase in response to stress and in plant growth and development has been reported previously. While their importance in Arabidopsis and Oryza sativa has been partially studied, the function of RNA helicase proteins is poorly understood in Zea mays and Glycine max. In this study, we identified a total of RNA helicase genes in Arabidopsis and other crop species genome by genome-wide comparative in silico analysis. We classified the RNA helicase genes into three subfamilies according to the structural features of the motif II region, such as DEAD-box, DEAH-box and DExD/H-box, and different species showed different patterns of alternative splicing. Secondly, chromosome location analysis showed that the RNA helicase protein genes were distributed across all chromosomes with different densities in the four species. Thirdly, phylogenetic tree analyses identified the relevant homologs of DEAD-box, DEAH-box and DExD/H-box RNA helicase proteins in each of the four species. Fourthly, microarray expression data showed that many of these predicted RNA helicase genes were expressed in different developmental stages and different tissues under normal growth conditions. Finally, real-time quantitative PCR analysis showed that the expression levels of 10 genes in Arabidopsis and 13 genes in Zea mays were in close agreement with the microarray expression data. To our knowledge, this is the first report of a comparative genome-wide analysis of the RNA helicase gene family in Arabidopsis, Oryza sativa, Zea mays and Glycine max. This study provides valuable information for understanding the classification and putative functions of

  5. Genome-Wide Association and Trans-ethnic Meta-Analysis for Advanced Diabetic Kidney Disease: Family Investigation of Nephropathy and Diabetes (FIND)

    PubMed Central

    Kretzler, Matthias; Keller, Benjamin J.; Adler, Sharon G.; Best, Lyle G.; Bowden, Donald W.; Burlock, Allison; Chen, Yii-Der Ida; Cole, Shelley A.; Comeau, Mary E.; Curtis, Jeffrey M.; Divers, Jasmin; Drechsler, Christiane; Duggirala, Ravi; Elston, Robert C.; Guo, Xiuqing; Huang, Huateng; Hoffmann, Michael Marcus; Howard, Barbara V.; Ipp, Eli; Kimmel, Paul L.; Klag, Michael J.; Knowler, William C.; Kohn, Orly F.; Leak, Tennille S.; Leehey, David J.; Li, Man; Malhotra, Alka; März, Winfried; Nair, Viji; Nelson, Robert G.; Nicholas, Susanne B.; O’Brien, Stephen J.; Pahl, Madeleine V.; Parekh, Rulan S.; Pezzolesi, Marcus G.; Rasooly, Rebekah S.; Rotimi, Charles N.; Rotter, Jerome I.; Schelling, Jeffrey R.; Seldin, Michael F.; Shah, Vallabh O.; Smiles, Adam M.; Smith, Michael W.; Taylor, Kent D.; Thameem, Farook; Thornley-Brown, Denyse P.; Truitt, Barbara J.; Wanner, Christoph; Weil, E. Jennifer; Winkler, Cheryl A.; Zager, Philip G.; Igo, Robert P.; Hanson, Robert L.; Langefeld, Carl D.

    2015-01-01

    Diabetic kidney disease (DKD) is the most common etiology of chronic kidney disease (CKD) in the industrialized world and accounts for much of the excess mortality in patients with diabetes mellitus. Approximately 45% of U.S. patients with incident end-stage kidney disease (ESKD) have DKD. Independent of glycemic control, DKD aggregates in families and has higher incidence rates in African, Mexican, and American Indian ancestral groups relative to European populations. The Family Investigation of Nephropathy and Diabetes (FIND) performed a genome-wide association study (GWAS) contrasting 6,197 unrelated individuals with advanced DKD with healthy and diabetic individuals lacking nephropathy of European American, African American, Mexican American, or American Indian ancestry. A large-scale replication and trans-ethnic meta-analysis included 7,539 additional European American, African American and American Indian DKD cases and non-nephropathy controls. Within ethnic group meta-analysis of discovery GWAS and replication set results identified genome-wide significant evidence for association between DKD and rs12523822 on chromosome 6q25.2 in American Indians (P = 5.74x10-9). The strongest signal of association in the trans-ethnic meta-analysis was with a SNP in strong linkage disequilibrium with rs12523822 (rs955333; P = 1.31x10-8), with directionally consistent results across ethnic groups. These 6q25.2 SNPs are located between the SCAF8 and CNKSR3 genes, a region with DKD relevant changes in gene expression and an eQTL with IPCEF1, a gene co-translated with CNKSR3. Several other SNPs demonstrated suggestive evidence of association with DKD, within and across populations. These data identify a novel DKD susceptibility locus with consistent directions of effect across diverse ancestral groups and provide insight into the genetic architecture of DKD. PMID:26305897

  6. Genome-wide linkage analysis of hemodynamic parameters under mental and physical stress in extended Omani Arab pedigrees: the Oman Family Study.

    PubMed

    Hassan, Mohammed O; Jaju, Deepali; Voruganti, V Saroja; Bayoumi, Riad A; Albarwani, Sulayma; Al-Yahyaee, Saeed; Aslani, Afshin; Snieder, Harold; Lopez-Alvarenga, Juan C; Al-Anqoudi, Zahir M; Alizadeh, Behrooz Z; Comuzzie, Anthony G

    2011-06-01

    We performed a genome-wide scan in a homogeneous Arab population to identify genomic regions linked to blood pressure (BP) and its intermediate phenotypes during mental and physical stress tests. The Oman Family Study subjects (N = 1277) were recruited from five extended families of ~10 generations. Hemodynamic phenotypes were computed from beat-to-beat BP, electrocardiography and impedance cardiography. Multi-point linkage was performed for resting, mental (word conflict test, WCT) and cold pressor (CPT) stress and their reactivity scores (s), using variance components decomposition-based methods implemented in SOLAR. Genome-wide scans for BP phenotypes identified quantitative trait loci (QTLs) with significant evidence of linkage on chromosomes 1 and 12 for WCT-linked cardiac output (LOD = 3.1) and systolic BP (LOD = 3.5). Evidence for suggestive linkage for WCT was found on chromosomes 3, 17 and 1 for heart rate (LOD = 2.3), DBP (LOD = 2.4) and left ventricular ejection time (LVET), respectively. For △WCT, suggestive QTLs were detected for CO on chr11 (LOD = 2.5), LVET on chr3 (LOD = 2.0) and EDI on chr9 (LOD = 2.1). For CPT, suggestive QTLs for HR and LVET shared the same region on chr22 (LOD 2.3 and 2.8, respectively) and on chr9 (LOD = 2.3) for SBP, chr7 (LOD = 2.4) for SV and chr19 (LOD = 2.6) for CO. For △CPT, CO and TPR top signals were detected on chr15 and 10 (LOD; 2.40, 2.08) respectively. Mental stress revealed the largest number of significant and suggestive loci for normal BP reported to date. The study of BP and its intermediate phenotypes under mental and physical stress may help reveal the genes involved in the pathogenesis of essential hypertension.

  7. Genome-Wide Analysis of APETALA2/Ethylene-Responsive Factor (AP2/ERF) Gene Family in Barley (Hordeum vulgare L.)

    PubMed Central

    Lin, Shen; Luan, Haiye; Lv, Chao; Zhang, Xinzhong; Song, Xiyun; Xu, Rugen

    2016-01-01

    APETALA2/Ethylene-Responsive Factor (AP2/ERF) gene family is plant specific transcription factor. It plays critical roles in development process, tolerance to biotic and abiotic stresses, and responses to plant hormones. However, limited data are available on the contributions of AP2/ERF gene family in barley (Hordeum vulgare L.). In the present study, 121 HvAP2/ERF genes in barley were identified by using bioinformatics methods. A total of 118 HvAP2/ERF (97.5%) genes were located on seven chromosomes. According to phylogenetic classification of AP2/ERF family in Arabidopsis, HvAP2/ERF proteins were divided into AP2 (APETALA2), RAV (Related to ABI3/VP), DREB (dehydration responsive element binding), ERF (ethylene responsive factors) and soloist sub families. The analysis of duplication events indicated that tandem repeat and segmental duplication contributed to the expansion of the AP2/ERF family in barley. HvDREB1s/2s genes displayed various expression patterns under abiotic stress and phytohormone. Taken together, the data generated in this study will be useful for genome-wide analysis to determine the precise role of the HvAP2/ERF gene during barley development, abiotic stress and phytohormone responses with the ultimate goal of improving crop production. PMID:27598245

  8. Genome-Wide Analysis of APETALA2/Ethylene-Responsive Factor (AP2/ERF) Gene Family in Barley (Hordeum vulgare L.).

    PubMed

    Guo, Baojian; Wei, Yafeng; Xu, Ruibin; Lin, Shen; Luan, Haiye; Lv, Chao; Zhang, Xinzhong; Song, Xiyun; Xu, Rugen

    2016-01-01

    APETALA2/Ethylene-Responsive Factor (AP2/ERF) gene family is plant specific transcription factor. It plays critical roles in development process, tolerance to biotic and abiotic stresses, and responses to plant hormones. However, limited data are available on the contributions of AP2/ERF gene family in barley (Hordeum vulgare L.). In the present study, 121 HvAP2/ERF genes in barley were identified by using bioinformatics methods. A total of 118 HvAP2/ERF (97.5%) genes were located on seven chromosomes. According to phylogenetic classification of AP2/ERF family in Arabidopsis, HvAP2/ERF proteins were divided into AP2 (APETALA2), RAV (Related to ABI3/VP), DREB (dehydration responsive element binding), ERF (ethylene responsive factors) and soloist sub families. The analysis of duplication events indicated that tandem repeat and segmental duplication contributed to the expansion of the AP2/ERF family in barley. HvDREB1s/2s genes displayed various expression patterns under abiotic stress and phytohormone. Taken together, the data generated in this study will be useful for genome-wide analysis to determine the precise role of the HvAP2/ERF gene during barley development, abiotic stress and phytohormone responses with the ultimate goal of improving crop production.

  9. A genome-wide quantitative trait locus (QTL) linkage scan of NEO personality factors in Latino families segregating bipolar disorder.

    PubMed

    Lee, Byung Dae; Gonzalez, Suzanne; Villa, Erika; Camarillo, Cynthia; Rodriguez, Marco; Yao, Yin; Guo, Wei; Flores, Deborah; Jerez, Alvaro; Raventos, Henriette; Ontiveros, Alfonso; Nicolini, Humberto; Escamilla, Michael

    2017-10-01

    Personality traits have been suggested as potential endophenotypes for Bipolar Disorder (BP), as they can be quantitatively measured and show correlations with BP. The present study utilized data from 2,745 individuals from 686 extended pedigrees originally ascertained for having multiplex cases of BP (963 cases of BPI or schizoaffective BP). Subjects were assessed with the NEO Personality Inventory, Revised (NEO PI-R) and genotyped using the Illumina HumanLinkage-24 Bead Chip, with an average genetic coverage of 0.67 cM. Two point linkage scores were calculated for each trait as a quantitative variable using SOLAR (Sequential Oligogenic Linkage Analysis Routines). Suggestive evidence for linkage was found for neuroticism at 1q32.1 (LOD = 2.52), 6q23.3 (2.32), 16p12 (2.79), extraversion at 4p15.3 (2.33), agreeableness at 4q31.1 (2.37), 5q34 (2.80), 7q31.1 (2.56), 16q22 (2.52), and conscientiousness at 4q31.1 (2.50). Each of the above traits have been shown to be correlated with the broad BP phenotype in this same sample. In addition, for the trait of openness, we found significant evidence of linkage to chromosome 3p24.3 (rs336610, LOD = 4.75) and suggestive evidence at 1q43 (2.74), 5q35.1 (3.03), 11q14.3 (2.61), 11q21 (2.30), and 19q13.1 (2.52). These findings support previous linkage findings of the openness trait to chromosome 19q13 and the agreeableness trait to 4q31 and identify a number of new loci for personality endophenotypes related to bipolar disorder. © 2017 Wiley Periodicals, Inc.

  10. The Glutathione Peroxidase Gene Family in Gossypium hirsutum: Genome-Wide Identification, Classification, Gene Expression and Functional Analysis

    PubMed Central

    Chen, Mingyang; Li, Kun; Li, Haipeng; Song, Chun-Peng; Miao, Yuchen

    2017-01-01

    The plant glutathione peroxidase (GPX) family consists of multiple isoenzymes with distinct subcellular locations, tissue-specific expression patterns and environmental stress responses. In this study, 13 putative GPXs from the genome of Gossypium hirsutum (GhGPXs) were identified and a conserved pattern among plant GPXs were exhibited, besides this they also responded to multiple environmental stresses and we predicted that they had hormone responsive cis-elements in their promoter regions. Most of the GhGPXs on expression in yeast can scavenge H2O2. Our results showed that different members of the GhGPX gene family were co-ordinately regulated under specific environmental stress conditions, and suggested the importance of GhGPXs in hormone treatments and abiotic stress responses. PMID:28300195

  11. Genome-wide analysis of transposable elements in the coffee berry borer Hypothenemus hampei (Coleoptera: Curculionidae): description of novel families.

    PubMed

    Hernandez-Hernandez, Eric M; Fernández-Medina, Rita Daniela; Navarro-Escalante, Lucio; Nuñez, Jonathan; Benavides-Machado, Pablo; Carareto, Claudia M A

    2017-02-15

    The coffee berry borer (CBB) Hypothenemus hampei is the most limiting pest of coffee production worldwide. The CBB genome has been recently sequenced; however, information regarding the presence and characteristics of transposable elements (TEs) was not provided. Using systematic searching strategies based on both de novo and homology-based approaches, we present a library of TEs from the draft genome of CBB sequenced by the Colombian Coffee Growers Federation. The library consists of 880 sequences classified as 66% Class I (LTRs: 46%, non-LTRs: 20%) and 34% Class II (DNA transposons: 8%, Helitrons: 16% and MITEs: 10%) elements, including families of the three main LTR (Gypsy, Bel-Pao and Copia) and non-LTR (CR1, Daphne, I/Nimb, Jockey, Kiri, R1, R2 and R4) clades and DNA superfamilies (Tc1-mariner, hAT, Merlin, P, PIF-Harbinger, PiggyBac and Helitron). We propose the existence of novel families: Hypo, belonging to the LTR Gypsy superfamily; Hamp, belonging to non-LTRs; and rosa, belonging to Class II or DNA transposons. Although the rosa clade has been previously described, it was considered to be a basal subfamily of the mariner family. Based on our phylogenetic analysis, including Tc1, mariner, pogo, rosa and Lsra elements from other insects, we propose that rosa and Lsra elements are subfamilies of an independent family of Class II elements termed rosa. The annotations obtained indicate that a low percentage of the assembled CBB genome (approximately 8.2%) consists of TEs. Although these TEs display high diversity, most sequences are degenerate, with few full-length copies of LTR and DNA transposons and several complete and putatively active copies of non-LTR elements. MITEs constitute approximately 50% of the total TEs content, with a high proportion associated with DNA transposons in the Tc1-mariner superfamily.

  12. Genome-wide analysis of the Dof transcription factor gene family reveals soybean-specific duplicable and functional characteristics.

    PubMed

    Guo, Yong; Qiu, Li-Juan

    2013-01-01

    The Dof domain protein family is a classic plant-specific zinc-finger transcription factor family involved in a variety of biological processes. There is great diversity in the number of Dof genes in different plants. However, there are only very limited reports on the characterization of Dof transcription factors in soybean (Glycine max). In the present study, 78 putative Dof genes were identified from the whole-genome sequence of soybean. The predicted GmDof genes were non-randomly distributed within and across 19 out of 20 chromosomes and 97.4% (38 pairs) were preferentially retained duplicate paralogous genes located in duplicated regions of the genome. Soybean-specific segmental duplications contributed significantly to the expansion of the soybean Dof gene family. These Dof proteins were phylogenetically clustered into nine distinct subgroups among which the gene structure and motif compositions were considerably conserved. Comparative phylogenetic analysis of these Dof proteins revealed four major groups, similar to those reported for Arabidopsis and rice. Most of the GmDofs showed specific expression patterns based on RNA-seq data analyses. The expression patterns of some duplicate genes were partially redundant while others showed functional diversity, suggesting the occurrence of sub-functionalization during subsequent evolution. Comprehensive expression profile analysis also provided insights into the soybean-specific functional divergence among members of the Dof gene family. Cis-regulatory element analysis of these GmDof genes suggested diverse functions associated with different processes. Taken together, our results provide useful information for the functional characterization of soybean Dof genes by combining phylogenetic analysis with global gene-expression profiling.

  13. Genome-wide identification and analysis of membrane-bound O-acyltransferase (MBOAT) gene family in plants.

    PubMed

    Wang, Peng; Wang, Zhunian; Dou, Yongchao; Zhang, Xiaoxiao; Wang, Maoyuan; Tian, Xinmin

    2013-11-01

    Membrane bound O-acyl transferase (MBOAT) family is composed of gene members encoding a variety of acyltransferase enzymes, which play important roles in plant acyl lipid metabolism. Here, we present the first genome-enabled identification and analysis of MBOAT gene models in plants. In total, we identified 136 plant MBOAT sequences from 14 plant species with complete genomes. Phylogenetic relationship analyses suggested the plant MBOAT gene models fell into four major groups, two of which likely encode enzymes of diacylglycerol acyltransferase 1 (DGAT1) and lysophospholipid acyltransferase (LPLAT), respectively, with one-three copies of paralogs present in each of the most plant species. A group of gene sequences, which are homologous to Saccharomyces cerevisiae glycerol uptake proteins (GUP), was identified in plants; copy numbers were conserved, with only one copy represented in each of the most plant species; analyses showed that residues essential for acyltransferases were more prone to be conserved than vertebrate orthologs. Among four groups, one was inferred to emerge in land plants and experience a rapid expansion in genomes of angiosperms, which suggested their important roles in adaptation of plants in lands. Sequence and phylogeny analyses indicated that genes in all four groups encode enzymes with acyltransferases. Comprehensive sequence identification of MBOAT family members and investigation into classification provide a complete picture of the MBOAT gene family in plants, and could shed light into enzymatic functions of different MBOAT genes in plants.

  14. Genome-Wide Characterization and Expression Profiling of the AUXIN RESPONSE FACTOR (ARF) Gene Family in Eucalyptus grandis

    PubMed Central

    Yu, Hong; Soler, Marçal; Mila, Isabelle; San Clemente, Hélène; Savelli, Bruno; Dunand, Christophe; Paiva, Jorge A. P.; Myburg, Alexander A.; Bouzayen, Mondher; Grima-Pettenati, Jacqueline; Cassan-Wang, Hua

    2014-01-01

    Auxin is a central hormone involved in a wide range of developmental processes including the specification of vascular stem cells. Auxin Response Factors (ARF) are important actors of the auxin signalling pathway, regulating the transcription of auxin-responsive genes through direct binding to their promoters. The recent availability of the Eucalyptus grandis genome sequence allowed us to examine the characteristics and evolutionary history of this gene family in a woody plant of high economic importance. With 17 members, the E. grandis ARF gene family is slightly contracted, as compared to those of most angiosperms studied hitherto, lacking traces of duplication events. In silico analysis of alternative transcripts and gene truncation suggested that these two mechanisms were preeminent in shaping the functional diversity of the ARF family in Eucalyptus. Comparative phylogenetic analyses with genomes of other taxonomic lineages revealed the presence of a new ARF clade found preferentially in woody and/or perennial plants. High-throughput expression profiling among different organs and tissues and in response to environmental cues highlighted genes expressed in vascular cambium and/or developing xylem, responding dynamically to various environmental stimuli. Finally, this study allowed identification of three ARF candidates potentially involved in the auxin-regulated transcriptional program underlying wood formation. PMID:25269088

  15. Genome-Wide Analysis and Expression Profiling of the Phospholipase C Gene Family in Soybean (Glycine max)

    PubMed Central

    Zhou, Yonggang; Dong, Jinye; Chen, Huan; Dong, Yuanyuan; Wang, Nan; Li, Xiaowei; Li, Haiyan

    2015-01-01

    Phosphatidylinositol-specific phospholipase C (PI-PLC) hydrolyses phosphatidylinositol-4,5-bisphosphate to produce diacylglycerol and inositol 1,4,5-trisphosphate. It plays an important role in plant development and abiotic stress responses. However, systematic analysis and expression profiling of the phospholipase C (PLC) gene family in soybean have not been reported. In this study, 12 putative PLC genes were identified in the soybean genome. Soybean PLCs were found on chromosomes 2, 11, 14 and 18 and encoded 58.8–70.06 kD proteins. Expression pattern analysis by RT-PCR demonstrated that expression of the GmPLCs was induced by PEG, NaCl and saline-alkali treatments in roots and leaves. GmPLC transcripts accumulated specifically in roots after ABA treatment. Furthermore, GmPLC transcripts were analyzed in various tissues. The results showed that GmPLC7 was highly expressed in most tissues, whereas GmPLC12 was expressed in early pods specifically. In addition, subcellular localization analysis was carried out and confirmed that GmPLC10 was localized in the plasma membrane in Nicotiana benthamiana. Our genomic analysis of the soybean PLC family provides an insight into the regulation of abiotic stress responses and development. It also provides a solid foundation for the functional characterization of the soybean PLC gene family. PMID:26421918

  16. Genome-wide identification and expression analysis of SBP-box gene family in Sorghum bicolor L.

    PubMed

    Jianzhong, Chang; Fengxia, Yan; Linyi, Qiao; Jun, Zheng; Fuyao, Zhang; Qingshan, Liu

    2016-06-20

    SQUAMOSA PROMOTER BINDING PROTEIN-box (SBP-box) family genes encoding plant-specific transcription factors are involved in many aspects of crop genetic improvement such as yield, plant-type and stress-resistance. The SBP-box gene family have important practical applications. In this study, 18 SBP-box genes were identified from the reference genome of sorghum (Sorghum bicolor L.) using bioinformatics. These genes distributed on nine chromosomes while eight of them located in the segmental duplication region. Phylogenetic reconstruction resulted in six subfamilies of SBP-box genes, among which SbSBP12, SbSBP3 and SbSBP15 are orthologous to ZmLG1, ZmTGA1 and ZmUB2/3 in corn, respectively. RNA-seq data analysis indicated that SbSBP-box genes show the highest expression level in primordial inflorescences. Moreover, SbSBP9 and SbSBP17 exhibited a tissue specific expression in primordial inflorescences. The expression levels of SbSBP5, SbSBP8 and SbSBP18 were increased in response to exogenous ABA and PEG,indicating that SbSBP-box genes are involved in the defense response against abiotic stresses in sorghum. This research provides references for cloning important genes in SbSBP-box gene family. Genes identified in this study could be considered as candidate genes for genetic improvement of sorghum.

  17. Genome-wide characterization and expression profiling of the AUXIN RESPONSE FACTOR (ARF) gene family in Eucalyptus grandis.

    PubMed

    Yu, Hong; Soler, Marçal; Mila, Isabelle; San Clemente, Hélène; Savelli, Bruno; Dunand, Christophe; Paiva, Jorge A P; Myburg, Alexander A; Bouzayen, Mondher; Grima-Pettenati, Jacqueline; Cassan-Wang, Hua

    2014-01-01

    Auxin is a central hormone involved in a wide range of developmental processes including the specification of vascular stem cells. Auxin Response Factors (ARF) are important actors of the auxin signalling pathway, regulating the transcription of auxin-responsive genes through direct binding to their promoters. The recent availability of the Eucalyptus grandis genome sequence allowed us to examine the characteristics and evolutionary history of this gene family in a woody plant of high economic importance. With 17 members, the E. grandis ARF gene family is slightly contracted, as compared to those of most angiosperms studied hitherto, lacking traces of duplication events. In silico analysis of alternative transcripts and gene truncation suggested that these two mechanisms were preeminent in shaping the functional diversity of the ARF family in Eucalyptus. Comparative phylogenetic analyses with genomes of other taxonomic lineages revealed the presence of a new ARF clade found preferentially in woody and/or perennial plants. High-throughput expression profiling among different organs and tissues and in response to environmental cues highlighted genes expressed in vascular cambium and/or developing xylem, responding dynamically to various environmental stimuli. Finally, this study allowed identification of three ARF candidates potentially involved in the auxin-regulated transcriptional program underlying wood formation.

  18. Chromosomes 4 and 8 Implicated in a Genome Wide SNP Linkage Scan of 762 Prostate Cancer Families Collected by the ICPCG

    PubMed Central

    Lu, Lingyi; Cancel-Tassin, Geraldine; Valeri, Antoine; Cussenot, Olivier; Lange, Ethan M.; Cooney, Kathleen A.; Farnham, James M.; Camp, Nicola J.; Cannon-Albright, Lisa A.; Tammela, Teuvo L.J.; Schleutker, Johanna; Hoegel, Josef; Herkommer, Kathleen; Maier, Christiane; Vogel, Walther; Wiklund, Fredrik; Emanuelsson, Monica; Grönberg, Henrik; Wiley, Kathleen E.; Isaacs, Sarah D.; Walsh, Patrick C.; Helfand, Brian T.; Kan, Donghui; Catalona, William J.; Stanford, Janet L.; FitzGerald, Liesel M.; Johanneson, Bo; Deutsch, Kerry; McIntosh, Laura; Ostrander, Elaine A.; Thibodeau, Stephen N.; McDonnell, Shannon K.; Hebbring, Scott; Schaid, Daniel J.; Whittemore, Alice S.; Oakley-Girvan, Ingrid; Hsieh, Chih-Lin; Powell, Isaac; Bailey-Wilson, Joan E.; Carpten, John D.; Seminara, Daniela; Zheng, S. Lilly; Xu, Jianfeng; Giles, Graham G.; Severi, Gianluca; Hopper, John L.; English, Dallas R.; Foulkes, William D.; Maehle, Lovise; Moller, Pal; Badzioch, Michael D.; Edwards, Steve; Guy, Michelle; Eeles, Ros; Easton, Douglas; Isaacs, William B.

    2012-01-01

    Background In spite of intensive efforts, understanding of the genetic aspects of familial prostate cancer remains largely incomplete. In a previous microsatellite-based linkage scan of 1233 prostate cancer (PC) families, we identified suggestive evidence for linkage (i.e. LOD≥1.86) at 5q12, 15q11, 17q21, 22q12, and two loci on 8p, with additional regions implicated in subsets of families defined by age at diagnosis, disease aggressiveness, or number of affected members. Methods In an attempt to replicate these findings and increase linkage resolution, we used the Illumina 6000 SNP linkage panel to perform a genome-wide linkage scan of an independent set of 762 multiplex PC families, collected by 11 ICPCG groups. Results Of the regions identified previously, modest evidence of replication was observed only on the short arm of chromosome 8, where HLOD scores of 1.63 and 3.60 were observed in the complete set of families and families with young average age at diagnosis, respectively. The most significant linkage signals found in the complete set of families were observed across a broad, 37 cM interval on 4q13-25, with LOD scores ranging from 2.02 to 2.62, increasing to 4.50 in families with older average age at diagnosis. In families with multiple cases presenting with more aggressive disease, LOD scores over 3.0 were observed at 8q24 in the vicinity of previously identified common PC risk variants, as well as MYC, an important gene in PC biology. Conclusions These results will be useful in prioritizing future susceptibility gene discovery efforts in this common cancer. PMID:21748754

  19. Genome-wide identification and analysis of the SBP-box family genes in apple (Malus × domestica Borkh.).

    PubMed

    Li, Jun; Hou, Hongmin; Li, Xiaoqin; Xiang, Jiang; Yin, Xiangjing; Gao, Hua; Zheng, Yi; Bassett, Carole L; Wang, Xiping

    2013-09-01

    SQUAMOSA promoter binding protein (SBP)-box genes encode a family of plant-specific transcription factors and play many crucial roles in plant development. In this study, 27 SBP-box gene family members were identified in the apple (Malus × domestica Borkh.) genome, 15 of which were suggested to be putative targets of MdmiR156. Plant SBPs were classified into eight groups according to the phylogenetic analysis of SBP-domain proteins. Gene structure, gene chromosomal location and synteny analyses of MdSBP genes within the apple genome demonstrated that tandem and segmental duplications, as well as whole genome duplications, have likely contributed to the expansion and evolution of the SBP-box gene family in apple. Additionally, synteny analysis between apple and Arabidopsis indicated that several paired homologs of MdSBP and AtSPL genes were located in syntenic genomic regions. Tissue-specific expression analysis of MdSBP genes in apple demonstrated their diversified spatiotemporal expression patterns. Most MdmiR156-targeted MdSBP genes, which had relatively high transcript levels in stems, leaves, apical buds and some floral organs, exhibited a more differential expression pattern than most MdmiR156-nontargeted MdSBP genes. Finally, expression analysis of MdSBP genes in leaves upon various plant hormone treatments showed that many MdSBP genes were responsive to different plant hormones, indicating that MdSBP genes may be involved in responses to hormone signaling during stress or in apple development.

  20. Genome-wide classification and evolutionary and expression analyses of citrus MYB transcription factor families in sweet orange.

    PubMed

    Hou, Xiao-Jin; Li, Si-Bei; Liu, Sheng-Rui; Hu, Chun-Gen; Zhang, Jin-Zhi

    2014-01-01

    MYB family genes are widely distributed in plants and comprise one of the largest transcription factors involved in various developmental processes and defense responses of plants. To date, few MYB genes and little expression profiling have been reported for citrus. Here, we describe and classify 177 members of the sweet orange MYB gene (CsMYB) family in terms of their genomic gene structures and similarity to their putative Arabidopsis orthologs. According to these analyses, these CsMYBs were categorized into four groups (4R-MYB, 3R-MYB, 2R-MYB and 1R-MYB). Gene structure analysis revealed that 1R-MYB genes possess relatively more introns as compared with 2R-MYB genes. Investigation of their chromosomal localizations revealed that these CsMYBs are distributed across nine chromosomes. Sweet orange includes a relatively small number of MYB genes compared with the 198 members in Arabidopsis, presumably due to a paralog reduction related to repetitive sequence insertion into promoter and non-coding transcribed region of the genes. Comparative studies of CsMYBs and Arabidopsis showed that CsMYBs had fewer gene duplication events. Expression analysis revealed that the MYB gene family has a wide expression profile in sweet orange development and plays important roles in development and stress responses. In addition, 337 new putative microsatellites with flanking sequences sufficient for primer design were also identified from the 177 CsMYBs. These results provide a useful reference for the selection of candidate MYB genes for cloning and further functional analysis forcitrus.

  1. Genome-Wide Analysis of the Sucrose Synthase Gene Family in Grape (Vitis vinifera): Structure, Evolution, and Expression Profiles.

    PubMed

    Zhu, Xudong; Wang, Mengqi; Li, Xiaopeng; Jiu, Songtao; Wang, Chen; Fang, Jinggui

    2017-03-28

    Sucrose synthase (SS) is widely considered as the key enzyme involved in the plant sugar metabolism that is critical to plant growth and development, especially quality of the fruit. The members of SS gene family have been identified and characterized in multiple plant genomes. However, detailed information about this gene family is lacking in grapevine (Vitis vinifera L.). In this study, we performed a systematic analysis of the grape (V. vinifera) genome and reported that there are five SS genes (VvSS1-5) in the grape genome. Comparison of the structures of grape SS genes showed high structural conservation of grape SS genes, resulting from the selection pressures during the evolutionary process. The segmental duplication of grape SS genes contributed to this gene family expansion. The syntenic analyses between grape and soybean (Glycine max) demonstrated that these genes located in corresponding syntenic blocks arose before the divergence of grape and soybean. Phylogenetic analysis revealed distinct evolutionary paths for the grape SS genes. VvSS1/VvSS5, VvSS2/VvSS3 and VvSS4 originated from three ancient SS genes, which were generated by duplication events before the split of monocots and eudicots. Bioinformatics analysis of publicly available microarray data, which was validated by quantitative real-time reverse transcription PCR (qRT-PCR), revealed distinct temporal and spatial expression patterns of VvSS genes in various tissues, organs and developmental stages, as well as in response to biotic and abiotic stresses. Taken together, our results will be beneficial for further investigations into the functions of SS gene in the processes of grape resistance to environmental stresses.

  2. Genome-Wide Analysis of the Sucrose Synthase Gene Family in Grape (Vitis vinifera): Structure, Evolution, and Expression Profiles

    PubMed Central

    Zhu, Xudong; Wang, Mengqi; Li, Xiaopeng; Jiu, Songtao; Wang, Chen; Fang, Jinggui

    2017-01-01

    Sucrose synthase (SS) is widely considered as the key enzyme involved in the plant sugar metabolism that is critical to plant growth and development, especially quality of the fruit. The members of SS gene family have been identified and characterized in multiple plant genomes. However, detailed information about this gene family is lacking in grapevine (Vitis vinifera L.). In this study, we performed a systematic analysis of the grape (V. vinifera) genome and reported that there are five SS genes (VvSS1–5) in the grape genome. Comparison of the structures of grape SS genes showed high structural conservation of grape SS genes, resulting from the selection pressures during the evolutionary process. The segmental duplication of grape SS genes contributed to this gene family expansion. The syntenic analyses between grape and soybean (Glycine max) demonstrated that these genes located in corresponding syntenic blocks arose before the divergence of grape and soybean. Phylogenetic analysis revealed distinct evolutionary paths for the grape SS genes. VvSS1/VvSS5, VvSS2/VvSS3 and VvSS4 originated from three ancient SS genes, which were generated by duplication events before the split of monocots and eudicots. Bioinformatics analysis of publicly available microarray data, which was validated by quantitative real-time reverse transcription PCR (qRT-PCR), revealed distinct temporal and spatial expression patterns of VvSS genes in various tissues, organs and developmental stages, as well as in response to biotic and abiotic stresses. Taken together, our results will be beneficial for further investigations into the functions of SS gene in the processes of grape resistance to environmental stresses. PMID:28350372

  3. A genome-wide survey of HD-Zip genes in rice and analysis of drought-responsive family members.

    PubMed

    Agalou, Adamantia; Purwantomo, Sigit; Overnäs, Elin; Johannesson, Henrik; Zhu, Xiaoyi; Estiati, Amy; de Kam, Rolf J; Engström, Peter; Slamet-Loedin, Inez H; Zhu, Zhen; Wang, Mei; Xiong, Lizhong; Meijer, Annemarie H; Ouwerkerk, Pieter B F

    2008-01-01

    The homeodomain leucine zipper (HD-Zip) genes encode transcription factors that have diverse functions in plant development and have often been implicated in stress adaptation. The HD-Zip genes are the most abundant group of homeobox (HB) genes in plants and do not occur in other eukaryotes. This paper describes the complete annotation of the HD-Zip families I, II and III from rice and compares these gene families with Arabidopsis in a phylogeny reconstruction. Orthologous pairs of rice and Arabidopsis HD-Zip genes were predicted based on neighbour joining and maximum parsimony (MP) trees with support of conserved intron-exon organization. Additionally, a number of HD-Zip genes appeared to be unique to rice. Searching of EST and cDNA databases and expression analysis using RT-PCR showed that 30 out of 31 predicted rice HD-Zip genes are expressed. Most HD-Zip genes were broadly expressed in mature plants and seedlings, but others showed more organ specific patterns. Like in Arabidopsis and other dicots, a subset of the rice HD-Zip I and II genes was found to be regulated by drought stress. We identified both drought-induced and drought-repressed HD-Zip genes and demonstrate that these genes are differentially regulated in drought-sensitive versus drought-tolerant rice cultivars. The drought-repressed HD-Zip family I gene, Oshox4, was selected for promoter-GUS analysis, showing that drought-responsiveness of Oshox4 is controlled by the promoter and that Oshox4 expression is predominantly vascular-specific. Loss-of-function analysis of Oshox4 revealed no specific phenotype, but overexpression analysis suggested a role for Oshox4 in elongation and maturation processes.

  4. Rare CNVs in Suicide Attempt include Schizophrenia-Associated Loci and Neurodevelopmental Genes: A Pilot Genome-Wide and Family-Based Study

    PubMed Central

    Wasserman, Jerzy; Wasserman, Danuta

    2016-01-01

    Suicidal behavior (SB) has a complex etiology involving genes and environment. One of the genetic components in SB could be copy number variations (CNVs), as CNVs are implicated in neurodevelopmental disorders. However, a recently published genome-wide and case-control study did not observe any significant role of CNVs in SB. Here we complemented these initial observations by instead using a family-based trio-sample that is robust to control biases, having severe suicide attempt (SA) in offspring as main outcome (n = 660 trios). We first tested for CNV associations on the genome-wide Illumina 1M SNP-array by using FBAT-CNV methodology, which allows for evaluating CNVs without reliance on CNV calling algorithms, analogous to a common SNP-based GWAS. We observed association of certain T-cell receptor markers, but this likely reflected inter-individual variation in somatic rearrangements rather than association with SA outcome. Next, we used the PennCNV software to call 385 putative rare (<1%) and large (>100 kb) CNVs, observed in n = 225 SA offspring. Nine SA offspring had rare CNV calls in a set of previously schizophrenia-associated loci, indicating the importance of such CNVs in certain SA subjects. Several additional, very large (>1MB) sized CNV calls in 15 other SA offspring also spanned pathogenic regions or other neural genes of interest. Overall, 45 SA had CNVs enriched for 65 medically relevant genes previously shown to be affected by CNVs, which were characterized by a neurodevelopmental biology. A neurodevelopmental implication was partly congruent with our previous SNP-based GWAS, but follow-up analysis here indicated that carriers of rare CNVs had a decreased burden of common SNP risk-alleles compared to non-carriers. In conclusion, while CNVs did not show genome-wide association by the FBAT-CNV methodology, our preliminary observations indicate rare pathogenic CNVs affecting neurodevelopmental functions in a subset of SA, who were distinct from SA

  5. Genome-wide linkage survey for genetic loci that affect the risk of suicide attempts in families with recurrent, early-onset, major depression.

    PubMed

    Zubenko, George S; Maher, Brion S; Hughes, Hugh B; Zubenko, Wendy N; Scott Stiffler, J; Marazita, Mary L

    2004-08-15

    We previously described the results of a genome-wide linkage survey for genetic loci that influenced the development of unipolar mood disorders in 81 families identified by individuals with Recurrent, Early-Onset, Major Depressive Disorder (RE-MDD) [Zubenko et al. 2003b; Am J Med Genet (Neuropsychiatr Genet) 123B:1-18]. In the current study, we extended this linkage analysis by including the history of a suicide attempt as a covariate to identify chromosomal regions that harbor genes that influence the risk of this behavior in the context of mood disorders. This approach identified six linkage peaks with maximum multipoint DeltaLOD scores that reached genome-wide adjusted levels of significance (2p, 5q, 6q, 8p, 11q, and Xq). Four of these (2p, 6q, 8p, and Xq) exceeded the criterion for "highly-significant linkage" (genome-wide adjusted P < 0.001) recommended by Lander and Kruglyak [1995; Nat Genet 11:241-246]. The strongest evidence for linkage was observed in analyses employing affected relative pairs (ARPs) with the most severe and disabling Mood Disorders: Depression Spectrum Disorder and RE-MDD. The highest DeltaLOD score that emerged from this linkage analysis, 5.08, occurred for ARPs with Depression Spectrum Disorder at D8S1145 (37.0 cM, 18.2 Mbps, P < 0.0001) at cytogenetic location 8p22-p21. Significant linkage results on Xq arose from analyses of ARPs with RE-MDD at DXS1047 (143 cM, 127.8 Mbps, DeltaLOD = 3.87, P < 0.0001), a finding that may contribute to the higher rate of suicide attempts among women than men. These findings provide evidence for suicide risk loci that are independent of susceptibility loci for Mood Disorders, and suggest that the capacity for suicide risk loci to affect the development of suicidal behavior depends on the psychiatric disorder or subtype with which they interact. Copyright 2004 Wiley-Liss, Inc.

  6. Genome-wide comparison of ferritin family from Archaea, Bacteria, Eukarya, and Viruses: its distribution, characteristic motif, and phylogenetic relationship.

    PubMed

    Bai, Lina; Xie, Ting; Hu, Qingqing; Deng, Changyan; Zheng, Rong; Chen, Wanping

    2015-10-01

    Ferritins are highly conserved proteins that are widely distributed in various species from archaea to humans. The ubiquitous characteristic of these proteins reflects the pivotal contribution of ferritins to the safe storage and timely delivery of iron to achieve iron homeostasis. This study investigated the ferritin genes in 248 genomes from various species, including viruses, archaea, bacteria, and eukarya. The distribution comparison suggests that mammals and eudicots possess abundant ferritin genes, whereas fungi contain very few ferritin genes. Archaea and bacteria show considerable numbers of ferritin genes. Generally, prokaryotes possess three types of ferritin (the typical ferritin, bacterioferritin, and DNA-binding protein from starved cell), whereas eukaryotes have various subunit types of ferritin, thereby indicating the individuation of the ferritin family during evolution. The characteristic motif analysis of ferritins suggested that all key residues specifying the unique structural motifs of ferritin are highly conserved across three domains of life. Meanwhile, the characteristic motifs were also distinguishable between ferritin groups, especially phytoferritins, which show a plant-specific motif. The phylogenetic analyses show that ferritins within the same subfamily or subunits are generally clustered together. The phylogenetic relationships among ferritin members suggest that both gene duplication and horizontal transfer contribute to the wide variety of ferritins, and their possible evolutionary scenario was also proposed. The results contribute to a better understanding of the distribution, characteristic motif, and evolutionary relationship of the ferritin family.

  7. Genome-wide identification of the TIFY gene family in three cultivated Gossypium species and the expression of JAZ genes

    PubMed Central

    Sun, Quan; Wang, Guanghao; Zhang, Xiao; Zhang, Xiangrui; Qiao, Peng; Long, Lu; Yuan, Youlu; Cai, Yingfan

    2017-01-01

    TIFY proteins are plant-specific proteins containing TIFY, JAZ, PPD and ZML subfamilies. A total of 50, 54 and 28 members of the TIFY gene family in three cultivated cotton species—Gossypium hirsutum, Gossypium barbadense and Gossypium arboretum—were identified, respectively. The results of phylogenetic analysis showed that these TIFY genes were divided into eight clusters. The different clusters of gene family members often have similar gene structures, including the number of exons. The results of quantitative reverse transcription polymerase chain reaction (qRT-PCR) showed that different JAZ genes displayed distinct expression patterns in the leaves of upland cotton under treatment with Gibberellin (GA), methyl jasmonate (MeJA), Jasmonic acid (JA) and abscisic acid (ABA). Different groups of JAZ genes exhibited different expression patterns in cotton leaves infected with Verticillium dahliae. The results of the comparative analysis of TIFY genes in the three cultivated species will be useful for understanding the involvement of these genes in development and stress resistance in cotton. PMID:28186193

  8. Genome-wide identification and expression profile of homeodomain-leucine zipper Class I gene family in Cucumis sativus.

    PubMed

    Liu, Wei; Fu, Rao; Li, Qiang; Li, Jing; Wang, Lina; Ren, Zhonghai

    2013-12-01

    The HD-Zip proteins comprise one of the largest families of transcription factors in plants. HD-Zip genes have been grouped into four different classes: HD-Zip I to IV. In this study, we described the identification and structural characterization of Class I HD-Zip genes in cucumber. A complete set of 13 HD-Zip I genes were identified in the cucumber genome using Blast search tools and phylogeny. The cucumber HD-Zip I family contained a smaller number of identified genes compared to other higher plants such as Arabidopsis and maize due to the absence of recent gene duplication events. Chromosomal location of these genes revealed that they are distributed unevenly across 5 of 7 chromosomes. Tissue-specific expression profiles showed that 13 cucumber HD-Zip I genes were expressed in at least one of the tissues, which suggested that cucumber HD-Zip I genes took part in many cellular processes. The transcript abundance level analysis during abiotic stress conditions (NaCl, ABA and low temperature treatments) identified a group of HD-Zip I genes that responded to one or more treatments.

  9. Genome-wide comparison of ferritin family from Archaea, Bacteria, Eukarya, and Viruses: its distribution, characteristic motif, and phylogenetic relationship

    NASA Astrophysics Data System (ADS)

    Bai, Lina; Xie, Ting; Hu, Qingqing; Deng, Changyan; Zheng, Rong; Chen, Wanping

    2015-10-01

    Ferritins are highly conserved proteins that are widely distributed in various species from archaea to humans. The ubiquitous characteristic of these proteins reflects the pivotal contribution of ferritins to the safe storage and timely delivery of iron to achieve iron homeostasis. This study investigated the ferritin genes in 248 genomes from various species, including viruses, archaea, bacteria, and eukarya. The distribution comparison suggests that mammals and eudicots possess abundant ferritin genes, whereas fungi contain very few ferritin genes. Archaea and bacteria show considerable numbers of ferritin genes. Generally, prokaryotes possess three types of ferritin (the typical ferritin, bacterioferritin, and DNA-binding protein from starved cell), whereas eukaryotes have various subunit types of ferritin, thereby indicating the individuation of the ferritin family during evolution. The characteristic motif analysis of ferritins suggested that all key residues specifying the unique structural motifs of ferritin are highly conserved across three domains of life. Meanwhile, the characteristic motifs were also distinguishable between ferritin groups, especially phytoferritins, which show a plant-specific motif. The phylogenetic analyses show that ferritins within the same subfamily or subunits are generally clustered together. The phylogenetic relationships among ferritin members suggest that both gene duplication and horizontal transfer contribute to the wide variety of ferritins, and their possible evolutionary scenario was also proposed. The results contribute to a better understanding of the distribution, characteristic motif, and evolutionary relationship of the ferritin family.

  10. Genome-Wide Identification, Phylogenetic and Co-Expression Analysis of OsSET Gene Family in Rice

    PubMed Central

    Lu, Zhanhua; Huang, Xiaolong; Ouyang, Yidan; Yao, Jialing

    2013-01-01

    Background SET domain is responsible for the catalytic activity of histone lysine methyltransferases (HKMTs) during developmental process. Histone lysine methylation plays a crucial and diverse regulatory function in chromatin organization and genome function. Although several SET genes have been identified and characterized in plants, the understanding of OsSET gene family in rice is still very limited. Methodology/Principal Findings In this study, a systematic analysis was performed and revealed the presence of at least 43 SET genes in rice genome. Phylogenetic and structural analysis grouped SET proteins into five classes, and supposed that the domains out of SET domain were significant for the specific of histone lysine methylation, as well as the recognition of methylated histone lysine. Based on the global microarray, gene expression profile revealed that the transcripts of OsSET genes were accumulated differentially during vegetative and reproductive developmental stages and preferentially up or down-regulated in different tissues. Cis-elements identification, co-expression analysis and GO analysis of expression correlation of 12 OsSET genes suggested that OsSET genes might be involved in cell cycle regulation and feedback. Conclusions/Significance This study will facilitate further studies on OsSET family and provide useful clues for functional validation of OsSETs. PMID:23762371

  11. Genome-wide characterization and analysis of bZIP transcription factor gene family related to abiotic stress in cassava

    PubMed Central

    Hu, Wei; Yang, Hubiao; Yan, Yan; Wei, Yunxie; Tie, Weiwei; Ding, Zehong; Zuo, Jiao; Peng, Ming; Li, Kaimian

    2016-01-01

    The basic leucine zipper (bZIP) transcription factor family plays crucial roles in various aspects of biological processes. Currently, no information is available regarding the bZIP family in the important tropical crop cassava. Herein, 77 bZIP genes were identified from cassava. Evolutionary analysis indicated that MebZIPs could be divided into 10 subfamilies, which was further supported by conserved motif and gene structure analyses. Global expression analysis suggested that MebZIPs showed similar or distinct expression patterns in different tissues between cultivated variety and wild subspecies. Transcriptome analysis of three cassava genotypes revealed that many MebZIP genes were activated by drought in the root of W14 subspecies, indicating the involvement of these genes in the strong resistance of cassava to drought. Expression analysis of selected MebZIP genes in response to osmotic, salt, cold, ABA, and H2O2 suggested that they might participate in distinct signaling pathways. Our systematic analysis of MebZIPs reveals constitutive, tissue-specific and abiotic stress-responsive candidate MebZIP genes for further functional characterization in planta, yields new insights into transcriptional regulation of MebZIP genes, and lays a foundation for understanding of bZIP-mediated abiotic stress response. PMID:26947924

  12. Genome-wide analysis, expression dynamics and varietal comparison of NAC gene family at various developmental stages in Morus notabilis.

    PubMed

    Baranwal, Vinay Kumar; Khurana, Paramjit

    2016-06-01

    NAC genes are important transcription factors and forms a large family in plants. They have shown to play an important role in growth and development and have also been shown to involve in regulation of stress-responsive genes. In the present study, a repertoire of NAC genes in recently published mulberry genome has been identified which consists of a total of 79 members. Structural analysis revealed that most of the NAC genes in mulberry contain two introns. The proteins encoded by them show a wide range of isoelectric points suggestive of their varied roles in varying microcellular environment. Phylogenetic and conserved motif analysis elucidate the presence of 15 sub-groups of these genes along with two novel sub-groups having distinct conserved motifs which are not present in Arabidopsis. Gene ontology term enrichment analysis and cis-element identification from their putative 1 K upstream regulatory region indicates their possible role in important biological processes like organ formation, meristem establishment, senescence, and various biotic and abiotic stresses. Expression analysis across various developmental stages led to identification of their preferential expression in diverse tissues. Taken together, this work provides a solid background information related to structure, function, expression and evolution of NAC gene family in mulberry.

  13. Genome-wide identification and analysis of JHBP-domain family members in the silkworm Bombyx mori.

    PubMed

    Li, Wei; Cheng, Tingcai; Hu, Wenbo; Peng, Zhangchuan; Liu, Chun; Xia, Qingyou

    2016-12-01

    Juvenile hormone (JH) regulates the insect growth and development. JH appears in the hemolymph bound by a specific glycoprotein, juvenile hormone-binding protein (JHBP), which serves as a carrier to release the hormone to target tissues and cells. However, JHBP family candidates, expression patterns, and functional implications are still unclear. In this study, we identified 41 genes-containing conserved JHBP domains distributed across eight chromosomes of the silkworm Bombyx mori. A phylogenetic tree showed that the silkworm JHBP (BmJHBP) genes could be classified into two major branches and four subfamilies. Microarray data revealed that BmJHBP genes exhibit various expression patterns and are expressed in different tissues, periods, and sexes. The expression of BmJHBP genes was generally higher in the head, integument, midgut, fat body, testis, and ovary than in the anterior of the silk gland (ASG), median of the silk gland (MSG), posterior of the silk gland (PSG), hemocyte, and Malpighian tubule. BmJHBPd2, in particular, was investigated by Western Blotting, and immunofluorescent assay and was found to be highly expressed in the PSG cytoplasm on day 3 of the fifth instar, coinciding with silk production. Taken together, our findings will be useful in improving understanding the complexity of the JHBP family, and will lay the foundation of explaining functional characterization for further research.

  14. Genome-wide analysis and expression profiling of the ERF transcription factor family in potato (Solanum tuberosum L.).

    PubMed

    Charfeddine, Mariam; Saïdi, Mohamed Najib; Charfeddine, Safa; Hammami, Asma; Gargouri Bouzid, Radhia

    2015-04-01

    The ERF transcription factors belong to the AP2/ERF superfamily, one of the largest transcription factor families in plants. They play important roles in plant development processes, as well as in the response to biotic, abiotic, and hormone signaling. In the present study, 155 putative ERF transcription factor genes were identified from the potato (Solanum tuberosum) genome database, and compared with those from Arabidopsis thaliana. The StERF proteins are divided into ten phylogenetic groups. Expression analyses of five StERFs were carried out by semi-quantitative RT-PCR and compared with published RNA-seq data. These latter analyses were used to distinguish tissue-specific, biotic, and abiotic stress genes as well as hormone-responsive StERF genes. The results are of interest to better understand the role of the AP2/ERF genes in response to diverse types of stress in potatoes. A comprehensive analysis of the physiological functions and biological roles of the ERF family genes in S. tuberosum is required to understand crop stress tolerance mechanisms.

  15. Genome-wide characterization of GRAS family genes in Medicago truncatula reveals their evolutionary dynamics and functional diversification.

    PubMed

    Zhang, Hailing; Cao, Yingping; Shang, Chen; Li, Jikai; Wang, Jianli; Wu, Zhenying; Ma, Lichao; Qi, Tianxiong; Fu, Chunxiang; Bai, Zetao; Hu, Baozhong

    2017-01-01

    The GRAS gene family is a large plant-specific family of transcription factors that are involved in diverse processes during plant development. Medicago truncatula is an ideal model plant for genetic research in legumes, and specifically for studying nodulation, which is crucial for nitrogen fixation. In this study, 59 MtGRAS genes were identified and classified into eight distinct subgroups based on phylogenetic relationships. Motifs located in the C-termini were conserved across the subgroups, while motifs in the N-termini were subfamily specific. Gene duplication was the main evolutionary force for MtGRAS expansion, especially proliferation of the LISCL subgroup. Seventeen duplicated genes showed strong effects of purifying selection and diverse expression patterns, highlighting their functional importance and diversification after duplication. Thirty MtGRAS genes, including NSP1 and NSP2, were preferentially expressed in nodules, indicating possible roles in the process of nodulation. A transcriptome study, combined with gene expression analysis under different stress conditions, suggested potential functions of MtGRAS genes in various biological pathways and stress responses. Taken together, these comprehensive analyses provide basic information for understanding the potential functions of GRAS genes, and will facilitate further discovery of MtGRAS gene functions.

  16. The ERF transcription factor family in cassava: genome-wide characterization and expression analyses against drought stress

    PubMed Central

    Fan, Wei; Hai, Meirong; Guo, Yunling; Ding, Zehong; Tie, Weiwei; Ding, Xupo; Yan, Yan; Wei, Yunxie; Liu, Yang; Wu, Chunlai; Shi, Haitao; Li, Kaimian; Hu, Wei

    2016-01-01

    Cassava (Manihot esculenta) shows strong tolerance to drought stress; however, the mechanisms underlying this tolerance are poorly understood. Ethylene response factor (ERF) family genes play a crucial role in plants responding to abiotic stress. Currently, less information is known regarding the ERF family in cassava. Herein, 147 ERF genes were characterized from cassava based on the complete genome data, which was further supported by phylogenetic relationship, gene structure, and conserved motif analyses. Transcriptome analysis suggested that most of the MeERF genes have similar expression profiles between W14 and Arg7 during organ development. Comparative expression profiles revealed that the function of MeERFs in drought tolerance may be differentiated in roots and leaves of different genotypes. W14 maintained strong tolerance by activating more MeERF genes in roots compared to Arg7 and SC124, whereas Arg7 and SC124 maintained drought tolerance by inducing more MeERF genes in leaves relative to W14. Expression analyses of the selected MeERF genes showed that most of them are significantly upregulated by osmotic and salt stresses, whereas slightly induced by cold stress. Taken together, this study identified candidate MeERF genes for genetic improvement of abiotic stress tolerance and provided new insights into ERF-mediated cassava tolerance to drought stress. PMID:27869212

  17. Genome-wide Analysis of Phosphoenolpyruvate Carboxylase Gene Family and Their Response to Abiotic Stresses in Soybean

    PubMed Central

    Wang, Ning; Zhong, Xiujuan; Cong, Yahui; Wang, Tingting; Yang, Songnan; Li, Yan; Gai, Junyi

    2016-01-01

    Phosphoenolpyruvate carboxylase (PEPC) plays an important role in assimilating atmospheric CO2 during C4 and crassulacean acid metabolism photosynthesis, and also participates in various non-photosynthetic processes, including fruit ripening, stomatal opening, supporting carbon–nitrogen interactions, seed formation and germination, and regulation of plant tolerance to stresses. However, a comprehensive analysis of PEPC family in Glycine max has not been reported. Here, a total of ten PEPC genes were identified in soybean and denominated as GmPEPC1-GmPEPC10. Based on the phylogenetic analysis of the PEPC proteins from 13 higher plant species including soybean, PEPC family could be classified into two subfamilies, which was further supported by analyses of their conserved motifs and gene structures. Nineteen cis-regulatory elements related to phytohormones, abiotic and biotic stresses were identified in the promoter regions of GmPEPC genes, indicating their roles in soybean development and stress responses. GmPEPC genes were expressed in various soybean tissues and most of them responded to the exogenously applied phytohormones. GmPEPC6, GmPEPC8 and GmPEPC9 were significantly induced by aluminum toxicity, cold, osmotic and salt stresses. In addition, the enzyme activities of soybean PEPCs were also up-regulated by these treatments, suggesting their potential roles in soybean response to abiotic stresses. PMID:27924923

  18. Genome-wide identification and comparative analysis of the cation proton antiporters family in pear and four other Rosaceae species.

    PubMed

    Zhou, Hongsheng; Qi, Kaijie; Liu, Xing; Yin, Hao; Wang, Peng; Chen, Jianqing; Wu, Juyou; Zhang, Shaoling

    2016-08-01

    The monovalent cation proton antiporters (CPAs) play essential roles in plant nutrition, development, and signal transduction by regulating ion and pH homeostasis of the cell. The CPAs of plants include the Na(+)/H(+) exchanger, K(+) efflux antiporter, and cation/H(+) exchanger families. However, currently, little is known about the CPA genes in Rosaceae species. In this study, 220 CPA genes were identified from five Rosaceae species (Pyrus bretschneideri, Malus domestica, Prunus persica, Fragaria vesca, and Prunus mume), and 53 of which came from P. bretschneideri. Phylogenetic, structure, collinearity, and gene expression analyses were conducted on the entire CPA genes of pear. Gene expression data showed that 35 and 37 CPA genes were expressed in pear fruit and pollen tubes, respectively. The transcript analysis of some CPA genes under abiotic stress conditions revealed that CPAs may play an important role in pollen tubes growth. The results presented here will be useful in improving understanding of the complexity of the CPA gene family and will promote functional characterization in future studies.

  19. Genome-wide identification and analysis of FK506-binding protein gene family in peach (Prunus persica).

    PubMed

    Zhang, Yanping; Han, Jan; Liu, Dan; Wen, Xicheng; Li, Yu; Tao, Ran; Peng, Yongbin; Fang, Jinggui; Wang, Chen

    2014-02-25

    The FKBP protein family has prolyl isomerase activity and is related in function to cyclophilins. FKBPs are known to be involved in many biological processes including hormone signaling, plant growth, and stress responses through a chaperone or an isomerization of proline residues during protein folding. The availability of complete peach genome sequences allowed the identification of 21 FKBP genes by HMMER and BLAST analyses. Scaffold locations of these FKBP genes in the peach genome were determined and the protein domain and motif organization of peach FKBPs were analyzed. The phylogenetic relationships between peach FKBPs were also assessed. The expression profiles of peach FKBP gene results revealed that most peach FKBPs were expressed in all tissues, while a few peach FKBPs were specifically expressed in some of the tissues. This data could contribute to better understanding of the complex regulation of the peach FKBP gene family, and also provide valuable information for further research in peach functional genomics. Copyright © 2013 Elsevier B.V. All rights reserved.

  20. Genome-wide analysis of the homeodomain-leucine zipper (HD-ZIP) gene family in peach (Prunus persica).

    PubMed

    Zhang, C H; Ma, R J; Shen, Z J; Sun, X; Korir, N K; Yu, M L

    2014-04-08

    In this study, 33 homeodomain-leucine zipper (HD-ZIP) genes were identified in peach using the HD-ZIP amino acid sequences of Arabidopsis thaliana as a probe. Based on the phylogenetic analysis and the individual gene or protein characteristics, the HD-ZIP gene family in peach can be classified into 4 subfamilies, HD-ZIP I, II, III, and IV, containing 14, 7, 4, and 8 members, respectively. The most closely related peach HD-ZIP members within the same subfamilies shared very similar gene structure in terms of either intron/exon numbers or lengths. Almost all members of the same subfamily shared common motif compositions, thereby implying that the HD-ZIP proteins within the same subfamily may have functional similarity. The 33 peach HD-ZIP genes were distributed across scaffolds 1 to 7. Although the primary structure varied among HD-ZIP family proteins, their tertiary structures were similar. The results from this study will be useful in selecting candidate genes from specific subfamilies for functional analysis.

  1. Genome-Wide Identification and Analysis of the Type-B Authentic Response Regulator Gene Family in Peach (Prunus persica).

    PubMed

    Zeng, Jingjue; Zhu, Xudong; Haider, Muhammad S; Wang, Xicheng; Zhang, Cheng; Wang, Chen

    2017-01-01

    The type-B authentic response regulator (ARR-B) family members serve as DNA-binding transcriptional regulators, whose activities are probably regulated by phosphorylation/dephosphorylation, resulting in the rapid induction of type-A ARR genes. Type-B ARRs are believed to be involved in many biological processes, including cytokinin signaling, plant growth, and stress responses through a chaperone or by isomerization of proline residues during protein folding. The public availability of complete peach genome sequences allows the identification of 23 ARR-B genes by HMMER and blast analysis. Scaffold locations of these genes in the peach genome were determined, and the protein domain and motif organization of peach type-B ARRs were analyzed. The phylogenetic relationships between peach type-B ARRs were also assessed. The expression profiles of peach ARR-B genes revealed that most of the type-B ARRs showed high expression levels in tissues undergoing rapid cell division and may engage more cytokinins, like half-opened flowers, fruits at expansion stages, and young leaves. These findings not only contribute to a better understanding of the complex regulation of the peach ARR-B gene family, but also provide valuable information for future research in peach functional genomics. © 2017 S. Karger AG, Basel.

  2. Comprehensive analysis of multi-tissue transcriptome data and the genome-wide investigation of GRAS family in Phyllostachys edulis

    PubMed Central

    Zhao, Hansheng; Dong, Lili; Sun, Huayu; Li, Lichao; Lou, Yongfeng; Wang, Lili; Li, Zuyao; Gao, Zhimin

    2016-01-01

    GRAS family is one of plant specific transcription factors and plays diverse roles in the regulation of plant growth and development as well as in the plant disease resistance and abiotic stress responses. However, the investigation of GRAS family and multi-tissue gene expression profiles still remains unavailable in bamboo (Phyllostachys edulis). Here, we applied RNA-Seq analysis to monitor global transcriptional changes and investigate expression patterns in the five tissues of Ph. edulis, and analyzed a large-scale transcriptional events and patterns. Moreover, the tissue-specific genes and DEGs in different tissues were detected. For example, DEGs in panicle and leaf tissues were abundant in photosynthesis, glutathione, porphyrin and chlorophyll metabolism, whereas those in shoot and rhizome were majority in glycerophospholipid metabolism. In the portion of Ph. edulis GRAS (PeGRAS) analyses, we performed the analysis of phylogenetic, gene structure, conserved motifs, and analyzed the expression profiles of PeGRASs in response to high light and made a co-expression analysis. Additionally, the expression profiles of PeGRASs were validated using quantitative real-time PCR. Thus, PeGRASs based on dynamics profiles of gene expression is helpful in uncovering the specific biological functions which might be of critical values for bioengineering to improve bamboo breeding in future. PMID:27325361

  3. Using ancestry matching to combine family-based and unrelated samples for genome-wide association studies.

    PubMed

    Crossett, Andrew; Kent, Brian P; Klei, Lambertus; Ringquist, Steven; Trucco, Massimo; Roeder, Kathryn; Devlin, Bernie

    2010-12-10

    We propose a method to analyze family-based samples together with unrelated cases and controls. The method builds on the idea of matched case-control analysis using conditional logistic regression (CLR). For each trio within the family, a case (the proband) and matched pseudo-controls are constructed, based upon the transmitted and untransmitted alleles. Unrelated controls, matched by genetic ancestry, supplement the sample of pseudo-controls; likewise unrelated cases are also paired with genetically matched controls. Within each matched stratum, the case genotype is contrasted with control/pseudo-control genotypes via CLR, using a method we call matched-CLR (mCLR). Eigenanalysis of numerous SNP genotypes provides a tool for mapping genetic ancestry. The result of such an analysis can be thought of as a multidimensional map, or eigenmap, in which the relative genetic similarities and differences amongst individuals is encoded in the map. Once constructed, new individuals can be projected onto the ancestry map based on their genotypes. Successful differentiation of individuals of distinct ancestry depends on having a diverse, yet representative sample from which to construct the ancestry map. Once samples are well-matched, mCLR yields comparable power to competing methods while ensuring excellent control over Type I error.

  4. Using ancestry matching to combine family-based and unrelated samples for genome-wide association studies‡

    PubMed Central

    Crossett, Andrew; Kent, Brian P.; Klei, Lambertus; Ringquist, Steven; Trucco, Massimo; Roeder, Kathryn; Devlin, Bernie

    2015-01-01

    We propose a method to analyze family-based samples together with unrelated cases and controls. The method builds on the idea of matched case–control analysis using conditional logistic regression (CLR). For each trio within the family, a case (the proband) and matched pseudo-controls are constructed, based upon the transmitted and untransmitted alleles. Unrelated controls, matched by genetic ancestry, supplement the sample of pseudo-controls; likewise unrelated cases are also paired with genetically matched controls. Within each matched stratum, the case genotype is contrasted with control pseudo-control genotypes via CLR, using a method we call matched-CLR (mCLR). Eigenanalysis of numerous SNP genotypes provides a tool for mapping genetic ancestry. The result of such an analysis can be thought of as a multidimensional map, or eigenmap, in which the relative genetic similarities and differences amongst individuals is encoded in the map. Once constructed, new individuals can be projected onto the ancestry map based on their genotypes. Successful differentiation of individuals of distinct ancestry depends on having a diverse, yet representative sample from which to construct the ancestry map. Once samples are well-matched, mCLR yields comparable power to competing methods while ensuring excellent control over Type I error. PMID:20862653

  5. A Genome-Wide Analysis Reveals Stress and Hormone Responsive Patterns of TIFY Family Genes in Brassica rapa.

    PubMed

    Saha, Gopal; Park, Jong-In; Kayum, Md Abdul; Nou, Ill-Sup

    2016-01-01

    The TIFY family is a plant-specific group of proteins with a diversity of functions and includes four subfamilies, viz. ZML, TIFY, PPD, and JASMONATE ZIM-domain (JAZ) proteins. TIFY family members, particularly JAZ subfamily proteins, play roles in biological processes such as development and stress and hormone responses in Arabidopsis, rice, chickpea, and grape. However, there is no information about this family in any Brassica crop. This study identifies 36 TIFY genes in Brassica rapa, an economically important crop species in the Brassicaceae. An extensive in silico analysis of phylogenetic grouping, protein motif organization and intron-exon distribution confirmed that there are four subfamilies of BrTIFY proteins. Out of 36 BrTIFY genes, we identified 21 in the JAZ subfamily, seven in the TIFY subfamily, six in ZML and two in PPD. Extensive expression profiling of 21 BrTIFY JAZs in various tissues, especially in floral organs and at different flower growth stages revealed constitutive expression patterns, which suggest that BrTIFY JAZ genes are important during growth and development of B. rapa flowers. A protein interaction network analysis also pointed to association of these proteins with fertility and defense processes of B. rapa. Using a low temperature-treated whole-genome microarray data set, most of the JAZ genes were found to have variable transcript abundance between the contrasting inbred lines Chiifu and Kenshin of B. rapa. Subsequently, the expression of all 21 BrTIFY JAZs in response to cold stress was characterized in the same two lines via qPCR, demonstrating that nine genes were up-regulated. Importantly, the BrTIFY JAZs showed strong and differential expression upon JA treatment, pointing to their probable involvement in JA-mediated growth regulatory functions, especially during flower development and stress responses. Additionally, BrTIFY JAZs were induced in response to salt, drought, Fusarium, ABA, and SA treatments, and six genes (BrTIFY3

  6. A Genome-Wide Analysis Reveals Stress and Hormone Responsive Patterns of TIFY Family Genes in Brassica rapa

    PubMed Central

    Saha, Gopal; Park, Jong-In; Kayum, Md. Abdul; Nou, Ill-Sup

    2016-01-01

    The TIFY family is a plant-specific group of proteins with a diversity of functions and includes four subfamilies, viz. ZML, TIFY, PPD, and JASMONATE ZIM-domain (JAZ) proteins. TIFY family members, particularly JAZ subfamily proteins, play roles in biological processes such as development and stress and hormone responses in Arabidopsis, rice, chickpea, and grape. However, there is no information about this family in any Brassica crop. This study identifies 36 TIFY genes in Brassica rapa, an economically important crop species in the Brassicaceae. An extensive in silico analysis of phylogenetic grouping, protein motif organization and intron-exon distribution confirmed that there are four subfamilies of BrTIFY proteins. Out of 36 BrTIFY genes, we identified 21 in the JAZ subfamily, seven in the TIFY subfamily, six in ZML and two in PPD. Extensive expression profiling of 21 BrTIFY JAZs in various tissues, especially in floral organs and at different flower growth stages revealed constitutive expression patterns, which suggest that BrTIFY JAZ genes are important during growth and development of B. rapa flowers. A protein interaction network analysis also pointed to association of these proteins with fertility and defense processes of B. rapa. Using a low temperature-treated whole-genome microarray data set, most of the JAZ genes were found to have variable transcript abundance between the contrasting inbred lines Chiifu and Kenshin of B. rapa. Subsequently, the expression of all 21 BrTIFY JAZs in response to cold stress was characterized in the same two lines via qPCR, demonstrating that nine genes were up-regulated. Importantly, the BrTIFY JAZs showed strong and differential expression upon JA treatment, pointing to their probable involvement in JA-mediated growth regulatory functions, especially during flower development and stress responses. Additionally, BrTIFY JAZs were induced in response to salt, drought, Fusarium, ABA, and SA treatments, and six genes (BrTIFY3

  7. Genome-Wide Identification and Expression Analysis of the Tubby-Like Protein Family in the Malus domestica Genome

    PubMed Central

    Xu, Jia-Ning; Xing, Shan-Shan; Zhang, Zheng-Rong; Chen, Xue-Sen; Wang, Xiao-Yun

    2016-01-01

    Tubby-like proteins (TLPs), which have a highly conserved β barrel tubby domain, have been found to be associated with some animal-specific characteristics. In the plant kingdom, more than 10 TLP family members were identified in Arabidopsis, rice and maize, and they were found to be involved in responses to stress. The publication of the apple genome makes it feasible to systematically study the TLP family in apple. In this investigation, nine TLP encoding genes (TLPs for short) were identified. When combined with the TLPs from other plant species, the TLPs were divided into three groups (group A, B, and C). Most plant TLP members in group A contained an additional F-box domain at the N-terminus. However, no common domain was identified other than tubby domain either in group B or in group C. An analysis of the tubby domains of MdTLPs identified three types of conserved motifs. Motif 1 and 2, the signature motifs in the confirmed TLPs, were always present in MdTLPs, while motif 3 was absent from group B. Homology modeling indicated that the tubby domain of most MdTLPs had a closed β barrel, as in animal tubby domains. Expression profiling revealed that the MdTLP genes were expressed in multiple organs and were abundant in roots, stems, and leaves but low in flowers. An analysis of cis-acting elements showed that elements related to the stress response were prevalent in the promoter sequences of MdTLPs. Expression profiling by qRT-PCR indicated that almost all MdTLPs were up-regulated at some extent under abiotic stress, exogenous ABA and H2O2 treatments in leaves and roots, though different MdTLP members exhibited differently in leaves and roots. The results and information above may provide a basis for further investigation of TLP function in plants. PMID:27895653

  8. Genome-Wide Identification and Comparative Analysis of Cytosine-5 DNA Methyltransferase and Demethylase Families in Wild and Cultivated Peanut

    PubMed Central

    Wang, Pengfei; Gao, Chao; Bian, Xiaotong; Zhao, Shuzhen; Zhao, Chuanzhi; Xia, Han; Song, Hui; Hou, Lei; Wan, Shubo; Wang, Xingjun

    2016-01-01

    DNA methylation plays important roles in genome protection, regulation of gene expression and is associated with plants development. Plant DNA methylation pattern was mediated by cytosine-5 DNA methyltransferase and demethylase. Although the genomes of AA and BB wild peanuts have been fully sequenced, these two gene families have not been studied. In this study we report the identification and analysis of putative cytosine-5 DNA methyltransferases (C5-MTases) and demethylases in AA and BB wild peanuts. Cytosine-5 DNA methyltransferases in AA and BB wild peanuts could be classified in MET, CMT, and DRM2 groups based on their domain organization. This result was supported by the gene and protein structural characteristics and phylogenetic analysis. We found that some wild peanut DRM2 members didn't contain UBA domain which was different from other plants such as Arabidopsis, maize and soybean. Five DNA demethylase encoding genes were found in AA genome and five in BB genome. The selective pressure analysis showed that wild peanut C5-MTase genes mainly underwent purifying selection but many positive selection sites can be detected. Conversely, DNA demethylase genes mainly underwent positive selection during evolution. Additionally, the expression dynamic of cytosine-5 DNA methyltransferase and demethylase genes in different cultivated peanut tissues were analyzed. Expression result showed that cold, heat or PEG stress could influence the expression level of C5-MTase and DNA demethylase genes in cultivated peanut. These results are useful for better understanding the complexity of these two gene families, and will facilitate epigenetic studies in peanut in the future. PMID:26870046

  9. Genome-wide identification and characterization of the NF-Y gene family in grape (vitis vinifera L.).

    PubMed

    Ren, Chong; Zhang, Zhan; Wang, Yi; Li, Shaohua; Liang, Zhenchang

    2016-08-11

    Nuclear factor Y (NF-Y) transcription factor is composed of three distinct subunits: NF-YA, NF-YB and NF-YC. Many members of NF-Y family have been reported to be key regulators in plant development, phytohormone signaling and drought tolerance. However, the function of the NF-Y family is less known in grape (Vitis vinifera L.). A total of 34 grape NF-Y genes that distributed unevenly on grape (V. vinifera) chromosomes were identified in this study. Phylogenetic analysis was performed to predict functional similarities between Arabidopsis thaliana and grape NF-Y genes. Comparison of the structures of grape NF-Y genes (VvNF-Ys) revealed their functional conservation and alteration. Furthermore, we investigated the expression profiles of VvNF-Ys in response to various stresses, phytohormone treatments, and in leaves and grape berries with various sugar contents at different developmental stages. The relationship between VvNF-Y transcript levels and sugar content was examined to select candidates for exogenous sugar treatments. Quantitative real-time PCR (qPCR) indicated that many VvNF-Ys responded to different sugar stimuli with variations in transcript abundance. qPCR and publicly available microarray data suggest that VvNF-Ys exhibit distinct expression patterns in different grape organs and developmental stages, and a number of VvNF-Ys may participate in responses to multiple abiotic and biotic stresses, phytohormone treatments and sugar accumulation or metabolism. In this study, we characterized 34 VvNF-Ys based on their distributions on chromosomes, gene structures, phylogenetic relationship with Arabidopsis NF-Y genes, and their expression patterns. The potential roles of VvNF-Ys in sugar accumulation or metabolism were also investigated. Altogether, the data provide significant insights on VvNF-Ys, and lay foundations for further functional studies of NF-Y genes in grape.

  10. Genome-wide characterization of the WRKY gene family in radish (Raphanus sativus L.) reveals its critical functions under different abiotic stresses.

    PubMed

    Karanja, Bernard Kinuthia; Fan, Lianxue; Xu, Liang; Wang, Yan; Zhu, Xianwen; Tang, Mingjia; Wang, Ronghua; Zhang, Fei; Muleke, Everlyne M'mbone; Liu, Liwang

    2017-08-17

    The radish WRKY gene family was genome-widely identified and played critical roles in response to multiple abiotic stresses. The WRKY is among the largest transcription factors (TFs) associated with multiple biological activities for plant survival, including control response mechanisms against abiotic stresses such as heat, salinity, and heavy metals. Radish is an important root vegetable crop and therefore characterization and expression pattern investigation of WRKY transcription factors in radish is imperative. In the present study, 126 putative WRKY genes were retrieved from radish genome database. Protein sequence and annotation scrutiny confirmed that RsWRKY proteins possessed highly conserved domains and zinc finger motif. Based on phylogenetic analysis results, RsWRKYs candidate genes were divided into three groups (Group I, II and III) with the number 31, 74, and 20, respectively. Additionally, gene structure analysis revealed that intron-exon patterns of the WRKY genes are highly conserved in radish. Linkage map analysis indicated that RsWRKY genes were distributed with varying densities over nine linkage groups. Further, RT-qPCR analysis illustrated the significant variation of 36 RsWRKY genes under one or more abiotic stress treatments, implicating that they might be stress-responsive genes. In total, 126 WRKY TFs were identified from the R. sativus genome wherein, 35 of them showed abiotic stress-induced expression patterns. These results provide a genome-wide characterization of RsWRKY TFs and baseline for further functional dissection and molecular evolution investigation, specifically for improving abiotic stress resistances with an ultimate goal of increasing yield and quality of radish.

  11. Genome-Wide Investigation and Expression Analyses of WD40 Protein Family in the Model Plant Foxtail Millet (Setaria italica L.)

    PubMed Central

    Mishra, Awdhesh Kumar; Muthamilarasan, Mehanathan; Khan, Yusuf; Parida, Swarup Kumar; Prasad, Manoj

    2014-01-01

    WD40 proteins play a crucial role in diverse protein-protein interactions by acting as scaffolding molecules and thus assisting in the proper activity of proteins. Hence, systematic characterization and expression profiling of these WD40 genes in foxtail millet would enable us to understand the networks of WD40 proteins and their biological processes and gene functions. In the present study, a genome-wide survey was conducted and 225 potential WD40 genes were identified. Phylogenetic analysis categorized the WD40 proteins into 5 distinct sub-families (I–V). Gene Ontology annotation revealed the biological roles of the WD40 proteins along with its cellular components and molecular functions. In silico comparative mapping with sorghum, maize and rice demonstrated the orthologous relationships and chromosomal rearrangements including duplication, inversion and deletion of WD40 genes. Estimation of synonymous and non-synonymous substitution rates revealed its evolutionary significance in terms of gene-duplication and divergence. Expression profiling against abiotic stresses provided novel insights into specific and/or overlapping expression patterns of SiWD40 genes. Homology modeling enabled three-dimensional structure prediction was performed to understand the molecular functions of WD40 proteins. Although, recent findings had shown the importance of WD40 domains in acting as hubs for cellular networks during many biological processes, it has invited a lesser research attention unlike other common domains. Being a most promiscuous interactors, WD40 domains are versatile in mediating critical cellular functions and hence this genome-wide study especially in the model crop foxtail millet would serve as a blue-print for functional characterization of WD40s in millets and bioenergy grass species. In addition, the present analyses would also assist the research community in choosing the candidate WD40s for comprehensive studies towards crop improvement of millets and

  12. Genome-wide analysis and expression profiling of glyoxalase gene families in soybean (Glycine max) indicate their development and abiotic stress specific response.

    PubMed

    Ghosh, Ajit; Islam, Tahmina

    2016-04-16

    Glyoxalase pathway consists of two enzymes, glyoxalase I (GLYI) and glyoxalase II (GLYII) which detoxifies a highly cytotoxic metabolite methylglyoxal (MG) to its non-toxic form. MG may form advanced glycation end products with various cellular macro-molecules such as proteins, DNA and RNA; that ultimately lead to their inactivation. Role of glyoxalase enzymes has been extensively investigated in various plant species which showed their crucial role in salinity, drought and heavy metal stress tolerance. Previously genome-wide analysis of glyoxalase genes has been conducted in model plants Arabidopsis and rice, but no such study was performed in any legume species. In the present study, a comprehensive genome database analysis of soybean was performed and identified a total of putative 41 GLYI and 23 GLYII proteins encoded by 24 and 12 genes, respectively. Detailed analysis of these identified members was conducted including their nomenclature and classification, chromosomal distribution and duplication, exon-intron organization, and protein domain(s) and motifs identification. Expression profiling of these genes has been performed in different tissues and developmental stages as well as under salinity and drought stresses using publicly available RNAseq and microarray data. The study revealed that GmGLYI-7 and GmGLYII-8 have been expressed intensively in all the developmental stages and tissues; while GmGLYI-6, GmGLYI-9, GmGLYI-20, GmGLYII-5 and GmGLYII-10 were highly abiotic stress responsive members. The present study identifies the largest family of glyoxalase proteins to date with 41 GmGLYI and 23 GmGLYII members in soybean. Detailed analysis of GmGLYI and GmGLYII genes strongly indicates the genome-wide segmental and tandem duplication of the glyoxalase members. Moreover, this study provides a strong basis about the biological role and function of GmGLYI and GmGLYII members in soybean growth, development and stress physiology.

  13. Clustering patterns of LOD scores for asthma-related phenotypes revealed by a genome-wide screen in 295 French EGEA families.

    PubMed

    Bouzigon, Emmanuelle; Dizier, Marie-Hélène; Krähenbühl, Christine; Lemainque, Arnaud; Annesi-Maesano, Isabella; Betard, Christine; Bousquet, Jean; Charpin, Denis; Gormand, Frédéric; Guilloud-Bataille, Michel; Just, Jocelyne; Le Moual, Nicole; Maccario, Jean; Matran, Régis; Neukirch, Françoise; Oryszczyn, Marie-Pierre; Paty, Evelyne; Pin, Isabelle; Rosenberg-Bourgin, Myriam; Vervloet, Daniel; Kauffmann, Francine; Lathrop, Mark; Demenais, Florence

    2004-12-15

    A genome-wide scan for asthma phenotypes was conducted in the whole sample of 295 EGEA families selected through at least one asthmatic subject. In addition to asthma, seven phenotypes involved in the main asthma physiopathological pathways were considered: SPT (positive skin prick test response to at least one of 11 allergens), SPTQ score being the number of positive skin test responses to 11 allergens, Phadiatop (positive specific IgE response to a mixture of allergens), total IgE levels, eosinophils, bronchial responsiveness (BR) to methacholine challenge and %predicted FEV(1). Four regions showed evidence for linkage (Pgenome-wide LOD scores. This analysis revealed clustering of LODs for asthma, SPT and Phadiatop on one axis and clustering of LODs for %FEV(1), BR and SPTQ on the other, while LODs for IgE and eosinophils appeared to be independent from all other LODs. These results provide new insights into the potential sharing of genetic determinants by asthma-related phenotypes.

  14. Genome-wide analysis of the ATP-binding cassette (ABC) transporter gene family in the silkworm, Bombyx mori.

    PubMed

    Xie, Xiaodong; Cheng, Tingcai; Wang, Genhong; Duan, Jun; Niu, Weihuan; Xia, Qingyou

    2012-07-01

    The ATP-binding cassette (ABC) superfamily is a larger protein family with diverse physiological functions in all kingdoms of life. We identified 53 ABC transporters in the silkworm genome, and classified them into eight subfamilies (A-H). Comparative genome analysis revealed that the silkworm has an expanded ABCC subfamily with more members than Drosophila melanogaster, Caenorhabditis elegans, or Homo sapiens. Phylogenetic analysis showed that the ABCE and ABCF genes were highly conserved in the silkworm, indicating possible involvement in fundamental biological processes. Five multidrug resistance-related genes in the ABCB subfamily and two multidrug resistance-associated-related genes in the ABCC subfamily indicated involvement in biochemical defense. Genetic variation analysis revealed four ABC genes that might be evolving under positive selection. Moreover, the silkworm ABCC4 gene might be important for silkworm domestication. Microarray analysis showed that the silkworm ABC genes had distinct expression patterns in different tissues on day 3 of the fifth instar. These results might provide new insights for further functional studies on the ABC genes in the silkworm genome.

  15. Genome-wide survey and expression analysis of the PUB family in Chinese cabbage (Brassica rapa ssp. pekinesis).

    PubMed

    Wang, Cheng; Duan, Weike; Riquicho, Ali RamuliMaquina; Jing, Zange; Liu, Tongkun; Hou, Xilin; Li, Ying

    2015-12-01

    U-box proteins are widely distributed among eukaryotic organisms and show a higher prevalence in plants than in other organisms. Plant U-box (PUB) proteins play crucial regulatory roles in various developmental and physiological processes. Previously, 64 and 77 PUB genes have been identified in Arabidopsis (Arabidopsis thaliana) and rice (Oryza sativa), respectively. In this study, 101 putative PUB genes were identified in the Chinese cabbage (Brassica rapa ssp. pekinensis line Chiifu-401-42) genome and compared with other 15 representative plants. By specific protein domains and a phylogenetic analysis, the B. rapa PUB (BrPUB) gene family was subdivided into 10 groups. Localization of BrPUB genes showed an uneven distribution on the ten chromosomes of B. rapa. The orthologous and co-orthologous PUB gene pairs were identified between B. rapa and A. thaliana. RNA-seq transcriptome data of different tissues revealed tissue-specific and differential expression profiles of the BrPUBs, and quantitative real-time PCR analysis showed inverse gene expression patterns of the BrPUB-ARMs in response to cold and heat stresses. Altogether, the identification, classification, phylogenetic analysis, chromosome distribution, conserved motifs, and expression patterns of BrPUBs were predicted and analysed. Importantly, this study of BrPUBs provides a rich resource that will aid in the determination of PUB functions in plant development.

  16. Genome-wide identification, classification, and analysis of heat shock transcription factor family in Chinese cabbage (Brassica rapa pekinensis).

    PubMed

    Huang, X Y; Tao, P; Li, B Y; Wang, W H; Yue, Z C; Lei, J L; Zhong, X M

    2015-03-27

    Chinese cabbage (Brassica rapa ssp. pekinensis) is one of the most important vegetable crops grown worldwide, and various methods exist for selection, propagation, and cultivation. The entire Chinese cabbage genome has been sequenced, and the heat shock transcription factor family (Hsfs) has been found to play a central role in plant growth and development and in the response to biotic and abiotic stress conditions, particularly in acquired thermotolerance. We analyzed heat tolerance mechanisms in Chinese cabbage. In this study, 30 Hsfs were identified from the Chinese cabbage genome database. The classification, phylogenetic reconstruction, chromosome distribution, conserved motifs, expression analysis, and interaction networks of the Hsfs were predicted and analyzed. Thirty BrHsfs were classified into 3 major classes (class A, B, and C) according to their structural characteristics and phylogenetic comparisons, and class A was further subdivided into 8 subclasses. Distribution mapping results showed that Hsf genes were located on 10 Chinese cabbage chromosomes. The expression profile indicated that Hsfs play differential roles in 5 organs in Chinese cabbage, and likely participate in the development of underground parts and regulation of reproductive growth. An orthologous gene interaction network was constructed, and included MBF1C, ROF1, TBP2, CDC2, and HSP70 5 genes, which are closely related to heat stress. Our results contribute to the understanding of the complexity of Hsfs in Chinese cabbage and provide a basis for further functional gene research.

  17. Genome wide identification of Dof transcription factor gene family in sorghum and its comparative phylogenetic analysis with rice and Arabidopsis.

    PubMed

    Kushwaha, Hariom; Gupta, Shubhra; Singh, Vinay Kumar; Rastogi, Smita; Yadav, Dinesh

    2011-11-01

    The Dof (DNA binding with One Finger) family represents a classic zinc-finger transcription factors involved with multifarious roles exclusively in plants. There exists great diversity in terms of number of Dof genes observed in different crops. In current study, a total of 28 putative Dof genes have been predicted in silico from the recently available whole genome shotgun sequence of Sorghum bicolor (L.) Moench (with assigned accession numbers TPA:BK006983-BK007006 and TPA:BK007079-BK007082). The predicted SbDof genes are distributed on nine out of ten chromosomes of sorghum and most of these genes lack introns based on canonical intron/exon structure. Phylogenetic analysis of 28 SbDof proteins resulted in four subgroups constituting six clusters. The comparative phylogenetic analysis of these Dof proteins along with 30 rice and 36 Arabidopsis Dof proteins revealed six major groups similar to what has been observed earlier for rice and Arabidopsis. Motif analysis revealed the presence of conserved 50-52 amino acids Dof domain uniformly distributed across all the 28 Dof proteins of sorghum. The in silico cis-regulatory elements analysis of these SbDof genes suggested its diverse functions associated with light responsiveness, endosperm specific gene expression, hormone responsiveness, meristem specific expression and stress responsiveness.

  18. Genome-Wide Identification and Expression Analysis of the Mitogen-Activated Protein Kinase Gene Family in Cassava

    PubMed Central

    Yan, Yan; Wang, Lianzhe; Ding, Zehong; Tie, Weiwei; Ding, Xupo; Zeng, Changying; Wei, Yunxie; Zhao, Hongliang; Peng, Ming; Hu, Wei

    2016-01-01

    Mitogen-activated protein kinases (MAPKs) play central roles in plant developmental processes, hormone signaling transduction, and responses to abiotic stress. However, no data are currently available about the MAPK family in cassava, an important tropical crop. Herein, 21 MeMAPK genes were identified from cassava. Phylogenetic analysis indicated that MeMAPKs could be classified into four subfamilies. Gene structure analysis demonstrated that the number of introns in MeMAPK genes ranged from 1 to 10, suggesting large variation among cassava MAPK genes. Conserved motif analysis indicated that all MeMAPKs had typical protein kinase domains. Transcriptomic analysis suggested that MeMAPK genes showed differential expression patterns in distinct tissues and in response to drought stress between wild subspecies and cultivated varieties. Interaction networks and co-expression analyses revealed that crucial pathways controlled by MeMAPK networks may be involved in the differential response to drought stress in different accessions of cassava. Expression of nine selected MAPK genes showed that these genes could comprehensively respond to osmotic, salt, cold, oxidative stressors, and abscisic acid (ABA) signaling. These findings yield new insights into the transcriptional control of MAPK gene expression, provide an improved understanding of abiotic stress responses and signaling transduction in cassava, and lead to potential applications in the genetic improvement of cassava cultivars. PMID:27625666

  19. Genome-Wide Identification and Expression Analyses of Aquaporin Gene Family during Development and Abiotic Stress in Banana.

    PubMed

    Hu, Wei; Hou, Xiaowan; Huang, Chao; Yan, Yan; Tie, Weiwei; Ding, Zehong; Wei, Yunxie; Liu, Juhua; Miao, Hongxia; Lu, Zhiwei; Li, Meiying; Xu, Biyu; Jin, Zhiqiang

    2015-08-20

    Aquaporins (AQPs) function to selectively control the flow of water and other small molecules through biological membranes, playing crucial roles in various biological processes. However, little information is available on the AQP gene family in bananas. In this study, we identified 47 banana AQP genes based on the banana genome sequence. Evolutionary analysis of AQPs from banana, Arabidopsis, poplar, and rice indicated that banana AQPs (MaAQPs) were clustered into four subfamilies. Conserved motif analysis showed that all banana AQPs contained the typical AQP-like or major intrinsic protein (MIP) domain. Gene structure analysis suggested the majority of MaAQPs had two to four introns with a highly specific number and length for each subfamily. Expression analysis of MaAQP genes during fruit development and postharvest ripening showed that some MaAQP genes exhibited high expression levels during these stages, indicating the involvement of MaAQP genes in banana fruit development and ripening. Additionally, some MaAQP genes showed strong induction after stress treatment and therefore, may represent potential candidates for improving banana resistance to abiotic stress. Taken together, this study identified some excellent tissue-specific, fruit development- and ripening-dependent, and abiotic stress-responsive candidate MaAQP genes, which could lay a solid foundation for genetic improvement of banana cultivars.

  20. Genome-wide analysis of primary auxin-responsive Aux/IAA gene family in maize (Zea mays. L.).

    PubMed

    Wang, Yijun; Deng, Dexiang; Bian, Yunlong; Lv, Yanping; Xie, Qin

    2010-12-01

    The phytohormone auxin is important in various aspects of organism growth and development. Aux/IAA genes encoding short-lived nuclear proteins are responsive primarily to auxin induction. Despite their physiological importance, systematic analysis of Aux/IAA genes in maize have not yet been reported. In this paper, we presented the isolation and characterization of maize Aux/IAA genes in whole-genome scale. A total of 31 maize Aux/IAA genes (ZmIAA1 to ZmIAA31) were identified. ZmIAA genes are distributed in all the maize chromosomes except chromosome 2. Aux/IAA genes expand in the maize genome partly due to tandem and segmental duplication events. Multiple alignment and motif display results revealed major maize Aux/IAA proteins share all the four conserved domains. Phylogenetic analysis indicated Aux/IAA family can be divided into seven subfamilies. Putative cis-acting regulatory DNA elements involved in auxin response, light signaling transduction and abiotic stress adaption were observed in the promoters of ZmIAA genes. Expression data mining suggested maize Aux/IAA genes have temporal and spatial expression pattern. Collectively, these results will provide molecular insights into the auxin metabolism, transport and signaling research.

  1. Genome-wide characterization of the β-1,3-glucanase gene family in Gossypium by comparative analysis

    PubMed Central

    Xu, Xiaoyang; Feng, Yue; Fang, Shuai; Xu, Jun; Wang, Xinyu; Guo, Wangzhen

    2016-01-01

    The β-1,3-glucanase gene family is involved in a wide range of plant developmental processes as well as pathogen defense mechanisms. Comprehensive analyses of β-1,3-glucanase genes (GLUs) have not been reported in cotton. Here, we identified 67, 68, 130 and 158 GLUs in four sequenced cotton species, G. raimondii (D5), G. arboreum (A2), G. hirsutum acc. TM-1 (AD1), and G. barbadense acc. 3–79 (AD2), respectively. Cotton GLUs can be classified into the eight subfamilies (A–H), and their protein domain architecture and intron/exon structure are relatively conserved within each subfamily. Sixty-seven GLUs in G. raimondii were anchored onto 13 chromosomes, with 27 genes involved in segmental duplications, and 13 in tandem duplications. Expression patterns showed highly developmental and spatial regulation of GLUs in TM-1. In particular, the expression of individual member of GLUs in subfamily E was limited to roots, leaves, floral organs or fibers. Members of subfamily E also showed more protein evolution and subgenome expression bias compared with members of other subfamilies. We clarified that GLU42 and GLU43 in subfamily E were preferentially expressed in root and leaf tissues and significantly upregulated after Verticillium dahliae inoculation. Silencing of GLU42 and GLU43 significantly increased the susceptibility of cotton to V. dahliae. PMID:27353015

  2. Genome-wide identification and evolution of the PIN-FORMED (PIN) gene family in Glycine max.

    PubMed

    Liu, Yuan; Wei, Haichao

    2017-03-17

    Soybean is one of the most important crop plants, wild and cultivated soybean varieties have significant differences worth further investigation, such as plant morphology, seed size, seed coat development, these characters may be related to auxin biology. The PIN family is an essential auxin polar transport gene, but little research on soybean PINs (GmPINs) has been done, especially with respect to evolution and the differences between wild and cultivated soybean. In this study, we retrieved 23 GmPINs from the latest updated G. max genome database; six GmPINs protein sequences were changed compared with the previous database. Based on the Plant Genome Duplication Database, 18 GmPINs have been involved in segment duplication. Three pairs of GmPINs arose after the second soybean genome duplication, and six occurred after the first genome duplication. The duplicated GmPINs retained similar expression patterns. All the duplicated GmPINs experienced purifying selection (Ka/Ks<1) to prevent accumulation of non-synonymous mutations and so became more similar. In addition, we also focused on the artificial selection of the soybean PIN genes. Five artificially selected GmPINs were identified by comparing the genome sequence of 17 wild and 14 cultivated soybean varieties. Our research provides useful and comprehensive basic information for understanding GmPINs.

  3. Genome-Wide Identification, Classification, and Expression Analysis of Amino Acid Transporter Gene Family in Glycine Max.

    PubMed

    Cheng, Lin; Yuan, Hong-Yu; Ren, Ren; Zhao, Shi-Qi; Han, Ya-Peng; Zhou, Qi-Ying; Ke, Dan-Xia; Wang, Ying-Xiang; Wang, Lei

    2016-01-01

    Amino acid transporters (AATs) play important roles in transporting amino acid across cellular membranes and are essential for plant growth and development. To date, the AAT gene family in soybean (Glycine max L.) has not been characterized. In this study, we identified 189 AAT genes from the entire soybean genomic sequence, and classified them into 12 distinct subfamilies based upon their sequence composition and phylogenetic positions. To further investigate the functions of these genes, we analyzed the chromosome distributions, gene structures, duplication patterns, phylogenetic tree, tissue expression patterns of the 189 AAT genes in soybean. We found that a large number of AAT genes in soybean were expanded via gene duplication, 46 and 36 GmAAT genes were WGD/segmental and tandemly duplicated, respectively. Further comprehensive analyses of the expression profiles of GmAAT genes in various stages of vegetative and reproductive development showed that soybean AAT genes exhibited preferential or distinct expression patterns among different tissues. Overall, our study provides a framework for further analysis of the biological functions of AAT genes in either soybean or other crops.

  4. Genome-Wide Identification, Evolution and Expression Analysis of the Grape (Vitis vinifera L.) Zinc Finger-Homeodomain Gene Family

    PubMed Central

    Wang, Hao; Yin, Xiangjing; Li, Xiaoqin; Wang, Li; Zheng, Yi; Xu, Xiaozhao; Zhang, Yucheng; Wang, Xiping

    2014-01-01

    Plant zinc finger-homeodomain (ZHD) genes encode a family of transcription factors that have been demonstrated to play an important role in the regulation of plant growth and development. In this study, we identified a total of 13 ZHD genes (VvZHD) in the grape genome that were further classified into at least seven groups. Genome synteny analysis revealed that a number of VvZHD genes were present in the corresponding syntenic blocks of Arabidopsis, indicating that they arose before the divergence of these two species. Gene expression analysis showed that the identified VvZHD genes displayed distinct spatiotemporal expression patterns, and were differentially regulated under various stress conditions and hormone treatments, suggesting that the grape VvZHDs might be also involved in plant response to a variety of biotic and abiotic insults. Our work provides insightful information and knowledge about the ZHD genes in grape, which provides a framework for further characterization of their roles in regulation of stress tolerance as well as other aspects of grape productivity. PMID:24705465

  5. Genome-Wide Identification of bZIP Family Genes Involved in Drought and Heat Stresses in Strawberry (Fragaria vesca).

    PubMed

    Wang, Xiao-Long; Chen, Xinlu; Yang, Tian-Bao; Cheng, Qunkang; Cheng, Zong-Ming

    2017-01-01

    Basic leucine zipper (bZIP) genes are known to play a crucial role in response to various processes in plant as well as abiotic or biotic stress challenges. We have performed an identification and characterization of 50 bZIP genes across the woodland strawberry (Fragaria vesca) genome, which were divided into 10 clades according to the phylogenetic relationship of the strawberry bZIP proteins with those in Arabidopsis and rice. Five categories of intron patterns were observed within basic and hinge regions of the bZIP domains. Some additional conserved motifs have been found with the group specificity. Further, we predicted DNA-binding specificity of the basic and hinge regions as well as dimerization properties of leucine zipper regions, which was consistent with our phylogenetic clade and classified into 20 subfamilies. Across the different developmental stages of 15 organs and two types of fruits, the clade A bZIP members showed different tissue-specific expression patterns and the duplicated genes were differentially regulated, indicating a functional diversification coupled with the expansion of this gene family in strawberry. Under normal growth conditions, mrna11837 and mrna30280 of clade A showed very weak expression levels in organs and fruits, respectively; but higher expression was observed with different set of genes following drought and heat treatment, which may be caused by the separate response pathway between drought and heat treatments.

  6. Genome-Wide Identification and Expression Analyses of Aquaporin Gene Family during Development and Abiotic Stress in Banana

    PubMed Central

    Hu, Wei; Hou, Xiaowan; Huang, Chao; Yan, Yan; Tie, Weiwei; Ding, Zehong; Wei, Yunxie; Liu, Juhua; Miao, Hongxia; Lu, Zhiwei; Li, Meiying; Xu, Biyu; Jin, Zhiqiang

    2015-01-01

    Aquaporins (AQPs) function to selectively control the flow of water and other small molecules through biological membranes, playing crucial roles in various biological processes. However, little information is available on the AQP gene family in bananas. In this study, we identified 47 banana AQP genes based on the banana genome sequence. Evolutionary analysis of AQPs from banana, Arabidopsis, poplar, and rice indicated that banana AQPs (MaAQPs) were clustered into four subfamilies. Conserved motif analysis showed that all banana AQPs contained the typical AQP-like or major intrinsic protein (MIP) domain. Gene structure analysis suggested the majority of MaAQPs had two to four introns with a highly specific number and length for each subfamily. Expression analysis of MaAQP genes during fruit development and postharvest ripening showed that some MaAQP genes exhibited high expression levels during these stages, indicating the involvement of MaAQP genes in banana fruit development and ripening. Additionally, some MaAQP genes showed strong induction after stress treatment and therefore, may represent potential candidates for improving banana resistance to abiotic stress. Taken together, this study identified some excellent tissue-specific, fruit development- and ripening-dependent, and abiotic stress-responsive candidate MaAQP genes, which could lay a solid foundation for genetic improvement of banana cultivars. PMID:26307965

  7. Genome-wide identification, evolution and expression analysis of the grape (Vitis vinifera L.) zinc finger-homeodomain gene family.

    PubMed

    Wang, Hao; Yin, Xiangjing; Li, Xiaoqin; Wang, Li; Zheng, Yi; Xu, Xiaozhao; Zhang, Yucheng; Wang, Xiping

    2014-04-03

    Plant zinc finger-homeodomain (ZHD) genes encode a family of transcription factors that have been demonstrated to play an important role in the regulation of plant growth and development. In this study, we identified a total of 13 ZHD genes (VvZHD) in the grape genome that were further classified into at least seven groups. Genome synteny analysis revealed that a number of VvZHD genes were present in the corresponding syntenic blocks of Arabidopsis, indicating that they arose before the divergence of these two species. Gene expression analysis showed that the identified VvZHD genes displayed distinct spatiotemporal expression patterns, and were differentially regulated under various stress conditions and hormone treatments, suggesting that the grape VvZHDs might be also involved in plant response to a variety of biotic and abiotic insults. Our work provides insightful information and knowledge about the ZHD genes in grape, which provides a framework for further characterization of their roles in regulation of stress tolerance as well as other aspects of grape productivity.

  8. Genome Wide Analysis of the Apple MYB Transcription Factor Family Allows the Identification of MdoMYB121 Gene Confering Abiotic Stress Tolerance in Plants

    PubMed Central

    Wang, Rong-Kai; Zhang, Rui-Fen; Hao, Yu-Jin

    2013-01-01

    The MYB proteins comprise one of the largest families of transcription factors (TFs) in plants. Although several MYB genes have been characterized to play roles in secondary metabolism, the MYB family has not yet been identified in apple. In this study, 229 apple MYB genes were identified through a genome-wide analysis and divided into 45 subgroups. A computational analysis was conducted using the apple genomic database to yield a complete overview of the MYB family, including the intron-exon organizations, the sequence features of the MYB DNA-binding domains, the carboxy-terminal motifs, and the chromosomal locations. Subsequently, the expression of 18 MYB genes, including 12 were chosen from stress-related subgroups, while another 6 ones from other subgroups, in response to various abiotic stresses was examined. It was found that several of these MYB genes, particularly MdoMYB121, were induced by multiple stresses. The MdoMYB121 was then further functionally characterized. Its predicted protein was found to be localized in the nucleus. A transgenic analysis indicated that the overexpression of the MdoMYB121 gene remarkably enhanced the tolerance to high salinity, drought, and cold stresses in transgenic tomato and apple plants. Our results indicate that the MYB genes are highly conserved in plant species and that MdoMYB121 can be used as a target gene in genetic engineering approaches to improve the tolerance of plants to multiple abiotic stresses. PMID:23950843

  9. Genome-wide analysis of DUF221 domain-containing gene family in Oryza species and identification of its salinity stress-responsive members in rice.

    PubMed

    Ganie, Showkat Ahmad; Pani, Dipti Ranjan; Mondal, Tapan Kumar

    2017-01-01

    DUF221 domain-containing genes (DDP genes) play important roles in developmental biology, hormone signalling transduction, and responses to abiotic stress. Therefore to understand their structural and evolutionary relationship, we did a genome-wide analysis of this important gene family in rice. Further, through comparative genomics, DDP genes from Oryza sativa subsp. (indica), nine different wild species of rice and Arabidopsis were also identified. We also found an expansion of the DDP gene families in rice and Arabidopsis which is due to the segmental duplication events in some of the gene family members. In general, a highly purifying selection was found acting on all the deduced paralogous and orthologous DDP gene pairs. The data from microarray and subsequent qRT-PCR analysis revealed that although several OsDDPs were differentially regulated under salinity stress, yet OsDDP6 was upregulated at all the developmental stages in salt tolerant rice genotype, FL478. Interestingly, OsDDP6 was found to be involved in proline metabolism pathway as indicated by protein network analysis. The diverse gene structures, varied transmembrane topologies and the differential expression patterns implied the functional diversity in DDP genes. Therefore, the comprehensive evolutionary analysis of DDP genes from different Oryza species and Arabidopsis performed in this study will provide the basis for further functional validation studies vis-à-vis DDP genes of rice and other plant species.

  10. Genome-wide analysis of DUF221 domain-containing gene family in Oryza species and identification of its salinity stress-responsive members in rice

    PubMed Central

    Ganie, Showkat Ahmad; Pani, Dipti Ranjan

    2017-01-01

    DUF221 domain-containing genes (DDP genes) play important roles in developmental biology, hormone signalling transduction, and responses to abiotic stress. Therefore to understand their structural and evolutionary relationship, we did a genome-wide analysis of this important gene family in rice. Further, through comparative genomics, DDP genes from Oryza sativa subsp. (indica), nine different wild species of rice and Arabidopsis were also identified. We also found an expansion of the DDP gene families in rice and Arabidopsis which is due to the segmental duplication events in some of the gene family members. In general, a highly purifying selection was found acting on all the deduced paralogous and orthologous DDP gene pairs. The data from microarray and subsequent qRT-PCR analysis revealed that although several OsDDPs were differentially regulated under salinity stress, yet OsDDP6 was upregulated at all the developmental stages in salt tolerant rice genotype, FL478. Interestingly, OsDDP6 was found to be involved in proline metabolism pathway as indicated by protein network analysis. The diverse gene structures, varied transmembrane topologies and the differential expression patterns implied the functional diversity in DDP genes. Therefore, the comprehensive evolutionary analysis of DDP genes from different Oryza species and Arabidopsis performed in this study will provide the basis for further functional validation studies vis-à-vis DDP genes of rice and other plant species. PMID:28846681

  11. Genome-Wide Function, Evolutionary Characterization and Expression Analysis of Sugar Transporter Family Genes in Pear (Pyrus bretschneideri Rehd).

    PubMed

    Li, Jia-Ming; Zheng, Dan-man; Li, Lei-ting; Qiao, Xin; Wei, Shu-wei; Bai, Bin; Zhang, Shao-ling; Wu, Jun

    2015-09-01

    The sugar transporter (ST) plays an important role in plant growth, development and fruit quality. In this study, a total of 75 ST genes were identified in the pear (Pyrus bretschneideri Rehd) genome based on systematic analysis. Furthermore, all ST genes identified were grouped into eight subfamilies according to conserved domains and phylogenetic analysis. Analysis of cis-regulatory element sequences of all ST genes identified the MYBCOREATCYCB1 promoter in sucrose transporter (SUT) and monosaccharide transporter (MST) genes of pear, while in grape it is exclusively found in SUT subfamily members, indicating divergent transcriptional regulation in different species. Gene duplication event analysis indicated that whole-genome duplication (WGD) and segmental duplication play key roles in ST gene amplification, followed by tandem duplication. Estimation of positive selection at codon sites of ST paralog pairs indicated that all plastidic glucose translocator (pGlcT) subfamily members have evolved under positive selection. In addition, the evolutionary history of ST gene duplications indicated that the ST genes have experienced significant expansion in the whole ST gene family after the second WGD, especially after apple and pear divergence. According to the global RNA sequencing results of pear fruit development, gene expression profiling showed the expression of 53 STs. Combined with quantitative real-time PCR (qRT-PCR) analysis, two polyol/monosaccharide transporter (PLT) and three tonoplast monosaccharide transporter (tMT) members were identified as candidate genes, which may play important roles in sugar accumulation during pear fruit development and ripening. Identification of highly expressed STs in fruit is important for finding novel genes contributing to enhanced levels of sugar content in pear fruit.

  12. Genome-Wide Identification and Evolution Analysis of Trehalose-6-Phosphate Synthase Gene Family in Nelumbo nucifera

    PubMed Central

    Jin, Qijiang; Hu, Xin; Li, Xin; Wang, Bei; Wang, Yanjie; Jiang, Hongwei; Mattson, Neil; Xu, Yingchun

    2016-01-01

    Trehalose-6-phosphate synthase (TPS) plays a key role in plant carbohydrate metabolism and the perception of carbohydrate availability. In the present work, the publicly available Nelumbo nucifera (lotus) genome sequence database was analyzed which led to identification of nine lotus TPS genes (NnTPS). It was found that at least two introns are included in the coding sequences of NnTPS genes. When the motif compositions were analyzed we found that NnTPS generally shared the similar motifs, implying that they have similar functions. The dN/dS ratios were always less than 1 for different domains and regions outside domains, suggesting purifying selection on the lotus TPS gene family. The regions outside TPS domain evolved relatively faster than NnTPS domains. A phylogenetic tree was constructed using all predicted coding sequences of lotus TPS genes, together with those from Arabidopsis, poplar, soybean, and rice. The result indicated that those TPS genes could be clearly divided into two main subfamilies (I-II), where each subfamily could be further divided into 2 (I) and 5 (II) subgroups. Analyses of divergence and adaptive evolution show that purifying selection may have been the main force driving evolution of plant TPS genes. Some of the critical sites that contributed to divergence may have been under positive selection. Transcriptome data analysis revealed that most NnTPS genes were predominantly expressed in sink tissues. Expression pattern of NnTPS genes under copper and submergence stress indicated that NNU_014679 and NNU_022788 might play important roles in lotus energy metabolism and participate in stress response. Our results can facilitate further functional studies of TPS genes in lotus. PMID:27746792

  13. Genome-wide identification and expression analysis of the WRKY gene family in common tobacco (Nicotiana tabacum L.).

    PubMed

    Xiaohua, Xiang; Xinru, Wu; Jiangtao, Chao; Minglei, Yang; Fan, Yang; Guo, Chen; Guanshan, Liu; Yuanying, Wang

    2016-09-01

    The coding products of WRKY gene family plays important roles in plant growth and development as well as in various stress responses. They have been identified in various plants, but only few in common tobacco (Nicotiana tabacum L.). In this study, 164 putative WRKY proteins in the common tobacco genome were identified by using the conserved WRKY sequence (PF03106) from the Pfam database. Phylogenetic trees, functional domain analysis, chromosomal localization, subcellular localization and tissue expression patterns were analyzed with the bioinformatics softwares, including DNAMAN 5.0, Weblogo 3, MEGA 5.1, MG2C and MEME. First of all, phylogenetic trees divided all the candidate genes into three subfamilies: Ⅰ, Ⅱ and Ⅲ, respectively, and subfamily Ⅱ could be further divided into five subgroups: group Ⅱ-a, -b, -c, -d and -e. Secondly, the WRKY regions contained a highly conserved heptapeptide stretch WRKYGQK followed by a zinc-finger motif. Most of the NtWRKY genes contained 2-5 exons and a highly conserved gene structure. Thirdly, 154 out of 164 NtWRKY genes were distributed with different densities on 24 chromosomes, and each subfamily with different patterns and frequency. The largest number of NtWRKY genes was found on chromosome VI, and only one on chromosome X. Fourthly, the majority of NtWRKY members located in the nucleus, with 74 percent of subfamily Ⅲ in the extracellular matrix. Lastly, the members in the same subfamily had different spatial and temporal expression profiles, with 11 NtWRKY genes in roots, stems and leaves expressed at various levels. The expression of genes NtWRKY26, NtWRKY30 and NtWRKY32 can be induced by Phytophthora nicotianae. Our research thus provides valuable information for NtWRKY gene cloning and functional characterization in common tobacco.

  14. Genome-wide identification of WRKY family genes and their response to cold stress in Vitis vinifera

    PubMed Central

    2014-01-01

    Background WRKY transcription factors are one of the largest families of transcriptional regulators in plants. WRKY genes are not only found to play significant roles in biotic and abiotic stress response, but also regulate growth and development. Grapevine (Vitis vinifera) production is largely limited by stressful climate conditions such as cold stress and the role of WRKY genes in the survival of grapevine under these conditions remains unknown. Results We identified a total of 59 VvWRKYs from the V. vinifera genome, belonging to four subgroups according to conserved WRKY domains and zinc-finger structure. The majority of VvWRKYs were expressed in more than one tissue among the 7 tissues examined which included young leaves, mature leaves, tendril, stem apex, root, young fruits and ripe fruits. Publicly available microarray data suggested that a subset of VvWRKYs was activated in response to diverse stresses. Quantitative real-time PCR (qRT-PCR) results demonstrated that the expression levels of 36 VvWRKYs are changed following cold exposure. Comparative analysis was performed on data from publicly available microarray experiments, previous global transcriptome analysis studies, and qRT-PCR. We identified 15 VvWRKYs in at least two of these databases which may relate to cold stress. Among them, the transcription of three genes can be induced by exogenous ABA application, suggesting that they can be involved in an ABA-dependent signaling pathway in response to cold stress. Conclusions We identified 59 VvWRKYs from the V. vinifera genome and 15 of them showed cold stress-induced expression patterns. These genes represented candidate genes for future functional analysis of VvWRKYs involved in the low temperature-related signal pathways in grape. PMID:24755338

  15. CCCH-type zinc finger family in maize: genome-wide identification, classification and expression profiling under abscisic acid and drought treatments.

    PubMed

    Peng, Xiaojian; Zhao, Yang; Cao, Jiangang; Zhang, Wei; Jiang, Haiyang; Li, Xiaoyu; Ma, Qing; Zhu, Suwen; Cheng, Beijiu

    2012-01-01

    CCCH-type zinc finger proteins comprise a large protein family. Increasing evidence suggests that members of this family are RNA-binding proteins with regulatory functions in mRNA processing. Compared with those in animals, functions of CCCH-type zinc finger proteins involved in plant growth and development are poorly understood. Here, we performed a genome-wide survey of CCCH-type zinc finger genes in maize (Zea mays L.) by describing the gene structure, phylogenetic relationships and chromosomal location of each family member. Promoter sequences and expression profiles of putative stress-responsive members were also investigated. A total of 68 CCCH genes (ZmC3H1-68) were identified in maize and divided into seven groups by phylogenetic analysis. These 68 genes were found to be unevenly distributed on 10 chromosomes with 15 segmental duplication events, suggesting that segmental duplication played a major role in expansion of the maize CCCH family. The Ka/Ks ratios suggested that the duplicated genes of the CCCH family mainly experienced purifying selection with limited functional divergence after duplication events. Twelve maize CCCH genes grouped with other known stress-responsive genes from Arabidopsis were found to contain putative stress-responsive cis-elements in their promoter regions. Seven of these genes chosen for further quantitative real-time PCR analysis showed differential expression patterns among five representative maize tissues and over time in response to abscisic acid and drought treatments. The results presented in this study provide basic information on maize CCCH proteins and form the foundation for future functional studies of these proteins, especially for those members of which may play important roles in response to abiotic stresses.

  16. The ARF, AUX/IAA and GH3 gene families in citrus: genome-wide identification and expression analysis during fruitlet drop from abscission zone A.

    PubMed

    Xie, Rangjin; Pang, Shaoping; Ma, Yanyan; Deng, Lie; He, Shaolan; Yi, Shilai; Lv, Qiang; Zheng, Yongqiang

    2015-12-01

    Completion of the whole genome sequencing of citrus enabled us to perform genome-wide identification and functional analysis of the gene families involved in agronomic traits and morphological diversity of citrus. In this study, 22 CitARF, 11 CitGH3 and 26 CitAUX/IAA genes were identified in citrus, respectively. Phylogenetic analysis revealed that all the genes of each gene family could be subdivided into three groups and showed strong evolutionary conservation. The GH3 and AUX/IAA gene families shrank and ARF gene family was highly conserved in the citrus genome after speciation from Arabidopsis thaliana. Tissue-specific expression profiles revealed that 54 genes were expressed in at least one tissue while just 5 genes including CitARF07, CitARF20, CitGH3.04, CitAUX/IAA25 and CitAUX/IAA26 with very low expression level in all tissues tested, suggesting that the CitARF, CitGH3 and CitAUX/IAA gene families played important roles in the development of citrus organs. In addition, our data found that the expression of 2 CitARF, 4 CitGH3 and 4 AUX/IAA genes was affected by IAA treatment, and 7 genes including, CitGH3.04, CitGH3.07, CitAUX/IAA03, CitAUX/IAA04, CitAUX/IAA18, CitAUX/IAA19 and CitAUX/IAA23 were related to fruitlet abscission. This study provides a foundation for future studies on elucidating the precise role of citrus ARF, GH3 and AUX/IAA genes in early steps of auxin signal transduction and open up a new opportunity to uncover the molecular mechanism underlying citrus fruitlet abscission.

  17. Multipoint genome-wide linkage scan for nonword repetition in a multigenerational family further supports chromosome 13q as a locus for verbal trait disorders.

    PubMed

    Truong, D T; Shriberg, L D; Smith, S D; Chapman, K L; Scheer-Cohen, A R; DeMille, M M C; Adams, A K; Nato, A Q; Wijsman, E M; Eicher, J D; Gruen, J R

    2016-12-01

    Verbal trait disorders encompass a wide range of conditions and are marked by deficits in five domains that impair a person's ability to communicate: speech, language, reading, spelling, and writing. Nonword repetition is a robust endophenotype for verbal trait disorders that is sensitive to cognitive processes critical to verbal development, including auditory processing, phonological working memory, and motor planning and programming. In the present study, we present a six-generation extended pedigree with a history of verbal trait disorders. Using genome-wide multipoint variance component linkage analysis of nonword repetition, we identified a region spanning chromosome 13q14-q21 with LOD = 4.45 between 52 and 55 cM, spanning approximately 5.5 Mb on chromosome 13. This region overlaps with SLI3, a locus implicated in reading disability in families with a history of specific language impairment. Our study of a large multigenerational family with verbal trait disorders further implicates the SLI3 region in verbal trait disorders. Future studies will further refine the specific causal genetic factors in this locus on chromosome 13q that contribute to language traits.

  18. Genome-wide analysis of UDP-glycosyltransferase super family in Brassica rapa and Brassica oleracea reveals its evolutionary history and functional characterization.

    PubMed

    Yu, Jingyin; Hu, Fan; Dossa, Komivi; Wang, Zhaokai; Ke, Tao

    2017-06-23

    Glycosyltransferases comprise a highly divergent and polyphyletic multigene family that is involved in widespread modification of plant secondary metabolites in a process called glycosylation. According to conserved domains identified in their amino acid sequences, these glycosyltransferases can be classified into a single UDP-glycosyltransferase (UGT) 1 superfamily. We performed genome-wide comparative analysis of UGT genes to trace evolutionary history in algae, bryophytes, pteridophytes, and angiosperms; then, we further investigated the expansion mechanisms and function characterization of UGT gene families in Brassica rapa and Brassica oleracea. Using Hidden Markov Model search, we identified 3, 21, 140, 200, 115, 147, and 147 UGTs in Chlamydomonas reinhardtii, Physcomitrella patens, Selaginella moellendorffii, Oryza sativa, Arabidopsis thaliana, B. rapa, and B. oleracea, respectively. Phylogenetic analysis revealed that UGT80 gene family is an ancient gene family, which is shared by all plants and UGT74 gene family is shared by ferns and angiosperms, but the remaining UGT gene families were shared by angiosperms. In dicot lineage, UGTs among three species were classified into three subgroups containing 3, 6, and 12 UGT gene families. Analysis of chromosomal distribution indicates that 98.6 and 71.4% of UGTs were located on B. rapa and B. oleracea pseudo-molecules, respectively. Expansion mechanism analyses uncovered that whole genome duplication event exerted larger influence than tandem duplication on expansion of UGT gene families in B. rapa, and B. oleracea. Analysis of selection forces of UGT orthologous gene pairs in B. rapa, and B. oleracea compared to A. thaliana suggested that orthologous genes in B. rapa, and B. oleracea have undergone negative selection, but there were no significant differences between A. thaliana -B. rapa and A. thaliana -B. oleracea lineages. Our comparisons of expression profiling illustrated that UGTs in B. rapa performed more

  19. Genome-Wide Analysis of the Fasciclin-Like Arabinogalactan Protein Gene Family Reveals Differential Expression Patterns, Localization, and Salt Stress Response in Populus

    PubMed Central

    Zang, Lina; Zheng, Tangchun; Chu, Yanguang; Ding, Changjun; Zhang, Weixi; Huang, Qinjun; Su, Xiaohua

    2015-01-01

    Fasciclin-like arabinogalactan proteins (FLAs) are a subclass of arabinogalactan proteins (AGPs) involved in plant growth, development and response to abiotic stress. Although many studies have been performed to identify molecular functions of individual family members, little information is available on genome-wide identification and characterization of FLAs in the genus Populus. Based on genome-wide analysis, we have identified 35 Populus FLAs which were distributed on 16 chromosomes and phylogenetically clustered into four major groups. Gene structure and motif composition were relatively conserved in each group. All the members contained N-terminal signal peptide, 23 of which included predicted glycosylphosphatidylinositol (GPI) modification sites and were anchored to plasma membranes. Subcellular localization analysis showed that PtrFLA2/20/26 were localized in cell membrane and cytoplasm of protoplasts from Populus stem-differentiating xylem. The Ka/Ks ratios showed that purifying selection has played a leading role in the long-term evolutionary period which greatly maintained the function of this family. The expression profiles showed that 32 PtrFLAs were differentially expressed in four tissues at four seasons based on publicly available microarray data. 18 FLAs were further verified with qRT-PCR in different tissues, which indicated that PtrFLA1/2/3/7/11/12/20/21/22/24/26/30 were significantly expressed in male and female flowers, suggesting close correlations with the reproductive development. In addition, PtrFLA1/9/10/11/17/21/23/24/26/28 were highly expressed in the stems and differentiating xylem, which may be involved in stem development. To determine salt response of FLAs, qRT-PCR was performed to analyze the expression of 18 genes under salinity stress across two time points. Results demonstrated that all the 18 FLAs were expressed in root tissues; especially, PtrFLA2/12/20/21/24/30 were significantly induced at different time points. In summary

  20. Genome-wide identification of sweet orange (Citrus sinensis) histone modification gene families and their expression analysis during the fruit development and fruit-blue mold infection process

    PubMed Central

    Xu, Jidi; Xu, Haidan; Liu, Yuanlong; Wang, Xia; Xu, Qiang; Deng, Xiuxin

    2015-01-01

    In eukaryotes, histone acetylation and methylation have been known to be involved in regulating diverse developmental processes and plant defense. These histone modification events are controlled by a series of histone modification gene families. To date, there is no study regarding genome-wide characterization of histone modification related genes in citrus species. Based on the two recent sequenced sweet orange genome databases, a total of 136 CsHMs (Citrus sinensis histone modification genes), including 47 CsHMTs (histone methyltransferase genes), 23 CsHDMs (histone demethylase genes), 50 CsHATs (histone acetyltransferase genes), and 16 CsHDACs (histone deacetylase genes) were identified. These genes were categorized to 11 gene families. A comprehensive analysis of these 11 gene families was performed with chromosome locations, phylogenetic comparison, gene structures, and conserved domain compositions of proteins. In order to gain an insight into the potential roles of these genes in citrus fruit development, 42 CsHMs with high mRNA abundance in fruit tissues were selected to further analyze their expression profiles at six stages of fruit development. Interestingly, a numbers of genes were expressed highly in flesh of ripening fruit and some of them showed the increasing expression levels along with the fruit development. Furthermore, we analyzed the expression patterns of all 136 CsHMs response to the infection of blue mold (Penicillium digitatum), which is the most devastating pathogen in citrus post-harvest process. The results indicated that 20 of them showed the strong alterations of their expression levels during the fruit-pathogen infection. In conclusion, this study presents a comprehensive analysis of the histone modification gene families in sweet orange and further elucidates their behaviors during the fruit development and the blue mold infection responses. PMID:26300904

  1. Genome-wide analysis of the CCCH zinc finger family identifies tissue specific and stress responsive candidates in chickpea (Cicer arietinum L.).

    PubMed

    Pradhan, Seema; Kant, Chandra; Verma, Subodh; Bhatia, Sabhyata

    2017-01-01

    The CCCH zinc finger is a group of proteins characterised by a typical motif consisting of three cysteine residues and one histidine residue. These proteins have been reported to play important roles in regulation of plant growth, developmental processes and environmental responses. In the present study, genome wide analysis of the CCCH zinc finger gene family was carried out in the available chickpea genome. Various bioinformatics tools were employed to predict 58 CCCH zinc finger genes in chickpea (designated CarC3H1-58), which were analysed for their physio-chemical properties. Phylogenetic analysis classified the proteins into 12 groups in which members of a particular group had similar structural organization. Further, the numbers as well as the types of CCCH motifs present in the CarC3H proteins were compared with those from Arabidopsis and Medicago truncatula. Synteny analysis revealed valuable information regarding the evolution of this gene family. Tandem and segmental duplication events were identified and their Ka/Ks values revealed that the CarC3H gene family in chickpea had undergone purifying selection. Digital, as well as real time qRT-PCR expression analysis was performed which helped in identification of several CarC3H members that expressed preferentially in specific chickpea tissues as well as during abiotic stresses (desiccation, cold, salinity). Moreover, molecular characterization of an important member CarC3H45 was carried out. This study provides comprehensive genomic information about the important CCCH zinc finger gene family in chickpea. The identified tissue specific and abiotic stress specific CCCH genes could be potential candidates for further characterization to delineate their functional roles in development and stress.

  2. Genome-wide identification of sweet orange (Citrus sinensis) histone modification gene families and their expression analysis during the fruit development and fruit-blue mold infection process.

    PubMed

    Xu, Jidi; Xu, Haidan; Liu, Yuanlong; Wang, Xia; Xu, Qiang; Deng, Xiuxin

    2015-01-01

    In eukaryotes, histone acetylation and methylation have been known to be involved in regulating diverse developmental processes and plant defense. These histone modification events are controlled by a series of histone modification gene families. To date, there is no study regarding genome-wide characterization of histone modification related genes in citrus species. Based on the two recent sequenced sweet orange genome databases, a total of 136 CsHMs (Citrus sinensis histone modification genes), including 47 CsHMTs (histone methyltransferase genes), 23 CsHDMs (histone demethylase genes), 50 CsHATs (histone acetyltransferase genes), and 16 CsHDACs (histone deacetylase genes) were identified. These genes were categorized to 11 gene families. A comprehensive analysis of these 11 gene families was performed with chromosome locations, phylogenetic comparison, gene structures, and conserved domain compositions of proteins. In order to gain an insight into the potential roles of these genes in citrus fruit development, 42 CsHMs with high mRNA abundance in fruit tissues were selected to further analyze their expression profiles at six stages of fruit development. Interestingly, a numbers of genes were expressed highly in flesh of ripening fruit and some of them showed the increasing expression levels along with the fruit development. Furthermore, we analyzed the expression patterns of all 136 CsHMs response to the infection of blue mold (Penicillium digitatum), which is the most devastating pathogen in citrus post-harvest process. The results indicated that 20 of them showed the strong alterations of their expression levels during the fruit-pathogen infection. In conclusion, this study presents a comprehensive analysis of the histone modification gene families in sweet orange and further elucidates their behaviors during the fruit development and the blue mold infection responses.

  3. Genome-Wide Study of the Tomato SlMLO Gene Family and Its Functional Characterization in Response to the Powdery Mildew Fungus Oidium neolycopersici.

    PubMed

    Zheng, Zheng; Appiano, Michela; Pavan, Stefano; Bracuto, Valentina; Ricciardi, Luigi; Visser, Richard G F; Wolters, Anne-Marie A; Bai, Yuling

    2016-01-01

    The MLO (Mildew Locus O) gene family encodes plant-specific proteins containing seven transmembrane domains and likely acting in signal transduction in a calcium and calmodulin dependent manner. Some members of the MLO family are susceptibility factors toward fungi causing the powdery mildew disease. In tomato, for example, the loss-of-function of the MLO gene SlMLO1 leads to a particular form of powdery mildew resistance, called ol-2, which arrests almost completely fungal penetration. This type of penetration resistance is characterized by the apposition of papillae at the sites of plant-pathogen interaction. Other MLO homologs in Arabidopsis regulate root response to mechanical stimuli (AtMLO4 and AtMLO11) and pollen tube reception by the female gametophyte (AtMLO7). However, the role of most MLO genes remains unknown. In this work, we provide a genome-wide study of the tomato SlMLO gene family. Besides SlMLO1, other 15 SlMLO homologs were identified and characterized with respect to their structure, genomic organization, phylogenetic relationship, and expression profile. In addition, by analysis of transgenic plants, we demonstrated that simultaneous silencing of SlMLO1 and two of its closely related homologs, SlMLO5 and SlMLO8, confer higher level of resistance than the one associated with the ol-2 mutation. The outcome of this study provides evidence for functional redundancy among tomato homolog genes involved in powdery mildew susceptibility. Moreover, we developed a series of transgenic lines silenced for individual SlMLO homologs, which lay the foundation for further investigations aimed at assigning new biological functions to the MLO gene family.

  4. Genome-wide analysis of ATP-binding cassette (ABC) proteins in a model legume plant, Lotus japonicus: comparison with Arabidopsis ABC protein family.

    PubMed

    Sugiyama, Akifumi; Shitan, Nobukazu; Sato, Shusei; Nakamura, Yasukazu; Tabata, Satoshi; Yazaki, Kazufumi

    2006-10-31

    ATP-binding cassette (ABC) proteins constitute a large family in plants with more than 120 members each in Arabidopsis and rice, and have various functions including the transport of auxin and alkaloid, as well as the regulation of stomata movement. In this report, we carried out genome-wide analysis of ABC protein genes in a model legume plant, Lotus japonicus. For analysis of the Lotus genome sequence, we devised a new method 'domain-based clustering analysis', where domain structures like the nucleotide-binding domain (NBD) and transmembrane domain (TMD), instead of full-length amino acid sequences, are used to compare phylogenetically each other. This method enabled us to characterize fragments of ABC proteins, which frequently appear in a draft sequence of the Lotus genome. We identified 91 putative ABC proteins in L. japonicus, i.e. 43 'full-size', 40 'half-size' and 18 'soluble' putative ABC proteins. The characteristic feature of the composition is that Lotus has extraordinarily many paralogs similar to AtMRP14 and AtPDR12, which are at least six and five members, respectively. Expression analysis of the latter genes performed with real-time quantitative reverse transcription-PCR revealed their putative involvement in the nodulation process.

  5. Genome-Wide Analysis of the Expression of WRKY Family Genes in Different Developmental Stages of Wild Strawberry (Fragaria vesca) Fruit.

    PubMed

    Zhou, Heying; Li, Yuxuan; Zhang, Qing; Ren, Suyue; Shen, Yuanyue; Qin, Ling; Xing, Yu

    2016-01-01

    WRKY proteins play important regulatory roles in plant developmental processes such as senescence, trichome initiation and embryo morphogenesis. In strawberry, only FaWRKY1 (Fragaria × ananassa) has been characterized, leaving numerous WRKY genes to be identified and their function characterized. The publication of the draft genome sequence of the strawberry genome allowed us to conduct a genome-wide search for WRKY proteins in Fragaria vesca, and to compare the identified proteins with their homologs in model plants. Fifty-nine FvWRKY genes were identified and annotated from the F. vesca genome. Detailed analysis, including gene classification, annotation, phylogenetic evaluation, conserved motif determination and expression profiling, based on RNA-seq data, were performed on all members of the family. Additionally, the expression patterns of the WRKY genes in different fruit developmental stages were further investigated using qRT-PCR, to provide a foundation for further comparative genomics and functional studies of this important class of transcriptional regulators in strawberry.

  6. Genome-Wide Analysis of the Expression of WRKY Family Genes in Different Developmental Stages of Wild Strawberry (Fragaria vesca) Fruit

    PubMed Central

    Zhang, Qing; Ren, Suyue; Shen, Yuanyue; Qin, Ling; Xing, Yu

    2016-01-01

    WRKY proteins play important regulatory roles in plant developmental processes such as senescence, trichome initiation and embryo morphogenesis. In strawberry, only FaWRKY1 (Fragaria × ananassa) has been characterized, leaving numerous WRKY genes to be identified and their function characterized. The publication of the draft genome sequence of the strawberry genome allowed us to conduct a genome-wide search for WRKY proteins in Fragaria vesca, and to compare the identified proteins with their homologs in model plants. Fifty-nine FvWRKY genes were identified and annotated from the F. vesca genome. Detailed analysis, including gene classification, annotation, phylogenetic evaluation, conserved motif determination and expression profiling, based on RNA-seq data, were performed on all members of the family. Additionally, the expression patterns of the WRKY genes in different fruit developmental stages were further investigated using qRT-PCR, to provide a foundation for further comparative genomics and functional studies of this important class of transcriptional regulators in strawberry. PMID:27138272

  7. Genome-Wide Identification and Expression Profiling Analysis of the Aux/IAA Gene Family in Medicago truncatula during the Early Phase of Sinorhizobium meliloti Infection

    PubMed Central

    Zhang, Lei; Sun, Tao; Xu, Luqin; Tie, Shuanggui; Wang, Huizhong

    2014-01-01

    Background Auxin/indoleacetic acid (Aux/IAA) genes, coding a family of short-lived nuclear proteins, play key roles in wide variety of plant developmental processes, including root system regulation and responses to environmental stimulus. However, how they function in auxin signaling pathway and symbiosis with rhizobial in Medicago truncatula are largely unknown. The present study aims at gaining deeper insight on distinctive expression and function features of Aux/IAA family genes in Medicago truncatula during nodule formation. Principal Findings Using the latest updated draft of the full Medicago truncatula genome, a comprehensive identification and analysis of IAA genes were performed. The data indicated that MtIAA family genes are distributed in all the M. truncatula chromosomes except chromosome 6. Most of MtIAA genes are responsive to exogenous auxin and express in tissues-specific manner. To understand the biological functions of MtIAA genes involved in nodule formation, quantitative real-time polymerase chain reaction (qRT-PCR) was used to test the expression profiling of MtIAA genes during the early phase of Sinorhizobium meliloti (S. meliloti) infection. The expression patterns of most MtIAA genes were down-regulated in roots and up-regulated in shoots by S. meliloti infection. The differences in expression responses between roots and shoots caused by S. meliloti infection were alleviated by 1-NOA application. Conclusion The genome-wide identification, evolution and expression pattern analysis of MtIAA genes were performed in this study. The data helps us to understand the roles of MtIAA-mediated auxin signaling in nodule formation during the early phase of S. meliloti infection. PMID:25226164

  8. Genome-wide organization and expression profiling of the R2R3-MYB transcription factor family in pineapple (Ananas comosus).

    PubMed

    Liu, Chaoyang; Xie, Tao; Chen, Chenjie; Luan, Aiping; Long, Jianmei; Li, Chuhao; Ding, Yaqi; He, Yehua

    2017-07-01

    The MYB proteins comprise one of the largest families of plant transcription factors, which are involved in various plant physiological and biochemical processes. Pineapple (Ananas comosus) is one of three most important tropical fruits worldwide. The completion of pineapple genome sequencing provides a great opportunity to investigate the organization and evolutionary traits of pineapple MYB genes at the genome-wide level. In the present study, a total of 94 pineapple R2R3-MYB genes were identified and further phylogenetically classified into 26 subfamilies, as supported by the conserved gene structures and motif composition. Collinearity analysis indicated that the segmental duplication events played a crucial role in the expansion of pineapple MYB gene family. Further comparative phylogenetic analysis suggested that there have been functional divergences of MYB gene family during plant evolution. RNA-seq data from different tissues and developmental stages revealed distinct temporal and spatial expression profiles of the AcMYB genes. Further quantitative expression analysis showed the specific expression patterns of the selected putative stress-related AcMYB genes in response to distinct abiotic stress and hormonal treatments. The comprehensive expression analysis of the pineapple MYB genes, especially the tissue-preferential and stress-responsive genes, could provide valuable clues for further function characterization. In this work, we systematically identified AcMYB genes by analyzing the pineapple genome sequence using a set of bioinformatics approaches. Our findings provide a global insight into the organization, phylogeny and expression patterns of the pineapple R2R3-MYB genes, and hence contribute to the greater understanding of their biological roles in pineapple.

  9. Genome-Wide Transcription Profiles Reveal Genotype-Dependent Responses of Biological Pathways and Gene-Families in Daphnia Exposed to Single and Mixed Stressors

    PubMed Central

    2015-01-01

    The present study investigated the possibilities and limitations of implementing a genome-wide transcription-based approach that takes into account genetic and environmental variation to better understand the response of natural populations to stressors. When exposing two different Daphnia pulex genotypes (a cadmium-sensitive and a cadmium-tolerant one) to cadmium, the toxic cyanobacteria Microcystis aeruginosa, and their mixture, we found that observations at the transcriptomic level do not always explain observations at a higher level (growth, reproduction). For example, although cadmium elicited an adverse effect at the organismal level, almost no genes were differentially expressed after cadmium exposure. In addition, we identified oxidative stress and polyunsaturated fatty acid metabolism-related pathways, as well as trypsin and neurexin IV gene-families as candidates for the underlying causes of genotypic differences in tolerance to Microcystis. Furthermore, the whole-genome transcriptomic data of a stressor mixture allowed a better understanding of mixture responses by evaluating interactions between two stressors at the gene-expression level against the independent action baseline model. This approach has indicated that ubiquinone pathway and the MAPK serine-threonine protein kinase and collagens gene-families were enriched with genes showing an interactive effect in expression response to exposure to the mixture of the stressors, while transcription and translation-related pathways and gene-families were mostly related with genotypic differences in interactive responses to this mixture. Collectively, our results indicate that the methods we employed may improve further characterization of the possibilities and limitations of transcriptomics approaches in the adverse outcome pathway framework and in predictions of multistressor effects on natural populations. PMID:24552364

  10. Genome-wide identification, phylogeny and expressional profiles of mitogen activated protein kinase kinase kinase (MAPKKK) gene family in bread wheat (Triticum aestivum L.).

    PubMed

    Wang, Meng; Yue, Hong; Feng, Kewei; Deng, Pingchuan; Song, Weining; Nie, Xiaojun

    2016-08-22

    Mitogen-activated protein kinase kinase kinases (MAPKKKs) are the important components of MAPK cascades, which play the crucial role in plant growth and development as well as in response to diverse stresses. Although this family has been systematically studied in many plant species, little is known about MAPKKK genes in wheat (Triticum aestivum L.), especially those involved in the regulatory network of stress processes. In this study, we identified 155 wheat MAPKKK genes through a genome-wide search method based on the latest available wheat genome information, of which 29 belonged to MEKK, 11 to ZIK and 115 to Raf subfamily, respectively. Then, chromosome localization, gene structure and conserved protein motifs and phylogenetic relationship as well as regulatory network of these TaMAPKKKs were systematically investigated and results supported the prediction. Furthermore, a total of 11 homologous groups between A, B and D sub-genome and 24 duplication pairs among them were detected, which contributed to the expansion of wheat MAPKKK gene family. Finally, the expression profiles of these MAPKKKs during development and under different abiotic stresses were investigated using the RNA-seq data. Additionally, 10 tissue-specific and 4 salt-responsive TaMAPKKK genes were selected to validate their expression level through qRT-PCR analysis. This study for the first time reported the genome organization, evolutionary features and expression profiles of the wheat MAPKKK gene family, which laid the foundation for further functional analysis of wheat MAPKKK genes, and contributed to better understanding the roles and regulatory mechanism of MAPKKKs in wheat.

  11. Genome-wide linkage scan for loci of musical aptitude in Finnish families: evidence for a major locus at 4q22

    PubMed Central

    Pulli, K; Karma, K; Norio, R; Sistonen, P; Göring, H H H; Järvelä, I

    2008-01-01

    Background: Music perception and performance are comprehensive human cognitive functions and thus provide an excellent model system for studying human behaviour and brain function. However, the molecules involved in mediating music perception and performance are so far uncharacterised. Objective: To unravel the biological background of music perception, using molecular and statistical genetic approaches. Methods: 15 Finnish multigenerational families (with a total of 234 family members) were recruited via a nationwide search. The phenotype of all family members was determined using three tests used in defining musical aptitude: a test for auditory structuring ability (Karma Music test; KMT) commonly used in Finland, and the Seashore pitch and time discrimination subtests (SP and ST respectively) used internationally. We calculated heritabilities and performed a genome-wide variance components-based linkage scan using genotype data for 1113 microsatellite markers. Results: The heritability estimates were 42% for KMT, 57% for SP, 21% for ST and 48% for the combined music test scores. Significant evidence of linkage was obtained on chromosome 4q22 (LOD 3.33) and suggestive evidence of linkage at 8q13-21 (LOD 2.29) with the combined music test scores, using variance component linkage analyses. The major contribution of the 4q22 locus was obtained for the KMT (LOD 2.91). Interestingly, a positive LOD score of 1.69 was shown at 18q, a region previously linked to dyslexia (DYX6) using combined music test scores. Conclusion: Our results show that there is a genetic contribution to musical aptitude that is likely to be regulated by several predisposing genes or variants. PMID:18424507

  12. Genome-wide linkage scan for loci of musical aptitude in Finnish families: evidence for a major locus at 4q22.

    PubMed

    Pulli, K; Karma, K; Norio, R; Sistonen, P; Göring, H H H; Järvelä, I

    2008-07-01

    Music perception and performance are comprehensive human cognitive functions and thus provide an excellent model system for studying human behaviour and brain function. However, the molecules involved in mediating music perception and performance are so far uncharacterised. To unravel the biological background of music perception, using molecular and statistical genetic approaches. 15 Finnish multigenerational families (with a total of 234 family members) were recruited via a nationwide search. The phenotype of all family members was determined using three tests used in defining musical aptitude: a test for auditory structuring ability (Karma Music test; KMT) commonly used in Finland, and the Seashore pitch and time discrimination subtests (SP and ST respectively) used internationally. We calculated heritabilities and performed a genome-wide variance components-based linkage scan using genotype data for 1113 microsatellite markers. The heritability estimates were 42% for KMT, 57% for SP, 21% for ST and 48% for the combined music test scores. Significant evidence of linkage was obtained on chromosome 4q22 (LOD 3.33) and suggestive evidence of linkage at 8q13-21 (LOD 2.29) with the combined music test scores, using variance component linkage analyses. The major contribution of the 4q22 locus was obtained for the KMT (LOD 2.91). Interestingly, a positive LOD score of 1.69 was shown at 18q, a region previously linked to dyslexia (DYX6) using combined music test scores. Our results show that there is a genetic contribution to musical aptitude that is likely to be regulated by several predisposing genes or variants.

  13. Genome-wide analysis and identification of stress-responsive genes of the NAM-ATAF1,2-CUC2 transcription factor family in apple.

    PubMed

    Su, Hongyan; Zhang, Shizhong; Yuan, Xiaowei; Chen, Changtian; Wang, Xiao-Fei; Hao, Yu-Jin

    2013-10-01

    NAC (NAM, ATAF1,2, and CUC2) proteins constitute one of the largest families of plant-specific transcription factors. To date, little is known about the NAC genes in the apple (Malus domestica). In this study, a total of 180 NAC genes were identified in the apple genome and were phylogenetically clustered into six groups (I-VI) with the NAC genes from Arabidopsis and rice. The predicted apple NAC genes were distributed across all of 17 chromosomes at various densities. Additionally, the gene structure and motif compositions of the apple NAC genes were analyzed. Moreover, the expression of 29 selected apple NAC genes was analyzed in different tissues and under different abiotic stress conditions. All of the selected genes, with the exception of four genes, were expressed in at least one of the tissues tested, which indicates that the NAC genes are involved in various aspects of the physiological and developmental processes of the apple. Encouragingly, 17 of the selected genes were found to respond to one or more of the abiotic stress treatments, and these 17 genes included not only the expected 7 genes that were clustered with the well-known stress-related marker genes in group IV but also 10 genes located in other subgroups, none of which contains members that have been reported to be stress-related. To the best of our knowledge, this report describes the first genome-wide analysis of the apple NAC gene family, and the results should provide valuable information for understanding the classification and putative functions of this family.

  14. Calcium-dependent protein kinase (CDPK) and CDPK-related kinase (CRK) gene families in tomato: genome-wide identification and functional analyses in disease resistance.

    PubMed

    Wang, Ji-Peng; Xu, You-Ping; Munyampundu, Jean-Pierre; Liu, Tian-Yu; Cai, Xin-Zhong

    2016-04-01

    Calcium-dependent protein kinases (CDPKs) and CDPK-related kinases (CRKs) play multiple roles in plant. Nevertheless, genome-wide identification of these two families is limited to several plant species, and role of CRKs in disease resistance remains unclear. In this study, we identified the CDPK and CRK gene families in genome of the economically important crop tomato (Solanum lycopersicum L.) and analyzed their function in resistance to various pathogens. Twenty-nine CDPK and six CRK genes were identified in tomato genome. Both SlCDPK and SlCRK proteins harbored an STKc_CAMK type protein kinase domain, while only SlCDPKs contained EF-hand type Ca(2+) binding domain(s). Phylogenetic analysis revealed that plant CRK family diverged early from CDPKs, and shared a common ancestor gene with subgroup IV CDPKs. Subgroup IV SlCDPK proteins were basic and their genes contained 11 introns, which were distinguished from other subgroups but similar to CRKs. Subgroup I SlCDPKs generally did not carry an N-terminal myristoylation motif while those of the remaining subgroups and SlCRKs universally did. SlCDPK and SlCRK genes were differently responsive to pathogenic stimuli. Furthermore, silencing analyses demonstrated that SlCDPK18 and SlCDPK10 positively regulated nonhost resistance to Xanthomonas oryzae pv. oryzae and host resistance to Pseudomonas syringae pv. tomato (Pst) DC3000, respectively, while SlCRK6 positively regulated resistance to both Pst DC3000 and Sclerotinia sclerotiorum in tomato. In conclusion, CRKs apparently evolved from CDPK lineage, SlCDPK and SlCRK genes regulate a wide range of resistance and SlCRK6 is the first CRK gene proved to function in plant disease resistance.

  15. Genome-wide scan for hypertension linkage to chromosome 12q23.1 - q23.3 in a Chinese family.

    PubMed

    Dong, Yan; Yang, Lin; Fang, Fang; Zhang, Jing; Jiang, Gengru; Xu, Shijie

    2013-05-01

    Essential hypertension is a multifactorial disorder with a complex phenotype. Here we report a susceptibility locus for the hypertension mapped by a genome-wide microsatellite scanning in an affected Chinese family, in which 11 members had hypertension before the age of 40. A total of 22 individuals from a single family from Shanghai, PR China, were genotyped on more than 400 microsatellite markers with a spacing gap of less than 10 cm for nearly the entire scanned genome. Linkage analysis suggested that an affected disorder is linked to a locus in the chromosome interval 12q23.1 to 12q23.3; two-point parametric analysis showed a logarithm of odds (LOD) score of 2.97 for the marker D12S346 (12q23.1) and 1.40 for the marker D12S78 (12q23.3). Fine mapping and haplotype analysis subsequently confirmed that eight continuous markers (D12Sac023161, D12S1706, D12S346, D12S1588, D12S1607, D12Sac010202, D12S78, D12Sac084356) had positive LOD with a maximum two-point LOD score of 3.34 for the marker D12S1706 and a maximum multi-point LOD score was 2.4002 for D12Sac010202, their NPL scores were 10.9091 for D12S1706 and 10.9114 for D12Sac010202. A novel locus for essential hypertension was identified on chromosome 12q23.1 - q23.3. This finding implies that the region 12q23.1 to 12q23.3 might encompass a susceptible gene that caused hypertension in this Chinese family.

  16. Genome-wide analyses of two families of snoRNA genes from Drosophila melanogaster, demonstrating the extensive utilization of introns for coding of snoRNAs

    PubMed Central

    HUANG, ZHAN-PENG; ZHOU, HUI; HE, HUA-LIANG; CHEN, CHUN-LONG; LIANG, DAN; QU, LIANG-HU

    2005-01-01

    Small nucleolar RNAs (snoRNAs) are an abundant group of noncoding RNAs mainly involved in the post-transcriptional modifications of rRNAs in eukaryotes. In this study, a large-scale genome-wide analysis of the two major families of snoRNA genes in the fruit fly Drosophila melanogaster has been performed using experimental and computational RNomics methods. Two hundred and twelve gene variants, encoding 56 box H/ACA and 63 box C/D snoRNAs, were identified, of which 57 novel snoRNAs have been reported for the first time. These snoRNAs were predicted to guide a total of 147 methylations and pseudouridylations on rRNAs and snRNAs, showing a more comprehensive pattern of rRNA modification in the fruit fly. With the exception of nine, all the snoRNAs identified to date in D. melanogaster are intron encoded. Remarkably, the genomic organization of the snoRNAs is characteristic of 8 dUhg genes and 17 intronic gene clusters, demonstrating that distinct organizations dominate the expression of the two families of snoRNAs in the fruit fly. Of the 267 introns in the host genes, more than half have been identified as host introns for coding of snoRNAs. In contrast to mammals, the variation in size of the host introns is mainly due to differences in the number of snoRNAs they contain. These results demonstrate the extensive utilization of introns for coding of snoRNAs in the host genes and shed light on further research of other noncoding RNA genes in the large introns of the Drosophila genome. PMID:15987805

  17. Genome-wide linkage of obstructive sleep apnoea and high-density lipoprotein cholesterol in a Filipino family: bivariate linkage analysis of obstructive sleep apnoea.

    PubMed

    Relf, Bronwyn L; Larkin, Emma K; De Torres, Carina; Baur, Louise A; Christodoulou, John; Waters, Karen A

    2010-06-01

    Increasing evidence supports an association between obstructive sleep apnoea (OSA) and metabolic syndrome (MeS) in both children and adults, suggesting a genetic component. However, the genetic relationship between the diseases remains unclear. We performed a bivariate linkage scan on a single Filipino family with a high prevalence of OSA and MeS to explore the genetic pathways underlying these diseases. A large rural family (n = 50, 50% adults) underwent a 10-cM genome-wide scan. Fasting blood was used to measure insulin, triglycerides, total cholesterol and high density lipoprotein (HDL) cholesterol. Attended overnight polysomnography was used to quantify the respiratory disturbance index (RDI), a measure of sleep apnoea. Body mass index z-scores and insulin resistance scores were calculated. Bivariate multipoint linkage analyses were performed on RDI and MeS components. OSA prevalence was 46% (n = 23; nine adults, 14 children) in our participants. MeS phenotype was present in 40% of adults (n = 10) and 48% of children (n = 12). Linkage peaks with a logarithm of odds (LOD) score >3 were demonstrated on chromosome 19q13.4 (LOD = 3.04) for the trait pair RDI and HDL cholesterol. Candidate genes identified in this region include the killer cell immunoglobulin-like receptor genes. These genes are associated with modulating inflammatory responses in reaction to cellular stress and initiation of atherosclerotic plaque formation. We have identified a novel locus for genetic links between RDI and lipid factors associated with MeS in a chromosomal region containing genes associated with inflammatory responses.

  18. Genome-Wide Identification and Analysis of the SBP-Box Family Genes under Phytophthora capsici Stress in Pepper (Capsicum annuum L.)

    PubMed Central

    Zhang, Huai-Xia; Jin, Jing-Hao; He, Yu-Mei; Lu, Bo-Ya; Li, Da-Wei; Chai, Wei-Guo; Khan, Abid; Gong, Zhen-Hui

    2016-01-01

    SQUAMOSA promoter binding protein (SBP)-box genes encode plant-specific transcription factors that are extensively involved in many physiological and biochemical processes, including growth, development, and signal transduction. However, pepper (Capsicum annuum L.) SBP-box family genes have not been well characterized. We investigated SBP-box family genes in the pepper genome and characterized these genes across both compatible and incompatible strain of Phytophthora capsici, and also under different hormone treatments. The results indicated that total 15 members were identified and distributed on seven chromosomes of pepper. Phylogenetic analysis showed that SBP-box genes of pepper can be classified into six groups. In addition, duplication analysis within pepper genome, as well as between pepper and Arabidopsis genomes demonstrated that there are four pairs of homology of SBP-box genes in the pepper genome and 10 pairs between pepper and Arabidopsis genomes. Tissue-specific expression analysis of the CaSBP genes demonstrated their diverse spatiotemporal expression patterns. The expression profiles were similarly analyzed following exposure to P. capsici inoculation and hormone treatments. It was shown that nine of the CaSBP genes (CaSBP01, 02, 03, 04, 05, 06, 11, 12, and 13) exhibited a dramatic up-regulation after compatible HX-9 strain (P. capsici) inoculation, while CaSBP09 and CaSBP15 were down-regulated. In case of PC strain (P. capsici) infection six of the CaSBP genes (CaSBP02, 05, 06, 11, 12, and 13) were arose while CaSBP14 was down regulated. Furthermore, Salicylic acid, Methyl jasmonate and their biosynthesis inhibitors treatment indicated that some of the CaSBP genes are potentially involved in these hormone regulation pathways. This genome-wide identification, as well as characterization of evolutionary relationships and expression profiles of the pepper CaSBP genes, will help to improve pepper stress tolerance in the future. PMID:27148327

  19. Genome-Wide Identification of Jatropha curcas Aquaporin Genes and the Comparative Analysis Provides Insights into the Gene Family Expansion and Evolution in Hevea brasiliensis

    PubMed Central

    Zou, Zhi; Yang, Lifu; Gong, Jun; Mo, Yeyong; Wang, Jikun; Cao, Jianhua; An, Feng; Xie, Guishui

    2016-01-01

    Aquaporins (AQPs) are channel-forming integral membrane proteins that transport water and other small solutes across biological membranes. Despite the vital role of AQPs, to date, little is known in physic nut (Jatropha curcas L., Euphorbiaceae), an important non-edible oilseed crop with great potential for the production of biodiesel. In this study, 32 AQP genes were identified from the physic nut genome and the family number is relatively small in comparison to 51 in another Euphorbiaceae plant, rubber tree (Hevea brasiliensis Muell. Arg.). Based on the phylogenetic analysis, the JcAQPs were assigned to five subfamilies, i.e., nine plasma membrane intrinsic proteins (PIPs), nine tonoplast intrinsic proteins (TIPs), eight NOD26-like intrinsic proteins (NIPs), two X intrinsic proteins (XIPs), and four small basic intrinsic proteins (SIPs). Like rubber tree and other plant species, functional prediction based on the aromatic/arginine selectivity filter, Froger's positions, and specificity-determining positions showed a remarkable difference in substrate specificity among subfamilies of JcAQPs. Genome-wide comparative analysis revealed the specific expansion of PIP and TIP subfamilies in rubber tree and the specific gene loss of the XIP subfamily in physic nut. Furthermore, by analyzing deep transcriptome sequencing data, the expression evolution especially the expression divergence of duplicated HbAQP genes was also investigated and discussed. Results obtained from this study not only provide valuable information for future functional analysis and utilization of Jc/HbAQP genes, but also provide a useful reference to survey the gene family expansion and evolution in Euphorbiaceae plants and other plant species. PMID:27066041

  20. Genome-Wide Analysis of Nucleosome Positions, Occupancy, and Accessibility in Yeast: Nucleosome Mapping, High-Resolution Histone ChIP, and NCAM.

    PubMed

    Rodriguez, Jairo; McKnight, Jeffrey N; Tsukiyama, Toshio

    2014-10-01

    Because histones bind DNA very tightly, the location on DNA and the level of occupancy of a given DNA sequence by nucleosomes can profoundly affect accessibility of non-histone proteins to chromatin, affecting virtually all DNA-dependent processes, such as transcription, DNA repair, DNA replication and recombination. Therefore, it is often necessary to determine positions and occupancy of nucleosomes to understand how DNA-dependent processes are regulated. Recent technological advances made such analyses feasible on a genome-wide scale at high resolution. In addition, we have recently developed a method to measure nuclease accessibility of nucleosomes on a global scale. This unit describes methods to map nucleosome positions, to determine nucleosome density, and to determine nuclease accessibility of nucleosomes using deep sequencing.

  1. Sex-limited genome-wide linkage scan for body mass index in an unselected sample of 933 Australian twin families.

    PubMed

    Cornes, Belinda K; Medland, Sarah E; Ferreira, Manuel A R; Morley, Katherine I; Duffy, David L; Heijmans, Bastiaan T; Montgomery, Grant W; Martin, Nicholas G

    2005-12-01

    Genes involved in pathways regulating body weight may operate differently in men and women. To determine whether sex-limited genes influence the obesity-related phenotype body mass index (BMI), we have conducted a general nonscalar sex-limited genome-wide linkage scan using variance components analysis in Mx (Neale, 2002). BMI measurements and genotypic data were available for 2053 Australian female and male adult twins and their siblings from 933 families. Clinical measures of BMI were available for 64.4% of these individuals, while only self-reported measures were available for the remaining participants. The mean age of participants was 39.0 years of age (SD 12.1 years). The use of a sex-limited linkage model identified areas on the genome where quantitative trait loci (QTL) effects differ between the sexes, particularly on chromosome 8 and 20, providing us with evidence that some of the genes responsible for BMI may have different effects in men and women. Our highest linkage peak was observed at 12q24 (-log10p = 3.02), which was near the recommended threshold for suggestive linkage (-log10p = 3.13). Previous studies have found evidence for a quantitative trait locus on 12q24 affecting BMI in a wide range of populations, and candidate genes for noninsulin-dependent diabetes mellitus, a consequence of obesity, have also been mapped to this region. We also identified many peaks near a -log10p of 2 (threshold for replicating an existing finding) in many areas across the genome that are within regions previously identified by other studies, as well as in locations that harbor genes known to influence weight regulation.

  2. Genome-wide identification and characterization of MADS-box family genes related to organ development and stress resistance in Brassica rapa.

    PubMed

    Saha, Gopal; Park, Jong-In; Jung, Hee-Jeong; Ahmed, Nasar Uddin; Kayum, Md Abdul; Chung, Mi-Young; Hur, Yoonkang; Cho, Yong-Gu; Watanabe, Masao; Nou, Ill-Sup

    2015-03-14

    MADS-box transcription factors (TFs) are important in floral organ specification as well as several other aspects of plant growth and development. Studies on stress resistance-related functions of MADS-box genes are very limited and no such functional studies in Brassica rapa have been reported. To gain insight into this gene family and to elucidate their roles in organ development and stress resistance, we performed genome-wide identification, characterization and expression analysis of MADS-box genes in B. rapa. Whole-genome survey of B. rapa revealed 167 MADS-box genes, which were categorized into type I (Mα, Mβ and Mγ) and type II (MIKC(c) and MIKC*) based on phylogeny, protein motif structure and exon-intron organization. Expression analysis of 89 MIKC(c) and 11 MIKC* genes was then carried out. In addition to those with floral and vegetative tissue expression, we identified MADS-box genes with constitutive expression patterns at different stages of flower development. More importantly, from a low temperature-treated whole-genome microarray data set, 19 BrMADS genes were found to show variable transcript abundance in two contrasting inbred lines of B. rapa. Among these, 13 BrMADS genes were further validated and their differential expression was monitored in response to cold stress in the same two lines via qPCR expression analysis. Additionally, the set of 19 BrMADS genes was analyzed under drought and salt stress, and 8 and 6 genes were found to be induced by drought and salt, respectively. The extensive annotation and transcriptome profiling reported in this study will be useful for understanding the involvement of MADS-box genes in stress resistance in addition to their growth and developmental functions, which ultimately provides the basis for functional characterization and exploitation of the candidate genes for genetic engineering of B. rapa.

  3. Genome-wide investigation and expression analysis suggest diverse roles and genetic redundancy of Pht1 family genes in response to Pi deficiency in tomato

    PubMed Central

    2014-01-01

    Background Phosphorus (P) deficiency is one of the major nutrient stresses limiting plant growth. The uptake of P by plants is well considered to be mediated by a number of high-affinity phosphate (Pi) transporters belonging to the Pht1 family. Although the Pht1 genes have been extensively identified in several plant species, there is a lack of systematic analysis of the Pht1 gene family in any solanaceous species thus far. Results Here, we report the genome-wide analysis, phylogenetic evolution and expression patterns of the Pht1 genes in tomato (Solanum lycopersicum). A total of eight putative Pht1 genes (LePT1 to 8), distributed on three chromosomes (3, 6 and 9), were identified through extensive searches of the released tomato genome sequence database. Chromosomal organization and phylogenetic tree analysis suggested that the six Pht1 paralogues, LePT1/3, LePT2/6 and LePT4/5, which were assigned into three pairs with very close physical distance, were produced from recent tandem duplication events that occurred after Solanaceae splitting with other dicot families. Expression analysis of these Pht1 members revealed that except LePT8, of which the transcript was undetectable in all tissues, the other seven paralogues showed differential but partial-overlapping expression patterns. LePT1 and LePT7 were ubiquitously expressed in all tissues examined, and their transcripts were induced abundantly in response to Pi starvation; LePT2 and LePT6, the two paralogues harboring identical coding sequence, were predominantly expressed in Pi-deficient roots; LePT3, LePT4 and LePT5 were strongly activated in the roots colonized by arbuscular mycorrhizal fungi under low-P, but not high-P condition. Histochemical analysis revealed that a 1250-bp LePT3 promoter fragment and a 471-bp LePT5 promoter fragment containing the two elements, MYCS and P1BS, were sufficient to direct the GUS reporter expression in mycorrhizal roots and were limited to distinct cells harboring AM fungal

  4. Improved genome-wide localization by ChIP-chip using double-round T7 RNA polymerase-based amplification.

    PubMed

    van Bakel, Harm; van Werven, Folkert J; Radonjic, Marijana; Brok, Mariel O; van Leenen, Dik; Holstege, Frank C P; Timmers, H T Marc

    2008-03-01

    Chromatin immunoprecipitation combined with DNA microarrays (ChIP-chip) is a powerful technique to detect in vivo protein-DNA interactions. Due to low yields, ChIP assays of transcription factors generally require amplification of immunoprecipitated genomic DNA. Here, we present an adapted linear amplification method that involves two rounds of T7 RNA polymerase amplification (double-T7). Using this we could successfully amplify as little as 0.4 ng of ChIP DNA to sufficient amounts for microarray analysis. In addition, we compared the double-T7 method to the ligation-mediated polymerase chain reaction (LM-PCR) method in a ChIP-chip of the yeast transcription factor Gsm1p. The double-T7 protocol showed lower noise levels and stronger binding signals compared to LM-PCR. Both LM-PCR and double-T7 identified strongly bound genomic regions, but the double-T7 method increased sensitivity and specificity to allow detection of weaker binding sites.

  5. Principal transcriptional regulation and genome-wide system interactions of the Asp-family and aromatic amino acid networks of amino acid metabolism in plants.

    PubMed

    Less, Hadar; Angelovici, Ruthie; Tzin, Vered; Galili, Gad

    2010-10-01

    Amino acid metabolism is among the most important and best recognized networks within biological systems. In plants, amino acids serve multiple functions associated with growth. Besides their function in protein synthesis, the amino acids are also catabolized into energy-associated metabolites as well we into numerous secondary metabolites, which are essential for plant growth and response to various stresses. Despite the central importance of amino acids in plants growth, elucidation of the regulation of amino acid metabolism within the context of the entire system, particularly transcriptional regulation, is still in its infancy. The different amino acids are synthesized by a number of distinct metabolic networks, which are expected to possess regulatory cross interactions between them for proper coordination of their interactive functions, such as incorporation into proteins. Yet, individual amino acid metabolic networks are also expected to differentially cross interact with various genome-wide gene expression programs and metabolic networks, in respect to their functions as precursors for various metabolites with distinct functions. In the present review, we discuss our recent genomics, metabolic and bioinformatics studies, which were aimed at addressing these questions, focusing mainly on the Asp-family metabolic network as the main example and also comparing it to the aromatic amino acids metabolic network as a second example (Angelovici et al. in Plant Physiol 151:2058-2072, 2009; Less and Galili in BMC Syst Biol 3:14, 2009; Tzin et al. in Plant J 60:156-167, 2009). Our focus on these two networks is because of the followings: (i) both networks are central to plant metabolism and growth and are also precursors for a wide range of primary and secondary metabolites that are indispensable to plant growth; (ii) the amino acids produced by these two networks are also essential to the nutrition and health of human and farm animals; and (iii) both networks contain

  6. Genome-Wide Transcriptome Analysis of Two Contrasting Brassica rapa Doubled Haploid Lines under Cold-Stresses Using Br135K Oligomeric Chip

    PubMed Central

    Park, Jong-In; Thamilarasan, Senthil Kumar; Lee, Sang Sook; Kim, Yeon-Ki; Lim, Yong-Pyo; Nou, Ill-Sup; Hur, Yoonkang

    2014-01-01

    Genome wide transcription analysis in response to stresses is important to provide a basis of effective engineering strategies to improve stress tolerance in crop plants. We assembled a Brassica rapa oligomeric microarray (Br135K microarray) using sequence information from 41,173 unigenes and analyzed the transcription profiles of two contrasting doubled haploid (DH) lines, Chiifu and Kenshin, under cold-treatments. The two DH lines showed great differences in electrolyte leakage below −4°C, but similar patterns from 4°C to −2°C. Cold-treatments induced 885 and 858 genes in Chiifu and Kenshin, respectively. Overall, 134, and 56 genes showed an intrinsic difference in expression in Chiifu and Kenshin, respectively. Among 5,349 genes that showed no hit found (NHF) in public databases, 61 and 24 were specifically expressed in Chiifu and Kenshin, respectively. Many transcription factor genes (TFs) also showed various characteristics of expression. BrMYB12, BrMYBL2, BrbHLHs, BrbHLH038, a C2H2, a WRKY, BrDREB19 and a integrase-type TF were induced in a Chiifu-specific fashion, while a bHLH (Bra001826/AT3G21330), bHLH, cycling Dof factor and two Dof type TFs were Kenshin specific. Similar to previous studies, a large number of genes were differently induced or regulated among the two genotypes, but many genes, including NHFs, were specifically or intrinsically expressed with genotype specificity. Expression patterns of known-cold responsive genes in plants resulted in discrepancy to membrane leakage in the two DH lines, indicating that timing of gene expression is more important to conferring freezing tolerance rather than expression levels. Otherwise, the tolerance will be related to the levels of transcripts before cold-treatment or regulated by other mechanisms. Overall, these results indicate common signaling pathways and various transcriptional regulatory mechanisms are working together during cold-treatment of B. rapa. Our newly developed Br135K oligomeric

  7. A Discovery Genome-Wide Association Study of Entrepreneurship

    ERIC Educational Resources Information Center

    Quaye, Lydia; Nicolaou, Nicos; Shane, Scott; Mangino, Massimo

    2012-01-01

    To identify specific genetic variants influencing the phenotype of entrepreneurship, we conducted a genome-wide association study (GWAS) with 3,933 Caucasian females from the TwinsUK Adult Twin Registry. Following stringent genotype quality control, GWAF (genome-wide association analyses for family data) software was used to assess the association…

  8. Genetic variation at the CELF1 (CUGBP, elav-like family member 1 gene) locus is genome-wide associated with Alzheimer's disease and obesity.

    PubMed

    Hinney, Anke; Albayrak, Ozgür; Antel, Jochen; Volckmar, Anna-Lena; Sims, Rebecca; Chapman, Jade; Harold, Denise; Gerrish, Amy; Heid, Iris M; Winkler, Thomas W; Scherag, André; Wiltfang, Jens; Williams, Julie; Hebebrand, Johannes

    2014-06-01

    Deviations from normal body weight are observed prior to and after the onset of Alzheimer's disease (AD). Midlife obesity confers increased AD risk in later life, whereas late-life obesity is associated with decreased AD risk. The role of underweight and weight loss for AD risk is controversial. Based on the hypothesis of shared genetic variants for both obesity and AD, we analyzed the variants identified for AD or obesity from genome-wide association meta-analyses of the GERAD (AD, cases = 6,688, controls = 13,685) and GIANT (body mass index [BMI] as measure of obesity, n = 123,865) consortia. Our cross-disorder analysis of genome-wide significant 39 obesity SNPs and 23 AD SNPs in these two large data sets revealed that: (1) The AD SNP rs10838725 (pAD  = 1.1 × 10(-08)) at the locus CELF1 is also genome-wide significant for obesity (pBMI  = 7.35 × 10(-09) ). (2) Four additional AD risk SNPs were nominally associated with obesity (rs17125944 at FERMT2, pBMI  = 4.03 × 10(-05), pBMI corr  = 2.50 × 10(-03) ; rs3851179 at PICALM; pBMI  = 0.002, rs2075650 at TOMM40/APOE, pBMI  = 0.024, rs3865444 at CD33, pBMI  = 0.024). (3) SNPs at two of the obesity risk loci (rs4836133 downstream of ZNF608; pAD  = 0.002 and at rs713586 downstream of RBJ/DNAJC27; pAD  = 0.018) were nominally associated with AD risk. Additionally, among the SNPs used for confirmation in both studies the AD risk allele of rs1858973, with an AD association just below genome-wide significance (pAD  = 7.20 × 10(-07)), was also associated with obesity (SNP at IQCK/GPRC5B; pBMI  = 5.21 × 10(-06) ; pcorr  = 3.24 × 10(-04)). Our first GWAS based cross-disorder analysis for AD and obesity suggests that rs10838725 at the locus CELF1 might be relevant for both disorders.

  9. Genome-Wide Analysis of Oleosin Gene Family in 22 Tree Species: An Accelerator for Metabolic Engineering of BioFuel Crops and Agrigenomics Industrial Applications?

    PubMed Central

    2015-01-01

    Abstract Trees contribute to enormous plant oil reserves because many trees contain 50%–80% of oil (triacylglycerols, TAGs) in the fruits and kernels. TAGs accumulate in subcellular structures called oil bodies/droplets, in which TAGs are covered by low-molecular-mass hydrophobic proteins called oleosins (OLEs). The OLEs/TAGs ratio determines the size and shape of intracellular oil bodies. There is a lack of comprehensive sequence analysis and structural information of OLEs among diverse trees. The objectives of this study were to identify OLEs from 22 tree species (e.g., tung tree, tea-oil tree, castor bean), perform genome-wide analysis of OLEs, classify OLEs, identify conserved sequence motifs and amino acid residues, and predict secondary and three-dimensional structures in tree OLEs and OLE subfamilies. Data mining identified 65 OLEs with perfect conservation of the “proline knot” motif (PX5SPX3P) from 19 trees. These OLEs contained >40% hydrophobic amino acid residues. They displayed similar properties and amino acid composition. Genome-wide phylogenetic analysis and multiple sequence alignment demonstrated that these proteins could be classified into five OLE subfamilies. There were distinct patterns of sequence conservation among the OLE subfamilies and within individual tree species. Computational modeling indicated that OLEs were composed of at least three α-helixes connected with short coils without any β-strand and that they exhibited distinct 3D structures and ligand binding sites. These analyses provide fundamental information in the similarity and specificity of diverse OLE isoforms within the same subfamily and among the different species, which should facilitate studying the structure-function relationship and identify critical amino acid residues in OLEs for metabolic engineering of tree TAGs. PMID:26258573

  10. Genome-Wide Analysis of Oleosin Gene Family in 22 Tree Species: An Accelerator for Metabolic Engineering of BioFuel Crops and Agrigenomics Industrial Applications?

    PubMed

    Cao, Heping

    2015-09-01

    Trees contribute to enormous plant oil reserves because many trees contain 50%-80% of oil (triacylglycerols, TAGs) in the fruits and kernels. TAGs accumulate in subcellular structures called oil bodies/droplets, in which TAGs are covered by low-molecular-mass hydrophobic proteins called oleosins (OLEs). The OLEs/TAGs ratio determines the size and shape of intracellular oil bodies. There is a lack of comprehensive sequence analysis and structural information of OLEs among diverse trees. The objectives of this study were to identify OLEs from 22 tree species (e.g., tung tree, tea-oil tree, castor bean), perform genome-wide analysis of OLEs, classify OLEs, identify conserved sequence motifs and amino acid residues, and predict secondary and three-dimensional structures in tree OLEs and OLE subfamilies. Data mining identified 65 OLEs with perfect conservation of the "proline knot" motif (PX5SPX3P) from 19 trees. These OLEs contained >40% hydrophobic amino acid residues. They displayed similar properties and amino acid composition. Genome-wide phylogenetic analysis and multiple sequence alignment demonstrated that these proteins could be classified into five OLE subfamilies. There were distinct patterns of sequence conservation among the OLE subfamilies and within individual tree species. Computational modeling indicated that OLEs were composed of at least three α-helixes connected with short coils without any β-strand and that they exhibited distinct 3D structures and ligand binding sites. These analyses provide fundamental information in the similarity and specificity of diverse OLE isoforms within the same subfamily and among the different species, which should facilitate studying the structure-function relationship and identify critical amino acid residues in OLEs for metabolic engineering of tree TAGs.

  11. Genome-Wide Identification and Analysis of the VQ Motif-Containing Protein Family in Chinese Cabbage (Brassica rapa L. ssp. Pekinensis).

    PubMed

    Zhang, Gaoyuan; Wang, Fengde; Li, Jingjuan; Ding, Qian; Zhang, Yihui; Li, Huayin; Zhang, Jiannong; Gao, Jianwei

    2015-12-02

    Previous studies have showed that the VQ motif-containing proteins in Arabidopsis thaliana and Oryza sativa play an important role in plant growth, development, and stress responses. However, little is known about the functions of the VQ genes in Brassica rapa (Chinese cabbage). In this study, we performed genome-wide identification, characterization, and expression analysis of the VQ genes in Chinese cabbage, especially under adverse environment. We identified 57 VQ genes and classified them into seven subgroups (I-VII), which were dispersedly distributed on chromosomes 1 to 10. The expansion of these genes mainly contributed to segmental and tandem duplication. Fifty-four VQ genes contained no introns and 50 VQ proteins were less than 300 amino acids in length. Quantitative real-time PCR showed that the VQ genes were differentially expressed in various tissues and during different abiotic stresses and plant hormone treatments. This study provides a comprehensive overview of Chinese cabbage VQ genes and will benefit the molecular breeding for resistance to stresses and disease, as well as further studies on the biological functions of the VQ proteins.

  12. The AGPase Family Proteins in Banana: Genome-Wide Identification, Phylogeny, and Expression Analyses Reveal Their Involvement in the Development, Ripening, and Abiotic/Biotic Stress Responses

    PubMed Central

    Miao, Hongxia; Sun, Peiguang; Liu, Qing; Liu, Juhua; Xu, Biyu; Jin, Zhiqiang

    2017-01-01

    ADP-glucose pyrophosphorylase (AGPase) is the first rate-limiting enzyme in starch biosynthesis and plays crucial roles in multiple biological processes. Despite its importance, AGPase is poorly studied in starchy fruit crop banana (Musa acuminata L.). In this study, eight MaAGPase genes have been identified genome-wide in M. acuminata, which could be clustered into the large (APL) and small (APS) subunits. Comprehensive transcriptomic analysis revealed temporal and spatial expression variations of MaAPLs and MaAPSs and their differential responses to abiotic/biotic stresses in two banana genotypes, Fen Jiao (FJ) and BaXi Jiao (BX). MaAPS1 showed generally high expression at various developmental and ripening stages and in response to abiotic/biotic stresses in both genotypes. MaAPL-3 and -2a were specifically induced by abiotic stresses including cold, salt, and drought, as well as by fungal infection in FJ, but not in BX. The presence of hormone-related and stress-relevant cis-acting elements in the promoters of MaAGPase genes suggests that MaAGPases may play an important role in multiple biological processes. Taken together, this study provides new insights into the complex transcriptional regulation of AGPases, underlying their key roles in promoting starch biosynthesis and enhancing stress tolerance in banana. PMID:28757545

  13. The AGPase Family Proteins in Banana: Genome-Wide Identification, Phylogeny, and Expression Analyses Reveal Their Involvement in the Development, Ripening, and Abiotic/Biotic Stress Responses.

    PubMed

    Miao, Hongxia; Sun, Peiguang; Liu, Qing; Liu, Juhua; Xu, Biyu; Jin, Zhiqiang

    2017-07-25

    ADP-glucose pyrophosphorylase (AGPase) is the first rate-limiting enzyme in starch biosynthesis and plays crucial roles in multiple biological processes. Despite its importance, AGPase is poorly studied in starchy fruit crop banana (Musa acuminata L.). In this study, eight MaAGPase genes have been identified genome-wide in M. acuminata, which could be clustered into the large (APL) and small (APS) subunits. Comprehensive transcriptomic analysis revealed temporal and spatial expression variations of MaAPLs and MaAPSs and their differential responses to abiotic/biotic stresses in two banana genotypes, Fen Jiao (FJ) and BaXi Jiao (BX). MaAPS1 showed generally high expression at various developmental and ripening stages and in response to abiotic/biotic stresses in both genotypes. MaAPL-3 and -2a were specifically induced by abiotic stresses including cold, salt, and drought, as well as by fungal infection in FJ, but not in BX. The presence of hormone-related and stress-relevant cis-acting elements in the promoters of MaAGPase genes suggests that MaAGPases may play an important role in multiple biological processes. Taken together, this study provides new insights into the complex transcriptional regulation of AGPases, underlying their key roles in promoting starch biosynthesis and enhancing stress tolerance in banana.

  14. Genome-Wide Identification and Analysis of the VQ Motif-Containing Protein Family in Chinese Cabbage (Brassica rapa L. ssp. Pekinensis)

    PubMed Central

    Zhang, Gaoyuan; Wang, Fengde; Li, Jingjuan; Ding, Qian; Zhang, Yihui; Li, Huayin; Zhang, Jiannong; Gao, Jianwei

    2015-01-01

    Previous studies have showed that the VQ motif–containing proteins in Arabidopsis thaliana and Oryza sativa play an important role in plant growth, development, and stress responses. However, little is known about the functions of the VQ genes in Brassica rapa (Chinese cabbage). In this study, we performed genome-wide identification, characterization, and expression analysis of the VQ genes in Chinese cabbage, especially under adverse environment. We identified 57 VQ genes and classified them into seven subgroups (I–VII), which were dispersedly distributed on chromosomes 1 to 10. The expansion of these genes mainly contributed to segmental and tandem duplication. Fifty-four VQ genes contained no introns and 50 VQ proteins were less than 300 amino acids in length. Quantitative real-time PCR showed that the VQ genes were differentially expressed in various tissues and during different abiotic stresses and plant hormone treatments. This study provides a comprehensive overview of Chinese cabbage VQ genes and will benefit the molecular breeding for resistance to stresses and disease, as well as further studies on the biological functions of the VQ proteins. PMID:26633387

  15. Review of statistical methodologies for the detection of parent-of-origin effects in family trio genome-wide association data with binary disease traits.

    PubMed

    Connolly, Siobhan; Heron, Elizabeth A

    2015-05-01

    The detection of parent-of-origin effects aims to identify whether the functionality of alleles, and in turn associated phenotypic traits, depends on the parental origin of the alleles. Different parent-of-origin effects have been identified through a variety of mechanisms and a number of statistical methodologies for their detection have been proposed, in particular for genome-wide association studies (GWAS). GWAS have had limited success in explaining the heritability of many complex disorders and traits, but successful identification of parent-of-origin effects using trio (mother, father and offspring) GWAS may help shed light on this missing heritability. However, it is important to choose the most appropriate parent-of-origin test or methodology, given knowledge of the phenotype, amount of available data and the type of parent-of-origin effect(s) being considered. This review brings together the parent-of-origin detection methodologies available, comparing them in terms of power and type I error for a number of different simulated data scenarios, and finally offering guidance as to the most appropriate choice for the different scenarios.

  16. Genome-Wide Copy Number Variation Analysis in Extended Families and Unrelated Individuals Characterized for Musical Aptitude and Creativity in Music

    PubMed Central

    Oikkonen, Jaana; Buck, Gemma; Blancher, Christine; Raijas, Pirre; Karma, Kai; Lähdesmäki, Harri; Järvelä, Irma

    2013-01-01

    Music perception and practice represent complex cognitive functions of the human brain. Recently, evidence for the molecular genetic background of music related phenotypes has been obtained. In order to further elucidate the molecular background of musical phenotypes we analyzed genome wide copy number variations (CNVs) in five extended pedigrees and in 172 unrelated subjects characterized for musical aptitude and creative functions in music. Musical aptitude was defined by combination of the scores of three music tests (COMB scores): auditory structuring ability, Seashores test for pitch and for time. Data on creativity in music (herein composing, improvising and/or arranging music) was surveyed using a web-based questionnaire. Several CNVRs containing genes that affect neurodevelopment, learning and memory were detected. A deletion at 5q31.1 covering the protocadherin-α gene cluster (Pcdha 1-9) was found co-segregating with low music test scores (COMB) in both sample sets. Pcdha is involved in neural migration, differentiation and synaptogenesis. Creativity in music was found to co-segregate with a duplication covering glucose mutarotase gene (GALM) at 2p22. GALM has influence on serotonin release and membrane trafficking of the human serotonin transporter. Interestingly, genes related to serotonergic systems have been shown to associate not only with psychiatric disorders but also with creativity and music perception. Both, Pcdha and GALM, are related to the serotonergic systems influencing cognitive and motor functions, important for music perception and practice. Finally, a 1.3 Mb duplication was identified in a subject with low COMB scores in the region previously linked with absolute pitch (AP) at 8q24. No differences in the CNV burden was detected among the high/low music test scores or creative/non-creative groups. In summary, CNVs and genes found in this study are related to cognitive functions. Our result suggests new candidate genes for music

  17. Genome-wide identification and characterization of the apple (Malus domestica) HECT ubiquitin-protein ligase family and expression analysis of their responsiveness to abiotic stresses.

    PubMed

    Xu, Jianing; Xing, Shanshan; Cui, Haoran; Chen, Xuesen; Wang, Xiaoyun

    2016-04-01

    The ubiquitin-protein ligases (E3s) directly participate in ubiquitin (Ub) transferring to the target proteins in the ubiquitination pathway. The HECT ubiquitin-protein ligase (UPL), one type of E3s, is characterized as containing a conserved HECT domain of approximately 350 amino acids in the C terminus. Some UPLs were found to be involved in trichome development and leaf senescence in Arabidopsis. However, studies on plant UPLs, such as characteristics of the protein structure, predicted functional motifs of the HECT domain, and the regulatory expression of UPLs have all been limited. Here, we present genome-wide identification of the genes encoding UPLs (HECT gene) in apple. The 13 genes (named as MdUPL1-MdUPL13) from ten different chromosomes were divided into four groups by phylogenetic analysis. Among these groups, the encoding genes in the intron-exon structure and the included additional functional domains were quite different. Notably, the F-box domain was first found in MdUPL7 in plant UPLs. The HECT domain in different MdUPL groups also presented different spatial features and three types of conservative motifs were identified. The promoters of each MdUPL member carried multiple stress-response related elements by cis-acting element analysis. Experimental results demonstrated that the expressions of several MdUPLs were quite sensitive to cold-, drought-, and salt-stresses by qRT-PCR assay. The results of this study helped to elucidate the functions of HECT proteins, especially in Rosaceae plants.

  18. Genome-wide copy number variation analysis in extended families and unrelated individuals characterized for musical aptitude and creativity in music.

    PubMed

    Ukkola-Vuoti, Liisa; Kanduri, Chakravarthi; Oikkonen, Jaana; Buck, Gemma; Blancher, Christine; Raijas, Pirre; Karma, Kai; Lähdesmäki, Harri; Järvelä, Irma

    2013-01-01

    Music perception and practice represent complex cognitive functions of the human brain. Recently, evidence for the molecular genetic background of music related phenotypes has been obtained. In order to further elucidate the molecular background of musical phenotypes we analyzed genome wide copy number variations (CNVs) in five extended pedigrees and in 172 unrelated subjects characterized for musical aptitude and creative functions in music. Musical aptitude was defined by combination of the scores of three music tests (COMB scores): auditory structuring ability, Seashores test for pitch and for time. Data on creativity in music (herein composing, improvising and/or arranging music) was surveyed using a web-based questionnaire.Several CNVRs containing genes that affect neurodevelopment, learning and memory were detected. A deletion at 5q31.1 covering the protocadherin-α gene cluster (Pcdha 1-9) was found co-segregating with low music test scores (COMB) in both sample sets. Pcdha is involved in neural migration, differentiation and synaptogenesis. Creativity in music was found to co-segregate with a duplication covering glucose mutarotase gene (GALM) at 2p22. GALM has influence on serotonin release and membrane trafficking of the human serotonin transporter. Interestingly, genes related to serotonergic systems have been shown to associate not only with psychiatric disorders but also with creativity and music perception. Both, Pcdha and GALM, are related to the serotonergic systems influencing cognitive and motor functions, important for music perception and practice. Finally, a 1.3 Mb duplication was identified in a subject with low COMB scores in the region previously linked with absolute pitch (AP) at 8q24. No differences in the CNV burden was detected among the high/low music test scores or creative/non-creative groups. In summary, CNVs and genes found in this study are related to cognitive functions. Our result suggests new candidate genes for music perception

  19. A Genome-Wide Association Study for Culm Cellulose Content in Barley Reveals Candidate Genes Co-Expressed with Members of the CELLULOSE SYNTHASE A Gene Family

    PubMed Central

    Houston, Kelly; Burton, Rachel A.; Sznajder, Beata; Rafalski, Antoni J.; Dhugga, Kanwarpal S.; Mather, Diane E.; Taylor, Jillian; Steffenson, Brian J.; Waugh, Robbie; Fincher, Geoffrey B.

    2015-01-01

    Cellulose is a fundamentally important component of cell walls of higher plants. It provides a scaffold that allows the development and growth of the plant to occur in an ordered fashion. Cellulose also provides mechanical strength, which is crucial for both normal development and to enable the plant to withstand both abiotic and biotic stresses. We quantified the cellulose concentration in the culm of 288 two – rowed and 288 six – rowed spring type barley accessions that were part of the USDA funded barley Coordinated Agricultural Project (CAP) program in the USA. When the population structure of these accessions was analysed we identified six distinct populations, four of which we considered to be comprised of a sufficient number of accessions to be suitable for genome-wide association studies (GWAS). These lines had been genotyped with 3072 SNPs so we combined the trait and genetic data to carry out GWAS. The analysis allowed us to identify regions of the genome containing significant associations between molecular markers and cellulose concentration data, including one region cross-validated in multiple populations. To identify candidate genes we assembled the gene content of these regions and used these to query a comprehensive RNA-seq based gene expression atlas. This provided us with gene annotations and associated expression data across multiple tissues, which allowed us to formulate a supported list of candidate genes that regulate cellulose biosynthesis. Several regions identified by our analysis contain genes that are co-expressed with CELLULOSE SYNTHASE A (HvCesA) across a range of tissues and developmental stages. These genes are involved in both primary and secondary cell wall development. In addition, genes that have been previously linked with cellulose synthesis by biochemical methods, such as HvCOBRA, a gene of unknown function, were also associated with cellulose levels in the association panel. Our analyses provide new insights into the

  20. Genome-wide analysis and expression profiling under heat and drought treatments of HSP70 gene family in soybean (Glycine max L.).

    PubMed

    Zhang, Ling; Zhao, Hong-Kun; Dong, Qian-Li; Zhang, Yuan-Yu; Wang, Yu-Min; Li, Hai-Yun; Xing, Guo-Jie; Li, Qi-Yun; Dong, Ying-Shan

    2015-01-01

    Heat shock proteins (HSPs) perform a fundamental role in protecting plants against abiotic stresses. Previous studies have made great efforts in the functional analysis of individual family members, but there has not yet been an overall analysis or expression profiling of the HSP70 gene family in soybeans (Glycine max L.). In this study, an investigation of the soybean genome revealed 61 putative HSP70 genes, which were evaluated. These genes were classified into eight sub-families, denoted I-VIII, based on a phylogenetic analysis. In each sub-family, the constituent parts of the gene structure and motif were relatively conserved. These GmHSP70 genes were distributed unequally on 17 of the 20 chromosomes. The analysis of the expression profiles showed that 53 of the 61 GmHSP70 genes were differentially expressed across the 14 tissues. However, most of the GmHSP70s were differentially expressed in a tissue-specific expression pattern. Furthermore, the expression of some of the duplicate genes was partially redundant, while others showed functional diversity. The quantitative real-time PCR (qRT-PCR) analysis of the 61 soybean HSP70 genes confirmed their stress-inducible expression patterns under both drought and heat stress. These findings provide a thorough overview of the evolution and modification of the GmHSP70 gene family, which will help to determine the functional characteristics of the HSP70 genes in soybean growth and development.

  1. Genome-wide analysis and expression profiling under heat and drought treatments of HSP70 gene family in soybean (Glycine max L.)

    PubMed Central

    Zhang, Ling; Zhao, Hong-Kun; Dong, Qian-Li; Zhang, Yuan-Yu; Wang, Yu-Min; Li, Hai-Yun; Xing, Guo-Jie; Li, Qi-Yun; Dong, Ying-Shan

    2015-01-01

    Heat shock proteins (HSPs) perform a fundamental role in protecting plants against abiotic stresses. Previous studies have made great efforts in the functional analysis of individual family members, but there has not yet been an overall analysis or expression profiling of the HSP70 gene family in soybeans (Glycine max L.). In this study, an investigation of the soybean genome revealed 61 putative HSP70 genes, which were evaluated. These genes were classified into eight sub-families, denoted I–VIII, based on a phylogenetic analysis. In each sub-family, the constituent parts of the gene structure and motif were relatively conserved. These GmHSP70 genes were distributed unequally on 17 of the 20 chromosomes. The analysis of the expression profiles showed that 53 of the 61 GmHSP70 genes were differentially expressed across the 14 tissues. However, most of the GmHSP70s were differentially expressed in a tissue-specific expression pattern. Furthermore, the expression of some of the duplicate genes was partially redundant, while others showed functional diversity. The quantitative real-time PCR (qRT-PCR) analysis of the 61 soybean HSP70 genes confirmed their stress-inducible expression patterns under both drought and heat stress. These findings provide a thorough overview of the evolution and modification of the GmHSP70 gene family, which will help to determine the functional characteristics of the HSP70 genes in soybean growth and development. PMID:26442082

  2. Genome-wide identification and expression profiling of the cystatin gene family in apple (Malus × domestica Borkh.).

    PubMed

    Tan, Yanxiao; Wang, Suncai; Liang, Dong; Li, Mingjun; Ma, Fengwang

    2014-06-01

    Cystatins or phytocystatins (PhyCys) comprise a family of plant-specific inhibitors of cysteine proteinases. Such inhibitors are thought to be involved in the regulation of several endogenous processes as well as defense against biotic or abiotic stresses. However, information about this family is limited in apple. We identified 26 PhyCys genes within the entire apple genome. They were clustered into three distinct groups distributed across several chromosomes. All of their putative proteins contained one or two typical cystatin domains, which shared the characteristic motifs of PhyCys. Eight selected genes displayed differential expression patterns in various tissues. Moreover, their transcript levels were also up-regulated significantly in leaves during maturation, senescence or in response to treatment with one or more abiotic stresses. Our results indicated that members of this family may function in tissue development, leaf senescence, and adaptation to adverse environments in apple. Copyright © 2014 Elsevier Masson SAS. All rights reserved.

  3. Genome-wide analysis and evolutionary study of sucrose non-fermenting 1-related protein kinase 2 (SnRK2) gene family members in Arabidopsis and Oryza.

    PubMed

    Saha, Jayita; Chatterjee, Chitrita; Sengupta, Atreyee; Gupta, Kamala; Gupta, Bhaskar

    2014-04-01

    The over-expression of plant specific SnRK2 gene family members by hyperosmotic stress and some by abscisic acid is well established. In this report, we have analyzed the evolution of SnRK2 gene family in different plant lineages including green algae, moss, lycophyte, dicot and monocot. Our results provide some evidences to indicate that the natural selection pressure had considerable influence on cis-regulatory promoter region and coding region of SnRK2 members in Arabidopsis and Oryza independently through time. Observed degree of sequence/motif conservation amongst SnRK2 homolog in all the analyzed plant lineages strongly supported their inclusion as members of this family. The chromosomal distributions of duplicated SnRK2 members have also been analyzed in Arabidopsis and Oryza. Massively Parallel Signature Sequencing (MPSS) database derived expression data and the presence of abiotic stress related promoter elements within the 1 kb upstream promoter region of these SnRK2 family members further strengthen the observations of previous workers. Additionally, the phylogenetic relationships of SnRK2 have been studied in all plant lineages along with their respective exon-intron structural patterns. Our results indicate that the ancestral SnRK2 gene of land plants gradually evolved by duplication and diversification and modified itself through exon-intron loss events to survive under environmental stress conditions. Copyright © 2013 Elsevier Ltd. All rights reserved.

  4. A genome-wide analysis of the auxin/indole-3-acetic acid gene family in hexaploid bread wheat (Triticum aestivum L.).

    PubMed

    Qiao, Linyi; Zhang, Xiaojun; Han, Xiao; Zhang, Lei; Li, Xin; Zhan, Haixian; Ma, Jian; Luo, Peigao; Zhang, Wenping; Cui, Lei; Li, Xiaoyan; Chang, Zhijian

    2015-01-01

    The Auxin/indole-3-acetic acid (Aux/IAA) gene family plays key roles in the primary auxin-response process and controls a number of important traits in plants. However, the characteristics of the Aux/IAA gene family in hexaploid bread wheat (Triticum aestivum L.) have long been unknown. In this study, a comprehensive identification of the Aux/IAA gene family was performed using the latest draft genome sequence of the bread wheat "Chinese Spring." Thirty-four Aux/IAA genes were identified, 30 of which have duplicated genes on the A, B or D sub-genome, with a total of 84 Aux/IAA sequences. These predicted Aux/IAA genes were non-randomly distributed in all the wheat chromosomes except for chromosome 2D. The information of wheat Aux/IAA proteins is also described. Based on an analysis of phylogeny, expression and adaptive evolution, we prove that the Aux/IAA family in wheat has been replicated twice in the two allopolyploidization events of bread wheat, when the tandem duplication also occurred. The duplicated genes have undergone an evolutionary process of purifying selection, resulting in the high conservation of copy genes among sub-genomes and functional redundancy among several members of the TaIAA family. However, functional divergence probably existed in most TaIAA members due to the diversity of the functional domain and expression pattern. Our research provides useful information for further research into the function of Aux/IAA genes in wheat.

  5. Genome-wide expression analysis of rice ABC transporter family across spatio-temporal samples and in response to abiotic stresses.

    PubMed

    Nguyen, Van Ngoc Tuyet; Moon, Sunok; Jung, Ki-Hong

    2014-09-01

    Although the super family of ATP-binding cassette (ABC) proteins plays key roles in the physiology and development of plants, the functions of members of this interesting family mostly remain to be clarified, especially in crop plants. Thus, systematic analysis of this family in rice (Oryza sativa), a major model crop plant, will be helpful in the design of effective strategies for functional analysis. Phylogenomic analysis that integrates anatomy and stress meta-profiling data based on a large collection of rice Affymetrix array data into the phylogenic context provides useful clues into the functions for each of the ABC transporter family members in rice. Using anatomy data, we identified 17 root-preferred and 16-shoot preferred genes at the vegetative stage, and 3 pollen, 2 embryo, 2 ovary, 2 endosperm, and 1 anther-preferred gene at the reproductive stage. The stress data revealed significant up-regulation or down-regulation of 47 genes under heavy metal treatment, 16 genes under nutrient deficient conditions, and 51 genes under abiotic stress conditions. Of these, we confirmed the differential expression patterns of 14 genes in root samples exposed to drought stress using quantitative real-time PCR. Network analysis using RiceNet suggests a functional gene network involving nine rice ABC transporters that are differentially regulated by drought stress in root, further enhancing the prediction of biological function. Our analysis provides a molecular basis for the study of diverse biological phenomena mediated by the ABC family in rice and will contribute to the enhancement of crop yield and stress tolerance. Copyright © 2014 Elsevier GmbH. All rights reserved.

  6. Genome-wide analysis of the grapevine stilbene synthase multigenic family: genomic organization and expression profiles upon biotic and abiotic stresses

    PubMed Central

    2012-01-01

    Background Plant stilbenes are a small group of phenylpropanoids, which have been detected in at least 72 unrelated plant species and accumulate in response to biotic and abiotic stresses such as infection, wounding, UV-C exposure and treatment with chemicals. Stilbenes are formed via the phenylalanine/polymalonate-route, the last step of which is catalyzed by the enzyme stilbene synthase (STS), a type III polyketide synthase (PKS). Stilbene synthases are closely related to chalcone synthases (CHS), the key enzymes of the flavonoid pathway, as illustrated by the fact that both enzymes share the same substrates. To date, STSs have been cloned from peanut, pine, sorghum and grapevine, the only stilbene-producing fruiting-plant for which the entire genome has been sequenced. Apart from sorghum, STS genes appear to exist as a family of closely related genes in these other plant species. Results In this study a complete characterization of the STS multigenic family in grapevine has been performed, commencing with the identification, annotation and phylogenetic analysis of all members and integration of this information with a comprehensive set of gene expression analyses including healthy tissues at differential developmental stages and in leaves exposed to both biotic (downy mildew infection) and abiotic (wounding and UV-C exposure) stresses. At least thirty-three full length sequences encoding VvSTS genes were identified, which, based on predicted amino acid sequences, cluster in 3 principal groups designated A, B and C. The majority of VvSTS genes cluster in groups B and C and are located on chr16 whereas the few gene family members in group A are found on chr10. Microarray and mRNA-seq expression analyses revealed different patterns of transcript accumulation between the different groups of VvSTS family members and between VvSTSs and VvCHSs. Indeed, under certain conditions the transcriptional response of VvSTS and VvCHS genes appears to be diametrically opposed

  7. Genome-Wide Analysis of DNA Methylation in Human Amnion

    PubMed Central

    Kim, Jinsil; Pitlick, Mitchell M.; Christine, Paul J.; Schaefer, Amanda R.; Saleme, Cesar; Comas, Belén; Cosentino, Viviana; Gadow, Enrique; Murray, Jeffrey C.

    2013-01-01

    The amnion is a specialized tissue in contact with the amniotic fluid, which is in a constantly changing state. To investigate the importance of epigenetic events in this tissue in the physiology and pathophysiology of pregnancy, we performed genome-wide DNA methylation profiling of human amnion from term (with and without labor) and preterm deliveries. Using the Illumina Infinium HumanMethylation27 BeadChip, we identified genes exhibiting differential methylation associated with normal labor and preterm birth. Functional analysis of the differentially methylated genes revealed biologically relevant enriched gene sets. Bisulfite sequencing analysis of the promoter region of the oxytocin receptor (OXTR) gene detected two CpG dinucleotides showing significant methylation differences among the three groups of samples. Hypermethylation of the CpG island of the solute carrier family 30 member 3 (SLC30A3) gene in preterm amnion was confirmed by methylation-specific PCR. This work provides preliminary evidence that DNA methylation changes in the amnion may be at least partially involved in the physiological process of labor and the etiology of preterm birth and suggests that DNA methylation profiles, in combination with other biological data, may provide valuable insight into the mechanisms underlying normal and pathological pregnancies. PMID:23533356

  8. A Genome-Wide Analysis of the LBD (LATERAL ORGAN BOUNDARIES Domain) Gene Family in Malus domestica with a Functional Characterization of MdLBD11

    PubMed Central

    Su, Ling; Liu, Xin; Hao, Yujin

    2013-01-01

    The plant-specific LBD (LATERAL ORGAN BOUNDARIES domain) genes belong to a major family of transcription factor that encode a zinc finger-like domain. It has been shown that LBD genes play crucial roles in the growth and development of Arabidopsis and other plant species. However, no detailed information concerning this family is available for apple. In the present study, we analyzed the apple (Malus domestica) genome and identified 58 LBD genes. This gene family was tested for its phylogenetic relationships with homologous genes in the Arabidopsis genome, as well as its location in the genome, structure and expression. We also transformed one MdLBD gene into Arabidopsis to evaluate its function. Like Arabidopsis, apple LBD genes also have a conserved CX2CX6CX3C zinc finger-like domain in the N terminus and can be divided into two classes. The expression profile indicated that apple LBD genes exhibited a variety of expression patterns, suggesting that they have diverse functions. At the same time, the expression analysis implied that members of this apple gene family were responsive to hormones and stress and that they may participate in hormone-mediated plant organogenesis, which was demonstrated with the overexpression of the apple LBD gene MdLBD11, resulting in an abnormal phenotype. This phenotype included upward curling leaves, delayed flowering, downward-pointing flowers, siliques and other abnormal traits. Based on these data, we concluded that the MdLBD genes may play an important role in apple growth and development as in Arabidopsis and other species. PMID:23468909

  9. Genome-Wide Analysis of the AP2/ERF Family in Eucalyptus grandis: An Intriguing Over-Representation of Stress-Responsive DREB1/CBF Genes

    PubMed Central

    SanClemente, H.; Mounet, F.; Dunand, C.; Marque, G.; Marque, C.; Teulières, C.

    2015-01-01

    Background The AP2/ERF family includes a large number of developmentally and physiologically important transcription factors sharing an AP2 DNA-binding domain. Among them DREB1/CBF and DREB2 factors are known as master regulators respectively of cold and heat/osmotic stress responses. Experimental Approaches The manual annotation of AP2/ERF family from Eucalyptus grandis, Malus, Populus and Vitis genomes allowed a complete phylogenetic study for comparing the structure of this family in woody species and the model Arabidopsis thaliana. Expression profiles of the whole groups of EgrDREB1 and EgrDREB2 were investigated through RNAseq database survey and RT-qPCR analyses. Results The structure and the size of the AP2/ERF family show a global conservation for the plant species under comparison. In addition to an expansion of the ERF subfamily, the tree genomes mainly differ with respect to the group representation within the subfamilies. With regard to the E. grandis DREB subfamily, an obvious feature is the presence of 17 DREB1/CBF genes, the maximum reported to date for dicotyledons. In contrast, only six DREB2 have been identified, which is similar to the other plants species under study, except for Malus. All the DREB1/CBF and DREB2 genes from E. grandis are expressed in at least one condition and all are heat-responsive. Regulation by cold and drought depends on the genes but is not specific of one group; DREB1/CBF group is more cold-inducible than DREB2 which is mainly drought responsive. Conclusion These features suggest that the dramatic expansion of the DREB1/CBF group might be related to the adaptation of this evergreen tree to climate changes when it expanded in Australia. PMID:25849589

  10. Genome-wide analysis of the AP2/ERF family in Eucalyptus grandis: an intriguing over-representation of stress-responsive DREB1/CBF genes.

    PubMed

    Cao, P B; Azar, S; SanClemente, H; Mounet, F; Dunand, C; Marque, G; Marque, C; Teulières, C

    2015-01-01

    The AP2/ERF family includes a large number of developmentally and physiologically important transcription factors sharing an AP2 DNA-binding domain. Among them DREB1/CBF and DREB2 factors are known as master regulators respectively of cold and heat/osmotic stress responses. The manual annotation of AP2/ERF family from Eucalyptus grandis, Malus, Populus and Vitis genomes allowed a complete phylogenetic study for comparing the structure of this family in woody species and the model Arabidopsis thaliana. Expression profiles of the whole groups of EgrDREB1 and EgrDREB2 were investigated through RNAseq database survey and RT-qPCR analyses. The structure and the size of the AP2/ERF family show a global conservation for the plant species under comparison. In addition to an expansion of the ERF subfamily, the tree genomes mainly differ with respect to the group representation within the subfamilies. With regard to the E. grandis DREB subfamily, an obvious feature is the presence of 17 DREB1/CBF genes, the maximum reported to date for dicotyledons. In contrast, only six DREB2 have been identified, which is similar to the other plants species under study, except for Malus. All the DREB1/CBF and DREB2 genes from E. grandis are expressed in at least one condition and all are heat-responsive. Regulation by cold and drought depends on the genes but is not specific of one group; DREB1/CBF group is more cold-inducible than DREB2 which is mainly drought responsive. These features suggest that the dramatic expansion of the DREB1/CBF group might be related to the adaptation of this evergreen tree to climate changes when it expanded in Australia.

  11. Genome-Wide Analysis of the AP2/ERF Transcription Factors Family and the Expression Patterns of DREB Genes in Moso Bamboo (Phyllostachys edulis).

    PubMed

    Wu, Huili; Lv, Hao; Li, Long; Liu, Jun; Mu, Shaohua; Li, Xueping; Gao, Jian

    2015-01-01

    The AP2/ERF transcription factor family, one of the largest families unique to plants, performs a significant role in terms of regulation of growth and development, and responses to biotic and abiotic stresses. Moso bamboo (Phyllostachys edulis) is a fast-growing non-timber forest species with the highest ecological, economic and social values of all bamboos in Asia. The draft genome of moso bamboo and the available genomes of other plants provide great opportunities to research global information on the AP2/ERF family in moso bamboo. In total, 116 AP2/ERF transcription factors were identified in moso bamboo. The phylogeny analyses indicated that the 116 AP2/ERF genes could be divided into three subfamilies: AP2, RAV and ERF; and the ERF subfamily genes were divided into 11 groups. The gene structures, exons/introns and conserved motifs of the PeAP2/ERF genes were analyzed. Analysis of the evolutionary patterns and divergence showed the PeAP2/ERF genes underwent a large-scale event around 15 million years ago (MYA) and the division time of AP2/ERF family genes between rice and moso bamboo was 15-23 MYA. We surveyed the putative promoter regions of the PeDREBs and showed that largely stress-related cis-elements existed in these genes. Further analysis of expression patterns of PeDREBs revealed that the most were strongly induced by drought, low-temperature and/or high salinity stresses in roots and, in contrast, most PeDREB genes had negative functions in leaves under the same respective stresses. In this study there were two main interesting points: there were fewer members of the PeDREB subfamily in moso bamboo than in other plants and there were differences in DREB gene expression profiles between leaves and roots triggered in response to abiotic stress. The information produced from this study may be valuable in overcoming challenges in cultivating moso bamboo.

  12. Genome-Wide Characterization of the MADS-Box Gene Family in Radish (Raphanus sativus L.) and Assessment of Its Roles in Flowering and Floral Organogenesis.

    PubMed

    Li, Chao; Wang, Yan; Xu, Liang; Nie, Shanshan; Chen, Yinglong; Liang, Dongyi; Sun, Xiaochuan; Karanja, Benard K; Luo, Xiaobo; Liu, Liwang

    2016-01-01

    The MADS-box gene family is an important transcription factor (TF) family that is involved in various aspects of plant growth and development, especially flowering time and floral organogenesis. Although it has been reported in many plant species, the systematic identification and characterization of MADS-box TF family is still limited in radish (Raphanus sativus L.). In the present study, a comprehensive analysis of MADS-box genes was performed, and a total of 144 MADS-box family members were identified from the whole radish genome. Meanwhile, a detailed list of MADS-box genes from other 28 plant species was also investigated. Through the phylogenetic analysis between radish and Arabidopsis thaliana, all the RsMADS genes were classified into two groups including 68 type I (31 Mα, 12 Mβ and 25Mγ) and 76 type II (70 MIKC(C) and 6 MIKC(∗)). Among them, 41 (28.47%) RsMADS genes were located in nine linkage groups of radish from R1 to R9. Moreover, the homologous MADS-box gene pairs were identified among radish, A. thaliana, Chinese cabbage and rice. Additionally, the expression profiles of RsMADS genes were systematically investigated in different tissues and growth stages. Furthermore, quantitative real-time PCR analysis was employed to validate expression patterns of some crucial RsMADS genes. These results could provide a valuable resource to explore the potential functions of RsMADS genes in radish, and facilitate dissecting MADS-box gene-mediated molecular mechanisms underlying flowering and floral organogenesis in root vegetable crops.

  13. Genome-Wide Characterization of the MADS-Box Gene Family in Radish (Raphanus sativus L.) and Assessment of Its Roles in Flowering and Floral Organogenesis

    PubMed Central

    Li, Chao; Wang, Yan; Xu, Liang; Nie, Shanshan; Chen, Yinglong; Liang, Dongyi; Sun, Xiaochuan; Karanja, Benard K.; Luo, Xiaobo; Liu, Liwang

    2016-01-01

    The MADS-box gene family is an important transcription factor (TF) family that is involved in various aspects of plant growth and development, especially flowering time and floral organogenesis. Although it has been reported in many plant species, the systematic identification and characterization of MADS-box TF family is still limited in radish (Raphanus sativus L.). In the present study, a comprehensive analysis of MADS-box genes was performed, and a total of 144 MADS-box family members were identified from the whole radish genome. Meanwhile, a detailed list of MADS-box genes from other 28 plant species was also investigated. Through the phylogenetic analysis between radish and Arabidopsis thaliana, all the RsMADS genes were classified into two groups including 68 type I (31 Mα, 12 Mβ and 25Mγ) and 76 type II (70 MIKCC and 6 MIKC∗). Among them, 41 (28.47%) RsMADS genes were located in nine linkage groups of radish from R1 to R9. Moreover, the homologous MADS-box gene pairs were identified among radish, A. thaliana, Chinese cabbage and rice. Additionally, the expression profiles of RsMADS genes were systematically investigated in different tissues and growth stages. Furthermore, quantitative real-time PCR analysis was employed to validate expression patterns of some crucial RsMADS genes. These results could provide a valuable resource to explore the potential functions of RsMADS genes in radish, and facilitate dissecting MADS-box gene-mediated molecular mechanisms underlying flowering and floral organogenesis in root vegetable crops. PMID:27703461

  14. Identification and genome-wide prediction of DNA binding specificities for the ApiAP2 family of regulators from the malaria parasite.

    PubMed

    Campbell, Tracey L; De Silva, Erandi K; Olszewski, Kellen L; Elemento, Olivier; Llinás, Manuel

    2010-10-28

    The molecular mechanisms underlying transcriptional regulation in apicomplexan parasites remain poorly understood. Recently, the Apicomplexan AP2 (ApiAP2) family of DNA binding proteins was identified as a major class of transcriptional regulators that are found across all Apicomplexa. To gain insight into the regulatory role of these proteins in the malaria parasite, we have comprehensively surveyed the DNA-binding specificities of all 27 members of the ApiAP2 protein family from Plasmodium falciparum revealing unique binding preferences for the majority of these DNA binding proteins. In addition to high affinity primary motif interactions, we also observe interactions with secondary motifs. The ability of a number of ApiAP2 proteins to bind multiple, distinct motifs significantly increases the potential complexity of the transcriptional regulatory networks governed by the ApiAP2 family. Using these newly identified sequence motifs, we infer the trans-factors associated with previously reported plasmodial cis-elements and provide evidence that ApiAP2 proteins modulate key regulatory decisions at all stages of parasite development. Our results offer a detailed view of ApiAP2 DNA binding specificity and take the first step toward inferring comprehensive gene regulatory networks for P. falciparum.

  15. Genome-Wide Identification and Expression Analysis of MRLK Family Genes Associated with Strawberry (Fragaria vesca) Fruit Ripening and Abiotic Stress Responses

    PubMed Central

    Zhang, Qing; Jia, Meiru; Xing, Yu; Qin, Ling; Li, Bingbing; Jia, Wensuo

    2016-01-01

    Malectin-like domain-containing receptor-like kinases (MRLK) constitute a large and divergent family of proteins in plants; however, little is known about the role of MRLKs in fruit growth and development. In this study, we characterized MRLK family genes in diploid strawberry, Fragaria vesca. Based on an analysis of malectin-like domain and a search in the strawberry genome and NCBI database, we identified 62 FvMRLKs in the strawberry genome, and classified these genes into six subfamilies with distinct malectin domains in the extracellular regions of the encoded proteins. Gene expression analysis indicated that more than 80% of the FvMRLKs were expressed in various tissues, with higher levels in roots than in other organs. Thirty-three FvMRLKs were found to be expressed in fruits during the early stages of development, and over 60% of these exhibited dramatic decreases in expression during fruit growth and development. Moreover, the expression of some FvMRLKs was sensitive to both environmental and internal cues that play critical roles in regulating strawberry fruit development and ripening. Collectively, this study provides valuable insight into the FvMRLKs gene family and its role in regulating strawberry fruit development and ripening. PMID:27685863

  16. Genome-Wide Identification and Expression Analysis of MRLK Family Genes Associated with Strawberry (Fragaria vesca) Fruit Ripening and Abiotic Stress Responses.

    PubMed

    Zhang, Qing; Jia, Meiru; Xing, Yu; Qin, Ling; Li, Bingbing; Jia, Wensuo

    Malectin-like domain-containing receptor-like kinases (MRLK) constitute a large and divergent family of proteins in plants; however, little is known about the role of MRLKs in fruit growth and development. In this study, we characterized MRLK family genes in diploid strawberry, Fragaria vesca. Based on an analysis of malectin-like domain and a search in the strawberry genome and NCBI database, we identified 62 FvMRLKs in the strawberry genome, and classified these genes into six subfamilies with distinct malectin domains in the extracellular regions of the encoded proteins. Gene expression analysis indicated that more than 80% of the FvMRLKs were expressed in various tissues, with higher levels in roots than in other organs. Thirty-three FvMRLKs were found to be expressed in fruits during the early stages of development, and over 60% of these exhibited dramatic decreases in expression during fruit growth and development. Moreover, the expression of some FvMRLKs was sensitive to both environmental and internal cues that play critical roles in regulating strawberry fruit development and ripening. Collectively, this study provides valuable insight into the FvMRLKs gene family and its role in regulating strawberry fruit development and ripening.

  17. Genome-Wide Identification and Transcriptome-Based Expression Profiling of the Sox Gene Family in the Nile Tilapia (Oreochromis niloticus)

    PubMed Central

    Wei, Ling; Yang, Chao; Tao, Wenjing; Wang, Deshou

    2016-01-01

    The Sox transcription factor family is characterized with the presence of a Sry-related high-mobility group (HMG) box and plays important roles in various biological processes in animals, including sex determination and differentiation, and the development of multiple organs. In this study, 27 Sox genes were identified in the genome of the Nile tilapia (Oreochromis niloticus), and were classified into seven groups. The members of each group of the tilapia Sox genes exhibited a relatively conserved exon-intron structure. Comparative analysis showed that the Sox gene family has undergone an expansion in tilapia and other teleost fishes following their whole genome duplication, and group K only exists in teleosts. Transcriptome-based analysis demonstrated that most of the tilapia Sox genes presented stage-specific and/or sex-dimorphic expressions during gonadal development, and six of the group B Sox genes were specifically expressed in the adult brain. Our results provide a better understanding of gene structure and spatio-temporal expression of the Sox gene family in tilapia, and will be useful for further deciphering the roles of the Sox genes during sex determination and gonadal development in teleosts. PMID:26907269

  18. Genome-Wide Identification and Transcriptome-Based Expression Profiling of the Sox Gene Family in the Nile Tilapia (Oreochromis niloticus).

    PubMed

    Wei, Ling; Yang, Chao; Tao, Wenjing; Wang, Deshou

    2016-02-23

    The Sox transcription factor family is characterized with the presence of a Sry-related high-mobility group (HMG) box and plays important roles in various biological processes in animals, including sex determination and differentiation, and the development of multiple organs. In this study, 27 Sox genes were identified in the genome of the Nile tilapia (Oreochromis niloticus), and were classified into seven groups. The members of each group of the tilapia Sox genes exhibited a relatively conserved exon-intron structure. Comparative analysis showed that the Sox gene family has undergone an expansion in tilapia and other teleost fishes following their whole genome duplication, and group K only exists in teleosts. Transcriptome-based analysis demonstrated that most of the tilapia Sox genes presented stage-specific and/or sex-dimorphic expressions during gonadal development, and six of the group B Sox genes were specifically expressed in the adult brain. Our results provide a better understanding of gene structure and spatio-temporal expression of the Sox gene family in tilapia, and will be useful for further deciphering the roles of the Sox genes during sex determination and gonadal development in teleosts.

  19. Genome-wide identification, expression analysis of auxin-responsive GH3 family genes in maize (Zea mays L.) under abiotic stresses.

    PubMed

    Feng, Shangguo; Yue, Runqing; Tao, Sun; Yang, Yanjun; Zhang, Lei; Xu, Mingfeng; Wang, Huizhong; Shen, Chenjia

    2015-09-01

    Auxin is involved in different aspects of plant growth and development by regulating the expression of auxin-responsive family genes. As one of the three major auxin-responsive families, GH3 (Gretchen Hagen3) genes participate in auxin homeostasis by catalyzing auxin conjugation and bounding free indole-3-acetic acid (IAA) to amino acids. However, how GH3 genes function in responses to abiotic stresses and various hormones in maize is largely unknown. Here, the latest updated maize (Zea mays L.) reference genome sequence was used to characterize and analyze the ZmGH3 family genes from maize. The results showed that 13 ZmGH3 genes were mapped on five maize chromosomes (total 10 chromosomes). Highly diversified gene structures and tissue-specific expression patterns suggested the possibility of function diversification for these genes in response to environmental stresses and hormone stimuli. The expression patterns of ZmGH3 genes are responsive to several abiotic stresses (salt, drought and cadmium) and major stress-related hormones (abscisic acid, salicylic acid and jasmonic acid). Various environmental factors suppress auxin free IAA contents in maize roots suggesting that these abiotic stresses and hormones might alter GH3-mediated auxin levels. The responsiveness of ZmGH3 genes to a wide range of abiotic stresses and stress-related hormones suggested that ZmGH3s are involved in maize tolerance to environmental stresses. © 2014 Institute of Botany, Chinese Academy of Sciences.

  20. Genome-Wide Identification and Expression Analysis of NBS-Encoding Genes in Malus x domestica and Expansion of NBS Genes Family in Rosaceae

    PubMed Central

    Arya, Preeti; Kumar, Gulshan; Acharya, Vishal; Singh, Anil K.

    2014-01-01

    Nucleotide binding site leucine-rich repeats (NBS-LRR) disease resistance proteins play an important role in plant defense against pathogen attack. A number of recent studies have been carried out to identify and characterize NBS-LRR gene families in many important plant species. In this study, we identified NBS-LRR gene family comprising of 1015 NBS-LRRs using highly stringent computational methods. These NBS-LRRs were characterized on the basis of conserved protein motifs, gene duplication events, chromosomal locations, phylogenetic relationships and digital gene expression analysis. Surprisingly, equal distribution of Toll/interleukin-1 receptor (TIR) and coiled coil (CC) (1∶1) was detected in apple while the unequal distribution was reported in majority of all other known plant genome studies. Prediction of gene duplication events intriguingly revealed that not only tandem duplication but also segmental duplication may equally be responsible for the expansion of the apple NBS-LRR gene family. Gene expression profiling using expressed sequence tags database of apple and quantitative real-time PCR (qRT-PCR) revealed the expression of these genes in wide range of tissues and disease conditions, respectively. Taken together, this study will provide a blueprint for future efforts towards improvement of disease resistance in apple. PMID:25232838

  1. Genome-wide analysis and characterization of Aux/IAA family genes related to fruit ripening in papaya (Carica papaya L.).

    PubMed

    Liu, Kaidong; Yuan, Changchun; Feng, Shaoxian; Zhong, Shuting; Li, Haili; Zhong, Jundi; Shen, Chenjia; Liu, Jinxiang

    2017-05-05

    Auxin/indole-3-acetic acid (Aux/IAA) family genes encode short-lived nuclear proteins that mediate the responses of auxin-related genes and are involved in several plant developmental and growth processes. However, how Aux/IAA genes function in the fruit development and ripening of papaya (Carica papaya L.) is largely unknown. In this study, a comprehensive identification and a distinctive expression analysis of 18 C. papaya Aux/IAA (CpIAA) genes were performed using newly updated papaya reference genome data. The Aux/IAA gene family in papaya is slightly smaller than that in Arabidopsis, but all of the phylogenetic subfamilies are represented. Most of the CpIAA genes are responsive to various phytohormones and expressed in a tissues-specific manner. To understand the putative biological functions of the CpIAA genes involved in fruit development and ripening, quantitative real-time PCR was used to test the expression profiling of CpIAA genes at different stages. Furthermore, an IAA treatment significantly delayed the ripening process in papaya fruit at the early stages. The expression changes of CpIAA genes in ACC and 1-MCP treatments suggested a crosstalk between auxin and ethylene during the fruit ripening process of papaya. Our study provided comprehensive information on the Aux/IAA family in papaya, including gene structures, phylogenetic relationships and expression profiles. The involvement of CpIAA gene expression changes in fruit development and ripening gives us an opportunity to understand the roles of auxin signaling in the maturation of papaya reproductive organs.

  2. Genome-Wide Analysis and Expression Profiling of the SUC and SWEET Gene Families of Sucrose Transporters in Oilseed Rape (Brassica napus L.).

    PubMed

    Jian, Hongju; Lu, Kun; Yang, Bo; Wang, Tengyue; Zhang, Li; Zhang, Aoxiang; Wang, Jia; Liu, Liezhao; Qu, Cunmin; Li, Jiana

    2016-01-01

    Sucrose is the principal transported product of photosynthesis from source leaves to sink organs. SUTs/SUCs (sucrose transporters or sucrose carriers) and SWEETs (Sugars Will Eventually be Exported Transporters) play significant central roles in phloem loading and unloading. SUTs/SUCs and SWEETs are key players in sucrose translocation and are associated with crop yields. The SUT/SUC and SWEET genes have been characterized in several plant species, but a comprehensive analysis of these two gene families in oilseed rape has not yet been reported. In our study, 22 and 68 members of the SUT/SUCs and SWEET gene families, respectively, were identified in the oilseed rape (Brassica napus) genome through homology searches. An analysis of the chromosomal distribution, phylogenetic relationships, gene structures, motifs and the cis-acting regulatory elements in the promoters of BnSUC and BnSWEET genes were analyzed. Furthermore, we examined the expression of the 18 BnSUC and 16 BnSWEET genes in different tissues of "ZS11" and the expression of 9 BnSUC and 7 BnSWEET genes in "ZS11" under various conditions, including biotic stress (Sclerotinia sclerotiorum), abiotic stresses (drought, salt and heat), and hormone treatments (abscisic acid, auxin, cytokinin, brassinolide, gibberellin, and salicylic acid). In conclusion, our study provides the first comprehensive analysis of the oilseed rape SUC and SWEET gene families. Information regarding the phylogenetic relationships, gene structure and expression profiles of the SUC and SWEET genes in the different tissues of oilseed rape helps to identify candidates with potential roles in specific developmental processes. Our study advances our understanding of the important roles of sucrose transport in oilseed rape.

  3. Genome-wide identification and characterization of TIFY family genes in Moso Bamboo (Phyllostachys edulis) and expression profiling analysis under dehydration and cold stresses

    PubMed Central

    Jin, Si-Han; Guo, Han-Du; Zhong, Xiao-Juan; He, Jiao; Li, Xi; Jiang, Ming-Yan; Yu, Xiao-Fang; Ma, Ming-Dong; Chen, Qi-Bing

    2016-01-01

    The proteins containing the TIFY domain belong to a plant-specific family of putative transcription factors and could be divided into four subfamilies: ZML, TIFY, PPD and JAZ. They not only function as key regulators of jasmonate hormonal response, but are also involved in responding to abiotic stress. In this study, we identified 24 TIFY genes (PeTIFYs) in Moso bamboo (Phyllostachys edulis) of Poaceae by analyzing the whole genome sequence. One PeTIFY belongs to TIFY subfamily, 18 and five belong to JAZ and ZML subfamilies, respectively. Two equivocal gene models were re-predicted and a putative retrotransposition event was found in a ZML protein. The distribution and conservation of domain or motif, and gene structure were also analyzed. Phylogenetic analysis with TIFY proteins of Arabidopsis and Oryza sativa indicated that JAZ subfamily could be further divided to four groups. Evolutionary analysis revealed intragenomic duplication and orthologous relationship between P. edulis, O. sativa, and B. distachyon. Calculation of the non-synonymous (Ka) and synonymous (Ks) substitution rates and their ratios indicated that the duplication of PeTIFY may have occurred around 16.7 million years ago (MYA), the divergence time of TIFY family among the P. edulis-O. sativa, P. edulis-B. distachyon, and O. sativa-B. distachyon was approximately 39 MYA, 39 MYA, and 45 MYA, respectively. They appear to have undergone extensive purifying selection during evolution. Transcriptome sequencing revealed that more than 50% of PeTIFY genes could be up-regulated by cold and dehydration stresses, and some PeTIFYs also share homology to know TIFYs involved in abiotic stress tolerance. Our results made insights into TIFY family of Moso bamboo, an economically important non-timber forest resource, and provided candidates for further identification of genes involved in regulating responses to abiotic stress. PMID:27812419

  4. Genome-Wide Analysis and Expression Profiling of the SUC and SWEET Gene Families of Sucrose Transporters in Oilseed Rape (Brassica napus L.)

    PubMed Central

    Jian, Hongju; Lu, Kun; Yang, Bo; Wang, Tengyue; Zhang, Li; Zhang, Aoxiang; Wang, Jia; Liu, Liezhao; Qu, Cunmin; Li, Jiana

    2016-01-01

    Sucrose is the principal transported product of photosynthesis from source leaves to sink organs. SUTs/SUCs (sucrose transporters or sucrose carriers) and SWEETs (Sugars Will Eventually be Exported Transporters) play significant central roles in phloem loading and unloading. SUTs/SUCs and SWEETs are key players in sucrose translocation and are associated with crop yields. The SUT/SUC and SWEET genes have been characterized in several plant species, but a comprehensive analysis of these two gene families in oilseed rape has not yet been reported. In our study, 22 and 68 members of the SUT/SUCs and SWEET gene families, respectively, were identified in the oilseed rape (Brassica napus) genome through homology searches. An analysis of the chromosomal distribution, phylogenetic relationships, gene structures, motifs and the cis-acting regulatory elements in the promoters of BnSUC and BnSWEET genes were analyzed. Furthermore, we examined the expression of the 18 BnSUC and 16 BnSWEET genes in different tissues of “ZS11” and the expression of 9 BnSUC and 7 BnSWEET genes in “ZS11” under various conditions, including biotic stress (Sclerotinia sclerotiorum), abiotic stresses (drought, salt and heat), and hormone treatments (abscisic acid, auxin, cytokinin, brassinolide, gibberellin, and salicylic acid). In conclusion, our study provides the first comprehensive analysis of the oilseed rape SUC and SWEET gene families. Information regarding the phylogenetic relationships, gene structure and expression profiles of the SUC and SWEET genes in the different tissues of oilseed rape helps to identify candidates with potential roles in specific developmental processes. Our study advances our understanding of the important roles of sucrose transport in oilseed rape. PMID:27733861

  5. Genome-wide analysis of the Hsp20 gene family in soybean: comprehensive sequence, genomic organization and expression profile analysis under abiotic and biotic stresses

    PubMed Central

    2013-01-01

    Background The Hsp20 genes are associated with stress caused by HS and other abiotic factors, but have recently been found to be associated with the response to biotic stresses. These genes represent the most abundant class among the HSPs in plants, but little is known about this gene family in soybean. Because of their apparent multifunctionality, these proteins are promising targets for developing crop varieties that are better adapted to biotic and abiotic stresses. Thus, in the present study an in silico identification of GmHsp20 gene family members was performed, and the genes were characterized and subjected to in vivo expression analysis under biotic and abiotic stresses. Results A search of the available soybean genome databases revealed 51 gene models as potential GmHsp20 candidates. The 51 GmHsp20 genes were distributed across a total of 15 subfamilies where a specific predicted secondary structure was identified. Based on in vivo analysis, only 47 soybean Hsp20 genes were responsive to heat shock stress. Among the GmHsp20 genes that were potentials HSR, five were also cold-induced, and another five, in addition to one GmAcd gene, were responsive to Meloidogyne javanica infection. Furthermore, one predicted GmHsp20 was shown to be responsive only to nematode infection; no expression change was detected under other stress conditions. Some of the biotic stress-responsive GmHsp20 genes exhibited a divergent expression pattern between resistant and susceptible soybean genotypes under M. javanica infection. The putative regulatory elements presenting some conservation level in the GmHsp20 promoters included HSE, W-box, CAAT box, and TA-rich elements. Some of these putative elements showed a unique occurrence pattern among genes responsive to nematode infection. Conclusions The evolution of Hsp20 family in soybean genome has most likely involved a total of 23 gene duplications. The obtained expression profiles revealed that the majority of the 51 GmHsp20

  6. Analyses of phylogeny, evolution, conserved sequences and genome-wide expression of the ICK/KRP family of plant CDK inhibitors

    PubMed Central

    Torres Acosta, Juan Antonio; Fowke, Larry C.; Wang, Hong

    2011-01-01

    Background and Aims The cell cycle is controlled by cyclin-dependent kinases (CDKs), and CDK inhibitors are major regulators of their activities. The ICK/KRP family of CDK inhibitors has been reported in several plants, with seven members in arabidopsis; however, the phylogenetic relationship among members in different species is unknown. Also, there is a need to understand how these genes and proteins are regulated. Furthermore, little information is available on the functional differences among ICK/KRP family members. Methods We searched publicly available databases and identified over 120 unique ICK/KRP protein sequences from more than 60 plant species. Phylogenetic analysis was performed using 101 full-length sequences from 40 species and intron–exon organization of ICK/KRP genes in model species. Conserved sequences and motifs were analysed using ICK/KRP protein sequences from arabidopsis (Arabidopsis thaliana), rice (Orysa sativa) and poplar (Populus trichocarpa). In addition, gene expression was examined using microarray data from arabidopsis, rice and poplar, and further analysed by RT-PCR for arabidopsis. Key Results and Conclusions Phylogenetic analysis showed that plant ICK/KRP proteins can be grouped into three major classes. Whereas the C-class contains sequences from dicotyledons, monocotyledons and gymnosperms, the A- and B-classes contain only sequences from dicotyledons or monocotyledons, respectively, suggesting that the A- and B-classes might have evolved from the C-class. This classification is also supported by exon–intron organization. Genes in the A- and B- classes have four exons, whereas genes in the C-class have only three exons. Analysis of sequences from arabidopsis, rice and poplar identified conserved sequence motifs, some of which had not been described previously, and putative functional sites. The presence of conserved motifs in different family members is consistent with the classification. In addition, gene expression analysis

  7. Genome-wide analysis identifies gain and loss/change of function within the small multigenic insecticidal Albumin 1 family of Medicago truncatula.

    PubMed

    Karaki, L; Da Silva, P; Rizk, F; Chouabe, C; Chantret, N; Eyraud, V; Gressent, F; Sivignon, C; Rahioui, I; Kahn, D; Brochier-Armanet, C; Rahbé, Y; Royer, C

    2016-03-10

    Albumin 1b peptides (A1b) are small disulfide-knotted insecticidal peptides produced by Fabaceae (also called Leguminosae). To date, their diversity among this plant family has been essentially investigated through biochemical and PCR-based approaches. The availability of high-quality genomic resources for several fabaceae species, among which the model species Medicago truncatula (Mtr), allowed for a genomic analysis of this protein family aimed at i) deciphering the evolutionary history of A1b proteins and their links with A1b-nodulins that are short non-insecticidal disulfide-bonded peptides involved in root nodule signaling and ii) exploring the functional diversity of A1b for novel bioactive molecules. Investigating the Mtr genome revealed a remarkable expansion, mainly through tandem duplications, of albumin1 (A1) genes, retaining nearly all of the same canonical structure at both gene and protein levels. Phylogenetic analysis revealed that the ancestral molecule was most probably insecticidal giving rise to, among others, A1b-nodulins. Expression meta-analysis revealed that many A1b coding genes are silent and a wide tissue distribution of the A1 transcripts/peptides within plant organs. Evolutionary rate analyses highlighted branches and sites with positive selection signatures, including two sites shown to be critical for insecticidal activity. Seven peptides were chemically synthesized and folded in vitro, then assayed for their biological activity. Among these, AG41 (aka MtrA1013 isoform, encoded by the orphan TA24778 contig.), showed an unexpectedly high insecticidal activity. The study highlights the unique burst of diversity of A1 peptides within the Medicago genus compared to the other taxa for which full-genomes are available: no A1 member in Lotus, or in red clover to date, while only a few are present in chick pea, soybean or pigeon pea genomes. The expansion of the A1 family in the Medicago genus is reminiscent of the situation described for

  8. Genome-Wide Analysis of the AP2/ERF Transcription Factors Family and the Expression Patterns of DREB Genes in Moso Bamboo (Phyllostachys edulis)

    PubMed Central

    Li, Long; Liu, Jun; Mu, Shaohua; Li, Xueping; Gao, Jian

    2015-01-01

    The AP2/ERF transcription factor family, one of the largest families unique to plants, performs a significant role in terms of regulation of growth and development, and responses to biotic and abiotic stresses. Moso bamboo (Phyllostachys edulis) is a fast-growing non-timber forest species with the highest ecological, economic and social values of all bamboos in Asia. The draft genome of moso bamboo and the available genomes of other plants provide great opportunities to research global information on the AP2/ERF family in moso bamboo. In total, 116 AP2/ERF transcription factors were identified in moso bamboo. The phylogeny analyses indicated that the 116 AP2/ERF genes could be divided into three subfamilies: AP2, RAV and ERF; and the ERF subfamily genes were divided into 11 groups. The gene structures, exons/introns and conserved motifs of the PeAP2/ERF genes were analyzed. Analysis of the evolutionary patterns and divergence showed the PeAP2/ERF genes underwent a large-scale event around 15 million years ago (MYA) and the division time of AP2/ERF family genes between rice and moso bamboo was 15–23 MYA. We surveyed the putative promoter regions of the PeDREBs and showed that largely stress-related cis-elements existed in these genes. Further analysis of expression patterns of PeDREBs revealed that the most were strongly induced by drought, low-temperature and/or high salinity stresses in roots and, in contrast, most PeDREB genes had negative functions in leaves under the same respective stresses. In this study there were two main interesting points: there were fewer members of the PeDREB subfamily in moso bamboo than in other plants and there were differences in DREB gene expression profiles between leaves and roots triggered in response to abiotic stress. The information produced from this study may be valuable in overcoming challenges in cultivating moso bamboo. PMID:25985202

  9. Genome-wide analysis of the Hsp20 gene family in soybean: comprehensive sequence, genomic organization and expression profile analysis under abiotic and biotic stresses.

    PubMed

    Lopes-Caitar, Valéria S; de Carvalho, Mayra C C G; Darben, Luana M; Kuwahara, Marcia K; Nepomuceno, Alexandre L; Dias, Waldir P; Abdelnoor, Ricardo V; Marcelino-Guimarães, Francismar C

    2013-08-28

    The Hsp20 genes are associated with stress caused by HS and other abiotic factors, but have recently been found to be associated with the response to biotic stresses. These genes represent the most abundant class among the HSPs in plants, but little is known about this gene family in soybean. Because of their apparent multifunctionality, these proteins are promising targets for developing crop varieties that are better adapted to biotic and abiotic stresses. Thus, in the present study an in silico identification of GmHsp20 gene family members was performed, and the genes were characterized and subjected to in vivo expression analysis under biotic and abiotic stresses. A search of the available soybean genome databases revealed 51 gene models as potential GmHsp20 candidates. The 51 GmHsp20 genes were distributed across a total of 15 subfamilies where a specific predicted secondary structure was identified. Based on in vivo analysis, only 47 soybean Hsp20 genes were responsive to heat shock stress. Among the GmHsp20 genes that were potentials HSR, five were also cold-induced, and another five, in addition to one GmAcd gene, were responsive to Meloidogyne javanica infection. Furthermore, one predicted GmHsp20 was shown to be responsive only to nematode infection; no expression change was detected under other stress conditions. Some of the biotic stress-responsive GmHsp20 genes exhibited a divergent expression pattern between resistant and susceptible soybean genotypes under M. javanica infection. The putative regulatory elements presenting some conservation level in the GmHsp20 promoters included HSE, W-box, CAAT box, and TA-rich elements. Some of these putative elements showed a unique occurrence pattern among genes responsive to nematode infection. The evolution of Hsp20 family in soybean genome has most likely involved a total of 23 gene duplications. The obtained expression profiles revealed that the majority of the 51 GmHsp20 candidates are induced under HT, but

  10. Genome-wide identification and characterization of WRKY transcriptional factor family in apple and analysis of their responses to waterlogging and drought stress.

    PubMed

    Meng, Dong; Li, Yuanyuan; Bai, Yang; Li, Mingjun; Cheng, Lailiang

    2016-06-01

    As one of the largest transcriptional factor families in plants, WRKY genes play significant roles in various biotic and abiotic stress responses. Although the WRKY gene family has been characterized in a few plant species, the details remain largely unknown in the apple (Malus domestica Borkh.). In this study, we identified a total of 127 MdWRKYs from the apple genome, which were divided into four subgroups according to the WRKY domains and zinc finger motif. Most of them were mapped onto the apple's 17 chromosomes and were expressed in more than one tissue, including shoot tips, mature leaves, fruit and apple calli. We then contrasted WRKY expression patterns between calli grown in solid medium (control) and liquid medium (representing waterlogging stress) and found that 34 WRKY genes were differentially expressed between the two growing conditions. Finally, we determined the expression patterns of 10 selected WRKY genes in an apple rootstock, G41, in response to waterlogging and drought stress, which identified candidate genes involved in responses to water stress for functional analysis. Our data provide interesting candidate MdWRKYs for future functional analysis and demonstrate that apple callus is a useful system for characterizing gene expression and function in apple. Copyright © 2016 Elsevier Masson SAS. All rights reserved.

  11. Genome-wide identification, classification and functional analyses of the bHLH transcription factor family in the pig, Sus scrofa.

    PubMed

    Liu, Wuyi

    2015-08-01

    The basic helix-loop-helix (bHLH) transcription factors are one of the largest families of gene regulatory proteins and play crucial roles in genetic, developmental and physiological processes in eukaryotes. Here, we conducted a survey of the Sus scrofa genome and identified 109 putative bHLH transcription factor members belonging to super-groups A, B, C, D, E, and F, respectively, while four members were orphan genes. We identified 6 most significantly enriched KEGG pathways and 116 most significant GO annotation categories. Further comprehensive surveys in human genome and other 12 medical databases identified 72 significantly enriched biological pathways with these 113 pig bHLH transcription factors. From the functional protein association network analysis 93 hub proteins were identified and 55 hub proteins created a tight network or a functional module within their protein families. Especially, there were 20 hub proteins found highly connected in the functional interaction network. The present study deepens our understanding and provided insights into the evolution and functional aspects of animal bHLH proteins and should serve as a solid foundation for further for analyses of specific bHLH transcription factors in the pig and other mammals.

  12. Genome-wide analysis of the fructose 1,6-bisphosphate aldolase (FBA) gene family and functional characterization of FBA7 in tomato.

    PubMed

    Cai, Bingbing; Li, Qiang; Xu, Yongchao; Yang, Long; Bi, Huangai; Ai, Xizhen

    2016-11-01

    Fructose 1,6-bisphosphate aldolase (FBA) is a key enzyme in plants that is involved in glycolysis, gluconeogenesis, and the Calvin cycle. FBA genes play significant roles in biotic and abiotic stress responses and also regulate growth and development. Despite the importance of FBA genes, little is known about it in tomato. In this study, we identified 8 FBA genes in tomato and classified them into 2 subgroups based on a phylogenetic tree, gene structures, and conserved motifs. Five (SlFBA1, 2, 3, 4 and 5) and three (SlFBA6, 7, and 8) SlFBA proteins were predicted to be localized in chloroplasts and cytoplasm, respectively. The phylogenetic analysis of FBAs from tomato, Arabidopsis, rice, and other organisms suggested that SlFBA shared the highest protein homology with FBAs from other plants. Synteny analysis indicated that segmental duplication events contributed to the expansion of the tomato FBA family. The expression profiles revealed that all SlFBAs were involved in the response to low and high temperature stresses. SlFBA7 overexpression increased the expression and activities of other main enzymes in Calvin cycle, net photosynthetic rate (Pn), seed size and stem diameter. SlFBA7 overexpression enhanced tolerances in seed germination under suboptimal temperature stresses. Taken together, comprehensive analyses of SlFBAs would provide a basis for understanding of evolution and function of SlFBA family.

  13. The EF-hand Ca(2+)-binding protein super-family: a genome-wide analysis of gene expression patterns in the adult mouse brain.

    PubMed

    Girard, F; Venail, J; Schwaller, B; Celio, M R

    2015-05-21

    In mice, 249 putative members of the superfamily of EF-hand domain Ca(2+)-binding proteins, manifesting great diversity in structure, cellular localization and functions have been identified. Three members in particular, namely, calbindin-D28K, calretinin and parvalbumin, are widely used as markers for specific neuronal subpopulations in different regions of the brain. The aim of the present study was to compile a comprehensive atlas of the gene-expression profiles of the entire EF-hand gene superfamily in the murine brain. This was achieved by a meticulous examination of the in-situ hybridization images in the Allen Brain Atlas database. Topographically, our analysis focused on the olfactory bulb, cerebral cortex (barrel cortex in the primary somatosensory area), basal ganglia, hippocampus, amygdala, thalamus, hypothalamus, cerebellum, midbrain, pons and medulla, and on clearly identifiable sub-structures within each of these areas. The expression profiles of four family-members, namely hippocalcin-like 4, neurocalcin-δ, plastin 3 and tescalcin, that have not been hitherto reported, at either the mRNA (in-situ-hybridization) or the protein (immunohistochemical) levels, are now presented for the first time. The fruit of our analysis is a document in which the gene-expression profiles of all members of the EF-hand family genes are compared, and in which future possible neuronal markers for specific cells/brain areas are identified. The assembled information could afford functional clues to investigators, conducive to further experimental pursuit.

  14. Genome-Wide Identification, Evolutionary Analysis and Expression Profiles of LATERAL ORGAN BOUNDARIES DOMAIN Gene Family in Lotus japonicus and Medicago truncatula

    PubMed Central

    Yang, Tianquan; Fang, Genwang yue; He, Hua

    2016-01-01

    The LATERAL ORGAN BOUNDARIES DOMAIN (LBD) gene family has been well-studied in Arabidopsis and play crucial roles in the diverse growth and development processes including establishment and maintenance of boundary of developmental lateral organs. In this study we identified and characterized 38 LBD genes in Lotus japonicus (LjLBD) and 57 LBD genes in Medicago truncatula (MtLBD), both of which are model legume plants that have some specific development features absent in Arabidopsis. The phylogenetic relationships, their locations in the genome, genes structure and conserved motifs were examined. The results revealed that all LjLBD and MtLBD genes could be distinctly divided into two classes: Class I and II. The evolutionary analysis showed that Type I functional divergence with some significantly site-specific shifts may be the main force for the divergence between Class I and Class II. In addition, the expression patterns of LjLBD genes uncovered the diverse functions in plant development. Interestingly, we found that two LjLBD proteins that were highly expressed during compound leaf and pulvinus development, can interact via yeast two-hybrid assays. Taken together, our findings provide an evolutionary and genetic foundation in further understanding the molecular basis of LBD gene family in general, specifically in L. japonicus and M. truncatula. PMID:27560982

  15. The heat shock factor gene family in Salix suchowensis: a genome-wide survey and expression profiling during development and abiotic stresses

    PubMed Central

    Zhang, Jin; Li, Yu; Jia, Hui-Xia; Li, Jian-Bo; Huang, Juan; Lu, Meng-Zhu; Hu, Jian-Jun

    2015-01-01

    Heat shock transcription factors (Hsfs), which act as important transcriptional regulatory proteins, play crucial roles in plant developmental processes, and stress responses. Recently, the genome of the shrub willow Salix suchowensis was fully sequenced. In this study, a total of 27 non-redundant Hsf genes were identified from the S. suchowensis genome. Phylogenetic analysis revealed that the members of the SsuHsf family can be divided into three groups (class A, B, and C) based on their structural characteristics. Promoter analysis indicated that the SsuHsfs promoters included various cis-acting elements related to hormone and/or stress responses. Furthermore, the expression profiles of 27 SsuHsfs were analyzed in different tissues and under various stresses (heat, drought, salt, and ABA treatment) using RT-PCR. The results demonstrated that the SsuHsfs were involved in abiotic stress responses. Our results contribute to a better understanding of the complexity of the SsuHsf gene family, and will facilitate functional characterization in futur