Deep conservation of cis-regulatory elements in metazoans
Maeso, Ignacio; Irimia, Manuel; Tena, Juan J.; Casares, Fernando; Gómez-Skarmeta, José Luis
2013-01-01
Despite the vast morphological variation observed across phyla, animals share multiple basic developmental processes orchestrated by a common ancestral gene toolkit. These genes interact with each other building complex gene regulatory networks (GRNs), which are encoded in the genome by cis-regulatory elements (CREs) that serve as computational units of the network. Although GRN subcircuits involved in ancient developmental processes are expected to be at least partially conserved, identification of CREs that are conserved across phyla has remained elusive. Here, we review recent studies that revealed such deeply conserved CREs do exist, discuss the difficulties associated with their identification and describe new approaches that will facilitate this search. PMID:24218633
Ancient genomic architecture for mammalian olfactory receptor clusters
Aloni, Ronny; Olender, Tsviya; Lancet, Doron
2006-01-01
Background Mammalian olfactory receptor (OR) genes reside in numerous genomic clusters of up to several dozen genes. Whole-genome sequence alignment nets of five mammals allow their comprehensive comparison, aimed at reconstructing the ancestral olfactory subgenome. Results We developed a new and general tool for genome-wide definition of genomic gene clusters conserved in multiple species. Syntenic orthologs, defined as gene pairs showing conservation of both genomic location and coding sequence, were subjected to a graph theory algorithm for discovering CLICs (clusters in conservation). When applied to ORs in five mammals, including the marsupial opossum, more than 90% of the OR genes were found within a framework of 48 multi-species CLICs, invoking a general conservation of gene order and composition. A detailed analysis of individual CLICs revealed multiple differences among species, interpretable through species-specific genomic rearrangements and reflecting complex mammalian evolutionary dynamics. One significant instance involves CLIC #1, which lacks a human member, implying the human-specific deletion of an OR cluster, whose mouse counterpart has been tentatively associated with isovaleric acid odorant detection. Conclusion The identified multi-species CLICs demonstrate that most of the mammalian OR clusters have a common ancestry, preceding the split between marsupials and placental mammals. However, only two of these CLICs were capable of incorporating chicken OR genes, parsimoniously implying that all other CLICs emerged subsequent to the avian-mammalian divergence. PMID:17010214
Unraveling transcriptional control and cis-regulatory codes using the software suite GeneACT
Cheung, Tom Hiu; Kwan, Yin Lam; Hamady, Micah; Liu, Xuedong
2006-01-01
Deciphering gene regulatory networks requires the systematic identification of functional cis-acting regulatory elements. We present a suite of web-based bioinformatics tools, called GeneACT , that can rapidly detect evolutionarily conserved transcription factor binding sites or microRNA target sites that are either unique or over-represented in differentially expressed genes from DNA microarray data. GeneACT provides graphic visualization and extraction of common regulatory sequence elements in the promoters and 3'-untranslated regions that are conserved across multiple mammalian species. PMID:17064417
González, Carolina; Tabernero, David; Cortese, Maria Francesca; Gregori, Josep; Casillas, Rosario; Riveiro-Barciela, Mar; Godoy, Cristina; Sopena, Sara; Rando, Ariadna; Yll, Marçal; Lopez-Martinez, Rosa; Quer, Josep; Esteban, Rafael; Buti, Maria; Rodríguez-Frías, Francisco
2018-05-21
To detect hyper-conserved regions in the hepatitis B virus (HBV) X gene ( HBX ) 5' region that could be candidates for gene therapy. The study included 27 chronic hepatitis B treatment-naive patients in various clinical stages (from chronic infection to cirrhosis and hepatocellular carcinoma, both HBeAg-negative and HBeAg-positive), and infected with HBV genotypes A-F and H. In a serum sample from each patient with viremia > 3.5 log IU/mL, the HBX 5' end region [nucleotide (nt) 1255-1611] was PCR-amplified and submitted to next-generation sequencing (NGS). We assessed genotype variants by phylogenetic analysis, and evaluated conservation of this region by calculating the information content of each nucleotide position in a multiple alignment of all unique sequences (haplotypes) obtained by NGS. Conservation at the HBx protein amino acid (aa) level was also analyzed. NGS yielded 1333069 sequences from the 27 samples, with a median of 4578 sequences/sample (2487-9279, IQR 2817). In 14/27 patients (51.8%), phylogenetic analysis of viral nucleotide haplotypes showed a complex mixture of genotypic variants. Analysis of the information content in the haplotype multiple alignments detected 2 hyper-conserved nucleotide regions, one in the HBX upstream non-coding region (nt 1255-1286) and the other in the 5' end coding region (nt 1519-1603). This last region coded for a conserved amino acid region (aa 63-76) that partially overlaps a Kunitz-like domain. Two hyper-conserved regions detected in the HBX 5' end may be of value for targeted gene therapy, regardless of the patients' clinical stage or HBV genotype.
Uchiyama, Ikuo
2008-10-31
Identifying the set of intrinsically conserved genes, or the genomic core, among related genomes is crucial for understanding prokaryotic genomes where horizontal gene transfers are common. Although core genome identification appears to be obvious among very closely related genomes, it becomes more difficult when more distantly related genomes are compared. Here, we consider the core structure as a set of sufficiently long segments in which gene orders are conserved so that they are likely to have been inherited mainly through vertical transfer, and developed a method for identifying the core structure by finding the order of pre-identified orthologous groups (OGs) that maximally retains the conserved gene orders. The method was applied to genome comparisons of two well-characterized families, Bacillaceae and Enterobacteriaceae, and identified their core structures comprising 1438 and 2125 OGs, respectively. The core sets contained most of the essential genes and their related genes, which were primarily included in the intersection of the two core sets comprising around 700 OGs. The definition of the genomic core based on gene order conservation was demonstrated to be more robust than the simpler approach based only on gene conservation. We also investigated the core structures in terms of G+C content homogeneity and phylogenetic congruence, and found that the core genes primarily exhibited the expected characteristic, i.e., being indigenous and sharing the same history, more than the non-core genes. The results demonstrate that our strategy of genome alignment based on gene order conservation can provide an effective approach to identify the genomic core among moderately related microbial genomes.
Evolution and Expression of Tissue Globins in Ray-Finned Fishes.
Gallagher, Michael D; Macqueen, Daniel J
2017-01-01
The globin gene family encodes oxygen-binding hemeproteins conserved across the major branches of multicellular life. The origins and evolutionary histories of complete globin repertoires have been established for many vertebrates, but there remain major knowledge gaps for ray-finned fish. Therefore, we used phylogenetic, comparative genomic and gene expression analyses to discover and characterize canonical “non-blood” globin family members (i.e., myoglobin, cytoglobin, neuroglobin, globin-X, and globin-Y) across multiple ray-finned fish lineages, revealing novel gene duplicates (paralogs) conserved from whole genome duplication (WGD) and small-scale duplication events. Our key findings were that: (1) globin-X paralogs in teleosts have been retained from the teleost-specific WGD, (2) functional paralogs of cytoglobin, neuroglobin, and globin-X, but not myoglobin, have been conserved from the salmonid-specific WGD, (3) triplicate lineage-specific myoglobin paralogs are conserved in arowanas (Osteoglossiformes), which arose by tandem duplication and diverged under positive selection, (4) globin-Y is retained in multiple early branching fish lineages that diverged before teleosts, and (5) marked variation in tissue-specific expression of globin gene repertoires exists across ray-finned fish evolution, including several previously uncharacterized sites of expression. In this respect, our data provide an interesting link between myoglobin expression and the evolution of air breathing in teleosts. Together, our findings demonstrate great-unrecognized diversity in the repertoire and expression of nonblood globins that has arisen during ray-finned fish evolution.
Blazier, J Chris; Ruhlman, Tracey A; Weng, Mao-Lun; Rehman, Sumaiyah K; Sabir, Jamal S M; Jansen, Robert K
2016-04-18
Genes for the plastid-encoded RNA polymerase (PEP) persist in the plastid genomes of all photosynthetic angiosperms. However, three unrelated lineages (Annonaceae, Passifloraceae and Geraniaceae) have been identified with unusually divergent open reading frames (ORFs) in the conserved region of rpoA, the gene encoding the PEP α subunit. We used sequence-based approaches to evaluate whether these genes retain function. Both gene sequences and complete plastid genome sequences were assembled and analyzed from each of the three angiosperm families. Multiple lines of evidence indicated that the rpoA sequences are likely functional despite retaining as low as 30% nucleotide sequence identity with rpoA genes from outgroups in the same angiosperm order. The ratio of non-synonymous to synonymous substitutions indicated that these genes are under purifying selection, and bioinformatic prediction of conserved domains indicated that functional domains are preserved. One of the lineages (Pelargonium, Geraniaceae) contains species with multiple rpoA-like ORFs that show evidence of ongoing inter-paralog gene conversion. The plastid genomes containing these divergent rpoA genes have experienced extensive structural rearrangement, including large expansions of the inverted repeat. We propose that illegitimate recombination, not positive selection, has driven the divergence of rpoA.
Constraints on genes shape long-term conservation of macro-synteny in metazoan genomes.
Lv, Jie; Havlak, Paul; Putnam, Nicholas H
2011-10-05
Many metazoan genomes conserve chromosome-scale gene linkage relationships ("macro-synteny") from the common ancestor of multicellular animal life 1234, but the biological explanation for this conservation is still unknown. Double cut and join (DCJ) is a simple, well-studied model of neutral genome evolution amenable to both simulation and mathematical analysis 5, but as we show here, it is not sufficent to explain long-term macro-synteny conservation. We examine a family of simple (one-parameter) extensions of DCJ to identify models and choices of parameters consistent with the levels of macro- and micro-synteny conservation observed among animal genomes. Our software implements a flexible strategy for incorporating genomic context into the DCJ model to incorporate various types of genomic context ("DCJ-[C]"), and is available as open source software from http://github.com/putnamlab/dcj-c. A simple model of genome evolution, in which DCJ moves are allowed only if they maintain chromosomal linkage among a set of constrained genes, can simultaneously account for the level of macro-synteny conservation and for correlated conservation among multiple pairs of species. Simulations under this model indicate that a constraint on approximately 7% of metazoan genes is sufficient to constrain genome rearrangement to an average rate of 25 inversions and 1.7 translocations per million years.
Kikuta, Hiroshi; Laplante, Mary; Navratilova, Pavla; Komisarczuk, Anna Z.; Engström, Pär G.; Fredman, David; Akalin, Altuna; Caccamo, Mario; Sealy, Ian; Howe, Kerstin; Ghislain, Julien; Pezeron, Guillaume; Mourrain, Philippe; Ellingsen, Staale; Oates, Andrew C.; Thisse, Christine; Thisse, Bernard; Foucher, Isabelle; Adolf, Birgit; Geling, Andrea; Lenhard, Boris; Becker, Thomas S.
2007-01-01
We report evidence for a mechanism for the maintenance of long-range conserved synteny across vertebrate genomes. We found the largest mammal-teleost conserved chromosomal segments to be spanned by highly conserved noncoding elements (HCNEs), their developmental regulatory target genes, and phylogenetically and functionally unrelated “bystander” genes. Bystander genes are not specifically under the control of the regulatory elements that drive the target genes and are expressed in patterns that are different from those of the target genes. Reporter insertions distal to zebrafish developmental regulatory genes pax6.1/2, rx3, id1, and fgf8 and miRNA genes mirn9-1 and mirn9-5 recapitulate the expression patterns of these genes even if located inside or beyond bystander genes, suggesting that the regulatory domain of a developmental regulatory gene can extend into and beyond adjacent transcriptional units. We termed these chromosomal segments genomic regulatory blocks (GRBs). After whole genome duplication in teleosts, GRBs, including HCNEs and target genes, were often maintained in both copies, while bystander genes were typically lost from one GRB, strongly suggesting that evolutionary pressure acts to keep the single-copy GRBs of higher vertebrates intact. We show that loss of bystander genes and other mutational events suffered by duplicated GRBs in teleost genomes permits target gene identification and HCNE/target gene assignment. These findings explain the absence of evolutionary breakpoints from large vertebrate chromosomal segments and will aid in the recognition of position effect mutations within human GRBs. PMID:17387144
Tsuchiya, Karen D.; Greally, John M.; Yi, Yajun; Noel, Kevin P.; Truong, Jean-Pierre; Disteche, Christine M.
2004-01-01
We have performed X-inactivation and sequence analyses on 350 kb of sequence from human Xp11.2, a region shown previously to contain a cluster of genes that escape X inactivation, and we compared this region with the region of conserved synteny in mouse. We identified several new transcripts from this region in human and in mouse, which defined the full extent of the domain escaping X inactivation in both species. In human, escape from X inactivation involves an uninterrupted 235-kb domain of multiple genes. Despite highly conserved gene content and order between the two species, Smcx is the only mouse gene from the conserved segment that escapes inactivation. As repetitive sequences are believed to facilitate spreading of X inactivation along the chromosome, we compared the repetitive sequence composition of this region between the two species. We found that long terminal repeats (LTRs) were decreased in the human domain of escape, but not in the majority of the conserved mouse region adjacent to Smcx in which genes were subject to X inactivation, suggesting that these repeats might be excluded from escape domains to prevent spreading of silencing. Our findings indicate that genomic context, as well as gene-specific regulatory elements, interact to determine expression of a gene from the inactive X-chromosome. PMID:15197169
Evolution and Expression of Tissue Globins in Ray-Finned Fishes
Gallagher, Michael D.
2017-01-01
The globin gene family encodes oxygen-binding hemeproteins conserved across the major branches of multicellular life. The origins and evolutionary histories of complete globin repertoires have been established for many vertebrates, but there remain major knowledge gaps for ray-finned fish. Therefore, we used phylogenetic, comparative genomic and gene expression analyses to discover and characterize canonical “non-blood” globin family members (i.e., myoglobin, cytoglobin, neuroglobin, globin-X, and globin-Y) across multiple ray-finned fish lineages, revealing novel gene duplicates (paralogs) conserved from whole genome duplication (WGD) and small-scale duplication events. Our key findings were that: (1) globin-X paralogs in teleosts have been retained from the teleost-specific WGD, (2) functional paralogs of cytoglobin, neuroglobin, and globin-X, but not myoglobin, have been conserved from the salmonid-specific WGD, (3) triplicate lineage-specific myoglobin paralogs are conserved in arowanas (Osteoglossiformes), which arose by tandem duplication and diverged under positive selection, (4) globin-Y is retained in multiple early branching fish lineages that diverged before teleosts, and (5) marked variation in tissue-specific expression of globin gene repertoires exists across ray-finned fish evolution, including several previously uncharacterized sites of expression. In this respect, our data provide an interesting link between myoglobin expression and the evolution of air breathing in teleosts. Together, our findings demonstrate great-unrecognized diversity in the repertoire and expression of nonblood globins that has arisen during ray-finned fish evolution. PMID:28173090
Characterisation of ATRX, DMRT1, DMRT7 and WT1 in the platypus (Ornithorhynchus anatinus).
Tsend-Ayush, Enkhjargal; Lim, Shu Ly; Pask, Andrew J; Hamdan, Diana Demiyah Mohd; Renfree, Marilyn B; Grützner, Frank
2009-01-01
One of the most puzzling aspects of monotreme reproductive biology is how they determine sex in the absence of the SRY gene that triggers testis development in most other mammals. Although monotremes share a XX female/XY male sex chromosome system with other mammals, their sex chromosomes show homology to the chicken Z chromosome, including the DMRT1 gene, which is a dosage-dependent sex determination gene in birds. In addition, monotremes feature an extraordinary multiple sex chromosome system. However, no sex determination gene has been identified as yet on any of the five X or five Y chromosomes and there is very little knowledge about the conservation and function of other known genes in the monotreme sex determination and differentiation pathway. We have analysed the expression pattern of four evolutionarily conserved genes that are important at different stages of sexual development in therian mammals. DMRT1 is a conserved sex-determination gene that is upregulated in the male developing gonad in vertebrates, while DMRT7 is a mammal-specific spermatogenesis gene. ATRX, a chromatin remodelling protein, lies on the therian X but there is a testis-expressed Y-copy in marsupials. However, in monotremes, the ATRX orthologue is autosomal. WT1 is an evolutionarily conserved gene essential for early gonadal formation in both sexes and later in testis development. We show that these four genes in the adult platypus have the same expression pattern as in other mammals, suggesting that they have a conserved role in sexual development independent of genomic location.
Chen, Jun; Gao, He; Zheng, Xiao-Ming; Jin, Mingna; Weng, Jian-Feng; Ma, Jin; Ren, Yulong; Zhou, Kunneng; Wang, Qi; Wang, Jie; Wang, Jiu-Lin; Zhang, Xin; Cheng, Zhijun; Wu, Chuanyin; Wang, Haiyang; Wan, Jian-Min
2015-08-01
Plant breeding relies on creation of novel allelic combinations for desired traits. Identification and utilization of beneficial alleles, rare alleles and evolutionarily conserved genes in the germplasm (referred to as 'hidden' genes) provide an effective approach to achieve this goal. Here we show that a chemically induced null mutation in an evolutionarily conserved gene, FUWA, alters multiple important agronomic traits in rice, including panicle architecture, grain shape and grain weight. FUWA encodes an NHL domain-containing protein, with preferential expression in the root meristem, shoot apical meristem and inflorescences, where it restricts excessive cell division. Sequence analysis revealed that FUWA has undergone a bottleneck effect, and become fixed in landraces and modern cultivars during domestication and breeding. We further confirm a highly conserved role of FUWA homologs in determining panicle architecture and grain development in rice, maize and sorghum through genetic transformation. Strikingly, knockdown of the FUWA transcription level by RNA interference results in an erect panicle and increased grain size in both indica and japonica genetic backgrounds. This study illustrates an approach to create new germplasm with improved agronomic traits for crop breeding by tapping into evolutionary conserved genes. © 2015 The Authors The Plant Journal © 2015 John Wiley & Sons Ltd.
Patterns of conservation and change in honey bee developmental genes
Dearden, Peter K.; Wilson, Megan J.; Sablan, Lisha; Osborne, Peter W.; Havler, Melanie; McNaughton, Euan; Kimura, Kiyoshi; Milshina, Natalia V.; Hasselmann, Martin; Gempe, Tanja; Schioett, Morten; Brown, Susan J.; Elsik, Christine G.; Holland, Peter W.H.; Kadowaki, Tatsuhiko; Beye, Martin
2006-01-01
The current insect genome sequencing projects provide an opportunity to extend studies of the evolution of developmental genes and pathways in insects. In this paper we examine the conservation and divergence of genes and developmental processes between Drosophila and the honey bee; two holometabolous insects whose lineages separated ∼300 million years ago, by comparing the presence or absence of 308 Drosophila developmental genes in the honey bee. Through examination of the presence or absence of genes involved in conserved pathways (cell signaling, axis formation, segmentation and homeobox transcription factors), we find that the vast majority of genes are conserved. Some genes involved in these processes are, however, missing in the honey bee. We have also examined the orthology of Drosophila genes involved in processes that differ between the honey bee and Drosophila. Many of these genes are preserved in the honey bee despite the process in which they act in Drosophila being different or absent in the honey bee. Many of the missing genes in both situations appear to have arisen recently in the Drosophila lineage, have single known functions in Drosophila, and act early in developmental pathways, while those that are preserved have pleiotropic functions. An evolutionary interpretation of these data is that either genes with multiple functions in a common ancestor are more likely to be preserved in both insect lineages, or genes that are preserved throughout evolution are more likely to co-opt additional functions. PMID:17065607
Blazier, J. Chris; Ruhlman, Tracey A.; Weng, Mao-Lun; Rehman, Sumaiyah K.; Sabir, Jamal S. M.; Jansen, Robert K.
2016-01-01
Genes for the plastid-encoded RNA polymerase (PEP) persist in the plastid genomes of all photosynthetic angiosperms. However, three unrelated lineages (Annonaceae, Passifloraceae and Geraniaceae) have been identified with unusually divergent open reading frames (ORFs) in the conserved region of rpoA, the gene encoding the PEP α subunit. We used sequence-based approaches to evaluate whether these genes retain function. Both gene sequences and complete plastid genome sequences were assembled and analyzed from each of the three angiosperm families. Multiple lines of evidence indicated that the rpoA sequences are likely functional despite retaining as low as 30% nucleotide sequence identity with rpoA genes from outgroups in the same angiosperm order. The ratio of non-synonymous to synonymous substitutions indicated that these genes are under purifying selection, and bioinformatic prediction of conserved domains indicated that functional domains are preserved. One of the lineages (Pelargonium, Geraniaceae) contains species with multiple rpoA-like ORFs that show evidence of ongoing inter-paralog gene conversion. The plastid genomes containing these divergent rpoA genes have experienced extensive structural rearrangement, including large expansions of the inverted repeat. We propose that illegitimate recombination, not positive selection, has driven the divergence of rpoA. PMID:27087667
Watanabe, Mutsumi; Mochida, Keiichi; Kato, Tomohiko; Tabata, Satoshi; Yoshimoto, Naoko; Noji, Masaaki; Saito, Kazuki
2008-01-01
Ser acetyltransferase (SERAT), which catalyzes O-acetyl-Ser (OAS) formation, plays a key role in sulfur assimilation and Cys synthesis. Despite several studies on SERATs from various plant species, the in vivo function of multiple SERAT genes in plant cells remains unaddressed. Comparative genomics studies with the five genes of the SERAT gene family in Arabidopsis thaliana indicated that all three Arabidopsis SERAT subfamilies are conserved across five plant species with available genome sequences. Single and multiple knockout mutants of all Arabidopsis SERAT gene family members were analyzed. All five quadruple mutants with a single gene survived, with three mutants showing dwarfism. However, the quintuple mutant lacking all SERAT genes was embryo-lethal. Thus, all five isoforms show functional redundancy in vivo. The developmental and compartment-specific roles of each SERAT isoform were also demonstrated. Mitochondrial SERAT2;2 plays a predominant role in cellular OAS formation, while plastidic SERAT2;1 contributes less to OAS formation and subsequent Cys synthesis. Three cytosolic isoforms, SERAT1;1, SERAT3;1, and SERAT3;2, may play a major role during seed development. Thus, the evolutionally conserved SERAT gene family is essential in cellular processes, and the substrates and products of SERAT must be exchangeable between the cytosol and organelles. PMID:18776059
From genes to landscapes: conserving biodiversity at multiple scales.
Sally Duncan
2000-01-01
Biodiversity has at last become a familiar term outside of scientific circles. Ways of measuring it and mapping it are advancing and becoming more complex, but ways of deciding how to conserve it remain mixed at best, and the resources available to manage dimishing biodiversity are themselves scarce. One significant problem is that policy decisions are frequently at...
Dasgupta, Ujjaini; Dixit, Bharat L; Rusch, Melissa; Selleck, Scott; The, Inge
2007-08-01
Heparan sulfate proteoglycans play a vital role in signaling of various growth factors in both Drosophila and vertebrates. In Drosophila, mutations in the tout velu (ttv) gene, a homolog of the mammalian EXT1 tumor suppressor gene, leads to abrogation of glycosaminoglycan (GAG) biosynthesis. This impairs distribution and signaling activities of various morphogens such as Hedgehog (Hh), Wingless (Wg), and Decapentaplegic (Dpp). Mutations in members of the exostosin (EXT) gene family lead to hereditary multiple exostosis in humans leading to bone outgrowths and tumors. In this study, we provide genetic and biochemical evidence that the human EXT1 (hEXT1) gene is conserved through species and can functionally complement the ttv mutation in Drosophila. The hEXT1 gene was able to rescue a ttv null mutant to adulthood and restore GAG biosynthesis.
Barik, Suvakanta; SarkarDas, Shabari; Singh, Archita; Gautam, Vibhav; Kumar, Pramod; Majee, Manoj; Sarkar, Ananda K
2014-01-01
Similar to the majority of the microRNAs, mature miR166s are derived from multiple members of MIR166 genes (precursors) and regulate various aspects of plant development by negatively regulating their target genes (Class III HD-ZIP). The evolutionary conservation or functional diversification of miRNA166 family members remains elusive. Here, we show the phylogenetic relationships among MIR166 precursor and mature sequences from three diverse model plant species. Despite strong conservation, some mature miR166 sequences, such as ppt-miR166m, have undergone sequence variation. Critical sequence variation in ppt-miR166m has led to functional diversification, as it targets non-HD-ZIPIII gene transcript (s). MIR166 precursor sequences have diverged in a lineage specific manner, and both precursors and mature osa-miR166i/j are highly conserved. Interestingly, polycistronic MIR166s were present in Physcomitrella and Oryza but not in Arabidopsis. The nature of cis-regulatory motifs on the upstream promoter sequences of MIR166 genes indicates their possible contribution to the functional variation observed among miR166 species. Copyright © 2013 Elsevier Inc. All rights reserved.
Kappen, Claudia
2016-01-01
The process of patterning along the anterior-posterior axis in vertebrates is highly conserved. The function of Hox genes in the axis patterning process is particularly well documented for bone development in the vertebral column and the limbs. We here show that Hoxb6, in skeletal elements at the cervico-thoracic junction, controls multiple independent aspects of skeletal pattern, implicating discrete developmental pathways as substrates for this transcription factor. In addition, we demonstrate that Hoxb6 function is subject to modulation by genetic factors. These results establish Hox-controlled skeletal pattern as a quantitative trait modulated by gene-gene interactions, and provide evidence that distinct modifiers influence the function of conserved developmental genes in fundamental patterning processes. PMID:26800342
Bernick, David L.; Dennis, Patrick P.; Lui, Lauren M.; Lowe, Todd M.
2012-01-01
A great diversity of small, non-coding RNA (ncRNA) molecules with roles in gene regulation and RNA processing have been intensely studied in eukaryotic and bacterial model organisms, yet our knowledge of possible parallel roles for small RNAs (sRNA) in archaea is limited. We employed RNA-seq to identify novel sRNA across multiple species of the hyperthermophilic genus Pyrobaculum, known for unusual RNA gene characteristics. By comparing transcriptional data collected in parallel among four species, we were able to identify conserved RNA genes fitting into known and novel families. Among our findings, we highlight three novel cis-antisense sRNAs encoded opposite to key regulatory (ferric uptake regulator), metabolic (triose-phosphate isomerase), and core transcriptional apparatus genes (transcription factor B). We also found a large increase in the number of conserved C/D box sRNA genes over what had been previously recognized; many of these genes are encoded antisense to protein coding genes. The conserved opposition to orthologous genes across the Pyrobaculum genus suggests similarities to other cis-antisense regulatory systems. Furthermore, the genus-specific nature of these sRNAs indicates they are relatively recent, stable adaptations. PMID:22783241
Gaponova, Anna V.; Deneka, Alexander Y.; Beck, Tim N.; Liu, Hanqing; Andrianov, Gregory; Nikonova, Anna S.; Nicolas, Emmanuelle; Einarson, Margret B.; Golemis, Erica A.; Serebriiskii, Ilya G.
2017-01-01
Ovarian, head and neck, and other cancers are commonly treated with cisplatin and other DNA damaging cytotoxic agents. Altered DNA damage response (DDR) contributes to resistance of these tumors to chemotherapies, some targeted therapies, and radiation. DDR involves multiple protein complexes and signaling pathways, some of which are evolutionarily ancient and involve protein orthologs conserved from yeast to humans. To identify new regulators of cisplatin-resistance in human tumors, we integrated high throughput and curated datasets describing yeast genes that regulate sensitivity to cisplatin and/or ionizing radiation. Next, we clustered highly validated genes based on chemogenomic profiling, and then mapped orthologs of these genes in expanded genomic networks for multiple metazoans, including humans. This approach identified an enriched candidate set of genes involved in the regulation of resistance to radiation and/or cisplatin in humans. Direct functional assessment of selected candidate genes using RNA interference confirmed their activity in influencing cisplatin resistance, degree of γH2AX focus formation and ATR phosphorylation, in ovarian and head and neck cancer cell lines, suggesting impaired DDR signaling as the driving mechanism. This work enlarges the set of genes that may contribute to chemotherapy resistance and provides a new contextual resource for interpreting next generation sequencing (NGS) genomic profiling of tumors. PMID:27863405
Blakely, Collin M; Stoddard, Alexander J; Belka, George K; Dugan, Katherine D; Notarfrancesco, Kathleen L; Moody, Susan E; D'Cruz, Celina M; Chodosh, Lewis A
2006-06-15
Women who have their first child early in life have a substantially lower lifetime risk of breast cancer. The mechanism for this is unknown. Similar to humans, rats exhibit parity-induced protection against mammary tumorigenesis. To explore the basis for this phenomenon, we identified persistent pregnancy-induced changes in mammary gene expression that are tightly associated with protection against tumorigenesis in multiple inbred rat strains. Four inbred rat strains that exhibit marked differences in their intrinsic susceptibilities to carcinogen-induced mammary tumorigenesis were each shown to display significant protection against methylnitrosourea-induced mammary tumorigenesis following treatment with pregnancy levels of estradiol and progesterone. Microarray expression profiling of parous and nulliparous mammary tissue from these four strains yielded a common 70-gene signature. Examination of the genes constituting this signature implicated alterations in transforming growth factor-beta signaling, the extracellular matrix, amphiregulin expression, and the growth hormone/insulin-like growth factor I axis in pregnancy-induced alterations in breast cancer risk. Notably, related molecular changes have been associated with decreased mammographic density, which itself is strongly associated with decreased breast cancer risk. Our findings show that hormone-induced protection against mammary tumorigenesis is widely conserved among divergent rat strains and define a gene expression signature that is tightly correlated with reduced mammary tumor susceptibility as a consequence of a normal developmental event. Given the conservation of this signature, these pathways may contribute to pregnancy-induced protection against breast cancer.
2014-01-01
Background Pectins are acidic sugar-containing polysaccharides that are universally conserved components of the primary cell walls of plants and modulate both tip and diffuse cell growth. However, many of their specific functions and the evolution of the genes responsible for producing and modifying them are incompletely understood. The moss Physcomitrella patens is emerging as a powerful model system for the study of plant cell walls. To identify deeply conserved pectin-related genes in Physcomitrella, we generated phylogenetic trees for 16 pectin-related gene families using sequences from ten plant genomes and analyzed the evolutionary relationships within these families. Results Contrary to our initial hypothesis that a single ancestral gene was present for each pectin-related gene family in the common ancestor of land plants, five of the 16 gene families, including homogalacturonan galacturonosyltransferases, polygalacturonases, pectin methylesterases, homogalacturonan methyltransferases, and pectate lyase-like proteins, show evidence of multiple members in the early land plant that gave rise to the mosses and vascular plants. Seven of the gene families, the UDP-rhamnose synthases, UDP-glucuronic acid epimerases, homogalacturonan galacturonosyltransferase-like proteins, β-1,4-galactan β-1,4-galactosyltransferases, rhamnogalacturonan II xylosyltransferases, and pectin acetylesterases appear to have had a single member in the common ancestor of land plants. We detected no Physcomitrella members in the xylogalacturonan xylosyltransferase, rhamnogalacturonan I arabinosyltransferase, pectin methylesterase inhibitor, or polygalacturonase inhibitor protein families. Conclusions Several gene families related to the production and modification of pectins in plants appear to have multiple members that are conserved as far back as the common ancestor of mosses and vascular plants. The presence of multiple members of these families even before the divergence of other important cell wall-related genes, such as cellulose synthases, suggests a more complex role than previously suspected for pectins in the evolution of land plants. The presence of relatively small pectin-related gene families in Physcomitrella as compared to Arabidopsis makes it an attractive target for analysis of the functions of pectins in cell walls. In contrast, the absence of genes in Physcomitrella for some families suggests that certain pectin modifications, such as homogalacturonan xylosylation, arose later during land plant evolution. PMID:24666997
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yang, Shihui; Pelletier, Dale A; Lu, Tse-Yuan
Zymomonas mobilis produces near theoretical yields of ethanol and recombinant strains are candidate industrial microorganisms. To date, few studies have examined its responses to various stresses at the gene level. Hfq is a conserved bacterial member of the Sm-like family of RNA-binding proteins, coordinating a broad array of responses including multiple stress responses. In a previous study, we observed Z. mobilis ZM4 gene ZMO0347 showed higher expression under anaerobic, stationary phase compared to that of aerobic, stationary conditions. We have shown the utility of the pKNOCK suicide plasmid for mutant construction in Z. mobilis, and constructed a Gateway compatible expressionmore » plasmid for use in Z. mobilis for the first time. We have also used genetics to show Z. mobilis Hfq and S. cerevisiae Lsm proteins play important roles in resisting multiple, important industrially relevant inhibitors. The conserved nature of this global regulator offers the potential to apply insights from these fundamental studies for further industrial strain development.« less
Finding approximate gene clusters with Gecko 3.
Winter, Sascha; Jahn, Katharina; Wehner, Stefanie; Kuchenbecker, Leon; Marz, Manja; Stoye, Jens; Böcker, Sebastian
2016-11-16
Gene-order-based comparison of multiple genomes provides signals for functional analysis of genes and the evolutionary process of genome organization. Gene clusters are regions of co-localized genes on genomes of different species. The rapid increase in sequenced genomes necessitates bioinformatics tools for finding gene clusters in hundreds of genomes. Existing tools are often restricted to few (in many cases, only two) genomes, and often make restrictive assumptions such as short perfect conservation, conserved gene order or monophyletic gene clusters. We present Gecko 3, an open-source software for finding gene clusters in hundreds of bacterial genomes, that comes with an easy-to-use graphical user interface. The underlying gene cluster model is intuitive, can cope with low degrees of conservation as well as misannotations and is complemented by a sound statistical evaluation. To evaluate the biological benefit of Gecko 3 and to exemplify our method, we search for gene clusters in a dataset of 678 bacterial genomes using Synechocystis sp. PCC 6803 as a reference. We confirm detected gene clusters reviewing the literature and comparing them to a database of operons; we detect two novel clusters, which were confirmed by publicly available experimental RNA-Seq data. The computational analysis is carried out on a laptop computer in <40 min. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
The sequence, structure and evolutionary features of HOTAIR in mammals
2011-01-01
Background An increasing number of long noncoding RNAs (lncRNAs) have been identified recently. Different from all the others that function in cis to regulate local gene expression, the newly identified HOTAIR is located between HoxC11 and HoxC12 in the human genome and regulates HoxD expression in multiple tissues. Like the well-characterised lncRNA Xist, HOTAIR binds to polycomb proteins to methylate histones at multiple HoxD loci, but unlike Xist, many details of its structure and function, as well as the trans regulation, remain unclear. Moreover, HOTAIR is involved in the aberrant regulation of gene expression in cancer. Results To identify conserved domains in HOTAIR and study the phylogenetic distribution of this lncRNA, we searched the genomes of 10 mammalian and 3 non-mammalian vertebrates for matches to its 6 exons and the two conserved domains within the 1800 bp exon6 using Infernal. There was just one high-scoring hit for each mammal, but many low-scoring hits were found in both mammals and non-mammalian vertebrates. These hits and their flanking genes in four placental mammals and platypus were examined to determine whether HOTAIR contained elements shared by other lncRNAs. Several of the hits were within unknown transcripts or ncRNAs, many were within introns of, or antisense to, protein-coding genes, and conservation of the flanking genes was observed only between human and chimpanzee. Phylogenetic analysis revealed discrete evolutionary dynamics for orthologous sequences of HOTAIR exons. Exon1 at the 5' end and a domain in exon6 near the 3' end, which contain domains that bind to multiple proteins, have evolved faster in primates than in other mammals. Structures were predicted for exon1, two domains of exon6 and the full HOTAIR sequence. The sequence and structure of two fragments, in exon1 and the domain B of exon6 respectively, were identified to robustly occur in predicted structures of exon1, domain B of exon6 and the full HOTAIR in mammals. Conclusions HOTAIR exists in mammals, has poorly conserved sequences and considerably conserved structures, and has evolved faster than nearby HoxC genes. Exons of HOTAIR show distinct evolutionary features, and a 239 bp domain in the 1804 bp exon6 is especially conserved. These features, together with the absence of some exons and sequences in mouse, rat and kangaroo, suggest ab initio generation of HOTAIR in marsupials. Structure prediction identifies two fragments in the 5' end exon1 and the 3' end domain B of exon6, with sequence and structure invariably occurring in various predicted structures of exon1, the domain B of exon6 and the full HOTAIR. PMID:21496275
Tsubota, Takuya; Tomita, Shuichiro; Uchino, Keiro; Kimoto, Mai; Takiya, Shigeharu; Kajiwara, Hideyuki; Yamazaki, Toshimasa; Sezutsu, Hideki
2016-01-01
Hox genes play a pivotal role in the determination of anteroposterior axis specificity during bilaterian animal development. They do so by acting as a master control and regulating the expression of genes important for development. Recently, however, we showed that Hox genes can also function in terminally differentiated tissue of the lepidopteran Bombyx mori. In this species, Antennapedia (Antp) regulates expression of sericin-1, a major silk protein gene, in the silk gland. Here, we investigated whether Antp can regulate expression of multiple genes in this tissue. By means of proteomic, RT-PCR, and in situ hybridization analyses, we demonstrate that misexpression of Antp in the posterior silk gland induced ectopic expression of major silk protein genes such as sericin-3, fhxh4, and fhxh5. These genes are normally expressed specifically in the middle silk gland as is Antp. Therefore, the evidence strongly suggests that Antp activates these silk protein genes in the middle silk gland. The putative sericin-1 activator complex (middle silk gland-intermolt-specific complex) can bind to the upstream regions of these genes, suggesting that Antp directly activates their expression. We also found that the pattern of gene expression was well conserved between B. mori and the wild species Bombyx mandarina, indicating that the gene regulation mechanism identified here is an evolutionarily conserved mechanism and not an artifact of the domestication of B. mori. We suggest that Hox genes have a role as a master control in terminally differentiated tissues, possibly acting as a primary regulator for a range of physiological processes. PMID:26814126
Gerencsér, Ákos; Barta, Endre; Boa, Simon; Kastanis, Petros; Bösze, Zsuzsanna; Whitelaw, C Bruce A
2002-01-01
κ-casein plays an essential role in the formation, stabilisation and aggregation of milk micelles. Control of κ-casein expression reflects this essential role, although an understanding of the mechanisms involved lags behind that of the other milk protein genes. We determined the 5'-flanking sequences for the murine, rabbit and human κ-casein genes and compared them to the published ruminant sequences. The most conserved region was not the proximal promoter region but an approximately 400 bp long region centred 800 bp upstream of the TATA box. This region contained two highly conserved MGF/STAT5 sites with common spacing relative to each other. In this region, six conserved short stretches of similarity were also found which did not correspond to known transcription factor consensus sites. On the contrary to ruminant and human 5' regulatory sequences, the rabbit and murine 5'-flanking regions did not harbour any kind of repetitive elements. We generated a phylogenetic tree of the six species based on multiple alignment of the κ-casein sequences. This study identified conserved candidate transcriptional regulatory elements within the κ-casein gene promoter. PMID:11929628
A Scalable Approach for Discovering Conserved Active Subnetworks across Species
Verfaillie, Catherine M.; Hu, Wei-Shou; Myers, Chad L.
2010-01-01
Overlaying differential changes in gene expression on protein interaction networks has proven to be a useful approach to interpreting the cell's dynamic response to a changing environment. Despite successes in finding active subnetworks in the context of a single species, the idea of overlaying lists of differentially expressed genes on networks has not yet been extended to support the analysis of multiple species' interaction networks. To address this problem, we designed a scalable, cross-species network search algorithm, neXus (Network - cross(X)-species - Search), that discovers conserved, active subnetworks based on parallel differential expression studies in multiple species. Our approach leverages functional linkage networks, which provide more comprehensive coverage of functional relationships than physical interaction networks by combining heterogeneous types of genomic data. We applied our cross-species approach to identify conserved modules that are differentially active in stem cells relative to differentiated cells based on parallel gene expression studies and functional linkage networks from mouse and human. We find hundreds of conserved active subnetworks enriched for stem cell-associated functions such as cell cycle, DNA repair, and chromatin modification processes. Using a variation of this approach, we also find a number of species-specific networks, which likely reflect mechanisms of stem cell function that have diverged between mouse and human. We assess the statistical significance of the subnetworks by comparing them with subnetworks discovered on random permutations of the differential expression data. We also describe several case examples that illustrate the utility of comparative analysis of active subnetworks. PMID:21170309
Jung, Sook; Main, Dorrie; Staton, Margaret; Cho, Ilhyung; Zhebentyayeva, Tatyana; Arús, Pere; Abbott, Albert
2006-01-01
Background Due to the lack of availability of large genomic sequences for peach or other Prunus species, the degree of synteny conservation between the Prunus species and Arabidopsis has not been systematically assessed. Using the recently available peach EST sequences that are anchored to Prunus genetic maps and to peach physical map, we analyzed the extent of conserved synteny between the Prunus and the Arabidopsis genomes. The reconstructed pseudo-ancestral Arabidopsis genome, existed prior to the proposed recent polyploidy event, was also utilized in our analysis to further elucidate the evolutionary relationship. Results We analyzed the synteny conservation between the Prunus and the Arabidopsis genomes by comparing 475 peach ESTs that are anchored to Prunus genetic maps and their Arabidopsis homologs detected by sequence similarity. Microsyntenic regions were detected between all five Arabidopsis chromosomes and seven of the eight linkage groups of the Prunus reference map. An additional 1097 peach ESTs that are anchored to 431 BAC contigs of the peach physical map and their Arabidopsis homologs were also analyzed. Microsyntenic regions were detected in 77 BAC contigs. The syntenic regions from both data sets were short and contained only a couple of conserved gene pairs. The synteny between peach and Arabidopsis was fragmentary; all the Prunus linkage groups containing syntenic regions matched to more than two different Arabidopsis chromosomes, and most BAC contigs with multiple conserved syntenic regions corresponded to multiple Arabidopsis chromosomes. Using the same peach EST datasets and their Arabidopsis homologs, we also detected conserved syntenic regions in the pseudo-ancestral Arabidopsis genome. In many cases, the gene order and content of peach regions was more conserved in the ancestral genome than in the present Arabidopsis region. Statistical significance of each syntenic group was calculated using simulated Arabidopsis genome. Conclusion We report here the result of the first extensive analysis of the conserved microsynteny using DNA sequences across the Prunus genome and their Arabidopsis homologs. Our study also illustrates that both the ancestral and present Arabidopsis genomes can provide a useful resource for marker saturation and candidate gene search, as well as elucidating evolutionary relationships between species. PMID:16615871
Liu, Junli; Liu, Jianjian; Chen, Aiqun; Ji, Minjie; Chen, Jiadong; Yang, Xiaofeng; Gu, Mian; Qu, Hongye; Xu, Guohua
2016-10-01
In plants, the plasma membrane H(+)-ATPase (HA) is considered to play a crucial role in regulating plant growth and respoding to environment stresses. Multiple paralogous genes encoding different isozymes of HA have been identified and characterized in several model plants, while limited information of the HA gene family is available to date for tomato. Here, we describe the molecular and expression features of eight HA-encoding genes (SlHA1-8) from tomato. All these genes are interrupted by multiple introns with conserved positions. SlHA1, 2, and 4 were widely expressed in all tissues, while SlHA5, 6, and 7 were almost only expressed in flowers. SlHA8, the transcripts of which were barely detectable under normal or nutrient-/salt-stress growth conditions, was strongly activated in arbuscular mycorrhizal (AM) fungal-colonized roots. Extreme lack of SlHA8 expression in M161, a mutant defective to AM fungal colonization, provided genetic evidence towards the dependence of its expression on AM symbiosis. A 1521-bp SlHA8 promoter could direct the GUS reporter expression specifically in colonized cells of transgenic tobacco, soybean, and rice mycorrhizal roots. Promoter deletion assay revealed a 223-bp promoter fragment of SlHA8 containing a variant of AM-specific cis-element MYCS (vMYCS) sufficient to confer the AM-induced activity. Targeted deletion of this motif in the corresponding promoter region causes complete abolishment of GUS staining in mycorrhizal roots. Together, these results lend cogent evidence towards the evolutionary conservation of a potential regulatory mechanism mediating the activation of AM-responsive HA genes in diverse mycorrhizal plant species.
Tsubota, Takuya; Tomita, Shuichiro; Uchino, Keiro; Kimoto, Mai; Takiya, Shigeharu; Kajiwara, Hideyuki; Yamazaki, Toshimasa; Sezutsu, Hideki
2016-03-25
Hoxgenes play a pivotal role in the determination of anteroposterior axis specificity during bilaterian animal development. They do so by acting as a master control and regulating the expression of genes important for development. Recently, however, we showed that Hoxgenes can also function in terminally differentiated tissue of the lepidopteranBombyx mori In this species,Antennapedia(Antp) regulates expression of sericin-1, a major silk protein gene, in the silk gland. Here, we investigated whether Antpcan regulate expression of multiple genes in this tissue. By means of proteomic, RT-PCR, and in situ hybridization analyses, we demonstrate that misexpression of Antpin the posterior silk gland induced ectopic expression of major silk protein genes such assericin-3,fhxh4, and fhxh5 These genes are normally expressed specifically in the middle silk gland as is Antp Therefore, the evidence strongly suggests that Antpactivates these silk protein genes in the middle silk gland. The putativesericin-1 activator complex (middle silk gland-intermolt-specific complex) can bind to the upstream regions of these genes, suggesting that Antpdirectly activates their expression. We also found that the pattern of gene expression was well conserved between B. moriand the wild species Bombyx mandarina, indicating that the gene regulation mechanism identified here is an evolutionarily conserved mechanism and not an artifact of the domestication of B. mori We suggest that Hoxgenes have a role as a master control in terminally differentiated tissues, possibly acting as a primary regulator for a range of physiological processes. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
Wuttke, Daniel; Connor, Richard; Vora, Chintan; Craig, Thomas; Li, Yang; Wood, Shona; Vasieva, Olga; Shmookler Reis, Robert; Tang, Fusheng; de Magalhães, João Pedro
2012-01-01
Dietary restriction (DR), limiting nutrient intake from diet without causing malnutrition, delays the aging process and extends lifespan in multiple organisms. The conserved life-extending effect of DR suggests the involvement of fundamental mechanisms, although these remain a subject of debate. To help decipher the life-extending mechanisms of DR, we first compiled a list of genes that if genetically altered disrupt or prevent the life-extending effects of DR. We called these DR–essential genes and identified more than 100 in model organisms such as yeast, worms, flies, and mice. In order for other researchers to benefit from this first curated list of genes essential for DR, we established an online database called GenDR (http://genomics.senescence.info/diet/). To dissect the interactions of DR–essential genes and discover the underlying lifespan-extending mechanisms, we then used a variety of network and systems biology approaches to analyze the gene network of DR. We show that DR–essential genes are more conserved at the molecular level and have more molecular interactions than expected by chance. Furthermore, we employed a guilt-by-association method to predict novel DR–essential genes. In budding yeast, we predicted nine genes related to vacuolar functions; we show experimentally that mutations deleting eight of those genes prevent the life-extending effects of DR. Three of these mutants (OPT2, FRE6, and RCR2) had extended lifespan under ad libitum, indicating that the lack of further longevity under DR is not caused by a general compromise of fitness. These results demonstrate how network analyses of DR using GenDR can be used to make phenotypically relevant predictions. Moreover, gene-regulatory circuits reveal that the DR–induced transcriptional signature in yeast involves nutrient-sensing, stress responses and meiotic transcription factors. Finally, comparing the influence of gene expression changes during DR on the interactomes of multiple organisms led us to suggest that DR commonly suppresses translation, while stimulating an ancient reproduction-related process. PMID:22912585
Akkuratov, Evgeny E; Walters, Lorraine; Saha-Mandal, Arnab; Khandekar, Sushant; Crawford, Erin; Zirbel, Craig L; Leisner, Scott; Prakash, Ashwin; Fedorova, Larisa; Fedorov, Alexei
2014-09-10
Orthologous introns have identical positions relative to the coding sequence in orthologous genes of different species. By analyzing the complete genomes of five plants we generated a database of 40,512 orthologous intron groups of dicotyledonous plants, 28,519 orthologous intron groups of angiosperms, and 15,726 of land plants (moss and angiosperms). Multiple sequence alignments of each orthologous intron group were obtained using the Mafft algorithm. The number of conserved regions in plant introns appeared to be hundreds of times fewer than that in mammals or vertebrates. Approximately three quarters of conserved intronic regions among angiosperms and dicots, in particular, correspond to alternatively-spliced exonic sequences. We registered only a handful of conserved intronic ncRNAs of flowering plants. However, the most evolutionarily conserved intronic region, which is ubiquitous for all plants examined in this study, including moss, possessed multiple structural features of tRNAs, which caused us to classify it as a putative tRNA-like ncRNA. Intronic sequences encoding tRNA-like structures are not unique to plants. Bioinformatics examination of the presence of tRNA inside introns revealed an unusually long-term association of four glycine tRNAs inside the Vac14 gene of fish, amniotes, and mammals. Copyright © 2014 Elsevier B.V. All rights reserved.
Geuverink, E; Beukeboom, L W
2014-01-01
Sex determination in insects is characterized by a gene cascade that is conserved at the bottom but contains diverse primary signals at the top. The bottom master switch gene doublesex is found in all insects. Its upstream regulator transformer is present in the orders Hymenoptera, Coleoptera and Diptera, but has thus far not been found in Lepidoptera and in the basal lineages of Diptera. transformer is presumed to be ancestral to the holometabolous insects based on its shared domains and conserved features of autoregulation and sex-specific splicing. We interpret that its absence in basal lineages of Diptera and its order-specific conserved domains indicate multiple independent losses or recruitments into the sex determination cascade. Duplications of transformer are found in derived families within the Hymenoptera, characterized by their complementary sex determination mechanism. As duplications are not found in any other insect order, they appear linked to the haplodiploid reproduction of the Hymenoptera. Further phylogenetic analyses combined with functional studies are needed to understand the evolutionary history of the transformer gene among insects. © 2013 S. Karger AG, Basel.
Fauteux, François; Strömvik, Martina V
2009-01-01
Background Accurate computational identification of cis-regulatory motifs is difficult, particularly in eukaryotic promoters, which typically contain multiple short and degenerate DNA sequences bound by several interacting factors. Enrichment in combinations of rare motifs in the promoter sequence of functionally or evolutionarily related genes among several species is an indicator of conserved transcriptional regulatory mechanisms. This provides a basis for the computational identification of cis-regulatory motifs. Results We have used a discriminative seeding DNA motif discovery algorithm for an in-depth analysis of 54 seed storage protein (SSP) gene promoters from three plant families, namely Brassicaceae (mustards), Fabaceae (legumes) and Poaceae (grasses) using backgrounds based on complete sets of promoters from a representative species in each family, namely Arabidopsis (Arabidopsis thaliana (L.) Heynh.), soybean (Glycine max (L.) Merr.) and rice (Oryza sativa L.) respectively. We have identified three conserved motifs (two RY-like and one ACGT-like) in Brassicaceae and Fabaceae SSP gene promoters that are similar to experimentally characterized seed-specific cis-regulatory elements. Fabaceae SSP gene promoter sequences are also enriched in a novel, seed-specific E2Fb-like motif. Conserved motifs identified in Poaceae SSP gene promoters include a GCN4-like motif, two prolamin-box-like motifs and an Skn-1-like motif. Evidence of the presence of a variant of the TATA-box is found in the SSP gene promoters from the three plant families. Motifs discovered in SSP gene promoters were used to score whole-genome sets of promoters from Arabidopsis, soybean and rice. The highest-scoring promoters are associated with genes coding for different subunits or precursors of seed storage proteins. Conclusion Seed storage protein gene promoter motifs are conserved in diverse species, and different plant families are characterized by a distinct combination of conserved motifs. The majority of discovered motifs match experimentally characterized cis-regulatory elements. These results provide a good starting point for further experimental analysis of plant seed-specific promoters and our methodology can be used to unravel more transcriptional regulatory mechanisms in plants and other eukaryotes. PMID:19843335
Luo, Ya; Zhao, Santao; Li, Jiahui; Li, Peizheng
2017-01-01
transformer (tra) is a switch gene of sex determination in many insects, particularly in Dipterans. However, the sex determination pathway in Bactrocera cucurbitae (Coquillett), a very destructive pest on earth, remains largely uncharacterized. In this study, we have isolated and characterized one female-specific and two male-specific transcripts of the tra gene (Bcutra) of B. cucurbitae. The genomic structure of Bcutra has been determined and the presence of multiple conserved Transformer (TRA)/TRA-2 binding sites in Bcutra has been found. BcuTRA is highly conservative with its homologues in other tephritid fruit flies. Gene expression analysis of Bcutra at different developmental stages demonstrates that the female transcript of Bcutra appears earlier than the male counterparts, indicating that the maternal TRA is inherited in eggs and might play a role in the regulation of TRA expression. The conservation of protein sequence and sex-specific splicing of Bcutra and its expression patterns during development suggest that Bcutra is probably the master gene of sex determination of B. cucurbitae. Isolation of Bcutra will facilitate the development of a genetic sexing strain for its biological control. PMID:28931159
Luo, Ya; Zhao, Santao; Li, Jiahui; Li, Peizheng; Yan, Rihui
2017-01-01
transformer (tra) is a switch gene of sex determination in many insects, particularly in Dipterans. However, the sex determination pathway in Bactrocera cucurbitae (Coquillett), a very destructive pest on earth, remains largely uncharacterized. In this study, we have isolated and characterized one female-specific and two male-specific transcripts of the tra gene (Bcutra) of B. cucurbitae. The genomic structure of Bcutra has been determined and the presence of multiple conserved Transformer (TRA)/TRA-2 binding sites in Bcutra has been found. BcuTRA is highly conservative with its homologues in other tephritid fruit flies. Gene expression analysis of Bcutra at different developmental stages demonstrates that the female transcript of Bcutra appears earlier than the male counterparts, indicating that the maternal TRA is inherited in eggs and might play a role in the regulation of TRA expression. The conservation of protein sequence and sex-specific splicing of Bcutra and its expression patterns during development suggest that Bcutra is probably the master gene of sex determination of B. cucurbitae. Isolation of Bcutra will facilitate the development of a genetic sexing strain for its biological control. © The Authors 2017. Published by Oxford University Press on behalf of Entomological Society of America.
Westholm, Jakub O.; Miura, Pedro; Olson, Sara; Shenker, Sol; Joseph, Brian; Sanfilippo, Piero; Celniker, Susan E.; Graveley, Brenton R.; Lai, Eric C.
2014-01-01
Circularization was recently recognized to broadly expand transcriptome complexity. Here, we exploit massive Drosophila total RNA-sequencing data, >5 billion paired-end reads from >100 libraries covering diverse developmental stages, tissues and cultured cells, to rigorously annotate >2500 fruitfly circular RNAs. These mostly derive from back-splicing of protein-coding genes and lack poly(A) tails, and circularization of hundreds of genes is conserved across multiple Drosophila species. We elucidate structural and sequence properties of Drosophila circular RNAs, which exhibit commonalities and distinctions from mammalian circles. Notably, Drosophila circular RNAs harbor >1000 well-conserved canonical miRNA seed matches, especially within coding regions, and coding conserved miRNA sites reside preferentially within circularized exons. Finally, we analyze the developmental and tissue specificity of circular RNAs, and note their preferred derivation from neural genes and enhanced accumulation in neural tissues. Interestingly, circular isoforms increase dramatically relative to linear isoforms during CNS aging, and constitute a novel aging biomarker. PMID:25544350
Westholm, Jakub O.; Miura, Pedro; Olson, Sara; ...
2014-11-26
Circularization was recently recognized to broadly expand transcriptome complexity. Here, we exploit massive Drosophila total RNA-sequencing data, >5 billion paired-end reads from >100 libraries covering diverse developmental stages, tissues, and cultured cells, to rigorously annotate >2,500 fruit fly circular RNAs. These mostly derive from back-splicing of protein-coding genes and lack poly(A) tails, and the circularization of hundreds of genes is conserved across multiple Drosophila species. We elucidate structural and sequence properties of Drosophila circular RNAs, which exhibit commonalities and distinctions from mammalian circles. Notably, Drosophila circular RNAs harbor >1,000 well-conserved canonical miRNA seed matches, especially within coding regions, and codingmore » conserved miRNA sites reside preferentially within circularized exons. Finally, we analyze the developmental and tissue specificity of circular RNAs and note their preferred derivation from neural genes and enhanced accumulation in neural tissues. Interestingly, circular isoforms increase substantially relative to linear isoforms during CNS aging and constitute an aging biomarker.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Westholm, Jakub O.; Miura, Pedro; Olson, Sara
Circularization was recently recognized to broadly expand transcriptome complexity. Here, we exploit massive Drosophila total RNA-sequencing data, >5 billion paired-end reads from >100 libraries covering diverse developmental stages, tissues, and cultured cells, to rigorously annotate >2,500 fruit fly circular RNAs. These mostly derive from back-splicing of protein-coding genes and lack poly(A) tails, and the circularization of hundreds of genes is conserved across multiple Drosophila species. We elucidate structural and sequence properties of Drosophila circular RNAs, which exhibit commonalities and distinctions from mammalian circles. Notably, Drosophila circular RNAs harbor >1,000 well-conserved canonical miRNA seed matches, especially within coding regions, and codingmore » conserved miRNA sites reside preferentially within circularized exons. Finally, we analyze the developmental and tissue specificity of circular RNAs and note their preferred derivation from neural genes and enhanced accumulation in neural tissues. Interestingly, circular isoforms increase substantially relative to linear isoforms during CNS aging and constitute an aging biomarker.« less
Sancho, Ana; Duran, Jordi; García-España, Antonio; Mauvezin, Caroline; Alemu, Endalkachew A; Lamark, Trond; Macias, Maria J; DeSalle, Rob; Royo, Miriam; Sala, David; Chicote, Javier U; Palacín, Manuel; Johansen, Terje; Zorzano, Antonio
2012-01-01
Human DOR/TP53INP2 displays a unique bifunctional role as a modulator of autophagy and gene transcription. However, the domains or regions of DOR that participate in those functions have not been identified. Here we have performed structure/function analyses of DOR guided by identification of conserved regions in the DOR gene family by phylogenetic reconstructions. We show that DOR is present in metazoan species. Invertebrates harbor only one gene, DOR/Tp53inp2, and in the common ancestor of vertebrates Tp53inp1 may have arisen by gene duplication. In keeping with these data, we show that human TP53INP1 regulates autophagy and that different DOR/TP53INP2 and TP53INP1 proteins display transcriptional activity. The use of molecular evolutionary information has been instrumental to determine the regions that participate in DOR functions. DOR and TP53INP1 proteins share two highly conserved regions (region 1, aa residues 28-42; region 2, 66-112 in human DOR). Mutation of conserved hydrophobic residues in region 1 of DOR (that are part of a nuclear export signal, NES) reduces transcriptional activity, and blocks nuclear exit and autophagic activity under autophagy-activated conditions. We also identify a functional and conserved LC3-interacting motif (LIR) in region 1 of DOR and TP53INP1 proteins. Mutation of conserved acidic residues in region 2 of DOR reduces transcriptional activity, impairs nuclear exit in response to autophagy activation, and disrupts autophagy. Taken together, our data reveal DOR and TP53INP1 as dual regulators of transcription and autophagy, and identify two conserved regions in the DOR family that concentrate multiple functions crucial for autophagy and transcription.
Sancho, Ana; Duran, Jordi; García-España, Antonio; Mauvezin, Caroline; Alemu, Endalkachew A.; Lamark, Trond; Macias, Maria J.; DeSalle, Rob; Royo, Miriam; Sala, David; Chicote, Javier U.; Palacín, Manuel; Johansen, Terje; Zorzano, Antonio
2012-01-01
Human DOR/TP53INP2 displays a unique bifunctional role as a modulator of autophagy and gene transcription. However, the domains or regions of DOR that participate in those functions have not been identified. Here we have performed structure/function analyses of DOR guided by identification of conserved regions in the DOR gene family by phylogenetic reconstructions. We show that DOR is present in metazoan species. Invertebrates harbor only one gene, DOR/Tp53inp2, and in the common ancestor of vertebrates Tp53inp1 may have arisen by gene duplication. In keeping with these data, we show that human TP53INP1 regulates autophagy and that different DOR/TP53INP2 and TP53INP1 proteins display transcriptional activity. The use of molecular evolutionary information has been instrumental to determine the regions that participate in DOR functions. DOR and TP53INP1 proteins share two highly conserved regions (region 1, aa residues 28–42; region 2, 66–112 in human DOR). Mutation of conserved hydrophobic residues in region 1 of DOR (that are part of a nuclear export signal, NES) reduces transcriptional activity, and blocks nuclear exit and autophagic activity under autophagy-activated conditions. We also identify a functional and conserved LC3-interacting motif (LIR) in region 1 of DOR and TP53INP1 proteins. Mutation of conserved acidic residues in region 2 of DOR reduces transcriptional activity, impairs nuclear exit in response to autophagy activation, and disrupts autophagy. Taken together, our data reveal DOR and TP53INP1 as dual regulators of transcription and autophagy, and identify two conserved regions in the DOR family that concentrate multiple functions crucial for autophagy and transcription. PMID:22470510
Díaz-Castillo, Carlos; Xia, Xiao-Qin; Ranz, José M.
2012-01-01
Why gene order is conserved over long evolutionary timespans remains elusive. A common interpretation is that gene order conservation might reflect the existence of functional constraints that are important for organismal performance. Alteration of the integrity of genomic regions, and therefore of those constraints, would result in detrimental effects. This notion seems especially plausible in those genomes that can easily accommodate gene reshuffling via chromosomal inversions since genomic regions free of constraints are likely to have been disrupted in one or more lineages. Nevertheless, no empirical test has been performed to this notion. Here, we disrupt one of the largest conserved genomic regions of the Drosophila genome by chromosome engineering and examine the phenotypic consequences derived from such disruption. The targeted region exhibits multiple patterns of functional enrichment suggestive of the presence of constraints. The carriers of the disrupted collinear block show no defects in their viability, fertility, and parameters of general homeostasis, although their odorant perception is altered. This change in odorant perception does not correlate with modifications of the level of expression and sex bias of the genes within the genomic region disrupted. Our results indicate that even in highly rearranged genomes, like those of Diptera, unusually high levels of gene order conservation cannot be systematically attributed to functional constraints, which raises the possibility that other mechanisms can be in place and therefore the underpinnings of the maintenance of gene organization might be more diverse than previously thought. PMID:22319453
Mapping cis-Regulatory Domains in the Human Genome UsingMulti-Species Conservation of Synteny
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ahituv, Nadav; Prabhakar, Shyam; Poulin, Francis
2005-06-13
Our inability to associate distant regulatory elements with the genes that they regulate has largely precluded their examination for sequence alterations contributing to human disease. One major obstacle is the large genomic space surrounding targeted genes in which such elements could potentially reside. In order to delineate gene regulatory boundaries we used whole-genome human-mouse-chicken (HMC) and human-mouse-frog (HMF) multiple alignments to compile conserved blocks of synteny (CBS), under the hypothesis that these blocks have been kept intact throughout evolution at least in part by the requirement of regulatory elements to stay linked to the genes that they regulate. A totalmore » of 2,116 and 1,942 CBS>200 kb were assembled for HMC and HMF respectively, encompassing 1.53 and 0.86 Gb of human sequence. To support the existence of complex long-range regulatory domains within these CBS we analyzed the prevalence and distribution of chromosomal aberrations leading to position effects (disruption of a genes regulatory environment), observing a clear bias not only for mapping onto CBS but also for longer CBS size. Our results provide a genome wide data set characterizing the regulatory domains of genes and the conserved regulatory elements within them.« less
Of mice and genes: evolution of vertebrate brain development
NASA Technical Reports Server (NTRS)
Fritzsch, B.
1998-01-01
In this review the current understanding of genetic and molecular evolution of development, in particular the formation of the major axis of bilateral animals, is critically evaluated, and the early pattern formation in the hindbrain is related as much as possible to these processes. On the genetic level it is proposed that the exuberant multiplication of regulatory genes compared to that of structural genes relates to the increased flexibility of early vertebrate development. In comparisons to fruit flies, many conserved genes are found to be expressed very differently, while many others seem to reflect a comparable pattern and thus suggest a conservation of function. Even genes with a largely conserved pattern of expression may change the level at which they are expressed and the mechanisms by which they are regulated in their expression. Evolution and development of hindbrain motoneurons is reviewed, and it is concluded that both comparative data as well as more recent experimental data suggest a limited importance for the rhombomeres. Clearly, many cell fate-specifying processes work below the level of rhombomeres or in the absence of rhombomeres. It is suggested that more comparative developmental data are needed to establish firmly the relationship between homeobox genes and rhombomere specification in vertebrates other than a few model species.
PSAT: A web tool to compare genomic neighborhoods of multiple prokaryotic genomes
Fong, Christine; Rohmer, Laurence; Radey, Matthew; Wasnick, Michael; Brittnacher, Mitchell J
2008-01-01
Background The conservation of gene order among prokaryotic genomes can provide valuable insight into gene function, protein interactions, or events by which genomes have evolved. Although some tools are available for visualizing and comparing the order of genes between genomes of study, few support an efficient and organized analysis between large numbers of genomes. The Prokaryotic Sequence homology Analysis Tool (PSAT) is a web tool for comparing gene neighborhoods among multiple prokaryotic genomes. Results PSAT utilizes a database that is preloaded with gene annotation, BLAST hit results, and gene-clustering scores designed to help identify regions of conserved gene order. Researchers use the PSAT web interface to find a gene of interest in a reference genome and efficiently retrieve the sequence homologs found in other bacterial genomes. The tool generates a graphic of the genomic neighborhood surrounding the selected gene and the corresponding regions for its homologs in each comparison genome. Homologs in each region are color coded to assist users with analyzing gene order among various genomes. In contrast to common comparative analysis methods that filter sequence homolog data based on alignment score cutoffs, PSAT leverages gene context information for homologs, including those with weak alignment scores, enabling a more sensitive analysis. Features for constraining or ordering results are designed to help researchers browse results from large numbers of comparison genomes in an organized manner. PSAT has been demonstrated to be useful for helping to identify gene orthologs and potential functional gene clusters, and detecting genome modifications that may result in loss of function. Conclusion PSAT allows researchers to investigate the order of genes within local genomic neighborhoods of multiple genomes. A PSAT web server for public use is available for performing analyses on a growing set of reference genomes through any web browser with no client side software setup or installation required. Source code is freely available to researchers interested in setting up a local version of PSAT for analysis of genomes not available through the public server. Access to the public web server and instructions for obtaining source code can be found at . PMID:18366802
Karanja, Bernard Kinuthia; Fan, Lianxue; Xu, Liang; Wang, Yan; Zhu, Xianwen; Tang, Mingjia; Wang, Ronghua; Zhang, Fei; Muleke, Everlyne M'mbone; Liu, Liwang
2017-11-01
The radish WRKY gene family was genome-widely identified and played critical roles in response to multiple abiotic stresses. The WRKY is among the largest transcription factors (TFs) associated with multiple biological activities for plant survival, including control response mechanisms against abiotic stresses such as heat, salinity, and heavy metals. Radish is an important root vegetable crop and therefore characterization and expression pattern investigation of WRKY transcription factors in radish is imperative. In the present study, 126 putative WRKY genes were retrieved from radish genome database. Protein sequence and annotation scrutiny confirmed that RsWRKY proteins possessed highly conserved domains and zinc finger motif. Based on phylogenetic analysis results, RsWRKYs candidate genes were divided into three groups (Group I, II and III) with the number 31, 74, and 20, respectively. Additionally, gene structure analysis revealed that intron-exon patterns of the WRKY genes are highly conserved in radish. Linkage map analysis indicated that RsWRKY genes were distributed with varying densities over nine linkage groups. Further, RT-qPCR analysis illustrated the significant variation of 36 RsWRKY genes under one or more abiotic stress treatments, implicating that they might be stress-responsive genes. In total, 126 WRKY TFs were identified from the R. sativus genome wherein, 35 of them showed abiotic stress-induced expression patterns. These results provide a genome-wide characterization of RsWRKY TFs and baseline for further functional dissection and molecular evolution investigation, specifically for improving abiotic stress resistances with an ultimate goal of increasing yield and quality of radish.
Evolutionarily conserved ELOVL4 gene expression in the vertebrate retina.
Lagali, Pamela S; Liu, Jiafan; Ambasudhan, Rajesh; Kakuk, Laura E; Bernstein, Steven L; Seigel, Gail M; Wong, Paul W; Ayyagari, Radha
2003-07-01
The gene elongation of very long chain fatty acids-4 (ELOVL4) has been shown to underlie phenotypically heterogeneous forms of autosomal dominant macular degeneration. In this study, the extent of evolutionary conservation and the existence and localization of retinal expression of this gene was investigated across a wide variety of species. Southern blot analysis of genomic DNA and bioinformatic analysis using the human ELOVL4 cDNA and protein sequences, respectively, were performed to identify species in which ELOVL4 orthologues and/or homologues are present. Retinal RNA and protein extracts derived from different species were assessed by Northern hybridization and immunoblot techniques to assess evolutionary conservation of gene expression. Immunohistochemical analysis of tissue sections prepared from various mammalian retinas was performed to determine the distribution of ELOVL4 and homologous proteins within specific retinal cell layers. The existence of ELOVL4 sequence orthologues and homologues was confirmed by both Southern blot analysis and in silico searches of protein sequence databases. Phylogenetic analysis places ELOVL4 among a large family of known and putative fatty acid elongase proteins. Northern blot analysis revealed the presence of multiple transcripts corresponding to ELOVL4 homologues expressed in the retina of several different mammalian species. Conserved proteins were also detected among retinal extracts of different mammals and were found to localize predominantly to the photoreceptor cell layer within retinal tissue preparations. The ELOVL4 gene is highly conserved throughout evolution and is expressed in the photoreceptor cells of the retina in a variety of different species, which suggests that it plays a critical role in retinal cell biology.
Lee, I-M; Bottner-Parker, K D; Zhao, Y; Bertaccini, A; Davis, R E
2012-09-01
The pigeon pea witches'-broom phytoplasma group (16SrIX) comprises diverse strains that cause numerous diseases in leguminous trees and herbaceous crops, vegetables, a fruit, a nut tree and a forest tree. At least 14 strains have been reported worldwide. Comparative phylogenetic analyses of the highly conserved 16S rRNA gene and the moderately conserved rplV (rpl22)-rpsC (rps3) and secY genes indicated that the 16SrIX group consists of at least six distinct genetic lineages. Some of these lineages cannot be readily differentiated based on analysis of 16S rRNA gene sequences alone. The relative genetic distances among these closely related lineages were better assessed by including more variable genes [e.g. ribosomal protein (rp) and secY genes]. The present study demonstrated that virtual RFLP analyses using rp and secY gene sequences allowed unambiguous identification of such lineages. A coding system is proposed to designate each distinct rp and secY subgroup in the 16SrIX group.
Comparative and evolutionary analysis of the 14-3-3 family genes in eleven fishes.
Cao, Jun; Tan, Xiaona
2018-07-01
14-3-3 proteins are a type of highly conserved acidic proteins, which are distributed over a wide variety of organisms and are involved in multiple cellular processes. While the comparative and evolutionary analysis of this gene family is unavailable in various fish species. In this study, we identified 101 putative 14-3-3 genes in 11 fish species and divided them into 5 groups via phylogenetic analysis. Synteny analysis implied conserved and dynamic evolution characteristics near the 14-3-3 gene loci in some vertebrates. We also found that some recombination events have accelerated the evolution of this gene family. Moreover, a positive selection site was also identified, and mutation of this site could reduce the 14-3-3 stability. Divergent expression profiles of the zebrafish 14-3-3 genes were further investigated under organophosphorus stress, suggesting that they may be involved in the different osmoregulation and immune response. The results will serve as a foundation for the further functional investigation into the 14-3-3 genes in fishes. Copyright © 2018 Elsevier B.V. All rights reserved.
Yan, Xin; Gu, Tao; Yi, Zhongquan; Huang, Junwei; Liu, Xiaowei; Zhang, Ji; Xu, Xihui; Xin, Zhihong; Hong, Qing; He, Jian; Spain, Jim C; Li, Shunpeng; Jiang, Jiandong
2016-12-01
The worldwide use of the phenylurea herbicide, isoproturon (IPU), has resulted in considerable concern about its environmental fate. Although many microbial metabolites of IPU are known and IPU-mineralizing bacteria have been isolated, the molecular mechanism of IPU catabolism has not been elucidated yet. In this study, complete genes that encode the conserved IPU catabolic pathway were revealed, based on comparative analysis of the genomes of three IPU-mineralizing sphingomonads and subsequent experimental validation. The complete genes included a novel hydrolase gene ddhA, which is responsible for the cleavage of the urea side chain of the IPU demethylated products; a distinct aniline dioxygenase gene cluster adoQTA1A2BR, which has a broad substrate range; and an inducible catechol meta-cleavage pathway gene cluster adoXEGKLIJC. Furthermore, the initial mono-N-demethylation genes pdmAB were further confirmed to be involved in the successive N-demethylation of the IPU mono-N-demethylated product. These IPU-catabolic genes were organized into four transcription units and distributed on three plasmids. They were flanked by multiple mobile genetic elements and highly conserved among IPU-mineralizing sphingomonads. The elucidation of the molecular mechanism of IPU catabolism will enhance our understanding of the microbial mineralization of IPU and provide insights into the evolutionary scenario of the conserved IPU-catabolic pathway. © 2016 The Authors. Environmental Microbiology published by Society for Applied Microbiology and John Wiley & Sons Ltd.
Kavakiotis, Ioannis; Xochelli, Aliki; Agathangelidis, Andreas; Tsoumakas, Grigorios; Maglaveras, Nicos; Stamatopoulos, Kostas; Hadzidimitriou, Anastasia; Vlahavas, Ioannis; Chouvarda, Ioanna
2016-06-06
Somatic Hypermutation (SHM) refers to the introduction of mutations within rearranged V(D)J genes, a process that increases the diversity of Immunoglobulins (IGs). The analysis of SHM has offered critical insight into the physiology and pathology of B cells, leading to strong prognostication markers for clinical outcome in chronic lymphocytic leukaemia (CLL), the most frequent adult B-cell malignancy. In this paper we present a methodology for integrating multiple immunogenetic and clinocobiological data sources in order to extract features and create high quality datasets for SHM analysis in IG receptors of CLL patients. This dataset is used as the basis for a higher level integration procedure, inspired form social choice theory. This is applied in the Towards Analysis, our attempt to investigate the potential ontogenetic transformation of genes belonging to specific stereotyped CLL subsets towards other genes or gene families, through SHM. The data integration process, followed by feature extraction, resulted in the generation of a dataset containing information about mutations occurring through SHM. The Towards analysis performed on the integrated dataset applying voting techniques, revealed the distinct behaviour of subset #201 compared to other subsets, as regards SHM related movements among gene clans, both in allele-conserved and non-conserved gene areas. With respect to movement between genes, a high percentage movement towards pseudo genes was found in all CLL subsets. This data integration and feature extraction process can set the basis for exploratory analysis or a fully automated computational data mining approach on many as yet unanswered, clinically relevant biological questions.
Friedberg, Felix
2009-05-01
In this paper we examine (restricted to homo sapiens) the products resulting from gene duplication and the subsequent alternative splicing for the members of a multidomain group of proteins which possess the evolutionary conserved calponin homology CH domain, i.e. an "actin binding domain", as a singlet and which, in addition, contain the conserved cysteine rich double Zn finger possessing Lim domain, also as a singlet. Seven genes, resulting from gene duplications, were identified that code for seven group members for which pre-mRNAs appear to have undergone multiple alternative splicing: Mical 1, 2 and 3 are located on chromosomes 6q21, 11p15 and 22q11, respectively. The LMO7 gene is present on chromosome 13q22 and the LIMCH1 gene on chromosome 4p13. Micall1 is mapped to chromosome 22q13 and Micall2 to chromosome 7p22. Translated Gen/Bank ESTs suggest the existence of multiple products alternatively spliced from the pre-mRNAs encoded by these genes. Characteristic indicators of such splicing among the proteins derived from one gene must include containment of some common extensive 100% identical regions. In some instances only one exon might be partly or completely eliminated. Sometimes alternative splicing is also associated with an increased frequency of creation of an exon or part of an exon from an intron. Not only coding regions for the body of the protein but also for its N- or -C ends could be affected by the splicing. If created forms are merely beginning at different starting points but remain identical in sequence thereafter, their existence as products of alternate splicing must be questioned. In the splicings, described in this paper, multiple isoforms rather than a single isoform appear as products during the gene expression.
A cross-species bi-clustering approach to identifying conserved co-regulated genes.
Sun, Jiangwen; Jiang, Zongliang; Tian, Xiuchun; Bi, Jinbo
2016-06-15
A growing number of studies have explored the process of pre-implantation embryonic development of multiple mammalian species. However, the conservation and variation among different species in their developmental programming are poorly defined due to the lack of effective computational methods for detecting co-regularized genes that are conserved across species. The most sophisticated method to date for identifying conserved co-regulated genes is a two-step approach. This approach first identifies gene clusters for each species by a cluster analysis of gene expression data, and subsequently computes the overlaps of clusters identified from different species to reveal common subgroups. This approach is ineffective to deal with the noise in the expression data introduced by the complicated procedures in quantifying gene expression. Furthermore, due to the sequential nature of the approach, the gene clusters identified in the first step may have little overlap among different species in the second step, thus difficult to detect conserved co-regulated genes. We propose a cross-species bi-clustering approach which first denoises the gene expression data of each species into a data matrix. The rows of the data matrices of different species represent the same set of genes that are characterized by their expression patterns over the developmental stages of each species as columns. A novel bi-clustering method is then developed to cluster genes into subgroups by a joint sparse rank-one factorization of all the data matrices. This method decomposes a data matrix into a product of a column vector and a row vector where the column vector is a consistent indicator across the matrices (species) to identify the same gene cluster and the row vector specifies for each species the developmental stages that the clustered genes co-regulate. Efficient optimization algorithm has been developed with convergence analysis. This approach was first validated on synthetic data and compared to the two-step method and several recent joint clustering methods. We then applied this approach to two real world datasets of gene expression during the pre-implantation embryonic development of the human and mouse. Co-regulated genes consistent between the human and mouse were identified, offering insights into conserved functions, as well as similarities and differences in genome activation timing between the human and mouse embryos. The R package containing the implementation of the proposed method in C ++ is available at: https://github.com/JavonSun/mvbc.git and also at the R platform https://www.r-project.org/ jinbo@engr.uconn.edu. © The Author 2016. Published by Oxford University Press.
Kito, Keiji; Ito, Haruka; Nohara, Takehiro; Ohnishi, Mihoko; Ishibashi, Yuko; Takeda, Daisuke
2016-01-01
Omics analysis is a versatile approach for understanding the conservation and diversity of molecular systems across multiple taxa. In this study, we compared the proteome expression profiles of four yeast species (Saccharomyces cerevisiae, Saccharomyces mikatae, Kluyveromyces waltii, and Kluyveromyces lactis) grown on glucose- or glycerol-containing media. Conserved expression changes across all species were observed only for a small proportion of all proteins differentially expressed between the two growth conditions. Two Kluyveromyces species, both of which exhibited a high growth rate on glycerol, a nonfermentative carbon source, showed distinct species-specific expression profiles. In K. waltii grown on glycerol, proteins involved in the glyoxylate cycle and gluconeogenesis were expressed in high abundance. In K. lactis grown on glycerol, the expression of glycolytic and ethanol metabolic enzymes was unexpectedly low, whereas proteins involved in cytoplasmic translation, including ribosomal proteins and elongation factors, were highly expressed. These marked differences in the types of predominantly expressed proteins suggest that K. lactis optimizes the balance of proteome resource allocation between metabolism and protein synthesis giving priority to cellular growth. In S. cerevisiae, about 450 duplicate gene pairs were retained after whole-genome duplication. Intriguingly, we found that in the case of duplicates with conserved sequences, the total abundance of proteins encoded by a duplicate pair in S. cerevisiae was similar to that of protein encoded by nonduplicated ortholog in Kluyveromyces yeast. Given the frequency of haploinsufficiency, this observation suggests that conserved duplicate genes, even though minor cases of retained duplicates, do not exhibit a dosage effect in yeast, except for ribosomal proteins. Thus, comparative proteomic analyses across multiple species may reveal not only species-specific characteristics of metabolic processes under nonoptimal culture conditions but also provide valuable insights into intriguing biological principles, including the balance of proteome resource allocation and the role of gene duplication in evolutionary history. PMID:26560065
Kito, Keiji; Ito, Haruka; Nohara, Takehiro; Ohnishi, Mihoko; Ishibashi, Yuko; Takeda, Daisuke
2016-01-01
Omics analysis is a versatile approach for understanding the conservation and diversity of molecular systems across multiple taxa. In this study, we compared the proteome expression profiles of four yeast species (Saccharomyces cerevisiae, Saccharomyces mikatae, Kluyveromyces waltii, and Kluyveromyces lactis) grown on glucose- or glycerol-containing media. Conserved expression changes across all species were observed only for a small proportion of all proteins differentially expressed between the two growth conditions. Two Kluyveromyces species, both of which exhibited a high growth rate on glycerol, a nonfermentative carbon source, showed distinct species-specific expression profiles. In K. waltii grown on glycerol, proteins involved in the glyoxylate cycle and gluconeogenesis were expressed in high abundance. In K. lactis grown on glycerol, the expression of glycolytic and ethanol metabolic enzymes was unexpectedly low, whereas proteins involved in cytoplasmic translation, including ribosomal proteins and elongation factors, were highly expressed. These marked differences in the types of predominantly expressed proteins suggest that K. lactis optimizes the balance of proteome resource allocation between metabolism and protein synthesis giving priority to cellular growth. In S. cerevisiae, about 450 duplicate gene pairs were retained after whole-genome duplication. Intriguingly, we found that in the case of duplicates with conserved sequences, the total abundance of proteins encoded by a duplicate pair in S. cerevisiae was similar to that of protein encoded by nonduplicated ortholog in Kluyveromyces yeast. Given the frequency of haploinsufficiency, this observation suggests that conserved duplicate genes, even though minor cases of retained duplicates, do not exhibit a dosage effect in yeast, except for ribosomal proteins. Thus, comparative proteomic analyses across multiple species may reveal not only species-specific characteristics of metabolic processes under nonoptimal culture conditions but also provide valuable insights into intriguing biological principles, including the balance of proteome resource allocation and the role of gene duplication in evolutionary history. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
Toxicogenomic effects common to triazole antifungals and conserved between rats and humans.
Goetz, Amber K; Dix, David J
2009-07-01
The triazole antifungals myclobutanil, propiconazole and triadimefon cause varying degrees of hepatic toxicity and disrupt steroid hormone homeostasis in rodent in vivo models. To identify biological pathways consistently modulated across multiple timepoints and various study designs, gene expression profiling was conducted on rat livers from three separate studies with triazole treatment groups ranging from 6 h after a single oral gavage exposure, to prenatal to adult exposures via feed. To explore conservation of responses across species, gene expression from the rat liver studies were compared to in vitro data from rat and human primary hepatocytes exposed to the triazoles. Toxicogenomic data on triazoles from 33 different treatment groups and 135 samples (microarrays) identified thousands of probe sets and dozens of pathways differentially expressed across time, dose, and species--many of these were common to all three triazoles, or conserved between rodents and humans. Common and conserved pathways included androgen and estrogen metabolism, xenobiotic metabolism signaling through CAR and PXR, and CYP mediated metabolism. Differentially expressed genes included the Phase I xenobiotic, fatty acid, sterol and steroid metabolism genes Cyp2b2 and CYP2B6, Cyp3a1 and CYP3A4, and Cyp4a22 and CYP4A11; Phase II conjugation enzyme genes Ugt1a1 and UGT1A1; and Phase III ABC transporter genes Abcb1 and ABCB1. Gene expression changes caused by all three triazoles in liver and hepatocytes were concentrated in biological pathways regulating lipid, sterol and steroid homeostasis, identifying a potential common mode of action conserved between rodents and humans. Modulation of hepatic sterol and steroid metabolism is a plausible mode of action for changes in serum testosterone and adverse reproductive outcomes observed in rat studies, and may be relevant to human risk assessment.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Devor, E.J.; Dill-Devor, R.M.
1994-09-01
We have obtained a number of unique sequences via PCR amplification of human genomic DNA using degenerate primers under low stringency (42{degrees}C). One of these, an 853 bp product, has been identified as a partial genomic sequence of the human homolog of the S. cerevisiae CDC27 gene, CDC27Hs (GenBank No. U00001). This gene, reported by Turgendreich et al. is also designated EST00556 from Adams et al. We have undertaken a more detailed examination of our sequence, MCP34N, and have found that: 1. the genomic sequence is nearly identical to CDC27Hs over its entire 853 bp length; 2. an MCP34N-specific PCRmore » assay of several non-human primate species reveals amplification products in chimpanzee and gorilla genomes having greater than 90% sequence identity with CDC27Hs; and 3. an MCP34N-specific PCR assay of the BIOS hybrid cell line panel gives a discordancy pattern suggesting multiple loci. Based upon these data, we present the following initial characterization: 1. the complete MCP34N sequence identity with CDC27Hs indicates that the latter is encoded by an intronless gene; 2. CDC27Hs is highly conserved among higher primates; and 3. CDC27Hs is present in multiple copies in the human genome. These characteristics, taken together with those initially reported for CDC27Hs, suggest that this is an old gene that carries out an important but, as yet, unknown function in the human brain.« less
Many nonuniversal archaeal ribosomal proteins are found in conserved gene clusters
WANG, JIACHEN; DASGUPTA, INDRANI; FOX, GEORGE E.
2009-01-01
The genomic associations of the archaeal ribosomal proteins, (r-proteins), were examined in detail. The archaeal versions of the universal r-protein genes are typically in clusters similar or identical and to those found in bacteria. Of the 35 nonuniversal archaeal r-protein genes examined, the gene encoding L18e was found to be associated with the conserved L13 cluster, whereas the genes for S4e, L32e and L19e were found in the archaeal version of the spc operon. Eleven nonuniversal protein genes were not associated with any common genomic context. Of the remaining 19 protein genes, 17 were convincingly assigned to one of 10 previously unrecognized gene clusters. Examination of the gene content of these clusters revealed multiple associations with genes involved in the initiation of protein synthesis, transcription or other cellular processes. The lack of such associations in the universal clusters suggests that initially the ribosome evolved largely independently of other processes. More recently it likely has evolved in concert with other cellular systems. It was also verified that a second copy of the gene encoding L7ae found in some bacteria is actually a homolog of the gene encoding L30e and should be annotated as such. PMID:19478915
Plant polyadenylation factors: conservation and variety in the polyadenylation complex in plants.
Hunt, Arthur G; Xing, Denghui; Li, Qingshun Q
2012-11-20
Polyadenylation, an essential step in eukaryotic gene expression, requires both cis-elements and a plethora of trans-acting polyadenylation factors. The polyadenylation factors are largely conserved across mammals and fungi. The conservation seems also extended to plants based on the analyses of Arabidopsis polyadenylation factors. To extend this observation, we systemically identified the orthologs of yeast and human polyadenylation factors from 10 plant species chosen based on both the availability of their genome sequences and their positions in the evolutionary tree, which render them representatives of different plant lineages. The evolutionary trajectories revealed several interesting features of plant polyadenylation factors. First, the number of genes encoding plant polyadenylation factors was clearly increased from "lower" to "higher" plants. Second, the gene expansion in higher plants was biased to some polyadenylation factors, particularly those involved in RNA binding. Finally, while there are clear commonalities, the differences in the polyadenylation apparatus were obvious across different species, suggesting an ongoing process of evolutionary change. These features lead to a model in which the plant polyadenylation complex consists of a conserved core, which is rather rigid in terms of evolutionary conservation, and a panoply of peripheral subunits, which are less conserved and associated with the core in various combinations, forming a collection of somewhat distinct complex assemblies. The multiple forms of plant polyadenylation complex, together with the diversified polyA signals may explain the intensive alternative polyadenylation (APA) and its regulatory role in biological functions of higher plants.
Boj, Sylvia F.; Servitja, Joan Marc; Martin, David; Rios, Martin; Talianidis, Iannis; Guigo, Roderic; Ferrer, Jorge
2009-01-01
OBJECTIVE The evolutionary conservation of transcriptional mechanisms has been widely exploited to understand human biology and disease. Recent findings, however, unexpectedly showed that the transcriptional regulators hepatocyte nuclear factor (HNF)-1α and -4α rarely bind to the same genes in mice and humans, leading to the proposal that tissue-specific transcriptional regulation has undergone extensive divergence in the two species. Such observations have major implications for the use of mouse models to understand HNF-1α– and HNF-4α–deficient diabetes. However, the significance of studies that assess binding without considering regulatory function is poorly understood. RESEARCH DESIGN AND METHODS We compared previously reported mouse and human HNF-1α and HNF-4α binding studies with independent binding experiments. We also integrated binding studies with mouse and human loss-of-function gene expression datasets. RESULTS First, we confirmed the existence of species-specific HNF-1α and -4α binding, yet observed incomplete detection of binding in the different datasets, causing an underestimation of binding conservation. Second, only a minor fraction of HNF-1α– and HNF-4α–bound genes were downregulated in the absence of these regulators. This subset of functional targets did not show evidence for evolutionary divergence of binding or binding sequence motifs. Finally, we observed differences between conserved and species-specific binding properties. For example, conserved binding was more frequently located near transcriptional start sites and was more likely to involve multiple binding events in the same gene. CONCLUSIONS Despite evolutionary changes in binding, essential direct transcriptional functions of HNF-1α and -4α are largely conserved between mice and humans. PMID:19188435
Grienenberger, Etienne; Douglas, Carl J.
2014-01-01
Despite a strict conservation of the vascular tissues in vascular plants (tracheophytes), our understanding of the genetic basis underlying the differentiation of secondary cell wall-containing cells in the xylem of tracheophytes is still far from complete. Using coexpression analysis and phylogenetic conservation across sequenced tracheophyte genomes, we identified a number of Arabidopsis (Arabidopsis thaliana) genes of unknown function whose expression is correlated with secondary cell wall deposition. Among these, the Arabidopsis VASCULAR-RELATED UNKNOWN PROTEIN1 (VUP1) gene encodes a predicted protein of 24 kD with no annotated functional domains but containing domains that are highly conserved in tracheophytes. Here, we show that the VUP1 expression pattern, determined by promoter-β-glucuronidase reporter gene expression, is associated with vascular tissues, while vup1 loss-of-function mutants exhibit collapsed morphology of xylem vessel cells. Constitutive overexpression of VUP1 caused dramatic and pleiotropic developmental defects, including severe dwarfism, dark green leaves, reduced apical dominance, and altered photomorphogenesis, resembling brassinosteroid-deficient mutants. Constitutive overexpression of VUP homologs from multiple tracheophyte species induced similar defects. Whole-genome transcriptome analysis revealed that overexpression of VUP1 represses the expression of many brassinosteroid- and auxin-responsive genes. Additionally, deletion constructs and site-directed mutagenesis were used to identify critical domains and amino acids required for VUP1 function. Altogether, our data suggest a conserved role for VUP1 in regulating secondary wall formation during vascular development by tissue- or cell-specific modulation of hormone signaling pathways. PMID:24567189
Comparative Genomics of Syntrophic Branched-Chain Fatty Acid Degrading Bacteria
Narihiro, Takashi; Nobu, Masaru K.; Tamaki, Hideyuki; Kamagata, Yoichi; Sekiguchi, Yuji; Liu, Wen-Tso
2016-01-01
The syntrophic degradation of branched-chain fatty acids (BCFAs) such as 2-methylbutyrate and isobutyrate is an essential step in the production of methane from proteins/amino acids in anaerobic ecosystems. While a few syntrophic BCFA-degrading bacteria have been isolated, their metabolic pathways in BCFA and short-chain fatty acid (SCFA) degradation as well as energy conservation systems remain unclear. In an attempt to identify these pathways, we herein performed comparative genomics of three syntrophic bacteria: 2-methylbutyrate-degrading “Syntrophomonas wolfei subsp. methylbutyratica” strain JCM 14075T (=4J5T), isobutyrate-degrading Syntrophothermus lipocalidus strain TGB-C1T, and non-BCFA-metabolizing S. wolfei subsp. wolfei strain GöttingenT. We demonstrated that 4J5 and TGB-C1 both encode multiple genes/gene clusters involved in β-oxidation, as observed in the Göttingen genome, which has multiple copies of genes associated with butyrate degradation. The 4J5 genome possesses phylogenetically distinct β-oxidation genes, which may be involved in 2-methylbutyrate degradation. In addition, these Syntrophomonadaceae strains harbor various hydrogen/formate generation systems (i.e., electron-bifurcating hydrogenase, formate dehydrogenase, and membrane-bound hydrogenase) and energy-conserving electron transport systems, including electron transfer flavoprotein (ETF)-linked acyl-CoA dehydrogenase, ETF-linked iron-sulfur binding reductase, ETF dehydrogenase (FixABCX), and flavin oxidoreductase-heterodisulfide reductase (Flox-Hdr). Unexpectedly, the TGB-C1 genome encodes a nitrogenase complex, which may function as an alternative H2 generation mechanism. These results suggest that the BCFA-degrading syntrophic strains 4J5 and TGB-C1 possess specific β-oxidation-related enzymes for BCFA oxidation as well as appropriate energy conservation systems to perform thermodynamically unfavorable syntrophic metabolism. PMID:27431485
McCormick, Mark A.; Delaney, Joe R.; Tsuchiya, Mitsuhiro; Tsuchiyama, Scott; Shemorry, Anna; Sim, Sylvia; Chou, Annie Chia-Zong; Ahmed, Umema; Carr, Daniel; Murakami, Christopher J.; Schleit, Jennifer; Sutphin, George L.; Wasko, Brian M.; Bennett, Christopher F.; Wang, Adrienne M.; Olsen, Brady; Beyer, Richard P.; Bammler, Theodor K.; Prunkard, Donna; Johnson, Simon C.; Pennypacker, Juniper K.; An, Elroy; Anies, Arieanna; Castanza, Anthony S.; Choi, Eunice; Dang, Nick; Enerio, Shiena; Fletcher, Marissa; Fox, Lindsay; Goswami, Sarani; Higgins, Sean A.; Holmberg, Molly A.; Hu, Di; Hui, Jessica; Jelic, Monika; Jeong, Ki-Soo; Johnston, Elijah; Kerr, Emily O.; Kim, Jin; Kim, Diana; Kirkland, Katie; Klum, Shannon; Kotireddy, Soumya; Liao, Eric; Lim, Michael; Lin, Michael S.; Lo, Winston C.; Lockshon, Dan; Miller, Hillary A.; Moller, Richard M.; Muller, Brian; Oakes, Jonathan; Pak, Diana N.; Peng, Zhao Jun; Pham, Kim M.; Pollard, Tom G.; Pradeep, Prarthana; Pruett, Dillon; Rai, Dilreet; Robison, Brett; Rodriguez, Ariana A.; Ros, Bopharoth; Sage, Michael; Singh, Manpreet K.; Smith, Erica D.; Snead, Katie; Solanky, Amrita; Spector, Benjamin L.; Steffen, Kristan K.; Tchao, Bie Nga; Ting, Marc K.; Wende, Helen Vander; Wang, Dennis; Welton, K. Linnea; Westman, Eric A.; Brem, Rachel B.; Liu, Xin-guang; Suh, Yousin; Zhou, Zhongjun; Kaeberlein, Matt; Kennedy, Brian K.
2015-01-01
SUMMARY Many genes that affect replicative lifespan (RLS) in the budding yeast Saccharomyces cerevisiae also affect aging in other organisms such as C. elegans and M. musculus. We performed a systematic analysis of yeast RLS in a set of 4,698 viable single-gene deletion strains. Multiple functional gene clusters were identified, and full genome-to-genome comparison demonstrated a significant conservation in longevity pathways between yeast and C. elegans. Among the mechanisms of aging identified, deletion of tRNA exporter LOS1 robustly extended lifespan. Dietary restriction (DR) and inhibition of mechanistic Target of Rapamycin (mTOR) exclude Los1 from the nucleus in a Rad53-dependent manner. Moreover, lifespan extension from deletion of LOS1 is non-additive with DR or mTOR inhibition, and results in Gcn4 transcription factor activation. Thus, the DNA damage response and mTOR converge on Los1-mediated nuclear tRNA export to regulate Gcn4 activity and aging. PMID:26456335
Identification and expression analysis of zebrafish glypicans during embryonic development.
Gupta, Mansi; Brand, Michael
2013-01-01
Heparan sulfate Proteoglycans (HSPG) are ubiquitous molecules with indispensable functions in various biological processes. Glypicans are a family of HSPG's, characterized by a Gpi-anchor which directs them to the cell surface and/or extracellular matrix where they regulate growth factor signaling during development and disease. We report the identification and expression pattern of glypican genes from zebrafish. The zebrafish genome contains 10 glypican homologs, as opposed to six in mammals, which are highly conserved and are phylogenetically related to the mammalian genes. Some of the fish glypicans like Gpc1a, Gpc3, Gpc4, Gpc6a and Gpc6b show conserved synteny with their mammalian cognate genes. Many glypicans are expressed during the gastrulation stage, but their expression becomes more tissue specific and defined during somitogenesis stages, particularly in the developing central nervous system. Existence of multiple glypican orthologs in fish with diverse expression pattern suggests highly specialized and/or redundant function of these genes during embryonic development.
Insights into the innate immunome of actiniarians using a comparative genomic approach.
van der Burg, Chloé A; Prentis, Peter J; Surm, Joachim M; Pavasovic, Ana
2016-11-02
Innate immune genes tend to be highly conserved in metazoans, even in early divergent lineages such as Cnidaria (jellyfish, corals, hydroids and sea anemones) and Porifera (sponges). However, constant and diverse selection pressures on the immune system have driven the expansion and diversification of different immune gene families in a lineage-specific manner. To investigate how the innate immune system has evolved in a subset of sea anemone species (Order: Actiniaria), we performed a comprehensive and comparative study using 10 newly sequenced transcriptomes, as well as three publically available transcriptomes, to identify the origins, expansions and contractions of candidate and novel immune gene families. We characterised five conserved genes and gene families, as well as multiple novel innate immune genes, including the newly recognised putative pattern recognition receptor CniFL. Single copies of TLR, MyD88 and NF-κB were found in most species, and several copies of IL-1R-like, NLR and CniFL were found in almost all species. Multiple novel immune genes were identified with domain architectures including the Toll/interleukin-1 receptor (TIR) homology domain, which is well documented as functioning in protein-protein interactions and signal transduction in immune pathways. We hypothesise that these genes may interact as novel proteins in immune pathways of cnidarian species. Novelty in the actiniarian immunome is not restricted to only TIR-domain-containing proteins, as we identify a subset of NLRs which have undergone neofunctionalisation and contain 3-5 N-terminal transmembrane domains, which have so far only been identified in two anthozoan species. This research has significance in understanding the evolution and origin of the core eumetazoan gene set, including how novel innate immune genes evolve. For example, the evolution of transmembrane domain containing NLRs indicates that these NLRs may be membrane-bound, while all other metazoan and plant NLRs are exclusively cytosolic receptors. This is one example of how species without an adaptive immune system may evolve innovative solutions to detect pathogens or interact with native microbiota. Overall, these results provide an insight into the evolution of the innate immune system, and show that early divergent lineages, such as actiniarians, have a diverse repertoire of conserved and novel innate immune genes.
Azevedo, Gabriel C; Cheavegatti-Gianotto, Adriana; Negri, Bárbara F; Hufnagel, Bárbara; E Silva, Luciano da Costa; Magalhaes, Jurandir V; Garcia, Antonio Augusto F; Lana, Ubiraci G P; de Sousa, Sylvia M; Guimaraes, Claudia T
2015-07-07
Modifications in root morphology are important strategies to maximize soil exploitation under phosphorus starvation in plants. Here, we used two multiple interval models to map QTLs related to root traits, biomass accumulation and P content in a maize RIL population cultivated in nutrient solution. In addition, we searched for putative maize homologs to PSTOL1, a gene responsible to enhance early root growth, P uptake and grain yield in rice and sorghum. Based on path analysis, root surface area was the root morphology component that most strongly contributed to total dry weight and to P content in maize seedling under low-P availability. Multiple interval mapping models for single (MIM) and multiple traits (MT-MIM) were combined and revealed 13 genomic regions significantly associated with the target traits in a complementary way. The phenotypic variances explained by all QTLs and their epistatic interactions using MT-MIM (23.4 to 35.5 %) were higher than in previous studies, and presented superior statistical power. Some of these QTLs were coincident with QTLs for root morphology traits and grain yield previously mapped, whereas others harbored ZmPSTOL candidate genes, which shared more than 55 % of amino acid sequence identity and a conserved serine/threonine kinase domain with OsPSTOL1. Additionally, four ZmPSTOL candidate genes co-localized with QTLs for root morphology, biomass accumulation and/or P content were preferentially expressed in roots of the parental lines that contributed the alleles enhancing the respective phenotypes. QTL mapping strategies adopted in this study revealed complementary results for single and multiple traits with high accuracy. Some QTLs, mainly the ones that were also associated with yield performance in other studies, can be good targets for marker-assisted selection to improve P-use efficiency in maize. Based on the co-localization with QTLs, the protein domain conservation and the coincidence of gene expression, we selected novel maize genes as putative homologs to PSTOL1 that will require further validation studies.
Olson, Nathan D.; Lund, Steven P.; Zook, Justin M.; Rojas-Cornejo, Fabiola; Beck, Brian; Foy, Carole; Huggett, Jim; Whale, Alexandra S.; Sui, Zhiwei; Baoutina, Anna; Dobeson, Michael; Partis, Lina; Morrow, Jayne B.
2015-01-01
This study presents the results from an interlaboratory sequencing study for which we developed a novel high-resolution method for comparing data from different sequencing platforms for a multi-copy, paralogous gene. The combination of PCR amplification and 16S ribosomal RNA gene (16S rRNA) sequencing has revolutionized bacteriology by enabling rapid identification, frequently without the need for culture. To assess variability between laboratories in sequencing 16S rRNA, six laboratories sequenced the gene encoding the 16S rRNA from Escherichia coli O157:H7 strain EDL933 and Listeria monocytogenes serovar 4b strain NCTC11994. Participants performed sequencing methods and protocols available in their laboratories: Sanger sequencing, Roche 454 pyrosequencing®, or Ion Torrent PGM®. The sequencing data were evaluated on three levels: (1) identity of biologically conserved position, (2) ratio of 16S rRNA gene copies featuring identified variants, and (3) the collection of variant combinations in a set of 16S rRNA gene copies. The same set of biologically conserved positions was identified for each sequencing method. Analytical methods using Bayesian and maximum likelihood statistics were developed to estimate variant copy ratios, which describe the ratio of nucleotides at each identified biologically variable position, as well as the likely set of variant combinations present in 16S rRNA gene copies. Our results indicate that estimated variant copy ratios at biologically variable positions were only reproducible for high throughput sequencing methods. Furthermore, the likely variant combination set was only reproducible with increased sequencing depth and longer read lengths. We also demonstrate novel methods for evaluating variable positions when comparing multi-copy gene sequence data from multiple laboratories generated using multiple sequencing technologies. PMID:27077030
Kang, Hye-Min; Lee, Jin-Sol; Kim, Min-Sub; Lee, Young Hwan; Jung, Jee-Hyun; Hagiwara, Atsushi; Zhou, Bingsheng; Lee, Jae-Seong; Jeong, Chang-Bum
2018-05-30
Autophagy originated from the common ancestor of all life forms, and its function is highly conserved from yeast to humans. Autophagy plays a key role in various fundamental biological processes including defense, and has developed through serial interactions of multiple gene sets referred to as autophagy-related (Atg) genes. Despite their significance in metazoan life and evolution, few studies have been conducted to identify these genes in aquatic invertebrates. In this study, we identified whole Atg genes in four Brachionus rotifer spp., namely B. calyciflorus, B. koreanus, B. plicatilis, and B. rotundiformis, through searches of their entire genomes; and we annotated them according to the yeast nomenclature. Twenty-four genes orthologous to yeast genes were present in all of the Brachionus spp. while three additional gene duplicates were identified in the genome of B. koreanus, indicating that these genes had diversified during the speciation. Also, their transcriptional responses to cadmium exposure indicated regulation by cadmium-induced oxidative-stress-related signaling pathways. This study provides valuable information on 99 conserved Atg genes involved in autophagosome formation in Brachionus spp., with transcriptional modulation in response to cadmium, in the context of the role of autophagy in the damage response. Copyright © 2018 Elsevier B.V. All rights reserved.
Nagy, Vanja; Cole, Tiffany; Van Campenhout, Claude; Khoung, Thang M; Leung, Calvin; Vermeiren, Simon; Novatchkova, Maria; Wenzel, Daniel; Cikes, Domagoj; Polyansky, Anton A; Kozieradzki, Ivona; Meixner, Arabella; Bellefroid, Eric J; Neely, G Gregory; Penninger, Josef M
2015-01-01
PR homology domain-containing member 12 (PRDM12) belongs to a family of conserved transcription factors implicated in cell fate decisions. Here we show that PRDM12 is a key regulator of sensory neuronal specification in Xenopus. Modeling of human PRDM12 mutations that cause hereditary sensory and autonomic neuropathy (HSAN) revealed remarkable conservation of the mutated residues in evolution. Expression of wild-type human PRDM12 in Xenopus induced the expression of sensory neuronal markers, which was reduced using various human PRDM12 mutants. In Drosophila, we identified Hamlet as the functional PRDM12 homolog that controls nociceptive behavior in sensory neurons. Furthermore, expression analysis of human patient fibroblasts with PRDM12 mutations uncovered possible downstream target genes. Knockdown of several of these target genes including thyrotropin-releasing hormone degrading enzyme (TRHDE) in Drosophila sensory neurons resulted in altered cellular morphology and impaired nociception. These data show that PRDM12 and its functional fly homolog Hamlet are evolutionary conserved master regulators of sensory neuronal specification and play a critical role in pain perception. Our data also uncover novel pathways in multiple species that regulate evolutionary conserved nociception.
Law, Sheran Hiu Wan; Redelings, Benjamin David; Kullman, Seth William
2012-01-15
The availability of multiple teleost (bony fish) genomes is providing unprecedented opportunities to understand the diversity and function of gene duplication events using comparative genomics. Here we examine multiple paralogous genes of γ-glutamyl transferase (GGT) in several distantly related teleost species including medaka, stickleback, green spotted pufferfish, fugu, and zebrafish. Through mining genome databases, we have identified multiple GGT orthologs. Duplicate (paralogous) GGT sequences for GGT1 (GGT1 a and b), GGTL1 (GGTL1 a and b), and GGTL3 (GGTL3 a and b) were identified for each species. Phylogenetic analysis suggests that GGTs are ancient proteins conserved across most metazoan phyla and those paralogous GGTs in teleosts likely arose from the serial 3R genome duplication events. A third GGTL1 gene (GGTL1c) was found in green spotted pufferfish; however, this gene is not present in medaka, stickleback, or fugu. Similarly, one or both paralogs of GGTL3 appear to have been lost in green spotted pufferfish, fugu, and zebrafish. Syntenic relationships were highly maintained between duplicated teleost chromosomes, among teleosts and across ray-finned (Actinopterygii) and lobe-finned (Sarcopterygii) species. To assess subfunction partitioning, six medaka GGT genes were cloned and assessed for developmental and tissue-specific expression. On the basis of these data, we propose a modification of the "duplication-degeneration-complementation" model of subfunction partitioning where quantitative differences rather than absolute differences in gene expression are observed between gene paralogs. Our results demonstrate that multiple GGT genes have been retained within teleost genomes. Questions remain, however, regarding the functional roles of multiple GGTs in these species. Copyright © 2011 Wiley Periodicals, Inc., A Wiley Company.
Miao, L X; Jiang, M; Zhang, Y C; Yang, X F; Zhang, H Q; Zhang, Z F; Wang, Y Z; Jiang, G H
2016-08-05
The MLO (powdery mildew locus O) gene family is important in resistance to powdery mildew (PM). In this study, all of the members of the MLO family were identified and analyzed in the strawberry (Fragaria vesca) genome. The strawberry contains at least 20 members of the MLO family, and the protein sequence contained between 171 and 1485 amino acids, with 0-34 introns. Chromosomal localization showed that the MLOs were unevenly distributed on each of the chromosomes, except for chromosome 4. The greatest number of MLOs (seven) was found on chromosome 3. A phylogenetic tree showed that the MLOs were divided into seven groups (I-VII), four of which consisted of MLOs from strawberry, Arabidopsis thaliana, rice, and maize, suggesting that these genes may have evolved after the divergence of monocots and dicots. Multiple sequence alignment showed that strawberry MLO candidates related to powdery mildew resistance possessed seven highly conserved transmembrane domains, a calmodulin-binding domain, and two conserved regions, all of which are important domains for powdery mildew resistance genes. Expressed sequence tag analysis revealed that the MLOs were induced by multiple abiotic stressors, including low and high temperature, drought, and high salinity. These findings will contribute to the functional characterization of MLOs related to PM susceptibility, and will assist in the development of disease resistance in strawberries.
Cavalcante, Manoella Gemaque; Bastos, Carlos Eduardo Matos Carvalho; Nagamachi, Cleusa Yoshiko; Pieczarka, Julio Cesar; Vicari, Marcelo Ricardo; Noronha, Renata Coelho Rodrigues
2018-01-01
Cytogenetic studies show that there is great karyotypic diversity in order Testudines (2n = 26–68), and that this may be mainly attributed to the presence/absence of microchromosomes. Members of the Podocnemididae family have the smallest diploid numbers of this order (2n = 26–28), which may be a derived condition of the group. Diverse studies suggest that repetitive-DNA-rich sites generally act as hotspots for double-strand breaks and chromosomal reorganization. In this context, we used fluorescent in situ hybridization (FISH) to map telomeric sequences (TTAGGG)n, 45S rDNA, and the genes encoding histones H1 and H3 in two species of genus Podocnemis. We also observed conservation of the 45S rDNA and H1 histone sequences (probable case of conserved synteny), but multiple conserved and non-conserved clusters of H3 genes, which colocalized with the interstitial telomeric sequences in the Podocnemis genome. Our results suggest that fusions have occurred between macro and microchromosomes or between microchromosomes, leading to the observed reduction in diploid number in the family Podocnemididae. PMID:29813087
Cavalcante, Manoella Gemaque; Bastos, Carlos Eduardo Matos Carvalho; Nagamachi, Cleusa Yoshiko; Pieczarka, Julio Cesar; Vicari, Marcelo Ricardo; Noronha, Renata Coelho Rodrigues
2018-01-01
Cytogenetic studies show that there is great karyotypic diversity in order Testudines (2n = 26-68), and that this may be mainly attributed to the presence/absence of microchromosomes. Members of the Podocnemididae family have the smallest diploid numbers of this order (2n = 26-28), which may be a derived condition of the group. Diverse studies suggest that repetitive-DNA-rich sites generally act as hotspots for double-strand breaks and chromosomal reorganization. In this context, we used fluorescent in situ hybridization (FISH) to map telomeric sequences (TTAGGG)n, 45S rDNA, and the genes encoding histones H1 and H3 in two species of genus Podocnemis. We also observed conservation of the 45S rDNA and H1 histone sequences (probable case of conserved synteny), but multiple conserved and non-conserved clusters of H3 genes, which colocalized with the interstitial telomeric sequences in the Podocnemis genome. Our results suggest that fusions have occurred between macro and microchromosomes or between microchromosomes, leading to the observed reduction in diploid number in the family Podocnemididae.
Gene family size conservation is a good indicator of evolutionary rates.
Chen, Feng-Chi; Chen, Chiuan-Jung; Li, Wen-Hsiung; Chuang, Trees-Juen
2010-08-01
The evolution of duplicate genes has been a topic of broad interest. Here, we propose that the conservation of gene family size is a good indicator of the rate of sequence evolution and some other biological properties. By comparing the human-chimpanzee-macaque orthologous gene families with and without family size conservation, we demonstrate that genes with family size conservation evolve more slowly than those without family size conservation. Our results further demonstrate that both family expansion and contraction events may accelerate gene evolution, resulting in elevated evolutionary rates in the genes without family size conservation. In addition, we show that the duplicate genes with family size conservation evolve significantly more slowly than those without family size conservation. Interestingly, the median evolutionary rate of singletons falls in between those of the above two types of duplicate gene families. Our results thus suggest that the controversy on whether duplicate genes evolve more slowly than singletons can be resolved when family size conservation is taken into consideration. Furthermore, we also observe that duplicate genes with family size conservation have the highest level of gene expression/expression breadth, the highest proportion of essential genes, and the lowest gene compactness, followed by singletons and then by duplicate genes without family size conservation. Such a trend accords well with our observations of evolutionary rates. Our results thus point to the importance of family size conservation in the evolution of duplicate genes.
Comparative functional characterization of the CSR-1 22G-RNA pathway in Caenorhabditis nematodes
Tu, Shikui; Wu, Monica Z.; Wang, Jie; Cutter, Asher D.; Weng, Zhiping; Claycomb, Julie M.
2015-01-01
As a champion of small RNA research for two decades, Caenorhabditis elegans has revealed the essential Argonaute CSR-1 to play key nuclear roles in modulating chromatin, chromosome segregation and germline gene expression via 22G-small RNAs. Despite CSR-1 being preserved among diverse nematodes, the conservation and divergence in function of the targets of small RNA pathways remains poorly resolved. Here we apply comparative functional genomic analysis between C. elegans and Caenorhabditis briggsae to characterize the CSR-1 pathway, its targets and their evolution. C. briggsae CSR-1-associated small RNAs that we identified by immunoprecipitation-small RNA sequencing overlap with 22G-RNAs depleted in cbr-csr-1 RNAi-treated worms. By comparing 22G-RNAs and target genes between species, we defined a set of CSR-1 target genes with conserved germline expression, enrichment in operons and more slowly evolving coding sequences than other genes, along with a small group of evolutionarily labile targets. We demonstrate that the association of CSR-1 with chromatin is preserved, and show that depletion of cbr-csr-1 leads to chromosome segregation defects and embryonic lethality. This first comparative characterization of a small RNA pathway in Caenorhabditis establishes a conserved nuclear role for CSR-1 and highlights its key role in germline gene regulation across multiple animal species. PMID:25510497
Nevil, Markus; Bondra, Eliana R.; Schulz, Katharine N.; Kaplan, Tommy; Harrison, Melissa M.
2017-01-01
It has been suggested that transcription factor binding is temporally dynamic, and that changes in binding determine transcriptional output. Nonetheless, this model is based on relatively few examples in which transcription factor binding has been assayed at multiple developmental stages. The essential transcription factor Grainy head (Grh) is conserved from fungi to humans, and controls epithelial development and barrier formation in numerous tissues. Drosophila melanogaster, which possess a single grainy head (grh) gene, provide an excellent system to study this conserved factor. To determine whether temporally distinct binding events allow Grh to control cell fate specification in different tissue types, we used a combination of ChIP-seq and RNA-seq to elucidate the gene regulatory network controlled by Grh during four stages of embryonic development (spanning stages 5–17) and in larval tissue. Contrary to expectations, we discovered that Grh remains bound to at least 1146 genomic loci over days of development. In contrast to this stable DNA occupancy, the subset of genes whose expression is regulated by Grh varies. Grh transitions from functioning primarily as a transcriptional repressor early in development to functioning predominantly as an activator later. Our data reveal that Grh binds to target genes well before the Grh-dependent transcriptional program commences, suggesting it sets the stage for subsequent recruitment of additional factors that execute stage-specific Grh functions. PMID:28007888
The metazoan Mediator co-activator complex as an integrative hub for transcriptional regulation.
Malik, Sohail; Roeder, Robert G
2010-11-01
The Mediator is an evolutionarily conserved, multiprotein complex that is a key regulator of protein-coding genes. In metazoan cells, multiple pathways that are responsible for homeostasis, cell growth and differentiation converge on the Mediator through transcriptional activators and repressors that target one or more of the almost 30 subunits of this complex. Besides interacting directly with RNA polymerase II, Mediator has multiple functions and can interact with and coordinate the action of numerous other co-activators and co-repressors, including those acting at the level of chromatin. These interactions ultimately allow the Mediator to deliver outputs that range from maximal activation of genes to modulation of basal transcription to long-term epigenetic silencing.
Insights into bilaterian evolution from three spiralian genomes
DOE Office of Scientific and Technical Information (OSTI.GOV)
Simakov, Oleg; Marletaz, Ferdinand; Cho, Sung-Jin
2012-01-07
Current genomic perspectives on animal diversity neglect two prominent phyla, the molluscs and annelids, that together account for nearly one-third of known marine species and are important both ecologically and as experimental systems in classical embryology1, 2, 3. Here we describe the draft genomes of the owl limpet (Lottia gigantea), a marine polychaete (Capitella teleta) and a freshwater leech (Helobdella robusta), and compare them with other animal genomes to investigate the origin and diversification of bilaterians from a genomic perspective. We find that the genome organization, gene structure and functional content of these species are more similar to those ofmore » some invertebrate deuterostome genomes (for example, amphioxus and sea urchin) than those of other protostomes that have been sequenced to date (flies, nematodes and flatworms). The conservation of these genomic features enables us to expand the inventory of genes present in the last common bilaterian ancestor, establish the tripartite diversification of bilaterians using multiple genomic characteristics and identify ancient conserved long- and short-range genetic linkages across metazoans. Superimposed on this broadly conserved pan-bilaterian background we find examples of lineage-specific genome evolution, including varying rates of rearrangement, intron gain and loss, expansions and contractions of gene families, and the evolution of clade-specific genes that produce the unique content of each genome.« less
Anderson, Olin D; Coleman-Derr, Devin; Gu, Yong Q; Heath, Sekou
2010-06-16
Among the dietary essential amino acids, the most severely limiting in the cereals is lysine. Since cereals make up half of the human diet, lysine limitation has quality/nutritional consequences. The breakdown of lysine is controlled mainly by the catabolic bifunctional enzyme lysine ketoglutarate reductase - saccharopine dehydrogenase (LKR/SDH). The LKR/SDH gene has been reported to produce transcripts for the bifunctional enzyme and separate monofunctional transcripts. In addition to lysine metabolism, this gene has been implicated in a number of metabolic and developmental pathways, which along with its production of multiple transcript types and complex exon/intron structure suggest an important node in plant metabolism. Understanding more about the LKR/SDH gene is thus interesting both from applied standpoint and for basic plant metabolism. The current report describes a wheat genomic fragment containing an LKR/SDH gene and adjacent genes. The wheat LKR/SDH genomic segment was found to originate from the A-genome of wheat, and EST analysis indicates all three LKR/SDH genes in hexaploid wheat are transcriptionally active. A comparison of a set of plant LKR/SDH genes suggests regions of greater sequence conservation likely related to critical enzymatic functions and metabolic controls. Although most plants contain only a single LKR/SDH gene per genome, poplar contains at least two functional bifunctional genes in addition to a monofunctional LKR gene. Analysis of ESTs finds evidence for monofunctional LKR transcripts in switchgrass, and monofunctional SDH transcripts in wheat, Brachypodium, and poplar. The analysis of a wheat LKR/SDH gene and comparative structural and functional analyses among available plant genes provides new information on this important gene. Both the structure of the LKR/SDH gene and the immediately adjacent genes show lineage-specific differences between monocots and dicots, and findings suggest variation in activity of LKR/SDH genes among plants. Although most plant genomes seem to contain a single conserved LKR/SDH gene per genome, poplar possesses multiple contiguous genes. A preponderance of SDH transcripts suggests the LKR region may be more rate-limiting. Only switchgrass has EST evidence for LKR monofunctional transcripts. Evidence for monofunctional SDH transcripts shows a novel intron in wheat, Brachypodium, and poplar.
2011-01-01
Background The two homologous iron-binding lobes of transferrins are thought to have evolved by gene duplication of an ancestral monolobal form, but any conserved synteny between bilobal and monolobal transferrin loci remains unexplored. The important role played by transferrin in the resistance to invading pathogens makes this polymorphic gene a highly valuable candidate for studying adaptive divergence among local populations. Results The Atlantic cod genome was shown to harbour two tandem duplicated serum transferrin genes (Tf1, Tf2), a melanotransferrin gene (MTf), and a monolobal transferrin gene (Omp). Whereas Tf1 and Tf2 were differentially expressed in liver and brain, the Omp transcript was restricted to the otoliths. Fish, chicken and mammals showed highly conserved syntenic regions in which monolobal and bilobal transferrins reside, but contrasting with tetrapods, the fish transferrin genes are positioned on three different linkage groups. Sequence alignment of cod Tf1 cDNAs from Northeast (NE) and Northwest (NW) Atlantic populations revealed 22 single nucleotide polymorphisms (SNP) causing the replacement of 16 amino acids, including eight surface residues revealed by the modelled 3D-structures, that might influence the binding of pathogens for removal of iron. SNP analysis of a total of 375 individuals from 14 trans-Atlantic populations showed that the Tf1-NE variant was almost fixed in the Baltic cod and predominated in the other NE Atlantic populations, whereas the NW Atlantic populations were more heterozygous and showed high frequencies of the Tf-NW SNP alleles. Conclusions The highly conserved synteny between fish and tetrapod transferrin loci infers that the fusion of tandem duplicated Omp-like genes gave rise to the modern transferrins. The multiple nonsynonymous substitutions in cod Tf1 with putative structural effects, together with highly divergent allele frequencies among different cod populations, strongly suggest evidence for positive selection and local adaptation in trans-Atlantic cod populations. PMID:21612617
Andersen, Øivind; De Rosa, Maria Cristina; Pirolli, Davide; Tooming-Klunderud, Ave; Petersen, Petra E; André, Carl
2011-05-25
The two homologous iron-binding lobes of transferrins are thought to have evolved by gene duplication of an ancestral monolobal form, but any conserved synteny between bilobal and monolobal transferrin loci remains unexplored. The important role played by transferrin in the resistance to invading pathogens makes this polymorphic gene a highly valuable candidate for studying adaptive divergence among local populations. The Atlantic cod genome was shown to harbour two tandem duplicated serum transferrin genes (Tf1, Tf2), a melanotransferrin gene (MTf), and a monolobal transferrin gene (Omp). Whereas Tf1 and Tf2 were differentially expressed in liver and brain, the Omp transcript was restricted to the otoliths. Fish, chicken and mammals showed highly conserved syntenic regions in which monolobal and bilobal transferrins reside, but contrasting with tetrapods, the fish transferrin genes are positioned on three different linkage groups. Sequence alignment of cod Tf1 cDNAs from Northeast (NE) and Northwest (NW) Atlantic populations revealed 22 single nucleotide polymorphisms (SNP) causing the replacement of 16 amino acids, including eight surface residues revealed by the modelled 3D-structures, that might influence the binding of pathogens for removal of iron. SNP analysis of a total of 375 individuals from 14 trans-Atlantic populations showed that the Tf1-NE variant was almost fixed in the Baltic cod and predominated in the other NE Atlantic populations, whereas the NW Atlantic populations were more heterozygous and showed high frequencies of the Tf-NW SNP alleles. The highly conserved synteny between fish and tetrapod transferrin loci infers that the fusion of tandem duplicated Omp-like genes gave rise to the modern transferrins. The multiple nonsynonymous substitutions in cod Tf1 with putative structural effects, together with highly divergent allele frequencies among different cod populations, strongly suggest evidence for positive selection and local adaptation in trans-Atlantic cod populations.
Zhang, Bo; Peng, Yu; Zheng, Jincheng; Liang, Lina; Hoffmann, Ary A; Ma, Chun-Sen
2016-07-01
Heat shock protein gene (Hsp) families are thought to be important in thermal adaptation, but their expression patterns under various thermal stresses have still been poorly characterized outside of model systems. We have therefore characterized Hsp genes and their stress responses in the oriental fruit moth (OFM), Grapholita molesta, a widespread global orchard pest, and compared patterns of expression in this species to that of other insects. Genes from four Hsp families showed variable expression levels among tissues and developmental stages. Members of the Hsp40, 70, and 90 families were highly expressed under short exposures to heat and cold. Expression of Hsp40, 70, and Hsc70 family members increased in OFM undergoing diapause, while Hsp90 was downregulated. We found that there was strong sequence conservation of members of large Hsp families (Hsp40, Hsp60, Hsp70, Hsc70) across taxa, but this was not always matched by conservation of expression patterns. When the large Hsps as well as small Hsps from OFM were compared under acute and ramping heat stress, two groups of sHsps expression patterns were apparent, depending on whether expression increased or decreased immediately after stress exposure. These results highlight potential differences in conservation of function as opposed to sequence in this gene family and also point to Hsp genes potentially useful as bioindicators of diapause and thermal stress in OFM.
Second chance for the plains bison
Freese, Curtis H.; Aune, K.; Boyd, D.; Derr, James N.; Forrest, Steven C.; Gates, C. Cormack; Gogan, Peter J.; Grassel, Shaun M.; Halbert, Natalie D.; Kunkel, Kyran; Redford, Kent
2007-01-01
Before European settlement the plains bison (Bison bison bison) numbered in the tens of millions across most of the temperate region of North America. Within the span of a few decades during the mid- to late-1800s its numbers were reduced by hunting and other factors to a few hundred. The plight of the plains bison led to one of the first major movements in North America to save an endangered species. A few individuals and the American Bison Society rescued the remaining animals. Attempts to hybridize cattle and bison when bison numbers were low resulted in extensive cattle gene introgression in bison. Today, though approximately 500,000 plains bison exist in North America, few are free of cattle gene introgression, 96% are subject to anthropogenic selection for commodity production, and only 4% are in herds managed primarily for conservation purposes. Small herd size, artificial selection, cattle-gene introgression, and other factors threaten the diversity and integrity of the bison genome. In addition, the bison is for all practical purposes ecologically extinct across its former range, with multiple consequences for grassland biodiversity. Urgent measures are needed to conserve the wild bison genome and to restore the ecological role of bison in grassland ecosystems. Socioeconomic trends in the Great Plains, combined with new information about bison conservation needs and new conservation initiatives by both the public and public sectors, have set the stage for significant progress in bison conservation over the next few years.
Prohibitin-2 gene reveals sex-related differences in the salmon louse Caligus rogercresseyi.
Farlora, Rodolfo; Nuñez-Acuña, Gustavo; Gallardo-Escárate, Cristian
2015-06-10
Prohibitins are evolutionarily conserved proteins present in multiple cellular compartments, and are involved in diverse cellular processes, including steroid hormone transcription and gametogenesis. In the present study, we report for the first time the characterization of the prohibitin-2 (Phb2) gene in the sea lice Caligus rogercresseyi. The CrPhb2 cDNA showed a total length of 1406 bp, which contained a predicted open reading frame (ORF) of 894 base pairs (bp) encoding for 298 amino acids. Multiple sequence alignments of prohibitin proteins from other arthropods revealed a high degree of amino acid sequence conservation. In silico Illumina read counts and RT-qPCR analyses showed a sex-dependent differential expression, with mRNA levels exhibiting a 1.7-fold (RT-qPCR) increase in adult females compared with adult males. A total of nine single nucleotide polymorphisms (SNPs) were identified, three were located in the 5' UTR of the Phb2 messenger and six in the ORF, but no mutations associated with sex were found. These results contribute to expand the present knowledge of the reproduction-related genes in C. rogercresseyi, and may be useful in future experiments aimed at controlling the impacts of sea lice in fish farming. Copyright © 2015 Elsevier B.V. All rights reserved.
Evolutionary conservation and expression of miR-10a-3p in olive flounder and rock bream.
Jo, Ara; Im, Jennifer; Lee, Hee-Eun; Jang, Dongmin; Nam, Gyu-Hwi; Mishra, Anshuman; Kim, Woo-Jin; Kim, Won; Cha, Hee-Jae; Kim, Heui-Soo
2017-09-10
MicroRNAs (miRNAs) are small non-coding RNAs (ncRNAs) that mainly bind to the seed sequences located within the 3' untranslated region (3' UTR) of target genes. They perform an important biological function as regulators of gene expression. Different genes can be regulated by the same miRNA, whilst different miRNAs can be regulated by the same genes. Here, the evolutionary conservation and expression pattern of miR-10a-3p in olive flounder and rock bream was examined. Binding sites (AAAUUC) to seed region of the 3' UTR of target genes were highly conserved in various species. The expression pattern of miR-10a-3p was ubiquitous in the examined tissues, whilst its expression level was decreased in gill tissues infected by viral hemorrhagic septicemia virus (VHSV) compared to the normal control. In the case of rock bream, the spleen, kidney, and liver tissues showed dominant expression levels of miR-10a-3p. Only the liver tissues in the rock bream samples infected by the iridovirus indicated a dominant miR-10a-3p expression. The gene ontology (GO) analysis of predicted target genes for miR-10a-3p revealed that multiple genes are related to binding activity, catalytic activity, cell components as well as cellular and metabolic process. Overall the results imply that the miR-10a-3p could be used as a biomarker to detect VHSV infection in olive flounder and iridovirus infection in rock bream. In addition, the data provides fundamental information for further study of the complex interaction between miR-10a-3p and gene expression. Copyright © 2017 Elsevier B.V. All rights reserved.
Evolution of the vertebrate insulin receptor substrate (Irs) gene family.
Al-Salam, Ahmad; Irwin, David M
2017-06-23
Insulin receptor substrate (Irs) proteins are essential for insulin signaling as they allow downstream effectors to dock with, and be activated by, the insulin receptor. A family of four Irs proteins have been identified in mice, however the gene for one of these, IRS3, has been pseudogenized in humans. While it is known that the Irs gene family originated in vertebrates, it is not known when it originated and which members are most closely related to each other. A better understanding of the evolution of Irs genes and proteins should provide insight into the regulation of metabolism by insulin. Multiple genes for Irs proteins were identified in a wide variety of vertebrate species. Phylogenetic and genomic neighborhood analyses indicate that this gene family originated very early in vertebrae evolution. Most Irs genes were duplicated and retained in fish after the fish-specific genome duplication. Irs genes have been lost of various lineages, including Irs3 in primates and birds and Irs1 in most fish. Irs3 and Irs4 experienced an episode of more rapid protein sequence evolution on the ancestral mammalian lineage. Comparisons of the conservation of the proteins sequences among Irs paralogs show that domains involved in binding to the plasma membrane and insulin receptors are most strongly conserved, while divergence has occurred in sequences involved in interacting with downstream effector proteins. The Irs gene family originated very early in vertebrate evolution, likely through genome duplications, and in parallel with duplications of other components of the insulin signaling pathway, including insulin and the insulin receptor. While the N-terminal sequences of these proteins are conserved among the paralogs, changes in the C-terminal sequences likely allowed changes in biological function.
Schnable, James C; Pedersen, Brent S; Subramaniam, Sabarinath; Freeling, Michael
2011-01-01
Whole genome duplications, or tetraploidies, are an important source of increased gene content. Following whole genome duplication, duplicate copies of many genes are lost from the genome. This loss of genes is biased both in the classes of genes deleted and the subgenome from which they are lost. Many or all classes are genes preferentially retained as duplicate copies are engaged in dose sensitive protein-protein interactions, such that deletion of any one duplicate upsets the status quo of subunit concentrations, and presumably lowers fitness as a result. Transcription factors are also preferentially retained following every whole genome duplications studied. This has been explained as a consequence of protein-protein interactions, just as for other highly retained classes of genes. We show that the quantity of conserved noncoding sequences (CNSs) associated with genes predicts the likelihood of their retention as duplicate pairs following whole genome duplication. As many CNSs likely represent binding sites for transcriptional regulators, we propose that the likelihood of gene retention following tetraploidy may also be influenced by dose-sensitive protein-DNA interactions between the regulatory regions of CNS-rich genes - nicknamed bigfoot genes - and the proteins that bind to them. Using grass genomes, we show that differential loss of CNSs from one member of a pair following the pre-grass tetraploidy reduces its chance of retention in the subsequent maize lineage tetraploidy.
Schnable, James C.; Pedersen, Brent S.; Subramaniam, Sabarinath; Freeling, Michael
2011-01-01
Whole genome duplications, or tetraploidies, are an important source of increased gene content. Following whole genome duplication, duplicate copies of many genes are lost from the genome. This loss of genes is biased both in the classes of genes deleted and the subgenome from which they are lost. Many or all classes are genes preferentially retained as duplicate copies are engaged in dose sensitive protein–protein interactions, such that deletion of any one duplicate upsets the status quo of subunit concentrations, and presumably lowers fitness as a result. Transcription factors are also preferentially retained following every whole genome duplications studied. This has been explained as a consequence of protein–protein interactions, just as for other highly retained classes of genes. We show that the quantity of conserved noncoding sequences (CNSs) associated with genes predicts the likelihood of their retention as duplicate pairs following whole genome duplication. As many CNSs likely represent binding sites for transcriptional regulators, we propose that the likelihood of gene retention following tetraploidy may also be influenced by dose–sensitive protein–DNA interactions between the regulatory regions of CNS-rich genes – nicknamed bigfoot genes – and the proteins that bind to them. Using grass genomes, we show that differential loss of CNSs from one member of a pair following the pre-grass tetraploidy reduces its chance of retention in the subsequent maize lineage tetraploidy. PMID:22645525
Evolutionary genetics of insect innate immunity.
Viljakainen, Lumi
2015-11-01
Patterns of evolution in immune defense genes help to understand the evolutionary dynamics between hosts and pathogens. Multiple insect genomes have been sequenced, with many of them having annotated immune genes, which paves the way for a comparative genomic analysis of insect immunity. In this review, I summarize the current state of comparative and evolutionary genomics of insect innate immune defense. The focus is on the conserved and divergent components of immunity with an emphasis on gene family evolution and evolution at the sequence level; both population genetics and molecular evolution frameworks are considered. © The Author 2015. Published by Oxford University Press.
Mitochondrial DNA Mutation Associated with Leber's Hereditary Optic Neuropathy
NASA Astrophysics Data System (ADS)
Wallace, Douglas C.; Singh, Gurparkash; Lott, Marie T.; Hodge, Judy A.; Schurr, Theodore G.; Lezza, Angela M. S.; Elsas, Louis J.; Nikoskelainen, Eeva K.
1988-12-01
Leber's hereditary optic neuropathy is a maternally inherited disease resulting in optic nerve degeneration and cardiac dysrhythmia. A mitochondrial DNA replacement mutation was identified that correlated with this disease in multiple families. This mutation converted a highly conserved arginine to a histidine at codon 340 in the NADH dehydrogenase subunit 4 gene and eliminated an Sfa NI site, thus providing a simple diagnostic test. This finding demonstrated that a nucleotide change in a mitochondrial DNA energy production gene can result in a neurological disease.
NASA Technical Reports Server (NTRS)
Fritzsch, B.; Beisel, K. W.; Bermingham, N. A.
2000-01-01
This brief overview shows that a start has been made to molecularly dissect vertebrate ear development and its evolutionary conservation to the development of the insect hearing organ. However, neither the patterning process of the ear nor the patterning process of insect sensory organs is sufficiently known at the moment to provide more than a first glimpse. Moreover, hardly anything is known about otocyst development of the cephalopod molluscs, another triploblast lineage that evolved complex 'ears'. We hope that the apparent conserved functional and cellular components present in the ciliated sensory neurons/hair cells will also be found in the genes required for vertebrate ear and insect sensory organ morphogenesis (Fig. 3). Likewise, we expect that homologous pre-patterning genes will soon be identified for the non-sensory cell development, which is more than a blocking of neuronal development through the Delta/Notch signaling system. Generation of the apparently unique ear could thus represent a multiplication of non-sensory cells by asymmetric and symmetric divisions as well as modification of existing patterning process by implementing novel developmental modules. In the final analysis, the vertebrate ear may come about by increasing the level of gene interactions in an already existing and highly conserved interactive cascade of bHLH genes. Since this was apparently achieved in all three lineages of triploblasts independently (Fig. 3), we now need to understand how much of the morphogenetic cascades are equally conserved across phyla to generate complex ears. The existing mutations in humans and mice may be able to point the direction of future research to understand the development of specific cell types and morphologies in the formation of complex arthropod, cephalopod, and vertebrate 'ears'.
Micropropagation and in vitro conservation of vanilla (Vanilla planifolia Andrews).
Divakaran, Minoo; Babu, K Nirmal
2009-01-01
Vanilla (Vanilla planifolia Andrews (syn. V. fragrans Salisb.), a source of natural vanillin, plays a major positive role in the economy of several countries. A native to the Central America, its primary gene pool is threatened by deforestation and over collection that has resulted in disappearance of natural habitats and wild species. Therefore, multiplication and conservation of vanilla diversity is of paramount importance because of its narrow genetic base. It plays an important role in the production of disease free planting material for commercial cultivation. Simple protocols for micropropagation, in vitro conservation and synthetic seed production are described in this chapter which could further be applied to other related vanilla species as well.
Furihata, Hazuka Y; Suenaga, Kazuya; Kawanabe, Takahiro; Yoshida, Takanori; Kawabe, Akira
2016-10-13
PRC2 genes were analyzed for their number of gene duplications, d N /d S ratios and expression patterns among Brassicaceae and Gramineae species. Although both amino acid sequences and copy number of the PRC2 genes were generally well conserved in both Brassicaceae and Gramineae species, we observed that some rapidly evolving genes experienced duplications and expression pattern changes. After multiple duplication events, all but one or two of the duplicated copies tend to be silenced. Silenced copies were reactivated in the endosperm and showed ectopic expression in developing seeds. The results indicated that rapid evolution of some PRC2 genes is initially caused by a relaxation of selective constraint following the gene duplication events. Several loci could become maternally expressed imprinted genes and acquired functional roles in the endosperm.
Heraty, Joanne M; Ellstrand, Norman C
Contemporary germplasm conservation studies largely focus on ex situ and in situ management of diversity within centers of genetic diversity. Transnational migrants who transport and introduce landraces to new locations may catalyze a third type of conservation that combines both approaches. Resulting populations may support reduced diversity as a result of evolutionary forces such as genetic drift, selection, and gene flow, yet they may also be more diverse as a result of multiple introductions, selective breeding and cross pollination among multiple introduced varietals. In this study, we measured the amount and structure of maize molecular genetic diversity in samples collected from home gardens and community gardens maintained by immigrant farmers in Southern California. We used the same markers to measure the genetic diversity and structure of commercially available maize varieties and compared our data to previously reported genetic diversity statistics of Mesoamerican landraces. Our results reveal that transnational dispersal creates an opportunity for the maintenance of maize genetic diversity beyond its recognized centers of diversity.
Urzica, Eugen I.; Casero, David; Yamasaki, Hiroaki; Hsieh, Scott I.; Adler, Lital N.; Karpowicz, Steven J.; Blaby-Haas, Crysten E.; Clarke, Steven G.; Loo, Joseph A.; Pellegrini, Matteo; Merchant, Sabeeha S.
2012-01-01
We surveyed the iron nutrition-responsive transcriptome of Chlamydomonas reinhardtii using RNA-Seq methodology. Presumed primary targets were identified in comparisons between visually asymptomatic iron-deficient versus iron-replete cells. This includes the known components of high-affinity iron uptake as well as candidates for distributive iron transport in C. reinhardtii. Comparison of growth-inhibited iron-limited versus iron-replete cells revealed changes in the expression of genes in chloroplastic oxidative stress response pathways, among hundreds of other genes. The output from the transcriptome was validated at multiple levels: by quantitative RT-PCR for assessing the data analysis pipeline, by quantitative proteomics for assessing the impact of changes in RNA abundance on the proteome, and by cross-species comparison for identifying conserved or universal response pathways. In addition, we assessed the functional importance of three target genes, VITAMIN C 2 (VTC2), MONODEHYDROASCORBATE REDUCTASE 1 (MDAR1), and CONSERVED IN THE GREEN LINEAGE AND DIATOMS 27 (CGLD27), by biochemistry or reverse genetics. VTC2 and MDAR1, which are key enzymes in de novo ascorbate synthesis and ascorbate recycling, respectively, are likely responsible for the 10-fold increase in ascorbate content of iron-limited cells. CGLD27/At5g67370 is a highly conserved, presumed chloroplast-localized pioneer protein and is important for growth of Arabidopsis thaliana in low iron. PMID:23043051
Strickland, Michelle; Tudorica, Victor; Řezáč, Milan; Thomas, Neil R; Goodacre, Sara L
2018-06-01
Spiders produce multiple silks with different physical properties that allow them to occupy a diverse range of ecological niches, including the underwater environment. Despite this functional diversity, past molecular analyses show a high degree of amino acid sequence similarity between C-terminal regions of silk genes that appear to be independent of the physical properties of the resulting silks; instead, this domain is crucial to the formation of silk fibers. Here, we present an analysis of the C-terminal domain of all known types of spider silk and include silk sequences from the spider Argyroneta aquatica, which spins the majority of its silk underwater. Our work indicates that spiders have retained a highly conserved mechanism of silk assembly, despite the extraordinary diversification of species, silk types and applications of silk over 350 million years. Sequence analysis of the silk C-terminal domain across the entire gene family shows the conservation of two uncommon amino acids that are implicated in the formation of a salt bridge, a functional bond essential to protein assembly. This conservation extends to the novel sequences isolated from A. aquatica. This finding is relevant to research regarding the artificial synthesis of spider silk, suggesting that synthesis of all silk types will be possible using a single process.
Lu, Hong; Patil, Prabhu; Van Sluys, Marie-Anne; White, Frank F; Ryan, Robert P; Dow, J Maxwell; Rabinowicz, Pablo; Salzberg, Steven L; Leach, Jan E; Sonti, Ramesh; Brendel, Volker; Bogdanove, Adam J
2008-01-01
Xanthomonas is a large genus of plant-associated and plant-pathogenic bacteria. Collectively, members cause diseases on over 392 plant species. Individually, they exhibit marked host- and tissue-specificity. The determinants of this specificity are unknown. To assess potential contributions to host- and tissue-specificity, pathogenesis-associated gene clusters were compared across genomes of eight Xanthomonas strains representing vascular or non-vascular pathogens of rice, brassicas, pepper and tomato, and citrus. The gum cluster for extracellular polysaccharide is conserved except for gumN and sequences downstream. The xcs and xps clusters for type II secretion are conserved, except in the rice pathogens, in which xcs is missing. In the otherwise conserved hrp cluster, sequences flanking the core genes for type III secretion vary with respect to insertion sequence element and putative effector gene content. Variation at the rpf (regulation of pathogenicity factors) cluster is more pronounced, though genes with established functional relevance are conserved. A cluster for synthesis of lipopolysaccharide varies highly, suggesting multiple horizontal gene transfers and reassortments, but this variation does not correlate with host- or tissue-specificity. Phylogenetic trees based on amino acid alignments of gum, xps, xcs, hrp, and rpf cluster products generally reflect strain phylogeny. However, amino acid residues at four positions correlate with tissue specificity, revealing hpaA and xpsD as candidate determinants. Examination of genome sequences of xanthomonads Xylella fastidiosa and Stenotrophomonas maltophilia revealed that the hrp, gum, and xcs clusters are recent acquisitions in the Xanthomonas lineage. Our results provide insight into the ancestral Xanthomonas genome and indicate that differentiation with respect to host- and tissue-specificity involved not major modifications or wholesale exchange of clusters, but subtle changes in a small number of genes or in non-coding sequences, and/or differences outside the clusters, potentially among regulatory targets or secretory substrates.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chaiboonchoe, Amphun; Ghamsari, Lila; Dohai, Bushra
Metabolic networks, which are mathematical representations of organismal metabolism, are reconstructed to provide computational platforms to guide metabolic engineering experiments and explore fundamental questions on metabolism. Systems level analyses, such as interrogation of phylogenetic relationships within the network, can provide further guidance on the modification of metabolic circuitries. Chlamydomonas reinhardtii, a biofuel relevant green alga that has retained key genes with plant, animal, and protist affinities, serves as an ideal model organism to investigate the interplay between gene function and phylogenetic affinities at multiple organizational levels. Here, using detailed topological and functional analyses, coupled with transcriptomics studies on a metabolicmore » network that we have reconstructed for C. reinhardtii, we show that network connectivity has a significant concordance with the co-conservation of genes; however, a distinction between topological and functional relationships is observable within the network. Dynamic and static modes of co-conservation were defined and observed in a subset of gene-pairs across the network topologically. In contrast, genes with predicted synthetic interactions, or genes involved in coupled reactions, show significant enrichment for both shorter and longer phylogenetic distances. Based on our results, we propose that the metabolic network of C. reinhardtii is assembled with an architecture to minimize phylogenetic profile distances topologically, while it includes an expansion of such distances for functionally interacting genes. This arrangement may increase the robustness of C. reinhardtii's network in dealing with varied environmental challenges that the species may face. As a result, the defined evolutionary constraints within the network, which identify important pairings of genes in metabolism, may offer guidance on synthetic biology approaches to optimize the production of desirable metabolites.« less
Chaiboonchoe, Amphun; Ghamsari, Lila; Dohai, Bushra; Ng, Patrick; Khraiwesh, Basel; Jaiswal, Ashish; Jijakli, Kenan; Koussa, Joseph; Nelson, David R; Cai, Hong; Yang, Xinping; Chang, Roger L; Papin, Jason; Yu, Haiyuan; Balaji, Santhanam; Salehi-Ashtiani, Kourosh
2016-07-19
Metabolic networks, which are mathematical representations of organismal metabolism, are reconstructed to provide computational platforms to guide metabolic engineering experiments and explore fundamental questions on metabolism. Systems level analyses, such as interrogation of phylogenetic relationships within the network, can provide further guidance on the modification of metabolic circuitries. Chlamydomonas reinhardtii, a biofuel relevant green alga that has retained key genes with plant, animal, and protist affinities, serves as an ideal model organism to investigate the interplay between gene function and phylogenetic affinities at multiple organizational levels. Here, using detailed topological and functional analyses, coupled with transcriptomics studies on a metabolic network that we have reconstructed for C. reinhardtii, we show that network connectivity has a significant concordance with the co-conservation of genes; however, a distinction between topological and functional relationships is observable within the network. Dynamic and static modes of co-conservation were defined and observed in a subset of gene-pairs across the network topologically. In contrast, genes with predicted synthetic interactions, or genes involved in coupled reactions, show significant enrichment for both shorter and longer phylogenetic distances. Based on our results, we propose that the metabolic network of C. reinhardtii is assembled with an architecture to minimize phylogenetic profile distances topologically, while it includes an expansion of such distances for functionally interacting genes. This arrangement may increase the robustness of C. reinhardtii's network in dealing with varied environmental challenges that the species may face. The defined evolutionary constraints within the network, which identify important pairings of genes in metabolism, may offer guidance on synthetic biology approaches to optimize the production of desirable metabolites.
Chaiboonchoe, Amphun; Ghamsari, Lila; Dohai, Bushra; ...
2016-06-14
Metabolic networks, which are mathematical representations of organismal metabolism, are reconstructed to provide computational platforms to guide metabolic engineering experiments and explore fundamental questions on metabolism. Systems level analyses, such as interrogation of phylogenetic relationships within the network, can provide further guidance on the modification of metabolic circuitries. Chlamydomonas reinhardtii, a biofuel relevant green alga that has retained key genes with plant, animal, and protist affinities, serves as an ideal model organism to investigate the interplay between gene function and phylogenetic affinities at multiple organizational levels. Here, using detailed topological and functional analyses, coupled with transcriptomics studies on a metabolicmore » network that we have reconstructed for C. reinhardtii, we show that network connectivity has a significant concordance with the co-conservation of genes; however, a distinction between topological and functional relationships is observable within the network. Dynamic and static modes of co-conservation were defined and observed in a subset of gene-pairs across the network topologically. In contrast, genes with predicted synthetic interactions, or genes involved in coupled reactions, show significant enrichment for both shorter and longer phylogenetic distances. Based on our results, we propose that the metabolic network of C. reinhardtii is assembled with an architecture to minimize phylogenetic profile distances topologically, while it includes an expansion of such distances for functionally interacting genes. This arrangement may increase the robustness of C. reinhardtii's network in dealing with varied environmental challenges that the species may face. As a result, the defined evolutionary constraints within the network, which identify important pairings of genes in metabolism, may offer guidance on synthetic biology approaches to optimize the production of desirable metabolites.« less
Lu, Shun-Wen; Chen, Shiyan; Wang, Jianying; Yu, Hang; Chronis, Demosthenis; Mitchum, Melissa G; Wang, Xiaohong
2009-09-01
Plant CLAVATA3/ESR-related (CLE) peptides have diverse roles in plant growth and development. Here, we report the isolation and functional characterization of five new CLE genes from the potato cyst nematode Globodera rostochiensis. Unlike typical plant CLE peptides that contain a single CLE motif, four of the five Gr-CLE genes encode CLE proteins with multiple CLE motifs. These Gr-CLE genes were found to be specifically expressed within the dorsal esophageal gland cell of nematode parasitic stages, suggesting a role for their encoded proteins in plant parasitism. Overexpression phenotypes of Gr-CLE genes in Arabidopsis mimicked those of plant CLE genes, and Gr-CLE proteins could rescue the Arabidopsis clv3-2 mutant phenotype when expressed within meristems. A short root phenotype was observed when synthetic GrCLE peptides were exogenously applied to roots of Arabidopsis or potato similar to the overexpression of Gr-CLE genes in Arabidopsis and potato hairy roots. These results reveal that G. rostochiensis CLE proteins with either single or multiple CLE motifs function similarly to plant CLE proteins and that CLE signaling components are conserved in both Arabidopsis and potato roots. Furthermore, our results provide evidence to suggest that the evolution of multiple CLE motifs may be an important mechanism for generating functional diversity in nematode CLE proteins to facilitate parasitism.
Ye, Fei; Lan, Xu-E; Zhu, Wen-Bo; You, Ping
2016-05-09
Insect mitochondrial genomes (mitogenomes) contain a conserved set of 37 genes for an extensive diversity of lineages. Previously reported dictyopteran mitogenomes share this conserved mitochondrial gene arrangement, although surprisingly little is known about the mitogenome of Mantodea. We sequenced eight mantodean mitogenomes including the first representatives of two families: Hymenopodidae and Liturgusidae. Only two of these genomes retain the typical insect gene arrangement. In three Liturgusidae species, the trnM genes have translocated. Four species of mantis (Creobroter gemmata, Mantis religiosa, Statilia sp., and Theopompa sp.-HN) have multiple identical tandem duplication of trnR, and Statilia sp. additionally includes five extra duplicate trnW. These extra trnR and trnW in Statilia sp. are erratically arranged and form another novel gene order. Interestingly, the extra trnW is converted from trnR by the process of point mutation at anticodon, which is the first case of tRNA reassignment for an insect. Furthermore, no significant differences were observed amongst mantodean mitogenomes with variable copies of tRNA according to comparative analysis of codon usage. Combined with phylogenetic analysis, the characteristics of tRNA only possess limited phylogenetic information in this research. Nevertheless, these features of gene rearrangement, duplication, and reassignment provide valuable information toward understanding mitogenome evolution in insects.
Ye, Fei; Lan, Xu-e; Zhu, Wen-bo; You, Ping
2016-01-01
Insect mitochondrial genomes (mitogenomes) contain a conserved set of 37 genes for an extensive diversity of lineages. Previously reported dictyopteran mitogenomes share this conserved mitochondrial gene arrangement, although surprisingly little is known about the mitogenome of Mantodea. We sequenced eight mantodean mitogenomes including the first representatives of two families: Hymenopodidae and Liturgusidae. Only two of these genomes retain the typical insect gene arrangement. In three Liturgusidae species, the trnM genes have translocated. Four species of mantis (Creobroter gemmata, Mantis religiosa, Statilia sp., and Theopompa sp.-HN) have multiple identical tandem duplication of trnR, and Statilia sp. additionally includes five extra duplicate trnW. These extra trnR and trnW in Statilia sp. are erratically arranged and form another novel gene order. Interestingly, the extra trnW is converted from trnR by the process of point mutation at anticodon, which is the first case of tRNA reassignment for an insect. Furthermore, no significant differences were observed amongst mantodean mitogenomes with variable copies of tRNA according to comparative analysis of codon usage. Combined with phylogenetic analysis, the characteristics of tRNA only possess limited phylogenetic information in this research. Nevertheless, these features of gene rearrangement, duplication, and reassignment provide valuable information toward understanding mitogenome evolution in insects. PMID:27157299
Ferguson, Laura C; Maroja, Luana; Jiggins, Chris D
2011-12-01
The evolution of pigmentation in vertebrates and flies has involved repeated divergence at a small number of genes related to melanin synthesis. Here, we study insect melanin synthesis genes in Heliconius butterflies, a group characterised by its diversity of wing patterns consisting of black (melanin), and yellow and red (ommochrome) pigmented scales. Consistent with their respective biochemical roles in Drosophila melanogaster, ebony is upregulated in non-melanic wing regions destined to be pigmented red whilst tan is upregulated in melanic regions. Wing regions destined to be pigmented yellow, however, are downregulated for both genes. This pattern is conserved across multiple divergent and convergent phenotypes within the Heliconii, suggesting a conserved mechanism for the development of black, red and yellow pattern elements across the genus. Linkage mapping of five melanin biosynthesis genes showed that, in contrast to other organisms, these genes do not control pattern polymorphism. Thus, the pigmentation genes themselves are not the locus of evolutionary change but lie downstream of a wing pattern regulatory factor. The results suggest a modular system in which particular combinations of genes are switched on whenever red, yellow or black pattern elements are favoured by natural selection for diverse and mimetic wing patterns. © Springer-Verlag 2011
WormQTLHD--a web database for linking human disease to natural variation data in C. elegans.
van der Velde, K Joeri; de Haan, Mark; Zych, Konrad; Arends, Danny; Snoek, L Basten; Kammenga, Jan E; Jansen, Ritsert C; Swertz, Morris A; Li, Yang
2014-01-01
Interactions between proteins are highly conserved across species. As a result, the molecular basis of multiple diseases affecting humans can be studied in model organisms that offer many alternative experimental opportunities. One such organism-Caenorhabditis elegans-has been used to produce much molecular quantitative genetics and systems biology data over the past decade. We present WormQTL(HD) (Human Disease), a database that quantitatively and systematically links expression Quantitative Trait Loci (eQTL) findings in C. elegans to gene-disease associations in man. WormQTL(HD), available online at http://www.wormqtl-hd.org, is a user-friendly set of tools to reveal functionally coherent, evolutionary conserved gene networks. These can be used to predict novel gene-to-gene associations and the functions of genes underlying the disease of interest. We created a new database that links C. elegans eQTL data sets to human diseases (34 337 gene-disease associations from OMIM, DGA, GWAS Central and NHGRI GWAS Catalogue) based on overlapping sets of orthologous genes associated to phenotypes in these two species. We utilized QTL results, high-throughput molecular phenotypes, classical phenotypes and genotype data covering different developmental stages and environments from WormQTL database. All software is available as open source, built on MOLGENIS and xQTL workbench.
Ren, He-Lin; Hu, Yuan; Guo, Ya-Jun; Li, Lu-Lin
2016-06-01
Within Baculoviridae, little is known about the molecular mechanisms of replication in betabaculoviruses, despite extensive studies in alphabaculoviruses. In this study, the promoters of nine late genes of the betabaculovirus Plutella xylostella granulovirus (PlxyGV) were cloned into a transient expression vector and the alphabaculovirus Autographa californica multiple nucleopolyhedrovirus (AcMNPV) genome, and compared with homologous late gene promoters of AcMNPV in Sf9 cells. In transient expression assays, all PlxyGV late promoters were activated in cells transfected with the individual reporter plasmids together with an AcMNPV bacmid. In infected cells, reporter gene expression levels with the promoters of PlxyGV e18 and AcMNPV vp39 and gp41 were significantly higher than those of the corresponding AcMNPV or PlxyGV promoters, which had fewer late promoter motifs. Observed expression levels were lower for the PlxyGV p6.9, pk1, gran, p10a, and p10b promoters than for the corresponding AcMNPV promoters, despite equal numbers of late promoter motifs, indicating that species-specific elements contained in some late promoters were favored by the native viral RNA polymerases for optimal transcription. The 8-nt sequence TAAATAAG encompassing the ATAAG motif was conserved in the AcMNPV polh, p10, and pk1 promoters. The 5-nt sequence CAATT located 4 or 5 nt upstream of the T/ATAAG motif was conserved in the promoters of PlxyGV gran, p10c, and pk1. The results of this study demonstrated that PlxyGV late gene promoters could be effectively activated by the RNA polymerase from AcMNPV, implying that late gene expression systems are regulated by similar mechanisms in alphabaculoviruses and betabaculoviruses.
Human Variation in Short Regions Predisposed to Deep Evolutionary Conservation
Loots, Gabriela G.; Ovcharenko, Ivan
2010-01-01
The landscape of the human genome consists of millions of short islands of conservation that are 100% conserved across multiple vertebrate genomes (termed “bricks”), the majority of which are located in noncoding regions. Several hundred thousand bricks are deeply conserved reaching the genomes of amphibians and fish. Deep phylogenetic conservation of noncoding DNA has been reported to be strongly associated with the presence of gene regulatory elements, introducing bricks as a proxy to the functional noncoding landscape of the human genome. Here, we report a significant overrepresentation of bricks in the promoters of transcription factors and developmental genes, where the high level of phylogenetic conservation correlates with an increase in brick overrepresentation. We also found that the presence of a brick dictates a predisposition to evolutionary constraint, with only 0.7% of the amniota brick central nucleotides being diverged within the primate lineage—an 11-fold reduction in the divergence rate compared with random expectation. Human single-nucleotide polymorphism (SNP) data explains only 3% of primate-specific variation in amniota bricks, thus arguing for a widespread fixation of brick mutations within the primate lineage and prior to human radiation. This variation, in turn, might have been utilized as a driving force for primate- and hominoid-specific adaptation. We also discovered a pronounced deviation from the evolutionary predisposition in the human lineage, with over 20-fold increase in the substitution rate at brick SNP sites over expected values. In addition, contrary to typical brick mutations, brick variation commonly encountered in the human population displays limited, if any, signatures of negative selection as measured by the minor allele frequency and population differentiation (F-statistical measure) measures. These observations argue for the plasticity of gene regulatory mechanisms in vertebrates—with evidence of strong purifying selection acting on the gene regulatory landscape of the human genome, where widespread advantageous mutations in putative regulatory elements are likely utilized in functional diversification and adaptation of species. PMID:20093432
McCormick, Mark A; Delaney, Joe R; Tsuchiya, Mitsuhiro; Tsuchiyama, Scott; Shemorry, Anna; Sim, Sylvia; Chou, Annie Chia-Zong; Ahmed, Umema; Carr, Daniel; Murakami, Christopher J; Schleit, Jennifer; Sutphin, George L; Wasko, Brian M; Bennett, Christopher F; Wang, Adrienne M; Olsen, Brady; Beyer, Richard P; Bammler, Theodor K; Prunkard, Donna; Johnson, Simon C; Pennypacker, Juniper K; An, Elroy; Anies, Arieanna; Castanza, Anthony S; Choi, Eunice; Dang, Nick; Enerio, Shiena; Fletcher, Marissa; Fox, Lindsay; Goswami, Sarani; Higgins, Sean A; Holmberg, Molly A; Hu, Di; Hui, Jessica; Jelic, Monika; Jeong, Ki-Soo; Johnston, Elijah; Kerr, Emily O; Kim, Jin; Kim, Diana; Kirkland, Katie; Klum, Shannon; Kotireddy, Soumya; Liao, Eric; Lim, Michael; Lin, Michael S; Lo, Winston C; Lockshon, Dan; Miller, Hillary A; Moller, Richard M; Muller, Brian; Oakes, Jonathan; Pak, Diana N; Peng, Zhao Jun; Pham, Kim M; Pollard, Tom G; Pradeep, Prarthana; Pruett, Dillon; Rai, Dilreet; Robison, Brett; Rodriguez, Ariana A; Ros, Bopharoth; Sage, Michael; Singh, Manpreet K; Smith, Erica D; Snead, Katie; Solanky, Amrita; Spector, Benjamin L; Steffen, Kristan K; Tchao, Bie Nga; Ting, Marc K; Vander Wende, Helen; Wang, Dennis; Welton, K Linnea; Westman, Eric A; Brem, Rachel B; Liu, Xin-Guang; Suh, Yousin; Zhou, Zhongjun; Kaeberlein, Matt; Kennedy, Brian K
2015-11-03
Many genes that affect replicative lifespan (RLS) in the budding yeast Saccharomyces cerevisiae also affect aging in other organisms such as C. elegans and M. musculus. We performed a systematic analysis of yeast RLS in a set of 4,698 viable single-gene deletion strains. Multiple functional gene clusters were identified, and full genome-to-genome comparison demonstrated a significant conservation in longevity pathways between yeast and C. elegans. Among the mechanisms of aging identified, deletion of tRNA exporter LOS1 robustly extended lifespan. Dietary restriction (DR) and inhibition of mechanistic Target of Rapamycin (mTOR) exclude Los1 from the nucleus in a Rad53-dependent manner. Moreover, lifespan extension from deletion of LOS1 is nonadditive with DR or mTOR inhibition, and results in Gcn4 transcription factor activation. Thus, the DNA damage response and mTOR converge on Los1-mediated nuclear tRNA export to regulate Gcn4 activity and aging. Copyright © 2015 Elsevier Inc. All rights reserved.
Tong, Pin; Monahan, Jack; Prendergast, James G D
2017-03-01
Large-scale gene expression datasets are providing an increasing understanding of the location of cis-eQTLs in the human genome and their role in disease. However, little is currently known regarding the extent of regulatory site-sharing between genes. This is despite it having potentially wide-ranging implications, from the determination of the way in which genetic variants may shape multiple phenotypes to the understanding of the evolution of human gene order. By first identifying the location of non-redundant cis-eQTLs, we show that regulatory site-sharing is a relatively common phenomenon in the human genome, with over 10% of non-redundant regulatory variants linked to the expression of multiple nearby genes. We show that these shared, local regulatory sites are linked to high levels of chromatin looping between the regulatory sites and their associated genes. In addition, these co-regulated gene modules are found to be strongly conserved across mammalian species, suggesting that shared regulatory sites have played an important role in shaping human gene order. The association of these shared cis-eQTLs with multiple genes means they also appear to be unusually important in understanding the genetics of human phenotypes and pleiotropy, with shared regulatory sites more often linked to multiple human phenotypes than other regulatory variants. This study shows that regulatory site-sharing is likely an underappreciated aspect of gene regulation and has important implications for the understanding of various biological phenomena, including how the two and three dimensional structures of the genome have been shaped and the potential causes of disease pleiotropy outside coding regions.
Ren, Ren; Sun, Yazhou; Zhao, Yue; Geiser, David
2016-01-01
Abstract A comprehensive and reliable eukaryotic tree of life is important for many aspects of biological studies from comparative developmental and physiological analyses to translational medicine and agriculture. Both gene-rich and taxon-rich approaches are effective strategies to improve phylogenetic accuracy and are greatly facilitated by marker genes that are universally distributed, well conserved, and orthologous among divergent eukaryotes. In this article, we report the identification of 943 low-copy eukaryotic genes and we show that many of these genes are promising tools in resolving eukaryotic phylogenies, despite the challenges of determining deep eukaryotic relationships. As a case study, we demonstrate that smaller subsets of ∼20 and 52 genes could resolve controversial relationships among widely divergent taxa and provide strong support for deep relationships such as the monophyly and branching order of several eukaryotic supergroups. In addition, the use of these genes resulted in fungal phylogenies that are congruent with previous phylogenomic studies that used much larger datasets, and successfully resolved several difficult relationships (e.g., forming a highly supported clade with Microsporidia, Mitosporidium and Rozella sister to other fungi). We propose that these genes are excellent for both gene-rich and taxon-rich analyses and can be applied at multiple taxonomic levels and facilitate a more complete understanding of the eukaryotic tree of life. PMID:27604879
Alternative Splicing of Barley Clock Genes in Response to Low Temperature
Calixto, Cristiane P. G.; Simpson, Craig G.; Waugh, Robbie; Brown, John W. S.
2016-01-01
Alternative splicing (AS) is a regulated mechanism that generates multiple transcripts from individual genes. It is widespread in eukaryotic genomes and provides an effective way to control gene expression. At low temperatures, AS regulates Arabidopsis clock genes through dynamic changes in the levels of productive mRNAs. We examined AS in barley clock genes to assess whether temperature-dependent AS responses also occur in a monocotyledonous crop species. We identify changes in AS of various barley core clock genes including the barley orthologues of Arabidopsis AtLHY and AtPRR7 which showed the most pronounced AS changes in response to low temperature. The AS events modulate the levels of functional and translatable mRNAs, and potentially protein levels, upon transition to cold. There is some conservation of AS events and/or splicing behaviour of clock genes between Arabidopsis and barley. In addition, novel temperature-dependent AS of the core clock gene HvPPD-H1 (a major determinant of photoperiod response and AtPRR7 orthologue) is conserved in monocots. HvPPD-H1 showed a rapid, temperature-sensitive isoform switch which resulted in changes in abundance of AS variants encoding different protein isoforms. This novel layer of low temperature control of clock gene expression, observed in two very different species, will help our understanding of plant adaptation to different environments and ultimately offer a new range of targets for plant improvement. PMID:27959947
The Silkworm (Bombyx mori) microRNAs and Their Expressions in Multiple Developmental Stages
Luo, Qibin; Cai, Yimei; Lin, Wen-chang; Chen, Huan; Yang, Yue; Hu, Songnian; Yu, Jun
2008-01-01
Background MicroRNAs (miRNAs) play crucial roles in various physiological processes through post-transcriptional regulation of gene expressions and are involved in development, metabolism, and many other important molecular mechanisms and cellular processes. The Bombyx mori genome sequence provides opportunities for a thorough survey for miRNAs as well as comparative analyses with other sequenced insect species. Methodology/Principal Findings We identified 114 non-redundant conserved miRNAs and 148 novel putative miRNAs from the B. mori genome with an elaborate computational protocol. We also sequenced 6,720 clones from 14 developmental stage-specific small RNA libraries in which we identified 35 unique miRNAs containing 21 conserved miRNAs (including 17 predicted miRNAs) and 14 novel miRNAs (including 11 predicted novel miRNAs). Among the 114 conserved miRNAs, we found six pairs of clusters evolutionarily conserved cross insect lineages. Our observations on length heterogeneity at 5′ and/or 3′ ends of nine miRNAs between cloned and predicted sequences, and three mature forms deriving from the same arm of putative pre-miRNAs suggest a mechanism by which miRNAs gain new functions. Analyzing development-related miRNAs expression at 14 developmental stages based on clone-sampling and stem-loop RT PCR, we discovered an unusual abundance of 33 sequences representing 12 different miRNAs and sharply fluctuated expression of miRNAs at larva-molting stage. The potential functions of several stage-biased miRNAs were also analyzed in combination with predicted target genes and silkworm's phenotypic traits; our results indicated that miRNAs may play key regulatory roles in specific developmental stages in the silkworm, such as ecdysis. Conclusions/Significance Taking a combined approach, we identified 118 conserved miRNAs and 151 novel miRNA candidates from the B. mori genome sequence. Our expression analyses by sampling miRNAs and real-time PCR over multiple developmental stages allowed us to pinpoint molting stages as hotspots of miRNA expression both in sorts and quantities. Based on the analysis of target genes, we hypothesized that miRNAs regulate development through a particular emphasis on complex stages rather than general regulatory mechanisms. PMID:18714353
Conservation in the face of diversity: multistrain analysis of an intracellular bacterium
USDA-ARS?s Scientific Manuscript database
Comparisons of multiple strains revealed that A. marginale has a closed-core genome with few highly plastic regions, which include the msp2 and msp3 genes, as well as the aaap locus. Comparison of the Florida and St. Maries genome sequences found that SNPs comprise 0.8% of the longer Florida genome,...
Comparative functional characterization of the CSR-1 22G-RNA pathway in Caenorhabditis nematodes.
Tu, Shikui; Wu, Monica Z; Wang, Jie; Cutter, Asher D; Weng, Zhiping; Claycomb, Julie M
2015-01-01
As a champion of small RNA research for two decades, Caenorhabditis elegans has revealed the essential Argonaute CSR-1 to play key nuclear roles in modulating chromatin, chromosome segregation and germline gene expression via 22G-small RNAs. Despite CSR-1 being preserved among diverse nematodes, the conservation and divergence in function of the targets of small RNA pathways remains poorly resolved. Here we apply comparative functional genomic analysis between C. elegans and Caenorhabditis briggsae to characterize the CSR-1 pathway, its targets and their evolution. C. briggsae CSR-1-associated small RNAs that we identified by immunoprecipitation-small RNA sequencing overlap with 22G-RNAs depleted in cbr-csr-1 RNAi-treated worms. By comparing 22G-RNAs and target genes between species, we defined a set of CSR-1 target genes with conserved germline expression, enrichment in operons and more slowly evolving coding sequences than other genes, along with a small group of evolutionarily labile targets. We demonstrate that the association of CSR-1 with chromatin is preserved, and show that depletion of cbr-csr-1 leads to chromosome segregation defects and embryonic lethality. This first comparative characterization of a small RNA pathway in Caenorhabditis establishes a conserved nuclear role for CSR-1 and highlights its key role in germline gene regulation across multiple animal species. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Criticality Is an Emergent Property of Genetic Networks that Exhibit Evolvability
Torres-Sosa, Christian; Huang, Sui; Aldana, Maximino
2012-01-01
Accumulating experimental evidence suggests that the gene regulatory networks of living organisms operate in the critical phase, namely, at the transition between ordered and chaotic dynamics. Such critical dynamics of the network permits the coexistence of robustness and flexibility which are necessary to ensure homeostatic stability (of a given phenotype) while allowing for switching between multiple phenotypes (network states) as occurs in development and in response to environmental change. However, the mechanisms through which genetic networks evolve such critical behavior have remained elusive. Here we present an evolutionary model in which criticality naturally emerges from the need to balance between the two essential components of evolvability: phenotype conservation and phenotype innovation under mutations. We simulated the Darwinian evolution of random Boolean networks that mutate gene regulatory interactions and grow by gene duplication. The mutating networks were subjected to selection for networks that both (i) preserve all the already acquired phenotypes (dynamical attractor states) and (ii) generate new ones. Our results show that this interplay between extending the phenotypic landscape (innovation) while conserving the existing phenotypes (conservation) suffices to cause the evolution of all the networks in a population towards criticality. Furthermore, the networks produced by this evolutionary process exhibit structures with hubs (global regulators) similar to the observed topology of real gene regulatory networks. Thus, dynamical criticality and certain elementary topological properties of gene regulatory networks can emerge as a byproduct of the evolvability of the phenotypic landscape. PMID:22969419
Molecular characterization and expression analysis of WRKY family genes in Dendrobium officinale.
Wang, Tao; Song, Zheng; Wei, Li; Li, Lubin
2018-03-01
The WRKY family of transcription factors is one of the most important families of plant transcriptional regulators, and the members regulate multiple biological processes. However, there is limited information on WRKYs in Dendrobium officinale. In this study, 52 WRKY family genes of D. officinale were surveyed for the first time. Conserved domain, phylogenetic, exon-intron construction, and expression analyses were performed for the DoWRKY genes. Two major types of intron splicing (PR and VQR introns) were found, and the intron insertion position was observed to be relatively conserved in the conserved DoWRKY domains. The expression profiles of nine DoWRKYs were analyzed in cold- and methyl jasmonate (MeJA)-treated D. officinale seedlings; the DoWRKYs showed significant expression changes at different levels, which suggested their vital roles in stress tolerance. Moreover, the expression trends of most of the DoWRKYs after the simultaneous cold stress and MeJA treatment were the opposite of those of DoWRKYs after the individual cold stress and MeJA treatments, suggesting that the two stresses might have antagonistic effects and affect the adaptive capacity of the plants to stresses. Twelve DoWRKY genes were differentially expressed between symbiotic and asymbiotic germinated seeds; all were upregulated in the symbiotic germinated seeds except DoWRKY16. These differences in expression of DoWRKYs might be involved in promoting in vitro symbiotic germination of seeds with Tulasnella-like fungi. Our findings will be useful for further studies on the WRKY family genes in orchids.
Herbig, Eric; Warfield, Linda; Fish, Lisa; Fishburn, James; Knutson, Bruce A; Moorefield, Beth; Pacheco, Derek; Hahn, Steven
2010-05-01
Targets of the tandem Gcn4 acidic activation domains in transcription preinitiation complexes were identified by site-specific cross-linking. The individual Gcn4 activation domains cross-link to three common targets, Gal11/Med15, Taf12, and Tra1, which are subunits of four conserved coactivator complexes, Mediator, SAGA, TFIID, and NuA4. The Gcn4 N-terminal activation domain also cross-links to the Mediator subunit Sin4/Med16. The contribution of the two Gcn4 activation domains to transcription was gene specific and varied from synergistic to less than additive. Gcn4-dependent genes had a requirement for Gal11 ranging from 10-fold dependence to complete Gal11 independence, while the Gcn4-Taf12 interaction did not significantly contribute to the expression of any gene studied. Complementary methods identified three conserved Gal11 activator-binding domains that bind each Gcn4 activation domain with micromolar affinity. These Gal11 activator-binding domains contribute additively to transcription activation and Mediator recruitment at Gcn4- and Gal11-dependent genes. Although we found that the conserved Gal11 KIX domain contributes to Gal11 function, we found no evidence of specific Gcn4-KIX interaction and conclude that the Gal11 KIX domain does not function by specific interaction with Gcn4. Our combined results show gene-specific coactivator requirements, a surprising redundancy in activator-target interactions, and an activator-coactivator interaction mediated by multiple low-affinity protein-protein interactions.
Lef1-dependent hypothalamic neurogenesis inhibits anxiety
Xie, Yuanyuan; Panahi, Samin; Gaynes, John A.; Watters, Harrison N.; Zhou, Dingxi; Xue, Hai-Hui; Fung, Camille M.; Levine, Edward M.; Letsou, Anthea; Brennan, K. C.
2017-01-01
While innate behaviors are conserved throughout the animal kingdom, it is unknown whether common signaling pathways regulate the development of neuronal populations mediating these behaviors in diverse organisms. Here, we demonstrate that the Wnt/ß-catenin effector Lef1 is required for the differentiation of anxiolytic hypothalamic neurons in zebrafish and mice, although the identity of Lef1-dependent genes and neurons differ between these 2 species. We further show that zebrafish and Drosophila have common Lef1-dependent gene expression in their respective neuroendocrine organs, consistent with a conserved pathway that has diverged in the mouse. Finally, orthologs of Lef1-dependent genes from both zebrafish and mouse show highly correlated hypothalamic expression in marmosets and humans, suggesting co-regulation of 2 parallel anxiolytic pathways in primates. These findings demonstrate that during evolution, a transcription factor can act through multiple mechanisms to generate a common behavioral output, and that Lef1 regulates circuit development that is fundamentally important for mediating anxiety in a wide variety of animal species. PMID:28837622
Dobson, Adam J.; Chaston, John M.; Newell, Peter D.; Donahue, Leanne; Hermann, Sara L.; Sannino, David R.; Westmiller, Stephanie; Wong, Adam C.-N.; Clark, Andrew G.; Lazzaro, Brian P.; Douglas, Angela E.
2015-01-01
Animals bear communities of gut microorganisms with substantial effects on animal nutrition, but the host genetic basis of these effects is unknown. Here, we use Drosophila to demonstrate substantial among-genotype variation in the effects of eliminating the gut microbiota on five host nutritional indices (weight, and protein, lipid, glucose and glycogen contents); this includes variation in both the magnitude and direction of microbiota-dependent effects. Genome-wide associations to identify the genetic basis of the microbiota-dependent variation reveal polymorphisms in largely non-overlapping sets of genes associated with variation in the nutritional traits, including strong representation of conserved genes functioning in signaling. Key genes identified by the GWA study are validated by loss-of-function mutations that altered microbiota-dependent nutritional effects. We conclude that the microbiota interacts with the animal at multiple points in the signaling and regulatory networks that determine animal nutrition. These interactions with the microbiota are likely conserved across animals, including humans. PMID:25692519
Genome-Wide Detection and Analysis of Multifunctional Genes
Pritykin, Yuri; Ghersi, Dario; Singh, Mona
2015-01-01
Many genes can play a role in multiple biological processes or molecular functions. Identifying multifunctional genes at the genome-wide level and studying their properties can shed light upon the complexity of molecular events that underpin cellular functioning, thereby leading to a better understanding of the functional landscape of the cell. However, to date, genome-wide analysis of multifunctional genes (and the proteins they encode) has been limited. Here we introduce a computational approach that uses known functional annotations to extract genes playing a role in at least two distinct biological processes. We leverage functional genomics data sets for three organisms—H. sapiens, D. melanogaster, and S. cerevisiae—and show that, as compared to other annotated genes, genes involved in multiple biological processes possess distinct physicochemical properties, are more broadly expressed, tend to be more central in protein interaction networks, tend to be more evolutionarily conserved, and are more likely to be essential. We also find that multifunctional genes are significantly more likely to be involved in human disorders. These same features also hold when multifunctionality is defined with respect to molecular functions instead of biological processes. Our analysis uncovers key features about multifunctional genes, and is a step towards a better genome-wide understanding of gene multifunctionality. PMID:26436655
Pandey, Ravi S; Saxena, Garima; Bhattacharya, Debashish; Qiu, Huan; Azad, Rajeev K
2017-02-01
Identification of horizontal gene transfers (HGTs) has primarily relied on phylogenetic tree based methods, which require a rich sampling of sequenced genomes to ensure a reliable inference. Because the success of phylogenetic approaches depends on the breadth and depth of the database, researchers usually apply stringent filters to detect only the most likely gene transfers in the genomes of interest. One such study focused on a highly conservative estimate of trans-domain gene transfers in the extremophile eukaryote, Galdieria sulphuraria (Galdieri) Merola (Rhodophyta), by applying multiple filters in their phylogenetic pipeline. This led to the identification of 75 inter-domain acquisitions from Bacteria or Archaea. Because of the evolutionary, ecological, and potential biotechnological significance of foreign genes in algae, alternative approaches and pipelines complementing phylogenetics are needed for a more comprehensive assessment of HGT. We present here a novel pipeline that uncovered 17 novel foreign genes of prokaryotic origin in G. sulphuraria, results that are supported by multiple lines of evidence including composition-based, comparative data, and phylogenetics. These genes encode a variety of potentially adaptive functions, from metabolite transport to DNA repair. © 2016 Phycological Society of America.
Elucidating the composition and conservation of the autophagy pathway in photosynthetic eukaryotes
Shemi, Adva; Ben-Dor, Shifra; Vardi, Assaf
2015-01-01
Aquatic photosynthetic eukaryotes represent highly diverse groups (green, red, and chromalveolate algae) derived from multiple endosymbiosis events, covering a wide spectrum of the tree of life. They are responsible for about 50% of the global photosynthesis and serve as the foundation for oceanic and fresh water food webs. Although the ecophysiology and molecular ecology of some algal species are extensively studied, some basic aspects of algal cell biology are still underexplored. The recent wealth of genomic resources from algae has opened new frontiers to decipher the role of cell signaling pathways and their function in an ecological and biotechnological context. Here, we took a bioinformatic approach to explore the distribution and conservation of TOR and autophagy-related (ATG) proteins (Atg in yeast) in diverse algal groups. Our genomic analysis demonstrates conservation of TOR and ATG proteins in green algae. In contrast, in all 5 available red algal genomes, we could not detect the sequences that encode for any of the 17 core ATG proteins examined, albeit TOR and its interacting proteins are conserved. This intriguing data suggests that the autophagy pathway is not conserved in red algae as it is in the entire eukaryote domain. In contrast, chromalveolates, despite being derived from the red-plastid lineage, retain and express ATG genes, which raises a fundamental question regarding the acquisition of ATG genes during algal evolution. Among chromalveolates, Emiliania huxleyi (Haptophyta), a bloom-forming coccolithophore, possesses the most complete set of ATG genes, and may serve as a model organism to study autophagy in marine protists with great ecological significance. PMID:25915714
Dehydration stress memory genes of Zea mays; comparison with Arabidopsis thaliana
2014-01-01
Background Pre-exposing plants to diverse abiotic stresses may alter their physiological and transcriptional responses to a subsequent stress, suggesting a form of “stress memory”. Arabidopsis thaliana plants that have experienced multiple exposures to dehydration stress display transcriptional behavior suggesting “memory” from an earlier stress. Genes that respond to a first stress by up-regulating or down-regulating their transcription but in a subsequent stress provide a significantly different response define the ‘memory genes’ category. Genes responding similarly to each stress form the ‘non-memory’ category. It is unknown whether such memory responses exists in other Angiosperm lineages and whether memory is an evolutionarily conserved response to repeated dehydration stresses. Results Here, we determine the transcriptional responses of maize (Zea mays L.) plants that have experienced repeated exposures to dehydration stress in comparison with plants encountering the stress for the first time. Four distinct transcription memory response patterns similar to those displayed by A. thaliana were revealed. The most important contribution is the evidence that monocot and eudicot plants, two lineages that have diverged 140 to 200 M years ago, display similar abilities to ‘remember’ a dehydration stress and to modify their transcriptional responses, accordingly. The highly sensitive RNA-Seq analyses allowed to identify genes that function similarly in the two lineages, as well as genes that function in species-specific ways. Memory transcription patterns indicate that the transcriptional behavior of responding genes under repeated stresses is different from the behavior during an initial dehydration stress, suggesting that stress memory is a complex phenotype resulting from coordinated responses of multiple signaling pathways. Conclusions Structurally related genes displaying the same memory responses in the two species would suggest conservation of the genes’ memory during the evolution of plants’ dehydration stress response systems. On the other hand, divergent transcription memory responses by genes encoding similar functions would suggest occurrence of species-specific memory responses. The results provide novel insights into our current knowledge of how plants respond to multiple dehydration stresses, as compared to a single exposure, and may serve as a reference platform to study the functions of memory genes in adaptive responses to water deficit in monocot and eudicot plants. PMID:24885787
Ujino-Ihara, Tokuko; Kanamori, Hiroyuki; Yamane, Hiroko; Taguchi, Yuriko; Namiki, Nobukazu; Mukai, Yuzuru; Yoshimura, Kensuke; Tsumura, Yoshihiko
2005-12-01
To identify and characterize lineage-specific genes of conifers, two sets of ESTs (with 12791 and 5902 ESTs, representing 5373 and 3018 gene transcripts, respectively) were generated from the Cupressaceae species Cryptomeria japonica and Chamaecyparis obtusa. These transcripts were compared with non-redundant sets of genes generated from Pinaceae species, other gymnosperms and angiosperms. About 6% of tentative unique genes (Unigenes) of C. japonica and C. obtusa had homologs in other conifers but not angiosperms, and about 70% had apparent homologs in angiosperms. The calculated GC contents of orthologous genes showed that GC contents of coniferous genes are likely to be lower than those of angiosperms. Comparisons of the numbers of homologous genes in each species suggest that copy numbers of genes may be correlated between diverse seed plants. This correlation suggests that the multiplicity of such genes may have arisen before the divergence of gymnosperms and angiosperms.
Parkin, Derek B; Archer, Linda L; Childress, April L; Wellehan, James F X
2009-07-01
Bearded dragons (Pogona vitticeps) are popular pets in the United States. Agamid Adenovirus 1 (AgAdV1) is an important infectious agent of bearded dragons. The only AgAdV1 sequences available to date are from a highly conserved region of the DNA polymerase gene. Degenerate primers were designed to amplify a variable region of the AgAdV1 hexon gene for sequencing. Genetic differences were identified within the hexon gene of 17 bearded dragons from 4 collections. Much less diversity was present in the polymerase gene. Bayesian analysis of the hexon nucleotide alignment identified two larger groups and two isolates that did not tightly cluster with these two groups. Multiple genotypes were identified within collections, and individual genotypes were seen in different collections. Three bearded dragons appeared to be infected by multiple strains. These findings show that this hexon region is useful for AgAdV1 genotyping, which can be used epidemiologically as well as in future investigations of AgAdV1 evolution and clinical implications of strain differences.
Sakudoh, Takashi; Nakashima, Takeharu; Kuroki, Yoko; Fujiyama, Asao; Kohara, Yuji; Honda, Naoko; Fujimoto, Hirofumi; Shimada, Toru; Nakagaki, Masao; Banno, Yutaka; Tsuchida, Kozo
2011-03-01
The carotenoid-binding protein (CBP) of the domesticated silkworm, Bombyx mori, a major determinant of cocoon color, is likely to have been substantially influenced by domestication of this species. We analyzed the structure of the CBP gene in multiple strains of B. mori, in multiple individuals of the wild silkworm, B. mandarina (the putative wild ancestor of B. mori), and in a number of other lepidopterans. We found the CBP gene copy number in genomic DNA to vary widely among B. mori strains, ranging from 1 to 20. The copies of CBP are of several types, based on the presence of a retrotransposon or partial deletion of the coding sequence. In contrast to B. mori, B. mandarina was found to possess a single copy of CBP without the retrotransposon insertion, regardless of habitat. Several other lepidopterans were found to contain sequences homologous to CBP, revealing that this gene is evolutionarily conserved in the lepidopteran lineage. Thus, domestication can generate significant diversity of gene copy number and structure over a relatively short evolutionary time. © 2011 by the Genetics Society of America
Diversity in Copy Number and Structure of a Silkworm Morphogenetic Gene as a Result of Domestication
Sakudoh, Takashi; Nakashima, Takeharu; Kuroki, Yoko; Fujiyama, Asao; Kohara, Yuji; Honda, Naoko; Fujimoto, Hirofumi; Shimada, Toru; Nakagaki, Masao; Banno, Yutaka; Tsuchida, Kozo
2011-01-01
The carotenoid-binding protein (CBP) of the domesticated silkworm, Bombyx mori, a major determinant of cocoon color, is likely to have been substantially influenced by domestication of this species. We analyzed the structure of the CBP gene in multiple strains of B. mori, in multiple individuals of the wild silkworm, B. mandarina (the putative wild ancestor of B. mori), and in a number of other lepidopterans. We found the CBP gene copy number in genomic DNA to vary widely among B. mori strains, ranging from 1 to 20. The copies of CBP are of several types, based on the presence of a retrotransposon or partial deletion of the coding sequence. In contrast to B. mori, B. mandarina was found to possess a single copy of CBP without the retrotransposon insertion, regardless of habitat. Several other lepidopterans were found to contain sequences homologous to CBP, revealing that this gene is evolutionarily conserved in the lepidopteran lineage. Thus, domestication can generate significant diversity of gene copy number and structure over a relatively short evolutionary time. PMID:21242537
Pan, Junsong; Tan, Junyi; Wang, Yuhui; Zheng, Xiangyang; Owens, Ken; Li, Dawei; Li, Yuhong; Weng, Yiqun
2018-04-21
Map-based cloning identified a candidate gene for resistance to the anthracnose fungal pathogen Colletotrichum orbiculare in cucumber, which reveals a novel function for the highly conserved STAYGREEN family genes for host disease resistance in plants. Colletotrichum orbiculare is a hemibiotrophic fungal pathogen that causes anthracnose disease in cucumber and other cucurbit crops. No host resistance genes against the anthracnose pathogens have been cloned in crop plants. Here, we reported fine mapping and cloning of a resistance gene to the race 1 anthracnose pathogen in cucumber inbred lines Gy14 and WI 2757. Phenotypic and QTL analysis in multiple populations revealed that a single recessive gene, cla, was underlying anthracnose resistance in both lines, but WI2757 carried an additional minor-effect QTL. Fine mapping using 150 Gy14 × 9930 recombinant inbred lines and 1043 F 2 individuals delimited the cla locus into a 32 kb region in cucumber Chromosome 5 with three predicted genes. Multiple lines of evidence suggested that the cucumber STAYGREEN (CsSGR) gene is a candidate for the anthracnose resistance locus. A single nucleotide mutation in the third exon of CsSGR resulted in the substitution of Glutamine in 9930 to Arginine in Gy14 in CsSGR protein which seems responsible for the differential anthracnose inoculation responses between Gy14 and 9930. Quantitative real-time PCR analysis indicated that CsSGR was significantly upregulated upon anthracnose pathogen inoculation in the susceptible 9930, while its expression was much lower in the resistant Gy14. Investigation of allelic diversities in natural cucumber populations revealed that the resistance allele in almost all improved cultivars or breeding lines of the U.S. origin was derived from PI 197087. This work reveals an unknown function for the highly conserved STAYGREEN (SGR) family genes for host disease resistance in plants.
Subramoni, Sujatha; Florez Salcedo, Diana Vanessa; Suarez-Moreno, Zulma R
2015-01-01
LuxR solo transcriptional regulators contain both an autoinducer binding domain (ABD; N-terminal) and a DNA binding Helix-Turn-Helix domain (HTH; C-terminal), but are not associated with a cognate N-acyl homoserine lactone (AHL) synthase coding gene in the same genome. Although a few LuxR solos have been characterized, their distributions as well as their role in bacterial signal perception and other processes are poorly understood. In this study we have carried out a systematic survey of distribution of all ABD containing LuxR transcriptional regulators (QS domain LuxRs) available in the InterPro database (IPR005143), and identified those lacking a cognate AHL synthase. These LuxR solos were then analyzed regarding their taxonomical distribution, predicted functions of neighboring genes and the presence of complete AHL-QS systems in the genomes that carry them. Our analyses reveal the presence of one or multiple predicted LuxR solos in many proteobacterial genomes carrying QS domain LuxRs, some of them harboring genes for one or more AHL-QS circuits. The presence of LuxR solos in bacteria occupying diverse environments suggests potential ecological functions for these proteins beyond AHL and interkingdom signaling. Based on gene context and the conservation levels of invariant amino acids of ABD, we have classified LuxR solos into functionally meaningful groups or putative orthologs. Surprisingly, putative LuxR solos were also found in a few non-proteobacterial genomes which are not known to carry AHL-QS systems. Multiple predicted LuxR solos in the same genome appeared to have different levels of conservation of invariant amino acid residues of ABD questioning their binding to AHLs. In summary, this study provides a detailed overview of distribution of LuxR solos and their probable roles in bacteria with genome sequence information.
Subramoni, Sujatha; Florez Salcedo, Diana Vanessa; Suarez-Moreno, Zulma R.
2015-01-01
LuxR solo transcriptional regulators contain both an autoinducer binding domain (ABD; N-terminal) and a DNA binding Helix-Turn-Helix domain (HTH; C-terminal), but are not associated with a cognate N-acyl homoserine lactone (AHL) synthase coding gene in the same genome. Although a few LuxR solos have been characterized, their distributions as well as their role in bacterial signal perception and other processes are poorly understood. In this study we have carried out a systematic survey of distribution of all ABD containing LuxR transcriptional regulators (QS domain LuxRs) available in the InterPro database (IPR005143), and identified those lacking a cognate AHL synthase. These LuxR solos were then analyzed regarding their taxonomical distribution, predicted functions of neighboring genes and the presence of complete AHL-QS systems in the genomes that carry them. Our analyses reveal the presence of one or multiple predicted LuxR solos in many proteobacterial genomes carrying QS domain LuxRs, some of them harboring genes for one or more AHL-QS circuits. The presence of LuxR solos in bacteria occupying diverse environments suggests potential ecological functions for these proteins beyond AHL and interkingdom signaling. Based on gene context and the conservation levels of invariant amino acids of ABD, we have classified LuxR solos into functionally meaningful groups or putative orthologs. Surprisingly, putative LuxR solos were also found in a few non-proteobacterial genomes which are not known to carry AHL-QS systems. Multiple predicted LuxR solos in the same genome appeared to have different levels of conservation of invariant amino acid residues of ABD questioning their binding to AHLs. In summary, this study provides a detailed overview of distribution of LuxR solos and their probable roles in bacteria with genome sequence information. PMID:25759807
Ventura, Marco; Jankovic, Ivana; Walker, D. Carey; Pridmore, R. David; Zink, Ralf
2002-01-01
We have identified and sequenced the genes encoding the aggregation-promoting factor (APF) protein from six different strains of Lactobacillus johnsonii and Lactobacillus gasseri. Both species harbor two apf genes, apf1 and apf2, which are in the same orientation and encode proteins of 257 to 326 amino acids. Multiple alignments of the deduced amino acid sequences of these apf genes demonstrate a very strong sequence conservation of all of the genes with the exception of their central regions. Northern blot analysis showed that both genes are transcribed, reaching their maximum expression during the exponential phase. Primer extension analysis revealed that apf1 and apf2 harbor a putative promoter sequence that is conserved in all of the genes. Western blot analysis of the LiCl cell extracts showed that APF proteins are located on the cell surface. Intact cells of L. johnsonii revealed the typical cell wall architecture of S-layer-carrying gram-positive eubacteria, which could be selectively removed with LiCl treatment. In addition, the amino acid composition, physical properties, and genetic organization were found to be quite similar to those of S-layer proteins. These results suggest that APF is a novel surface protein of the Lactobacillus acidophilus B-homology group which might belong to an S-layer-like family. PMID:12450842
Transcription co-activator SAYP mediates the action of STAT activator.
Panov, Vladislav V; Kuzmina, Julia L; Doronin, Semen A; Kopantseva, Marina R; Nabirochkina, Elena N; Georgieva, Sofia G; Vorobyeva, Nadezhda E; Shidlovskii, Yulii V
2012-03-01
Jak/STAT is an important signaling pathway mediating multiple events in development. We describe participation of metazoan co-activator SAYP/PHF10 in this pathway downstream of STAT. The latter, via its activation domain, interacts with the conserved core of SAYP. STAT is associated with the SAYP-containing co-activator complex BTFly and recruits BTFly onto genes. SAYP is necessary for stimulating STAT-driven transcription of numerous genes. Mutation of SAYP leads to maldevelopments similar to those observed in STAT mutants. Thus, SAYP is a novel co-activator mediating the action of STAT.
Genome-wide identification and characterization of the SBP-box gene family in Petunia.
Zhou, Qin; Zhang, Sisi; Chen, Feng; Liu, Baojun; Wu, Lan; Li, Fei; Zhang, Jiaqi; Bao, Manzhu; Liu, Guofeng
2018-03-12
SQUAMOSA PROMOTER BINDING PROTEIN (SBP)-box genes encode a family of plant-specific transcription factors (TFs) that play important roles in many growth and development processes including phase transition, leaf initiation, shoot and inflorescence branching, fruit development and ripening etc. The SBP-box gene family has been identified and characterized in many species, but has not been well studied in Petunia, an important ornamental genus. We identified 21 putative SPL genes of Petunia axillaris and P. inflata from the reference genome of P. axillaris N and P. inflata S6, respectively, which were supported by the transcriptome data. For further confirmation, all the 21 genes were also cloned from P. hybrida line W115 (Mitchel diploid). Phylogenetic analysis based on the highly conserved SBP domains arranged PhSPLs in eight groups, analogous to those from Arabidopsis and tomato. Furthermore, the Petunia SPL genes had similar exon-intron structure and the deduced proteins contained very similar conserved motifs within the same subgroup. Out of 21 PhSPL genes, fourteen were predicted to be potential targets of PhmiR156/157, and the putative miR156/157 response elements (MREs) were located in the coding region of group IV, V, VII and VIII genes, but in the 3'-UTR regions of group VI genes. SPL genes were also identified from another two wild Petunia species, P. integrifolia and P. exserta, based on their transcriptome databases to investigate the origin of PhSPLs. Phylogenetic analysis and multiple alignments of the coding sequences of PhSPLs and their orthologs from wild species indicated that PhSPLs were originated mainly from P. axillaris. qRT-PCR analysis demonstrated differential spatiotemperal expression patterns of PhSPL genes in petunia and many were expressed predominantly in the axillary buds and/or inflorescences. In addition, overexpression of PhSPL9a and PhSPL9b in Arabidopsis suggested that these genes play a conserved role in promoting the vegetative-to-reproductive phase transition. Petunia genome contains at least 21 SPL genes, and most of the genes are expressed in different tissues. The PhSPL genes may play conserved and diverse roles in plant growth and development, including flowering regulation, leaf initiation, axillary bud and inflorescence development. This work provides a comprehensive understanding of the SBP-box gene family in Petunia and lays a significant foundation for future studies on the function and evolution of SPL genes in petunia.
Shahinyan, Grigor; Margaryan, Armine; Panosyan, Hovik; Trchounian, Armen
2017-05-02
Among the huge diversity of thermophilic bacteria mainly bacilli have been reported as active thermostable lipase producers. Geothermal springs serve as the main source for isolation of thermostable lipase producing bacilli. Thermostable lipolytic enzymes, functioning in the harsh conditions, have promising applications in processing of organic chemicals, detergent formulation, synthesis of biosurfactants, pharmaceutical processing etc. In order to study the distribution of lipase-producing thermophilic bacilli and their specific lipase protein primary structures, three lipase producers from different genera were isolated from mesothermal (27.5-70 °C) springs distributed on the territory of Armenia and Nagorno Karabakh. Based on phenotypic characteristics and 16S rRNA gene sequencing the isolates were identified as Geobacillus sp., Bacillus licheniformis and Anoxibacillus flavithermus strains. The lipase genes of isolates were sequenced by using initially designed primer sets. Multiple alignments generated from primary structures of the lipase proteins and annotated lipase protein sequences, conserved regions analysis and amino acid composition have illustrated the similarity (98-99%) of the lipases with true lipases (family I) and GDSL esterase family (family II). A conserved sequence block that determines the thermostability has been identified in the multiple alignments of the lipase proteins. The results are spreading light on the lipase producing bacilli distribution in geothermal springs in Armenia and Nagorno Karabakh. Newly isolated bacilli strains could be prospective source for thermostable lipases and their genes.
Genomic dissection of conserved transcriptional regulation in intestinal epithelial cells
Camp, J. Gray; Weiser, Matthew; Cocchiaro, Jordan L.; Kingsley, David M.; Furey, Terrence S.; Sheikh, Shehzad Z.; Rawls, John F.
2017-01-01
The intestinal epithelium serves critical physiologic functions that are shared among all vertebrates. However, it is unknown how the transcriptional regulatory mechanisms underlying these functions have changed over the course of vertebrate evolution. We generated genome-wide mRNA and accessible chromatin data from adult intestinal epithelial cells (IECs) in zebrafish, stickleback, mouse, and human species to determine if conserved IEC functions are achieved through common transcriptional regulation. We found evidence for substantial common regulation and conservation of gene expression regionally along the length of the intestine from fish to mammals and identified a core set of genes comprising a vertebrate IEC signature. We also identified transcriptional start sites and other putative regulatory regions that are differentially accessible in IECs in all 4 species. Although these sites rarely showed sequence conservation from fish to mammals, surprisingly, they drove highly conserved IEC expression in a zebrafish reporter assay. Common putative transcription factor binding sites (TFBS) found at these sites in multiple species indicate that sequence conservation alone is insufficient to identify much of the functionally conserved IEC regulatory information. Among the rare, highly sequence-conserved, IEC-specific regulatory regions, we discovered an ancient enhancer upstream from her6/HES1 that is active in a distinct population of Notch-positive cells in the intestinal epithelium. Together, these results show how combining accessible chromatin and mRNA datasets with TFBS prediction and in vivo reporter assays can reveal tissue-specific regulatory information conserved across 420 million years of vertebrate evolution. We define an IEC transcriptional regulatory network that is shared between fish and mammals and establish an experimental platform for studying how evolutionarily distilled regulatory information commonly controls IEC development and physiology. PMID:28850571
Evolutionary conservation of sequence and secondary structures inCRISPR repeats
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kunin, Victor; Sorek, Rotem; Hugenholtz, Philip
Clustered Regularly Interspaced Palindromic Repeats (CRISPRs) are a novel class of direct repeats, separated by unique spacer sequences of similar length, that are present in {approx}40% of bacterial and all archaeal genomes analyzed to date. More than 40 gene families, called CRISPR-associated sequences (CAS), appear in conjunction with these repeats and are thought to be involved in the propagation and functioning of CRISPRs. It has been proposed that the CRISPR/CAS system samples, maintains a record of, and inactivates invasive DNA that the cell has encountered, and therefore constitutes a prokaryotic analog of an immune system. Here we analyze CRISPR repeatsmore » identified in 195 microbial genomes and show that they can be organized into multiple clusters based on sequence similarity. All individual repeats in any given cluster were inferred to form characteristic RNA secondary structure, ranging from non-existent to pronounced. Stable secondary structures included G:U base pairs and exhibited multiple compensatory base changes in the stem region, indicating evolutionary conservation and functional importance. We also show that the repeat-based classification corresponds to, and expands upon, a previously reported CAS gene-based classification including specific relationships between CRISPR and CAS subtypes.« less
Takeda, Kojiro; Mori, Ayaka; Yanagida, Mitsuhiro
2011-01-01
Bortezomib/PS-341/Velcade, a proteasome inhibitor, is widely used to treat multiple myeloma. While several mechanisms of the cytotoxicity of the drug were proposed, the actual mechanism remains elusive. We aimed to identify genes affecting the cytotoxicity of Bortezomib in the fission yeast S.pombe as the drug inhibits this organism's cell division cycle like proteasome mutants. Among the 2815 genes screened (covering 56% of total ORFs), 19 genes, whose deletions induce strong synthetic lethality with Bortezomib, were identified. The products of the 19 genes included four ubiquitin enzymes and one nuclear proteasome factor, and 13 of them are conserved in humans. Our results will provide useful information for understanding the actions of Bortezomib within cells. PMID:21760946
Barta, Endre; Sebestyén, Endre; Pálfy, Tamás B.; Tóth, Gábor; Ortutay, Csaba P.; Patthy, László
2005-01-01
DoOP (http://doop.abc.hu/) is a database of eukaryotic promoter sequences (upstream regions) aiming to facilitate the recognition of regulatory sites conserved between species. The annotated first exons of human and Arabidopsis thaliana genes were used as queries in BLAST searches to collect the most closely related orthologous first exon sequences from Chordata and Viridiplantae species. Up to 3000 bp DNA segments upstream from these first exons constitute the clusters in the chordate and plant sections of the Database of Orthologous Promoters. Release 1.0 of DoOP contains 21 061 chordate clusters from 284 different species and 7548 plant clusters from 269 different species. The database can be used to find and retrieve promoter sequences of a given gene from various species and it is also suitable to see the most trivial conserved sequence blocks in the orthologous upstream regions. Users can search DoOP with either sequence or text (annotation) to find promoter clusters of various genes. In addition to the sequence data, the positions of the conserved sequence blocks derived from multiple alignments, the positions of repetitive elements and the positions of transcription start sites known from the Eukaryotic Promoter Database (EPD) can be viewed graphically. PMID:15608291
Barta, Endre; Sebestyén, Endre; Pálfy, Tamás B; Tóth, Gábor; Ortutay, Csaba P; Patthy, László
2005-01-01
DoOP (http://doop.abc.hu/) is a database of eukaryotic promoter sequences (upstream regions) aiming to facilitate the recognition of regulatory sites conserved between species. The annotated first exons of human and Arabidopsis thaliana genes were used as queries in BLAST searches to collect the most closely related orthologous first exon sequences from Chordata and Viridiplantae species. Up to 3000 bp DNA segments upstream from these first exons constitute the clusters in the chordate and plant sections of the Database of Orthologous Promoters. Release 1.0 of DoOP contains 21,061 chordate clusters from 284 different species and 7548 plant clusters from 269 different species. The database can be used to find and retrieve promoter sequences of a given gene from various species and it is also suitable to see the most trivial conserved sequence blocks in the orthologous upstream regions. Users can search DoOP with either sequence or text (annotation) to find promoter clusters of various genes. In addition to the sequence data, the positions of the conserved sequence blocks derived from multiple alignments, the positions of repetitive elements and the positions of transcription start sites known from the Eukaryotic Promoter Database (EPD) can be viewed graphically.
Alternative splicing of anciently exonized 5S rRNA regulates plant transcription factor TFIIIA
Fu, Yan; Bannach, Oliver; Chen, Hao; Teune, Jan-Hendrik; Schmitz, Axel; Steger, Gerhard; Xiong, Liming; Barbazuk, W. Brad
2009-01-01
Identifying conserved alternative splicing (AS) events among evolutionarily distant species can prioritize AS events for functional characterization and help uncover relevant cis- and trans-regulatory factors. A genome-wide search for conserved cassette exon AS events in higher plants revealed the exonization of 5S ribosomal RNA (5S rRNA) within the gene of its own transcription regulator, TFIIIA (transcription factor for polymerase III A). The 5S rRNA-derived exon in TFIIIA gene exists in all representative land plant species but not in green algae and nonplant species, suggesting it is specific to land plants. TFIIIA is essential for RNA polymerase III-based transcription of 5S rRNA in eukaryotes. Integrating comparative genomics and molecular biology revealed that the conserved cassette exon derived from 5S rRNA is coupled with nonsense-mediated mRNA decay. Utilizing multiple independent Arabidopsis overexpressing TFIIIA transgenic lines under osmotic and salt stress, strong accordance between phenotypic and molecular evidence reveals the biological relevance of AS of the exonized 5S rRNA in quantitative autoregulation of TFIIIA homeostasis. Most significantly, this study provides the first evidence of ancient exaptation of 5S rRNA in plants, suggesting a novel gene regulation model mediated by the AS of an anciently exonized noncoding element. PMID:19211543
Short and long-term genome stability analysis of prokaryotic genomes.
Brilli, Matteo; Liò, Pietro; Lacroix, Vincent; Sagot, Marie-France
2013-05-08
Gene organization dynamics is actively studied because it provides useful evolutionary information, makes functional annotation easier and often enables to characterize pathogens. There is therefore a strong interest in understanding the variability of this trait and the possible correlations with life-style. Two kinds of events affect genome organization: on one hand translocations and recombinations change the relative position of genes shared by two genomes (i.e. the backbone gene order); on the other, insertions and deletions leave the backbone gene order unchanged but they alter the gene neighborhoods by breaking the syntenic regions. A complete picture about genome organization evolution therefore requires to account for both kinds of events. We developed an approach where we model chromosomes as graphs on which we compute different stability estimators; we consider genome rearrangements as well as the effect of gene insertions and deletions. In a first part of the paper, we fit a measure of backbone gene order conservation (hereinafter called backbone stability) against phylogenetic distance for over 3000 genome comparisons, improving existing models for the divergence in time of backbone stability. Intra- and inter-specific comparisons were treated separately to focus on different time-scales. The use of multiple genomes of a same species allowed to identify genomes with diverging gene order with respect to their conspecific. The inter-species analysis indicates that pathogens are more often unstable with respect to non-pathogens. In a second part of the text, we show that in pathogens, gene content dynamics (insertions and deletions) have a much more dramatic effect on genome organization stability than backbone rearrangements. In this work, we studied genome organization divergence taking into account the contribution of both genome order rearrangements and genome content dynamics. By studying species with multiple sequenced genomes available, we were able to explore genome organization stability at different time-scales and to find significant differences for pathogen and non-pathogen species. The output of our framework also allows to identify the conserved gene clusters and/or partial occurrences thereof, making possible to explore how gene clusters assembled during evolution.
Genome Wide Search for Biomarkers to Diagnose Yersinia Infections.
Kalia, Vipin Chandra; Kumar, Prasun
2015-12-01
Bacterial identification on the basis of the highly conserved 16S rRNA (rrs) gene is limited by its presence in multiple copies and a very high level of similarity among them. The need is to look for other genes with unique characteristics to be used as biomarkers. Fifty-one sequenced genomes belonging to 10 different Yersinia species were used for searching genes common to all the genomes. Out of 304 common genes, 34 genes of sizes varying from 0.11 to 4.42 kb, were selected and subjected to in silico digestion with 10 different Restriction endonucleases (RE) (4-6 base cutters). Yersinia species have 6-7 copies of rrs per genome, which are difficult to distinguish by multiple sequence alignments or their RE digestion patterns. However, certain unique combinations of other common gene sequences-carB, fadJ, gluM, gltX, ileS, malE, nusA, ribD, and rlmL and their RE digestion patterns can be used as markers for identifying 21 strains belonging to 10 Yersinia species: Y. aldovae, Y. enterocolitica, Y. frederiksenii, Y. intermedia, Y. kristensenii, Y. pestis, Y. pseudotuberculosis, Y. rohdei, Y. ruckeri, and Y. similis. This approach can be applied for rapid diagnostic applications.
Evolution of the acyl-CoA binding protein (ACBP)
Burton, Mark; Rose, Timothy M.; Færgeman, Nils J.; Knudsen, Jens
2005-01-01
Acyl-CoA-binding protein (ACBP) is a 10 kDa protein that binds C12–C22 acyl-CoA esters with high affinity. In vitro and in vivo experiments suggest that it is involved in multiple cellular tasks including modulation of fatty acid biosynthesis, enzyme regulation, regulation of the intracellular acyl-CoA pool size, donation of acyl-CoA esters for β-oxidation, vesicular trafficking, complex lipid synthesis and gene regulation. In the present study, we delineate the evolutionary history of ACBP to get a complete picture of its evolution and distribution among species. ACBP homologues were identified in all four eukaryotic kingdoms, Animalia, Plantae, Fungi and Protista, and eleven eubacterial species. ACBP homologues were not detected in any other known bacterial species, or in archaea. Nearly all of the ACBP-containing bacteria are pathogenic to plants or animals, suggesting that an ACBP gene could have been acquired from a eukaryotic host by horizontal gene transfer. Many bacterial, fungal and higher eukaryotic species only harbour a single ACBP homologue. However, a number of species, ranging from protozoa to vertebrates, have evolved two to six lineage-specific paralogues through gene duplication and/or retrotransposition events. The ACBP protein is highly conserved across phylums, and the majority of ACBP genes are subjected to strong purifying selection. Experimental evidence indicates that the function of ACBP has been conserved from yeast to humans and that the multiple lineage-specific paralogues have evolved altered functions. The appearance of ACBP very early on in evolution points towards a fundamental role of ACBP in acyl-CoA metabolism, including ceramide synthesis and in signalling. PMID:16018771
Sanchez, Diego H; Pieckenstain, Fernando L; Szymanski, Jedrzey; Erban, Alexander; Bromke, Mariusz; Hannah, Matthew A; Kraemer, Ute; Kopka, Joachim; Udvardi, Michael K
2011-02-14
One of the objectives of plant translational genomics is to use knowledge and genes discovered in model species to improve crops. However, the value of translational genomics to plant breeding, especially for complex traits like abiotic stress tolerance, remains uncertain. Using comparative genomics (ionomics, transcriptomics and metabolomics) we analyzed the responses to salinity of three model and three cultivated species of the legume genus Lotus. At physiological and ionomic levels, models responded to salinity in a similar way to crop species, and changes in the concentration of shoot Cl(-) correlated well with tolerance. Metabolic changes were partially conserved, but divergence was observed amongst the genotypes. Transcriptome analysis showed that about 60% of expressed genes were responsive to salt treatment in one or more species, but less than 1% was responsive in all. Therefore, genotype-specific transcriptional and metabolic changes overshadowed conserved responses to salinity and represent an impediment to simple translational genomics. However, 'triangulation' from multiple genotypes enabled the identification of conserved and tolerant-specific responses that may provide durable tolerance across species.
CODEHOP (COnsensus-DEgenerate Hybrid Oligonucleotide Primer) PCR primer design
Rose, Timothy M.; Henikoff, Jorja G.; Henikoff, Steven
2003-01-01
We have developed a new primer design strategy for PCR amplification of distantly related gene sequences based on consensus-degenerate hybrid oligonucleotide primers (CODEHOPs). An interactive program has been written to design CODEHOP PCR primers from conserved blocks of amino acids within multiply-aligned protein sequences. Each CODEHOP consists of a pool of related primers containing all possible nucleotide sequences encoding 3–4 highly conserved amino acids within a 3′ degenerate core. A longer 5′ non-degenerate clamp region contains the most probable nucleotide predicted for each flanking codon. CODEHOPs are used in PCR amplification to isolate distantly related sequences encoding the conserved amino acid sequence. The primer design software and the CODEHOP PCR strategy have been utilized for the identification and characterization of new gene orthologs and paralogs in different plant, animal and bacterial species. In addition, this approach has been successful in identifying new pathogen species. The CODEHOP designer (http://blocks.fhcrc.org/codehop.html) is linked to BlockMaker and the Multiple Alignment Processor within the Blocks Database World Wide Web (http://blocks.fhcrc.org). PMID:12824413
Zhang, Wensheng; Edwards, Andrea; Fan, Wei; Zhu, Dongxiao; Zhang, Kun
2010-06-22
Comparative analysis of gene expression profiling of multiple biological categories, such as different species of organisms or different kinds of tissue, promises to enhance the fundamental understanding of the universality as well as the specialization of mechanisms and related biological themes. Grouping genes with a similar expression pattern or exhibiting co-expression together is a starting point in understanding and analyzing gene expression data. In recent literature, gene module level analysis is advocated in order to understand biological network design and system behaviors in disease and life processes; however, practical difficulties often lie in the implementation of existing methods. Using the singular value decomposition (SVD) technique, we developed a new computational tool, named svdPPCS (SVD-based Pattern Pairing and Chart Splitting), to identify conserved and divergent co-expression modules of two sets of microarray experiments. In the proposed methods, gene modules are identified by splitting the two-way chart coordinated with a pair of left singular vectors factorized from the gene expression matrices of the two biological categories. Importantly, the cutoffs are determined by a data-driven algorithm using the well-defined statistic, SVD-p. The implementation was illustrated on two time series microarray data sets generated from the samples of accessory gland (ACG) and malpighian tubule (MT) tissues of the line W118 of M. drosophila. Two conserved modules and six divergent modules, each of which has a unique characteristic profile across tissue kinds and aging processes, were identified. The number of genes contained in these models ranged from five to a few hundred. Three to over a hundred GO terms were over-represented in individual modules with FDR < 0.1. One divergent module suggested the tissue-specific relationship between the expressions of mitochondrion-related genes and the aging process. This finding, together with others, may be of biological significance. The validity of the proposed SVD-based method was further verified by a simulation study, as well as the comparisons with regression analysis and cubic spline regression analysis plus PAM based clustering. svdPPCS is a novel computational tool for the comparative analysis of transcriptional profiling. It especially fits the comparison of time series data of related organisms or different tissues of the same organism under equivalent or similar experimental conditions. The general scheme can be directly extended to the comparisons of multiple data sets. It also can be applied to the integration of data sets from different platforms and of different sources.
Bagley, Justin C.; Alda, Fernando; Breitman, M. Florencia; Bermingham, Eldredge; van den Berghe, Eric P.; Johnson, Jerald B.
2015-01-01
Accurately delimiting species is fundamentally important for understanding species diversity and distributions and devising effective strategies to conserve biodiversity. However, species delimitation is problematic in many taxa, including ‘non-adaptive radiations’ containing morphologically cryptic lineages. Fortunately, coalescent-based species delimitation methods hold promise for objectively estimating species limits in such radiations, using multilocus genetic data. Using coalescent-based approaches, we delimit species and infer evolutionary relationships in a morphologically conserved group of Central American freshwater fishes, the Poecilia sphenops species complex. Phylogenetic analyses of multiple genetic markers (sequences of two mitochondrial DNA genes and five nuclear loci) from 10/15 species and genetic lineages recognized in the group support the P. sphenops species complex as monophyletic with respect to outgroups, with eight mitochondrial ‘major-lineages’ diverged by ≥2% pairwise genetic distances. From general mixed Yule-coalescent models, we discovered (conservatively) 10 species within our concatenated mitochondrial DNA dataset, 9 of which were strongly supported by subsequent multilocus Bayesian species delimitation and species tree analyses. Results suggested species-level diversity is underestimated or overestimated by at least ~15% in different lineages in the complex. Nonparametric statistics and coalescent simulations indicate genealogical discordance among our gene tree results has mainly derived from interspecific hybridization in the nuclear genome. However, mitochondrial DNA show little evidence for introgression, and our species delimitation results appear robust to effects of this process. Overall, our findings support the utility of combining multiple lines of genetic evidence and broad phylogeographical sampling to discover and validate species using coalescent-based methods. Our study also highlights the importance of testing for hybridization versus incomplete lineage sorting, which aids inference of not only species limits but also evolutionary processes influencing genetic diversity. PMID:25849959
Bagley, Justin C; Alda, Fernando; Breitman, M Florencia; Bermingham, Eldredge; van den Berghe, Eric P; Johnson, Jerald B
2015-01-01
Accurately delimiting species is fundamentally important for understanding species diversity and distributions and devising effective strategies to conserve biodiversity. However, species delimitation is problematic in many taxa, including 'non-adaptive radiations' containing morphologically cryptic lineages. Fortunately, coalescent-based species delimitation methods hold promise for objectively estimating species limits in such radiations, using multilocus genetic data. Using coalescent-based approaches, we delimit species and infer evolutionary relationships in a morphologically conserved group of Central American freshwater fishes, the Poecilia sphenops species complex. Phylogenetic analyses of multiple genetic markers (sequences of two mitochondrial DNA genes and five nuclear loci) from 10/15 species and genetic lineages recognized in the group support the P. sphenops species complex as monophyletic with respect to outgroups, with eight mitochondrial 'major-lineages' diverged by ≥2% pairwise genetic distances. From general mixed Yule-coalescent models, we discovered (conservatively) 10 species within our concatenated mitochondrial DNA dataset, 9 of which were strongly supported by subsequent multilocus Bayesian species delimitation and species tree analyses. Results suggested species-level diversity is underestimated or overestimated by at least ~15% in different lineages in the complex. Nonparametric statistics and coalescent simulations indicate genealogical discordance among our gene tree results has mainly derived from interspecific hybridization in the nuclear genome. However, mitochondrial DNA show little evidence for introgression, and our species delimitation results appear robust to effects of this process. Overall, our findings support the utility of combining multiple lines of genetic evidence and broad phylogeographical sampling to discover and validate species using coalescent-based methods. Our study also highlights the importance of testing for hybridization versus incomplete lineage sorting, which aids inference of not only species limits but also evolutionary processes influencing genetic diversity.
Rare Variant Association Test with Multiple Phenotypes
Lee, Selyeong; Won, Sungho; Kim, Young Jin; Kim, Yongkang; Kim, Bong-Jo; Park, Taesung
2016-01-01
Although genome-wide association studies (GWAS) have now discovered thousands of genetic variants associated with common traits, such variants cannot explain the large degree of “missing heritability,” likely due to rare variants. The advent of next generation sequencing technology has allowed rare variant detection and association with common traits, often by investigating specific genomic regions for rare variant effects on a trait. Although multiply correlated phenotypes are often concurrently observed in GWAS, most studies analyze only single phenotypes, which may lessen statistical power. To increase power, multivariate analyses, which consider correlations between multiple phenotypes, can be used. However, few existing multi-variant analyses can identify rare variants for assessing multiple phenotypes. Here, we propose Multivariate Association Analysis using Score Statistics (MAAUSS), to identify rare variants associated with multiple phenotypes, based on the widely used Sequence Kernel Association Test (SKAT) for a single phenotype. We applied MAAUSS to Whole Exome Sequencing (WES) data from a Korean population of 1,058 subjects, to discover genes associated with multiple traits of liver function. We then assessed validation of those genes by a replication study, using an independent dataset of 3,445 individuals. Notably, we detected the gene ZNF620 among five significant genes. We then performed a simulation study to compare MAAUSS's performance with existing methods. Overall, MAAUSS successfully conserved type 1 error rates and in many cases, had a higher power than the existing methods. This study illustrates a feasible and straightforward approach for identifying rare variants correlated with multiple phenotypes, with likely relevance to missing heritability. PMID:28039885
SET1A/COMPASS and shadow enhancers in the regulation of homeotic gene expression
Cao, Kaixiang; Collings, Clayton K.; Marshall, Stacy A.; Morgan, Marc A.; Rendleman, Emily J.; Wang, Lu; Sze, Christie C.; Sun, Tianjiao; Bartom, Elizabeth T.; Shilatifard, Ali
2017-01-01
The homeotic (Hox) genes are highly conserved in metazoans, where they are required for various processes in development, and misregulation of their expression is associated with human cancer. In the developing embryo, Hox genes are activated sequentially in time and space according to their genomic position within Hox gene clusters. Accumulating evidence implicates both enhancer elements and noncoding RNAs in controlling this spatiotemporal expression of Hox genes, but disentangling their relative contributions is challenging. Here, we identify two cis-regulatory elements (E1 and E2) functioning as shadow enhancers to regulate the early expression of the HoxA genes. Simultaneous deletion of these shadow enhancers in embryonic stem cells leads to impaired activation of HoxA genes upon differentiation, while knockdown of a long noncoding RNA overlapping E1 has no detectable effect on their expression. Although MLL/COMPASS (complex of proteins associated with Set1) family of histone methyltransferases is known to activate transcription of Hox genes in other contexts, we found that individual inactivation of the MLL1-4/COMPASS family members has little effect on early Hox gene activation. Instead, we demonstrate that SET1A/COMPASS is required for full transcriptional activation of multiple Hox genes but functions independently of the E1 and E2 cis-regulatory elements. Our results reveal multiple regulatory layers for Hox genes to fine-tune transcriptional programs essential for development. PMID:28487406
Li, Qi; Zhang, Ning; Zhang, Liangsheng; Ma, Hong
2015-04-01
Rhomboid proteins are intramembrane serine proteases that are involved in a plethora of biological functions, but the evolutionary history of the rhomboid gene family is not clear. We performed a comprehensive molecular evolutionary analysis of the rhomboid gene family and also investigated the organization and sequence features of plant rhomboids in different subfamilies. Our results showed that eukaryotic rhomboids could be divided into five subfamilies (RhoA-RhoD and PARL). Most orthology groups appeared to be conserved only as single or low-copy genes in all lineages in RhoB-RhoD and PARL, whereas RhoA genes underwent several duplication events, resulting in multiple gene copies. These duplication events were due to whole genome duplications in plants and animals and the duplicates might have experienced functional divergence. We also identified a novel group of plant rhomboid (RhoB1) that might have lost their enzymatic activity; their existence suggests that they might have evolved new mechanisms. Plant and animal rhomboids have similar evolutionary patterns. In addition, there are mutations affecting key active sites in RBL8, RBL9 and one of the Brassicaceae PARL duplicates. This study delineates a possible evolutionary scheme for intramembrane proteins and illustrates distinct fates and a mechanism of evolution of gene duplicates. © 2014 The Authors. New Phytologist © 2014 New Phytologist Trust.
Developmental Pathways Are Blueprints for Designing Successful Crops.
Trevaskis, Ben
2018-01-01
Genes controlling plant development have been studied in multiple plant systems. This has provided deep insights into conserved genetic pathways controlling core developmental processes including meristem identity, phase transitions, determinacy, stem elongation, and branching. These pathways control plant growth patterns and are fundamentally important to crop biology and agriculture. This review describes the conserved pathways that control plant development, using Arabidopsis as a model. Historical examples of how plant development has been altered through selection to improve crop performance are then presented. These examples, drawn from diverse crops, show how the genetic pathways controlling development have been modified to increase yield or tailor growth patterns to suit local growing environments or specialized crop management practices. Strategies to apply current progress in genomics and developmental biology to future crop improvement are then discussed within the broader context of emerging trends in plant breeding. The ways that knowledge of developmental processes and understanding of gene function can contribute to crop improvement, beyond what can be achieved by selection alone, are emphasized. These include using genome re-sequencing, mutagenesis, and gene editing to identify or generate novel variation in developmental genes. The expanding scope for comparative genomics, the possibility to engineer new developmental traits and new approaches to resolve gene-gene or gene-environment interactions are also discussed. Finally, opportunities to integrate fundamental research and crop breeding are highlighted.
Ma, Wenjun; Lager, Kelly M; Lekcharoensuk, Porntippa; Ulery, Eva S; Janke, Bruce H; Solórzano, Alicia; Webby, Richard J; García-Sastre, Adolfo; Richt, Jürgen A
2010-09-01
Triple-reassortant swine influenza viruses circulating in North American pigs contain the internal genes derived from swine (matrix, non-structural and nucleoprotein), human [polymerase basic 1 (PB1)] and avian (polymerase acidic and PB2) influenza viruses forming a constellation of genes that is well conserved and is called the triple-reassortant internal gene (TRIG) cassette. In contrast, the external genes [haemagglutinin (HA) and neuraminidase (NA)] are less conserved, reflecting multiple reassortant events that have produced viruses with different combinations of HA and NA genes. This study hypothesized that maintenance of the TRIG cassette confers a selective advantage to the virus. To test this hypothesis, pigs were co-infected with the triple-reassortant H3N2 A/Swine/Texas/4199-2/98 (Tx/98) and the classical H1N1 A/Swine/Iowa/15/1930 viruses and co-housed with a group of sentinel animals. This direct contact group was subsequently moved into contact with a second group of naïve animals. Four different subtypes (H1N1, H1N2, H3N1 and H3N2) of influenza virus were identified in bronchoalveolar lavage fluid collected from the lungs of the experimentally infected pigs, with most of the viruses containing TRIG from the Tx/98 virus. Interestingly, only the intact H3N2 Tx/98 virus was transmitted from the infected pigs to the direct-contact animals and from them to the second contact group of pigs. These results demonstrated that multiple reassortments can occur within a host; however, only specific gene constellations are readily transmissible. It was concluded that certain HA and NA gene pairs, in conjunction with the TRIG cassette, may have a competitive advantage over other combinations for transmission and maintenance in swine.
Evolution of two Rh blood group-related genes of the amphioxus species Branchiostoma floridae.
Kitano, Takashi; Satou, Masahiro; Saitou, Naruya
2010-04-01
We determined cDNAs of two genes that belong to the Rhesus (Rh) blood group gene family in an amphioxus species (Branchiostoma floridae) and designated them Rh-related-1 (RhR-1) and Rh-related-2 (RhR-2). RhR-1 and RhR-2 consisted of 10 and 11 exons, respectively. 3' UTR sequences of RhR-1 were shorter (220-272 bp) than those of RhR-2 (1,505-1,650 bp). CDS lengths were 1,344 and 1,476 bp for RhR-1 and RhR-2, respectively, and the average nucleotide difference between their CDS regions was 0.33. The corresponding regions of Rh genes from exons 2 to 7 were relatively conserved among the chordate species examined in this study. Length difference numbers were in multiples of three, which implies that codon frames were conserved among them, and the same exon/intron boundary phases were observed in those regions. This region was used for the phylogenetic analyses. RhR-1 and RhR-2 formed a cluster on the phylogenetic tree of the Rh gene family. Gene duplication time of RhR-1 and RhR-2 was estimated to be ca. 500 million years ago. It is likely that the four Rh family genes in vertebrates emerged by gene duplications in the common ancestor of vertebrates, and functional differentiation has occurred after the first gene duplication.
CoSMoS: Conserved Sequence Motif Search in the proteome
Liu, Xiao I; Korde, Neeraj; Jakob, Ursula; Leichert, Lars I
2006-01-01
Background With the ever-increasing number of gene sequences in the public databases, generating and analyzing multiple sequence alignments becomes increasingly time consuming. Nevertheless it is a task performed on a regular basis by researchers in many labs. Results We have now created a database called CoSMoS to find the occurrences and at the same time evaluate the significance of sequence motifs and amino acids encoded in the whole genome of the model organism Escherichia coli K12. We provide a precomputed set of multiple sequence alignments for each individual E. coli protein with all of its homologues in the RefSeq database. The alignments themselves, information about the occurrence of sequence motifs together with information on the conservation of each of the more than 1.3 million amino acids encoded in the E. coli genome can be accessed via the web interface of CoSMoS. Conclusion CoSMoS is a valuable tool to identify highly conserved sequence motifs, to find regions suitable for mutational studies in functional analyses and to predict important structural features in E. coli proteins. PMID:16433915
Conserved structure and expression of hsp70 paralogs in teleost fishes.
Metzger, David C H; Hemmer-Hansen, Jakob; Schulte, Patricia M
2016-06-01
The cytosolic 70KDa heat shock proteins (Hsp70s) are widely used as biomarkers of environmental stress in ecological and toxicological studies in fish. Here we analyze teleost genome sequences to show that two genes encoding inducible hsp70s (hsp70-1 and hsp70-2) are likely present in all teleost fish. Phylogenetic and synteny analyses indicate that hsp70-1 and hsp70-2 are distinct paralogs that originated prior to the diversification of the teleosts. The promoters of both genes contain a TATA box and conserved heat shock elements (HSEs), but unlike mammalian HSP70s, both genes contain an intron in the 5' UTR. The hsp70-2 gene has undergone tandem duplication in several species. In addition, many other teleost genome assemblies have multiple copies of hsp70-2 present on separate, small, genomic scaffolds. To verify that these represent poorly assembled tandem duplicates, we cloned the genomic region surrounding hsp70-2 in Fundulus heteroclitus and showed that the hsp70-2 gene copies that are on separate scaffolds in the genome assembly are arranged as tandem duplicates. Real-time quantitative PCR of F. heteroclitus genomic DNA indicates that four copies of the hsp70-2 gene are likely present in the F. heteroclitus genome. Comparison of expression patterns in F. heteroclitus and Gasterosteus aculeatus demonstrates that hsp70-2 has a higher fold increase than hsp70-1 following heat shock in gill but not in muscle tissue, revealing a conserved difference in expression patterns between isoforms and tissues. These data indicate that ecological and toxicological studies using hsp70 as a biomarker in teleosts should take this complexity into account. Copyright © 2016 Elsevier Inc. All rights reserved.
2013-01-01
Background MicroRNAs (miRNAs) are an abundant class of endogenous small RNA molecules that downregulate gene expression at the posttranscriptional level. They play important roles in multiple biological processes by regulating genes that control developmental timing, growth, stem cell division and apoptosis by binding to the mRNA of target genes. Despite the position Atlantic salmon (Salmo salar) has as an economically important domesticated animal, there has been little research on miRNAs in this species. Knowledge about miRNAs and their target genes may be used to control health and to improve performance of economically important traits. However, before their biological function can be unravelled they must be identified and annotated. The aims of this study were to identify and characterize miRNA genes in Atlantic salmon by deep sequencing analysis of small RNA libraries from nine different tissues. Results A total of 180 distinct mature miRNAs belonging to 106 families of evolutionary conserved miRNAs, and 13 distinct novel mature miRNAs were discovered and characterized. The mature miRNAs corresponded to 521 putative precursor sequences located at unique genome locations. About 40% of these precursors were part of gene clusters, and the majority of the Salmo salar gene clusters discovered were conserved across species. Comparison of expression levels in samples from different tissues applying DESeq indicated that there were tissue specific expression differences in three conserved and one novel miRNA. Ssa-miR 736 was detected in heart tissue only, while two other clustered miRNAs (ssa-miR 212 and132) seems to be at a higher expression level in brain tissue. These observations correlate well with their expected functions as regulators of signal pathways in cardiac and neuronal cells, respectively. Ssa-miR 8163 is one of the novel miRNAs discovered and its function remains unknown. However, differential expression analysis using DESeq suggests that this miRNA is enriched in liver tissue and the precursor was mapped to intron 7 of the transferrin gene. Conclusions The identification and annotation of evolutionary conserved and novel Salmo salar miRNAs as well as the characterization of miRNA gene clusters provide biological knowledge that will greatly facilitate further functional studies on miRNAs in this species. PMID:23865519
Core Promoter Functions in the Regulation of Gene Expression of Drosophila Dorsal Target Genes*
Zehavi, Yonathan; Kuznetsov, Olga; Ovadia-Shochat, Avital; Juven-Gershon, Tamar
2014-01-01
Developmental processes are highly dependent on transcriptional regulation by RNA polymerase II. The RNA polymerase II core promoter is the ultimate target of a multitude of transcription factors that control transcription initiation. Core promoters consist of core promoter motifs, e.g. the initiator, TATA box, and the downstream core promoter element (DPE), which confer specific properties to the core promoter. Here, we explored the importance of core promoter functions in the dorsal-ventral developmental gene regulatory network. This network includes multiple genes that are activated by different nuclear concentrations of Dorsal, an NFκB homolog transcription factor, along the dorsal-ventral axis. We show that over two-thirds of Dorsal target genes contain DPE sequence motifs, which is significantly higher than the proportion of DPE-containing promoters in Drosophila genes. We demonstrate that multiple Dorsal target genes are evolutionarily conserved and functionally dependent on the DPE. Furthermore, we have analyzed the activation of key Dorsal target genes by Dorsal, as well as by another Rel family transcription factor, Relish, and the dependence of their activation on the DPE motif. Using hybrid enhancer-promoter constructs in Drosophila cells and embryo extracts, we have demonstrated that the core promoter composition is an important determinant of transcriptional activity of Dorsal target genes. Taken together, our results provide evidence for the importance of core promoter composition in the regulation of Dorsal target genes. PMID:24634215
Highly Conserved Mitochondrial Genomes among Multicellular Red Algae of the Florideophyceae
Yang, Eun Chan; Kim, Kyeong Mi; Kim, Su Yeon; Lee, JunMo; Boo, Ga Hun; Lee, Jung-Hyun; Nelson, Wendy A.; Yi, Gangman; Schmidt, William E.; Fredericq, Suzanne; Boo, Sung Min; Bhattacharya, Debashish; Yoon, Hwan Su
2015-01-01
Two red algal classes, the Florideophyceae (approximately 7,100 spp.) and Bangiophyceae (approximately 193 spp.), comprise 98% of red algal diversity in marine and freshwater habitats. These two classes form well-supported monophyletic groups in most phylogenetic analyses. Nonetheless, the interordinal relationships remain largely unresolved, in particular in the largest subclass Rhodymeniophycidae that includes 70% of all species. To elucidate red algal phylogenetic relationships and study organelle evolution, we determined the sequence of 11 mitochondrial genomes (mtDNA) from 5 florideophycean subclasses. These mtDNAs were combined with existing data, resulting in a database of 25 florideophytes and 12 bangiophytes (including cyanidiophycean species). A concatenated alignment of mt proteins was used to resolve ordinal relationships in the Rhodymeniophycidae. Red algal mtDNA genome comparisons showed 47 instances of gene rearrangement including 12 that distinguish Bangiophyceae from Hildenbrandiophycidae, and 5 that distinguish Hildenbrandiophycidae from Nemaliophycidae. These organelle data support a rapid radiation and surprisingly high conservation of mtDNA gene syntheny among the morphologically divergent multicellular lineages of Rhodymeniophycidae. In contrast, we find extensive mitochondrial gene rearrangements when comparing Bangiophyceae and Florideophyceae and multiple examples of gene loss among the different red algal lineages. PMID:26245677
Roller, Richard J; Fetters, Rachel
2015-03-01
The alphaherpesvirus UL51 protein is a tegument component that interacts with the viral glycoprotein E and functions at multiple steps in virus assembly and spread in epithelial cells. We show here that pUL51 forms a complex in infected cells with another conserved tegument protein, pUL7. This complex can form in the absence of other viral proteins and is largely responsible for recruitment of pUL7 to cytoplasmic membranes and into the virion tegument. Incomplete colocalization of pUL51 and pUL7 in infected cells, however, suggests that a significant fraction of the population of each protein is not complexed with the other and that they may accomplish independent functions. The ability of herpesviruses to spread from cell to cell in the face of an immune response is critical for disease and shedding following reactivation from latency. Cell-to-cell spread is a conserved ability of herpesviruses, and the identification of conserved viral genes that mediate this process will aid in the design of attenuated vaccines and of novel therapeutics. The conserved UL51 gene of herpes simplex virus 1 plays important roles in cell-to-cell spread and in virus assembly in the cytoplasm, both of which likely depend on specific interactions with other viral and cellular proteins. Here we identify one of those interactions with the product of another conserved herpesvirus gene, UL7, and show that formation of this complex mediates recruitment of UL7 to membranes and to the virion. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Sidorenko, Lyudmila; Dorweiler, Jane E; Cigan, A Mark; Arteaga-Vazquez, Mario; Vyas, Meenal; Kermicle, Jerry; Jurcin, Diane; Brzeski, Jan; Cai, Yu; Chandler, Vicki L
2009-11-01
Paramutation involves homologous sequence communication that leads to meiotically heritable transcriptional silencing. We demonstrate that mop2 (mediator of paramutation2), which alters paramutation at multiple loci, encodes a gene similar to Arabidopsis NRPD2/E2, the second-largest subunit of plant-specific RNA polymerases IV and V. In Arabidopsis, Pol-IV and Pol-V play major roles in RNA-mediated silencing and a single second-largest subunit is shared between Pol-IV and Pol-V. Maize encodes three second-largest subunit genes: all three genes potentially encode full length proteins with highly conserved polymerase domains, and each are expressed in multiple overlapping tissues. The isolation of a recessive paramutation mutation in mop2 from a forward genetic screen suggests limited or no functional redundancy of these three genes. Potential alternative Pol-IV/Pol-V-like complexes could provide maize with a greater diversification of RNA-mediated transcriptional silencing machinery relative to Arabidopsis. Mop2-1 disrupts paramutation at multiple loci when heterozygous, whereas previously silenced alleles are only up-regulated when Mop2-1 is homozygous. The dramatic reduction in b1 tandem repeat siRNAs, but no disruption of silencing in Mop2-1 heterozygotes, suggests the major role for tandem repeat siRNAs is not to maintain silencing. Instead, we hypothesize the tandem repeat siRNAs mediate the establishment of the heritable silent state-a process fully disrupted in Mop2-1 heterozygotes. The dominant Mop2-1 mutation, which has a single nucleotide change in a domain highly conserved among all polymerases (E. coli to eukaryotes), disrupts both siRNA biogenesis (Pol-IV-like) and potentially processes downstream (Pol-V-like). These results suggest either the wild-type protein is a subunit in both complexes or the dominant mutant protein disrupts both complexes. Dominant mutations in the same domain in E. coli RNA polymerase suggest a model for Mop2-1 dominance: complexes containing Mop2-1 subunits are non-functional and compete with wild-type complexes.
In vitro validation of self designed "universal human Influenza A siRNA".
Jain, Bhawana; Jain, Amita; Prakash, Om; Singh, Ajay Kr; Dangi, Tanushree; Singh, Mastan; Singh, K P
2015-08-01
The genomic variability of Influenza A virus (IAV) makes it difficult for the existing vaccines or anti-influenza drugs to control. The siRNA targeting viral gene induces RNAi mechanism in the host and silent the gene by cleaving mRNA. In this study, we developed an universal siRNA and validated its efficiency in vitro. The siRNA was designed rationally, targeting the most conserved region (delineated with the help of multiple sequence alignment) of M gene of IAV strains. Three level screening method was adopted, and the most efficient one was selected on the basis of its unique position in the conserved region. The siRNA efficacy was confirmed in vitro with the Madin Darby Canine Kidney (MDCK) cell line for IAV propagation using two clinical isolates i.e., Influenza A/H3N2 and Influenza A/pdmH1N1. Of the total 168 strains worldwide and 33 strains from India, 97 bp long (position 137-233) conserved region was identified. The longest ORF of matrix gene was targeted by the selected siRNA, which showed 73.6% inhibition in replication of Influenza A/pdmH1N1 and 62.1% inhibition in replication of Influenza A/H3N2 at 48 h post infection on MDCK cell line. This study provides a basis for the development of siRNA which can be used as universal anti-IAV therapeutic agent.
Zhou, Qingxiang; Zhang, Tianyi; Xu, Weihua; Yu, Linlin; Yi, Yongzhu; Zhang, Zhifang
2008-01-01
Background achaete-scute complexe (AS-C) has been widely studied at genetic, developmental and evolutional levels. Genes of this family encode proteins containing a highly conserved bHLH domain, which take part in the regulation of the development of central nervous system and peripheral nervous system. Many AS-C homologs have been isolated from various vertebrates and invertebrates. Also, AS-C genes are duplicated during the evolution of Diptera. Functions besides neural development controlling have also been found in Drosophila AS-C genes. Results We cloned four achaete-scute homologs (ASH) from the lepidopteran model organism Bombyx mori, including three proneural genes and one neural precursor gene. Proteins encoded by them contained the characteristic bHLH domain and the three proneural ones were also found to have the C-terminal conserved motif. These genes regulated promoter activity through the Class A E-boxes in vitro. Though both Bm-ASH and Drosophila AS-C have four members, they are not in one by one corresponding relationships. Results of RT-PCR and real-time PCR showed that Bm-ASH genes were expressed in different larval tissues, and had well-regulated expressional profiles during the development of embryo and wing/wing disc. Conclusion There are four achaete-scute homologs in Bombyx mori, the second insect having four AS-C genes so far, and these genes have multiple functions in silkworm life cycle. AS-C gene duplication in insects occurs after or parallel to, but not before the taxonomic order formation during evolution. PMID:18321391
Zhou, Qingxiang; Zhang, Tianyi; Xu, Weihua; Yu, Linlin; Yi, Yongzhu; Zhang, Zhifang
2008-03-06
achaete-scute complexe (AS-C) has been widely studied at genetic, developmental and evolutional levels. Genes of this family encode proteins containing a highly conserved bHLH domain, which take part in the regulation of the development of central nervous system and peripheral nervous system. Many AS-C homologs have been isolated from various vertebrates and invertebrates. Also, AS-C genes are duplicated during the evolution of Diptera. Functions besides neural development controlling have also been found in Drosophila AS-C genes. We cloned four achaete-scute homologs (ASH) from the lepidopteran model organism Bombyx mori, including three proneural genes and one neural precursor gene. Proteins encoded by them contained the characteristic bHLH domain and the three proneural ones were also found to have the C-terminal conserved motif. These genes regulated promoter activity through the Class A E-boxes in vitro. Though both Bm-ASH and Drosophila AS-C have four members, they are not in one by one corresponding relationships. Results of RT-PCR and real-time PCR showed that Bm-ASH genes were expressed in different larval tissues, and had well-regulated expressional profiles during the development of embryo and wing/wing disc. There are four achaete-scute homologs in Bombyx mori, the second insect having four AS-C genes so far, and these genes have multiple functions in silkworm life cycle. AS-C gene duplication in insects occurs after or parallel to, but not before the taxonomic order formation during evolution.
Unraveling flp-11/flp-32 dichotomy in nematodes.
Atkinson, Louise E; Miskelly, Iain R; Moffett, Christy L; McCoy, Ciaran J; Maule, Aaron G; Marks, Nikki J; Mousley, Angela
2016-10-01
FMRFamide-like peptide (FLP) signalling systems are core to nematode neuromuscular function. Novel drug discovery efforts associated with nematode FLP/FLP receptor biology are advanced through the accumulation of basic biological data that can reveal subtle complexities within the neuropeptidergic system. This study reports the characterisation of FMRFamide-like peptide encoding gene-11 (flp-11) and FMRFamide-like peptide encoding gene-32 (flp-32), two distinct flp genes which encode the analogous peptide, AMRN(A/S)LVRFamide, in multiple nematode species - the only known example of this phenomenon within the FLPergic system of nematodes. Using bioinformatics, in situ hybridisation, immunocytochemistry and behavioural assays we show that: (i) flp-11 and -32 are distinct flp genes expressed individually or in tandem across multiple nematode species, where they encode a highly similar peptide; (ii) flp-11 does not appear to be the most widely expressed flp in Caenorhabditis elegans; (iii) in species expressing both flp-11 and flp-32, flp-11 displays a conserved, restricted expression pattern across nematode clades and lifestyles; (iv) in species expressing both flp-11 and flp-32, flp-32 expression is more widespread and less conserved than flp-11; (v) in species expressing only flp-11, the flp-11 expression profile is more similar to the flp-32 profile observed in species expressing both; and (vi) FLP-11 peptides inhibit motor function in multiple nematode species. The biological significance and evolutionary origin of flp-11 and -32 peptide duplication remains unclear despite attempts to identify a common ancestor; this may become clearer as the availability of genomic data improves. This work provides insight into the complexity of the neuropeptidergic system in nematodes, and begins to examine how nematodes may compensate for structural neuronal simplicity. From a parasite control standpoint, this work underscores the importance of basic biological data, and has wider implications for the utility of C. elegans as a model for parasite neurobiology. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.
Chen, Chao; Chen, Ranran; Wu, Shengyang; Zhu, Dan; Sun, Xiaoli; Liu, Beidong; Li, Qiang; Zhu, Yanming
2018-03-26
Ubiquitin is a highly conserved protein with multiple essential regulation functions through the ubiquitin-proteasome system. Even though its functions in the ubiquitin-mediated protein degradation pathway were very well characterized. The functions of ubiquitin genes in regulating alkaline stress response are not fully established. In this study, we identified 12 potential UBQ genes in Glycine soja genome, and analyzed their evolutionary relationship, conserved domains and promoter cis-elements. We also explored the expression profiles of G. soja UBQ genes under alkaline stress, based on the transcriptome sequencing. We found that the expression of GsUBQ10 was significantly induced by alkaline stress, and function of GsUBQ10 was characterized using overexpression transgenic alfalfa (Medicago sativa). Our results suggested that GsUBQ10 transgenic lines significantly improved the alkaline tolerance in alfalfa. The GsUBQ10 transgenic lines showed lower relative membrane permeability, lower malon dialdehyde content and higher catalase activity than in the wild-type plants. This indicates that GsUBQ10 is involved in regulating the reactive oxygen species accumulation under alkaline stress. Taken together, we identified an ubiquitin gene GsUBQ10 from G. soja, which plays a positive role in responses to alkaline stress in alfalfa. This article is protected by copyright. All rights reserved.
WormQTLHD—a web database for linking human disease to natural variation data in C. elegans
van der Velde, K. Joeri; de Haan, Mark; Zych, Konrad; Arends, Danny; Snoek, L. Basten; Kammenga, Jan E.; Jansen, Ritsert C.; Swertz, Morris A.; Li, Yang
2014-01-01
Interactions between proteins are highly conserved across species. As a result, the molecular basis of multiple diseases affecting humans can be studied in model organisms that offer many alternative experimental opportunities. One such organism—Caenorhabditis elegans—has been used to produce much molecular quantitative genetics and systems biology data over the past decade. We present WormQTLHD (Human Disease), a database that quantitatively and systematically links expression Quantitative Trait Loci (eQTL) findings in C. elegans to gene–disease associations in man. WormQTLHD, available online at http://www.wormqtl-hd.org, is a user-friendly set of tools to reveal functionally coherent, evolutionary conserved gene networks. These can be used to predict novel gene-to-gene associations and the functions of genes underlying the disease of interest. We created a new database that links C. elegans eQTL data sets to human diseases (34 337 gene–disease associations from OMIM, DGA, GWAS Central and NHGRI GWAS Catalogue) based on overlapping sets of orthologous genes associated to phenotypes in these two species. We utilized QTL results, high-throughput molecular phenotypes, classical phenotypes and genotype data covering different developmental stages and environments from WormQTL database. All software is available as open source, built on MOLGENIS and xQTL workbench. PMID:24217915
2016-01-01
Color variation provides the opportunity to investigate the genetic basis of evolution and selection. Reptiles are less studied than mammals. Comparative genomics approaches allow for knowledge gained in one species to be leveraged for use in another species. We describe a comparative vertebrate analysis of conserved regulatory modules in pythons aimed at assessing bioinformatics evidence that transcription factors important in mammalian pigmentation phenotypes may also be important in python pigmentation phenotypes. We identified 23 python orthologs of mammalian genes associated with variation in coat color phenotypes for which we assessed the extent of pairwise protein sequence identity between pythons and mouse, dog, horse, cow, chicken, anole lizard, and garter snake. We next identified a set of melanocyte/pigment associated transcription factors (CREB, FOXD3, LEF-1, MITF, POU3F2, and USF-1) that exhibit relatively conserved sequence similarity within their DNA binding regions across species based on orthologous alignments across multiple species. Finally, we identified 27 evolutionarily conserved clusters of transcription factor binding sites within ~200-nucleotide intervals of the 1500-nucleotide upstream regions of AIM1, DCT, MC1R, MITF, MLANA, OA1, PMEL, RAB27A, and TYR from Python bivittatus. Our results provide insight into pigment phenotypes in pythons. PMID:27698666
NASA Astrophysics Data System (ADS)
Zhao, Yanbin; Zhang, Kun; Giesy, John P.; Hu, Jianying
2015-02-01
Various synthetic chemicals are ligands for nuclear receptors (NRs) and can cause adverse effects in vertebrates mediated by NRs. While several model vertebrates, such as mouse, chicken, western clawed frog and zebrafish, are widely used in toxicity testing, few NRs have been well described for most of these classes. In this report, NRs in genomes of 12 vertebrates are characterized via bioinformatics approaches. Although numbers of NRs varied among species, with 40-42 genes in birds to 66-74 genes in teleost fishes, all NRs had clear homologs in human and could be categorized into seven subfamilies defined as NR0B-NR6A. Phylogenetic analysis revealed conservative evolutionary relationships for most NRs, which were consistent with traditional morphology-based systematics, except for some exceptions in Dolphin (Tursiops truncatus). Evolution of PXR and CAR exhibited unexpected multiple patterns and the existence of CAR possibly being traced back to ancient lobe-finned fishes and tetrapods (Sarcopterygii). Compared to the more conservative DBD of NRs, sequences of LBD were less conserved: Sequences of THRs, RARs and RXRs were >=90% similar to those of the human, ERs, AR, GR, ERRs and PPARs were more variable with similarities of 60%-100% and PXR, CAR, DAX1 and SHP were least conserved among species.
Irizarry, Kristopher J L; Bryden, Randall L
2016-01-01
Color variation provides the opportunity to investigate the genetic basis of evolution and selection. Reptiles are less studied than mammals. Comparative genomics approaches allow for knowledge gained in one species to be leveraged for use in another species. We describe a comparative vertebrate analysis of conserved regulatory modules in pythons aimed at assessing bioinformatics evidence that transcription factors important in mammalian pigmentation phenotypes may also be important in python pigmentation phenotypes. We identified 23 python orthologs of mammalian genes associated with variation in coat color phenotypes for which we assessed the extent of pairwise protein sequence identity between pythons and mouse, dog, horse, cow, chicken, anole lizard, and garter snake. We next identified a set of melanocyte/pigment associated transcription factors (CREB, FOXD3, LEF-1, MITF, POU3F2, and USF-1) that exhibit relatively conserved sequence similarity within their DNA binding regions across species based on orthologous alignments across multiple species. Finally, we identified 27 evolutionarily conserved clusters of transcription factor binding sites within ~200-nucleotide intervals of the 1500-nucleotide upstream regions of AIM1, DCT, MC1R, MITF, MLANA, OA1, PMEL, RAB27A, and TYR from Python bivittatus . Our results provide insight into pigment phenotypes in pythons.
Gangadhar, Baniekal H.; Sajeesh, Kappachery; Venkatesh, Jelli; Baskar, Venkidasamy; Abhinandan, Kumar; Yu, Jae W.; Prasad, Ram; Mishra, Raghvendra K.
2016-01-01
Abiotic stresses such as heat, drought, and salinity are major environmental constraints that limit potato (Solanum tuberosum L.) production worldwide. Previously, we found a potential thermo-tolerance gene, named StnsLTP1 from potato using yeast functional screening. Here, we report the functional characterization of StnsLTP1 and its role in multiple abiotic stresses in potato plants. Computational analysis of StnsLTP1 with other plant LTPs showed eight conserved cysteine residues, and four α-helices stabilized by four disulfide bridges. Expression analysis of StnsLTP1 gene showed differential expression under heat, water-deficit and salt stresses. Transgenic potato lines over-expressing StnsLTP1 gene displayed enhanced cell membrane integrity under stress conditions, as indicated by reduced membrane lipid per-oxidation, and hydrogen peroxide content relative to untransformed (UT) control plants. In addition, transgenic lines over-expressing StLTP1 also exhibited increased antioxidant enzyme activity with enhanced accumulation of ascorbates, and up-regulation of stress-related genes including StAPX, StCAT, StSOD, StHsfA3, StHSP70, and StsHSP20 compared with the UT plants. These results suggests that StnsLTP1 transgenic plants acquired improved tolerance to multiple abiotic stresses through enhanced activation of antioxidative defense mechanisms via cyclic scavenging of reactive oxygen species and regulated expression of stress-related genes. PMID:27597854
Stiebens, Victor A; Merino, Sonia E; Chain, Frédéric J J; Eizaguirre, Christophe
2013-04-30
In evolutionary and conservation biology, parasitism is often highlighted as a major selective pressure. To fight against parasites and pathogens, genetic diversity of the immune genes of the major histocompatibility complex (MHC) are particularly important. However, the extensive degree of polymorphism observed in these genes makes it difficult to conduct thorough population screenings. We utilized a genotyping protocol that uses 454 amplicon sequencing to characterize the MHC class I in the endangered loggerhead sea turtle (Caretta caretta) and to investigate their evolution at multiple relevant levels of organization. MHC class I genes revealed signatures of trans-species polymorphism across several reptile species. In the studied loggerhead turtle individuals, it results in the maintenance of two ancient allelic lineages. We also found that individuals carrying an intermediate number of MHC class I alleles are larger than those with either a low or high number of alleles. Multiple modes of evolution seem to maintain MHC diversity in the loggerhead turtles, with relatively high polymorphism for an endangered species.
Modise, David M.; Gemeildien, Junaid; Ndimba, Bongani K.; Christoffels, Alan
2018-01-01
Background Crop response to the changing climate and unpredictable effects of global warming with adverse conditions such as drought stress has brought concerns about food security to the fore; crop yield loss is a major cause of concern in this regard. Identification of genes with multiple responses across environmental stresses is the genetic foundation that leads to crop adaptation to environmental perturbations. Methods In this paper, we introduce an integrated approach to assess candidate genes for multiple stress responses across-species. The approach combines ontology based semantic data integration with expression profiling, comparative genomics, phylogenomics, functional gene enrichment and gene enrichment network analysis to identify genes associated with plant stress phenotypes. Five different ontologies, viz., Gene Ontology (GO), Trait Ontology (TO), Plant Ontology (PO), Growth Ontology (GRO) and Environment Ontology (EO) were used to semantically integrate drought related information. Results Target genes linked to Quantitative Trait Loci (QTLs) controlling yield and stress tolerance in sorghum (Sorghum bicolor (L.) Moench) and closely related species were identified. Based on the enriched GO terms of the biological processes, 1116 sorghum genes with potential responses to 5 different stresses, such as drought (18%), salt (32%), cold (20%), heat (8%) and oxidative stress (25%) were identified to be over-expressed. Out of 169 sorghum drought responsive QTLs associated genes that were identified based on expression datasets, 56% were shown to have multiple stress responses. On the other hand, out of 168 additional genes that have been evaluated for orthologous pairs, 90% were conserved across species for drought tolerance. Over 50% of identified maize and rice genes were responsive to drought and salt stresses and were co-located within multifunctional QTLs. Among the total identified multi-stress responsive genes, 272 targets were shown to be co-localized within QTLs associated with different traits that are responsive to multiple stresses. Ontology mapping was used to validate the identified genes, while reconstruction of the phylogenetic tree was instrumental to infer the evolutionary relationship of the sorghum orthologs. The results also show specific genes responsible for various interrelated components of drought response mechanism such as drought tolerance, drought avoidance and drought escape. Conclusions We submit that this approach is novel and to our knowledge, has not been used previously in any other research; it enables us to perform cross-species queries for genes that are likely to be associated with multiple stress tolerance, as a means to identify novel targets for engineering stress resistance in sorghum and possibly, in other crop species. PMID:29590108
Repeat-Associated Plasticity in the Helicobacter pylori RD Gene Family▿ †
Shak, Joshua R.; Dick, Jonathan J.; Meinersmann, Richard J.; Perez-Perez, Guillermo I.; Blaser, Martin J.
2009-01-01
The bacterium Helicobacter pylori is remarkable for its ability to persist in the human stomach for decades without provoking sterilizing immunity. Since repetitive DNA can facilitate adaptive genomic flexibility via increased recombination, insertion, and deletion, we searched the genomes of two H. pylori strains for nucleotide repeats. We discovered a family of genes with extensive repetitive DNA that we have termed the H. pylori RD gene family. Each gene of this family is composed of a conserved 3′ region, a variable mid-region encoding 7 and 11 amino acid repeats, and a 5′ region containing one of two possible alleles. Analysis of five complete genome sequences and PCR genotyping of 42 H. pylori strains revealed extensive variation between strains in the number, location, and arrangement of RD genes. Furthermore, examination of multiple strains isolated from a single subject's stomach revealed intrahost variation in repeat number and composition. Despite prior evidence that the protein products of this gene family are expressed at the bacterial cell surface, enzyme-linked immunosorbent assay and immunoblot studies revealed no consistent seroreactivity to a recombinant RD protein by H. pylori-positive hosts. The pattern of repeats uncovered in the RD gene family appears to reflect slipped-strand mispairing or domain duplication, allowing for redundancy and subsequent diversity in genotype and phenotype. This novel family of hypervariable genes with conserved, repetitive, and allelic domains may represent an important locus for understanding H. pylori persistence in its natural host. PMID:19749042
Repeat-associated plasticity in the Helicobacter pylori RD gene family.
Shak, Joshua R; Dick, Jonathan J; Meinersmann, Richard J; Perez-Perez, Guillermo I; Blaser, Martin J
2009-11-01
The bacterium Helicobacter pylori is remarkable for its ability to persist in the human stomach for decades without provoking sterilizing immunity. Since repetitive DNA can facilitate adaptive genomic flexibility via increased recombination, insertion, and deletion, we searched the genomes of two H. pylori strains for nucleotide repeats. We discovered a family of genes with extensive repetitive DNA that we have termed the H. pylori RD gene family. Each gene of this family is composed of a conserved 3' region, a variable mid-region encoding 7 and 11 amino acid repeats, and a 5' region containing one of two possible alleles. Analysis of five complete genome sequences and PCR genotyping of 42 H. pylori strains revealed extensive variation between strains in the number, location, and arrangement of RD genes. Furthermore, examination of multiple strains isolated from a single subject's stomach revealed intrahost variation in repeat number and composition. Despite prior evidence that the protein products of this gene family are expressed at the bacterial cell surface, enzyme-linked immunosorbent assay and immunoblot studies revealed no consistent seroreactivity to a recombinant RD protein by H. pylori-positive hosts. The pattern of repeats uncovered in the RD gene family appears to reflect slipped-strand mispairing or domain duplication, allowing for redundancy and subsequent diversity in genotype and phenotype. This novel family of hypervariable genes with conserved, repetitive, and allelic domains may represent an important locus for understanding H. pylori persistence in its natural host.
Identification of differentially expressed genes and false discovery rate in microarray studies.
Gusnanto, Arief; Calza, Stefano; Pawitan, Yudi
2007-04-01
To highlight the development in microarray data analysis for the identification of differentially expressed genes, particularly via control of false discovery rate. The emergence of high-throughput technology such as microarrays raises two fundamental statistical issues: multiplicity and sensitivity. We focus on the biological problem of identifying differentially expressed genes. First, multiplicity arises due to testing tens of thousands of hypotheses, rendering the standard P value meaningless. Second, known optimal single-test procedures such as the t-test perform poorly in the context of highly multiple tests. The standard approach of dealing with multiplicity is too conservative in the microarray context. The false discovery rate concept is fast becoming the key statistical assessment tool replacing the P value. We review the false discovery rate approach and argue that it is more sensible for microarray data. We also discuss some methods to take into account additional information from the microarrays to improve the false discovery rate. There is growing consensus on how to analyse microarray data using the false discovery rate framework in place of the classical P value. Further research is needed on the preprocessing of the raw data, such as the normalization step and filtering, and on finding the most sensitive test procedure.
Arm-specific dynamics of chromosome evolution in malaria mosquitoes
2011-01-01
Background The malaria mosquito species of subgenus Cellia have rich inversion polymorphisms that correlate with environmental variables. Polymorphic inversions tend to cluster on the chromosomal arms 2R and 2L but not on X, 3R and 3L in Anopheles gambiae and homologous arms in other species. However, it is unknown whether polymorphic inversions on homologous chromosomal arms of distantly related species from subgenus Cellia nonrandomly share similar sets of genes. It is also unclear if the evolutionary breakage of inversion-poor chromosomal arms is under constraints. Results To gain a better understanding of the arm-specific differences in the rates of genome rearrangements, we compared gene orders and established syntenic relationships among Anopheles gambiae, Anopheles funestus, and Anopheles stephensi. We provided evidence that polymorphic inversions on the 2R arms in these three species nonrandomly captured similar sets of genes. This nonrandom distribution of genes was not only a result of preservation of ancestral gene order but also an outcome of extensive reshuffling of gene orders that created new combinations of homologous genes within independently originated polymorphic inversions. The statistical analysis of distribution of conserved gene orders demonstrated that the autosomal arms differ in their tolerance to generating evolutionary breakpoints. The fastest evolving 2R autosomal arm was enriched with gene blocks conserved between only a pair of species. In contrast, all identified syntenic blocks were preserved on the slowly evolving 3R arm of An. gambiae and on the homologous arms of An. funestus and An. stephensi. Conclusions Our results suggest that natural selection favors specific gene combinations within polymorphic inversions when distant species are exposed to similar environmental pressures. This knowledge could be useful for the discovery of genes responsible for an association of inversion polymorphisms with phenotypic variations in multiple species. Our data support the chromosomal arm specificity in rates of gene order disruption during mosquito evolution. We conclude that the distribution of breakpoint regions is evolutionary conserved on slowly evolving arms and tends to be lineage-specific on rapidly evolving arms. PMID:21473772
The Evolution of the Secreted Regulatory Protein Progranulin.
Palfree, Roger G E; Bennett, Hugh P J; Bateman, Andrew
2015-01-01
Progranulin is a secreted growth factor that is active in tumorigenesis, wound repair, and inflammation. Haploinsufficiency of the human progranulin gene, GRN, causes frontotemporal dementia. Progranulins are composed of chains of cysteine-rich granulin modules. Modules may be released from progranulin by proteolysis as 6kDa granulin polypeptides. Both intact progranulin and some of the granulin polypeptides are biologically active. The granulin module occurs in certain plant proteases and progranulins are present in early diverging metazoan clades such as the sponges, indicating their ancient evolutionary origin. There is only one Grn gene in mammalian genomes. More gene-rich Grn families occur in teleost fish with between 3 and 6 members per species including short-form Grns that have no tetrapod counterparts. Our goals are to elucidate progranulin and granulin module evolution by investigating (i): the origins of metazoan progranulins (ii): the evolutionary relationships between the single Grn of tetrapods and the multiple Grn genes of fish (iii): the evolution of granulin module architectures of vertebrate progranulins (iv): the conservation of mammalian granulin polypeptide sequences and how the conserved granulin amino acid sequences map to the known three dimensional structures of granulin modules. We report that progranulin-like proteins are present in unicellular eukaryotes that are closely related to metazoa suggesting that progranulin is among the earliest extracellular regulatory proteins still employed by multicellular animals. From the genomes of the elephant shark and coelacanth we identified contemporary representatives of a precursor for short-from Grn genes of ray-finned fish that is lost in tetrapods. In vertebrate Grns pathways of exon duplication resulted in a conserved module architecture at the amino-terminus that is frequently accompanied by an unusual pattern of tandem nearly identical module repeats near the carboxyl-terminus. Polypeptide sequence conservation of mammalian granulin modules identified potential structure-activity relationships that may be informative in designing progranulin based therapeutics.
The Evolution of the Secreted Regulatory Protein Progranulin
Palfree, Roger G. E.; Bennett, Hugh P. J.; Bateman, Andrew
2015-01-01
Progranulin is a secreted growth factor that is active in tumorigenesis, wound repair, and inflammation. Haploinsufficiency of the human progranulin gene, GRN, causes frontotemporal dementia. Progranulins are composed of chains of cysteine-rich granulin modules. Modules may be released from progranulin by proteolysis as 6kDa granulin polypeptides. Both intact progranulin and some of the granulin polypeptides are biologically active. The granulin module occurs in certain plant proteases and progranulins are present in early diverging metazoan clades such as the sponges, indicating their ancient evolutionary origin. There is only one Grn gene in mammalian genomes. More gene-rich Grn families occur in teleost fish with between 3 and 6 members per species including short-form Grns that have no tetrapod counterparts. Our goals are to elucidate progranulin and granulin module evolution by investigating (i): the origins of metazoan progranulins (ii): the evolutionary relationships between the single Grn of tetrapods and the multiple Grn genes of fish (iii): the evolution of granulin module architectures of vertebrate progranulins (iv): the conservation of mammalian granulin polypeptide sequences and how the conserved granulin amino acid sequences map to the known three dimensional structures of granulin modules. We report that progranulin-like proteins are present in unicellular eukaryotes that are closely related to metazoa suggesting that progranulin is among the earliest extracellular regulatory proteins still employed by multicellular animals. From the genomes of the elephant shark and coelacanth we identified contemporary representatives of a precursor for short-from Grn genes of ray-finned fish that is lost in tetrapods. In vertebrate Grns pathways of exon duplication resulted in a conserved module architecture at the amino-terminus that is frequently accompanied by an unusual pattern of tandem nearly identical module repeats near the carboxyl-terminus. Polypeptide sequence conservation of mammalian granulin modules identified potential structure-activity relationships that may be informative in designing progranulin based therapeutics. PMID:26248158
Sinha, Amit; Langnick, Claudia; Sommer, Ralf J; Dieterich, Christoph
2014-09-01
Discovery of trans-splicing in multiple metazoan lineages led to the identification of operon-like gene organization in diverse organisms, including trypanosomes, tunicates, and nematodes, but the functional significance of such operons is not completely understood. To see whether the content or organization of operons serves similar roles across species, we experimentally defined operons in the nematode model Pristionchus pacificus. We performed affinity capture experiments on mRNA pools to specifically enrich for transcripts that are trans-spliced to either the SL1- or SL2-spliced leader, using spliced leader-specific probes. We obtained distinct trans-splicing patterns from the analysis of three mRNA pools (total mRNA, SL1 and SL2 fraction) by RNA-seq. This information was combined with a genome-wide analysis of gene orientation and spacing. We could confirm 2219 operons by RNA-seq data out of 6709 candidate operons, which were predicted by sequence information alone. Our gene order comparison of the Caenorhabditis elegans and P. pacificus genomes shows major changes in operon organization in the two species. Notably, only 128 out of 1288 operons in C. elegans are conserved in P. pacificus. However, analysis of gene-expression profiles identified conserved functions such as an enrichment of germline-expressed genes and higher expression levels of operonic genes during recovery from dauer arrest in both species. These results provide support for the model that a necessity for increased transcriptional efficiency in the context of certain developmental processes could be a selective constraint for operon evolution in metazoans. Our method is generally applicable to other metazoans to see if similar functional constraints regulate gene organization into operons. © 2014 Sinha et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Conservation of transcription factor binding events predicts gene expression across species
Hemberg, Martin; Kreiman, Gabriel
2011-01-01
Recent technological advances have made it possible to determine the genome-wide binding sites of transcription factors (TFs). Comparisons across species have suggested a relatively low degree of evolutionary conservation of experimentally defined TF binding events (TFBEs). Using binding data for six different TFs in hepatocytes and embryonic stem cells from human and mouse, we demonstrate that evolutionary conservation of TFBEs within orthologous proximal promoters is closely linked to function, defined as expression of the target genes. We show that (i) there is a significantly higher degree of conservation of TFBEs when the target gene is expressed in both species; (ii) there is increased conservation of binding events for groups of TFs compared to individual TFs; and (iii) conserved TFBEs have a greater impact on the expression of their target genes than non-conserved ones. These results link conservation of structural elements (TFBEs) to conservation of function (gene expression) and suggest a higher degree of functional conservation than implied by previous studies. PMID:21622661
Unity in defence: honeybee workers exhibit conserved molecular responses to diverse pathogens.
Doublet, Vincent; Poeschl, Yvonne; Gogol-Döring, Andreas; Alaux, Cédric; Annoscia, Desiderato; Aurori, Christian; Barribeau, Seth M; Bedoya-Reina, Oscar C; Brown, Mark J F; Bull, James C; Flenniken, Michelle L; Galbraith, David A; Genersch, Elke; Gisder, Sebastian; Grosse, Ivo; Holt, Holly L; Hultmark, Dan; Lattorff, H Michael G; Le Conte, Yves; Manfredini, Fabio; McMahon, Dino P; Moritz, Robin F A; Nazzi, Francesco; Niño, Elina L; Nowick, Katja; van Rij, Ronald P; Paxton, Robert J; Grozinger, Christina M
2017-03-02
Organisms typically face infection by diverse pathogens, and hosts are thought to have developed specific responses to each type of pathogen they encounter. The advent of transcriptomics now makes it possible to test this hypothesis and compare host gene expression responses to multiple pathogens at a genome-wide scale. Here, we performed a meta-analysis of multiple published and new transcriptomes using a newly developed bioinformatics approach that filters genes based on their expression profile across datasets. Thereby, we identified common and unique molecular responses of a model host species, the honey bee (Apis mellifera), to its major pathogens and parasites: the Microsporidia Nosema apis and Nosema ceranae, RNA viruses, and the ectoparasitic mite Varroa destructor, which transmits viruses. We identified a common suite of genes and conserved molecular pathways that respond to all investigated pathogens, a result that suggests a commonality in response mechanisms to diverse pathogens. We found that genes differentially expressed after infection exhibit a higher evolutionary rate than non-differentially expressed genes. Using our new bioinformatics approach, we unveiled additional pathogen-specific responses of honey bees; we found that apoptosis appeared to be an important response following microsporidian infection, while genes from the immune signalling pathways, Toll and Imd, were differentially expressed after Varroa/virus infection. Finally, we applied our bioinformatics approach and generated a gene co-expression network to identify highly connected (hub) genes that may represent important mediators and regulators of anti-pathogen responses. Our meta-analysis generated a comprehensive overview of the host metabolic and other biological processes that mediate interactions between insects and their pathogens. We identified key host genes and pathways that respond to phylogenetically diverse pathogens, representing an important source for future functional studies as well as offering new routes to identify or generate pathogen resilient honey bee stocks. The statistical and bioinformatics approaches that were developed for this study are broadly applicable to synthesize information across transcriptomic datasets. These approaches will likely have utility in addressing a variety of biological questions.
Transcription co-activator SAYP mediates the action of STAT activator
Panov, Vladislav V.; Kuzmina, Julia L.; Doronin, Semen A.; Kopantseva, Marina R.; Nabirochkina, Elena N.; Georgieva, Sofia G.; Vorobyeva, Nadezhda E.; Shidlovskii, Yulii V.
2012-01-01
Jak/STAT is an important signaling pathway mediating multiple events in development. We describe participation of metazoan co-activator SAYP/PHF10 in this pathway downstream of STAT. The latter, via its activation domain, interacts with the conserved core of SAYP. STAT is associated with the SAYP-containing co-activator complex BTFly and recruits BTFly onto genes. SAYP is necessary for stimulating STAT-driven transcription of numerous genes. Mutation of SAYP leads to maldevelopments similar to those observed in STAT mutants. Thus, SAYP is a novel co-activator mediating the action of STAT. PMID:22123744
Natural allelic variation of the AZI1 gene controls root growth under zinc-limiting condition
Bouain, Nadia; Saenchai, Chorpet
2018-01-01
Zinc is an essential micronutrient for all living organisms and is involved in a plethora of processes including growth and development, and immunity. However, it is unknown if there is a common genetic and molecular basis underlying multiple facets of zinc function. Here we used natural variation in Arabidopsis thaliana to study the role of zinc in regulating growth. We identify allelic variation of the systemic immunity gene AZI1 as a key for determining root growth responses to low zinc conditions. We further demonstrate that this gene is important for modulating primary root length depending on the zinc and defence status. Finally, we show that the interaction of the immunity signal azelaic acid and zinc level to regulate root growth is conserved in rice. This work demonstrates that there is a common genetic and molecular basis for multiple zinc dependent processes and that nutrient cues can determine the balance of growth and immune responses in plants. PMID:29608565
Lan, Hong; Chen, Hui; Chen, Li-Cheng; Wang, Bei-Bing; Sun, Li; Ma, Mei-Ying; Fang, Sheng-Guo; Wan, Qiu-Hong
2014-01-01
Defensins play a key role in the innate immunity of various organisms. Detailed genomic studies of the defensin cluster have only been reported in a limited number of birds. Herein, we present the first characterization of defensins in a Pelecaniformes species, the crested ibis (Nipponia nippon), which is one of the most endangered birds in the world. We constructed bacterial artificial chromosome libraries, including a 4D-PCR library and a reverse-4D library, which provide at least 40 equivalents of this rare bird's genome. A cluster including 14 β-defensin loci within 129 kb was assigned to chromosome 3 by FISH, and one gene duplication of AvBD1 was found. The ibis defensin genes are characterized by multiform gene organization ranging from two to four exons through extensive exon fusion. Splicing signal variations and alternative splice variants were also found. Comparative analysis of four bird species identified one common and multiple species-specific duplications, which might be associated with high GC content. Evolutionary analysis revealed birth-and-death mode and purifying selection for avian defensin evolution, resulting in different defensin gene numbers among bird species and functional conservation within orthologous genes, respectively. Additionally, we propose various directions for further research on genetic conservation in the crested ibis. PMID:25372018
2010-01-01
Background Multiple sequence alignments are used to study gene or protein function, phylogenetic relations, genome evolution hypotheses and even gene polymorphisms. Virtually without exception, all available tools focus on conserved segments or residues. Small divergent regions, however, are biologically important for specific quantitative polymerase chain reaction, genotyping, molecular markers and preparation of specific antibodies, and yet have received little attention. As a consequence, they must be selected empirically by the researcher. AlignMiner has been developed to fill this gap in bioinformatic analyses. Results AlignMiner is a Web-based application for detection of conserved and divergent regions in alignments of conserved sequences, focusing particularly on divergence. It accepts alignments (protein or nucleic acid) obtained using any of a variety of algorithms, which does not appear to have a significant impact on the final results. AlignMiner uses different scoring methods for assessing conserved/divergent regions, Entropy being the method that provides the highest number of regions with the greatest length, and Weighted being the most restrictive. Conserved/divergent regions can be generated either with respect to the consensus sequence or to one master sequence. The resulting data are presented in a graphical interface developed in AJAX, which provides remarkable user interaction capabilities. Users do not need to wait until execution is complete and can.even inspect their results on a different computer. Data can be downloaded onto a user disk, in standard formats. In silico and experimental proof-of-concept cases have shown that AlignMiner can be successfully used to designing specific polymerase chain reaction primers as well as potential epitopes for antibodies. Primer design is assisted by a module that deploys several oligonucleotide parameters for designing primers "on the fly". Conclusions AlignMiner can be used to reliably detect divergent regions via several scoring methods that provide different levels of selectivity. Its predictions have been verified by experimental means. Hence, it is expected that its usage will save researchers' time and ensure an objective selection of the best-possible divergent region when closely related sequences are analysed. AlignMiner is freely available at http://www.scbi.uma.es/alignminer. PMID:20525162
The drug target genes show higher evolutionary conservation than non-target genes.
Lv, Wenhua; Xu, Yongdeng; Guo, Yiying; Yu, Ziqi; Feng, Guanglong; Liu, Panpan; Luan, Meiwei; Zhu, Hongjie; Liu, Guiyou; Zhang, Mingming; Lv, Hongchao; Duan, Lian; Shang, Zhenwei; Li, Jin; Jiang, Yongshuai; Zhang, Ruijie
2016-01-26
Although evidence indicates that drug target genes share some common evolutionary features, there have been few studies analyzing evolutionary features of drug targets from an overall level. Therefore, we conducted an analysis which aimed to investigate the evolutionary characteristics of drug target genes. We compared the evolutionary conservation between human drug target genes and non-target genes by combining both the evolutionary features and network topological properties in human protein-protein interaction network. The evolution rate, conservation score and the percentage of orthologous genes of 21 species were included in our study. Meanwhile, four topological features including the average shortest path length, betweenness centrality, clustering coefficient and degree were considered for comparison analysis. Then we got four results as following: compared with non-drug target genes, 1) drug target genes had lower evolutionary rates; 2) drug target genes had higher conservation scores; 3) drug target genes had higher percentages of orthologous genes and 4) drug target genes had a tighter network structure including higher degrees, betweenness centrality, clustering coefficients and lower average shortest path lengths. These results demonstrate that drug target genes are more evolutionarily conserved than non-drug target genes. We hope that our study will provide valuable information for other researchers who are interested in evolutionary conservation of drug targets.
DLGP: A database for lineage-conserved and lineage-specific gene pairs in animal and plant genomes.
Wang, Dapeng
2016-01-15
The conservation of gene organization in the genome with lineage-specificity is an invaluable resource to decipher their potential functionality with diverse selective constraints, especially in higher animals and plants. Gene pairs appear to be the minimal structure for such kind of gene clusters that tend to reside in their preferred locations, representing the distinctive genomic characteristics in single species or a given lineage. Despite gene families having been investigated in a widespread manner, the definition of gene pair families in various taxa still lacks adequate attention. To address this issue, we report DLGP (http://lcgbase.big.ac.cn/DLGP/) that stores the pre-calculated lineage-based gene pairs in currently available 134 animal and plant genomes and inspect them under the same analytical framework, bringing out a set of innovational features. First, the taxonomy or lineage has been classified into four levels such as Kingdom, Phylum, Class and Order. It adopts all-to-all comparison strategy to identify the possible conserved gene pairs in all species for each gene pair in certain species and reckon those that are conserved in over a significant proportion of species in a given lineage (e.g. Primates, Diptera or Poales) as the lineage-conserved gene pairs. Furthermore, it predicts the lineage-specific gene pairs by retaining the above-mentioned lineage-conserved gene pairs that are not conserved in any other lineages. Second, it carries out pairwise comparison for the gene pairs between two compared species and creates the table including all the conserved gene pairs and the image elucidating the conservation degree of gene pairs in chromosomal level. Third, it supplies gene order browser to extend gene pairs to gene clusters, allowing users to view the evolution dynamics in the gene context in an intuitive manner. This database will be able to facilitate the particular comparison between animals and plants, between vertebrates and arthropods, and between monocots and eudicots, accounting for the significant contribution of gene pairs to speciation and diversification in specific lineages. Copyright © 2015 Elsevier Inc. All rights reserved.
Gene context conservation of a higher order than operons.
Lathe, W C; Snel, B; Bork, P
2000-10-01
Operons, co-transcribed and co-regulated contiguous sets of genes, are poorly conserved over short periods of evolutionary time. The gene order, gene content and regulatory mechanisms of operons can be very different, even in closely related species. Here, we present several lines of evidence which suggest that, although an operon and its individual genes and regulatory structures are rearranged when comparing the genomes of different species, this rearrangement is a conservative process. Genomic rearrangements invariably maintain individual genes in very specific functional and regulatory contexts. We call this conserved context an uber-operon.
2012-01-01
Background Coffee trees (Rubiaceae) and tomato (Solanaceae) belong to the Asterid clade, while grapevine (Vitaceae) belongs to the Rosid clade. Coffee and tomato separated from grapevine 125 million years ago, while coffee and tomato diverged 83-89 million years ago. These long periods of divergent evolution should have permitted the genomes to reorganize significantly. So far, very few comparative mappings have been performed between very distantly related species belonging to different clades. We report the first multiple comparison between species from Asterid and Rosid clades, to examine both macro-and microsynteny relationships. Results Thanks to a set of 867 COSII markers, macrosynteny was detected between coffee, tomato and grapevine. While coffee and tomato genomes share 318 orthologous markers and 27 conserved syntenic segments (CSSs), coffee and grapevine also share a similar number of syntenic markers and CSSs: 299 and 29 respectively. Despite large genome macrostructure reorganization, several large chromosome segments showed outstanding macrosynteny shedding new insights into chromosome evolution between Asterids and Rosids. We also analyzed a sequence of 174 kb containing the ovate gene, conserved in a syntenic block between coffee, tomato and grapevine that showed a high-level of microstructure conservation. A higher level of conservation was observed between coffee and grapevine, both woody and long life-cycle plants, than between coffee and tomato. Out of 16 coffee genes of this syntenic segment, 7 and 14 showed complete synteny between coffee and tomato or grapevine, respectively. Conclusions These results show that significant conservation is found between distantly related species from the Asterid (Coffea canephora and Solanum sp.) and Rosid (Vitis vinifera) clades, at the genome macrostructure and microstructure levels. At the ovate locus, conservation did not decline in relation to increasing phylogenetic distance, suggesting that the time factor alone does not explain divergences. Our results are considerably useful for syntenic studies between supposedly remote species for the isolation of important genes for agronomy. PMID:22433423
Oliva, Carlos; Molina-Fernandez, Claudia; Maureira, Miguel; Candia, Noemi; López, Estefanía; Hassan, Bassem; Aerts, Stein; Cánovas, José; Olguín, Patricio; Sierralta, Jimena
2015-09-01
During axon targeting, a stereotyped pattern of connectivity is achieved by the integration of intrinsic genetic programs and the response to extrinsic long and short-range directional cues. How this coordination occurs is the subject of intense study. Transcription factors play a central role due to their ability to regulate the expression of multiple genes required to sense and respond to these cues during development. Here we show that the transcription factor HNT regulates layer-specific photoreceptor axon targeting in Drosophila through transcriptional control of jbug/Filamin and multiple genes involved in axon guidance and cytoskeleton organization.Using a microarray analysis we identified 235 genes whose expression levels were changed by HNT overexpression in the eye primordia. We analyzed nine candidate genes involved in cytoskeleton regulation and axon guidance, six of which displayed significantly altered gene expression levels in hnt mutant retinas. Functional analysis confirmed the role of OTK/PTK7 in photoreceptor axon targeting and uncovered Tiggrin, an integrin ligand, and Jbug/Filamin, a conserved actin- binding protein, as new factors that participate of photoreceptor axon targeting. Moreover, we provided in silico and molecular evidence that supports jbug/Filamin as a direct transcriptional target of HNT and that HNT acts partially through Jbug/Filamin in vivo to regulate axon guidance. Our work broadens the understanding of how HNT regulates the coordinated expression of a group of genes to achieve the correct connectivity pattern in the Drosophila visual system. © 2015 Wiley Periodicals, Inc. Develop Neurobiol 75: 1018-1032, 2015. © 2015 Wiley Periodicals, Inc.
Comparative Microbial Modules Resource: Generation and Visualization of Multi-species Biclusters
Bate, Ashley; Eichenberger, Patrick; Bonneau, Richard
2011-01-01
The increasing abundance of large-scale, high-throughput datasets for many closely related organisms provides opportunities for comparative analysis via the simultaneous biclustering of datasets from multiple species. These analyses require a reformulation of how to organize multi-species datasets and visualize comparative genomics data analyses results. Recently, we developed a method, multi-species cMonkey, which integrates heterogeneous high-throughput datatypes from multiple species to identify conserved regulatory modules. Here we present an integrated data visualization system, built upon the Gaggle, enabling exploration of our method's results (available at http://meatwad.bio.nyu.edu/cmmr.html). The system can also be used to explore other comparative genomics datasets and outputs from other data analysis procedures – results from other multiple-species clustering programs or from independent clustering of different single-species datasets. We provide an example use of our system for two bacteria, Escherichia coli and Salmonella Typhimurium. We illustrate the use of our system by exploring conserved biclusters involved in nitrogen metabolism, uncovering a putative function for yjjI, a currently uncharacterized gene that we predict to be involved in nitrogen assimilation. PMID:22144874
Comparative microbial modules resource: generation and visualization of multi-species biclusters.
Kacmarczyk, Thadeous; Waltman, Peter; Bate, Ashley; Eichenberger, Patrick; Bonneau, Richard
2011-12-01
The increasing abundance of large-scale, high-throughput datasets for many closely related organisms provides opportunities for comparative analysis via the simultaneous biclustering of datasets from multiple species. These analyses require a reformulation of how to organize multi-species datasets and visualize comparative genomics data analyses results. Recently, we developed a method, multi-species cMonkey, which integrates heterogeneous high-throughput datatypes from multiple species to identify conserved regulatory modules. Here we present an integrated data visualization system, built upon the Gaggle, enabling exploration of our method's results (available at http://meatwad.bio.nyu.edu/cmmr.html). The system can also be used to explore other comparative genomics datasets and outputs from other data analysis procedures - results from other multiple-species clustering programs or from independent clustering of different single-species datasets. We provide an example use of our system for two bacteria, Escherichia coli and Salmonella Typhimurium. We illustrate the use of our system by exploring conserved biclusters involved in nitrogen metabolism, uncovering a putative function for yjjI, a currently uncharacterized gene that we predict to be involved in nitrogen assimilation. © 2011 Kacmarczyk et al.
Jenkins, Adam M; Waterhouse, Robert M; Muskavitch, Marc A T
2015-04-23
Long non-coding RNAs (lncRNAs) have been defined as mRNA-like transcripts longer than 200 nucleotides that lack significant protein-coding potential, and many of them constitute scaffolds for ribonucleoprotein complexes with critical roles in epigenetic regulation. Various lncRNAs have been implicated in the modulation of chromatin structure, transcriptional and post-transcriptional gene regulation, and regulation of genomic stability in mammals, Caenorhabditis elegans, and Drosophila melanogaster. The purpose of this study is to identify the lncRNA landscape in the malaria vector An. gambiae and assess the evolutionary conservation of lncRNAs and their secondary structures across the Anopheles genus. Using deep RNA sequencing of multiple Anopheles gambiae life stages, we have identified 2,949 lncRNAs and more than 300 previously unannotated putative protein-coding genes. The lncRNAs exhibit differential expression profiles across life stages and adult genders. We find that across the genus Anopheles, lncRNAs display much lower sequence conservation than protein-coding genes. Additionally, we find that lncRNA secondary structure is highly conserved within the Gambiae complex, but diverges rapidly across the rest of the genus Anopheles. This study offers one of the first lncRNA secondary structure analyses in vector insects. Our description of lncRNAs in An. gambiae offers the most comprehensive genome-wide insights to date into lncRNAs in this vector mosquito, and defines a set of potential targets for the development of vector-based interventions that may further curb the human malaria burden in disease-endemic countries.
Comparative and evolutionary studies of vertebrate ALDH1A-like genes and proteins.
Holmes, Roger S
2015-06-05
Vertebrate ALDH1A-like genes encode cytosolic enzymes capable of metabolizing all-trans-retinaldehyde to retinoic acid which is a molecular 'signal' guiding vertebrate development and adipogenesis. Bioinformatic analyses of vertebrate and invertebrate genomes were undertaken using known ALDH1A1, ALDH1A2 and ALDH1A3 amino acid sequences. Comparative analyses of the corresponding human genes provided evidence for distinct modes of gene regulation and expression with putative transcription factor binding sites (TFBS), CpG islands and micro-RNA binding sites identified for the human genes. ALDH1A-like sequences were identified for all mammalian, bird, lizard and frog genomes examined, whereas fish genomes displayed a more restricted distribution pattern for ALDH1A1 and ALDH1A3 genes. The ALDH1A1 gene was absent in many bony fish genomes examined, with the ALDH1A3 gene also absent in the medaka and tilapia genomes. Multiple ALDH1A1-like genes were identified in mouse, rat and marsupial genomes. Vertebrate ALDH1A1, ALDH1A2 and ALDH1A3 subunit sequences were highly conserved throughout vertebrate evolution. Comparative amino acid substitution rates showed that mammalian ALDH1A2 sequences were more highly conserved than for the ALDH1A1 and ALDH1A3 sequences. Phylogenetic studies supported an hypothesis for ALDH1A2 as a likely primordial gene originating in invertebrate genomes and undergoing sequential gene duplication to generate two additional genes, ALDH1A1 and ALDH1A3, in most vertebrate genomes. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Multi-functional regulation of 4E-BP gene expression by the Ccr4-Not complex.
Okada, Hirokazu; Schittenhelm, Ralf B; Straessle, Anna; Hafen, Ernst
2015-01-01
The mechanistic target of rapamycin (mTOR) signaling pathway is highly conserved from yeast to humans. It senses various environmental cues to regulate cellular growth and homeostasis. Deregulation of the pathway has been implicated in many pathological conditions including cancer. Phosphorylation cascades through the pathway have been extensively studied but not much is known about the regulation of gene expression of the pathway components. Here, we report that the mRNA level of eukaryotic translation initiation factor (eIF) subunit 4E-binding protein (4E-BP) gene, one of the key mTOR signaling components, is regulated by the highly conserved Ccr4-Not complex. RNAi knockdown of Not1, a putative scaffold protein of this protein complex, increases the mRNA level of 4E-BP in Drosophila Kc cells. Examination of the gene expression mechanism using reporter swap constructs reveals that Not1 depletion increases reporter mRNAs with the 3'UTR of 4E-BP gene, but decreases the ones with the 4E-BP promoter region, suggesting that Ccr4-Not complex regulates both degradation and transcription of 4E-BP mRNA. These results indicate that the Ccr4-Not complex controls expression of a single gene at multiple levels and adjusts the magnitude of the total effect. Thus, our study reveals a novel regulatory mechanism of a key component of the mTOR signaling pathway at the level of gene expression.
Multiple levels of redundant processes inhibit Caenorhabditis elegans vulval cell fates.
Andersen, Erik C; Saffer, Adam M; Horvitz, H Robert
2008-08-01
Many mutations cause obvious abnormalities only when combined with other mutations. Such synthetic interactions can be the result of redundant gene functions. In Caenorhabditis elegans, the synthetic multivulva (synMuv) genes have been grouped into multiple classes that redundantly inhibit vulval cell fates. Animals with one or more mutations of the same class undergo wild-type vulval development, whereas animals with mutations of any two classes have a multivulva phenotype. By varying temperature and genetic background, we determined that mutations in most synMuv genes within a single synMuv class enhance each other. However, in a few cases no enhancement was observed. For example, mutations that affect an Mi2 homolog and a histone methyltransferase are of the same class and do not show enhancement. We suggest that such sets of genes function together in vivo and in at least some cases encode proteins that interact physically. The approach of genetic enhancement can be applied more broadly to identify potential protein complexes as well as redundant processes or pathways. Many synMuv genes are evolutionarily conserved, and the genetic relationships we have identified might define the functions not only of synMuv genes in C. elegans but also of their homologs in other organisms.
Multiple Levels of Redundant Processes Inhibit Caenorhabditis elegans Vulval Cell Fates
Andersen, Erik C.; Saffer, Adam M.; Horvitz, H. Robert
2008-01-01
Many mutations cause obvious abnormalities only when combined with other mutations. Such synthetic interactions can be the result of redundant gene functions. In Caenorhabditis elegans, the synthetic multivulva (synMuv) genes have been grouped into multiple classes that redundantly inhibit vulval cell fates. Animals with one or more mutations of the same class undergo wild-type vulval development, whereas animals with mutations of any two classes have a multivulva phenotype. By varying temperature and genetic background, we determined that mutations in most synMuv genes within a single synMuv class enhance each other. However, in a few cases no enhancement was observed. For example, mutations that affect an Mi2 homolog and a histone methyltransferase are of the same class and do not show enhancement. We suggest that such sets of genes function together in vivo and in at least some cases encode proteins that interact physically. The approach of genetic enhancement can be applied more broadly to identify potential protein complexes as well as redundant processes or pathways. Many synMuv genes are evolutionarily conserved, and the genetic relationships we have identified might define the functions not only of synMuv genes in C. elegans but also of their homologs in other organisms. PMID:18689876
Circular DNA Intermediate in the Duplication of Nile Tilapia vasa Genes
Fujimura, Koji; Conte, Matthew A.; Kocher, Thomas D.
2011-01-01
vasa is a highly conserved RNA helicase involved in animal germ cell development. Among vertebrate species, it is typically present as a single copy per genome. Here we report the isolation and sequencing of BAC clones for Nile tilapia vasa genes. Contrary to a previous report that Nile tilapia have a single copy of the vasa gene, we find evidence for at least three vasa gene loci. The vasa gene locus was duplicated from the original site and integrated into two distant novel sites. For one of these insertions we find evidence that the duplication was mediated by a circular DNA intermediate. This mechanism of gene duplication may explain the origin of isolated gene duplicates during the evolution of fish genomes. These data provide a foundation for studying the role of multiple vasa genes in the development of tilapia gonads, and will contribute to investigations of the molecular mechanisms of sex determination and evolution in cichlid fishes. PMID:22216289
Kashuk, Carl S.; Stone, Eric A.; Grice, Elizabeth A.; Portnoy, Matthew E.; Green, Eric D.; Sidow, Arend; Chakravarti, Aravinda; McCallion, Andrew S.
2005-01-01
The ability to discriminate between deleterious and neutral amino acid substitutions in the genes of patients remains a significant challenge in human genetics. The increasing availability of genomic sequence data from multiple vertebrate species allows inclusion of sequence conservation and physicochemical properties of residues to be used for functional prediction. In this study, the RET receptor tyrosine kinase serves as a model disease gene in which a broad spectrum (≥116) of disease-associated mutations has been identified among patients with Hirschsprung disease and multiple endocrine neoplasia type 2. We report the alignment of the human RET protein sequence with the orthologous sequences of 12 non-human vertebrates (eight mammalian, one avian, and three teleost species), their comparative analysis, the evolutionary topology of the RET protein, and predicted tolerance for all published missense mutations. We show that, although evolutionary conservation alone provides significant information to predict the effect of a RET mutation, a model that combines comparative sequence data with analysis of physiochemical properties in a quantitative framework provides far greater accuracy. Although the ability to discern the impact of a mutation is imperfect, our analyses permit substantial discrimination between predicted functional classes of RET mutations and disease severity even for a multigenic disease such as Hirschsprung disease. PMID:15956201
Han, Wei; Zou, Jianmin; Wang, Kehua; Su, Yijun; Zhu, Yunfen; Song, Chi; Li, Guohui; Qu, Liang; Zhang, Huiyong; Liu, Honglin
2015-01-01
Onset of the rapid gonad growth is a milestone in sexual development that comprises many genes and regulatory factors. The observations in model organisms and mammals including humans have shown a potential link between miRNAs and development timing. To determine whether miRNAs play roles in this process in the chicken (Gallus gallus), the Solexa deep sequencing was performed to analyze the profiles of miRNA expression in the hypothalamus of hens from two different pubertal stages, before onset of the rapid gonad development (BO) and after onset of the rapid gonad development (AO). 374 conserved and 46 novel miRNAs were identified as hypothalamus-expressed miRNAs in the chicken. 144 conserved miRNAs were showed to be differentially expressed (reads > 10, P < 0.05) during the transition from BO to AO. Five differentially expressed miRNAs were validated by real-time quantitative RT-PCR (qRT-PCR) method. 2013 putative genes were predicted as the targets of the 15 most differentially expressed miRNAs (fold-change > 4.0, P < 0.01). Of these genes, 7 putative circadian clock genes, Per2, Bmal1/2, Clock, Cry1/2, and Star were found to be targeted multiple times by the miRNAs. qRT-PCR revealed the basic transcription levels of these clock genes were much higher (P < 0.01) in AO than in BO. Further functional analysis suggested that these 15 miRNAs play important roles in transcriptional regulation and signal transduction pathways. The results provide new insights into miRNAs functions in timing the rapid development of chicken gonads. Considering the characteristics of miRNA functional conservation, the results will contribute to the research on puberty onset in humans.
Gushchina, Liubov V; Kwiatkowski, Thomas A; Bhattacharya, Sayak; Weisleder, Noah L
2018-05-01
The tripartite motif (TRIM) gene family is a highly conserved group of E3 ubiquitin ligase proteins that can establish substrate specificity for the ubiquitin-proteasome complex and also have proteasome-independent functions. While several family members were studied previously, it is relatively recent that over 80 genes, based on sequence homology, were grouped to establish the TRIM gene family. Functional studies of various TRIM genes linked these proteins to modulation of inflammatory responses showing that they can contribute to a wide variety of disease states including cardiovascular, neurological and musculoskeletal diseases, as well as various forms of cancer. Given the fundamental role of the ubiquitin-proteasome complex in protein turnover and the importance of this regulation in most aspects of cellular physiology, it is not surprising that TRIM proteins display a wide spectrum of functions in a variety of cellular processes. This broad range of function and the highly conserved primary amino acid sequence of family members, particularly in the canonical TRIM E3 ubiquitin ligase domain, complicates the development of therapeutics that specifically target these proteins. A more comprehensive understanding of the structure and function of TRIM proteins will help guide therapeutic development for a number of different diseases. This review summarizes the structural organization of TRIM proteins, their domain architecture, common and unique post-translational modifications within the family, and potential binding partners and targets. Further discussion is provided on efforts to target TRIM proteins as therapeutic agents and how our increasing understanding of the nature of TRIM proteins can guide discovery of other therapeutics in the future. Copyright © 2017 Elsevier Inc. All rights reserved.
Molecular cloning and expression analysis of WRKY transcription factor genes in Salvia miltiorrhiza.
Li, Caili; Li, Dongqiao; Shao, Fenjuan; Lu, Shanfa
2015-03-17
WRKY proteins comprise a large family of transcription factors and play important regulatory roles in plant development and defense response. The WRKY gene family in Salvia miltiorrhiza has not been characterized. A total of 61 SmWRKYs were cloned from S. miltiorrhiza. Multiple sequence alignment showed that SmWRKYs could be classified into 3 groups and 8 subgroups. Sequence features, the WRKY domain and other motifs of SmWRKYs are largely conserved with Arabidopsis AtWRKYs. Each group of WRKY domains contains characteristic conserved sequences, and group-specific motifs might attribute to functional divergence of WRKYs. A total of 17 pairs of orthologous SmWRKY and AtWRKY genes and 21 pairs of paralogous SmWRKY genes were identified. Maximum likelihood analysis showed that SmWRKYs had undergone strong selective pressure for adaptive evolution. Functional divergence analysis suggested that the SmWRKY subgroup genes and many paralogous SmWRKY gene pairs were divergent in functions. Various critical amino acids contributed to functional divergence among subgroups were detected. Of the 61 SmWRKYs, 22, 13, 4 and 1 were predominantly expressed in roots, stems, leaves, and flowers, respectively. The other 21 were mainly expressed in at least two tissues analyzed. In S. miltiorrhiza roots treated with MeJA, significant changes of gene expression were observed for 49 SmWRKYs, of which 26 were up-regulated, 18 were down-regulated, while the other 5 were either up-regulated or down-regulated at different time-points of treatment. Analysis of published RNA-seq data showed that 42 of the 61 identified SmWRKYs were yeast extract and Ag(+)-responsive. Through a systematic analysis, SmWRKYs potentially involved in tanshinone biosynthesis were predicted. These results provide insights into functional conservation and diversification of SmWRKYs and are useful information for further elucidating SmWRKY functions.
Genomic Sequence around Butterfly Wing Development Genes: Annotation and Comparative Analysis
Conceição, Inês C.; Long, Anthony D.; Gruber, Jonathan D.; Beldade, Patrícia
2011-01-01
Background Analysis of genomic sequence allows characterization of genome content and organization, and access beyond gene-coding regions for identification of functional elements. BAC libraries, where relatively large genomic regions are made readily available, are especially useful for species without a fully sequenced genome and can increase genomic coverage of phylogenetic and biological diversity. For example, no butterfly genome is yet available despite the unique genetic and biological properties of this group, such as diversified wing color patterns. The evolution and development of these patterns is being studied in a few target species, including Bicyclus anynana, where a whole-genome BAC library allows targeted access to large genomic regions. Methodology/Principal Findings We characterize ∼1.3 Mb of genomic sequence around 11 selected genes expressed in B. anynana developing wings. Extensive manual curation of in silico predictions, also making use of a large dataset of expressed genes for this species, identified repetitive elements and protein coding sequence, and highlighted an expansion of Alcohol dehydrogenase genes. Comparative analysis with orthologous regions of the lepidopteran reference genome allowed assessment of conservation of fine-scale synteny (with detection of new inversions and translocations) and of DNA sequence (with detection of high levels of conservation of non-coding regions around some, but not all, developmental genes). Conclusions The general properties and organization of the available B. anynana genomic sequence are similar to the lepidopteran reference, despite the more than 140 MY divergence. Our results lay the groundwork for further studies of new interesting findings in relation to both coding and non-coding sequence: 1) the Alcohol dehydrogenase expansion with higher similarity between the five tandemly-repeated B. anynana paralogs than with the corresponding B. mori orthologs, and 2) the high conservation of non-coding sequence around the genes wingless and Ecdysone receptor, both involved in multiple developmental processes including wing pattern formation. PMID:21909358
Chandramore, Kalpana; Ito, Yuzuro; Takahashi, Shuji; Asashima, Makoto; Ghaskadbi, Surendra
2010-01-01
Hydra, a member of phylum Cnidaria that arose early in evolution, is endowed with a defined axis, organized nervous system, and active behavior. It is a powerful model system for the elucidation of evolution of developmental mechanisms in animals. Here, we describe the identification and cloning of noggin-like gene from hydra. Noggin is a secreted protein involved at multiple stages of vertebrate embryonic development including neural induction and is known to exert its effects by inhibiting the bone morphogenetic protein (BMP)-signaling pathway. Sequence analysis revealed that hydra Noggin shows considerable similarity with its orthologs at the amino acid level. When microinjected in the early Xenopus embryos, hydra noggin mRNA induced a secondary axis in 100% of the injected embryos, demonstrating functional conservation of hydra noggin in vertebrates. This was further confirmed by the partial rescue of Xenopus embryos by hydra noggin mRNA from UV-induced ventralization. By using animal cap assay in Xenopus embryos, we demonstrate that these effects of hydra noggin in Xenopus embryos are because of inhibition of BMP signaling by Noggin. Our data indicate that BMP/Noggin antagonism predates the bilaterian divergence and is conserved during the evolution.
Association of TLR1, TLR2, TLR4, TLR6, and TIRAP polymorphisms with disease susceptibility.
Noreen, Mamoona; Arshad, Muhammad
2015-06-01
Toll like receptors (TLRs) play a crucial role in regulation of innate as well as adaptive immunity. TLRs recognize a distinct but limited repertoire of conserved microbial products. Ligand binding to TLRs activates the signaling cascade and results in activation of multiple inflammatory genes. Variation in this immune response is under genetic control. Polymorphisms in genes associated with inflammatory pathway especially influence the outcome of diseases. TLR2 makes heterodimer with TLR1 or TLR6 and recognizes a wide variety of microbial ligands. In this review, we summarize studies of polymorphisms in genes encoding TLR1, TLR2, TLR4, TLR6, and most polymorphic adaptor protein, Mal/TIRAP, revealing their effect on susceptibility to diseases.
Thanki, Anil S; Soranzo, Nicola; Haerty, Wilfried; Davey, Robert P
2018-03-01
Gene duplication is a major factor contributing to evolutionary novelty, and the contraction or expansion of gene families has often been associated with morphological, physiological, and environmental adaptations. The study of homologous genes helps us to understand the evolution of gene families. It plays a vital role in finding ancestral gene duplication events as well as identifying genes that have diverged from a common ancestor under positive selection. There are various tools available, such as MSOAR, OrthoMCL, and HomoloGene, to identify gene families and visualize syntenic information between species, providing an overview of syntenic regions evolution at the family level. Unfortunately, none of them provide information about structural changes within genes, such as the conservation of ancestral exon boundaries among multiple genomes. The Ensembl GeneTrees computational pipeline generates gene trees based on coding sequences, provides details about exon conservation, and is used in the Ensembl Compara project to discover gene families. A certain amount of expertise is required to configure and run the Ensembl Compara GeneTrees pipeline via command line. Therefore, we converted this pipeline into a Galaxy workflow, called GeneSeqToFamily, and provided additional functionality. This workflow uses existing tools from the Galaxy ToolShed, as well as providing additional wrappers and tools that are required to run the workflow. GeneSeqToFamily represents the Ensembl GeneTrees pipeline as a set of interconnected Galaxy tools, so they can be run interactively within the Galaxy's user-friendly workflow environment while still providing the flexibility to tailor the analysis by changing configurations and tools if necessary. Additional tools allow users to subsequently visualize the gene families produced by the workflow, using the Aequatus.js interactive tool, which has been developed as part of the Aequatus software project.
Essentiality, conservation, evolutionary pressure and codon bias in bacterial genomes.
Dilucca, Maddalena; Cimini, Giulio; Giansanti, Andrea
2018-07-15
Essential genes constitute the core of genes which cannot be mutated too much nor lost along the evolutionary history of a species. Natural selection is expected to be stricter on essential genes and on conserved (highly shared) genes, than on genes that are either nonessential or peculiar to a single or a few species. In order to further assess this expectation, we study here how essentiality of a gene is connected with its degree of conservation among several unrelated bacterial species, each one characterised by its own codon usage bias. Confirming previous results on E. coli, we show the existence of a universal exponential relation between gene essentiality and conservation in bacteria. Moreover, we show that, within each bacterial genome, there are at least two groups of functionally distinct genes, characterised by different levels of conservation and codon bias: i) a core of essential genes, mainly related to cellular information processing; ii) a set of less conserved nonessential genes with prevalent functions related to metabolism. In particular, the genes in the first group are more retained among species, are subject to a stronger purifying conservative selection and display a more limited repertoire of synonymous codons. The core of essential genes is close to the minimal bacterial genome, which is in the focus of recent studies in synthetic biology, though we confirm that orthologs of genes that are essential in one species are not necessarily essential in other species. We also list a set of highly shared genes which, reasonably, could constitute a reservoir of targets for new anti-microbial drugs. Copyright © 2018 Elsevier B.V. All rights reserved.
Linkage analysis of schizophrenia with five dopamine receptor genes in nine pedigrees
DOE Office of Scientific and Technical Information (OSTI.GOV)
Coon, H.; Byerley, W.; Holik, J.
Alterations in dopamine neurotransmission have been strongly implicated in the pathogenesis of schizophrenia for nearly 2 decades. Recently, the genes for five dopamine receptors have been cloned and characterized, and genetic and physical map information has become available. Using these five loci as candidate genes, the authors have tested for genetic linkage to schizophrenia in nine multigenerational families which include multiple affected individuals. In addition to testing conservative disease models, the have used a neurophysiological indicator variable, the P50 auditory evoked response. Deficits in gating of the P50 response have been shown to segregate with schizophrenia in this sample andmore » may identify carriers of gene(s) predisposing for schizophrenia. Linkage results were consistently negative, indicating that a defect at any of the actual receptor sites is unlikely to be a major contributor to schizophrenia in the nine families studied. 47 refs., 1 fig., 4 tabs.« less
Butterfly genome reveals promiscuous exchange of mimicry adaptations among species
Dasmahapatra, Kanchon K; Walters, James R.; Briscoe, Adriana D.; Davey, John W.; Whibley, Annabel; Nadeau, Nicola J.; Zimin, Aleksey V.; Hughes, Daniel S. T.; Ferguson, Laura C.; Martin, Simon H.; Salazar, Camilo; Lewis, James J.; Adler, Sebastian; Ahn, Seung-Joon; Baker, Dean A.; Baxter, Simon W.; Chamberlain, Nicola L.; Chauhan, Ritika; Counterman, Brian A.; Dalmay, Tamas; Gilbert, Lawrence E.; Gordon, Karl; Heckel, David G.; Hines, Heather M.; Hoff, Katharina J.; Holland, Peter W.H.; Jacquin-Joly, Emmanuelle; Jiggins, Francis M.; Jones, Robert T.; Kapan, Durrell D.; Kersey, Paul; Lamas, Gerardo; Lawson, Daniel; Mapleson, Daniel; Maroja, Luana S.; Martin, Arnaud; Moxon, Simon; Palmer, William J.; Papa, Riccardo; Papanicolaou, Alexie; Pauchet, Yannick; Ray, David A.; Rosser, Neil; Salzberg, Steven L.; Supple, Megan A.; Surridge, Alison; Tenger-Trolander, Ayse; Vogel, Heiko; Wilkinson, Paul A.; Wilson, Derek; Yorke, James A.; Yuan, Furong; Balmuth, Alexi L.; Eland, Cathlene; Gharbi, Karim; Thomson, Marian; Gibbs, Richard A.; Han, Yi; Jayaseelan, Joy C.; Kovar, Christie; Mathew, Tittu; Muzny, Donna M.; Ongeri, Fiona; Pu, Ling-Ling; Qu, Jiaxin; Thornton, Rebecca L.; Worley, Kim C.; Wu, Yuan-Qing; Linares, Mauricio; Blaxter, Mark L.; Constant, Richard H. ffrench; Joron, Mathieu; Kronforst, Marcus R.; Mullen, Sean P.; Reed, Robert D.; Scherer, Steven E.; Richards, Stephen; Mallet, James; McMillan, W. Owen; Jiggins, Chris D.
2012-01-01
The evolutionary importance of hybridization and introgression has long been debated1. We used genomic tools to investigate introgression in Heliconius, a rapidly radiating genus of neotropical butterflies widely used in studies of ecology, behaviour, mimicry and speciation2-5 . We sequenced the genome of Heliconius melpomene and compared it with other taxa to investigate chromosomal evolution in Lepidoptera and gene flow among multiple Heliconius species and races. Among 12,657 predicted genes for Heliconius, biologically important expansions of families of chemosensory and Hox genes are particularly noteworthy. Chromosomal organisation has remained broadly conserved since the Cretaceous, when butterflies split from the silkmoth lineage. Using genomic resequencing, we show hybrid exchange of genes between three co-mimics, H. melpomene, H. timareta, and H. elevatus, especially at two genomic regions that control mimicry pattern. Closely related Heliconius species clearly exchange protective colour pattern genes promiscuously, implying a major role for hybridization in adaptive radiation. PMID:22722851
Li, Xuyan; Xie, Xin; Li, Ji; Cui, Yuhai; Hou, Yanming; Zhai, Lulu; Wang, Xiao; Fu, Yanli; Liu, Ranran; Bian, Shaomin
2017-02-01
microRNA166 (miR166) is a highly conserved family of miRNAs implicated in a wide range of cellular and physiological processes in plants. miR166 family generally comprises multiple miR166 members in plants, which might exhibit functional redundancy and specificity. The soybean miR166 family consists of 21 members according to the miRBase database. However, the evolutionary conservation and functional diversification of miR166 family members in soybean remain poorly understood. We identified five novel miR166s in soybean by data mining approach, thus enlarging the size of miR166 family from 21 to 26 members. Phylogenetic analyses of the 26 miR166s and their precursors indicated that soybean miR166 family exhibited both evolutionary conservation and diversification, and ten pairs of miR166 precursors with high sequence identity were individually grouped into a discrete clade in the phylogenetic tree. The analysis of genomic organization and evolution of MIR166 gene family revealed that eight segmental duplications and four tandem duplications might occur during evolution of the miR166 family in soybean. The cis-elements in promoters of MIR166 family genes and their putative targets pointed to their possible contributions to the functional conservation and diversification. The targets of soybean miR166s were predicted, and the cleavage of ATHB14-LIKE transcript was experimentally validated by RACE PCR. Further, the expression patterns of the five newly identified MIR166s and 12 target genes were examined during seed development and in response to abiotic stresses, which provided important clues for dissecting their functions and isoform specificity. This study enlarged the size of soybean miR166 family from 21 to 26 members, and the 26 soybean miR166s exhibited evolutionary conservation and diversification. These findings have laid a foundation for elucidating functional conservation and diversification of miR166 family members, especially during seed development or under abiotic stresses.
Staiger, Dorothee; Allenbach, Laure; Salathia, Neeraj; Fiechter, Vincent; Davis, Seth J.; Millar, Andrew J.; Chory, Joanne; Fankhauser, Christian
2003-01-01
Plants possess several photoreceptors to sense the light environment. In Arabidopsis cryptochromes and phytochromes play roles in photomorphogenesis and in the light input pathways that synchronize the circadian clock with the external world. We have identified SRR1 (sensitivity to red light reduced), a gene that plays an important role in phytochrome B (phyB)-mediated light signaling. The recessive srr1 null allele and phyB mutants display a number of similar phenotypes indicating that SRR1 is required for normal phyB signaling. Genetic analysis suggests that SRR1 works both in the phyB pathway but also independently of phyB. srr1 mutants are affected in multiple outputs of the circadian clock in continuous light conditions, including leaf movement and expression of the clock components, CCA1 and TOC1. Clock-regulated gene expression is also impaired during day–night cycles and in constant darkness. The circadian phenotypes of srr1 mutants in all three conditions suggest that SRR1 activity is required for normal oscillator function. The SRR1 gene was identified and shown to code for a protein conserved in numerous eukaryotes including mammals and flies, implicating a conserved role for this protein in both the animal and plant kingdoms. PMID:12533513
Developmental Pathways Are Blueprints for Designing Successful Crops
Trevaskis, Ben
2018-01-01
Genes controlling plant development have been studied in multiple plant systems. This has provided deep insights into conserved genetic pathways controlling core developmental processes including meristem identity, phase transitions, determinacy, stem elongation, and branching. These pathways control plant growth patterns and are fundamentally important to crop biology and agriculture. This review describes the conserved pathways that control plant development, using Arabidopsis as a model. Historical examples of how plant development has been altered through selection to improve crop performance are then presented. These examples, drawn from diverse crops, show how the genetic pathways controlling development have been modified to increase yield or tailor growth patterns to suit local growing environments or specialized crop management practices. Strategies to apply current progress in genomics and developmental biology to future crop improvement are then discussed within the broader context of emerging trends in plant breeding. The ways that knowledge of developmental processes and understanding of gene function can contribute to crop improvement, beyond what can be achieved by selection alone, are emphasized. These include using genome re-sequencing, mutagenesis, and gene editing to identify or generate novel variation in developmental genes. The expanding scope for comparative genomics, the possibility to engineer new developmental traits and new approaches to resolve gene–gene or gene–environment interactions are also discussed. Finally, opportunities to integrate fundamental research and crop breeding are highlighted. PMID:29922318
NASA Technical Reports Server (NTRS)
Yang, Tianbao; Poovaiah, B. W.
2002-01-01
We reported earlier that the tobacco early ethylene-responsive gene NtER1 encodes a calmodulin-binding protein (Yang, T., and Poovaiah, B. W. (2000) J. Biol. Chem. 275, 38467-38473). Here we demonstrate that there is one NtER1 homolog as well as five related genes in Arabidopsis. These six genes are rapidly and differentially induced by environmental signals such as temperature extremes, UVB, salt, and wounding; hormones such as ethylene and abscisic acid; and signal molecules such as methyl jasmonate, H(2)O(2), and salicylic acid. Hence, they were designated as AtSR1-6 (Arabidopsis thaliana signal-responsive genes). Ca(2+)/calmodulin binds to all AtSRs, and their calmodulin-binding regions are located on a conserved basic amphiphilic alpha-helical motif in the C terminus. AtSR1 targets the nucleus and specifically recognizes a novel 6-bp CGCG box (A/C/G)CGCG(G/T/C). The multiple CGCG cis-elements are found in promoters of genes such as those involved in ethylene signaling, abscisic acid signaling, and light signal perception. The DNA-binding domain in AtSR1 is located on the N-terminal 146 bp where all AtSR1-related proteins share high similarity but have no similarity to other known DNA-binding proteins. The calmodulin-binding nuclear proteins isolated from wounded leaves exhibit specific CGCG box DNA binding activities. These results suggest that the AtSR gene family encodes a family of calmodulin-binding/DNA-binding proteins involved in multiple signal transduction pathways in plants.
Evolution of Daily Gene Co-expression Patterns from Algae to Plants
de los Reyes, Pedro; Romero-Campero, Francisco J.; Ruiz, M. Teresa; Romero, José M.; Valverde, Federico
2017-01-01
Daily rhythms play a key role in transcriptome regulation in plants and microalgae orchestrating responses that, among other processes, anticipate light transitions that are essential for their metabolism and development. The recent accumulation of genome-wide transcriptomic data generated under alternating light:dark periods from plants and microalgae has made possible integrative and comparative analysis that could contribute to shed light on the evolution of daily rhythms in the green lineage. In this work, RNA-seq and microarray data generated over 24 h periods in different light regimes from the eudicot Arabidopsis thaliana and the microalgae Chlamydomonas reinhardtii and Ostreococcus tauri have been integrated and analyzed using gene co-expression networks. This analysis revealed a reduction in the size of the daily rhythmic transcriptome from around 90% in Ostreococcus, being heavily influenced by light transitions, to around 40% in Arabidopsis, where a certain independence from light transitions can be observed. A novel Multiple Bidirectional Best Hit (MBBH) algorithm was applied to associate single genes with a family of potential orthologues from evolutionary distant species. Gene duplication, amplification and divergence of rhythmic expression profiles seems to have played a central role in the evolution of gene families in the green lineage such as Pseudo Response Regulators (PRRs), CONSTANS-Likes (COLs), and DNA-binding with One Finger (DOFs). Gene clustering and functional enrichment have been used to identify groups of genes with similar rhythmic gene expression patterns. The comparison of gene clusters between species based on potential orthologous relationships has unveiled a low to moderate level of conservation of daily rhythmic expression patterns. However, a strikingly high conservation was found for the gene clusters exhibiting their highest and/or lowest expression value during the light transitions. PMID:28751903
Modularity and evolutionary constraints in a baculovirus gene regulatory network
2013-01-01
Background The structure of regulatory networks remains an open question in our understanding of complex biological systems. Interactions during complete viral life cycles present unique opportunities to understand how host-parasite network take shape and behave. The Anticarsia gemmatalis multiple nucleopolyhedrovirus (AgMNPV) is a large double-stranded DNA virus, whose genome may encode for 152 open reading frames (ORFs). Here we present the analysis of the ordered cascade of the AgMNPV gene expression. Results We observed an earlier onset of the expression than previously reported for other baculoviruses, especially for genes involved in DNA replication. Most ORFs were expressed at higher levels in a more permissive host cell line. Genes with more than one copy in the genome had distinct expression profiles, which could indicate the acquisition of new functionalities. The transcription gene regulatory network (GRN) for 149 ORFs had a modular topology comprising five communities of highly interconnected nodes that separated key genes that are functionally related on different communities, possibly maximizing redundancy and GRN robustness by compartmentalization of important functions. Core conserved functions showed expression synchronicity, distinct GRN features and significantly less genetic diversity, consistent with evolutionary constraints imposed in key elements of biological systems. This reduced genetic diversity also had a positive correlation with the importance of the gene in our estimated GRN, supporting a relationship between phylogenetic data of baculovirus genes and network features inferred from expression data. We also observed that gene arrangement in overlapping transcripts was conserved among related baculoviruses, suggesting a principle of genome organization. Conclusions Albeit with a reduced number of nodes (149), the AgMNPV GRN had a topology and key characteristics similar to those observed in complex cellular organisms, which indicates that modularity may be a general feature of biological gene regulatory networks. PMID:24006890
Ventura, Marco; Canchaya, Carlos; Meylan, Valèrie; Klaenhammer, Todd R.; Zink, Ralf
2003-01-01
We analyzed the tuf gene, encoding elongation factor Tu, from 33 strains representing 17 Lactobacillus species and 8 Bifidobacterium species. The tuf sequences were aligned and used to infer phylogenesis among species of lactobacilli and bifidobacteria. We demonstrated that the synonymous substitution affecting this gene renders elongation factor Tu a reliable molecular clock for investigating evolutionary distances of lactobacilli and bifidobacteria. In fact, the phylogeny generated by these tuf sequences is consistent with that derived from 16S rRNA analysis. The investigation of a multiple alignment of tuf sequences revealed regions conserved among strains belonging to the same species but distinct from those of other species. PCR primers complementary to these regions allowed species-specific identification of closely related species, such as Lactobacillus casei group members. These tuf gene-based assays developed in this study provide an alternative to present methods for the identification for lactic acid bacterial species. Since a variable number of tuf genes have been described for bacteria, the presence of multiple genes was examined. Southern analysis revealed one tuf gene in the genomes of lactobacilli and bifidobacteria, but the tuf gene was arranged differently in the genomes of these two taxa. Our results revealed that the tuf gene in bifidobacteria is flanked by the same gene constellation as the str operon, as originally reported for Escherichia coli. In contrast, bioinformatic and transcriptional analyses of the DNA region flanking the tuf gene in four Lactobacillus species indicated the same four-gene unit and suggested a novel tuf operon specific for the genus Lactobacillus. PMID:14602655
Wang, Zhihui; Cheng, Ke; Wan, Liyun; Yan, Liying; Jiang, Huifang; Liu, Shengyi; Lei, Yong; Liao, Boshou
2015-12-10
Plant bZIP proteins characteristically harbor a highly conserved bZIP domain with two structural features: a DNA-binding basic region and a leucine (Leu) zipper dimerization region. They have been shown to be diverse transcriptional regulators, playing crucial roles in plant development, physiological processes, and biotic/abiotic stress responses. Despite the availability of six completely sequenced legume genomes, a comprehensive investigation of bZIP family members in legumes has yet to be presented. In this study, we identified 428 bZIP genes encoding 585 distinct proteins in six legumes, Glycine max, Medicago truncatula, Phaseolus vulgaris, Cicer arietinum, Cajanus cajan, and Lotus japonicus. The legume bZIP genes were categorized into 11 groups according to their phylogenetic relationships with genes from Arabidopsis. Four kinds of intron patterns (a-d) within the basic and hinge regions were defined and additional conserved motifs were identified, both presenting high group specificity and supporting the group classification. We predicted the DNA-binding patterns and the dimerization properties, based on the characteristic features in the basic and hinge regions and the Leu zipper, respectively, which indicated that some highly conserved amino acid residues existed across each major group. The chromosome distribution and analysis for WGD-derived duplicated blocks revealed that the legume bZIP genes have expanded mainly by segmental duplication rather than tandem duplication. Expression data further revealed that the legume bZIP genes were expressed constitutively or in an organ-specific, development-dependent manner playing roles in multiple seed developmental stages and tissues. We also detected several key legume bZIP genes involved in drought- and salt-responses by comparing fold changes of expression values in drought-stressed or salt-stressed roots and leaves. In summary, this genome-wide identification, characterization and expression analysis of legume bZIP genes provides valuable information for understanding the molecular functions and evolution of the legume bZIP transcription factor family, and highlights potential legume bZIP genes involved in regulating tissue development and abiotic stress responses.
A quantitative framework to evaluate modeling of cortical development by neural stem cells
Stein, Jason L.; de la Torre-Ubieta, Luis; Tian, Yuan; Parikshak, Neelroop N.; Hernandez, Israel A.; Marchetto, Maria C.; Baker, Dylan K.; Lu, Daning; Hinman, Cassidy R.; Lowe, Jennifer K.; Wexler, Eric M.; Muotri, Alysson R.; Gage, Fred H.; Kosik, Kenneth S.; Geschwind, Daniel H.
2014-01-01
Summary Neural stem cells have been adopted to model a wide range of neuropsychiatric conditions in vitro. However, how well such models correspond to in vivo brain has not been evaluated in an unbiased, comprehensive manner. We used transcriptomic analyses to compare in vitro systems to developing human fetal brain and observed strong conservation of in vivo gene expression and network architecture in differentiating primary human neural progenitor cells (phNPCs). Conserved modules are enriched in genes associated with ASD, supporting the utility of phNPCs for studying neuropsychiatric disease. We also developed and validated a machine learning approach called CoNTExT that identifies the developmental maturity and regional identity of in vitro models. We observed strong differences between in vitro models, including hiPSC-derived neural progenitors from multiple laboratories. This work provides a systems biology framework for evaluating in vitro systems and supports their value in studying the molecular mechanisms of human neurodevelopmental disease. PMID:24991955
Wood, David L. A.; Nones, Katia; Steptoe, Anita; Christ, Angelika; Harliwong, Ivon; Newell, Felicity; Bruxner, Timothy J. C.; Miller, David; Cloonan, Nicole; Grimmond, Sean M.
2015-01-01
Genetic variation modulates gene expression transcriptionally or post-transcriptionally, and can profoundly alter an individual’s phenotype. Measuring allelic differential expression at heterozygous loci within an individual, a phenomenon called allele-specific expression (ASE), can assist in identifying such factors. Massively parallel DNA and RNA sequencing and advances in bioinformatic methodologies provide an outstanding opportunity to measure ASE genome-wide. In this study, matched DNA and RNA sequencing, genotyping arrays and computationally phased haplotypes were integrated to comprehensively and conservatively quantify ASE in a single human brain and liver tissue sample. We describe a methodological evaluation and assessment of common bioinformatic steps for ASE quantification, and recommend a robust approach to accurately measure SNP, gene and isoform ASE through the use of personalized haplotype genome alignment, strict alignment quality control and intragenic SNP aggregation. Our results indicate that accurate ASE quantification requires careful bioinformatic analyses and is adversely affected by sample specific alignment confounders and random sampling even at moderate sequence depths. We identified multiple known and several novel ASE genes in liver, including WDR72, DSP and UBD, as well as genes that contained ASE SNPs with imbalance direction discordant with haplotype phase, explainable by annotated transcript structure, suggesting isoform derived ASE. The methods evaluated in this study will be of use to researchers performing highly conservative quantification of ASE, and the genes and isoforms identified as ASE of interest to researchers studying those loci. PMID:25965996
In Vitro Propagation and Conservation of Bacopa monnieri L.
Sharma, Neelam; Singh, Rakesh; Pandey, Ruchira
2016-01-01
Bacopa monnieri L. (common name brahmi) is a traditional and renowned Indian medicinal plant with high commercial value for its memory revitalizer potential. Demand for this herb has further escalated due to popularization of various brahmi-based drugs coupled with reported anticancer property. Insufficient seed availability and problems associated with seed propagation including short seed viability are the major constraints of seed conservation in the gene banks. In vitro clonal propagation, a prerequisite for in vitro conservation by enhanced axillary branching was standardized. We have developed a simple, single step protocol for in vitro establishment, propagation and medium-term conservation of B. monnieri. Single node explants, cultured on Murashige and Skoog's medium supplemented with BA (0.2 mg/L), exhibited shoot proliferation without callus formation. Rooting was achieved on the same medium. The in vitro raised plants were successfully transferred to soil with ~80 % survival. On the same medium, shoots could also be conserved for 12 months with high survival and genetic stability was maintained as revealed by molecular markers. The protocol optimized in the present study has been applied for culture establishment, shoot multiplication and medium-term conservation of several Bacopa germplasm, procured from different agro-ecological regions of India.
Reneker, Jeff; Shyu, Chi-Ren; Zeng, Peiyu; Polacco, Joseph C.; Gassmann, Walter
2004-01-01
We have developed a web server for the life sciences community to use to search for short repeats of DNA sequence of length between 3 and 10 000 bases within multiple species. This search employs a unique and fast hash function approach. Our system also applies information retrieval algorithms to discover knowledge of cross-species conservation of repeat sequences. Furthermore, we have incorporated a part of the Gene Ontology database into our information retrieval algorithms to broaden the coverage of the search. Our web server and tutorial can be found at http://acmes.rnet.missouri.edu. PMID:15215469
Two rapidly evolving genes contribute to male fitness in Drosophila
Reinhardt, Josephine A; Jones, Corbin D
2013-01-01
Purifying selection often results in conservation of gene sequence and function. The most functionally conserved genes are also thought to be among the most biologically essential. These observations have led to the use of sequence conservation as a proxy for functional conservation. Here we describe two genes that are exceptions to this pattern. We show that lack of sequence conservation among orthologs of CG15460 and CG15323 – herein named jean-baptiste (jb) and karr respectively – does not necessarily predict lack of functional conservation. These two Drosophila melanogaster genes are among the most rapidly evolving protein-coding genes in this species, being nearly as diverged from their D. yakuba orthologs as random sequences are. jb and karr are both expressed at an elevated level in larval males and adult testes, but they are not accessory gland proteins and their loss does not affect male fertility. Instead, knockdown of these genes in D. melanogaster via RNA interference caused male-biased viability defects. These viability effects occur prior to the third instar for jb and during late pupation for karr. We show that putative orthologs to jb and karr are also expressed strongly in the testes of other Drosophila species and have similar gene structure across species despite low levels of sequence conservation. While standard molecular evolution tests could not reject neutrality, other data hint at a role for natural selection. Together these data provide a clear case where a lack of sequence conservation does not imply a lack of conservation of expression or function. PMID:24221639
Ezkurdia, Iakes; del Pozo, Angela; Frankish, Adam; Rodriguez, Jose Manuel; Harrow, Jennifer; Ashman, Keith; Valencia, Alfonso; Tress, Michael L.
2012-01-01
Advances in high-throughput mass spectrometry are making proteomics an increasingly important tool in genome annotation projects. Peptides detected in mass spectrometry experiments can be used to validate gene models and verify the translation of putative coding sequences (CDSs). Here, we have identified peptides that cover 35% of the genes annotated by the GENCODE consortium for the human genome as part of a comprehensive analysis of experimental spectra from two large publicly available mass spectrometry databases. We detected the translation to protein of “novel” and “putative” protein-coding transcripts as well as transcripts annotated as pseudogenes and nonsense-mediated decay targets. We provide a detailed overview of the population of alternatively spliced protein isoforms that are detectable by peptide identification methods. We found that 150 genes expressed multiple alternative protein isoforms. This constitutes the largest set of reliably confirmed alternatively spliced proteins yet discovered. Three groups of genes were highly overrepresented. We detected alternative isoforms for 10 of the 25 possible heterogeneous nuclear ribonucleoproteins, proteins with a key role in the splicing process. Alternative isoforms generated from interchangeable homologous exons and from short indels were also significantly enriched, both in human experiments and in parallel analyses of mouse and Drosophila proteomics experiments. Our results show that a surprisingly high proportion (almost 25%) of the detected alternative isoforms are only subtly different from their constitutive counterparts. Many of the alternative splicing events that give rise to these alternative isoforms are conserved in mouse. It was striking that very few of these conserved splicing events broke Pfam functional domains or would damage globular protein structures. This evidence of a strong bias toward subtle differences in CDS and likely conserved cellular function and structure is remarkable and strongly suggests that the translation of alternative transcripts may be subject to selective constraints. PMID:22446687
Furihata, Takashi; Maruyama, Kyonoshin; Fujita, Yasunari; Umezawa, Taishi; Yoshida, Riichiro; Shinozaki, Kazuo; Yamaguchi-Shinozaki, Kazuko
2006-02-07
bZIP-type transcription factors AREBs/ABFs bind an abscisic acid (ABA)-responsive cis-acting element named ABRE and transactivate downstream gene expression in Arabidopsis. Because AREB1 overexpression could not induce downstream gene expression, activation of AREB1 requires ABA-dependent posttranscriptional modification. We confirmed that ABA activated 42-kDa kinase activity, which, in turn, phosphorylated Ser/Thr residues of R-X-X-S/T sites in the conserved regions of AREB1. Amino acid substitutions of R-X-X-S/T sites to Ala suppressed transactivation activity, and multiple substitution of these sites resulted in almost complete suppression of transactivation activity in transient assays. In contrast, substitution of the Ser/Thr residues to Asp resulted in high transactivation activity without exogenous ABA application. A phosphorylated, transcriptionally active form was achieved by substitution of Ser/Thr in all conserved R-X-X-S/T sites to Asp. Transgenic plants overexpressing the phosphorylated active form of AREB1 expressed many ABA-inducible genes, such as RD29B, without ABA treatment. These results indicate that the ABA-dependent multisite phosphorylation of AREB1 regulates its own activation in plants.
Druml, Thomas; Salajpal, Kresimir; Dikic, Maria; Urosevic, Miroslav; Grilz-Seger, Gertrud; Baumung, Roswitha
2012-03-01
At present the Croatian Turopolje pig population comprises about 157 breeding animals. In Austria, 324 Turopolje pigs originating from six Croatian founder animals are registered. Multiple bottlenecks have occurred in this population, one major one rather recently and several more older and moderate ones. In addition, it has been subdivided into three subpopulations, one in Austria and two in Croatia, with restricted gene flow. These specificities explain the delicate situation of this endangered Croatian lard-type pig breed. In order to identify candidate breeding animals or gene pools for future conservation breeding programs, we studied the genetic diversity and population structure of this breed using microsatellite data from 197 individuals belonging to five different breeds. The genetic diversity of the Turopolje pig is dramatically low with observed heterozygosities values ranging from 0.38 to 0.57. Split into three populations since 1994, two genetic clusters could be identified: one highly conserved Croatian gene pool in Turopoljski Lug and the"Posavina" gene pool mainly present in the Austrian population. The second Croatian subpopulation in Lonjsko Polje in the Posavina region shows a constant gene flow from the Turopoljski Lug animals. One practical conclusion is that it is necessary to develop a "Posavina" boar line to preserve the "Posavina" gene pool and constitute a corresponding population in Croatia. Animals of the highly inbred herd in Turopoljski Lug should not be crossed with animals of other populations since they represent a specific phenotype-genotype combination. However to increase the genetic diversity of this herd, a program to optimize its sex ratio should be carried out, as was done in the Austrian population where the level of heterozygosity has remained moderate despite its heavy bottleneck in 1994. © 2012 Druml et al; licensee BioMed Central Ltd.
KAP1 promotes proliferation and metastatic progression of breast cancer cells.
Addison, Joseph B; Koontz, Colton; Fugett, James H; Creighton, Chad J; Chen, Dongquan; Farrugia, Mark K; Padon, Renata R; Voronkova, Maria A; McLaughlin, Sarah L; Livengood, Ryan H; Lin, Chen-Chung; Ruppert, J Michael; Pugacheva, Elena N; Ivanov, Alexey V
2015-01-15
KAP1 (TRIM28) is a transcriptional regulator in embryonic development that controls stem cell self-renewal, chromatin organization, and the DNA damage response, acting as an essential corepressor for KRAB family zinc finger proteins (KRAB-ZNF). To gain insight into the function of this large gene family, we developed an antibody that recognizes the conserved zinc fingers linker region (ZnFL) in multiple KRAB-ZNF. Here, we report that the expression of many KRAB-ZNF along with active SUMOlyated KAP1 is elevated widely in human breast cancers. KAP1 silencing in breast cancer cells reduced proliferation and inhibited the growth and metastasis of tumor xenografts. Conversely, KAP1 overexpression stimulated cell proliferation and tumor growth. In cells where KAP1 was silenced, we identified multiple downregulated genes linked to tumor progression and metastasis, including EREG/epiregulin, PTGS2/COX2, MMP1, MMP2, and CD44, along with downregulation of multiple KRAB-ZNF proteins. KAP1-dependent stabilization of KRAB-ZNF required direct interactions with KAP1. Together, our results show that KAP1-mediated stimulation of multiple KRAB-ZNF contributes to the growth and metastasis of breast cancer. ©2014 American Association for Cancer Research.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Maeder, Dennis L.; Anderson, Iain; Brettin, Thomas S.
2006-05-19
We report here a comparative analysis of the genome sequence of Methanosarcina barkeri with those of Methanosarcina acetivorans and Methanosarcina mazei. All three genomes share a conserved double origin of replication and many gene clusters. M. barkeri is distinguished by having an organization that is well conserved with respect to the other Methanosarcinae in the region proximal to the origin of replication with interspecies gene similarities as high as 95%. However it is disordered and marked by increased transposase frequency and decreased gene synteny and gene density in the proximal semi-genome. Of the 3680 open reading frames in M. barkeri,more » 678 had paralogs with better than 80% similarity to both M. acetivorans and M. mazei while 128 nonhypothetical orfs were unique (non-paralogous) amongst these species including a complete formate dehydrogenase operon, two genes required for N-acetylmuramic acid synthesis, a 14 gene gas vesicle cluster and a bacterial P450-specific ferredoxin reductase cluster not previously observed or characterized in this genus. A cryptic 36 kbp plasmid sequence was detected in M. barkeri that contains an orc1 gene flanked by a presumptive origin of replication consisting of 38 tandem repeats of a 143 nt motif. Three-way comparison of these genomes reveals differing mechanisms for the accrual of changes. Elongation of the large M. acetivorans is the result of multiple gene-scale insertions and duplications uniformly distributed in that genome, while M. barkeri is characterized by localized inversions associated with the loss of gene content. In contrast, the relatively short M. mazei most closely approximates the ancestral organizational state.« less
Wang, Rong-Kai; Zhang, Rui-Fen; Hao, Yu-Jin
2013-01-01
The MYB proteins comprise one of the largest families of transcription factors (TFs) in plants. Although several MYB genes have been characterized to play roles in secondary metabolism, the MYB family has not yet been identified in apple. In this study, 229 apple MYB genes were identified through a genome-wide analysis and divided into 45 subgroups. A computational analysis was conducted using the apple genomic database to yield a complete overview of the MYB family, including the intron-exon organizations, the sequence features of the MYB DNA-binding domains, the carboxy-terminal motifs, and the chromosomal locations. Subsequently, the expression of 18 MYB genes, including 12 were chosen from stress-related subgroups, while another 6 ones from other subgroups, in response to various abiotic stresses was examined. It was found that several of these MYB genes, particularly MdoMYB121, were induced by multiple stresses. The MdoMYB121 was then further functionally characterized. Its predicted protein was found to be localized in the nucleus. A transgenic analysis indicated that the overexpression of the MdoMYB121 gene remarkably enhanced the tolerance to high salinity, drought, and cold stresses in transgenic tomato and apple plants. Our results indicate that the MYB genes are highly conserved in plant species and that MdoMYB121 can be used as a target gene in genetic engineering approaches to improve the tolerance of plants to multiple abiotic stresses. PMID:23950843
Lagares, Antonio; Borella, Germán Ceizel; Linne, Uwe; Becker, Anke
2017-01-01
ABSTRACT Riboregulation has a major role in the fine-tuning of multiple bacterial processes. Among the RNA players, trans-encoded untranslated small RNAs (sRNAs) regulate complex metabolic networks by tuning expression from multiple target genes in response to numerous signals. In Sinorhizobium meliloti, over 400 sRNAs are expressed under different stimuli. The sRNA MmgR (standing for Makes more granules Regulator) has been of particular interest to us since its sequence and structure are highly conserved among the alphaproteobacteria and its expression is regulated by the amount and quality of the bacterium's available nitrogen source. In this work, we explored the biological role of MmgR in S. meliloti 2011 by characterizing the effect of a deletion of the internal conserved core of mmgR (mmgRΔ33–51). This mutation resulted in larger amounts of polyhydroxybutyrate (PHB) distributed into more intracellular granules than are found in the wild-type strain. This phenotype was expressed upon cessation of balanced growth owing to nitrogen depletion in the presence of surplus carbon (i.e., at a carbon/nitrogen molar ratio greater than 10). The normal PHB accumulation was complemented with a wild-type mmgR copy but not with unrelated sRNA genes. Furthermore, the expression of mmgR limited PHB accumulation in the wild type, regardless of the magnitude of the C surplus. Quantitative proteomic profiling and quantitative reverse transcription-PCR (qRT-PCR) revealed that the absence of MmgR results in a posttranscriptional overexpression of both PHB phasin proteins (PhaP1 and PhaP2). Together, our results indicate that the widely conserved alphaproteobacterial MmgR sRNA fine-tunes the regulation of PHB storage in S. meliloti. IMPORTANCE High-throughput RNA sequencing has recently uncovered an overwhelming number of trans-encoded small RNAs (sRNAs) in diverse prokaryotes. In the nitrogen-fixing alphaproteobacterial symbiont of alfalfa root nodules Sinorhizobium meliloti, only four out of hundreds of identified sRNA genes have been functionally characterized. Thus, uncovering the biological role of sRNAs currently represents a major issue and one that is particularly challenging because of the usually subtle quantitative regulation contributed by most characterized sRNAs. Here, we have characterized the function of the broadly conserved alphaproteobacterial sRNA gene mmgR in S. meliloti. Our results strongly suggest that mmgR encodes a negative regulator of the accumulation of polyhydroxybutyrate, the major carbon and reducing power storage polymer in S. meliloti cells growing under conditions of C/N overbalance. PMID:28167519
Gene expression allelic imbalance in ovine brown adipose tissue impacts energy homeostasis
Ghazanfar, Shila; Vuocolo, Tony; Morrison, Janna L.; Nicholas, Lisa M.; McMillen, Isabella C.; Yang, Jean Y. H.; Buckley, Michael J.
2017-01-01
Heritable trait variation within a population of organisms is largely governed by DNA variations that impact gene transcription and protein function. Identifying genetic variants that affect complex functional traits is a primary aim of population genetics studies, especially in the context of human disease and agricultural production traits. The identification of alleles directly altering mRNA expression and thereby biological function is challenging due to difficulty in isolating direct effects of cis-acting genetic variations from indirect trans-acting genetic effects. Allele specific gene expression or allelic imbalance in gene expression (AI) occurring at heterozygous loci provides an opportunity to identify genes directly impacted by cis-acting genetic variants as indirect trans-acting effects equally impact the expression of both alleles. However, the identification of genes showing AI in the context of the expression of all genes remains a challenge due to a variety of technical and statistical issues. The current study focuses on the discovery of genes showing AI using single nucleotide polymorphisms as allelic reporters. By developing a computational and statistical process that addressed multiple analytical challenges, we ranked 5,809 genes for evidence of AI using RNA-Seq data derived from brown adipose tissue samples from a cohort of late gestation fetal lambs and then identified a conservative subgroup of 1,293 genes. Thus, AI was extensive, representing approximately 25% of the tested genes. Genes associated with AI were enriched for multiple Gene Ontology (GO) terms relating to lipid metabolism, mitochondrial function and the extracellular matrix. These functions suggest that cis-acting genetic variations causing AI in the population are preferentially impacting genes involved in energy homeostasis and tissue remodelling. These functions may contribute to production traits likely to be under genetic selection in the population. PMID:28665992
Ma, Jun; Wang, Qinglian; Sun, Runrun; Xie, Fuliang; Jones, Don C; Zhang, Baohong
2014-10-16
Plant-specific TEOSINTE-BRANCHED1/CYCLOIDEA/PCF (TCP) transcription factors play versatile functions in multiple aspects of plant growth and development. However, no systematical study has been performed in cotton. In this study, we performed for the first time the genome-wide identification and expression analysis of the TCP transcription factor family in Gossypium raimondii. A total of 38 non-redundant cotton TCP encoding genes were identified. The TCP transcription factors were divided into eleven subgroups based on phylogenetic analysis. Most TCP genes within the same subfamily demonstrated similar exon and intron organization and the motif structures were highly conserved among the subfamilies. Additionally, the chromosomal distribution pattern revealed that TCP genes were unevenly distributed across 11 out of the 13 chromosomes; segmental duplication is a predominant duplication event for TCP genes and the major contributor to the expansion of TCP gene family in G. raimondii. Moreover, the expression profiles of TCP genes shed light on their functional divergence.
Ma, Jun; Wang, Qinglian; Sun, Runrun; Xie, Fuliang; Jones, Don C.; Zhang, Baohong
2014-01-01
Plant-specific TEOSINTE-BRANCHED1/CYCLOIDEA/PCF (TCP) transcription factors play versatile functions in multiple aspects of plant growth and development. However, no systematical study has been performed in cotton. In this study, we performed for the first time the genome-wide identification and expression analysis of the TCP transcription factor family in Gossypium raimondii. A total of 38 non-redundant cotton TCP encoding genes were identified. The TCP transcription factors were divided into eleven subgroups based on phylogenetic analysis. Most TCP genes within the same subfamily demonstrated similar exon and intron organization and the motif structures were highly conserved among the subfamilies. Additionally, the chromosomal distribution pattern revealed that TCP genes were unevenly distributed across 11 out of the 13 chromosomes; segmental duplication is a predominant duplication event for TCP genes and the major contributor to the expansion of TCP gene family in G. raimondii. Moreover, the expression profiles of TCP genes shed light on their functional divergence. PMID:25322260
The evolutionary landscape of intergenic trans-splicing events in insects
Kong, Yimeng; Zhou, Hongxia; Yu, Yao; Chen, Longxian; Hao, Pei; Li, Xuan
2015-01-01
To explore the landscape of intergenic trans-splicing events and characterize their functions and evolutionary dynamics, we conduct a mega-data study of a phylogeny containing eight species across five orders of class Insecta, a model system spanning 400 million years of evolution. A total of 1,627 trans-splicing events involving 2,199 genes are identified, accounting for 1.58% of the total genes. Homology analysis reveals that mod(mdg4)-like trans-splicing is the only conserved event that is consistently observed in multiple species across two orders, which represents a unique case of functional diversification involving trans-splicing. Thus, evolutionarily its potential for generating proteins with novel function is not broadly utilized by insects. Furthermore, 146 non-mod trans-spliced transcripts are found to resemble canonical genes from different species. Trans-splicing preserving the function of ‘breakup' genes may serve as a general mechanism for relaxing the constraints on gene structure, with profound implications for the evolution of genes and genomes. PMID:26521696
The genetic structure of the A mating-type locus of Lentinula edodes.
Au, Chun Hang; Wong, Man Chun; Bao, Dapeng; Zhang, Meiyan; Song, Chunyan; Song, Wenhua; Law, Patrick Tik Wan; Kües, Ursula; Kwan, Hoi Shan
2014-02-10
The Shiitake mushroom, Lentinula edodes (Berk.) Pegler is a tetrapolar basidiomycete with two unlinked mating-type loci, commonly called the A and B loci. Identifying the mating-types in shiitake is important for enhancing the breeding and cultivation of this economically-important edible mushroom. Here, we identified the A mating-type locus from the first draft genome sequence of L. edodes and characterized multiple alleles from different monokaryotic strains. Two intron-length polymorphism markers were developed to facilitate rapid molecular determination of A mating-type. L. edodes sequences were compared with those of known tetrapolar and bipolar basidiomycete species. The A mating-type genes are conserved at the homeodomain region across the order Agaricales. However, we observed unique genomic organization of the locus in L. edodes which exhibits atypical gene order and multiple repetitive elements around its A locus. To our knowledge, this is the first known exception among Homobasidiomycetes, in which the mitochondrial intermediate peptidase (mip) gene is not closely linked to A locus. Copyright © 2013 Elsevier B.V. All rights reserved.
MEDIATOR18 and MEDIATOR20 confer susceptibility to Fusarium oxysporum in Arabidopsis thaliana
Stiller, Jiri; Davoine, Celine; Björklund, Stefan; Manners, John M.; Kazan, Kemal; Schenk, Peer M.
2017-01-01
The conserved protein complex known as Mediator conveys transcriptional signals by acting as an intermediary between transcription factors and RNA polymerase II. As a result, Mediator subunits play multiple roles in regulating developmental as well as abiotic and biotic stress pathways. In this report we identify the head domain subunits MEDIATOR18 and MEDIATOR20 as important susceptibility factors for Fusarium oxysporum infection in Arabidopsis thaliana. Mutants of MED18 and MED20 display down-regulation of genes associated with jasmonate signaling and biosynthesis while up-regulation of salicylic acid associated pathogenesis related genes and reactive oxygen producing and scavenging genes. We propose that MED18 and MED20 form a sub-domain within Mediator that controls the balance of salicylic acid and jasmonate associated defense pathways. PMID:28441405
SEA: a super-enhancer archive.
Wei, Yanjun; Zhang, Shumei; Shang, Shipeng; Zhang, Bin; Li, Song; Wang, Xinyu; Wang, Fang; Su, Jianzhong; Wu, Qiong; Liu, Hongbo; Zhang, Yan
2016-01-04
Super-enhancers are large clusters of transcriptional enhancers regarded as having essential roles in driving the expression of genes that control cell identity during development and tumorigenesis. The construction of a genome-wide super-enhancer database is urgently needed to better understand super-enhancer-directed gene expression regulation for a given biology process. Here, we present a specifically designed web-accessible database, Super-Enhancer Archive (SEA, http://sea.edbc.org). SEA focuses on integrating super-enhancers in multiple species and annotating their potential roles in the regulation of cell identity gene expression. The current release of SEA incorporates 83 996 super-enhancers computationally or experimentally identified in 134 cell types/tissues/diseases, including human (75 439, three of which were experimentally identified), mouse (5879, five of which were experimentally identified), Drosophila melanogaster (1774) and Caenorhabditis elegans (904). To facilitate data extraction, SEA supports multiple search options, including species, genome location, gene name, cell type/tissue and super-enhancer name. The response provides detailed (epi)genetic information, incorporating cell type specificity, nearby genes, transcriptional factor binding sites, CRISPR/Cas9 target sites, evolutionary conservation, SNPs, H3K27ac, DNA methylation, gene expression and TF ChIP-seq data. Moreover, analytical tools and a genome browser were developed for users to explore super-enhancers and their roles in defining cell identity and disease processes in depth. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
2010-01-01
Background Osteopoikilosis is a rare autosomal dominant genetic disorder, characterised by the occurrence of the hyperostotic spots preferentially localized in the epiphyses and metaphyses of the long bones, and in the carpal and tarsal bones [1]. Heterozygous LEMD3 gene mutations were shown to be the primary cause of the disease [2]. Association of the primarily asymptomatic osteopokilosis with connective tissue nevi of the skin is categorized as Buschke-Ollendorff syndrome (BOS) [3]. Additionally, osteopoikilosis can coincide with melorheostosis (MRO), a more severe bone disease characterised by the ectopic bone formation on the periosteal and endosteal surface of the long bones [4-6]. However, not all MRO affected individuals carry germ-line LEMD3 mutations [7]. Thus, the genetic cause of MRO remains unknown. Here we describe a familial case of osteopoikilosis in which a novel heterozygous LEMD3 mutation coincides with a novel mutation in EXT1, a gene involved in aetiology of multiple exostosis syndrome. The patients affected with both LEMD3 and EXT1 gene mutations displayed typical features of the osteopoikilosis. There were no additional skeletal manifestations detected however, various non-skeletal pathologies coincided in this group. Methods We investigated LEMD3 and EXT1 in the three-generation family from Poland, with 5 patients affected with osteopoikilosis and one child affected with multiple exostoses. Results We found a novel c.2203C > T (p.R735X) mutation in exon 9 of LEMD3, resulting in a premature stop codon at amino acid position 735. The mutation co-segregates with the osteopoikilosis phenotype and was not found in 200 ethnically matched controls. Another new substitution G > A was found in EXT1 gene at position 1732 (cDNA) in Exon 9 (p.A578T) in three out of five osteopoikilosis affected family members. Evolutionary conservation of the affected amino acid suggested possible functional relevance, however no additional skeletal manifestations were observed other then those specific for osteopoikilosis. Finally in one member of the family we found a splice site mutation in the EXT1 gene intron 5 (IVS5-2 A > G) resulting in the deletion of 9 bp of cDNA encoding three evolutionarily conserved amino acid residues. This child patient suffered from a severe form of exostoses, thus a causal relationship can be postulated. Conclusions We identified a new mutation in LEMD3 gene, accounting for the familial case of osteopoikilosis. In the same family we identified two novel EXT1 gene mutations. One of them A598T co-incided with the LEMD3 mutation. Co-incidence of LEMD3 and EXT1 gene mutations was not associated with a more severe skeletal phenotype in those patients. PMID:20618940
Baranova, Ancha; Hammarsund, Marianne; Ivanov, Dmitry; Skoblov, Mikhail; Sangfelt, Olle; Corcoran, Martin; Borodina, Tatiana; Makeeva, Natalia; Pestova, Anna; Tyazhelova, Tatiana; Nazarenko, Svetlana; Gorreta, Francesco; Alsheddi, Tariq; Schlauch, Karen; Nikitin, Eugene; Kapanadze, Bagrat; Shagin, Dmitry; Poltaraus, Andrey; Ivanovich Vorobiev, Andrey; Zabarovsky, Eugene; Lukianov, Sergey; Chandhoke, Vikas; Ibbotson, Rachel; Oscier, David; Einhorn, Stefan; Grander, Dan; Yankovsky, Nick
2003-12-04
In the present study, we describe the human and mouse RFP2 gene structure, multiple RFP2 mRNA isoforms in the two species that have different 5' UTRs and a human-specific antisense transcript RFP2OS. Since the human RFP2 5' UTR is not conserved in mouse, these findings might indicate a different regulation of RFP2 in the two species. The predicted human and mouse RFP2 proteins are shown to contain a tripartite RING finger-B-box-coiled-coil domain (RBCC), also known as a TRIM domain, and therefore belong to a subgroup of RING finger proteins that are often involved in developmental and tumorigenic processes. Because homozygous deletions of chromosomal region 13q14.3 are found in a number of malignancies, including chronic lymphocytic leukemia (CLL) and multiple myeloma (MM), we suggest that RFP2 might be involved in tumor development. This study provides necessary information for evaluation of the role of RFP2 in malignant transformation and other biological processes.
2013-01-01
Background A co-ordinated tissue-independent gene expression profile associated with growth is present in rodent models and this is hypothesised to extend to all mammals. Growth in humans has similarities to other mammals but the return to active long bone growth in the pubertal growth spurt is a distinctly human growth event. The aim of this study was to describe gene expression and biological pathways associated with stages of growth in children and to assess tissue-independent expression patterns in relation to human growth. Results We conducted gene expression analysis on a library of datasets from normal children with age annotation, collated from the NCBI Gene Expression Omnibus (GEO) and EBI Arrayexpress databases. A primary data set was generated using cells of lymphoid origin from normal children; the expression of 688 genes (ANOVA false discovery rate modified p-value, q < 0.1) was associated with age, and subsets of these genes formed clusters that correlated with the phases of growth – infancy, childhood, puberty and final height. Network analysis on these clusters identified evolutionarily conserved growth pathways (NOTCH, VEGF, TGFB, WNT and glucocorticoid receptor – Hyper-geometric test, q < 0.05). The greatest degree of network ‘connectivity’ and hence functional significance was present in infancy (Wilcoxon test, p < 0.05), which then decreased through to adulthood. These observations were confirmed in a separate validation data set from lymphoid tissue. Similar biological pathways were observed to be associated with development-related gene expression in other tissues (conjunctival epithelia, temporal lobe brain tissue and bone marrow) suggesting the existence of a tissue-independent genetic program for human growth and maturation. Conclusions Similar evolutionarily conserved pathways have been associated with gene expression and child growth in multiple tissues. These expression profiles associate with the developmental phases of growth including the return to active long bone growth in puberty, a distinctly human event. These observations also have direct medical relevance to pathological changes that induce disease in children. Taking into account development-dependent gene expression profiles for normal children will be key to the appropriate selection of genes and pathways as potential biomarkers of disease or as drug targets. PMID:23941278
Pridans, Clare; Lillico, Simon; Whitelaw, Bruce; Hume, David A
2014-01-01
The development of macrophages requires signaling through the lineage-restricted receptor Csf1r. Macrophage-restricted expression of transgenic reporters based upon Csf1r requires the highly conserved Fms-intronic regulatory element (FIRE). We have created a lentiviral construct containing mouse FIRE and promoter. The lentivirus is capable of directing macrophage-restricted reporter gene expression in mouse, rat, human, pig, cow, sheep, and even chicken. Rat bone marrow cells transduced with the lentivirus were capable of differentiating into macrophages expressing the reporter gene in vitro. Macrophage-restricted expression may be desirable for immunization or immune response modulation, and for gene therapy for lysosomal storage diseases and some immunodeficiencies. The small size of the Csf1r transcription control elements will allow the insertion of large “cargo” for applications in gene therapy and vaccine delivery. PMID:26015955
Westbye, Alexander B; Beatty, J Thomas; Lang, Andrew S
2017-08-01
Gene transfer agents (GTAs) are bacteriophage-like particles produced by many prokaryotes. Several members of the Alphaproteobacteria produce a class of genetically-related GTAs that is best studied in Rhodobacter capsulatus. DNA transfer by the R. capsulatus GTA (RcGTA) combines aspects of both transduction and natural transformation, as recipient cells require a natural transformation-like system to incorporate donated DNA. The genes involved in RcGTA production and recipient capability are located at multiple loci in the bacterial genome; however, a conserved phosphorelay containing the response regulator CtrA and a quorum sensing system regulate both RcGTA production and recipient capability. This review highlights recent discoveries in RcGTA biology, and focuses on the co-regulation of genes involved in RcGTA production and recipient capability. Copyright © 2017 Elsevier Ltd. All rights reserved.
Conserved Genes Act as Modifiers of Invertebrate SMN Loss of Function Defects
Chang, Howard C.; Sen, Anindya; Kalloo, Geetika; Harris, Jevede; Barsby, Tom; Walsh, Melissa B.; Satterlee, John S.; Li, Chris; Van Vactor, David; Artavanis-Tsakonas, Spyros; Hart, Anne C.
2010-01-01
Spinal Muscular Atrophy (SMA) is caused by diminished function of the Survival of Motor Neuron (SMN) protein, but the molecular pathways critical for SMA pathology remain elusive. We have used genetic approaches in invertebrate models to identify conserved SMN loss of function modifier genes. Drosophila melanogaster and Caenorhabditis elegans each have a single gene encoding a protein orthologous to human SMN; diminished function of these invertebrate genes causes lethality and neuromuscular defects. To find genes that modulate SMN function defects across species, two approaches were used. First, a genome-wide RNAi screen for C. elegans SMN modifier genes was undertaken, yielding four genes. Second, we tested the conservation of modifier gene function across species; genes identified in one invertebrate model were tested for function in the other invertebrate model. Drosophila orthologs of two genes, which were identified originally in C. elegans, modified Drosophila SMN loss of function defects. C. elegans orthologs of twelve genes, which were originally identified in a previous Drosophila screen, modified C. elegans SMN loss of function defects. Bioinformatic analysis of the conserved, cross-species, modifier genes suggests that conserved cellular pathways, specifically endocytosis and mRNA regulation, act as critical genetic modifiers of SMN loss of function defects across species. PMID:21124729
A functional analysis of the spacer of V(D)J recombination signal sequences.
Lee, Alfred Ian; Fugmann, Sebastian D; Cowell, Lindsay G; Ptaszek, Leon M; Kelsoe, Garnett; Schatz, David G
2003-10-01
During lymphocyte development, V(D)J recombination assembles antigen receptor genes from component V, D, and J gene segments. These gene segments are flanked by a recombination signal sequence (RSS), which serves as the binding site for the recombination machinery. The murine Jbeta2.6 gene segment is a recombinationally inactive pseudogene, but examination of its RSS reveals no obvious reason for its failure to recombine. Mutagenesis of the Jbeta2.6 RSS demonstrates that the sequences of the heptamer, nonamer, and spacer are all important. Strikingly, changes solely in the spacer sequence can result in dramatic differences in the level of recombination. The subsequent analysis of a library of more than 4,000 spacer variants revealed that spacer residues of particular functional importance are correlated with their degree of conservation. Biochemical assays indicate distinct cooperation between the spacer and heptamer/nonamer along each step of the reaction pathway. The results suggest that the spacer serves not only to ensure the appropriate distance between the heptamer and nonamer but also regulates RSS activity by providing additional RAG:RSS interaction surfaces. We conclude that while RSSs are defined by a "digital" requirement for absolutely conserved nucleotides, the quality of RSS function is determined in an "analog" manner by numerous complex interactions between the RAG proteins and the less-well conserved nucleotides in the heptamer, the nonamer, and, importantly, the spacer. Those modulatory effects are accurately predicted by a new computational algorithm for "RSS information content." The interplay between such binary and multiplicative modes of interactions provides a general model for analyzing protein-DNA interactions in various biological systems.
Canella, Donatella; Bernasconi, David; Gilardi, Federica; LeMartelot, Gwendal; Migliavacca, Eugenia; Praz, Viviane; Cousin, Pascal; Delorenzi, Mauro; Hernandez, Nouria; Hernandez, Nouria; Delorenzi, Mauro; Deplancke, Bart; Desvergne, Béatrice; Guex, Nicolas; Herr, Winship; Naef, Felix; Rougemont, Jacques; Schibler, Ueli; Deplancke, Bart; Guex, Nicolas; Herr, Winship; Guex, Nicolas; Andersin, Teemu; Cousin, Pascal; Gilardi, Federica; Gos, Pascal; Le Martelot, Gwendal; Lammers, Fabienne; Canella, Donatella; Gilardi, Federica; Raghav, Sunil; Fabbretti, Roberto; Fortier, Arnaud; Long, Li; Vlegel, Volker; Xenarios, Ioannis; Migliavacca, Eugenia; Praz, Viviane; Guex, Nicolas; Naef, Felix; Rougemont, Jacques; David, Fabrice; Jarosz, Yohan; Kuznetsov, Dmitry; Liechti, Robin; Martin, Olivier; Ross, Frederick; Sinclair, Lucas; Cajan, Julia; Krier, Irina; Leleu, Marion; Migliavacca, Eugenia; Molina, Nacho; Naldi, Aurélien; Rey, Guillaume; Symul, Laura; Guex, Nicolas; Naef, Felix; Rougemont, Jacques; Bernasconi, David; Delorenzi, Mauro; Andersin, Teemu; Canella, Donatella; Gilardi, Federica; Le Martelot, Gwendal; Lammers, Fabienne; Raghav, Sunil
2012-01-01
The genomic loci occupied by RNA polymerase (RNAP) III have been characterized in human culture cells by genome-wide chromatin immunoprecipitations, followed by deep sequencing (ChIP-seq). These studies have shown that only ∼40% of the annotated 622 human tRNA genes and pseudogenes are occupied by RNAP-III, and that these genes are often in open chromatin regions rich in active RNAP-II transcription units. We have used ChIP-seq to characterize RNAP-III-occupied loci in a differentiated tissue, the mouse liver. Our studies define the mouse liver RNAP-III-occupied loci including a conserved mammalian interspersed repeat (MIR) as a potential regulator of an RNAP-III subunit-encoding gene. They reveal that synteny relationships can be established between a number of human and mouse RNAP-III genes, and that the expression levels of these genes are significantly linked. They establish that variations within the A and B promoter boxes, as well as the strength of the terminator sequence, can strongly affect RNAP-III occupancy of tRNA genes. They reveal correlations with various genomic features that explain the observed variation of 81% of tRNA scores. In mouse liver, loci represented in the NCBI37/mm9 genome assembly that are clearly occupied by RNAP-III comprise 50 Rn5s (5S RNA) genes, 14 known non-tRNA RNAP-III genes, nine Rn4.5s (4.5S RNA) genes, and 29 SINEs. Moreover, out of the 433 annotated tRNA genes, half are occupied by RNAP-III. Transfer RNA gene expression levels reflect both an underlying genomic organization conserved in dividing human culture cells and resting mouse liver cells, and the particular promoter and terminator strengths of individual genes. PMID:22287103
Robinson, Gene E.; Jakobsson, Eric
2016-01-01
The emerging field of sociogenomics explores the relations between social behavior and genome structure and function. An important question is the extent to which associations between social behavior and gene expression are conserved among the Metazoa. Prior experimental work in an invertebrate model of social behavior, the honey bee, revealed distinct brain gene expression patterns in African and European honey bees, and within European honey bees with different behavioral phenotypes. The present work is a computational study of these previous findings in which we analyze, by orthology determination, the extent to which genes that are socially regulated in honey bees are conserved across the Metazoa. We found that the differentially expressed gene sets associated with alarm pheromone response, the difference between old and young bees, and the colony influence on soldier bees, are enriched in widely conserved genes, indicating that these differences have genomic bases shared with many other metazoans. By contrast, the sets of differentially expressed genes associated with the differences between African and European forager and guard bees are depleted in widely conserved genes, indicating that the genomic basis for this social behavior is relatively specific to honey bees. For the alarm pheromone response gene set, we found a particularly high degree of conservation with mammals, even though the alarm pheromone itself is bee-specific. Gene Ontology identification of human orthologs to the strongly conserved honey bee genes associated with the alarm pheromone response shows overrepresentation of protein metabolism, regulation of protein complex formation, and protein folding, perhaps associated with remodeling of critical neural circuits in response to alarm pheromone. We hypothesize that such remodeling may be an adaptation of social animals to process and respond appropriately to the complex patterns of conspecific communication essential for social organization. PMID:27359102
Liu, Hui; Robinson, Gene E; Jakobsson, Eric
2016-06-01
The emerging field of sociogenomics explores the relations between social behavior and genome structure and function. An important question is the extent to which associations between social behavior and gene expression are conserved among the Metazoa. Prior experimental work in an invertebrate model of social behavior, the honey bee, revealed distinct brain gene expression patterns in African and European honey bees, and within European honey bees with different behavioral phenotypes. The present work is a computational study of these previous findings in which we analyze, by orthology determination, the extent to which genes that are socially regulated in honey bees are conserved across the Metazoa. We found that the differentially expressed gene sets associated with alarm pheromone response, the difference between old and young bees, and the colony influence on soldier bees, are enriched in widely conserved genes, indicating that these differences have genomic bases shared with many other metazoans. By contrast, the sets of differentially expressed genes associated with the differences between African and European forager and guard bees are depleted in widely conserved genes, indicating that the genomic basis for this social behavior is relatively specific to honey bees. For the alarm pheromone response gene set, we found a particularly high degree of conservation with mammals, even though the alarm pheromone itself is bee-specific. Gene Ontology identification of human orthologs to the strongly conserved honey bee genes associated with the alarm pheromone response shows overrepresentation of protein metabolism, regulation of protein complex formation, and protein folding, perhaps associated with remodeling of critical neural circuits in response to alarm pheromone. We hypothesize that such remodeling may be an adaptation of social animals to process and respond appropriately to the complex patterns of conspecific communication essential for social organization.
Phylogenomic Analysis and Dynamic Evolution of Chloroplast Genomes in Salicaceae
Huang, Yuan; Wang, Jun; Yang, Yongping; Fan, Chuanzhu; Chen, Jiahui
2017-01-01
Chloroplast genomes of plants are highly conserved in both gene order and gene content. Analysis of the whole chloroplast genome is known to provide much more informative DNA sites and thus generates high resolution for plant phylogenies. Here, we report the complete chloroplast genomes of three Salix species in family Salicaceae. Phylogeny of Salicaceae inferred from complete chloroplast genomes is generally consistent with previous studies but resolved with higher statistical support. Incongruences of phylogeny, however, are observed in genus Populus, which most likely results from homoplasy. By comparing three Salix chloroplast genomes with the published chloroplast genomes of other Salicaceae species, we demonstrate that the synteny and length of chloroplast genomes in Salicaceae are highly conserved but experienced dynamic evolution among species. We identify seven positively selected chloroplast genes in Salicaceae, which might be related to the adaptive evolution of Salicaceae species. Comparative chloroplast genome analysis within the family also indicates that some chloroplast genes are lost or became pseudogenes, infer that the chloroplast genes horizontally transferred to the nucleus genome. Based on the complete nucleus genome sequences from two Salicaceae species, we remarkably identify that the entire chloroplast genome is indeed transferred and integrated to the nucleus genome in the individual of the reference genome of P. trichocarpa at least once. This observation, along with presence of the large nuclear plastid DNA (NUPTs) and NUPTs-containing multiple chloroplast genes in their original order in the chloroplast genome, favors the DNA-mediated hypothesis of organelle to nucleus DNA transfer. Overall, the phylogenomic analysis using chloroplast complete genomes clearly elucidates the phylogeny of Salicaceae. The identification of positively selected chloroplast genes and dynamic chloroplast-to-nucleus gene transfers in Salicaceae provide resources to better understand the successful adaptation of Salicaceae species. PMID:28676809
Targeted gene flow for conservation.
Kelly, Ella; Phillips, Ben L
2016-04-01
Anthropogenic threats often impose strong selection on affected populations, causing rapid evolutionary responses. Unfortunately, these adaptive responses are rarely harnessed for conservation. We suggest that conservation managers pay close attention to adaptive processes and geographic variation, with an eye to using them for conservation goals. Translocating pre-adapted individuals into recipient populations is currently considered a potentially important management tool in the face of climate change. Targeted gene flow, which involves moving individuals with favorable traits to areas where these traits would have a conservation benefit, could have a much broader application in conservation. Across a species' range there may be long-standing geographic variation in traits or variation may have rapidly developed in response to a threatening process. Targeted gene flow could be used to promote natural resistance to threats to increase species resilience. We suggest that targeted gene flow is a currently underappreciated strategy in conservation that has applications ranging from the management of invasive species and their impacts to controlling the impact and virulence of pathogens. © 2015 Society for Conservation Biology.
Okada, Kazuma; Moriya, Shigeki; Haji, Takashi; Abe, Kazuyuki
2013-06-01
Using 11 consensus primer pairs designed from S-linked F-box genes of apple and Japanese pear, 10 new F-box genes (MdFBX21 to 30) were isolated from the apple cultivar 'Spartan' (S(9)S(10)). MdFBX21 to 23 and MdFBX24 to 30 were completely linked to the S(9) -RNase and S(10-)RNase, respectively, and showed pollen-specific expression and S-haplotype-specific polymorphisms. Therefore, these 10 F-box genes are good candidates for the pollen determinant of self-incompatibility in apple. Phylogenetic analysis and comparison of deduced amino acid sequences of MdFBX21 to 30 with those of 25 S-linked F-box genes previously isolated from apple showed that a deduced amino acid identity of greater than 88.0 % can be used as the tentative criterion to classify F-box genes into one type. Using this criterion, 31 of 35 F-box genes of apple were classified into 11 types (SFBB1-11). All types included F-box genes derived from S(3-) and S(9-)haplotypes, and seven types included F-box genes derived from S(3-), S(9-), and S(10-)haplotypes. Moreover, comparison of nucleotide sequences of S-RNases and multiple F-box genes among S(3-), S(9-), and S(10-)haplotypes suggested that F-box genes within each type showed high nucleotide identity regardless of the identity of the S-RNase. The large number of F-box genes as candidates for the pollen determinant and the high degree of conservation within each type are consistent with the collaborative non-self-recognition model reported for Petunia. These findings support that the collaborative non-self-recognition system also exists in apple.
NASA Astrophysics Data System (ADS)
Novianti, T.; Sadikin, M.; Widia, S.; Juniantito, V.; Arida, E. A.
2018-03-01
Development of unidentified specific gene is essential to analyze the availability these genes in biological process. Identification unidentified specific DNA of HIF 1α genes is important to analyze their contribution in tissue regeneration process in lizard tail (Hemidactylus platyurus). Bioinformatics and PCR techniques are relatively an easier method to identify an unidentified gene. The most widely used method is BLAST (Basic Local Alignment Sequence Tools) method for alignment the sequences from the other organism. BLAST technique is online software from website https://blast.ncbi.nlm.nih.gov/Blast.cgi that capable to generate the similar sequences from closest kinship to distant kindship. Gecko japonicus is a species that it has closest kinship with H. platyurus. Comparing HIF 1 α gene sequence of G. japonicus with the other species used multiple alignment methods from Mega7 software. Conserved base areas were identified using Clustal IX method. Primary DNA of HIF 1 α gene was design by Primer3 software. HIF 1α gene of lizard (H. platyurus) was successfully amplified using a real-time PCR machine by primary DNA that we had designed from Gecko japonicus. Identification unidentified gene of HIF 1a lizard has been done successfully with multiple alignment method. The study was conducted by analyzing during the growth of tail on day 1, 3, 5, 7, 10, 13 and 17 of lizard tail after autotomy. Process amplification of HIF 1α gene was described by CT value in real time PCR machine. HIF 1α expression of gene is quantified by Livak formula. Chi-square statistic test is 0.000 which means that there is a different expression of HIF 1 α gene in every growth day treatment.
Forest gene conservation programs in Alberta, Canada
Jodie Krakowski
2017-01-01
Provincial tree improvement programs in Alberta began in 1976. Early gene conservation focused on ex situ measures such as seed and clone banking, and research trials of commercial species with tree improvement programs. The gene conservation program now encompasses representative and unique populations of all native tree species in situ. The ex situ program aims to...
Bass, Lydia; Liebert, Cynthia A.; Lee, Margie D.; Summers, Anne O.; White, David G.; Thayer, Stephan G.; Maurer, John J.
1999-01-01
Antibiotic resistance among avian bacterial isolates is common and is of great concern to the poultry industry. Approximately 36% (n = 100) of avian, pathogenic Escherichia coli isolates obtained from diseased poultry exhibited multiple-antibiotic resistance to tetracycline, oxytetracycline, streptomycin, sulfonamides, and gentamicin. Clinical avian E. coli isolates were further screened for the presence of markers for class 1 integrons, the integron recombinase intI1 and the quaternary ammonium resistance gene qacEΔ1, in order to determine the contribution of integrons to the observed multiple-antibiotic resistance phenotypes. Sixty-three percent of the clinical isolates were positive for the class 1 integron markers intI1 and qacEΔ1. PCR analysis with the conserved class 1 integron primers yielded amplicons of approximately 1 kb from E. coli isolates positive for intI1 and qacEΔ1. These PCR amplicons contained the spectinomycin-streptomycin resistance gene aadA1. Further characterization of the identified integrons revealed that many were part of the transposon Tn21, a genetic element that encodes both antibiotic resistance and heavy-metal resistance to mercuric compounds. Fifty percent of the clinical isolates positive for the integron marker gene intI1 as well as for the qacEΔ1 and aadA1 cassettes also contained the mercury reductase gene merA. The correlation between the presence of the merA gene with that of the integrase and antibiotic resistance genes suggests that these integrons are located in Tn21. The presence of these elements among avian E. coli isolates of diverse genetic makeup as well as in Salmonella suggests the mobility of Tn21 among pathogens in humans as well as poultry. PMID:10582884
Haake, David A.; Suchard, Marc A.; Kelley, Melissa M.; Dundoo, Manjula; Alt, David P.; Zuerner, Richard L.
2004-01-01
Leptospires belong to a genus of parasitic bacterial spirochetes that have adapted to a broad range of mammalian hosts. Mechanisms of leptospiral molecular evolution were explored by sequence analysis of four genes shared by 38 strains belonging to the core group of pathogenic Leptospira species: L. interrogans, L. kirschneri, L. noguchii, L. borgpetersenii, L. santarosai, and L. weilii. The 16S rRNA and lipL32 genes were highly conserved, and the lipL41 and ompL1 genes were significantly more variable. Synonymous substitutions are distributed throughout the ompL1 gene, whereas nonsynonymous substitutions are clustered in four variable regions encoding surface loops. While phylogenetic trees for the 16S, lipL32, and lipL41 genes were relatively stable, 8 of 38 (20%) ompL1 sequences had mosaic compositions consistent with horizontal transfer of DNA between related bacterial species. A novel Bayesian multiple change point model was used to identify the most likely sites of recombination and to determine the phylogenetic relatedness of the segments of the mosaic ompL1 genes. Segments of the mosaic ompL1 genes encoding two of the surface-exposed loops were likely acquired by horizontal transfer from a peregrine allele of unknown ancestry. Identification of the most likely sites of recombination with the Bayesian multiple change point model, an approach which has not previously been applied to prokaryotic gene sequence analysis, serves as a model for future studies of recombination in molecular evolution of genes. PMID:15090524
Sebestyén, Endre; Nagy, Tibor; Suhai, Sándor; Barta, Endre
2009-01-01
Background The comparative genomic analysis of a large number of orthologous promoter regions of the chordate and plant genes from the DoOP databases shows thousands of conserved motifs. Most of these motifs differ from any known transcription factor binding site (TFBS). To identify common conserved motifs, we need a specific tool to be able to search amongst them. Since conserved motifs from the DoOP databases are linked to genes, the result of such a search can give a list of genes that are potentially regulated by the same transcription factor(s). Results We have developed a new tool called DoOPSearch for the analysis of the conserved motifs in the promoter regions of chordate or plant genes. We used the orthologous promoters of the DoOP database to extract thousands of conserved motifs from different taxonomic groups. The advantage of this approach is that different sets of conserved motifs might be found depending on how broad the taxonomic coverage of the underlying orthologous promoter sequence collection is (consider e.g. primates vs. mammals or Brassicaceae vs. Viridiplantae). The DoOPSearch tool allows the users to search these motif collections or the promoter regions of DoOP with user supplied query sequences or any of the conserved motifs from the DoOP database. To find overrepresented gene ontologies, the gene lists obtained can be analysed further using a modified version of the GeneMerge program. Conclusion We present here a comparative genomics based promoter analysis tool. Our system is based on a unique collection of conserved promoter motifs characteristic of different taxonomic groups. We offer both a command line and a web-based tool for searching in these motif collections using user specified queries. These can be either short promoter sequences or consensus sequences of known transcription factor binding sites. The GeneMerge analysis of the search results allows the user to identify statistically overrepresented Gene Ontology terms that might provide a clue on the function of the motifs and genes. PMID:19534755
The genomes of two key bumblebee species with primitive eusocial organization.
Sadd, Ben M; Barribeau, Seth M; Bloch, Guy; de Graaf, Dirk C; Dearden, Peter; Elsik, Christine G; Gadau, Jürgen; Grimmelikhuijzen, Cornelis J P; Hasselmann, Martin; Lozier, Jeffrey D; Robertson, Hugh M; Smagghe, Guy; Stolle, Eckart; Van Vaerenbergh, Matthias; Waterhouse, Robert M; Bornberg-Bauer, Erich; Klasberg, Steffen; Bennett, Anna K; Câmara, Francisco; Guigó, Roderic; Hoff, Katharina; Mariotti, Marco; Munoz-Torres, Monica; Murphy, Terence; Santesmasses, Didac; Amdam, Gro V; Beckers, Matthew; Beye, Martin; Biewer, Matthias; Bitondi, Márcia M G; Blaxter, Mark L; Bourke, Andrew F G; Brown, Mark J F; Buechel, Severine D; Cameron, Rossanah; Cappelle, Kaat; Carolan, James C; Christiaens, Olivier; Ciborowski, Kate L; Clarke, David F; Colgan, Thomas J; Collins, David H; Cridge, Andrew G; Dalmay, Tamas; Dreier, Stephanie; du Plessis, Louis; Duncan, Elizabeth; Erler, Silvio; Evans, Jay; Falcon, Tiago; Flores, Kevin; Freitas, Flávia C P; Fuchikawa, Taro; Gempe, Tanja; Hartfelder, Klaus; Hauser, Frank; Helbing, Sophie; Humann, Fernanda C; Irvine, Frano; Jermiin, Lars S; Johnson, Claire E; Johnson, Reed M; Jones, Andrew K; Kadowaki, Tatsuhiko; Kidner, Jonathan H; Koch, Vasco; Köhler, Arian; Kraus, F Bernhard; Lattorff, H Michael G; Leask, Megan; Lockett, Gabrielle A; Mallon, Eamonn B; Antonio, David S Marco; Marxer, Monika; Meeus, Ivan; Moritz, Robin F A; Nair, Ajay; Näpflin, Kathrin; Nissen, Inga; Niu, Jinzhi; Nunes, Francis M F; Oakeshott, John G; Osborne, Amy; Otte, Marianne; Pinheiro, Daniel G; Rossié, Nina; Rueppell, Olav; Santos, Carolina G; Schmid-Hempel, Regula; Schmitt, Björn D; Schulte, Christina; Simões, Zilá L P; Soares, Michelle P M; Swevers, Luc; Winnebeck, Eva C; Wolschin, Florian; Yu, Na; Zdobnov, Evgeny M; Aqrawi, Peshtewani K; Blankenburg, Kerstin P; Coyle, Marcus; Francisco, Liezl; Hernandez, Alvaro G; Holder, Michael; Hudson, Matthew E; Jackson, LaRonda; Jayaseelan, Joy; Joshi, Vandita; Kovar, Christie; Lee, Sandra L; Mata, Robert; Mathew, Tittu; Newsham, Irene F; Ngo, Robin; Okwuonu, Geoffrey; Pham, Christopher; Pu, Ling-Ling; Saada, Nehad; Santibanez, Jireh; Simmons, DeNard; Thornton, Rebecca; Venkat, Aarti; Walden, Kimberly K O; Wu, Yuan-Qing; Debyser, Griet; Devreese, Bart; Asher, Claire; Blommaert, Julie; Chipman, Ariel D; Chittka, Lars; Fouks, Bertrand; Liu, Jisheng; O'Neill, Meaghan P; Sumner, Seirian; Puiu, Daniela; Qu, Jiaxin; Salzberg, Steven L; Scherer, Steven E; Muzny, Donna M; Richards, Stephen; Robinson, Gene E; Gibbs, Richard A; Schmid-Hempel, Paul; Worley, Kim C
2015-04-24
The shift from solitary to social behavior is one of the major evolutionary transitions. Primitively eusocial bumblebees are uniquely placed to illuminate the evolution of highly eusocial insect societies. Bumblebees are also invaluable natural and agricultural pollinators, and there is widespread concern over recent population declines in some species. High-quality genomic data will inform key aspects of bumblebee biology, including susceptibility to implicated population viability threats. We report the high quality draft genome sequences of Bombus terrestris and Bombus impatiens, two ecologically dominant bumblebees and widely utilized study species. Comparing these new genomes to those of the highly eusocial honeybee Apis mellifera and other Hymenoptera, we identify deeply conserved similarities, as well as novelties key to the biology of these organisms. Some honeybee genome features thought to underpin advanced eusociality are also present in bumblebees, indicating an earlier evolution in the bee lineage. Xenobiotic detoxification and immune genes are similarly depauperate in bumblebees and honeybees, and multiple categories of genes linked to social organization, including development and behavior, show high conservation. Key differences identified include a bias in bumblebee chemoreception towards gustation from olfaction, and striking differences in microRNAs, potentially responsible for gene regulation underlying social and other traits. These two bumblebee genomes provide a foundation for post-genomic research on these key pollinators and insect societies. Overall, gene repertoires suggest that the route to advanced eusociality in bees was mediated by many small changes in many genes and processes, and not by notable expansion or depauperation.
Functional and Genomic Features of Human Genes Mutated in Neuropsychiatric Disorders.
Forero, Diego A; Prada, Carlos F; Perry, George
2016-01-01
In recent years, a large number of studies around the world have led to the identification of causal genes for hereditary types of common and rare neurological and psychiatric disorders. To explore the functional and genomic features of known human genes mutated in neuropsychiatric disorders. A systematic search was used to develop a comprehensive catalog of genes mutated in neuropsychiatric disorders (NPD). Functional enrichment and protein-protein interaction analyses were carried out. A false discovery rate approach was used for correction for multiple testing. We found several functional categories that are enriched among NPD genes, such as gene ontologies, protein domains, tissue expression, signaling pathways and regulation by brain-expressed miRNAs and transcription factors. Sixty six of those NPD genes are known to be druggable. Several topographic parameters of protein-protein interaction networks and the degree of conservation between orthologous genes were identified as significant among NPD genes. These results represent one of the first analyses of enrichment of functional categories of genes known to harbor mutations for NPD. These findings could be useful for a future creation of computational tools for prioritization of novel candidate genes for NPD.
Functional and Genomic Features of Human Genes Mutated in Neuropsychiatric Disorders
Forero, Diego A.; Prada, Carlos F.; Perry, George
2016-01-01
Background: In recent years, a large number of studies around the world have led to the identification of causal genes for hereditary types of common and rare neurological and psychiatric disorders. Objective: To explore the functional and genomic features of known human genes mutated in neuropsychiatric disorders. Methods: A systematic search was used to develop a comprehensive catalog of genes mutated in neuropsychiatric disorders (NPD). Functional enrichment and protein-protein interaction analyses were carried out. A false discovery rate approach was used for correction for multiple testing. Results: We found several functional categories that are enriched among NPD genes, such as gene ontologies, protein domains, tissue expression, signaling pathways and regulation by brain-expressed miRNAs and transcription factors. Sixty six of those NPD genes are known to be druggable. Several topographic parameters of protein-protein interaction networks and the degree of conservation between orthologous genes were identified as significant among NPD genes. Conclusion: These results represent one of the first analyses of enrichment of functional categories of genes known to harbor mutations for NPD. These findings could be useful for a future creation of computational tools for prioritization of novel candidate genes for NPD. PMID:27990183
Natural killer cell receptor genes in the family Equidae: not only Ly49.
Futas, Jan; Horin, Petr
2013-01-01
Natural killer (NK) cells have important functions in immunity. NK recognition in mammals can be mediated through killer cell immunoglobulin-like receptors (KIR) and/or killer cell lectin-like Ly49 receptors. Genes encoding highly variable NK cell receptors (NKR) represent rapidly evolving genomic regions. No single conservative model of NKR genes was observed in mammals. Single-copy low polymorphic NKR genes present in one mammalian species may expand into highly polymorphic multigene families in other species. In contrast to other non-rodent mammals, multiple Ly49-like genes appear to exist in the horse, while no functional KIR genes were observed in this species. In this study, Ly49 and KIR were sought and their evolution was characterized in the entire family Equidae. Genomic sequences retrieved showed the presence of at least five highly conserved polymorphic Ly49 genes in horses, asses and zebras. These findings confirmed that the expansion of Ly49 occurred in the entire family. Several KIR-like sequences were also identified in the genome of Equids. Besides a previously identified non-functional KIR-Immunoglobulin-like transcript fusion gene (KIR-ILTA) and two putative pseudogenes, a KIR3DL-like sequence was analyzed. In contrast to previous observations made in the horse, the KIR3DL sequence, genomic organization and mRNA expression suggest that all Equids might produce a functional KIR receptor protein molecule with a single non-mutated immune tyrosine-based inhibition motif (ITIM) domain. No evidence for positive selection in the KIR3DL gene was found. Phylogenetic analysis including rhinoceros and tapir genomic DNA and deduced amino acid KIR-related sequences showed differences between families and even between species within the order Perissodactyla. The results suggest that the order Perissodactyla and its family Equidae with expanded Ly49 genes and with a potentially functional KIR gene may represent an interesting model for evolutionary biology of NKR genes.
Natural Killer Cell Receptor Genes in the Family Equidae: Not only Ly49
Futas, Jan; Horin, Petr
2013-01-01
Natural killer (NK) cells have important functions in immunity. NK recognition in mammals can be mediated through killer cell immunoglobulin-like receptors (KIR) and/or killer cell lectin-like Ly49 receptors. Genes encoding highly variable NK cell receptors (NKR) represent rapidly evolving genomic regions. No single conservative model of NKR genes was observed in mammals. Single-copy low polymorphic NKR genes present in one mammalian species may expand into highly polymorphic multigene families in other species. In contrast to other non-rodent mammals, multiple Ly49-like genes appear to exist in the horse, while no functional KIR genes were observed in this species. In this study, Ly49 and KIR were sought and their evolution was characterized in the entire family Equidae. Genomic sequences retrieved showed the presence of at least five highly conserved polymorphic Ly49 genes in horses, asses and zebras. These findings confirmed that the expansion of Ly49 occurred in the entire family. Several KIR-like sequences were also identified in the genome of Equids. Besides a previously identified non-functional KIR-Immunoglobulin-like transcript fusion gene (KIR-ILTA) and two putative pseudogenes, a KIR3DL-like sequence was analyzed. In contrast to previous observations made in the horse, the KIR3DL sequence, genomic organization and mRNA expression suggest that all Equids might produce a functional KIR receptor protein molecule with a single non-mutated immune tyrosine-based inhibition motif (ITIM) domain. No evidence for positive selection in the KIR3DL gene was found. Phylogenetic analysis including rhinoceros and tapir genomic DNA and deduced amino acid KIR-related sequences showed differences between families and even between species within the order Perissodactyla. The results suggest that the order Perissodactyla and its family Equidae with expanded Ly49 genes and with a potentially functional KIR gene may represent an interesting model for evolutionary biology of NKR genes. PMID:23724088
Conservation and diversification of Msx protein in metazoan evolution.
Takahashi, Hirokazu; Kamiya, Akiko; Ishiguro, Akira; Suzuki, Atsushi C; Saitou, Naruya; Toyoda, Atsushi; Aruga, Jun
2008-01-01
Msx (/msh) family genes encode homeodomain (HD) proteins that control ontogeny in many animal species. We compared the structures of Msx genes from a wide range of Metazoa (Porifera, Cnidaria, Nematoda, Arthropoda, Tardigrada, Platyhelminthes, Mollusca, Brachiopoda, Annelida, Echiura, Echinodermata, Hemichordata, and Chordata) to gain an understanding of the role of these genes in phylogeny. Exon-intron boundary analysis suggested that the position of the intron located N-terminally to the HDs was widely conserved in all the genes examined, including those of cnidarians. Amino acid (aa) sequence comparison revealed 3 new evolutionarily conserved domains, as well as very strong conservation of the HDs. Two of the three domains were associated with Groucho-like protein binding in both a vertebrate and a cnidarian Msx homolog, suggesting that the interaction between Groucho-like proteins and Msx proteins was established in eumetazoan ancestors. Pairwise comparison among the collected HDs and their C-flanking aa sequences revealed that the degree of sequence conservation varied depending on the animal taxa from which the sequences were derived. Highly conserved Msx genes were identified in the Vertebrata, Cephalochordata, Hemichordata, Echinodermata, Mollusca, Brachiopoda, and Anthozoa. The wide distribution of the conserved sequences in the animal phylogenetic tree suggested that metazoan ancestors had already acquired a set of conserved domains of the current Msx family genes. Interestingly, although strongly conserved sequences were recovered from the Vertebrata, Cephalochordata, and Anthozoa, the sequences from the Urochordata and Hydrozoa showed weak conservation. Because the Vertebrata-Cephalochordata-Urochordata and Anthozoa-Hydrozoa represent sister groups in the Chordata and Cnidaria, respectively, Msx sequence diversification may have occurred differentially in the course of evolution. We speculate that selective loss of the conserved domains in Msx family proteins contributed to the diversification of animal body organization.
Variability and repertoire size of T-cell receptor V alpha gene segments.
Becker, D M; Pattern, P; Chien, Y; Yokota, T; Eshhar, Z; Giedlin, M; Gascoigne, N R; Goodnow, C; Wolf, R; Arai, K
The immune system of higher organisms is composed largely of two distinct cell types, B lymphocytes and T lymphocytes, each of which is independently capable of recognizing an enormous number of distinct entities through their antigen receptors; surface immunoglobulin in the case of the former, and the T-cell receptor (TCR) in the case of the latter. In both cell types, the genes encoding the antigen receptors consist of multiple gene segments which recombine during maturation to produce many possible peptides. One striking difference between B- and T-cell recognition that has not yet been resolved by the structural data is the fact that T cells generally require a major histocompatibility determinant together with an antigen whereas, in most cases, antibodies recognize antigen alone. Recently, we and others have found that a series of TCR V beta gene sequences show conservation of many of the same residues that are conserved between heavy- and light-chain immunoglobulin V regions, and these V beta sequences are predicted to have an immunoglobulin-like secondary structure. To extend these studies, we have isolated and sequenced eight additional alpha-chain complementary cDNA clones and compared them with published sequences. Analyses of these sequences, reported here, indicate that V alpha regions have many of the characteristics of V beta gene segments but differ in that they almost always occur as cross-hybridizing gene families. We conclude that there may be very different selective pressures operating on V alpha and V beta sequences and that the V alpha repertoire may be considerably larger than that of V beta.
Subaran, Ryan L.; Odgerel, Zagaa; Swaminathan, Rajeswari; Glatt, Charles E.; Weissman, Myrna M.
2018-01-01
There are no known genetic variants with large effects on susceptibility to major depressive disorder (MDD). Although one proposed study approach is to increase sensitivity by increasing sample sizes, another is to focus on families with multiple affected individuals to identify genes with rare or novel variants with strong effects. Choosing the family-based approach, we performed whole-exome analysis on affected individuals (n = 12) across five MDD families, each with at least five affected individuals, early onset, and prepubertal diagnoses. We identified 67 genes where novel deleterious variants were shared among affected relatives. Gene ontology analysis shows that of these 67 genes, 18 encode transcriptional regulators, eight of which are expressed in the human brain, including four KRAB-A box-containing Zn2+ finger repressors. One of these, ZNF34, has been reported as being associated with bipolar disorder and as differentially expressed in bipolar disorder patients compared to healthy controls. We found a novel variant—encoding a non-conservative P17R substitution in the conserved repressor domain of ZNF34 protein—segregating completely with MDD in all available individuals in the family in which it was discovered. Further analysis showed a common ZNF34 coding indel segregating with MDD in a separate family, possibly indicating the presence of an unobserved, linked, rare variant in that particular family. Our results indicate that genes encoding transcription factors expressed in the brain might be an important group of MDD candidate genes and that rare variants in ZNF34 might contribute to susceptibility to MDD and perhaps other affective disorders. PMID:26823146
Kikhno, Irina
2014-01-01
Highly homologous sequences 154–157 bp in length grouped under the name of “conserved non-protein-coding element” (CNE) were revealed in all of the sequenced genomes of baculoviruses belonging to the genus Alphabaculovirus. A CNE alignment led to the detection of a set of highly conserved nucleotide clusters that occupy strictly conserved positions in the CNE sequence. The significant length of the CNE and conservation of both its length and cluster architecture were identified as a combination of characteristics that make this CNE different from known viral non-coding functional sequences. The essential role of the CNE in the Alphabaculovirus life cycle was demonstrated through the use of a CNE-knockout Autographa californica multiple nucleopolyhedrovirus (AcMNPV) bacmid. It was shown that the essential function of the CNE was not mediated by the presumed expression activities of the protein- and non-protein-coding genes that overlap the AcMNPV CNE. On the basis of the presented data, the AcMNPV CNE was categorized as a complex-structured, polyfunctional genomic element involved in an essential DNA transaction that is associated with an undefined function of the baculovirus genome. PMID:24740153
Functional analysis and transcriptional output of the Göttingen minipig genome.
Heckel, Tobias; Schmucki, Roland; Berrera, Marco; Ringshandl, Stephan; Badi, Laura; Steiner, Guido; Ravon, Morgane; Küng, Erich; Kuhn, Bernd; Kratochwil, Nicole A; Schmitt, Georg; Kiialainen, Anna; Nowaczyk, Corinne; Daff, Hamina; Khan, Azinwi Phina; Lekolool, Isaac; Pelle, Roger; Okoth, Edward; Bishop, Richard; Daubenberger, Claudia; Ebeling, Martin; Certa, Ulrich
2015-11-14
In the past decade the Göttingen minipig has gained increasing recognition as animal model in pharmaceutical and safety research because it recapitulates many aspects of human physiology and metabolism. Genome-based comparison of drug targets together with quantitative tissue expression analysis allows rational prediction of pharmacology and cross-reactivity of human drugs in animal models thereby improving drug attrition which is an important challenge in the process of drug development. Here we present a new chromosome level based version of the Göttingen minipig genome together with a comparative transcriptional analysis of tissues with pharmaceutical relevance as basis for translational research. We relied on mapping and assembly of WGS (whole-genome-shotgun sequencing) derived reads to the reference genome of the Duroc pig and predict 19,228 human orthologous protein-coding genes. Genome-based prediction of the sequence of human drug targets enables the prediction of drug cross-reactivity based on conservation of binding sites. We further support the finding that the genome of Sus scrofa contains about ten-times less pseudogenized genes compared to other vertebrates. Among the functional human orthologs of these minipig pseudogenes we found HEPN1, a putative tumor suppressor gene. The genomes of Sus scrofa, the Tibetan boar, the African Bushpig, and the Warthog show sequence conservation of all inactivating HEPN1 mutations suggesting disruption before the evolutionary split of these pig species. We identify 133 Sus scrofa specific, conserved long non-coding RNAs (lncRNAs) in the minipig genome and show that these transcripts are highly conserved in the African pigs and the Tibetan boar suggesting functional significance. Using a new minipig specific microarray we show high conservation of gene expression signatures in 13 tissues with biomedical relevance between humans and adult minipigs. We underline this relationship for minipig and human liver where we could demonstrate similar expression levels for most phase I drug-metabolizing enzymes. Higher expression levels and metabolic activities were found for FMO1, AKR/CRs and for phase II drug metabolizing enzymes in minipig as compared to human. The variability of gene expression in equivalent human and minipig tissues is considerably higher in minipig organs, which is important for study design in case a human target belongs to this variable category in the minipig. The first analysis of gene expression in multiple tissues during development from young to adult shows that the majority of transcriptional programs are concluded four weeks after birth. This finding is in line with the advanced state of human postnatal organ development at comparative age categories and further supports the minipig as model for pediatric drug safety studies. Genome based assessment of sequence conservation combined with gene expression data in several tissues improves the translational value of the minipig for human drug development. The genome and gene expression data presented here are important resources for researchers using the minipig as model for biomedical research or commercial breeding. Potential impact of our data for comparative genomics, translational research, and experimental medicine are discussed.
Kirby, Ralph; Herron, Paul; Hoskisson, Paul
2011-02-01
Based on available genome sequences, Actinomycetales show significant gene synteny across a wide range of species and genera. In addition, many genera show varying degrees of complex morphological development. Using the presence of gene synteny as a basis, it is clear that an analysis of gene conservation across the Streptomyces and various other Actinomycetales will provide information on both the importance of genes and gene clusters and the evolution of morphogenesis in these bacteria. Genome sequencing, although becoming cheaper, is still relatively expensive for comparing large numbers of strains. Thus, a heterologous DNA/DNA microarray hybridization dataset based on a Streptomyces coelicolor microarray allows a cheaper and greater depth of analysis of gene conservation. This study, using both bioinformatical and microarray approaches, was able to classify genes previously identified as involved in morphogenesis in Streptomyces into various subgroups in terms of conservation across species and genera. This will allow the targeting of genes for further study based on their importance at the species level and at higher evolutionary levels.
Evolutionary conservation of regulatory elements in vertebrate HOX gene clusters
DOE Office of Scientific and Technical Information (OSTI.GOV)
Santini, Simona; Boore, Jeffrey L.; Meyer, Axel
2003-12-31
Due to their high degree of conservation, comparisons of DNA sequences among evolutionarily distantly-related genomes permit to identify functional regions in noncoding DNA. Hox genes are optimal candidate sequences for comparative genome analyses, because they are extremely conserved in vertebrates and occur in clusters. We aligned (Pipmaker) the nucleotide sequences of HoxA clusters of tilapia, pufferfish, striped bass, zebrafish, horn shark, human and mouse (over 500 million years of evolutionary distance). We identified several highly conserved intergenic sequences, likely to be important in gene regulation. Only a few of these putative regulatory elements have been previously described as being involvedmore » in the regulation of Hox genes, while several others are new elements that might have regulatory functions. The majority of these newly identified putative regulatory elements contain short fragments that are almost completely conserved and are identical to known binding sites for regulatory proteins (Transfac). The conserved intergenic regions located between the most rostrally expressed genes in the developing embryo are longer and better retained through evolution. We document that presumed regulatory sequences are retained differentially in either A or A clusters resulting from a genome duplication in the fish lineage. This observation supports both the hypothesis that the conserved elements are involved in gene regulation and the Duplication-Deletion-Complementation model.« less
Iskar, Murat; Zeller, Georg; Blattmann, Peter; Campillos, Monica; Kuhn, Michael; Kaminska, Katarzyna H; Runz, Heiko; Gavin, Anne-Claude; Pepperkok, Rainer; van Noort, Vera; Bork, Peer
2013-01-01
In pharmacology, it is crucial to understand the complex biological responses that drugs elicit in the human organism and how well they can be inferred from model organisms. We therefore identified a large set of drug-induced transcriptional modules from genome-wide microarray data of drug-treated human cell lines and rat liver, and first characterized their conservation. Over 70% of these modules were common for multiple cell lines and 15% were conserved between the human in vitro and the rat in vivo system. We then illustrate the utility of conserved and cell-type-specific drug-induced modules by predicting and experimentally validating (i) gene functions, e.g., 10 novel regulators of cellular cholesterol homeostasis and (ii) new mechanisms of action for existing drugs, thereby providing a starting point for drug repositioning, e.g., novel cell cycle inhibitors and new modulators of α-adrenergic receptor, peroxisome proliferator-activated receptor and estrogen receptor. Taken together, the identified modules reveal the conservation of transcriptional responses towards drugs across cell types and organisms, and improve our understanding of both the molecular basis of drug action and human biology. PMID:23632384
Pirrò, Stefano; Zanella, Letizia; Kenzo, Maurice; Montesano, Carla; Minutolo, Antonella; Potestà, Marina; Sobze, Martin Sanou; Canini, Antonella; Cirilli, Marco; Muleo, Rosario; Colizzi, Vittorio; Galgani, Andrea
2016-01-01
Moringa oleifera is a widespread plant with substantial nutritional and medicinal value. We postulated that microRNAs (miRNAs), which are endogenous, noncoding small RNAs regulating gene expression at the post-transcriptional level, might contribute to the medicinal properties of plants of this species after ingestion into human body, regulating human gene expression. However, the knowledge is scarce about miRNA in Moringa. Furthermore, in order to test the hypothesis on the pharmacological potential properties of miRNA, we conducted a high-throughput sequencing analysis using the Illumina platform. A total of 31,290,964 raw reads were produced from a library of small RNA isolated from M. oleifera seeds. We identified 94 conserved and two novel miRNAs that were validated by qRT-PCR assays. Results from qRT-PCR trials conducted on the expression of 20 Moringa miRNA showed that are conserved across multiple plant species as determined by their detection in tissue of other common crop plants. In silico analyses predicted target genes for the conserved miRNA that in turn allowed to relate the miRNAs to the regulation of physiological processes. Some of the predicted plant miRNAs have functional homology to their mammalian counterparts and regulated human genes when they were transfected into cell lines. To our knowledge, this is the first report of discovering M. oleifera miRNAs based on high-throughput sequencing and bioinformatics analysis and we provided new insight into a potential cross-species control of human gene expression. The widespread cultivation and consumption of M. oleifera, for nutritional and medicinal purposes, brings humans into close contact with products and extracts of this plant species. The potential for miRNA transfer should be evaluated as one possible mechanism of action to account for beneficial properties of this valuable species.
Replication and meiotic transmission of yeast ribosomal RNA genes.
Brewer, B J; Zakian, V A; Fangman, W L
1980-11-01
The yeast Saccharomyces cerevisiae has approximately 120 genes for the ribosomal RNAs (rDNA) which are organized in tandem within chromosomal DNA. These multiple-copy genes are homogeneous in sequence but can undergo changes in copy number and topology. To determine if these changes reflect unusual features of rDNA metabolism, we have examined both the replication of rDNA in the mitotic cell cycle and the inheritance of rDNA during meiosis. The results indicate that rDNA behaves identically to chromosomal DNA: each rDNA unit is replicated once during the S phase of each cell cycle and each unit is conserved through meiosis. Therefore, the flexibility in copy number and topology of rDNA does not arise from the selective replication of units in each S phase nor by the selective inheritance of units in meiosis.
Alternate approaches to repress endogenous microRNA activity in Arabidopsis thaliana
Wang, Ming-Bo
2011-01-01
MicroRnAs (miRnAs) are an endogenous class of regulatory small RnA (sRnA). in plants, miRnAs are processed from short non-protein-coding messenger RnAs (mRnAs) transcribed from small miRnA genes (MIR genes). Traditionally in the model plant Arabidopsis thaliana (Arabidopsis), the functional analysis of a gene product has relied on the identification of a corresponding T-DnA insertion knockout mutant from a large, randomly-mutagenized population. However, because of the small size of MIR genes and presence of multiple, highly conserved members in most plant miRnA families, it has been extremely laborious and time consuming to obtain a corresponding single or multiple, null mutant plant line. Our recent study published in Molecular Plant1 outlines an alternate method for the functional characterization of miRnA action in Arabidopsis, termed anti-miRnA technology. Using this approach we demonstrated that the expression of individual miRnAs or entire miRnA families, can be readily and efficiently knocked-down. Our approach is in addition to two previously reported methodologies that also allow for the targeted suppression of either individual miRnAs, or all members of a MIR gene family; these include miRnA target mimicry2,3 and transcriptional gene silencing (TGS) of MIR gene promoters.4 All three methodologies rely on endogenous gene regulatory machinery and in this article we provide an overview of these technologies and discuss their strengths and weaknesses in inhibiting the activity of their targeted miRnA(s). PMID:21358288
Conserved Curvature of RNA Polymerase I Core Promoter Beyond rRNA Genes: The Case of the Tritryps
Smircich, Pablo; Duhagon, María Ana; Garat, Beatriz
2015-01-01
In trypanosomatids, the RNA polymerase I (RNAPI)-dependent promoters controlling the ribosomal RNA (rRNA) genes have been well identified. Although the RNAPI transcription machinery recognizes the DNA conformation instead of the DNA sequence of promoters, no conformational study has been reported for these promoters. Here we present the in silico analysis of the intrinsic DNA curvature of the rRNA gene core promoters in Trypanosoma brucei, Trypanosoma cruzi, and Leishmania major. We found that, in spite of the absence of sequence conservation, these promoters hold conformational properties similar to other eukaryotic rRNA promoters. Our results also indicated that the intrinsic DNA curvature pattern is conserved within the Leishmania genus and also among strains of T. cruzi and T. brucei. Furthermore, we analyzed the impact of point mutations on the intrinsic curvature and their impact on the promoter activity. Furthermore, we found that the core promoters of protein-coding genes transcribed by RNAPI in T. brucei show the same conserved conformational characteristics. Overall, our results indicate that DNA intrinsic curvature of the rRNA gene core promoters is conserved in these ancient eukaryotes and such conserved curvature might be a requirement of RNAPI machinery for transcription of not only rRNA genes but also protein-coding genes. PMID:26718450
Gene essentiality, conservation index and co-evolution of genes in cyanobacteria.
Tiruveedula, Gopi Siva Sai; Wangikar, Pramod P
2017-01-01
Cyanobacteria, a group of photosynthetic prokaryotes, dominate the earth with ~ 1015 g wet biomass. Despite diversity in habitats and an ancient origin, cyanobacterial phylum has retained a significant core genome. Cyanobacteria are being explored for direct conversion of solar energy and carbon dioxide into biofuels. For this, efficient cyanobacterial strains will need to be designed via metabolic engineering. This will require identification of target knockouts to channelize the flow of carbon toward the product of interest while minimizing deletions of essential genes. We propose "Gene Conservation Index" (GCI) as a quick measure to predict gene essentiality in cyanobacteria. GCI is based on phylogenetic profile of a gene constructed with a reduced dataset of cyanobacterial genomes. GCI is the percentage of organism clusters in which the query gene is present in the reduced dataset. Of the 750 genes deemed to be essential in the experimental study on S. elongatus PCC 7942, we found 494 to be conserved across the phylum which largely comprise of the essential metabolic pathways. On the contrary, the conserved but non-essential genes broadly comprise of genes required under stress conditions. Exceptions to this rule include genes such as the glycogen synthesis and degradation enzymes, deoxyribose-phosphate aldolase (DERA), glucose-6-phosphate 1-dehydrogenase (zwf) and fructose-1,6-bisphosphatase class1, which are conserved but non-essential. While the essential genes are to be avoided during gene knockout studies as potentially lethal deletions, the non-essential but conserved set of genes could be interesting targets for metabolic engineering. Further, we identify clusters of co-evolving genes (CCG), which provide insights that may be useful in annotation. Principal component analysis (PCA) plots of the CCGs are demonstrated as data visualization tools that are complementary to the conventional heatmaps. Our dataset consists of phylogenetic profiles for 23,643 non-redundant cyanobacterial genes. We believe that the data and the analysis presented here will be a great resource to the scientific community interested in cyanobacteria.
Role of acyl carrier protein isoforms in plant lipid metabolism: Progress report
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ohlrogge, J.B.
1989-01-01
Previous research from my lab has revealed that several higher plant species have multiple isoforms of acyl carrier protein (ACP) and therefore this trait appears highly conserved among higher plants. This level of conservation suggests that the existence of ACP isoforms is not merely the results of neutral gene duplications. We have developed techniques to examine a wider range of species. Acyl carrier proteins can be labelled very specifically and to high specific activity using H-palmitate and the E. coli enzyme acyl-ACP synthetase. Isoforms were then resolved by western blotting and native PAGE of H-palmitate labelled ACP's. Multiple isoforms ofmore » ACP were observed the leaf tissue of the monocots Avena sativa and Hordeum vulgare and dicots including Arabidopsis thallina, Cuphea wrightii, and Brassica napus. Lower vascular plants including the cycad, Dioon edule, Ginkgo biloba, the gymnosperm Pinus, the fern Anernia phyllitidis and Psilotum nudum, the most primitive known extant vascular plant, were also found to have multiple ACP isoforms as were the nonvascular liverwort, Marchantia and moss, Polytrichum. Therefore, the development of ACP isoforms occurred early in evolution. However, the uniellular alge Chlamydomonas and Dunaliella and the photosynthetic cyanobacteria Synechocystis and Agmnellum have only a single elecrophotetic form of ACP. Thus, multiple forms of ACP do not occur in all photosynthetic organisms but may be associated with multicellular plants.« less
Zhang, Ruijie; Lv, Wenhua; Luan, Meiwei; Zheng, Jiajia; Shi, Miao; Zhu, Hongjie; Li, Jin; Lv, Hongchao; Zhang, Mingming; Shang, Zhenwei; Duan, Lian; Jiang, Yongshuai
2015-11-24
Different human genes often exhibit different degrees of stability in their DNA methylation levels between tissues, samples or cell types. This may be related to the evolution of human genome. Thus, we compared the evolutionary conservation between two types of genes: genes with stable DNA methylation levels (SM genes) and genes with fluctuant DNA methylation levels (FM genes). For long-term evolutionary characteristics between species, we compared the percentage of the orthologous genes, evolutionary rate dn/ds and protein sequence identity. We found that the SM genes had greater percentages of the orthologous genes, lower dn/ds, and higher protein sequence identities in all the 21 species. These results indicated that the SM genes were more evolutionarily conserved than the FM genes. For short-term evolutionary characteristics among human populations, we compared the single nucleotide polymorphism (SNP) density, and the linkage disequilibrium (LD) degree in HapMap populations and 1000 genomes project populations. We observed that the SM genes had lower SNP densities, and higher degrees of LD in all the 11 HapMap populations and 13 1000 genomes project populations. These results mean that the SM genes had more stable chromosome genetic structures, and were more conserved than the FM genes.
Conservation of NLR-triggered immunity across plant lineages.
Maekawa, Takaki; Kracher, Barbara; Vernaldi, Saskia; Ver Loren van Themaat, Emiel; Schulze-Lefert, Paul
2012-12-04
The nucleotide-binding domain and leucine-rich repeat (NLR) family of plant receptors detects pathogen-derived molecules, designated effectors, inside host cells and mediates innate immune responses to pathogenic invaders. Genetic evidence revealed species-specific coevolution of many NLRs with effectors from host-adapted pathogens, suggesting that the specificity of these NLRs is restricted to the host or closely related plant species. However, we report that an NLR immune receptor (MLA1) from monocotyledonous barley is fully functional in partially immunocompromised dicotyledonous Arabidopsis thaliana against the barley powdery mildew fungus, Blumeria graminis f. sp. hordei. This implies ~200 million years of evolutionary conservation of the underlying immune mechanism. A time-course RNA-seq analysis in transgenic Arabidopsis lines detected sustained expression of a large MLA1-dependent gene cluster. This cluster is greatly enriched in genes known to respond to the fungal cell wall-derived microbe-associated molecular pattern chitin. The MLA1-dependent sustained transcript accumulation could define a conserved function of the nuclear pool of MLA1 detected in barley and Arabidopsis. We also found that MLA1-triggered immunity was fully retained in mutant plants that are simultaneously depleted of ethylene, jasmonic acid, and salicylic acid signaling. This points to the existence of an evolutionarily conserved and phytohormone-independent MLA1-mediated resistance mechanism. This also suggests a conserved mechanism for internalization of B. graminis f. sp. hordei effectors into host cells of flowering plants. Furthermore, the deduced connectivity of the NLR to multiple branches of immune signaling pathways likely confers increased robustness against pathogen effector-mediated interception of host immune signaling and could have contributed to the evolutionary preservation of the immune mechanism.
Wultsch, Claudia; Waits, Lisette P; Kelly, Marcella J
2016-01-01
With increasing anthropogenic impact and landscape change, terrestrial carnivore populations are becoming more fragmented. Thus, it is crucial to genetically monitor wild carnivores and quantify changes in genetic diversity and gene flow in response to these threats. This study combined the use of scat detector dogs and molecular scatology to conduct the first genetic study on wild populations of multiple Neotropical felids coexisting across a fragmented landscape in Belize, Central America. We analyzed data from 14 polymorphic microsatellite loci in 1053 scat samples collected from wild jaguars (Panthera onca), pumas (Puma concolor), and ocelots (Leopardus pardalis). We assessed levels of genetic diversity, defined potential genetic clusters, and examined gene flow for the three target species on a countrywide scale using a combination of individual- and population-based analyses. Wild felids in Belize showed moderate levels of genetic variation, with jaguars having the lowest diversity estimates (HE = 0.57 ± 0.02; AR = 3.36 ± 0.09), followed by pumas (HE = 0.57 ± 0.08; AR = 4.20 ± 0.16), and ocelots (HE = 0.63 ± 0.03; AR = 4.16 ± 0.08). We observed low to moderate levels of genetic differentiation for all three target species, with jaguars showing the lowest degree of genetic subdivision across the country, followed by ocelots and pumas. Although levels of genetic diversity and gene flow were still fairly high, we detected evidence of fine-scale genetic subdivision, indicating that levels of genetic connectivity for wild felids in Belize are likely to decrease if habitat loss and fragmentation continue at the current rate. Our study demonstrates the value of understanding fine-scale patterns of gene flow in multiple co-occurring felid species of conservation concern, which is vital for wildlife movement corridor planning and prioritizing future conservation and management efforts within human-impacted landscapes.
Wultsch, Claudia; Waits, Lisette P.; Kelly, Marcella J.
2016-01-01
With increasing anthropogenic impact and landscape change, terrestrial carnivore populations are becoming more fragmented. Thus, it is crucial to genetically monitor wild carnivores and quantify changes in genetic diversity and gene flow in response to these threats. This study combined the use of scat detector dogs and molecular scatology to conduct the first genetic study on wild populations of multiple Neotropical felids coexisting across a fragmented landscape in Belize, Central America. We analyzed data from 14 polymorphic microsatellite loci in 1053 scat samples collected from wild jaguars (Panthera onca), pumas (Puma concolor), and ocelots (Leopardus pardalis). We assessed levels of genetic diversity, defined potential genetic clusters, and examined gene flow for the three target species on a countrywide scale using a combination of individual- and population-based analyses. Wild felids in Belize showed moderate levels of genetic variation, with jaguars having the lowest diversity estimates (HE = 0.57 ± 0.02; AR = 3.36 ± 0.09), followed by pumas (HE = 0.57 ± 0.08; AR = 4.20 ± 0.16), and ocelots (HE = 0.63 ± 0.03; AR = 4.16 ± 0.08). We observed low to moderate levels of genetic differentiation for all three target species, with jaguars showing the lowest degree of genetic subdivision across the country, followed by ocelots and pumas. Although levels of genetic diversity and gene flow were still fairly high, we detected evidence of fine-scale genetic subdivision, indicating that levels of genetic connectivity for wild felids in Belize are likely to decrease if habitat loss and fragmentation continue at the current rate. Our study demonstrates the value of understanding fine-scale patterns of gene flow in multiple co-occurring felid species of conservation concern, which is vital for wildlife movement corridor planning and prioritizing future conservation and management efforts within human-impacted landscapes. PMID:26974968
Role and convergent evolution of competing RNA secondary structures in mutually exclusive splicing
Yue, Yuan; Hou, Shouqing; Wang, Xiu; Zhan, Leilei; Cao, Guozheng; Li, Guoli; Shi, Yang; Zhang, Peng; Hong, Weiling; Lin, Hao; Liu, Baoping; Shi, Feng; Yang, Yun; Jin, Yongfeng
2017-01-01
ABSTRACT Exon or cassette duplication is an important means of expanding protein and functional diversity through mutually exclusive splicing. However, the mechanistic basis of this process in non-arthropod species remains poorly understood. Here, we demonstrate that MRP1 genes underwent tandem exon duplication in Nematoda, Platyhelminthes, Annelida, Mollusca, Arthropoda, Echinodermata, and early-diverging Chordata but not in late-diverging vertebrates. Interestingly, these events were of independent origin in different phyla, suggesting convergent evolution of alternative splicing. Furthermore, we showed that multiple sets of clade-conserved RNA pairings evolved to guide species-specific mutually exclusive splicing in Arthropoda. Importantly, we also identified a similar structural code in MRP exon clusters of the annelid, Capitella teleta, and chordate, Branchiostoma belcheri, suggesting an evolutionarily conserved competing pairing-guided mechanism in bilaterians. Taken together, these data reveal the molecular determinants and RNA pairing-guided evolution of species-specific mutually exclusive splicing spanning more than 600 million years of bilaterian evolution. These findings have a significant impact on our understanding of the evolution of and mechanism underpinning isoform diversity and complex gene structure. PMID:28277933
Role and convergent evolution of competing RNA secondary structures in mutually exclusive splicing.
Yue, Yuan; Hou, Shouqing; Wang, Xiu; Zhan, Leilei; Cao, Guozheng; Li, Guoli; Shi, Yang; Zhang, Peng; Hong, Weiling; Lin, Hao; Liu, Baoping; Shi, Feng; Yang, Yun; Jin, Yongfeng
2017-10-03
Exon or cassette duplication is an important means of expanding protein and functional diversity through mutually exclusive splicing. However, the mechanistic basis of this process in non-arthropod species remains poorly understood. Here, we demonstrate that MRP1 genes underwent tandem exon duplication in Nematoda, Platyhelminthes, Annelida, Mollusca, Arthropoda, Echinodermata, and early-diverging Chordata but not in late-diverging vertebrates. Interestingly, these events were of independent origin in different phyla, suggesting convergent evolution of alternative splicing. Furthermore, we showed that multiple sets of clade-conserved RNA pairings evolved to guide species-specific mutually exclusive splicing in Arthropoda. Importantly, we also identified a similar structural code in MRP exon clusters of the annelid, Capitella teleta, and chordate, Branchiostoma belcheri, suggesting an evolutionarily conserved competing pairing-guided mechanism in bilaterians. Taken together, these data reveal the molecular determinants and RNA pairing-guided evolution of species-specific mutually exclusive splicing spanning more than 600 million years of bilaterian evolution. These findings have a significant impact on our understanding of the evolution of and mechanism underpinning isoform diversity and complex gene structure.
Xin, Chengqi; Liu, Wanfei; Lin, Qiang; Zhang, Xiaowei; Cui, Peng; Li, Fusen; Zhang, Guangyu; Pan, Linlin; Al-Amer, Ali; Mei, Hailiang; Al-Mssallem, Ibrahim S; Hu, Songnian; Al-Johi, Hasan Awad; Yu, Jun
2015-04-01
MicroRNAs (miRNAs) play crucial roles in multiple stages of plant development and regulate gene expression at posttranscriptional and translational levels. In this study, we first identified 238 conserved miRNAs in date palm (Phoenix dactylifera) based on a high-quality genome assembly and defined 78 fruit-development-associated (FDA) miRNAs, whose expression profiles are variable at different fruit development stages. Using experimental data, we subsequently detected 276 novel P. dactylifera-specific FDA miRNAs and predicted their targets. We also revealed that FDA miRNAs function mainly in regulating genes involved in starch/sucrose metabolisms and other carbon metabolic pathways; among them, 221 FDA miRNAs exhibit negative correlation with their corresponding targets, which suggests their direct regulatory roles on mRNA targets. Our data define a comprehensive set of conserved and novel FDA miRNAs along with their expression profiles, which provide a basis for further experimentation in assigning discrete functions of these miRNAs in P. dactylifera fruit development. Copyright © 2015. Published by Elsevier Inc.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mitchell, Hugh D.; Eisfeld, Amie J.; Sims, Amy
Respiratory infections stemming from influenza viruses and the Severe Acute Respiratory Syndrome corona virus (SARS-CoV) represent a serious public health threat as emerging pandemics. Despite efforts to identify the critical interactions of these viruses with host machinery, the key regulatory events that lead to disease pathology remain poorly targeted with therapeutics. Here we implement an integrated network interrogation approach, in which proteome and transcriptome datasets from infection of both viruses in human lung epithelial cells are utilized to predict regulatory genes involved in the host response. We take advantage of a novel “crowd-based” approach to identify and combine ranking metricsmore » that isolate genes/proteins likely related to the pathogenicity of SARS-CoV and influenza virus. Subsequently, a multivariate regression model is used to compare predicted lung epithelial regulatory influences with data derived from other respiratory virus infection models. We predicted a small set of regulatory factors with conserved behavior for consideration as important components of viral pathogenesis that might also serve as therapeutic targets for intervention. Our results demonstrate the utility of integrating diverse ‘omic datasets to predict and prioritize regulatory features conserved across multiple pathogen infection models.« less
He, Xiaocui; Zhang, Yang; Yu, Ziniu
2010-10-01
Rieske protein gene in the Pacific oyster Crassostrea gigas was obtained by in silico cloning for the first time, and its expression profiles and subcellular localization were determined, respectively. The full-length cDNA of Cgisp is 985 bp in length and contains a 5'- and 3'-untranslated regions of 35 and 161 bp, respectively, with an open reading frame of 786 bp encoding a protein of 262 amino acids. The predicted molecular weight of 30 kDa of Cgisp protein was verified by prokaryotic expression. Conserved Rieske [2Fe-2S] cluster binding sites and highly matched-pair tertiary structure with 3CWB_E (Gallus gallus) were revealed by homologous analysis and molecular modeling. Eleven putative SNP sites and two conserved hexapeptide sequences, box I (THLGC) and II (PCHGS), were detected by multiple alignments. Real-time PCR analysis showed that Cgisp is expressed in a wide range of tissues, with adductor muscle exhibiting the top expression level, suggesting its biological function of energy transduction. The GFP tagging Cgisp indicated a mitochondrial localization, further confirming its physiological function.
Cunningham, Christopher B; Ji, Lexiang; Wiberg, R Axel W; Shelton, Jennifer; McKinney, Elizabeth C; Parker, Darren J; Meagher, Richard B; Benowitz, Kyle M; Roy-Zokan, Eileen M; Ritchie, Michael G; Brown, Susan J; Schmitz, Robert J; Moore, Allen J
2015-10-09
Testing for conserved and novel mechanisms underlying phenotypic evolution requires a diversity of genomes available for comparison spanning multiple independent lineages. For example, complex social behavior in insects has been investigated primarily with eusocial lineages, nearly all of which are Hymenoptera. If conserved genomic influences on sociality do exist, we need data from a wider range of taxa that also vary in their levels of sociality. Here, we present the assembled and annotated genome of the subsocial beetle Nicrophorus vespilloides, a species long used to investigate evolutionary questions of complex social behavior. We used this genome to address two questions. First, do aspects of life history, such as using a carcass to breed, predict overlap in gene models more strongly than phylogeny? We found that the overlap in gene models was similar between N. vespilloides and all other insect groups regardless of life history. Second, like other insects with highly developed social behavior but unlike other beetles, does N. vespilloides have DNA methylation? We found strong evidence for an active DNA methylation system. The distribution of methylation was similar to other insects with exons having the most methylated CpGs. Methylation status appears highly conserved; 85% of the methylated genes in N. vespilloides are also methylated in the hymentopteran Nasonia vitripennis. The addition of this genome adds a coleopteran resource to answer questions about the evolution and mechanistic basis of sociality and to address questions about the potential role of methylation in social behavior. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Kuan, Lisa; Schaffer, Jessica N.; Zouzias, Christos D.
2014-01-01
Proteus mirabilis is a Gram-negative enteric bacterium that causes complicated urinary tract infections, particularly in patients with indwelling catheters. Sequencing of clinical isolate P. mirabilis HI4320 revealed the presence of 17 predicted chaperone-usher fimbrial operons. We classified these fimbriae into three groups by their genetic relationship to other chaperone-usher fimbriae. Sixteen of these fimbriae are encoded by all seven currently sequenced P. mirabilis genomes. The predicted protein sequence of the major structural subunit for 14 of these fimbriae was highly conserved (≥95 % identity), whereas three other structural subunits (Fim3A, UcaA and Fim6A) were variable. Further examination of 58 clinical isolates showed that 14 of the 17 predicted major structural subunit genes of the fimbriae were present in most strains (>85 %). Transcription of the predicted major structural subunit genes for all 17 fimbriae was measured under different culture conditions designed to mimic conditions in the urinary tract. The majority of the fimbrial genes were induced during stationary phase, static culture or colony growth when compared to exponential-phase aerated culture. Major structural subunit proteins for six of these fimbriae were detected using MS of proteins sheared from the surface of broth-cultured P. mirabilis, demonstrating that this organism may produce multiple fimbriae within a single culture. The high degree of conservation of P. mirabilis fimbriae stands in contrast to uropathogenic Escherichia coli and Salmonella enterica, which exhibit greater variability in their fimbrial repertoires. These findings suggest there may be evolutionary pressure for P. mirabilis to maintain a large fimbrial arsenal. PMID:24809384
Scholthof, Karen-Beth G.
2015-01-01
In eukaryotes, alternative splicing (AS) promotes transcriptome and proteome diversity. The extent of genome-wide AS changes occurring during a plant-microbe interaction is largely unknown. Here, using high-throughput, paired-end RNA sequencing, we generated an isoform-level spliceome map of Brachypodium distachyon infected with Panicum mosaic virus and its satellite virus. Overall, we detected ∼44,443 transcripts in B. distachyon, ∼30% more than those annotated in the reference genome. Expression of ∼28,900 transcripts was ≥2 fragments per kilobase of transcript per million mapped fragments, and ∼42% of multi-exonic genes were alternatively spliced. Comparative analysis of AS patterns in B. distachyon, rice (Oryza sativa), maize (Zea mays), sorghum (Sorghum bicolor), Arabidopsis thaliana, potato (Solanum tuberosum), Medicago truncatula, and poplar (Populus trichocarpa) revealed conserved ratios of the AS types between monocots and dicots. Virus infection quantitatively altered AS events in Brachypodium with little effect on the AS ratios. We discovered AS events for >100 immune-related genes encoding receptor-like kinases, NB-LRR resistance proteins, transcription factors, RNA silencing, and splicing-associated proteins. Cloning and molecular characterization of SCL33, a serine/arginine-rich splicing factor, identified multiple novel intron-retaining splice variants that are developmentally regulated and modulated during virus infection. B. distachyon SCL33 splicing patterns are also strikingly conserved compared with a distant Arabidopsis SCL33 ortholog. This analysis provides new insights into AS landscapes conserved among monocots and dicots and uncovered AS events in plant defense-related genes. PMID:25634987
Conservation of Transcription Start Sites within Genes across a Bacterial Genus
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shao, Wenjun; Price, Morgan N.; Deutschbauer, Adam M.
Transcription start sites (TSSs) lying inside annotated genes, on the same or opposite strand, have been observed in diverse bacteria, but the function of these unexpected transcripts is unclear. Here, we use the metal-reducing bacterium Shewanella oneidensis MR-1 and its relatives to study the evolutionary conservation of unexpected TSSs. Using high-resolution tiling microarrays and 5'-end RNA sequencing, we identified 2,531 TSSs in S. oneidensis MR-1, of which 18% were located inside coding sequences (CDSs). Comparative transcriptome analysis with seven additional Shewanella species revealed that the majority (76%) of the TSSs within the upstream regions of annotated genes (gTSSs) were conserved.more » Thirty percent of the TSSs that were inside genes and on the sense strand (iTSSs) were also conserved. Sequence analysis around these iTSSs showed conserved promoter motifs, suggesting that many iTSS are under purifying selection. Furthermore, conserved iTSSs are enriched for regulatory motifs, suggesting that they are regulated, and they tend to eliminate polar effects, which confirms that they are functional. In contrast, the transcription of antisense TSSs located inside CDSs (aTSSs) was significantly less likely to be conserved (22%). However, aTSSs whose transcription was conserved often have conserved promoter motifs and drive the expression of nearby genes. Overall, our findings demonstrate that some internal TSSs are conserved and drive protein expression despite their unusual locations, but the majority are not conserved and may reflect noisy initiation of transcription rather than a biological function.« less
Balasuriya, U B R; Nadler, S A; Wilson, W C; Pritchard, L I; Smythe, A B; Savini, G; Monaco, F; De Santis, P; Zhang, N; Tabachnick, W J; Maclachlan, N J
2008-01-01
Comparison of the deduced amino acid sequences of the genes (S10) encoding the NS3 protein of 137 strains of bluetongue virus (BTV) from Africa, the Americas, Asia, Australia and the Mediterranean Basin showed limited variation. Common to all NS3 sequences were potential glycosylation sites at amino acid residues 63 and 150 and a cysteine at residue 137, whereas a cysteine at residue 181 was not conserved. The PPXY and PS/TAP late-domain motifs were conserved in all but three of the viruses. Phylogenetic analyses of these same sequences yielded two principal clades that grouped the viruses irrespective of their serotype or year of isolation (1900-2003). All viruses from Asia and Australia were grouped in one clade, whereas those from the other regions were present in both clades. Each clade segregated into distinct subclades that included viruses from single or multiple regions, and the S10 genes of some field viruses were identical to those of live-attenuated BTV vaccines. There was no evidence of positive selection on the S10 gene as assessed by reconstruction of ancestral codon states on the phylogeny, rather the functional constraints of the NS3 protein are expressed through substantial negative (purifying) selection.
Temperton, Ben; Gilbert, Jack A.; Quinn, John P.; McGrath, John W.
2011-01-01
Polyphosphate is a ubiquitous linear homopolymer of phosphate residues linked by high-energy bonds similar to those found in ATP. It has been associated with many processes including pathogenicity, DNA uptake and multiple stress responses across all domains. Bacteria have also been shown to use polyphosphate as a way to store phosphate when transferred from phosphate-limited to phosphate-rich media – a process exploited in wastewater treatment and other environmental contaminant remediation. Despite this, there has, to date, been little research into the role of polyphosphate in the survival of marine bacterioplankton in oligotrophic environments. The three main proteins involved in polyphosphate metabolism, Ppk1, Ppk2 and Ppx are multi-domain and have differential inter-domain and inter-gene conservation, making unbiased analysis of relative abundance in metagenomic datasets difficult. This paper describes the development of a novel Isofunctional Homolog Annotation Tool (IHAT) to detect homologs of genes with a broad range of conservation without bias of traditional expect-value cutoffs. IHAT analysis of the Global Ocean Sampling (GOS) dataset revealed that genes associated with polyphosphate metabolism are more abundant in environments where available phosphate is limited, suggesting an important role for polyphosphate metabolism in marine oligotrophs. PMID:21305044
Diversification of Root Hair Development Genes in Vascular Plants.
Huang, Ling; Shi, Xinhui; Wang, Wenjia; Ryu, Kook Hui; Schiefelbein, John
2017-07-01
The molecular genetic program for root hair development has been studied intensively in Arabidopsis ( Arabidopsis thaliana ). To understand the extent to which this program might operate in other plants, we conducted a large-scale comparative analysis of root hair development genes from diverse vascular plants, including eudicots, monocots, and a lycophyte. Combining phylogenetics and transcriptomics, we discovered conservation of a core set of root hair genes across all vascular plants, which may derive from an ancient program for unidirectional cell growth coopted for root hair development during vascular plant evolution. Interestingly, we also discovered preferential diversification in the structure and expression of root hair development genes, relative to other root hair- and root-expressed genes, among these species. These differences enabled the definition of sets of genes and gene functions that were acquired or lost in specific lineages during vascular plant evolution. In particular, we found substantial divergence in the structure and expression of genes used for root hair patterning, suggesting that the Arabidopsis transcriptional regulatory mechanism is not shared by other species. To our knowledge, this study provides the first comprehensive view of gene expression in a single plant cell type across multiple species. © 2017 American Society of Plant Biologists. All Rights Reserved.
Diversification of Root Hair Development Genes in Vascular Plants1[OPEN
Shi, Xinhui; Wang, Wenjia; Ryu, Kook Hui
2017-01-01
The molecular genetic program for root hair development has been studied intensively in Arabidopsis (Arabidopsis thaliana). To understand the extent to which this program might operate in other plants, we conducted a large-scale comparative analysis of root hair development genes from diverse vascular plants, including eudicots, monocots, and a lycophyte. Combining phylogenetics and transcriptomics, we discovered conservation of a core set of root hair genes across all vascular plants, which may derive from an ancient program for unidirectional cell growth coopted for root hair development during vascular plant evolution. Interestingly, we also discovered preferential diversification in the structure and expression of root hair development genes, relative to other root hair- and root-expressed genes, among these species. These differences enabled the definition of sets of genes and gene functions that were acquired or lost in specific lineages during vascular plant evolution. In particular, we found substantial divergence in the structure and expression of genes used for root hair patterning, suggesting that the Arabidopsis transcriptional regulatory mechanism is not shared by other species. To our knowledge, this study provides the first comprehensive view of gene expression in a single plant cell type across multiple species. PMID:28487476
Functional and evolutionary insights from the Ciona notochord transcriptome.
Reeves, Wendy M; Wu, Yuye; Harder, Matthew J; Veeman, Michael T
2017-09-15
The notochord of the ascidian Ciona consists of only 40 cells, and is a longstanding model for studying organogenesis in a small, simple embryo. Here, we perform RNAseq on flow-sorted notochord cells from multiple stages to define a comprehensive Ciona notochord transcriptome. We identify 1364 genes with enriched expression and extensively validate the results by in situ hybridization. These genes are highly enriched for Gene Ontology terms related to the extracellular matrix, cell adhesion and cytoskeleton. Orthologs of 112 of the Ciona notochord genes have known notochord expression in vertebrates, more than twice as many as predicted by chance alone. This set of putative effector genes with notochord expression conserved from tunicates to vertebrates will be invaluable for testing hypotheses about notochord evolution. The full set of Ciona notochord genes provides a foundation for systems-level studies of notochord gene regulation and morphogenesis. We find only modest overlap between this set of notochord-enriched transcripts and the genes upregulated by ectopic expression of the key notochord transcription factor Brachyury, indicating that Brachyury is not a notochord master regulator gene as strictly defined. © 2017. Published by The Company of Biologists Ltd.
Wei, Ling; Yang, Chao; Tao, Wenjing; Wang, Deshou
2016-01-01
The Sox transcription factor family is characterized with the presence of a Sry-related high-mobility group (HMG) box and plays important roles in various biological processes in animals, including sex determination and differentiation, and the development of multiple organs. In this study, 27 Sox genes were identified in the genome of the Nile tilapia (Oreochromis niloticus), and were classified into seven groups. The members of each group of the tilapia Sox genes exhibited a relatively conserved exon-intron structure. Comparative analysis showed that the Sox gene family has undergone an expansion in tilapia and other teleost fishes following their whole genome duplication, and group K only exists in teleosts. Transcriptome-based analysis demonstrated that most of the tilapia Sox genes presented stage-specific and/or sex-dimorphic expressions during gonadal development, and six of the group B Sox genes were specifically expressed in the adult brain. Our results provide a better understanding of gene structure and spatio-temporal expression of the Sox gene family in tilapia, and will be useful for further deciphering the roles of the Sox genes during sex determination and gonadal development in teleosts. PMID:26907269
Wei, Ling; Yang, Chao; Tao, Wenjing; Wang, Deshou
2016-02-23
The Sox transcription factor family is characterized with the presence of a Sry-related high-mobility group (HMG) box and plays important roles in various biological processes in animals, including sex determination and differentiation, and the development of multiple organs. In this study, 27 Sox genes were identified in the genome of the Nile tilapia (Oreochromis niloticus), and were classified into seven groups. The members of each group of the tilapia Sox genes exhibited a relatively conserved exon-intron structure. Comparative analysis showed that the Sox gene family has undergone an expansion in tilapia and other teleost fishes following their whole genome duplication, and group K only exists in teleosts. Transcriptome-based analysis demonstrated that most of the tilapia Sox genes presented stage-specific and/or sex-dimorphic expressions during gonadal development, and six of the group B Sox genes were specifically expressed in the adult brain. Our results provide a better understanding of gene structure and spatio-temporal expression of the Sox gene family in tilapia, and will be useful for further deciphering the roles of the Sox genes during sex determination and gonadal development in teleosts.
Predicting Protein Function by Genomic Context: Quantitative Evaluation and Qualitative Inferences
Huynen, Martijn; Snel, Berend; Lathe, Warren; Bork, Peer
2000-01-01
Various new methods have been proposed to predict functional interactions between proteins based on the genomic context of their genes. The types of genomic context that they use are Type I: the fusion of genes; Type II: the conservation of gene-order or co-occurrence of genes in potential operons; and Type III: the co-occurrence of genes across genomes (phylogenetic profiles). Here we compare these types for their coverage, their correlations with various types of functional interaction, and their overlap with homology-based function assignment. We apply the methods to Mycoplasma genitalium, the standard benchmarking genome in computational and experimental genomics. Quantitatively, conservation of gene order is the technique with the highest coverage, applying to 37% of the genes. By combining gene order conservation with gene fusion (6%), the co-occurrence of genes in operons in absence of gene order conservation (8%), and the co-occurrence of genes across genomes (11%), significant context information can be obtained for 50% of the genes (the categories overlap). Qualitatively, we observe that the functional interactions between genes are stronger as the requirements for physical neighborhood on the genome are more stringent, while the fraction of potential false positives decreases. Moreover, only in cases in which gene order is conserved in a substantial fraction of the genomes, in this case six out of twenty-five, does a single type of functional interaction (physical interaction) clearly dominate (>80%). In other cases, complementary function information from homology searches, which is available for most of the genes with significant genomic context, is essential to predict the type of interaction. Using a combination of genomic context and homology searches, new functional features can be predicted for 10% of M. genitalium genes. PMID:10958638
SynFind: Compiling Syntenic Regions across Any Set of Genomes on Demand.
Tang, Haibao; Bomhoff, Matthew D; Briones, Evan; Zhang, Liangsheng; Schnable, James C; Lyons, Eric
2015-11-11
The identification of conserved syntenic regions enables discovery of predicted locations for orthologous and homeologous genes, even when no such gene is present. This capability means that synteny-based methods are far more effective than sequence similarity-based methods in identifying true-negatives, a necessity for studying gene loss and gene transposition. However, the identification of syntenic regions requires complex analyses which must be repeated for pairwise comparisons between any two species. Therefore, as the number of published genomes increases, there is a growing demand for scalable, simple-to-use applications to perform comparative genomic analyses that cater to both gene family studies and genome-scale studies. We implemented SynFind, a web-based tool that addresses this need. Given one query genome, SynFind is capable of identifying conserved syntenic regions in any set of target genomes. SynFind is capable of reporting per-gene information, useful for researchers studying specific gene families, as well as genome-wide data sets of syntenic gene and predicted gene locations, critical for researchers focused on large-scale genomic analyses. Inference of syntenic homologs provides the basis for correlation of functional changes around genes of interests between related organisms. Deployed on the CoGe online platform, SynFind is connected to the genomic data from over 15,000 organisms from all domains of life as well as supporting multiple releases of the same organism. SynFind makes use of a powerful job execution framework that promises scalability and reproducibility. SynFind can be accessed at http://genomevolution.org/CoGe/SynFind.pl. A video tutorial of SynFind using Phytophthrora as an example is available at http://www.youtube.com/watch?v=2Agczny9Nyc. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Programmed Cell Death During Caenorhabditis elegans Development
Conradt, Barbara; Wu, Yi-Chun; Xue, Ding
2016-01-01
Programmed cell death is an integral component of Caenorhabditis elegans development. Genetic and reverse genetic studies in C. elegans have led to the identification of many genes and conserved cell death pathways that are important for the specification of which cells should live or die, the activation of the suicide program, and the dismantling and removal of dying cells. Molecular, cell biological, and biochemical studies have revealed the underlying mechanisms that control these three phases of programmed cell death. In particular, the interplay of transcriptional regulatory cascades and networks involving multiple transcriptional regulators is crucial in activating the expression of the key death-inducing gene egl-1 and, in some cases, the ced-3 gene in cells destined to die. A protein interaction cascade involving EGL-1, CED-9, CED-4, and CED-3 results in the activation of the key cell death protease CED-3, which is tightly controlled by multiple positive and negative regulators. The activation of the CED-3 caspase then initiates the cell disassembly process by cleaving and activating or inactivating crucial CED-3 substrates; leading to activation of multiple cell death execution events, including nuclear DNA fragmentation, mitochondrial elimination, phosphatidylserine externalization, inactivation of survival signals, and clearance of apoptotic cells. Further studies of programmed cell death in C. elegans will continue to advance our understanding of how programmed cell death is regulated, activated, and executed in general. PMID:27516615
Parks, Sean A; McKelvey, Kevin S; Schwartz, Michael K
2013-02-01
The importance of movement corridors for maintaining connectivity within metapopulations of wild animals is a cornerstone of conservation. One common approach for determining corridor locations is least-cost corridor (LCC) modeling, which uses algorithms within a geographic information system to search for routes with the lowest cumulative resistance between target locations on a landscape. However, the presentation of multiple LCCs that connect multiple locations generally assumes all corridors contribute equally to connectivity, regardless of the likelihood that animals will use them. Thus, LCCs may overemphasize seldom-used longer routes and underemphasize more frequently used shorter routes. We hypothesize that, depending on conservation objectives and available biological information, weighting individual corridors on the basis of species-specific movement, dispersal, or gene flow data may better identify effective corridors. We tested whether locations of key connectivity areas, defined as the highest 75th and 90th percentile cumulative weighted value of approximately 155,000 corridors, shift under different weighting scenarios. In addition, we quantified the amount and location of private land that intersect key connectivity areas under each weighting scheme. Some areas that appeared well connected when analyzed with unweighted corridors exhibited much less connectivity compared with weighting schemes that discount corridors with large effective distances. Furthermore, the amount and location of key connectivity areas that intersected private land varied among weighting schemes. We believe biological assumptions and conservation objectives should be explicitly incorporated to weight corridors when assessing landscape connectivity. These results are highly relevant to conservation planning because on the basis of recent interest by government agencies and nongovernmental organizations in maintaining and enhancing wildlife corridors, connectivity will likely be an important criterion for prioritization of land purchases and swaps. ©2012 Society for Conservation Biology.
Proudhon, D; Wei, J; Briat, J; Theil, E C
1996-03-01
Ferritin, a protein widespread in nature, concentrates iron approximately 10(11)-10(12)-fold above the solubility within a spherical shell of 24 subunits; it derives in plants and animals from a common ancestor (based on sequence) but displays a cytoplasmic location in animals compared to the plastid in contemporary plants. Ferritin gene regulation in plants and animals is altered by development, hormones, and excess iron; iron signals target DNA in plants but mRNA in animals. Evolution has thus conserved the two end points of ferritin gene expression, the physiological signals and the protein structure, while allowing some divergence of the genetic mechanisms. Comparison of ferritin gene organization in plants and animals, made possible by the cloning of a dicot (soybean) ferritin gene presented here and the recent cloning of two monocot (maize) ferritin genes, shows evolutionary divergence in ferritin gene organization between plants and animals but conservation among plants or among animals; divergence in the genetic mechanism for iron regulation is reflected by the absence in all three plant genes of the IRE, a highly conserved, noncoding sequence in vertebrate animal ferritin mRNA. In plant ferritin genes, the number of introns (n = 7) is higher than in animals (n = 3). Second, no intron positions are conserved when ferritin genes of plants and animals are compared, although all ferritin gene introns are in the coding region; within kingdoms, the intron positions in ferritin genes are conserved. Finally, secondary protein structure has no apparent relationship to intron/exon boundaries in plant ferritin genes, whereas in animal ferritin genes the correspondence is high. The structural differences in introns/exons among phylogenetically related ferritin coding sequences and the high conservation of the gene structure within plant or animal kingdoms of the gene structure within plant or animal kingdoms suggest that kingdom-specific functional constraints may exist to maintain a particular intron/exon pattern within ferritin genes. In the case of plants, where ferritin gene intron placement is unrelated to triplet codons or protein structure, and where ferritin is targeted to the plastid, the selection pressure on gene organization may relate to RNA function and plastid/nuclear signaling.
Conserved noncoding sequences (CNSs) in higher plants.
Freeling, Michael; Subramaniam, Shabarinath
2009-04-01
Plant conserved noncoding sequences (CNSs)--a specific category of phylogenetic footprint--have been shown experimentally to function. No plant CNS is conserved to the extent that ultraconserved noncoding sequences are conserved in vertebrates. Plant CNSs are enriched in known transcription factor or other cis-acting binding sites, and are usually clustered around genes. Genes that encode transcription factors and/or those that respond to stimuli are particularly CNS-rich. Only rarely could this function involve small RNA binding. Some transcribed CNSs encode short translation products as a form of negative control. Approximately 4% of Arabidopsis gene content is estimated to be both CNS-rich and occupies a relatively long stretch of chromosome: Bigfoot genes (long phylogenetic footprints). We discuss a 'DNA-templated protein assembly' idea that might help explain Bigfoot gene CNSs.
Gaji, Rajshekhar Y; Howe, Daniel K
2009-07-01
The apicomplexan parasite Sarcocystis neurona undergoes a complex process of intracellular development, during which many genes are temporally regulated. The described study was undertaken to begin identifying the basic promoter elements that control gene expression in S. neurona. Sequence analysis of the 5'-flanking region of five S. neurona genes revealed a conserved heptanucleotide motif GAGACGC that is similar to the WGAGACG motif described upstream of multiple genes in Toxoplasma gondii. The promoter region for the major surface antigen gene SnSAG1, which contains three heptanucleotide motifs within 135 bases of the transcription start site, was dissected by functional analysis using a dual luciferase reporter assay. These analyses revealed that a minimal promoter fragment containing all three motifs was sufficient to drive reporter molecule expression, with the presence and orientation of the 5'-most heptanucleotide motif being absolutely critical for promoter function. Further studies should help to identify additional sequence elements important for promoter function and for controlling gene expression during intracellular development by this apicomplexan pathogen.
Zheng, Xiaomei; Zheng, Ping; Zhang, Kun; Cairns, Timothy C; Meyer, Vera; Sun, Jibin; Ma, Yanhe
2018-04-30
The CRISPR/Cas9 system is a revolutionary genome editing tool. However, in eukaryotes, search and optimization of a suitable promoter for guide RNA expression is a significant technical challenge. Here we used the industrially important fungus, Aspergillus niger, to demonstrate that the 5S rRNA gene, which is both highly conserved and efficiently expressed in eukaryotes, can be used as a guide RNA promoter. The gene editing system was established with 100% rates of precision gene modifications among dozens of transformants using short (40-bp) homologous donor DNA. This system was also applicable for generation of designer chromosomes, as evidenced by deletion of a 48 kb gene cluster required for biosynthesis of the mycotoxin fumonisin B1. Moreover, this system also facilitated simultaneous mutagenesis of multiple genes in A. niger. We anticipate that the use of the 5S rRNA gene as guide RNA promoter can broadly be applied for engineering highly efficient eukaryotic CRISPR/Cas9 toolkits. Additionally, the system reported here will enable development of designer chromosomes in model and industrially important fungi.
APPRIS 2017: principal isoforms for multiple gene sets
Rodriguez-Rivas, Juan; Di Domenico, Tomás; Vázquez, Jesús; Valencia, Alfonso
2018-01-01
Abstract The APPRIS database (http://appris-tools.org) uses protein structural and functional features and information from cross-species conservation to annotate splice isoforms in protein-coding genes. APPRIS selects a single protein isoform, the ‘principal’ isoform, as the reference for each gene based on these annotations. A single main splice isoform reflects the biological reality for most protein coding genes and APPRIS principal isoforms are the best predictors of these main proteins isoforms. Here, we present the updates to the database, new developments that include the addition of three new species (chimpanzee, Drosophila melangaster and Caenorhabditis elegans), the expansion of APPRIS to cover the RefSeq gene set and the UniProtKB proteome for six species and refinements in the core methods that make up the annotation pipeline. In addition APPRIS now provides a measure of reliability for individual principal isoforms and updates with each release of the GENCODE/Ensembl and RefSeq reference sets. The individual GENCODE/Ensembl, RefSeq and UniProtKB reference gene sets for six organisms have been merged to produce common sets of splice variants. PMID:29069475
Genetic analysis of Ikaros target genes and tumor suppressor function in BCR-ABL1+ pre–B ALL
Aghajanirefah, Ali; McLaughlin, Jami; Cheng, Donghui; Geng, Huimin; Eggesbø, Linn M.; Smale, Stephen T.; Müschen, Markus
2017-01-01
Inactivation of the tumor suppressor gene encoding the transcriptional regulator Ikaros (IKZF1) is a hallmark of BCR-ABL1+ precursor B cell acute lymphoblastic leukemia (pre–B ALL). However, the mechanisms by which Ikaros functions as a tumor suppressor in pre–B ALL remain poorly understood. Here, we analyzed a mouse model of BCR-ABL1+ pre–B ALL together with a new model of inducible expression of wild-type Ikaros in IKZF1 mutant human BCR-ABL1+ pre–B ALL. We performed integrated genome-wide chromatin and expression analyses and identified Ikaros target genes in mouse and human BCR-ABL1+ pre–B ALL, revealing novel conserved gene pathways associated with Ikaros tumor suppressor function. Notably, genetic depletion of different Ikaros targets, including CTNND1 and the early hematopoietic cell surface marker CD34, resulted in reduced leukemic growth. Our results suggest that Ikaros mediates tumor suppressor function by enforcing proper developmental stage–specific expression of multiple genes through chromatin compaction at its target genes. PMID:28190001
Patarca, R; Dorta, B; Ramirez, J L
1982-01-01
As part of a project pertaining the organization of ribosomal genes in Kinetoplastidae, we have created a data base for published sequences of ribosomal nucleic acids, with information in Spanish. As a first step in their processing, we have written a computer program which introduces the new feature of determining the length of the fragments produced after single or multiple digestion with any of the known restriction enzymes. With this information we have detected conserved SAU 3A sites: (i) at the 5' end of the 5.8S rRNA and at the 3' end of the small subunit rRNA, both included in similar larger sequences; (ii) in the 5.8S rRNA of vertebrates (a second one), which is not present in lower eukaryotes, showing a clear evolutive divergence; and, (iii) at the 5' terminal of the small subunit rRNA, included in a larger conserved sequence. The possible biological importance of these sequences is discussed. PMID:6278402
A Network of Genes Antagonistic to the LIN-35 Retinoblastoma Protein of Caenorhabditis elegans
Polley, Stanley R. G.; Fay, David S.
2012-01-01
The Caenorhabditis elegans pRb ortholog, LIN-35, functions in a wide range of cellular and developmental processes. This includes a role of LIN-35 in nutrient utilization by the intestine, which it carries out redundantly with SLR-2, a zinc-finger protein. This and other redundant functions of LIN-35 were identified in genetic screens for mutations that display synthetic phenotypes in conjunction with loss of lin-35. To explore the intestinal role of LIN-35, we conducted a genome-wide RNA-interference-feeding screen for suppressors of lin-35; slr-2 early larval arrest. Of the 26 suppressors identified, 17 fall into three functional classes: (1) ribosome biogenesis genes, (2) mitochondrial prohibitins, and (3) chromatin regulators. Further characterization indicates that different categories of suppressors act through distinct molecular mechanisms. We also tested lin-35; slr-2 suppressors, as well as suppressors of the synthetic multivulval phenotype, to determine the spectrum of lin-35-synthetic phenotypes that could be suppressed following inhibition of these genes. We identified 19 genes, most of which are evolutionarily conserved, that can suppress multiple unrelated lin-35-synthetic phenotypes. Our study reveals a network of genes broadly antagonistic to LIN-35 as well as genes specific to the role of LIN-35 in intestinal and vulval development. Suppressors of multiple lin-35 phenotypes may be candidate targets for anticancer therapies. Moreover, screening for suppressors of phenotypically distinct synthetic interactions, which share a common altered gene, may prove to be a novel and effective approach for identifying genes whose activities are most directly relevant to the core functions of the shared gene. PMID:22542970
Hu, Wei; Xia, Zhiqiang; Yan, Yan; Ding, Zehong; Tie, Weiwei; Wang, Lianzhe; Zou, Meiling; Wei, Yunxie; Lu, Cheng; Hou, Xiaowan; Wang, Wenquan; Peng, Ming
2015-01-01
Cassava is an important food and potential biofuel crop that is tolerant to multiple abiotic stressors. The mechanisms underlying these tolerances are currently less known. CBL-interacting protein kinases (CIPKs) have been shown to play crucial roles in plant developmental processes, hormone signaling transduction, and in the response to abiotic stress. However, no data is currently available about the CPK family in cassava. In this study, a total of 25 CIPK genes were identified from cassava genome based on our previous genome sequencing data. Phylogenetic analysis suggested that 25 MeCIPKs could be classified into four subfamilies, which was supported by exon-intron organizations and the architectures of conserved protein motifs. Transcriptomic analysis of a wild subspecies and two cultivated varieties showed that most MeCIPKs had different expression patterns between wild subspecies and cultivatars in different tissues or in response to drought stress. Some orthologous genes involved in CIPK interaction networks were identified between Arabidopsis and cassava. The interaction networks and co-expression patterns of these orthologous genes revealed that the crucial pathways controlled by CIPK networks may be involved in the differential response to drought stress in different accessions of cassava. Nine MeCIPK genes were selected to investigate their transcriptional response to various stimuli and the results showed the comprehensive response of the tested MeCIPK genes to osmotic, salt, cold, oxidative stressors, and ABA signaling. The identification and expression analysis of CIPK family suggested that CIPK genes are important components of development and multiple signal transduction pathways in cassava. The findings of this study will help lay a foundation for the functional characterization of the CIPK gene family and provide an improved understanding of abiotic stress responses and signaling transduction in cassava. PMID:26579161
Luo, Xingguang; Zuo, Lingjun; Kranzler, Henry; Zhang, Huiping; Wang, Shuang; Gelernter, Joel
2011-01-01
Background Personality traits are among the most complex quantitative traits. Certain personality traits are associated with substance dependence (SD); genetic factors may influence both. Associations between opioid receptor (OPR) genes and SD have been reported. This study investigated the relationship between OPR genes and personality traits in a case-control sample. Methods We assessed dimensions of the five-factor model of personality in 556 subjects: 250 with SD [181 European-Americans (EAs) and 69 African-Americans (AAs)] and 306 healthy subjects (266 EAs and 40 AAs). We genotyped 20 OPRM1 markers, 8 OPRD1 markers, and 7 OPRK1 markers, and 38 unlinked ancestry-informative markers in these subjects. The relationships between OPR genes and personality traits were examined using MANCOVA, controlling for gene-gene interaction effects and potential confounders. Associations were decomposed by Roy-Bargmann Stepdown ANCOVA. Results Personality traits were associated as main or interaction effects with the haplotypes, diplotypes, alleles and genotypes at the three OPR genes (0.002
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tempel, W.; Wu, H.; Dombrovsky, L.
2010-08-17
A recent survey of protein expression patterns in patients with Alzheimer's disease (AD) has identified ece2 (chromosome: 3; Locations: 3q27.1) as the most significantly downregulated gene within the tested group. ece2 encodes endothelin-converting enzyme ECE2, a metalloprotease with a role in neuropeptide processing. Deficiency in the highly homologous ECE1 has earlier been linked to increased levels of AD-related {beta}-amyloid peptide in mice, consistent with a role for ECE in the degradation of that peptide. Initially, ECE2 was presumed to resemble ECE1, in that it comprises a single transmembrane region of {approx}20 residues flanked by a small amino-terminal cytosolic segment andmore » a carboxy-terminal lumenar peptidase domain. The carboxy-terminal domain has significant sequence similarity to both neutral endopeptidase, for which an X-ray structure has been determined, and Kell blood group protein. After their initial discovery, multiple isoforms of ECE1 and ECE2 were discovered, generated by alternative splicing of multiple exons. The originally described ece2 transcript, RefSeq NM{_}174046, contains the amino-terminal cytosolic portion followed by the transmembrane region and peptidase domain (Fig. 1, isoform B). Another ece2 transcript, available from the Mammalian Gene Collection under MGC2408 (Fig. 1, isoform C), RefSeq accession NM{_}032331, is predicted to be translated into a 255 residue peptide with low but detectable sequence similarity to known S-adenosyl-L-methionine (SAM)-dependent methyltransferases (SAM-MTs), such as the hypothetical protein TT1324 from Thermus thermophilis, PDB code 2GS9, which shares 30% amino acid sequence identity with ECE2 over 138 residues of the sequence. Intriguingly, another 'elongated' ece2 transcript (Fig. 1, isoform A) (RefSeq NM{_}014693) contains an amino-terminal portion of the putative SAM-MT domain, the transmembrane domain, and the protease domain. This suggests the possibility for coexistence of the putative SAM-MT and protease domains in a single polypeptide and their transmembrane interplay. Although sequence conservation across the SAM-MT family is weak, the structural fold is highly conserved. The most conserved part of this fold is the SAM-binding subdomain, which is shared between MGC2408 and hypothetical protein TT1324. Typically, the SAM-binding subdomain is flanked by a variable Nterminal extension and, at the C-terminus, by a substrate- binding subdomain, which varies enormously in size but preserves a conserved topology with three antiparallel b-strands. The 'elongated' transcript of ece2 lacks this substrate-binding subdomain. To test the hypothesis that the 255 residue ece2 gene product MGC2408 represents a complete SAM-MT fold, we have determined a crystal structure of this protein in the presence of SAH.« less
Lagares, Antonio; Ceizel Borella, Germán; Linne, Uwe; Becker, Anke; Valverde, Claudio
2017-04-15
Riboregulation has a major role in the fine-tuning of multiple bacterial processes. Among the RNA players, trans -encoded untranslated small RNAs (sRNAs) regulate complex metabolic networks by tuning expression from multiple target genes in response to numerous signals. In Sinorhizobium meliloti , over 400 sRNAs are expressed under different stimuli. The sRNA MmgR (standing for M akes m ore g ranules R egulator) has been of particular interest to us since its sequence and structure are highly conserved among the alphaproteobacteria and its expression is regulated by the amount and quality of the bacterium's available nitrogen source. In this work, we explored the biological role of MmgR in S. meliloti 2011 by characterizing the effect of a deletion of the internal conserved core of mmgR ( mmgR Δ33-51 ). This mutation resulted in larger amounts of polyhydroxybutyrate (PHB) distributed into more intracellular granules than are found in the wild-type strain. This phenotype was expressed upon cessation of balanced growth owing to nitrogen depletion in the presence of surplus carbon (i.e., at a carbon/nitrogen molar ratio greater than 10). The normal PHB accumulation was complemented with a wild-type mmgR copy but not with unrelated sRNA genes. Furthermore, the expression of mmgR limited PHB accumulation in the wild type, regardless of the magnitude of the C surplus. Quantitative proteomic profiling and quantitative reverse transcription-PCR (qRT-PCR) revealed that the absence of MmgR results in a posttranscriptional overexpression of both PHB phasin proteins (PhaP1 and PhaP2). Together, our results indicate that the widely conserved alphaproteobacterial MmgR sRNA fine-tunes the regulation of PHB storage in S. meliloti IMPORTANCE High-throughput RNA sequencing has recently uncovered an overwhelming number of trans -encoded small RNAs (sRNAs) in diverse prokaryotes. In the nitrogen-fixing alphaproteobacterial symbiont of alfalfa root nodules Sinorhizobium meliloti , only four out of hundreds of identified sRNA genes have been functionally characterized. Thus, uncovering the biological role of sRNAs currently represents a major issue and one that is particularly challenging because of the usually subtle quantitative regulation contributed by most characterized sRNAs. Here, we have characterized the function of the broadly conserved alphaproteobacterial sRNA gene mmgR in S. meliloti Our results strongly suggest that mmgR encodes a negative regulator of the accumulation of polyhydroxybutyrate, the major carbon and reducing power storage polymer in S. meliloti cells growing under conditions of C/N overbalance. Copyright © 2017 American Society for Microbiology.
Chaillou, Thomas; Jackson, Janna R; England, Jonathan H; Kirby, Tyler J; Richards-White, Jena; Esser, Karyn A; Dupont-Versteegden, Esther E; McCarthy, John J
2015-01-01
The purpose of this study was to compare the gene expression profile of mouse skeletal muscle undergoing two forms of growth (hypertrophy and regrowth) with the goal of identifying a conserved set of differentially expressed genes. Expression profiling by microarray was performed on the plantaris muscle subjected to 1, 3, 5, 7, 10, and 14 days of hypertrophy or regrowth following 2 wk of hind-limb suspension. We identified 97 differentially expressed genes (≥2-fold increase or ≥50% decrease compared with control muscle) that were conserved during the two forms of muscle growth. The vast majority (∼90%) of the differentially expressed genes was upregulated and occurred at a single time point (64 out of 86 genes), which most often was on the first day of the time course. Microarray analysis from the conserved upregulated genes showed a set of genes related to contractile apparatus and stress response at day 1, including three genes involved in mechanotransduction and four genes encoding heat shock proteins. Our analysis further identified three cell cycle-related genes at day and several genes associated with extracellular matrix (ECM) at both days 3 and 10. In conclusion, we have identified a core set of genes commonly upregulated in two forms of muscle growth that could play a role in the maintenance of sarcomere stability, ECM remodeling, cell proliferation, fast-to-slow fiber type transition, and the regulation of skeletal muscle growth. These findings suggest conserved regulatory mechanisms involved in the adaptation of skeletal muscle to increased mechanical loading. Copyright © 2015 the American Physiological Society.
Chaillou, Thomas; Jackson, Janna R.; England, Jonathan H.; Kirby, Tyler J.; Richards-White, Jena; Esser, Karyn A.; Dupont-Versteegden, Esther E.
2014-01-01
The purpose of this study was to compare the gene expression profile of mouse skeletal muscle undergoing two forms of growth (hypertrophy and regrowth) with the goal of identifying a conserved set of differentially expressed genes. Expression profiling by microarray was performed on the plantaris muscle subjected to 1, 3, 5, 7, 10, and 14 days of hypertrophy or regrowth following 2 wk of hind-limb suspension. We identified 97 differentially expressed genes (≥2-fold increase or ≥50% decrease compared with control muscle) that were conserved during the two forms of muscle growth. The vast majority (∼90%) of the differentially expressed genes was upregulated and occurred at a single time point (64 out of 86 genes), which most often was on the first day of the time course. Microarray analysis from the conserved upregulated genes showed a set of genes related to contractile apparatus and stress response at day 1, including three genes involved in mechanotransduction and four genes encoding heat shock proteins. Our analysis further identified three cell cycle-related genes at day and several genes associated with extracellular matrix (ECM) at both days 3 and 10. In conclusion, we have identified a core set of genes commonly upregulated in two forms of muscle growth that could play a role in the maintenance of sarcomere stability, ECM remodeling, cell proliferation, fast-to-slow fiber type transition, and the regulation of skeletal muscle growth. These findings suggest conserved regulatory mechanisms involved in the adaptation of skeletal muscle to increased mechanical loading. PMID:25554798
2012-01-01
Background Metallothioneins (MT) are low molecular weight, cysteine rich metal binding proteins, found across genera and species, but their function(s) in abiotic stress tolerance are not well documented. Results We have characterized a rice MT gene, OsMT1e-P, isolated from a subtractive library generated from a stressed salinity tolerant rice genotype, Pokkali. Bioinformatics analysis of the rice genome sequence revealed that this gene belongs to a multigenic family, which consists of 13 genes with 15 protein products. OsMT1e-P is located on chromosome XI, away from the majority of other type I genes that are clustered on chromosome XII. Various members of this MT gene cluster showed a tight co-regulation pattern under several abiotic stresses. Sequence analysis revealed the presence of conserved cysteine residues in OsMT1e-P protein. Salinity stress was found to regulate the transcript abundance of OsMT1e-P in a developmental and organ specific manner. Using transgenic approach, we found a positive correlation between ectopic expression of OsMT1e-P and stress tolerance. Our experiments further suggest ROS scavenging to be the possible mechanism for multiple stress tolerance conferred by OsMT1e-P. Conclusion We present an overview of MTs, describing their gene structure, genome localization and expression patterns under salinity and development in rice. We have found that ectopic expression of OsMT1e-P enhances tolerance towards multiple abiotic stresses in transgenic tobacco and the resultant plants could survive and set viable seeds under saline conditions. Taken together, the experiments presented here have indicated that ectopic expression of OsMT1e-P protects against oxidative stress primarily through efficient scavenging of reactive oxygen species. PMID:22780875
Kumar, Gautam; Kushwaha, Hemant Ritturaj; Panjabi-Sabharwal, Vaishali; Kumari, Sumita; Joshi, Rohit; Karan, Ratna; Mittal, Shweta; Pareek, Sneh L Singla; Pareek, Ashwani
2012-07-10
Metallothioneins (MT) are low molecular weight, cysteine rich metal binding proteins, found across genera and species, but their function(s) in abiotic stress tolerance are not well documented. We have characterized a rice MT gene, OsMT1e-P, isolated from a subtractive library generated from a stressed salinity tolerant rice genotype, Pokkali. Bioinformatics analysis of the rice genome sequence revealed that this gene belongs to a multigenic family, which consists of 13 genes with 15 protein products. OsMT1e-P is located on chromosome XI, away from the majority of other type I genes that are clustered on chromosome XII. Various members of this MT gene cluster showed a tight co-regulation pattern under several abiotic stresses. Sequence analysis revealed the presence of conserved cysteine residues in OsMT1e-P protein. Salinity stress was found to regulate the transcript abundance of OsMT1e-P in a developmental and organ specific manner. Using transgenic approach, we found a positive correlation between ectopic expression of OsMT1e-P and stress tolerance. Our experiments further suggest ROS scavenging to be the possible mechanism for multiple stress tolerance conferred by OsMT1e-P. We present an overview of MTs, describing their gene structure, genome localization and expression patterns under salinity and development in rice. We have found that ectopic expression of OsMT1e-P enhances tolerance towards multiple abiotic stresses in transgenic tobacco and the resultant plants could survive and set viable seeds under saline conditions. Taken together, the experiments presented here have indicated that ectopic expression of OsMT1e-P protects against oxidative stress primarily through efficient scavenging of reactive oxygen species.
Haga, Nozomi; Kobayashi, Kosuke; Suzuki, Takamasa; Maeo, Kenichiro; Kubo, Minoru; Ohtani, Misato; Mitsuda, Nobutaka; Demura, Taku; Nakamura, Kenzo; Jürgens, Gerd; Ito, Masaki
2011-01-01
R1R2R3-Myb proteins represent an evolutionarily conserved class of Myb family proteins important for cell cycle regulation and differentiation in eukaryotic cells. In plants, this class of Myb proteins are believed to regulate the transcription of G2/M phase-specific genes by binding to common cis-elements, called mitosis-specific activator (MSA) elements. In Arabidopsis (Arabidopsis thaliana), MYB3R1 and MYB3R4 act as transcriptional activators and positively regulate cytokinesis by activating the transcription of KNOLLE, which encodes a cytokinesis-specific syntaxin. Here, we show that the double mutation myb3r1 myb3r4 causes pleiotropic developmental defects, some of which are due to deficiency of KNOLLE whereas other are not, suggesting that multiple target genes are involved. Consistently, microarray analysis of the double mutant revealed altered expression of many genes, among which G2/M-specific genes showed significant overrepresentation of the MSA motif and a strong tendency to be down-regulated by the double mutation. Our results demonstrate, on a genome-wide level, the importance of the MYB3R-MSA pathway for regulating G2/M-specific transcription. In addition, MYB3R1 and MYB3R4 may have diverse roles during plant development by regulating G2/M-specific genes with various functions as well as genes possibly unrelated to the cell cycle. PMID:21862669
Methylation and microRNA-mediated epigenetic regulation of SOCS3
Boosani, Chandra S.; Agrawal, Devendra K.
2017-01-01
Epigenetic gene silencing of several genes causes different pathological conditions in humans, and DNA methylation has been identified as one of the key mechanisms that underlie this evolutionarily conserved phenomenon associated with developmental and pathological gene regulation. Recent advances in the miRNA technology with high throughput analysis of gene regulation further increased our understanding on the role of miRNAs regulating multiple gene expression. There is increasing evidence supporting that the miRNAs not only regulate gene expression but they also are involved in the hypermethylation of promoter sequences, which cumulatively contributes to the epigenetic gene silencing. Here, we critically evaluated the recent progress on the transcriptional regulation of an important suppressor protein that inhibits cytokine-mediated signaling, SOCS3, whose expression is directly regulated both by promoter methylation and also by microRNAs, affecting its vital cell regulating functions. SOCS3 was identified as a potent inhibitor of Jak/STAT signaling pathway which is frequently upregulated in several pathologies, including cardiovascular disease, cancer, diabetes, viral infections, and the expression of SOCS3 was inhibited or greatly reduced due to hypermethylation of the CpG islands in its promoter region or suppression of its expression by different microRNAs. Additionally, we discuss key intracellular signaling pathways regulated by SOCS3 involving cellular events, including cell proliferation, cell growth, cell migration and apoptosis. Identification of the pathway intermediates as specific targets would not only aid in the development of novel therapeutic drugs, but, would also assist in developing new treatment strategies that could successfully be employed in combination therapy to target multiple signaling pathways. PMID:25682267
Flexible CRISPR library construction using parallel oligonucleotide retrieval
Read, Abigail; Gao, Shaojian; Batchelor, Eric
2017-01-01
Abstract CRISPR/Cas9-based gene knockout libraries have emerged as a powerful tool for functional screens. We present here a set of pre-designed human and mouse sgRNA sequences that are optimized for both high on-target potency and low off-target effect. To maximize the chance of target gene inactivation, sgRNAs were curated to target both 5΄ constitutive exons and exons that encode conserved protein domains. We describe here a robust and cost-effective method to construct multiple small sized CRISPR library from a single oligo pool generated by array synthesis using parallel oligonucleotide retrieval. Together, these resources provide a convenient means for individual labs to generate customized CRISPR libraries of variable size and coverage depth for functional genomics application. PMID:28334828
Gene conservation of tree species—banking on the future. Proceedings of a workshop.
Richard A. Sniezko; Gary Man; Valerie Hipkins; Keith Woeste; David Gwaze; John T. Kliejunas; Brianna A. McTeague
2017-01-01
The âGene Conservation of Tree SpeciesâBanking on the Future Workshopâ provided a forum for presenting and discussing issues and accomplishments in genetic conservation of trees, and notably those of North America. The meeting gathered scientists, specialists, administrators and conservation practitioners from federal, university, non-governmental and public garden...
Thapa, Kanchan; Manandhar, Sulochana; Bista, Manisha; Shakya, Jivan; Sah, Govind; Dhakal, Maheshwar; Sharma, Netra; Llewellyn, Bronwyn; Wultsch, Claudia; Waits, Lisette P; Kelly, Marcella J; Hero, Jean-Marc; Hughes, Jane; Karmacharya, Dibesh
2018-01-01
With fewer than 200 tigers (Panthera tigris tigris) left in Nepal, that are generally confined to five protected areas across the Terai Arc Landscape, genetic studies are needed to provide crucial information on diversity and connectivity for devising an effective country-wide tiger conservation strategy. As part of the Nepal Tiger Genome Project, we studied landscape change, genetic variation, population structure, and gene flow of tigers across the Terai Arc Landscape by conducting Nepal's first comprehensive and systematic scat-based, non-invasive genetic survey. Of the 770 scat samples collected opportunistically from five protected areas and six presumed corridors, 412 were tiger (57%). Out of ten microsatellite loci, we retain eight markers that were used in identifying 78 individual tigers. We used this dataset to examine population structure, genetic variation, contemporary gene flow, and potential population bottlenecks of tigers in Nepal. We detected three genetic clusters consistent with three demographic sub-populations and found moderate levels of genetic variation (He = 0.61, AR = 3.51) and genetic differentiation (FST = 0.14) across the landscape. We detected 3-7 migrants, confirming the potential for dispersal-mediated gene flow across the landscape. We found evidence of a bottleneck signature likely caused by large-scale land-use change documented in the last two centuries in the Terai forest. Securing tiger habitat including functional forest corridors is essential to enhance gene flow across the landscape and ensure long-term tiger survival. This requires cooperation among multiple stakeholders and careful conservation planning to prevent detrimental effects of anthropogenic activities on tigers.
Manandhar, Sulochana; Bista, Manisha; Shakya, Jivan; Sah, Govind; Dhakal, Maheshwar; Sharma, Netra; Llewellyn, Bronwyn; Wultsch, Claudia; Waits, Lisette P.; Kelly, Marcella J.; Hero, Jean-Marc; Hughes, Jane
2018-01-01
With fewer than 200 tigers (Panthera tigris tigris) left in Nepal, that are generally confined to five protected areas across the Terai Arc Landscape, genetic studies are needed to provide crucial information on diversity and connectivity for devising an effective country-wide tiger conservation strategy. As part of the Nepal Tiger Genome Project, we studied landscape change, genetic variation, population structure, and gene flow of tigers across the Terai Arc Landscape by conducting Nepal’s first comprehensive and systematic scat-based, non-invasive genetic survey. Of the 770 scat samples collected opportunistically from five protected areas and six presumed corridors, 412 were tiger (57%). Out of ten microsatellite loci, we retain eight markers that were used in identifying 78 individual tigers. We used this dataset to examine population structure, genetic variation, contemporary gene flow, and potential population bottlenecks of tigers in Nepal. We detected three genetic clusters consistent with three demographic sub-populations and found moderate levels of genetic variation (He = 0.61, AR = 3.51) and genetic differentiation (FST = 0.14) across the landscape. We detected 3–7 migrants, confirming the potential for dispersal-mediated gene flow across the landscape. We found evidence of a bottleneck signature likely caused by large-scale land-use change documented in the last two centuries in the Terai forest. Securing tiger habitat including functional forest corridors is essential to enhance gene flow across the landscape and ensure long-term tiger survival. This requires cooperation among multiple stakeholders and careful conservation planning to prevent detrimental effects of anthropogenic activities on tigers. PMID:29561865
Milani, Liliana; Ghiselli, Fabrizio; Guerra, Davide; Breton, Sophie; Passamonti, Marco
2013-01-01
Despite numerous comparative mitochondrial genomics studies revealing that animal mitochondrial genomes are highly conserved in terms of gene content, supplementary genes are sometimes found, often arising from gene duplication. Mitochondrial ORFans (ORFs having no detectable homology and unknown function) were found in bivalve molluscs with Doubly Uniparental Inheritance (DUI) of mitochondria. In DUI animals, two mitochondrial lineages are present: one transmitted through females (F-type) and the other through males (M-type), each showing a specific and conserved ORF. The analysis of 34 mitochondrial major Unassigned Regions of Musculista senhousia F- and M-mtDNA allowed us to verify the presence of novel mitochondrial ORFs in this species and to compare them with ORFs from other species with ascertained DUI, with other bivalves and with animals showing new mitochondrial elements. Overall, 17 ORFans from nine species were analyzed for structure and function. Many clues suggest that the analyzed ORFans arose from endogenization of viral genes. The co-option of such novel genes by viral hosts may have determined some evolutionary aspects of host life cycle, possibly involving mitochondria. The structure similarity of DUI ORFans within evolutionary lineages may also indicate that they originated from independent events. If these novel ORFs are in some way linked to DUI establishment, a multiple origin of DUI has to be considered. These putative proteins may have a role in the maintenance of sperm mitochondria during embryo development, possibly masking them from the degradation processes that normally affect sperm mitochondria in species with strictly maternal inheritance. PMID:23824218
Milani, Liliana; Ghiselli, Fabrizio; Guerra, Davide; Breton, Sophie; Passamonti, Marco
2013-01-01
Despite numerous comparative mitochondrial genomics studies revealing that animal mitochondrial genomes are highly conserved in terms of gene content, supplementary genes are sometimes found, often arising from gene duplication. Mitochondrial ORFans (ORFs having no detectable homology and unknown function) were found in bivalve molluscs with Doubly Uniparental Inheritance (DUI) of mitochondria. In DUI animals, two mitochondrial lineages are present: one transmitted through females (F-type) and the other through males (M-type), each showing a specific and conserved ORF. The analysis of 34 mitochondrial major Unassigned Regions of Musculista senhousia F- and M-mtDNA allowed us to verify the presence of novel mitochondrial ORFs in this species and to compare them with ORFs from other species with ascertained DUI, with other bivalves and with animals showing new mitochondrial elements. Overall, 17 ORFans from nine species were analyzed for structure and function. Many clues suggest that the analyzed ORFans arose from endogenization of viral genes. The co-option of such novel genes by viral hosts may have determined some evolutionary aspects of host life cycle, possibly involving mitochondria. The structure similarity of DUI ORFans within evolutionary lineages may also indicate that they originated from independent events. If these novel ORFs are in some way linked to DUI establishment, a multiple origin of DUI has to be considered. These putative proteins may have a role in the maintenance of sperm mitochondria during embryo development, possibly masking them from the degradation processes that normally affect sperm mitochondria in species with strictly maternal inheritance.
Zygote arrest 1 (Zar1) is an evolutionarily conserved gene expressed in vertebrate ovaries.
Wu, Xuemei; Wang, Pei; Brown, Christopher A; Zilinski, Carolyn A; Matzuk, Martin M
2003-09-01
Zygote arrest 1 (ZAR1) is an ovary-specific maternal factor that plays essential roles during the oocyte-to-embryo transition. In mice, the Zar1 mRNA is detected as a 1.4-kilobase (kb) transcript that is synthesized exclusively in growing oocytes. To further understand the functions of ZAR1, we have cloned the orthologous Zar1 cDNA and/or genes for mouse, rat, human, frog, zebrafish, and pufferfish. The entire mouse Zar1 gene and a related pseudogene span approximately 4.0 kb, contain four exons, and map to adjacent loci on mouse chromosome 5. The human ZAR1 orthologous gene similarly consists of four exons and resides on human chromosome 4p12, which is syntenic with the mouse Zar1 chromosomal locus. Rat (Rattus norvegicus) and pufferfish (Fugu rubripes) Zar1 genes were recognized by database mining and deduced protein alignment analysis. The rat Zar1 gene also maps to a region that is syntenic with the mouse Zar1 gene locus on rat chromosome 14. Frog (Xenopus laevis) and zebrafish (Danio rerio) Zar1 orthologs were cloned by reverse transcription-polymerase chain reaction and rapid amplification of cDNA ends analysis of ovarian mRNA. Unlike mouse and human, the frog Zar1 is detected in multiple tissues, including lung, muscle, and ovary. The Zar1 mRNA appears in the cytoplasm of oocytes and persists until the tailbud stage during frog embryogenesis. Mouse, rat, human, frog, zebrafish, and pufferfish Zar1 genes encode proteins of 361, 361, 424, 295, 329, and 320 amino acids, respectively, and share 50.8%-88.1% amino acid identity. Regions of the N-termini of these ZAR1 orthologs show high sequence identity among these various proteins. However, the C-terminal 103 amino acids of these proteins, encoded by exons 2-4, contain an atypical eight-cysteine Plant Homeo Domain motif and are highly conserved, sharing 80.6%-98.1% identity among these species. These findings suggest that the carboxyl-termini of these ZAR1 proteins contain an important functional domain that is conserved through vertebrate evolution and that may be necessary for normal female reproduction in the transition from oocyte to embryonic life.
Izzi, Stephanie A; Colantuono, Bonnie J; Sullivan, Kelly; Khare, Parul; Meedel, Thomas H
2013-04-15
Ci-MRF is the sole myogenic regulatory factor (MRF) of the ascidian Ciona intestinalis, an invertebrate chordate. In order to investigate its properties we developed a simple in vivo assay based on misexpressing Ci-MRF in the notochord of Ciona embryos. We used this assay to examine the roles of three structural motifs that are conserved among MRFs: an alanine-threonine (Ala-Thr) dipeptide of the basic domain that is known in vertebrates as the myogenic code, a cysteine/histidine-rich (C/H) domain found just N-terminal to the basic domain, and a carboxy-terminal amphipathic α-helix referred to as Helix III. We show that the Ala-Thr dipeptide is necessary for normal Ci-MRF function, and that while eliminating the C/H domain or Helix III individually has no demonstrable effect on Ci-MRF, simultaneous loss of both motifs significantly reduces its activity. Our studies also indicate that direct interaction between CiMRF and an essential E-box of Ciona Troponin I is required for the expression of this muscle-specific gene and that multiple classes of MRF-regulated genes exist in Ciona. These findings are consistent with substantial conservation of MRF-directed myogenesis in chordates and demonstrate for the first time that the Ala/Thr dipeptide of the basic domain of an invertebrate MRF behaves as a myogenic code. Copyright © 2013 Elsevier Inc. All rights reserved.
Robertson, Laura S.; Cornman, Robert S.
2014-01-01
We developed genetic resources for two North American frogs, Lithobates clamitans and Pseudacris regilla, widespread native amphibians that are potential indicator species of environmental health. For both species, mRNA from multiple tissues was sequenced using 454 technology. De novo assemblies with Mira3 resulted in 50 238 contigs (N50 = 687 bp) and 48 213 contigs (N50 = 686 bp) for L. clamitans and P. regilla, respectively, after clustering with CD-Hit-EST and purging contigs below 200 bp. We performed BLASTX similarity searches against the Xenopus tropicalis proteome and, for predicted ORFs, HMMER similarity searches against the Pfam-A database. Because there is broad interest in amphibian immune factors, we manually annotated putative antimicrobial peptides. To identify conserved regions suitable for amplicon resequencing across a broad taxonomic range, we performed an additional assembly of public short-read transcriptome data derived from two species of the genus Rana and identified reciprocal best TBLASTX matches among all assemblies. Although P. regilla, a hylid frog, is substantially more diverged from the ranid species, we identified 56 genes that were sufficiently conserved to allow nondegenerate primer design with Primer3. In addition to providing a foundation for comparative genomics and quantitative gene expression analysis, our results enable quick development of nuclear sequence-based markers for phylogenetics or population genetics.
Synteny of Prunus and other model plant species
Jung, Sook; Jiwan, Derick; Cho, Ilhyung; Lee, Taein; Abbott, Albert; Sosinski, Bryon; Main, Dorrie
2009-01-01
Background Fragmentary conservation of synteny has been reported between map-anchored Prunus sequences and Arabidopsis. With the availability of genome sequence for fellow rosid I members Populus and Medicago, we analyzed the synteny between Prunus and the three model genomes. Eight Prunus BAC sequences and map-anchored Prunus sequences were used in the comparison. Results We found a well conserved synteny across the Prunus species – peach, plum, and apricot – and Populus using a set of homologous Prunus BACs. Conversely, we could not detect any synteny with Arabidopsis in this region. Other peach BACs also showed extensive synteny with Populus. The syntenic regions detected were up to 477 kb in Populus. Two syntenic regions between Arabidopsis and these BACs were much shorter, around 10 kb. We also found syntenic regions that are conserved between the Prunus BACs and Medicago. The array of synteny corresponded with the proposed whole genome duplication events in Populus and Medicago. Using map-anchored Prunus sequences, we detected many syntenic blocks with several gene pairs between Prunus and Populus or Arabidopsis. We observed a more complex network of synteny between Prunus-Arabidopsis, indicative of multiple genome duplication and subsequence gene loss in Arabidopsis. Conclusion Our result shows the striking microsynteny between the Prunus BACs and the genome of Populus and Medicago. In macrosynteny analysis, more distinct Prunus regions were syntenic to Populus than to Arabidopsis. PMID:19208249
Ancient Origin of the Tryptophan Operon and the Dynamics of Evolutionary Change†
Xie, Gary; Keyhani, Nemat O.; Bonner; Jensen, Roy A.
2003-01-01
The seven conserved enzymatic domains required for tryptophan (Trp) biosynthesis are encoded in seven genetic regions that are organized differently (whole-pathway operons, multiple partial-pathway operons, and dispersed genes) in prokaryotes. A comparative bioinformatics evaluation of the conservation and organization of the genes of Trp biosynthesis in prokaryotic operons should serve as an excellent model for assessing the feasibility of predicting the evolutionary histories of genes and operons associated with other biochemical pathways. These comparisons should provide a better understanding of possible explanations for differences in operon organization in different organisms at a genomics level. These analyses may also permit identification of some of the prevailing forces that dictated specific gene rearrangements during the course of evolution. Operons concerned with Trp biosynthesis in prokaryotes have been in a dynamic state of flux. Analysis of closely related organisms among the Bacteria at various phylogenetic nodes reveals many examples of operon scission, gene dispersal, gene fusion, gene scrambling, and gene loss from which the direction of evolutionary events can be deduced. Two milestone evolutionary events have been mapped to the 16S rRNA tree of Bacteria, one splitting the operon in two, and the other rejoining it by gene fusion. The Archaea, though less resolved due to a lesser genome representation, appear to exhibit more gene scrambling than the Bacteria. The trp operon appears to have been an ancient innovation; it was already present in the common ancestor of Bacteria and Archaea. Although the operon has been subjected, even in recent times, to dynamic changes in gene rearrangement, the ancestral gene order can be deduced with confidence. The evolutionary history of the genes of the pathway is discernible in rough outline as a vertical line of descent, with events of lateral gene transfer or paralogy enriching the analysis as interesting features that can be distinguished. As additional genomes are thoroughly analyzed, an increasingly refined resolution of the sequential evolutionary steps is clearly possible. These comparisons suggest that present-day trp operons that possess finely tuned regulatory features are under strong positive selection and are able to resist the disruptive evolutionary events that may be experienced by simpler, poorly regulated operons. PMID:12966138
A case study of assigning conservation value to dispersed habitat units for conservation planning
Rohweder, Jason J.; Sara C. Vacek,; Crimmins, Shawn M.; Thogmartin, Wayne E.
2015-01-01
Resource managers are increasingly tasked with developing habitat conservation plans in the face of numerous, sometimes competing, objectives. These plans must often be implemented across dispersed habitat conservation units that may contribute unequally to overall conservation objectives. Using U.S. Fish and Wildlife Service waterfowl production areas (WPA) in western Minnesota as our conservation landscape, we develop a landscape-scale approach for evaluating the conservation value of dispersed habitat conservation units with multiple conservation priorities. We evaluated conservation value based on a suite of variables directly applicable to conservation management practices, thus providing a direct link between conservation actions and outcomes. We developed spatial models specific to each of these conservation objectives and also developed two freely available prioritization tools to implement these analyses. We found that some WPAs provided high conservation value across a range of conservation objectives, suggesting that managing these specific areas would achieve multiple conservation goals. Conversely, other WPAs provided low conservation value for some objectives, suggesting they would be most effectively managed for a distinct set of specific conservation goals. Approaches such as ours provide a direct means of assessing the conservation value of dispersed habitat conservation units and could be useful in the development of habitat management plans, particularly when faced with multiple conservation objectives.
Li, Fupeng; Hao, Chaoyun; Yan, Lin; Wu, Baoduo; Qin, Xiaowei; Lai, Jianxiong; Song, Yinghui
2015-09-01
In higher plants, sucrose synthase (Sus, EC 2.4.1.13) is widely considered as a key enzyme involved in sucrose metabolism. Although, several paralogous genes encoding different isozymes of Sus have been identified and characterized in multiple plant genomes, to date detailed information about the Sus genes is lacking for cacao. This study reports the identification of six novel Sus genes from economically important cacao tree. Analyses of the gene structure and phylogeny of the Sus genes demonstrated evolutionary conservation in the Sus family across cacao and other plant species. The expression of cacao Sus genes was investigated via real-time PCR in various tissues, different developmental phases of leaf, flower bud and pod. The Sus genes exhibited distinct but partially redundant expression profiles in cacao, with TcSus1, TcSus5 and TcSus6, being the predominant genes in the bark with phloem, TcSus2 predominantly expressing in the seed during the stereotype stage. TcSus3 and TcSus4 were significantly detected more in the pod husk and seed coat along the pod development, and showed development dependent expression profiles in the cacao pod. These results provide new insights into the evolution, and basic information that will assist in elucidating the functions of cacao Sus gene family.
Lind, Abigail L.; Wisecaver, Jennifer H.; Smith, Timothy D.; Feng, Xuehuan; Calvo, Ana M.; Rokas, Antonis
2015-01-01
Filamentous fungi produce diverse secondary metabolites (SMs) essential to their ecology and adaptation. Although each SM is typically produced by only a handful of species, global SM production is governed by widely conserved transcriptional regulators in conjunction with other cellular processes, such as development. We examined the interplay between the taxonomic narrowness of SM distribution and the broad conservation of global regulation of SM and development in Aspergillus, a diverse fungal genus whose members produce well-known SMs such as penicillin and gliotoxin. Evolutionary analysis of the 2,124 genes comprising the 262 SM pathways in four Aspergillus species showed that most SM pathways were species-specific, that the number of SM gene orthologs was significantly lower than that of orthologs in primary metabolism, and that the few conserved SM orthologs typically belonged to non-homologous SM pathways. RNA sequencing of two master transcriptional regulators of SM and development, veA and mtfA, showed that the effects of deletion of each gene, especially veA, on SM pathway regulation were similar in A. fumigatus and A. nidulans, even though the underlying genes and pathways regulated in each species differed. In contrast, examination of the role of these two regulators in development, where 94% of the underlying genes are conserved in both species showed that whereas the role of veA is conserved, mtfA regulates development in the homothallic A. nidulans but not in the heterothallic A. fumigatus. Thus, the regulation of these highly conserved developmental genes is divergent, whereas–despite minimal conservation of target genes and pathways–the global regulation of SM production is largely conserved. We suggest that the evolution of the transcriptional regulation of secondary metabolism in Aspergillus represents a novel type of regulatory circuit rewiring and hypothesize that it has been largely driven by the dramatic turnover of the target genes involved in the process. PMID:25786130
Kukita, Yoji; Okami, Jiro; Yoneda-Kato, Noriko; Nakamae, Ikuko; Kawabata, Takeshi; Higashiyama, Masahiko; Kato, Junya; Kodama, Ken; Kato, Kikuya
2016-01-01
In clinical practice, there are a number of cancer patients with clear family histories, but the patients lack mutations in known familial cancer syndrome genes. Recent advances in genomic technologies have enhanced the possibility of identifying causative genes in such cases. Two siblings, an elder sister and a younger brother, were found to have multiple primary lung cancers at the age of 60. The former subsequently developed breast cancer and had a history of uterine myoma. The latter had initially developed prostate cancer at the age of 59 and had a history of colon cancer. Single-nucleotide polymorphism (SNP) genotyping revealed that ∼10% of the genomes were homozygous in both patients. Exome sequencing revealed nonsynonymous mutations in five genes in the runs of homozygosity: CHEK2, FCGRT, INPP5J, MYO18B, and SFI1. Evolutionary conservation of primary protein structures suggested the functional importance of the CHEK2 mutation, p.R474C. This mutation altered the tertiary structure of CHK2 by disrupting the salt bridge between p.R474 and p.E394. No such structural changes were observed with the other mutated genes. Subsequent cell-based transfection analysis revealed that CHK2 p.R474C was unstable and scarcely activated. We concluded that the homozygous CHEK2 variant was contributory in this case of familial cancer. Although homozygous inactivation of CHEK2 in mice led to cancers in multiple organs, accumulation of additional human cases is needed to establish its pathogenic role in humans. PMID:27900359
Kukita, Yoji; Okami, Jiro; Yoneda-Kato, Noriko; Nakamae, Ikuko; Kawabata, Takeshi; Higashiyama, Masahiko; Kato, Junya; Kodama, Ken; Kato, Kikuya
2016-11-01
In clinical practice, there are a number of cancer patients with clear family histories, but the patients lack mutations in known familial cancer syndrome genes. Recent advances in genomic technologies have enhanced the possibility of identifying causative genes in such cases. Two siblings, an elder sister and a younger brother, were found to have multiple primary lung cancers at the age of 60. The former subsequently developed breast cancer and had a history of uterine myoma. The latter had initially developed prostate cancer at the age of 59 and had a history of colon cancer. Single-nucleotide polymorphism (SNP) genotyping revealed that ∼10% of the genomes were homozygous in both patients. Exome sequencing revealed nonsynonymous mutations in five genes in the runs of homozygosity: CHEK2 , FCGRT , INPP5J , MYO18B , and SFI1 . Evolutionary conservation of primary protein structures suggested the functional importance of the CHEK2 mutation, p.R474C. This mutation altered the tertiary structure of CHK2 by disrupting the salt bridge between p.R474 and p.E394. No such structural changes were observed with the other mutated genes. Subsequent cell-based transfection analysis revealed that CHK2 p.R474C was unstable and scarcely activated. We concluded that the homozygous CHEK2 variant was contributory in this case of familial cancer. Although homozygous inactivation of CHEK2 in mice led to cancers in multiple organs, accumulation of additional human cases is needed to establish its pathogenic role in humans.
Goldstone, Jared V; Sundaramoorthy, Munirathinam; Zhao, Bin; Waterman, Michael R; Stegeman, John J; Lamb, David C
2016-01-01
Biosynthesis of steroid hormones in vertebrates involves three cytochrome P450 hydroxylases, CYP11A1, CYP17A1 and CYP19A1, which catalyze sequential steps in steroidogenesis. These enzymes are conserved in the vertebrates, but their origin and existence in other chordate subphyla (Tunicata and Cephalochordata) have not been clearly established. In this study, selected protein sequences of CYP11A1, CYP17A1 and CYP19A1 were compiled and analyzed using multiple sequence alignment and phylogenetic analysis. Our analyses show that cephalochordates have sequences orthologous to vertebrate CYP11A1, CYP17A1 or CYP19A1, and that echinoderms and hemichordates possess CYP11-like but not CYP19 genes. While the cephalochordate sequences have low identity with the vertebrate sequences, reflecting evolutionary distance, the data show apparent origin of CYP11 prior to the evolution of CYP19 and possibly CYP17, thus indicating a sequential origin of these functionally related steroidogenic CYPs. Co-occurrence of the three CYPs in early chordates suggests that the three genes may have coevolved thereafter, and that functional conservation should be reflected in functionally important residues in the proteins. CYP19A1 has the largest number of conserved residues while CYP11A1 sequences are less conserved. Structural analyses of human CYP11A1, CYP17A1 and CYP19A1 show that critical substrate binding site residues are highly conserved in each enzyme family. The results emphasize that the steroidogenic pathways producing glucocorticoids and reproductive steroids are several hundred million years old and that the catalytic structural elements of the enzymes have been conserved over the same period of time. Analysis of these elements may help to identify when precursor functions linked to these enzymes first arose. Copyright © 2015 Elsevier Inc. All rights reserved.
2013-01-01
Background Pectin methylesterases (PMEs) catalyze the demethylesterification of homogalacturonans in the cell wall; their activity is regulated in part by pectin methylesterase inhibitors (PMEIs). PME activity may result in either rigidification or loosening of the cell wall, depending on the mode of demethylesterification. The activity of PMEs in the middle lamella is expected to affect intrusive elongation of phloem fibers, and their adhesion to adjacent cells. Length and extractability of phloem fibers are qualities important for their industrial uses in textiles and composites. As only three flax PMEs had been previously described, we were motivated to characterize the PME and PMEI gene families of flax. Results We identified 105 putative flax PMEs (LuPMEs) and 95 putative PMEIs (LuPMEIs) within the whole-genome assembly. We found experimental evidence for the transcription of 77/105 LuPMEs and 83/95 LuPMEIs, and surveyed the transcript abundance of these in 12 different tissues and stages of development. Six major monophyletic groups of LuPMEs could be defined based on the inferred relationships of flax genes and their presumed orthologs from other species. We searched the LuPMEs and LuPMEIs for conserved residues previously reported to be important for their tertiary structure and function. In the LuPMEs, the most highly conserved residues were catalytic residues while in the LuPMEIs, cysteines forming disulfude bridges between helices α2 and α3 were most highly conserved. In general, the conservation of critical residues was higher in the genes with evidence of transcript expression than in those for which no expression was detected. Conclusions The LuPMEs and LuPMEIs comprise large families with complex patterns of transcript expression and a wide range of physical characteristics. We observed that multiple PMEs and PMEIs are expressed in partially overlapping domains, indicative of several genes acting redundantly during most processes. The potential for functional redundancy was highlighted also by the phylogenetic analyses. We were able to identify a subset of PME and PMEIs that appeared particularly relevant to fiber development, which may provide a basis for the improvement of key traits in industrial feedstocks and a better understanding of the physiological roles of PMEs and PMEIs in general. PMID:24168262
Pinzón-Latorre, David; Deyholos, Michael K
2013-10-30
Pectin methylesterases (PMEs) catalyze the demethylesterification of homogalacturonans in the cell wall; their activity is regulated in part by pectin methylesterase inhibitors (PMEIs). PME activity may result in either rigidification or loosening of the cell wall, depending on the mode of demethylesterification. The activity of PMEs in the middle lamella is expected to affect intrusive elongation of phloem fibers, and their adhesion to adjacent cells. Length and extractability of phloem fibers are qualities important for their industrial uses in textiles and composites. As only three flax PMEs had been previously described, we were motivated to characterize the PME and PMEI gene families of flax. We identified 105 putative flax PMEs (LuPMEs) and 95 putative PMEIs (LuPMEIs) within the whole-genome assembly. We found experimental evidence for the transcription of 77/105 LuPMEs and 83/95 LuPMEIs, and surveyed the transcript abundance of these in 12 different tissues and stages of development. Six major monophyletic groups of LuPMEs could be defined based on the inferred relationships of flax genes and their presumed orthologs from other species. We searched the LuPMEs and LuPMEIs for conserved residues previously reported to be important for their tertiary structure and function. In the LuPMEs, the most highly conserved residues were catalytic residues while in the LuPMEIs, cysteines forming disulfude bridges between helices α2 and α3 were most highly conserved. In general, the conservation of critical residues was higher in the genes with evidence of transcript expression than in those for which no expression was detected. The LuPMEs and LuPMEIs comprise large families with complex patterns of transcript expression and a wide range of physical characteristics. We observed that multiple PMEs and PMEIs are expressed in partially overlapping domains, indicative of several genes acting redundantly during most processes. The potential for functional redundancy was highlighted also by the phylogenetic analyses. We were able to identify a subset of PME and PMEIs that appeared particularly relevant to fiber development, which may provide a basis for the improvement of key traits in industrial feedstocks and a better understanding of the physiological roles of PMEs and PMEIs in general.
Syring, John V; Tennessen, Jacob A; Jennings, Tara N; Wegrzyn, Jill; Scelfo-Dalbey, Camille; Cronn, Richard
2016-01-01
Whitebark pine (Pinus albicaulis) inhabits an expansive range in western North America, and it is a keystone species of subalpine environments. Whitebark is susceptible to multiple threats - climate change, white pine blister rust, mountain pine beetle, and fire exclusion - and it is suffering significant mortality range-wide, prompting the tree to be listed as 'globally endangered' by the International Union for Conservation of Nature and 'endangered' by the Canadian government. Conservation collections (in situ and ex situ) are being initiated to preserve the genetic legacy of the species. Reliable, transferrable, and highly variable genetic markers are essential for quantifying the genetic profiles of seed collections relative to natural stands, and ensuring the completeness of conservation collections. We evaluated the use of hybridization-based target capture to enrich specific genomic regions from the 27 GB genome of whitebark pine, and to evaluate genetic variation across loci, trees, and geography. Probes were designed to capture 7,849 distinct genes, and screening was performed on 48 trees. Despite the inclusion of repetitive elements in the probe pool, the resulting dataset provided information on 4,452 genes and 32% of targeted positions (528,873 bp), and we were able to identify 12,390 segregating sites from 47 trees. Variations reveal strong geographic trends in heterozygosity and allelic richness, with trees from the southern Cascade and Sierra Range showing the greatest distinctiveness and differentiation. Our results show that even under non-optimal conditions (low enrichment efficiency; inclusion of repetitive elements in baits), targeted enrichment produces high quality, codominant genotypes from large genomes. The resulting data can be readily integrated into management and gene conservation activities for whitebark pine, and have the potential to be applied to other members of 5-needle pine group (Pinus subsect. Quinquefolia) due to their limited genetic divergence.
Di, Chao; Xu, Wenying; Su, Zhen; Yuan, Joshua S
2010-10-07
PHB (Prohibitin) gene family is involved in a variety of functions important for different biological processes. PHB genes are ubiquitously present in divergent species from prokaryotes to eukaryotes. Human PHB genes have been found to be associated with various diseases. Recent studies by our group and others have shown diverse function of PHB genes in plants for development, senescence, defence, and others. Despite the importance of the PHB gene family, no comprehensive gene family analysis has been carried to evaluate the relatedness of PHB genes across different species. In order to better guide the gene function analysis and understand the evolution of the PHB gene family, we therefore carried out the comparative genome analysis of the PHB genes across different kingdoms. The relatedness, motif distribution, and intron/exon distribution all indicated that PHB genes is a relatively conserved gene family. The PHB genes can be classified into 5 classes and each class have a very deep evolutionary origin. The PHB genes within the class maintained the same motif patterns during the evolution. With Arabidopsis as the model species, we found that PHB gene intron/exon structure and domains are also conserved during the evolution. Despite being a conserved gene family, various gene duplication events led to the expansion of the PHB genes. Both segmental and tandem gene duplication were involved in Arabidopsis PHB gene family expansion. However, segmental duplication is predominant in Arabidopsis. Moreover, most of the duplicated genes experienced neofunctionalization. The results highlighted that PHB genes might be involved in important functions so that the duplicated genes are under the evolutionary pressure to derive new function. PHB gene family is a conserved gene family and accounts for diverse but important biological functions based on the similar molecular mechanisms. The highly diverse biological function indicated that more research needs to be carried out to dissect the PHB gene function. The conserved gene evolution indicated that the study in the model species can be translated to human and mammalian studies.
Delaney, Shannon M.; Mavrodi, Dmitri V.; Bonsall, Robert F.; Thomashow, Linda S.
2001-01-01
Certain strains of root-colonizing fluorescent Pseudomonas spp. produce phenazines, a class of antifungal metabolites that can provide protection against various soilborne root pathogens. Despite the fact that the phenazine biosynthetic locus is highly conserved among fluorescent Pseudomonas spp., individual strains differ in the range of phenazine compounds they produce. This study focuses on the ability of Pseudomonas aureofaciens 30-84 to produce 2-hydroxyphenazine-1-carboxylic acid (2-OH-PCA) and 2-hydroxyphenazine from the common phenazine metabolite phenazine-1-carboxylic acid (PCA). P. aureofaciens 30-84 contains a novel gene located downstream from the core phenazine operon that encodes a 55-kDa aromatic monooxygenase responsible for the hydroxylation of PCA to produce 2-OH-PCA. Knowledge of the genes responsible for phenazine product specificity could ultimately reveal ways to manipulate organisms to produce multiple phenazines or novel phenazines not previously described. PMID:11114932
Wang, Ling-Yan; Li, Shi-Tao; Guo, Lian-Hong; Jiang, Rong; Li, Yuan
2003-08-01
Recently in our laboratory, Streptomyces sp. 139 has been identified to produce a new exopolysaccharide designated EPS 139A that shows anti-rheumatic arthritis activity. The strategy of studying EPS 139A biosynthesis is to clone the key gene in the EPS biosynthesis pathway, i.e. the priming glycosyltransferase gene catalyzing the first step of nucleotide sugar transfer. Degenerate primers-based PCR approach was adopted to isolate the putative priming glycosyltransferase gene in Streptomyces sp. 139. According to the genes encoding the priming glycosyltransferases that have been identified in several microorganisms, a multiple alignment of the amino acid sequences of these genes was used to identify regions conserved between all genes. To clone the priming glycosyltransferase gene in Streptomyces sp. 139, degenerate primers were designed from these conserved regions taking into account information on Streptomyces codon usage to amplify an internal DNA fragment of this gene. A distinctive PCR product with the expected size of 0.3 kb was amplified from Streptomyces sp. 139 total genomic DNA. Sequence analysis showed that it is part of a putative priming glycosyltransferase gene and contains the predicted conserved domain B. To isolate the complete priming glycosyltransferase gene, a Streptomyces sp. 139 genomic library was constructed in the E. coli--Streptomyces shuttle vector pOJ446. Using the 0.3 kb PCR product of priming glycosyltransferase gene as a probe, 17 positive colonies were isolated by colony hybridization. A 4.0 kb BamHI fragment from all positive cosmids that hybridized to this probe was sequenced, which revealed the complete priming glycosyltransferase gene. The priming glycosyltransferase gene ste5 (GenBank under accession number AY131229) most likely begins with GTG, preceded by a probable ribosome binding site (RBS), GGGGA. It encodes a 492-amino-acid protein with molecular weight of 54 kDa and isoelectric point of 10.6. The G + C content of ste5 is 73%, close to the average of G + C content (74%) for Streptomyces. Moreover, the preference usage of G or C as third base of codons are found in the ste5, which is in accordance with the Streptomyces codon usage. A BlastP search showed that the C-terminal region of Ste5 shows highly homology with a number of priming glycosyltransferases from many different organisms. Ste5 contains two putative catalytic residues, Glu and Asp (residues 423 and 474) with a spacing of approximately 50 amino acids that conserved in various beta-glycosyltransferases. Moreover, the C-terminal one third of Ste5 contains three domains, A, B and C that is reported to be common to glycosyltransferases. By hydrophilicity plot prediction, the N-terminal two thirds of Ste5 exhibits 5 putative transmembrane domains. To investigate the involvement of the identified polysaccharide gene cluster in EPS 139A biosynthesis, the gene ste5 encoding priming glycosyltransferase was insertionally disrupted by a single-crossover homologous recombination event. A 0.85 kb internal fragment of ste5 was cloned into vector pKC1139 to yield pLY5015 that was transduced into Streptomyces sp. 139. Correct integration in Streptomyces LY1001 ste5- mutant strain was confirmed by Southern hybridization. After fermentation, no EPS 139A could be detected in the cultures of ste5- mutant strain Streptomyces LY1001. Therefore, the gene ste5 identified in this work is involved in the synthesis of the Streptomyces sp. 139 EPS.
Angiosperm phylogeny inferred from multiple genes as a tool for comparative biology.
Soltis, P S; Soltis, D E; Chase, M W
1999-11-25
Comparative biology requires a firm phylogenetic foundation to uncover and understand patterns of diversification and evaluate hypotheses of the processes responsible for these patterns. In the angiosperms, studies of diversification in floral form, stamen organization, reproductive biology, photosynthetic pathway, nitrogen-fixing symbioses and life histories have relied on either explicit or implied phylogenetic trees. Furthermore, to understand the evolution of specific genes and gene families, evaluate the extent of conservation of plant genomes and make proper sense of the huge volume of molecular genetic data available for model organisms such as Arabidopsis, Antirrhinum, maize, rice and wheat, a phylogenetic perspective is necessary. Here we report the results of parsimony analyses of DNA sequences of the plastid genes rbcL and atpB and the nuclear 18S rDNA for 560 species of angiosperms and seven non-flowering seed plants and show a well-resolved and well-supported phylogenetic tree for the angiosperms for use in comparative biology.
Biological Insights From 108 Schizophrenia-Associated Genetic Loci
Ripke, Stephan; Neale, Benjamin M; Corvin, Aiden; Walters, James TR; Farh, Kai-How; Holmans, Peter A; Lee, Phil; Bulik-Sullivan, Brendan; Collier, David A; Huang, Hailiang; Pers, Tune H; Agartz, Ingrid; Agerbo, Esben; Albus, Margot; Alexander, Madeline; Amin, Farooq; Bacanu, Silviu A; Begemann, Martin; Belliveau, Richard A; Bene, Judit; Bergen, Sarah E; Bevilacqua, Elizabeth; Bigdeli, Tim B; Black, Donald W; Bruggeman, Richard; Buccola, Nancy G; Buckner, Randy L; Byerley, William; Cahn, Wiepke; Cai, Guiqing; Campion, Dominique; Cantor, Rita M; Carr, Vaughan J; Carrera, Noa; Catts, Stanley V; Chambert, Kimberley D; Chan, Raymond CK; Chan, Ronald YL; Chen, Eric YH; Cheng, Wei; Cheung, Eric FC; Chong, Siow Ann; Cloninger, C Robert; Cohen, David; Cohen, Nadine; Cormican, Paul; Craddock, Nick; Crowley, James J; Curtis, David; Davidson, Michael; Davis, Kenneth L; Degenhardt, Franziska; Del Favero, Jurgen; Demontis, Ditte; Dikeos, Dimitris; Dinan, Timothy; Djurovic, Srdjan; Donohoe, Gary; Drapeau, Elodie; Duan, Jubao; Dudbridge, Frank; Durmishi, Naser; Eichhammer, Peter; Eriksson, Johan; Escott-Price, Valentina; Essioux, Laurent; Fanous, Ayman H; Farrell, Martilias S; Frank, Josef; Franke, Lude; Freedman, Robert; Freimer, Nelson B; Friedl, Marion; Friedman, Joseph I; Fromer, Menachem; Genovese, Giulio; Georgieva, Lyudmila; Giegling, Ina; Giusti-Rodríguez, Paola; Godard, Stephanie; Goldstein, Jacqueline I; Golimbet, Vera; Gopal, Srihari; Gratten, Jacob; de Haan, Lieuwe; Hammer, Christian; Hamshere, Marian L; Hansen, Mark; Hansen, Thomas; Haroutunian, Vahram; Hartmann, Annette M; Henskens, Frans A; Herms, Stefan; Hirschhorn, Joel N; Hoffmann, Per; Hofman, Andrea; Hollegaard, Mads V; Hougaard, David M; Ikeda, Masashi; Joa, Inge; Julià, Antonio; Kahn, René S; Kalaydjieva, Luba; Karachanak-Yankova, Sena; Karjalainen, Juha; Kavanagh, David; Keller, Matthew C; Kennedy, James L; Khrunin, Andrey; Kim, Yunjung; Klovins, Janis; Knowles, James A; Konte, Bettina; Kucinskas, Vaidutis; Kucinskiene, Zita Ausrele; Kuzelova-Ptackova, Hana; Kähler, Anna K; Laurent, Claudine; Lee, Jimmy; Lee, S Hong; Legge, Sophie E; Lerer, Bernard; Li, Miaoxin; Li, Tao; Liang, Kung-Yee; Lieberman, Jeffrey; Limborska, Svetlana; Loughland, Carmel M; Lubinski, Jan; Lönnqvist, Jouko; Macek, Milan; Magnusson, Patrik KE; Maher, Brion S; Maier, Wolfgang; Mallet, Jacques; Marsal, Sara; Mattheisen, Manuel; Mattingsdal, Morten; McCarley, Robert W; McDonald, Colm; McIntosh, Andrew M; Meier, Sandra; Meijer, Carin J; Melegh, Bela; Melle, Ingrid; Mesholam-Gately, Raquelle I; Metspalu, Andres; Michie, Patricia T; Milani, Lili; Milanova, Vihra; Mokrab, Younes; Morris, Derek W; Mors, Ole; Murphy, Kieran C; Murray, Robin M; Myin-Germeys, Inez; Müller-Myhsok, Bertram; Nelis, Mari; Nenadic, Igor; Nertney, Deborah A; Nestadt, Gerald; Nicodemus, Kristin K; Nikitina-Zake, Liene; Nisenbaum, Laura; Nordin, Annelie; O’Callaghan, Eadbhard; O’Dushlaine, Colm; O’Neill, F Anthony; Oh, Sang-Yun; Olincy, Ann; Olsen, Line; Van Os, Jim; Pantelis, Christos; Papadimitriou, George N; Papiol, Sergi; Parkhomenko, Elena; Pato, Michele T; Paunio, Tiina; Pejovic-Milovancevic, Milica; Perkins, Diana O; Pietiläinen, Olli; Pimm, Jonathan; Pocklington, Andrew J; Powell, John; Price, Alkes; Pulver, Ann E; Purcell, Shaun M; Quested, Digby; Rasmussen, Henrik B; Reichenberg, Abraham; Reimers, Mark A; Richards, Alexander L; Roffman, Joshua L; Roussos, Panos; Ruderfer, Douglas M; Salomaa, Veikko; Sanders, Alan R; Schall, Ulrich; Schubert, Christian R; Schulze, Thomas G; Schwab, Sibylle G; Scolnick, Edward M; Scott, Rodney J; Seidman, Larry J; Shi, Jianxin; Sigurdsson, Engilbert; Silagadze, Teimuraz; Silverman, Jeremy M; Sim, Kang; Slominsky, Petr; Smoller, Jordan W; So, Hon-Cheong; Spencer, Chris C A; Stahl, Eli A; Stefansson, Hreinn; Steinberg, Stacy; Stogmann, Elisabeth; Straub, Richard E; Strengman, Eric; Strohmaier, Jana; Stroup, T Scott; Subramaniam, Mythily; Suvisaari, Jaana; Svrakic, Dragan M; Szatkiewicz, Jin P; Söderman, Erik; Thirumalai, Srinivas; Toncheva, Draga; Tosato, Sarah; Veijola, Juha; Waddington, John; Walsh, Dermot; Wang, Dai; Wang, Qiang; Webb, Bradley T; Weiser, Mark; Wildenauer, Dieter B; Williams, Nigel M; Williams, Stephanie; Witt, Stephanie H; Wolen, Aaron R; Wong, Emily HM; Wormley, Brandon K; Xi, Hualin Simon; Zai, Clement C; Zheng, Xuebin; Zimprich, Fritz; Wray, Naomi R; Stefansson, Kari; Visscher, Peter M; Adolfsson, Rolf; Andreassen, Ole A; Blackwood, Douglas HR; Bramon, Elvira; Buxbaum, Joseph D; Børglum, Anders D; Cichon, Sven; Darvasi, Ariel; Domenici, Enrico; Ehrenreich, Hannelore; Esko, Tõnu; Gejman, Pablo V; Gill, Michael; Gurling, Hugh; Hultman, Christina M; Iwata, Nakao; Jablensky, Assen V; Jönsson, Erik G; Kendler, Kenneth S; Kirov, George; Knight, Jo; Lencz, Todd; Levinson, Douglas F; Li, Qingqin S; Liu, Jianjun; Malhotra, Anil K; McCarroll, Steven A; McQuillin, Andrew; Moran, Jennifer L; Mortensen, Preben B; Mowry, Bryan J; Nöthen, Markus M; Ophoff, Roel A; Owen, Michael J; Palotie, Aarno; Pato, Carlos N; Petryshen, Tracey L; Posthuma, Danielle; Rietschel, Marcella; Riley, Brien P; Rujescu, Dan; Sham, Pak C; Sklar, Pamela; St Clair, David; Weinberger, Daniel R; Wendland, Jens R; Werge, Thomas; Daly, Mark J; Sullivan, Patrick F; O’Donovan, Michael C
2014-01-01
Summary Schizophrenia is a highly heritable disorder. Genetic risk is conferred by a large number of alleles, including common alleles of small effect that might be detected by genome-wide association studies. Here, we report a multi-stage schizophrenia genome-wide association study of up to 36,989 cases and 113,075 controls. We identify 128 independent associations spanning 108 conservatively defined loci that meet genome-wide significance, 83 of which have not been previously reported. Associations were enriched among genes expressed in brain providing biological plausibility for the findings. Many findings have the potential to provide entirely novel insights into aetiology, but associations at DRD2 and multiple genes involved in glutamatergic neurotransmission highlight molecules of known and potential therapeutic relevance to schizophrenia, and are consistent with leading pathophysiological hypotheses. Independent of genes expressed in brain, associations were enriched among genes expressed in tissues that play important roles in immunity, providing support for the hypothesized link between the immune system and schizophrenia. PMID:25056061
oPOSSUM: integrated tools for analysis of regulatory motif over-representation
Ho Sui, Shannan J.; Fulton, Debra L.; Arenillas, David J.; Kwon, Andrew T.; Wasserman, Wyeth W.
2007-01-01
The identification of over-represented transcription factor binding sites from sets of co-expressed genes provides insights into the mechanisms of regulation for diverse biological contexts. oPOSSUM, an internet-based system for such studies of regulation, has been improved and expanded in this new release. New features include a worm-specific version for investigating binding sites conserved between Caenorhabditis elegans and C. briggsae, as well as a yeast-specific version for the analysis of co-expressed sets of Saccharomyces cerevisiae genes. The human and mouse applications feature improvements in ortholog mapping, sequence alignments and the delineation of multiple alternative promoters. oPOSSUM2, introduced for the analysis of over-represented combinations of motifs in human and mouse genes, has been integrated with the original oPOSSUM system. Analysis using user-defined background gene sets is now supported. The transcription factor binding site models have been updated to include new profiles from the JASPAR database. oPOSSUM is available at http://www.cisreg.ca/oPOSSUM/ PMID:17576675
What is a gene? From molecules to metaphysics.
Rolston, Holmes
2006-01-01
Mendelian genes have become molecular genes, with increasing puzzlement about locating them, due to increasing complexity in genomic webworks. Genome science finds modular and conserved units of inheritance, identified as homologous genes. Such genes are cybernetic, transmitting information over generations; this too requires multi-leveled analysis, from DNA transcription to development and reproduction of the whole organism. Genes are conserved; genes are also dynamic and creative in evolutionary speciation-most remarkably producing humans capable of wondering about what genes are.
The Metarhizium anisopliae trp1 gene: cloning and regulatory analysis.
Staats, Charley Christian; Silva, Marcia Suzana Nunes; Pinto, Paulo Marcos; Vainstein, Marilene Henning; Schrank, Augusto
2004-07-01
The trp1 gene from the entomopathogenic fungus Metarhizium anisopliae, cloned by heterologous hybridization with the plasmid carrying the trpC gene from Aspergillus nidulans, was sequence characterized. The predicted translation product has the conserved catalytic domains of glutamine amidotransferase (G domain), indoleglycerolphosphate synthase (C domain), and phosphoribosyl anthranilate isomerase (F domain) organized as NH2-G-C-F-COOH. The ORF is interrupted by a single intron of 60 nt that is position conserved in relation to trp genes from Ascomycetes and length conserved in relation to Basidiomycetes species. RT-PCR analysis suggests constitutive expression of trp1 gene in M. anisopliae.
Seim, Inge; Carter, Shea L; Herington, Adrian C; Chopin, Lisa K
2008-01-01
Background The peptide hormone ghrelin has many important physiological and pathophysiological roles, including the stimulation of growth hormone (GH) release, appetite regulation, gut motility and proliferation of cancer cells. We previously identified a gene on the opposite strand of the ghrelin gene, ghrelinOS (GHRLOS), which spans the promoter and untranslated regions of the ghrelin gene (GHRL). Here we further characterise GHRLOS. Results We have described GHRLOS mRNA isoforms that extend over 1.4 kb of the promoter region and 106 nucleotides of exon 4 of the ghrelin gene, GHRL. These GHRLOS transcripts initiate 4.8 kb downstream of the terminal exon 4 of GHRL and are present in the 3' untranslated exon of the adjacent gene TATDN2 (TatD DNase domain containing 2). Interestingly, we have also identified a putative non-coding TATDN2-GHRLOS chimaeric transcript, indicating that GHRLOS RNA biogenesis is extremely complex. Moreover, we have discovered that the 3' region of GHRLOS is also antisense, in a tail-to-tail fashion to a novel terminal exon of the neighbouring SEC13 gene, which is important in protein transport. Sequence analyses revealed that GHRLOS is riddled with stop codons, and that there is little nucleotide and amino-acid sequence conservation of the GHRLOS gene between vertebrates. The gene spans 44 kb on 3p25.3, is extensively spliced and harbours multiple variable exons. We have also investigated the expression of GHRLOS and found evidence of differential tissue expression. It is highly expressed in tissues which are emerging as major sites of non-coding RNA expression (the thymus, brain, and testis), as well as in the ovary and uterus. In contrast, very low levels were found in the stomach where sense, GHRL derived RNAs are highly expressed. Conclusion GHRLOS RNA transcripts display several distinctive features of non-coding (ncRNA) genes, including 5' capping, polyadenylation, extensive splicing and short open reading frames. The gene is also non-conserved, with differential and tissue-restricted expression. The overlapping genomic arrangement of GHRLOS with the ghrelin gene indicates that it is likely to have interesting regulatory and functional roles in the ghrelin axis. PMID:18954468
Seim, Inge; Carter, Shea L; Herington, Adrian C; Chopin, Lisa K
2008-10-28
The peptide hormone ghrelin has many important physiological and pathophysiological roles, including the stimulation of growth hormone (GH) release, appetite regulation, gut motility and proliferation of cancer cells. We previously identified a gene on the opposite strand of the ghrelin gene, ghrelinOS (GHRLOS), which spans the promoter and untranslated regions of the ghrelin gene (GHRL). Here we further characterise GHRLOS. We have described GHRLOS mRNA isoforms that extend over 1.4 kb of the promoter region and 106 nucleotides of exon 4 of the ghrelin gene, GHRL. These GHRLOS transcripts initiate 4.8 kb downstream of the terminal exon 4 of GHRL and are present in the 3' untranslated exon of the adjacent gene TATDN2 (TatD DNase domain containing 2). Interestingly, we have also identified a putative non-coding TATDN2-GHRLOS chimaeric transcript, indicating that GHRLOS RNA biogenesis is extremely complex. Moreover, we have discovered that the 3' region of GHRLOS is also antisense, in a tail-to-tail fashion to a novel terminal exon of the neighbouring SEC13 gene, which is important in protein transport. Sequence analyses revealed that GHRLOS is riddled with stop codons, and that there is little nucleotide and amino-acid sequence conservation of the GHRLOS gene between vertebrates. The gene spans 44 kb on 3p25.3, is extensively spliced and harbours multiple variable exons. We have also investigated the expression of GHRLOS and found evidence of differential tissue expression. It is highly expressed in tissues which are emerging as major sites of non-coding RNA expression (the thymus, brain, and testis), as well as in the ovary and uterus. In contrast, very low levels were found in the stomach where sense, GHRL derived RNAs are highly expressed. GHRLOS RNA transcripts display several distinctive features of non-coding (ncRNA) genes, including 5' capping, polyadenylation, extensive splicing and short open reading frames. The gene is also non-conserved, with differential and tissue-restricted expression. The overlapping genomic arrangement of GHRLOS with the ghrelin gene indicates that it is likely to have interesting regulatory and functional roles in the ghrelin axis.
Gene conservation in California's forests
Constance I. Millar
1986-01-01
The University of California's Wildland Resources Center has established a new program of forest gene conservation to ensure that California's rich and diverse forests maintain their vigor and productivity in the face of human activities. At an international level, conservation biologists recognize the importance not only of protecting rare species from...
Comparative genomics and evolution of the HSP90 family of genes across all kingdoms of organisms.
Chen, Bin; Zhong, Daibin; Monteiro, Antónia
2006-06-17
HSP90 proteins are essential molecular chaperones involved in signal transduction, cell cycle control, stress management, and folding, degradation, and transport of proteins. HSP90 proteins have been found in a variety of organisms suggesting that they are ancient and conserved. In this study we investigate the nuclear genomes of 32 species across all kingdoms of organisms, and all sequences available in GenBank, and address the diversity, evolution, gene structure, conservation and nomenclature of the HSP90 family of genes across all organisms. Twelve new genes and a new type HSP90C2 were identified. The chromosomal location, exon splicing, and prediction of whether they are functional copies were documented, as well as the amino acid length and molecular mass of their polypeptides. The conserved regions across all protein sequences, and signature sequences in each subfamily were determined, and a standardized nomenclature system for this gene family is presented. The proeukaryote HSP90 homologue, HTPG, exists in most Bacteria species but not in Archaea, and it evolved into three lineages (Groups A, B and C) via two gene duplication events. None of the organellar-localized HSP90s were derived from endosymbionts of early eukaryotes. Mitochondrial TRAP and endoplasmic reticulum HSP90B separately originated from the ancestors of HTPG Group A in Firmicutes-like organisms very early in the formation of the eukaryotic cell. TRAP is monophyletic and present in all Animalia and some Protista species, while HSP90B is paraphyletic and present in all eukaryotes with the exception of some Fungi species, which appear to have lost it. Both HSP90C (chloroplast HSP90C1 and location-undetermined SP90C2) and cytosolic HSP90A are monophyletic, and originated from HSP90B by independent gene duplications. HSP90C exists only in Plantae, and was duplicated into HSP90C1 and HSP90C2 isoforms in higher plants. HSP90A occurs across all eukaryotes, and duplicated into HSP90AA and HSP90AB in vertebrates. Diplomonadida was identified as the most basal organism in the eukaryote lineage. The present study presents the first comparative genomic study and evolutionary analysis of the HSP90 family of genes across all kingdoms of organisms. HSP90 family members underwent multiple duplications and also subsequent losses during their evolution. This study established an overall framework of information for the family of genes, which may facilitate and stimulate the study of this gene family across all organisms.
Comparative genomics and evolution of the HSP90 family of genes across all kingdoms of organisms
Chen, Bin; Zhong, Daibin; Monteiro, Antónia
2006-01-01
Background HSP90 proteins are essential molecular chaperones involved in signal transduction, cell cycle control, stress management, and folding, degradation, and transport of proteins. HSP90 proteins have been found in a variety of organisms suggesting that they are ancient and conserved. In this study we investigate the nuclear genomes of 32 species across all kingdoms of organisms, and all sequences available in GenBank, and address the diversity, evolution, gene structure, conservation and nomenclature of the HSP90 family of genes across all organisms. Results Twelve new genes and a new type HSP90C2 were identified. The chromosomal location, exon splicing, and prediction of whether they are functional copies were documented, as well as the amino acid length and molecular mass of their polypeptides. The conserved regions across all protein sequences, and signature sequences in each subfamily were determined, and a standardized nomenclature system for this gene family is presented. The proeukaryote HSP90 homologue, HTPG, exists in most Bacteria species but not in Archaea, and it evolved into three lineages (Groups A, B and C) via two gene duplication events. None of the organellar-localized HSP90s were derived from endosymbionts of early eukaryotes. Mitochondrial TRAP and endoplasmic reticulum HSP90B separately originated from the ancestors of HTPG Group A in Firmicutes-like organisms very early in the formation of the eukaryotic cell. TRAP is monophyletic and present in all Animalia and some Protista species, while HSP90B is paraphyletic and present in all eukaryotes with the exception of some Fungi species, which appear to have lost it. Both HSP90C (chloroplast HSP90C1 and location-undetermined SP90C2) and cytosolic HSP90A are monophyletic, and originated from HSP90B by independent gene duplications. HSP90C exists only in Plantae, and was duplicated into HSP90C1 and HSP90C2 isoforms in higher plants. HSP90A occurs across all eukaryotes, and duplicated into HSP90AA and HSP90AB in vertebrates. Diplomonadida was identified as the most basal organism in the eukaryote lineage. Conclusion The present study presents the first comparative genomic study and evolutionary analysis of the HSP90 family of genes across all kingdoms of organisms. HSP90 family members underwent multiple duplications and also subsequent losses during their evolution. This study established an overall framework of information for the family of genes, which may facilitate and stimulate the study of this gene family across all organisms. PMID:16780600
Interfamily Transfer of Dual NB-LRR Genes Confers Resistance to Multiple Pathogens
Narusaka, Mari; Kubo, Yasuyuki; Hatakeyama, Katsunori; Imamura, Jun; Ezura, Hiroshi; Nanasato, Yoshihiko; Tabei, Yutaka; Takano, Yoshitaka; Shirasu, Ken; Narusaka, Yoshihiro
2013-01-01
A major class of disease resistance (R) genes which encode nucleotide binding and leucine rich repeat (NB-LRR) proteins have been used in traditional breeding programs for crop protection. However, it has been difficult to functionally transfer NB-LRR-type R genes in taxonomically distinct families. Here we demonstrate that a pair of Arabidopsis (Brassicaceae) NB-LRR-type R genes, RPS4 and RRS1, properly function in two other Brassicaceae, Brassica rapa and Brassica napus, but also in two Solanaceae, Nicotiana benthamiana and tomato (Solanum lycopersicum). The solanaceous plants transformed with RPS4/RRS1 confer bacterial effector-specific immunity responses. Furthermore, RPS4 and RRS1, which confer resistance to a fungal pathogen Colletotrichum higginsianum in Brassicaceae, also protect against Colletotrichum orbiculare in cucumber (Cucurbitaceae). Importantly, RPS4/RRS1 transgenic plants show no autoimmune phenotypes, indicating that the NB-LRR proteins are tightly regulated. The successful transfer of two R genes at the family level implies that the downstream components of R genes are highly conserved. The functional interfamily transfer of R genes can be a powerful strategy for providing resistance to a broad range of pathogens. PMID:23437080
Gene organization and alternative splicing of human prohormone convertase PC8.
Goodge, K A; Thomas, R J; Martin, T J; Gillespie, M T
1998-01-01
The mammalian Ca2+-dependent serine protease prohormone convertase PC8 is expressed ubiquitously, being transcribed as 3.5, 4.3 and 6.0 kb mRNA isoforms in various tissues. To determine the origin of these various mRNA isoforms we report the characterization of the human PC8 gene, which has been previously localized to chromosome 11q23-24. Consisting of 16 exons, the human PC8 gene spans approx. 27 kb. A comparison of the position of intron-exon junctions of the human PC8 gene with the gene structures of previously reported prohormone convertase genes demonstrated a divergence of the human PC8 from the highly conserved nature of the gene organization of this enzyme family. The nucleotide sequence of the 5'-flanking region of the human PC8 is reported and possesses putative promoter elements characteristic of a GC-rich promoter. Further supporting the potential role of a GC-rich promoter element, multiple transcriptional initiation sites within a 200 bp region were demonstrated. We propose that the various mRNA isoforms of PC8 result from the inclusion of intronic sequences within transcripts. PMID:9820811
A curated catalog of canine and equine keratin genes
Pujar, Shashikant; McGarvey, Kelly M.; Welle, Monika; Galichet, Arnaud; Müller, Eliane J.; Pruitt, Kim D.; Leeb, Tosso
2017-01-01
Keratins represent a large protein family with essential structural and functional roles in epithelial cells of skin, hair follicles, and other organs. During evolution the genes encoding keratins have undergone multiple rounds of duplication and humans have two clusters with a total of 55 functional keratin genes in their genomes. Due to the high similarity between different keratin paralogs and species-specific differences in gene content, the currently available keratin gene annotation in species with draft genome assemblies such as dog and horse is still imperfect. We compared the National Center for Biotechnology Information (NCBI) (dog annotation release 103, horse annotation release 101) and Ensembl (release 87) gene predictions for the canine and equine keratin gene clusters to RNA-seq data that were generated from adult skin of five dogs and two horses and from adult hair follicle tissue of one dog. Taking into consideration the knowledge on the conserved exon/intron structure of keratin genes, we annotated 61 putatively functional keratin genes in both the dog and horse, respectively. Subsequently, curators in the RefSeq group at NCBI reviewed their annotation of keratin genes in the dog and horse genomes (Annotation Release 104 and Annotation Release 102, respectively) and updated annotation and gene nomenclature of several keratin genes. The updates are now available in the NCBI Gene database (https://www.ncbi.nlm.nih.gov/gene). PMID:28846680
Awan, Ali R; Manfredo, Amanda; Pleiss, Jeffrey A
2013-07-30
Alternative splicing is a potent regulator of gene expression that vastly increases proteomic diversity in multicellular eukaryotes and is associated with organismal complexity. Although alternative splicing is widespread in vertebrates, little is known about the evolutionary origins of this process, in part because of the absence of phylogenetically conserved events that cross major eukaryotic clades. Here we describe a lariat-sequencing approach, which offers high sensitivity for detecting splicing events, and its application to the unicellular fungus, Schizosaccharomyces pombe, an organism that shares many of the hallmarks of alternative splicing in mammalian systems but for which no previous examples of exon-skipping had been demonstrated. Over 200 previously unannotated splicing events were identified, including examples of regulated alternative splicing. Remarkably, an evolutionary analysis of four of the exons identified here as subject to skipping in S. pombe reveals high sequence conservation and perfect length conservation with their homologs in scores of plants, animals, and fungi. Moreover, alternative splicing of two of these exons have been documented in multiple vertebrate organisms, making these the first demonstrations of identical alternative-splicing patterns in species that are separated by over 1 billion y of evolution.
Genome-wide transcriptomics of aging in the rotifer Brachionus manjavacas, an emerging model system.
Gribble, Kristin E; Mark Welch, David B
2017-03-01
Understanding gene expression changes over lifespan in diverse animal species will lead to insights to conserved processes in the biology of aging and allow development of interventions to improve health. Rotifers are small aquatic invertebrates that have been used in aging studies for nearly 100 years and are now re-emerging as a modern model system. To provide a baseline to evaluate genetic responses to interventions that change health throughout lifespan and a framework for new hypotheses about the molecular genetic mechanisms of aging, we examined the transcriptome of an asexual female lineage of the rotifer Brachionus manjavacas at five life stages: eggs, neonates, and early-, late-, and post-reproductive adults. There are widespread shifts in gene expression over the lifespan of B. manjavacas; the largest change occurs between neonates and early reproductive adults and is characterized by down-regulation of developmental genes and up-regulation of genes involved in reproduction. The expression profile of post-reproductive adults was distinct from that of other life stages. While few genes were significantly differentially expressed in the late- to post-reproductive transition, gene set enrichment analysis revealed multiple down-regulated pathways in metabolism, maintenance and repair, and proteostasis, united by genes involved in mitochondrial function and oxidative phosphorylation. This study provides the first examination of changes in gene expression over lifespan in rotifers. We detected differential expression of many genes with human orthologs that are absent in Drosophila and C. elegans, highlighting the potential of the rotifer model in aging studies. Our findings suggest that small but coordinated changes in expression of many genes in pathways that integrate diverse functions drive the aging process. The observation of simultaneous declines in expression of genes in multiple pathways may have consequences for health and longevity not detected by single- or multi-gene knockdown in otherwise healthy animals. Investigation of subtle but genome-wide change in these pathways during aging is an important area for future study.
The Regulatory Small RNA MarS Supports Virulence of Streptococcus pyogenes.
Pappesch, Roberto; Warnke, Philipp; Mikkat, Stefan; Normann, Jana; Wisniewska-Kucper, Aleksandra; Huschka, Franziska; Wittmann, Maja; Khani, Afsaneh; Schwengers, Oliver; Oehmcke-Hecht, Sonja; Hain, Torsten; Kreikemeyer, Bernd; Patenge, Nadja
2017-09-25
Small regulatory RNAs (sRNAs) play a role in the control of bacterial virulence gene expression. In this study, we investigated an sRNA that was identified in Streptococcus pyogenes (group A Streptococcus, GAS) but is conserved throughout various streptococci. In a deletion strain, expression of mga, the gene encoding the multiple virulence gene regulator, was reduced. Accordingly, transcript and proteome analyses revealed decreased expression of several Mga-activated genes. Therefore, and because the sRNA was shown to interact with the 5' UTR of the mga transcript in a gel-shift assay, we designated it MarS for m ga-activating regulatory sRNA. Down-regulation of important virulence factors, including the antiphagocytic M-protein, led to increased susceptibility of the deletion strain to phagocytosis and reduced adherence to human keratinocytes. In a mouse infection model, the marS deletion mutant showed reduced dissemination to the liver, kidney, and spleen. Additionally, deletion of marS led to increased tolerance towards oxidative stress. Our in vitro and in vivo results indicate a modulating effect of MarS on virulence gene expression and on the pathogenic potential of GAS.
Transcriptional Dysregulation of MYC Reveals Common Enhancer-Docking Mechanism.
Schuijers, Jurian; Manteiga, John Colonnese; Weintraub, Abraham Selby; Day, Daniel Sindt; Zamudio, Alicia Viridiana; Hnisz, Denes; Lee, Tong Ihn; Young, Richard Allen
2018-04-10
Transcriptional dysregulation of the MYC oncogene is among the most frequent events in aggressive tumor cells, and this is generally accomplished by acquisition of a super-enhancer somewhere within the 2.8 Mb TAD where MYC resides. We find that these diverse cancer-specific super-enhancers, differing in size and location, interact with the MYC gene through a common and conserved CTCF binding site located 2 kb upstream of the MYC promoter. Genetic perturbation of this enhancer-docking site in tumor cells reduces CTCF binding, super-enhancer interaction, MYC gene expression, and cell proliferation. CTCF binding is highly sensitive to DNA methylation, and this enhancer-docking site, which is hypomethylated in diverse cancers, can be inactivated through epigenetic editing with dCas9-DNMT. Similar enhancer-docking sites occur at other genes, including genes with prominent roles in multiple cancers, suggesting a mechanism by which tumor cell oncogenes can generally hijack enhancers. These results provide insights into mechanisms that allow a single target gene to be regulated by diverse enhancer elements in different cell types. Copyright © 2018 The Author(s). Published by Elsevier Inc. All rights reserved.
Jeong, Chang-Bum; Kim, Hui-Su; Kang, Hye-Min; Lee, Jae-Seong
2017-04-01
The ATP-binding cassette (ABC) protein superfamily is known to play a fundamental role in biological processes and is highly conserved across animal taxa. The ABC proteins function as active transporters for multiple substrates across the cellular membrane by ATP hydrolysis. As this superfamily is derived from a common ancestor, ABC genes have evolved via lineage-specific duplications through the process of adaptation. In this review, we summarized information about the ABC gene families in aquatic invertebrates, considering their evolution and putative functions in defense mechanisms. Phylogenetic analysis was conducted to examine the evolutionary significance of ABC gene families in aquatic invertebrates. Particularly, a massive expansion of multixenobiotic resistance (MXR)-mediated efflux transporters was identified in the absence of the ABCG2 (BCRP) gene in Ecdysozoa and Platyzoa, suggesting that a loss of Abcg2 gene occurred sporadically in these species during divergence of Protostome to Lophotrochozoa. Furthermore, in aquatic invertebrates, the ecotoxicological significance of MXR is discussed while considering the role of MXR-mediated efflux transporters in response to various environmental pollutants. Copyright © 2017 Elsevier B.V. All rights reserved.
Lundqvist, Mats L; Kohlberg, Kathleen E; Gefroh, Holly A; Arnaud, Philippe; Middleton, Darlene L; Romano, Tracy A; Warr, Gregory W
2002-07-01
Clones encoding the dolphin IgM heavy (micro) chain gene were isolated from a cDNA library of peripheral blood leukocytes. Genomic Southern blot analyses showed that the dolphin IGHM gene is most likely present in a single copy, and its sequence shows greatest similarity to those of the IGHM gene of the sheep, pig and cow, evolutionarily related artiodactyls. The transmembrane (TM) form of the IGHM chain was isolated by 3' RACE. While showing similarities to the TM regions of other mammalian IGHM chains, the highly conserved Ser residue of the CART motif is substituted with a Gly in the dolphin. In contrast to the pig and cow, which utilize only a single VH family, the dolphin expresses at least two distinct VH families, belonging to the mammalian VH clans I and III. At least two JH genes were identified in the dolphin. Some CDR3 regions of the dolphin VH are long (up to 21 amino acids), and contain multiple Cys residues, hypothesized to stabilize the CDR3 structure through disulfide bond formation.
Cerebroretinal microangiopathy with calcifications and cysts associated with CTC1 and NDP mutations.
Romaniello, Romina; Arrigoni, Filippo; Citterio, Andrea; Tonelli, Alessandra; Sforzini, Cinzia; Rizzari, Carmelo; Pessina, Marco; Triulzi, Fabio; Bassi, Maria Teresa; Borgatti, Renato
2013-12-01
Mutations in the conserved telomere maintenance component 1 (CTC1) gene were recently described in Coats plus syndrome and in cerebroretinal microangiopathy with calcifications and cysts. Norrie disease protein (NDP) gene was found mutated in Norrie disease, in Familial Exudative Vitreoretinopathy, and in Coats syndrome. Here we describe a boy affected by Norrie disease who developed typical features of cerebroretinal microangiopathy with calcifications and cysts. Direct sequencing of the CTC1 and NDP genes in this patient shows the presence of compound heterozygosity for 2 mutations in CTC1 (c.775G>A, pV259M and a novel microdeletion c.1213delG) and a missense mutation in the NDP gene (c.182T>C, p.L61P). Based on these genetic findings and on the expression of both genes in endothelial cells, we postulate that microangiopathy might be a primary underlying pathologic abnormality in cerebroretinal microangiopathy with calcifications and cysts. This hypothesis is further supported by magnetic resonance imaging (MRI) data showing multiple minute calcifications in the deep gray nuclei and in terminal arteriolar zones.
Zhao, G; Hortsch, M
1998-07-17
Members of the L1 family of neural cell adhesion molecules consist of multiple extracellular immunoglobulin and fibronectin type III domains that mediate the adhesive properties of this group of transmembrane proteins. In vertebrate genomes, these protein domains are separated by introns, and it has been suggested that L1-type genes might have been subject to exon-shuffling events during evolution. However, comparison of the human L1-CAM and the chicken neurofascin gene with the genomic structure of their Drosophila homologue, neuroglian, indicates that no major rearrangement of protein domains has taken place subsequent to the split of the arthropod and chordate phyla. The Drosophila neuroglian gene appears to have lost most of the introns that have been conserved in the human L1-CAM and the chicken neurofascin gene. Nevertheless, exon shuffling or the generation of new exons by mutational changes might have been responsible for the generation of additional, alternatively spliced exons in L1-type genes.
Burns, Terry C; Li, Matthew D; Mehta, Swapnil; Awad, Ahmed J; Morgan, Alexander A
2015-07-15
Translational research for neurodegenerative disease depends intimately upon animal models. Unfortunately, promising therapies developed using mouse models mostly fail in clinical trials, highlighting uncertainty about how well mouse models mimic human neurodegenerative disease at the molecular level. We compared the transcriptional signature of neurodegeneration in mouse models of Alzheimer׳s disease (AD), Parkinson׳s disease (PD), Huntington׳s disease (HD) and amyotrophic lateral sclerosis (ALS) to human disease. In contrast to aging, which demonstrated a conserved transcriptome between humans and mice, only 3 of 19 animal models showed significant enrichment for gene sets comprising the most dysregulated up- and down-regulated human genes. Spearman׳s correlation analysis revealed even healthy human aging to be more closely related to human neurodegeneration than any mouse model of AD, PD, ALS or HD. Remarkably, mouse models frequently upregulated stress response genes that were consistently downregulated in human diseases. Among potential alternate models of neurodegeneration, mouse prion disease outperformed all other disease-specific models. Even among the best available animal models, conserved differences between mouse and human transcriptomes were found across multiple animal model versus human disease comparisons, surprisingly, even including aging. Relative to mouse models, mouse disease signatures demonstrated consistent trends toward preserved mitochondrial function protein catabolism, DNA repair responses, and chromatin maintenance. These findings suggest a more complex and multifactorial pathophysiology in human neurodegeneration than is captured through standard animal models, and suggest that even among conserved physiological processes such as aging, mice are less prone to exhibit neurodegeneration-like changes. This work may help explain the poor track record of mouse-based translational therapies for neurodegeneration and provides a path forward to critically evaluate and improve animal models of human disease. Copyright © 2015 Elsevier B.V. All rights reserved.
Uittenbogaard, Martine; Martinka, Debra L.; Johnson, Peter F.; Vinson, Charles; Chiaramello, Anne
2009-01-01
Expression of the bHLH transcription factor Nex1/MATH-2/NeuroD6, a member of the NeuroD subfamily, parallels overt neuronal differentiation and synaptogenesis during brain development. Our previous studies have shown that Nex1 is a critical effector of the NGF pathway and promotes neuronal differentiation and survival of PC12 cells in the absence of growth factors. In this study, we investigated the transcriptional regulation of the Nex1 gene during NGF-induced neuronal differentiation. We found that Nex1 expression is under the control of two conserved promoters, Nex1-P1 and Nex1-P2, located in two distinct non-coding exons. Both promoters are TATA-less with multiple transcription start sites, and are activated on NGF or cAMP exposure. Luciferase-reporter assays showed that the Nex1-P2 promoter activity is stronger than the Nex1-P1 promoter activity, which supports the previously reported differential expression levels of Nex1 transcripts throughout brain development. Using a combination of DNaseI footprinting, EMSA assays, and site-directed mutagenesis, we identified the essential regulatory elements within the first 2 kb of the Nex1 5′UTR. The Nex1-P1 promoter is mainly regulated by a conserved CRE element, whereas the Nex1-P2 promoter is under the control of a conserved C/EBP binding site. Overexpression of wild-type C/EBPβ resulted in increased Nex1-P2 promoter activity in NGF-differentiated PC12 cells. The fact that Nex1 is a target gene of C/EBPβ provides new insight into the C/EBP transcriptional cascade known to promote neurogenesis, while repressing gliogenesis. PMID:17075921
Nawaz, Zarqa; Kakar, Kaleem Ullah; Saand, Mumtaz A; Shu, Qing-Yao
2014-10-04
Cyclic nucleotide-gated channels (CNGCs) are Ca2+-permeable cation transport channels, which are present in both animal and plant systems. They have been implicated in the uptake of both essential and toxic cations, Ca2+ signaling, pathogen defense, and thermotolerance in plants. To date there has not been a genome-wide overview of the CNGC gene family in any economically important crop, including rice (Oryza sativa L.). There is an urgent need for a thorough genome-wide analysis and experimental verification of this gene family in rice. In this study, a total of 16 full length rice CNGC genes distributed on chromosomes 1-6, 9 and 12, were identified by employing comprehensive bioinformatics analyses. Based on phylogeny, the family of OsCNGCs was classified into four major groups (I-IV) and two sub-groups (IV-A and IV- B). Likewise, the CNGCs from all plant lineages clustered into four groups (I-IV), where group II was conserved in all land plants. Gene duplication analysis revealed that both chromosomal segmentation (OsCNGC1 and 2, 10 and 11, 15 and 16) and tandem duplications (OsCNGC1 and 2) significantly contributed to the expansion of this gene family. Motif composition and protein sequence analysis revealed that the CNGC specific domain "cyclic nucleotide-binding domain (CNBD)" comprises a "phosphate binding cassette" (PBC) and a "hinge" region that is highly conserved among the OsCNGCs. In addition, OsCNGC proteins also contain various other functional motifs and post-translational modification sites. We successively built a stringent motif: (LI-X(2)-[GS]-X-[FV]-X-G-[1]-ELL-X-W-X(12,22)-SA-X(2)-T-X(7)-[EQ]-AF-X-L) that recognizes the rice CNGCs specifically. Prediction of cis-acting regulatory elements in 5' upstream sequences and expression analyses through quantitative qPCR demonstrated that OsCNGC genes were highly responsive to multiple stimuli including hormonal (abscisic acid, indoleacetic acid, kinetin and ethylene), biotic (Pseudomonas fuscovaginae and Xanthomonas oryzae pv. oryzae) and abiotic (cold) stress. There are 16 CNGC genes in rice, which were probably expanded through chromosomal segmentation and tandem duplications and comprise a PBC and a "hinge" region in the CNBD domain, featured by a stringent motif. The various cis-acting regulatory elements in the upstream sequences may be responsible for responding to multiple stimuli, including hormonal, biotic and abiotic stresses.
Meyer, Miriah; Wunderlich, Zeba; Simirenko, Lisa; Luengo Hendriks, Cris L.; Keränen, Soile V. E.; Henriquez, Clara; Knowles, David W.; Biggin, Mark D.; Eisen, Michael B.; DePace, Angela H.
2011-01-01
Differences in the level, timing, or location of gene expression can contribute to alternative phenotypes at the molecular and organismal level. Understanding the origins of expression differences is complicated by the fact that organismal morphology and gene regulatory networks could potentially vary even between closely related species. To assess the scope of such changes, we used high-resolution imaging methods to measure mRNA expression in blastoderm embryos of Drosophila yakuba and Drosophila pseudoobscura and assembled these data into cellular resolution atlases, where expression levels for 13 genes in the segmentation network are averaged into species-specific, cellular resolution morphological frameworks. We demonstrate that the blastoderm embryos of these species differ in their morphology in terms of size, shape, and number of nuclei. We present an approach to compare cellular gene expression patterns between species, while accounting for varying embryo morphology, and apply it to our data and an equivalent dataset for Drosophila melanogaster. Our analysis reveals that all individual genes differ quantitatively in their spatio-temporal expression patterns between these species, primarily in terms of their relative position and dynamics. Despite many small quantitative differences, cellular gene expression profiles for the whole set of genes examined are largely similar. This suggests that cell types at this stage of development are conserved, though they can differ in their relative position by up to 3–4 cell widths and in their relative proportion between species by as much as 5-fold. Quantitative differences in the dynamics and relative level of a subset of genes between corresponding cell types may reflect altered regulatory functions between species. Our results emphasize that transcriptional networks can diverge over short evolutionary timescales and that even small changes can lead to distinct output in terms of the placement and number of equivalent cells. PMID:22046143
Dai, Mengyao; Wang, Yao; Fang, Lu; Irwin, David M; Zhu, Tengteng; Zhang, Junpeng; Zhang, Shuyi; Wang, Zhe
2014-01-01
Bats are the only mammals capable of self-powered flight using wings. Differing from mouse or human limbs, four elongated digits within a broad wing membrane support the bat wing, and the foot of the bat has evolved a long calcar that spread the interfemoral membrane. Our recent mRNA sequencing (mRNA-Seq) study found unique expression patterns for genes at the 5' end of the Hoxd gene cluster and for Tbx3 that are associated with digit elongation and wing membrane growth in bats. In this study, we focused on two additional genes, Meis2 and Mab21l2, identified from the mRNA-Seq data. Using whole-mount in situ hybridization (WISH) we validated the mRNA-Seq results for differences in the expression patterns of Meis2 and Mab21l2 between bat and mouse limbs, and further characterize the timing and location of the expression of these two genes. These analyses suggest that Meis2 may function in wing membrane growth and Mab21l2 may have a role in AP and DV axial patterning. In addition, we found that Tbx3 is uniquely expressed in the unique calcar structure found in the bat hindlimb, suggesting a role for this gene in calcar growth and elongation. Moreover, analysis of the coding sequences for Meis2, Mab21l2 and Tbx3 showed that Meis2 and Mab21l2 have high sequence identity, consistent with the functions of genes being conserved, but that Tbx3 showed accelerated evolution in bats. However, evidence for positive selection in Tbx3 was not found, which would suggest that the function of this gene has not been changed. Together, our findings support the hypothesis that the modulation of the spatiotemporal expression patterns of multiple functional conserved genes control limb morphology and drive morphological change in the diversification of mammalian limbs.
Fang, Lu; Irwin, David M.; Zhu, Tengteng; Zhang, Junpeng; Zhang, Shuyi; Wang, Zhe
2014-01-01
Bats are the only mammals capable of self-powered flight using wings. Differing from mouse or human limbs, four elongated digits within a broad wing membrane support the bat wing, and the foot of the bat has evolved a long calcar that spread the interfemoral membrane. Our recent mRNA sequencing (mRNA-Seq) study found unique expression patterns for genes at the 5′ end of the Hoxd gene cluster and for Tbx3 that are associated with digit elongation and wing membrane growth in bats. In this study, we focused on two additional genes, Meis2 and Mab21l2, identified from the mRNA-Seq data. Using whole-mount in situ hybridization (WISH) we validated the mRNA-Seq results for differences in the expression patterns of Meis2 and Mab21l2 between bat and mouse limbs, and further characterize the timing and location of the expression of these two genes. These analyses suggest that Meis2 may function in wing membrane growth and Mab21l2 may have a role in AP and DV axial patterning. In addition, we found that Tbx3 is uniquely expressed in the unique calcar structure found in the bat hindlimb, suggesting a role for this gene in calcar growth and elongation. Moreover, analysis of the coding sequences for Meis2, Mab21l2 and Tbx3 showed that Meis2 and Mab21l2 have high sequence identity, consistent with the functions of genes being conserved, but that Tbx3 showed accelerated evolution in bats. However, evidence for positive selection in Tbx3 was not found, which would suggest that the function of this gene has not been changed. Together, our findings support the hypothesis that the modulation of the spatiotemporal expression patterns of multiple functional conserved genes control limb morphology and drive morphological change in the diversification of mammalian limbs. PMID:25166052
Xu, Dong-Bei; Gao, Shi-Qing; Ma, You-Zhi; Xu, Zhao-Shi; Zhao, Chang-Ping; Tang, Yi-Miao; Li, Xue-Yin; Li, Lian-Cheng; Chen, Yao-Feng; Chen, Ming
2014-12-01
The phytohormone abscisic acid (ABA) plays crucial roles in adaptive responses of plants to abiotic stresses. ABA-responsive element binding proteins (AREBs) are basic leucine zipper transcription factors that regulate the expression of downstream genes containing ABA-responsive elements (ABREs) in promoter regions. A novel ABI-like (ABA-insensitive) transcription factor gene, named TaABL1, containing a conserved basic leucine zipper (bZIP) domain was cloned from wheat. Southern blotting showed that three copies were present in the wheat genome. Phylogenetic analyses indicated that TaABL1 belonged to the AREB subfamily of the bZIP transcription factor family and was most closely related to ZmABI5 in maize and OsAREB2 in rice. Expression of TaABL1 was highly induced in wheat roots, stems, and leaves by ABA, drought, high salt, and low temperature stresses. TaABL1 was localized inside the nuclei of transformed wheat mesophyll protoplast. Overexpression of TaABL1 enhanced responses of transgenic plants to ABA and hastened stomatal closure under stress, thereby improving tolerance to multiple abiotic stresses. Furthermore, overexpression of TaABL1 upregulated or downregulated the expression of some stress-related genes controlling stomatal closure in transgenic plants under ABA and drought stress conditions, suggesting that TaABL1 might be a valuable genetic resource for transgenic molecular breeding.
Jia, Mingrui; Shi, Ranran; Zhao, Xuli; Fu, Zhijian; Bai, Zhijing; Sun, Tao; Zhao, Xuejun; Wang, Wenbo; Xu, Chao; Yan, Fang
2017-01-01
Abstract Mutation analysis as the gold standard is particularly important in diagnosis of osteogenesis imperfecta (OI) and it may be preventable upon early diagnosis. In this study, we aimed to analyze the clinical and genetic materials of an OI pedigree as well as to confirm the deleterious property of the mutation. A pedigree with OI was identified. All family members received careful clinical examinations and blood was drawn for genetic analyses. Genes implicated in OI were screened for mutation. The function and structure of the mutant protein were predicted using bioinformatics analysis. The proband, a 9-month fetus, showed abnormal sonographic images. Disproportionately short and triangular face with blue sclera was noticed at birth. She can barely walk and suffered multiple fractures till 2-year old. Her mother appeared small stature, frequent fractures, blue sclera, and deformity of extremities. A heterozygous missense mutation c.1009G>T (p.G337C) in the COL1A2 gene was identified in her mother and her. Bioinformatics analysis showed p.G337 was well-conserved among multiple species and the mutation probably changed the structure and damaged the function of collagen. We suggest that the mutation p.G337C in the COL1A2 gene is pathogenic for OI by affecting the protein structure and the function of collagen. PMID:28953610
Burgess, Diane; Freeling, Michael
2014-01-01
In vertebrates, conserved noncoding elements (CNEs) are functionally constrained sequences that can show striking conservation over >400 million years of evolutionary distance and frequently are located megabases away from target developmental genes. Conserved noncoding sequences (CNSs) in plants are much shorter, and it has been difficult to detect conservation among distantly related genomes. In this article, we show not only that CNS sequences can be detected throughout the eudicot clade of flowering plants, but also that a subset of 37 CNSs can be found in all flowering plants (diverging ∼170 million years ago). These CNSs are functionally similar to vertebrate CNEs, being highly associated with transcription factor and development genes and enriched in transcription factor binding sites. Some of the most highly conserved sequences occur in genes encoding RNA binding proteins, particularly the RNA splicing–associated SR genes. Differences in sequence conservation between plants and animals are likely to reflect differences in the biology of the organisms, with plants being much more able to tolerate genomic deletions and whole-genome duplication events due, in part, to their far greater fecundity compared with vertebrates. PMID:24681619
Fahlgren, Noah; Howell, Miya D.; Kasschau, Kristin D.; Chapman, Elisabeth J.; Sullivan, Christopher M.; Cumbie, Jason S.; Givan, Scott A.; Law, Theresa F.; Grant, Sarah R.; Dangl, Jeffery L.; Carrington, James C.
2007-01-01
In plants, microRNAs (miRNAs) comprise one of two classes of small RNAs that function primarily as negative regulators at the posttranscriptional level. Several MIRNA genes in the plant kingdom are ancient, with conservation extending between angiosperms and the mosses, whereas many others are more recently evolved. Here, we use deep sequencing and computational methods to identify, profile and analyze non-conserved MIRNA genes in Arabidopsis thaliana. 48 non-conserved MIRNA families, nearly all of which were represented by single genes, were identified. Sequence similarity analyses of miRNA precursor foldback arms revealed evidence for recent evolutionary origin of 16 MIRNA loci through inverted duplication events from protein-coding gene sequences. Interestingly, these recently evolved MIRNA genes have taken distinct paths. Whereas some non-conserved miRNAs interact with and regulate target transcripts from gene families that donated parental sequences, others have drifted to the point of non-interaction with parental gene family transcripts. Some young MIRNA loci clearly originated from one gene family but form miRNAs that target transcripts in another family. We suggest that MIRNA genes are undergoing relatively frequent birth and death, with only a subset being stabilized by integration into regulatory networks. PMID:17299599
Characterization of Conserved and Non-conserved Imprinted Genes in Swine
USDA-ARS?s Scientific Manuscript database
In order to increase our understanding of the role of imprinted genes in swine reproduction we used two complementary approaches, analysis of imprinting by pyrosequencing, and expression profiling of parthenogenetic fetuses, to carry out a comprehensive analysis of this gene family in swine. Using A...
Conservation of Animal Genetic Resources (AnGR): the Next Decade
USDA-ARS?s Scientific Manuscript database
After 20 years, progress has been made in conserving AnGR; but how it will be in ten years? Viewing gene banks and in situ conservation in the context of food security, climate change, and product demand suggest a more efficient use of these practices to support sustainable production. Gene banks sh...
No3CoGP: non-conserved and conserved coexpressed gene pairs.
Mal, Chittabrata; Aftabuddin, Md; Kundu, Sudip
2014-12-08
Analyzing the microarray data of different conditions, one can identify the conserved and condition-specific genes and gene modules, and thus can infer the underlying cellular activities. All the available tools based on Bioconductor and R packages differ in how they extract differential coexpression and at what level they study. There is a need for a user-friendly, flexible tool which can start analysis using raw or preprocessed microarray data and can report different levels of useful information. We present a GUI software, No3CoGP: Non-Conserved and Conserved Coexpressed Gene Pairs which takes Affymetrix microarray data (.CEL files or log2 normalized.txt files) along with annotation file (.csv file), Chip Definition File (CDF file) and probe file as inputs, utilizes the concept of network density cut-off and Fisher's z-test to extract biologically relevant information. It can identify four possible types of gene pairs based on their coexpression relationships. These are (i) gene pair showing coexpression in one condition but not in the other, (ii) gene pair which is positively coexpressed in one condition but negatively coexpressed in the other condition, (iii) positively and (iv) negatively coexpressed in both the conditions. Further, it can generate modules of coexpressed genes. Easy-to-use GUI interface enables researchers without knowledge in R language to use No3CoGP. Utilization of one or more CPU cores, depending on the availability, speeds up the program. The output files stored in the respective directories under the user-defined project offer the researchers to unravel condition-specific functionalities of gene, gene sets or modules.
El Zoeiby, A; Sanschagrin, F; Lamoureux, J; Darveau, A; Levesque, R C
2000-02-15
We cloned and sequenced the murC gene from Pseudomonas aeruginosa encoding a protein of 53 kDa. Multiple alignments with 20 MurC peptide sequences from different bacteria confirmed the presence of highly conserved regions having sequence identities ranging from 22-97% including conserved motifs for ATP-binding and the active site of the enzyme. Genetic complementation was done in Escherichia coli (murCts) suppressing the lethal phenotype. The murC gene was subcloned into the expression vector pET30a and overexpressed in E. coli BL21(lambdaDE3). Three PCR cloning strategies were used to obtain the three recombinant plasmids for expression of the native MurC, MurC His-tagged at N-terminal and at C-terminal, respectively. MurC His-tagged at C-terminal was chosen for large scale production and protein purification in the soluble form. The purification was done in a single chromatographic step on an affinity nickel column and obtained in mg quantities at 95% homogeneity. MurC protein was used to produce monoclonal antibodies for epitope mapping and for assay development in high throughput screenings. Detailed studies of MurC and other genes of the bacterial cell cycle will provide the reagents and strain constructs for high throughput screening and for design of novel antibacterials.
Genetic control and comparative genomic analysis of flowering time in Setaria (Poaceae).
Mauro-Herrera, Margarita; Wang, Xuewen; Barbier, Hugues; Brutnell, Thomas P; Devos, Katrien M; Doust, Andrew N
2013-02-01
We report the first study on the genetic control of flowering in Setaria, a panicoid grass closely related to switchgrass, and in the same subfamily as maize and sorghum. A recombinant inbred line mapping population derived from a cross between domesticated Setaria italica (foxtail millet) and its wild relative Setaria viridis (green millet), was grown in eight trials with varying environmental conditions to identify a small number of quantitative trait loci (QTL) that control differences in flowering time. Many of the QTL across trials colocalize, suggesting that the genetic control of flowering in Setaria is robust across a range of photoperiod and other environmental factors. A detailed comparison of QTL for flowering in Setaria, sorghum, and maize indicates that several of the major QTL regions identified in maize and sorghum are syntenic orthologs with Setaria QTL, although the maize large effect QTL on chromosome 10 is not. Several Setaria QTL intervals had multiple LOD peaks and were composed of multiple syntenic blocks, suggesting that observed QTL represent multiple tightly linked loci. Candidate genes from flowering time pathways identified in rice and Arabidopsis were identified in Setaria QTL intervals, including those involved in the CONSTANS photoperiod pathway. However, only three of the approximately seven genes cloned for flowering time in maize colocalized with Setaria QTL. This suggests that variation in flowering time in separate grass lineages is controlled by a combination of conserved and lineage specific genes.
Genetic Control and Comparative Genomic Analysis of Flowering Time in Setaria (Poaceae)
Mauro-Herrera, Margarita; Wang, Xuewen; Barbier, Hugues; Brutnell, Thomas P.; Devos, Katrien M.; Doust, Andrew N.
2013-01-01
We report the first study on the genetic control of flowering in Setaria, a panicoid grass closely related to switchgrass, and in the same subfamily as maize and sorghum. A recombinant inbred line mapping population derived from a cross between domesticated Setaria italica (foxtail millet) and its wild relative Setaria viridis (green millet), was grown in eight trials with varying environmental conditions to identify a small number of quantitative trait loci (QTL) that control differences in flowering time. Many of the QTL across trials colocalize, suggesting that the genetic control of flowering in Setaria is robust across a range of photoperiod and other environmental factors. A detailed comparison of QTL for flowering in Setaria, sorghum, and maize indicates that several of the major QTL regions identified in maize and sorghum are syntenic orthologs with Setaria QTL, although the maize large effect QTL on chromosome 10 is not. Several Setaria QTL intervals had multiple LOD peaks and were composed of multiple syntenic blocks, suggesting that observed QTL represent multiple tightly linked loci. Candidate genes from flowering time pathways identified in rice and Arabidopsis were identified in Setaria QTL intervals, including those involved in the CONSTANS photoperiod pathway. However, only three of the approximately seven genes cloned for flowering time in maize colocalized with Setaria QTL. This suggests that variation in flowering time in separate grass lineages is controlled by a combination of conserved and lineage specific genes. PMID:23390604
Singh, Komudi; Ju, Jennifer Y.; Walsh, Melissa B.; DiIorio, Michael A.; Hart, Anne C.
2014-01-01
Objectives: Cross-species conservation of sleep-like behaviors predicts the presence of conserved molecular mechanisms underlying sleep. However, limited experimental evidence of conservation exists. Here, this prediction is tested directly. Measurements and Results: During lethargus, Caenorhabditis elegans spontaneously sleep in short bouts that are interspersed with bouts of spontaneous locomotion. We identified 26 genes required for Drosophila melanogaster sleep. Twenty orthologous C. elegans genes were selected based on similarity. Their effect on C. elegans sleep and arousal during the last larval lethargus was assessed. The 20 most similar genes altered both the quantity of sleep and arousal thresholds. In 18 cases, the direction of change was concordant with Drosophila studies published previously. Additionally, we delineated a conserved genetic pathway by which dopamine regulates sleep and arousal. In C. elegans neurons, G-alpha S, adenylyl cyclase, and protein kinase A act downstream of D1 dopamine receptors to regulate these behaviors. Finally, a quantitative analysis of genes examined herein revealed that C. elegans arousal thresholds were directly correlated with amount of sleep during lethargus. However, bout duration varies little and was not correlated with arousal thresholds. Conclusions: The comprehensive analysis presented here suggests that conserved genes and pathways are required for sleep in invertebrates and, likely, across the entire animal kingdom. The genetic pathway delineated in this study implicates G-alpha S and previously known genes downstream of dopamine signaling in sleep. Quantitative analysis of various components of quiescence suggests that interdependent or identical cellular and molecular mechanisms are likely to regulate both arousal and sleep entry. Citation: Singh K, Ju JY, Walsh MB, Dilorio MA, Hart AC. Deep conservation of genes required for both Drosophila melanogaster and Caenorhabditis elegans sleep includes a role for dopaminergic signaling. SLEEP 2014;37(9):1439-1451. PMID:25142568
USDA-ARS?s Scientific Manuscript database
Polymerase chain reaction amplification of conserved genes and sequence analysis provides a very powerful tool for the identification of toxigenic as well as non-toxigenic Penicillium species. Sequences are obtained by amplification of the gene fragment, sequencing via capillary electrophoresis of d...
Książkiewicz, Michał; Rychel, Sandra; Nelson, Matthew N; Wyrwa, Katarzyna; Naganowska, Barbara; Wolko, Bogdan
2016-10-21
The Arabidopsis FLOWERING LOCUS T (FT) gene, a member of the phosphatidylethanolamine binding protein (PEBP) family, is a major controller of flowering in response to photoperiod, vernalization and light quality. In legumes, FT evolved into three, functionally diversified clades, FTa, FTb and FTc. A milestone achievement in narrow-leafed lupin (Lupinus angustifolius L.) domestication was the loss of vernalization responsiveness at the Ku locus. Recently, one of two existing L. angustifolius homologs of FTc, LanFTc1, was revealed to be the gene underlying Ku. It is the first recorded involvement of an FTc homologue in vernalization. The evolutionary basis of this phenomenon in lupin has not yet been deciphered. Bacterial artificial chromosome (BAC) clones carrying LanFTc1 and LanFTc2 genes were localized in different mitotic chromosomes and constituted sequence-specific landmarks for linkage groups NLL-10 and NLL-17. BAC-derived superscaffolds containing LanFTc genes revealed clear microsyntenic patterns to genome sequences of nine legume species. Superscaffold-1 carrying LanFTc1 aligned to regions encoding one or more FT-like genes whereas superscaffold-2 mapped to a region lacking such a homolog. Comparative mapping of the L. angustifolius genome assembly anchored to linkage map localized superscaffold-1 in the middle of a 15 cM conserved, collinear region. In contrast, superscaffold-2 was found at the edge of a 20 cM syntenic block containing highly disrupted collinearity at the LanFTc2 locus. 118 PEBP-family full-length homologs were identified in 10 legume genomes. Bayesian phylogenetic inference provided novel evidence supporting the hypothesis that whole-genome and tandem duplications contributed to expansion of PEBP-family genes in legumes. Duplicated genes were subjected to strong purifying selection. Promoter analysis of FT genes revealed no statistically significant sequence similarity between duplicated copies; only RE-alpha and CCAAT-box motifs were found at conserved positions and orientations. Numerous lineage-specific duplications occurred during the evolution of legume PEBP-family genes. Whole-genome duplications resulted in the origin of subclades FTa, FTb and FTc and in the multiplication of FTa and FTb copy number. LanFTc1 is located in the region conserved among all main lineages of Papilionoideae. LanFTc1 is a direct descendant of ancestral FTc, whereas LanFTc2 appeared by subsequent duplication.
Sela, D. A.; Chapman, J.; Adeuya, A.; Kim, J. H.; Chen, F.; Whitehead, T. R.; Lapidus, A.; Rokhsar, D. S.; Lebrilla, C. B.; German, J. B.; Price, N. P.; Richardson, P. M.; Mills, D. A.
2008-01-01
Following birth, the breast-fed infant gastrointestinal tract is rapidly colonized by a microbial consortium often dominated by bifidobacteria. Accordingly, the complete genome sequence of Bifidobacterium longum subsp. infantis ATCC15697 reflects a competitive nutrient-utilization strategy targeting milk-borne molecules which lack a nutritive value to the neonate. Several chromosomal loci reflect potential adaptation to the infant host including a 43 kbp cluster encoding catabolic genes, extracellular solute binding proteins and permeases predicted to be active on milk oligosaccharides. An examination of in vivo metabolism has detected the hallmarks of milk oligosaccharide utilization via the central fermentative pathway using metabolomic and proteomic approaches. Finally, conservation of gene clusters in multiple isolates corroborates the genomic mechanism underlying milk utilization for this infant-associated phylotype. PMID:19033196
Gao, Junpeng; Cao, Xiaoli; Shi, Shandang; Ma, Yuling; Wang, Kai; Liu, Shengjie; Chen, Dan; Chen, Qin; Ma, Haoli
2016-03-04
The Auxin/indole-3-acetic acid (Aux/IAA) genes encode short-lived nuclear proteins that are known to be involved in the primary cellular responses to auxin. To date, systematic analysis of the Aux/IAA genes in potato (Solanum tuberosum) has not been conducted. In this study, a total of 26 potato Aux/IAA genes were identified (designated from StIAA1 to StIAA26), and the distribution of four conserved domains shared by the StIAAs were analyzed based on multiple sequence alignment and a motif-based sequence analysis. A phylogenetic analysis of the Aux/IAA gene families of potato and Arabidopsis was also conducted. In order to assess the roles of StIAA genes in tuber development, the results of RNA-seq studies were reformatted to analyze the expression patterns of StIAA genes, and then verified by quantitative real-time PCR. A large number of StIAA genes (12 genes) were highly expressed in stolon organs and in during the tuber initiation and expansion developmental stages, and most of these genes were responsive to indoleacetic acid treatment. Our results suggested that StIAA genes were involved in the process of tuber development and provided insights into functional roles of potato Aux/IAA genes. Copyright © 2016 Elsevier Inc. All rights reserved.
Dumas, Kathleen J; Delaney, Colin E; Flibotte, Stephane; Moerman, Donald G; Csankovszki, Gyorgyi; Hu, Patrick J
2013-07-01
During embryogenesis, an essential process known as dosage compensation is initiated to equalize gene expression from sex chromosomes. Although much is known about how dosage compensation is established, the consequences of modulating the stability of dosage compensation postembryonically are not known. Here we define a role for the Caenorhabditis elegans dosage compensation complex (DCC) in the regulation of DAF-2 insulin-like signaling. In a screen for dauer regulatory genes that control the activity of the FoxO transcription factor DAF-16, we isolated three mutant alleles of dpy-21, which encodes a conserved DCC component. Knockdown of multiple DCC components in hermaphrodite and male animals indicates that the dauer suppression phenotype of dpy-21 mutants is due to a defect in dosage compensation per se. In dpy-21 mutants, expression of several X-linked genes that promote dauer bypass is elevated, including four genes encoding components of the DAF-2 insulin-like pathway that antagonize DAF-16/FoxO activity. Accordingly, dpy-21 mutation reduced the expression of DAF-16/FoxO target genes by promoting the exclusion of DAF-16/FoxO from nuclei. Thus, dosage compensation enhances dauer arrest by repressing X-linked genes that promote reproductive development through the inhibition of DAF-16/FoxO nuclear translocation. This work is the first to establish a specific postembryonic function for dosage compensation in any organism. The influence of dosage compensation on dauer arrest, a larval developmental fate governed by the integration of multiple environmental inputs and signaling outputs, suggests that the dosage compensation machinery may respond to external cues by modulating signaling pathways through chromosome-wide regulation of gene expression.
Brenner, Eric D; Katari, Manpreet S; Stevenson, Dennis W; Rudd, Stephen A; Douglas, Andrew W; Moss, Walter N; Twigg, Richard W; Runko, Suzan J; Stellari, Giulia M; McCombie, WR; Coruzzi, Gloria M
2005-01-01
Background Ginkgo biloba L. is the only surviving member of one of the oldest living seed plant groups with medicinal, spiritual and horticultural importance worldwide. As an evolutionary relic, it displays many characters found in the early, extinct seed plants and extant cycads. To establish a molecular base to understand the evolution of seeds and pollen, we created a cDNA library and EST dataset from the reproductive structures of male (microsporangiate), female (megasporangiate), and vegetative organs (leaves) of Ginkgo biloba. Results RNA from newly emerged male and female reproductive organs and immature leaves was used to create three distinct cDNA libraries from which 6,434 ESTs were generated. These 6,434 ESTs from Ginkgo biloba were clustered into 3,830 unigenes. A comparison of our Ginkgo unigene set against the fully annotated genomes of rice and Arabidopsis, and all available ESTs in Genbank revealed that 256 Ginkgo unigenes match only genes among the gymnosperms and non-seed plants – many with multiple matches to genes in non-angiosperm plants. Conversely, another group of unigenes in Gingko had highly significant homology to transcription factors in angiosperms involved in development, including MADS box genes as well as post-transcriptional regulators. Several of the conserved developmental genes found in Ginkgo had top BLAST homology to cycad genes. We also note here the presence of ESTs in G. biloba similar to genes that to date have only been found in gymnosperms and an additional 22 Ginkgo genes common only to genes from cycads. Conclusion Our analysis of an EST dataset from G. biloba revealed genes potentially unique to gymnosperms. Many of these genes showed homology to fully sequenced clones from our cycad EST dataset found in common only with gymnosperms. Other Ginkgo ESTs are similar to developmental regulators in higher plants. This work sets the stage for future studies on Ginkgo to better understand seed and pollen evolution, and to resolve the ambiguous phylogenetic relationship of G. biloba among the gymnosperms. PMID:16225698
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lan, Yemin; Rosen, Gail; Hershberg, Ruth
The 16s rRNA gene is so far the most widely used marker for taxonomical classification and separation of prokaryotes. Since it is universally conserved among prokaryotes, it is possible to use this gene to classify a broad range of prokaryotic organisms. At the same time, it has often been noted that the 16s rRNA gene is too conserved to separate between prokaryotes at finer taxonomic levels. In this paper, we examine how well levels of similarity of 16s rRNA and 73 additional universal or nearly universal marker genes correlate with genome-wide levels of gene sequence similarity. We demonstrate that themore » percent identity of 16s rRNA predicts genome-wide levels of similarity very well for distantly related prokaryotes, but not for closely related ones. In closely related prokaryotes, we find that there are many other marker genes for which levels of similarity are much more predictive of genome-wide levels of gene sequence similarity. Finally, we show that the identities of the markers that are most useful for predicting genome-wide levels of similarity within closely related prokaryotic lineages vary greatly between lineages. However, the most useful markers are always those that are least conserved in their sequences within each lineage. In conclusion, our results show that by choosing markers that are less conserved in their sequences within a lineage of interest, it is possible to better predict genome-wide gene sequence similarity between closely related prokaryotes than is possible using the 16s rRNA gene. We point readers towards a database we have created (POGO-DB) that can be used to easily establish which markers show lowest levels of sequence conservation within different prokaryotic lineages.« less
Lan, Yemin; Rosen, Gail; Hershberg, Ruth
2016-05-03
The 16s rRNA gene is so far the most widely used marker for taxonomical classification and separation of prokaryotes. Since it is universally conserved among prokaryotes, it is possible to use this gene to classify a broad range of prokaryotic organisms. At the same time, it has often been noted that the 16s rRNA gene is too conserved to separate between prokaryotes at finer taxonomic levels. In this paper, we examine how well levels of similarity of 16s rRNA and 73 additional universal or nearly universal marker genes correlate with genome-wide levels of gene sequence similarity. We demonstrate that themore » percent identity of 16s rRNA predicts genome-wide levels of similarity very well for distantly related prokaryotes, but not for closely related ones. In closely related prokaryotes, we find that there are many other marker genes for which levels of similarity are much more predictive of genome-wide levels of gene sequence similarity. Finally, we show that the identities of the markers that are most useful for predicting genome-wide levels of similarity within closely related prokaryotic lineages vary greatly between lineages. However, the most useful markers are always those that are least conserved in their sequences within each lineage. In conclusion, our results show that by choosing markers that are less conserved in their sequences within a lineage of interest, it is possible to better predict genome-wide gene sequence similarity between closely related prokaryotes than is possible using the 16s rRNA gene. We point readers towards a database we have created (POGO-DB) that can be used to easily establish which markers show lowest levels of sequence conservation within different prokaryotic lineages.« less
Pathogen evolution and disease emergence in carnivores.
McCarthy, Alex J; Shaw, Marie-Anne; Goodman, Simon J
2007-12-22
Emerging infectious diseases constitute some of the most pressing problems for both human and domestic animal health, and biodiversity conservation. Currently it is not clear whether the removal of past constraints on geographical distribution and transmission possibilities for pathogens alone are sufficient to give rise to novel host-pathogen combinations, or whether pathogen evolution is also generally required for establishment in novel hosts. Canine distemper virus (CDV) is a morbillivirus that is prevalent in the world dog population and poses an important conservation threat to a diverse range of carnivores. We performed an extensive phylogenetic and molecular evolution analysis on complete sequences of all CDV genes to assess the role of selection and recombination in shaping viral genetic diversity and driving the emergence of CDV in non-dog hosts. We tested the specific hypothesis that molecular adaptation at known receptor-binding sites of the haemagglutinin gene is associated with independent instances of the spread of CDV to novel non-dog hosts in the wild. This hypothesis was upheld, providing compelling evidence that repeated evolution at known functional sites (in this case residues 530 and 549 of the haemagglutinin molecule) is associated with multiple independent occurrences of disease emergence in a range of novel host species.
Tse, Longping Victor; Klinc, Kelli A; Madigan, Victoria J; Castellanos Rivera, Ruth M; Wells, Lindsey F; Havlik, L Patrick; Smith, J Kennon; Agbandje-McKenna, Mavis; Asokan, Aravind
2017-06-13
Preexisting neutralizing antibodies (NAbs) against adeno-associated viruses (AAVs) pose a major, unresolved challenge that restricts patient enrollment in gene therapy clinical trials using recombinant AAV vectors. Structural studies suggest that despite a high degree of sequence variability, antibody recognition sites or antigenic hotspots on AAVs and other related parvoviruses might be evolutionarily conserved. To test this hypothesis, we developed a structure-guided evolution approach that does not require selective pressure exerted by NAbs. This strategy yielded highly divergent antigenic footprints that do not exist in natural AAV isolates. Specifically, synthetic variants obtained by evolving murine antigenic epitopes on an AAV serotype 1 capsid template can evade NAbs without compromising titer, transduction efficiency, or tissue tropism. One lead AAV variant generated by combining multiple evolved antigenic sites effectively evades polyclonal anti-AAV1 neutralizing sera from immunized mice and rhesus macaques. Furthermore, this variant displays robust immune evasion in nonhuman primate and human serum samples at dilution factors as high as 1:5, currently mandated by several clinical trials. Our results provide evidence that antibody recognition of AAV capsids is conserved across species. This approach can be applied to any AAV strain to evade NAbs in prospective patients for human gene therapy.
Global Alignment of Pairwise Protein Interaction Networks for Maximal Common Conserved Patterns
Tian, Wenhong; Samatova, Nagiza F.
2013-01-01
A number of tools for the alignment of protein-protein interaction (PPI) networks have laid the foundation for PPI network analysis. Most of alignment tools focus on finding conserved interaction regions across the PPI networks through either local or global mapping of similar sequences. Researchers are still trying to improve the speed, scalability, and accuracy of network alignment. In view of this, we introduce a connected-components based fast algorithm, HopeMap, for network alignment. Observing that the size of true orthologs across species is small comparing to the total number of proteins in all species, we take a different approach based onmore » a precompiled list of homologs identified by KO terms. Applying this approach to S. cerevisiae (yeast) and D. melanogaster (fly), E. coli K12 and S. typhimurium , E. coli K12 and C. crescenttus , we analyze all clusters identified in the alignment. The results are evaluated through up-to-date known gene annotations, gene ontology (GO), and KEGG ortholog groups (KO). Comparing to existing tools, our approach is fast with linear computational cost, highly accurate in terms of KO and GO terms specificity and sensitivity, and can be extended to multiple alignments easily.« less
Chakravorty, S; Sarkar, S; Gachhui, R
2015-01-01
The Acetobacteraceae family of the class Alpha Proteobacteria is comprised of high sugar and acid tolerant bacteria. The Acetic Acid Bacteria are the economically most significant group of this family because of its association with food products like vinegar, wine etc. Acetobacteraceae are often hard to culture in laboratory conditions and they also maintain very low abundances in their natural habitats. Thus identification of the organisms in such environments is greatly dependent on modern tools of molecular biology which require a thorough knowledge of specific conserved gene sequences that may act as primers and or probes. Moreover unconserved domains in genes also become markers for differentiating closely related genera. In bacteria, the 16S rRNA gene is an ideal candidate for such conserved and variable domains. In order to study the conserved and variable domains of the 16S rRNA gene of Acetic Acid Bacteria and the Acetobacteraceae family, sequences from publicly available databases were aligned and compared. Near complete sequences of the gene were also obtained from Kombucha tea biofilm, a known Acetobacteraceae family habitat, in order to corroborate the domains obtained from the alignment studies. The study indicated that the degree of conservation in the gene is significantly higher among the Acetic Acid Bacteria than the whole Acetobacteraceae family. Moreover it was also observed that the previously described hypervariable regions V1, V3, V5, V6 and V7 were more or less conserved in the family and the spans of the variable regions are quite distinct as well.
Evolutionary conservation of regulated longevity assurance mechanisms
McElwee, Joshua J; Schuster, Eugene; Blanc, Eric; Piper, Matthew D; Thomas, James H; Patel, Dhaval S; Selman, Colin; Withers, Dominic J; Thornton, Janet M; Partridge, Linda; Gems, David
2007-01-01
Background To what extent are the determinants of aging in animal species universal? Insulin/insulin-like growth factor (IGF)-1 signaling (IIS) is an evolutionarily conserved (public) regulator of longevity; yet it remains unclear whether the genes and biochemical processes through which IIS acts on aging are public or private (that is, lineage specific). To address this, we have applied a novel, multi-level cross-species comparative analysis to compare gene expression changes accompanying increased longevity in mutant nematodes, fruitflies and mice with reduced IIS. Results Surprisingly, there is little evolutionary conservation at the level of individual, orthologous genes or paralogous genes under IIS regulation. However, a number of gene categories are significantly enriched for genes whose expression changes in long-lived animals of all three species. Down-regulated categories include protein biosynthesis-associated genes. Up-regulated categories include sugar catabolism, energy generation, glutathione-S-transferases (GSTs) and several other categories linked to cellular detoxification (that is, phase 1 and phase 2 metabolism of xenobiotic and endobiotic toxins). Protein biosynthesis and GST activity have recently been linked to aging and longevity assurance, respectively. Conclusion These processes represent candidate, regulated mechanisms of longevity-control that are conserved across animal species. The longevity assurance mechanisms via which IIS acts appear to be lineage-specific at the gene level (private), but conserved at the process level (or semi-public). In the case of GSTs, and cellular detoxification generally, this suggests that the mechanisms of aging against which longevity assurance mechanisms act are, to some extent, lineage specific. PMID:17612391
From wild wolf to domestic dog: gene expression changes in the brain.
Saetre, Peter; Lindberg, Julia; Leonard, Jennifer A; Olsson, Kerstin; Pettersson, Ulf; Ellegren, Hans; Bergström, Tomas F; Vilà, Carles; Jazin, Elena
2004-07-26
Despite the relatively recent divergence time between domestic dogs (Canis familiaris) and gray wolves (Canis lupus), the two species show remarkable behavioral differences. Since dogs and wolves are nearly identical at the level of DNA sequence, we hypothesize that the two species may differ in patterns of gene expression. We compare gene expression patterns in dogs, wolves and a close relative, the coyote (Canis latrans), in three parts of the brain: hypothalamus, amygdala and frontal cortex, with microarray technology. Additionally, we identify genes with region-specific expression patterns in all three species. Among the wild canids, the hypothalamus has a highly conserved expression profile. This contrasts with a marked divergence in domestic dogs. Real-time PCR experiments confirm the altered expression of two neuropeptides, CALCB and NPY. Our results suggest that strong selection on dogs for behavior during domestication may have resulted in modifications of mRNA expression patterns in a few hypothalamic genes with multiple functions. This study indicates that rapid changes in brain gene expression may not be exclusive to the development of human brains. Instead, they may provide a common mechanism for rapid adaptive changes during speciation, particularly in cases that present strong selective pressures on behavioral characters.
Hu, Wei; Wang, Lianzhe; Tie, Weiwei; Yan, Yan; Ding, Zehong; Liu, Juhua; Li, Meiying; Peng, Ming; Xu, Biyu; Jin, Zhiqiang
2016-01-01
The leucine zipper (bZIP) transcription factors play important roles in multiple biological processes. However, less information is available regarding the bZIP family in the important fruit crop banana. In this study, 121 bZIP transcription factor genes were identified in the banana genome. Phylogenetic analysis showed that MabZIPs were classified into 11 subfamilies. The majority of MabZIP genes in the same subfamily shared similar gene structures and conserved motifs. The comprehensive transcriptome analysis of two banana genotypes revealed the differential expression patterns of MabZIP genes in different organs, in various stages of fruit development and ripening, and in responses to abiotic stresses, including drought, cold, and salt. Interaction networks and co-expression assays showed that group A MabZIP-mediated networks participated in various stress signaling, which was strongly activated in Musa ABB Pisang Awak. This study provided new insights into the complicated transcriptional control of MabZIP genes and provided robust tissue-specific, development-dependent, and abiotic stress-responsive candidate MabZIP genes for potential applications in the genetic improvement of banana cultivars. PMID:27445085
Genetic adaptations of the plateau zokor in high-elevation burrows.
Shao, Yong; Li, Jin-Xiu; Ge, Ri-Li; Zhong, Li; Irwin, David M; Murphy, Robert W; Zhang, Ya-Ping
2015-11-25
The plateau zokor (Myospalax baileyi) spends its entire life underground in sealed burrows. Confronting limited oxygen and high carbon dioxide concentrations, and complete darkness, they epitomize a successful physiological adaptation. Here, we employ transcriptome sequencing to explore the genetic underpinnings of their adaptations to this unique habitat. Compared to Rattus norvegicus, genes belonging to GO categories related to energy metabolism (e.g. mitochondrion and fatty acid beta-oxidation) underwent accelerated evolution in the plateau zokor. Furthermore, the numbers of positively selected genes were significantly enriched in the gene categories involved in ATPase activity, blood vessel development and respiratory gaseous exchange, functional categories that are relevant to adaptation to high altitudes. Among the 787 genes with evidence of parallel evolution, and thus identified as candidate genes, several GO categories (e.g. response to hypoxia, oxygen homeostasis and erythrocyte homeostasis) are significantly enriched, are two genes, EPAS1 and AJUBA, involved in the response to hypoxia, where the parallel evolved sites are at positions that are highly conserved in sequence alignments from multiple species. Thus, accelerated evolution of GO categories, positive selection and parallel evolution at the molecular level provide evidences to parse the genetic adaptations of the plateau zokor for living in high-elevation burrows.
Drosophila nemo is an essential gene involved in the regulation of programmed cell death.
Mirkovic, Ivana; Charish, Kristi; Gorski, Sharon M; McKnight, Kristen; Verheyen, Esther M
2002-11-01
Nemo-like kinases define a novel family of serine/threonine kinases that are involved in integrating multiple signaling pathways. They are conserved regulators of Wnt/Wingless pathways, which may coordinate Wnt with TGFbeta-mediated signaling. Drosophila nemo was identified through its involvement in epithelial planar polarity, a process regulated by a non-canonical Wnt pathway. We have previously found that ectopic expression of Nemo using the Gal4-UAS system resulted in embryonic lethality associated with defects in patterning and head development. In this study we present our analyses of the phenotypes of germline clone-derived embryos. We observe lethality associated with head defects and reduction of programmed cell death and conclude that nmo is an essential gene. We also present data showing that nmo is involved in regulating apoptosis during eye development, based on both loss of function phenotypes and on genetic interactions with the pro-apoptotic gene reaper. Finally, we present genetic data from the adult wing that suggest the activity of ectopically expressed Nemo can be modulated by Jun N-terminal kinase (JNK) signaling. Such an observation supports the model that there is cross-talk between Wnt, TGFbeta and JNK signaling at multiple stages of development. Copyright 2002 Elsevier Science Ireland Ltd.
Xu, Yue; Li, Song Feng; Parish, Roger W
2017-07-01
Targeted gene manipulation is a central strategy for studying gene function and identifying related biological processes. However, a methodology for manipulating the regulatory motifs of transcription factors is lacking as these factors commonly possess multiple motifs (e.g. repression and activation motifs) which collaborate with each other to regulate multiple biological processes. We describe a novel approach designated conserved sequence-guided repressor inhibition (CoSRI) that can specifically reduce or abolish the repressive activities of transcription factors in vivo. The technology was evaluated using the chimeric MYB80-EAR transcription factor and subsequently the endogenous WUS transcription factor. The technology was employed to develop a reversible male sterility system applicable to hybrid seed production. In order to determine the capacity of the technology to regulate the activity of endogenous transcription factors, the WUS repressor was chosen. The WUS repression motif could be inhibited in vivo and the transformed plants exhibited the wus-1 phenotype. Consequently, the technology can be used to manipulate the activities of transcriptional repressor motifs regulating beneficial traits in crop plants and other eukaryotic organisms. © 2016 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
Dynamic and Widespread lncRNA Expression in a Sponge and the Origin of Animal Complexity
Gaiti, Federico; Fernandez-Valverde, Selene L.; Nakanishi, Nagayasu; Calcino, Andrew D.; Yanai, Itai; Tanurdzic, Milos; Degnan, Bernard M.
2015-01-01
Long noncoding RNAs (lncRNAs) are important developmental regulators in bilaterian animals. A correlation has been claimed between the lncRNA repertoire expansion and morphological complexity in vertebrate evolution. However, this claim has not been tested by examining morphologically simple animals. Here, we undertake a systematic investigation of lncRNAs in the demosponge Amphimedon queenslandica, a morphologically simple, early-branching metazoan. We combine RNA-Seq data across multiple developmental stages of Amphimedon with a filtering pipeline to conservatively predict 2,935 lncRNAs. These include intronic overlapping lncRNAs, exonic antisense overlapping lncRNAs, long intergenic nonprotein coding RNAs, and precursors for small RNAs. Sponge lncRNAs are remarkably similar to their bilaterian counterparts in being relatively short with few exons and having low primary sequence conservation relative to protein-coding genes. As in bilaterians, a majority of sponge lncRNAs exhibit typical hallmarks of regulatory molecules, including high temporal specificity and dynamic developmental expression. Specific lncRNA expression profiles correlate tightly with conserved protein-coding genes likely involved in a range of developmental and physiological processes, such as the Wnt signaling pathway. Although the majority of Amphimedon lncRNAs appears to be taxonomically restricted with no identifiable orthologs, we find a few cases of conservation between demosponges in lncRNAs that are antisense to coding sequences. Based on the high similarity in the structure, organization, and dynamic expression of sponge lncRNAs to their bilaterian counterparts, we propose that these noncoding RNAs are an ancient feature of the metazoan genome. These results are consistent with lncRNAs regulating the development of animals, regardless of their level of morphological complexity. PMID:25976353
Bowling, Bethany V.; Schultheis, Patrick J.
2015-01-01
Saccharomyces cerevisiae was the first eukaryotic organism to be sequenced, however little progress has been made in recent years in furthering our understanding of all open reading frames (ORFs). From October 2012 to May 2015 the number of verified ORFs has only risen from 75.31% to 78% while the number of uncharacterized ORFs have decreased from 12.8% to 11% (representing more than 700 genes still left in this category) [http://www.yeastgenome.org/genomesnapshot]. Course-based research has been shown to increase student learning while providing experience with real scientific investigation; however, implementation in large, multi-section courses presents many challenges. This study sought to test the feasibility and effectiveness of incorporating authentic research into a core genetics course with multiple instructors to increase student learning and progress our understanding of uncharacterized ORFs. We generated a module-based annotation toolkit and utilized easily accessible bioinformatics tools to predict gene function for uncharacterized ORFs within the Saccharomyces Genome Database (SGD). Students were each assigned an uncharacterized ORF which they annotated using contemporary comparative genomics methodologies including multiple sequence alignment, conserved domain identification, signal peptide prediction and cellular localization algorithms. Student learning outcomes were measured by quizzes, project reports and presentations, as well as a post-project questionnaire. Our results indicate the authentic research experience had positive impacts on student's perception of their learning and their confidence to conduct future research. Furthermore we believe that creation of an online repository and adoption and/or adaptation of this project across multiple researchers and institutions could speed the process of gene function prediction. PMID:26460164
Bowling, Bethany V; Schultheis, Patrick J; Strome, Erin D
2016-02-01
Saccharomyces cerevisiae was the first eukaryotic organism to be sequenced; however, little progress has been made in recent years in furthering our understanding of all open reading frames (ORFs). From October 2012 to May 2015 the number of verified ORFs had only risen from 75.31% to 78%, while the number of uncharacterized ORFs had decreased from 12.8% to 11% (representing > 700 genes still left in this category; http://www.yeastgenome.org/genomesnapshot). Course-based research has been shown to increase student learning while providing experience with real scientific investigation; however, implementation in large, multi-section courses presents many challenges. This study sought to test the feasibility and effectiveness of incorporating authentic research into a core genetics course, with multiple instructors, to increase student learning and progress our understanding of uncharacterized ORFs. We generated a module-based annotation toolkit and utilized easily accessible bioinformatics tools to predict gene function for uncharacterized ORFs within the Saccharomyces Genome Database (SGD). Students were each assigned an uncharacterized ORF, which they annotated using contemporary comparative genomics methodologies, including multiple sequence alignment, conserved domain identification, signal peptide prediction and cellular localization algorithms. Student learning outcomes were measured by quizzes, project reports and presentations, as well as a post-project questionnaire. Our results indicate that the authentic research experience had positive impacts on students' perception of their learning and their confidence to conduct future research. Furthermore, we believe that creation of an online repository and adoption and/or adaptation of this project across multiple researchers and institutions could speed the process of gene function prediction. Copyright © 2015 John Wiley & Sons, Ltd.
Charlesworth, Jac C; Peralta, Juan M; Drigalenko, Eugene; Göring, Harald Hh; Almasy, Laura; Dyer, Thomas D; Blangero, John
2009-12-15
Gene identification using linkage, association, or genome-wide expression is often underpowered. We propose that formal combination of information from multiple gene-identification approaches may lead to the identification of novel loci that are missed when only one form of information is available. Firstly, we analyze the Genetic Analysis Workshop 16 Framingham Heart Study Problem 2 genome-wide association data for HDL-cholesterol using a "gene-centric" approach. Then we formally combine the association test results with genome-wide transcriptional profiling data for high-density lipoprotein cholesterol (HDL-C), from the San Antonio Family Heart Study, using a Z-transform test (Stouffer's method). We identified 39 genes by the joint test at a conservative 1% false-discovery rate, including 9 from the significant gene-based association test and 23 whose expression was significantly correlated with HDL-C. Seven genes identified as significant in the joint test were not independently identified by either the association or expression tests. This combined approach has increased power and leads to the direct nomination of novel candidate genes likely to be involved in the determination of HDL-C levels. Such information can then be used as justification for a more exhaustive search for functional sequence variation within the nominated genes. We anticipate that this type of analysis will improve our speed of identification of regulatory genes causally involved in disease risk.
Dinucleotide controlled null models for comparative RNA gene prediction.
Gesell, Tanja; Washietl, Stefan
2008-05-27
Comparative prediction of RNA structures can be used to identify functional noncoding RNAs in genomic screens. It was shown recently by Babak et al. [BMC Bioinformatics. 8:33] that RNA gene prediction programs can be biased by the genomic dinucleotide content, in particular those programs using a thermodynamic folding model including stacking energies. As a consequence, there is need for dinucleotide-preserving control strategies to assess the significance of such predictions. While there have been randomization algorithms for single sequences for many years, the problem has remained challenging for multiple alignments and there is currently no algorithm available. We present a program called SISSIz that simulates multiple alignments of a given average dinucleotide content. Meeting additional requirements of an accurate null model, the randomized alignments are on average of the same sequence diversity and preserve local conservation and gap patterns. We make use of a phylogenetic substitution model that includes overlapping dependencies and site-specific rates. Using fast heuristics and a distance based approach, a tree is estimated under this model which is used to guide the simulations. The new algorithm is tested on vertebrate genomic alignments and the effect on RNA structure predictions is studied. In addition, we directly combined the new null model with the RNAalifold consensus folding algorithm giving a new variant of a thermodynamic structure based RNA gene finding program that is not biased by the dinucleotide content. SISSIz implements an efficient algorithm to randomize multiple alignments preserving dinucleotide content. It can be used to get more accurate estimates of false positive rates of existing programs, to produce negative controls for the training of machine learning based programs, or as standalone RNA gene finding program. Other applications in comparative genomics that require randomization of multiple alignments can be considered. SISSIz is available as open source C code that can be compiled for every major platform and downloaded here: http://sourceforge.net/projects/sissiz.
Conservation strategies for forest gene resources
F. Thomas Ledig
1986-01-01
Gene conservation has three facets: (1) the maintenance of diversity in production plantations to buffer against vulnerability to pests and climatic extremes; (2) the preservation of genes for their future value in breeding; (3) the protection of species to promote ecosystem stability. Maintaining diversity as a hedge against damaging agents is a simple strategy in...
Functional Conservation of MIKC*-Type MADS Box Genes in Arabidopsis and Rice Pollen Maturation[C][W
Liu, Yuan; Cui, Shaojie; Wu, Feng; Yan, Shuo; Lin, Xuelei; Du, Xiaoqiu; Chong, Kang; Schilling, Susanne; Theißen, Günter; Meng, Zheng
2013-01-01
There are two groups of MADS intervening keratin-like and C-terminal (MIKC)-type MADS box genes, MIKCC type and MIKC* type. In seed plants, the MIKCC type shows considerable diversity, but the MIKC* type has only two subgroups, P- and S-clade, which show conserved expression in the gametophyte. To examine the functional conservation of MIKC*-type genes, we characterized all three rice (Oryza sativa) MIKC*-type genes. All three genes are specifically expressed late in pollen development. The single knockdown or knockout lines, respectively, of the S-clade MADS62 and MADS63 did not show a mutant phenotype, but lines in which both S-clade genes were affected showed severe defects in pollen maturation and germination, as did knockdown lines of MADS68, the only P-clade gene in rice. The rice MIKC*-type proteins form strong heterodimeric complexes solely with partners from the other subclade; these complexes specifically bind to N10-type C-A-rich-G-boxes in vitro and regulate downstream gene expression by binding to N10-type promoter motifs. The rice MIKC* genes have a much lower degree of functional redundancy than the Arabidopsis thaliana MIKC* genes. Nevertheless, our data indicate that the function of heterodimeric MIKC*-type protein complexes in pollen development has been conserved since the divergence of monocots and eudicots, roughly 150 million years ago. PMID:23613199
Conserved gene regulatory module specifies lateral neural borders across bilaterians
Li, Yongbin; Zhao, Di; Horie, Takeo; Chen, Geng; Bao, Hongcun; Chen, Siyu; Liu, Weihong; Horie, Ryoko; Liang, Tao; Dong, Biyu; Feng, Qianqian; Tao, Qinghua
2017-01-01
The lateral neural plate border (NPB), the neural part of the vertebrate neural border, is composed of central nervous system (CNS) progenitors and peripheral nervous system (PNS) progenitors. In invertebrates, PNS progenitors are also juxtaposed to the lateral boundary of the CNS. Whether there are conserved molecular mechanisms determining vertebrate and invertebrate lateral neural borders remains unclear. Using single-cell-resolution gene-expression profiling and genetic analysis, we present evidence that orthologs of the NPB specification module specify the invertebrate lateral neural border, which is composed of CNS and PNS progenitors. First, like in vertebrates, the conserved neuroectoderm lateral border specifier Msx/vab-15 specifies lateral neuroblasts in Caenorhabditis elegans. Second, orthologs of the vertebrate NPB specification module (Msx/vab-15, Pax3/7/pax-3, and Zic/ref-2) are significantly enriched in worm lateral neuroblasts. In addition, like in other bilaterians, the expression domain of Msx/vab-15 is more lateral than those of Pax3/7/pax-3 and Zic/ref-2 in C. elegans. Third, we show that Msx/vab-15 regulates the development of mechanosensory neurons derived from lateral neural progenitors in multiple invertebrate species, including C. elegans, Drosophila melanogaster, and Ciona intestinalis. We also identify a novel lateral neural border specifier, ZNF703/tlp-1, which functions synergistically with Msx/vab-15 in both C. elegans and Xenopus laevis. These data suggest a common origin of the molecular mechanism specifying lateral neural borders across bilaterians. PMID:28716930
Conserved gene regulatory module specifies lateral neural borders across bilaterians.
Li, Yongbin; Zhao, Di; Horie, Takeo; Chen, Geng; Bao, Hongcun; Chen, Siyu; Liu, Weihong; Horie, Ryoko; Liang, Tao; Dong, Biyu; Feng, Qianqian; Tao, Qinghua; Liu, Xiao
2017-08-01
The lateral neural plate border (NPB), the neural part of the vertebrate neural border, is composed of central nervous system (CNS) progenitors and peripheral nervous system (PNS) progenitors. In invertebrates, PNS progenitors are also juxtaposed to the lateral boundary of the CNS. Whether there are conserved molecular mechanisms determining vertebrate and invertebrate lateral neural borders remains unclear. Using single-cell-resolution gene-expression profiling and genetic analysis, we present evidence that orthologs of the NPB specification module specify the invertebrate lateral neural border, which is composed of CNS and PNS progenitors. First, like in vertebrates, the conserved neuroectoderm lateral border specifier Msx/vab-15 specifies lateral neuroblasts in Caenorhabditis elegans Second, orthologs of the vertebrate NPB specification module ( Msx/vab-15 , Pax3/7/pax-3 , and Zic/ref-2 ) are significantly enriched in worm lateral neuroblasts. In addition, like in other bilaterians, the expression domain of Msx/vab-15 is more lateral than those of Pax3/7/pax-3 and Zic/ref- 2 in C. elegans Third, we show that Msx/vab-15 regulates the development of mechanosensory neurons derived from lateral neural progenitors in multiple invertebrate species, including C. elegans , Drosophila melanogaster , and Ciona intestinalis We also identify a novel lateral neural border specifier, ZNF703/tlp-1 , which functions synergistically with Msx/vab- 15 in both C. elegans and Xenopus laevis These data suggest a common origin of the molecular mechanism specifying lateral neural borders across bilaterians.
Conservation of the behavioral and transcriptional response to social experience among Drosophilids.
Shultzaberger, Ryan K; Johnson, Sarah J; Wagner, Jenee; Ha, Kim; Markow, Therese A; Greenspan, Ralph J
2018-05-24
While social experience has been shown to significantly alter behaviors in a wide range of species, comparative studies that uniformly measure the impact of a single experience across multiple species have been lacking, limiting our understanding of how plastic traits evolve. To address this, we quantified variations in social feeding behaviors across 10 species of Drosophilids, tested the effect of altering rearing context on these behaviors (reared in groups or in isolation), and correlated observed behavioral shifts to accompanying transcriptional changes in the heads of these flies. We observed significant variability in the extent of aggressiveness, the utilization of social cues during food search, and social space preferences across species. The sensitivity of these behaviors to rearing experience also varied: socially naive flies were more aggressive than their socialized con-specifics in some species, and more reserved or identical in others. Despite these differences, the mechanism of socialization appeared to be conserved within the melanogaster sub-group as species could cross-socialize each other, and the transcriptional response to social exposure was significantly conserved. The expression levels of chemosensory-perception genes often varied between species and rearing conditions, supporting a growing body of evidence that behavioral evolution is driven by the differential regulation of this class of genes. The clear differences in behavioral responses to socialization observed in Drosophilids make this an ideal system for continued studies on the genetic basis and evolution of socialization and behavioral plasticity. This article is protected by copyright. All rights reserved.
Chen, Wen; Zhang, Xuan; Li, Jing; Huang, Shulan; Xiang, Shuanglin; Hu, Xiang; Liu, Changning
2018-05-09
Zebrafish is a full-developed model system for studying development processes and human disease. Recent studies of deep sequencing had discovered a large number of long non-coding RNAs (lncRNAs) in zebrafish. However, only few of them had been functionally characterized. Therefore, how to take advantage of the mature zebrafish system to deeply investigate the lncRNAs' function and conservation is really intriguing. We systematically collected and analyzed a series of zebrafish RNA-seq data, then combined them with resources from known database and literatures. As a result, we obtained by far the most complete dataset of zebrafish lncRNAs, containing 13,604 lncRNA genes (21,128 transcripts) in total. Based on that, a co-expression network upon zebrafish coding and lncRNA genes was constructed and analyzed, and used to predict the Gene Ontology (GO) and the KEGG annotation of lncRNA. Meanwhile, we made a conservation analysis on zebrafish lncRNA, identifying 1828 conserved zebrafish lncRNA genes (1890 transcripts) that have their putative mammalian orthologs. We also found that zebrafish lncRNAs play important roles in regulation of the development and function of nervous system; these conserved lncRNAs present a significant sequential and functional conservation, with their mammalian counterparts. By integrative data analysis and construction of coding-lncRNA gene co-expression network, we gained the most comprehensive dataset of zebrafish lncRNAs up to present, as well as their systematic annotations and comprehensive analyses on function and conservation. Our study provides a reliable zebrafish-based platform to deeply explore lncRNA function and mechanism, as well as the lncRNA commonality between zebrafish and human.
Godec, Jernej; Tan, Yan; Liberzon, Arthur; Tamayo, Pablo; Bhattacharya, Sanchita; Butte, Atul J; Mesirov, Jill P; Haining, W Nicholas
2016-01-19
Gene-expression profiling has become a mainstay in immunology, but subtle changes in gene networks related to biological processes are hard to discern when comparing various datasets. For instance, conservation of the transcriptional response to sepsis in mouse models and human disease remains controversial. To improve transcriptional analysis in immunology, we created ImmuneSigDB: a manually annotated compendium of ∼5,000 gene-sets from diverse cell states, experimental manipulations, and genetic perturbations in immunology. Analysis using ImmuneSigDB identified signatures induced in activated myeloid cells and differentiating lymphocytes that were highly conserved between humans and mice. Sepsis triggered conserved patterns of gene expression in humans and mouse models. However, we also identified species-specific biological processes in the sepsis transcriptional response: although both species upregulated phagocytosis-related genes, a mitosis signature was specific to humans. ImmuneSigDB enables granular analysis of transcriptomic data to improve biological understanding of immune processes of the human and mouse immune systems. Copyright © 2016 Elsevier Inc. All rights reserved.
Yousfi, Fatma-Ezzahra; Makhloufi, Emna; Marande, William; Ghorbel, Abdel W; Bouzayen, Mondher; Bergès, Hélène
2016-01-01
WRKY transcription factors are involved in multiple aspects of plant growth, development and responses to biotic stresses. Although they have been found to play roles in regulating plant responses to environmental stresses, these roles still need to be explored, especially those pertaining to crops. Durum wheat is the second most widely produced cereal in the world. Complex, large and unsequenced genomes, in addition to a lack of genomic resources, hinder the molecular characterization of tolerance mechanisms. This paper describes the isolation and characterization of five TdWRKY genes from durum wheat ( Triticum turgidum L . ssp. durum ). A PCR-based screening of a T. turgidum BAC genomic library using primers within the conserved region of WRKY genes resulted in the isolation of five BAC clones. Following sequencing fully the five BACs, fine annotation through Triannot pipeline revealed 74.6% of the entire sequences as transposable elements and a 3.2% gene content with genes organized as islands within oceans of TEs. Each BAC clone harbored a TdWRKY gene. The study showed a very extensive conservation of genomic structure between TdWRKYs and their orthologs from Brachypodium, barley, and T. aestivum . The structural features of TdWRKY proteins suggested that they are novel members of the WRKY family in durum wheat. TdWRKY1/2/4, TdWRKY3, and TdWRKY5 belong to the group Ia, IIa, and IIc, respectively. Enrichment of cis -regulatory elements related to stress responses in the promoters of some TdWRKY genes indicated their potential roles in mediating plant responses to a wide variety of environmental stresses. TdWRKY genes displayed different expression patterns in response to salt stress that distinguishes two durum wheat genotypes with contrasting salt stress tolerance phenotypes. TdWRKY genes tended to react earlier with a down-regulation in sensitive genotype leaves and with an up-regulation in tolerant genotype leaves. The TdWRKY transcripts levels in roots increased in tolerant genotype compared to sensitive genotype. The present results indicate that these genes might play some functional role in the salt tolerance in durum wheat.
Gunasekera, Thusitha S.; Bowen, Loryn L.; Zhou, Carol E.; Howard-Byerly, Susan C.; Foley, William S.; Striebich, Richard C.; Dugan, Larry C.
2017-01-01
ABSTRACT Pseudomonas aeruginosa can utilize hydrocarbons, but different strains have various degrees of adaptation despite their highly conserved genome. P. aeruginosa ATCC 33988 is highly adapted to hydrocarbons, while P. aeruginosa strain PAO1, a human pathogen, is less adapted and degrades jet fuel at a lower rate than does ATCC 33988. We investigated fuel-specific transcriptomic differences between these strains in order to ascertain the underlying mechanisms utilized by the adapted strain to proliferate in fuel. During growth in fuel, the genes related to alkane degradation, heat shock response, membrane proteins, efflux pumps, and several novel genes were upregulated in ATCC 33988. Overexpression of alk genes in PAO1 provided some improvement in growth, but it was not as robust as that of ATCC 33988, suggesting the role of other genes in adaptation. Expression of the function unknown gene PA5359 from ATCC 33988 in PAO1 increased the growth in fuel. Bioinformatic analysis revealed that PA5359 is a predicted lipoprotein with a conserved Yx(FWY)xxD motif, which is shared among bacterial adhesins. Overexpression of the putative resistance-nodulation-division (RND) efflux pump PA3521 to PA3523 increased the growth of the ATCC 33988 strain, suggesting a possible role in fuel tolerance. Interestingly, the PAO1 strain cannot utilize n-C8 and n-C10. The expression of green fluorescent protein (GFP) under the control of alkB promoters confirmed that alk gene promoter polymorphism affects the expression of alk genes. Promoter fusion assays further confirmed that the regulation of alk genes was different in the two strains. Protein sequence analysis showed low amino acid differences for many of the upregulated genes, further supporting transcriptional control as the main mechanism for enhanced adaptation. IMPORTANCE These results support that specific signal transduction, gene regulation, and coordination of multiple biological responses are required to improve the survival, growth, and metabolism of fuel in adapted strains. This study provides new insight into the mechanistic differences between strains and helpful information that may be applied in the improvement of bacterial strains for resistance to biotic and abiotic factors encountered during bioremediation and industrial biotechnological processes. PMID:28314727
Xu, Jianing; Xing, Shanshan; Cui, Haoran; Chen, Xuesen; Wang, Xiaoyun
2016-04-01
The ubiquitin-protein ligases (E3s) directly participate in ubiquitin (Ub) transferring to the target proteins in the ubiquitination pathway. The HECT ubiquitin-protein ligase (UPL), one type of E3s, is characterized as containing a conserved HECT domain of approximately 350 amino acids in the C terminus. Some UPLs were found to be involved in trichome development and leaf senescence in Arabidopsis. However, studies on plant UPLs, such as characteristics of the protein structure, predicted functional motifs of the HECT domain, and the regulatory expression of UPLs have all been limited. Here, we present genome-wide identification of the genes encoding UPLs (HECT gene) in apple. The 13 genes (named as MdUPL1-MdUPL13) from ten different chromosomes were divided into four groups by phylogenetic analysis. Among these groups, the encoding genes in the intron-exon structure and the included additional functional domains were quite different. Notably, the F-box domain was first found in MdUPL7 in plant UPLs. The HECT domain in different MdUPL groups also presented different spatial features and three types of conservative motifs were identified. The promoters of each MdUPL member carried multiple stress-response related elements by cis-acting element analysis. Experimental results demonstrated that the expressions of several MdUPLs were quite sensitive to cold-, drought-, and salt-stresses by qRT-PCR assay. The results of this study helped to elucidate the functions of HECT proteins, especially in Rosaceae plants.
Primate-specific evolution of noncoding element insertion into PLA2G4C and human preterm birth
2010-01-01
Background The onset of birth in humans, like other apes, differs from non-primate mammals in its endocrine physiology. We hypothesize that higher primate-specific gene evolution may lead to these differences and target genes involved in human preterm birth, an area of global health significance. Methods We performed a comparative genomics screen of highly conserved noncoding elements and identified PLA2G4C, a phospholipase A isoform involved in prostaglandin biosynthesis as human accelerated. To examine whether this gene demonstrating primate-specific evolution was associated with birth timing, we genotyped and analyzed 8 common single nucleotide polymorphisms (SNPs) in PLA2G4C in US Hispanic (n = 73 preterm, 292 control), US White (n = 147 preterm, 157 control) and US Black (n = 79 preterm, 166 control) mothers. Results Detailed structural and phylogenic analysis of PLA2G4C suggested a short genomic element within the gene duplicated from a paralogous highly conserved element on chromosome 1 specifically in primates. SNPs rs8110925 and rs2307276 in US Hispanics and rs11564620 in US Whites were significant after correcting for multiple tests (p < 0.006). Additionally, rs11564620 (Thr360Pro) was associated with increased metabolite levels of the prostaglandin thromboxane in healthy individuals (p = 0.02), suggesting this variant may affect PLA2G4C activity. Conclusions Our findings suggest that variation in PLA2G4C may influence preterm birth risk by increasing levels of prostaglandins, which are known to regulate labor. PMID:21184677
Chilton, Scott S; Falbel, Tanya G; Hromada, Susan; Burton, Briana M
2017-08-01
Genetic competence is a process in which cells are able to take up DNA from their environment, resulting in horizontal gene transfer, a major mechanism for generating diversity in bacteria. Many bacteria carry homologs of the central DNA uptake machinery that has been well characterized in Bacillus subtilis It has been postulated that the B. subtilis competence helicase ComFA belongs to the DEAD box family of helicases/translocases. Here, we made a series of mutants to analyze conserved amino acid motifs in several regions of B. subtilis ComFA. First, we confirmed that ComFA activity requires amino acid residues conserved among the DEAD box helicases, and second, we show that a zinc finger-like motif consisting of four cysteines is required for efficient transformation. Each cysteine in the motif is important, and mutation of at least two of the cysteines dramatically reduces transformation efficiency. Further, combining multiple cysteine mutations with the helicase mutations shows an additive phenotype. Our results suggest that the helicase and metal binding functions are two distinct activities important for ComFA function during transformation. IMPORTANCE ComFA is a highly conserved protein that has a role in DNA uptake during natural competence, a mechanism for horizontal gene transfer observed in many bacteria. Investigation of the details of the DNA uptake mechanism is important for understanding the ways in which bacteria gain new traits from their environment, such as drug resistance. To dissect the role of ComFA in the DNA uptake machinery, we introduced point mutations into several motifs in the protein sequence. We demonstrate that several amino acid motifs conserved among ComFA proteins are important for efficient transformation. This report is the first to demonstrate the functional requirement of an amino-terminal cysteine motif in ComFA. Copyright © 2017 American Society for Microbiology.
The effects of exogenous cortisol on myostatin transcription in rainbow trout, Oncorhynchus mykiss
Galt, Nicholas J.; Froehlich, Jacob Michael; Remily, Ethan A.; Romero, Sinibaldo R.; Biga, Peggy R.
2014-01-01
Glucocorticoids (GCs) strongly regulate myostatin transcript levels in mammals via glucocorticoid response elements (GREs) in the myostatin promoter, and bioinformatics methods suggest that this regulatory mechanism is conserved among many vertebrates. However, the multiple myostatin genes found in some fishes may be an exception. In rainbow trout (Oncorhynchus mykiss), two genome duplication events have produced three putatively functional myostatin genes, myostatin-1a, -1b and -2a, which are ubiquitously and differentially expressed. In addition, in silico promoter analyses of the rainbow trout myostatin promoters have failed to identify putative GREs, suggesting a divergence in myostatin function. Therefore, we hypothesized that myostatin mRNA expression is not regulated by glucocorticoids in rainbow trout. In this study, both juvenile rainbow trout and primary trout myoblasts were treated with cortisol to examine the relationship between this glucocorticoid and myostatin mRNA expression. Results suggest that exogenous cortisol does not regulate myostatin-1a and -1b expression in vivo, as myostatin mRNA levels were not significantly affected by cortisol treatment in either red or white muscle tissue. In red muscle, myostatin-2a levels were significantly elevated in the cortisol treatment group relative to the control, but not the vehicle control, at both 12 h and 24 h post-injection. As such, it is unclear if cortisol was acting alone or in combination with the vehicle. Cortisol increased myostatin-1b expression in a dose-dependent manner in vitro. Further work is needed to determine if this response is the direct result of cortisol acting on the myostatin-1b promoter or through an alternative mechanism. These results suggest that regulation of myostatin by cortisol may not be as highly conserved as previously thought and support previous work that describes potential functional divergence of the multiple myostatin genes in fishes. PMID:24875565
Riley, D E; Wagner, B; Polley, L; Krieger, J N
1995-01-01
The protozoan parasite Tritrichomonas foetus causes infertility and spontaneous abortion in cattle. In Saskatchewan, Canada, the culture prevalence of trichomonads was 65 of 1,048 (6%) among 1,048 bulls tested within a 1-year period ending in April 1994. Saskatchewan was previously thought to be free of the parasite. To confirm the culture results, possible T. foetus DNA presence was determined by the PCR. All of the 16 culture-positive isolates tested were PCR positive by a single-band test, but one PCR product was weak. DNA fingerprinting by both T17 PCR and randomly amplified polymorphic DNA PCR revealed genetic variation or polymorphism among the T. foetus isolates. T17 PCR also revealed conserved loci that distinguished these T. foetus isolates from Trichomonas vaginalis, from a variety of other protozoa, and from prokaryotes. TCO-1 PCR, a PCR test designed to sample DNA sequence homologous to the 5' flank of a highly conserved cell division control gene, detected genetic polymorphism at low stringency and a conserved, single locus at higher stringency. These findings suggested that T. foetus isolates exhibit both conserved genetic loci and polymorphic loci detectable by independent PCR methods. Both conserved and polymorphic genetic loci may prove useful for improved clinical diagnosis of T. foetus. The polymorphic loci detected by PCR suggested either a long history of infection or multiple lines of T. foetus infection in Saskatchewan. Polymorphic loci detected by PCR may provide data for epidemiologic studies of T. foetus. PMID:7615746
Regions of extreme synonymous codon selection in mammalian genes
Schattner, Peter; Diekhans, Mark
2006-01-01
Recently there has been increasing evidence that purifying selection occurs among synonymous codons in mammalian genes. This selection appears to be a consequence of either cis-regulatory motifs, such as exonic splicing enhancers (ESEs), or mRNA secondary structures, being superimposed on the coding sequence of the gene. We have developed a program to identify regions likely to be enriched for such motifs by searching for extended regions of extreme codon conservation between homologous genes of related species. Here we present the results of applying this approach to five mammalian species (human, chimpanzee, mouse, rat and dog). Even with very conservative selection criteria, we find over 200 regions of extreme codon conservation, ranging in length from 60 to 178 codons. The regions are often found within genes involved in DNA-binding, RNA-binding or zinc-ion-binding. They are highly depleted for synonymous single nucleotide polymorphisms (SNPs) but not for non-synonymous SNPs, further indicating that the observed codon conservation is being driven by negative selection. Forty-three percent of the regions overlap conserved alternative transcript isoforms and are enriched for known ESEs. Other regions are enriched for TpA dinucleotides and may contain conserved motifs/structures relating to mRNA stability and/or degradation. We anticipate that this tool will be useful for detecting regions enriched in other classes of coding-sequence motifs and structures as well. PMID:16556911
Conservation and loss of ribosomal RNA gene sites in diploid and polyploid Fragaria (Rosaceae)
2011-01-01
Background The genus Fragaria comprises species at ploidy levels ranging from diploid (2n = 2x = 14) to decaploid (2n = 10x = 70). Fluorescence in situ hybridization with 5S and 25S rDNA probes was performed to gather cytogenetic information that illuminates genomic divergence among different taxa at multiple ploidy levels, as well as to explore the evolution of ribosomal RNA genes during polyploidization in Fragaria. Results Root tip cells of diploid taxa were typified by two 5S and six 25S rDNA hybridization signals of varying intensities, providing a baseline for comparisons within the genus. In three exceptional diploid genotypes, F. nilgerrensis (CFRA 1358 and CFRA 1825) and F. vesca 'Yellow Wonder', two 5S but only four 25S rDNA sites were found but with differing site losses. The numbers of 5S and 25S rDNA signals, respectively were three and nine in a triploid F. ×bifera accession, and were four and twelve in three tetraploids, thus occurring in proportional 1.5× and 2× multiples of the typical diploid pattern. In hexaploid F. moschata, a proportional multiple of six 5S rDNA sites was observed, but the number of 25S rDNA sites was one or two less than the proportionate prediction of eighteen. This apparent tendency toward rDNA site loss at higher ploidy was markedly expanded in octoploids, which displayed only two 5S and ten 25S rDNA sites. In the two decaploids examined, the numbers of 5S and 25S rDNA signals, respectively, were four and fifteen in F. virginiana subsp. platypetala, and six and twelve in F. iturupensis. Conclusions Among diploid Fragaria species, a general consistency of rDNA site numbers implies conserved genomic organization, but highly variable 25S signal sizes and intensities and two instances of site loss suggest concurrent high dynamics of rDNA copy numbers among both homologs and non-homologs. General conservation of rDNA site numbers in lower ploidy, but marked site number reductions at higher ploidy levels, suggest complex evolution of rDNA sites during polyploidization and/or independent evolutionary pathways for 6x versus higher ploidy strawberries. Site number comparisons suggest common genomic composition among natural octoploids, and independent origins of the two divergent decaploid accessions. PMID:22074487
The Role of DNA Barcodes in Understanding and Conservation of Mammal Diversity in Southeast Asia
Francis, Charles M.; Borisenko, Alex V.; Ivanova, Natalia V.; Eger, Judith L.; Lim, Burton K.; Guillén-Servent, Antonio; Kruskop, Sergei V.; Mackie, Iain; Hebert, Paul D. N.
2010-01-01
Background Southeast Asia is recognized as a region of very high biodiversity, much of which is currently at risk due to habitat loss and other threats. However, many aspects of this diversity, even for relatively well-known groups such as mammals, are poorly known, limiting ability to develop conservation plans. This study examines the value of DNA barcodes, sequences of the mitochondrial COI gene, to enhance understanding of mammalian diversity in the region and hence to aid conservation planning. Methodology and Principal Findings DNA barcodes were obtained from nearly 1900 specimens representing 165 recognized species of bats. All morphologically or acoustically distinct species, based on classical taxonomy, could be discriminated with DNA barcodes except four closely allied species pairs. Many currently recognized species contained multiple barcode lineages, often with deep divergence suggesting unrecognized species. In addition, most widespread species showed substantial genetic differentiation across their distributions. Our results suggest that mammal species richness within the region may be underestimated by at least 50%, and there are higher levels of endemism and greater intra-specific population structure than previously recognized. Conclusions DNA barcodes can aid conservation and research by assisting field workers in identifying species, by helping taxonomists determine species groups needing more detailed analysis, and by facilitating the recognition of the appropriate units and scales for conservation planning. PMID:20838635
Unger, Shem D.; Rhodes, Olin E.; Sutton, Trent M.; Williams, Rod N.
2013-01-01
Conservation genetics is a powerful tool to assess the population structure of species and provides a framework for informing management of freshwater ecosystems. As lotic habitats become fragmented, the need to assess gene flow for species of conservation management becomes a priority. The eastern hellbender (Cryptobranchus alleganiensis alleganiensis) is a large, fully aquatic paedamorphic salamander. Many populations are experiencing declines throughout their geographic range, yet the genetic ramifications of these declines are currently unknown. To this end, we examined levels of genetic variation and genetic structure at both range-wide and drainage (hierarchical) scales. We collected 1,203 individuals from 77 rivers throughout nine states from June 2007 to August 2011. Levels of genetic diversity were relatively high among all sampling locations. We detected significant genetic structure across populations (Fst values ranged from 0.001 between rivers within a single watershed to 0.218 between states). We identified two genetically differentiated groups at the range-wide scale: 1) the Ohio River drainage and 2) the Tennessee River drainage. An analysis of molecular variance (AMOVA) based on landscape-scale sampling of basins within the Tennessee River drainage revealed the majority of genetic variation (∼94–98%) occurs within rivers. Eastern hellbenders show a strong pattern of isolation by stream distance (IBSD) at the drainage level. Understanding levels of genetic variation and differentiation at multiple spatial and biological scales will enable natural resource managers to make more informed decisions and plan effective conservation strategies for cryptic, lotic species. PMID:24204565
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gao, Junpeng; Innovation Experimental College, Northwest A&F University, Yangling, Shaanxi 712100; Cao, Xiaoli
The Auxin/indole-3-acetic acid (Aux/IAA) genes encode short-lived nuclear proteins that are known to be involved in the primary cellular responses to auxin. To date, systematic analysis of the Aux/IAA genes in potato (Solanum tuberosum) has not been conducted. In this study, a total of 26 potato Aux/IAA genes were identified (designated from StIAA1 to StIAA26), and the distribution of four conserved domains shared by the StIAAs were analyzed based on multiple sequence alignment and a motif-based sequence analysis. A phylogenetic analysis of the Aux/IAA gene families of potato and Arabidopsis was also conducted. In order to assess the roles ofmore » StIAA genes in tuber development, the results of RNA-seq studies were reformatted to analyze the expression patterns of StIAA genes, and then verified by quantitative real-time PCR. A large number of StIAA genes (12 genes) were highly expressed in stolon organs and in during the tuber initiation and expansion developmental stages, and most of these genes were responsive to indoleacetic acid treatment. Our results suggested that StIAA genes were involved in the process of tuber development and provided insights into functional roles of potato Aux/IAA genes. - Highlights: • A systematic analysis of the potato AUX/IAA gene family were performed. • StIAA genes were related to auxin perception and signal transduction. • Candidate StIAA genes likely related to tuber initiation and expansion were screened.« less
Negre, Bárbara; Casillas, Sònia; Suzanne, Magali; Sánchez-Herrero, Ernesto; Akam, Michael; Nefedov, Michael; Barbadilla, Antonio; de Jong, Pieter; Ruiz, Alfredo
2005-01-01
Homeotic (Hox) genes are usually clustered and arranged in the same order as they are expressed along the anteroposterior body axis of metazoans. The mechanistic explanation for this colinearity has been elusive, and it may well be that a single and universal cause does not exist. The Hox-gene complex (HOM-C) has been rearranged differently in several Drosophila species, producing a striking diversity of Hox gene organizations. We investigated the genomic and functional consequences of the two HOM-C splits present in Drosophila buzzatii. Firstly, we sequenced two regions of the D. buzzatii genome, one containing the genes labial and abdominal A, and another one including proboscipedia, and compared their organization with that of D. melanogaster and D. pseudoobscura in order to map precisely the two splits. Then, a plethora of conserved noncoding sequences, which are putative enhancers, were identified around the three Hox genes closer to the splits. The position and order of these enhancers are conserved, with minor exceptions, between the three Drosophila species. Finally, we analyzed the expression patterns of the same three genes in embryos and imaginal discs of four Drosophila species with different Hox-gene organizations. The results show that their expression patterns are conserved despite the HOM-C splits. We conclude that, in Drosophila, Hox-gene clustering is not an absolute requirement for proper function. Rather, the organization of Hox genes is modular, and their clustering seems the result of phylogenetic inertia more than functional necessity. PMID:15867430
Response variables for evaluation of the effectiveness of conservation corridors.
Gregory, Andrew J; Beier, Paul
2014-06-01
Many studies have evaluated effectiveness of corridors by measuring species presence in and movement through small structural corridors. However, few studies have assessed whether these response variables are adequate for assessing whether the conservation goals of the corridors have been achieved or considered the costs or lag times involved in measuring the response variables. We examined 4 response variables-presence of the focal species in the corridor, interpatch movement via the corridor, gene flow, and patch occupancy--with respect to 3 criteria--relevance to conservation goals, lag time (fewest generations at which a positive response to the corridor might be evident with a particular variable), and the cost of a study when applying a particular variable. The presence variable had the least relevance to conservation goals, no lag time advantage compared with interpatch movement, and only a moderate cost advantage over interpatch movement or gene flow. Movement of individual animals between patches was the most appropriate response variable for a corridor intended to provide seasonal migration, but it was not an appropriate response variable for corridor dwellers, and for passage species it was only moderately relevant to the goals of gene flow, demographic rescue, and recolonization. Response variables related to gene flow provided a good trade-off among cost, relevance to conservation goals, and lag time. Nonetheless, the lag time of 10-20 generations means that evaluation of conservation corridors cannot occur until a few decades after a corridor has been established. Response variables related to occupancy were most relevant to conservation goals, but the lag time and costs to detect corridor effects on occupancy were much greater than the lag time and costs to detect corridor effects on gene flow. © 2014 Society for Conservation Biology.
Gillespie, J J; Johnston, J S; Cannone, J J; Gutell, R R
2006-01-01
As an accompanying manuscript to the release of the honey bee genome, we report the entire sequence of the nuclear (18S, 5.8S, 28S and 5S) and mitochondrial (12S and 16S) ribosomal RNA (rRNA)-encoding gene sequences (rDNA) and related internally and externally transcribed spacer regions of Apis mellifera (Insecta: Hymenoptera: Apocrita). Additionally, we predict secondary structures for the mature rRNA molecules based on comparative sequence analyses with other arthropod taxa and reference to recently published crystal structures of the ribosome. In general, the structures of honey bee rRNAs are in agreement with previously predicted rRNA models from other arthropods in core regions of the rRNA, with little additional expansion in non-conserved regions. Our multiple sequence alignments are made available on several public databases and provide a preliminary establishment of a global structural model of all rRNAs from the insects. Additionally, we provide conserved stretches of sequences flanking the rDNA cistrons that comprise the externally transcribed spacer regions (ETS) and part of the intergenic spacer region (IGS), including several repetitive motifs. Finally, we report the occurrence of retrotransposition in the nuclear large subunit rDNA, as R2 elements are present in the usual insertion points found in other arthropods. Interestingly, functional R1 elements usually present in the genomes of insects were not detected in the honey bee rRNA genes. The reverse transcriptase products of the R2 elements are deduced from their putative open reading frames and structurally aligned with those from another hymenopteran insect, the jewel wasp Nasonia (Pteromalidae). Stretches of conserved amino acids shared between Apis and Nasonia are illustrated and serve as potential sites for primer design, as target amplicons within these R2 elements may serve as novel phylogenetic markers for Hymenoptera. Given the impending completion of the sequencing of the Nasonia genome, we expect our report eventually to shed light on the evolution of the hymenopteran genome within higher insects, particularly regarding the relative maintenance of conserved rDNA genes, related variable spacer regions and retrotransposable elements. PMID:17069639
2013-01-01
Background The development of new therapies for orphan genetic diseases represents an extremely important medical and social challenge. Drug repositioning, i.e. finding new indications for approved drugs, could be one of the most cost- and time-effective strategies to cope with this problem, at least in a subset of cases. Therefore, many computational approaches based on the analysis of high throughput gene expression data have so far been proposed to reposition available drugs. However, most of these methods require gene expression profiles directly relevant to the pathologic conditions under study, such as those obtained from patient cells and/or from suitable experimental models. In this work we have developed a new approach for drug repositioning, based on identifying known drug targets showing conserved anti-correlated expression profiles with human disease genes, which is completely independent from the availability of ‘ad hoc’ gene expression data-sets. Results By analyzing available data, we provide evidence that the genes displaying conserved anti-correlation with drug targets are antagonistically modulated in their expression by treatment with the relevant drugs. We then identified clusters of genes associated to similar phenotypes and showing conserved anticorrelation with drug targets. On this basis, we generated a list of potential candidate drug-disease associations. Importantly, we show that some of the proposed associations are already supported by independent experimental evidence. Conclusions Our results support the hypothesis that the identification of gene clusters showing conserved anticorrelation with drug targets can be an effective method for drug repositioning and provide a wide list of new potential drug-disease associations for experimental validation. PMID:24088245
Next generation sequencing and analysis of a conserved transcriptome of New Zealand's kiwi.
Subramanian, Sankar; Huynen, Leon; Millar, Craig D; Lambert, David M
2010-12-15
Kiwi is a highly distinctive, flightless and endangered ratite bird endemic to New Zealand. To understand the patterns of molecular evolution of the nuclear protein-coding genes in brown kiwi (Apteryx australis mantelli) and to determine the timescale of avian history we sequenced a transcriptome obtained from a kiwi embryo using next generation sequencing methods. We then assembled the conserved protein-coding regions using the chicken proteome as a scaffold. Using 1,543 conserved protein coding genes we estimated the neutral evolutionary divergence between the kiwi and chicken to be ~45%, which is approximately equal to the divergence computed for the human-mouse pair using the same set of genes. A large fraction of genes was found to be under high selective constraint, as most of the expressed genes appeared to be involved in developmental gene regulation. Our study suggests a significant relationship between gene expression levels and protein evolution. Using sequences from over 700 nuclear genes we estimated the divergence between the two basal avian groups, Palaeognathae and Neognathae to be 132 million years, which is consistent with previous studies using mitochondrial genes. The results of this investigation revealed patterns of mutation and purifying selection in conserved protein coding regions in birds. Furthermore this study suggests a relatively cost-effective way of obtaining a glimpse into the fundamental molecular evolutionary attributes of a genome, particularly when no closely related genomic sequence is available.
Structure of CARB-4 and AER-1 CarbenicillinHydrolyzing β-Lactamases
Sanschagrin, François; Bejaoui, Noureddine; Levesque, Roger C.
1998-01-01
We determined the nucleotide sequences of blaCARB-4 encoding CARB-4 and deduced a polypeptide of 288 amino acids. The gene was characterized as a variant of group 2c carbenicillin-hydrolyzing β-lactamases such as PSE-4, PSE-1, and CARB-3. The level of DNA homology between the bla genes for these β-lactamases varied from 98.7 to 99.9%, while that between these genes and blaCARB-4 encoding CARB-4 was 86.3%. The blaCARB-4 gene was acquired from some other source because it has a G+C content of 39.1%, compared to a G+C content of 67% for typical Pseudomonas aeruginosa genes. DNA sequencing revealed that blaAER-1 shared 60.8% DNA identity with blaPSE-3 encoding PSE-3. The deduced AER-1 β-lactamase peptide was compared to class A, B, C, and D enzymes and had 57.6% identity with PSE-3, including an STHK tetrad at the active site. For CARB-4 and AER-1, conserved canonical amino acid boxes typical of class A β-lactamases were identified in a multiple alignment. Analysis of the DNA sequences flanking blaCARB-4 and blaAER-1 confirmed the importance of gene cassettes acquired via integrons in bla gene distribution. PMID:9687391
Erdelyi, Peter; Wang, Xing; Suleski, Marina; Wicky, Chantal
2016-01-01
Mi2 proteins are evolutionarily conserved, ATP-dependent chromatin remodelers of the CHD family that play key roles in stem cell differentiation and reprogramming. In Caenorhabditis elegans, the let-418 gene encodes one of the two Mi2 homologs, which is part of at least two chromatin complexes, namely the Nucleosome Remodeling and histone Deacetylase (NuRD) complex and the MEC complex, and functions in larval development, vulval morphogenesis, lifespan regulation, and cell fate determination. To explore the mechanisms involved in the action of LET-418/Mi2, we performed a genome-wide RNA interference (RNAi) screen for suppressors of early larval arrest associated with let-418 mutations. We identified 29 suppressor genes, of which 24 encode chromatin regulators, mostly orthologs of proteins present in transcriptional activator complexes. The remaining five genes vary broadly in their predicted functions. All suppressor genes could suppress multiple aspects of the let-418 phenotype, including developmental arrest and ectopic expression of germline genes in the soma. Analysis of available transcriptomic data and quantitative PCR revealed that LET-418 and the suppressors of early larval arrest are regulating common target genes. These suppressors might represent direct competitors of LET-418 complexes for chromatin regulation of crucial genes involved in the transition to postembryonic development. PMID:28007841
Erdelyi, Peter; Wang, Xing; Suleski, Marina; Wicky, Chantal
2017-02-09
Mi2 proteins are evolutionarily conserved, ATP-dependent chromatin remodelers of the CHD family that play key roles in stem cell differentiation and reprogramming. In Caenorhabditis elegans , the let-418 gene encodes one of the two Mi2 homologs, which is part of at least two chromatin complexes, namely the Nucleosome Remodeling and histone Deacetylase (NuRD) complex and the MEC complex, and functions in larval development, vulval morphogenesis, lifespan regulation, and cell fate determination. To explore the mechanisms involved in the action of LET-418/Mi2, we performed a genome-wide RNA interference (RNAi) screen for suppressors of early larval arrest associated with let-418 mutations. We identified 29 suppressor genes, of which 24 encode chromatin regulators, mostly orthologs of proteins present in transcriptional activator complexes. The remaining five genes vary broadly in their predicted functions. All suppressor genes could suppress multiple aspects of the let-418 phenotype, including developmental arrest and ectopic expression of germline genes in the soma. Analysis of available transcriptomic data and quantitative PCR revealed that LET-418 and the suppressors of early larval arrest are regulating common target genes. These suppressors might represent direct competitors of LET-418 complexes for chromatin regulation of crucial genes involved in the transition to postembryonic development. Copyright © 2017 Erdelyi et al.
Morioka, Kelsie; Yockteng, Roxana; Almeida, Ana M R; Specht, Chelsea D
2015-01-01
The Zingiberales is an order of tropical monocots that exhibits diverse floral morphologies. The evolution of petaloid, laminar stamens, staminodes, and styles contributes to this diversity. The laminar style is a derived trait in the family Cannaceae and plays an important role in pollination as its surface is used for secondary pollen presentation. Previous work in the Zingiberales has implicated YABBY2-like genes, which function in promoting laminar outgrowth, in the evolution of stamen morphology. Here, we investigate the evolution and expression of Zingiberales YABBY2-like genes in order to understand the evolution of the laminar style in Canna. Phylogenetic analyses show that multiple duplication events have occurred in this gene lineage prior to the diversification of the Zingiberales. Reverse transcription-PCR in Canna, Costus, and Musa reveals differential expression across floral organs, taxa, and gene copies, and a role for YABBY2-like genes in the evolution of the laminar style is proposed. Selection tests indicate that almost all sites in conserved domains are under purifying selection, consistent with their functional relevance, and a motif unique to monocot YABBY2-like genes is identified. These results contribute to our understanding of the molecular mechanisms underlying the evolution of floral morphologies.
Morioka, Kelsie; Yockteng, Roxana; Almeida, Ana M. R.; Specht, Chelsea D.
2015-01-01
The Zingiberales is an order of tropical monocots that exhibits diverse floral morphologies. The evolution of petaloid, laminar stamens, staminodes, and styles contributes to this diversity. The laminar style is a derived trait in the family Cannaceae and plays an important role in pollination as its surface is used for secondary pollen presentation. Previous work in the Zingiberales has implicated YABBY2-like genes, which function in promoting laminar outgrowth, in the evolution of stamen morphology. Here, we investigate the evolution and expression of Zingiberales YABBY2-like genes in order to understand the evolution of the laminar style in Canna. Phylogenetic analyses show that multiple duplication events have occurred in this gene lineage prior to the diversification of the Zingiberales. Reverse transcription-PCR in Canna, Costus, and Musa reveals differential expression across floral organs, taxa, and gene copies, and a role for YABBY2-like genes in the evolution of the laminar style is proposed. Selection tests indicate that almost all sites in conserved domains are under purifying selection, consistent with their functional relevance, and a motif unique to monocot YABBY2-like genes is identified. These results contribute to our understanding of the molecular mechanisms underlying the evolution of floral morphologies. PMID:26734021
Human-specific features of spatial gene expression and regulation in eight brain regions.
Xu, Chuan; Li, Qian; Efimova, Olga; He, Liu; Tatsumoto, Shoji; Stepanova, Vita; Oishi, Takao; Udono, Toshifumi; Yamaguchi, Katsushi; Shigenobu, Shuji; Kakita, Akiyoshi; Nawa, Hiroyuki; Khaitovich, Philipp; Go, Yasuhiro
2018-06-13
Molecular maps of the human brain alone do not inform us of the features unique to humans. Yet, the identification of these features is important for understanding both the evolution and nature of human cognition. Here, we approached this question by analyzing gene expression and H3K27ac chromatin modification data collected in eight brain regions of humans, chimpanzees, gorillas, a gibbon and macaques. An analysis of spatial transcriptome trajectories across eight brain regions in four primate species revealed 1,851 genes showing human-specific transcriptome differences in one or multiple brain regions, in contrast to 240 chimpanzee-specific ones. More than half of these human-specific differences represented elevated expression of genes enriched in neuronal and astrocytic markers in the human hippocampus, while the rest were enriched in microglial markers and displayed human-specific expression in several frontal cortical regions and the cerebellum. An analysis of the predicted regulatory interactions driving these differences revealed the role of transcription factors in species-specific transcriptome changes, while epigenetic modifications were linked to spatial expression differences conserved across species. Published by Cold Spring Harbor Laboratory Press.
Louis, Alexandra; Nguyen, Nga Thi Thuy; Muffato, Matthieu; Roest Crollius, Hugues
2015-01-01
The Genomicus web server (http://www.genomicus.biologie.ens.fr/genomicus) is a visualization tool allowing comparative genomics in four different phyla (Vertebrate, Fungi, Metazoan and Plants). It provides access to genomic information from extant species, as well as ancestral gene content and gene order for vertebrates and flowering plants. Here we present the new features available for vertebrate genome with a focus on new graphical tools. The interface to enter the database has been improved, two pairwise genome comparison tools are now available (KaryoView and MatrixView) and the multiple genome comparison tools (PhyloView and AlignView) propose three new kinds of representation and a more intuitive menu. These new developments have been implemented for Genomicus portal dedicated to vertebrates. This allows the analysis of 68 extant animal genomes, as well as 58 ancestral reconstructed genomes. The Genomicus server also provides access to ancestral gene orders, to facilitate evolutionary and comparative genomics studies, as well as computationally predicted regulatory interactions, thanks to the representation of conserved non-coding elements with their putative gene targets. PMID:25378326
Hendrickson, Peter G; Doráis, Jessie A; Grow, Edward J; Whiddon, Jennifer L; Lim, Jong-Won; Wike, Candice L; Weaver, Bradley D; Pflueger, Christian; Emery, Benjamin R; Wilcox, Aaron L; Nix, David A; Peterson, C Matthew; Tapscott, Stephen J; Carrell, Douglas T; Cairns, Bradley R
2017-06-01
To better understand transcriptional regulation during human oogenesis and preimplantation development, we defined stage-specific transcription, which highlighted the cleavage stage as being highly distinctive. Here, we present multiple lines of evidence that a eutherian-specific multicopy retrogene, DUX4, encodes a transcription factor that activates hundreds of endogenous genes (for example, ZSCAN4, KDM4E and PRAMEF-family genes) and retroviral elements (MERVL/HERVL family) that define the cleavage-specific transcriptional programs in humans and mice. Remarkably, mouse Dux expression is both necessary and sufficient to convert mouse embryonic stem cells (mESCs) into 2-cell-embryo-like ('2C-like') cells, measured here by the reactivation of '2C' genes and repeat elements, the loss of POU5F1 (also known as OCT4) protein and chromocenters, and the conversion of the chromatin landscape (as assessed by transposase-accessible chromatin using sequencing (ATAC-seq)) to a state strongly resembling that of mouse 2C embryos. Thus, we propose mouse DUX and human DUX4 as major drivers of the cleavage or 2C state.
Neuman, Sarah D.; Bashirullah, Arash; Kumar, Justin P.
2016-01-01
The eyes absent (eya) gene of the fruit fly, Drosophila melanogaster, is a member of an evolutionarily conserved gene regulatory network that controls eye formation in all seeing animals. The loss of eya leads to the complete elimination of the compound eye while forced expression of eya in non-retinal tissues is sufficient to induce ectopic eye formation. Within the developing retina eya is expressed in a dynamic pattern and is involved in tissue specification/determination, cell proliferation, apoptosis, and cell fate choice. In this report we explore the mechanisms by which eya expression is spatially and temporally governed in the developing eye. We demonstrate that multiple cis-regulatory elements function cooperatively to control eya transcription and that spacing between a pair of enhancer elements is important for maintaining correct gene expression. Lastly, we show that the loss of eya expression in sine oculis (so) mutants is the result of massive cell death and a progressive homeotic transformation of retinal progenitor cells into head epidermis. PMID:27930646
Cardiac muscle regeneration: lessons from development
Mercola, Mark; Ruiz-Lozano, Pilar; Schneider, Michael D.
2011-01-01
The adult human heart is an ideal target for regenerative intervention since it does not functionally restore itself after injury yet has a modest regenerative capacity that could be enhanced by innovative therapies. Adult cardiac cells with regenerative potential share gene expression signatures with early fetal progenitors that give rise to multiple cardiac cell types, suggesting that the evolutionarily conserved regulatory networks that drive embryonic heart development might also control aspects of regeneration. Here we discuss commonalities of development and regeneration, and the application of the rich developmental biology heritage to achieve therapeutic regeneration of the human heart. PMID:21325131
Venturini, Carola; Hassan, Karl A; Roy Chowdhury, Piklu; Paulsen, Ian T; Walker, Mark J; Djordjevic, Steven P
2013-01-01
Enterohemorrhagic Escherichia coli (EHEC) and atypical enteropathogenic E. coli (aEPEC) are important zoonotic pathogens that increasingly are becoming resistant to multiple antibiotics. Here we describe two plasmids, pO26-CRL125 (125 kb) from a human O26:H- EHEC, and pO111-CRL115 (115kb) from a bovine O111 aEPEC, that impart resistance to ampicillin, kanamycin, neomycin, streptomycin, sulfathiazole, trimethoprim and tetracycline and both contain atypical class 1 integrons with an identical IS26-mediated deletion in their 3´-conserved segment. Complete sequence analysis showed that pO26-CRL125 and pO111-CRL115 are essentially identical except for a 9.7 kb fragment, present in the backbone of pO26-CRL125 but absent in pO111-CRL115, and several indels. The 9.7 kb fragment encodes IncI-associated genes involved in plasmid stability during conjugation, a putative transposase gene and three imperfect repeats. Contiguous sequence identical to regions within these pO26-CRL125 imperfect repeats was identified in pO111-CRL115 precisely where the 9.7 kb fragment is missing, suggesting it may be mobile. Sequences shared between the plasmids include a complete IncZ replicon, a unique toxin/antitoxin system, IncI stability and maintenance genes, a novel putative serine protease autotransporter, and an IncI1 transfer system including a unique shufflon. Both plasmids carry a derivate Tn21 transposon with an atypical class 1 integron comprising a dfrA5 gene cassette encoding resistance to trimethoprim, and 24 bp of the 3´-conserved segment followed by Tn6026, which encodes resistance to ampicillin, kanymycin, neomycin, streptomycin and sulfathiazole. The Tn21-derivative transposon is linked to a truncated Tn1721, encoding resistance to tetracycline, via a region containing the IncP-1α oriV. Absence of the 5 bp direct repeats flanking Tn3-family transposons, indicates that homologous recombination events played a key role in the formation of this complex antibiotic resistance gene locus. Comparative sequence analysis of these closely related plasmids reveals aspects of plasmid evolution in pathogenic E. coli from different hosts.
Apple miRNAs and tasiRNAs with novel regulatory networks
2012-01-01
Background MicroRNAs (miRNAs) and their regulatory functions have been extensively characterized in model species but whether apple has evolved similar or unique regulatory features remains unknown. Results We performed deep small RNA-seq and identified 23 conserved, 10 less-conserved and 42 apple-specific miRNAs or families with distinct expression patterns. The identified miRNAs target 118 genes representing a wide range of enzymatic and regulatory activities. Apple also conserves two TAS gene families with similar but unique trans-acting small interfering RNA (tasiRNA) biogenesis profiles and target specificities. Importantly, we found that miR159, miR828 and miR858 can collectively target up to 81 MYB genes potentially involved in diverse aspects of plant growth and development. These miRNA target sites are differentially conserved among MYBs, which is largely influenced by the location and conservation of the encoded amino acid residues in MYB factors. Finally, we found that 10 of the 19 miR828-targeted MYBs undergo small interfering RNA (siRNA) biogenesis at the 3' cleaved, highly divergent transcript regions, generating over 100 sequence-distinct siRNAs that potentially target over 70 diverse genes as confirmed by degradome analysis. Conclusions Our work identified and characterized apple miRNAs, their expression patterns, targets and regulatory functions. We also discovered that three miRNAs and the ensuing siRNAs exploit both conserved and divergent sequence features of MYB genes to initiate distinct regulatory networks targeting a multitude of genes inside and outside the MYB family. PMID:22704043
Evolutionary analysis of the jacalin-related lectin family genes in 11 fishes.
Cao, Jun; Lv, Yueqing
2016-09-01
Jacalin-related lectins are a type of carbohydrate-binding proteins, which are distributed across a wide variety of organisms and involved in some important biological processes. The evolution of this gene family in fishes is unknown. Here, 47 putative jacalin genes in 11 fish species were identified and divided into 4 groups through phylogenetic analysis. Conserved gene organization and motif distribution existed in each group, suggesting their functional conservation. Some fishes have eleven jacalin genes, while others have only one or zero gene in their genomes, suggesting dynamic changes in the number of jacalin genes during the evolution of fishes. Intragenic recombination played a key role in the evolution of jacalin genes. Synteny analyses of jacalin genes in some fishes implied conserved and dynamic evolution characteristics of this gene family and related genome segments. Moreover, a few functional divergence sites were identified within each group pairs. Divergent expression profiles of the zebra fish jacalin genes were further investigated in different stresses. The results provided a foundation for exploring the characterization of the jacalin genes in fishes and will offer insights for additional functional studies. Copyright © 2016 Elsevier Ltd. All rights reserved.
Conservation of the structure and organization of lupin mitochondrial nad3 and rps12 genes.
Rurek, M; Oczkowski, M; Augustyniak, H
1998-01-01
A high level of the nucleotide sequence conservation of mitochondrial nad3 and rps12 genes was found in four lupin species. The only differences concern three nucleotides in the Lupinus albus rps12 gene and three nucleotides insertion in the L. mutabilis spacer. Northern blot analysis as well as RT-PCR confirmed cotranscription of the L. luteus genes because the transcripts detected were long enough.
Massive Gene Transfer and Extensive RNA Editing of a Symbiotic Dinoflagellate Plastid Genome
Mungpakdee, Sutada; Shinzato, Chuya; Takeuchi, Takeshi; Kawashima, Takeshi; Koyanagi, Ryo; Hisata, Kanako; Tanaka, Makiko; Goto, Hiroki; Fujie, Manabu; Lin, Senjie; Satoh, Nori; Shoguchi, Eiichi
2014-01-01
Genome sequencing of Symbiodinium minutum revealed that 95 of 109 plastid-associated genes have been transferred to the nuclear genome and subsequently expanded by gene duplication. Only 14 genes remain in plastids and occur as DNA minicircles. Each minicircle (1.8–3.3 kb) contains one gene and a conserved noncoding region containing putative promoters and RNA-binding sites. Nine types of RNA editing, including a novel G/U type, were discovered in minicircle transcripts but not in genes transferred to the nucleus. In contrast to DNA editing sites in dinoflagellate mitochondria, which tend to be highly conserved across all taxa, editing sites employed in DNA minicircles are highly variable from species to species. Editing is crucial for core photosystem protein function. It restores evolutionarily conserved amino acids and increases peptidyl hydropathy. It also increases protein plasticity necessary to initiate photosystem complex assembly. PMID:24881086
PanACEA: a bioinformatics tool for the exploration and visualization of bacterial pan-chromosomes.
Clarke, Thomas H; Brinkac, Lauren M; Inman, Jason M; Sutton, Granger; Fouts, Derrick E
2018-06-27
Bacterial pan-genomes, comprised of conserved and variable genes across multiple sequenced bacterial genomes, allow for identification of genomic regions that are phylogenetically discriminating or functionally important. Pan-genomes consist of large amounts of data, which can restrict researchers ability to locate and analyze these regions. Multiple software packages are available to visualize pan-genomes, but currently their ability to address these concerns are limited by using only pre-computed data sets, prioritizing core over variable gene clusters, or by not accounting for pan-chromosome positioning in the viewer. We introduce PanACEA (Pan-genome Atlas with Chromosome Explorer and Analyzer), which utilizes locally-computed interactive web-pages to view ordered pan-genome data. It consists of multi-tiered, hierarchical display pages that extend from pan-chromosomes to both core and variable regions to single genes. Regions and genes are functionally annotated to allow for rapid searching and visual identification of regions of interest with the option that user-supplied genomic phylogenies and metadata can be incorporated. PanACEA's memory and time requirements are within the capacities of standard laptops. The capability of PanACEA as a research tool is demonstrated by highlighting a variable region important in differentiating strains of Enterobacter hormaechei. PanACEA can rapidly translate the results of pan-chromosome programs into an intuitive and interactive visual representation. It will empower researchers to visually explore and identify regions of the pan-chromosome that are most biologically interesting, and to obtain publication quality images of these regions.
A major gene controls mimicry and crypsis in butterflies and moths
Nadeau, Nicola J.; Pardo-Diaz, Carolina; Whibley, Annabel; Supple, Megan; Saenko, Suzanne V.; Wallbank, Richard W. R.; Wu, Grace C.; Maroja, Luana; Ferguson, Laura; Hanly, Joseph J.; Hines, Heather; Salazar, Camilo; Merrill, Richard; Dowling, Andrea; ffrench-Constant, Richard; Llaurens, Violaine; Joron, Mathieu; McMillan, W. Owen; Jiggins, Chris D.
2016-01-01
The wing patterns of butterflies and moths (Lepidoptera) are diverse and striking examples of evolutionary diversification by natural selection1,2. Lepidopteran wing colour patterns are a key innovation, consisting of arrays of coloured scales. We still lack a general understanding of how these patterns are controlled and if there is any commonality across the 160,000 moth and 17,000 butterfly species. Here, we identify a gene, cortex, through fine-scale mapping using population genomics and gene expression analyses, which regulates pattern switches in multiple species across the mimetic radiation in Heliconius butterflies. cortex belongs to a fast evolving subfamily of the otherwise highly conserved fizzy family of cell cycle regulators3, suggesting that it most likely regulates pigmentation patterning through regulation of scale cell development. In parallel with findings in the peppered moth (Biston betularia)4, our results suggest that this mechanism is common within Lepidoptera and that cortex has become a major target for natural selection acting on colour and pattern variation in this group of insects. PMID:27251285
Perspectives on the mechanism of transcriptional regulation by long non-coding RNAs.
Roberts, Thomas C; Morris, Kevin V; Weinberg, Marc S
2014-01-01
Long non-coding RNAs (lncRNAs) are increasingly being recognized as epigenetic regulators of gene transcription. The diversity and complexity of lncRNA genes means that they exert their regulatory effects by a variety of mechanisms. Although there is still much to be learned about the mechanism of lncRNA function, general principles are starting to emerge. In particular, the application of high throughput (deep) sequencing methodologies has greatly advanced our understanding of lncRNA gene function. lncRNAs function as adaptors that link specific chromatin loci with chromatin-remodeling complexes and transcription factors. lncRNAs can act in cis or trans to guide epigenetic-modifier complexes to distinct genomic sites, or act as scaffolds which recruit multiple proteins simultaneously, thereby coordinating their activities. In this review we discuss the genomic organization of lncRNAs, the importance of RNA secondary structure to lncRNA functionality, the multitude of ways in which they interact with the genome, and what evolutionary conservation tells us about their function.
DCODE.ORG Anthology of Comparative Genomic Tools
DOE Office of Scientific and Technical Information (OSTI.GOV)
Loots, G G; Ovcharenko, I
2005-01-11
Comparative genomics provides the means to demarcate functional regions in anonymous DNA sequences. The successful application of this method to identifying novel genes is currently shifting to deciphering the noncoding encryption of gene regulation across genomes. To facilitate the use of comparative genomics to practical applications in genetics and genomics we have developed several analytical and visualization tools for the analysis of arbitrary sequences and whole genomes. These tools include two alignment tools: zPicture and Mulan; a phylogenetic shadowing tool: eShadow for identifying lineage- and species-specific functional elements; two evolutionary conserved transcription factor analysis tools: rVista and multiTF; a toolmore » for extracting cis-regulatory modules governing the expression of co-regulated genes, CREME; and a dynamic portal to multiple vertebrate and invertebrate genome alignments, the ECR Browser. Here we briefly describe each one of these tools and provide specific examples on their practical applications. All the tools are publicly available at the http://www.dcode.org/ web site.« less
Dcode.org anthology of comparative genomic tools.
Loots, Gabriela G; Ovcharenko, Ivan
2005-07-01
Comparative genomics provides the means to demarcate functional regions in anonymous DNA sequences. The successful application of this method to identifying novel genes is currently shifting to deciphering the non-coding encryption of gene regulation across genomes. To facilitate the practical application of comparative sequence analysis to genetics and genomics, we have developed several analytical and visualization tools for the analysis of arbitrary sequences and whole genomes. These tools include two alignment tools, zPicture and Mulan; a phylogenetic shadowing tool, eShadow for identifying lineage- and species-specific functional elements; two evolutionary conserved transcription factor analysis tools, rVista and multiTF; a tool for extracting cis-regulatory modules governing the expression of co-regulated genes, Creme 2.0; and a dynamic portal to multiple vertebrate and invertebrate genome alignments, the ECR Browser. Here, we briefly describe each one of these tools and provide specific examples on their practical applications. All the tools are publicly available at the http://www.dcode.org/ website.
Posttranslational modification of autophagy-related proteins in macroautophagy
Xie, Yangchun; Kang, Rui; Sun, Xiaofang; Zhong, Meizuo; Huang, Jin; Klionsky, Daniel J.; Tang, Daolin
2014-01-01
Macroautophagy is an intracellular catabolic process involved in the formation of multiple membrane structures ranging from phagophores to autophagosomes and autolysosomes. Dysfunction of macroautophagy is implicated in both physiological and pathological conditions. To date, 38 autophagy-related (ATG) genes have been identified as controlling these complicated membrane dynamics during macroautophagy in yeast; approximately half of these genes are clearly conserved up to human, and there are additional genes whose products function in autophagy in higher eukaryotes that are not found in yeast. The function of the ATG proteins, in particular their ability to interact with a number of macroautophagic regulators, is modulated by posttranslational modifications (PTMs) such as phosphorylation, glycosylation, ubiquitination, acetylation, lipidation, and proteolysis. In this review, we summarize our current knowledge of the role of ATG protein PTMs and their functional relevance in macroautophagy. Unraveling how these PTMs regulate ATG protein function during macroautophagy will not only reveal fundamental mechanistic insights into the regulatory process, but also provide new therapeutic targets for the treatment of autophagy-associated diseases. PMID:25484070
The gene cortex controls mimicry and crypsis in butterflies and moths.
Nadeau, Nicola J; Pardo-Diaz, Carolina; Whibley, Annabel; Supple, Megan A; Saenko, Suzanne V; Wallbank, Richard W R; Wu, Grace C; Maroja, Luana; Ferguson, Laura; Hanly, Joseph J; Hines, Heather; Salazar, Camilo; Merrill, Richard M; Dowling, Andrea J; ffrench-Constant, Richard H; Llaurens, Violaine; Joron, Mathieu; McMillan, W Owen; Jiggins, Chris D
2016-06-02
The wing patterns of butterflies and moths (Lepidoptera) are diverse and striking examples of evolutionary diversification by natural selection. Lepidopteran wing colour patterns are a key innovation, consisting of arrays of coloured scales. We still lack a general understanding of how these patterns are controlled and whether this control shows any commonality across the 160,000 moth and 17,000 butterfly species. Here, we use fine-scale mapping with population genomics and gene expression analyses to identify a gene, cortex, that regulates pattern switches in multiple species across the mimetic radiation in Heliconius butterflies. cortex belongs to a fast-evolving subfamily of the otherwise highly conserved fizzy family of cell-cycle regulators, suggesting that it probably regulates pigmentation patterning by regulating scale cell development. In parallel with findings in the peppered moth (Biston betularia), our results suggest that this mechanism is common within Lepidoptera and that cortex has become a major target for natural selection acting on colour and pattern variation in this group of insects.
Li, Meiying; Ren, Licheng; Xu, Biyu; Yang, Xiaoliang; Xia, Qiyu; He, Pingping; Xiao, Susheng; Guo, Anping; Hu, Wei; Jin, Zhiqiang
2016-01-01
Plant 14-3-3 proteins act as critical components of various cellular signaling processes and play an important role in regulating multiple physiological processes. However, less information is known about the 14-3-3 gene family in banana. In this study, 25 14-3-3 genes were identified from the banana genome. Based on the evolutionary analysis, banana 14-3-3 proteins were clustered into ε and non-ε groups. Conserved motif analysis showed that all identified banana 14-3-3 genes had the typical 14-3-3 motif. The gene structure of banana 14-3-3 genes showed distinct class-specific divergence between the ε group and the non-ε group. Most banana 14-3-3 genes showed strong transcript accumulation changes during fruit development and postharvest ripening in two banana varieties, indicating that they might be involved in regulating fruit development and ripening. Moreover, some 14-3-3 genes also showed great changes after osmotic, cold, and salt treatments in two banana varieties, suggested their potential role in regulating banana response to abiotic stress. Taken together, this systemic analysis reveals the involvement of banana 14-3-3 genes in fruit development, postharvest ripening, and response to abiotic stress and provides useful information for understanding the functions of 14-3-3 genes in banana. PMID:27713761
Andrews, T Daniel; Gojobori, Takashi
2004-01-01
The PilE protein is the major component of the Neisseria meningitidis pilus, which is encoded by the pilE/pilS locus that includes an expressed gene and eight homologous silent fragments. The silent gene fragments have been shown to recombine through gene conversion with the expressed gene and thereby provide a means by which novel antigenic variants of the PilE protein can be generated. We have analyzed the evolutionary rate of the pilE gene using the nucleotide sequence of two complete pilE/pilS loci. The very high rate of evolution displayed by the PilE protein appears driven by both recombination and positive selection. Within the semivariable region of the pilE and pilS genes, recombination appears to occur within multiple small sequence blocks that lie between conserved sequence elements. Within the hypervariable region, positive selection was identified from comparison of the silent and expressed genes. The unusual gene conversion mechanism that operates at the pilE/pilS locus is a strategy employed by N. meningitidis to enhance mutation of certain regions of the PilE protein. The silent copies of the gene effectively allow "parallelized" evolution of pilE, thus enabling the encoded protein to rapidly explore a large area of sequence space in an effort to find novel antigenic variants.
Conservation and divergence of ADAM family proteins in the Xenopus genome
2010-01-01
Background Members of the disintegrin metalloproteinase (ADAM) family play important roles in cellular and developmental processes through their functions as proteases and/or binding partners for other proteins. The amphibian Xenopus has long been used as a model for early vertebrate development, but genome-wide analyses for large gene families were not possible until the recent completion of the X. tropicalis genome sequence and the availability of large scale expression sequence tag (EST) databases. In this study we carried out a systematic analysis of the X. tropicalis genome and uncovered several interesting features of ADAM genes in this species. Results Based on the X. tropicalis genome sequence and EST databases, we identified Xenopus orthologues of mammalian ADAMs and obtained full-length cDNA clones for these genes. The deduced protein sequences, synteny and exon-intron boundaries are conserved between most human and X. tropicalis orthologues. The alternative splicing patterns of certain Xenopus ADAM genes, such as adams 22 and 28, are similar to those of their mammalian orthologues. However, we were unable to identify an orthologue for ADAM7 or 8. The Xenopus orthologue of ADAM15, an active metalloproteinase in mammals, does not contain the conserved zinc-binding motif and is hence considered proteolytically inactive. We also found evidence for gain of ADAM genes in Xenopus as compared to other species. There is a homologue of ADAM10 in Xenopus that is missing in most mammals. Furthermore, a single scaffold of X. tropicalis genome contains four genes encoding ADAM28 homologues, suggesting genome duplication in this region. Conclusions Our genome-wide analysis of ADAM genes in X. tropicalis revealed both conservation and evolutionary divergence of these genes in this amphibian species. On the one hand, all ADAMs implicated in normal development and health in other species are conserved in X. tropicalis. On the other hand, some ADAM genes and ADAM protease activities are absent, while other novel ADAM proteins in this species are predicted by this study. The conservation and unique divergence of ADAM genes in Xenopus probably reflect the particular selective pressures these amphibian species faced during evolution. PMID:20630080
O’Brien, Conor S.; Bourdo, Ryan; Bradshaw, William E.; Holzapfel, Christina M.; Cresko, William. A.
2012-01-01
Photoperiod, or length of day, has a predictable annual cycle, making it an important cue for the timing of seasonal behavior and development in many organisms. Photoperiod is widely used among temperate and polar animals to regulate the timing of sexual maturation. The proper sensing and interpretation of photoperiod can be tightly tied to an organism’s overall fitness. In photoperiodic mammals and birds the thyroid hormone pathway initiates sexual maturation, but the degree to which this pathway is conserved across other vertebrates is not well known. We use the threespine stickleback Gasterosteus aculeatus, as a representative teleost to quantify the photoperiodic response of key genes in the thyroid hormone pathway under controlled laboratory conditions. We find that the photoperiodic responses of the hormones are largely consistent amongst multiple populations, although differences suggest physiological adaptation to various climates. We conclude that the thyroid hormone pathway initiates sexual maturation in response to photoperiod in G. aculeatus, and our results show that more components of this pathway are conserved among mammals, birds, and teleost fish than was previously known. However, additional endocrinology, cell biology and molecular research will be required to define precisely which aspects of the pathway are conserved across vertebrates. PMID:22504272
O'Brien, Conor S; Bourdo, Ryan; Bradshaw, William E; Holzapfel, Christina M; Cresko, William A
2012-08-01
Photoperiod, or length of day, has a predictable annual cycle, making it an important cue for the timing of seasonal behavior and development in many organisms. Photoperiod is widely used among temperate and polar animals to regulate the timing of sexual maturation. The proper sensing and interpretation of photoperiod can be tightly tied to an organism's overall fitness. In photoperiodic mammals and birds the thyroid hormone pathway initiates sexual maturation, but the degree to which this pathway is conserved across other vertebrates is not well known. We use the threespine stickleback Gasterosteus aculeatus, as a representative teleost to quantify the photoperiodic response of key genes in the thyroid hormone pathway under controlled laboratory conditions. We find that the photoperiodic responses of the hormones are largely consistent amongst multiple populations, although differences suggest physiological adaptation to various climates. We conclude that the thyroid hormone pathway initiates sexual maturation in response to photoperiod in G. aculeatus, and our results show that more components of this pathway are conserved among mammals, birds, and teleost fish than was previously known. However, additional endocrinology, cell biology and molecular research will be required to define precisely which aspects of the pathway are conserved across vertebrates. Copyright © 2012 Elsevier Inc. All rights reserved.
Ankyrin-repeat containing proteins of microbes: a conserved structure with functional diversity
Al-Khodor, Souhaila; Price, Christopher T.; Kalia, Awdhesh; Kwaik, Yousef Abu
2009-01-01
Summary The ankyrin repeat (ANK) is the most common protein-protein interaction motif in nature and predominantly found in eukaryotic proteins. The genome sequencing of various pathogenic or symbiotic bacteria and eukaryotic viruses identified numerous genes encoding ANK-containing proteins that were proposed to have been acquired from eukaryotes by horizontal gene transfer. However, the recent discovery of additional ANK-containing proteins encoded in the genomes of archaea and free-living bacteria suggests either a more ancient origin of the ANK motif or multiple convergent evolution events. Many bacterial pathogens employ various types of secretion systems to deliver ANK-containing proteins into eukaryotic cells where they mimic or manipulate various host functions. Understanding the molecular and biochemical functions of this family of proteins will enhance our understanding of important host-microbe interactions. PMID:19962898
Ape parasite origins of human malaria virulence genes
Larremore, Daniel B.; Sundararaman, Sesh A.; Liu, Weimin; Proto, William R.; Clauset, Aaron; Loy, Dorothy E.; Speede, Sheri; Plenderleith, Lindsey J.; Sharp, Paul M.; Hahn, Beatrice H.; Rayner, Julian C.; Buckee, Caroline O.
2015-01-01
Antigens encoded by the var gene family are major virulence factors of the human malaria parasite Plasmodium falciparum, exhibiting enormous intra- and interstrain diversity. Here we use network analysis to show that var architecture and mosaicism are conserved at multiple levels across the Laverania subgenus, based on var-like sequences from eight single-species and three multi-species Plasmodium infections of wild-living or sanctuary African apes. Using select whole-genome amplification, we also find evidence of multi-domain var structure and synteny in Plasmodium gaboni, one of the ape Laverania species most distantly related to P. falciparum, as well as a new class of Duffy-binding-like domains. These findings indicate that the modular genetic architecture and sequence diversity underlying var-mediated host-parasite interactions evolved before the radiation of the Laverania subgenus, long before the emergence of P. falciparum. PMID:26456841
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dyer, K.D.; Handen, J.S.; Rosenberg, H.F.
The Charcot-Leyden crystal (CLC) protein, or eosinophil lysophospholipase, is a characteristic protein of human eosinophils and basophils; recent work has demonstrated that the CLC protein is both structurally and functionally related to the galectin family of {beta}-galactoside binding proteins. The galectins as a group share a number of features in common, including a linear ligand binding site encoded on a single exon. In this work, we demonstrate that the intron-exon structure of the gene encoding CLC is analogous to those encoding the galectins. The coding sequence of the CLC gene is divided into four exons, with the entire {beta}-galactoside bindingmore » site encoded by exon III. We have isolated CLC {beta}-galactoside binding sites from both orangutan (Pongo pygmaeus) and murine (Mus musculus) genomic DNAs, both encoded on single exons, and noted conservation of the amino acids shown to interact directly with the {beta}-galactoside ligand. The most likely interpretation of these results suggests the occurrence of one or more exon duplication and insertion events, resulting in the distribution of this lectin domain to CLC as well as to the multiple galectin genes. 35 refs., 3 figs.« less
Analysis of the cytochrome c oxidase subunit II (COX2) gene in giant panda, Ailuropoda melanoleuca.
Ling, S S; Zhu, Y; Lan, D; Li, D S; Pang, H Z; Wang, Y; Li, D Y; Wei, R P; Zhang, H M; Wang, C D; Hu, Y D
2017-01-23
The giant panda, Ailuropoda melanoleuca (Ursidae), has a unique bamboo-based diet; however, this low-energy intake has been sufficient to maintain the metabolic processes of this species since the fourth ice age. As mitochondria are the main sites for energy metabolism in animals, the protein-coding genes involved in mitochondrial respiratory chains, particularly cytochrome c oxidase subunit II (COX2), which is the rate-limiting enzyme in electron transfer, could play an important role in giant panda metabolism. Therefore, the present study aimed to isolate, sequence, and analyze the COX2 DNA from individuals kept at the Giant Panda Protection and Research Center, China, and compare these sequences with those of the other Ursidae family members. Multiple sequence alignment showed that the COX2 gene had three point mutations that defined three haplotypes, with 60% of the sequences corresponding to haplotype I. The neutrality tests revealed that the COX2 gene was conserved throughout evolution, and the maximum likelihood phylogenetic analysis, using homologous sequences from other Ursidae species, showed clustering of the COX2 sequences of giant pandas, suggesting that this gene evolved differently in them.
Should genes with missing data be excluded from phylogenetic analyses?
Jiang, Wei; Chen, Si-Yun; Wang, Hong; Li, De-Zhu; Wiens, John J
2014-11-01
Phylogeneticists often design their studies to maximize the number of genes included but minimize the overall amount of missing data. However, few studies have addressed the costs and benefits of adding characters with missing data, especially for likelihood analyses of multiple loci. In this paper, we address this topic using two empirical data sets (in yeast and plants) with well-resolved phylogenies. We introduce varying amounts of missing data into varying numbers of genes and test whether the benefits of excluding genes with missing data outweigh the costs of excluding the non-missing data that are associated with them. We also test if there is a proportion of missing data in the incomplete genes at which they cease to be beneficial or harmful, and whether missing data consistently bias branch length estimates. Our results indicate that adding incomplete genes generally increases the accuracy of phylogenetic analyses relative to excluding them, especially when there is a high proportion of incomplete genes in the overall dataset (and thus few complete genes). Detailed analyses suggest that adding incomplete genes is especially helpful for resolving poorly supported nodes. Given that we find that excluding genes with missing data often decreases accuracy relative to including these genes (and that decreases are generally of greater magnitude than increases), there is little basis for assuming that excluding these genes is necessarily the safer or more conservative approach. We also find no evidence that missing data consistently bias branch length estimates. Copyright © 2014 Elsevier Inc. All rights reserved.
Nozaki, T; Arase, T; Shigeta, Y; Asai, T; Leustek, T; Takeuchi, T
1998-12-08
A gene encoding adenosine-5'-triphosphate sulfurylase (AS) was cloned from the enteric protozoan parasite Entamoeba histolytica by polymerase chain reaction using degenerate oligonucleotide primers corresponding to conserved regions of the protein from a variety of organisms. The deduced amino acid sequence of E. histolytica AS revealed a calculated molecular mass of 47925 Da and an unusual basic pI of 9.38. The amebic protein sequence showed 23-48% identities with AS from bacteria, yeasts, fungi, plants, and animals with the highest identities being to Synechocystis sp. and Bacillus subtilis (48 and 44%, respectively). Four conserved blocks including putative sulfate-binding and phosphate-binding regions were highly conserved in the E. histolytica AS. The upstream region of the AS gene contained three conserved elements reported for other E. histolytica genes. A recombinant E. histolytica AS revealed enzymatic activity, measured in both the forward and reverse directions. Expression of the E. histolytica AS complemented cysteine auxotrophy of the AS-deficient Escherichia coli strains. Genomic hybridization revealed that the AS gene exists as a single copy gene. In the literature, this is the first description of an AS gene in Protozoa.
Long, Hannah K; Sims, David; Heger, Andreas; Blackledge, Neil P; Kutter, Claudia; Wright, Megan L; Grützner, Frank; Odom, Duncan T; Patient, Roger; Ponting, Chris P; Klose, Robert J
2013-01-01
Two-thirds of gene promoters in mammals are associated with regions of non-methylated DNA, called CpG islands (CGIs), which counteract the repressive effects of DNA methylation on chromatin. In cold-blooded vertebrates, computational CGI predictions often reside away from gene promoters, suggesting a major divergence in gene promoter architecture across vertebrates. By experimentally identifying non-methylated DNA in the genomes of seven diverse vertebrates, we instead reveal that non-methylated islands (NMIs) of DNA are a central feature of vertebrate gene promoters. Furthermore, NMIs are present at orthologous genes across vast evolutionary distances, revealing a surprising level of conservation in this epigenetic feature. By profiling NMIs in different tissues and developmental stages we uncover a unifying set of features that are central to the function of NMIs in vertebrates. Together these findings demonstrate an ancient logic for NMI usage at gene promoters and reveal an unprecedented level of epigenetic conservation across vertebrate evolution. DOI: http://dx.doi.org/10.7554/eLife.00348.001 PMID:23467541
Evans, Tyler G.; Hofmann, Gretchen E.
2012-01-01
Anthropogenic stressors, such as climate change, are driving fundamental shifts in the abiotic characteristics of marine ecosystems. As the environmental aspects of our world's oceans deviate from evolved norms, of major concern is whether extant marine species possess the capacity to cope with such rapid change. In what many scientists consider the post-genomic era, tools that exploit the availability of DNA sequence information are being increasingly recognized as relevant to questions surrounding ocean change and marine conservation. In this review, we highlight the application of high-throughput gene-expression profiling, primarily transcriptomics, to the field of marine conservation physiology. Through the use of case studies, we illustrate how gene expression can be used to standardize metrics of sub-lethal stress, track organism condition in natural environments and bypass phylogenetic barriers that hinder the application of other physiological techniques to conservation. When coupled with fine-scale monitoring of environmental variables, gene-expression profiling provides a powerful approach to conservation capable of informing diverse issues related to ocean change, from coral bleaching to the spread of invasive species. Integrating novel approaches capable of improving existing conservation strategies, including gene-expression profiling, will be critical to ensuring the ecological and economic health of the global ocean. PMID:22566679
Masson, Emmanuelle; Chen, Jian-Min; Audrézet, Marie-Pierre; Cooper, David N; Férec, Claude
2013-01-01
Idiopathic chronic pancreatitis (ICP) has traditionally been defined as chronic pancreatitis in the absence of any obvious precipitating factors (e.g. alcohol abuse) and family history of the disease. Studies over the past 15 years have revealed that ICP has a highly complex genetic architecture involving multiple gene loci. Here, we have attempted to provide a conservative assessment of the major genetic causes of ICP in a sample of 253 young French ICP patients. For the first time, conventional types of mutation (comprising coding sequence variants and variants at intron/exon boundaries) and gross genomic rearrangements were screened for in all four major pancreatitis genes, PRSS1, SPINK1, CTRC and CFTR. For the purposes of the study, synonymous, intronic and 5'- or 3'-untranslated region variants were excluded from the analysis except where there was persuasive evidence of functional consequences. The remaining sequence variants/genotypes were classified into causative, contributory or neutral categories by consideration of (i) their allele frequencies in patient and normal control populations, (ii) their presumed or experimentally confirmed functional effects, (iii) the relative importance of their associated genes in the pathogenesis of chronic pancreatitis and (iv) gene-gene interactions wherever applicable. Adoption of this strategy allowed us to assess the pathogenic relevance of specific variants/genotypes to their respective carriers to an unprecedented degree. The genetic cause of ICP could be assigned in 23.7% of individuals in the study group. A strong genetic susceptibility factor was also present in an additional 24.5% of cases. Taken together, up to 48.2% of the studied ICP patients were found to display evidence of a genetic basis for their pancreatitis. Whereas these particular proportions may not be extrapolable to all ICP patients, the approach employed should serve as a useful framework for acquiring a better understanding of the role of genetic factors in causing this oligogenic disease.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhang, Yanfeng; Zheng, Yi; Qin, Ling
Beta-hydroxyacid dehydrogenase (β-HAD) genes have been identified in all sequenced genomes of eukaryotes and prokaryotes. Their gene products catalyze the NAD+- or NADP+-dependent oxidation of various β-hydroxy acid substrates into their corresponding semialdehyde. In many fungal and bacterial genomes, multiple β-HAD genes are observed leading to the hypothesis that these gene products may have unique, uncharacterized metabolic roles specific to their species. The genomes of Geobacter sulfurreducens and Geobacter metallireducens each contain two potential β-HAD genes. The protein sequences of one pair of these genes, Gs-βHAD (Q74DE4) and Gm-βHAD (Q39R98), have 65% sequence identity and 77% sequence similarity with eachmore » other. Both proteins reduce succinic semialdehyde, a metabolite of the GABA shunt. To further explore the structural and functional characteristics of these two β-HADs with a potentially unique substrate specificity, crystal structures for Gs-βHAD and Gm-βHAD in complex with NADP+ were determined to a resolution of 1.89 Å and 2.07 Å, respectively. The structure of both proteins are similar, composed of 14 α-helices and nine β-strands organized into two domains. Domain One (1-165) adopts a typical Rossmann fold composed of two α/β units: a six-strand parallel β-sheet surrounded by six α-helices (α1 – α6) followed by a mixed three-strand β-sheet surrounded by two α-helices (α7 and α8). Domain Two (166-287) is composed of a bundle of seven α-helices (α9 – α14). Four functional regions conserved in all β-HADs are spatially located near each other at the interdomain cleft in both Gs-βHAD and Gm-βHAD with a buried molecule of NADP+. The structural features of Gs-βHAD and Gm-βHAD are described in relation to the four conserved consensus sequences characteristic of β-HADs and the potential biochemical importance of these enzymes as an alternative pathway for the degradation of succinic semialdehyde.« less
Jiang, Ke; Zhang, Peng
2011-01-01
TRPA1 is a calcium ion channel protein recently identified as the infrared receptor in pit organ-containing snakes. Therefore, understanding the molecular evolution of TRPA1 may help to illuminate the origin of “heat vision” in snakes and reveal the molecular mechanism of infrared sensitivity for TRPA1. To this end, we sequenced the infrared sensory gene TRPA1 in 24 snake species, representing nine snake families and multiple non-snake outgroups. We found that TRPA1 is under strong positive selection in the pit-bearing snakes studied, but not in other non-pit snakes and non-snake vertebrates. As a comparison, TRPV1, a gene closely related to TRPA1, was found to be under strong purifying selection in all the species studied, with no difference in the strength of selection between pit-bearing snakes and non-pit snakes. This finding demonstrates that the adaptive evolution of TRPA1 specifically occurred within the pit-bearing snakes and may be related to the functional modification for detecting infrared radiation. In addition, by comparing the TRPA1 protein sequences, we identified 11 amino acid sites that were diverged in pit-bearing snakes but conserved in non-pit snakes and other vertebrates, 21 sites that were diverged only within pit-vipers but conserved in the remaining snakes. These specific amino acid substitutions may be potentially functional important for infrared sensing. PMID:22163322
Robledo, Marta; Peregrina, Alexandra; Millán, Vicenta; García-Tomsig, Natalia I; Torres-Quesada, Omar; Mateos, Pedro F; Becker, Anke; Jiménez-Zurdo, José I
2017-07-01
Small non-coding RNAs (sRNAs) are expected to have pivotal roles in the adaptive responses underlying symbiosis of nitrogen-fixing rhizobia with legumes. Here, we provide primary insights into the function and activity mechanism of the Sinorhizobium meliloti trans-sRNA NfeR1 (Nodule Formation Efficiency RNA). Northern blot probing and transcription tracking with fluorescent promoter-reporter fusions unveiled high nfeR1 expression in response to salt stress and throughout the symbiotic interaction. The strength and differential regulation of nfeR1 transcription are conferred by a motif, which is conserved in nfeR1 promoter regions in α-proteobacteria. NfeR1 loss-of-function compromised osmoadaptation of free-living bacteria, whilst causing misregulation of salt-responsive genes related to stress adaptation, osmolytes catabolism and membrane trafficking. Nodulation tests revealed that lack of NfeR1 affected competitiveness, infectivity, nodule development and symbiotic efficiency of S. meliloti on alfalfa roots. Comparative computer predictions and a genetic reporter assay evidenced a redundant role of three identical NfeR1 unpaired anti Shine-Dalgarno motifs for targeting and downregulation of translation of multiple mRNAs from transporter genes. Our data provide genetic evidence of the hyperosmotic conditions of the endosymbiotic compartments. NfeR1-mediated gene regulation in response to this cue could contribute to coordinate nutrient uptake with the metabolic reprogramming concomitant to symbiotic transitions. © 2017 Society for Applied Microbiology and John Wiley & Sons Ltd.
Amino acid sequence analysis of the annexin super-gene family of proteins.
Barton, G J; Newman, R H; Freemont, P S; Crumpton, M J
1991-06-15
The annexins are a widespread family of calcium-dependent membrane-binding proteins. No common function has been identified for the family and, until recently, no crystallographic data existed for an annexin. In this paper we draw together 22 available annexin sequences consisting of 88 similar repeat units, and apply the techniques of multiple sequence alignment, pattern matching, secondary structure prediction and conservation analysis to the characterisation of the molecules. The analysis clearly shows that the repeats cluster into four distinct families and that greatest variation occurs within the repeat 3 units. Multiple alignment of the 88 repeats shows amino acids with conserved physicochemical properties at 22 positions, with only Gly at position 23 being absolutely conserved in all repeats. Secondary structure prediction techniques identify five conserved helices in each repeat unit and patterns of conserved hydrophobic amino acids are consistent with one face of a helix packing against the protein core in predicted helices a, c, d, e. Helix b is generally hydrophobic in all repeats, but contains a striking pattern of repeat-specific residue conservation at position 31, with Arg in repeats 4 and Glu in repeats 2, but unconserved amino acids in repeats 1 and 3. This suggests repeats 2 and 4 may interact via a buried saltbridge. The loop between predicted helices a and b of repeat 3 shows features distinct from the equivalent loop in repeats 1, 2 and 4, suggesting an important structural and/or functional role for this region. No compelling evidence emerges from this study for uteroglobin and the annexins sharing similar tertiary structures, or for uteroglobin representing a derivative of a primordial one-repeat structure that underwent duplication to give the present day annexins. The analyses performed in this paper are re-evaluated in the Appendix, in the light of the recently published X-ray structure for human annexin V. The structure confirms most of the predictions and shows the power of techniques for the determination of tertiary structural information from the amino acid sequences of an aligned protein family.
Paquet, Nicolas; Bernadet, Marie; Morin, Halima; Traas, Jan; Dron, Michel; Charon, Celine
2005-06-01
Poaceae species present a conserved distichous phyllotaxy (leaf position along the stem) and share common properties with respect to leaf initiation. The goal of this work was to determine if these common traits imply common genes. Therefore, homologues of the maize TERMINAL EAR1 gene in Poaceae were studied. This gene encodes an RNA-binding motif (RRM) protein, that is suggested to regulate leaf initiation. Using degenerate primers, one unique tel (terminal ear1-like) gene from seven Poaceae members, covering almost all the phylogenetic tree of the family, was identified by PCR. These genes present a very high degree of similarity, a much conserved exon-intron structure, and the three RRMs and TEL characteristic motifs. The evolution of tel sequences in Poaceae strongly correlates with the known phylogenetic tree of this family. RT-PCR gene expression analyses show conserved tel expression in the shoot apex in all species, suggesting functional orthology between these genes. In addition, in situ hybridization experiments with specific antisense probes show tel transcript accumulation in all differentiating cells of the leaf, from the recruitment of leaf founder cells to leaf margins cells. Tel expression is not restricted to initiating leaves as it is also found in pro-vascular tissues, root meristems, and immature inflorescences. Therefore, these results suggest that TEL is not only associated with leaf initiation but more generally with cell differentiation in Poaceae.
2011-01-01
Background Understanding polyphenism, the ability of a single genome to express multiple morphologically and behaviourally distinct phenotypes, is an important goal for evolutionary and developmental biology. Polyphenism has been key to the evolution of the Hymenoptera, and particularly the social Hymenoptera where the genome of a single species regulates distinct larval stages, sexual dimorphism and physical castes within the female sex. Transcriptomic analyses of social Hymenoptera will therefore provide unique insights into how changes in gene expression underlie such complexity. Here we describe gene expression in individual specimens of the pre-adult stages, sexes and castes of the key pollinator, the buff-tailed bumblebee Bombus terrestris. Results cDNA was prepared from mRNA from five life cycle stages (one larva, one pupa, one male, one gyne and two workers) and a total of 1,610,742 expressed sequence tags (ESTs) were generated using Roche 454 technology, substantially increasing the sequence data available for this important species. Overlapping ESTs were assembled into 36,354 B. terrestris putative transcripts, and functionally annotated. A preliminary assessment of differences in gene expression across non-replicated specimens from the pre-adult stages, castes and sexes was performed using R-STAT analysis. Individual samples from the life cycle stages of the bumblebee differed in the expression of a wide array of genes, including genes involved in amino acid storage, metabolism, immunity and olfaction. Conclusions Detailed analyses of immune and olfaction gene expression across phenotypes demonstrated how transcriptomic analyses can inform our understanding of processes central to the biology of B. terrestris and the social Hymenoptera in general. For example, examination of immunity-related genes identified high conservation of important immunity pathway components across individual specimens from the life cycle stages while olfactory-related genes exhibited differential expression with a wider repertoire of gene expression within adults, especially sexuals, in comparison to immature stages. As there is an absence of replication across the samples, the results of this study are preliminary but provide a number of candidate genes which may be related to distinct phenotypic stage expression. This comprehensive transcriptome catalogue will provide an important gene discovery resource for directed programmes in ecology, evolution and conservation of a key pollinator. PMID:22185240
Federal Register 2010, 2011, 2012, 2013, 2014
2013-04-18
... Basic Impulse Level 4. Dual/Multiple-Voltage Primary Windings 5. Dual/Multiple-Voltage Secondary Windings 6. Loading B. Technological Feasibility 1. General 2. Maximum Technologically Feasible Levels C...
Genome Sequence of the Pea Aphid Acyrthosiphon pisum
2010-01-01
Aphids are important agricultural pests and also biological models for studies of insect-plant interactions, symbiosis, virus vectoring, and the developmental causes of extreme phenotypic plasticity. Here we present the 464 Mb draft genome assembly of the pea aphid Acyrthosiphon pisum. This first published whole genome sequence of a basal hemimetabolous insect provides an outgroup to the multiple published genomes of holometabolous insects. Pea aphids are host-plant specialists, they can reproduce both sexually and asexually, and they have coevolved with an obligate bacterial symbiont. Here we highlight findings from whole genome analysis that may be related to these unusual biological features. These findings include discovery of extensive gene duplication in more than 2000 gene families as well as loss of evolutionarily conserved genes. Gene family expansions relative to other published genomes include genes involved in chromatin modification, miRNA synthesis, and sugar transport. Gene losses include genes central to the IMD immune pathway, selenoprotein utilization, purine salvage, and the entire urea cycle. The pea aphid genome reveals that only a limited number of genes have been acquired from bacteria; thus the reduced gene count of Buchnera does not reflect gene transfer to the host genome. The inventory of metabolic genes in the pea aphid genome suggests that there is extensive metabolite exchange between the aphid and Buchnera, including sharing of amino acid biosynthesis between the aphid and Buchnera. The pea aphid genome provides a foundation for post-genomic studies of fundamental biological questions and applied agricultural problems. PMID:20186266
Horizontal transfer of a large and highly toxic secondary metabolic gene cluster between fungi.
Slot, Jason C; Rokas, Antonis
2011-01-25
Genes involved in intermediary and secondary metabolism in fungi are frequently physically linked or clustered. For example, in Aspergillus nidulans the entire pathway for the production of sterigmatocystin (ST), a highly toxic secondary metabolite and a precursor to the aflatoxins (AF), is located in a ∼54 kb, 23 gene cluster. We discovered that a complete ST gene cluster in Podospora anserina was horizontally transferred from Aspergillus. Phylogenetic analysis shows that most Podospora cluster genes are adjacent to or nested within Aspergillus cluster genes, although the two genera belong to different taxonomic classes. Furthermore, the Podospora cluster is highly conserved in content, sequence, and microsynteny with the Aspergillus ST/AF clusters and its intergenic regions contain 14 putative binding sites for AflR, the transcription factor required for activation of the ST/AF biosynthetic genes. Examination of ∼52,000 Podospora expressed sequence tags identified transcripts for 14 genes in the cluster, with several expressed at multiple life cycle stages. The presence of putative AflR-binding sites and the expression evidence for several cluster genes, coupled with the recent independent discovery of ST production in Podospora [1], suggest that this HGT event probably resulted in a functional cluster. Given the abundance of metabolic gene clusters in fungi, our finding that one of the largest known metabolic gene clusters moved intact between species suggests that such transfers might have significantly contributed to fungal metabolic diversity. PAPERFLICK: Copyright © 2011 Elsevier Ltd. All rights reserved.
The effects of exogenous cortisol on myostatin transcription in rainbow trout, Oncorhynchus mykiss.
Galt, Nicholas J; Froehlich, Jacob Michael; Remily, Ethan A; Romero, Sinibaldo R; Biga, Peggy R
2014-09-01
Glucocorticoids (GCs) strongly regulate myostatin expression in mammals via glucocorticoid response elements (GREs), and bioinformatics methods suggest that this regulatory mechanism is conserved among many vertebrates. However, the multiple myostatin genes found in some fishes may be an exception. In silico promoter analyses of the three putative rainbow trout (Oncorhynchus mykiss) myostatin promoters have failed to identify putative GREs, suggesting a divergence in myostatin function. Therefore, we hypothesized that myostatin mRNA expression is not regulated by glucocorticoids in rainbow trout. In this study, both juvenile rainbow trout and primary trout myoblasts were treated with cortisol to examine the effects on myostatin mRNA expression. Results suggest that exogenous cortisol does not regulate myostatin-1a and -1b expression in vivo, as myostatin mRNA levels were not significantly affected by cortisol treatment in either red or white muscle tissue. In red muscle, myostatin-2a levels were significantly elevated in the cortisol treatment group relative to the control, but not the vehicle control, at both 12 h and 24 h post-injection. As such, it is unclear if cortisol was acting alone or in combination with the vehicle. Cortisol increased myostatin-1b expression in a dose-dependent manner in vitro. Further work is needed to determine if this response is the direct result of cortisol acting on the myostatin-1b promoter or through an alternative mechanism. These results suggest that regulation of myostatin by cortisol may not be as highly conserved as previously thought and support previous work that describes potential functional divergence of the multiple myostatin genes in fishes. Copyright © 2014 Elsevier Inc. All rights reserved.
Peterson, Daniel A; Planer, Joseph D; Guruge, Janaki L; Xue, Lai; Downey-Virgin, Whitt; Goodman, Andrew L; Seedorf, Henning; Gordon, Jeffrey I
2015-05-15
The adaptive immune response to the human gut microbiota consists of a complex repertoire of antibodies interacting with a broad range of taxa. Fusing intestinal lamina propria lymphocytes from mice monocolonized with Bacteroides thetaiotaomicron to a myeloma fusion partner allowed us to recover hybridomas that captured naturally primed, antigen-specific antibody responses representing multiple isotypes, including IgA. One of these hybridomas, 260.8, produced a monoclonal antibody that recognizes an epitope specific for B. thetaiotaomicron isolates in a large panel of hospital- and community-acquired Bacteroides. Whole genome transposon mutagenesis revealed a 19-gene locus, involved in LPS O-antigen polysaccharide synthesis and conserved among multiple B. thetaiotaomicron isolates, that is required for 260.8 epitope expression. Mutants in this locus exhibited marked fitness defects in vitro during growth in rich medium and in gnotobiotic mice colonized with defined communities of human gut symbionts. Expression of the 260.8 epitope was sustained during 10 months of daily passage in vitro and during 14 months of monocolonization of gnotobiotic wild-type, Rag1-/-, or Myd88-/- mice. Comparison of gnotobiotic Rag1-/- mice with and without subcutaneous 260.8 hybridomas disclosed that this IgA did not affect B. thetaiotaomicron population density or suppress 260.8 epitope production but did affect bacterial gene expression in ways emblematic of a diminished host innate immune response. Our study illustrates an approach for (i) generating diagnostic antibodies, (ii) characterizing IgA responses along a continuum of specificity/degeneracy that defines the IgA repertoire to gut symbionts, and (iii) identifying immunogenic epitopes that affect competitiveness and help maintain host-microbe mutualism. © 2015 by The American Society for Biochemistry and Molecular Biology, Inc.
Peterson, Daniel A.; Planer, Joseph D.; Guruge, Janaki L.; Xue, Lai; Downey-Virgin, Whitt; Goodman, Andrew L.; Seedorf, Henning; Gordon, Jeffrey I.
2015-01-01
The adaptive immune response to the human gut microbiota consists of a complex repertoire of antibodies interacting with a broad range of taxa. Fusing intestinal lamina propria lymphocytes from mice monocolonized with Bacteroides thetaiotaomicron to a myeloma fusion partner allowed us to recover hybridomas that captured naturally primed, antigen-specific antibody responses representing multiple isotypes, including IgA. One of these hybridomas, 260.8, produced a monoclonal antibody that recognizes an epitope specific for B. thetaiotaomicron isolates in a large panel of hospital- and community-acquired Bacteroides. Whole genome transposon mutagenesis revealed a 19-gene locus, involved in LPS O-antigen polysaccharide synthesis and conserved among multiple B. thetaiotaomicron isolates, that is required for 260.8 epitope expression. Mutants in this locus exhibited marked fitness defects in vitro during growth in rich medium and in gnotobiotic mice colonized with defined communities of human gut symbionts. Expression of the 260.8 epitope was sustained during 10 months of daily passage in vitro and during 14 months of monocolonization of gnotobiotic wild-type, Rag1−/−, or Myd88−/− mice. Comparison of gnotobiotic Rag1−/− mice with and without subcutaneous 260.8 hybridomas disclosed that this IgA did not affect B. thetaiotaomicron population density or suppress 260.8 epitope production but did affect bacterial gene expression in ways emblematic of a diminished host innate immune response. Our study illustrates an approach for (i) generating diagnostic antibodies, (ii) characterizing IgA responses along a continuum of specificity/degeneracy that defines the IgA repertoire to gut symbionts, and (iii) identifying immunogenic epitopes that affect competitiveness and help maintain host-microbe mutualism. PMID:25795776
Tbx2/3 is an essential mediator within the Brachyury gene network during Ciona notochord development
José-Edwards, Diana S.; Oda-Ishii, Izumi; Nibu, Yutaka; Di Gregorio, Anna
2013-01-01
T-box genes are potent regulators of mesoderm development in many metazoans. In chordate embryos, the T-box transcription factor Brachyury (Bra) is required for specification and differentiation of the notochord. In some chordates, including the ascidian Ciona, members of the Tbx2 subfamily of T-box genes are also expressed in this tissue; however, their regulatory relationships with Bra and their contributions to the development of the notochord remain uncharacterized. We determined that the notochord expression of Ciona Tbx2/3 (Ci-Tbx2/3) requires Ci-Bra, and identified a Ci-Tbx2/3 notochord CRM that necessitates multiple Ci-Bra binding sites for its activity. Expression of mutant forms of Ci-Tbx2/3 in the developing notochord revealed a role for this transcription factor primarily in convergent extension. Through microarray screens, we uncovered numerous Ci-Tbx2/3 targets, some of which overlap with known Ci-Bra-downstream notochord genes. Among the Ci-Tbx2/3 notochord targets are evolutionarily conserved genes, including caspases, lineage-specific genes, such as Noto4, and newly identified genes, such as MLKL. This work sheds light on a large section of the notochord regulatory circuitry controlled by T-box factors, and reveals new components of the complement of genes required for the proper formation of this structure. PMID:23674602
José-Edwards, Diana S; Oda-Ishii, Izumi; Nibu, Yutaka; Di Gregorio, Anna
2013-06-01
T-box genes are potent regulators of mesoderm development in many metazoans. In chordate embryos, the T-box transcription factor Brachyury (Bra) is required for specification and differentiation of the notochord. In some chordates, including the ascidian Ciona, members of the Tbx2 subfamily of T-box genes are also expressed in this tissue; however, their regulatory relationships with Bra and their contributions to the development of the notochord remain uncharacterized. We determined that the notochord expression of Ciona Tbx2/3 (Ci-Tbx2/3) requires Ci-Bra, and identified a Ci-Tbx2/3 notochord CRM that necessitates multiple Ci-Bra binding sites for its activity. Expression of mutant forms of Ci-Tbx2/3 in the developing notochord revealed a role for this transcription factor primarily in convergent extension. Through microarray screens, we uncovered numerous Ci-Tbx2/3 targets, some of which overlap with known Ci-Bra-downstream notochord genes. Among the Ci-Tbx2/3 notochord targets are evolutionarily conserved genes, including caspases, lineage-specific genes, such as Noto4, and newly identified genes, such as MLKL. This work sheds light on a large section of the notochord regulatory circuitry controlled by T-box factors, and reveals new components of the complement of genes required for the proper formation of this structure.
Targeting Conserved Genes in Penicillium Species.
Peterson, Stephen W
2017-01-01
Polymerase chain reaction amplification of conserved genes and sequence analysis provides a very powerful tool for the identification of toxigenic as well as non-toxigenic Penicillium species. Sequences are obtained by amplification of the gene fragment, sequencing via capillary electrophoresis of dideoxynucleotide-labeled fragments or NGS. The sequences are compared to a database of validated isolates. Identification of species indicates the potential of the fungus to make particular mycotoxins.
Cox, Murray P; Dong, Ting; Shen, Genggeng; Dalvi, Yogesh; Scott, D Barry; Ganley, Austen R D
2014-03-01
Polyploidy, a state in which the chromosome complement has undergone an increase, is a major force in evolution. Understanding the consequences of polyploidy has received much attention, and allopolyploids, which result from the union of two different parental genomes, are of particular interest because they must overcome a suite of biological responses to this merger, known as "genome shock." A key question is what happens to gene expression of the two gene copies following allopolyploidization, but until recently the tools to answer this question on a genome-wide basis were lacking. Here we utilize high throughput transcriptome sequencing to produce the first genome-wide picture of gene expression response to allopolyploidy in fungi. A novel pipeline for assigning sequence reads to the gene copies was used to quantify their expression in a fungal allopolyploid. We find that the transcriptional response to allopolyploidy is predominantly conservative: both copies of most genes are retained; over half the genes inherit parental gene expression patterns; and parental differential expression is often lost in the allopolyploid. Strikingly, the patterns of gene expression change are highly concordant with the genome-wide expression results of a cotton allopolyploid. The very different nature of these two allopolyploids implies a conserved, eukaryote-wide transcriptional response to genome merger. We provide evidence that the transcriptional responses we observe are mostly driven by intrinsic differences between the regulatory systems in the parent species, and from this propose a mechanistic model in which the cross-kingdom conservation in transcriptional response reflects conservation of the mutational processes underlying eukaryotic gene regulatory evolution. This work provides a platform to develop a universal understanding of gene expression response to allopolyploidy and suggests that allopolyploids are an exceptional system to investigate gene regulatory changes that have evolved in the parental species prior to allopolyploidization.
Conservation and divergence of microRNAs in Populus
Barakat, Abdelali; Wall, Phillip K; DiLoreto, Scott; dePamphilis, Claude W; Carlson, John E
2007-01-01
Background MicroRNAs (miRNAs) are small RNAs (sRNA) ~21 nucleotides in length that negatively control gene expression by cleaving or inhibiting the translation of target gene transcripts. miRNAs have been extensively analyzed in Arabidopsis and rice and partially investigated in other non-model plant species. To date, 109 and 62 miRNA families have been identified in Arabidopsis and rice respectively. However, only 33 miRNAs have been identified from the genome of the model tree species (Populus trichocarpa), of which 11 are Populus specific. The low number of miRNA families previously identified in Populus, compared with the number of families identified in Arabidopsis and rice, suggests that many miRNAs still remain to be discovered in Populus. In this study, we analyzed expressed small RNAs from leaves and vegetative buds of Populus using high throughput pyrosequencing. Results Analysis of almost eighty thousand small RNA reads allowed us to identify 123 new sequences belonging to previously identified miRNA families as well as 48 new miRNA families that could be Populus-specific. Comparison of the organization of miRNA families in Populus, Arabidopsis and rice showed that miRNA family sizes were generally expanded in Populus. The putative targets of non-conserved miRNA include both previously identified targets as well as several new putative target genes involved in development, resistance to stress, and other cellular processes. Moreover, almost half of the genes predicted to be targeted by non-conserved miRNAs appear to be Populus-specific. Comparative analyses showed that genes targeted by conserved and non-conserved miRNAs are biased mainly towards development, electron transport and signal transduction processes. Similar results were found for non-conserved miRNAs from Arabidopsis. Conclusion Our results suggest that while there is a conserved set of miRNAs among plant species, a large fraction of miRNAs vary among species. The non-conserved miRNAs may regulate cellular, physiological or developmental processes specific to the taxa that produce them, as appears likely to be the case for those miRNAs that have only been observed in Populus. Non-conserved and conserved miRNAs seem to target genes with similar biological functions indicating that similar selection pressures are acting on both types of miRNAs. The expansion in the number of most conserved miRNAs in Populus relative to Arabidopsis, may be linked to the recent genome duplication in Populus, the slow evolution of the Populus genome, or to differences in the selection pressure on duplicated miRNAs in these species. PMID:18166134
de Cambiaire, Jean-Charles; Otis, Christian; Turmel, Monique; Lemieux, Claude
2007-01-01
Background In the Chlorophyta – the green algal phylum comprising the classes Prasinophyceae, Ulvophyceae, Trebouxiophyceae and Chlorophyceae – the chloroplast genome displays a highly variable architecture. While chlorophycean chloroplast DNAs (cpDNAs) deviate considerably from the ancestral pattern described for the prasinophyte Nephroselmis olivacea, the degree of remodelling sustained by the two ulvophyte cpDNAs completely sequenced to date is intermediate relative to those observed for chlorophycean and trebouxiophyte cpDNAs. Chlorella vulgaris (Chlorellales) is currently the only photosynthetic trebouxiophyte whose complete cpDNA sequence has been reported. To gain insights into the evolutionary trends of the chloroplast genome in the Trebouxiophyceae, we sequenced cpDNA from the filamentous alga Leptosira terrestris (Ctenocladales). Results The 195,081-bp Leptosira chloroplast genome resembles the 150,613-bp Chlorella genome in lacking a large inverted repeat (IR) but differs greatly in gene order. Six of the conserved genes present in Chlorella cpDNA are missing from the Leptosira gene repertoire. The 106 conserved genes, four introns and 11 free standing open reading frames (ORFs) account for 48.3% of the genome sequence. This is the lowest gene density yet observed among chlorophyte cpDNAs. Contrary to the situation in Chlorella but similar to that in the chlorophycean Scenedesmus obliquus, the gene distribution is highly biased over the two DNA strands in Leptosira. Nine genes, compared to only three in Chlorella, have significantly expanded coding regions relative to their homologues in ancestral-type green algal cpDNAs. As observed in chlorophycean genomes, the rpoB gene is fragmented into two ORFs. Short repeats account for 5.1% of the Leptosira genome sequence and are present mainly in intergenic regions. Conclusion Our results highlight the great plasticity of the chloroplast genome in the Trebouxiophyceae and indicate that the IR was lost on at least two separate occasions. The intriguing similarities of the derived features exhibited by Leptosira cpDNA and its chlorophycean counterparts suggest that the same evolutionary forces shaped the IR-lacking chloroplast genomes in these two algal lineages. PMID:17610731
RNAi for functional genomics in plants.
McGinnis, Karen M
2010-03-01
RNAi refers to several different types of gene silencing mediated by small, dsRNA molecules. Over the course of 20 years, the scientific understanding of RNAi has developed from the initial observation of unexpected expression patterns to a sophisticated understanding of a multi-faceted, evolutionarily conserved network of mechanisms that regulate gene expression in many organisms. It has also been developed as a genetic tool that can be exploited in a wide range of species. Because transgene-induced RNAi has been effective at silencing one or more genes in a wide range of plants, this technology also bears potential as a powerful functional genomics tool across the plant kingdom. Transgene-induced RNAi has indeed been shown to be an effective mechanism for silencing many genes in many organisms, but the results from multiple projects which attempted to exploit RNAi on a genome-wide scale suggest that there is a great deal of variation in the silencing efficacy between transgenic events, silencing targets and silencing-induced phenotype. The results from these projects indicate several important variables that should be considered in experimental design prior to the initiation of functional genomics efforts based on RNAi silencing. In recent years, alternative strategies have been developed for targeted gene silencing, and a combination of approaches may also enhance the use of targeted gene silencing for functional genomics.
Samson, Marie-Laure
2008-01-01
Background The Drosophila gene embryonic lethal abnormal visual system (elav) is the prototype of a gene family present in all metazoans. Its members encode structurally conserved neuronal proteins with three RNA Recognition Motifs (RRM) but they paradoxically act at diverse levels of post-transcriptional regulation. In an attempt to understand the history of this family, we searched for orthologs in eleven completely sequenced genomes, including those of humans, D. melanogaster and C. elegans, for which cDNAs are available. Results We analyzed 23 orthologs/paralogs of elav, and found evidence of gain/loss of gene copy number. For one set of genes, including elav itself, the coding sequences are free of introns and their products most resemble ELAV. The remaining genes show remarkable conservation of their exon organization, and their products most resemble FNE and RBP9, proteins encoded by the two elav paralogs of Drosophila. Remarkably, three of the conserved exon junctions are both close to structural elements, involved respectively in protein-RNA interactions and in the regulation of sub-cellular localization, and in the vicinity of diverse sequence variations. Conclusion The data indicate that the essential elav gene of Drosophila is newly emerged, restricted to dipterans and of retrotransposed origin. We propose that the conserved exon junctions constitute potential sites for sequence/function modifications, and that RRM binding proteins, whose function relies upon plastic RNA-protein interactions, may have played an important role in brain evolution. PMID:18715504
Skuse, David H.; Lori, Adriana; Cubells, Joseph F.; Lee, Irene; Conneely, Karen N.; Puura, Kaija; Lehtimäki, Terho; Binder, Elisabeth B.; Young, Larry J.
2014-01-01
The neuropeptides oxytocin and vasopressin are evolutionarily conserved regulators of social perception and behavior. Evidence is building that they are critically involved in the development of social recognition skills within rodent species, primates, and humans. We investigated whether common polymorphisms in the genes encoding the oxytocin and vasopressin 1a receptors influence social memory for faces. Our sample comprised 198 families, from the United Kingdom and Finland, in whom a single child had been diagnosed with high-functioning autism. Previous research has shown that impaired social perception, characteristic of autism, extends to the first-degree relatives of autistic individuals, implying heritable risk. Assessments of face recognition memory, discrimination of facial emotions, and direction of gaze detection were standardized for age (7–60 y) and sex. A common SNP in the oxytocin receptor (rs237887) was strongly associated with recognition memory in combined probands, parents, and siblings after correction for multiple comparisons. Homozygotes for the ancestral A allele had impairments in the range −0.6 to −1.15 SD scores, irrespective of their diagnostic status. Our findings imply that a critical role for the oxytocin system in social recognition has been conserved across perceptual boundaries through evolution, from olfaction in rodents to visual memory in humans. PMID:24367110
Priyadarshini, P; Tiwari, K; Das, A; Kumar, D; Mishra, M N; Desikan, P; Nath, G
2017-02-01
To evaluate the sensitivity and specificity of a new nested set of primers designed for the detection of Mycobacterium tuberculosis complex targeting a highly conserved heat shock protein gene (hsp65). The nested primers were designed using multiple sequence alignment assuming the nucleotide sequence of the M. tuberculosis H37Rv hsp65 genome as base. Multidrug-resistant Mycobacterium species along with other non-mycobacterial and fungal species were included to evaluate the specificity of M. tuberculosis hsp65 gene-specific primers. The sensitivity of the primers was determined using serial 10-fold dilutions, and was 100% as shown by the bands in the case of M. tuberculosis complex. None of the other non M. tuberculosis complex bacterial and fungal species yielded any band on nested polymerase chain reaction (PCR). The first round of amplification could amplify 0.3 ng of the template DNA, while nested PCR could detect 0.3 pg. The present hsp65-specific primers have been observed to be sensitive, specific and cost-effective, without requiring interpretation of biochemical tests, real-time PCR, sequencing or high-performance liquid chromatography. These primer sets do not have the drawbacks associated with those protocols that target insertion sequence 6110, 16S rDNA, rpoB, recA and MPT 64.
Skuse, David H; Lori, Adriana; Cubells, Joseph F; Lee, Irene; Conneely, Karen N; Puura, Kaija; Lehtimäki, Terho; Binder, Elisabeth B; Young, Larry J
2014-02-04
The neuropeptides oxytocin and vasopressin are evolutionarily conserved regulators of social perception and behavior. Evidence is building that they are critically involved in the development of social recognition skills within rodent species, primates, and humans. We investigated whether common polymorphisms in the genes encoding the oxytocin and vasopressin 1a receptors influence social memory for faces. Our sample comprised 198 families, from the United Kingdom and Finland, in whom a single child had been diagnosed with high-functioning autism. Previous research has shown that impaired social perception, characteristic of autism, extends to the first-degree relatives of autistic individuals, implying heritable risk. Assessments of face recognition memory, discrimination of facial emotions, and direction of gaze detection were standardized for age (7-60 y) and sex. A common SNP in the oxytocin receptor (rs237887) was strongly associated with recognition memory in combined probands, parents, and siblings after correction for multiple comparisons. Homozygotes for the ancestral A allele had impairments in the range -0.6 to -1.15 SD scores, irrespective of their diagnostic status. Our findings imply that a critical role for the oxytocin system in social recognition has been conserved across perceptual boundaries through evolution, from olfaction in rodents to visual memory in humans.
Elevated Rate of Genome Rearrangements in Radiation-Resistant Bacteria.
Repar, Jelena; Supek, Fran; Klanjscek, Tin; Warnecke, Tobias; Zahradka, Ksenija; Zahradka, Davor
2017-04-01
A number of bacterial, archaeal, and eukaryotic species are known for their resistance to ionizing radiation. One of the challenges these species face is a potent environmental source of DNA double-strand breaks, potential drivers of genome structure evolution. Efficient and accurate DNA double-strand break repair systems have been demonstrated in several unrelated radiation-resistant species and are putative adaptations to the DNA damaging environment. Such adaptations are expected to compensate for the genome-destabilizing effect of environmental DNA damage and may be expected to result in a more conserved gene order in radiation-resistant species. However, here we show that rates of genome rearrangements, measured as loss of gene order conservation with time, are higher in radiation-resistant species in multiple, phylogenetically independent groups of bacteria. Comparison of indicators of selection for genome organization between radiation-resistant and phylogenetically matched, nonresistant species argues against tolerance to disruption of genome structure as a strategy for radiation resistance. Interestingly, an important mechanism affecting genome rearrangements in prokaryotes, the symmetrical inversions around the origin of DNA replication, shapes genome structure of both radiation-resistant and nonresistant species. In conclusion, the opposing effects of environmental DNA damage and DNA repair result in elevated rates of genome rearrangements in radiation-resistant bacteria. Copyright © 2017 Repar et al.
Tempo and Mode of Gene Duplication in Mammalian Ribosomal Protein Evolution
Gajdosik, Matthew D.; Simon, Amanda; Nelson, Craig E.
2014-01-01
Gene duplication has been widely recognized as a major driver of evolutionary change and organismal complexity through the generation of multi-gene families. Therefore, understanding the forces that govern the evolution of gene families through the retention or loss of duplicated genes is fundamentally important in our efforts to study genome evolution. Previous work from our lab has shown that ribosomal protein (RP) genes constitute one of the largest classes of conserved duplicated genes in mammals. This result was surprising due to the fact that ribosomal protein genes evolve slowly and transcript levels are very tightly regulated. In our present study, we identified and characterized all RP duplicates in eight mammalian genomes in order to investigate the tempo and mode of ribosomal protein family evolution. We show that a sizable number of duplicates are transcriptionally active and are very highly conserved. Furthermore, we conclude that existing gene duplication models do not readily account for the preservation of a very large number of intact retroduplicated ribosomal protein (RT-RP) genes observed in mammalian genomes. We suggest that selection against dominant-negative mutations may underlie the unexpected retention and conservation of duplicated RP genes, and may shape the fate of newly duplicated genes, regardless of duplication mechanism. PMID:25369106
CORECLUST: identification of the conserved CRM grammar together with prediction of gene regulation.
Nikulova, Anna A; Favorov, Alexander V; Sutormin, Roman A; Makeev, Vsevolod J; Mironov, Andrey A
2012-07-01
Identification of transcriptional regulatory regions and tracing their internal organization are important for understanding the eukaryotic cell machinery. Cis-regulatory modules (CRMs) of higher eukaryotes are believed to possess a regulatory 'grammar', or preferred arrangement of binding sites, that is crucial for proper regulation and thus tends to be evolutionarily conserved. Here, we present a method CORECLUST (COnservative REgulatory CLUster STructure) that predicts CRMs based on a set of positional weight matrices. Given regulatory regions of orthologous and/or co-regulated genes, CORECLUST constructs a CRM model by revealing the conserved rules that describe the relative location of binding sites. The constructed model may be consequently used for the genome-wide prediction of similar CRMs, and thus detection of co-regulated genes, and for the investigation of the regulatory grammar of the system. Compared with related methods, CORECLUST shows better performance at identification of CRMs conferring muscle-specific gene expression in vertebrates and early-developmental CRMs in Drosophila.
Kwantes, Michiel; Liebsch, Daniela; Verelst, Wim
2012-01-01
Land plants have a remarkable life cycle that alternates between a diploid sporophytic and a haploid gametophytic generation, both of which are multicellular and changed drastically during evolution. Classical MIKC MADS-domain (MIKCC) transcription factors are famous for their role in sporophytic development and are considered crucial for its evolution. About the regulation of gametophyte development, in contrast, little is known. Recent evidence indicated that the closely related MIKC* MADS-domain proteins are important for the functioning of the Arabidopsis thaliana male gametophyte (pollen). Furthermore, also in bryophytes, several MIKC* genes are expressed in the haploid generation. Therefore, that MIKC* genes have a similar role in the evolution of the gametophytic phase as MIKCC genes have in the sporophyte is a tempting hypothesis. To get a comprehensive view of the involvement of MIKC* genes in gametophyte evolution, we isolated them from a broad variety of vascular plants, including the lycophyte Selaginella moellendorffii, the fern Ceratopteris richardii, and representatives of several flowering plant lineages. Phylogenetic analysis revealed an extraordinary conservation not found in MIKCC genes. Moreover, expression and interaction studies suggest that a conserved and characteristic network operates in the gametophytes of all tested model organisms. Additionally, we found that MIKC* genes probably evolved from an ancestral MIKCC-like gene by a duplication in the Keratin-like region. We propose that this event facilitated the independent evolution of MIKC* and MIKCC protein networks and argue that whereas MIKCC genes diversified and attained new functions, MIKC* genes retained a conserved role in the gametophyte during land plant evolution.
Allen, Alexandra M; Lexer, Christian; Hiscock, Simon J
2010-11-01
Fertilization in angiosperms depends on a complex cellular "courtship" between haploid pollen and diploid pistil. These pollen-pistil interactions are regulated by a diversity of molecules, many of which remain to be identified and characterized. Thus, it is unclear to what extent these processes are conserved among angiosperms, a fact confounded by limited sampling across taxa. Here, we report the analysis of pistil-expressed genes in Senecio squalidus (Asteraceae), a species from euasterid II, a major clade for which there are currently no data on pistil-expressed genes. Species from the Asteraceae characteristically have a "semidry stigma," intermediate between the "wet" and "dry" stigmas typical of the majority of angiosperms. Construction of pistil-enriched cDNA libraries for S. squalidus allowed us to address two hypotheses: (1) stigmas of S. squalidus will express genes common to wet and dry stigmas and genes specific to the semidry stigma characteristic of the Asteraceae; and (2) genes potentially essential for pistil function will be conserved between diverse angiosperm groups and therefore common to all currently available pistil transcriptome data sets, including S. squalidus. Our data support both these hypotheses. The S. squalidus pistil transcriptome contains novel genes and genes previously identified in pistils of species with dry stigmas and wet stigmas. Comparative analysis of the five pistil transcriptomes currently available (Oryza sativa, Crocus sativus, Arabidopsis thaliana, Nicotiana tabacum, and S. squalidus), representing four major angiosperm clades and the three stigma states, identified novel genes and conserved genes potentially regulating pollen-pistil interaction pathways common to monocots and eudicots.
Complex modulation of the Aedes aegypti transcriptome in response to dengue virus infection.
Bonizzoni, Mariangela; Dunn, W Augustine; Campbell, Corey L; Olson, Ken E; Marinotti, Osvaldo; James, Anthony A
2012-01-01
Dengue fever is the most important arboviral disease world-wide, with Aedes aegypti being the major vector. Interactions between the mosquito host and dengue viruses (DENV) are complex and vector competence varies among geographically-distinct Ae. aegypti populations. Additionally, dengue is caused by four antigenically-distinct viral serotypes (DENV1-4), each with multiple genotypes. Each virus genotype interacts differently with vertebrate and invertebrate hosts. Analyses of alterations in mosquito transcriptional profiles during DENV infection are expected to provide the basis for identifying networks of genes involved in responses to viruses and contribute to the molecular-genetic understanding of vector competence. In addition, this knowledge is anticipated to support the development of novel disease-control strategies. RNA-seq technology was used to assess genome-wide changes in transcript abundance at 1, 4 and 14 days following DENV2 infection in carcasses, midguts and salivary glands of the Ae. aegypti Chetumal strain. DENV2 affected the expression of 397 Ae. aegypti genes, most of which were down-regulated by viral infection. Differential accumulation of transcripts was mainly tissue- and time-specific. Comparisons of our data with other published reports reveal conservation of functional classes, but limited concordance of specific mosquito genes responsive to DENV2 infection. These results indicate the necessity of additional studies of mosquito-DENV interactions, specifically those focused on recently-derived mosquito strains with multiple dengue virus serotypes and genotypes.
RGAugury: a pipeline for genome-wide prediction of resistance gene analogs (RGAs) in plants.
Li, Pingchuan; Quan, Xiande; Jia, Gaofeng; Xiao, Jin; Cloutier, Sylvie; You, Frank M
2016-11-02
Resistance gene analogs (RGAs), such as NBS-encoding proteins, receptor-like protein kinases (RLKs) and receptor-like proteins (RLPs), are potential R-genes that contain specific conserved domains and motifs. Thus, RGAs can be predicted based on their conserved structural features using bioinformatics tools. Computer programs have been developed for the identification of individual domains and motifs from the protein sequences of RGAs but none offer a systematic assessment of the different types of RGAs. A user-friendly and efficient pipeline is needed for large-scale genome-wide RGA predictions of the growing number of sequenced plant genomes. An integrative pipeline, named RGAugury, was developed to automate RGA prediction. The pipeline first identifies RGA-related protein domains and motifs, namely nucleotide binding site (NB-ARC), leucine rich repeat (LRR), transmembrane (TM), serine/threonine and tyrosine kinase (STTK), lysin motif (LysM), coiled-coil (CC) and Toll/Interleukin-1 receptor (TIR). RGA candidates are identified and classified into four major families based on the presence of combinations of these RGA domains and motifs: NBS-encoding, TM-CC, and membrane associated RLP and RLK. All time-consuming analyses of the pipeline are paralleled to improve performance. The pipeline was evaluated using the well-annotated Arabidopsis genome. A total of 98.5, 85.2, and 100 % of the reported NBS-encoding genes, membrane associated RLPs and RLKs were validated, respectively. The pipeline was also successfully applied to predict RGAs for 50 sequenced plant genomes. A user-friendly web interface was implemented to ease command line operations, facilitate visualization and simplify result management for multiple datasets. RGAugury is an efficiently integrative bioinformatics tool for large scale genome-wide identification of RGAs. It is freely available at Bitbucket: https://bitbucket.org/yaanlpc/rgaugury .
SoxB2 in sea urchin development: implications in neurogenesis, ciliogenesis and skeletal patterning.
Anishchenko, Evgeniya; Arnone, Maria Ina; D'Aniello, Salvatore
2018-01-01
Current studies in evolutionary developmental biology are focused on the reconstruction of gene regulatory networks in target animal species. From decades, the scientific interest on genetic mechanisms orchestrating embryos development has been increasing in consequence to the fact that common features shared by evolutionarily distant phyla are being clarified. In 2011, a study across eumetazoan species showed for the first time the existence of a highly conserved non-coding element controlling the SoxB2 gene, which is involved in the early specification of the nervous system. This discovery raised several questions about SoxB2 function and regulation in deuterostomes from an evolutionary point of view. Due to the relevant phylogenetic position within deuterostomes, the sea urchin Strongylocentrotus purpuratus represents an advantageous animal model in the field of evolutionary developmental biology. Herein, we show a comprehensive study of SoxB2 functions in sea urchins, in particular its expression pattern in a wide range of developmental stages, and its co-localization with other neurogenic markers, as SoxB1 , SoxC and Elav . Moreover, this work provides a detailed description of the phenotype of sea urchin SoxB2 knocked-down embryos, confirming its key function in neurogenesis and revealing, for the first time, its additional roles in oral and aboral ectoderm cilia and skeletal rod morphology. We concluded that SoxB2 in sea urchins has a neurogenic function; however, this gene could have multiple roles in sea urchin embryogenesis, expanding its expression in non-neurogenic cells. We showed that SoxB2 is functionally conserved among deuterostomes and suggested that in S. purpuratus this gene acquired additional functions, being involved in ciliogenesis and skeletal patterning.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gersuk, V.H.; Rose, T.M.; Todaro, G.J.
The acyl-CoA binding protein (ACBP) and the diazepam binding inhibitor (DBI) or endozepine are independent isolates of a single 86-amino-acid, 10-kDa protein. ACBP/DBI is highly conserved between species and has been identified in several diverse organisms, including human, cow, rat, frog, duck, insects, plants, and yeast. Although the genomic locus has not yet been cloned in humans, complementary DNA clones with different 5{prime} ends have been isolated and characterized. These cDNA clones appear to be encoded by a single gene. However, Southern blot analyses, in situ hybridizations, and somatic cell hybrid chromosomal mapping all suggest that there are multiple ACBP/DBI-relatedmore » sequences in the genome. To identify potential members of this gene family, degenerate oligonucleotides corresponding to highly conserved regions of ACBP/DBI were used to screen a human genomic DNA library using the polymerase chain reaction. A novel gene, DBIP1, that is closely related to ACBP/DBI but is clearly distinct was identified. DBIP1 bears extensive sequence homology to ACBP/DBI but lacks the introns predicted by rat and duck genomic sequence studies. A 1-base deletion in the coding region results in a frameshift and, along with the absence of introns and the lack of a detectable transcript, suggests that DBIP1 is a pseudogene. ACBP/DBI has previously been mapped to chromosome 2, although this was recently disputed, and a chromosome 6 location was suggested. We show that ACBP/DBI is correctly placed on chromosome 2 and that the gene identified on chromosome 6 is DBIP1. 33 refs., 3 figs., 1 tab.« less
Li, Shengbin; Li, Bo; Cheng, Cheng; Xiong, Zijun; Liu, Qingbo; Lai, Jianghua; Carey, Hannah V; Zhang, Qiong; Zheng, Haibo; Wei, Shuguang; Zhang, Hongbo; Chang, Liao; Liu, Shiping; Zhang, Shanxin; Yu, Bing; Zeng, Xiaofan; Hou, Yong; Nie, Wenhui; Guo, Youmin; Chen, Teng; Han, Jiuqiang; Wang, Jian; Wang, Jun; Chen, Chen; Liu, Jiankang; Stambrook, Peter J; Xu, Ming; Zhang, Guojie; Gilbert, M Thomas P; Yang, Huanming; Jarvis, Erich D; Yu, Jun; Yan, Jianqun
2014-01-01
Nearly one-quarter of all avian species is either threatened or nearly threatened. Of these, 73 species are currently being rescued from going extinct in wildlife sanctuaries. One of the previously most critically-endangered is the crested ibis, Nipponia nippon. Once widespread across North-East Asia, by 1981 only seven individuals from two breeding pairs remained in the wild. The recovering crested ibis populations thus provide an excellent example for conservation genomics since every individual bird has been recruited for genomic and demographic studies. Using high-quality genome sequences of multiple crested ibis individuals, its thriving co-habitant, the little egret, Egretta garzetta, and the recently sequenced genomes of 41 other avian species that are under various degrees of survival threats, including the bald eagle, we carry out comparative analyses for genomic signatures of near extinction events in association with environmental and behavioral attributes of species. We confirm that both loss of genetic diversity and enrichment of deleterious mutations of protein-coding genes contribute to the major genetic defects of the endangered species. We further identify that genetic inbreeding and loss-of-function genes in the crested ibis may all constitute genetic susceptibility to other factors including long-term climate change, over-hunting, and agrochemical overuse. We also establish a genome-wide DNA identification platform for molecular breeding and conservation practices, to facilitate sustainable recovery of endangered species. These findings demonstrate common genomic signatures of population decline across avian species and pave a way for further effort in saving endangered species and enhancing conservation genomic efforts.
Ahi, Ehsan Pashay; Kapralova, Kalina Hristova; Pálsson, Arnar; Maier, Valerie Helene; Gudbrandsson, Jóhannes; Snorrason, Sigurdur S; Jónsson, Zophonías O; Franzdóttir, Sigrídur Rut
2014-01-01
Understanding the molecular basis of craniofacial variation can provide insights into key developmental mechanisms of adaptive changes and their role in trophic divergence and speciation. Arctic charr (Salvelinus alpinus) is a polymorphic fish species, and, in Lake Thingvallavatn in Iceland, four sympatric morphs have evolved distinct craniofacial structures. We conducted a gene expression study on candidates from a conserved gene coexpression network, focusing on the development of craniofacial elements in embryos of two contrasting Arctic charr morphotypes (benthic and limnetic). Four Arctic charr morphs were studied: one limnetic and two benthic morphs from Lake Thingvallavatn and a limnetic reference aquaculture morph. The presence of morphological differences at developmental stages before the onset of feeding was verified by morphometric analysis. Following up on our previous findings that Mmp2 and Sparc were differentially expressed between morphotypes, we identified a network of genes with conserved coexpression across diverse vertebrate species. A comparative expression study of candidates from this network in developing heads of the four Arctic charr morphs verified the coexpression relationship of these genes and revealed distinct transcriptional dynamics strongly correlated with contrasting craniofacial morphologies (benthic versus limnetic). A literature review and Gene Ontology analysis indicated that a significant proportion of the network genes play a role in extracellular matrix organization and skeletogenesis, and motif enrichment analysis of conserved noncoding regions of network candidates predicted a handful of transcription factors, including Ap1 and Ets2, as potential regulators of the gene network. The expression of Ets2 itself was also found to associate with network gene expression. Genes linked to glucocorticoid signalling were also studied, as both Mmp2 and Sparc are responsive to this pathway. Among those, several transcriptional targets and upstream regulators showed differential expression between the contrasting morphotypes. Interestingly, although selected network genes showed overlapping expression patterns in situ and no morph differences, Timp2 expression patterns differed between morphs. Our comparative study of transcriptional dynamics in divergent craniofacial morphologies of Arctic charr revealed a conserved network of coexpressed genes sharing functional roles in structural morphogenesis. We also implicate transcriptional regulators of the network as targets for future functional studies.
Vu Manh, Thien-Phong; Elhmouzi-Younes, Jamila; Urien, Céline; Ruscanu, Suzana; Jouneau, Luc; Bourge, Mickaël; Moroldo, Marco; Foucras, Gilles; Salmon, Henri; Marty, Hélène; Quéré, Pascale; Bertho, Nicolas; Boudinot, Pierre; Dalod, Marc; Schwartz-Cornil, Isabelle
2015-01-01
Mononuclear phagocytes are organized in a complex system of ontogenetically and functionally distinct subsets, that has been best described in mouse and to some extent in human. Identification of homologous mononuclear phagocyte subsets in other vertebrate species of biomedical, economic, and environmental interest is needed to improve our knowledge in physiologic and physio-pathologic processes, and to design intervention strategies against a variety of diseases, including zoonotic infections. We developed a streamlined approach combining refined cell sorting and integrated comparative transcriptomics analyses which revealed conservation of the mononuclear phagocyte organization across human, mouse, sheep, pigs and, in some respect, chicken. This strategy should help democratizing the use of omics analyses for the identification and study of cell types across tissues and species. Moreover, we identified conserved gene signatures that enable robust identification and universal definition of these cell types. We identified new evolutionarily conserved gene candidates and gene interaction networks for the molecular regulation of the development or functions of these cell types, as well as conserved surface candidates for refined subset phenotyping throughout species. A phylogenetic analysis revealed that orthologous genes of the conserved signatures exist in teleost fishes and apparently not in Lamprey. PMID:26150816
The crucial contribution of veterinarians to conservation biology.
Reading, Richard P; Kenny, David E; Fitzgerald, Kevin T
2013-11-01
Conservation biology is a relatively new (began in the 1980s), value-based discipline predicated on the belief that biological diversity-from genes to populations to species to communities to ecosystems-is good and extinction is bad. Conservation biology grew from the recognition that the Earth has entered its sixth great extinction event, one that differs from previous great extinctions in that a single species-Homo sapiens-has caused this biodiversity crisis. A diverse, interacting set of variables drive current extinctions. As such, to succeed, conservation efforts usually require broad-based, interdisciplinary approaches. Conservationists increasingly recognize the importance of contributions by veterinary science, among many other disciplines, to collaborative efforts aimed at stemming the loss of biodiversity. We argue that, to improve success rates, many wildlife conservation programs must incorporate veterinarians as part of an interdisciplinary team to assess and address problems. Ideally, veterinarians who participate in conservation would receive specialized training and be willing to work as partners as part of a larger team of experts who effectively integrate their work rather than work independently (i.e., work as interdisciplinary, as opposed to multidisciplinary, teams, respectively). In our opinion, the most successful and productive projects involve interdisciplinary teams involving both biological and nonbiological specialists. Some researchers hold multiple degrees in biology and veterinary medicine or the biological and social sciences. These experts can often offer unique insight. We see at least 3 major areas in which veterinarians can immediately offer great assistance to conservation efforts: (1) participation in wildlife capture and immobilization, (2) leadership or assistance in addressing wildlife health issues, and (3) leadership or assistance in addressing wildlife disease issues, including using wildlife as sentinels to identify new and emerging diseases or epidemics of old diseases. We cover each of these main topics in detail. © 2013 Published by Elsevier Inc.
Draft Genome of the Scarab Beetle Oryctes borbonicus on La Réunion Island
Meyer, Jan M.; Markov, Gabriel V.; Baskaran, Praveen; Herrmann, Matthias; Sommer, Ralf J.; Rödelsperger, Christian
2016-01-01
Beetles represent the largest insect order and they display extreme morphological, ecological and behavioral diversity, which makes them ideal models for evolutionary studies. Here, we present the draft genome of the scarab beetle Oryctes borbonicus, which has a more basal phylogenetic position than the two previously sequenced pest species Tribolium castaneum and Dendroctonus ponderosae providing the potential for sequence polarization. Oryctes borbonicus is endemic to La Réunion, an island located in the Indian Ocean, and is the host of the nematode Pristionchus pacificus, a well-established model organism for integrative evolutionary biology. At 518 Mb, the O. borbonicus genome is substantially larger and encodes more genes than T. castaneum and D. ponderosae. We found that only 25% of the predicted genes of O. borbonicus are conserved as single copy genes across the nine investigated insect genomes, suggesting substantial gene turnover within insects. Even within beetles, up to 21% of genes are restricted to only one species, whereas most other genes have undergone lineage-specific duplications and losses. We illustrate lineage-specific duplications using detailed phylogenetic analysis of two gene families. This study serves as a reference point for insect/coleopteran genomics, although its original motivation was to find evidence for potential horizontal gene transfer (HGT) between O. borbonicus and P. pacificus. The latter was previously shown to be the recipient of multiple horizontally transferred genes including some genes from insect donors. However, our study failed to provide any clear evidence for additional HGTs between the two species. PMID:27289092