SeMPI: a genome-based secondary metabolite prediction and identification web server.
Zierep, Paul F; Padilla, Natàlia; Yonchev, Dimitar G; Telukunta, Kiran K; Klementz, Dennis; Günther, Stefan
2017-07-03
The secondary metabolism of bacteria, fungi and plants yields a vast number of bioactive substances. The constantly increasing amount of published genomic data provides the opportunity for an efficient identification of gene clusters by genome mining. Conversely, for many natural products with resolved structures, the encoding gene clusters have not been identified yet. Even though genome mining tools have become significantly more efficient in the identification of biosynthetic gene clusters, structural elucidation of the actual secondary metabolite is still challenging, especially due to as yet unpredictable post-modifications. Here, we introduce SeMPI, a web server providing a prediction and identification pipeline for natural products synthesized by polyketide synthases of type I modular. In order to limit the possible structures of PKS products and to include putative tailoring reactions, a structural comparison with annotated natural products was introduced. Furthermore, a benchmark was designed based on 40 gene clusters with annotated PKS products. The web server of the pipeline (SeMPI) is freely available at: http://www.pharmaceutical-bioinformatics.de/sempi. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Causal gene identification using combinatorial V-structure search.
Cai, Ruichu; Zhang, Zhenjie; Hao, Zhifeng
2013-07-01
With the advances of biomedical techniques in the last decade, the costs of human genomic sequencing and genomic activity monitoring are coming down rapidly. To support the huge genome-based business in the near future, researchers are eager to find killer applications based on human genome information. Causal gene identification is one of the most promising applications, which may help the potential patients to estimate the risk of certain genetic diseases and locate the target gene for further genetic therapy. Unfortunately, existing pattern recognition techniques, such as Bayesian networks, cannot be directly applied to find the accurate causal relationship between genes and diseases. This is mainly due to the insufficient number of samples and the extremely high dimensionality of the gene space. In this paper, we present the first practical solution to causal gene identification, utilizing a new combinatorial formulation over V-Structures commonly used in conventional Bayesian networks, by exploring the combinations of significant V-Structures. We prove the NP-hardness of the combinatorial search problem under a general settings on the significance measure on the V-Structures, and present a greedy algorithm to find sub-optimal results. Extensive experiments show that our proposal is both scalable and effective, particularly with interesting findings on the causal genes over real human genome data. Copyright © 2013 Elsevier Ltd. All rights reserved.
In silico identification and analysis of phytoene synthase genes in plants.
Han, Y; Zheng, Q S; Wei, Y P; Chen, J; Liu, R; Wan, H J
2015-08-14
In this study, we examined phytoene synthetase (PSY), the first key limiting enzyme in the synthesis of carotenoids and catalyzing the formation of geranylgeranyl pyrophosphate in terpenoid biosynthesis. We used known amino acid sequences of the PSY gene in tomato plants to conduct a genome-wide search and identify putative candidates in 34 sequenced plants. A total of 101 homologous genes were identified. Phylogenetic analysis revealed that PSY evolved independently in algae as well as monocotyledonous and dicotyledonous plants. Our results showed that the amino acid structures exhibited 5 motifs (motifs 1 to 5) in algae and those in higher plants were highly conserved. The PSY gene structures showed that the number of intron in algae varied widely, while the number of introns in higher plants was 4 to 5. Identification of PSY genes in plants and the analysis of the gene structure may provide a theoretical basis for studying evolutionary relationships in future analyses.
Data on the genome-wide identification of CNL R-genes in Setaria italica (L.) P. Beauv.
Andersen, Ethan J; Nepal, Madhav P
2017-08-01
We report data associated with the identification of 242 disease resistance genes (R-genes) in the genome of Setaria italica as presented in "Genetic diversity of disease resistance genes in foxtail millet ( Setaria italica L.)" (Andersen and Nepal, 2017) [1]. Our data describe the structure and evolution of the Coiled-coil, Nucleotide-binding site, Leucine-rich repeat (CNL) R-genes in foxtail millet. The CNL genes were identified through rigorous extraction and analysis of recently available plant genome sequences using cutting-edge analytical software. Data visualization includes gene structure diagrams, chromosomal syntenic maps, a chromosomal density plot, and a maximum-likelihood phylogenetic tree comparing Sorghum bicolor , Panicum virgatum , Setaria italica , and Arabidopsis thaliana . Compilation of InterProScan annotations, Gene Ontology (GO) annotations, and Basic Local Alignment Search Tool (BLAST) results for the 242 R-genes identified in the foxtail millet genome are also included in tabular format.
The P450alk gene, which is inducible by the assimilation of alkane in Candida tropicalis, was sequenced and characterized. Structural features described in promoter and terminator regions of Saccharomyces yeast genes are present in the P450alk gene and some particular structures ...
Zeng, Lingfeng; Deng, Rong; Guo, Ziping; Yang, Shushen; Deng, Xiping
2016-03-16
Glyceraldehyde-3-phosphate dehydrogenase (GAPDH) is a central enzyme in glycolysi, we performed genome-wide identification of GAPDH genes in wheat and analyzed their structural characteristics and expression patterns under abiotic stress in wheat. A total of 22 GAPDH genes were identified in wheat cv. Chinese spring; the phylogenetic and structure analysis showed that these GAPDH genes could be divided into four distinct subfamilies. The expression profiles of GAPDH genes showed tissue specificity all over plant development stages. The qRT-PCR results revealed that wheat GAPDHs were involved in several abiotic stress response. Wheat carried 22 GAPDH genes, representing four types of plant GAPDHs (gapA/B, gapC, gapCp and gapN). Whole genome duplication and segmental duplication might account for the expansion of wheat GAPDHs. Expression analysis implied that GAPDHs play roles in plants abiotic stress tolerance.
Zhao, Yan; Gentekaki, Eleni; Yi, Zhenzhen; Lin, Xiaofeng
2013-01-01
The mitochondrial cytochrome c oxidase subunit I (COI) gene is being used increasingly for evaluating inter- and intra-specific genetic diversity of ciliated protists. However, very few studies focus on assessing genetic divergence of the COI gene within individuals and how its presence might affect species identification and population structure analyses. We evaluated the genetic variation of the COI gene in five Paramecium species for a total of 147 clones derived from 21 individuals and 7 populations. We identified a total of 90 haplotypes with several individuals carrying more than one haplotype. Parsimony network and phylogenetic tree analyses revealed that intra-individual diversity had no effect in species identification and only a minor effect on population structure. Our results suggest that the COI gene is a suitable marker for resolving inter- and intra-specific relationships of Paramecium spp.
Zhao, Yan; Gentekaki, Eleni; Yi, Zhenzhen; Lin, Xiaofeng
2013-01-01
Background The mitochondrial cytochrome c oxidase subunit I (COI) gene is being used increasingly for evaluating inter- and intra-specific genetic diversity of ciliated protists. However, very few studies focus on assessing genetic divergence of the COI gene within individuals and how its presence might affect species identification and population structure analyses. Methodology/Principal findings We evaluated the genetic variation of the COI gene in five Paramecium species for a total of 147 clones derived from 21 individuals and 7 populations. We identified a total of 90 haplotypes with several individuals carrying more than one haplotype. Parsimony network and phylogenetic tree analyses revealed that intra-individual diversity had no effect in species identification and only a minor effect on population structure. Conclusions Our results suggest that the COI gene is a suitable marker for resolving inter- and intra-specific relationships of Paramecium spp. PMID:24204730
Bie, Luyao; Wu, Hao; Wang, Xin-Hua; Wang, Mingyu; Xu, Hai
2017-08-01
Integrative and conjugative elements (ICEs) are self-transmissible chromosomal mobile elements that play significant roles in the dissemination of antimicrobial resistance genes. Identification of the structures and functions of ICEs, particularly those in pathogens, improves understanding of the dissemination of antimicrobial resistance. This study identified new members of the sulfamethoxazole-trimethoprim (SXT)/R391 family of ICEs that could confer multi-drug resistance in the opportunistic pathogen Proteus mirabilis, characterized their genetic structures, and explored their evolutionary connection with other members of this family of ICEs. Three new members of the SXT/R391 family of ICEs were detected in six of 77 P. mirabilis strains isolated in China: ICEPmiChn2 (one strain), ICEPmiChn3 (one strain) and ICEPmiChn4 (three strains). All three new ICEs harbour antimicrobial resistance genes from diverse origins, suggesting their capability in acquiring foreign genes and serving as important carriers for antimicrobial resistance genes. Structural analysis showed that ICEPmiChn3 is a particularly interesting and unique ICE that has lost core genes involved in conjugation, and could not transfer to other cells via conjugation. This finding confirmed the key roles of these missing genes in conjugation. Further phylogenetic analysis suggested that ICEs in geographically close strains are also connected evolutionarily, and ICEPmiChn3 lost its conjugation cassette from a former mobile ICE. The identification and characterization of the three new members of the SXT/R391 family of ICEs in this work leads to suggestions of core ICE genes essential for conjugation, and extends understanding on the structures of ICEs, evolutionary relationships between ICEs, and the antimicrobial resistance mechanisms of P. mirabilis. Copyright © 2017 Elsevier B.V. and International Society of Chemotherapy. All rights reserved.
Identification of Enzyme Genes Using Chemical Structure Alignments of Substrate-Product Pairs.
Moriya, Yuki; Yamada, Takuji; Okuda, Shujiro; Nakagawa, Zenichi; Kotera, Masaaki; Tokimatsu, Toshiaki; Kanehisa, Minoru; Goto, Susumu
2016-03-28
Although there are several databases that contain data on many metabolites and reactions in biochemical pathways, there is still a big gap in the numbers between experimentally identified enzymes and metabolites. It is supposed that many catalytic enzyme genes are still unknown. Although there are previous studies that estimate the number of candidate enzyme genes, these studies required some additional information aside from the structures of metabolites such as gene expression and order in the genome. In this study, we developed a novel method to identify a candidate enzyme gene of a reaction using the chemical structures of the substrate-product pair (reactant pair). The proposed method is based on a search for similar reactant pairs in a reference database and offers ortholog groups that possibly mediate the given reaction. We applied the proposed method to two experimentally validated reactions. As a result, we confirmed that the histidine transaminase was correctly identified. Although our method could not directly identify the asparagine oxo-acid transaminase, we successfully found the paralog gene most similar to the correct enzyme gene. We also applied our method to infer candidate enzyme genes in the mesaconate pathway. The advantage of our method lies in the prediction of possible genes for orphan enzyme reactions where any associated gene sequences are not determined yet. We believe that this approach will facilitate experimental identification of genes for orphan enzymes.
Prom-On, Santitham; Chanthaphan, Atthawut; Chan, Jonathan Hoyin; Meechai, Asawin
2011-02-01
Relationships among gene expression levels may be associated with the mechanisms of the disease. While identifying a direct association such as a difference in expression levels between case and control groups links genes to disease mechanisms, uncovering an indirect association in the form of a network structure may help reveal the underlying functional module associated with the disease under scrutiny. This paper presents a method to improve the biological relevance in functional module identification from the gene expression microarray data by enhancing the structure of a weighted gene co-expression network using minimum spanning tree. The enhanced network, which is called a backbone network, contains only the essential structural information to represent the gene co-expression network. The entire backbone network is decoupled into a number of coherent sub-networks, and then the functional modules are reconstructed from these sub-networks to ensure minimum redundancy. The method was tested with a simulated gene expression dataset and case-control expression datasets of autism spectrum disorder and colorectal cancer studies. The results indicate that the proposed method can accurately identify clusters in the simulated dataset, and the functional modules of the backbone network are more biologically relevant than those obtained from the original approach.
A network-based method for the identification of putative genes related to infertility.
Wang, ShaoPeng; Huang, GuoHua; Hu, Qinghua; Zou, Quan
2016-11-01
Infertility has become one of the major health problems worldwide, with its incidence having risen markedly in recent decades. There is an urgent need to investigate the pathological mechanisms behind infertility and to design effective treatments. However, this is made difficult by the fact that various biological factors have been identified to be related to infertility, including genetic factors. A network-based method was established to identify new genes potentially related to infertility. A network constructed using human protein-protein interactions based on previously validated infertility-related genes enabled the identification of some novel candidate genes. These genes were then filtered by a permutation test and their functional and structural associations with infertility-related genes. Our method identified 23 novel genes, which have strong functional and structural associations with previously validated infertility-related genes. Substantial evidence indicates that the identified genes are strongly related to dysfunction of the four main biological processes of fertility: reproductive development and physiology, gametogenesis, meiosis and recombination, and hormone regulation. The newly discovered genes may provide new directions for investigating infertility. This article is part of a Special Issue entitled "System Genetics" Guest Editor: Dr. Yudong Cai and Dr. Tao Huang. Copyright © 2016 Elsevier B.V. All rights reserved.
Wu, Cen; Jiang, Yu; Ren, Jie; Cui, Yuehua; Ma, Shuangge
2018-02-10
Identification of gene-environment (G × E) interactions associated with disease phenotypes has posed a great challenge in high-throughput cancer studies. The existing marginal identification methods have suffered from not being able to accommodate the joint effects of a large number of genetic variants, while some of the joint-effect methods have been limited by failing to respect the "main effects, interactions" hierarchy, by ignoring data contamination, and by using inefficient selection techniques under complex structural sparsity. In this article, we develop an effective penalization approach to identify important G × E interactions and main effects, which can account for the hierarchical structures of the 2 types of effects. Possible data contamination is accommodated by adopting the least absolute deviation loss function. The advantage of the proposed approach over the alternatives is convincingly demonstrated in both simulation and a case study on lung cancer prognosis with gene expression measurements and clinical covariates under the accelerated failure time model. Copyright © 2017 John Wiley & Sons, Ltd.
Homogeneous versus heterogeneous probes for microbial ecological microarrays.
Bae, Jin-Woo; Park, Yong-Ha
2006-07-01
Microbial ecological microarrays have been developed for investigating the composition and functions of microorganism communities in environmental niches. These arrays include microbial identification microarrays, which use oligonucleotides, gene fragments or microbial genomes as probes. In this article, the advantages and disadvantages of each type of probe are reviewed. Oligonucleotide probes are currently useful for probing uncultivated bacteria that are not amenable to gene fragment probing, whereas the functional gene fragments amplified randomly from microbial genomes require phylogenetic and hierarchical categorization before use as microbial identification probes, despite their high resolution for both specificity and sensitivity. Until more bacteria are sequenced and gene fragment probes are thoroughly validated, heterogeneous bacterial genome probes will provide a simple, sensitive and quantitative tool for exploring the ecosystem structure.
Singh, Amarjeet; Kanwar, Poonam; Pandey, Amita; Tyagi, Akhilesh K.; Sopory, Sudhir K.; Kapoor, Sanjay; Pandey, Girdhar K.
2013-01-01
Background Phospholipase C (PLC) is one of the major lipid hydrolysing enzymes, implicated in lipid mediated signaling. PLCs have been found to play a significant role in abiotic stress triggered signaling and developmental processes in various plant species. Genome wide identification and expression analysis have been carried out for this gene family in Arabidopsis, yet not much has been accomplished in crop plant rice. Methodology/Principal Findings An exhaustive in-silico exploration of rice genome using various online databases and tools resulted in the identification of nine PLC encoding genes. Based on sequence, motif and phylogenetic analysis rice PLC gene family could be divided into phosphatidylinositol-specific PLCs (PI-PLCs) and phosphatidylcholine- PLCs (PC-PLC or NPC) classes with four and five members, respectively. A comparative analysis revealed that PLCs are conserved in Arabidopsis (dicots) and rice (monocot) at gene structure and protein level but they might have evolved through a separate evolutionary path. Transcript profiling using gene chip microarray and quantitative RT-PCR showed that most of the PLC members expressed significantly and differentially under abiotic stresses (salt, cold and drought) and during various developmental stages with condition/stage specific and overlapping expression. This finding suggested an important role of different rice PLC members in abiotic stress triggered signaling and plant development, which was also supported by the presence of relevant cis-regulatory elements in their promoters. Sub-cellular localization of few selected PLC members in Nicotiana benthamiana and onion epidermal cells has provided a clue about their site of action and functional behaviour. Conclusion/Significance The genome wide identification, structural and expression analysis and knowledge of sub-cellular localization of PLC gene family envisage the functional characterization of these genes in crop plants in near future. PMID:23638098
Singh, Amarjeet; Kanwar, Poonam; Pandey, Amita; Tyagi, Akhilesh K; Sopory, Sudhir K; Kapoor, Sanjay; Pandey, Girdhar K
2013-01-01
Phospholipase C (PLC) is one of the major lipid hydrolysing enzymes, implicated in lipid mediated signaling. PLCs have been found to play a significant role in abiotic stress triggered signaling and developmental processes in various plant species. Genome wide identification and expression analysis have been carried out for this gene family in Arabidopsis, yet not much has been accomplished in crop plant rice. An exhaustive in-silico exploration of rice genome using various online databases and tools resulted in the identification of nine PLC encoding genes. Based on sequence, motif and phylogenetic analysis rice PLC gene family could be divided into phosphatidylinositol-specific PLCs (PI-PLCs) and phosphatidylcholine- PLCs (PC-PLC or NPC) classes with four and five members, respectively. A comparative analysis revealed that PLCs are conserved in Arabidopsis (dicots) and rice (monocot) at gene structure and protein level but they might have evolved through a separate evolutionary path. Transcript profiling using gene chip microarray and quantitative RT-PCR showed that most of the PLC members expressed significantly and differentially under abiotic stresses (salt, cold and drought) and during various developmental stages with condition/stage specific and overlapping expression. This finding suggested an important role of different rice PLC members in abiotic stress triggered signaling and plant development, which was also supported by the presence of relevant cis-regulatory elements in their promoters. Sub-cellular localization of few selected PLC members in Nicotiana benthamiana and onion epidermal cells has provided a clue about their site of action and functional behaviour. The genome wide identification, structural and expression analysis and knowledge of sub-cellular localization of PLC gene family envisage the functional characterization of these genes in crop plants in near future.
CORECLUST: identification of the conserved CRM grammar together with prediction of gene regulation.
Nikulova, Anna A; Favorov, Alexander V; Sutormin, Roman A; Makeev, Vsevolod J; Mironov, Andrey A
2012-07-01
Identification of transcriptional regulatory regions and tracing their internal organization are important for understanding the eukaryotic cell machinery. Cis-regulatory modules (CRMs) of higher eukaryotes are believed to possess a regulatory 'grammar', or preferred arrangement of binding sites, that is crucial for proper regulation and thus tends to be evolutionarily conserved. Here, we present a method CORECLUST (COnservative REgulatory CLUster STructure) that predicts CRMs based on a set of positional weight matrices. Given regulatory regions of orthologous and/or co-regulated genes, CORECLUST constructs a CRM model by revealing the conserved rules that describe the relative location of binding sites. The constructed model may be consequently used for the genome-wide prediction of similar CRMs, and thus detection of co-regulated genes, and for the investigation of the regulatory grammar of the system. Compared with related methods, CORECLUST shows better performance at identification of CRMs conferring muscle-specific gene expression in vertebrates and early-developmental CRMs in Drosophila.
Genomic platform for efficient identification of fungal secondary metabolism genes
USDA-ARS?s Scientific Manuscript database
Fungal secondary metabolites (SMs) are structurally diverse natural compounds, which are thought to have great potential not only for medical industry but also for chemical and environmental industries. Since expansion of sequencing microbial genomes in 1990’s, it has been known that SM genes are ex...
Verma, Jitendra Kumar; Wardhan, Vijay; Singh, Deepali; Chakraborty, Subhra; Chakraborty, Niranjan
2018-03-28
Architectural proteins play key roles in genome construction and regulate the expression of many genes, albeit the modulation of genome plasticity by these proteins is largely unknown. A critical screening of the architectural proteins in five crop species, viz., Oryza sativa , Zea mays , Sorghum bicolor , Cicer arietinum , and Vitis vinifera , and in the model plant Arabidopsis thaliana along with evolutionary relevant species such as Chlamydomonas reinhardtii , Physcomitrella patens , and Amborella trichopoda , revealed 9, 20, 10, 7, 7, 6, 1, 4, and 4 Alba (acetylation lowers binding affinity) genes, respectively. A phylogenetic analysis of the genes and of their counterparts in other plant species indicated evolutionary conservation and diversification. In each group, the structural components of the genes and motifs showed significant conservation. The chromosomal location of the Alba genes of rice ( OsAlba ), showed an unequal distribution on 8 of its 12 chromosomes. The expression profiles of the OsAlba genes indicated a distinct tissue-specific expression in the seedling, vegetative, and reproductive stages. The quantitative real-time PCR (qRT-PCR) analysis of the OsAlba genes confirmed their stress-inducible expression under multivariate environmental conditions and phytohormone treatments. The evaluation of the regulatory elements in 68 Alba genes from the 9 species studied led to the identification of conserved motifs and overlapping microRNA (miRNA) target sites, suggesting the conservation of their function in related proteins and a divergence in their biological roles across species. The 3D structure and the prediction of putative ligands and their binding sites for OsAlba proteins offered a key insight into the structure-function relationship. These results provide a comprehensive overview of the subtle genetic diversification of the OsAlba genes, which will help in elucidating their functional role in plants.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ranjan, Priya; Yin, Tongming; Zhang, Xinye
2009-11-01
Quantitative trait locus (QTL) studies are an integral part of plant research and are used to characterize the genetic basis of phenotypic variation observed in structured populations and inform marker-assisted breeding efforts. These QTL intervals can span large physical regions on a chromosome comprising hundreds of genes, thereby hampering candidate gene identification. Genome history, evolution, and expression evidence can be used to narrow the genes in the interval to a smaller list that is manageable for detailed downstream functional genomics characterization. Our primary motivation for the present study was to address the need for a research methodology that identifies candidatemore » genes within a broad QTL interval. Here we present a bioinformatics-based approach for subdividing candidate genes within QTL intervals into alternate groups of high probability candidates. Application of this approach in the context of studying cell wall traits, specifically lignin content and S/G ratios of stem and root in Populus plants, resulted in manageable sets of genes of both known and putative cell wall biosynthetic function. These results provide a roadmap for future experimental work leading to identification of new genes controlling cell wall recalcitrance and, ultimately, in the utility of plant biomass as an energy feedstock.« less
RNA-Seq Based Transcriptional Map of Bovine Respiratory Disease Pathogen “Histophilus somni 2336”
Kumar, Ranjit; Lawrence, Mark L.; Watt, James; Cooksey, Amanda M.; Burgess, Shane C.; Nanduri, Bindu
2012-01-01
Genome structural annotation, i.e., identification and demarcation of the boundaries for all the functional elements in a genome (e.g., genes, non-coding RNAs, proteins and regulatory elements), is a prerequisite for systems level analysis. Current genome annotation programs do not identify all of the functional elements of the genome, especially small non-coding RNAs (sRNAs). Whole genome transcriptome analysis is a complementary method to identify “novel” genes, small RNAs, regulatory regions, and operon structures, thus improving the structural annotation in bacteria. In particular, the identification of non-coding RNAs has revealed their widespread occurrence and functional importance in gene regulation, stress and virulence. However, very little is known about non-coding transcripts in Histophilus somni, one of the causative agents of Bovine Respiratory Disease (BRD) as well as bovine infertility, abortion, septicemia, arthritis, myocarditis, and thrombotic meningoencephalitis. In this study, we report a single nucleotide resolution transcriptome map of H. somni strain 2336 using RNA-Seq method. The RNA-Seq based transcriptome map identified 94 sRNAs in the H. somni genome of which 82 sRNAs were never predicted or reported in earlier studies. We also identified 38 novel potential protein coding open reading frames that were absent in the current genome annotation. The transcriptome map allowed the identification of 278 operon (total 730 genes) structures in the genome. When compared with the genome sequence of a non-virulent strain 129Pt, a disproportionate number of sRNAs (∼30%) were located in genomic region unique to strain 2336 (∼18% of the total genome). This observation suggests that a number of the newly identified sRNAs in strain 2336 may be involved in strain-specific adaptations. PMID:22276113
RNA-seq based transcriptional map of bovine respiratory disease pathogen "Histophilus somni 2336".
Kumar, Ranjit; Lawrence, Mark L; Watt, James; Cooksey, Amanda M; Burgess, Shane C; Nanduri, Bindu
2012-01-01
Genome structural annotation, i.e., identification and demarcation of the boundaries for all the functional elements in a genome (e.g., genes, non-coding RNAs, proteins and regulatory elements), is a prerequisite for systems level analysis. Current genome annotation programs do not identify all of the functional elements of the genome, especially small non-coding RNAs (sRNAs). Whole genome transcriptome analysis is a complementary method to identify "novel" genes, small RNAs, regulatory regions, and operon structures, thus improving the structural annotation in bacteria. In particular, the identification of non-coding RNAs has revealed their widespread occurrence and functional importance in gene regulation, stress and virulence. However, very little is known about non-coding transcripts in Histophilus somni, one of the causative agents of Bovine Respiratory Disease (BRD) as well as bovine infertility, abortion, septicemia, arthritis, myocarditis, and thrombotic meningoencephalitis. In this study, we report a single nucleotide resolution transcriptome map of H. somni strain 2336 using RNA-Seq method.The RNA-Seq based transcriptome map identified 94 sRNAs in the H. somni genome of which 82 sRNAs were never predicted or reported in earlier studies. We also identified 38 novel potential protein coding open reading frames that were absent in the current genome annotation. The transcriptome map allowed the identification of 278 operon (total 730 genes) structures in the genome. When compared with the genome sequence of a non-virulent strain 129Pt, a disproportionate number of sRNAs (∼30%) were located in genomic region unique to strain 2336 (∼18% of the total genome). This observation suggests that a number of the newly identified sRNAs in strain 2336 may be involved in strain-specific adaptations.
Singh, Kh Dhanachandra; Karthikeyan, Muthusamy
2014-12-01
The renin-angiotensin-aldosterone system (RAAS) plays a key role in the regulation of blood pressure (BP). Mutations on the genes that encode components of the RAAS have played a significant role in genetic susceptibility to hypertension and have been intensively scrutinized. The identification of such probably causal mutations not only provides insight into the RAAS but may also serve as antihypertensive therapeutic targets and diagnostic markers. The methods for analyzing the SNPs from the huge dataset of SNPs, containing both functional and neutral SNPs is challenging by the experimental approach on every SNPs to determine their biological significance. To explore the functional significance of genetic mutation (SNPs), we adopted combined sequence and sequence-structure-based SNP analysis algorithm. Out of 3864 SNPs reported in dbSNP, we found 108 missense SNPs in the coding region and remaining in the non-coding region. In this study, we are reporting only those SNPs in coding region to be deleterious when three or more tools are predicted to be deleterious and which have high RMSD from the native structure. Based on these analyses, we have identified two SNPs of REN gene, eight SNPs of AGT gene, three SNPs of ACE gene, two SNPs of AT1R gene, three SNPs of CYP11B2 gene and three SNPs of CMA1 gene in the coding region were found to be deleterious. Further this type of study will be helpful in reducing the cost and time for identification of potential SNP and also helpful in selecting potential SNP for experimental study out of SNP pool.
Alagarasan, Ganesh; Dubey, Mahima; Aswathy, Kumar S; Chandel, Girish
2017-01-01
Genes in the ZIP family encode transcripts to store and transport bivalent metal micronutrient, particularly iron (Fe) and or zinc (Zn). These transcripts are important for a variety of functions involved in the developmental and physiological processes in many plant species, including most, if not all, Poaceae plant species and the model species Arabidopsis. Here, we present the report of a genome wide investigation of orthologous ZIP genes in Setaria italica and the identification of 7 single copy genes. RT-PCR shows 4 of them could be used to increase the bio-availability of zinc and iron content in grains. Of 36 ZIP members, 25 genes have traces of signal peptide based sub-cellular localization, as compared to those of plant species studied previously, yet translocation of ions remains unclear. In silico analysis of gene structure and protein nature suggests that these two were preeminent in shaping the functional diversity of the ZIP gene family in S. italica . NAC, bZIP and bHLH are the predominant Fe and Zn responsive transcription factors present in SiZIP genes. Together, our results provide new insights into the signal peptide based/independent iron and zinc translocation in the plant system and allowed identification of ZIP genes that may be involved in the zinc and iron absorption from the soil, and thus transporting it to the cereal grain underlying high micronutrient accumulation.
Integrative Annotation of 21,037 Human Genes Validated by Full-Length cDNA Clones
Imanishi, Tadashi; Itoh, Takeshi; Suzuki, Yutaka; O'Donovan, Claire; Fukuchi, Satoshi; Koyanagi, Kanako O; Barrero, Roberto A; Tamura, Takuro; Yamaguchi-Kabata, Yumi; Tanino, Motohiko; Yura, Kei; Miyazaki, Satoru; Ikeo, Kazuho; Homma, Keiichi; Kasprzyk, Arek; Nishikawa, Tetsuo; Hirakawa, Mika; Thierry-Mieg, Jean; Thierry-Mieg, Danielle; Ashurst, Jennifer; Jia, Libin; Nakao, Mitsuteru; Thomas, Michael A; Mulder, Nicola; Karavidopoulou, Youla; Jin, Lihua; Kim, Sangsoo; Yasuda, Tomohiro; Lenhard, Boris; Eveno, Eric; Suzuki, Yoshiyuki; Yamasaki, Chisato; Takeda, Jun-ichi; Gough, Craig; Hilton, Phillip; Fujii, Yasuyuki; Sakai, Hiroaki; Tanaka, Susumu; Amid, Clara; Bellgard, Matthew; Bonaldo, Maria de Fatima; Bono, Hidemasa; Bromberg, Susan K; Brookes, Anthony J; Bruford, Elspeth; Carninci, Piero; Chelala, Claude; Couillault, Christine; de Souza, Sandro J.; Debily, Marie-Anne; Devignes, Marie-Dominique; Dubchak, Inna; Endo, Toshinori; Estreicher, Anne; Eyras, Eduardo; Fukami-Kobayashi, Kaoru; R. Gopinath, Gopal; Graudens, Esther; Hahn, Yoonsoo; Han, Michael; Han, Ze-Guang; Hanada, Kousuke; Hanaoka, Hideki; Harada, Erimi; Hashimoto, Katsuyuki; Hinz, Ursula; Hirai, Momoki; Hishiki, Teruyoshi; Hopkinson, Ian; Imbeaud, Sandrine; Inoko, Hidetoshi; Kanapin, Alexander; Kaneko, Yayoi; Kasukawa, Takeya; Kelso, Janet; Kersey, Paul; Kikuno, Reiko; Kimura, Kouichi; Korn, Bernhard; Kuryshev, Vladimir; Makalowska, Izabela; Makino, Takashi; Mano, Shuhei; Mariage-Samson, Regine; Mashima, Jun; Matsuda, Hideo; Mewes, Hans-Werner; Minoshima, Shinsei; Nagai, Keiichi; Nagasaki, Hideki; Nagata, Naoki; Nigam, Rajni; Ogasawara, Osamu; Ohara, Osamu; Ohtsubo, Masafumi; Okada, Norihiro; Okido, Toshihisa; Oota, Satoshi; Ota, Motonori; Ota, Toshio; Otsuki, Tetsuji; Piatier-Tonneau, Dominique; Poustka, Annemarie; Ren, Shuang-Xi; Saitou, Naruya; Sakai, Katsunaga; Sakamoto, Shigetaka; Sakate, Ryuichi; Schupp, Ingo; Servant, Florence; Sherry, Stephen; Shiba, Rie; Shimizu, Nobuyoshi; Shimoyama, Mary; Simpson, Andrew J; Soares, Bento; Steward, Charles; Suwa, Makiko; Suzuki, Mami; Takahashi, Aiko; Tamiya, Gen; Tanaka, Hiroshi; Taylor, Todd; Terwilliger, Joseph D; Unneberg, Per; Veeramachaneni, Vamsi; Watanabe, Shinya; Wilming, Laurens; Yasuda, Norikazu; Yoo, Hyang-Sook; Stodolsky, Marvin; Makalowski, Wojciech; Go, Mitiko; Nakai, Kenta; Takagi, Toshihisa; Kanehisa, Minoru; Sakaki, Yoshiyuki; Quackenbush, John; Okazaki, Yasushi; Hayashizaki, Yoshihide; Hide, Winston; Chakraborty, Ranajit; Nishikawa, Ken; Sugawara, Hideaki; Tateno, Yoshio; Chen, Zhu; Oishi, Michio; Tonellato, Peter; Apweiler, Rolf; Okubo, Kousaku; Wagner, Lukas; Wiemann, Stefan; Strausberg, Robert L; Isogai, Takao; Auffray, Charles; Nomura, Nobuo; Sugano, Sumio
2004-01-01
The human genome sequence defines our inherent biological potential; the realization of the biology encoded therein requires knowledge of the function of each gene. Currently, our knowledge in this area is still limited. Several lines of investigation have been used to elucidate the structure and function of the genes in the human genome. Even so, gene prediction remains a difficult task, as the varieties of transcripts of a gene may vary to a great extent. We thus performed an exhaustive integrative characterization of 41,118 full-length cDNAs that capture the gene transcripts as complete functional cassettes, providing an unequivocal report of structural and functional diversity at the gene level. Our international collaboration has validated 21,037 human gene candidates by analysis of high-quality full-length cDNA clones through curation using unified criteria. This led to the identification of 5,155 new gene candidates. It also manifested the most reliable way to control the quality of the cDNA clones. We have developed a human gene database, called the H-Invitational Database (H-InvDB; http://www.h-invitational.jp/). It provides the following: integrative annotation of human genes, description of gene structures, details of novel alternative splicing isoforms, non-protein-coding RNAs, functional domains, subcellular localizations, metabolic pathways, predictions of protein three-dimensional structure, mapping of known single nucleotide polymorphisms (SNPs), identification of polymorphic microsatellite repeats within human genes, and comparative results with mouse full-length cDNAs. The H-InvDB analysis has shown that up to 4% of the human genome sequence (National Center for Biotechnology Information build 34 assembly) may contain misassembled or missing regions. We found that 6.5% of the human gene candidates (1,377 loci) did not have a good protein-coding open reading frame, of which 296 loci are strong candidates for non-protein-coding RNA genes. In addition, among 72,027 uniquely mapped SNPs and insertions/deletions localized within human genes, 13,215 nonsynonymous SNPs, 315 nonsense SNPs, and 452 indels occurred in coding regions. Together with 25 polymorphic microsatellite repeats present in coding regions, they may alter protein structure, causing phenotypic effects or resulting in disease. The H-InvDB platform represents a substantial contribution to resources needed for the exploration of human biology and pathology. PMID:15103394
Identification and characterization of the grape WRKY family.
Zhang, Ying; Feng, Jian Can
2014-01-01
WRKY transcription factors have functions in plant growth and development and in response to biotic and abiotic stresses. Many studies have focused on functional identification of WRKY transcription factors, but little is known about the molecular phylogeny or global expression patterns of the complete WRKY family. In this study, we identified 80 WRKY proteins encoded in the grape genome. Based on the structural features of these proteins, the grape WRKY genes were classified into three groups (groups 1-3). Analysis of WRKY genes expression profiles indicated that 28 WRKY genes were differentially expressed in response to biotic stress caused by grape whiterot and/or salicylic acid (SA). In that 16 WRKY genes upregulated both by whiterot pathogenic bacteria and SA. The results indicated that 16 WRKY proteins participated in SA-dependent defense signal pathway. This study provides a basis for cloning genes with specific functions from grape.
Fujimoto, Akihiro; Okada, Yukinori; Boroevich, Keith A; Tsunoda, Tatsuhiko; Taniguchi, Hiroaki; Nakagawa, Hidewaki
2016-05-26
Protein tertiary structure determines molecular function, interaction, and stability of the protein, therefore distribution of mutation in the tertiary structure can facilitate the identification of new driver genes in cancer. To analyze mutation distribution in protein tertiary structures, we applied a novel three dimensional permutation test to the mutation positions. We analyzed somatic mutation datasets of 21 types of cancers obtained from exome sequencing conducted by the TCGA project. Of the 3,622 genes that had ≥3 mutations in the regions with tertiary structure data, 106 genes showed significant skew in mutation distribution. Known tumor suppressors and oncogenes were significantly enriched in these identified cancer gene sets. Physical distances between mutations in known oncogenes were significantly smaller than those of tumor suppressors. Twenty-three genes were detected in multiple cancers. Candidate genes with significant skew of the 3D mutation distribution included kinases (MAPK1, EPHA5, ERBB3, and ERBB4), an apoptosis related gene (APP), an RNA splicing factor (SF1), a miRNA processing factor (DICER1), an E3 ubiquitin ligase (CUL1) and transcription factors (KLF5 and EEF1B2). Our study suggests that systematic analysis of mutation distribution in the tertiary protein structure can help identify cancer driver genes.
Fujimoto, Akihiro; Okada, Yukinori; Boroevich, Keith A.; Tsunoda, Tatsuhiko; Taniguchi, Hiroaki; Nakagawa, Hidewaki
2016-01-01
Protein tertiary structure determines molecular function, interaction, and stability of the protein, therefore distribution of mutation in the tertiary structure can facilitate the identification of new driver genes in cancer. To analyze mutation distribution in protein tertiary structures, we applied a novel three dimensional permutation test to the mutation positions. We analyzed somatic mutation datasets of 21 types of cancers obtained from exome sequencing conducted by the TCGA project. Of the 3,622 genes that had ≥3 mutations in the regions with tertiary structure data, 106 genes showed significant skew in mutation distribution. Known tumor suppressors and oncogenes were significantly enriched in these identified cancer gene sets. Physical distances between mutations in known oncogenes were significantly smaller than those of tumor suppressors. Twenty-three genes were detected in multiple cancers. Candidate genes with significant skew of the 3D mutation distribution included kinases (MAPK1, EPHA5, ERBB3, and ERBB4), an apoptosis related gene (APP), an RNA splicing factor (SF1), a miRNA processing factor (DICER1), an E3 ubiquitin ligase (CUL1) and transcription factors (KLF5 and EEF1B2). Our study suggests that systematic analysis of mutation distribution in the tertiary protein structure can help identify cancer driver genes. PMID:27225414
DOE Office of Scientific and Technical Information (OSTI.GOV)
Corbin, Cyrielle; Drouet, Samantha; Markulin, Lucija
Identification of DIR encoding genes in flax genome. Analysis of phylogeny, gene/protein structures and evolution. Identification of new conserved motifs linked to biochemical functions. Investigation of spatio-temporal gene expression and response to stress. Dirigent proteins (DIRs) were discovered during 8-8' lignan biosynthesis studies, through identification of stereoselective coupling to afford either (+)- or (-)-pinoresinols from E-coniferyl alcohol. DIRs are also involved or potentially involved in terpenoid, allyl/propenyl phenol lignan, pterocarpan and lignin biosynthesis. DIRs have very large multigene families in different vascular plants including flax, with most still of unknown function. DIR studies typically focus on a small subset ofmore » genes and identification of biochemical/physiological functions. Herein, a genome-wide analysis and characterization of the predicted flax DIR 44-membered multigene family was performed, this species being a rich natural grain source of 8-8' linked secoisolariciresinol-derived lignan oligomers. All predicted DIR sequences, including their promoters, were analyzed together with their public gene expression datasets. Expression patterns of selected DIRs were examined using qPCR, as well as through clustering analysis of DIR gene expression. These analyses further implicated roles for specific DIRs in (-)-pinoresinol formation in seed-coats, as well as (+)-pinoresinol in vegetative organs and/or specific responses to stress. Phylogeny and gene expression analysis segregated flax DIRs into six distinct clusters with new cluster-specific motifs identified. We propose that these findings can serve as a foundation to further systematically determine functions of DIRs, i.e. other than those already known in lignan biosynthesis in flax and other species. Given the differential expression profiles and inducibility of the flax DIR family, we provisionally propose that some DIR genes of unknown function could be involved in different aspects of secondary cell wall biosynthesis and plant defense.« less
Corbin, Cyrielle; Drouet, Samantha; Markulin, Lucija; Auguin, Daniel; Lainé, Éric; Davin, Laurence B; Cort, John R; Lewis, Norman G; Hano, Christophe
2018-05-01
Identification of DIR encoding genes in flax genome. Analysis of phylogeny, gene/protein structures and evolution. Identification of new conserved motifs linked to biochemical functions. Investigation of spatio-temporal gene expression and response to stress. Dirigent proteins (DIRs) were discovered during 8-8' lignan biosynthesis studies, through identification of stereoselective coupling to afford either (+)- or (-)-pinoresinols from E-coniferyl alcohol. DIRs are also involved or potentially involved in terpenoid, allyl/propenyl phenol lignan, pterocarpan and lignin biosynthesis. DIRs have very large multigene families in different vascular plants including flax, with most still of unknown function. DIR studies typically focus on a small subset of genes and identification of biochemical/physiological functions. Herein, a genome-wide analysis and characterization of the predicted flax DIR 44-membered multigene family was performed, this species being a rich natural grain source of 8-8' linked secoisolariciresinol-derived lignan oligomers. All predicted DIR sequences, including their promoters, were analyzed together with their public gene expression datasets. Expression patterns of selected DIRs were examined using qPCR, as well as through clustering analysis of DIR gene expression. These analyses further implicated roles for specific DIRs in (-)-pinoresinol formation in seed-coats, as well as (+)-pinoresinol in vegetative organs and/or specific responses to stress. Phylogeny and gene expression analysis segregated flax DIRs into six distinct clusters with new cluster-specific motifs identified. We propose that these findings can serve as a foundation to further systematically determine functions of DIRs, i.e. other than those already known in lignan biosynthesis in flax and other species. Given the differential expression profiles and inducibility of the flax DIR family, we provisionally propose that some DIR genes of unknown function could be involved in different aspects of secondary cell wall biosynthesis and plant defense.
Identification of Novel Prognostic Genetic Markers in Prostate Cancer
2000-02-01
alterations in two normal- and three malignant-derived prostate epithelial cell lines immortalized with the E6 and E7 transforming genes of human papilloma virus (HPV...malignant-derived prostate epithelial cell lines immortalized with the E6 and E7 transforming genes of human papilloma virus (HPV) 16. These studies...transforming genes of human papilloma virus (HPV) 16 (13). The cell lines demonstrated several numerical and structural chromosomal alterations
Genome-wide identification, phylogeny, and expression analysis of the SWEET gene family in tomato.
Feng, Chao-Yang; Han, Jia-Xuan; Han, Xiao-Xue; Jiang, Jing
2015-12-01
The SWEET (Sugars Will Eventually Be Exported Transporters) gene family encodes membrane-embedded sugar transporters containing seven transmembrane helices harboring two MtN3 and saliva domain. SWEETs play important roles in diverse biological processes, including plant growth, development, and response to environmental stimuli. Here, we conducted an exhaustive search of the tomato genome, leading to the identification of 29 SWEET genes. We analyzed the structures, conserved domains, and phylogenetic relationships of these protein-coding genes in detail. We also analyzed the transcript levels of SWEET genes in various tissues, organs, and developmental stages to obtain information about their functions. Furthermore, we investigated the expression patterns of the SWEET genes in response to exogenous sugar and adverse environmental stress (high and low temperatures). Some family members exhibited tissue-specific expression, whereas others were more ubiquitously expressed. Numerous stress-responsive candidate genes were obtained. The results of this study provide insights into the characteristics of the SWEET genes in tomato and may serve as a basis for further functional studies of such genes. Copyright © 2015 Elsevier B.V. All rights reserved.
Data identification for improving gene network inference using computational algebra.
Dimitrova, Elena; Stigler, Brandilyn
2014-11-01
Identification of models of gene regulatory networks is sensitive to the amount of data used as input. Considering the substantial costs in conducting experiments, it is of value to have an estimate of the amount of data required to infer the network structure. To minimize wasted resources, it is also beneficial to know which data are necessary to identify the network. Knowledge of the data and knowledge of the terms in polynomial models are often required a priori in model identification. In applications, it is unlikely that the structure of a polynomial model will be known, which may force data sets to be unnecessarily large in order to identify a model. Furthermore, none of the known results provides any strategy for constructing data sets to uniquely identify a model. We provide a specialization of an existing criterion for deciding when a set of data points identifies a minimal polynomial model when its monomial terms have been specified. Then, we relax the requirement of the knowledge of the monomials and present results for model identification given only the data. Finally, we present a method for constructing data sets that identify minimal polynomial models.
Plant nucleolar DNA: Green light shed on the role of Nucleolin in genome organization
Picart, Claire
2017-01-01
ABSTRACT The nucleolus forms as a consequence of ribosome biogenesis, but it is also implicated in other cell functions. The identification of nucleolus-associated chromatin domains (NADs) in animal and plant cells revealed the presence of DNA sequences other than rRNA genes in and around the nucleolus. NADs display repressive chromatin signatures and harbour repetitive DNA, but also tRNA genes and RNA polymerase II-transcribed genes. Furthermore, the identification of NADs revealed a specific function of the nucleolus and the protein Nucleolin 1 (NUC1) in telomere biology. Here, we discuss the significance of these data with regard to nucleolar structure and to the role of the nucleolus and NUC1 in global genome organization and stability. PMID:27644794
Secondary structural entropy in RNA switch (Riboswitch) identification.
Manzourolajdad, Amirhossein; Arnold, Jonathan
2015-04-28
RNA regulatory elements play a significant role in gene regulation. Riboswitches, a widespread group of regulatory RNAs, are vital components of many bacterial genomes. These regulatory elements generally function by forming a ligand-induced alternative fold that controls access to ribosome binding sites or other regulatory sites in RNA. Riboswitch-mediated mechanisms are ubiquitous across bacterial genomes. A typical class of riboswitch has its own unique structural and biological complexity, making de novo riboswitch identification a formidable task. Traditionally, riboswitches have been identified through comparative genomics based on sequence and structural homology. The limitations of structural-homology-based approaches, coupled with the assumption that there is a great diversity of undiscovered riboswitches, suggests the need for alternative methods for riboswitch identification, possibly based on features intrinsic to their structure. As of yet, no such reliable method has been proposed. We used structural entropy of riboswitch sequences as a measure of their secondary structural dynamics. Entropy values of a diverse set of riboswitches were compared to that of their mutants, their dinucleotide shuffles, and their reverse complement sequences under different stochastic context-free grammar folding models. Significance of our results was evaluated by comparison to other approaches, such as the base-pairing entropy and energy landscapes dynamics. Classifiers based on structural entropy optimized via sequence and structural features were devised as riboswitch identifiers and tested on Bacillus subtilis, Escherichia coli, and Synechococcus elongatus as an exploration of structural entropy based approaches. The unusually long untranslated region of the cotH in Bacillus subtilis, as well as upstream regions of certain genes, such as the sucC genes were associated with significant structural entropy values in genome-wide examinations. Various tests show that there is in fact a relationship between higher structural entropy and the potential for the RNA sequence to have alternative structures, within the limitations of our methodology. This relationship, though modest, is consistent across various tests. Understanding the behavior of structural entropy as a fairly new feature for RNA conformational dynamics, however, may require extensive exploratory investigation both across RNA sequences and folding models.
Vivero, Rafael José; Contreras-Gutiérrez, Maria Angélica; Bejarano, Eduar Elías
2007-09-01
Lutzomyia sand flies are involved in the transmission of the parasite Leishmania spp. in America. The taxonomy of these vectors is traditionally based on morphological features of the adult stage, particularly the paired structures of the head and genitalia. Although these characters are useful to distinguish most species of Lutzomyia, morphological identification may be complicated by the similarities within subgenera and species group. To evaluate the utility of mitochondrial serine transfer RNA tRNA Ser for taxonomic identification of Lutzomyia. Seven sand fly species, each representing one of the 27 taxonomic subdivisions in genus Lutzomyia, were analyzed including L. trinidadensis (Oswaldoi group), L. (Psychodopygus) panamensis, L.(Micropygomyia) cayennensis cayennensis, L. dubitans (Migonei group), L. (Lutzomyia) gomezi, L. rangeliana (ungrouped) and L. evansi (Verrucarum group). The mitochondrial tRNA Ser gene, flanked by the cytochrome b and NAD dehydrogenase subunit one genes, was extracted, amplified and sequenced from each specimen. Secondary structure of the tRNA Ser was predicted by comparisons with previously described homologous structures from other dipteran species. The tRNA Ser gene ranged in size from 66 base pairs in L. gomezi to 69 base pairs in L. trinidadensis. Fourteen polymorphic sites, including four insertion-deletion events, were observed in the aligned 70 nucleotide positions. The majority of the substitutions were located in the dihydrouridine, ribothymidine-pseudouridine-cytosine and variable loops, as well as in the basal extreme of the anticodon arm. Changes of primary sequence of the tRNASer provided useful molecular characters for taxonomic identification of the sand fly species under consideration.
Molecular mechanisms of floral organ specification by MADS domain proteins.
Yan, Wenhao; Chen, Dijun; Kaufmann, Kerstin
2016-02-01
Flower development is a model system to understand organ specification in plants. The identities of different types of floral organs are specified by homeotic MADS transcription factors that interact in a combinatorial fashion. Systematic identification of DNA-binding sites and target genes of these key regulators show that they have shared and unique sets of target genes. DNA binding by MADS proteins is not based on 'simple' recognition of a specific DNA sequence, but depends on DNA structure and combinatorial interactions. Homeotic MADS proteins regulate gene expression via alternative mechanisms, one of which may be to modulate chromatin structure and accessibility in their target gene promoters. Copyright © 2015 Elsevier Ltd. All rights reserved.
Ma, Jun; Wang, Qinglian; Sun, Runrun; Xie, Fuliang; Jones, Don C; Zhang, Baohong
2014-10-16
Plant-specific TEOSINTE-BRANCHED1/CYCLOIDEA/PCF (TCP) transcription factors play versatile functions in multiple aspects of plant growth and development. However, no systematical study has been performed in cotton. In this study, we performed for the first time the genome-wide identification and expression analysis of the TCP transcription factor family in Gossypium raimondii. A total of 38 non-redundant cotton TCP encoding genes were identified. The TCP transcription factors were divided into eleven subgroups based on phylogenetic analysis. Most TCP genes within the same subfamily demonstrated similar exon and intron organization and the motif structures were highly conserved among the subfamilies. Additionally, the chromosomal distribution pattern revealed that TCP genes were unevenly distributed across 11 out of the 13 chromosomes; segmental duplication is a predominant duplication event for TCP genes and the major contributor to the expansion of TCP gene family in G. raimondii. Moreover, the expression profiles of TCP genes shed light on their functional divergence.
Ma, Jun; Wang, Qinglian; Sun, Runrun; Xie, Fuliang; Jones, Don C.; Zhang, Baohong
2014-01-01
Plant-specific TEOSINTE-BRANCHED1/CYCLOIDEA/PCF (TCP) transcription factors play versatile functions in multiple aspects of plant growth and development. However, no systematical study has been performed in cotton. In this study, we performed for the first time the genome-wide identification and expression analysis of the TCP transcription factor family in Gossypium raimondii. A total of 38 non-redundant cotton TCP encoding genes were identified. The TCP transcription factors were divided into eleven subgroups based on phylogenetic analysis. Most TCP genes within the same subfamily demonstrated similar exon and intron organization and the motif structures were highly conserved among the subfamilies. Additionally, the chromosomal distribution pattern revealed that TCP genes were unevenly distributed across 11 out of the 13 chromosomes; segmental duplication is a predominant duplication event for TCP genes and the major contributor to the expansion of TCP gene family in G. raimondii. Moreover, the expression profiles of TCP genes shed light on their functional divergence. PMID:25322260
Mu, Min; Lu, Xu-Ke; Wang, Jun-Juan; Wang, De-Long; Yin, Zu-Jun; Wang, Shuai; Fan, Wei-Li; Ye, Wu-Wei
2016-03-18
Trehalose (a-D-glucopyranosyl a-D-glucopyranoside) is a nonreducing disaccharide and is widely distributed in bacteria, fungi, algae, plants and invertebrates. In the study, the identification of trehalose-6-phosphate synthase (TPS) genes stress-related in cotton, and the genetic structure analysis and molecular evolution analysis of TPSs were conducted with bioinformatics methods, which could lay a foundation for further research of TPS functions in cotton. The genome information of Gossypium raimondii (group D), G. arboreum L. (group A), and G. hirsutum L. (group AD) was used in the study. Fifty-three TPSs were identified comprising 15 genes in group D, 14 in group A, and 24 in group AD. Bioinformatics methods were used to analyze the genetic structure and molecular evolution of TPSs. Real-time PCR analysis was performed to investigate the expression patterns of gene family members. All TPS family members in cotton can be divided into two subfamilies: Class I and Class II. The similarity of the TPS sequence is high within the same species and close within their family relatives. The genetic structures of two TPS subfamily members are different, with more introns and a more complicated gene structure in Class I. There is a TPS domain(Glyco transf_20) at the N-terminal in all TPS family members and a TPP domain(Trehalose_PPase) at the C-terminal in all except GrTPS6, GhTPS4, and GhTPS9. All Class II members contain a UDP-forming domain. The responses to environmental stresses showed that stresses could induce the expression of TPSs but the expression patterns vary with different stresses. The distribution of TPSs varies with different species but is relatively uniform on chromosomes. Genetic structure varies with different gene members, and expression levels vary with different stresses and exhibit tissue specificity. The upregulated genes in upland cotton TM-1 is significantly more than that in G. raimondii and G. arboreum L. Shixiya 1.
Cao, Yi-zhan; Hao, Chun-qiu; Feng, Zhi-hua; Zhou, Yong-xing; Li, Jin-ge; Jia, Zhan-sheng; Wang, Ping-zhong
2003-02-01
To construct three recombinant shuttle plasmids of adenovirus expression vector which can express hepatitis C virus(HCV) different structure genes(C, C+E1, C+E1+E2) in order to pack adenovirus expression vectors which can express HCV different structure gene effectively. The different HCV structure genes derived from the plasmid pBRTM/HCV1-3011 by using polymerase chain reaction (PCR) were inserted into the backward position of cytomegalovirus(CMV) immediate early promotor element of shuttle plasmid(pAd.CMV-Link.1) of adenovirus expression vector respectively, then the three recombinant plasmids (pAd.HCV-C, pAd.HCV-CE1, pAd.HCV-S) were obtained. The recombinant plasmids were identified by endonuclease, PCR and sequencing. HCV structure genes were expressed transiently with Lipofectamine 2000 coated in HepG2 cells which were confirmed by immunofluorescence and Western-Blot. Insert DNAs of the three recombinant plasmids' were confirmed to be HCV different structure genes by endonuclease, PCR and sequencing. The three recombinant plasmids can express HCV structure gene (C, C+E1, C+E1+E2) transiently in HepG2 cells which were confirmed by immunofluorescence and Western-Blot. The three recombinant shuttle plasmids of adenovirus expression vector can express HCV structure gene(C, C+E1, C+E1+E2) transiently. This should be useful to pack adenovirus expression vector which can express HCV structure genes.
Xu, Xiaodan; Li, Yingcong; Zhao, Heng; Wen, Si-yuan; Wang, Sheng-qi; Huang, Jian; Huang, Kun-lun; Luo, Yun-bo
2005-05-18
To devise a rapid and reliable method for the detection and identification of genetically modified (GM) events, we developed a multiplex polymerase chain reaction (PCR) coupled with a DNA microarray system simultaneously aiming at many targets in a single reaction. The system included probes for screening gene, species reference gene, specific gene, construct-specific gene, event-specific gene, and internal and negative control genes. 18S rRNA was combined with species reference genes as internal controls to assess the efficiency of all reactions and to eliminate false negatives. Two sets of the multiplex PCR system were used to amplify four and five targets, respectively. Eight different structure genes could be detected and identified simultaneously for Roundup Ready soybean in a single microarray. The microarray specificity was validated by its ability to discriminate two GM maizes Bt176 and Bt11. The advantages of this method are its high specificity and greatly reduced false-positives and -negatives. The multiplex PCR coupled with microarray technology presented here is a rapid and reliable tool for the simultaneous detection of GM organism ingredients.
Gene expression complex networks: synthesis, identification, and analysis.
Lopes, Fabrício M; Cesar, Roberto M; Costa, Luciano Da F
2011-10-01
Thanks to recent advances in molecular biology, allied to an ever increasing amount of experimental data, the functional state of thousands of genes can now be extracted simultaneously by using methods such as cDNA microarrays and RNA-Seq. Particularly important related investigations are the modeling and identification of gene regulatory networks from expression data sets. Such a knowledge is fundamental for many applications, such as disease treatment, therapeutic intervention strategies and drugs design, as well as for planning high-throughput new experiments. Methods have been developed for gene networks modeling and identification from expression profiles. However, an important open problem regards how to validate such approaches and its results. This work presents an objective approach for validation of gene network modeling and identification which comprises the following three main aspects: (1) Artificial Gene Networks (AGNs) model generation through theoretical models of complex networks, which is used to simulate temporal expression data; (2) a computational method for gene network identification from the simulated data, which is founded on a feature selection approach where a target gene is fixed and the expression profile is observed for all other genes in order to identify a relevant subset of predictors; and (3) validation of the identified AGN-based network through comparison with the original network. The proposed framework allows several types of AGNs to be generated and used in order to simulate temporal expression data. The results of the network identification method can then be compared to the original network in order to estimate its properties and accuracy. Some of the most important theoretical models of complex networks have been assessed: the uniformly-random Erdös-Rényi (ER), the small-world Watts-Strogatz (WS), the scale-free Barabási-Albert (BA), and geographical networks (GG). The experimental results indicate that the inference method was sensitive to average degree
Flynn, Christopher M; Schmidt-Dannert, Claudia
2018-06-01
The wood-rotting mushroom Stereum hirsutum is a known producer of a large number of namesake hirsutenoids, many with important bioactivities. Hirsutenoids form a structurally diverse and distinct class of sesquiterpenoids. No genes involved in hirsutenoid biosynthesis have yet been identified or their enzymes characterized. Here, we describe the cloning and functional characterization of a hirsutene synthase as an unexpected fusion protein of a sesquiterpene synthase (STS) with a C-terminal 3-hydroxy-3-methylglutaryl-coenzyme A (3-hydroxy-3-methylglutaryl-CoA) synthase (HMGS) domain. Both the full-length fusion protein and truncated STS domain are highly product-specific 1,11-cyclizing STS enzymes with kinetic properties typical of STSs. Complementation studies in Saccharomyces cerevisiae confirmed that the HMGS domain is also functional in vivo Phylogenetic analysis shows that the hirsutene synthase domain does not form a clade with other previously characterized sesquiterpene synthases from Basidiomycota. Comparative gene structure analysis of this hirsutene synthase with characterized fungal enzymes reveals a significantly higher intron density, suggesting that this enzyme may be acquired by horizontal gene transfer. In contrast, the HMGS domain is clearly related to other fungal homologs. This STS-HMGS fusion protein is part of a biosynthetic gene cluster that includes P450s and oxidases that are expressed and could be cloned from cDNA. Finally, this unusual fusion of a terpene synthase to an HMGS domain, which is not generally recognized as a key regulatory enzyme of the mevalonate isoprenoid precursor pathway, led to the identification of additional HMGS duplications in many fungal genomes, including the localization of HMGSs in other predicted sesquiterpenoid biosynthetic gene clusters. IMPORTANCE Hirsutenoids represent a structurally diverse class of bioactive sesquiterpenoids isolated from fungi. Identification of their biosynthetic pathways will provide access to this chemodiversity for the discovery and synthesis of molecules with new bioactivities. The identification and successful cloning of the previously elusive hirsutene synthase from the S. hirsutum provide important insights and strategies for biosynthetic gene discovery in Basidiomycota. The finding of a terpene synthase-HMGS fusion, the discovery of other sesquiterpenoid biosynthetic gene clusters with dedicated HMGS genes, and HMGS gene duplications in fungal genomes give new importance to the role of HMGS as a key regulatory enzyme in isoprenoid and sterol biosynthesis that should be exploited for metabolic engineering. Copyright © 2018 American Society for Microbiology.
Anaerobic biosynthesis of the lower ligand of vitamin B12
Hazra, Amrita B.; Han, Andrew W.; Mehta, Angad P.; Mok, Kenny C.; Osadchiy, Vadim; Begley, Tadhg P.; Taga, Michiko E.
2015-01-01
Vitamin B12 (cobalamin) is required by humans and other organisms for diverse metabolic processes, although only a subset of prokaryotes is capable of synthesizing B12 and other cobamide cofactors. The complete aerobic and anaerobic pathways for the de novo biosynthesis of B12 are known, with the exception of the steps leading to the anaerobic biosynthesis of the lower ligand, 5,6-dimethylbenzimidazole (DMB). Here, we report the identification and characterization of the complete pathway for anaerobic DMB biosynthesis. This pathway, identified in the obligate anaerobic bacterium Eubacterium limosum, is composed of five previously uncharacterized genes, bzaABCDE, that together direct DMB production when expressed in anaerobically cultured Escherichia coli. Expression of different combinations of the bza genes revealed that 5-hydroxybenzimidazole, 5-methoxybenzimidazole, and 5-methoxy-6-methylbenzimidazole, all of which are lower ligands of cobamides produced by other organisms, are intermediates in the pathway. The bza gene content of several bacterial and archaeal genomes is consistent with experimentally determined structures of the benzimidazoles produced by these organisms, indicating that these genes can be used to predict cobamide structure. The identification of the bza genes thus represents the last remaining unknown component of the biosynthetic pathway for not only B12 itself, but also for three other cobamide lower ligands whose biosynthesis was previously unknown. Given the importance of cobamides in environmental, industrial, and human-associated microbial metabolism, the ability to predict cobamide structure may lead to an improved ability to understand and manipulate microbial metabolism. PMID:26246619
Differentially Coexpressed Disease Gene Identification Based on Gene Coexpression Network.
Jiang, Xue; Zhang, Han; Quan, Xiongwen
2016-01-01
Screening disease-related genes by analyzing gene expression data has become a popular theme. Traditional disease-related gene selection methods always focus on identifying differentially expressed gene between case samples and a control group. These traditional methods may not fully consider the changes of interactions between genes at different cell states and the dynamic processes of gene expression levels during the disease progression. However, in order to understand the mechanism of disease, it is important to explore the dynamic changes of interactions between genes in biological networks at different cell states. In this study, we designed a novel framework to identify disease-related genes and developed a differentially coexpressed disease-related gene identification method based on gene coexpression network (DCGN) to screen differentially coexpressed genes. We firstly constructed phase-specific gene coexpression network using time-series gene expression data and defined the conception of differential coexpression of genes in coexpression network. Then, we designed two metrics to measure the value of gene differential coexpression according to the change of local topological structures between different phase-specific networks. Finally, we conducted meta-analysis of gene differential coexpression based on the rank-product method. Experimental results demonstrated the feasibility and effectiveness of DCGN and the superior performance of DCGN over other popular disease-related gene selection methods through real-world gene expression data sets.
Identification of causal genes for complex traits
Hormozdiari, Farhad; Kichaev, Gleb; Yang, Wen-Yun; Pasaniuc, Bogdan; Eskin, Eleazar
2015-01-01
Motivation: Although genome-wide association studies (GWAS) have identified thousands of variants associated with common diseases and complex traits, only a handful of these variants are validated to be causal. We consider ‘causal variants’ as variants which are responsible for the association signal at a locus. As opposed to association studies that benefit from linkage disequilibrium (LD), the main challenge in identifying causal variants at associated loci lies in distinguishing among the many closely correlated variants due to LD. This is particularly important for model organisms such as inbred mice, where LD extends much further than in human populations, resulting in large stretches of the genome with significantly associated variants. Furthermore, these model organisms are highly structured and require correction for population structure to remove potential spurious associations. Results: In this work, we propose CAVIAR-Gene (CAusal Variants Identification in Associated Regions), a novel method that is able to operate across large LD regions of the genome while also correcting for population structure. A key feature of our approach is that it provides as output a minimally sized set of genes that captures the genes which harbor causal variants with probability ρ. Through extensive simulations, we demonstrate that our method not only speeds up computation, but also have an average of 10% higher recall rate compared with the existing approaches. We validate our method using a real mouse high-density lipoprotein data (HDL) and show that CAVIAR-Gene is able to identify Apoa2 (a gene known to harbor causal variants for HDL), while reducing the number of genes that need to be tested for functionality by a factor of 2. Availability and implementation: Software is freely available for download at genetics.cs.ucla.edu/caviar. Contact: eeskin@cs.ucla.edu PMID:26072484
Identification of causal genes for complex traits.
Hormozdiari, Farhad; Kichaev, Gleb; Yang, Wen-Yun; Pasaniuc, Bogdan; Eskin, Eleazar
2015-06-15
Although genome-wide association studies (GWAS) have identified thousands of variants associated with common diseases and complex traits, only a handful of these variants are validated to be causal. We consider 'causal variants' as variants which are responsible for the association signal at a locus. As opposed to association studies that benefit from linkage disequilibrium (LD), the main challenge in identifying causal variants at associated loci lies in distinguishing among the many closely correlated variants due to LD. This is particularly important for model organisms such as inbred mice, where LD extends much further than in human populations, resulting in large stretches of the genome with significantly associated variants. Furthermore, these model organisms are highly structured and require correction for population structure to remove potential spurious associations. In this work, we propose CAVIAR-Gene (CAusal Variants Identification in Associated Regions), a novel method that is able to operate across large LD regions of the genome while also correcting for population structure. A key feature of our approach is that it provides as output a minimally sized set of genes that captures the genes which harbor causal variants with probability ρ. Through extensive simulations, we demonstrate that our method not only speeds up computation, but also have an average of 10% higher recall rate compared with the existing approaches. We validate our method using a real mouse high-density lipoprotein data (HDL) and show that CAVIAR-Gene is able to identify Apoa2 (a gene known to harbor causal variants for HDL), while reducing the number of genes that need to be tested for functionality by a factor of 2. Software is freely available for download at genetics.cs.ucla.edu/caviar. © The Author 2015. Published by Oxford University Press.
Identification of the gene for Nance-Horan syndrome (NHS).
Brooks, S P; Ebenezer, N D; Poopalasundaram, S; Lehmann, O J; Moore, A T; Hardcastle, A J
2004-10-01
The disease intervals for Nance-Horan syndrome (NHS [MIM 302350]) and X linked congenital cataract (CXN) overlap on Xp22. To identify the gene or genes responsible for these diseases. Families with NHS were ascertained. The refined locus for CXN was used to focus the search for candidate genes, which were screened by polymerase chain reaction and direct sequencing of potential exons and intron-exon splice sites. Genomic structures and homologies were determined using bioinformatics. Expression studies were undertaken using specific exonic primers to amplify human fetal cDNA and mouse RNA. A novel gene NHS, with no known function, was identified as causative for NHS. Protein truncating mutations were detected in all three NHS pedigrees, but no mutation was identified in a CXN family, raising the possibility that NHS and CXN may not be allelic. The NHS gene forms a new gene family with a closely related novel gene NHS-Like1 (NHSL1). NHS and NHSL1 lie in paralogous duplicated chromosomal intervals on Xp22 and 6q24, and NHSL1 is more broadly expressed than NHS in human fetal tissues. This study reports the independent identification of the gene causative for Nance-Horan syndrome and extends the number of mutations identified.
Massonneau, Agnes; Coronado, Maria-José; Audran, Arthur; Bagniewska, Agnieszka; Mòl, Rafal; Testillano, Pilar S; Goralski, Grzegorz; Dumas, Christian; Risueño, Maria-Carmen; Matthys-Rochon, Elisabeth
2005-07-01
During maize pollen embryogenesis, a range of multicellular structures are formed. Using different approaches, the "nature" of these structures has been determined in terms of their embryogenic potential. In situ molecular identification techniques for gene transcripts and products, and a novel cell tracking system indicated the presence of embryogenic (embryo-like structures, ELS) and non-embryogenic (callus-like structures, CLS) structures that occurred for short periods within the cultures. Some multicellular structures with a compact appearance generated embryos. RT-PCR and fluorescence in situ hybridization (FISH) with confocal microscopy techniques using specific gene markers of the endosperm (ZmESR2, ZmAE3) and embryo (LTP2 and ZmOCL1, ZmOCL3) revealed "embryo" and "endosperm" potentialities in these various multicellular structures present in the cultures. The results presented here showed distinct and specific patterns of gene expression. Altogether, the results demonstrate the presence of different molecules on both embryonic and non-embryonic structures. Their possible roles are discussed in the context of a parallel between embryo/endosperm interactions in planta and embryonic and non-embryonic structure interrelations under in vitro conditions.
Wang, Jiang; Yu, Yi; Tang, Kexuan; Liu, Wen; He, Xinyi; Huang, Xi; Deng, Zixin
2010-01-01
Thiopeptide antibiotics are an important class of natural products resulting from posttranslational modifications of ribosomally synthesized peptides. Cyclothiazomycin is a typical thiopeptide antibiotic that has a unique bridged macrocyclic structure derived from an 18-amino-acid structural peptide. Here we reported cloning, sequencing, and heterologous expression of the cyclothiazomycin biosynthetic gene cluster from Streptomyces hygroscopicus 10-22. Remarkably, successful heterologous expression of a 22.7-kb gene cluster in Streptomyces lividans 1326 suggested that there is a minimum set of 15 open reading frames that includes all of the functional genes required for cyclothiazomycin production. Six genes of these genes, cltBCDEFG flanking the structural gene cltA, were predicted to encode the enzymes required for the main framework of cyclothiazomycin, and two enzymes encoded by a putative operon, cltMN, were hypothesized to participate in the tailoring step to generate the tertiary thioether, leading to the final cyclization of the bridged macrocyclic structure. This rigorous bioinformatics analysis based on heterologous expression of cyclothiazomycin resulted in an ideal biosynthetic model for us to understand the biosynthesis of thiopeptides. PMID:20154110
Liu, Hongyun; Qin, Jiajia; Fan, Hui; Cheng, Jinjin; Li, Lin; Liu, Zheng
2017-07-01
As a member of the GRAS gene family, SCARECROW - LIKE ( SCL ) genes encode transcriptional regulators that are involved in plant information transmission and signal transduction. In this study, 44 SCL genes including two SCARECROW genes in millet were identified to be distributed on eight chromosomes, except chromosome 6. All the millet genes contain motifs 6-8, indicating that these motifs are conserved during the evolution. SCL genes of millet were divided into eight groups based on the phylogenetic relationship and classification of Arabidopsis SCL genes. Several putative millet orthologous genes in Arabidopsis , maize and rice were identified. High throughput RNA sequencing revealed that the expressions of millet SCL genes in root, stem, leaf, spica, and along leaf gradient varied greatly. Analyses combining the gene expression patterns, gene structures, motif compositions, promoter cis -elements identification, alternative splicing of transcripts and phylogenetic relationship of SCL genes indicate that the these genes may play diverse functions. Functionally characterized SCL genes in maize, rice and Arabidopsis would provide us some clues for future characterization of their homologues in millet. To the best of our knowledge, this is the first study of millet SCL genes at the genome wide level. Our work provides a useful platform for functional analysis of SCL genes in millet, a model crop for C 4 photosynthesis and bioenergy studies.
Genetics Home Reference: GM2-gangliosidosis, AB variant
... link) National Institute of Neurological Disorders and Stroke: Lipid Storage Diseases Fact Sheet Educational Resources (3 links) ... Chen B, Rigat B, Curry C, Mahuran DJ. Structure of the GM2A gene: identification of an exon ...
Blood Type Biochemistry and Human Disease
Ewald, D Rose; Sumner, Susan CJ
2016-01-01
Associations between blood type and disease have been studied since the early 1900s when researchers determined that antibodies and antigens are inherited. In the 1950s, the chemical identification of the carbohydrate structure of surface antigens led to the understanding of biosynthetic pathways. The blood type is defined by oligosaccharide structures, which are specific to the antigens, thus, blood group antigens are secondary gene products, while the primary gene products are various glycosyltransferase enzymes that attach the sugar molecules to the oligosaccharide chain. Blood group antigens are found on red blood cells, platelets, leukocytes, plasma proteins, certain tissues, and various cell surface enzymes, and also exist in soluble form in body secretions such as breast milk, seminal fluid, saliva, sweat, gastric secretions, urine, and amniotic fluid. Recent advances in technology, biochemistry, and genetics have clarified the functional classifications of human blood group antigens, the structure of the A, B, H, and Lewis determinants and the enzymes that produce them, and the association of blood group antigens with disease risks. Further research to identify differences in the biochemical composition of blood group antigens, and the relationship to risks for disease, can be important for the identification of targets for the development of nutritional intervention strategies, or the identification of druggable targets. PMID:27599872
Recent advances in primary ciliary dyskinesia genetics
Kurkowiak, Małgorzata; Ziętkiewicz, Ewa; Witt, Michał
2015-01-01
Primary ciliary dyskinesia (PCD) is a rare genetically heterogeneous disorder caused by the abnormal structure and/or function of motile cilia. The PCD diagnosis is challenging and requires a well-described clinical phenotype combined with the identification of abnormalities in ciliary ultrastructure and/or beating pattern as well as the recognition of genetic cause of the disease. Regarding the pace of identification of PCD-related genes, a rapid acceleration during the last 2–3 years is notable. This is the result of new technologies, such as whole-exome sequencing, that have been recently applied in genetic research. To date, PCD-causative mutations in 29 genes are known and the number of causative genes is bound to rise. Even though the genetic causes of approximately one-third of PCD cases still remain to be found, the current knowledge can already be used to create new, accurate genetic tests for PCD that can accelerate the correct diagnosis and reduce the proportion of unexplained cases. This review aims to present the latest data on the relations between ciliary structure aberrations and their genetic basis. PMID:25351953
Woldesemayat, Adugna Abdi; Van Heusden, Peter; Ndimba, Bongani K; Christoffels, Alan
2017-12-22
Drought is the most disastrous abiotic stress that severely affects agricultural productivity worldwide. Understanding the biological basis of drought-regulated traits, requires identification and an in-depth characterization of genetic determinants using model organisms and high-throughput technologies. However, studies on drought tolerance have generally been limited to traditional candidate gene approach that targets only a single gene in a pathway that is related to a trait. In this study, we used sorghum, one of the model crops that is well adapted to arid regions, to mine genes and define determinants for drought tolerance using drought expression libraries and RNA-seq data. We provide an integrated and comparative in silico candidate gene identification, characterization and annotation approach, with an emphasis on genes playing a prominent role in conferring drought tolerance in sorghum. A total of 470 non-redundant functionally annotated drought responsive genes (DRGs) were identified using experimental data from drought responses by employing pairwise sequence similarity searches, pathway and interpro-domain analysis, expression profiling and orthology relation. Comparison of the genomic locations between these genes and sorghum quantitative trait loci (QTLs) showed that 40% of these genes were co-localized with QTLs known for drought tolerance. The genome reannotation conducted using the Program to Assemble Spliced Alignment (PASA), resulted in 9.6% of existing single gene models being updated. In addition, 210 putative novel genes were identified using AUGUSTUS and PASA based analysis on expression dataset. Among these, 50% were single exonic, 69.5% represented drought responsive and 5.7% were complete gene structure models. Analysis of biochemical metabolism revealed 14 metabolic pathways that are related to drought tolerance and also had a strong biological network, among categories of genes involved. Identification of these pathways, signifies the interplay of biochemical reactions that make up the metabolic network, constituting fundamental interface for sorghum defence mechanism against drought stress. This study suggests untapped natural variability in sorghum that could be used for developing drought tolerance. The data presented here, may be regarded as an initial reference point in functional and comparative genomics in the Gramineae family.
Milanesi, Luciano; Petrillo, Mauro; Sepe, Leandra; Boccia, Angelo; D'Agostino, Nunzio; Passamano, Myriam; Di Nardo, Salvatore; Tasco, Gianluca; Casadio, Rita; Paolella, Giovanni
2005-01-01
Background Protein kinases are a well defined family of proteins, characterized by the presence of a common kinase catalytic domain and playing a significant role in many important cellular processes, such as proliferation, maintenance of cell shape, apoptosys. In many members of the family, additional non-kinase domains contribute further specialization, resulting in subcellular localization, protein binding and regulation of activity, among others. About 500 genes encode members of the kinase family in the human genome, and although many of them represent well known genes, a larger number of genes code for proteins of more recent identification, or for unknown proteins identified as kinase only after computational studies. Results A systematic in silico study performed on the human genome, led to the identification of 5 genes, on chromosome 1, 11, 13, 15 and 16 respectively, and 1 pseudogene on chromosome X; some of these genes are reported as kinases from NCBI but are absent in other databases, such as KinBase. Comparative analysis of 483 gene regions and subsequent computational analysis, aimed at identifying unannotated exons, indicates that a large number of kinase may code for alternately spliced forms or be incorrectly annotated. An InterProScan automated analysis was perfomed to study domain distribution and combination in the various families. At the same time, other structural features were also added to the annotation process, including the putative presence of transmembrane alpha helices, and the cystein propensity to participate into a disulfide bridge. Conclusion The predicted human kinome was extended by identifiying both additional genes and potential splice variants, resulting in a varied panorama where functionality may be searched at the gene and protein level. Structural analysis of kinase proteins domains as defined in multiple sources together with transmembrane alpha helices and signal peptide prediction provides hints to function assignment. The results of the human kinome analysis are collected in the KinWeb database, available for browsing and searching over the internet, where all results from the comparative analysis and the gene structure annotation are made available, alongside the domain information. Kinases may be searched by domain combinations and the relative genes may be viewed in a graphic browser at various level of magnification up to gene organization on the full chromosome set. PMID:16351747
2013-01-01
protein conserved in Actinobacteria M206‡ AoriK_010100005764 ZP_08125978 Hypothetical protein AoriK_010100005769 ZP_08125979 TransRDD family protein M155...conserved in Actinobacteria . In mutant 4 (designated strain M206), we found that EZ-Tn5 was integrated into an intergenic region between 2 genes in divergent
2010-04-01
equipped with a spinning-disc confocal system ( Yokogawa ) was used. The statistical significance of changes to OPC cell numbers and migration upon nf1...that they are expressed in overlapping tissues. We examined the expression of both genes by whole mount in situ hybridization between the 4- cell stage...sorted cells confirmed expression, particularly in the vascular endothelium (Figure 4E-G), while RNA from 1- cell embryos indicate that both genes are
Eckert, Andrew J; van Heerwaarden, Joost; Wegrzyn, Jill L; Nelson, C Dana; Ross-Ibarra, Jeffrey; González-Martínez, Santíago C; Neale, David B
2010-07-01
Natural populations of forest trees exhibit striking phenotypic adaptations to diverse environmental gradients, thereby making them appealing subjects for the study of genes underlying ecologically relevant phenotypes. Here, we use a genome-wide data set of single nucleotide polymorphisms genotyped across 3059 functional genes to study patterns of population structure and identify loci associated with aridity across the natural range of loblolly pine (Pinus taeda L.). Overall patterns of population structure, as inferred using principal components and Bayesian cluster analyses, were consistent with three genetic clusters likely resulting from expansions out of Pleistocene refugia located in Mexico and Florida. A novel application of association analysis, which removes the confounding effects of shared ancestry on correlations between genetic and environmental variation, identified five loci correlated with aridity. These loci were primarily involved with abiotic stress response to temperature and drought. A unique set of 24 loci was identified as F(ST) outliers on the basis of the genetic clusters identified previously and after accounting for expansions out of Pleistocene refugia. These loci were involved with a diversity of physiological processes. Identification of nonoverlapping sets of loci highlights the fundamental differences implicit in the use of either method and suggests a pluralistic, yet complementary, approach to the identification of genes underlying ecologically relevant phenotypes.
Zhou, Yong; Hu, Lifang; Jiang, Lunwei; Liu, Shiqiang
2018-06-01
YTH domain-containing RNA-binding proteins are involved in post-transcriptional regulation and play important roles in the growth and development as well as abiotic stress responses of plants. However, YTH genes have not been previously studied in cucumber (Cucumis sativus). In this study, a total of five YTH genes (CsYTH1-CsYTH5) were identified in cucumber, which could be mapped on three out of the seven cucumber chromosomes. All CsYTH proteins had highly conserved C-terminal YTH domains, and two of them (CsYTH1 and CsYTH4) harbored extra CCCH and P/Q/N-rich domains. The phylogenesis, conserved motifs and exon-intron structure of YTH genes from cucumber, Arabidopsis and rice were also analyzed. The phylogenetically closely clustered YTHs shared similar gene structures and conserved motifs. An analysis of the cis-acting regulatory elements in the upstream region of these genes resulted in the identification of many cis-elements related to stress, hormone and development. Expression analysis based on the transcriptome data showed that some CsYTHs had development- or tissue-specific expression. In addition, their expression levels were altered under various stresses such as salt, drought, cold, and abscisic acid (ABA) treatments. These findings lay the foundation for the functional analysis of CsYTHs in the future.
Remali, Juwairiah; Sarmin, Nurul ‘Izzah Mohd; Ng, Chyan Leong; Tiong, John J.L.; Aizat, Wan M.; Keong, Loke Kok
2017-01-01
Background Streptomyces are well known for their capability to produce many bioactive secondary metabolites with medical and industrial importance. Here we report a novel bioactive phenazine compound, 6-((2-hydroxy-4-methoxyphenoxy) carbonyl) phenazine-1-carboxylic acid (HCPCA) extracted from Streptomyces kebangsaanensis, an endophyte isolated from the ethnomedicinal Portulaca oleracea. Methods The HCPCA chemical structure was determined using nuclear magnetic resonance spectroscopy. We conducted whole genome sequencing for the identification of the gene cluster(s) believed to be responsible for phenazine biosynthesis in order to map its corresponding pathway, in addition to bioinformatics analysis to assess the potential of S. kebangsaanensis in producing other useful secondary metabolites. Results The S. kebangsaanensis genome comprises an 8,328,719 bp linear chromosome with high GC content (71.35%) consisting of 12 rRNA operons, 81 tRNA, and 7,558 protein coding genes. We identified 24 gene clusters involved in polyketide, nonribosomal peptide, terpene, bacteriocin, and siderophore biosynthesis, as well as a gene cluster predicted to be responsible for phenazine biosynthesis. Discussion The HCPCA phenazine structure was hypothesized to derive from the combination of two biosynthetic pathways, phenazine-1,6-dicarboxylic acid and 4-methoxybenzene-1,2-diol, originated from the shikimic acid pathway. The identification of a biosynthesis pathway gene cluster for phenazine antibiotics might facilitate future genetic engineering design of new synthetic phenazine antibiotics. Additionally, these findings confirm the potential of S. kebangsaanensis for producing various antibiotics and secondary metabolites. PMID:29201559
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sternberg, E.A.; Spizz, G.; Perry, W.M.
1988-07-01
Terminal differentiation of skeletal myobalsts is accompanied by induction of a series of tissue-specific gene products, which includes the muscle isoenzymte of creatine kinase (MCK). To begin to define the sequences and signals involved in MCK regulation in developing muscle cells, the mouse MCK gene has been isolated. Sequence analysis of 4,147 bases of DNA surrounding the transcription initiation site revealed several interesting structural features, some of which are common to other muscle-specific genes and to cellular and viral enhancers.
Identification of the gene for Nance-Horan syndrome (NHS)
Brooks, S; Ebenezer, N; Poopalasundaram, S; Lehmann, O; Moore, A; Hardcastle, A
2004-01-01
Background: The disease intervals for Nance-Horan syndrome (NHS [MIM 302350]) and X linked congenital cataract (CXN) overlap on Xp22. Objective: To identify the gene or genes responsible for these diseases. Methods: Families with NHS were ascertained. The refined locus for CXN was used to focus the search for candidate genes, which were screened by polymerase chain reaction and direct sequencing of potential exons and intron-exon splice sites. Genomic structures and homologies were determined using bioinformatics. Expression studies were undertaken using specific exonic primers to amplify human fetal cDNA and mouse RNA. Results: A novel gene NHS, with no known function, was identified as causative for NHS. Protein truncating mutations were detected in all three NHS pedigrees, but no mutation was identified in a CXN family, raising the possibility that NHS and CXN may not be allelic. The NHS gene forms a new gene family with a closely related novel gene NHS-Like1 (NHSL1). NHS and NHSL1 lie in paralogous duplicated chromosomal intervals on Xp22 and 6q24, and NHSL1 is more broadly expressed than NHS in human fetal tissues. Conclusions: This study reports the independent identification of the gene causative for Nance-Horan syndrome and extends the number of mutations identified. PMID:15466011
Structural Overview of the Nuclear Receptor Superfamily: Insights into Physiology and Therapeutics
Huang, Pengxiang; Chandra, Vikas; Rastinejad, Fraydoon
2013-01-01
As ligand-regulated transcription factors, the nuclear hormone receptors are nearly ideal drug targets, with internal pockets that bind to hydrophobic, drug-like molecules and well-characterized ligand-induced conformational changes that recruit transcriptional coregulators to promoter elements. Yet, due to the multitude of genes under the control of a single receptor, the major challenge has been the identification of ligands with gene-selective actions, impacting disease outcomes through a narrow subset of target genes and not across their entire gene-regulatory repertoire. Here, we summarize the concepts and work to date underlying the development of steroidal and nonsteroidal receptor ligands, including the use of crystal structures, high-throughput screens, and rational design approaches for finding useful therapeutic molecules. Difficulties in finding selective receptor modulators require a more complete understanding of receptor interdomain communications, posttranslational modifications, and receptor-protein interactions that could be exploited for target gene selectivity. PMID:20148675
Jeffrey R. Row; Kevin E. Doherty; Todd B. Cross; Michael K. Schwartz; Sara Oyler-McCance; Dave E. Naugle; Steven T. Knick; Bradley C. Fedy
2018-01-01
Functional connectivity, quantified using landscape genetics, can inform conservation through the identification of factors linking genetic structure to landscape mechanisms. We used breeding habitat metrics, landscape attributes and indices of grouse abundance, to compare fit between structural connectivity and genetic differentiation within five longâestablished Sage...
A dominant variant in the PDE1C gene is associated with nonsyndromic hearing loss.
Wang, Li; Feng, Yong; Yan, Denise; Qin, Litao; Grati, M'hamed; Mittal, Rahul; Li, Tao; Sundhari, Abhiraami Kannan; Liu, Yalan; Chapagain, Prem; Blanton, Susan H; Liao, Shixiu; Liu, Xuezhong
2018-06-02
Identification of genes with variants causing non-syndromic hearing loss (NSHL) is challenging due to genetic heterogeneity. The difficulty is compounded by technical limitations that in the past prevented comprehensive gene identification. Recent advances in technology, using targeted capture and next-generation sequencing (NGS), is changing the face of gene identification and making it possible to rapidly and cost-effectively sequence the whole human exome. Here, we characterize a five-generation Chinese family with progressive, postlingual autosomal dominant nonsyndromic hearing loss (ADNSHL). By combining population-specific mutation arrays, targeted deafness genes panel, whole exome sequencing (WES), we identified PDE1C (Phosphodiesterase 1C) c.958G>T (p.A320S) as the disease-associated variant. Structural modeling insights into p.A320S strongly suggest that the sequence alteration will likely affect the substrate-binding pocket of PDE1C. By whole-mount immunofluorescence on postnatal day 3 mouse cochlea, we show its expression in outer (OHC) and inner (IHC) hair cells cytosol co-localizing with Lamp-1 in lysosomes. Furthermore, we provide evidence that the variant alters the PDE1C hydrolytic activity for both cyclic adenosine monophosphate (cAMP) and cyclic guanosine monophosphate (cGMP). Collectively, our findings indicate that the c.958G>T variant in PDE1C may disrupt the cross talk between cGMP-signaling and cAMP pathways in Ca 2+ homeostasis.
Uchiyama, Ikuo
2008-10-31
Identifying the set of intrinsically conserved genes, or the genomic core, among related genomes is crucial for understanding prokaryotic genomes where horizontal gene transfers are common. Although core genome identification appears to be obvious among very closely related genomes, it becomes more difficult when more distantly related genomes are compared. Here, we consider the core structure as a set of sufficiently long segments in which gene orders are conserved so that they are likely to have been inherited mainly through vertical transfer, and developed a method for identifying the core structure by finding the order of pre-identified orthologous groups (OGs) that maximally retains the conserved gene orders. The method was applied to genome comparisons of two well-characterized families, Bacillaceae and Enterobacteriaceae, and identified their core structures comprising 1438 and 2125 OGs, respectively. The core sets contained most of the essential genes and their related genes, which were primarily included in the intersection of the two core sets comprising around 700 OGs. The definition of the genomic core based on gene order conservation was demonstrated to be more robust than the simpler approach based only on gene conservation. We also investigated the core structures in terms of G+C content homogeneity and phylogenetic congruence, and found that the core genes primarily exhibited the expected characteristic, i.e., being indigenous and sharing the same history, more than the non-core genes. The results demonstrate that our strategy of genome alignment based on gene order conservation can provide an effective approach to identify the genomic core among moderately related microbial genomes.
Zhao, Jie
2010-01-01
Arabinogalactan proteins (AGPs) comprise a family of hydroxyproline-rich glycoproteins that are implicated in plant growth and development. In this study, 69 AGPs are identified from the rice genome, including 13 classical AGPs, 15 arabinogalactan (AG) peptides, three non-classical AGPs, three early nodulin-like AGPs (eNod-like AGPs), eight non-specific lipid transfer protein-like AGPs (nsLTP-like AGPs), and 27 fasciclin-like AGPs (FLAs). The results from expressed sequence tags, microarrays, and massively parallel signature sequencing tags are used to analyse the expression of AGP-encoding genes, which is confirmed by real-time PCR. The results reveal that several rice AGP-encoding genes are predominantly expressed in anthers and display differential expression patterns in response to abscisic acid, gibberellic acid, and abiotic stresses. Based on the results obtained from this analysis, an attempt has been made to link the protein structures and expression patterns of rice AGP-encoding genes to their functions. Taken together, the genome-wide identification and expression analysis of the rice AGP gene family might facilitate further functional studies of rice AGPs. PMID:20423940
Mihali, Troco K; Kellmann, Ralf; Neilan, Brett A
2009-03-30
Saxitoxin and its analogues collectively known as the paralytic shellfish toxins (PSTs) are neurotoxic alkaloids and are the cause of the syndrome named paralytic shellfish poisoning. PSTs are produced by a unique biosynthetic pathway, which involves reactions that are rare in microbial metabolic pathways. Nevertheless, distantly related organisms such as dinoflagellates and cyanobacteria appear to produce these toxins using the same pathway. Hypothesised explanations for such an unusual phylogenetic distribution of this shared uncommon metabolic pathway, include a polyphyletic origin, an involvement of symbiotic bacteria, and horizontal gene transfer. We describe the identification, annotation and bioinformatic characterisation of the putative paralytic shellfish toxin biosynthesis clusters in an Australian isolate of Anabaena circinalis and an American isolate of Aphanizomenon sp., both members of the Nostocales. These putative PST gene clusters span approximately 28 kb and contain genes coding for the biosynthesis and export of the toxin. A putative insertion/excision site in the Australian Anabaena circinalis AWQC131C was identified, and the organization and evolution of the gene clusters are discussed. A biosynthetic pathway leading to the formation of saxitoxin and its analogues in these organisms is proposed. The PST biosynthesis gene cluster presents a mosaic structure, whereby genes have apparently transposed in segments of varying size, resulting in different gene arrangements in all three sxt clusters sequenced so far. The gene cluster organizational structure and sequence similarity seems to reflect the phylogeny of the producer organisms, indicating that the gene clusters have an ancient origin, or that their lateral transfer was also an ancient event. The knowledge we gain from the characterisation of the PST biosynthesis gene clusters, including the identity and sequence of the genes involved in the biosynthesis, may also afford the identification of these gene clusters in dinoflagellates, the cause of human mortalities and significant financial loss to the tourism and shellfish industries.
Mihali, Troco K; Kellmann, Ralf; Neilan, Brett A
2009-01-01
Background Saxitoxin and its analogues collectively known as the paralytic shellfish toxins (PSTs) are neurotoxic alkaloids and are the cause of the syndrome named paralytic shellfish poisoning. PSTs are produced by a unique biosynthetic pathway, which involves reactions that are rare in microbial metabolic pathways. Nevertheless, distantly related organisms such as dinoflagellates and cyanobacteria appear to produce these toxins using the same pathway. Hypothesised explanations for such an unusual phylogenetic distribution of this shared uncommon metabolic pathway, include a polyphyletic origin, an involvement of symbiotic bacteria, and horizontal gene transfer. Results We describe the identification, annotation and bioinformatic characterisation of the putative paralytic shellfish toxin biosynthesis clusters in an Australian isolate of Anabaena circinalis and an American isolate of Aphanizomenon sp., both members of the Nostocales. These putative PST gene clusters span approximately 28 kb and contain genes coding for the biosynthesis and export of the toxin. A putative insertion/excision site in the Australian Anabaena circinalis AWQC131C was identified, and the organization and evolution of the gene clusters are discussed. A biosynthetic pathway leading to the formation of saxitoxin and its analogues in these organisms is proposed. Conclusion The PST biosynthesis gene cluster presents a mosaic structure, whereby genes have apparently transposed in segments of varying size, resulting in different gene arrangements in all three sxt clusters sequenced so far. The gene cluster organizational structure and sequence similarity seems to reflect the phylogeny of the producer organisms, indicating that the gene clusters have an ancient origin, or that their lateral transfer was also an ancient event. The knowledge we gain from the characterisation of the PST biosynthesis gene clusters, including the identity and sequence of the genes involved in the biosynthesis, may also afford the identification of these gene clusters in dinoflagellates, the cause of human mortalities and significant financial loss to the tourism and shellfish industries. PMID:19331657
[Genome-wide identification and analysis of WRKY transcription factors in Medicago truncatula].
Song, Hui; Nan, Zhibiao
2014-02-01
WRKY gene family plays important roles in plant by involving in transcriptional regulations during various physiologically processes such as development, metabolism and responses to biotic and abiotic stresses. WRKY genes have been identified in various plants. However, only few WRKY genes in Medicago truncatula have been identified with systematic analysis and comparison. In this study, we identified 93 WRKY genes through analyses of M. truncatula genome. These genes include 19 type-I genes, 49 type II genes and 13 type-III genes, and 12 non-regular type genes. All of these genes were characterized through analyses of gene duplication, chromosomal locations, structural diversity, conserved protein motifs and phylogenetic relations. The results showed that 11 times of gene duplication event occurred in WRKY gene family involving 24 genes. WRKY genes, containing 6 gene clusters, are unevenly distributed into chromosome 1 to 6, and there is the purifying selection pressure in WRKY group III genes.
Evidence for a large expansion and subfunctionalisation of globin genes in sea anemones.
Smith, Hayden L; Pavasovic, Ana; Surm, Joachim M; Phillips, Matthew J; Prentis, Peter J
2018-06-27
The globin gene superfamily has been well-characterised in vertebrates, however, there has been limited research in early-diverging lineages, such as phylum Cnidaria. This study aimed to identify globin genes in multiple cnidarian lineages, and use bioinformatic approaches to characterise the evolution, structure and expression of these genes. Phylogenetic analyses and in silico protein predictions showed that all cnidarians have undergone an expansion of globin genes, which likely have a hexacoordinate protein structure. Our protein modelling has also revealed the possibility of a single pentacoordinate globin lineage in anthozoan species. Some cnidarian globin genes displayed tissue and development specific expression with very few orthologous genes similarly expressed across species. Our phylogenetic analyses also revealed that eumetazoan globin genes form a polyphyletic relationship with vertebrate globin genes. Overall, our analyses suggest that a Ngb-like and GbX-like gene were most likely present in the globin gene repertoire for the last common ancestor of eumetazoans. The identification of a large-scale expansion and subfunctionalisation of globin genes in actiniarians provides an excellent starting point to further our understanding of the evolution and function of the globin gene superfamily in early-diverging lineages.
Arias, Carlos Roberto; Yeh, Hsiang-Yuan; Soo, Von-Wun
2012-01-01
Finding a genetic disease-related gene is not a trivial task. Therefore, computational methods are needed to present clues to the biomedical community to explore genes that are more likely to be related to a specific disease as biomarker. We present biomarker identification problem using gene prioritization method called gene prioritization from microarray data based on shortest paths, extended with structural and biological properties and edge flux using voting scheme (GP-MIDAS-VXEF). The method is based on finding relevant interactions on protein interaction networks, then scoring the genes using shortest paths and topological analysis, integrating the results using a voting scheme and a biological boosting. We applied two experiments, one is prostate primary and normal samples and the other is prostate primary tumor with and without lymph nodes metastasis. We used 137 truly prostate cancer genes as benchmark. In the first experiment, GP-MIDAS-VXEF outperforms all the other state-of-the-art methods in the benchmark by retrieving the truest related genes from the candidate set in the top 50 scores found. We applied the same technique to infer the significant biomarkers in prostate cancer with lymph nodes metastasis which is not established well. PMID:22654636
Lyubetsky, Vassily; Gershgorin, Roman; Gorbunov, Konstantin
2017-12-06
Chromosome structure is a very limited model of the genome including the information about its chromosomes such as their linear or circular organization, the order of genes on them, and the DNA strand encoding a gene. Gene lengths, nucleotide composition, and intergenic regions are ignored. Although highly incomplete, such structure can be used in many cases, e.g., to reconstruct phylogeny and evolutionary events, to identify gene synteny, regulatory elements and promoters (considering highly conserved elements), etc. Three problems are considered; all assume unequal gene content and the presence of gene paralogs. The distance problem is to determine the minimum number of operations required to transform one chromosome structure into another and the corresponding transformation itself including the identification of paralogs in two structures. We use the DCJ model which is one of the most studied combinatorial rearrangement models. Double-, sesqui-, and single-operations as well as deletion and insertion of a chromosome region are considered in the model; the single ones comprise cut and join. In the reconstruction problem, a phylogenetic tree with chromosome structures in the leaves is given. It is necessary to assign the structures to inner nodes of the tree to minimize the sum of distances between terminal structures of each edge and to identify the mutual paralogs in a fairly large set of structures. A linear algorithm is known for the distance problem without paralogs, while the presence of paralogs makes it NP-hard. If paralogs are allowed but the insertion and deletion operations are missing (and special constraints are imposed), the reduction of the distance problem to integer linear programming is known. Apparently, the reconstruction problem is NP-hard even in the absence of paralogs. The problem of contigs is to find the optimal arrangements for each given set of contigs, which also includes the mutual identification of paralogs. We proved that these problems can be reduced to integer linear programming formulations, which allows an algorithm to redefine the problems to implement a very special case of the integer linear programming tool. The results were tested on synthetic and biological samples. Three well-known problems were reduced to a very special case of integer linear programming, which is a new method of their solutions. Integer linear programming is clearly among the main computational methods and, as generally accepted, is fast on average; in particular, computation systems specifically targeted at it are available. The challenges are to reduce the size of the corresponding integer linear programming formulations and to incorporate a more detailed biological concept in our model of the reconstruction.
Wu, Zhi-Jun; Li, Xing-Hui; Liu, Zhi-Wei; Li, Hui; Wang, Yong-Xin; Zhuang, Jing
2016-02-01
Tea plant [Camellia sinensis (L.) O. Kuntze] is a leaf-type healthy non-alcoholic beverage crop, which has been widely introduced worldwide. Tea is rich in various secondary metabolites, which are important for human health. However, varied climate and complex geography have posed challenges for tea plant survival. The WRKY gene family in plants is a large transcription factor family that is involved in biological processes related to stress defenses, development, and metabolite synthesis. Therefore, identification and analysis of WRKY family transcription factors in tea plant have a profound significance. In the present study, 50 putative C. sinensis WRKY proteins (CsWRKYs) with complete WRKY domain were identified and divided into three Groups (Group I-III) on the basis of phylogenetic analysis results. The distribution of WRKY family transcription factors among plantae, fungi, and protozoa showed that the number of WRKY genes increased in higher plant, whereas the number of these genes did not correspond to the evolutionary relationships of different species. Structural feature and annotation analysis results showed that CsWRKY proteins contained WRKYGQK/WRKYGKK domains and C2H2/C2HC-type zinc-finger structure: D-X18-R-X1-Y-X2-C-X4-7-C-X23-H motif; CsWRKY proteins may be associated with the biological processes of abiotic and biotic stresses, tissue development, and hormone and secondary metabolite biosynthesis. Temperature stresses suggested that the candidate CsWRKY genes were involved in responses to extreme temperatures. The current study established an extensive overview of the WRKY family transcription factors in tea plant. This study also provided a global survey of CsWRKY transcription factors and a foundation of future functional identification and molecular breeding.
Rajesh, P S; Rai, V Ravishankar
2014-01-03
The aiiA homologous gene known to encode AHL- lactonase enzyme which hydrolyze the N-acylhomoserine lactone (AHL) quorum sensing signaling molecules produced by Gram negative bacteria. In this study, the degradation of AHL molecules was determined by cell-free lysate of endophytic Enterobacter species. The percentage of quorum quenching was confirmed and quantified by HPLC method (p<0.0001). Amplification and sequence BLAST analysis showed the presence of aiiA homologous gene in endophytic Enterobacter asburiae VT65, Enterobacter aerogenes VT66 and Enterobacter ludwigii VT70 strains. Sequence alignment analysis revealed the presence of two zinc binding sites, "HXHXDH" motif as well as tyrosine residue at the position 194. Based on known template available at Swiss-Model, putative tertiary structure of AHL-lactonase was constructed. The result showed that novel endophytic strains of Enterobacter genera encode the novel aiiA homologous gene and its structural importance for future study. Copyright © 2013 Elsevier Inc. All rights reserved.
Gu, Ganyu; Smith, Leif; Liu, Aixin; Lu, Shi-En
2011-01-01
A striking feature of Burkholderia contaminans strain MS14 is the production of a glycolipopeptide named occidiofungin. Occidiofungin has a broad range of antifungal activities against plant and animal pathogens. In this study, a complete covalent structure characterization and identification of the whole genomic DNA region for the occidiofungin gene (ocf) cluster are described. Discovery of the presence of 2,4-diaminobutyric acid and 3-chloro-β-hydroxytyrosine and elucidation of the structure of a novel C18 fatty amino acid residue have been achieved. In addition, seven additional putative open reading frames (the genes from ocfI to ocfN [ocfI-N] and ORF16) were identified. Transcription of all the putative genes ocfI-N identified in the region except ORF16 was regulated by both ambR1 and ambR2. Elucidation of the structure and the ocf gene cluster provides insight into the biosynthesis of occidiofungin and promotes future aims at understanding the biosynthetic machinery. This work provides new avenues for optimizing the production and synthesis of structural analogs of occidiofungin. PMID:21742901
Lessons learned from gene identification studies in Mendelian epilepsy disorders
Hardies, Katia; Weckhuysen, Sarah; De Jonghe, Peter; Suls, Arvid
2016-01-01
Next-generation sequencing (NGS) technologies are now routinely used for gene identification in Mendelian disorders. Setting up cost-efficient NGS projects and managing the large amount of variants remains, however, a challenging job. Here we provide insights in the decision-making processes before and after the use of NGS in gene identification studies. Genetic factors are thought to have a role in ~70% of all epilepsies, and a variety of inheritance patterns have been described for seizure-associated gene defects. We therefore chose epilepsy as disease model and selected 35 NGS studies that focused on patients with a Mendelian epilepsy disorder. The strategies used for gene identification and their respective outcomes were reviewed. High-throughput NGS strategies have led to the identification of several new epilepsy-causing genes, enlarging our knowledge on both known and novel pathomechanisms. NGS findings have furthermore extended the awareness of phenotypical and genetic heterogeneity. By discussing recent studies we illustrate: (I) the power of NGS for gene identification in Mendelian disorders, (II) the accelerating pace in which this field evolves, and (III) the considerations that have to be made when performing NGS studies. Nonetheless, the enormous rise in gene discovery over the last decade, many patients and families included in gene identification studies still remain without a molecular diagnosis; hence, further genetic research is warranted. On the basis of successful NGS studies in epilepsy, we discuss general approaches to guide human geneticists and clinicians in setting up cost-efficient gene identification NGS studies. PMID:26603999
Himeno, Kohei; Rosengren, K Johan; Inoue, Tomoko; Perez, Rodney H; Colgrave, Michelle L; Lee, Han Siean; Chan, Lai Y; Henriques, Sónia Troeira; Fujita, Koji; Ishibashi, Naoki; Zendo, Takeshi; Wilaipun, Pongtep; Nakayama, Jiro; Leelawatcharamas, Vichien; Jikuya, Hiroyuki; Craik, David J; Sonomoto, Kenji
2015-08-11
Enterocin NKR-5-3B, one of the multiple bacteriocins produced by Enterococcus faecium NKR-5-3, is a 64-amino acid novel circular bacteriocin that displays broad-spectrum antimicrobial activity. Here we report the identification, characterization, and three-dimensional nuclear magnetic resonance solution structure determination of enterocin NKR-5-3B. Enterocin NKR-5-3B is characterized by four helical segments that enclose a compact hydrophobic core, which together with its circular backbone impart high stability and structural integrity. We also report the corresponding structural gene, enkB, that encodes an 87-amino acid precursor peptide that undergoes a yet to be described enzymatic processing that involves adjacent cleavage and ligation of Leu(24) and Trp(87) to yield the mature (circular) enterocin NKR-5-3B.
Verma, Jitendra Kumar; Wardhan, Vijay; Singh, Deepali; Chakraborty, Subhra; Chakraborty, Niranjan
2018-01-01
Architectural proteins play key roles in genome construction and regulate the expression of many genes, albeit the modulation of genome plasticity by these proteins is largely unknown. A critical screening of the architectural proteins in five crop species, viz., Oryza sativa, Zea mays, Sorghum bicolor, Cicer arietinum, and Vitis vinifera, and in the model plant Arabidopsis thaliana along with evolutionary relevant species such as Chlamydomonas reinhardtii, Physcomitrella patens, and Amborella trichopoda, revealed 9, 20, 10, 7, 7, 6, 1, 4, and 4 Alba (acetylation lowers binding affinity) genes, respectively. A phylogenetic analysis of the genes and of their counterparts in other plant species indicated evolutionary conservation and diversification. In each group, the structural components of the genes and motifs showed significant conservation. The chromosomal location of the Alba genes of rice (OsAlba), showed an unequal distribution on 8 of its 12 chromosomes. The expression profiles of the OsAlba genes indicated a distinct tissue-specific expression in the seedling, vegetative, and reproductive stages. The quantitative real-time PCR (qRT-PCR) analysis of the OsAlba genes confirmed their stress-inducible expression under multivariate environmental conditions and phytohormone treatments. The evaluation of the regulatory elements in 68 Alba genes from the 9 species studied led to the identification of conserved motifs and overlapping microRNA (miRNA) target sites, suggesting the conservation of their function in related proteins and a divergence in their biological roles across species. The 3D structure and the prediction of putative ligands and their binding sites for OsAlba proteins offered a key insight into the structure–function relationship. These results provide a comprehensive overview of the subtle genetic diversification of the OsAlba genes, which will help in elucidating their functional role in plants. PMID:29597290
Eckert, Andrew J.; van Heerwaarden, Joost; Wegrzyn, Jill L.; Nelson, C. Dana; Ross-Ibarra, Jeffrey; González-Martínez, Santíago C.; Neale, David. B.
2010-01-01
Natural populations of forest trees exhibit striking phenotypic adaptations to diverse environmental gradients, thereby making them appealing subjects for the study of genes underlying ecologically relevant phenotypes. Here, we use a genome-wide data set of single nucleotide polymorphisms genotyped across 3059 functional genes to study patterns of population structure and identify loci associated with aridity across the natural range of loblolly pine (Pinus taeda L.). Overall patterns of population structure, as inferred using principal components and Bayesian cluster analyses, were consistent with three genetic clusters likely resulting from expansions out of Pleistocene refugia located in Mexico and Florida. A novel application of association analysis, which removes the confounding effects of shared ancestry on correlations between genetic and environmental variation, identified five loci correlated with aridity. These loci were primarily involved with abiotic stress response to temperature and drought. A unique set of 24 loci was identified as FST outliers on the basis of the genetic clusters identified previously and after accounting for expansions out of Pleistocene refugia. These loci were involved with a diversity of physiological processes. Identification of nonoverlapping sets of loci highlights the fundamental differences implicit in the use of either method and suggests a pluralistic, yet complementary, approach to the identification of genes underlying ecologically relevant phenotypes. PMID:20439779
Chen, Tianbao; Gagliardo, Ron; Walker, Brian; Zhou, Mei; Shaw, Chris
2005-12-01
Phylloxin is a novel prototype antimicrobial peptide from the skin of Phyllomedusa bicolor. Here, we describe parallel identification and sequencing of phylloxin precursor transcript (mRNA) and partial gene structure (genomic DNA) from the same sample of lyophilized skin secretion using our recently-described cloning technique. The open-reading frame of the phylloxin precursor was identical in nucleotide sequence to that previously reported and alignment with the nucleotide sequence derived from genomic DNA indicated the presence of a 175 bp intron located in a near identical position to that found in the dermaseptins. The highly-conserved structural organization of skin secretion peptide genes in P. bicolor can thus be extended to include that encoding phylloxin (plx). These data further reinforce our assertion that application of the described methodology can provide robust genomic/transcriptomic/peptidomic data without the need for specimen sacrifice.
Identification of essential genes and synthetic lethal gene combinations in Escherichia coli K-12.
Mori, Hirotada; Baba, Tomoya; Yokoyama, Katsushi; Takeuchi, Rikiya; Nomura, Wataru; Makishi, Kazuichi; Otsuka, Yuta; Dose, Hitomi; Wanner, Barry L
2015-01-01
Here we describe the systematic identification of single genes and gene pairs, whose knockout causes lethality in Escherichia coli K-12. During construction of precise single-gene knockout library of E. coli K-12, we identified 328 essential gene candidates for growth in complex (LB) medium. Upon establishment of the Keio single-gene deletion library, we undertook the development of the ASKA single-gene deletion library carrying a different antibiotic resistance. In addition, we developed tools for identification of synthetic lethal gene combinations by systematic construction of double-gene knockout mutants. We introduce these methods herein.
Theis, Torsten; Skurray, Ronald A; Brown, Melissa H
2007-08-01
Quantitative real-time PCR (qRT-PCR) has become a routine technique for gene expression analysis. Housekeeping genes are customarily used as endogenous references for the relative quantification of genes of interest. The aim of this study was to develop a quantitative real-time PCR assay to analyze gene expression in multidrug resistant Staphylococcus aureus in the presence of cationic lipophilic substrates of multidrug transport proteins. Eleven different housekeeping genes were analyzed for their expression stability in the presence of a range of concentrations of four structurally different antimicrobial compounds. This analysis demonstrated that the genes rho, pyk and proC were least affected by rhodamine 6G and crystal violet, whereas fabD, tpiA and gyrA or fabD, proC and pyk were stably expressed in cultures grown in the presence of ethidium or berberine, respectively. Subsequently, these housekeeping genes were used as internal controls to analyze expression of the multidrug transport protein QacA and its transcriptional regulator QacR in the presence of the aforementioned compounds. Expression of qacA was induced by all four compounds, whereas qacR expression was found to be unaffected, reduced or enhanced. This study demonstrates that staphylococcal gene expression, including housekeeping genes previously used to normalize qRT-PCR data, is affected by growth in the presence of different antimicrobial compounds. Thus, identification of suitable genes usable as a control set requires rigorous testing. Identification of a such a set enabled them to be utilized as internal standards for accurate quantification of transcripts of the qac multidrug resistance system from S. aureus grown under different inducing conditions. Moreover, the qRT-PCR assay presented in this study may also be applied to gene expression studies of other multidrug transporters from S. aureus.
Identification, Classification, and Expression Analysis of GRAS Gene Family in Malus domestica
Fan, Sheng; Zhang, Dong; Gao, Cai; Zhao, Ming; Wu, Haiqin; Li, Youmei; Shen, Yawen; Han, Mingyu
2017-01-01
GRAS genes encode plant-specific transcription factors that play important roles in plant growth and development. However, little is known about the GRAS gene family in apple. In this study, 127 GRAS genes were identified in the apple (Malus domestica Borkh.) genome and named MdGRAS1 to MdGRAS127 according to their chromosomal locations. The chemical characteristics, gene structures and evolutionary relationships of the MdGRAS genes were investigated. The 127 MdGRAS genes could be grouped into eight subfamilies based on their structural features and phylogenetic relationships. Further analysis of gene structures, segmental and tandem duplication, gene phylogeny and tissue-specific expression with ArrayExpress database indicated their diversification in quantity, structure and function. We further examined the expression pattern of MdGRAS genes during apple flower induction with transcriptome sequencing. Eight higher MdGRAS (MdGRAS6, 26, 28, 44, 53, 64, 107, and 122) genes were surfaced. Further quantitative reverse transcription PCR indicated that the candidate eight genes showed distinct expression patterns among different tissues (leaves, stems, flowers, buds, and fruits). The transcription levels of eight genes were also investigated with various flowering related treatments (GA3, 6-BA, and sucrose) and different flowering varieties (Yanfu No. 6 and Nagafu No. 2). They all were affected by flowering-related circumstance and showed different expression level. Changes in response to these hormone or sugar related treatments indicated their potential involvement during apple flower induction. Taken together, our results provide rich resources for studying GRAS genes and their potential clues in genetic improvement of apple flowering, which enriches biological theories of GRAS genes in apple and their involvement in flower induction of fruit trees. PMID:28503152
Identification, Classification, and Expression Analysis of GRAS Gene Family in Malus domestica.
Fan, Sheng; Zhang, Dong; Gao, Cai; Zhao, Ming; Wu, Haiqin; Li, Youmei; Shen, Yawen; Han, Mingyu
2017-01-01
GRAS genes encode plant-specific transcription factors that play important roles in plant growth and development. However, little is known about the GRAS gene family in apple. In this study, 127 GRAS genes were identified in the apple ( Malus domestica Borkh.) genome and named MdGRAS1 to MdGRAS127 according to their chromosomal locations. The chemical characteristics, gene structures and evolutionary relationships of the MdGRAS genes were investigated. The 127 MdGRAS genes could be grouped into eight subfamilies based on their structural features and phylogenetic relationships. Further analysis of gene structures, segmental and tandem duplication, gene phylogeny and tissue-specific expression with ArrayExpress database indicated their diversification in quantity, structure and function. We further examined the expression pattern of MdGRAS genes during apple flower induction with transcriptome sequencing. Eight higher MdGRAS ( MdGRAS6, 26, 28, 44, 53, 64, 107 , and 122 ) genes were surfaced. Further quantitative reverse transcription PCR indicated that the candidate eight genes showed distinct expression patterns among different tissues (leaves, stems, flowers, buds, and fruits). The transcription levels of eight genes were also investigated with various flowering related treatments (GA 3 , 6-BA, and sucrose) and different flowering varieties (Yanfu No. 6 and Nagafu No. 2). They all were affected by flowering-related circumstance and showed different expression level. Changes in response to these hormone or sugar related treatments indicated their potential involvement during apple flower induction. Taken together, our results provide rich resources for studying GRAS genes and their potential clues in genetic improvement of apple flowering, which enriches biological theories of GRAS genes in apple and their involvement in flower induction of fruit trees.
Xu, Zongda; Zhang, Qixiang; Sun, Lidan; Du, Dongliang; Cheng, Tangren; Pan, Huitang; Yang, Weiru; Wang, Jia
2014-10-01
MADS-box genes encode transcription factors that play crucial roles in plant development, especially in flower and fruit development. To gain insight into this gene family in Prunus mume, an important ornamental and fruit plant in East Asia, and to elucidate their roles in flower organ determination and fruit development, we performed a genome-wide identification, characterisation and expression analysis of MADS-box genes in this Rosaceae tree. In this study, 80 MADS-box genes were identified in P. mume and categorised into MIKC, Mα, Mβ, Mγ and Mδ groups based on gene structures and phylogenetic relationships. The MIKC group could be further classified into 12 subfamilies. The FLC subfamily was absent in P. mume and the six tandemly arranged DAM genes might experience a species-specific evolution process in P. mume. The MADS-box gene family might experience an evolution process from MIKC genes to Mδ genes to Mα, Mβ and Mγ genes. The expression analysis suggests that P. mume MADS-box genes have diverse functions in P. mume development and the functions of duplicated genes diverged after the duplication events. In addition to its involvement in the development of female gametophytes, type I genes also play roles in male gametophytes development. In conclusion, this study adds to our understanding of the roles that the MADS-box genes played in flower and fruit development and lays a foundation for selecting candidate genes for functional studies in P. mume and other species. Furthermore, this study also provides a basis to study the evolution of the MADS-box family.
2013-01-01
predicted amino acid sequences of the three encoded BmAChEs were no more closely related to one another than AChEs from different organisms and their...solely on nucleotide and amino acid sequence similarity; however, the cholinesterase gene family contains a number of related enzymes and structural...acetylcholinesterase of P. papatasi was cloned, sequenced , and expressed in the baculo- virus system to generate a recombinant enzyme for biochemical
Arvind, Akanksha; Jain, Vaibhav; Saravanan, Parameswaran; Mohan, C Gopi
2013-12-01
Mycobacterium tuberculosis (Mtb) is a causative agent of tuberculosis (TB) disease, which has affected approximately 2 billion people worldwide. Due to the emergence of resistance towards the existing drugs, discovery of new anti-TB drugs is an important global healthcare challenge. To address this problem, there is an urgent need to identify new drug targets in Mtb. In the present study, the subtractive genomics approach has been employed for the identification of new drug targets against TB. Screening the Mtb proteome using the Database of Essential Genes (DEG) and human proteome resulted in the identification of 60 key proteins which have no eukaryotic counterparts. Critical analysis of these proteins using Kyoto Encyclopedia of Genes and Genomes (KEGG) metabolic pathways database revealed uridine monophosphate kinase (UMPK) enzyme as a potential drug target for developing novel anti-TB drugs. Homology model of Mtb-UMPK was constructed for the first time on the basis of the crystal structure of E. coli-UMPK, in order to understand its structure-function relationships, and which would in turn facilitate to perform structure-based inhibitor design. Furthermore, the structural similarity search was carried out using physiological inhibitor UTP of Mtb-UMPK to virtually screen ZINC database. Retrieved hits were further screened by implementing several filters like ADME and toxicity followed by molecular docking. Finally, on the basis of the Glide docking score and the mode of binding, 6 putative leads were identified as inhibitors of this enzyme which can potentially emerge as future drugs for the treatment of TB.
The multifunctional nuclear pore complex: a platform for controlling gene expression
Ptak, Christopher; Aitchison, John D.; Wozniak, Richard W.
2014-01-01
In addition to their established roles in nucleocytoplasmic transport, the intimate association of nuclear pore complexes (NPCs) with chromatin has long led to speculation that these structures influence peripheral chromatin structure and regulate gene expression. These ideas have their roots in morphological observations, however recent years have seen the identification of physical interactions between NPCs, chromatin, and the transcriptional machinery. Key insights into the molecular functions of specific NPC proteins have uncovered roles for these proteins in transcriptional activation and elongation, mRNA processing, as well as chromatin structure and localization. Here, we review recent studies that provide further molecular detail on the role of specific NPC components as distinct platforms for these chromatin dependent processes. PMID:24657998
Transcriptome profile of a bovine respiratory disease pathogen: Mannheimia haemolytica PHL213
2012-01-01
Background Computational methods for structural gene annotation have propelled gene discovery but face certain drawbacks with regards to prokaryotic genome annotation. Identification of transcriptional start sites, demarcating overlapping gene boundaries, and identifying regulatory elements such as small RNA are not accurate using these approaches. In this study, we re-visit the structural annotation of Mannheimia haemolytica PHL213, a bovine respiratory disease pathogen. M. haemolytica is one of the causative agents of bovine respiratory disease that results in about $3 billion annual losses to the cattle industry. We used RNA-Seq and analyzed the data using freely-available computational methods and resources. The aim was to identify previously unannotated regions of the genome using RNA-Seq based expression profile to complement the existing annotation of this pathogen. Results Using the Illumina Genome Analyzer, we generated 9,055,826 reads (average length ~76 bp) and aligned them to the reference genome using Bowtie. The transcribed regions were analyzed using SAMTOOLS and custom Perl scripts in conjunction with BLAST searches and available gene annotation information. The single nucleotide resolution map enabled the identification of 14 novel protein coding regions as well as 44 potential novel sRNA. The basal transcription profile revealed that 2,506 of the 2,837 annotated regions were expressed in vitro, at 95.25% coverage, representing all broad functional gene categories in the genome. The expression profile also helped identify 518 potential operon structures involving 1,086 co-expressed pairs. We also identified 11 proteins with mutated/alternate start codons. Conclusions The application of RNA-Seq based transcriptome profiling to structural gene annotation helped correct existing annotation errors and identify potential novel protein coding regions and sRNA. We used computational tools to predict regulatory elements such as promoters and terminators associated with the novel expressed regions for further characterization of these novel functional elements. Our study complements the existing structural annotation of Mannheimia haemolytica PHL213 based on experimental evidence. Given the role of sRNA in virulence gene regulation and stress response, potential novel sRNA described in this study can form the framework for future studies to determine the role of sRNA, if any, in M. haemolytica pathogenesis. PMID:23046475
Structural and functional partitioning of bread wheat chromosome 3B.
Choulet, Frédéric; Alberti, Adriana; Theil, Sébastien; Glover, Natasha; Barbe, Valérie; Daron, Josquin; Pingault, Lise; Sourdille, Pierre; Couloux, Arnaud; Paux, Etienne; Leroy, Philippe; Mangenot, Sophie; Guilhot, Nicolas; Le Gouis, Jacques; Balfourier, Francois; Alaux, Michael; Jamilloux, Véronique; Poulain, Julie; Durand, Céline; Bellec, Arnaud; Gaspin, Christine; Safar, Jan; Dolezel, Jaroslav; Rogers, Jane; Vandepoele, Klaas; Aury, Jean-Marc; Mayer, Klaus; Berges, Hélène; Quesneville, Hadi; Wincker, Patrick; Feuillet, Catherine
2014-07-18
We produced a reference sequence of the 1-gigabase chromosome 3B of hexaploid bread wheat. By sequencing 8452 bacterial artificial chromosomes in pools, we assembled a sequence of 774 megabases carrying 5326 protein-coding genes, 1938 pseudogenes, and 85% of transposable elements. The distribution of structural and functional features along the chromosome revealed partitioning correlated with meiotic recombination. Comparative analyses indicated high wheat-specific inter- and intrachromosomal gene duplication activities that are potential sources of variability for adaption. In addition to providing a better understanding of the organization, function, and evolution of a large and polyploid genome, the availability of a high-quality sequence anchored to genetic maps will accelerate the identification of genes underlying important agronomic traits. Copyright © 2014, American Association for the Advancement of Science.
2013-01-01
Background MicroRNAs (miRNAs) are small non-coding RNAs that play critical roles in regulating post transcriptional gene expression. Gall midges encompass a large group of insects that are of economic importance and also possess fascinating biological traits. The gall midge Mayetiola destructor, commonly known as the Hessian fly, is a destructive pest of wheat and model organism for studying gall midge biology and insect – host plant interactions. Results In this study, we systematically analyzed miRNAs from the Hessian fly. Deep-sequencing a Hessian fly larval transcriptome led to the identification of 89 miRNA species that are either identical or very similar to known miRNAs from other insects, and 184 novel miRNAs that have not been reported from other species. A genome-wide search through a draft Hessian fly genome sequence identified a total of 611 putative miRNA-encoding genes based on sequence similarity and the existence of a stem-loop structure for miRNA precursors. Analysis of the 611 putative genes revealed a striking feature: the dramatic expansion of several miRNA gene families. The largest family contained 91 genes that encoded 20 different miRNAs. Microarray analyses revealed the expression of miRNA genes was strictly regulated during Hessian fly larval development and abundance of many miRNA genes were affected by host genotypes. Conclusion The identification of a large number of miRNAs for the first time from a gall midge provides a foundation for further studies of miRNA functions in gall midge biology and behavior. The dramatic expansion of identical or similar miRNAs provides a unique system to study functional relations among miRNA iso-genes as well as changes in sequence specificity due to small changes in miRNAs and in their mRNA targets. These results may also facilitate the identification of miRNA genes for potential pest control through transgenic approaches. PMID:23496979
USDA-ARS?s Scientific Manuscript database
Fumonisins are polyketide mycotoxins produced by the maize pathogen Fusarium verticillioides and are associated with multiple human and animal diseases. A fumonisin biosynthetic pathway has been proposed, but structures of early pathway intermediates have not been demonstrated. The F. verticillioide...
The opportunities and challenges of large-scale molecular approaches to songbird neurobiology
Mello, C.V.; Clayton, D.F.
2014-01-01
High-through put methods for analyzing genome structure and function are having a large impact in song-bird neurobiology. Methods include genome sequencing and annotation, comparative genomics, DNA microarrays and transcriptomics, and the development of a brain atlas of gene expression. Key emerging findings include the identification of complex transcriptional programs active during singing, the robust brain expression of non-coding RNAs, evidence of profound variations in gene expression across brain regions, and the identification of molecular specializations within song production and learning circuits. Current challenges include the statistical analysis of large datasets, effective genome curations, the efficient localization of gene expression changes to specific neuronal circuits and cells, and the dissection of behavioral and environmental factors that influence brain gene expression. The field requires efficient methods for comparisons with organisms like chicken, which offer important anatomical, functional and behavioral contrasts. As sequencing costs plummet, opportunities emerge for comparative approaches that may help reveal evolutionary transitions contributing to vocal learning, social behavior and other properties that make songbirds such compelling research subjects. PMID:25280907
Oh, Sunghee; Song, Seongho
2017-01-01
In gene expression profile, data analysis pipeline is categorized into four levels, major downstream tasks, i.e., (1) identification of differential expression; (2) clustering co-expression patterns; (3) classification of subtypes of samples; and (4) detection of genetic regulatory networks, are performed posterior to preprocessing procedure such as normalization techniques. To be more specific, temporal dynamic gene expression data has its inherent feature, namely, two neighboring time points (previous and current state) are highly correlated with each other, compared to static expression data which samples are assumed as independent individuals. In this chapter, we demonstrate how HMMs and hierarchical Bayesian modeling methods capture the horizontal time dependency structures in time series expression profiles by focusing on the identification of differential expression. In addition, those differential expression genes and transcript variant isoforms over time detected in core prerequisite steps can be generally further applied in detection of genetic regulatory networks to comprehensively uncover dynamic repertoires in the aspects of system biology as the coupled framework.
Noor Uddin, Gazi Md; Larsen, Marianne Halberg; Christensen, Henrik; Aarestrup, Frank M; Phu, Tran Minh; Dalsgaard, Anders
2015-01-01
Probiotics are increasingly used in aquaculture to control diseases and improve feed digestion and pond water quality; however, little is known about the antimicrobial resistance properties of such probiotic bacteria and to what extent they may contribute to the development of bacterial resistance in aquaculture ponds. Concerns have been raised that the declared information on probiotic product labels are incorrect and information on bacterial composition are often missing. We therefore evaluated seven probiotics commonly used in Vietnamese shrimp culture for their bacterial species content, phenotypic antimicrobial resistance and associated transferable resistance genes. The bacterial species was established by 16S rRNA sequence analysis of 125 representative bacterial isolates. MIC testing was done for a range of antimicrobials and whole genome sequencing of six multiple antimicrobial resistant Bacillus spp. used to identify resistance genes and genetic elements associated with horizontal gene transfer. Thirteen bacterial species declared on the probiotic products could not be identified and 11 non-declared Bacillus spp. were identified. Although our culture-based isolation and identification may have missed a few bacterial species present in the tested products this would represent minor bias, but future studies may apply culture independent identification methods like pyro sequencing. Only 6/60 isolates were resistant to more than four antimicrobials and whole genome sequencing showed that they contained macrolide (ermD), tetracycline (tetL), phenicol (fexA) and trimethoprim (dfrD, dfrG and dfrK) resistance genes, but not known structures associated with horizontal gene transfer. Probiotic bacterial strains used in Vietnamese shrimp culture seem to contribute with very limited types and numbers of resistance genes compared to the naturally occurring bacterial species in aquaculture environments. Approval procedures of probiotic products must be strengthened through scientific-based efficacy trials and product labels should allow identification of individual bacterial strains and inform the farmer on specific purpose, dosage and correct application measures.
New Genes and New Insights from Old Genes: Update on Alzheimer Disease
Ringman, John M.; Coppola, Giovanni
2013-01-01
Purpose of Review: This article discusses the current status of knowledge regarding the genetic basis of Alzheimer disease (AD) with a focus on clinically relevant aspects. Recent Findings: The genetic architecture of AD is complex, as it includes multiple susceptibility genes and likely nongenetic factors. Rare but highly penetrant autosomal dominant mutations explain a small minority of the cases but have allowed tremendous advances in understanding disease pathogenesis. The identification of a strong genetic risk factor, APOE, reshaped the field and introduced the notion of genetic risk for AD. More recently, large-scale genome-wide association studies are adding to the picture a number of common variants with very small effect sizes. Large-scale resequencing studies are expected to identify additional risk factors, including rare susceptibility variants and structural variation. Summary: Genetic assessment is currently of limited utility in clinical practice because of the low frequency (Mendelian mutations) or small effect size (common risk factors) of the currently known susceptibility genes. However, genetic studies are identifying with confidence a number of novel risk genes, and this will further our understanding of disease biology and possibly the identification of therapeutic targets. PMID:23558482
Oladnabi, Morteza; Musante, Luciana; Larti, Farzaneh; Hu, Hao; Abedini, Seyedeh Sedigheh; Wienker, Thomas; Ropers, Hans Hilger; Kahrizi, Kimia; Najmabadi, Hossein
2015-03-01
Knowledge of the genes responsible for intellectual disability, particularly autosomal recessive forms, is rapidly expanding. Increasing numbers of the gene show great heterogeneity and supports the hypothesis that human genome may contain over 2000 causative genes with a critical role in brain development. Since 2004, we have applied genome-wide SNP genotyping and next-generation sequencing in large consanguineous Iranian families with intellectual disability, to identify the genes harboring disease-causing mutations. The current study paved the way for identification of responsible genes in two unrelated Iranian families. We found two novel nonsense mutations, p.C77* and p.Q115*, in the calpain catalytic domain of CAPN10, which is a cysteine protease known to be involved in pathogenesis of noninsulin-dependent diabetes mellitus. Another different mutation in this gene (p.S138_R139ins5) has previously been reported in an Iranian family. All of these patients have common clinical features in spite of specific brain structural abnormalities on MRI. Different mutations in CAPN10 have already been found in three independent Iranian families. These results have strongly supported the possible role of CAPN10 in human brain development. Altogether, we proposed CAPN10 as a promising candidate gene for intellectual disability, which should be considered in diagnostic gene panels.
Wei, Ling; Yang, Chao; Tao, Wenjing; Wang, Deshou
2016-01-01
The Sox transcription factor family is characterized with the presence of a Sry-related high-mobility group (HMG) box and plays important roles in various biological processes in animals, including sex determination and differentiation, and the development of multiple organs. In this study, 27 Sox genes were identified in the genome of the Nile tilapia (Oreochromis niloticus), and were classified into seven groups. The members of each group of the tilapia Sox genes exhibited a relatively conserved exon-intron structure. Comparative analysis showed that the Sox gene family has undergone an expansion in tilapia and other teleost fishes following their whole genome duplication, and group K only exists in teleosts. Transcriptome-based analysis demonstrated that most of the tilapia Sox genes presented stage-specific and/or sex-dimorphic expressions during gonadal development, and six of the group B Sox genes were specifically expressed in the adult brain. Our results provide a better understanding of gene structure and spatio-temporal expression of the Sox gene family in tilapia, and will be useful for further deciphering the roles of the Sox genes during sex determination and gonadal development in teleosts. PMID:26907269
Wei, Ling; Yang, Chao; Tao, Wenjing; Wang, Deshou
2016-02-23
The Sox transcription factor family is characterized with the presence of a Sry-related high-mobility group (HMG) box and plays important roles in various biological processes in animals, including sex determination and differentiation, and the development of multiple organs. In this study, 27 Sox genes were identified in the genome of the Nile tilapia (Oreochromis niloticus), and were classified into seven groups. The members of each group of the tilapia Sox genes exhibited a relatively conserved exon-intron structure. Comparative analysis showed that the Sox gene family has undergone an expansion in tilapia and other teleost fishes following their whole genome duplication, and group K only exists in teleosts. Transcriptome-based analysis demonstrated that most of the tilapia Sox genes presented stage-specific and/or sex-dimorphic expressions during gonadal development, and six of the group B Sox genes were specifically expressed in the adult brain. Our results provide a better understanding of gene structure and spatio-temporal expression of the Sox gene family in tilapia, and will be useful for further deciphering the roles of the Sox genes during sex determination and gonadal development in teleosts.
Zhang, Xiaoni; Wang, Qijian; Yang, Shaozong; Lin, Shengnan; Bao, Manzhu; Wu, Quanshu; Wang, Caiyun; Fu, Xiaopeng
2018-01-01
Dianthus is a large genus containing many species with high ornamental economic value. Extensive breeding strategies permitted an exploration of an improvement in the quality of cultivated carnation, particularly in flowers. However, little is known on the molecular mechanisms of flower development in carnation. Here, we report the identification and description of MADS-box genes in carnation (DcaMADS) with a focus on those involved in flower development and organ identity determination. In this study, 39 MADS-box genes were identified from the carnation genome and transcriptome by the phylogenetic analysis. These genes were categorized into four subgroups (30 MIKCc, two MIKC*, two Mα, and five Mγ). The MADS-box domain, gene structure, and conserved motif compositions of the carnation MADS genes were analysed. Meanwhile, the expression of DcaMADS genes were significantly different in stems, leaves, and flower buds. Further studies were carried out for exploring the expression of DcaMADS genes in individual flower organs, and some crucial DcaMADS genes correlated with their putative function were validated. Finally, a new expression pattern of DcaMADS genes in flower organs of carnation was provided: sepal (three class E genes and two class A genes), petal (two class B genes, two class E genes, and one SHORT VEGETATIVE PHASE (SVP)), stamen (two class B genes, two class E genes, and two class C), styles (two class E genes and two class C), and ovary (two class E genes, two class C, one AGAMOUS-LIKE 6 (AGL6), one SEEDSTICK (STK), one B sister, one SVP, and one Mα). This result proposes a model in floral organ identity of carnation and it may be helpful to further explore the molecular mechanism of flower organ identity in carnation. PMID:29617274
Zhang, Xiaoni; Wang, Qijian; Yang, Shaozong; Lin, Shengnan; Bao, Manzhu; Bendahmane, Mohammed; Wu, Quanshu; Wang, Caiyun; Fu, Xiaopeng
2018-04-04
Dianthus is a large genus containing many species with high ornamental economic value. Extensive breeding strategies permitted an exploration of an improvement in the quality of cultivated carnation, particularly in flowers. However, little is known on the molecular mechanisms of flower development in carnation. Here, we report the identification and description of MADS-box genes in carnation ( DcaMADS ) with a focus on those involved in flower development and organ identity determination. In this study, 39 MADS-box genes were identified from the carnation genome and transcriptome by the phylogenetic analysis. These genes were categorized into four subgroups (30 MIKC c , two MIKC*, two Mα, and five Mγ). The MADS-box domain, gene structure, and conserved motif compositions of the carnation MADS genes were analysed. Meanwhile, the expression of DcaMADS genes were significantly different in stems, leaves, and flower buds. Further studies were carried out for exploring the expression of DcaMADS genes in individual flower organs, and some crucial DcaMADS genes correlated with their putative function were validated. Finally, a new expression pattern of DcaMADS genes in flower organs of carnation was provided: sepal (three class E genes and two class A genes), petal (two class B genes, two class E genes, and one SHORT VEGETATIVE PHASE ( SVP )), stamen (two class B genes, two class E genes, and two class C), styles (two class E genes and two class C), and ovary (two class E genes, two class C, one AGAMOUS-LIKE 6 ( AGL6 ), one SEEDSTICK ( STK ), one B sister , one SVP , and one Mα ). This result proposes a model in floral organ identity of carnation and it may be helpful to further explore the molecular mechanism of flower organ identity in carnation.
[Hydrophidae identification through analysis on Cyt b gene barcode].
Liao, Li-xi; Zeng, Ke-wu; Tu, Peng-fei
2015-08-01
Hydrophidae, one of the precious traditional Chinese medicines, is generally drily preserved to prevent corruption, but it is hard to identify the species of Hydrophidae through the appearance because of the change due to the drying process. The identification through analysis on gene barcode, a new technique in species identification, can avoid the problem. The gene barcodes of the 6 species of Hydrophidae like Lapemis hardwickii were aquired through DNA extraction and gene sequencing. These barcodes were then in sequence alignment and test the identification efficency by BLAST. Our results revealed that the barcode sequences performed high identification efficiency, and had obvious difference between intra- and inter-species. These all indicated that Cyt b DNA barcoding can confirm the Hydrophidae identification.
Wang, Dan; Zhao, Jietang; Hu, Bing; Li, Jiaqi; Qin, Yaqi; Chen, Linhuan; Qin, Yonghua
2018-01-01
Sucrose phosphate synthase (SPS, EC 2.4.1.14) is a key enzyme that regulates sucrose biosynthesis in plants. SPS is encoded by different gene families which display differential expression patterns and functional divergence. Genome-wide identification and expression analyses of SPS gene families have been performed in Arabidopsis, rice, and sugarcane, but a comprehensive analysis of the SPS gene family in Litchi chinensis Sonn. has not yet been reported. In the current study, four SPS gene (LcSPS1, LcSPS2, LcSPS3, and LcSPS4) were isolated from litchi. The genomic organization analysis indicated the four litchi SPS genes have very similar exon-intron structures. Phylogenetic tree showed LcSPS1-4 were grouped into different SPS families (LcSPS1 and LcSPS2 in A family, LcSPS3 in B family, and LcSPS4 in C family). LcSPS1 and LcSPS4 were strongly expressed in the flowers, while LcSPS3 most expressed in mature leaves. RT-qPCR results showed that LcSPS genes expressed differentially during aril development between cultivars with different hexose/sucrose ratios. A higher level of expression of LcSPS genes was detected in Wuheli, which accumulates higher sucrose in the aril at mature. The tissue- and developmental stage-specific expression of LcSPS1-4 genes uncovered in this study increase our understanding of the important roles played by these genes in litchi fruits. PMID:29473005
Singh, Vinay Kumar; Ambwani, Sonu; Marla, Soma; Kumar, Anil
2009-10-23
We describe the development of a user friendly tool that would assist in the retrieval of information relating to Cry genes in transgenic crops. The tool also helps in detection of transformed Cry genes from Bacillus thuringiensis present in transgenic plants by providing suitable designed primers for PCR identification of these genes. The tool designed based on relational database model enables easy retrieval of information from the database with simple user queries. The tool also enables users to access related information about Cry genes present in various databases by interacting with different sources (nucleotide sequences, protein sequence, sequence comparison tools, published literature, conserved domains, evolutionary and structural data). http://insilicogenomics.in/Cry-btIdentifier/welcome.html.
Wittenberger, T; Schaller, H C; Hellebrand, S
2001-03-30
We have developed a comprehensive expressed sequence tag database search method and used it for the identification of new members of the G-protein coupled receptor superfamily. Our approach proved to be especially useful for the detection of expressed sequence tag sequences that do not encode conserved parts of a protein, making it an ideal tool for the identification of members of divergent protein families or of protein parts without conserved domain structures in the expressed sequence tag database. At least 14 of the expressed sequence tags found with this strategy are promising candidates for new putative G-protein coupled receptors. Here, we describe the sequence and expression analysis of five new members of this receptor superfamily, namely GPR84, GPR86, GPR87, GPR90 and GPR91. We also studied the genomic structure and chromosomal localization of the respective genes applying in silico methods. A cluster of six closely related G-protein coupled receptors was found on the human chromosome 3q24-3q25. It consists of four orphan receptors (GPR86, GPR87, GPR91, and H963), the purinergic receptor P2Y1, and the uridine 5'-diphosphoglucose receptor KIAA0001. It seems likely that these receptors evolved from a common ancestor and therefore might have related ligands. In conclusion, we describe a data mining procedure that proved to be useful for the identification and first characterization of new genes and is well applicable for other gene families. Copyright 2001 Academic Press.
Genome-Wide Identification and Expression Analysis of WRKY Gene Family in Capsicum annuum L.
Diao, Wei-Ping; Snyder, John C; Wang, Shu-Bin; Liu, Jin-Bing; Pan, Bao-Gui; Guo, Guang-Jun; Wei, Ge
2016-01-01
The WRKY family of transcription factors is one of the most important families of plant transcriptional regulators with members regulating multiple biological processes, especially in regulating defense against biotic and abiotic stresses. However, little information is available about WRKYs in pepper (Capsicum annuum L.). The recent release of completely assembled genome sequences of pepper allowed us to perform a genome-wide investigation for pepper WRKY proteins. In the present study, a total of 71 WRKY genes were identified in the pepper genome. According to structural features of their encoded proteins, the pepper WRKY genes (CaWRKY) were classified into three main groups, with the second group further divided into five subgroups. Genome mapping analysis revealed that CaWRKY were enriched on four chromosomes, especially on chromosome 1, and 15.5% of the family members were tandemly duplicated genes. A phylogenetic tree was constructed depending on WRKY domain' sequences derived from pepper and Arabidopsis. The expression of 21 selected CaWRKY genes in response to seven different biotic and abiotic stresses (salt, heat shock, drought, Phytophtora capsici, SA, MeJA, and ABA) was evaluated by quantitative RT-PCR; Some CaWRKYs were highly expressed and up-regulated by stress treatment. Our results will provide a platform for functional identification and molecular breeding studies of WRKY genes in pepper.
Yang, Yongchao; Wang, Yongqi; Mo, Yanling; Zhang, Ruimin; Zhang, Yong; Ma, Jianxiang; Wei, Chunhua
2018-01-01
Despite identification of WRKY family genes in numerous plant species, a little is known about WRKY genes in watermelon, one of the most economically important fruit crops around the world. Here, we identified a total of 63 putative WRKY genes in watermelon and classified them into three major groups (I-III) and five subgroups (IIa-IIe) in group II. The structure analysis indicated that ClWRKYs with different WRKY domains or motifs may play different roles by regulating respective target genes. The expressions of ClWRKYs in different tissues indicate that they are involved in various tissue growth and development. Furthermore, the diverse responses of ClWRKYs to drought, salt, or cold stress suggest that they positively or negatively affect plant tolerance to various abiotic stresses. In addition, the altered expression patterns of ClWRKYs in response to phytohormones such as, ABA, SA, MeJA, and ETH, imply the occurrence of complex cross-talks between ClWRKYs and plant hormone signals in regulating plant physiological and biological processes. Taken together, our findings provide valuable clues to further explore the function and regulatory mechanisms of ClWRKY genes in watermelon growth, development, and adaption to environmental stresses. PMID:29338040
Yang, Xiaozhen; Li, Hao; Yang, Yongchao; Wang, Yongqi; Mo, Yanling; Zhang, Ruimin; Zhang, Yong; Ma, Jianxiang; Wei, Chunhua; Zhang, Xian
2018-01-01
Despite identification of WRKY family genes in numerous plant species, a little is known about WRKY genes in watermelon, one of the most economically important fruit crops around the world. Here, we identified a total of 63 putative WRKY genes in watermelon and classified them into three major groups (I-III) and five subgroups (IIa-IIe) in group II. The structure analysis indicated that ClWRKYs with different WRKY domains or motifs may play different roles by regulating respective target genes. The expressions of ClWRKYs in different tissues indicate that they are involved in various tissue growth and development. Furthermore, the diverse responses of ClWRKYs to drought, salt, or cold stress suggest that they positively or negatively affect plant tolerance to various abiotic stresses. In addition, the altered expression patterns of ClWRKYs in response to phytohormones such as, ABA, SA, MeJA, and ETH, imply the occurrence of complex cross-talks between ClWRKYs and plant hormone signals in regulating plant physiological and biological processes. Taken together, our findings provide valuable clues to further explore the function and regulatory mechanisms of ClWRKY genes in watermelon growth, development, and adaption to environmental stresses.
Identification of Surprisingly Diverse Type IV Pili, across a Broad Range of Gram-Positive Bacteria
Roos, David S.; Pohlschröder, Mechthild
2011-01-01
Background In Gram-negative bacteria, type IV pili (TFP) have long been known to play important roles in such diverse biological phenomena as surface adhesion, motility, and DNA transfer, with significant consequences for pathogenicity. More recently it became apparent that Gram-positive bacteria also express type IV pili; however, little is known about the diversity and abundance of these structures in Gram-positives. Computational tools for automated identification of type IV pilins are not currently available. Results To assess TFP diversity in Gram-positive bacteria and facilitate pilin identification, we compiled a comprehensive list of putative Gram-positive pilins encoded by operons containing highly conserved pilus biosynthetic genes (pilB, pilC). A surprisingly large number of species were found to contain multiple TFP operons (pil, com and/or tad). The N-terminal sequences of predicted pilins were exploited to develop PilFind, a rule-based algorithm for genome-wide identification of otherwise poorly conserved type IV pilins in any species, regardless of their association with TFP biosynthetic operons (http://signalfind.org). Using PilFind to scan 53 Gram-positive genomes (encoding >187,000 proteins), we identified 286 candidate pilins, including 214 in operons containing TFP biosynthetic genes (TBG+ operons). Although trained on Gram-positive pilins, PilFind identified 55 of 58 manually curated Gram-negative pilins in TBG+ operons, as well as 53 additional pilin candidates in operons lacking biosynthetic genes in ten species (>38,000 proteins), including 27 of 29 experimentally verified pilins. False positive rates appear to be low, as PilFind predicted only four pilin candidates in eleven bacterial species (>13,000 proteins) lacking TFP biosynthetic genes. Conclusions We have shown that Gram-positive bacteria contain a highly diverse set of type IV pili. PilFind can be an invaluable tool to study bacterial cellular processes known to involve type IV pilus-like structures. Its use in combination with other currently available computational tools should improve the accuracy of predicting the subcellular localization of bacterial proteins. PMID:22216142
The MB2 gene family of Plasmodium species has a unique combination of S1 and GTP-binding domains
Romero, Lisa C; Nguyen, Thanh V; Deville, Benoit; Ogunjumo, Oluwasanmi; James, Anthony A
2004-01-01
Background Identification and characterization of novel Plasmodium gene families is necessary for developing new anti-malarial therapeutics. The products of the Plasmodium falciparum gene, MB2, were shown previously to have a stage-specific pattern of subcellular localization and proteolytic processing. Results Genes homologous to MB2 were identified in five additional parasite species, P. knowlesi, P. gallinaceum, P. berghei, P. yoelii, and P. chabaudi. Sequence comparisons among the MB2 gene products reveal amino acid conservation of structural features, including putative S1 and GTP-binding domains, and putative signal peptides and nuclear localization signals. Conclusions The combination of domains is unique to this gene family and indicates that MB2 genes comprise a novel family and therefore may be a good target for drug development. PMID:15222903
Identification of the centromere-specific histone H3 variant in Lotus japonicus.
Tek, Ahmet L; Kashihara, Kazunari; Murata, Minoru; Nagaki, Kiyotaka
2014-03-15
The centromere is a structurally and functionally specialized region present on every eukaryotic chromosome. Lotus japonicus is a model legume species for which there is very limited information on the centromere structure. Here we cloned and characterized the L. japonicus homolog of the centromere-specific histone H3 gene (LjCenH3) encoding a 159-amino acid protein. Using an Agrobacterium-based transformation system, LjCenH3 tagged with a green fluorescent protein was transferred into L. japonicus cells. The centromeric position of LjCENH3 protein was revealed on L. japonicus metaphase chromosomes by an immunofluorescence assay. The identification of LjCenH3 as a critical centromere landmark could pave the way for a better understanding of centromere structure in this model and other agriculturally important legume species. Published by Elsevier B.V.
Li, Fupeng; Hao, Chaoyun; Yan, Lin; Wu, Baoduo; Qin, Xiaowei; Lai, Jianxiong; Song, Yinghui
2015-09-01
In higher plants, sucrose synthase (Sus, EC 2.4.1.13) is widely considered as a key enzyme involved in sucrose metabolism. Although, several paralogous genes encoding different isozymes of Sus have been identified and characterized in multiple plant genomes, to date detailed information about the Sus genes is lacking for cacao. This study reports the identification of six novel Sus genes from economically important cacao tree. Analyses of the gene structure and phylogeny of the Sus genes demonstrated evolutionary conservation in the Sus family across cacao and other plant species. The expression of cacao Sus genes was investigated via real-time PCR in various tissues, different developmental phases of leaf, flower bud and pod. The Sus genes exhibited distinct but partially redundant expression profiles in cacao, with TcSus1, TcSus5 and TcSus6, being the predominant genes in the bark with phloem, TcSus2 predominantly expressing in the seed during the stereotype stage. TcSus3 and TcSus4 were significantly detected more in the pod husk and seed coat along the pod development, and showed development dependent expression profiles in the cacao pod. These results provide new insights into the evolution, and basic information that will assist in elucidating the functions of cacao Sus gene family.
Molla, Mijanur R; Böser, Alexander; Rana, Akshita; Schwarz, Karina; Levkin, Pavel A
2018-04-18
Efficient delivery of nucleic acids into cells is of great interest in the field of cell biology and gene therapy. Despite a lot of research, transfection efficiency and structural diversity of gene-delivery vectors are still limited. A better understanding of the structure-function relationship of gene delivery vectors is also essential for the design of novel and intelligent delivery vectors, efficient in "difficult-to-transfect" cells and in vivo clinical applications. Most of the existing strategies for the synthesis of gene-delivery vectors require multiple steps and lengthy procedures. Here, we demonstrate a facile, three-component one-pot synthesis of a combinatorial library of 288 structurally diverse lipid-like molecules termed "lipidoids" via a thiolactone ring opening reaction. This strategy introduces the possibility to synthesize lipidoids with hydrophobic tails containing both unsaturated bonds and reducible disulfide groups. The whole synthesis and purification are convenient, extremely fast, and can be accomplished within a few hours. Screening of the produced lipidoids using HEK293T cells without addition of helper lipids resulted in identification of highly stable liposomes demonstrating ∼95% transfection efficiency with low toxicity.
Wu, Lin; van Peer, Arend; Song, Wenhua; Wang, Hong; Chen, Mingjie; Tan, Qi; Song, Chunyan; Zhang, Meiyan; Bao, Dapeng
2013-12-01
During the life cycle of heterothallic tetrapolar Agaricomycetes such as Lentinula edodes (Berk.) Pegler, the mating type system, composed of unlinked A and B loci, plays a vital role in controlling sexual development and resulting formation of the fruit body. L. edodes is produced worldwide for consumption and medicinal purposes, and understanding its sexual development is therefore of great importance. A considerable amount of mating type factors has been indicated over the past decades but few genes have actually been identified, and no complete genetic structures of L. edodes B mating-type loci are available. In this study, we cloned the matB regions from two mating compatible L. edodes strains, 939P26 and 939P42. Four pheromone receptors were identified on each new matB region, together with three and four pheromone precursor genes in the respective strains. Gene polymorphism, phylogenetic analysis and distribution of pheromone receptors and pheromone precursors clearly indicate a bipartite matB locus, each sublocus containing a pheromone receptor and one or two pheromone precursors. Detailed sequence comparisons of genetic structures between the matB regions of strains 939P42, 939P26 and a previously reported strain SUP2 further supported this model and allowed identification of the B mating type subloci borders. Mating studies confirmed the control of B mating by the identified pheromone receptors and pheromones in L. edodes. © 2013 Elsevier B.V. All rights reserved.
Exploiting proteomic data for genome annotation and gene model validation in Aspergillus niger.
Wright, James C; Sugden, Deana; Francis-McIntyre, Sue; Riba-Garcia, Isabel; Gaskell, Simon J; Grigoriev, Igor V; Baker, Scott E; Beynon, Robert J; Hubbard, Simon J
2009-02-04
Proteomic data is a potentially rich, but arguably unexploited, data source for genome annotation. Peptide identifications from tandem mass spectrometry provide prima facie evidence for gene predictions and can discriminate over a set of candidate gene models. Here we apply this to the recently sequenced Aspergillus niger fungal genome from the Joint Genome Institutes (JGI) and another predicted protein set from another A.niger sequence. Tandem mass spectra (MS/MS) were acquired from 1d gel electrophoresis bands and searched against all available gene models using Average Peptide Scoring (APS) and reverse database searching to produce confident identifications at an acceptable false discovery rate (FDR). 405 identified peptide sequences were mapped to 214 different A.niger genomic loci to which 4093 predicted gene models clustered, 2872 of which contained the mapped peptides. Interestingly, 13 (6%) of these loci either had no preferred predicted gene model or the genome annotators' chosen "best" model for that genomic locus was not found to be the most parsimonious match to the identified peptides. The peptides identified also boosted confidence in predicted gene structures spanning 54 introns from different gene models. This work highlights the potential of integrating experimental proteomics data into genomic annotation pipelines much as expressed sequence tag (EST) data has been. A comparison of the published genome from another strain of A.niger sequenced by DSM showed that a number of the gene models or proteins with proteomics evidence did not occur in both genomes, further highlighting the utility of the method.
Fields, Randall R.; Zhou, Guimei; Huang, Dali; Davis, Jack R.; Möller, Claes; Jacobson, Samuel G.; Kimberling, William J.; Sumegi, Janos
2002-01-01
Usher syndrome type III is an autosomal recessive disorder characterized by progressive sensorineural hearing loss, vestibular dysfunction, and retinitis pigmentosa. The disease gene was localized to 3q25 and recently was identified by positional cloning. In the present study, we have revised the structure of the USH3 gene, including a new translation start site, 5′ untranslated region, and a transcript encoding a 232–amino acid protein. The mature form of the protein is predicted to contain three transmembrane domains and 204 residues. We have found four new disease-causing mutations, including one that appears to be relatively common in the Ashkenazi Jewish population. We have also identified mouse (chromosome 3) and rat (chromosome 2) orthologues, as well as two human paralogues on chromosomes 4 and 10. PMID:12145752
Identification, distribution and molecular evolution of the pacifastin gene family in Metazoa
Breugelmans, Bert; Simonet, Gert; van Hoef, Vincent; Van Soest, Sofie; Broeck, Jozef Vanden
2009-01-01
Background Members of the pacifastin family are serine peptidase inhibitors, most of which are produced as multi domain precursor proteins. Structural and biochemical characteristics of insect pacifastin-like peptides have been studied intensively, but only one inhibitor has been functionally characterised. Recent sequencing projects of metazoan genomes have created an unprecedented opportunity to explore the distribution, evolution and functional diversification of pacifastin genes in the animal kingdom. Results A large scale in silico data mining search led to the identification of 83 pacifastin members with 284 inhibitor domains, distributed over 55 species from three metazoan phyla. In contrast to previous assumptions, members of this family were also found in other phyla than Arthropoda, including the sister phylum Onychophora and the 'primitive', non-bilaterian Placozoa. In Arthropoda, pacifastin members were found to be distributed among insect families of nearly all insect orders and for the first time also among crustacean species other than crayfish and the Chinese mitten crab. Contrary to precursors from Crustacea, the majority of insect pacifastin members contain dibasic cleavage sites, indicative for posttranslational processing into numerous inhibitor peptides. Whereas some insect species have lost the pacifastin gene, others were found to have several (often clustered) paralogous genes. Amino acids corresponding to the reactive site or involved in the folding of the inhibitor domain were analysed as a basis for the biochemical properties. Conclusion The absence of the pacifastin gene in some insect genomes and the extensive gene expansion in other insects are indicative for the rapid (adaptive) evolution of this gene family. In addition, differential processing mechanisms and a high variability in the reactive site residues and the inner core interactions contribute to a broad functional diversification of inhibitor peptides, indicating wide ranging roles in different physiological processes. Based on the observation of a pacifastin gene in Placozoa, it can be hypothesized that the ancestral pacifastin gene has occurred before the divergence of bilaterian animals. However, considering differences in gene structure between the placozoan and other pacifastin genes and the existence of a 'pacifastin gene gap' between Placozoa and Onychophora/Arthropoda, it cannot be excluded that the pacifastin signature originated twice by convergent evolution. PMID:19435517
Jing, Zhaobin; Liu, Zhande
2018-04-01
As one of the largest transcriptional factor families in plants, WRKY transcription factors play important roles in various biotic and abiotic stress responses. To date, WRKY genes in kiwifruit (Actinidia spp.) remain poorly understood. In our study, o total of 97 AcWRKY genes have been identified in the kiwifruit genome. An overview of these AcWRKY genes is analyzed, including the phylogenetic relationships, exon-intron structures, synteny and expression profiles. The 97 AcWRKY genes were divided into three groups based on the conserved WRKY domain. Synteny analysis indicated that segmental duplication events contributed to the expansion of the kiwifruit AcWRKY family. In addition, the synteny analysis between kiwifruit and Arabidopsis suggested that some of the AcWRKY genes were derived from common ancestors before the divergence of these two species. Conserved motifs outside the AcWRKY domain may reflect their functional conservation. Genome-wide segmental and tandem duplication were found, which may contribute to the expansion of AcWRKY genes. Furthermore, the analysis of selected AcWRKY genes showed a variety of expression patterns in five different organs as well as during biotic and abiotic stresses. The genome-wide identification and characterization of kiwifruit WRKY transcription factors provides insight into the evolutionary history and is a useful resource for further functional analyses of kiwifruit.
Pydiura, Nikolay; Pirko, Yaroslav; Galinousky, Dmitry; Postovoitova, Anastasiia; Yemets, Alla; Kilchevsky, Aleksandr; Blume, Yaroslav
2018-06-08
Flax (Linum usitatissimum L.) is a valuable food and fiber crop cultivated for its quality fiber and seed oil. α-, β-, γ-tubulins and actins are the main structural proteins of the cytoskeleton. α- and γ-tubulin and actin genes have not been characterized yet in the flax genome. In this study, we have identified 6 α-tubulin genes, 13 β-tubulin genes, 2 γ-tubulin genes, and 15 actin genes in the flax genome and analysed the phylogenetic relationships between flax and A. thaliana tubulin and actin genes. Six α-tubulin genes are represented by 3 paralogous pairs, among 13 β-tubulin genes 7 different isotypes can be distinguished, 6 of which are encoded by two paralogous genes each. γ-tubulin is represented by a paralogous pair of genes one of which may be not functional. Fifteen actin genes represent 7 paralogous pairs - 7 actin isotypes and a sequentially duplicated copy of one of the genes of one of the isotypes. Exon-intron structure analysis has shown intron length polymorphism within the β-tubulin genes and intron number variation among the α-tubulin gene: 3 or 4 introns are found in two or four genes, respectively. Intron positioning occurs at conservative sites, as observed in numerous other plant species. Flax actin genes show both intron length polymorphisms and variation in the number of intron that may be 2 or 3. These data will be useful to support further studies on the specificity, functioning, regulation and evolution of the flax cytoskeleton proteins. This article is protected by copyright. All rights reserved.
USDA-ARS?s Scientific Manuscript database
Triacylglycerols (TAG) are the major molecules of energy storage in eukaryotes. TAG are packed in subcellular structures called oil bodies or lipid droplets. Oleosins (OLE) are the major proteins in plant oil bodies. Multiple isoforms of OLE are present in plants such as tung tree (Vernicia fordii),...
Conazoles are triazole- or imidazole-containing fungicides used in agriculture and medicine. Using transcriptomic analysis of rat thyroid tissues exposed to either tumorigenic or non-tumorigenic structurally related conazoles, we identified new findings on thyroid gene expressio...
Wang, Jian; Xie, Dong; Lin, Hongfei; Yang, Zhihao; Zhang, Yijia
2012-06-21
Many biological processes recognize in particular the importance of protein complexes, and various computational approaches have been developed to identify complexes from protein-protein interaction (PPI) networks. However, high false-positive rate of PPIs leads to challenging identification. A protein semantic similarity measure is proposed in this study, based on the ontology structure of Gene Ontology (GO) terms and GO annotations to estimate the reliability of interactions in PPI networks. Interaction pairs with low GO semantic similarity are removed from the network as unreliable interactions. Then, a cluster-expanding algorithm is used to detect complexes with core-attachment structure on filtered network. Our method is applied to three different yeast PPI networks. The effectiveness of our method is examined on two benchmark complex datasets. Experimental results show that our method performed better than other state-of-the-art approaches in most evaluation metrics. The method detects protein complexes from large scale PPI networks by filtering GO semantic similarity. Removing interactions with low GO similarity significantly improves the performance of complex identification. The expanding strategy is also effective to identify attachment proteins of complexes.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kou, Qiang; Wu, Si; Tolić, Nikola
Motivation: Although proteomics has rapidly developed in the past decade, researchers are still in the early stage of exploring the world of complex proteoforms, which are protein products with various primary structure alterations resulting from gene mutations, alternative splicing, post-translational modifications, and other biological processes. Proteoform identification is essential to mapping proteoforms to their biological functions as well as discovering novel proteoforms and new protein functions. Top-down mass spectrometry is the method of choice for identifying complex proteoforms because it provides a “bird’s eye view” of intact proteoforms. The combinatorial explosion of various alterations on a protein may result inmore » billions of possible proteoforms, making proteoform identification a challenging computational problem. Results: We propose a new data structure, called the mass graph, for efficient representation of proteoforms and design mass graph alignment algorithms. We developed TopMG, a mass graph-based software tool for proteoform identification by top-down mass spectrometry. Experiments on top-down mass spectrometry data sets showed that TopMG outperformed existing methods in identifying complex proteoforms.« less
Yan, Bo; Neilson, Karen M.; Ranganathan, Ramya; Maynard, Thomas; Streit, Andrea; Moody, Sally A.
2014-01-01
Background Six1 plays an important role in the development of several vertebrate organs, including cranial sensory placodes, somites and kidney. Although Six1 mutations cause one form of Branchio-Otic Syndrome (BOS), the responsible gene in many patients has not been identified; genes that act downstream of Six1 are potential BOS candidates. Results We sought to identify novel genes expressed during placode, somite and kidney development by comparing gene expression between control and Six1-expressing ectodermal explants. The expression patterns of 19 of the significantly up-regulated and 11 of the significantly down-regulated genes were assayed from cleavage to larval stages. 28/30 genes are expressed in the otocyst, a structure that is functionally disrupted in BOS, and 26/30 genes are expressed in the nephric mesoderm, a structure that is functionally disrupted in the related Branchio-Otic-Renal (BOR) syndrome. We also identified the chick homologues of 5 genes and show that they have conserved expression patterns. Conclusions Of the 30 genes selected for expression analyses, all are expressed at many of the developmental times and appropriate tissues to be regulated by Six1. Many have the potential to play a role in the disruption of hearing and kidney function seen in BOS/BOR patients. PMID:25403746
Clustering Algorithms: Their Application to Gene Expression Data
Oyelade, Jelili; Isewon, Itunuoluwa; Oladipupo, Funke; Aromolaran, Olufemi; Uwoghiren, Efosa; Ameh, Faridah; Achas, Moses; Adebiyi, Ezekiel
2016-01-01
Gene expression data hide vital information required to understand the biological process that takes place in a particular organism in relation to its environment. Deciphering the hidden patterns in gene expression data proffers a prodigious preference to strengthen the understanding of functional genomics. The complexity of biological networks and the volume of genes present increase the challenges of comprehending and interpretation of the resulting mass of data, which consists of millions of measurements; these data also inhibit vagueness, imprecision, and noise. Therefore, the use of clustering techniques is a first step toward addressing these challenges, which is essential in the data mining process to reveal natural structures and identify interesting patterns in the underlying data. The clustering of gene expression data has been proven to be useful in making known the natural structure inherent in gene expression data, understanding gene functions, cellular processes, and subtypes of cells, mining useful information from noisy data, and understanding gene regulation. The other benefit of clustering gene expression data is the identification of homology, which is very important in vaccine design. This review examines the various clustering algorithms applicable to the gene expression data in order to discover and provide useful knowledge of the appropriate clustering technique that will guarantee stability and high degree of accuracy in its analysis procedure. PMID:27932867
Salcedo, Raúl García; Olano, Carlos; Gómez, Cristina; Fernández, Rogelio; Braña, Alfredo F; Méndez, Carmen; de la Calle, Fernando; Salas, José A
2016-02-22
PM100117 and PM100118 are glycosylated polyketides with remarkable antitumor activity, which derive from the marine symbiotic actinobacteria Streptomyces caniferus GUA-06-05-006A. Structurally, PM100117 and PM100118 are composed of a macrocyclic lactone, three deoxysugar units and a naphthoquinone (NQ) chromophore that shows a clear structural similarity to menaquinone. Whole-genome sequencing of S. caniferus GUA-06-05-006A has enabled the identification of PM100117 and PM100118 biosynthesis gene cluster, which has been characterized on the basis of bioinformatics and genetic engineering data. The product of four genes shows high identity to proteins involved in the biosynthesis of menaquinone via futalosine. Deletion of one of these genes led to a decay in PM100117 and PM100118 production, and to the accumulation of several derivatives lacking NQ. Likewise, five additional genes have been genetically characterized to be involved in the biosynthesis of this moiety. Moreover, the generation of a mutant in a gene coding for a putative cytochrome P450 has led to the production of PM100117 and PM100118 structural analogues showing an enhanced in vitro cytotoxic activity relative to the parental products. Although a number of compounds structurally related to PM100117 and PM100118 has been discovered, this is, to our knowledge, the first insight reported into their biosynthesis. The structural resemblance of the NQ moiety to menaquinone, and the presence in the cluster of four putative menaquinone biosynthetic genes, suggests a connection between the biosynthesis pathways of both compounds. The availability of the PM100117 and PM100118 biosynthetic gene cluster will surely pave a way to the combinatorial engineering of more derivatives.
Tang, Huiwu; Zheng, Xingmei; Li, Chuliang; Xie, Xianrong; Chen, Yuanling; Chen, Letian; Zhao, Xiucai; Zheng, Huiqi; Zhou, Jiajian; Ye, Shan; Guo, Jingxin; Liu, Yao-Guang
2017-01-01
New gene origination is a major source of genomic innovations that confer phenotypic changes and biological diversity. Generation of new mitochondrial genes in plants may cause cytoplasmic male sterility (CMS), which can promote outcrossing and increase fitness. However, how mitochondrial genes originate and evolve in structure and function remains unclear. The rice Wild Abortive type of CMS is conferred by the mitochondrial gene WA352c (previously named WA352) and has been widely exploited in hybrid rice breeding. Here, we reconstruct the evolutionary trajectory of WA352c by the identification and analyses of 11 mitochondrial genomic recombinant structures related to WA352c in wild and cultivated rice. We deduce that these structures arose through multiple rearrangements among conserved mitochondrial sequences in the mitochondrial genome of the wild rice Oryza rufipogon, coupled with substoichiometric shifting and sequence variation. We identify two expressed but nonfunctional protogenes among these structures, and show that they could evolve into functional CMS genes via sequence variations that could relieve the self-inhibitory potential of the proteins. These sequence changes would endow the proteins the ability to interact with the nucleus-encoded mitochondrial protein COX11, resulting in premature programmed cell death in the anther tapetum and male sterility. Furthermore, we show that the sequences that encode the COX11-interaction domains in these WA352c-related genes have experienced purifying selection during evolution. We propose a model for the formation and evolution of new CMS genes via a “multi-recombination/protogene formation/functionalization” mechanism involving gradual variations in the structure, sequence, copy number, and function. PMID:27725674
Diehn, Till A.; Pommerrenig, Benjamin; Bernhardt, Nadine; Hartmann, Anja; Bienert, Gerd P.
2015-01-01
Aquaporins (AQPs) are essential channel proteins that regulate plant water homeostasis and the uptake and distribution of uncharged solutes such as metalloids, urea, ammonia, and carbon dioxide. Despite their importance as crop plants, little is known about AQP gene and protein function in cabbage (Brassica oleracea) and other Brassica species. The recent releases of the genome sequences of B. oleracea and Brassica rapa allow comparative genomic studies in these species to investigate the evolution and features of Brassica genes and proteins. In this study, we identified all AQP genes in B. oleracea by a genome-wide survey. In total, 67 genes of four plant AQP subfamilies were identified. Their full-length gene sequences and locations on chromosomes and scaffolds were manually curated. The identification of six additional full-length AQP sequences in the B. rapa genome added to the recently published AQP protein family of this species. A phylogenetic analysis of AQPs of Arabidopsis thaliana, B. oleracea, B. rapa allowed us to follow AQP evolution in closely related species and to systematically classify and (re-) name these isoforms. Thirty-three groups of AQP-orthologous genes were identified between B. oleracea and Arabidopsis and their expression was analyzed in different organs. The two selectivity filters, gene structure and coding sequences were highly conserved within each AQP subfamily while sequence variations in some introns and untranslated regions were frequent. These data suggest a similar substrate selectivity and function of Brassica AQPs compared to Arabidopsis orthologs. The comparative analyses of all AQP subfamilies in three Brassicaceae species give initial insights into AQP evolution in these taxa. Based on the genome-wide AQP identification in B. oleracea and the sequence analysis and reprocessing of Brassica AQP information, our dataset provides a sequence resource for further investigations of the physiological and molecular functions of Brassica crop AQPs. PMID:25904922
Sharma, Akanksha; Sharma, Niharika; Bhalla, Prem; Singh, Mohan
2017-01-01
Comparative genomics have facilitated the mining of biological information from a genome sequence, through the detection of similarities and differences with genomes of closely or more distantly related species. By using such comparative approaches, knowledge can be transferred from the model to non-model organisms and insights can be gained in the structural and evolutionary patterns of specific genes. In the absence of sequenced genomes for allergenic grasses, this study was aimed at understanding the structure, organisation and expression profiles of grass pollen allergens using the genomic data from Brachypodium distachyon as it is phylogenetically related to the allergenic grasses. Combining genomic data with the anther RNA-Seq dataset revealed 24 pollen allergen genes belonging to eight allergen groups mapping on the five chromosomes in B. distachyon. High levels of anther-specific expression profiles were observed for the 24 identified putative allergen-encoding genes in Brachypodium. The genomic evidence suggests that gene encoding the group 5 allergen, the most potent trigger of hay fever and allergic asthma originated as a pollen specific orphan gene in a common grass ancestor of Brachypodium and Triticiae clades. Gene structure analysis showed that the putative allergen-encoding genes in Brachypodium either lack or contain reduced number of introns. Promoter analysis of the identified Brachypodium genes revealed the presence of specific cis-regulatory sequences likely responsible for high anther/pollen-specific expression. With the identification of putative allergen-encoding genes in Brachypodium, this study has also described some important plant gene families (e.g. expansin superfamily, EF-Hand family, profilins etc) for the first time in the model plant Brachypodium. Altogether, the present study provides new insights into structural characterization and evolution of pollen allergens and will further serve as a base for their functional characterization in related grass species.
oPOSSUM: identification of over-represented transcription factor binding sites in co-expressed genes
Ho Sui, Shannan J.; Mortimer, James R.; Arenillas, David J.; Brumm, Jochen; Walsh, Christopher J.; Kennedy, Brian P.; Wasserman, Wyeth W.
2005-01-01
Targeted transcript profiling studies can identify sets of co-expressed genes; however, identification of the underlying functional mechanism(s) is a significant challenge. Established methods for the analysis of gene annotations, particularly those based on the Gene Ontology, can identify functional linkages between genes. Similar methods for the identification of over-represented transcription factor binding sites (TFBSs) have been successful in yeast, but extension to human genomics has largely proved ineffective. Creation of a system for the efficient identification of common regulatory mechanisms in a subset of co-expressed human genes promises to break a roadblock in functional genomics research. We have developed an integrated system that searches for evidence of co-regulation by one or more transcription factors (TFs). oPOSSUM combines a pre-computed database of conserved TFBSs in human and mouse promoters with statistical methods for identification of sites over-represented in a set of co-expressed genes. The algorithm successfully identified mediating TFs in control sets of tissue-specific genes and in sets of co-expressed genes from three transcript profiling studies. Simulation studies indicate that oPOSSUM produces few false positives using empirically defined thresholds and can tolerate up to 50% noise in a set of co-expressed genes. PMID:15933209
Misra, Namrata; Panda, Prasanna Kumar; Parida, Bikram Kumar
2014-12-01
Lysophosphatidyl acyltransferase (LPAT) is one of the major triacylglycerol synthesis enzymes, controlling the metabolic flow of lysophosphatidic acid to phosphatidic acid. Experimental studies in Arabidopsis have shown that LPAT activity is exhibited primarily by three distinct isoforms, namely the plastid-located LPAT1, the endoplasmic reticulum-located LPAT2, and the soluble isoform of LPAT (solLPAT). In this study, 24 putative genes representing all LPAT isoforms were identified from the analysis of 11 complete genomes including green algae, red algae, diatoms and higher plants. We observed LPAT1 and solLPAT genes to be ubiquitously present in nearly all genomes examined, whereas LPAT2 genes to have evolved more recently in the plant lineage. Phylogenetic analysis indicated that LPAT1, LPAT2 and solLPAT have convergently evolved through separate evolutionary paths and belong to three different gene families, which was further evidenced by their wide divergence at gene structure and sequence level. The genome distribution supports the hypothesis that each gene encoding a LPAT is not duplicated. Mapping of exon-intron structure of LPAT genes to the domain structure of proteins across different algal and plant species indicates that exon shuffling plays no role in the evolution of LPAT genes. Besides the previously defined motifs, several conserved consensus sequences were discovered which could be useful to distinguish different LPAT isoforms. Taken together, this study will enable the generation of experimental approximations to better understand the functional role of algal LPAT in lipid accumulation.
Han, Junwei; Shang, Desi; Zhang, Yunpeng; Zhang, Wei; Yao, Qianlan; Han, Lei; Xu, Yanjun; Yan, Wei; Bao, Zhaoshi; You, Gan; Jiang, Tao; Kang, Chunsheng; Li, Xia
2014-01-01
The prognosis of glioma patients is usually poor, especially in patients with glioblastoma (World Health Organization (WHO) grade IV). The regulatory functions of microRNA (miRNA) on genes have important implications in glioma cell survival. However, there are not many studies that have investigated glioma survival by integrating miRNAs and genes while also considering pathway structure. In this study, we performed sample-matched miRNA and mRNA expression profilings to systematically analyze glioma patient survival. During this analytical process, we developed pathway-based random walk to identify a glioma core miRNA-gene module, simultaneously considering pathway structure information and multi-level involvement of miRNAs and genes. The core miRNA-gene module we identified was comprised of four apparent sub-modules; all four sub-modules displayed a significant correlation with patient survival in the testing set (P-values≤0.001). Notably, one sub-module that consisted of 6 miRNAs and 26 genes also correlated with survival time in the high-grade subgroup (WHO grade III and IV), P-value = 0.0062. Furthermore, the 26-gene expression signature from this sub-module had robust predictive power in four independent, publicly available glioma datasets. Our findings suggested that the expression signatures, which were identified by integration of miRNA and gene level, were closely associated with overall survival among the glioma patients with various grades. PMID:24809850
Structure of HsaD, a steroid-degrading hydrolase, from Mycobacterium tuberculosis
Lack, Nathan; Lowe, Edward D.; Liu, Jie; Eltis, Lindsay D.; Noble, Martin E. M.; Sim, Edith; Westwood, Isaac M.
2008-01-01
Tuberculosis is a major cause of death worldwide. Understanding of the pathogenicity of Mycobacterium tuberculosis has been advanced by gene analysis and has led to the identification of genes that are important for intracellular survival in macrophages. One of these genes encodes HsaD, a meta-cleavage product (MCP) hydrolase that catalyzes the hydrolytic cleavage of a carbon–carbon bond in cholesterol metabolism. This paper describes the production of HsaD as a recombinant protein and, following crystallization, the determination of its three-dimensional structure to 2.35 Å resolution by X-ray crystallography at the Diamond Light Source in Oxfordshire, England. To the authors’ knowledge, this study constitutes the first report of a structure determined at the new synchrotron facility. The volume of the active-site cleft of the HsaD enzyme is more than double the corresponding active-site volumes of related MCP hydrolases involved in the catabolism of aromatic compounds, consistent with the specificity of HsaD for steroids such as cholesterol. Knowledge of the structure of the enzyme facilitates the design of inhibitors. PMID:18097091
Structure of HsaD, a steroid-degrading hydrolase, from Mycobacterium tuberculosis.
Lack, Nathan; Lowe, Edward D; Liu, Jie; Eltis, Lindsay D; Noble, Martin E M; Sim, Edith; Westwood, Isaac M
2008-01-01
Tuberculosis is a major cause of death worldwide. Understanding of the pathogenicity of Mycobacterium tuberculosis has been advanced by gene analysis and has led to the identification of genes that are important for intracellular survival in macrophages. One of these genes encodes HsaD, a meta-cleavage product (MCP) hydrolase that catalyzes the hydrolytic cleavage of a carbon-carbon bond in cholesterol metabolism. This paper describes the production of HsaD as a recombinant protein and, following crystallization, the determination of its three-dimensional structure to 2.35 A resolution by X-ray crystallography at the Diamond Light Source in Oxfordshire, England. To the authors' knowledge, this study constitutes the first report of a structure determined at the new synchrotron facility. The volume of the active-site cleft of the HsaD enzyme is more than double the corresponding active-site volumes of related MCP hydrolases involved in the catabolism of aromatic compounds, consistent with the specificity of HsaD for steroids such as cholesterol. Knowledge of the structure of the enzyme facilitates the design of inhibitors.
Singh, Himanshu Narayan; Rajeswari, Moganty R
2016-01-01
Purine repeat sequences present in a gene are unique as they have high propensity to form unusual DNA-triple helix structures. Friedreich's ataxia is the only human disease that is well known to be associated with DNA-triplexes formed by purine repeats. The purpose of this study was to recognize the expanded purine repeats (EPRs) in human genome and find their correlation with cancer pathogenesis. We developed "PuRepeatFinder.pl" algorithm to identify non-overlapping EPRs without pyrimidine interruptions in the human genome and customized for searching repeat lengths, n ≥ 200. A total of 1158 EPRs were identified in the genome which followed Wakeby distribution. Two hundred and ninety-six EPRs were found in geneic regions of 282 genes (EPR-genes). Gene clustering of EPR-genes was done based on their cellular function and a large number of EPR-genes were found to be enzymes/enzyme modulators. Meta-analysis of 282 EPR-genes identified only 63 EPR-genes in association with cancer, mostly in breast, lung, and blood cancers. Protein-protein interaction network analysis of all 282 EPR-genes identified proteins including those in cadherins and VEGF. The two observations, that EPRs can induce mutations under malignant conditions and that identification of some EPR-gene products in vital cell signaling-mediated pathways, together suggest the crucial role of EPRs in carcinogenesis. The new link between EPR-genes and their functionally interacting proteins throws a new dimension in the present understanding of cancer pathogenesis and can help in planning therapeutic strategies. Validation of present results using techniques like NGS is required to establish the role of the EPR genes in cancer pathology.
Ai, Ye; Zhang, Chunling; Sun, Yalin; Wang, Weining; He, Yanhong; Bao, Manzhu
2017-01-01
According to the floral organ development ABC model, B class genes specify petal and stamen identification. In order to study the function of B class genes in flower development of Tagetes erecta, five MADS-box B class genes were identified and their expression and putative functions were studied. Sequence comparisons and phylogenetic analyses indicated that there were one PI-like gene-TePI, two euAP3-like genes-TeAP3-1 and TeAP3-2, and two TM6-like genes-TeTM6-1 and TeTM6-2 in T. erecta. Strong expression levels of these genes were detected in stamens of the disk florets, but little or no expression was detected in bracts, receptacles or vegetative organs. Yeast hybrid experiments of the B class proteins showed that TePI protein could form a homodimer and heterodimers with all the other four B class proteins TeAP3-1, TeAP3-2, TeTM6-1 and TeTM6-2. No homodimer or interaction was observed between the euAP3 and TM6 clade members. Over-expression of five B class genes of T. erecta in Nicotiana rotundifolia showed that only the transgenic plants of 35S::TePI showed altered floral morphology compared with the non-transgenic line. This study could contribute to the understanding of the function of B class genes in flower development of T. erecta, and provide a theoretical basis for further research to change floral organ structures and create new materials for plant breeding.
Molecular identification of the chitinase genes in Plasmodium relictum.
Garcia-Longoria, Luz; Hellgren, Olof; Bensch, Staffan
2014-06-18
Malaria parasites need to synthesize chitinase in order to go through the peritrophic membrane, which is created around the mosquito midgut, to complete its life cycle. In mammalian malaria species, the chitinase gene comprises either a large or a short copy. In the avian malaria parasites Plasmodium gallinaceum both copies are present, suggesting that a gene duplication in the ancestor to these extant species preceded the loss of either the long or the short copy in Plasmodium parasites of mammals. Plasmodium gallinaceum is not the most widespread and harmful parasite of birds. This study is the first to search for and identify the chitinase gene in one of the most prevalent avian malaria parasites, Plasmodium relictum. Both copies of P. gallinaceum chitinase were used as reference sequences for primer design. Different sequences of Plasmodium spp. were used to build the phylogenetic tree of chitinase gene. The gene encoding for chitinase was identified in isolates of two mitochondrial lineages of P. relictum (SGS1 and GRW4). The chitinase found in these two lineages consists both of the long (PrCHT1) and the short (PrCHT2) copy. The genetic differences found in the long copy of the chitinase gene between SGS1 and GRW4 were higher than the difference observed for the cytochrome b gene. The identification of both copies in P. relictum sheds light on the phylogenetic relationship of the chitinase gene in the genus Plasmodium. Due to its high variability, the chitinase gene could be used to study the genetic population structure in isolates from different host species and geographic regions.
Charlesworth, Jac C; Peralta, Juan M; Drigalenko, Eugene; Göring, Harald Hh; Almasy, Laura; Dyer, Thomas D; Blangero, John
2009-12-15
Gene identification using linkage, association, or genome-wide expression is often underpowered. We propose that formal combination of information from multiple gene-identification approaches may lead to the identification of novel loci that are missed when only one form of information is available. Firstly, we analyze the Genetic Analysis Workshop 16 Framingham Heart Study Problem 2 genome-wide association data for HDL-cholesterol using a "gene-centric" approach. Then we formally combine the association test results with genome-wide transcriptional profiling data for high-density lipoprotein cholesterol (HDL-C), from the San Antonio Family Heart Study, using a Z-transform test (Stouffer's method). We identified 39 genes by the joint test at a conservative 1% false-discovery rate, including 9 from the significant gene-based association test and 23 whose expression was significantly correlated with HDL-C. Seven genes identified as significant in the joint test were not independently identified by either the association or expression tests. This combined approach has increased power and leads to the direct nomination of novel candidate genes likely to be involved in the determination of HDL-C levels. Such information can then be used as justification for a more exhaustive search for functional sequence variation within the nominated genes. We anticipate that this type of analysis will improve our speed of identification of regulatory genes causally involved in disease risk.
José-Edwards, Diana S.; Kerner, Pierre; Kugler, Jamie E.; Deng, Wei; Jiang, Di; Di Gregorio, Anna
2013-01-01
The notochord is the distinctive characteristic of chordates; however, the knowledge of the complement of transcription factors governing the development of this structure is still incomplete. Here we present the expression patterns of seven transcription factor genes detected in the notochord of the ascidian Ciona intestinalis at various stages of embryonic development. Four of these transcription factors, Fos-a, NFAT5, AFF and Klf15, have not been directly associated with the notochord in previous studies, while the others, including Spalt-like-a, Lmx-like and STAT5/6-b, display evolutionarily conserved expression in this structure as well as in other domains. We examined the hierarchical relationships between these genes and the transcription factor Brachyury, which is necessary for notochord development in all chordates. We found that Ciona Brachyury regulates the expression of most, although not all, of these genes. These results shed light on the genetic regulatory program underlying notochord formation in Ciona and possibly other chordates. PMID:21594950
Genome-wide identification of soybean WRKY transcription factors in response to salt stress.
Yu, Yanchong; Wang, Nan; Hu, Ruibo; Xiang, Fengning
2016-01-01
Members of the large family of WRKY transcription factors are involved in a wide range of developmental and physiological processes, most particularly in the plant response to biotic and abiotic stress. Here, an analysis of the soybean genome sequence allowed the identification of the full complement of 188 soybean WRKY genes. Phylogenetic analysis revealed that soybean WRKY genes were classified into three major groups (I, II, III), with the second group further categorized into five subgroups (IIa-IIe). The soybean WRKYs from each group shared similar gene structures and motif compositions. The location of the GmWRKYs was dispersed over all 20 soybean chromosomes. The whole genome duplication appeared to have contributed significantly to the expansion of the family. Expression analysis by RNA-seq indicated that in soybean root, 66 of the genes responded rapidly and transiently to the imposition of salt stress, all but one being up-regulated. While in aerial part, 49 GmWRKYs responded, all but two being down-regulated. RT-qPCR analysis showed that in the whole soybean plant, 66 GmWRKYs exhibited distinct expression patterns in response to salt stress, of which 12 showed no significant change, 35 were decreased, while 19 were induced. The data present here provide critical clues for further functional studies of WRKY gene in soybean salt tolerance.
Okuwa, Takako; Katayama, Takahiro; Takano, Akinori; Yasukawa, Hiroo
2002-10-01
Genes for the cell-counting factors in Dictyostelium discoideum, countin and countin2, are considered to control the size of the multicellular structure of this organism. A novel gene, countin3, that is homologous to countin and countin2 genes (49 and 39% identity in amino acid sequence, respectively) was identified in the D. discoideum genome. The expression of countin3 was observed in the vegetatively growing cells, decreased in the aggregating stage, increased in the mid-developmental stage and decreased again in subsequent stages. This expression pattern is different from that of countin and countin2. The distinct expression kinetics of three genes suggests that they would have unique roles in size control of D. discoideum.
Schönmann, Susan; Loy, Alexander; Wimmersberger, Céline; Sobek, Jens; Aquino, Catharine; Vandamme, Peter; Frey, Beat; Rehrauer, Hubert; Eberl, Leo
2009-04-01
For cultivation-independent and highly parallel analysis of members of the genus Burkholderia, an oligonucleotide microarray (phylochip) consisting of 131 hierarchically nested 16S rRNA gene-targeted oligonucleotide probes was developed. A novel primer pair was designed for selective amplification of a 1.3 kb 16S rRNA gene fragment of Burkholderia species prior to microarray analysis. The diagnostic performance of the microarray for identification and differentiation of Burkholderia species was tested with 44 reference strains of the genera Burkholderia, Pandoraea, Ralstonia and Limnobacter. Hybridization patterns based on presence/absence of probe signals were interpreted semi-automatically using the novel likelihood-based strategy of the web-tool Phylo- Detect. Eighty-eight per cent of the reference strains were correctly identified at the species level. The evaluated microarray was applied to investigate shifts in the Burkholderia community structure in acidic forest soil upon addition of cadmium, a condition that selected for Burkholderia species. The microarray results were in agreement with those obtained from phylogenetic analysis of Burkholderia 16S rRNA gene sequences recovered from the same cadmiumcontaminated soil, demonstrating the value of the Burkholderia phylochip for determinative and environmental studies.
Global Identification and Characterization of Transcriptionally Active Regions in the Rice Genome
Stolc, Viktor; Deng, Wei; He, Hang; Korbel, Jan; Chen, Xuewei; Tongprasit, Waraporn; Ronald, Pamela; Chen, Runsheng; Gerstein, Mark; Wang Deng, Xing
2007-01-01
Genome tiling microarray studies have consistently documented rich transcriptional activity beyond the annotated genes. However, systematic characterization and transcriptional profiling of the putative novel transcripts on the genome scale are still lacking. We report here the identification of 25,352 and 27,744 transcriptionally active regions (TARs) not encoded by annotated exons in the rice (Oryza. sativa) subspecies japonica and indica, respectively. The non-exonic TARs account for approximately two thirds of the total TARs detected by tiling arrays and represent transcripts likely conserved between japonica and indica. Transcription of 21,018 (83%) japonica non-exonic TARs was verified through expression profiling in 10 tissue types using a re-array in which annotated genes and TARs were each represented by five independent probes. Subsequent analyses indicate that about 80% of the japonica TARs that were not assigned to annotated exons can be assigned to various putatively functional or structural elements of the rice genome, including splice variants, uncharacterized portions of incompletely annotated genes, antisense transcripts, duplicated gene fragments, and potential non-coding RNAs. These results provide a systematic characterization of non-exonic transcripts in rice and thus expand the current view of the complexity and dynamics of the rice transcriptome. PMID:17372628
Comprehensive assessment of cancer missense mutation clustering in protein structures.
Kamburov, Atanas; Lawrence, Michael S; Polak, Paz; Leshchiner, Ignaty; Lage, Kasper; Golub, Todd R; Lander, Eric S; Getz, Gad
2015-10-06
Large-scale tumor sequencing projects enabled the identification of many new cancer gene candidates through computational approaches. Here, we describe a general method to detect cancer genes based on significant 3D clustering of mutations relative to the structure of the encoded protein products. The approach can also be used to search for proteins with an enrichment of mutations at binding interfaces with a protein, nucleic acid, or small molecule partner. We applied this approach to systematically analyze the PanCancer compendium of somatic mutations from 4,742 tumors relative to all known 3D structures of human proteins in the Protein Data Bank. We detected significant 3D clustering of missense mutations in several previously known oncoproteins including HRAS, EGFR, and PIK3CA. Although clustering of missense mutations is often regarded as a hallmark of oncoproteins, we observed that a number of tumor suppressors, including FBXW7, VHL, and STK11, also showed such clustering. Beside these known cases, we also identified significant 3D clustering of missense mutations in NUF2, which encodes a component of the kinetochore, that could affect chromosome segregation and lead to aneuploidy. Analysis of interaction interfaces revealed enrichment of mutations in the interfaces between FBXW7-CCNE1, HRAS-RASA1, CUL4B-CAND1, OGT-HCFC1, PPP2R1A-PPP2R5C/PPP2R2A, DICER1-Mg2+, MAX-DNA, SRSF2-RNA, and others. Together, our results indicate that systematic consideration of 3D structure can assist in the identification of cancer genes and in the understanding of the functional role of their mutations.
Comprehensive assessment of cancer missense mutation clustering in protein structures
Kamburov, Atanas; Lawrence, Michael S.; Polak, Paz; Leshchiner, Ignaty; Lage, Kasper; Golub, Todd R.; Lander, Eric S.; Getz, Gad
2015-01-01
Large-scale tumor sequencing projects enabled the identification of many new cancer gene candidates through computational approaches. Here, we describe a general method to detect cancer genes based on significant 3D clustering of mutations relative to the structure of the encoded protein products. The approach can also be used to search for proteins with an enrichment of mutations at binding interfaces with a protein, nucleic acid, or small molecule partner. We applied this approach to systematically analyze the PanCancer compendium of somatic mutations from 4,742 tumors relative to all known 3D structures of human proteins in the Protein Data Bank. We detected significant 3D clustering of missense mutations in several previously known oncoproteins including HRAS, EGFR, and PIK3CA. Although clustering of missense mutations is often regarded as a hallmark of oncoproteins, we observed that a number of tumor suppressors, including FBXW7, VHL, and STK11, also showed such clustering. Beside these known cases, we also identified significant 3D clustering of missense mutations in NUF2, which encodes a component of the kinetochore, that could affect chromosome segregation and lead to aneuploidy. Analysis of interaction interfaces revealed enrichment of mutations in the interfaces between FBXW7-CCNE1, HRAS-RASA1, CUL4B-CAND1, OGT-HCFC1, PPP2R1A-PPP2R5C/PPP2R2A, DICER1-Mg2+, MAX-DNA, SRSF2-RNA, and others. Together, our results indicate that systematic consideration of 3D structure can assist in the identification of cancer genes and in the understanding of the functional role of their mutations. PMID:26392535
Resistance genes in barley (Hordeum vulgare L.) and their identification with molecular markers.
Chełkowski, Jerzy; Tyrka, Mirosław; Sobkiewicz, Andrzej
2003-01-01
Current information on barley resistance genes available from scientific papers and on-line databases is summarised. The recent literature contains information on 107 major resistance genes (R genes) against fungal pathogens (excluding powdery mildew), pathogenic viruses and aphids identified in Hordeum vulgare accessions. The highest number of resistance genes was identified against Puccinia hordei, Rhynchosporium secalis, and the viruses BaYMV and BaMMV, with 17, 14 and 13 genes respectively. There is still a lot of confusion regarding symbols for R genes against powdery mildew. Among the 23 loci described to date, two regions Mla and Mlo comprise approximately 31 and 25 alleles. Over 50 R genes have already been localised and over 30 mapped on 7 barley chromosomes. Four barley R genes have been cloned recently: Mlo, Rpg1, Mla1 and Mla6, and their structures (sequences) are available. The paper presents a catalogue of barley resistance gene symbols, their chromosomalocation and the list of available DNA markers useful in characterising cultivars and breeding accessions.
2001-07-01
USA. 86: 29. Mattheakis. L. C.. Sor. F.. and Collier . R. J. Diphthamide synthesis in Sacclharomnvces 5136-5140, 1989. cerevisiae: structure of the...Cell. Biochem. 138: 131-133. Vargas, M. P., Zhuang, Z., Wang, C., Vortmeyer, A., Linehan, W. M.. Mattheakis, H. C., Sor, F., and Collier , R. J. (1993...W. H., and Collier , R. J. (1992). DPH5, a Wilson, R., Ainscough, R., Andersen, K., Baynes, C., Berks, M., methyltransferase gene required for
Kaneko, Jun; Narita-Yamada, Sachiko; Wakabayashi, Yukari; Kamio, Yoshiyuki
2009-07-01
The temperate phage phiSLT of Staphylococcus aureus carries genes for Panton-Valentine leukocidin. Here, we identify ORF636, a constituent of the phage tail tip structure, as a recognition/adhesion protein for a poly(glycerophosphate) chain of lipoteichoic acid on the cell surface of S. aureus. ORF636 bound specifically to S. aureus; it did not bind to any other staphylococcal species or to several gram-positive bacteria.
Sahoo, Satya S.; Bodenreider, Olivier; Rutter, Joni L.; Skinner, Karen J.; Sheth, Amit P.
2008-01-01
Objectives This paper illustrates how Semantic Web technologies (especially RDF, OWL, and SPARQL) can support information integration and make it easy to create semantic mashups (semantically integrated resources). In the context of understanding the genetic basis of nicotine dependence, we integrate gene and pathway information and show how three complex biological queries can be answered by the integrated knowledge base. Methods We use an ontology-driven approach to integrate two gene resources (Entrez Gene and HomoloGene) and three pathway resources (KEGG, Reactome and BioCyc), for five organisms, including humans. We created the Entrez Knowledge Model (EKoM), an information model in OWL for the gene resources, and integrated it with the extant BioPAX ontology designed for pathway resources. The integrated schema is populated with data from the pathway resources, publicly available in BioPAX-compatible format, and gene resources for which a population procedure was created. The SPARQL query language is used to formulate queries over the integrated knowledge base to answer the three biological queries. Results Simple SPARQL queries could easily identify hub genes, i.e., those genes whose gene products participate in many pathways or interact with many other gene products. The identification of the genes expressed in the brain turned out to be more difficult, due to the lack of a common identification scheme for proteins. Conclusion Semantic Web technologies provide a valid framework for information integration in the life sciences. Ontology-driven integration represents a flexible, sustainable and extensible solution to the integration of large volumes of information. Additional resources, which enable the creation of mappings between information sources, are required to compensate for heterogeneity across namespaces. Resource page http://knoesis.wright.edu/research/lifesci/integration/structured_data/JBI-2008/ PMID:18395495
Sahoo, Satya S; Bodenreider, Olivier; Rutter, Joni L; Skinner, Karen J; Sheth, Amit P
2008-10-01
This paper illustrates how Semantic Web technologies (especially RDF, OWL, and SPARQL) can support information integration and make it easy to create semantic mashups (semantically integrated resources). In the context of understanding the genetic basis of nicotine dependence, we integrate gene and pathway information and show how three complex biological queries can be answered by the integrated knowledge base. We use an ontology-driven approach to integrate two gene resources (Entrez Gene and HomoloGene) and three pathway resources (KEGG, Reactome and BioCyc), for five organisms, including humans. We created the Entrez Knowledge Model (EKoM), an information model in OWL for the gene resources, and integrated it with the extant BioPAX ontology designed for pathway resources. The integrated schema is populated with data from the pathway resources, publicly available in BioPAX-compatible format, and gene resources for which a population procedure was created. The SPARQL query language is used to formulate queries over the integrated knowledge base to answer the three biological queries. Simple SPARQL queries could easily identify hub genes, i.e., those genes whose gene products participate in many pathways or interact with many other gene products. The identification of the genes expressed in the brain turned out to be more difficult, due to the lack of a common identification scheme for proteins. Semantic Web technologies provide a valid framework for information integration in the life sciences. Ontology-driven integration represents a flexible, sustainable and extensible solution to the integration of large volumes of information. Additional resources, which enable the creation of mappings between information sources, are required to compensate for heterogeneity across namespaces. RESOURCE PAGE: http://knoesis.wright.edu/research/lifesci/integration/structured_data/JBI-2008/
Gene identification in the congenital disorders of glycosylation type I by whole-exome sequencing.
Timal, Sharita; Hoischen, Alexander; Lehle, Ludwig; Adamowicz, Maciej; Huijben, Karin; Sykut-Cegielska, Jolanta; Paprocka, Justyna; Jamroz, Ewa; van Spronsen, Francjan J; Körner, Christian; Gilissen, Christian; Rodenburg, Richard J; Eidhof, Ilse; Van den Heuvel, Lambert; Thiel, Christian; Wevers, Ron A; Morava, Eva; Veltman, Joris; Lefeber, Dirk J
2012-10-01
Congenital disorders of glycosylation type I (CDG-I) form a growing group of recessive neurometabolic diseases. Identification of disease genes is compromised by the enormous heterogeneity in clinical symptoms and the large number of potential genes involved. Until now, gene identification included the sequential application of biochemical methods in blood samples and fibroblasts. In genetically unsolved cases, homozygosity mapping has been applied in consanguineous families. Altogether, this time-consuming diagnostic strategy led to the identification of defects in 17 different CDG-I genes. Here, we applied whole-exome sequencing (WES) in combination with the knowledge of the protein N-glycosylation pathway for gene identification in our remaining group of six unsolved CDG-I patients from unrelated non-consanguineous families. Exome variants were prioritized based on a list of 76 potential CDG-I candidate genes, leading to the rapid identification of one known and two novel CDG-I gene defects. These included the first X-linked CDG-I due to a de novo mutation in ALG13, and compound heterozygous mutations in DPAGT1, together the first two steps in dolichol-PP-glycan assembly, and mutations in PGM1 in two cases, involved in nucleotide sugar biosynthesis. The pathogenicity of the mutations was confirmed by showing the deficient activity of the corresponding enzymes in patient fibroblasts. Combined with these results, the gene defect has been identified in 98% of our CDG-I patients. Our results implicate the potential of WES to unravel disease genes in the CDG-I in newly diagnosed singleton families.
Genome-Wide Identification and Expression Analysis of WRKY Gene Family in Capsicum annuum L.
Diao, Wei-Ping; Snyder, John C.; Wang, Shu-Bin; Liu, Jin-Bing; Pan, Bao-Gui; Guo, Guang-Jun; Wei, Ge
2016-01-01
The WRKY family of transcription factors is one of the most important families of plant transcriptional regulators with members regulating multiple biological processes, especially in regulating defense against biotic and abiotic stresses. However, little information is available about WRKYs in pepper (Capsicum annuum L.). The recent release of completely assembled genome sequences of pepper allowed us to perform a genome-wide investigation for pepper WRKY proteins. In the present study, a total of 71 WRKY genes were identified in the pepper genome. According to structural features of their encoded proteins, the pepper WRKY genes (CaWRKY) were classified into three main groups, with the second group further divided into five subgroups. Genome mapping analysis revealed that CaWRKY were enriched on four chromosomes, especially on chromosome 1, and 15.5% of the family members were tandemly duplicated genes. A phylogenetic tree was constructed depending on WRKY domain' sequences derived from pepper and Arabidopsis. The expression of 21 selected CaWRKY genes in response to seven different biotic and abiotic stresses (salt, heat shock, drought, Phytophtora capsici, SA, MeJA, and ABA) was evaluated by quantitative RT-PCR; Some CaWRKYs were highly expressed and up-regulated by stress treatment. Our results will provide a platform for functional identification and molecular breeding studies of WRKY genes in pepper. PMID:26941768
Identification of the Pr1 Gene Product Completes the Anthocyanin Biosynthesis Pathway of Maize
Sharma, Mandeep; Cortes-Cruz, Moises; Ahern, Kevin R.; McMullen, Michael; Brutnell, Thomas P.; Chopra, Surinder
2011-01-01
In maize, mutations in the pr1 locus lead to the accumulation of pelargonidin (red) rather than cyanidin (purple) pigments in aleurone cells where the anthocyanin biosynthetic pathway is active. We characterized pr1 mutation and isolated a putative F3′H encoding gene (Zmf3′h1) and showed by segregation analysis that the red kernel phenotype is linked to this gene. Genetic mapping using SNP markers confirms its position on chromosome 5L. Furthermore, genetic complementation experiments using a CaMV 35S::ZmF3′H1 promoter–gene construct established that the encoded protein product was sufficient to perform a 3′-hydroxylation reaction. The Zmf3′h1-specific transcripts were detected in floral and vegetative tissues of Pr1 plants and were absent in pr1. Four pr1 alleles were characterized: two carry a 24 TA dinucleotide repeat insertion in the 5′-upstream promoter region, a third has a 17-bp deletion near the TATA box, and a fourth contains a Ds insertion in exon1. Genetic and transcription assays demonstrated that the pr1 gene is under the regulatory control of anthocyanin transcription factors red1 and colorless1. The cloning and characterization of pr1 completes the molecular identification of all genes encoding structural enzymes of the anthocyanin pathway of maize. PMID:21385724
Structural evolution of the 4/1 genes and proteins in non-vascular and lower vascular plants.
Morozov, Sergey Y; Milyutina, Irina A; Bobrova, Vera K; Ryazantsev, Dmitry Y; Erokhina, Tatiana N; Zavriev, Sergey K; Agranovsky, Alexey A; Solovyev, Andrey G; Troitsky, Alexey V
2015-12-01
The 4/1 protein of unknown function is encoded by a single-copy gene in most higher plants. The 4/1 protein of Nicotiana tabacum (Nt-4/1 protein) has been shown to be alpha-helical and predominantly expressed in conductive tissues. Here, we report the analysis of 4/1 genes and the encoded proteins of lower land plants. Sequences of a number of 4/1 genes from liverworts, lycophytes, ferns and gymnosperms were determined and analyzed together with sequences available in databases. Most of the vascular plants were found to encode Magnoliophyta-like 4/1 proteins exhibiting previously described gene structure and protein properties. Identification of the 4/1-like proteins in hornworts, liverworts and charophyte algae (sister lineage to all land plants) but not in mosses suggests that 4/1 proteins are likely important for plant development but not required for a primary metabolic function of plant cell. Copyright © 2015 Elsevier B.V. and Société Française de Biochimie et Biologie Moléculaire (SFBBM). All rights reserved.
Dai, Zhimin; Guo, Xue; Yin, Huaqun; Liang, Yili; Cong, Jing; Liu, Xueduan
2014-01-01
Biological nitrogen fixation is an essential function of acid mine drainage (AMD) microbial communities. However, most acidophiles in AMD environments are uncultured microorganisms and little is known about the diversity of nitrogen-fixing genes and structure of nif gene cluster in AMD microbial communities. In this study, we used metagenomic sequencing to isolate nif genes in the AMD microbial community from Dexing Copper Mine, China. Meanwhile, a metagenome microarray containing 7,776 large-insertion fosmids was constructed to screen novel nif gene clusters. Metagenomic analyses revealed that 742 sequences were identified as nif genes including structural subunit genes nifH, nifD, nifK and various additional genes. The AMD community is massively dominated by the genus Acidithiobacillus. However, the phylogenetic diversity of nitrogen-fixing microorganisms is much higher than previously thought in the AMD community. Furthermore, a 32.5-kb genomic sequence harboring nif, fix and associated genes was screened by metagenome microarray. Comparative genome analysis indicated that most nif genes in this cluster are most similar to those of Herbaspirillum seropedicae, but the organization of the nif gene cluster had significant differences from H. seropedicae. Sequence analysis and reverse transcription PCR also suggested that distinct transcription units of nif genes exist in this gene cluster. nifQ gene falls into the same transcription unit with fixABCX genes, which have not been reported in other diazotrophs before. All of these results indicated that more novel diazotrophs survive in the AMD community.
Yin, Huaqun; Liang, Yili; Cong, Jing; Liu, Xueduan
2014-01-01
Biological nitrogen fixation is an essential function of acid mine drainage (AMD) microbial communities. However, most acidophiles in AMD environments are uncultured microorganisms and little is known about the diversity of nitrogen-fixing genes and structure of nif gene cluster in AMD microbial communities. In this study, we used metagenomic sequencing to isolate nif genes in the AMD microbial community from Dexing Copper Mine, China. Meanwhile, a metagenome microarray containing 7,776 large-insertion fosmids was constructed to screen novel nif gene clusters. Metagenomic analyses revealed that 742 sequences were identified as nif genes including structural subunit genes nifH, nifD, nifK and various additional genes. The AMD community is massively dominated by the genus Acidithiobacillus. However, the phylogenetic diversity of nitrogen-fixing microorganisms is much higher than previously thought in the AMD community. Furthermore, a 32.5-kb genomic sequence harboring nif, fix and associated genes was screened by metagenome microarray. Comparative genome analysis indicated that most nif genes in this cluster are most similar to those of Herbaspirillum seropedicae, but the organization of the nif gene cluster had significant differences from H. seropedicae. Sequence analysis and reverse transcription PCR also suggested that distinct transcription units of nif genes exist in this gene cluster. nifQ gene falls into the same transcription unit with fixABCX genes, which have not been reported in other diazotrophs before. All of these results indicated that more novel diazotrophs survive in the AMD community. PMID:24498417
Cancel all Hollidays for SLX4 mutations: identification of a new Fanconi anemia subtype, FANCP.
Kang, M H
2011-07-01
SLX4, a coordinator of structure-specific endo-nucleases, is mutated in a new Fanconi anemia subtype Stoepker et al. (2011) Nature Genetics 43:138-141. Mutations of the SLX4 gene in Fanconi anemia Kim et al. (2011) Nature Genetics 43:142-146. © 2011 John Wiley & Sons A/S.
Gifford, Lida K.; Opalinska, Joanna B.; Jordan, David; Pattanayak, Vikram; Greenham, Paul; Kalota, Anna; Robbins, Michelle; Vernovsky, Kathy; Rodriguez, Lesbeth C.; Do, Bao T.; Lu, Ponzy; Gewirtz, Alan M.
2005-01-01
We describe a physical mRNA mapping strategy employing fluorescent self-quenching reporter molecules (SQRMs) that facilitates the identification of mRNA sequence accessible for hybridization with antisense nucleic acids in vitro and in vivo, real time. SQRMs are 20–30 base oligodeoxynucleotides with 5–6 bp complementary ends to which a 5′ fluorophore and 3′ quenching group are attached. Alone, the SQRM complementary ends form a stem that holds the fluorophore and quencher in contact. When the SQRM forms base pairs with its target, the structure separates the fluorophore from the quencher. This event can be reported by fluorescence emission when the fluorophore is excited. The stem–loop of the SQRM suggests that SQRM be made to target natural stem–loop structures formed during mRNA synthesis. The general utility of this method is demonstrated by SQRM identification of targetable sequence within c-myb and bcl-6 mRNA. Corresponding antisense oligonucleotides reduce these gene products in cells. PMID:15718294
Schwizer, Sarah; Tasara, Taurai; Zurfluh, Katrin; Stephan, Roger; Lehner, Angelika
2013-02-15
Cronobacter spp. are opportunistic pathogens that can cause septicemia and infections of the central nervous system primarily in premature, low-birth weight and/or immune-compromised neonates. Serum resistance is a crucial virulence factor for the development of systemic infections, including bacteremia. It was the aim of the current study to identify genes involved in serum tolerance in a selected Cronobacter sakazakii strain of clinical origin. Screening of 2749 random transposon knock out mutants of a C. sakazakii ES 5 library for modified serum tolerance (compared to wild type) revealed 10 mutants showing significantly increased/reduced resistance to serum killing. Identification of the affected sites in mutants displaying reduced serum resistance revealed genes encoding for surface and membrane proteins as well as regulatory elements or chaperones. By this approach, the involvement of the yet undescribed Wzy_C superfamily domain containing coding region in serum tolerance was observed and experimentally confirmed. Additionally, knock out mutants with enhanced serum tolerance were observed. Examination of respective transposon insertion loci revealed regulatory (repressor) elements, coding regions for chaperones and efflux systems as well as the coding region for the protein YbaJ. Real time expression analysis experiments revealed, that knock out of the gene for this protein negatively affects the expression of the fimA gene, which is a key structural component of the formation of fimbriae. Fimbriae are structures of high immunogenic potential and it is likely that absence/truncation of the ybaJ gene resulted in a non-fimbriated phenotype accounting for the enhanced survival of this mutant in human serum. By using a transposon knock out approach we were able to identify genes involved in both increased and reduced serum tolerance in Cronobacter sakazakii ES5. This study reveals first insights in the complex nature of serum tolerance of Cronobacter spp.
Filiz, Ertugrul; Ozyigit, Ibrahim Ilker; Vatansever, Recep
2015-10-01
GolS genes stand as potential candidate genes for molecular breeding and/or engineering programs in order for improving abiotic stress tolerance in plant species. In this study, a total of six galactinol synthase (GolS) genes/proteins were retrieved for Solanum lycopersicum and Brachypodium distachyon. GolS protein sequences were identified to include glyco_transf_8 (PF01501) domain structure, and to have a close molecular weight (36.40-39.59kDa) and amino acid length (318-347 aa) with a slightly acidic pI (5.35-6.40). The sub-cellular location was mainly predicted as cytoplasmic. S. lycopersicum genes located on chr 1 and 2, and included one segmental duplication while genes of B. distachyon were only on chr 1 with one tandem duplication. GolS sequences were found to have well conserved motif structures. Cis-acting analysis was performed for three abiotic stress responsive elements, including ABA responsive element (ABRE), dehydration and cold responsive elements (DRE/CRT) and low-temperature responsive element (LTRE). ABRE elements were found in all GolS genes, except for SlGolS4; DRE/CRT was not detected in any GolS genes and LTRE element found in SlGolS1 and BdGolS1 genes. AU analysis in UTR and ORF regions indicated that SlGolS and BdGolS mRNAs may have a short half-life. SlGolS3 and SlGolS4 genes may generate more stable transcripts since they included AATTAAA motif for polyadenylation signal POLASIG2. Seconder structures of SlGolS proteins were well conserved than that of BdGolS. Some structural divergences were detected in 3D structures and predicted binding sites exhibited various patterns in GolS proteins. Copyright © 2015 Elsevier Ltd. All rights reserved.
Roncaglia, Paola; Howe, Douglas G.; Laulederkind, Stanley J.F.; Khodiyar, Varsha K.; Berardini, Tanya Z.; Tweedie, Susan; Foulger, Rebecca E.; Osumi-Sutherland, David; Campbell, Nancy H.; Huntley, Rachael P.; Talmud, Philippa J.; Blake, Judith A.; Breckenridge, Ross; Riley, Paul R.; Lambiase, Pier D.; Elliott, Perry M.; Clapp, Lucie; Tinker, Andrew; Hill, David P.
2018-01-01
Background: A systems biology approach to cardiac physiology requires a comprehensive representation of how coordinated processes operate in the heart, as well as the ability to interpret relevant transcriptomic and proteomic experiments. The Gene Ontology (GO) Consortium provides structured, controlled vocabularies of biological terms that can be used to summarize and analyze functional knowledge for gene products. Methods and Results: In this study, we created a computational resource to facilitate genetic studies of cardiac physiology by integrating literature curation with attention to an improved and expanded ontological representation of heart processes in the Gene Ontology. As a result, the Gene Ontology now contains terms that comprehensively describe the roles of proteins in cardiac muscle cell action potential, electrical coupling, and the transmission of the electrical impulse from the sinoatrial node to the ventricles. Evaluating the effectiveness of this approach to inform data analysis demonstrated that Gene Ontology annotations, analyzed within an expanded ontological context of heart processes, can help to identify candidate genes associated with arrhythmic disease risk loci. Conclusions: We determined that a combination of curation and ontology development for heart-specific genes and processes supports the identification and downstream analysis of genes responsible for the spread of the cardiac action potential through the heart. Annotating these genes and processes in a structured format facilitates data analysis and supports effective retrieval of gene-centric information about cardiac defects. PMID:29440116
Lovering, Ruth C; Roncaglia, Paola; Howe, Douglas G; Laulederkind, Stanley J F; Khodiyar, Varsha K; Berardini, Tanya Z; Tweedie, Susan; Foulger, Rebecca E; Osumi-Sutherland, David; Campbell, Nancy H; Huntley, Rachael P; Talmud, Philippa J; Blake, Judith A; Breckenridge, Ross; Riley, Paul R; Lambiase, Pier D; Elliott, Perry M; Clapp, Lucie; Tinker, Andrew; Hill, David P
2018-02-01
A systems biology approach to cardiac physiology requires a comprehensive representation of how coordinated processes operate in the heart, as well as the ability to interpret relevant transcriptomic and proteomic experiments. The Gene Ontology (GO) Consortium provides structured, controlled vocabularies of biological terms that can be used to summarize and analyze functional knowledge for gene products. In this study, we created a computational resource to facilitate genetic studies of cardiac physiology by integrating literature curation with attention to an improved and expanded ontological representation of heart processes in the Gene Ontology. As a result, the Gene Ontology now contains terms that comprehensively describe the roles of proteins in cardiac muscle cell action potential, electrical coupling, and the transmission of the electrical impulse from the sinoatrial node to the ventricles. Evaluating the effectiveness of this approach to inform data analysis demonstrated that Gene Ontology annotations, analyzed within an expanded ontological context of heart processes, can help to identify candidate genes associated with arrhythmic disease risk loci. We determined that a combination of curation and ontology development for heart-specific genes and processes supports the identification and downstream analysis of genes responsible for the spread of the cardiac action potential through the heart. Annotating these genes and processes in a structured format facilitates data analysis and supports effective retrieval of gene-centric information about cardiac defects. © 2018 The Authors.
Exploiting proteomic data for genome annotation and gene model validation in Aspergillus niger
Wright, James C; Sugden, Deana; Francis-McIntyre, Sue; Riba-Garcia, Isabel; Gaskell, Simon J; Grigoriev, Igor V; Baker, Scott E; Beynon, Robert J; Hubbard, Simon J
2009-01-01
Background Proteomic data is a potentially rich, but arguably unexploited, data source for genome annotation. Peptide identifications from tandem mass spectrometry provide prima facie evidence for gene predictions and can discriminate over a set of candidate gene models. Here we apply this to the recently sequenced Aspergillus niger fungal genome from the Joint Genome Institutes (JGI) and another predicted protein set from another A.niger sequence. Tandem mass spectra (MS/MS) were acquired from 1d gel electrophoresis bands and searched against all available gene models using Average Peptide Scoring (APS) and reverse database searching to produce confident identifications at an acceptable false discovery rate (FDR). Results 405 identified peptide sequences were mapped to 214 different A.niger genomic loci to which 4093 predicted gene models clustered, 2872 of which contained the mapped peptides. Interestingly, 13 (6%) of these loci either had no preferred predicted gene model or the genome annotators' chosen "best" model for that genomic locus was not found to be the most parsimonious match to the identified peptides. The peptides identified also boosted confidence in predicted gene structures spanning 54 introns from different gene models. Conclusion This work highlights the potential of integrating experimental proteomics data into genomic annotation pipelines much as expressed sequence tag (EST) data has been. A comparison of the published genome from another strain of A.niger sequenced by DSM showed that a number of the gene models or proteins with proteomics evidence did not occur in both genomes, further highlighting the utility of the method. PMID:19193216
Huang, Jianyan; Zhao, Xiaobo; Weng, Xiaoyu; Wang, Lei; Xie, Weibo
2012-01-01
Background The B-box (BBX) -containing proteins are a class of zinc finger proteins that contain one or two B-box domains and play important roles in plant growth and development. The Arabidopsis BBX gene family has recently been re-identified and renamed. However, there has not been a genome-wide survey of the rice BBX (OsBBX) gene family until now. Methodology/Principal Findings In this study, we identified 30 rice BBX genes through a comprehensive bioinformatics analysis. Each gene was assigned a uniform nomenclature. We described the chromosome localizations, gene structures, protein domains, phylogenetic relationship, whole life-cycle expression profile and diurnal expression patterns of the OsBBX family members. Based on the phylogeny and domain constitution, the OsBBX gene family was classified into five subfamilies. The gene duplication analysis revealed that only chromosomal segmental duplication contributed to the expansion of the OsBBX gene family. The expression profile of the OsBBX genes was analyzed by Affymetrix GeneChip microarrays throughout the entire life-cycle of rice cultivar Zhenshan 97 (ZS97). In addition, microarray analysis was performed to obtain the expression patterns of these genes under light/dark conditions and after three phytohormone treatments. This analysis revealed that the expression patterns of the OsBBX genes could be classified into eight groups. Eight genes were regulated under the light/dark treatments, and eleven genes showed differential expression under at least one phytohormone treatment. Moreover, we verified the diurnal expression of the OsBBX genes using the data obtained from the Diurnal Project and qPCR analysis, and the results indicated that many of these genes had a diurnal expression pattern. Conclusions/Significance The combination of the genome-wide identification and the expression and diurnal analysis of the OsBBX gene family should facilitate additional functional studies of the OsBBX genes. PMID:23118960
Applications of emerging transmission electron microscopy technology in PCD research and diagnosis.
Shoemark, Amelia
2017-01-01
Primary Ciliary Dyskinesia (PCD) is a heterogeneous genetic condition characterized by dysfunction of motile cilia. Patients suffer from chronic infection and inflammation of the upper and lower respiratory tract. Diagnosis of PCD is confirmed by identification of a hallmark defect of ciliary ultrastructure or by identification of biallelic pathogenic mutations in a known PCD gene. Since the first description of PCD in 1976, assessment of ciliary ultrastructure by transmission electron microscopy (TEM) has been central to diagnosis and research. Electron tomography is a technique whereby a series of transmission electron micrographs are collected at different angles and reconstructed into a single 3D model of a specimen. Electron tomography provides improved spatial information and resolution compared to a single micrograph. Research by electron tomography has revealed new insight into ciliary ultrastructure and consequently ciliary function at a molecular and cellular level. Gene discovery studies in PCD have utilized electron tomography to define the structural consequences of variants in cilia genes. Modern transmission electron microscopes capable of electron tomography are increasingly being installed in clinical laboratories. This presents the possibility for the use of tomography technique in a diagnostic setting. This review describes the electron tomography technique, the contribution tomography has made to the understanding of basic cilia structure and function and finally the potential of the technique for use in PCD diagnosis.
Using secondary structure to identify ribosomal numts: cautionary examples from the human genome.
Olson, Link E; Yoder, Anne D
2002-01-01
The identification of inadvertently sequenced mitochondrial pseudogenes (numts) is critical to any study employing mitochondrial DNA sequence data. Failure to discriminate numts correctly can confound phylogenetic reconstruction and studies of molecular evolution. This is especially problematic for ribosomal mtDNA genes. Unlike protein-coding loci, whose pseudogenes tend to accumulate diagnostic frameshift or premature stop mutations, functional ribosomal genes are not constrained to maintain a reading frame and can accumulate insertion-deletion events of varying length, particularly in nonpairing regions. Several authors have advocated using structural features of the transcribed rRNA molecule to differentiate functional mitochondrial rRNA genes from their nuclear paralogs. We explored this approach using the mitochondrial 12S rRNA gene and three known 12S numts from the human genome in the context of anthropoid phylogeny and the inferred secondary structure of primate 12S rRNA. Contrary to expectation, each of the three human numts exhibits striking concordance with secondary structure models, with little, if any, indication of their pseudogene status, and would likely escape detection based on structural criteria alone. Furthermore, we show that the unwitting inclusion of a particularly ancient (18-25 Myr old) and surprisingly cryptic human numt in a phylogenetic analysis would yield a well-supported but dramatically incorrect conclusion regarding anthropoid relationships. Though we endorse the use of secondary structure models for inferring positional homology wholeheartedly, we caution against reliance on structural criteria for the discrimination of rRNA numts, given the potential fallibility of this approach.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Stephens, R.L.; Haygood, M.G.; Lidstrom, M.E.
An open-reading-frame fragment of a Methylobacterium sp. strain AM1 gene (moxF) encoding a portion of the methanol dehydrogenase structural protein has been used as a hybridization probe to detect similar sequences in a variety of methylotrophic bacteria. This hybridization was used to isolate clones containing putative moxF genes from two obligate methanotrophic bacteria, Methylococcus capsulatus Bath and Methylomonas albus BG8. The identity of these genes was confirmed in two ways. A T7 expression vector was used to produce methanol dehydrogenase protein in Escherichia coli from the cloned genes,a and in each case the protein was identified by immunoblotting with antiserummore » against the Methylomonas albus methanol dehydrogenase. In addition, a moxF mutant of Methylobacterium strain AM1 was complemented to a methanol-positive phenotype that partially restored methanol dehydrogenase activity, using broad-host-range plasmids containing the moxF genes from each methanotroph. The partial complementation of a moxF mutant in a facultative serine pathway methanol utilizer by moxF genes from type I and type X obligate methane utilizers suggests broad functional conservation of the methanol oxidation system among gram-negative methylotrophs.« less
Guo, Xingyi; Shi, Jiajun; Cai, Qiuyin; Shu, Xiao-Ou; He, Jing; Wen, Wanqing; Allen, Jamie; Pharoah, Paul; Dunning, Alison; Hunter, David J; Kraft, Peter; Easton, Douglas F; Zheng, Wei; Long, Jirong
2018-03-01
Functional disruptions of susceptibility genes by large genomic structure variant (SV) deletions in germlines are known to be associated with cancer risk. However, few studies have been conducted to systematically search for SV deletions in breast cancer susceptibility genes. We analysed deep (> 30x) whole-genome sequencing (WGS) data generated in blood samples from 128 breast cancer patients of Asian and European descent with either a strong family history of breast cancer or early cancer onset disease. To identify SV deletions in known or suspected breast cancer susceptibility genes, we used multiple SV calling tools including Genome STRiP, Delly, Manta, BreakDancer and Pindel. SV deletions were detected by at least three of these bioinformatics tools in five genes. Specifically, we identified heterozygous deletions covering a fraction of the coding regions of BRCA1 (with approximately 80kb in two patients), and TP53 genes (with ∼1.6 kb in two patients), and of intronic regions (∼1 kb) of the PALB2 (one patient), PTEN (three patients) and RAD51C genes (one patient). We confirmed the presence of these deletions using real-time quantitative PCR (qPCR). Our study identified novel SV deletions in breast cancer susceptibility genes and the identification of such SV deletions may improve clinical testing.
Pim-1: A Molecular Target to Modulate Cellular Resistance to Therapy in Prostate Cancer
2005-10-01
Reiter RE, Lilly MB: Gene expression profiling in R- flurbiprofen -treated prostate cancer: Identification of prostate stem cell antigen as a... flurbiprofen -regulated gene. (submitted, 2006). 51. Holder SL, Zemskova M, Bremner R, Neidigh J, Lilly MB: Identification of specific, cell-permeable...profiling in R- flurbiprofen - treated prostate cancer: Identification of prostate stem cell antigen as a flurbiprofen - regulated gene. (poster
Jue, Dengwei; Sang, Xuelian; Lu, Shengqiao; Dong, Chen; Zhao, Qiufang; Chen, Hongliang; Jia, Liqiang
2015-01-01
Ubiquitination is a post-translation modification where ubiquitin is attached to a substrate. Ubiquitin-conjugating enzymes (E2s) play a major role in the ubiquitin transfer pathway, as well as a variety of functions in plant biological processes. To date, no genome-wide characterization of this gene family has been conducted in maize (Zea mays). In the present study, a total of 75 putative ZmUBC genes have been identified and located in the maize genome. Phylogenetic analysis revealed that ZmUBC proteins could be divided into 15 subfamilies, which include 13 ubiquitin-conjugating enzymes (ZmE2s) and two independent ubiquitin-conjugating enzyme variant (UEV) groups. The predicted ZmUBC genes were distributed across 10 chromosomes at different densities. In addition, analysis of exon-intron junctions and sequence motifs in each candidate gene has revealed high levels of conservation within and between phylogenetic groups. Tissue expression analysis indicated that most ZmUBC genes were expressed in at least one of the tissues, indicating that these are involved in various physiological and developmental processes in maize. Moreover, expression profile analyses of ZmUBC genes under different stress treatments (4°C, 20% PEG6000, and 200 mM NaCl) and various expression patterns indicated that these may play crucial roles in the response of plants to stress. Genome-wide identification, chromosome organization, gene structure, evolutionary and expression analyses of ZmUBC genes have facilitated in the characterization of this gene family, as well as determined its potential involvement in growth, development, and stress responses. This study provides valuable information for better understanding the classification and putative functions of the UBC-encoding genes of maize.
Jue, Dengwei; Sang, Xuelian; Lu, Shengqiao; Dong, Chen; Zhao, Qiufang; Chen, Hongliang; Jia, Liqiang
2015-01-01
Background Ubiquitination is a post-translation modification where ubiquitin is attached to a substrate. Ubiquitin-conjugating enzymes (E2s) play a major role in the ubiquitin transfer pathway, as well as a variety of functions in plant biological processes. To date, no genome-wide characterization of this gene family has been conducted in maize (Zea mays). Methodology/Principal Findings In the present study, a total of 75 putative ZmUBC genes have been identified and located in the maize genome. Phylogenetic analysis revealed that ZmUBC proteins could be divided into 15 subfamilies, which include 13 ubiquitin-conjugating enzymes (ZmE2s) and two independent ubiquitin-conjugating enzyme variant (UEV) groups. The predicted ZmUBC genes were distributed across 10 chromosomes at different densities. In addition, analysis of exon-intron junctions and sequence motifs in each candidate gene has revealed high levels of conservation within and between phylogenetic groups. Tissue expression analysis indicated that most ZmUBC genes were expressed in at least one of the tissues, indicating that these are involved in various physiological and developmental processes in maize. Moreover, expression profile analyses of ZmUBC genes under different stress treatments (4°C, 20% PEG6000, and 200 mM NaCl) and various expression patterns indicated that these may play crucial roles in the response of plants to stress. Conclusions Genome-wide identification, chromosome organization, gene structure, evolutionary and expression analyses of ZmUBC genes have facilitated in the characterization of this gene family, as well as determined its potential involvement in growth, development, and stress responses. This study provides valuable information for better understanding the classification and putative functions of the UBC-encoding genes of maize. PMID:26606743
Abrouk, Michael; Balcárková, Barbora; Šimková, Hana; Komínkova, Eva; Martis, Mihaela M; Jakobson, Irena; Timofejeva, Ljudmilla; Rey, Elodie; Vrána, Jan; Kilian, Andrzej; Järve, Kadri; Doležel, Jaroslav; Valárik, Miroslav
2017-02-01
The capacity of the bread wheat (Triticum aestivum) genome to tolerate introgression from related genomes can be exploited for wheat improvement. A resistance to powdery mildew expressed by a derivative of the cross-bread wheat cv. Tähti × T. militinae (Tm) is known to be due to the incorporation of a Tm segment into the long arm of chromosome 4A. Here, a newly developed in silico method termed rearrangement identification and characterization (RICh) has been applied to characterize the introgression. A virtual gene order, assembled using the GenomeZipper approach, was obtained for the native copy of chromosome 4A; it incorporated 570 4A DArTseq markers to produce a zipper comprising 2132 loci. A comparison between the native and introgressed forms of the 4AL chromosome arm showed that the introgressed region is located at the distal part of the arm. The Tm segment, derived from chromosome 7G, harbours 131 homoeologs of the 357 genes present on the corresponding region of Chinese Spring 4AL. The estimated number of Tm genes transferred along with the disease resistance gene was 169. Characterizing the introgression's position, gene content and internal gene order should not only facilitate gene isolation, but may also be informative with respect to chromatin structure and behaviour studies. © 2016 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
Complete genome sequence of Paenibacillus sp. strain JDR-2
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chow, Virginia; Nong, Guang; St. John, Franz J.
2012-01-01
Paenibacillus sp. strain JDR-2, an aggressively xylanolytic bacterium isolated from sweetgum (Liquidambar styraciflua) wood, is able to efficiently depolymerize, assimilate and metabolize 4-O-methylglucuronoxylan, the predominant structural component of hardwood hemicelluloses. A basis for this capability was first supported by the identification of genes and characterization of encoded enzymes and has been further defined by the sequencing and annotation of the complete genome, which we describe. In addition to genes implicated in the utilization of -1,4-xylan, genes have also been identified for the utilization of other hemicellulosic polysaccharides. The genome of Paenibacillus sp. JDR-2 contains 7,184,930 bp in a single repliconmore » with 6,288 protein-coding and 122 RNA genes. Uniquely prominent are 874 genes encoding proteins involved in carbohydrate transport and metabolism. The prevalence and organization of these genes support a metabolic potential for bioprocessing of hemicellulose fractions derived from lignocellulosic resources.« less
Röder, Christoph; König, Helmut; Fröhlich, Jürgen
2007-09-01
Sequencing of the complete 26S rRNA genes of all Dekkera/Brettanomyces species colonizing different beverages revealed the potential for a specific primer and probe design to support diagnostic PCR approaches and FISH. By analysis of the complete 26S rRNA genes of all five currently known Dekkera/Brettanomyces species (Dekkera bruxellensis, D. anomala, Brettanomyces custersianus, B. nanus and B. naardenensis), several regions with high nucleotide sequence variability yet distinct from the D1/D2 domains were identified. FISH species-specific probes targeting the 26S rRNA gene's most variable regions were designed. Accessibility of probe targets for hybridization was facilitated by the construction of partially complementary 'side'-labeled probes, based on secondary structure models of the rRNA sequences. The specificity and routine applicability of the FISH-based method for yeast identification were tested by analyzing different wine isolates. Investigation of the prevalence of Dekkera/Brettanomyces yeasts in the German viticultural regions Wonnegau, Nierstein and Bingen (Rhinehesse, Rhineland-Palatinate) resulted in the isolation of 37 D. bruxellensis strains from 291 wine samples.
Accurate population genetic measurements require cryptic species identification in corals
NASA Astrophysics Data System (ADS)
Sheets, Elizabeth A.; Warner, Patricia A.; Palumbi, Stephen R.
2018-06-01
Correct identification of closely related species is important for reliable measures of gene flow. Incorrectly lumping individuals of different species together has been shown to over- or underestimate population differentiation, but examples highlighting when these different results are observed in empirical datasets are rare. Using 199 single nucleotide polymorphisms, we assigned 768 individuals in the Acropora hyacinthus and A. cytherea morphospecies complexes to each of eight previously identified cryptic genetic species and measured intraspecific genetic differentiation across three geographic scales (within reefs, among reefs within an archipelago, and among Pacific archipelagos). We then compared these calculations to estimated genetic differentiation at each scale with all cryptic genetic species mixed as if we could not tell them apart. At the reef scale, correct genetic species identification yielded lower F ST estimates and fewer significant comparisons than when species were mixed, raising estimates of short-scale gene flow. In contrast, correct genetic species identification at large spatial scales yielded higher F ST measurements than mixed-species comparisons, lowering estimates of long-term gene flow among archipelagos. A meta-analysis of published population genetic studies in corals found similar results: F ST estimates at small spatial scales were lower and significance was found less often in studies that controlled for cryptic species. Our results and these prior datasets controlling for cryptic species suggest that genetic differentiation among local reefs may be lower than what has generally been reported in the literature. Not properly controlling for cryptic species structure can bias population genetic analyses in different directions across spatial scales, and this has important implications for conservation strategies that rely on these estimates.
Maintaining the Brain: Insight into Human Neurodegeneration From Drosophila Mutants
Lessing, Derek; Bonini, Nancy M.
2009-01-01
The fruit fly Drosophila melanogaster has brought significant advances to research in neurodegenerative disease, notably in the identification of genes that are required to maintain the structural integrity of the brain, defined by recessive mutations that cause adult-onset neurodegeneration. Here, we survey these genes in the fly and classify them according to five key cell biological processes. Over half of these genes have counterparts in mouse or human that are also associated with neurodegeneration. Fly genetics continues to be instrumental in the analysis of degenerative disease, with notable recent advances in our understanding of several inherited disorders, as well as Parkinson’s Disease and the central role of mitochondria in neuronal maintenance. PMID:19434080
Xu, Jianing; Xing, Shanshan; Cui, Haoran; Chen, Xuesen; Wang, Xiaoyun
2016-04-01
The ubiquitin-protein ligases (E3s) directly participate in ubiquitin (Ub) transferring to the target proteins in the ubiquitination pathway. The HECT ubiquitin-protein ligase (UPL), one type of E3s, is characterized as containing a conserved HECT domain of approximately 350 amino acids in the C terminus. Some UPLs were found to be involved in trichome development and leaf senescence in Arabidopsis. However, studies on plant UPLs, such as characteristics of the protein structure, predicted functional motifs of the HECT domain, and the regulatory expression of UPLs have all been limited. Here, we present genome-wide identification of the genes encoding UPLs (HECT gene) in apple. The 13 genes (named as MdUPL1-MdUPL13) from ten different chromosomes were divided into four groups by phylogenetic analysis. Among these groups, the encoding genes in the intron-exon structure and the included additional functional domains were quite different. Notably, the F-box domain was first found in MdUPL7 in plant UPLs. The HECT domain in different MdUPL groups also presented different spatial features and three types of conservative motifs were identified. The promoters of each MdUPL member carried multiple stress-response related elements by cis-acting element analysis. Experimental results demonstrated that the expressions of several MdUPLs were quite sensitive to cold-, drought-, and salt-stresses by qRT-PCR assay. The results of this study helped to elucidate the functions of HECT proteins, especially in Rosaceae plants.
Khamis, Atieh; Raoult, Didier; La Scola, Bernard
2005-01-01
Higher proportions (91%) of 168 corynebacterial isolates were positively identified by partial rpoB gene determination than by that based on 16S rRNA gene sequences. This method is thus a simple, molecular-analysis-based method for identification of corynebacteria, but it should be used in conjunction with other tests for definitive identification. PMID:15815024
Ruppitsch, W; Stöger, A; Indra, A; Grif, K; Schabereiter-Gurtner, C; Hirschl, A; Allerberger, F
2007-03-01
In a bioterrorism event a rapid tool is needed to identify relevant dangerous bacteria. The aim of the study was to assess the usefulness of partial 16S rRNA gene sequence analysis and the suitability of diverse databases for identifying dangerous bacterial pathogens. For rapid identification purposes a 500-bp fragment of the 16S rRNA gene of 28 isolates comprising Bacillus anthracis, Brucella melitensis, Burkholderia mallei, Burkholderia pseudomallei, Francisella tularensis, Yersinia pestis, and eight genus-related and unrelated control strains was amplified and sequenced. The obtained sequence data were submitted to three public and two commercial sequence databases for species identification. The most frequent reason for incorrect identification was the lack of the respective 16S rRNA gene sequences in the database. Sequence analysis of a 500-bp 16S rDNA fragment allows the rapid identification of dangerous bacterial species. However, for discrimination of closely related species sequencing of the entire 16S rRNA gene, additional sequencing of the 23S rRNA gene or sequencing of the 16S-23S rRNA intergenic spacer is essential. This work provides comprehensive information on the suitability of partial 16S rDNA analysis and diverse databases for rapid and accurate identification of dangerous bacterial pathogens.
PanGEA: identification of allele specific gene expression using the 454 technology.
Kofler, Robert; Teixeira Torres, Tatiana; Lelley, Tamas; Schlötterer, Christian
2009-05-14
Next generation sequencing technologies hold great potential for many biological questions. While mainly used for genomic sequencing, they are also very promising for gene expression profiling. Sequencing of cDNA does not only provide an estimate of the absolute expression level, it can also be used for the identification of allele specific gene expression. We developed PanGEA, a tool which enables a fast and user-friendly analysis of allele specific gene expression using the 454 technology. PanGEA allows mapping of 454-ESTs to genes or whole genomes, displaying gene expression profiles, identification of SNPs and the quantification of allele specific gene expression. The intuitive GUI of PanGEA facilitates a flexible and interactive analysis of the data. PanGEA additionally implements a modification of the Smith-Waterman algorithm which deals with incorrect estimates of homopolymer length as occuring in the 454 technology To our knowledge, PanGEA is the first tool which facilitates the identification of allele specific gene expression. PanGEA is distributed under the Mozilla Public License and available at: http://www.kofler.or.at/bioinformatics/PanGEA
PanGEA: Identification of allele specific gene expression using the 454 technology
Kofler, Robert; Teixeira Torres, Tatiana; Lelley, Tamas; Schlötterer, Christian
2009-01-01
Background Next generation sequencing technologies hold great potential for many biological questions. While mainly used for genomic sequencing, they are also very promising for gene expression profiling. Sequencing of cDNA does not only provide an estimate of the absolute expression level, it can also be used for the identification of allele specific gene expression. Results We developed PanGEA, a tool which enables a fast and user-friendly analysis of allele specific gene expression using the 454 technology. PanGEA allows mapping of 454-ESTs to genes or whole genomes, displaying gene expression profiles, identification of SNPs and the quantification of allele specific gene expression. The intuitive GUI of PanGEA facilitates a flexible and interactive analysis of the data. PanGEA additionally implements a modification of the Smith-Waterman algorithm which deals with incorrect estimates of homopolymer length as occuring in the 454 technology Conclusion To our knowledge, PanGEA is the first tool which facilitates the identification of allele specific gene expression. PanGEA is distributed under the Mozilla Public License and available at: PMID:19442283
Jauregui, Andrew R; Savalia, Dhruti; Lowry, Virginia K; Farrell, Cara M; Wathelet, Marc G
2013-01-01
An epidemic of Severe Acute Respiratory Syndrome (SARS) led to the identification of an associated coronavirus, SARS-CoV. This virus evades the host innate immune response in part through the expression of its non-structural protein (nsp) 1, which inhibits both host gene expression and virus- and interferon (IFN)-dependent signaling. Thus, nsp1 is a promising target for drugs, as inhibition of nsp1 would make SARS-CoV more susceptible to the host antiviral defenses. To gain a better understanding of nsp1 mode of action, we generated and analyzed 38 mutants of the SARS-CoV nsp1, targeting 62 solvent exposed residues out of the 180 amino acid protein. From this work, we identified six classes of mutants that abolished, attenuated or increased nsp1 inhibition of host gene expression and/or antiviral signaling. Each class of mutants clustered on SARS-CoV nsp1 surface and suggested nsp1 interacts with distinct host factors to exert its inhibitory activities. Identification of the nsp1 residues critical for its activities and the pathways involved in these activities should help in the design of drugs targeting nsp1. Significantly, several point mutants increased the inhibitory activity of nsp1, suggesting that coronaviruses could evolve a greater ability to evade the host response through mutations of such residues.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Fields, C.A.
1994-09-01
This Report concludes the DOE Human Genome Program project, ``Identification of Genes in Anonymous DNA Sequence.`` The central goals of this project have been (1) understanding the problem of identifying genes in anonymous sequences, and (2) development of tools, primarily the automated identification system gm, for identifying genes. The activities supported under the previous award are summarized here to provide a single complete report on the activities supported as part of the project from its inception to its completion.
Li, Xiaoqin; Guo, Rongrong; Li, Jun; Singer, Stacy D; Zhang, Yucheng; Yin, Xiangjing; Zheng, Yi; Fan, Chonghui; Wang, Xiping
2013-10-01
Aldehyde dehydrogenases (ALDHs) represent a protein superfamily encoding NAD(P)(+)-dependent enzymes that oxidize a wide range of endogenous and exogenous aliphatic and aromatic aldehydes. In plants, they are involved in many biological processes and play a role in the response to environmental stress. In this study, a total of 39 ALDH genes from ten families were identified in the apple (Malus × domestica Borkh.) genome. Synteny analysis of the apple ALDH (MdALDH) genes indicated that segmental and tandem duplications, as well as whole genome duplications, have likely contributed to the expansion and evolution of these gene families in apple. Moreover, synteny analysis between apple and Arabidopsis demonstrated that several MdALDH genes were found in the corresponding syntenic blocks of Arabidopsis, suggesting that these genes appeared before the divergence of lineages that led to apple and Arabidopsis. In addition, phylogenetic analysis, as well as comparisons of exon-intron and protein structures, provided further insight into both their evolutionary relationships and their putative functions. Tissue-specific expression analysis of the MdALDH genes demonstrated diverse spatiotemporal expression patterns, while their expression profiles under abiotic stress and various hormone treatments indicated that many MdALDH genes were responsive to high salinity and drought, as well as different plant hormones. This genome-wide identification, as well as characterization of evolutionary relationships and expression profiles, of the apple MdALDH genes will not only be useful for the further analysis of ALDH genes and their roles in stress response, but may also aid in the future improvement of apple stress tolerance. Copyright © 2013 Elsevier Masson SAS. All rights reserved.
He, Yi; Ahmad, Dawood; Zhang, Xu; Zhang, Yu; Wu, Lei; Jiang, Peng; Ma, Hongxiang
2018-04-19
Fusarium head blight (FHB), a devastating disease in wheat worldwide, results in yield loses and mycotoxin, such as deoxynivalenol (DON), accumulation in infected grains. DON also facilitates the pathogen colonization and spread of FHB symptoms during disease development. UDP-glycosyltransferase enzymes (UGTs) are known to contribute to detoxification and enhance FHB resistance by glycosylating DON into DON-3-glucoside (D3G) in wheat. However, a comprehensive investigation of wheat (Triticum aestivum) UGT genes is still lacking. In this study, we carried out a genome-wide analysis of family-1 UDP glycosyltransferases in wheat based on the PSPG conserved box that resulted in the identification of 179 putative UGT genes. The identified genes were clustered into 16 major phylogenetic groups with a lack of phylogenetic group K. The UGT genes were invariably distributed among all the chromosomes of the 3 genomes. At least 10 intron insertion events were found in the UGT sequences, where intron 4 was observed as the most conserved intron. The expression analysis of the wheat UGT genes using both online microarray data and quantitative real-time PCR verification suggested the distinct role of UGT genes in different tissues and developmental stages. The expression of many UGT genes was up-regulated after Fusarium graminearum inoculation, and six of the genes were further verified by RT-qPCR. We identified 179 UGT genes from wheat using the available sequenced wheat genome. This study provides useful insight into the phylogenetic structure, distribution, and expression patterns of family-1 UDP glycosyltransferases in wheat. The results also offer a foundation for future work aimed at elucidating the molecular mechanisms underlying the resistance to FHB and DON accumulation.
Karpe, Snehal D.; Jain, Rikesh; Brockmann, Axel; Sowdhamini, Ramanathan
2016-01-01
Abstract We developed a computational pipeline for homology based identification of the complete repertoire of olfactory receptor (OR) genes in the Asian honey bee species, Apis florea. Apis florea is phylogenetically the most basal honey bee species and also the most distant sister species to the Western honey bee Apis mellifera, for which all OR genes had been identified before. Using our pipeline, we identified 180 OR genes in A. florea, which is very similar to the number of ORs identified in A. mellifera (177 ORs). Many characteristics of the ORs including gene structure, synteny of tandemly repeated ORs and basic phylogenetic clustering are highly conserved. The composite phylogenetic tree of A. florea and A. mellifera ORs could be divided into 21 clades which are in harmony with the existing Hymenopteran tree. However, we found a few nonorthologous OR relationships between both species as well as independent pseudogenization of ORs suggesting separate evolutionary changes. Particularly, a subgroup of the OR gene clade XI, which had been hypothesized to code cuticular hydrocarbon receptors showed a high number of species-specific ORs. RNAseq analysis detected a total number of 145 OR transcripts in male and 162 in female antennae. Most of the OR genes were highly expressed on the female antennae. However, we detected five distinct male-biased OR genes, out of which three genes (AfOr11, AfOr18, AfOr170P) were shown to be male-biased in A. mellifera, too, thus corroborating a behavioral function in sex-pheromone communication. PMID:27540087
USDA-ARS?s Scientific Manuscript database
Matrix metalloproteinase-13 (MMP-13), referred to as collagenase-3, is a proteolytic enzyme that plays a key role in degradation and remodelling of host extracellularmatrix proteins. The objective of this study was to characterize the MMP-13 gene in channel catfish, and to determine its pattern of e...
Preston R. Aldrich; George R. Parker; Charles H. Michler; Jeanne Romero-Severson
2003-01-01
The red oaks (Quercus section Lobatae) include important timber species, but we know little about their gene pools. Red oak species can be difficult to identify, possibly because of extensive interspecific hybridization, although most evidence of this is morphological. We used 15 microsatellite loci to examine the genetic...
Mattiucci, S; Cimmaruta, R; Cipriani, P; Abaunza, P; Bellisario, B; Nascetti, G
2015-01-01
The unique environment of the Mediterranean Sea makes fish stock assessment a major challenge. Stock identification of Mediterranean fisheries has been based mostly from data on biology, morphometrics, artificial tags, otolith shape and fish genetics, with less effort on the use of parasites as biomarkers. Here we use some case studies comparing Mediterranean vs Atlantic fish stocks in a multidisciplinary framework. The generalized Procrustes Rotation (PR) was used to assess the association between host genetics and larval Anisakis spp. datasets on demersal (hake) and pelagic (horse mackerel, swordfish) species. When discordant results emerged, they were due to the different features of the data. While fish population genetics can detect changes over an evolutionary timescale, providing indications on the cohesive action of gene flow, parasites are more suitable biomarkers when considering fish stocks over smaller temporal and spatial scales, hence giving information of fish movements over their lifespan. Future studies on the phylogeographic analysis of parasites suitable as biomarkers, and that of their fish host, performed on the same genes, will represent a further tool to be included in multidisciplinary studies on fish stock structure.
Isolation and identification of a bovine viral diarrhea virus from sika deer in china.
Gao, Yugang; Wang, Shijie; Du, Rui; Wang, Quankai; Sun, Changjiang; Wang, Nan; Zhang, Pengju; Zhang, Lianxue
2011-02-25
Bovine viral diarrhea virus (BVDV) infections continue to cause significantly losses in the deer population. Better isolation and identification of BVDV from sika deer may contribute significantly to the development of prophylactic therapeutic, and diagnostic reagents as well as help in prevention and control of BVDV. However, isolation and identification of BVDV from sika deer is seldom reported in literature. In this study, we collected some samples according to clinical sign of BVDV to isolation and identification of BVDV from sika deer. we isolated a suspected BVDV strain from livers of an aborted fetus from sika deer in Changchun (China) using MDBK cell lines, named as CCSYD strain, and identified it by cytopathic effect (CPE), indirect immunoperoxidase test (IPX) and electron microscopy(EM). The results indicated that this virus was BVDV by a series of identification. The structural proteins E0 gene was cloned and sequenced. The obtained E0 gene sequence has been submitted to GenBank with the accession number: FJ555203. Alignment with other 9 strains of BVDV, 7 strains of classical swine fever virus (CSFV) and 3 strains of border disease virus(BDV) in the world, showed that the homology were 98.6%-84.8%, 76.0%-74.7%, 76.6%-77.0% for nucleotide sequence, respectively. The phylogenetic analysis indicated that new isolation and identification CCSYD strain belonged to BVDV1b. To the best of our knowledge, this is the first report that BVDV was isolated and identified in sika deer. This current research contributes development new BVDV vaccine to prevent and control of BVD in sika deer.
Mu, Chuang; Wang, Ruijia; Li, Tianqi; Li, Yuqiang; Tian, Meilin; Jiao, Wenqian; Huang, Xiaoting; Zhang, Lingling; Hu, Xiaoli; Wang, Shi; Bao, Zhenmin
2016-08-01
Long non-coding RNA (lncRNA) structurally resembles mRNA but cannot be translated into protein. Although the systematic identification and characterization of lncRNAs have been increasingly reported in model species, information concerning non-model species is still lacking. Here, we report the first systematic identification and characterization of lncRNAs in two sea cucumber species: (1) Apostichopus japonicus during lipopolysaccharide (LPS) challenge and in heathy tissues and (2) Holothuria glaberrima during radial organ complex regeneration, using RNA-seq datasets and bioinformatics analysis. We identified A. japonicus and H. glaberrima lncRNAs that were differentially expressed during LPS challenge and radial organ complex regeneration, respectively. Notably, the predicted lncRNA-microRNA-gene trinities revealed that, in addition to targeting protein-coding transcripts, miRNAs might also target lncRNAs, thereby participating in a potential novel layer of regulatory interactions among non-coding RNA classes in echinoderms. Furthermore, the constructed coding-non-coding network implied the potential involvement of lncRNA-gene interactions during the regulation of several important genes (e.g., Toll-like receptor 1 [TLR1] and transglutaminase-1 [TGM1]) in response to LPS challenge and radial organ complex regeneration in sea cucumbers. Overall, this pioneer systematic identification, annotation, and characterization of lncRNAs in echinoderm pave the way for similar studies and future genetic, genomic, and evolutionary research in non-model species.
Ganie, Showkat Ahmad; Pani, Dipti Ranjan; Mondal, Tapan Kumar
2017-01-01
DUF221 domain-containing genes (DDP genes) play important roles in developmental biology, hormone signalling transduction, and responses to abiotic stress. Therefore to understand their structural and evolutionary relationship, we did a genome-wide analysis of this important gene family in rice. Further, through comparative genomics, DDP genes from Oryza sativa subsp. (indica), nine different wild species of rice and Arabidopsis were also identified. We also found an expansion of the DDP gene families in rice and Arabidopsis which is due to the segmental duplication events in some of the gene family members. In general, a highly purifying selection was found acting on all the deduced paralogous and orthologous DDP gene pairs. The data from microarray and subsequent qRT-PCR analysis revealed that although several OsDDPs were differentially regulated under salinity stress, yet OsDDP6 was upregulated at all the developmental stages in salt tolerant rice genotype, FL478. Interestingly, OsDDP6 was found to be involved in proline metabolism pathway as indicated by protein network analysis. The diverse gene structures, varied transmembrane topologies and the differential expression patterns implied the functional diversity in DDP genes. Therefore, the comprehensive evolutionary analysis of DDP genes from different Oryza species and Arabidopsis performed in this study will provide the basis for further functional validation studies vis-à-vis DDP genes of rice and other plant species.
Ganie, Showkat Ahmad; Pani, Dipti Ranjan
2017-01-01
DUF221 domain-containing genes (DDP genes) play important roles in developmental biology, hormone signalling transduction, and responses to abiotic stress. Therefore to understand their structural and evolutionary relationship, we did a genome-wide analysis of this important gene family in rice. Further, through comparative genomics, DDP genes from Oryza sativa subsp. (indica), nine different wild species of rice and Arabidopsis were also identified. We also found an expansion of the DDP gene families in rice and Arabidopsis which is due to the segmental duplication events in some of the gene family members. In general, a highly purifying selection was found acting on all the deduced paralogous and orthologous DDP gene pairs. The data from microarray and subsequent qRT-PCR analysis revealed that although several OsDDPs were differentially regulated under salinity stress, yet OsDDP6 was upregulated at all the developmental stages in salt tolerant rice genotype, FL478. Interestingly, OsDDP6 was found to be involved in proline metabolism pathway as indicated by protein network analysis. The diverse gene structures, varied transmembrane topologies and the differential expression patterns implied the functional diversity in DDP genes. Therefore, the comprehensive evolutionary analysis of DDP genes from different Oryza species and Arabidopsis performed in this study will provide the basis for further functional validation studies vis-à-vis DDP genes of rice and other plant species. PMID:28846681
Rivera-Posada, J A; Pratchett, M; Cano-Gomez, A; Arango-Gomez, J D; Owens, L
2011-09-09
We used a polyphasic approach for precise identification of bacterial flora (Vibrionaceae) isolated from crown-of-thorns starfish (COTS) from Lizard Island (Great Barrier Reef, Australia) and Guam (U.S.A., Western Pacific Ocean). Previous 16S rRNA gene phylogenetic analysis was useful to allocate and identify isolates within the Photobacterium, Splendidus and Harveyi clades but failed in the identification of Vibrio harveyi-like isolates. Species of the V harveyi group have almost indistinguishable phenotypes and genotypes, and thus, identification by standard biochemical tests and 16S rRNA gene analysis is commonly inaccurate. Biochemical profiling and sequence analysis of additional topA and mreB housekeeping genes were carried out for definitive identification of 19 bacterial isolates recovered from sick and wild COTS. For 8 isolates, biochemical profiles and topA and mreB gene sequence alignments with the closest relatives (GenBank) confirmed previous 16S rRNA-based identification: V. fortis and Photobacterium eurosenbergii species (from wild COTS), and V natriegens (from diseased COTS). Further phylogenetic analysis based on topA and mreB concatenated sequences served to identify the remaining 11 V harveyi-like isolates: V. owensii and V. rotiferianus (from wild COTS), and V. owensii, V. rotiferianus, and V. harveyi (from diseased COTS). This study further confirms the reliability of topA-mreB gene sequence analysis for identification of these close species, and it reveals a wider distribution range of the potentially pathogenic V. harveyi group.
Petti, C. A.; Polage, C. R.; Schreckenberger, P.
2005-01-01
Traditional methods for microbial identification require the recognition of differences in morphology, growth, enzymatic activity, and metabolism to define genera and species. Full and partial 16S rRNA gene sequencing methods have emerged as useful tools for identifying phenotypically aberrant microorganisms. We report on three bacterial blood isolates from three different College of American Pathologists-certified laboratories that were referred to ARUP Laboratories for definitive identification. Because phenotypic identification suggested unusual organisms not typically associated with the submitted clinical diagnosis, consultation with the Medical Director was sought and further testing was performed including partial 16S rRNA gene sequencing. All three patients had endocarditis, and conventional methods identified isolates from patients A, B, and C as a Facklamia sp., Eubacterium tenue, and a Bifidobacterium sp. 16S rRNA gene sequencing identified the isolates as Enterococcus faecalis, Cardiobacterium valvarum, and Streptococcus mutans, respectively. We conclude that the initial identifications of these three isolates were erroneous, may have misled clinicians, and potentially impacted patient care. 16S rRNA gene sequencing is a more objective identification tool, unaffected by phenotypic variation or technologist bias, and has the potential to reduce laboratory errors. PMID:16333109
James M. Slavicek; Nancy Hayes-Plazolles
1991-01-01
Viral immediate early gene products are usually regulatory proteins that control expression of other viral genes at the transcriptional level or are proteins that are part of the viral DNA replication complex. The identification and functional characterization of the immediate early gene products of Lymantria dispar nuclear polyhedrosis virus (LdNPV...
Jahandideh, Samad; Srinivasasainagendra, Vinodh; Zhi, Degui
2012-11-07
RNA-protein interaction plays an important role in various cellular processes, such as protein synthesis, gene regulation, post-transcriptional gene regulation, alternative splicing, and infections by RNA viruses. In this study, using Gene Ontology Annotated (GOA) and Structural Classification of Proteins (SCOP) databases an automatic procedure was designed to capture structurally solved RNA-binding protein domains in different subclasses. Subsequently, we applied tuned multi-class SVM (TMCSVM), Random Forest (RF), and multi-class ℓ1/ℓq-regularized logistic regression (MCRLR) for analysis and classifying RNA-binding protein domains based on a comprehensive set of sequence and structural features. In this study, we compared prediction accuracy of three different state-of-the-art predictor methods. From our results, TMCSVM outperforms the other methods and suggests the potential of TMCSVM as a useful tool for facilitating the multi-class prediction of RNA-binding protein domains. On the other hand, MCRLR by elucidating importance of features for their contribution in predictive accuracy of RNA-binding protein domains subclasses, helps us to provide some biological insights into the roles of sequences and structures in protein-RNA interactions.
Grindberg, Rashel V.; Ishoey, Thomas; Brinza, Dumitru; Esquenazi, Eduardo; Coates, R. Cameron; Liu, Wei-ting; Gerwick, Lena; Dorrestein, Pieter C.; Pevzner, Pavel; Lasken, Roger; Gerwick, William H.
2011-01-01
Filamentous marine cyanobacteria are extraordinarily rich sources of structurally novel, biomedically relevant natural products. To understand their biosynthetic origins as well as produce increased supplies and analog molecules, access to the clustered biosynthetic genes that encode for the assembly enzymes is necessary. Complicating these efforts is the universal presence of heterotrophic bacteria in the cell wall and sheath material of cyanobacteria obtained from the environment and those grown in uni-cyanobacterial culture. Moreover, the high similarity in genetic elements across disparate secondary metabolite biosynthetic pathways renders imprecise current gene cluster targeting strategies and contributes sequence complexity resulting in partial genome coverage. Thus, it was necessary to use a dual-method approach of single-cell genomic sequencing based on multiple displacement amplification (MDA) and metagenomic library screening. Here, we report the identification of the putative apratoxin. A biosynthetic gene cluster, a potent cancer cell cytotoxin with promise for medicinal applications. The roughly 58 kb biosynthetic gene cluster is composed of 12 open reading frames and has a type I modular mixed polyketide synthase/nonribosomal peptide synthetase (PKS/NRPS) organization and features loading and off-loading domain architecture never previously described. Moreover, this work represents the first successful isolation of a complete biosynthetic gene cluster from Lyngbya bouillonii, a tropical marine cyanobacterium renowned for its production of diverse bioactive secondary metabolites. PMID:21533272
Kaneko, Jun; Narita-Yamada, Sachiko; Wakabayashi, Yukari; Kamio, Yoshiyuki
2009-01-01
The temperate phage φSLT of Staphylococcus aureus carries genes for Panton-Valentine leukocidin. Here, we identify ORF636, a constituent of the phage tail tip structure, as a recognition/adhesion protein for a poly(glycerophosphate) chain of lipoteichoic acid on the cell surface of S. aureus. ORF636 bound specifically to S. aureus; it did not bind to any other staphylococcal species or to several gram-positive bacteria. PMID:19429614
Structural and Biochemical Characterization of a Novel Aminopeptidase from Human Intestine
Tykvart, Jan; Bařinka, Cyril; Svoboda, Michal; ...
2015-03-09
N-acetylated α-linked acidic dipeptidase-like protein (NAALADase L), encoded by the NAALADL1 gene, is a close homolog of glutamate carboxypeptidase II, a metallopeptidase that has been intensively studied as a target for imaging and therapy of solid malignancies and neuropathologies. However, neither the physiological functions nor structural features of NAALADase L are known at present. In this paper, we report a thorough characterization of the protein product of the human NAALADL1 gene, including heterologous overexpression and purification, structural and biochemical characterization, and analysis of its expression profile. By solving the NAALADase L x-ray structure, we provide the first experimental evidence thatmore » it is a zinc-dependent metallopeptidase with a catalytic mechanism similar to that of glutamate carboxypeptidase II yet distinct substrate specificity. A proteome-based assay revealed that the NAALADL1 gene product possesses previously unrecognized aminopeptidase activity but no carboxy- or endopeptidase activity. These findings were corroborated by site-directed mutagenesis and identification of bestatin as a potent inhibitor of the enzyme. Analysis of NAALADL1 gene expression at both the mRNA and protein levels revealed the small intestine as the major site of protein expression and points toward extensive alternative splicing of the NAALADL1 gene transcript. Taken together, our data imply that the NAALADL1 gene product's primary physiological function is associated with the final stages of protein/peptide digestion and absorption in the human digestive system. Finally, based on these results, we suggest a new name for this enzyme: human ileal aminopeptidase (HILAP).« less
2010-01-01
Background Expansins form a large multi-gene family found in wheat and other cereal genomes that are involved in the expansion of cell walls as a tissue grows. The expansin family can be divided up into two main groups, namely, alpha-expansin (EXPA) and beta-expansin proteins (EXPB), with the EXPB group being of particular interest as group 1-pollen allergens. Results In this study, three beta-expansin genes were identified and characterized from a newly sequenced region of the Triticum aestivum cv. Chinese Spring chromosome 3B physical map at the Sr2 locus (FPC contig ctg11). The analysis of a 357 kb sub-sequence of FPC contig ctg11 identified one beta-expansin genes to be TaEXPB11, originally identified as a cDNA from the wheat cv Wyuna. Through the analysis of intron sequences of the three wheat cv. Chinese Spring genes, we propose that two of these beta-expansin genes are duplications of the TaEXPB11 gene. Comparative sequence analysis with two other wheat cultivars (cv. Westonia and cv. Hope) and a Triticum aestivum var. spelta line validated the identification of the Chinese Spring variant of TaEXPB11. The expression in maternal and grain tissues was confirmed by examining EST databases and carrying out RT-PCR experiments. Detailed examination of the position of TaEXPB11 relative to the locus encoding Sr2 disease resistance ruled out the possibility of this gene directly contributing to the resistance phenotype. Conclusions Through 3-D structural protein comparisons with Zea mays EXPB1, we proposed that variations within the coding sequence of TaEXPB11 in wheats may produce a functional change within features such as domain 1 related to possible involvement in cell wall structure and domain 2 defining the pollen allergen domain and binding to IgE protein. The variation established in this gene suggests it is a clearly identifiable member of a gene family and reflects the dynamic features of the wheat genome as it adapted to a range of different environments and uses. Accession Numbers: ctg11 =FN564426 Survey sequences of TaEXPB11ws and TsEXPB11 are provided request. PMID:20507562
Mathupala, S P; Lowe, S E; Podkovyrov, S M; Zeikus, J G
1993-08-05
The complete nucleotide sequence of the gene encoding the dual active amylopullulanase of Thermoanaerobacter ethanolicus 39E (formerly Clostridium thermohydrosulfuricum) was determined. The structural gene (apu) contained a single open reading frame 4443 base pairs in length, corresponding to 1481 amino acids, with an estimated molecular weight of 162,780. Analysis of the deduced sequence of apu with sequences of alpha-amylases and alpha-1,6 debranching enzymes enabled the identification of four conserved regions putatively involved in substrate binding and in catalysis. The conserved regions were localized within a 2.9-kilobase pair gene fragment, which encoded a M(r) 100,000 protein that maintained the dual activities and thermostability of the native enzyme. The catalytic residues of amylopullulanase were tentatively identified by using hydrophobic cluster analysis for comparison of amino acid sequences of amylopullulanase and other amylolytic enzymes. Asp597, Glu626, and Asp703 were individually modified to their respective amide form, or the alternate acid form, and in all cases both alpha-amylase and pullulanase activities were lost, suggesting the possible involvement of 3 residues in a catalytic triad, and the presence of a putative single catalytic site within the enzyme. These findings substantiate amylopullulanase as a new type of amylosaccharidase.
Pacheco-Arjona, Jose Ramon; Ramirez-Prado, Jorge Humberto
2014-01-01
The cell wall is a protective and versatile structure distributed in all fungi. The component responsible for its rigidity is chitin, a product of chitin synthase (Chsp) enzymes. There are seven classes of chitin synthase genes (CHS) and the amount and type encoded in fungal genomes varies considerably from one species to another. Previous Chsp sequence analyses focused on their study as individual units, regardless of genomic context. The identification of blocks of conserved genes between genomes can provide important clues about the interactions and localization of chitin synthases. On the present study, we carried out an in silico search of all putative Chsp encoded in 54 full fungal genomes, encompassing 21 orders from five phyla. Phylogenetic studies of these Chsp were able to confidently classify 347 out of the 369 Chsp identified (94%). Patterns in the distribution of Chsp related to taxonomy were identified, the most prominent being related to the type of fungal growth. More importantly, a synteny analysis for genomic blocks centered on class IV Chsp (the most abundant and widely distributed Chsp class) identified a putative cell wall metabolism gene cluster in members of the genus Aspergillus, the first such association reported for any fungal genome. PMID:25148134
Freytag, Saskia; Manitz, Juliane; Schlather, Martin; Kneib, Thomas; Amos, Christopher I.; Risch, Angela; Chang-Claude, Jenny; Heinrich, Joachim; Bickeböller, Heike
2014-01-01
Biological pathways provide rich information and biological context on the genetic causes of complex diseases. The logistic kernel machine test integrates prior knowledge on pathways in order to analyze data from genome-wide association studies (GWAS). Here, the kernel converts genomic information of two individuals to a quantitative value reflecting their genetic similarity. With the selection of the kernel one implicitly chooses a genetic effect model. Like many other pathway methods, none of the available kernels accounts for topological structure of the pathway or gene-gene interaction types. However, evidence indicates that connectivity and neighborhood of genes are crucial in the context of GWAS, because genes associated with a disease often interact. Thus, we propose a novel kernel that incorporates the topology of pathways and information on interactions. Using simulation studies, we demonstrate that the proposed method maintains the type I error correctly and can be more effective in the identification of pathways associated with a disease than non-network-based methods. We apply our approach to genome-wide association case control data on lung cancer and rheumatoid arthritis. We identify some promising new pathways associated with these diseases, which may improve our current understanding of the genetic mechanisms. PMID:24434848
New support vector machine-based method for microRNA target prediction.
Li, L; Gao, Q; Mao, X; Cao, Y
2014-06-09
MicroRNA (miRNA) plays important roles in cell differentiation, proliferation, growth, mobility, and apoptosis. An accurate list of precise target genes is necessary in order to fully understand the importance of miRNAs in animal development and disease. Several computational methods have been proposed for miRNA target-gene identification. However, these methods still have limitations with respect to their sensitivity and accuracy. Thus, we developed a new miRNA target-prediction method based on the support vector machine (SVM) model. The model supplies information of two binding sites (primary and secondary) for a radial basis function kernel as a similarity measure for SVM features. The information is categorized based on structural, thermodynamic, and sequence conservation. Using high-confidence datasets selected from public miRNA target databases, we obtained a human miRNA target SVM classifier model with high performance and provided an efficient tool for human miRNA target gene identification. Experiments have shown that our method is a reliable tool for miRNA target-gene prediction, and a successful application of an SVM classifier. Compared with other methods, the method proposed here improves the sensitivity and accuracy of miRNA prediction. Its performance can be further improved by providing more training examples.
Zhang, Jin; Liu, Bobin; Li, Jianbo; Zhang, Li; Wang, Yan; Zheng, Huanquan; Lu, Mengzhu; Chen, Jun
2015-03-14
Heat shock proteins (Hsps) are molecular chaperones that are involved in many normal cellular processes and stress responses, and heat shock factors (Hsfs) are the transcriptional activators of Hsps. Hsfs and Hsps are widely coordinated in various biological processes. Although the roles of Hsfs and Hsps in stress responses have been well characterized in Arabidopsis, their roles in perennial woody species undergoing various environmental stresses remain unclear. Here, a comprehensive identification and analysis of Hsf and Hsp families in poplars is presented. In Populus trichocarpa, we identified 42 paralogous pairs, 66.7% resulting from a whole genome duplication. The gene structure and motif composition are relatively conserved in each subfamily. Microarray and quantitative real-time RT-PCR analyses showed that most of the Populus Hsf and Hsp genes are differentially expressed upon exposure to various stresses. A coexpression network between Populus Hsf and Hsp genes was generated based on their expression. Coordinated relationships were validated by transient overexpression and subsequent qPCR analyses. The comprehensive analysis indicates that different sets of PtHsps are downstream of particular PtHsfs and provides a basis for functional studies aimed at revealing the roles of these families in poplar development and stress responses.
The Amaryllidaceae alkaloids: biosynthesis and methods for enzyme discovery
Kilgore, Matthew B.; Kutchan, Toni M.
2015-01-01
Amaryllidaceae alkaloids are an example of the vast diversity of secondary metabolites with great therapeutic promise. The identification of novel compounds in this group with over 300 known structures continues to be an area of active study. The recent identification of norbelladine 4′-O-methyltransferase (N4OMT), an Amaryllidaceae alkaloid biosynthetic enzyme, and the assembly of transcriptomes for Narcissus sp. aff. pseudonarcissus and Lycoris aurea highlight the potential for discovery of Amaryllidaceae alkaloid biosynthetic genes with new technologies. Recent technical advances of interest include those in enzymology, next generation sequencing, genetic modification, nuclear magnetic resonance spectroscopy (NMR), and mass spectrometry (MS). PMID:27340382
NASA Astrophysics Data System (ADS)
Messina, Jane L.; Fenstermacher, David A.; Eschrich, Steven; Qu, Xiaotao; Berglund, Anders E.; Lloyd, Mark C.; Schell, Michael J.; Sondak, Vernon K.; Weber, Jeffrey S.; Mulé, James J.
2012-10-01
We have interrogated a 12-chemokine gene expression signature (GES) on genomic arrays of 14,492 distinct solid tumors and show broad distribution across different histologies. We hypothesized that this 12-chemokine GES might accurately predict a unique intratumoral immune reaction in stage IV (non-locoregional) melanoma metastases. The 12-chemokine GES predicted the presence of unique, lymph node-like structures, containing CD20+ B cell follicles with prominent areas of CD3+ T cells (both CD4+ and CD8+ subsets). CD86+, but not FoxP3+, cells were present within these unique structures as well. The direct correlation between the 12-chemokine GES score and the presence of unique, lymph nodal structures was also associated with better overall survival of the subset of melanoma patients. The use of this novel 12-chemokine GES may reveal basic information on in situ mechanisms of the anti-tumor immune response, potentially leading to improvements in the identification and selection of melanoma patients most suitable for immunotherapy.
Sharma, Akanksha; Sharma, Niharika; Bhalla, Prem; Singh, Mohan
2017-01-01
Comparative genomics have facilitated the mining of biological information from a genome sequence, through the detection of similarities and differences with genomes of closely or more distantly related species. By using such comparative approaches, knowledge can be transferred from the model to non-model organisms and insights can be gained in the structural and evolutionary patterns of specific genes. In the absence of sequenced genomes for allergenic grasses, this study was aimed at understanding the structure, organisation and expression profiles of grass pollen allergens using the genomic data from Brachypodium distachyon as it is phylogenetically related to the allergenic grasses. Combining genomic data with the anther RNA-Seq dataset revealed 24 pollen allergen genes belonging to eight allergen groups mapping on the five chromosomes in B. distachyon. High levels of anther-specific expression profiles were observed for the 24 identified putative allergen-encoding genes in Brachypodium. The genomic evidence suggests that gene encoding the group 5 allergen, the most potent trigger of hay fever and allergic asthma originated as a pollen specific orphan gene in a common grass ancestor of Brachypodium and Triticiae clades. Gene structure analysis showed that the putative allergen-encoding genes in Brachypodium either lack or contain reduced number of introns. Promoter analysis of the identified Brachypodium genes revealed the presence of specific cis-regulatory sequences likely responsible for high anther/pollen-specific expression. With the identification of putative allergen-encoding genes in Brachypodium, this study has also described some important plant gene families (e.g. expansin superfamily, EF-Hand family, profilins etc) for the first time in the model plant Brachypodium. Altogether, the present study provides new insights into structural characterization and evolution of pollen allergens and will further serve as a base for their functional characterization in related grass species. PMID:28103252
Cloud, Joann L; Harmsen, Dag; Iwen, Peter C; Dunn, James J; Hall, Gerri; Lasala, Paul Rocco; Hoggan, Karen; Wilson, Deborah; Woods, Gail L; Mellmann, Alexander
2010-04-01
Correct identification of nonfermenting Gram-negative bacilli (NFB) is crucial for patient management. We compared phenotypic identifications of 96 clinical NFB isolates with identifications obtained by 5' 16S rRNA gene sequencing. Sequencing identified 88 isolates (91.7%) with >99% similarity to a sequence from the assigned species; 61.5% of sequencing results were concordant with phenotypic results, indicating the usability of sequencing to identify NFB.
Characterization of Proteoforms with Unknown Post-translational Modifications Using the MIScore
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kou, Qiang; Zhu, Binhai; Wu, Si
Various proteoforms may be generated from a single gene due to primary structure alterations (PSAs) such as genetic variations, alternative splicing, and post-translational modifications (PTMs). Top-down mass spectrometry is capable of analyzing intact proteins and identifying patterns of multiple PSAs, making it the method of choice for studying complex proteoforms. In top-down proteomics, proteoform identification is often performed by searching tandem mass spectra against a protein sequence database that contains only one reference protein sequence for each gene or transcript variant in a proteome. Because of the incompleteness of the protein database, an identified proteoform may contain unknown PSAs comparedmore » with the reference sequence. Proteoform characterization is to identify and localize PSAs in a proteoform. Although many software tools have been proposed for proteoform identification by top-down mass spectrometry, the characterization of proteoforms in identified proteoform-spectrum matches still relies mainly on manual annotation. We propose to use the Modification Identification Score (MIScore), which is based on Bayesian models, to automatically identify and localize PTMs in proteoforms. Experiments showed that the MIScore is accurate in identifying and localizing one or two modifications.« less
Ma, Hong-Zhen; Liu, Guo-Qin; Li, Cheng-Wei; Kang, Guo-Zhang; Guo, Tian-Cai
2012-10-05
The full-length cDNA (882bp) and DNA (1742bp) sequences encoding a basic transcription factor 3, designated as TaBTF3, were first isolated from common wheat (Triticum aestivum L.). Subcellular localization studies revealed that the TaBTF3 protein was mainly located in the cytoplasm and nucleus. In TaBTF3-silenced transgenic wheat seedlings obtained using the Virus-induced gene silencing (VIGS) method, the chlorophyll pigment content was markedly reduced. However, the malonaldehyde (MDA) and H(2)O(2) contents were enhanced, and the structure of the wheat mesophyll cell was seriously damaged. Furthermore, transcripts of the chloroplast- and mitochondrial-encoded genes were significantly reduced in TaBTF3-silenced transgenic wheat plants. These results suggest that the TaBTF3 gene might function in the development of the wheat chloroplast, mitochondria and mesophyll cell. This paper is the first report to describe the involvement of TaBTF3 in maintaining the normal plant mesophyll cell structure. Copyright © 2012 Elsevier Inc. All rights reserved.
José-Edwards, Diana S; Kerner, Pierre; Kugler, Jamie E; Deng, Wei; Jiang, Di; Di Gregorio, Anna
2011-07-01
The notochord is the distinctive characteristic of chordates; however, the knowledge of the complement of transcription factors governing the development of this structure is still incomplete. Here we present the expression patterns of seven transcription factor genes detected in the notochord of the ascidian Ciona intestinalis at various stages of embryonic development. Four of these transcription factors, Fos-a, NFAT5, AFF and Klf15, have not been directly associated with the notochord in previous studies, while the others, including Spalt-like-a, Lmx-like, and STAT5/6-b, display evolutionarily conserved expression in this structure as well as in other domains. We examined the hierarchical relationships between these genes and the transcription factor Brachyury, which is necessary for notochord development in all chordates. We found that Ciona Brachyury regulates the expression of most, although not all, of these genes. These results shed light on the genetic regulatory program underlying notochord formation in Ciona and possibly other chordates. Copyright © 2011 Wiley-Liss, Inc.
Mikshis, N I; Kashtanova, T N; Kutyrev, V V
2015-01-01
Nucleotide sequence analysis of several genes responsible for the anthrax pathogen definitive properties--motility and penicillinase activity--determined a chromosomal locus promising for interspecies differentiation. We demonstrated that the gene fliC encoding flagellin synthesis contains extended region, distinguishing B. anthracis strains from the majority of non-pathogenic and opportunistic bacilli. A novel method for the anthrax pathogen indication and identification based on determination of the differences in the chromosomal genes fliC and hom2 structure was suggested. A total of 60 strains of different Bacillus spp. (B. anthracis, B. cereus, B. thuringiensis, B. mycoides, B. megaterium, B. subtilis, etc.) were tested using two chromosomal DNA targets. The algorithm developed in this work permits to detect the pathogenic microorganism and reliably differentiate it from other Bacillus spp. representatives. The introduction of primers complementary to specific sequences of pXO1 and pXQ2 plasmids into the multiplex PCR makes it possible to receive additional information on proposed virulence of the isolate.
Rodríguez-García, María Juliana; García-Reina, Andrés; Machado, Vilmar; Galián, José
2016-09-01
In this study, a defensin gene (Clit-Def) has been characterised in the tiger beetle Calomera littoralis for the first time. Bioinformatic analysis showed that the gene has an open reading frame of 246bp that contains a 46 amino acid mature peptide. The phylogenetic analysis showed a high variability in the coleopteran defensins analysed. The Clit-Def mature peptide has the features to be involved in the antimicrobial function: a predicted cationic isoelectric point of 8.94, six cysteine residues that form three disulfide bonds, and the typical cysteine-stabilized α-helix β-sheet (CSαβ) structural fold. Real time quantitative PCR analysis showed that Clit-Def was upregulated in the different body parts analysed after infection with lipopolysaccharides of Escherichia coli, and also indicated that has an expression peak at 12h post infection. The expression patterns of Clit-Def suggest that this gene plays important roles in the humoral system in the adephagan beetle Calomera littoralis. Copyright © 2016 Elsevier B.V. All rights reserved.
Application of hidden Markov models to biological data mining: a case study
NASA Astrophysics Data System (ADS)
Yin, Michael M.; Wang, Jason T.
2000-04-01
In this paper we present an example of biological data mining: the detection of splicing junction acceptors in eukaryotic genes. Identification or prediction of transcribed sequences from within genomic DNA has been a major rate-limiting step in the pursuit of genes. Programs currently available are far from being powerful enough to elucidate the gene structure completely. Here we develop a hidden Markov model (HMM) to represent the degeneracy features of splicing junction acceptor sites in eukaryotic genes. The HMM system is fully trained using an expectation maximization (EM) algorithm and the system performance is evaluated using the 10-way cross- validation method. Experimental results show that our HMM system can correctly classify more than 94% of the candidate sequences (including true and false acceptor sites) into right categories. About 90% of the true acceptor sites and 96% of the false acceptor sites in the test data are classified correctly. These results are very promising considering that only the local information in DNA is used. The proposed model will be a very important component of an effective and accurate gene structure detection system currently being developed in our lab.
Liu, Pu; Zhang, Chao; Ma, Jin-Qi; Zhang, Li-Yuan; Yang, Bo; Tang, Xin-Yu; Huang, Ling; Zhou, Xin-Tong; Lu, Kun; Li, Jia-Na
2018-03-16
Cytokinin oxidase/dehydrogenases (CKXs) play a critical role in the irreversible degradation of cytokinins, thereby regulating plant growth and development. Brassica napus is one of the most widely cultivated oilseed crops worldwide. With the completion of whole-genome sequencing of B. napus , genome-wide identification and expression analysis of the BnCKX gene family has become technically feasible. In this study, we identified 23 BnCKX genes and analyzed their phylogenetic relationships, gene structures, conserved motifs, protein subcellular localizations, and other properties. We also analyzed the expression of the 23 BnCKX genes in the B. napus cultivar Zhong Shuang 11 ('ZS11') by quantitative reverse-transcription polymerase chain reaction (qRT-PCR), revealing their diverse expression patterns. We selected four BnCKX genes based on the results of RNA-sequencing and qRT-PCR and compared their expression in cultivated varieties with extremely long versus short siliques. The expression levels of BnCKX5-1 , 5-2 , 6-1 , and 7-1 significantly differed between the two lines and changed during pod development, suggesting they might play roles in determining silique length and in pod development. Finally, we investigated the effects of treatment with the synthetic cytokinin 6-benzylaminopurine (6-BA) and the auxin indole-3-acetic acid (IAA) on the expression of the four selected BnCKX genes. Our results suggest that regulating BnCKX expression is a promising way to enhance the harvest index and stress resistance in plants.
Genome-wide identification of the SWEET gene family in wheat.
Gao, Yue; Wang, Zi Yuan; Kumar, Vikranth; Xu, Xiao Feng; Yuan, De Peng; Zhu, Xiao Feng; Li, Tian Ya; Jia, Baolei; Xuan, Yuan Hu
2018-02-05
The SWEET (sugars will eventually be exported transporter) family is a newly characterized group of sugar transporters. In plants, the key roles of SWEETs in phloem transport, nectar secretion, pollen nutrition, stress tolerance, and plant-pathogen interactions have been identified. SWEET family genes have been characterized in many plant species, but a comprehensive analysis of SWEET members has not yet been performed in wheat. Here, 59 wheat SWEETs (hereafter TaSWEETs) were identified through homology searches. Analyses of phylogenetic relationships, numbers of transmembrane helices (TMHs), gene structures, and motifs showed that TaSWEETs carrying 3-7 TMHs could be classified into four clades with 10 different types of motifs. Examination of the expression patterns of 18 SWEET genes revealed that a few are tissue-specific while most are ubiquitously expressed. In addition, the stem rust-mediated expression patterns of SWEET genes were monitored using a stem rust-susceptible cultivar, 'Little Club' (LC). The resulting data showed that the expression of five out of the 18 SWEETs tested was induced following inoculation. In conclusion, we provide the first comprehensive analysis of the wheat SWEET gene family. Information regarding the phylogenetic relationships, gene structures, and expression profiles of SWEET genes in different tissues and following stem rust disease inoculation will be useful in identifying the potential roles of SWEETs in specific developmental and pathogenic processes. Copyright © 2017 Elsevier B.V. All rights reserved.
Cherian, Milu T.; Yang, Lei; Chai, Sergio C.; Lin, Wenwei
2016-01-01
The constitutive androstane receptor (CAR) regulates the expression of genes involved in drug metabolism and other processes. A specific inhibitor of CAR is critical for modulating constitutive CAR activity. We recently described a specific small-molecule inhibitor of CAR, CINPA1 (ethyl (5-(diethylglycyl)-10,11-dihydro-5H-dibenzo[b,f]azepin-3-yl)carbamate), which is capable of reducing CAR-mediated transcription by changing the coregulator recruitment pattern and reducing CAR occupancy at the promoter regions of its target genes. In this study, we showed that CINPA1 is converted to two main metabolites in human liver microsomes. By using cell-based reporter gene and biochemical coregulator recruitment assays, we showed that although metabolite 1 was very weak in inhibiting CAR function and disrupting CAR-coactivator interaction, metabolite 2 was inactive in this regard. Docking studies using the CAR ligand-binding domain structure showed that although CINPA1 and metabolite 1 can bind in the CAR ligand-binding pocket, metabolite 2 may be incapable of the molecular interactions required for binding. These results indicate that the metabolites of CINPA1 may not interfere with the action of CINPA1. We also used in vitro enzyme assays to identify the cytochrome P450 enzymes responsible for metabolizing CINPA1 in human liver microsomes and showed that CINPA1 was first converted to metabolite 1 by CYP3A4 and then further metabolized by CYP2D6 to metabolite 2. Identification and characterization of the metabolites of CINPA1 enabled structure-activity relationship studies of this family of small molecules and provided information to guide in vivo pharmacological studies. PMID:27519550
Salmond, G P; Lutkenhaus, J F; Donachie, W D
1980-01-01
We report the identification, cloning, and mapping of a new cell envelope gene, murG. This lies in a group of five genes of similar phenotype (in the order murE murF murG murC ddl) all concerned with peptidoglycan biosynthesis. This group is in a larger cluster of at least 10 genes, all of which are involved in some way with cell envelope growth. Images PMID:6998962
Furlong, Michael; Seong, Jae Young
2017-01-01
Seven transmembrane receptors (7TMRs), also known as G protein-coupled receptors, are popular targets of drug development, particularly 7TMR systems that are activated by peptide ligands. Although many pharmaceutical drugs have been discovered via conventional bulk analysis techniques the increasing availability of structural and evolutionary data are facilitating change to rational, targeted drug design. This article discusses the appeal of neuropeptide-7TMR systems as drug targets and provides an overview of concepts in the evolution of vertebrate genomes and gene families. Subsequently, methods that use evolutionary concepts and comparative analysis techniques to aid in gene discovery, gene function identification, and novel drug design are provided along with case study examples.
Furlong, Michael; Seong, Jae Young
2017-01-01
Seven transmembrane receptors (7TMRs), also known as G protein-coupled receptors, are popular targets of drug development, particularly 7TMR systems that are activated by peptide ligands. Although many pharmaceutical drugs have been discovered via conventional bulk analysis techniques the increasing availability of structural and evolutionary data are facilitating change to rational, targeted drug design. This article discusses the appeal of neuropeptide-7TMR systems as drug targets and provides an overview of concepts in the evolution of vertebrate genomes and gene families. Subsequently, methods that use evolutionary concepts and comparative analysis techniques to aid in gene discovery, gene function identification, and novel drug design are provided along with case study examples. PMID:28035082
USDA-ARS?s Scientific Manuscript database
Polymerase chain reaction amplification of conserved genes and sequence analysis provides a very powerful tool for the identification of toxigenic as well as non-toxigenic Penicillium species. Sequences are obtained by amplification of the gene fragment, sequencing via capillary electrophoresis of d...
Perceptron ensemble of graph-based positive-unlabeled learning for disease gene identification.
Jowkar, Gholam-Hossein; Mansoori, Eghbal G
2016-10-01
Identification of disease genes, using computational methods, is an important issue in biomedical and bioinformatics research. According to observations that diseases with the same or similar phenotype have the same biological characteristics, researchers have tried to identify genes by using machine learning tools. In recent attempts, some semi-supervised learning methods, called positive-unlabeled learning, is used for disease gene identification. In this paper, we present a Perceptron ensemble of graph-based positive-unlabeled learning (PEGPUL) on three types of biological attributes: gene ontologies, protein domains and protein-protein interaction networks. In our method, a reliable set of positive and negative genes are extracted using co-training schema. Then, the similarity graph of genes is built using metric learning by concentrating on multi-rank-walk method to perform inference from labeled genes. At last, a Perceptron ensemble is learned from three weighted classifiers: multilevel support vector machine, k-nearest neighbor and decision tree. The main contributions of this paper are: (i) incorporating the statistical properties of gene data through choosing proper metrics, (ii) statistical evaluation of biological features, and (iii) noise robustness characteristic of PEGPUL via using multilevel schema. In order to assess PEGPUL, we have applied it on 12950 disease genes with 949 positive genes from six class of diseases and 12001 unlabeled genes. Compared with some popular disease gene identification methods, the experimental results show that PEGPUL has reasonable performance. Copyright © 2016 Elsevier Ltd. All rights reserved.
Premraj, Avinash; Nautiyal, Binita; Aleyas, Abi G; Rasool, Thaha Jamal
2015-10-01
Interleukin-26 (IL-26) is a member of the IL-10 family of cytokines. Though conserved across vertebrates, the IL-26 gene is functionally inactivated in a few mammals like rat, mouse and horse. We report here the identification, isolation and cloning of the cDNA of IL-26 from the dromedary camel. The camel cDNA contains a 516 bp open reading frame encoding a 171 amino acid precursor protein, including a 21 amino acid signal peptide. Sequence analysis revealed high similarity with other mammalian IL-26 homologs and the conservation of IL-10 cytokine family domain structure including key amino acid residues. We also report the identification and cloning of four novel transcript variants produced by alternative splicing at the Exon 3-Exon 4 regions of the gene. Three of the alternative splice variants had premature termination codons and are predicted to code for truncated proteins. The transcript variant 4 (Tv4) having an insertion of an extra 120 bp nucleotides in the ORF was predicted to encode a full length protein product with 40 extra amino acid residues. The mRNA transcripts of all the variants were identified in lymph node, where as fewer variants were observed in other tissues like blood, liver and kidney. The expression of Tv2 and Tv3 were found to be up regulated in mitogen induced camel peripheral blood mononuclear cells. IL-26-Tv2 expression was also induced in camel fibroblast cells infected with Camel pox virus in-vitro. The identification of the transcript variants of IL-26 from the dromedary camel is the first report of alternative splicing for IL-26 in a species in which the gene has not been inactivated. Copyright © 2015 Elsevier Ltd. All rights reserved.
Le Bail, Aude; Scholz, Sebastian; Kost, Benedikt
2013-01-01
The use of the moss Physcomitrella patens as a model system to study plant development and physiology is rapidly expanding. The strategic position of P. patens within the green lineage between algae and vascular plants, the high efficiency with which transgenes are incorporated by homologous recombination, advantages associated with the haploid gametophyte representing the dominant phase of the P. patens life cycle, the simple structure of protonemata, leafy shoots and rhizoids that constitute the haploid gametophyte, as well as a readily accessible high-quality genome sequence make this moss a very attractive experimental system. The investigation of the genetic and hormonal control of P. patens development heavily depends on the analysis of gene expression patterns by real time quantitative PCR (RT qPCR). This technique requires well characterized sets of reference genes, which display minimal expression level variations under all analyzed conditions, for data normalization. Sets of suitable reference genes have been described for most widely used model systems including e.g. Arabidopsis thaliana, but not for P. patens. Here, we present a RT qPCR based comparison of transcript levels of 12 selected candidate reference genes in a range of gametophytic P. patens structures at different developmental stages, and in P. patens protonemata treated with hormones or hormone transport inhibitors. Analysis of these RT qPCR data using GeNorm and NormFinder software resulted in the identification of sets of P. patens reference genes suitable for gene expression analysis under all tested conditions, and suggested that the two best reference genes are sufficient for effective data normalization under each of these conditions. PMID:23951063
Zhang, Shi-tao; Zuo, Chao; Li, Wan-nan; Fu, Xue-qi; Xing, Shu; Zhang, Xiao-ping
2016-02-01
To identify key genes related to the effect of estrogen on ovarian cancer. Microarray data (GSE22600) were downloaded from Gene Expression Omnibus. Eight estrogen and seven placebo treatment samples were obtained using a 2 × 2 factorial designs, which contained 2 cell lines (PEO4 and 2008) and 2 treatments (estrogen and placebo). Differentially expressed genes were identified by Bayesian methods, and the genes with P < 0.05 and |log2FC (fold change)| ≥0.5 were chosen as cut-off criterion. Differentially co-expressed genes (DCGs) and differentially regulated genes (DRGs) were, respectively, identified by DCe function and DRsort function in DCGL package. Topological structure analysis was performed on the important transcriptional factors (TFs) and genes in transcriptional regulatory network using tYNA. Functional enrichment analysis was, respectively, performed for DEGs and the important genes using Gene Ontology and KEGG databases. In total, 465 DEGs were identified. Functional enrichment analysis of DEGs indicated that ACVR2B, LTBP1, BMP7 and MYC involved in TGF-beta signaling pathway. The 2285 DCG pairs and 357 DRGs were identified. Topological structure analysis showed that 52 important TFs and 65 important genes were identified. Functional enrichment analysis of the important genes showed that TP53 and MLH1 participated in DNA damage response and the genes (ACVR2B, LTBP1, BMP7 and MYC) involved in TGF-beta signaling pathway. TP53, MLH1, ACVR2B, LTBP1 and BMP7 might participate in the pathogenesis of ovarian cancer.
Botulinum neurotoxin homologs in non-Clostridium species.
Mansfield, Michael J; Adams, Jeremy B; Doxey, Andrew C
2015-01-30
Clostridial neurotoxins (CNTs) are the deadliest toxins known and the causative agents of botulism and tetanus. Despite their structural and functional complexity, no CNT homologs are currently known outside Clostridium. Here, we report the first homologs of Clostridium CNTs within the genome of the rice fermentation organism Weissella oryzae SG25. One gene in W. oryzae S25 encodes a protein with a four-domain architecture and HExxH protease motif common to botulinum neurotoxins (BoNTs). An adjacent gene with partial similarity to CNTs is also present, and both genes seem to have been laterally transferred into the W. oryzae genome from an unknown source. Identification of mobile, CNT-related genes outside of Clostridium has implications for our understanding of the evolution of this important toxin family. Copyright © 2015 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.
Schwartz, N B; Pirok, E W; Mensch, J R; Domowicz, M S
1999-01-01
Proteoglycans are complex macromolecules, consisting of a polypeptide backbone to which are covalently attached one or more glycosaminoglycan chains. Molecular cloning has allowed identification of the genes encoding the core proteins of various proteoglycans, leading to a better understanding of the diversity of proteoglycan structure and function, as well as to the evolution of a classification of proteoglycans on the basis of emerging gene families that encode the different core proteins. One such family includes several proteoglycans that have been grouped with aggrecan, the large aggregating chondroitin sulfate proteoglycan of cartilage, based on a high number of sequence similarities within the N- and C-terminal domains. Thus far these proteoglycans include versican, neurocan, and brevican. It is now apparent that these proteins, as a group, are truly a gene family with shared structural motifs on the protein and nucleotide (mRNA) levels, and with nearly identical genomic organizations. Clearly a common ancestral origin is indicated for the members of the aggrecan family of proteoglycans. However, differing patterns of amplification and divergence have also occurred within certain exons across species and family members, leading to the class-characteristic protein motifs in the central carbohydrate-rich region exclusively. Thus the overall domain organization strongly suggests that sequence conservation in the terminal globular domains underlies common functions, whereas differences in the central portions of the genes account for functional specialization among the members of this gene family.
Merlino, Giuseppe; Marzorati, Massimo; Rizzi, Aurora; Lavazza, Davide; de Ferra, Francesca; Carpani, Giovanna
2015-01-01
The achievement of successful biostimulation of active microbiomes for the cleanup of a polluted site is strictly dependent on the knowledge of the key microorganisms equipped with the relevant catabolic genes responsible for the degradation process. In this work, we present the characterization of the bacterial community developed in anaerobic microcosms after biostimulation with the electron donor lactate of groundwater polluted with 1,2-dichloroethane (1,2-DCA). Through a multilevel analysis, we have assessed (i) the structural analysis of the bacterial community; (ii) the identification of putative dehalorespiring bacteria; (iii) the characterization of functional genes encoding for putative 1,2-DCA reductive dehalogenases (RDs). Following the biostimulation treatment, the structure of the bacterial community underwent a notable change of the main phylotypes, with the enrichment of representatives of the order Clostridiales. Through PCR targeting conserved regions within known RD genes, four novel variants of RDs previously associated with the reductive dechlorination of 1,2-DCA were identified in the metagenome of the Clostridiales-dominated bacterial community. PMID:26273600
Pérez-Doria, Alveiro; Bejarano, Eduar Elías; Sierra, Diana; Vélez, Iván Darío
2008-07-01
The phlebotomine sand flies Lutzomyia pia (Fairchild & Hertig 1961) and Lutzomyia tihuiliensis Le Pont, Torrez-Espejo & Dujardin 1997 (Diptera: Psychodidae) belong to the pia series of the Lu. verrucarum species group, which includes several species that bite humans in Andean foci of leishmaniasis. The females of these two species exhibit isometry and isomorphism in anatomical structures of the head and terminalia commonly used in taxonomic identification of sand flies. They can only be differentiated based on subtle differences in the pigmentation of the pleura. In Lu. tihuiliensis, this is restricted to the basal portions of the katepimeron and katepisternum, whereas in Lu. pia both structures are totally pigmented. Taking into account the subtle morphological differences between these species, the objective of the current study was to evaluate the specific taxonomic status of Lu. tihuiliensis with respect to Lu. pia. A 475-bp portion of the mitochondrial genome was sequenced, composed of the 3' end of the cytochrome b gene, intergenic spacer 1, the transfer RNA gene for serine, intergenic spacer 2, and the 3' end of the gene NAD dehydrogenase 1. Genetic analysis confirms that Lu. tihuiliensis and Lu. pia constitute two distinct species and this is supported by four strong lines of evidence, i.e., the paired genetic distances, size differences and amino acid composition of the cytochrome b protein, presence and absence of intergenic spacer one and divergence observed in the sequence of the transfer RNA gene for serine. It also confirms the validity of the pleural pigmentation pattern as a species diagnostic character and the importance of performing a detailed examination of this character during morphological determination of phlebotomine sand flies in the series pia.
Santos, André S; Ramos, Rommel T; Silva, Artur; Hirata, Raphael; Mattos-Guaraldi, Ana L; Meyer, Roberto; Azevedo, Vasco; Felicori, Liza; Pacheco, Luis G C
2018-05-11
Biochemical tests are traditionally used for bacterial identification at the species level in clinical microbiology laboratories. While biochemical profiles are generally efficient for the identification of the most important corynebacterial pathogen Corynebacterium diphtheriae, their ability to differentiate between biovars of this bacterium is still controversial. Besides, the unambiguous identification of emerging human pathogenic species of the genus Corynebacterium may be hampered by highly variable biochemical profiles commonly reported for these species, including Corynebacterium striatum, Corynebacterium amycolatum, Corynebacterium minutissimum, and Corynebacterium xerosis. In order to identify the genomic basis contributing for the biochemical variabilities observed in phenotypic identification methods of these bacteria, we combined a comprehensive literature review with a bioinformatics approach based on reconstruction of six specific biochemical reactions/pathways in 33 recently released whole genome sequences. We used data retrieved from curated databases (MetaCyc, PathoSystems Resource Integration Center (PATRIC), The SEED, TransportDB, UniProtKB) associated with homology searches by BLAST and profile Hidden Markov Models (HMMs) to detect enzymes participating in the various pathways and performed ab initio protein structure modeling and molecular docking to confirm specific results. We found a differential distribution among the various strains of genes that code for some important enzymes, such as beta-phosphoglucomutase and fructokinase, and also for individual components of carbohydrate transport systems, including the fructose-specific phosphoenolpyruvate-dependent sugar phosphotransferase (PTS) and the ribose-specific ATP-binging cassette (ABC) transporter. Horizontal gene transfer plays a role in the biochemical variability of the isolates, as some genes needed for sucrose fermentation were seen to be present in genomic islands. Noteworthy, using profile HMMs, we identified an enzyme with putative alpha-1,6-glycosidase activity only in some specific strains of C. diphtheriae and this may aid to understanding of the differential abilities to utilize glycogen and starch between the biovars.
Analysis of informational redundancy in the protein-assembling machinery
NASA Astrophysics Data System (ADS)
Berkovich, Simon
2004-03-01
Entropy analysis of the DNA structure does not reveal a significant departure from randomness indicating lack of informational redundancy. This signifies the absence of a hidden meaning in the genome text and supports the 'barcode' interpretation of DNA given in [1]. Lack of informational redundancy is a characteristic property of an identification label rather than of a message of instructions. Yet randomness of DNA has to induce non-random structures of the proteins. Protein synthesis is a two-step process: transcription into RNA with gene splicing and formation a structure of amino acids. Entropy estimations, performed by A. Djebbari, show typical values of redundancy of the biomolecules along these pathways: DNA gene 4proteins 15-40in gene expression, the RNA copy carries the same information as the original DNA template. Randomness is essentially eliminated only at the step of the protein creation by a degenerate code. According to [1], the significance of the substitution of U for T with a subsequent gene splicing is that these transformations result in a different pattern of RNA oscillations, so the vital DNA communications are protected against extraneous noise coming from the protein making activities. 1. S. Berkovich, "On the 'barcode' functionality of DNA, or the Phenomenon of Life in the Physical Universe", Dorrance Publishing Co., Pittsburgh, 2003
Sander, Adam F.; Lavstsen, Thomas; Rask, Thomas S.; Lisby, Michael; Salanti, Ali; Fordyce, Sarah L.; Jespersen, Jakob S.; Carter, Richard; Deitsch, Kirk W.; Theander, Thor G.; Pedersen, Anders Gorm; Arnot, David E.
2014-01-01
Many bacterial, viral and parasitic pathogens undergo antigenic variation to counter host immune defense mechanisms. In Plasmodium falciparum, the most lethal of human malaria parasites, switching of var gene expression results in alternating expression of the adhesion proteins of the Plasmodium falciparum-erythrocyte membrane protein 1 class on the infected erythrocyte surface. Recombination clearly generates var diversity, but the nature and control of the genetic exchanges involved remain unclear. By experimental and bioinformatic identification of recombination events and genome-wide recombination hotspots in var genes, we show that during the parasite’s sexual stages, ectopic recombination between isogenous var paralogs occurs near low folding free energy DNA 50-mers and that these sequences are heavily concentrated at the boundaries of regions encoding individual Plasmodium falciparum-erythrocyte membrane protein 1 structural domains. The recombinogenic potential of these 50-mers is not parasite-specific because these sequences also induce recombination when transferred to the yeast Saccharomyces cerevisiae. Genetic cross data suggest that DNA secondary structures (DSS) act as inducers of recombination during DNA replication in P. falciparum sexual stages, and that these DSS-regulated genetic exchanges generate functional and diverse P. falciparum adhesion antigens. DSS-induced recombination may represent a common mechanism for optimizing the evolvability of virulence gene families in pathogens. PMID:24253306
Genetic Structure of Avian Influenza Viruses from Ducks of the Atlantic Flyway of North America
Huang, Yanyan; Wille, Michelle; Dobbin, Ashley; Walzthöni, Natasha M.; Robertson, Gregory J.; Ojkic, Davor; Whitney, Hugh; Lang, Andrew S.
2014-01-01
Wild birds, including waterfowl such as ducks, are reservoir hosts of influenza A viruses. Despite the increased number of avian influenza virus (AIV) genome sequences available, our understanding of AIV genetic structure and transmission through space and time in waterfowl in North America is still limited. In particular, AIVs in ducks of the Atlantic flyway of North America have not been thoroughly investigated. To begin to address this gap, we analyzed 109 AIV genome sequences from ducks in the Atlantic flyway to determine their genetic structure and to document the extent of gene flow in the context of sequences from other locations and other avian and mammalian host groups. The analyses included 25 AIVs from ducks from Newfoundland, Canada, from 2008–2011 and 84 available reference duck AIVs from the Atlantic flyway from 2006–2011. A vast diversity of viral genes and genomes was identified in the 109 viruses. The genetic structure differed amongst the 8 viral segments with predominant single lineages found for the PB2, PB1 and M segments, increased diversity found for the PA, NP and NS segments (2, 3 and 3 lineages, respectively), and the highest diversity found for the HA and NA segments (12 and 9 lineages, respectively). Identification of inter-hemispheric transmissions was rare with only 2% of the genes of Eurasian origin. Virus transmission between ducks and other bird groups was investigated, with 57.3% of the genes having highly similar (≥99% nucleotide identity) genes detected in birds other than ducks. Transmission between North American flyways has been frequent and 75.8% of the genes were highly similar to genes found in other North American flyways. However, the duck AIV genes did display spatial distribution bias, which was demonstrated by the different population sizes of specific viral genes in one or two neighbouring flyways compared to more distant flyways. PMID:24498009
Isolation and identification of a bovine viral diarrhea virus from sika deer in china
2011-01-01
Background Bovine viral diarrhea virus (BVDV) infections continue to cause significantly losses in the deer population. Better isolation and identification of BVDV from sika deer may contribute significantly to the development of prophylactic therapeutic, and diagnostic reagents as well as help in prevention and control of BVDV. However, isolation and identification of BVDV from sika deer is seldom reported in literature. In this study, we collected some samples according to clinical sign of BVDV to isolation and identification of BVDV from sika deer. Results we isolated a suspected BVDV strain from livers of an aborted fetus from sika deer in Changchun (China) using MDBK cell lines, named as CCSYD strain, and identified it by cytopathic effect (CPE), indirect immunoperoxidase test (IPX) and electron microscopy(EM). The results indicated that this virus was BVDV by a series of identification. The structural proteins E0 gene was cloned and sequenced. The obtained E0 gene sequence has been submitted to GenBank with the accession number: FJ555203. Alignment with other 9 strains of BVDV, 7 strains of classical swine fever virus (CSFV) and 3 strains of border disease virus(BDV) in the world, showed that the homology were 98.6%-84.8%, 76.0%-74.7%, 76.6%-77.0% for nucleotide sequence, respectively. The phylogenetic analysis indicated that new isolation and identification CCSYD strain belonged to BVDV1b. Conclusion To the best of our knowledge, this is the first report that BVDV was isolated and identified in sika deer. This current research contributes development new BVDV vaccine to prevent and control of BVD in sika deer. PMID:21352530
Koda, Satoru; Onda, Yoshihiko; Matsui, Hidetoshi; Takahagi, Kotaro; Yamaguchi-Uehara, Yukiko; Shimizu, Minami; Inoue, Komaki; Yoshida, Takuhiro; Sakurai, Tetsuya; Honda, Hiroshi; Eguchi, Shinto; Nishii, Ryuei; Mochida, Keiichi
2017-01-01
We report the comprehensive identification of periodic genes and their network inference, based on a gene co-expression analysis and an Auto-Regressive eXogenous (ARX) model with a group smoothly clipped absolute deviation (SCAD) method using a time-series transcriptome dataset in a model grass, Brachypodium distachyon . To reveal the diurnal changes in the transcriptome in B. distachyon , we performed RNA-seq analysis of its leaves sampled through a diurnal cycle of over 48 h at 4 h intervals using three biological replications, and identified 3,621 periodic genes through our wavelet analysis. The expression data are feasible to infer network sparsity based on ARX models. We found that genes involved in biological processes such as transcriptional regulation, protein degradation, and post-transcriptional modification and photosynthesis are significantly enriched in the periodic genes, suggesting that these processes might be regulated by circadian rhythm in B. distachyon . On the basis of the time-series expression patterns of the periodic genes, we constructed a chronological gene co-expression network and identified putative transcription factors encoding genes that might be involved in the time-specific regulatory transcriptional network. Moreover, we inferred a transcriptional network composed of the periodic genes in B. distachyon , aiming to identify genes associated with other genes through variable selection by grouping time points for each gene. Based on the ARX model with the group SCAD regularization using our time-series expression datasets of the periodic genes, we constructed gene networks and found that the networks represent typical scale-free structure. Our findings demonstrate that the diurnal changes in the transcriptome in B. distachyon leaves have a sparse network structure, demonstrating the spatiotemporal gene regulatory network over the cyclic phase transitions in B. distachyon diurnal growth.
NASA Astrophysics Data System (ADS)
Chen, Ye; Wolanyk, Nathaniel; Ilker, Tunc; Gao, Shouguo; Wang, Xujing
Methods developed based on bifurcation theory have demonstrated their potential in driving network identification for complex human diseases, including the work by Chen, et al. Recently bifurcation theory has been successfully applied to model cellular differentiation. However, there one often faces a technical challenge in driving network prediction: time course cellular differentiation study often only contains one sample at each time point, while driving network prediction typically require multiple samples at each time point to infer the variation and interaction structures of candidate genes for the driving network. In this study, we investigate several methods to identify both the critical time point and the driving network through examination of how each time point affects the autocorrelation and phase locking. We apply these methods to a high-throughput sequencing (RNA-Seq) dataset of 42 subsets of thymocytes and mature peripheral T cells at multiple time points during their differentiation (GSE48138 from GEO). We compare the predicted driving genes with known transcription regulators of cellular differentiation. We will discuss the advantages and limitations of our proposed methods, as well as potential further improvements of our methods.
Structure-Based Annotation of a Novel Sugar Isomerase from the Pathogenic E. coli O157:H7
DOE Office of Scientific and Technical Information (OSTI.GOV)
van Staalduinen, L.; Park, C; Yeom, S
2010-01-01
Prokaryotes can use a variety of sugars as carbon sources in order to provide a selective survival advantage. The gene z5688 found in the pathogenic Escherichia coli O157:H7 encodes a 'hypothetical' protein of unknown function. Sequence analysis identified the gene product as a putative member of the cupin superfamily of proteins, but no other functional information was known. We have determined the crystal structure of the Z5688 protein at 1.6 {angstrom} resolution and identified the protein as a novel E. coli sugar isomerase (EcSI) through overall fold analysis and secondary-structure matching. Extensive substrate screening revealed that EcSI is capable ofmore » acting on D-lyxose and D-mannose. The complex structure of EcSI with fructose allowed the identification of key active-site residues, and mutagenesis confirmed their importance. The structure of EcSI also suggested a novel mechanism for substrate binding and product release in a cupin sugar isomerase. Supplementation of a nonpathogenic E. coli strain with EcSI enabled cell growth on the rare pentose d-lyxose.« less
USDA-ARS?s Scientific Manuscript database
The hypersensitive response (HR) is the most visible and arguably the most important defense response in plants, although the details of how it is controlled and executed remain patchy. In this paper a novel genetic technique called MAGIC (Mutant-Assisted Gene Identification and Characterization) i...
Cuykendall, Tawny N.; Houston, Douglas W.
2011-01-01
RNA localization is a common mechanism for regulating cell structure and function. Localized RNAs in Xenopus oocytes are critical for early development, including germline specification by the germ plasm. Despite the importance of these localized RNAs, only approximately 25 have been identified and fewer are functionally characterized. Using microarrays, we identified a large set of localized RNAs from the vegetal cortex. Overall, our results indicate a minimum of 275 localized RNAs in oocytes, or 2–3% of maternal transcripts, which are in general agreement with previous findings. We further validated vegetal localization for 24 candidates and further characterized three genes expressed in the germ plasm. We identified novel germ plasm expression for reticulon 3.1, exd2 (a novel exonuclease-domain encoding gene), and a putative noncoding RNA. Further analysis of these and other localized RNAs will likely identify new functions of germ plasm and facilitate the identification of cis-acting RNA localization elements. PMID:20503379
Identification and functional analysis of secreted effectors from phytoparasitic nematodes.
Rehman, Sajid; Gupta, Vijai K; Goyal, Aakash K
2016-03-21
Plant parasitic nematodes develop an intimate and long-term feeding relationship with their host plants. They induce a multi-nucleate feeding site close to the vascular bundle in the roots of their host plant and remain sessile for the rest of their life. Nematode secretions, produced in the oesophageal glands and secreted through a hollow stylet into the host plant cytoplasm, are believed to play key role in pathogenesis. To combat these persistent pathogens, the identity and functional analysis of secreted effectors can serve as a key to devise durable control measures. In this review, we will recapitulate the knowledge over the identification and functional characterization of secreted nematode effector repertoire from phytoparasitic nematodes. Despite considerable efforts, the identity of genes encoding nematode secreted proteins has long been severely hampered because of their microscopic size, long generation time and obligate biotrophic nature. The methodologies such as bioinformatics, protein structure modeling, in situ hybridization microscopy, and protein-protein interaction have been used to identify and to attribute functions to the effectors. In addition, RNA interference (RNAi) has been instrumental to decipher the role of the genes encoding secreted effectors necessary for parasitism and genes attributed to normal development. Recent comparative and functional genomic approaches have accelerated the identification of effectors from phytoparasitic nematodes and offers opportunities to control these pathogens. Plant parasitic nematodes pose a serious threat to global food security of various economically important crops. There is a wealth of genomic and transcriptomic information available on plant parasitic nematodes and comparative genomics has identified many effectors. Bioengineering crops with dsRNA of phytonematode genes can disrupt the life cycle of parasitic nematodes and therefore holds great promise to develop resistant crops against plant-parasitic nematodes.
Endophenotypes in the personality disorders
Siever, Larry J.
2005-01-01
The identification of endophenotypes in the personality disorders may provide a basis for the identification of underlying genotypes that influence the traits and dimensions of the personality disorders, as well as susceptibility to major psychiatric illnesses. Clinical dimensions of personality disorders that lend themselves to the study of corresponding endophenotypes include affective instability impulsiwity aggression, emotional information processing, cognitive disorganization, social deficits, and psychosis. For example, the propensity to aggression can be evaluated by psychometric measures, interview, laboratory paradigms, neurochemical imaging, and pharmacological studies. These suggest that aggression is a measurable trait that may be related to reduced serotonergic activity. Hyperresponsiveness of amygdala and other limbic structures may be related to affective instability, while structural and functional brain alterations underlie the cognitive disorganization in psychoticlike symptoms of schizotypal personality disorder. Thus, an endophenotypic approach not only provides clues to underlying candidate genes contributing to these behavioral dimensions, but may also point the way to a better understanding of pathophysiological mechanisms. PMID:16262209
DOE Office of Scientific and Technical Information (OSTI.GOV)
Moretti, Rocco; Chang, Aram; Peltier-Pain, Pauline
2012-03-15
Directed evolution is a valuable technique to improve enzyme activity in the absence of a priori structural knowledge, which can be typically enhanced via structure-guided strategies. In this study, a combination of both whole-gene error-prone polymerase chain reaction and site-saturation mutagenesis enabled the rapid identification of mutations that improved RmlA activity toward non-native substrates. These mutations have been shown to improve activities over 10-fold for several targeted substrates, including non-native pyrimidine- and purine-based NTPs as well as non-native d- and l-sugars (both a- and b-isomers). This study highlights the first broadly applicable high throughput sugar-1-phosphate nucleotidyltransferase screen and the firstmore » proof of concept for the directed evolution of this enzyme class toward the identification of uniquely permissive RmlA variants.« less
Bhatia, Chitra; Oerum, Stephanie; Bray, James; Kavanagh, Kathryn L; Shafqat, Naeem; Yue, Wyatt; Oppermann, Udo
2015-06-05
Short-chain dehydrogenases/reductases (SDRs) constitute a large, functionally diverse branch of enzymes within the class of NAD(P)(H) dependent oxidoreductases. In humans, over 80 genes have been identified with distinct metabolic roles in carbohydrate, amino acid, lipid, retinoid and steroid hormone metabolism, frequently associated with inherited genetic defects. Besides metabolic functions, a subset of atypical SDR proteins appears to play critical roles in adapting to redox status or RNA processing, and thereby controlling metabolic pathways. Here we present an update on the human SDR superfamily and a ligand identification strategy using differential scanning fluorimetry (DSF) with a focused library of oxidoreductase and metabolic ligands to identify substrate classes and inhibitor chemotypes. This method is applicable to investigate structure-activity relationships of oxidoreductases and ultimately to better understand their physiological roles. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Cortés-Romero, Celso; Martínez-Hernández, Aída; Mellado-Mojica, Erika; López, Mercedes G; Simpson, June
2012-01-01
Fructans are the main storage polysaccharides found in Agave species. The synthesis of these complex carbohydrates relies on the activities of specific fructosyltransferase enzymes closely related to the hydrolytic invertases. Analysis of Agave tequilana transcriptome data led to the identification of ESTs encoding putative fructosyltransferases and invertases. Based on sequence alignments and structure/function relationships, two different genes were predicted to encode 1-SST and 6G-FFT type fructosyltransferases, in addition, 4 genes encoding putative cell wall invertases and 4 genes encoding putative vacuolar invertases were also identified. Probable functions for each gene, were assigned based on conserved amino acid sequences and confirmed for 2 fructosyltransferases and one invertase by analyzing the enzymatic activity of recombinant Agave protein s expressed and purified from Pichia pastoris. The genome organization of the fructosyltransferase/invertase genes, for which the corresponding cDNA contained the complete open reading frame, was found to be well conserved since all genes were shown to carry a 9 bp mini-exon and all showed a similar structure of 8 exons/7 introns with the exception of a cell wall invertase gene which has 7 exons and 6 introns. Fructosyltransferase genes were strongly expressed in the storage organs of the plants, especially in vegetative stages of development and to lower levels in photosynthetic tissues, in contrast to the invertase genes where higher levels of expression were observed in leaf tissues and in mature plants.
Cortés-Romero, Celso; Martínez-Hernández, Aída; Mellado-Mojica, Erika; López, Mercedes G.; Simpson, June
2012-01-01
Fructans are the main storage polysaccharides found in Agave species. The synthesis of these complex carbohydrates relies on the activities of specific fructosyltransferase enzymes closely related to the hydrolytic invertases. Analysis of Agave tequilana transcriptome data led to the identification of ESTs encoding putative fructosyltransferases and invertases. Based on sequence alignments and structure/function relationships, two different genes were predicted to encode 1-SST and 6G-FFT type fructosyltransferases, in addition, 4 genes encoding putative cell wall invertases and 4 genes encoding putative vacuolar invertases were also identified. Probable functions for each gene, were assigned based on conserved amino acid sequences and confirmed for 2 fructosyltransferases and one invertase by analyzing the enzymatic activity of recombinant Agave protein s expressed and purified from Pichia pastoris. The genome organization of the fructosyltransferase/invertase genes, for which the corresponding cDNA contained the complete open reading frame, was found to be well conserved since all genes were shown to carry a 9 bp mini-exon and all showed a similar structure of 8 exons/7 introns with the exception of a cell wall invertase gene which has 7 exons and 6 introns. Fructosyltransferase genes were strongly expressed in the storage organs of the plants, especially in vegetative stages of development and to lower levels in photosynthetic tissues, in contrast to the invertase genes where higher levels of expression were observed in leaf tissues and in mature plants. PMID:22558253
An open-source framework for large-scale, flexible evaluation of biomedical text mining systems.
Baumgartner, William A; Cohen, K Bretonnel; Hunter, Lawrence
2008-01-29
Improved evaluation methodologies have been identified as a necessary prerequisite to the improvement of text mining theory and practice. This paper presents a publicly available framework that facilitates thorough, structured, and large-scale evaluations of text mining technologies. The extensibility of this framework and its ability to uncover system-wide characteristics by analyzing component parts as well as its usefulness for facilitating third-party application integration are demonstrated through examples in the biomedical domain. Our evaluation framework was assembled using the Unstructured Information Management Architecture. It was used to analyze a set of gene mention identification systems involving 225 combinations of system, evaluation corpus, and correctness measure. Interactions between all three were found to affect the relative rankings of the systems. A second experiment evaluated gene normalization system performance using as input 4,097 combinations of gene mention systems and gene mention system-combining strategies. Gene mention system recall is shown to affect gene normalization system performance much more than does gene mention system precision, and high gene normalization performance is shown to be achievable with remarkably low levels of gene mention system precision. The software presented in this paper demonstrates the potential for novel discovery resulting from the structured evaluation of biomedical language processing systems, as well as the usefulness of such an evaluation framework for promoting collaboration between developers of biomedical language processing technologies. The code base is available as part of the BioNLP UIMA Component Repository on SourceForge.net.
An open-source framework for large-scale, flexible evaluation of biomedical text mining systems
Baumgartner, William A; Cohen, K Bretonnel; Hunter, Lawrence
2008-01-01
Background Improved evaluation methodologies have been identified as a necessary prerequisite to the improvement of text mining theory and practice. This paper presents a publicly available framework that facilitates thorough, structured, and large-scale evaluations of text mining technologies. The extensibility of this framework and its ability to uncover system-wide characteristics by analyzing component parts as well as its usefulness for facilitating third-party application integration are demonstrated through examples in the biomedical domain. Results Our evaluation framework was assembled using the Unstructured Information Management Architecture. It was used to analyze a set of gene mention identification systems involving 225 combinations of system, evaluation corpus, and correctness measure. Interactions between all three were found to affect the relative rankings of the systems. A second experiment evaluated gene normalization system performance using as input 4,097 combinations of gene mention systems and gene mention system-combining strategies. Gene mention system recall is shown to affect gene normalization system performance much more than does gene mention system precision, and high gene normalization performance is shown to be achievable with remarkably low levels of gene mention system precision. Conclusion The software presented in this paper demonstrates the potential for novel discovery resulting from the structured evaluation of biomedical language processing systems, as well as the usefulness of such an evaluation framework for promoting collaboration between developers of biomedical language processing technologies. The code base is available as part of the BioNLP UIMA Component Repository on SourceForge.net. PMID:18230184
Kou, Qiang; Wu, Si; Tolic, Nikola; Paša-Tolic, Ljiljana; Liu, Yunlong; Liu, Xiaowen
2017-05-01
Although proteomics has rapidly developed in the past decade, researchers are still in the early stage of exploring the world of complex proteoforms, which are protein products with various primary structure alterations resulting from gene mutations, alternative splicing, post-translational modifications, and other biological processes. Proteoform identification is essential to mapping proteoforms to their biological functions as well as discovering novel proteoforms and new protein functions. Top-down mass spectrometry is the method of choice for identifying complex proteoforms because it provides a 'bird's eye view' of intact proteoforms. The combinatorial explosion of various alterations on a protein may result in billions of possible proteoforms, making proteoform identification a challenging computational problem. We propose a new data structure, called the mass graph, for efficient representation of proteoforms and design mass graph alignment algorithms. We developed TopMG, a mass graph-based software tool for proteoform identification by top-down mass spectrometry. Experiments on top-down mass spectrometry datasets showed that TopMG outperformed existing methods in identifying complex proteoforms. http://proteomics.informatics.iupui.edu/software/topmg/. xwliu@iupui.edu. Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
Antunes, Deborah; Jorge, Natasha A. N.; Caffarena, Ernesto R.; Passetti, Fabio
2018-01-01
RNA molecules are essential players in many fundamental biological processes. Prokaryotes and eukaryotes have distinct RNA classes with specific structural features and functional roles. Computational prediction of protein structures is a research field in which high confidence three-dimensional protein models can be proposed based on the sequence alignment between target and templates. However, to date, only a few approaches have been developed for the computational prediction of RNA structures. Similar to proteins, RNA structures may be altered due to the interaction with various ligands, including proteins, other RNAs, and metabolites. A riboswitch is a molecular mechanism, found in the three kingdoms of life, in which the RNA structure is modified by the binding of a metabolite. It can regulate multiple gene expression mechanisms, such as transcription, translation initiation, and mRNA splicing and processing. Due to their nature, these entities also act on the regulation of gene expression and detection of small metabolites and have the potential to helping in the discovery of new classes of antimicrobial agents. In this review, we describe software and web servers currently available for riboswitch aptamer identification and secondary and tertiary structure prediction, including applications. PMID:29403526
Identification and characterization of amelogenin genes in monotremes, reptiles, and amphibians
Toyosawa, Satoru; O’hUigin, Colm; Figueroa, Felipe; Tichy, Herbert; Klein, Jan
1998-01-01
Two features make the tooth an excellent model in the study of evolutionary innovations: the relative simplicity of its structure and the fact that the major tooth-forming genes have been identified in eutherian mammals. To understand the nature of the innovation at the molecular level, it is necessary to identify the homologs of tooth-forming genes in other vertebrates. As a first step toward this goal, homologs of the eutherian amelogenin gene have been cloned and characterized in selected species of monotremes (platypus and echidna), reptiles (caiman), and amphibians (African clawed toad). Comparisons of the homologs reveal that the amelogenin gene evolves quickly in the repeat region, in which numerous insertions and deletions have obliterated any similarity among the genes, and slowly in other regions. The gene organization, the distribution of hydrophobic and hydrophilic segments in the encoded protein, and several other features have been conserved throughout the evolution of the tetrapod amelogenin gene. Clones corresponding to one locus only were found in caiman, whereas the clawed toad possesses at least two amelogenin-encoding loci. PMID:9789040
Sönksen, Ute Wolff; Christensen, Jens Jørgen; Nielsen, Lisbeth; Hesselbjerg, Annemarie; Hansen, Dennis Schrøder; Bruun, Brita
2010-12-31
Taxonomy and identification of fastidious Gram negatives are evolving and challenging. We compared identifications achieved with the Vitek 2 Neisseria-Haemophilus (NH) card and partial 16S rRNA gene sequence (526 bp stretch) analysis with identifications obtained with extensive phenotypic characterization using 100 fastidious Gram negative bacteria. Seventy-five strains represented 21 of the 26 taxa included in the Vitek 2 NH database and 25 strains represented related species not included in the database. Of the 100 strains, 31 were the type strains of the species. Vitek 2 NH identification results: 48 of 75 database strains were correctly identified, 11 strains gave `low discrimination´, seven strains were unidentified, and nine strains were misidentified. Identification of 25 non-database strains resulted in 14 strains incorrectly identified as belonging to species in the database. Partial 16S rRNA gene sequence analysis results: For 76 strains phenotypic and sequencing identifications were identical, for 23 strains the sequencing identifications were either probable or possible, and for one strain only the genus was confirmed. Thus, the Vitek 2 NH system identifies most of the commonly occurring species included in the database. Some strains of rarely occurring species and strains of non-database species closely related to database species cause problems. Partial 16S rRNA gene sequence analysis performs well, but does not always suffice, additional phenotypical characterization being useful for final identification.
Sönksen, Ute Wolff; Christensen, Jens Jørgen; Nielsen, Lisbeth; Hesselbjerg, Annemarie; Hansen, Dennis Schrøder; Bruun, Brita
2010-01-01
Taxonomy and identification of fastidious Gram negatives are evolving and challenging. We compared identifications achieved with the Vitek 2 Neisseria-Haemophilus (NH) card and partial 16S rRNA gene sequence (526 bp stretch) analysis with identifications obtained with extensive phenotypic characterization using 100 fastidious Gram negative bacteria. Seventy-five strains represented 21 of the 26 taxa included in the Vitek 2 NH database and 25 strains represented related species not included in the database. Of the 100 strains, 31 were the type strains of the species. Vitek 2 NH identification results: 48 of 75 database strains were correctly identified, 11 strains gave `low discrimination´, seven strains were unidentified, and nine strains were misidentified. Identification of 25 non-database strains resulted in 14 strains incorrectly identified as belonging to species in the database. Partial 16S rRNA gene sequence analysis results: For 76 strains phenotypic and sequencing identifications were identical, for 23 strains the sequencing identifications were either probable or possible, and for one strain only the genus was confirmed. Thus, the Vitek 2 NH system identifies most of the commonly occurring species included in the database. Some strains of rarely occurring species and strains of non-database species closely related to database species cause problems. Partial 16S rRNA gene sequence analysis performs well, but does not always suffice, additional phenotypical characterization being useful for final identification. PMID:21347215
The versatile DNA nucleotide excision repair (NER) and its medical significance.
Falik-Zaccai, Tzipora C; Keren, Zohar; Slor, Hanoch
2009-12-01
Two of DNA's worst enemies, ultraviolet light and chemical carcinogens, can cause damage to the molecule by mutating individual nucleotides or changing its physical structure. In most cases, genomic integrity is restored by specialized suites of proteins dedicated to repairing specific types of injuries. One restoration mechanism, called nucleotide excision repair (NER), recruits and coordinates the services of 20-30 proteins to recognize and remove structure-impairing lesions, including those induced by ultraviolet (UV) light. Mutations in a gene that encodes a protein from the NER machinery might cause a wide variety of rare inherited human disorders. Sun sensitivity, cancer, developmental retardation, neurodegeneration and premature aging characterize these syndromes. Identification of the causative genes and proteins in affected families in Israel allowed us to establish accurate molecular diagnosis of couples at risk, and provide them with better genetic counseling.
Unexpected detection of porcine rotavirus C strains carrying human origin VP6 gene.
Kattoor, Jobin Jose; Saurabh, Sharad; Malik, Yashpal Singh; Sircar, Shubhankar; Dhama, Kuldeep; Ghosh, Souvik; Bányai, Krisztián; Kobayashi, Nobumichi; Singh, Raj Kumar
2017-12-01
Rotavirus C (RVC), a known etiological agent of diarrheal outbreaks, mainly inflicts swine population globally with sporadic incidence in human, cattle, ferret, mink and dog. To demonstrate the presence of RVC in Indian swine population and characterization of its selected structural (VP6) and non-structural (NSP4 and NSP5) genes. A total of 108 diarrheic samples from different regions of India were used. Isolated RNA was loaded onto polyacrylamide gel to screen for the presence of RVs through the identification of specific electrophoretic genomic migration pattern. To characterize the RVC strains, VP6 gene and NSP4 and NSP5 genes were amplified, sequenced and analyzed. Based on VP6 gene specific diagnostic RT-PCR, the presence of RVC was confirmed in 12.0% (13/108) piglet fecal specimens. The nucleotide sequence analysis of VP6 gene, encoding inner capsid protein, from selected porcine RVC (PoRVC) strains revealed more than 93% homologies to human RVC strains (HuRVC) of Eurasian origin. These strains were distant from hitherto reported PoRVCs and clustered with HuRVCs, owning I2 genotype. However, the two non-structural genes, i.e. NSP4 and NSP5, of these strains were found to be of swine type, signifying a re-assortment event that has occurred in the Indian swine population. The findings indicate the presence of human-like RVC in Indian pigs and division of RVC clade with I2 genotype into further sub-clades. To the best of our knowledge, this appears to be the first report of RVC in Indian swine population. Incidence of human-like RVC VP6 gene in swine supports its subsequent zoonotic prospective.
2012-01-01
Background Single nucleotide polymorphism (SNP) validation and large-scale genotyping are required to maximize the use of DNA sequence variation and determine the functional relevance of candidate genes for complex stress tolerance traits through genetic association in rice. We used the bead array platform-based Illumina GoldenGate assay to validate and genotype SNPs in a select set of stress-responsive genes to understand their functional relevance and study the population structure in rice. Results Of the 384 putative SNPs assayed, we successfully validated and genotyped 362 (94.3%). Of these 325 (84.6%) showed polymorphism among the 91 rice genotypes examined. Physical distribution, degree of allele sharing, admixtures and introgression, and amino acid replacement of SNPs in 263 abiotic and 62 biotic stress-responsive genes provided clues for identification and targeted mapping of trait-associated genomic regions. We assessed the functional and adaptive significance of validated SNPs in a set of contrasting drought tolerant upland and sensitive lowland rice genotypes by correlating their allelic variation with amino acid sequence alterations in catalytic domains and three-dimensional secondary protein structure encoded by stress-responsive genes. We found a strong genetic association among SNPs in the nine stress-responsive genes with upland and lowland ecological adaptation. Higher nucleotide diversity was observed in indica accessions compared with other rice sub-populations based on different population genetic parameters. The inferred ancestry of 16% among rice genotypes was derived from admixed populations with the maximum between upland aus and wild Oryza species. Conclusions SNPs validated in biotic and abiotic stress-responsive rice genes can be used in association analyses to identify candidate genes and develop functional markers for stress tolerance in rice. PMID:22921105
IMPACT_S: integrated multiprogram platform to analyze and combine tests of selection.
Maldonado, Emanuel; Sunagar, Kartik; Almeida, Daniela; Vasconcelos, Vitor; Antunes, Agostinho
2014-01-01
Among the major goals of research in evolutionary biology are the identification of genes targeted by natural selection and understanding how various regimes of evolution affect the fitness of an organism. In particular, adaptive evolution enables organisms to adapt to changing ecological factors such as diet, temperature, habitat, predatory pressures and prey abundance. An integrative approach is crucial for the identification of non-synonymous mutations that introduce radical changes in protein biochemistry and thus in turn influence the structure and function of proteins. Performing such analyses manually is often a time-consuming process, due to the large number of statistical files generated from multiple approaches, especially when assessing numerous taxa and/or large datasets. We present IMPACT_S, an easy-to-use Graphical User Interface (GUI) software, which rapidly and effectively integrates, filters and combines results from three widely used programs for assessing the influence of selection: Codeml (PAML package), Datamonkey and TreeSAAP. It enables the identification and tabulation of sites detected by these programs as evolving under the influence of positive, neutral and/or negative selection in protein-coding genes. IMPACT_S further facilitates the automatic mapping of these sites onto the three-dimensional structures of proteins. Other useful tools incorporated in IMPACT_S include Jmol, Archaeopteryx, Gnuplot, PhyML, a built-in Swiss-Model interface and a PDB downloader. The relevance and functionality of IMPACT_S is shown through a case study on the toxicoferan-reptilian Cysteine-rich Secretory Proteins (CRiSPs). IMPACT_S is a platform-independent software released under GPLv3 license, freely available online from http://impact-s.sourceforge.net.
Functions and Mechanisms of Sleep in Flies and Mammals
2007-02-01
serotonin receptor likely to mediate the known interaction between the serotonergic Raphe nucleus and the LC (Htr1d). We have also confirmed the prior... Chemistry . His research focuses on mass spectrometry, a technique that will augment research on the mechanisms of sleep and complement microarray gene...labeling (ICAT, ITRAQ, etc); 8) MALDI and electrospray FTMS for the identification of small molecule structure ; 9) Gas phase reactions within the FTMS
Genome-wide Identification and Expression Analysis of the CDPK Gene Family in Grape, Vitis spp.
Zhang, Kai; Han, Yong-Tao; Zhao, Feng-Li; Hu, Yang; Gao, Yu-Rong; Ma, Yan-Fei; Zheng, Yi; Wang, Yue-Jin; Wen, Ying-Qiang
2015-06-30
Calcium-dependent protein kinases (CDPKs) play vital roles in plant growth and development, biotic and abiotic stress responses, and hormone signaling. Little is known about the CDPK gene family in grapevine. In this study, we performed a genome-wide analysis of the 12X grape genome (Vitis vinifera) and identified nineteen CDPK genes. Comparison of the structures of grape CDPK genes allowed us to examine their functional conservation and differentiation. Segmentally duplicated grape CDPK genes showed high structural conservation and contributed to gene family expansion. Additional comparisons between grape and Arabidopsis thaliana demonstrated that several grape CDPK genes occured in the corresponding syntenic blocks of Arabidopsis, suggesting that these genes arose before the divergence of grapevine and Arabidopsis. Phylogenetic analysis divided the grape CDPK genes into four groups. Furthermore, we examined the expression of the corresponding nineteen homologous CDPK genes in the Chinese wild grape (Vitis pseudoreticulata) under various conditions, including biotic stress, abiotic stress, and hormone treatments. The expression profiles derived from reverse transcription and quantitative PCR suggested that a large number of VpCDPKs responded to various stimuli on the transcriptional level, indicating their versatile roles in the responses to biotic and abiotic stresses. Moreover, we examined the subcellular localization of VpCDPKs by transiently expressing six VpCDPK-GFP fusion proteins in Arabidopsis mesophyll protoplasts; this revealed high variability consistent with potential functional differences. Taken as a whole, our data provide significant insights into the evolution and function of grape CDPKs and a framework for future investigation of grape CDPK genes.
Morimoto, Tomomi; Arii, Jun; Akashi, Hiroomi; Kawaguchi, Yasushi
2009-03-01
Information on sites in HSV genomes at which foreign gene(s) can be inserted without disrupting viral genes or affecting properties of the parental virus are important for basic research on HSV and development of HSV-based vectors for human therapy. The intergenic region between HSV-1 UL3 and UL4 genes has been reported to satisfy the requirements for such an insertion site. The UL3 and UL4 genes are oriented toward the intergenic region and, therefore, insertion of a foreign gene(s) into the region between the UL3 and UL4 polyadenylation signals should not disrupt any viral genes or transcriptional units. HSV-1 and HSV-2 each have more than 10 additional regions structurally similar to the intergenic region between UL3 and UL4. In the studies reported here, it has been demonstrated that insertion of a reporter gene expression cassette into several of the HSV-1 and HSV-2 intergenic regions has no effect on viral growth in cell culture or virulence in mice, suggesting that these multiple intergenic regions may be suitable HSV sites for insertion of foreign genes.
Song, Zhijiao; Zhang, Miaomiao; Li, Fagen; Weng, Qijie; Zhou, Chanpin; Li, Mei; Li, Jie; Huang, Huanhua; Mo, Xiaoyong; Gan, Siming
2016-01-01
Identification of loci or genes under natural selection is important for both understanding the genetic basis of local adaptation and practical applications, and genome scans provide a powerful means for such identification purposes. In this study, genome-wide simple sequence repeats markers (SSRs) were used to scan for molecular footprints of divergent selection in Eucalyptus grandis, a hardwood species occurring widely in costal areas from 32° S to 16° S in Australia. High population diversity levels and weak population structure were detected with putatively neutral genomic SSRs. Using three FST outlier detection methods, a total of 58 outlying SSRs were collectively identified as loci under divergent selection against three non-correlated climatic variables, namely, mean annual temperature, isothermality and annual precipitation. Using a spatial analysis method, nine significant associations were revealed between FST outlier allele frequencies and climatic variables, involving seven alleles from five SSR loci. Of the five significant SSRs, two (EUCeSSR1044 and Embra394) contained alleles of putative genes with known functional importance for response to climatic factors. Our study presents critical information on the population diversity and structure of the important woody species E. grandis and provides insight into the adaptive responses of perennial trees to climatic variations. PMID:27748400
A last stand in the Po valley: genetic structure and gene flow patterns in Ulmus minor and U. pumila
Bertolasi, B.; Leonarduzzi, C.; Piotti, A.; Leonardi, S.; Zago, L.; Gui, L.; Gorian, F.; Vanetti, I.; Binelli, G.
2015-01-01
Background and Aims Ulmus minor has been severely affected by Dutch elm disease (DED). The introduction into Europe of the exotic Ulmus pumila, highly tolerant to DED, has resulted in it widely replacing native U. minor populations. Morphological and genetic evidence of hybridization has been reported, and thus there is a need for assessment of interspecific gene flow patterns in natural populations. This work therefore aimed at studying pollen gene flow in a remnant U. minor stand surrounded by trees of both species scattered across an agricultural landscape. Methods All trees from a small natural stand (350 in number) and the surrounding agricultural area within a 5-km radius (89) were genotyped at six microsatellite loci. Trees were morphologically characterized as U. minor, U. pumila or intermediate phenotypes, and morphological identification was compared with Bayesian clustering of genotypes. For paternity analysis, seeds were collected in two consecutive years from 20 and 28 mother trees. Maximum likelihood paternity assignment was used to elucidate intra- and interspecific gene flow patterns. Key Results Genetic structure analyses indicated the presence of two genetic clusters only partially matching the morphological identification. The paternity analysis results were consistent between the two consecutive years of sampling and showed high pollen immigration rates (∼0·80) and mean pollination distances (∼3 km), and a skewed distribution of reproductive success. Few intercluster pollinations and putative hybrid individuals were found. Conclusions Pollen gene flow is not impeded in the fragmented agricultural landscape investigated. High pollen immigration and extensive pollen dispersal distances are probably counteracting the potential loss of genetic variation caused by isolation. Some evidence was also found that U. minor and U. pumila can hybridize when in sympatry. Although hybridization might have beneficial effects on both species, remnant U. minor populations represent a valuable source of genetic diversity that needs to be preserved. PMID:25725008
The Silver locus product Pmel17/gp100/Silv/ME20: controversial in name and in function
Theos, Alexander C.; Truschel, Steven T.; Raposo, Graça; Marks, Michael S.
2009-01-01
Summary Mouse coat color mutants have led to the identification of more than 120 genes that encode proteins involved in all aspects of pigmentation, from the regulation of melanocyte development and differentiation to the transcriptional activation of pigment genes, from the enzymatic formation of pigment to the control of melanosome biogenesis and movement [Bennett and Lamoreux (2003) Pigment Cell Res. 16, 333]. One of the more perplexing of the identified mouse pigment genes is encoded at the Silver locus, first identified by Dunn and Thigpen [(1930) J. Heredity 21, 495] as responsible for a recessive coat color dilution that worsened with age on black backgrounds. The product of the Silver gene has since been discovered numerous times in different contexts, including the initial search for the tyrosinase gene, the characterization of major melanosome constituents in various species, and the identification of tumor-associated antigens from melanoma patients. Each discoverer provided a distinct name: Pmel17, gp100, gp95, gp85, ME20, RPE1, SILV and MMP115 among others. Although all its functions are unlikely to have yet been fully described, the protein clearly plays a central role in the biogenesis of the early stages of the pigment organelle, the melanosome, in birds, and mammals. As such, we will refer to the protein in this review simply as pre-melanosomal protein (Pmel). This review will summarize the structural and functional aspects of Pmel and its role in melanosome biogenesis. PMID:16162173
Nuclear Receptors, RXR, and the Big Bang.
Evans, Ronald M; Mangelsdorf, David J
2014-03-27
Isolation of genes encoding the receptors for steroids, retinoids, vitamin D, and thyroid hormone and their structural and functional analysis revealed an evolutionarily conserved template for nuclear hormone receptors. This discovery sparked identification of numerous genes encoding related proteins, termed orphan receptors. Characterization of these orphan receptors and, in particular, of the retinoid X receptor (RXR) positioned nuclear receptors at the epicenter of the "Big Bang" of molecular endocrinology. This Review provides a personal perspective on nuclear receptors and explores their integrated and coordinated signaling networks that are essential for multicellular life, highlighting the RXR heterodimer and its associated ligands and transcriptional mechanism. Copyright © 2014 Elsevier Inc. All rights reserved.
Wang, Guifeng; Zhong, Mingyu; Wang, Gang; Song, Rentao
2014-01-01
The actin-based myosin system is essential for the organization and dynamics of the endomembrane system and transport network in plant cells. Plants harbour two unique myosin groups, class VIII and class XI, and the latter is structurally and functionally analogous to the animal and fungal class V myosin. Little is known about myosins in grass, even though grass includes several agronomically important cereal crops. Here, we identified 14 myosin genes from the genome of maize (Zea mays). The relatively larger sizes of maize myosin genes are due to their much longer introns, which are abundant in transposable elements. Phylogenetic analysis indicated that maize myosin genes could be classified into class VIII and class XI, with three and 11 members, respectively. Apart from subgroup XI-F, the remaining subgroups were duplicated at least in one analysed lineage, and the duplication events occurred more extensively in Arabidopsis than in maize. Only two pairs of maize myosins were generated from segmental duplication. Expression analysis revealed that most maize myosin genes were expressed universally, whereas a few members (XI-1, -6, and -11) showed an anther-specific pattern, and many underwent extensive alternative splicing. We also found a short transcript at the O1 locus, which conceptually encoded a headless myosin that most likely functions at the transcriptional level rather than via a dominant-negative mechanism at the translational level. Together, these data provide significant insights into the evolutionary and functional characterization of maize myosin genes that could transfer to the identification and application of homologous myosins of other grasses. PMID:24363426
Żak, Mariusz; Zaborowski, Piotr; Baczewska-Rej, Milena; Zasada, Aleksandra A; Matuszewska, Renata; Krogulska, Bożena
2011-12-20
For the last five years, Legionella sp. infections and legionnaire's disease in Poland have been receiving a lot of attention, because of the new regulations concerning microbiological quality of drinking water. This was the inspiration to search for and develop a new assay to identify many virulence genes of Legionella pneumophila to better understand their distribution in environmental and clinical strains. The method might be an invaluable help in infection risk assessment and in epidemiological investigations. The microarray is based on Array Tube technology. It contains 3 positive and 1 negative control. Target genes encode structural elements of T4SS, effector proteins and factors not related to T4SS. Probes were designed using OligoWiz software and data analyzed using IconoClust software. To isolate environmental and clinical strains, BAL samples and samples of hot water from different and independent hot water distribution systems of public utility buildings were collected. We have developed a miniaturized DNA microarray for identification of 66 virulence genes of L. pneumophila. The assay is specific to L. pneumophila sg 1 with sensitivity sufficient to perform the assay using DNA isolated from a single L. pneumophila colony. Seven environmental strains were analyzed. Two exhibited a hybridization pattern distinct from the reference strain. The method is time- and cost-effective. Initial studies have shown that genes encoding effector proteins may vary among environmental strains. Further studies might help to identify set of genes increasing the risk of clinical disease and to determine the pathogenic potential of environmental strains.
The Burmese python genome reveals the molecular basis for extreme adaptation in snakes
Castoe, Todd A.; de Koning, A. P. Jason; Hall, Kathryn T.; Card, Daren C.; Schield, Drew R.; Fujita, Matthew K.; Ruggiero, Robert P.; Degner, Jack F.; Daza, Juan M.; Gu, Wanjun; Reyes-Velasco, Jacobo; Shaney, Kyle J.; Castoe, Jill M.; Fox, Samuel E.; Poole, Alex W.; Polanco, Daniel; Dobry, Jason; Vandewege, Michael W.; Li, Qing; Schott, Ryan K.; Kapusta, Aurélie; Minx, Patrick; Feschotte, Cédric; Uetz, Peter; Ray, David A.; Hoffmann, Federico G.; Bogden, Robert; Smith, Eric N.; Chang, Belinda S. W.; Vonk, Freek J.; Casewell, Nicholas R.; Henkel, Christiaan V.; Richardson, Michael K.; Mackessy, Stephen P.; Bronikowski, Anne M.; Yandell, Mark; Warren, Wesley C.; Secor, Stephen M.; Pollock, David D.
2013-01-01
Snakes possess many extreme morphological and physiological adaptations. Identification of the molecular basis of these traits can provide novel understanding for vertebrate biology and medicine. Here, we study snake biology using the genome sequence of the Burmese python (Python molurus bivittatus), a model of extreme physiological and metabolic adaptation. We compare the python and king cobra genomes along with genomic samples from other snakes and perform transcriptome analysis to gain insights into the extreme phenotypes of the python. We discovered rapid and massive transcriptional responses in multiple organ systems that occur on feeding and coordinate major changes in organ size and function. Intriguingly, the homologs of these genes in humans are associated with metabolism, development, and pathology. We also found that many snake metabolic genes have undergone positive selection, which together with the rapid evolution of mitochondrial proteins, provides evidence for extensive adaptive redesign of snake metabolic pathways. Additional evidence for molecular adaptation and gene family expansions and contractions is associated with major physiological and phenotypic adaptations in snakes; genes involved are related to cell cycle, development, lungs, eyes, heart, intestine, and skeletal structure, including GRB2-associated binding protein 1, SSH, WNT16, and bone morphogenetic protein 7. Finally, changes in repetitive DNA content, guanine-cytosine isochore structure, and nucleotide substitution rates indicate major shifts in the structure and evolution of snake genomes compared with other amniotes. Phenotypic and physiological novelty in snakes seems to be driven by system-wide coordination of protein adaptation, gene expression, and changes in the structure of the genome. PMID:24297902
The Burmese python genome reveals the molecular basis for extreme adaptation in snakes.
Castoe, Todd A; de Koning, A P Jason; Hall, Kathryn T; Card, Daren C; Schield, Drew R; Fujita, Matthew K; Ruggiero, Robert P; Degner, Jack F; Daza, Juan M; Gu, Wanjun; Reyes-Velasco, Jacobo; Shaney, Kyle J; Castoe, Jill M; Fox, Samuel E; Poole, Alex W; Polanco, Daniel; Dobry, Jason; Vandewege, Michael W; Li, Qing; Schott, Ryan K; Kapusta, Aurélie; Minx, Patrick; Feschotte, Cédric; Uetz, Peter; Ray, David A; Hoffmann, Federico G; Bogden, Robert; Smith, Eric N; Chang, Belinda S W; Vonk, Freek J; Casewell, Nicholas R; Henkel, Christiaan V; Richardson, Michael K; Mackessy, Stephen P; Bronikowski, Anne M; Bronikowsi, Anne M; Yandell, Mark; Warren, Wesley C; Secor, Stephen M; Pollock, David D
2013-12-17
Snakes possess many extreme morphological and physiological adaptations. Identification of the molecular basis of these traits can provide novel understanding for vertebrate biology and medicine. Here, we study snake biology using the genome sequence of the Burmese python (Python molurus bivittatus), a model of extreme physiological and metabolic adaptation. We compare the python and king cobra genomes along with genomic samples from other snakes and perform transcriptome analysis to gain insights into the extreme phenotypes of the python. We discovered rapid and massive transcriptional responses in multiple organ systems that occur on feeding and coordinate major changes in organ size and function. Intriguingly, the homologs of these genes in humans are associated with metabolism, development, and pathology. We also found that many snake metabolic genes have undergone positive selection, which together with the rapid evolution of mitochondrial proteins, provides evidence for extensive adaptive redesign of snake metabolic pathways. Additional evidence for molecular adaptation and gene family expansions and contractions is associated with major physiological and phenotypic adaptations in snakes; genes involved are related to cell cycle, development, lungs, eyes, heart, intestine, and skeletal structure, including GRB2-associated binding protein 1, SSH, WNT16, and bone morphogenetic protein 7. Finally, changes in repetitive DNA content, guanine-cytosine isochore structure, and nucleotide substitution rates indicate major shifts in the structure and evolution of snake genomes compared with other amniotes. Phenotypic and physiological novelty in snakes seems to be driven by system-wide coordination of protein adaptation, gene expression, and changes in the structure of the genome.
NASA Astrophysics Data System (ADS)
Song, Xiaoming; Duan, Weike; Huang, Zhinan; Liu, Gaofeng; Wu, Peng; Liu, Tongkun; Li, Ying; Hou, Xilin
2015-09-01
In plants, flowering is the most important transition from vegetative to reproductive growth. The flowering patterns of monocots and eudicots are distinctly different, but few studies have described the evolutionary patterns of the flowering genes in them. In this study, we analysed the evolutionary pattern, duplication and expression level of these genes. The main results were as follows: (i) characterization of flowering genes in monocots and eudicots, including the identification of family-specific, orthologous and collinear genes; (ii) full characterization of CONSTANS-like genes in Brassica rapa (BraCOL genes), the key flowering genes; (iii) exploration of the evolution of COL genes in plant kingdom and construction of the evolutionary pattern of COL genes; (iv) comparative analysis of CO and FT genes between Brassicaceae and Grass, which identified several family-specific amino acids, and revealed that CO and FT protein structures were similar in B. rapa and Arabidopsis but different in rice; and (v) expression analysis of photoperiod pathway-related genes in B. rapa under different photoperiod treatments by RT-qPCR. This analysis will provide resources for understanding the flowering mechanisms and evolutionary pattern of COL genes. In addition, this genome-wide comparative study of COL genes may also provide clues for evolution of other flowering genes.
Mishra, Apurva; Pandey, Ramesh K; Manickam, Natesan
2015-01-01
Rapid phylogenetic and functional gene (gtfB) identification of S. mutans from the dental plaque derived from children. Dental plaque collected from fifteen patients of age group 7-12 underwent centrifugation followed by genomic DNA extraction for S. mutans. Genomic DNA was processed with S. mutans specific primers in suitable PCR condtions for phylogenetic and functional gene (gtfB) identification. The yield and results were confirmed by agarose gel electrophoresis. 1% agarose gel electrophoresis depicts the positive PCR amplification at 1,485 bp when compared with standard 1 kbp indicating the presence of S. mutans in the test sample. Another PCR reaction was set using gtfB primers specific for S. mutans for functional gene identification. 1.2% agarose gel electrophoresis was done and a positive amplication was observed at 192 bp when compared to 100 bp standards. With the advancement in molecular biology techniques, PCR based identification and quantification of the bacterial load can be done within hours using species-specific primers and DNA probes. Thus, this technique may reduce the laboratory time spend in conventional culture methods, reduces the possibility of colony identification errors and is more sensitive to culture techniques.
ERIC Educational Resources Information Center
Castermans, Dries; Wilquet, Valerie; Steyaert, Jean; van de Ven, Wim; Fryns, Jean-Pierre; Devriendt, Koen
2004-01-01
We review the different strategies currently used to try to identify susceptibility genes for idiopathic autism. Although identification of genes is usually straightforward in Mendelian disorders, it has proved to be much more difficult to establish in polygenic disorders like autism. Neither genome screens of affected siblings nor the large…
Cui, Peng; Zhong, Tingyan; Wang, Zhuo; Wang, Tao; Zhao, Hongyu; Liu, Chenglin; Lu, Hui
2018-06-01
Circadian genes express periodically in an approximate 24-h period and the identification and study of these genes can provide deep understanding of the circadian control which plays significant roles in human health. Although many circadian gene identification algorithms have been developed, large numbers of false positives and low coverage are still major problems in this field. In this study we constructed a novel computational framework for circadian gene identification using deep neural networks (DNN) - a deep learning algorithm which can represent the raw form of data patterns without imposing assumptions on the expression distribution. Firstly, we transformed time-course gene expression data into categorical-state data to denote the changing trend of gene expression. Two distinct expression patterns emerged after clustering of the state data for circadian genes from our manually created learning dataset. DNN was then applied to discriminate the aperiodic genes and the two subtypes of periodic genes. In order to assess the performance of DNN, four commonly used machine learning methods including k-nearest neighbors, logistic regression, naïve Bayes, and support vector machines were used for comparison. The results show that the DNN model achieves the best balanced precision and recall. Next, we conducted large scale circadian gene detection using the trained DNN model for the remaining transcription profiles. Comparing with JTK_CYCLE and a study performed by Möller-Levet et al. (doi: https://doi.org/10.1073/pnas.1217154110), we identified 1132 novel periodic genes. Through the functional analysis of these novel circadian genes, we found that the GTPase superfamily exhibits distinct circadian expression patterns and may provide a molecular switch of circadian control of the functioning of the immune system in human blood. Our study provides novel insights into both the circadian gene identification field and the study of complex circadian-driven biological control. This article is part of a Special Issue entitled: Accelerating Precision Medicine through Genetic and Genomic Big Data Analysis edited by Yudong Cai & Tao Huang. Copyright © 2017. Published by Elsevier B.V.
Active bacterial community structure along vertical redox gradients in Baltic Sea sediment
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jansson, Janet; Edlund, Anna; Hardeman, Fredrik
Community structures of active bacterial populations were investigated along a vertical redox profile in coastal Baltic Sea sediments by terminal-restriction fragment length polymorphism (T-RFLP) and clone library analysis. According to correspondence analysis of T-RFLP results and sequencing of cloned 16S rRNA genes, the microbial community structures at three redox depths (179 mV, -64 mV and -337 mV) differed significantly. The bacterial communities in the community DNA differed from those in bromodeoxyuridine (BrdU)-labeled DNA, indicating that the growing members of the community that incorporated BrdU were not necessarily the most dominant members. The structures of the actively growing bacterial communities weremore » most strongly correlated to organic carbon followed by total nitrogen and redox potentials. Bacterial identification by sequencing of 16S rRNA genes from clones of BrdU-labeled DNA and DNA from reverse transcription PCR (rt-PCR) showed that bacterial taxa involved in nitrogen and sulfur cycling were metabolically active along the redox profiles. Several sequences had low similarities to previously detected sequences indicating that novel lineages of bacteria are present in Baltic Sea sediments. Also, a high number of different 16S rRNA gene sequences representing different phyla were detected at all sampling depths.« less
High Connectivity among Blue Crab (Callinectes sapidus) Populations in the Western South Atlantic
Kersanach, Ralf; Cortinhas, Maria Cristina Silva; Prata, Pedro Fernandes Sanmartin; Dumont, Luiz Felipe Cestari; Proietti, Maíra Carneiro; Maggioni, Rodrigo; D’Incao, Fernando
2016-01-01
Population connectivity in the blue crab Callinectes sapidus was evaluated along 740 km of the Western South Atlantic coast. Blue crabs are the most exploited portunid in Brazil. Despite their economic importance, few studies report their ecology or population structure. Here we sampled four estuarine areas in southern Brazil during winter 2013 and summer 2014 in order to evaluate diversity, gene flow and structure of these populations. Nine microsatellite markers were evaluated for 213 adult crabs, with identification of seven polymorphic loci and 183 alleles. Pairwise FST values indicated low population structure ranging from -0.00023 to 0.01755. A Mantel test revealed that the geographic distance does not influence genetic (r = -0.48), and structure/migration rates confirmed this, showing that even the populations located at the opposite extremities of our covered region presented low FST and exchanged migrants. These findings show that there is a significant amount of gene flow between blue crab populations in South Brazil, likely influenced by local current dynamics that allow the transport of a high number of larvae between estuaries. Considering the elevated gene flow, the populations can be considered a single genetic stock. However, further information on population size and dynamics, as well as fishery demands and impacts at different regions, are necessary for harvest management purposes. PMID:27064977
iSS-PseDNC: identifying splicing sites using pseudo dinucleotide composition.
Chen, Wei; Feng, Peng-Mian; Lin, Hao; Chou, Kuo-Chen
2014-01-01
In eukaryotic genes, exons are generally interrupted by introns. Accurately removing introns and joining exons together are essential processes in eukaryotic gene expression. With the avalanche of genome sequences generated in the postgenomic age, it is highly desired to develop automated methods for rapid and effective detection of splice sites that play important roles in gene structure annotation and even in RNA splicing. Although a series of computational methods were proposed for splice site identification, most of them neglected the intrinsic local structural properties. In the present study, a predictor called "iSS-PseDNC" was developed for identifying splice sites. In the new predictor, the sequences were formulated by a novel feature-vector called "pseudo dinucleotide composition" (PseDNC) into which six DNA local structural properties were incorporated. It was observed by the rigorous cross-validation tests on two benchmark datasets that the overall success rates achieved by iSS-PseDNC in identifying splice donor site and splice acceptor site were 85.45% and 87.73%, respectively. It is anticipated that iSS-PseDNC may become a useful tool for identifying splice sites and that the six DNA local structural properties described in this paper may provide novel insights for in-depth investigations into the mechanism of RNA splicing.
A roadmap for the genetic analysis of renal aging
Noordmans, Gerda A; Hillebrands, Jan-Luuk; van Goor, Harry; Korstanje, Ron
2015-01-01
Several studies show evidence for the genetic basis of renal disease, which renders some individuals more prone than others to accelerated renal aging. Studying the genetics of renal aging can help us to identify genes involved in this process and to unravel the underlying pathways. First, this opinion article will give an overview of the phenotypes that can be observed in age-related kidney disease. Accurate phenotyping is essential in performing genetic analysis. For kidney aging, this could include both functional and structural changes. Subsequently, this article reviews the studies that report on candidate genes associated with renal aging in humans and mice. Several loci or candidate genes have been found associated with kidney disease, but identification of the specific genetic variants involved has proven to be difficult. CUBN, UMOD, and SHROOM3 were identified by human GWAS as being associated with albuminuria, kidney function, and chronic kidney disease (CKD). These are promising examples of genes that could be involved in renal aging, and were further mechanistically evaluated in animal models. Eventually, we will provide approaches for performing genetic analysis. We should leverage the power of mouse models, as testing in humans is limited. Mouse and other animal models can be used to explain the underlying biological mechanisms of genes and loci identified by human GWAS. Furthermore, mouse models can be used to identify genetic variants associated with age-associated histological changes, of which Far2, Wisp2, and Esrrg are examples. A new outbred mouse population with high genetic diversity will facilitate the identification of genes associated with renal aging by enabling high-resolution genetic mapping while also allowing the control of environmental factors, and by enabling access to renal tissues at specific time points for histology, proteomics, and gene expression. PMID:26219736
Zhang, Wei; Li, Yaoyao; Qian, Guoliang; Wang, Yan; Chen, Haotong; Li, Yue-Zhong; Liu, Fengquan; Shen, Yuemao; Du, Liangcheng
2011-01-01
Lysobactor enzymogenes strain OH11 is an emerging biological control agent of fungal and bacterial diseases. We recently completed its genome sequence and found it contains a large number of gene clusters putatively responsible for the biosynthesis of nonribosomal peptides and polyketides, including the previously identified antifungal dihydromaltophilin (HSAF). One of the gene clusters contains two huge open reading frames, together encoding 12 modules of nonribosomal peptide synthetases (NRPS). Gene disruption of one of the NRPS led to the disappearance of a metabolite produced in the wild type and the elimination of its antibacterial activity. The metabolite and antibacterial activity were also affected by the disruption of some of the flanking genes. We subsequently isolated this metabolite and subjected it to spectroscopic analysis. The mass spectrometry and nuclear magnetic resonance data showed that its chemical structure is identical to WAP-8294A2, a cyclic lipodepsipeptide with potent anti-methicillin-resistant Staphylococcus aureus (MRSA) activity and currently in phase I/II clinical trials. The WAP-8294A2 biosynthetic genes had not been described previously. So far, the Gram-positive Streptomyces have been the primary source of anti-infectives. Lysobacter are Gram-negative soil/water bacteria that are genetically amendable and have not been well exploited. The WAP-8294A2 synthetase represents one of the largest NRPS complexes, consisting of 45 functional domains. The identification of these genes sets the foundation for the study of the WAP-8294A2 biosynthetic mechanism and opens the door for producing new anti-MRSA antibiotics through biosynthetic engineering in this new source of Lysobacter. PMID:21930890
Deng, Youjin; Zhang, Qihui; Ming, Ray; Lin, Longji; Lin, Xiangzhi; Lin, Yiying; Li, Xiao; Xie, Baogui; Wen, Zhiqiang
2016-06-30
Hypomyces aurantius is a mycoparasite that causes cobweb disease, a most serious disease of cultivated mushrooms. Intra-species identification is vital for disease control, however the lack of genomic data makes development of molecular markers challenging. Small size, high copy number, and high mutation rate of fungal mitochondrial genome makes it a good candidate for intra and inter species differentiation. In this study, the mitochondrial genome of H. H.a0001 was determined from genomic DNA using Illumina sequencing. The roughly 72 kb genome shows all major features found in other Hypocreales: 14 common protein genes, large and small subunit rRNAs genes and 27 tRNAs genes. Gene arrangement comparison showed conserved gene orders in Hypocreales mitochondria are relatively conserved, with the exception of Acremonium chrysogenum and Acremonium implicatum. Mitochondrial genome comparison also revealed that intron length primarily contributes to mitogenome size variation. Seventeen introns were detected in six conserved genes: five in cox1, four in rnl, three in cob, two each in atp6 and cox3, and one in cox2. Four introns were found to contain two introns or open reading frames: cox3-i2 is a twintron containing two group IA type introns; cox2-i1 is a group IB intron encoding two homing endonucleases; and cox1-i4 and cox1-i3 both contain two open reading frame (ORFs). Analyses combining secondary intronic structures, insertion sites, and similarities of homing endonuclease genes reveal two group IA introns arranged side by side within cox3-i2. Mitochondrial data for H. aurantius provides the basis for further studies relating to population genetics and species identification.
Deng, Youjin; Zhang, Qihui; Ming, Ray; Lin, Longji; Lin, Xiangzhi; Lin, Yiying; Li, Xiao; Xie, Baogui; Wen, Zhiqiang
2016-01-01
Hypomyces aurantius is a mycoparasite that causes cobweb disease, a most serious disease of cultivated mushrooms. Intra-species identification is vital for disease control, however the lack of genomic data makes development of molecular markers challenging. Small size, high copy number, and high mutation rate of fungal mitochondrial genome makes it a good candidate for intra and inter species differentiation. In this study, the mitochondrial genome of H. H.a0001 was determined from genomic DNA using Illumina sequencing. The roughly 72 kb genome shows all major features found in other Hypocreales: 14 common protein genes, large and small subunit rRNAs genes and 27 tRNAs genes. Gene arrangement comparison showed conserved gene orders in Hypocreales mitochondria are relatively conserved, with the exception of Acremonium chrysogenum and Acremonium implicatum. Mitochondrial genome comparison also revealed that intron length primarily contributes to mitogenome size variation. Seventeen introns were detected in six conserved genes: five in cox1, four in rnl, three in cob, two each in atp6 and cox3, and one in cox2. Four introns were found to contain two introns or open reading frames: cox3-i2 is a twintron containing two group IA type introns; cox2-i1 is a group IB intron encoding two homing endonucleases; and cox1-i4 and cox1-i3 both contain two open reading frame (ORFs). Analyses combining secondary intronic structures, insertion sites, and similarities of homing endonuclease genes reveal two group IA introns arranged side by side within cox3-i2. Mitochondrial data for H. aurantius provides the basis for further studies relating to population genetics and species identification. PMID:27376282
An approach to large scale identification of non-obvious structural similarities between proteins
Cherkasov, Artem; Jones, Steven JM
2004-01-01
Background A new sequence independent bioinformatics approach allowing genome-wide search for proteins with similar three dimensional structures has been developed. By utilizing the numerical output of the sequence threading it establishes putative non-obvious structural similarities between proteins. When applied to the testing set of proteins with known three dimensional structures the developed approach was able to recognize structurally similar proteins with high accuracy. Results The method has been developed to identify pathogenic proteins with low sequence identity and high structural similarity to host analogues. Such protein structure relationships would be hypothesized to arise through convergent evolution or through ancient horizontal gene transfer events, now undetectable using current sequence alignment techniques. The pathogen proteins, which could mimic or interfere with host activities, would represent candidate virulence factors. The developed approach utilizes the numerical outputs from the sequence-structure threading. It identifies the potential structural similarity between a pair of proteins by correlating the threading scores of the corresponding two primary sequences against the library of the standard folds. This approach allowed up to 64% sensitivity and 99.9% specificity in distinguishing protein pairs with high structural similarity. Conclusion Preliminary results obtained by comparison of the genomes of Homo sapiens and several strains of Chlamydia trachomatis have demonstrated the potential usefulness of the method in the identification of bacterial proteins with known or potential roles in virulence. PMID:15147578
Liu, Shikai; Zhang, Jiaren; Yao, Jun; Liu, Zhanjiang
2016-05-01
The complete mitochondrial genome of the armored catfish, Hypostomus plecostomus, was determined by next generation sequencing of genomic DNA without prior sample processing or primer design. Bioinformatics analysis resulted in the entire mitochondrial genome sequence with length of 16,523 bp. The H. plecostomus mitochondrial genome is consisted of 13 protein-coding genes, 22 tRNA genes, 2 rRNA genes, and 1 control region, showing typical circular molecule structure of mitochondrial genome as in other vertebrates. The whole genome base composition was estimated to be 31.8% A, 27.0% T, 14.6% G, and 26.6% C, with A/T bias of 58.8%. This work provided the H. plecostomus mitochondrial genome sequence which should be valuable for species identification, phylogenetic analysis and conservation genetics studies in catfishes.
Ciok, Anna; Adamczuk, Marcin; Bartosik, Dariusz; Dziewit, Lukasz
2016-11-28
Pseudomonas strains isolated from the heavily contaminated Lubin copper mine and Zelazny Most post-flotation waste reservoir in Poland were screened for the presence of integrons. This analysis revealed that two strains carried homologous DNA regions composed of a gene encoding a DNA_BRE_C domain-containing tyrosine recombinase (with no significant sequence similarity to other integrases of integrons) plus a three-component array of putative integron gene cassettes. The predicted gene cassettes encode three putative polypeptides with homology to (i) transmembrane proteins, (ii) GCN5 family acetyltransferases, and (iii) hypothetical proteins of unknown function (homologous proteins are encoded by the gene cassettes of several class 1 integrons). Comparative sequence analyses identified three structural variants of these novel integron-like elements within the sequenced bacterial genomes. Analysis of their distribution revealed that they are found exclusively in strains of the genus Pseudomonas .
Isolation and identification of new pollen-specific SFB genes in Japanese apricot (Prunus mume).
Wang, P P; Gao, Z H; Ni, Z J; Zhuang, W B; Zhang, Z
2013-09-03
SFB, a candidate gene for the pollen S gene, has been identified in several species of Prunus (Rosaceae). We isolated 5 new SFB alleles from 6 Japanese apricot (Prunus mume) lines using a specific Prunus SFB primer pair (SFB-C1F and Pm-Vb), which was designed from conserved regions of Prunus SFB. The nucleotide sequences of these SFB genes were submitted to the GenBank database. The 5 new SFB alleles share typical structural features with SFB alleles from other Prunus species and were found to be polymorphic, with 67.08 to 96.91% amino acid identity. These new SFB alleles were specifically expressed in the pollen. We conclude that the PmSFB alleles that we identified are the pollen S determinants of Japanese apricot; they have potential as a tool for studies of the mechanisms of pollen self-incompatibility.
Hu, Jianhua; Wright, Fred A
2007-03-01
The identification of the genes that are differentially expressed in two-sample microarray experiments remains a difficult problem when the number of arrays is very small. We discuss the implications of using ordinary t-statistics and examine other commonly used variants. For oligonucleotide arrays with multiple probes per gene, we introduce a simple model relating the mean and variance of expression, possibly with gene-specific random effects. Parameter estimates from the model have natural shrinkage properties that guard against inappropriately small variance estimates, and the model is used to obtain a differential expression statistic. A limiting value to the positive false discovery rate (pFDR) for ordinary t-tests provides motivation for our use of the data structure to improve variance estimates. Our approach performs well compared to other proposed approaches in terms of the false discovery rate.
Huang, Tingting; Wang, Yemin; Yin, Jun; Du, Yanhua; Tao, Meifeng; Xu, Jing; Chen, Wenqing; Lin, Shuangjun; Deng, Zixin
2011-01-01
Pyridomycin is a structurally unique antimycobacterial cyclodepsipeptide containing rare 3-(3-pyridyl)-l-alanine and 2-hydroxy-3-methylpent-2-enoic acid moieties. The biosynthetic gene cluster for pyridomycin has been cloned and identified from Streptomyces pyridomyceticus NRRL B-2517. Sequence analysis of a 42.5-kb DNA region revealed 26 putative open reading frames, including two nonribosomal peptide synthetase (NRPS) genes and a polyketide synthase gene. A special feature is the presence of a polyketide synthase-type ketoreductase domain embedded in an NRPS. Furthermore, we showed that PyrA functioned as an NRPS adenylation domain that activates 3-hydroxypicolinic acid and transfers it to a discrete peptidyl carrier protein, PyrU, which functions as a loading module that initiates pyridomycin biosynthesis in vivo and in vitro. PyrA could also activate other aromatic acids, generating three pyridomycin analogues in vivo. PMID:21454714
Kowata, Kinue; Nakaoka, Minori; Nishio, Kaori; Fukao, Ayaka; Satoh, Akira; Ogoshi, Maho; Takahashi, Sumio; Tsudzuki, Masaoki; Takeuchi, Sakae
2014-05-25
Feathers are elaborate skin appendages shared by birds and theropod dinosaurs that have hierarchical branching of the rachis, barbs, and barbules. Feather filaments consist of β-keratins encoded by multiple genes, most of which are located in tandem arrays on chromosomes 2, 25, and 27 in chicken. The expansion of the genes is thought to have contributed to feather evolution; however, it is unclear how the individual genes are involved in feather formation. The aim of the present study was to identify feather keratin genes involved in the formation of barbules. Using a combination of microarray analysis, reverse-transcription polymerase chain reaction, and in situ hybridization, we found an uncharacterized keratin gene on chromosome 7 that was expressed specifically in barbule cells in regenerating chicken feathers. We have named the gene barbule specific keratin 1 (BlSK1). The BlSK1 gene structure was similar to the gene structure of previously characterized feather keratin genes, and consisted of a non-coding leader exon, an intron, and an exon with an open reading frame (ORF). The ORF was predicted to encode a 98 aa long protein, which shared 59% identity with feather keratin B. Orthologs of BlSK1 were found in the genomes of other avian species, including turkey, duck, zebra finch, and flycatcher, in regions that shared synteny with chromosome 7 of chicken. Interestingly, BlSK1 was expressed in feather follicles that generated pennaceous barbules but not in follicles that generated plumulaceous barbules. These results suggested that the composition of feather keratins probably varies depending on the structure of the feather filaments and, that individual feather keratin genes may be involved in building different portions and/or types of feathers in chicken. Copyright © 2014 Elsevier B.V. All rights reserved.
Hanin, Aurelie; Sava, Irina; Bao, YinYin; Huebner, Johannes; Hartke, Axel; Auffray, Yanick; Sauvageot, Nicolas
2010-01-01
Enterococcus faecalis is part of the commensal microbiota of humans and its main habitat is the gastrointestinal tract. Although harmless in healthy individuals, E. faecalis has emerged as a major cause of nosocomial infections. In order to better understand the transformation of a harmless commensal into a life-threatening pathogen, we developed a Recombination-based In Vivo Expression Technology for E. faecalis. Two R-IVET systems with different levels of sensitivity have been constructed in a E. faecalis V583 derivative strain and tested in the insect model Galleria mellonella, during growth in urine, in a mouse bacteremia and in a mouse peritonitis model. Our combined results led to the identification of 81 in vivo activated genes. Among them, the ef_3196/7 operon was shown to be strongly induced in the insect host model. Deletion of this operonic structure demonstrated that this two-component system was essential to the E. faecalis pathogenic potential in Galleria. Gene ef_0377, induced in insect and mammalian models, has also been further analyzed and it has been demonstrated that this ankyrin-encoding gene was also involved in E. faecalis virulence. Thus these R-IVET screenings led to the identification of new E. faecalis factors implied in in vivo persistence and pathogenic potential of this opportunistic pathogen. PMID:20686694
Genome-wide identification and characterization of aquaporin gene family in Beta vulgaris
Kong, Weilong; Yang, Shaozong; Wang, Yulu; Bendahmane, Mohammed
2017-01-01
Aquaporins (AQPs) are essential channel proteins that execute multi-functions throughout plant growth and development, including water transport, uncharged solutes uptake, stress response, and so on. Here, we report the first genome-wide identification and characterization AQP (BvAQP) genes in sugar beet (Beta vulgaris), an important crop widely cultivated for feed, for sugar production and for bioethanol production. Twenty-eight sugar beet AQPs (BvAQPs) were identified and assigned into five subfamilies based on phylogenetic analyses: seven of plasma membrane (PIPs), eight of tonoplast (TIPs), nine of NOD26-like (NIPs), three of small basic (SIPs), and one of x-intrinsic proteins (XIPs). BvAQP genes unevenly mapped on all chromosomes, except on chromosome 4. Gene structure and motifs analyses revealed that BvAQP have conserved exon-intron organization and that they exhibit conserved motifs within each subfamily. Prediction of BvAQPs functions, based on key protein domains conservation, showed a remarkable difference in substrate specificity among the five subfamilies. Analyses of BvAQPs expression, by mean of RNA-seq, in different plant organs and in response to various abiotic stresses revealed that they were ubiquitously expressed and that their expression was induced by heat and salt stresses. These results provide a reference base to address further the function of sugar beet aquaporins and to explore future applications for plants growth and development improvements as well as in response to environmental stresses. PMID:28948097
Gao, Liangliang; Turner, M Kathryn; Chao, Shiaoman; Kolmer, James; Anderson, James A
2016-01-01
Leaf rust is an important disease, threatening wheat production annually. Identification of resistance genes or QTLs for effective field resistance could greatly enhance our ability to breed durably resistant varieties. We applied a genome wide association study (GWAS) approach to identify resistance genes or QTLs in 338 spring wheat breeding lines from public and private sectors that were predominately developed in the Americas. A total of 46 QTLs were identified for field and seedling traits and approximately 20-30 confer field resistance in varying degrees. The 10 QTLs accounting for the most variation in field resistance explained 26-30% of the total variation (depending on traits: percent severity, coefficient of infection or response type). Similarly, the 10 QTLs accounting for most of the variation in seedling resistance to different races explained 24-34% of the variation, after correcting for population structure. Two potentially novel QTLs (QLr.umn-1AL, QLr.umn-4AS) were identified. Identification of novel genes or QTLs and validation of previously identified genes or QTLs for seedling and especially adult plant resistance will enhance understanding of leaf rust resistance and assist breeding for resistant wheat varieties. We also developed computer programs to automate field and seedling rust phenotype data conversions. This is the first GWAS study of leaf rust resistance in elite wheat breeding lines genotyped with high density 90K SNP arrays.
Cao, Yunpeng; Han, Yahui; Meng, Dandan; Li, Dahui; Jiao, Chunyan; Jin, Qing; Lin, Yi; Cai, Yongping
2017-09-19
The B-BOX (BBX) proteins have important functions in regulating plant growth and development. In plants, the BBX gene family has been identified in several plants, such as rice, Arabidopsis and tomato. However, there still lack a genome-wide survey of BBX genes in pear. In the present study, a total of 25 BBX genes were identified in pear (Pyrus bretschneideri Rehd.). Subsequently, phylogenetic relationship, gene structure, gene duplication, transcriptome data and qRT-PCR were conducted on these BBX gene members. The transcript analysis revealed that twelve PbBBX genes (48%) were specifically expressed in pear pollen tubes. Furthermore, qRT-PCR analysis indicated that both PbBBX4 and PbBBX13 have potential role in pear fruit development, while PbBBX5 should be involved in the senescence of pear pollen tube. This study provided a genome-wide survey of BBX gene family in pear, and highlighted its roles in both pear fruits and pollen tubes. The results will be useful in improving our understanding of the complexity of BBX gene family and functional characteristics of its members in future study.
Zhou, Yan; Xu, Daixiang; Jia, Ledong; Huang, Xiaohu; Ma, Guoqiang; Wang, Shuxian; Zhu, Meichen; Zhang, Aoxiang; Guan, Mingwei; Lu, Kun; Xu, Xinfu; Wang, Rui; Li, Jiana; Qu, Cunmin
2017-10-24
The basic region/leucine zipper motif (bZIP) transcription factor family is one of the largest families of transcriptional regulators in plants. bZIP genes have been systematically characterized in some plants, but not in rapeseed ( Brassica napus ). In this study, we identified 247 BnbZIP genes in the rapeseed genome, which we classified into 10 subfamilies based on phylogenetic analysis of their deduced protein sequences. The BnbZIP genes were grouped into functional clades with Arabidopsis genes with similar putative functions, indicating functional conservation. Genome mapping analysis revealed that the BnbZIPs are distributed unevenly across all 19 chromosomes, and that some of these genes arose through whole-genome duplication and dispersed duplication events. All expression profiles of 247 bZIP genes were extracted from RNA-sequencing data obtained from 17 different B . napus ZS11 tissues with 42 various developmental stages. These genes exhibited different expression patterns in various tissues, revealing that these genes are differentially regulated. Our results provide a valuable foundation for functional dissection of the different BnbZIP homologs in B . napus and its parental lines and for molecular breeding studies of bZIP genes in B . napus .
Zhou, Yan; Xu, Daixiang; Jia, Ledong; Huang, Xiaohu; Ma, Guoqiang; Wang, Shuxian; Zhu, Meichen; Zhang, Aoxiang; Guan, Mingwei; Xu, Xinfu; Wang, Rui; Li, Jiana
2017-01-01
The basic region/leucine zipper motif (bZIP) transcription factor family is one of the largest families of transcriptional regulators in plants. bZIP genes have been systematically characterized in some plants, but not in rapeseed (Brassica napus). In this study, we identified 247 BnbZIP genes in the rapeseed genome, which we classified into 10 subfamilies based on phylogenetic analysis of their deduced protein sequences. The BnbZIP genes were grouped into functional clades with Arabidopsis genes with similar putative functions, indicating functional conservation. Genome mapping analysis revealed that the BnbZIPs are distributed unevenly across all 19 chromosomes, and that some of these genes arose through whole-genome duplication and dispersed duplication events. All expression profiles of 247 bZIP genes were extracted from RNA-sequencing data obtained from 17 different B. napus ZS11 tissues with 42 various developmental stages. These genes exhibited different expression patterns in various tissues, revealing that these genes are differentially regulated. Our results provide a valuable foundation for functional dissection of the different BnbZIP homologs in B. napus and its parental lines and for molecular breeding studies of bZIP genes in B. napus. PMID:29064393
Porcelli, Damiano; Barsanti, Paolo; Pesole, Graziano; Caggese, Corrado
2007-01-01
Background When orthologous sequences from species distributed throughout an optimal range of divergence times are available, comparative genomics is a powerful tool to address problems such as the identification of the forces that shape gene structure during evolution, although the functional constraints involved may vary in different genes and lineages. Results We identified and annotated in the MitoComp2 dataset the orthologs of 68 nuclear genes controlling oxidative phosphorylation in 11 Drosophilidae species and in five non-Drosophilidae insects, and compared them with each other and with their counterparts in three vertebrates (Fugu rubripes, Danio rerio and Homo sapiens) and in the cnidarian Nematostella vectensis, taking into account conservation of gene structure and regulatory motifs, and preservation of gene paralogs in the genome. Comparative analysis indicates that the ancestral insect OXPHOS genes were intron rich and that extensive intron loss and lineage-specific intron gain occurred during evolution. Comparison with vertebrates and cnidarians also shows that many OXPHOS gene introns predate the cnidarian/Bilateria evolutionary split. The nuclear respiratory gene element (NRG) has played a key role in the evolution of the insect OXPHOS genes; it is constantly conserved in the OXPHOS orthologs of all the insect species examined, while their duplicates either completely lack the element or possess only relics of the motif. Conclusion Our observations reinforce the notion that the common ancestor of most animal phyla had intron-rich gene, and suggest that changes in the pattern of expression of the gene facilitate the fixation of duplications in the genome and the development of novel genetic functions. PMID:18315839
NASA Astrophysics Data System (ADS)
Agung, Muhammad Budi; Budiarsa, I. Made; Suwastika, I. Nengah
2017-02-01
Cocoa bean is one of the main commodities from Indonesia for the world, which still have problem regarding yield degradation due to pathogens and disease attack. Developing robust cacao plant that genetically resistant to pathogen and disease attack is an ideal solution in over taking on this problem. The aim of this study was to identify Theobroma cacao genes on database of cacao genome that homolog to response genes of pathogen and disease attack in other plant, through in silico analysis. Basic information survey and gene identification were performed in GenBank and The Arabidopsis Information Resource database. The In silico analysis contains protein BLAST, homology test of each gene's protein candidates, and identification of homologue gene in Cacao Genome Database using data source "Theobroma cacao cv. Matina 1-6 v1.1" genome. Identification found that Thecc1EG011959t1 (EDS1), Thecc1EG006803t1 (EDS5), Thecc1EG013842t1 (ICS1), and Thecc1EG015614t1 (BG_PPAP) gene of Cacao Genome Database were Theobroma cacao genes that homolog to plant's resistance genes which highly possible to have similar functions of each gene's homologue gene.
Genome-Wide Identification and Expression Analysis of the WRKY Gene Family in Cassava
Wei, Yunxie; Shi, Haitao; Xia, Zhiqiang; Tie, Weiwei; Ding, Zehong; Yan, Yan; Wang, Wenquan; Hu, Wei; Li, Kaimian
2016-01-01
The WRKY family, a large family of transcription factors (TFs) found in higher plants, plays central roles in many aspects of physiological processes and adaption to environment. However, little information is available regarding the WRKY family in cassava (Manihot esculenta). In the present study, 85 WRKY genes were identified from the cassava genome and classified into three groups according to conserved WRKY domains and zinc-finger structure. Conserved motif analysis showed that all of the identified MeWRKYs had the conserved WRKY domain. Gene structure analysis suggested that the number of introns in MeWRKY genes varied from 1 to 5, with the majority of MeWRKY genes containing three exons. Expression profiles of MeWRKY genes in different tissues and in response to drought stress were analyzed using the RNA-seq technique. The results showed that 72 MeWRKY genes had differential expression in their transcript abundance and 78 MeWRKY genes were differentially expressed in response to drought stresses in different accessions, indicating their contribution to plant developmental processes and drought stress resistance in cassava. Finally, the expression of 9 WRKY genes was analyzed by qRT-PCR under osmotic, salt, ABA, H2O2, and cold treatments, indicating that MeWRKYs may be involved in different signaling pathways. Taken together, this systematic analysis identifies some tissue-specific and abiotic stress-responsive candidate MeWRKY genes for further functional assays in planta, and provides a solid foundation for understanding of abiotic stress responses and signal transduction mediated by WRKYs in cassava. PMID:26904033
Genome-Wide Identification and Expression Analysis of the WRKY Gene Family in Cassava.
Wei, Yunxie; Shi, Haitao; Xia, Zhiqiang; Tie, Weiwei; Ding, Zehong; Yan, Yan; Wang, Wenquan; Hu, Wei; Li, Kaimian
2016-01-01
The WRKY family, a large family of transcription factors (TFs) found in higher plants, plays central roles in many aspects of physiological processes and adaption to environment. However, little information is available regarding the WRKY family in cassava (Manihot esculenta). In the present study, 85 WRKY genes were identified from the cassava genome and classified into three groups according to conserved WRKY domains and zinc-finger structure. Conserved motif analysis showed that all of the identified MeWRKYs had the conserved WRKY domain. Gene structure analysis suggested that the number of introns in MeWRKY genes varied from 1 to 5, with the majority of MeWRKY genes containing three exons. Expression profiles of MeWRKY genes in different tissues and in response to drought stress were analyzed using the RNA-seq technique. The results showed that 72 MeWRKY genes had differential expression in their transcript abundance and 78 MeWRKY genes were differentially expressed in response to drought stresses in different accessions, indicating their contribution to plant developmental processes and drought stress resistance in cassava. Finally, the expression of 9 WRKY genes was analyzed by qRT-PCR under osmotic, salt, ABA, H2O2, and cold treatments, indicating that MeWRKYs may be involved in different signaling pathways. Taken together, this systematic analysis identifies some tissue-specific and abiotic stress-responsive candidate MeWRKY genes for further functional assays in planta, and provides a solid foundation for understanding of abiotic stress responses and signal transduction mediated by WRKYs in cassava.
CNL Disease Resistance Genes in Soybean and Their Evolutionary Divergence
Nepal, Madhav P; Benson, Benjamin V
2015-01-01
Disease resistance genes (R-genes) encode proteins involved in detecting pathogen attack and activating downstream defense molecules. Recent availability of soybean genome sequences makes it possible to examine the diversity of gene families including disease-resistant genes. The objectives of this study were to identify coiled-coil NBS-LRR (= CNL) R-genes in soybean, infer their evolutionary relationships, and assess structural as well as functional divergence of the R-genes. Profile hidden Markov models were used for sequence identification and model-based maximum likelihood was used for phylogenetic analysis, and variation in chromosomal positioning, gene clustering, and functional divergence were assessed. We identified 188 soybean CNL genes nested into four clades consistent to their orthologs in Arabidopsis. Gene clustering analysis revealed the presence of 41 gene clusters located on 13 different chromosomes. Analyses of the Ks-values and chromosomal positioning suggest duplication events occurring at varying timescales, and an extrapericentromeric positioning may have facilitated their rapid evolution. Each of the four CNL clades exhibited distinct patterns of gene expression. Phylogenetic analysis further supported the extrapericentromeric positioning effect on the divergence and retention of the CNL genes. The results are important for understanding the diversity and divergence of CNL genes in soybean, which would have implication in soybean crop improvement in future. PMID:25922568
CNL Disease Resistance Genes in Soybean and Their Evolutionary Divergence.
Nepal, Madhav P; Benson, Benjamin V
2015-01-01
Disease resistance genes (R-genes) encode proteins involved in detecting pathogen attack and activating downstream defense molecules. Recent availability of soybean genome sequences makes it possible to examine the diversity of gene families including disease-resistant genes. The objectives of this study were to identify coiled-coil NBS-LRR (= CNL) R-genes in soybean, infer their evolutionary relationships, and assess structural as well as functional divergence of the R-genes. Profile hidden Markov models were used for sequence identification and model-based maximum likelihood was used for phylogenetic analysis, and variation in chromosomal positioning, gene clustering, and functional divergence were assessed. We identified 188 soybean CNL genes nested into four clades consistent to their orthologs in Arabidopsis. Gene clustering analysis revealed the presence of 41 gene clusters located on 13 different chromosomes. Analyses of the K s-values and chromosomal positioning suggest duplication events occurring at varying timescales, and an extrapericentromeric positioning may have facilitated their rapid evolution. Each of the four CNL clades exhibited distinct patterns of gene expression. Phylogenetic analysis further supported the extrapericentromeric positioning effect on the divergence and retention of the CNL genes. The results are important for understanding the diversity and divergence of CNL genes in soybean, which would have implication in soybean crop improvement in future.
Ren, Lipin; Chen, Wei; Shang, Yanjie; Meng, Fanming; Zha, Lagabaiyila; Wang, Yong; Guo, Yadong
2018-05-17
Muscid Flies (Diptera: Muscidae) are of great forensic importance due to their wide distribution, ubiquitous and synanthropic nature. They are frequently neglected as they tend to arrive at the corpses later than the flesh flies and blow flies. Moreover, the lack of species-level identification also hinders investigation of medicolegal purposes. To overcome the difficulty of morphological identification, molecular method has gained relevance. Cytochrome c oxidase subunit I (COI) gene has been widely utilized. Nonetheless, to achieve correct identification of an unknown sample, it is important to survey certain muscid taxa from its geographic distribution range. Accordingly, the aim of this study is to contribute more geographically specific. We sequenced the COI gene of 51 muscid specimens of 12 species, and added all correct sequences available in GenBank to yield a total data set of 125 COI sequences from 33 muscid species to evaluate the COI gene as a molecular diagnostic tool. The interspecific distances were extremely high (4.7-19.8%) in either the standard barcoding fragment (658 bp) or the long COI sequence (1,019-1,535 bp), demonstrating that these two genetic markers were nearly identical in the species identification. However, the intraspecific distances of the long COI sequences were significantly higher than the barcoding region for the conspecific species that geographical locations vary greatly. Therefore, genetic diversity presented in this study provides a reference for species identification of muscid flies. Nevertheless, further investigation and data from more muscid species are required to enhance the efficacy of species-level identification using COI gene as a genetic marker.
Babu, Peram Ravindra; Rao, Khareedu Venkateswara; Reddy, Vudem Dashavantha
2013-01-15
Flax CYPome analysis resulted in the identification of 334 putative cytochrome P450 (CYP450) genes in the cultivated flax genome. Classification of flax CYP450 genes based on the sequence similarity with Arabidopsis orthologs and CYP450 nomenclature, revealed 10 clans representing 44 families and 98 subfamilies. CYP80, CYP83, CYP92, CYP702, CYP705, CYP708, CYP728, CYP729, CYP733 and CYP736 families are absent in the flax genome. The subfamily members exhibited conserved sequences, length of exons and phasing of introns. Similarity search of the genomic resources of wild flax species Linum bienne with CYP450 coding sequences of the cultivated flax, revealed the presence of 127 CYP450 gene orthologs, indicating amplification of novel CYP450 genes in the cultivated flax. Seven families CYP73, 74, 75, 76, 77, 84 and 709, coding for enzymes associated with phenylpropanoid/fatty acid metabolism, showed extensive gene amplification in the flax. About 59% of the flax CYP450 genes were present in the EST libraries. Copyright © 2012 Elsevier B.V. All rights reserved.
Goh, Swee Han; Driedger, David; Gillett, Sandra; Low, Donald E.; Hemmingsen, Sean M.; Amos, Mayben; Chan, David; Lovgren, Marguerite; Willey, Barbara M.; Shaw, Carol; Smith, John A.
1998-01-01
It was recently reported that Streptococcus iniae, a bacterial pathogen of aquatic animals, can cause serious disease in humans. Using the chaperonin 60 (Cpn60) gene identification method with reverse checkerboard hybridization and chemiluminescent detection, we identified correctly each of 12 S. iniae samples among 34 aerobic gram-positive isolates from animal and clinical human sources. PMID:9650992
Lesnyak, Dmitry V.; Osipiuk, Jerzy; Skarina, Tatiana; Sergiev, Petr V.; Bogdanov, Alexey A.; Edwards, Aled; Savchenko, Alexei; Joachimiak, Andrzej; Dontsova, Olga A.
2010-01-01
N2-Methylguanine 966 is located in the loop of Escherichia coli 16 S rRNA helix 31, forming a part of the P-site tRNA-binding pocket. We found yhhF to be a gene encoding for m2G966 specific 16 S rRNA methyltransferase. Disruption of the yhhF gene by kanamycin resistance marker leads to a loss of modification at G966. The modification could be rescued by expression of recombinant protein from the plasmid carrying the yhhF gene. Moreover, purified m2G966 methyltransferase, in the presence of S-adenosylomethionine (AdoMet), is able to methylate 30 S ribosomal subunits that were purified from yhhF knock-out strain in vitro. The methylation is specific for G966 base of the 16 S rRNA. The m2G966 methyltransferase was crystallized, and its structure has been determined and refined to 2.05 Å. The structure closely resembles RsmC rRNA methyltransferase, specific for m2G1207 of the 16 S rRNA. Structural comparisons and analysis of the enzyme active site suggest modes for binding AdoMet and rRNA to m2G966 methyltransferase. Based on the experimental data and current nomenclature the protein expressed from the yhhF gene was renamed to RsmD. A model for interaction of RsmD with ribosome has been proposed. PMID:17189261
Lesnyak, Dmitry V; Osipiuk, Jerzy; Skarina, Tatiana; Sergiev, Petr V; Bogdanov, Alexey A; Edwards, Aled; Savchenko, Alexei; Joachimiak, Andrzej; Dontsova, Olga A
2007-02-23
N(2)-Methylguanine 966 is located in the loop of Escherichia coli 16 S rRNA helix 31, forming a part of the P-site tRNA-binding pocket. We found yhhF to be a gene encoding for m(2)G966 specific 16 S rRNA methyltransferase. Disruption of the yhhF gene by kanamycin resistance marker leads to a loss of modification at G966. The modification could be rescued by expression of recombinant protein from the plasmid carrying the yhhF gene. Moreover, purified m(2)G966 methyltransferase, in the presence of S-adenosylomethionine (AdoMet), is able to methylate 30 S ribosomal subunits that were purified from yhhF knock-out strain in vitro. The methylation is specific for G966 base of the 16 S rRNA. The m(2)G966 methyltransferase was crystallized, and its structure has been determined and refined to 2.05A(.) The structure closely resembles RsmC rRNA methyltransferase, specific for m(2)G1207 of the 16 S rRNA. Structural comparisons and analysis of the enzyme active site suggest modes for binding AdoMet and rRNA to m(2)G966 methyltransferase. Based on the experimental data and current nomenclature the protein expressed from the yhhF gene was renamed to RsmD. A model for interaction of RsmD with ribosome has been proposed.
Kakeshpour, Tayebeh; Nayebi, Shadi; Rashidi Monfared, Sajad; Moieni, Ahmad; Karimzadeh, Ghasem
2015-10-01
Papaver somniferum L. is an herbaceous, annual and diploid plant that is important from pharmacological and strategic point of view. The cDNA clones of two putative MYB and WRKY genes were isolated (GeneBank accession numbers KP411870 and KP203854, respectively) from this plant, via the nested-PCR method, and characterized. The MYB transcription factor (TF) comprises 342 amino acids, and exhibits the structural features of the R2R3MYB protein family. The WRKY TF, a 326 amino acid-long polypeptide, falls structurally into the group II of WRKY protein family. Quantitative real-time PCR (qRT-PCR) analyses indicate the presence of these TFs in all organs of P. somniferum L. and Papaver bracteatum L. Highest expression levels of these two TFs were observed in the leaf tissues of P. somniferum L. while in P. bracteatum L. the espression levels were highest in the root tissues. Promoter analysis of the 10 co-expressed gene clustered involved in noscapine biosynthesis pathway in P. somniferum L. suggested that not only these 10 genes are co-expressed, but also share common regulatory motifs and TFs including MYB and WRKY TFs, and that may explain their common regulation.
Positional cloning of a gene responsible for the cts mutation of the silkworm, Bombyx mori.
Ito, Katsuhiko; Kidokoro, Kurako; Katsuma, Susumu; Shimada, Toru; Yamamoto, Kimiko; Mita, Kazuei; Kadono-Okuda, Keiko
2012-07-01
The larval head cuticle and anal plates of the silkworm mutant cheek and tail spot (cts) have chocolate-colored spots, unlike the entirely white appearance of the wild-type (WT) strain. We report the identification and characterization of the gene responsible for the cts mutation. Positional cloning revealed a cts candidate on chromosome 16, designated BmMFS, based on the high similarity of the deduced amino acid sequence between the candidate gene from the WT strain and the major facilitator superfamily (MFS) protein. BmMFS likely encodes a membrane protein with 11 putative transmembrane domains, while the putative structure deduced from the cts-type allele possesses only 10-pass transmembrane domains owing to a deletion in its coding region. Quantitative RT-PCR analysis showed that BmMFS mRNA was strongly expressed in the integument of the head and tail, where the cts phenotype is observed; expression markedly increased at the molting and newly ecdysed stages. These results indicate that the novel BmMFS gene is cts and the membrane structure of its protein accounts for the cts phenotype. These expression profiles and the cts phenotype are quite similar to those of melanin-related genes, such as Bmyellow-e and Bm-iAANAT, suggesting that BmMFS is involved in the melanin synthesis pathway.
Mapping the polysaccharide degradation potential of Aspergillus niger
2012-01-01
Background The degradation of plant materials by enzymes is an industry of increasing importance. For sustainable production of second generation biofuels and other products of industrial biotechnology, efficient degradation of non-edible plant polysaccharides such as hemicellulose is required. For each type of hemicellulose, a complex mixture of enzymes is required for complete conversion to fermentable monosaccharides. In plant-biomass degrading fungi, these enzymes are regulated and released by complex regulatory structures. In this study, we present a methodology for evaluating the potential of a given fungus for polysaccharide degradation. Results Through the compilation of information from 203 articles, we have systematized knowledge on the structure and degradation of 16 major types of plant polysaccharides to form a graphical overview. As a case example, we have combined this with a list of 188 genes coding for carbohydrate-active enzymes from Aspergillus niger, thus forming an analysis framework, which can be queried. Combination of this information network with gene expression analysis on mono- and polysaccharide substrates has allowed elucidation of concerted gene expression from this organism. One such example is the identification of a full set of extracellular polysaccharide-acting genes for the degradation of oat spelt xylan. Conclusions The mapping of plant polysaccharide structures along with the corresponding enzymatic activities is a powerful framework for expression analysis of carbohydrate-active enzymes. Applying this network-based approach, we provide the first genome-scale characterization of all genes coding for carbohydrate-active enzymes identified in A. niger. PMID:22799883
Mapping the polysaccharide degradation potential of Aspergillus niger.
Andersen, Mikael R; Giese, Malene; de Vries, Ronald P; Nielsen, Jens
2012-07-16
The degradation of plant materials by enzymes is an industry of increasing importance. For sustainable production of second generation biofuels and other products of industrial biotechnology, efficient degradation of non-edible plant polysaccharides such as hemicellulose is required. For each type of hemicellulose, a complex mixture of enzymes is required for complete conversion to fermentable monosaccharides. In plant-biomass degrading fungi, these enzymes are regulated and released by complex regulatory structures. In this study, we present a methodology for evaluating the potential of a given fungus for polysaccharide degradation. Through the compilation of information from 203 articles, we have systematized knowledge on the structure and degradation of 16 major types of plant polysaccharides to form a graphical overview. As a case example, we have combined this with a list of 188 genes coding for carbohydrate-active enzymes from Aspergillus niger, thus forming an analysis framework, which can be queried. Combination of this information network with gene expression analysis on mono- and polysaccharide substrates has allowed elucidation of concerted gene expression from this organism. One such example is the identification of a full set of extracellular polysaccharide-acting genes for the degradation of oat spelt xylan. The mapping of plant polysaccharide structures along with the corresponding enzymatic activities is a powerful framework for expression analysis of carbohydrate-active enzymes. Applying this network-based approach, we provide the first genome-scale characterization of all genes coding for carbohydrate-active enzymes identified in A. niger.
Cell type-selective disease-association of genes under high regulatory load
Galhardo, Mafalda; Berninger, Philipp; Nguyen, Thanh-Phuong; Sauter, Thomas; Sinkkonen, Lasse
2015-01-01
We previously showed that disease-linked metabolic genes are often under combinatorial regulation. Using the genome-wide ChIP-Seq binding profiles for 93 transcription factors in nine different cell lines, we show that genes under high regulatory load are significantly enriched for disease-association across cell types. We find that transcription factor load correlates with the enhancer load of the genes and thereby allows the identification of genes under high regulatory load by epigenomic mapping of active enhancers. Identification of the high enhancer load genes across 139 samples from 96 different cell and tissue types reveals a consistent enrichment for disease-associated genes in a cell type-selective manner. The underlying genes are not limited to super-enhancer genes and show several types of disease-association evidence beyond genetic variation (such as biomarkers). Interestingly, the high regulatory load genes are involved in more KEGG pathways than expected by chance, exhibit increased betweenness centrality in the interaction network of liver disease genes, and carry longer 3′ UTRs with more microRNA (miRNA) binding sites than genes on average, suggesting a role as hubs integrating signals within regulatory networks. In summary, epigenetic mapping of active enhancers presents a promising and unbiased approach for identification of novel disease genes in a cell type-selective manner. PMID:26338775
Chen, Frank; Spano, Anthony; Goodman, Benjamin E.; Blasier, Kiev R.; Sabat, Agnes; Jeffery, Erin; Norris, Andrew; Shabanowitz, Jeffrey; Hunt, Donald F.; Lebedev, Nikolai
2010-01-01
The gene transfer agent of Rhodobacter capsulatus (GTA) is a unique phage-like particle that exchanges genetic information between members of this same species of bacterium. Besides being an excellent tool for genetic mapping, the GTA has a number of advantages for biotechnological and nanoengineering purposes. To facilitate the GTA purification and identify the proteins involved in GTA expression, assembly and regulation, in the present work we construct and transform into R. capsulatus Y262 a gene coding for a C-terminally His-tagged capsid protein. The constructed protein was expressed in the cells, assembled into chimeric GTA particles inside the cells and excreted from the cells into surrounding medium. Transmission electron micrographs of phosphotungstate-stained, NiNTA-purified chimeric GTA confirm that its structure is similar to normal GTA particles, with many particles composed both of a head and a tail. The mass spectrometric proteomic analysis of polypeptides present in the GTA recovered outside the cells shows that GTA is composed of at least 9 proteins represented in the GTA gene cluster including proteins coded for by Orf’s 3, 5, 6–9, 11, 13, and 15. PMID:19105630
Chen, Frank; Spano, Anthony; Goodman, Benjamin E; Blasier, Kiev R; Sabat, Agnes; Jeffery, Erin; Norris, Andrew; Shabanowitz, Jeffrey; Hunt, Donald F; Lebedev, Nikolai
2009-02-01
The gene transfer agent of Rhodobacter capsulatus (GTA) is a unique phage-like particle that exchanges genetic information between members of this same species of bacterium. Besides being an excellent tool for genetic mapping, the GTA has a number of advantages for biotechnological and nanoengineering purposes. To facilitate the GTA purification and identify the proteins involved in GTA expression, assembly and regulation, in the present work we construct and transform into R. capsulatus Y262 a gene coding for a C-terminally His-tagged capsid protein. The constructed protein was expressed in the cells, assembled into chimeric GTA particles inside the cells and excreted from the cells into surrounding medium. Transmission electron micrographs of phosphotungstate-stained, NiNTA-purified chimeric GTA confirm that its structure is similar to normal GTA particles, with many particles composed both of a head and a tail. The mass spectrometric proteomic analysis of polypeptides present in the GTA recovered outside the cells shows that GTA is composed of at least 9 proteins represented in the GTA gene cluster including proteins coded for by Orf's 3, 5, 6-9, 11, 13, and 15.
The standard operating procedure of the DOE-JGI Microbial Genome Annotation Pipeline (MGAP v.4).
Huntemann, Marcel; Ivanova, Natalia N; Mavromatis, Konstantinos; Tripp, H James; Paez-Espino, David; Palaniappan, Krishnaveni; Szeto, Ernest; Pillay, Manoj; Chen, I-Min A; Pati, Amrita; Nielsen, Torben; Markowitz, Victor M; Kyrpides, Nikos C
2015-01-01
The DOE-JGI Microbial Genome Annotation Pipeline performs structural and functional annotation of microbial genomes that are further included into the Integrated Microbial Genome comparative analysis system. MGAP is applied to assembled nucleotide sequence datasets that are provided via the IMG submission site. Dataset submission for annotation first requires project and associated metadata description in GOLD. The MGAP sequence data processing consists of feature prediction including identification of protein-coding genes, non-coding RNAs and regulatory RNA features, as well as CRISPR elements. Structural annotation is followed by assignment of protein product names and functions.
Uhlik, Ondrej; Strejcek, Michal; Junkova, Petra; Sanda, Miloslav; Hroudova, Miluse; Vlcek, Cestmir; Mackova, Martina; Macek, Tomas
2011-01-01
Bacteria that are able to utilize biphenyl as a sole source of carbon were extracted and isolated from polychlorinated biphenyl (PCB)-contaminated soil vegetated by horseradish. Isolates were identified using matrix-assisted laser desorption ionization-time of flight mass spectrometry (MALDI-TOF MS). The usage of MALDI Biotyper for the classification of isolates was evaluated and compared to 16S rRNA gene sequence analysis. A wide spectrum of bacteria was isolated, with Arthrobacter, Serratia, Rhodococcus, and Rhizobium being predominant. Arthrobacter isolates also represented the most diverse group. The use of MALDI Biotyper in many cases permitted the identification at the level of species, which was not achieved by 16S rRNA gene sequence analyses. However, some isolates had to be identified by 16S rRNA gene analyses if MALDI Biotyper-based identification was at the level of probable or not reliable identification, usually due to a lack of reference spectra included in the database. Overall, this study shows the possibility of using MALDI-TOF MS and MALDI Biotyper for the fast and relatively nonlaborious identification/classification of soil isolates. At the same time, it demonstrates the dominant role of employing 16S rRNA gene analyses for the identification of recently isolated strains that can later fill the gaps in the protein-based identification databases. PMID:21821747
DOE Office of Scientific and Technical Information (OSTI.GOV)
Duncan, Katherine R.; Crüsemann, Max; Lechner, Anna
Genome sequencing has revealed that bacteria contain many more biosynthetic gene clusters than predicted based on the number of secondary metabolites discovered to date. While this biosynthetic reservoir has fostered interest in new tools for natural product discovery, there remains a gap between gene cluster detection and compound discovery. In this paper, we apply molecular networking and the new concept of pattern-based genome mining to 35 Salinispora strains, including 30 for which draft genome sequences were either available or obtained for this study. The results provide a method to simultaneously compare large numbers of complex microbial extracts, which facilitated themore » identification of media components, known compounds and their derivatives, and new compounds that could be prioritized for structure elucidation. Finally, these efforts revealed considerable metabolite diversity and led to several molecular family-gene cluster pairings, of which the quinomycin-type depsipeptide retimycin A was characterized and linked to gene cluster NRPS40 using pattern-based bioinformatic approaches.« less
Duncan, Katherine R.; Crüsemann, Max; Lechner, Anna; ...
2015-04-09
Genome sequencing has revealed that bacteria contain many more biosynthetic gene clusters than predicted based on the number of secondary metabolites discovered to date. While this biosynthetic reservoir has fostered interest in new tools for natural product discovery, there remains a gap between gene cluster detection and compound discovery. In this paper, we apply molecular networking and the new concept of pattern-based genome mining to 35 Salinispora strains, including 30 for which draft genome sequences were either available or obtained for this study. The results provide a method to simultaneously compare large numbers of complex microbial extracts, which facilitated themore » identification of media components, known compounds and their derivatives, and new compounds that could be prioritized for structure elucidation. Finally, these efforts revealed considerable metabolite diversity and led to several molecular family-gene cluster pairings, of which the quinomycin-type depsipeptide retimycin A was characterized and linked to gene cluster NRPS40 using pattern-based bioinformatic approaches.« less
The Reconstruction and Analysis of Gene Regulatory Networks.
Zheng, Guangyong; Huang, Tao
2018-01-01
In post-genomic era, an important task is to explore the function of individual biological molecules (i.e., gene, noncoding RNA, protein, metabolite) and their organization in living cells. For this end, gene regulatory networks (GRNs) are constructed to show relationship between biological molecules, in which the vertices of network denote biological molecules and the edges of network present connection between nodes (Strogatz, Nature 410:268-276, 2001; Bray, Science 301:1864-1865, 2003). Biologists can understand not only the function of biological molecules but also the organization of components of living cells through interpreting the GRNs, since a gene regulatory network is a comprehensively physiological map of living cells and reflects influence of genetic and epigenetic factors (Strogatz, Nature 410:268-276, 2001; Bray, Science 301:1864-1865, 2003). In this paper, we will review the inference methods of GRN reconstruction and analysis approaches of network structure. As a powerful tool for studying complex diseases and biological processes, the applications of the network method in pathway analysis and disease gene identification will be introduced.
Identification of the gene for disaggregatase from Methanosarcina mazei.
Osumi, Naoki; Kakehashi, Yoshihiro; Matsumoto, Shiho; Nagaoka, Kazunari; Sakai, Junichi; Miyashita, Kiyotaka; Kimura, Makoto; Asakawa, Susumu
2008-12-01
The gene sequences encoding disaggregatase (Dag), the enzyme responsible for dispersion of cell aggregates of Methanosarcina mazei to single cells, were determined for three strains of M. mazei (S-6(T), LYC and TMA). The dag genes of the three strains were 3234 bp in length and had almost the same sequences with 97% amino acid sequence identities. Dag was predicted to comprise 1077 amino acid residues and to have a molecular mass of 120 kDa containing three repeats of the DNRLRE domain in the C terminus, which is specific to the genus Methanosarcina and may be responsible for structural organization and cell wall function. Recombinant Dag was overexpressed in Escherichia coli and preparations of the expressed protein exhibited enzymatic activity. The RT-PCR analysis showed that dag was transcribed to mRNA in M. mazei LYC and indicated that the gene was expressed in vivo. This is the first time the gene involved in the morphological change of Methanosarcina spp. from aggregate to single cells has been identified.
Ilaslan, Erkut; Calvel, Pierre; Nowak, Dominika; Szarras-Czapnik, Maria; Slowikowska-Hilczer, Jolanta; Spik, Anna; Sararols, Pauline; Nef, Serge; Jaruzelska, Jadwiga; Kusz-Zamelczyk, Kamila
2018-06-08
Identification of novel genes involved in sexual development is crucial for understanding disorders of sex development (DSD). Here, we propose a member of the START domain family, the X chromosome STARD8, as a DSD candidate gene. We have identified a missense mutation of this gene in 2 sisters with 46,XY gonadal dysgenesis, inherited from their heterozygous mother. Gonadal tissue of one of the sisters contained Leydig cells overloaded with cholesterol droplets, i.e., structures previously identified in 46,XY DSD patients carrying mutations in the STAR gene encoding another START domain family member, which is crucial for steroidogenesis. Based on the phenotypes of our patients, we propose a dual role of STARD8 in sexual development, namely in testes determination and testosterone synthesis. However, further studies are needed to confirm the involvement of STARD8 in sexual development. © 2018 S. Karger AG, Basel.
Duncan, Katherine R.; Crüsemann, Max; Lechner, Anna; Sarkar, Anindita; Li, Jie; Ziemert, Nadine; Wang, Mingxun; Bandeira, Nuno; Moore, Bradley S.; Dorrestein, Pieter C.; Jensen, Paul R.
2015-01-01
Summary Genome sequencing has revealed that bacteria contain many more biosynthetic gene clusters than predicted based on the number of secondary metabolites discovered to date. While this biosynthetic reservoir has fostered interest in new tools for natural product discovery, there remains a gap between gene cluster detection and compound discovery. Here we apply molecular networking and the new concept of pattern-based genome mining to 35 Salinispora strains including 30 for which draft genome sequences were either available or obtained for this study. The results provide a method to simultaneously compare large numbers of complex microbial extracts, which facilitated the identification of media components, known compounds and their derivatives, and new compounds that could be prioritized for structure elucidation. These efforts revealed considerable metabolite diversity and led to several molecular family-gene cluster pairings, of which the quinomycin-type depsipeptide retimycin A was characterized and linked to gene cluster NRPS40 using pattern-based bioinformatic approaches. PMID:25865308
Gene Expression Dynamics Inspector (GEDI): for integrative analysis of expression profiles
NASA Technical Reports Server (NTRS)
Eichler, Gabriel S.; Huang, Sui; Ingber, Donald E.
2003-01-01
Genome-wide expression profiles contain global patterns that evade visual detection in current gene clustering analysis. Here, a Gene Expression Dynamics Inspector (GEDI) is described that uses self-organizing maps to translate high-dimensional expression profiles of time courses or sample classes into animated, coherent and robust mosaics images. GEDI facilitates identification of interesting patterns of molecular activity simultaneously across gene, time and sample space without prior assumption of any structure in the data, and then permits the user to retrieve genes of interest. Important changes in genome-wide activities may be quickly identified based on 'Gestalt' recognition and hence, GEDI may be especially useful for non-specialist end users, such as physicians. AVAILABILITY: GEDI v1.0 is written in Matlab, and binary Matlab.dll files which require Matlab to run can be downloaded for free by academic institutions at http://www.chip.org/ge/gedihome.html Supplementary information: http://www.chip.org/ge/gedihome.html.
Iverson, Eric A.; Goodman, David A.; Gorchels, Madeline E.
2017-01-01
ABSTRACT Viruses infecting the Archaea harbor a tremendous amount of genetic diversity. This is especially true for the spindle-shaped viruses of the family Fuselloviridae, where >90% of the viral genes do not have detectable homologs in public databases. This significantly limits our ability to elucidate the role of viral proteins in the infection cycle. To address this, we have developed genetic techniques to study the well-characterized fusellovirus Sulfolobus spindle-shaped virus 1 (SSV1), which infects Sulfolobus solfataricus in volcanic hot springs at 80°C and pH 3. Here, we present a new comparative genome analysis and a thorough genetic analysis of SSV1 using both specific and random mutagenesis and thereby generate mutations in all open reading frames. We demonstrate that almost half of the SSV1 genes are not essential for infectivity, and the requirement for a particular gene correlates well with its degree of conservation within the Fuselloviridae. The major capsid gene vp1 is essential for SSV1 infectivity. However, the universally conserved minor capsid gene vp3 could be deleted without a loss in infectivity and results in virions with abnormal morphology. IMPORTANCE Most of the putative genes in the spindle-shaped archaeal hyperthermophile fuselloviruses have no sequences that are clearly similar to characterized genes. In order to determine which of these SSV genes are important for function, we disrupted all of the putative genes in the prototypical fusellovirus, SSV1. Surprisingly, about half of the genes could be disrupted without destroying virus function. Even deletions of one of the known structural protein genes that is present in all known fuselloviruses, vp3, allows the production of infectious viruses. However, viruses lacking vp3 have abnormal shapes, indicating that the vp3 gene is important for virus structure. Identification of essential genes will allow focused research on minimal SSV genomes and further understanding of the structure of these unique, ubiquitous, and extremely stable archaeal viruses. PMID:28148789
Iverson, Eric A; Goodman, David A; Gorchels, Madeline E; Stedman, Kenneth M
2017-05-15
Viruses infecting the Archaea harbor a tremendous amount of genetic diversity. This is especially true for the spindle-shaped viruses of the family Fuselloviridae , where >90% of the viral genes do not have detectable homologs in public databases. This significantly limits our ability to elucidate the role of viral proteins in the infection cycle. To address this, we have developed genetic techniques to study the well-characterized fusellovirus Sulfolobus spindle-shaped virus 1 (SSV1), which infects Sulfolobus solfataricus in volcanic hot springs at 80°C and pH 3. Here, we present a new comparative genome analysis and a thorough genetic analysis of SSV1 using both specific and random mutagenesis and thereby generate mutations in all open reading frames. We demonstrate that almost half of the SSV1 genes are not essential for infectivity, and the requirement for a particular gene correlates well with its degree of conservation within the Fuselloviridae The major capsid gene vp1 is essential for SSV1 infectivity. However, the universally conserved minor capsid gene vp3 could be deleted without a loss in infectivity and results in virions with abnormal morphology. IMPORTANCE Most of the putative genes in the spindle-shaped archaeal hyperthermophile fuselloviruses have no sequences that are clearly similar to characterized genes. In order to determine which of these SSV genes are important for function, we disrupted all of the putative genes in the prototypical fusellovirus, SSV1. Surprisingly, about half of the genes could be disrupted without destroying virus function. Even deletions of one of the known structural protein genes that is present in all known fuselloviruses, vp3 , allows the production of infectious viruses. However, viruses lacking vp3 have abnormal shapes, indicating that the vp3 gene is important for virus structure. Identification of essential genes will allow focused research on minimal SSV genomes and further understanding of the structure of these unique, ubiquitous, and extremely stable archaeal viruses. Copyright © 2017 American Society for Microbiology.
Xu, Yuantao; Wu, Guizhi; Hao, Baohai; Chen, Lingling; Deng, Xiuxin; Xu, Qiang
2015-11-23
With the availability of rapidly increasing number of genome and transcriptome sequences, lineage-specific genes (LSGs) can be identified and characterized. Like other conserved functional genes, LSGs play important roles in biological evolution and functions. Two set of citrus LSGs, 296 citrus-specific genes (CSGs) and 1039 orphan genes specific to sweet orange, were identified by comparative analysis between the sweet orange genome sequences and 41 genomes and 273 transcriptomes. With the two sets of genes, gene structure and gene expression pattern were investigated. On average, both the CSGs and orphan genes have fewer exons, shorter gene length and higher GC content when compared with those evolutionarily conserved genes (ECs). Expression profiling indicated that most of the LSGs expressed in various tissues of sweet orange and some of them exhibited distinct temporal and spatial expression patterns. Particularly, the orphan genes were preferentially expressed in callus, which is an important pluripotent tissue of citrus. Besides, part of the CSGs and orphan genes expressed responsive to abiotic stress, indicating their potential functions during interaction with environment. This study identified and characterized two sets of LSGs in citrus, dissected their sequence features and expression patterns, and provided valuable clues for future functional analysis of the LSGs in sweet orange.
Kabat, Susan M; Dick, Christopher W; Hunter, Mark D
2010-05-01
Microsatellite primers were developed for the common milkweed, Asclepias syriaca L., to assist in genet identification and the analysis of spatial genetic structure. Using an enrichment cloning protocol, eight microsatellite loci were isolated and characterized in a Michigan population of A. syriaca. The primers amplified di- and trinucleotide repeats with 4-13 alleles per locus. The primers will be useful for studies of clonality and gene flow in natural populations.
Xie, Xin-Ping; Xie, Yu-Feng; Wang, Hong-Qiang
2017-08-23
Large-scale accumulation of omics data poses a pressing challenge of integrative analysis of multiple data sets in bioinformatics. An open question of such integrative analysis is how to pinpoint consistent but subtle gene activity patterns across studies. Study heterogeneity needs to be addressed carefully for this goal. This paper proposes a regulation probability model-based meta-analysis, jGRP, for identifying differentially expressed genes (DEGs). The method integrates multiple transcriptomics data sets in a gene regulatory space instead of in a gene expression space, which makes it easy to capture and manage data heterogeneity across studies from different laboratories or platforms. Specifically, we transform gene expression profiles into a united gene regulation profile across studies by mathematically defining two gene regulation events between two conditions and estimating their occurring probabilities in a sample. Finally, a novel differential expression statistic is established based on the gene regulation profiles, realizing accurate and flexible identification of DEGs in gene regulation space. We evaluated the proposed method on simulation data and real-world cancer datasets and showed the effectiveness and efficiency of jGRP in identifying DEGs identification in the context of meta-analysis. Data heterogeneity largely influences the performance of meta-analysis of DEGs identification. Existing different meta-analysis methods were revealed to exhibit very different degrees of sensitivity to study heterogeneity. The proposed method, jGRP, can be a standalone tool due to its united framework and controllable way to deal with study heterogeneity.
New insights on hereditary erythrocyte membrane defects.
Andolfo, Immacolata; Russo, Roberta; Gambale, Antonella; Iolascon, Achille
2016-11-01
After the first proposed model of the red blood cell membrane skeleton 36 years ago, several additional proteins have been discovered during the intervening years, and their relationship with the pathogenesis of the related disorders have been somewhat defined. The knowledge of erythrocyte membrane structure is important because it represents the model for spectrin-based membrane skeletons in all cells and because defects in its structure underlie multiple hemolytic anemias. This review summarizes the main features of erythrocyte membrane disorders, dividing them into structural and altered permeability defects, focusing particularly on the most recent advances. New proteins involved in alterations of the red blood cell membrane permeability were recently described. The mechanoreceptor PIEZO1 is the largest ion channel identified to date, the fundamental regulator of erythrocyte volume homeostasis. Missense, gain-of-function mutations in the PIEZO1 gene have been identified in several families as causative of dehydrated hereditary stomatocytosis or xerocytosis. Similarly, the KCNN4 gene, codifying the so called Gardos channel, has been recently identified as a second causative gene of hereditary xerocytosis. Finally, ABCB6 missense mutations were identified in different pedigrees of familial pseudohyperkalemia. New genomic technologies have improved the quality and reduced the time of diagnosis of these diseases. Moreover, they are essential for the identification of the new causative genes. However, many questions remain to solve, and are currently objects of intensive studies. Copyright© Ferrata Storti Foundation.
New insights on hereditary erythrocyte membrane defects
Andolfo, Immacolata; Russo, Roberta; Gambale, Antonella; Iolascon, Achille
2016-01-01
After the first proposed model of the red blood cell membrane skeleton 36 years ago, several additional proteins have been discovered during the intervening years, and their relationship with the pathogenesis of the related disorders have been somewhat defined. The knowledge of erythrocyte membrane structure is important because it represents the model for spectrin-based membrane skeletons in all cells and because defects in its structure underlie multiple hemolytic anemias. This review summarizes the main features of erythrocyte membrane disorders, dividing them into structural and altered permeability defects, focusing particularly on the most recent advances. New proteins involved in alterations of the red blood cell membrane permeability were recently described. The mechanoreceptor PIEZO1 is the largest ion channel identified to date, the fundamental regulator of erythrocyte volume homeostasis. Missense, gain-of-function mutations in the PIEZO1 gene have been identified in several families as causative of dehydrated hereditary stomatocytosis or xerocytosis. Similarly, the KCNN4 gene, codifying the so called Gardos channel, has been recently identified as a second causative gene of hereditary xerocytosis. Finally, ABCB6 missense mutations were identified in different pedigrees of familial pseudohyperkalemia. New genomic technologies have improved the quality and reduced the time of diagnosis of these diseases. Moreover, they are essential for the identification of the new causative genes. However, many questions remain to solve, and are currently objects of intensive studies. PMID:27756835
Structural and Phylogenetic Analysis of Laccases from Trichoderma: A Bioinformatic Approach
Cázares-García, Saila Viridiana; Vázquez-Garcidueñas, Ma. Soledad; Vázquez-Marrufo, Gerardo
2013-01-01
The genus Trichoderma includes species of great biotechnological value, both for their mycoparasitic activities and for their ability to produce extracellular hydrolytic enzymes. Although activity of extracellular laccase has previously been reported in Trichoderma spp., the possible number of isoenzymes is still unknown, as are the structural and functional characteristics of both the genes and the putative proteins. In this study, the system of laccases sensu stricto in the Trichoderma species, the genomes of which are publicly available, were analyzed using bioinformatic tools. The intron/exon structure of the genes and the identification of specific motifs in the sequence of amino acids of the proteins generated in silico allow for clear differentiation between extracellular and intracellular enzymes. Phylogenetic analysis suggests that the common ancestor of the genus possessed a functional gene for each one of these enzymes, which is a characteristic preserved in T. atroviride and T. virens. This analysis also reveals that T. harzianum and T. reesei only retained the intracellular activity, whereas T. asperellum added an extracellular isoenzyme acquired through horizontal gene transfer during the mycoparasitic process. The evolutionary analysis shows that in general, extracellular laccases are subjected to purifying selection, and intracellular laccases show neutral evolution. The data provided by the present study will enable the generation of experimental approximations to better understand the physiological role of laccases in the genus Trichoderma and to increase their biotechnological potential. PMID:23383142
NASA Astrophysics Data System (ADS)
Xue, Zhuang; Li, Hui; Liu, Yang; Zhou, Wei; Sun, Jing; Wang, Xiuli
2017-12-01
As a `living fossil' of species origin and `rich treasure' of food and nutrition development, sea cucumber has received a lot of attentions from researchers. The cDNA library construction and EST sequencing of blood had been conducted previously in our lab. The bioinformatic analysis provided a gene fragment which is highly homologous with the genes of lectin family, named AjL ( Apostichopus japonicus lectin). To characterize and determine the phylogeny of AjL genes in early evolution, we isolated a full-length cDNA of lectin gene from the body wall of A. japonicus. The open reading frame of this gene contained 489 bp and encoded a 163 amino acids secretory protein being homologous to lectins of mammals and aquatic organisms. The deduced protein included a lectin-like domain. SDS-PAGE analysis showed that AjL migrated as a specific band (about 36.09 kDa under reducing), and agglutinated against rabbit red blood cells. AjL was similar to chain A of CEL-IV in space structure. We predicted that AjL may play the same role of CEL-IV. Our results suggested that more than one lectin gene functioned in sea cucumber and most of other species, which was fused by uncertain sequences during the evolution and encoded different proteins with diverse functions. Our findings provided the insights into the function and characteristics of lectin genes invertebrates. The results will also be helpful for the identification and structural, functional, and evolutionary analyses of lectin genes.
Jensen, Anders
2012-01-01
The taxonomic status and structure of Streptococcus dysgalactiae have been the object of much confusion. Bacteria belonging to this species are usually referred to as Lancefield group C or group G streptococci in clinical settings in spite of the fact that these terms lack precision and prevent recognition of the exact clinical relevance of these bacteria. The purpose of this study was to develop an improved basis for delineation and identification of the individual species of the pyogenic group of streptococci in the clinical microbiology laboratory, with a special focus on S. dysgalactiae. We critically reexamined the genetic relationships of the species S. dysgalactiae, Streptococcus pyogenes, Streptococcus canis, and Streptococcus equi, which may share Lancefield group antigens, by phylogenetic reconstruction based on multilocus sequence analysis (MLSA) and 16S rRNA gene sequences and by emm typing combined with phenotypic characterization. Analysis of concatenated sequences of seven genes previously used for examination of viridans streptococci distinguished robust and coherent clusters. S. dysgalactiae consists of two separate clusters consistent with the two recognized subspecies dysgalactiae and equisimilis. Both taxa share alleles with S. pyogenes in several housekeeping genes, which invalidates identification based on single-locus sequencing. S. dysgalactiae, S. canis, and S. pyogenes constitute a closely related branch within the genus Streptococcus indicative of recent descent from a common ancestor, while S. equi is highly divergent from other species of the pyogenic group streptococci. The results provide an improved basis for identification of clinically important pyogenic group streptococci and explain the overlapping spectrum of infections caused by the species associated with humans. PMID:22075580
DOE Office of Scientific and Technical Information (OSTI.GOV)
Galeazzi, Luca; Bocci, Paolo; Amici, Adolfo
2011-09-27
The pyridine nucleotide cycle (PNC) is a network of salvage and recycling routes maintaining homeostasis of NAD(P) cofactor pool in the cell. Nicotinamide mononucleotide (NMN) deamidase (EC 3.5.1.42), one of the key enzymes of the bacterial PNC was originally described in Enterobacteria, but the corresponding gene eluded identification for over 30 years. A genomics-based reconstruction of NAD metabolism across hundreds bacterial species suggested that NMN deamidase reaction is the only possible way of nicotinamide salvage in the marine bacterium Shewanella oneidensis. This prediction was verified via purification of native NMN deamidase from S. oneidensis followed by the identification of themore » respective gene, termed pncC. Enzymatic characterization of the PncC protein, as well as phenotype analysis of deletion mutants, confirmed its proposed biochemical and physiological function in S. oneidensis. Of the three PncC homologs present in E. coli, NMN deamidase activity was confirmed only for the recombinant purified product of the ygaD gene. A comparative analysis at the level of sequence and three dimensional structure, which is available for one of the PncC family member, shows no homology with any previously described amidohydrolases. Multiple alignment analysis of functional and non functional PncC homologs, together with NMN docking experiments, allowed us to tentatively identify the active site area and conserved residues therein. An observed broad phylogenomic distribution of predicted functional PncCs in bacterial kingdom is consistent with a possible role in detoxification of NMN, resulting from NAD utilization by DNA ligase.« less
Smith, Desmond J.; Rubin, Edward M.
2000-01-01
A a diagnostic test useful for prenatal identification of Down syndrome and mental retardation. A method for gene therapy for correction and treatment of Down syndrome. DYRK gene involved in the ability to learn. A method for diagnosing Down's syndrome and mental retardation and an assay therefor. A pharmaceutical composition for treatment of Down's syndrome mental retardation.
Targeting Conserved Genes in Penicillium Species.
Peterson, Stephen W
2017-01-01
Polymerase chain reaction amplification of conserved genes and sequence analysis provides a very powerful tool for the identification of toxigenic as well as non-toxigenic Penicillium species. Sequences are obtained by amplification of the gene fragment, sequencing via capillary electrophoresis of dideoxynucleotide-labeled fragments or NGS. The sequences are compared to a database of validated isolates. Identification of species indicates the potential of the fungus to make particular mycotoxins.
Zhou, Changqing; Kandemir, Irfan; Walsh, Douglas B; Zalom, Frank G; Lavine, Laura Corley
2012-01-01
The western tarnished plant bug Lygus hesperus is an economically important pest that belongs to a complex of morphologically similar species that makes identification problematic. The present study provides evidence for the use of DNA barcodes from populations of L. hesperus from the western United States of America for accurate identification. This study reports DNA barcodes for 134 individuals of the western tarnished plant bug from alfalfa and strawberry agricultural fields in the western United States of America. Sequence divergence estimates of <3% reveal that morphologically variable individuals presumed to be L. hesperus were accurately identified. Paired estimates of F(st) and subsequent estimates of gene flow show that geographically distinct populations of L. hesperus are genetically similar. Therefore, our results support and reinforce the relatively recent (<100 years) migration of the western tarnished plant bug into agricultural habitats across the western United States. This study reveals that despite wide host plant usage and phenotypically plastic morphological traits, the commonly recognized western tarnished plant bug belongs to a single species, Lygus hesperus. In addition, no significant genetic structure was found for the geographically diverse populations of western tarnished plant bug used in this study.
Capturing novel mouse genes encoding chromosomal and other nuclear proteins.
Tate, P; Lee, M; Tweedie, S; Skarnes, W C; Bickmore, W A
1998-09-01
The burgeoning wealth of gene sequences contrasts with our ignorance of gene function. One route to assigning function is by determining the sub-cellular location of proteins. We describe the identification of mouse genes encoding proteins that are confined to nuclear compartments by splicing endogeneous gene sequences to a promoterless betageo reporter, using a gene trap approach. Mouse ES (embryonic stem) cell lines were identified that express betageo fusions located within sub-nuclear compartments, including chromosomes, the nucleolus and foci containing splicing factors. The sequences of 11 trapped genes were ascertained, and characterisation of endogenous protein distribution in two cases confirmed the validity of the approach. Three novel proteins concentrated within distinct chromosomal domains were identified, one of which appears to be a serine/threonine kinase. The sequence of a gene whose product co-localises with splicesome components suggests that this protein may be an E3 ubiquitin-protein ligase. The majority of the other genes isolated represent novel genes. This approach is shown to be a powerful tool for identifying genes encoding novel proteins with specific sub-nuclear localisations and exposes our ignorance of the protein composition of the nucleus. Motifs in two of the isolated genes suggest new links between cellular regulatory mechanisms (ubiquitination and phosphorylation) and mRNA splicing and chromosome structure/function.
Chen, Min; Tan, Qiuping; Sun, Mingyue; Li, Dongmei; Fu, Xiling; Chen, Xiude; Xiao, Wei; Li, Ling; Gao, Dongsheng
2016-06-01
Bud dormancy in deciduous fruit trees is an important adaptive mechanism for their survival in cold climates. The WRKY genes participate in several developmental and physiological processes, including dormancy. However, the dormancy mechanisms of WRKY genes have not been studied in detail. We conducted a genome-wide analysis and identified 58 WRKY genes in peach. These putative genes were located on all eight chromosomes. In bioinformatics analyses, we compared the sequences of WRKY genes from peach, rice, and Arabidopsis. In a cluster analysis, the gene sequences formed three groups, of which group II was further divided into five subgroups. Gene structure was highly conserved within each group, especially in groups IId and III. Gene expression analyses by qRT-PCR showed that WRKY genes showed different expression patterns in peach buds during dormancy. The mean expression levels of six WRKY genes (Prupe.6G286000, Prupe.1G393000, Prupe.1G114800, Prupe.1G071400, Prupe.2G185100, and Prupe.2G307400) increased during endodormancy and decreased during ecodormancy, indicating that these six WRKY genes may play a role in dormancy in a perennial fruit tree. This information will be useful for selecting fruit trees with desirable dormancy characteristics or for manipulating dormancy in genetic engineering programs.
Glycomic and glycoproteomic analysis of glycoproteins—a tutorial
Shajahan, Asif; Heiss, Christian; Ishihara, Mayumi; ...
2017-06-06
The structural analysis of glycoproteins is a challenging endeavor and is under steadily increasing demand, but only a very limited number of labs have the expertise required to accomplish this task. This tutorial is aimed at researchers from the fields of molecular biology and biochemistry that have discovered that glycoproteins are important in their biological research and are looking for the tools to elucidate their structure. It provides brief descriptions of the major and most common analytical techniques used in glycomics and glycoproteomics analysis, including explanations of the rationales for individual steps and references to published literature containing the experimentalmore » details necessary to carry out the analyses. Glycomics includes the comprehensive study of the structure and function of the glycans expressed in a given cell or organism along with identification of all the genes that encode glycoproteins and glycosyltransferases. Glycoproteomics which is subset of both glycomics and proteomics is the identification and characterization of proteins bearing carbohydrates as posttranslational modification. This tutorial is designed to ease entry into the glycomics and glycoproteomics field for those without prior carbohydrate analysis experience.« less
Glycomic and glycoproteomic analysis of glycoproteins—a tutorial
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shajahan, Asif; Heiss, Christian; Ishihara, Mayumi
The structural analysis of glycoproteins is a challenging endeavor and is under steadily increasing demand, but only a very limited number of labs have the expertise required to accomplish this task. This tutorial is aimed at researchers from the fields of molecular biology and biochemistry that have discovered that glycoproteins are important in their biological research and are looking for the tools to elucidate their structure. It provides brief descriptions of the major and most common analytical techniques used in glycomics and glycoproteomics analysis, including explanations of the rationales for individual steps and references to published literature containing the experimentalmore » details necessary to carry out the analyses. Glycomics includes the comprehensive study of the structure and function of the glycans expressed in a given cell or organism along with identification of all the genes that encode glycoproteins and glycosyltransferases. Glycoproteomics which is subset of both glycomics and proteomics is the identification and characterization of proteins bearing carbohydrates as posttranslational modification. This tutorial is designed to ease entry into the glycomics and glycoproteomics field for those without prior carbohydrate analysis experience.« less
Wendler, Sergej; Hürtgen, Daniel; Kalinowski, Jörn; Klein, Andreas; Niehaus, Karsten; Schulte, Fabian; Schwientek, Patrick; Wehlmann, Hermann; Wehmeier, Udo F; Pühler, Alfred
2013-08-20
The pseudotetrasaccharide acarbose is a medically relevant secondary metabolite produced by strains of the genera Actinoplanes and Streptomyces. In this study gene products involved in acarbose metabolism were identified by analyzing the cytosolic and extracellular proteome of Actinoplanes sp. SE50/110 cultures grown in a high-maltose minimal medium. The analysis by 2D protein gel electrophoresis of cytosolic proteins of Actinoplanes sp. SE50/110 resulted in 318 protein spots and 162 identified proteins. Nine of those were acarbose cluster proteins (Acb-proteins), namely AcbB, AcbD, AcbE, AcbK, AcbL, AcbN, AcbR, AcbV and AcbZ. The analysis of proteins in the extracellular space of Actinoplanes sp. SE50/110 cultures resulted in about 100 protein spots and 22 identified proteins. The identifications included the three acarbose gene cluster proteins AcbD, AcbE and AcbZ. After their identification, proteins were classified into functional groups. The dominant functional groups were the carbohydrate binding, carbohydrate cleavage and carbohydrate transport proteins. The other functional groups included protein cleavage, amino acid degradation, nucleic acid cleavage and a number of functionally uncharacterized proteins. In addition, signal peptide structures of extracellularly found proteins were analyzed. Of the 22 detected proteins 19 contained signal peptides, while 2 had N-terminal transmembrane helices explaining their localization. The only protein having neither of them was enolase. Under the conditions applied, the secretome of Actinoplanes sp. SE50/110 was dominated by seven proteins involved in carbohydrate metabolism (PulA, AcbE, AcbD, MalE, AglE, CbpA and Cgt). Of special interest were the identified extracellular pullulanase PulA and the two solute-binding proteins MalE and AglE. The identifications suggest that Actinoplanes sp. SE50/110 has two maltose/maltodextrin import systems. We postulate the identified MalEFG transport system of Actinoplanes sp. SE50/100 as the missing acarbose-metabolite importer and present a model of acarbose metabolism that is extended by the newly identified gene products. Copyright © 2012 Elsevier B.V. All rights reserved.
Du, Jiancan; Hu, Simin; Yu, Qin; Wang, Chongde; Yang, Yunqiang; Sun, Hang; Yang, Yongping; Sun, Xudong
2017-01-01
The teosinte branched1/cycloidea/proliferating cell factor (TCP) gene family is a plant-specific transcription factor that participates in the control of plant development by regulating cell proliferation. However, no report is currently available about this gene family in turnips ( Brassica rapa ssp. rapa ). In this study, a genome-wide analysis of TCP genes was performed in turnips. Thirty-nine TCP genes in turnip genome were identified and distributed on 10 chromosomes. Phylogenetic analysis clearly showed that the family was classified as two clades: class I and class II. Gene structure and conserved motif analysis showed that the same clade genes have similar gene structures and conserved motifs. The expression profiles of 39 TCP genes were determined through quantitative real-time PCR. Most CIN-type BrrTCP genes were highly expressed in leaf. The members of CYC/TB1 subclade are highly expressed in flower bud and weakly expressed in root. By contrast, class I clade showed more widespread but less tissue-specific expression patterns. Yeast two-hybrid data show that BrrTCP proteins preferentially formed heterodimers. The function of BrrTCP2 was confirmed through ectopic expression of BrrTCP2 in wild-type and loss-of-function ortholog mutant of Arabidopsis. Overexpression of BrrTCP2 in wild-type Arabidopsis resulted in the diminished leaf size. Overexpression of BrrTCP2 in triple mutants of tcp2/4/10 restored the leaf phenotype of tcp2/4/10 to the phenotype of wild type. The comprehensive analysis of turnip TCP gene family provided the foundation to further study the roles of TCP genes in turnips.
Damberg, M; Garpenstrand, H; Alfredsson, J; Ekblom, J; Forslund, K; Rylander, G; Oreland, L
2000-03-01
Transcription factor AP-2beta is implicated in playing an important role during embryonic development of different parts of the brain, eg, midbrain, hindbrain, spinal cord, dorsal and cranial root ganglia.1,2 The gene encoding AP-2beta contains a polymorphic region which includes a tetranucleotide repeat of [CAAA] four or five times, located in intron 2 between nucleotides 12593 and 12612.3 Since the midbrain contains structures important for variables such as mood and personality, we have investigated if the AP-2beta genotype is associated with personality traits estimated by the Karolinska Scales of Personality (KSP). Identification of transcription factor genes as candidate genes in psychiatric disorders is a novel approach to further elucidate the genetic factors that, together with environmental factors, are involved in the expression of specific psychiatric phenotypes. The AP-2beta genotype and KSP scores were determined for 137 Caucasian volunteers (73 females and 64 males). The personality traits muscular tension, guilt, somatic anxiety, psychastenia and indirect aggression were significantly associated with the specific AP-2beta genotype, albeit with significant difference between genders. Based on this result the human AP-2beta gene seems to be an important candidate gene for personality disorders. Moreover, the present results suggest that the structure of the intron 2 region of the AP-2beta gene is one factor that contributes to development of the constitutional component of specific personality traits.
Seon, A A; Pierre, T N; Redeker, V; Lacombe, C; Delfour, A; Nicolas, P; Amiche, M
2000-02-25
Calcitonin gene-related peptide has been extracted from the skin exudate of a single living specimen of the frog Phyllomedusa bicolor and purified to homogeneity by a two-step protocol. A total volume of 250 microl of exudate yielded 380 microg of purified peptide. Mass spectrometric analysis and gas phase sequencing of the purified peptide as well as chemical synthesis and cDNA analysis were consistent with the structure SCDTSTCATQRLADFLSRSGGIGSPDFVPTDVSANSF amide and the presence of a disulfide bridge linking Cys(2) and Cys(7). The skin peptide, named skin calcitonin gene-related peptide, differs significantly from all other members of the calcitonin gene-related peptide family of peptides at nine positions but binds with high affinity to calcitonin gene-related peptide receptors in the rat brain and acts as an agonist in the rat vas deferens bioassay with potencies equal to those of human CGRP. Reverse transcriptase-polymerase chain reaction coupled with cDNA cloning and sequencing demonstrated that skin calcitonin gene-related peptide isolated in the skin is identical to that present in the frog's central and enteric nervous systems. These data, which indicate for the first time the existence of calcitonin gene-related peptide in the frog skin, add further support to the brain-skin-gut triangle hypothesis as a useful tool in the identification and/or isolation of mammalian peptides that are present in the brain and other tissues in only minute quantities.
Wheat CBF gene family: identification of polymorphisms in the CBF coding sequence.
Mohseni, Sara; Che, Hua; Djillali, Zakia; Dumont, Estelle; Nankeu, Joseph; Danyluk, Jean
2012-12-01
Expression of cold-regulated genes needed for protection against freezing stress is mediated, in part, by the CBF transcription factor family. Previous studies with temperate cereals suggested that the CBF gene family in wheat was large, and that CBF genes were at the base of an important low temperature tolerance trait. Therefore, the goal of our study was to identify the CBF repertoire in the freezing-tolerant hexaploid wheat cultivar Norstar, and then to examine if the coding region of CBF genes in two spring cultivars contain polymorphisms that could affect the protein sequence and structure. Our analyses reveal that hexaploid wheat contains a complex CBF family consisting of at least 65 CBF genes of which 60 are known to be expressed in the cultivar Norstar. They represent 27 paralogous genes with 1-3 homeologous copies for the A, B, and D genomes. The cultivar Norstar contains two pseudogenes and at least 24 additional proteins having sequences and (or) structures that deviate from the consensus in the conserved AP2 DNA-binding and (or) C-terminal activation-domains. This suggests that in cultivars such as Norstar, low temperature tolerance may be increased through breeding of additional optimal alleles. The examination of the CBF repertoire present in the two spring cultivars, Chinese Spring and Manitou, reveals that they have additional polymorphisms affecting conserved positions in these domains. Understanding the effects of these polymorphisms will provide additional information for the selection of optimum CBF alleles in Triticeae breeding programs.
Dwivedi, Ankit; Khim, Nimol; Reynes, Christelle; Ravel, Patrice; Ma, Laurence; Tichit, Magali; Bourchier, Christiane; Kim, Saorin; Dourng, Dany; Khean, Chanra; Chim, Pheaktra; Siv, Sovannaroth; Frutos, Roger; Lek, Dysoley; Mercereau-Puijalon, Odile; Ariey, Frédéric; Menard, Didier; Cornillot, Emmanuel
2016-06-14
Western Cambodia is recognized as the epicentre of emergence of Plasmodium falciparum multi-drug resistance. The emergence of artemisinin resistance has been observed in this area since 2008-2009 and molecular signatures associated to artemisinin resistance have been characterized in k13 gene. At present, one of the major threats faced, is the possible spread of Asian artemisinin resistant parasites over the world threatening millions of people and jeopardizing malaria elimination programme efforts. To anticipate the diffusion of artemisinin resistance, the identification of the P. falciparum population structure and the gene flow among the parasite population in Cambodia are essential. To this end, a mid-throughput PCR-LDR-FMA approach based on LUMINEX technology was developed to screen for genetic barcode in 533 blood samples collected in 2010-2011 from 16 health centres in malaria endemics areas in Cambodia. Based on successful typing of 282 samples, subpopulations were characterized along the borders of the country. Each 11-loci barcode provides evidence supporting allele distribution gradient related to subpopulations and gene flow. The 11-loci barcode successfully identifies recently emerging parasite subpopulations in western Cambodia that are associated with the C580Y dominant allele for artemisinin resistance in k13 gene. A subpopulation was identified in northern Cambodia that was associated to artemisinin (R539T resistant allele of k13 gene) and mefloquine resistance. The gene flow between these subpopulations might have driven the spread of artemisinin resistance over Cambodia.
Lynch, T; Gregson, D; Church, D L
2016-03-01
Actinomyces species are uncommon but important causes of invasive infections. The ability of our regional clinical microbiology laboratory to report species-level identification of Actinomyces relied on molecular identification by partial sequencing of the 16S ribosomal gene prior to the implementation of the Vitek MS (matrix-assisted laser desorption ionization-time of flight mass spectrometry [MALDI-TOF MS]) system. We compared the use of the Vitek MS to that of 16S rRNA gene sequencing for reliable species-level identification of invasive infections caused by Actinomyces spp. because limited data had been published for this important genera. A total of 115 cases of Actinomyces spp., either alone or as part of a polymicrobial infection, were diagnosed between 2011 and 2014. Actinomyces spp. were considered the principal pathogen in bloodstream infections (n = 17, 15%), in skin and soft tissue abscesses (n = 25, 22%), and in pulmonary (n = 26, 23%), bone (n = 27, 23%), intraabdominal (n = 16, 14%), and central nervous system (n = 4, 3%) infections. Compared to sequencing and identification from the SmartGene Integrated Database Network System (IDNS), Vitek MS identified 47/115 (41%) isolates to the correct species and 10 (9%) isolates to the correct genus. However, the Vitek MS was unable to provide identification for 43 (37%) isolates while 15 (13%) had discordant results. Phylogenetic analyses of the 16S rRNA sequences demonstrate high diversity in recovered Actinomyces spp. and provide additional information to compare/confirm discordant identifications between MALDI-TOF and 16S rRNA gene sequences. This study highlights the diversity of clinically relevant Actinomyces spp. and provides an important typing comparison. Based on our analysis, 16S rRNA gene sequencing should be used to rapidly identify Actinomyces spp. until MALDI-TOF databases are optimized. Copyright © 2016, American Society for Microbiology. All Rights Reserved.
Gregson, D.; Church, D. L.
2016-01-01
Actinomyces species are uncommon but important causes of invasive infections. The ability of our regional clinical microbiology laboratory to report species-level identification of Actinomyces relied on molecular identification by partial sequencing of the 16S ribosomal gene prior to the implementation of the Vitek MS (matrix-assisted laser desorption ionization–time of flight mass spectrometry [MALDI-TOF MS]) system. We compared the use of the Vitek MS to that of 16S rRNA gene sequencing for reliable species-level identification of invasive infections caused by Actinomyces spp. because limited data had been published for this important genera. A total of 115 cases of Actinomyces spp., either alone or as part of a polymicrobial infection, were diagnosed between 2011 and 2014. Actinomyces spp. were considered the principal pathogen in bloodstream infections (n = 17, 15%), in skin and soft tissue abscesses (n = 25, 22%), and in pulmonary (n = 26, 23%), bone (n = 27, 23%), intraabdominal (n = 16, 14%), and central nervous system (n = 4, 3%) infections. Compared to sequencing and identification from the SmartGene Integrated Database Network System (IDNS), Vitek MS identified 47/115 (41%) isolates to the correct species and 10 (9%) isolates to the correct genus. However, the Vitek MS was unable to provide identification for 43 (37%) isolates while 15 (13%) had discordant results. Phylogenetic analyses of the 16S rRNA sequences demonstrate high diversity in recovered Actinomyces spp. and provide additional information to compare/confirm discordant identifications between MALDI-TOF and 16S rRNA gene sequences. This study highlights the diversity of clinically relevant Actinomyces spp. and provides an important typing comparison. Based on our analysis, 16S rRNA gene sequencing should be used to rapidly identify Actinomyces spp. until MALDI-TOF databases are optimized. PMID:26739153
Chen, Rui; Jiang, Li-Yun; Qiao, Ge-Xia
2012-01-01
The mitochondrial gene COI has been widely used by taxonomists as a standard DNA barcode sequence for the identification of many animal species. However, the COI region is of limited use for identifying certain species and is not efficiently amplified by PCR in all animal taxa. To evaluate the utility of COI as a DNA barcode and to identify other barcode genes, we chose the aphid subfamily Lachninae (Hemiptera: Aphididae) as the focus of our study. We compared the results obtained using COI with two other mitochondrial genes, COII and Cytb. In addition, we propose a new method to improve the efficiency of species identification using DNA barcoding. Three mitochondrial genes (COI, COII and Cytb) were sequenced and were used in the identification of over 80 species of Lachninae. The COI and COII genes demonstrated a greater PCR amplification efficiency than Cytb. Species identification using COII sequences had a higher frequency of success (96.9% in "best match" and 90.8% in "best close match") and yielded lower intra- and higher interspecific genetic divergence values than the other two markers. The use of "tag barcodes" is a new approach that involves attaching a species-specific tag to the standard DNA barcode. With this method, the "barcoding overlap" can be nearly eliminated. As a result, we were able to increase the identification success rate from 83.9% to 95.2% by using COI and the "best close match" technique. A COII-based identification system should be more effective in identifying lachnine species than COI or Cytb. However, the Cytb gene is an effective marker for the study of aphid population genetics due to its high sequence diversity. Furthermore, the use of "tag barcodes" can improve the accuracy of DNA barcoding identification by reducing or removing the overlap between intra- and inter-specific genetic divergence values.
Identification of three duplicated Spin genes in medaka (Oryzias latipes).
Wang, Xiao-Lei; Mei, Jie; Sun, Min; Hong, Yun-Han; Gui, Jian-Fang
2005-05-09
Gene and genomic duplications are very important and frequent events in fish evolution, and the divergence of duplicated genes in sequences and functions is a focus of research on gene evolution. Here, we report the identification and characterization of three duplicated Spindlin (Spin) genes from medaka (Oryzias latipes): OlSpinA, OlSpinB, and OlSpinC. Molecular cloning, genomic DNA Blast analysis and phylogenetic relationship analysis demonstrated that the three duplicated OlSpin genes should belong to gene duplication. Furthermore, Western blot analysis revealed significant expression differences of the three OlSpins among different tissues and during embryogenesis in medaka, and suggested that sequence and functional divergence might have occurred in evolution among them.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nunn, D.N.; Lidstrom, M.E.
A method has been developed for the direct selection of methanol oxidation mutants of the facultative methylotroph Methylobacterium sp. strain AM1 (formerly Pseudomonas sp. strain AM1). Using this direct selection technique, we have isolated mutants of Methylobacterium sp. strain AM1 that are no longer capable of growth on methanol but retain the ability to grow on methylamine. These methanol oxidation (Mox) mutants were complemented with a genomic clone bank of this organism constructed in the broad-host-range cosmid pVK100, and subcloning and Tn5 mutagenesis experiments have assigned the Mox mutants to 10 distinct complementation groups. Using an open reading frame beta-galactosidasemore » fusion vector and antibodies specific for Methylobacterium sp. strain AM1 methanol dehydrogenase, we have identified the methanol dehydrogenase structural gene and determined the direction of transcription. The results suggest that the synthesis and utilization of an active methanol dehydrogenase in this organism requires at least 10 different gene functions.« less
Identification and characterization of the autophagy-related genes Atg12 and Atg5 in hydra.
Dixit, Nishikant S; Shravage, Bhupendra V; Ghaskadbi, Surendra
2017-01-01
Autophagy is an evolutionarily conserved process in eukaryotic cells that is involved in the degradation of cytoplasmic contents including organelles via the lysosome. Hydra is an early metazoan which exhibits simple tissue grade organization, a primitive nervous system, and is one of the classical non-bilaterian models extensively used in evo-devo research. Here, we describe the characterization of two core autophagy genes, Atg12 and Atg5, from hydra. In silico analyses including sequence similarity, domain analysis, and phylogenetic analysis demonstrate the conservation of these genes across eukaryotes. The predicted 3D structure of hydra Atg12 showed very little variance when compared to human Atg12 and yeast Atg12, whereas the hydra Atg5 predicted 3D structure was found to be variable, when compared with its human and yeast homologs. Strikingly, whole mount in situ hybridization showed high expression of Atg12 transcripts specifically in nematoblasts, whereas Atg5 transcripts were found to be expressed strongly in budding region and growing buds. This study may provide a framework to understand the evolution of autophagy networks in higher eukaryotes.
Schuurs-Hoeijmakers, Janneke H M; Vulto-van Silfhout, Anneke T; Vissers, Lisenka E L M; van de Vondervoort, Ilse I G M; van Bon, Bregje W M; de Ligt, Joep; Gilissen, Christian; Hehir-Kwa, Jayne Y; Neveling, Kornelia; del Rosario, Marisol; Hira, Gausiya; Reitano, Santina; Vitello, Aurelio; Failla, Pinella; Greco, Donatella; Fichera, Marco; Galesi, Ornella; Kleefstra, Tjitske; Greally, Marie T; Ockeloen, Charlotte W; Willemsen, Marjolein H; Bongers, Ernie M H F; Janssen, Irene M; Pfundt, Rolph; Veltman, Joris A; Romano, Corrado; Willemsen, Michèl A; van Bokhoven, Hans; Brunner, Han G; de Vries, Bert B A; de Brouwer, Arjan P M
2013-12-01
Intellectual disability (ID) is a common neurodevelopmental disorder affecting 1-3% of the general population. Mutations in more than 10% of all human genes are considered to be involved in this disorder, although the majority of these genes are still unknown. We investigated 19 small non-consanguineous families with two to five affected siblings in order to identify pathogenic gene variants in known, novel and potential ID candidate genes. Non-consanguineous families have been largely ignored in gene identification studies as small family size precludes prior mapping of the genetic defect. Using exome sequencing, we identified pathogenic mutations in three genes, DDHD2, SLC6A8, and SLC9A6, of which the latter two have previously been implicated in X-linked ID phenotypes. In addition, we identified potentially pathogenic mutations in BCORL1 on the X-chromosome and in MCM3AP, PTPRT, SYNE1, and ZNF528 on autosomes. We show that potentially pathogenic gene variants can be identified in small, non-consanguineous families with as few as two affected siblings, thus emphasising their value in the identification of syndromic and non-syndromic ID genes.
Liu, Lei; Ang, Keng Pee; Elliott, J A K; Kent, Matthew Peter; Lien, Sigbjørn; MacDonald, Danielle; Boulding, Elizabeth Grace
2017-03-01
Comparative genome scans can be used to identify chromosome regions, but not traits, that are putatively under selection. Identification of targeted traits may be more likely in recently domesticated populations under strong artificial selection for increased production. We used a North American Atlantic salmon 6K SNP dataset to locate genome regions of an aquaculture strain (Saint John River) that were highly diverged from that of its putative wild founder population (Tobique River). First, admixed individuals with partial European ancestry were detected using STRUCTURE and removed from the dataset. Outlier loci were then identified as those showing extreme differentiation between the aquaculture population and the founder population. All Arlequin methods identified an overlapping subset of 17 outlier loci, three of which were also identified by BayeScan. Many outlier loci were near candidate genes and some were near published quantitative trait loci (QTLs) for growth, appetite, maturity, or disease resistance. Parallel comparisons using a wild, nonfounder population (Stewiacke River) yielded only one overlapping outlier locus as well as a known maturity QTL. We conclude that genome scans comparing a recently domesticated strain with its wild founder population can facilitate identification of candidate genes for traits known to have been under strong artificial selection.
Eves-van den Akker, Sebastian; Lilley, Catherine J.; Jones, John T.; Urwin, Peter E.
2014-01-01
Sedentary endoparasitic nematodes are obligate biotrophs that modify host root tissues, using a suite of effector proteins to create and maintain a feeding site that is their sole source of nutrition. Using assumptions about the characteristics of genes involved in plant-nematode biotrophic interactions to inform the identification strategy, we provide a description and characterisation of a novel group of hyper-variable extracellular effectors termed HYP, from the potato cyst nematode Globodera pallida. HYP effectors comprise a large gene family, with a modular structure, and have unparalleled diversity between individuals of the same population: no two nematodes tested had the same genetic complement of HYP effectors. Individuals vary in the number, size, and type of effector subfamilies. HYP effectors are expressed throughout the biotrophic stages in large secretory cells associated with the amphids of parasitic stage nematodes as confirmed by in situ hybridisation. The encoded proteins are secreted into the host roots where they are detectable by immunochemistry in the apoplasm, between the anterior end of the nematode and the feeding site. We have identified HYP effectors in three genera of plant parasitic nematodes capable of infecting a broad range of mono- and dicotyledon crop species. In planta RNAi targeted to all members of the effector family causes a reduction in successful parasitism. PMID:25255291
Eves-van den Akker, Sebastian; Lilley, Catherine J; Jones, John T; Urwin, Peter E
2014-09-01
Sedentary endoparasitic nematodes are obligate biotrophs that modify host root tissues, using a suite of effector proteins to create and maintain a feeding site that is their sole source of nutrition. Using assumptions about the characteristics of genes involved in plant-nematode biotrophic interactions to inform the identification strategy, we provide a description and characterisation of a novel group of hyper-variable extracellular effectors termed HYP, from the potato cyst nematode Globodera pallida. HYP effectors comprise a large gene family, with a modular structure, and have unparalleled diversity between individuals of the same population: no two nematodes tested had the same genetic complement of HYP effectors. Individuals vary in the number, size, and type of effector subfamilies. HYP effectors are expressed throughout the biotrophic stages in large secretory cells associated with the amphids of parasitic stage nematodes as confirmed by in situ hybridisation. The encoded proteins are secreted into the host roots where they are detectable by immunochemistry in the apoplasm, between the anterior end of the nematode and the feeding site. We have identified HYP effectors in three genera of plant parasitic nematodes capable of infecting a broad range of mono- and dicotyledon crop species. In planta RNAi targeted to all members of the effector family causes a reduction in successful parasitism.
Srivastava, A; Singh, V K; Patnaik, S; Tripathi, J; Singh, P; Nath, G; Asthana, R K
2017-04-01
Explorations of freshwater Cyanobacteria as antimicrobial (bacteria, fungi and methicillin-resistant Staphylococcus aureus (MRSA) strains) drug resource using bioassay, NRPS (non-ribosomal polypeptide synthetase) and PKS (polyketide synthase) genes, as well as in silico approach. We have bioassayed the extracts of Phormidium CCC727, Geitlerinema CCC728, Arthrospira CCC729, Leptolyngbya CCC732, Phormidium CCC730, Phormidium CCC731 against six pathogenic bacteria comprising Gram (+ve): S. aureus including seven clinical MRSA and Enterococcus faecalis, Gram (-ve): Escherichia coli, Salmonella Typhimurium, Klebsiella pneumoniae and Shigella boydii along with non-pathogenic Enterobacter aerogenes as well as fungal strains (Cryptococcus neoformans and Candida albicans, C. krusei, C. tropicalis and Aspergillus niger) exhibiting antimicrobial potential. The NRPS and PKS genes of the target strains were also amplified and sequenced. The putative protein structures were predicted using bioinformatics approach. PKS gene expression indicated β keto-acyl synthase as one of the important active domains in the biomolecules related to antitumour and antifungal group. The simultaneous identification of the biomolecule (dihydro-2H-pyran-2-one derivative) was also inferred spectroscopically. Freshwater Cyanobacteria are prolific producers of secondary metabolite(s) that may act as the antimicrobial drug resource in addition to their much explored marine counterpart. © 2016 The Society for Applied Microbiology.
Fingerprinting Soybean Germplasm and Its Utility in Genomic Research
Song, Qijian; Hyten, David L.; Jia, Gaofeng; Quigley, Charles V.; Fickus, Edward W.; Nelson, Randall L.; Cregan, Perry B.
2015-01-01
The United States Department of Agriculture, Soybean Germplasm Collection includes 18,480 domesticated soybean and 1168 wild soybean accessions introduced from 84 countries or developed in the United States. This collection was genotyped with the SoySNP50K BeadChip containing greater than 50K single-nucleotide polymorphisms. Redundant accessions were identified in the collection, and distinct genetic backgrounds of soybean from different geographic origins were observed that could be a unique resource for soybean genetic improvement. We detected a dramatic reduction of genetic diversity based on linkage disequilibrium and haplotype structure analyses of the wild, landrace, and North American cultivar populations and identified candidate regions associated with domestication and selection imposed by North American breeding. We constructed the first soybean haplotype block maps in the wild, landrace, and North American cultivar populations and observed that most recombination events occurred in the regions between haplotype blocks. These haplotype maps are crucial for association mapping aimed at the identification of genes controlling traits of economic importance. A case-control association test delimited potential genomic regions along seven chromosomes that most likely contain genes controlling seed weight in domesticated soybean. The resulting dataset will facilitate germplasm utilization, identification of genes controlling important traits, and will accelerate the creation of soybean varieties with improved seed yield and quality. PMID:26224783
Identification of pathogen avirulencegenes in the fusiform rust pathosystem
John M. Davis; Katherine E. Smith; Amanda Pendleton; Jason A. Smith; C. Dana Nelson
2012-01-01
The Cronartium quercuum f.sp. fusiforme (Cqf) whole genome sequencing project will enable identification of avirulence genes in the most devastating pine fungal pathogen in the southeastern United States. Amerson and colleagues (unpublished) have mapped nine fusiform rust resistance genes in loblolly pine,...
20 years since the introduction of DNA barcoding: from theory to application.
Fišer Pečnikar, Živa; Buzan, Elena V
2014-02-01
Traditionally, taxonomic identification has relied upon morphological characters. In the last two decades, molecular tools based on DNA sequences of short standardised gene fragments, termed DNA barcodes, have been developed for species discrimination. The most common DNA barcode used in animals is a fragment of the cytochrome c oxidase (COI) mitochondrial gene, while for plants, two chloroplast gene fragments from the RuBisCo large subunit (rbcL) and maturase K (matK) genes are widely used. Information gathered from DNA barcodes can be used beyond taxonomic studies and will have far-reaching implications across many fields of biology, including ecology (rapid biodiversity assessment and food chain analysis), conservation biology (monitoring of protected species), biosecurity (early identification of invasive pest species), medicine (identification of medically important pathogens and their vectors) and pharmacology (identification of active compounds). However, it is important that the limitations of DNA barcoding are understood and techniques continually adapted and improved as this young science matures.
Chen, Yunjia; Qiu, Shihong; Luan, Chi-Hao; Luo, Ming
2007-01-01
Background Expression of higher eukaryotic genes as soluble, stable recombinant proteins is still a bottleneck step in biochemical and structural studies of novel proteins today. Correct identification of stable domains/fragments within the open reading frame (ORF), combined with proper cloning strategies, can greatly enhance the success rate when higher eukaryotic proteins are expressed as these domains/fragments. Furthermore, a HTP cloning pipeline incorporated with bioinformatics domain/fragment selection methods will be beneficial to studies of structure and function genomics/proteomics. Results With bioinformatics tools, we developed a domain/domain boundary prediction (DDBP) method, which was trained by available experimental data. Combined with an improved cloning strategy, DDBP had been applied to 57 proteins from C. elegans. Expression and purification results showed there was a 10-fold increase in terms of obtaining purified proteins. Based on the DDBP method, the improved GATEWAY cloning strategy and a robotic platform, we constructed a high throughput (HTP) cloning pipeline, including PCR primer design, PCR, BP reaction, transformation, plating, colony picking and entry clones extraction, which have been successfully applied to 90 C. elegans genes, 88 Brucella genes, and 188 human genes. More than 97% of the targeted genes were obtained as entry clones. This pipeline has a modular design and can adopt different operations for a variety of cloning/expression strategies. Conclusion The DDBP method and improved cloning strategy were satisfactory. The cloning pipeline, combined with our recombinant protein HTP expression pipeline and the crystal screening robots, constitutes a complete platform for structure genomics/proteomics. This platform will increase the success rate of purification and crystallization dramatically and promote the further advancement of structure genomics/proteomics. PMID:17663785
Liu, Jiangang; Wang, Dapeng; Li, Yanyan; Yao, Hui; Zhang, Nan; Zhang, Xuewen; Zhong, Fangping; Huang, Yulun
2018-06-01
The human pituitary tumor-transforming gene is an oncogenic protein which serves as a central hub in the cellular signaling network of medulloblastoma. The protein contains two vicinal PxxP motifs at its C terminus that are potential binding sites of peptide-recognition SH3 domains. Here, a synthetic protocol that integrated in silico analysis and in vitro assay was described to identify the SH3-binding partners of pituitary tumor-transforming gene in the gene expression profile of medulloblastoma. In the procedure, a variety of structurally diverse, non-redundant SH3 domains with high gene expression in medulloblastoma were compiled, and their three-dimensional structures were either manually retrieved from the protein data bank database or computationally modeled through bioinformatics technique. The binding capability of these domains towards the two PxxP-containing peptides m1p: 161 LGPPSPVK 168 and m2p: 168 KMPSPPWE 175 of pituitary tumor-transforming gene were ranked by structure-based scoring and fluorescence-based assay. Consequently, a number of SH3 domains, including MAP3K and PI3K, were found to have moderate or high affinity for m1p and/or m2p. Interestingly, the two overlapping peptides exhibits a distinct binding profile to these identified domain partners, suggesting that the binding selectivity of m1p and m2p is optimized across the medulloblastoma expression spectrum by competing for domain candidates. In addition, two redesigned versions of m1p peptide ware obtained via a structure-based rational mutation approach, which exhibited an increased affinity for the domain as compared to native peptide.
Identification and characterization of a class of MALAT1 -like genomic loci
Zhang, Bin; Mao, Yuntao S.; Diermeier, Sarah D.; ...
2017-05-23
The MALAT1 (Metastasis-Associated Lung Adenocarcinoma Transcript 1) gene encodes a noncoding RNA that is processed into a long nuclear retained transcript ( MALAT1) and a small cytoplasmic tRNA-like transcript (mascRNA). Using an RNA sequence- and structure-based covariance model, we identified more than 130 genomic loci in vertebrate genomes containing the MALAT1 3' end triple-helix structure and its immediate downstream tRNA-like structure, including 44 in the green lizard Anolis carolinensis. Structural and computational analyses revealed a co-occurrence of components of the 3' end module. MALAT1-like genes in Anolis carolinensis are highly expressed in adult testis, thus we named them testis-abundant longmore » noncoding RNAs (tancRNAs). MALAT1-like loci also produce multiple small RNA species, including PIWI-interacting RNAs (piRNAs), from the antisense strand. The 3' ends of tancRNAs serve as potential targets for the PIWI-piRNA complex. Furthermore, we have identified an evolutionarily conserved class of long noncoding RNAs (lncRNAs) with similar structural constraints, post-transcriptional processing, and subcellular localization and a distinct function in spermatocytes.« less
Identification and characterization of a class of MALAT1 -like genomic loci
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhang, Bin; Mao, Yuntao S.; Diermeier, Sarah D.
The MALAT1 (Metastasis-Associated Lung Adenocarcinoma Transcript 1) gene encodes a noncoding RNA that is processed into a long nuclear retained transcript ( MALAT1) and a small cytoplasmic tRNA-like transcript (mascRNA). Using an RNA sequence- and structure-based covariance model, we identified more than 130 genomic loci in vertebrate genomes containing the MALAT1 3' end triple-helix structure and its immediate downstream tRNA-like structure, including 44 in the green lizard Anolis carolinensis. Structural and computational analyses revealed a co-occurrence of components of the 3' end module. MALAT1-like genes in Anolis carolinensis are highly expressed in adult testis, thus we named them testis-abundant longmore » noncoding RNAs (tancRNAs). MALAT1-like loci also produce multiple small RNA species, including PIWI-interacting RNAs (piRNAs), from the antisense strand. The 3' ends of tancRNAs serve as potential targets for the PIWI-piRNA complex. Furthermore, we have identified an evolutionarily conserved class of long noncoding RNAs (lncRNAs) with similar structural constraints, post-transcriptional processing, and subcellular localization and a distinct function in spermatocytes.« less
Parker, Brian J; Moltke, Ida; Roth, Adam; Washietl, Stefan; Wen, Jiayu; Kellis, Manolis; Breaker, Ronald; Pedersen, Jakob Skou
2011-11-01
Regulatory RNA structures are often members of families with multiple paralogous instances across the genome. Family members share functional and structural properties, which allow them to be studied as a whole, facilitating both bioinformatic and experimental characterization. We have developed a comparative method, EvoFam, for genome-wide identification of families of regulatory RNA structures, based on primary sequence and secondary structure similarity. We apply EvoFam to a 41-way genomic vertebrate alignment. Genome-wide, we identify 220 human, high-confidence families outside protein-coding regions comprising 725 individual structures, including 48 families with known structural RNA elements. Known families identified include both noncoding RNAs, e.g., miRNAs and the recently identified MALAT1/MEN β lincRNA family; and cis-regulatory structures, e.g., iron-responsive elements. We also identify tens of new families supported by strong evolutionary evidence and other statistical evidence, such as GO term enrichments. For some of these, detailed analysis has led to the formulation of specific functional hypotheses. Examples include two hypothesized auto-regulatory feedback mechanisms: one involving six long hairpins in the 3'-UTR of MAT2A, a key metabolic gene that produces the primary human methyl donor S-adenosylmethionine; the other involving a tRNA-like structure in the intron of the tRNA maturation gene POP1. We experimentally validate the predicted MAT2A structures. Finally, we identify potential new regulatory networks, including large families of short hairpins enriched in immunity-related genes, e.g., TNF, FOS, and CTLA4, which include known transcript destabilizing elements. Our findings exemplify the diversity of post-transcriptional regulation and provide a resource for further characterization of new regulatory mechanisms and families of noncoding RNAs.
Vadigepalli, Rajanikanth; Chakravarthula, Praveen; Zak, Daniel E; Schwaber, James S; Gonye, Gregory E
2003-01-01
We have developed a bioinformatics tool named PAINT that automates the promoter analysis of a given set of genes for the presence of transcription factor binding sites. Based on coincidence of regulatory sites, this tool produces an interaction matrix that represents a candidate transcriptional regulatory network. This tool currently consists of (1) a database of promoter sequences of known or predicted genes in the Ensembl annotated mouse genome database, (2) various modules that can retrieve and process the promoter sequences for binding sites of known transcription factors, and (3) modules for visualization and analysis of the resulting set of candidate network connections. This information provides a substantially pruned list of genes and transcription factors that can be examined in detail in further experimental studies on gene regulation. Also, the candidate network can be incorporated into network identification methods in the form of constraints on feasible structures in order to render the algorithms tractable for large-scale systems. The tool can also produce output in various formats suitable for use in external visualization and analysis software. In this manuscript, PAINT is demonstrated in two case studies involving analysis of differentially regulated genes chosen from two microarray data sets. The first set is from a neuroblastoma N1E-115 cell differentiation experiment, and the second set is from neuroblastoma N1E-115 cells at different time intervals following exposure to neuropeptide angiotensin II. PAINT is available for use as an agent in BioSPICE simulation and analysis framework (www.biospice.org), and can also be accessed via a WWW interface at www.dbi.tju.edu/dbi/tools/paint/.
Han, Qian; Fang, Jianmin; Ding, Haizhen; Johnson, Jody K; Christensen, Bruce M; Li, Jianyong
2002-01-01
This study describes the identification of Drosophila yellow-f and yellow-f2 as dopachrome-conversion enzymes responsible for catalysing the conversion of dopachrome into 5,6-dihydroxyindole in the melanization pathway. Drosophila yellow -y gene and yellow -b, -c, -f and -f2 genes were expressed in an insect cell/baculovirus expression system and their corresponding recombinant proteins were screened for dopachrome-conversion enzyme activity. Among the yellow and yellow -related genes, the yellow -f and yellow -f2 genes were identified as the genes coding for Drosophila dopachrome-conversion enzyme based on the high activity of their recombinant proteins in catalysing the production of 5,6-dihydroxyindole from dopachrome. Both yellow-f and yellow-f2 are capable of mediating a decarboxylative structural rearrangement of dopachrome, as well as an isomerization/tautomerization of dopamine chrome and dopa methyl ester chrome. Northern hybridization revealed the transcription of yellow -f in larvae and pupae, but a high abundance of mRNA was observed in later larval and early pupal stages. In contrast, yellow-f2 transcripts were present at all stages, but high abundance of its mRNA was observed in later-stage pupae and adults. These data indicate that yellow-f and yellow-f2 complement each other during Drosophila development and that the yellow-f is involved in larval and pupal melanization, and yellow-f2 plays a major role in melanization reactions in Drosophila during later pupal and adult development. Results from this study provide the groundwork towards a better understanding of the physiological roles of the Drosophila yellow gene family. PMID:12164780
Ryan, P R; Tyerman, S D; Sasaki, T; Furuichi, T; Yamamoto, Y; Zhang, W H; Delhaize, E
2011-01-01
Acid soils restrict plant production around the world. One of the major limitations to plant growth on acid soils is the prevalence of soluble aluminium (Al(3+)) ions which can inhibit root growth at micromolar concentrations. Species that show a natural resistance to Al(3+) toxicity perform better on acid soils. Our understanding of the physiology of Al(3+) resistance in important crop plants has increased greatly over the past 20 years, largely due to the application of genetics and molecular biology. Fourteen genes from seven different species are known to contribute to Al(3+) tolerance and resistance and several additional candidates have been identified. Some of these genes account for genotypic variation within species and others do not. One mechanism of resistance which has now been identified in a range of species relies on the efflux of organic anions such as malate and citrate from roots. The genes controlling this trait are members of the ALMT and MATE families which encode membrane proteins that facilitate organic anion efflux across the plasma membrane. Identification of these and other resistance genes provides opportunities for enhancing the Al(3+) resistance of plants by marker-assisted breeding and through biotechnology. Most attempts to enhance Al(3+) resistance in plants with genetic engineering have targeted genes that are induced by Al(3+) stress or that are likely to increase organic anion efflux. In the latter case, studies have either enhanced organic anion synthesis or increased organic anion transport across the plasma membrane. Recent developments in this area are summarized and the structure-function of the TaALMT1 protein from wheat is discussed.
Hussain, Tajammul; Plunkett, Blue; Ejaz, Mahwish; Espley, Richard V.; Kayser, Oliver
2018-01-01
The liverwort Radula marginata belongs to the bryophyte division of land plants and is a prospective alternate source of cannabinoid-like compounds. However, mechanistic insights into the molecular pathways directing the synthesis of these cannabinoid-like compounds have been hindered due to the lack of genetic information. This prompted us to do deep sequencing, de novo assembly and annotation of R. marginata transcriptome, which resulted in the identification and validation of the genes for cannabinoid biosynthetic pathway. In total, we have identified 11,421 putative genes encoding 1,554 enzymes from 145 biosynthetic pathways. Interestingly, we have identified all the upstream genes of the central precursor of cannabinoid biosynthesis, cannabigerolic acid (CBGA), including its two first intermediates, stilbene acid (SA) and geranyl diphosphate (GPP). Expression of all these genes was validated using quantitative real-time PCR. We have characterized the protein structure of stilbene synthase (STS), which is considered as a homolog of olivetolic acid in R. marginata. Moreover, the metabolomics approach enabled us to identify CBGA-analogous compounds using electrospray ionization mass spectrometry (ESI-MS/MS) and gas chromatography mass spectrometry (GC-MS). Transcriptomic analysis revealed 1085 transcription factors (TF) from 39 families. Comparative analysis showed that six TF families have been uniquely predicted in R. marginata. In addition, the bioinformatics analysis predicted a large number of simple sequence repeats (SSRs) and non-coding RNAs (ncRNAs). Our results collectively provide mechanistic insights into the putative precursor genes for the biosynthesis of cannabinoid-like compounds and a novel transcriptomic resource for R. marginata. The large-scale transcriptomic resource generated in this study would further serve as a reference transcriptome to explore the Radulaceae family.
Goulin, Eduardo Henrique; Savi, Daiani Cristina; Petters, Desirrê Alexia Lourenço; Kava, Vanessa; Galli-Terasawa, Lygia; Silva, Geraldo José; Glienke, Chirlei
2016-11-01
Phyllosticta citricarpa is the epidemiological agent of Citrus Black Spot (CBS) disease, which is responsible for large economic losses worldwide. CBS is characterized by the presence of spores (pycnidiospores) in dark lesions of fruit, which are also responsible for short distance dispersal of the disease. The identification of genes involved in asexual reproduction of P. citricarpa can be an alternative for directional disease control. We analyzed a library of mutants obtained through Agrobacterium tumefaciens transformation system, looking for alterations in growth and reproductive structure formation. Two mutant strains were found to have lost the ability to form pycnidia. The flanking T-DNA insertion regions were identified on P. citricarpa genome by using blast analysis and further gene prediction. The predicted genes containing the T-DNA insertions were identified as Spindle Poison Sensitivity Scp3, Ion Transport protein, and Cullin Binding proteins. The Ion Transport and Cullin Binding proteins are known to be correlated with sexual and asexual reproduction in fungi; however, the exact mechanism by which these proteins act on spore formation in P. citricarpa needs to be better characterized. The Scp3 proteins are suggested here for the first time as being associated with asexual reproduction in fungus. This protein is associated with microtubule formation, and as microtubules play an essential role as spindle machinery for chromosome segregation and cytokinesis, insertions in this gene can lead to abnormal formations, such as that observed here in P. citricarpa. We suggest these genes as new targets for fungicide development and CBS disease control, by iRNA. Copyright © 2016 Elsevier GmbH. All rights reserved.
Schneider, Lizette M; Adamski, Nikolai M; Christensen, Caspar Elo; Stuart, David B; Vautrin, Sonia; Hansson, Mats; Uauy, Cristobal; von Wettstein-Knowles, Penny
2016-03-09
Aliphatic compounds on plant surfaces, called epicuticular waxes, are the first line of defense against pathogens and pests, contribute to reducing water loss and determine other important phenotypes. Aliphatics can form crystals affecting light refraction, resulting in a color change and allowing identification of mutants in their synthesis or transport. The present study discloses three such Eceriferum (cer) genes in barley - Cer-c, Cer-q and Cer-u - known to be tightly linked and functioning in a biochemical pathway forming dominating amounts of β-diketone and hydroxy-β-diketones plus some esterified alkan-2-ols. These aliphatics are present in many Triticeae as well as dicotyledons such as Eucalyptus and Dianthus. Recently developed genomic resources and mapping populations in barley defined these genes to a small region on chromosome arm 2HS. Exploiting Cer-c and -u potential functions pinpointed five candidates, of which three were missing in apparent cer-cqu triple mutants. Sequencing more than 50 independent mutants for each gene confirmed their identification. Cer-c is a chalcone synthase-like polyketide synthase, designated diketone synthase (DKS), Cer-q is a lipase/carboxyl transferase and Cer-u is a P450 enzyme. All were highly expressed in pertinent leaf sheath tissue of wild type. A physical map revealed the order Cer-c, Cer-u, Cer-q with the flanking genes 101kb apart, confirming they are a gene cluster, Cer-cqu. Homology-based modeling suggests that many of the mutant alleles affect overall protein structure or specific active site residues. The rich diversity of identified mutations will facilitate future studies of three key enzymes involved in synthesis of plant apoplast waxes. © The Author 2016. Published by Oxford University Press on behalf of the Society for Experimental Biology.
Schneider, Lizette M; Adamski, Nikolai M; Christensen, Caspar Elo; Stuart, David B; Vautrin, Sonia; Hansson, Mats; Uauy, Cristobal; von Wettstein-Knowles, Penny
2016-01-01
Aliphatic compounds on plant surfaces, called epicuticular waxes, are the first line of defense against pathogens and pests, contribute to reducing water loss and determine other important phenotypes. Aliphatics can form crystals affecting light refraction, resulting in a color change and allowing identification of mutants in their synthesis or transport. The present study discloses three such Eceriferum (cer) genes in barley – Cer-c, Cer-q and Cer-u – known to be tightly linked and functioning in a biochemical pathway forming dominating amounts of β-diketone and hydroxy-β-diketones plus some esterified alkan-2-ols. These aliphatics are present in many Triticeae as well as dicotyledons such as Eucalyptus and Dianthus. Recently developed genomic resources and mapping populations in barley defined these genes to a small region on chromosome arm 2HS. Exploiting Cer-c and -u potential functions pinpointed five candidates, of which three were missing in apparent cer-cqu triple mutants. Sequencing more than 50 independent mutants for each gene confirmed their identification. Cer-c is a chalcone synthase-like polyketide synthase, designated diketone synthase (DKS), Cer-q is a lipase/carboxyl transferase and Cer-u is a P450 enzyme. All were highly expressed in pertinent leaf sheath tissue of wild type. A physical map revealed the order Cer-c, Cer-u, Cer-q with the flanking genes 101kb apart, confirming they are a gene cluster, Cer-cqu. Homology-based modeling suggests that many of the mutant alleles affect overall protein structure or specific active site residues. The rich diversity of identified mutations will facilitate future studies of three key enzymes involved in synthesis of plant apoplast waxes. PMID:26962211
Chandra, Saket; Kazmi, Andaleeb Z; Ahmed, Zainab; Roychowdhury, Gargi; Kumari, Veena; Kumar, Manish; Mukhopadhyay, Kunal
2017-07-01
NB-ARC domain-containing resistance genes from the wheat genome were identified, characterized and localized on chromosome arms that displayed differential yet positive response during incompatible and compatible leaf rust interactions. Wheat (Triticum aestivum L.) is an important cereal crop; however, its production is affected severely by numerous diseases including rusts. An efficient, cost-effective and ecologically viable approach to control pathogens is through host resistance. In wheat, high numbers of resistance loci are present but only few have been identified and cloned. A comprehensive analysis of the NB-ARC-containing genes in complete wheat genome was accomplished in this study. Complete NB-ARC encoding genes were mined from the Ensembl Plants database to predict 604 NB-ARC containing sequences using the HMM approach. Genome-wide analysis of orthologous clusters in the NB-ARC-containing sequences of wheat and other members of the Poaceae family revealed maximum homology with Oryza sativa indica and Brachypodium distachyon. The identification of overlap between orthologous clusters enabled the elucidation of the function and evolution of resistance proteins. The distributions of the NB-ARC domain-containing sequences were found to be balanced among the three wheat sub-genomes. Wheat chromosome arms 4AL and 7BL had the most NB-ARC domain-containing contigs. The spatio-temporal expression profiling studies exemplified the positive role of these genes in resistant and susceptible wheat plants during incompatible and compatible interaction in response to the leaf rust pathogen Puccinia triticina. Two NB-ARC domain-containing sequences were modelled in silico, cloned and sequenced to analyze their fine structures. The data obtained in this study will augment isolation, characterization and application NB-ARC resistance genes in marker-assisted selection based breeding programs for improving rust resistance in wheat.
Shi, Pibiao; Guy, Kateta Malangisha; Wu, Weifang; Fang, Bingsheng; Yang, Jinghua; Zhang, Mingfang; Hu, Zhongyuan
2016-04-12
The plant-specific TCP transcription factor family, which is involved in the regulation of cell growth and proliferation, performs diverse functions in multiple aspects of plant growth and development. However, no comprehensive analysis of the TCP family in watermelon (Citrullus lanatus) has been undertaken previously. A total of 27 watermelon TCP encoding genes distributed on nine chromosomes were identified. Phylogenetic analysis clustered the genes into 11 distinct subgroups. Furthermore, phylogenetic and structural analyses distinguished two homology classes within the ClTCP family, designated Class I and Class II. The Class II genes were differentiated into two subclasses, the CIN subclass and the CYC/TB1 subclass. The expression patterns of all members were determined by semi-quantitative PCR. The functions of two ClTCP genes, ClTCP14a and ClTCP15, in regulating plant height were confirmed by ectopic expression in Arabidopsis wild-type and ortholog mutants. This study represents the first genome-wide analysis of the watermelon TCP gene family, which provides valuable information for understanding the classification and functions of the TCP genes in watermelon.
Chai, Wenbo; Jiang, Pengfei; Huang, Guoyu; Jiang, Haiyang; Li, Xiaoyu
2017-10-01
The TCP family is a group of plant-specific transcription factors. TCP genes encode proteins harboring bHLH structure, which is implicated in DNA binding and protein-protein interactions and known as the TCP domain. TCP genes play important roles in plant development and have been evolutionarily and functionally elaborated in various plants, however, no overall phylogenetic analysis or expression profiling of TCP genes in Zea mays has been reported. In the present study, a systematic analysis of molecular evolution and functional prediction of TCP family genes in maize ( Z . mays L.) has been conducted. We performed a genome-wide survey of TCP genes in maize, revealing the gene structure, chromosomal location and phylogenetic relationship of family members. Microsynteny between grass species and tissue-specific expression profiles were also investigated. In total, 29 TCP genes were identified in the maize genome, unevenly distributed on the 10 maize chromosomes. Additionally, ZmTCP genes were categorized into nine classes based on phylogeny and purifying selection may largely be responsible for maintaining the functions of maize TCP genes. What's more, microsynteny analysis suggested that TCP genes have been conserved during evolution. Finally, expression analysis revealed that most TCP genes are expressed in the stem and ear, which suggests that ZmTCP genes influence stem and ear growth. This result is consistent with the previous finding that maize TCP genes represses the growth of axillary organs and enables the formation of female inflorescences. Altogether, this study presents a thorough overview of TCP family in maize and provides a new perspective on the evolution of this gene family. The results also indicate that TCP family genes may be involved in development stage in plant growing conditions. Additionally, our results will be useful for further functional analysis of the TCP gene family in maize.
A Third Approach to Gene Prediction Suggests Thousands of Additional Human Transcribed Regions
Glusman, Gustavo; Qin, Shizhen; El-Gewely, M. Raafat; Siegel, Andrew F; Roach, Jared C; Hood, Leroy; Smit, Arian F. A
2006-01-01
The identification and characterization of the complete ensemble of genes is a main goal of deciphering the digital information stored in the human genome. Many algorithms for computational gene prediction have been described, ultimately derived from two basic concepts: (1) modeling gene structure and (2) recognizing sequence similarity. Successful hybrid methods combining these two concepts have also been developed. We present a third orthogonal approach to gene prediction, based on detecting the genomic signatures of transcription, accumulated over evolutionary time. We discuss four algorithms based on this third concept: Greens and CHOWDER, which quantify mutational strand biases caused by transcription-coupled DNA repair, and ROAST and PASTA, which are based on strand-specific selection against polyadenylation signals. We combined these algorithms into an integrated method called FEAST, which we used to predict the location and orientation of thousands of putative transcription units not overlapping known genes. Many of the newly predicted transcriptional units do not appear to code for proteins. The new algorithms are particularly apt at detecting genes with long introns and lacking sequence conservation. They therefore complement existing gene prediction methods and will help identify functional transcripts within many apparent “genomic deserts.” PMID:16543943
Cell type-selective disease-association of genes under high regulatory load.
Galhardo, Mafalda; Berninger, Philipp; Nguyen, Thanh-Phuong; Sauter, Thomas; Sinkkonen, Lasse
2015-10-15
We previously showed that disease-linked metabolic genes are often under combinatorial regulation. Using the genome-wide ChIP-Seq binding profiles for 93 transcription factors in nine different cell lines, we show that genes under high regulatory load are significantly enriched for disease-association across cell types. We find that transcription factor load correlates with the enhancer load of the genes and thereby allows the identification of genes under high regulatory load by epigenomic mapping of active enhancers. Identification of the high enhancer load genes across 139 samples from 96 different cell and tissue types reveals a consistent enrichment for disease-associated genes in a cell type-selective manner. The underlying genes are not limited to super-enhancer genes and show several types of disease-association evidence beyond genetic variation (such as biomarkers). Interestingly, the high regulatory load genes are involved in more KEGG pathways than expected by chance, exhibit increased betweenness centrality in the interaction network of liver disease genes, and carry longer 3' UTRs with more microRNA (miRNA) binding sites than genes on average, suggesting a role as hubs integrating signals within regulatory networks. In summary, epigenetic mapping of active enhancers presents a promising and unbiased approach for identification of novel disease genes in a cell type-selective manner. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Moody, Michael L; Rieseberg, Loren H
2012-07-01
The annual sunflowers (Helianthus sect. Helianthus) present a formidable challenge for phylogenetic inference because of ancient hybrid speciation, recent introgression, and suspected issues with deep coalescence. Here we analyze sequence data from 11 nuclear DNA (nDNA) genes for multiple genotypes of species within the section to (1) reconstruct the phylogeny of this group, (2) explore the utility of nDNA gene trees for detecting hybrid speciation and introgression; and (3) test an empirical method of hybrid identification based on the phylogenetic congruence of nDNA gene trees from tightly linked genes. We uncovered considerable topological heterogeneity among gene trees with or without three previously identified hybrid species included in the analyses, as well as a general lack of reciprocal monophyly of species. Nonetheless, partitioned Bayesian analyses provided strong support for the reciprocal monophyly of all species except H. annuus (0.89 PP), the most widespread and abundant annual sunflower. Previous hypotheses of relationships among taxa were generally strongly supported (1.0 PP), except among taxa typically associated with H. annuus, apparently due to the paraphyly of the latter in all gene trees. While the individual nDNA gene trees provided a useful means for detecting recent hybridization, identification of ancient hybridization was problematic for all ancient hybrid species, even when linkage was considered. We discuss biological factors that affect the efficacy of phylogenetic methods for hybrid identification.
Sakai, Kanae; Komaki, Hisayuki; Gonoi, Tohru
2015-01-01
Nocardithiocin is a thiopeptide compound isolated from the opportunistic pathogen Nocardia pseudobrasiliensis. It shows a strong activity against acid-fast bacteria and is also active against rifampicin-resistant Mycobacterium tuberculosis. Here, we report the identification of the nocardithiocin gene cluster in N. pseudobrasiliensis IFM 0761 based on conserved thiopeptide biosynthesis gene sequence and the whole genome sequence. The predicted gene cluster was confirmed by gene disruption and complementation. As expected, strains containing the disrupted gene did not produce nocardithiocin while gene complementation restored nocardithiocin production in these strains. The predicted cluster was further analyzed using RNA-seq which showed that the nocardithiocin gene cluster contains 12 genes within a 15.2-kb region. This finding will promote the improvement of nocardithiocin productivity and its derivatives production. PMID:26588225
rpoB Gene Sequencing for Identification of Corynebacterium Species
Khamis, Atieh; Raoult, Didier; La Scola, Bernard
2004-01-01
The genus Corynebacterium is a heterogeneous group of species comprising human and animal pathogens and environmental bacteria. It is defined on the basis of several phenotypic characters and the results of DNA-DNA relatedness and, more recently, 16S rRNA gene sequencing. However, the 16S rRNA gene is not polymorphic enough to ensure reliable phylogenetic studies and needs to be completely sequenced for accurate identification. The almost complete rpoB sequences of 56 Corynebacterium species were determined by both PCR and genome walking methods. In all cases the percent similarities between different species were lower than those observed by 16S rRNA gene sequencing, even for those species with degrees of high similarity. Several clusters supported by high bootstrap values were identified. In order to propose a method for strain identification which does not require sequencing of the complete rpoB sequence (approximately 3,500 bp), we identified an area with a high degree of polymorphism, bordered by conserved sequences that can be used as universal primers for PCR amplification and sequencing. The sequence of this fragment (434 to 452 bp) allows accurate species identification and may be used in the future for routine sequence-based identification of Corynebacterium species. PMID:15364970
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dichgans, M.; Mayer, M.; Straube, A.
1996-02-15
This article reports on new information regarding the genetic mapping of the human CADASIL gene region. Previously, the gene had been mapped to human chromosome 19q12. Using the identification of a chromosomal crossover, the region has been refined to an 8-cM interval. 11 refs., 2 figs., 1 tab.
Large Scale Single Nucleotide Polymorphism Study of PD Susceptibility
2005-03-01
identification of eight genetic loci in the familial PD, the results of intensive investigations of polymorphisms in dozens of genes related to sporadic, late...1) investigate the association between classical, sporadic PD and 2386 SNPs in 23 genes implicated in the pathogenesis of PD; (2) construct...addition, experiences derived from this study may be applied in other complex disorders for the identification of susceptibility genes , as well as in genome
Zou, Shanmei; Fei, Cong; Wang, Chun; Gao, Zhan; Bao, Yachao; He, Meilin; Wang, Changhai
2016-01-01
Microalgae identification is extremely difficult. The efficiency of DNA barcoding in microalgae identification involves ideal gene markers and approaches employed, which however, is still under the way. Although Scenedesmus has obtained much research in producing lipids its identification is difficult. Here we present a comprehensive coalescent, distance and character-based DNA barcoding for 118 Scenedesmus strains based on rbcL, tufA, ITS and 16S. The four genes, and their combined data rbcL + tufA + ITS + 16S, rbcL + tufA and ITS + 16S were analyzed by all of GMYC, P ID, PTP, ABGD, and character-based barcoding respectively. It was apparent that the three combined gene data showed a higher proportion of resolution success than the single gene. In comparison, the GMYC and PTP analysis produced more taxonomic lineages. The ABGD generated various resolution in discrimination among the single and combined data. The character-based barcoding was proved to be the most effective approach for species discrimination in both single and combined data which produced consistent species identification. All the integrated results recovered 11 species, five out of which were revealed as potential cryptic species. We suggest that the character-based DNA barcoding together with other approaches based on multiple genes and their combined data could be more effective in microalgae diversity revelation. PMID:27827440
Zou, Shanmei; Fei, Cong; Wang, Chun; Gao, Zhan; Bao, Yachao; He, Meilin; Wang, Changhai
2016-11-09
Microalgae identification is extremely difficult. The efficiency of DNA barcoding in microalgae identification involves ideal gene markers and approaches employed, which however, is still under the way. Although Scenedesmus has obtained much research in producing lipids its identification is difficult. Here we present a comprehensive coalescent, distance and character-based DNA barcoding for 118 Scenedesmus strains based on rbcL, tufA, ITS and 16S. The four genes, and their combined data rbcL + tufA + ITS + 16S, rbcL + tufA and ITS + 16S were analyzed by all of GMYC, P ID, PTP, ABGD, and character-based barcoding respectively. It was apparent that the three combined gene data showed a higher proportion of resolution success than the single gene. In comparison, the GMYC and PTP analysis produced more taxonomic lineages. The ABGD generated various resolution in discrimination among the single and combined data. The character-based barcoding was proved to be the most effective approach for species discrimination in both single and combined data which produced consistent species identification. All the integrated results recovered 11 species, five out of which were revealed as potential cryptic species. We suggest that the character-based DNA barcoding together with other approaches based on multiple genes and their combined data could be more effective in microalgae diversity revelation.
Identification and characterization of NF-YB family genes in tung tree.
Yang, Susu; Wang, Yangdong; Yin, Hengfu; Guo, Haobo; Gao, Ming; Zhu, Huiping; Chen, Yicun
2015-12-01
The NF-YB transcription factor gene family encodes a subunit of the CCAAT box-binding factor (CBF), a highly conserved trimeric activator that strongly binds to the CCAAT box promoter element. Studies on model plants have shown that NF-YB proteins participate in important developmental and physiological processes, but little is known about NF-YB proteins in trees. Here, we identified seven NF-YB transcription factor-encoding genes in Vernicia fordii, an important oilseed tree in China. A phylogenetic analysis separated the genes into two groups; non-LEC1 type (VfNF-YB1, 5, 7, 9, 11, 13) and LEC1-type (VfNF-YB 14). A gene structure analysis showed that VfNF-YB 5 has three introns and the other genes have no introns. The seven VfNF-YB sequences contain highly conserved domains, a disordered region at the N terminus, and two long helix structures at the C terminus. Phylogenetic analyses showed that VfNF-YB family genes are highly homologous to GmNF-YB genes, and many of them are closely related to functionally characterized NF-YBs. In expression analyses of various tissues (root, stem, leaf, and kernel) and the root during pathogen infection, VfNF-YB1, 5, and 11 were dominantly expressed in kernels, and VfNF-YB7 and 9 were expressed only in the root. Different VfNF-YB family genes showed different responses to pathogen infection, suggesting that they play different roles in the pathogen response. Together, these findings represent the first extensive evaluation of the NF-YB family in tung tree and provide a foundation for dissecting the functions of VfNF-YB genes in seed development, stress adaption, fatty acid synthesis, and pathogen response.
Sotirova, V N; Rezaie, T M; Khoshsorour, M M; Sarfarazi, M
2000-03-01
Waardenburg syndrome Type I (WS1) is an autosomal dominant disorder that has previously been associated with mutations in the PAX3 gene on the 2q35 region. In this study, we used an Iranian WS1 family with seven affected individuals in three generations. The phenotypic characteristics of the family include sensorineural deafness, dystopia canthorum, hypopigmented skin patches of the upper limbs, congenital white forelock, confluent white eyebrows, nonpigmented iris, poliosis, and hypopigmentation of the retina. Herein, we report a previously unidentified single-base substitution in exon II (C-->T at position 218) that results in a change of serine to leucine (S73L) in this family. This change was not observed in 100 chromosomes of healthy unrelated individuals. This mutation is within the PAX3 paired domain region, a structure that is highly conserved and implicated in DNA binding. This is the first identification of a PAX3 mutation for this phenotype in the Iranian population. This also provides additional confirmation for the involvement of this gene in the etiology of WS1.
Tippett, Lynette J; Waldvogel, Henry J; Snell, Russell G; Vonsattel, Jean-Paul; Young, Anne B; Faull, Richard L M
2017-01-01
Huntington's disease (HD) is an autosomal dominant neurodegenerative disorder characterised by extensive neuronal loss in the striatum and cerebral cortex, and a triad of clinical symptoms affecting motor, cognitive/behavioural and mood functioning. The mutation causing HD is an expansion of a CAG tract in exon 1 of the HTT gene. This chapter provides a multifaceted overview of the clinical complexity of HD. We explore recent directions in molecular genetics including the identification of loci that are genetic modifiers of HD that could potentially reveal therapeutic targets beyond the HTT gene transcript and protein. The variability of clinical symptomatology in HD is considered alongside recent findings of variability in cellular and neurochemical changes in the striatum and cerebral cortex in human brain. We review evidence from structural neuroimaging methods of progressive changes of striatum, cerebral cortex and white matter in pre-symptomatic and symptomatic HD, with a particular focus on the potential identification of neuroimaging biomarkers that could be used to test promising disease-specific and modifying treatments. Finally we provide an overview of completed clinical trials in HD and future therapeutic developments.
Cheng, Y; Yao, Z P; Ruan, M Y; Ye, Q J; Wang, R Q; Zhou, G Z; Luo, J
2016-09-23
The WRKY family is one of the most important transcription factor families in plants, involved in the regulation of a broad range of biological roles. The recent releases of whole-genome sequences of pepper (Capsicum annuum L.) allow us to perform a genome-wide identification and characterization of the WRKY family. In this study, 61 CaWRKY proteins were identified in the pepper genome. Based on protein structural and phylogenetic analyses, these proteins were classified into four main groups (I, II, III, and NG), and Group II was further divided into five subgroups (IIa to IIe). Chromosome mapping analysis indicated that CaWRKY genes are distributed across all 12 chromosomes, although the location of four CaWRKYs (CaWRKY58-CaWRKY61) could not be identified. Two pairs of CaWRKYs located on chromosome 01 appear to be tandem duplications. Furthermore, the phylogenetic tree showed a close evolutionary relationship of WRKYs in three species from Solanaceae. In conclusion, this comprehensive analysis of CaWRKYs will provide rich resources for further functional studies in pepper.
Comprehensive cellular‐resolution atlas of the adult human brain
Royall, Joshua J.; Sunkin, Susan M.; Ng, Lydia; Facer, Benjamin A.C.; Lesnar, Phil; Guillozet‐Bongaarts, Angie; McMurray, Bergen; Szafer, Aaron; Dolbeare, Tim A.; Stevens, Allison; Tirrell, Lee; Benner, Thomas; Caldejon, Shiella; Dalley, Rachel A.; Dee, Nick; Lau, Christopher; Nyhus, Julie; Reding, Melissa; Riley, Zackery L.; Sandman, David; Shen, Elaine; van der Kouwe, Andre; Varjabedian, Ani; Write, Michelle; Zollei, Lilla; Dang, Chinh; Knowles, James A.; Koch, Christof; Phillips, John W.; Sestan, Nenad; Wohnoutka, Paul; Zielke, H. Ronald; Hohmann, John G.; Jones, Allan R.; Bernard, Amy; Hawrylycz, Michael J.; Hof, Patrick R.; Fischl, Bruce
2016-01-01
ABSTRACT Detailed anatomical understanding of the human brain is essential for unraveling its functional architecture, yet current reference atlases have major limitations such as lack of whole‐brain coverage, relatively low image resolution, and sparse structural annotation. We present the first digital human brain atlas to incorporate neuroimaging, high‐resolution histology, and chemoarchitecture across a complete adult female brain, consisting of magnetic resonance imaging (MRI), diffusion‐weighted imaging (DWI), and 1,356 large‐format cellular resolution (1 µm/pixel) Nissl and immunohistochemistry anatomical plates. The atlas is comprehensively annotated for 862 structures, including 117 white matter tracts and several novel cyto‐ and chemoarchitecturally defined structures, and these annotations were transferred onto the matching MRI dataset. Neocortical delineations were done for sulci, gyri, and modified Brodmann areas to link macroscopic anatomical and microscopic cytoarchitectural parcellations. Correlated neuroimaging and histological structural delineation allowed fine feature identification in MRI data and subsequent structural identification in MRI data from other brains. This interactive online digital atlas is integrated with existing Allen Institute for Brain Science gene expression atlases and is publicly accessible as a resource for the neuroscience community. J. Comp. Neurol. 524:3127–3481, 2016. © 2016 The Authors The Journal of Comparative Neurology Published by Wiley Periodicals, Inc. PMID:27418273
Król, Jaroslaw; Bania, Jacek; Florek, Magdalena; Pliszczak-Król, Aleksandra; Staroniewicz, Zdzislaw
2011-05-01
A set of polymerase chain reaction (PCR) assays for identification of the most important Pasteurellaceae species encountered in cats and dogs were developed. Primers for Pasteurella multocida were designed to detect a fragment of the kmt, a gene encoding the outer-membrane protein. Primers specific to Pasteurella canis, Pasteurella dagmatis, and Pasteurella stomatis were based on the manganese-dependent superoxide dismutase gene (sodA) and those specific to [Haemophilus] haemoglobinophilus on species-specific sequences of the 16S ribosomal RNA gene. All the primers were tested on respective reference and control strains and applied to the identification of 47 canine and feline field isolates of Pasteurellaceae. The PCR assays were shown to be species specific, providing a valuable supplement to phenotypic identification of species within this group of bacteria. © 2011 The Author(s)
Ríos, Gabino; Naranjo, Miguel A; Iglesias, Domingo J; Ruiz-Rivero, Omar; Geraud, Marion; Usach, Antonio; Talón, Manuel
2008-01-01
Background Many fruit-tree species, including relevant Citrus spp varieties exhibit a reproductive biology that impairs breeding and strongly constrains genetic improvements. In citrus, juvenility increases the generation time while sexual sterility, inbreeding depression and self-incompatibility prevent the production of homozygous cultivars. Genomic technology may provide citrus researchers with a new set of tools to address these various restrictions. In this work, we report a valuable genomics-based protocol for the structural analysis of deletion mutations on an heterozygous background. Results Two independent fast neutron mutants of self-incompatible clementine (Citrus clementina Hort. Ex Tan. cv. Clemenules) were the subject of the study. Both mutants, named 39B3 and 39E7, were expected to carry DNA deletions in hemizygous dosage. Array-based Comparative Genomic Hybridization (array-CGH) using a Citrus cDNA microarray allowed the identification of underrepresented genes in these two mutants. Subsequent comparison of citrus deleted genes with annotated plant genomes, especially poplar, made possible to predict the presence of a large deletion in 39B3 of about 700 kb and at least two deletions of approximately 100 and 500 kb in 39E7. The deletion in 39B3 was further characterized by PCR on available Citrus BACs, which helped us to build a partial physical map of the deletion. Among the deleted genes, ClpC-like gene coding for a putative subunit of a multifunctional chloroplastic protease involved in the regulation of chlorophyll b synthesis was directly related to the mutated phenotype since the mutant showed a reduced chlorophyll a/b ratio in green tissues. Conclusion In this work, we report the use of array-CGH for the successful identification of genes included in a hemizygous deletion induced by fast neutron irradiation on Citrus clementina. The study of gene content and order into the 39B3 deletion also led to the unexpected conclusion that microsynteny and local gene colinearity in this species were higher with Populus trichocarpa than with the phylogenetically closer Arabidopsis thaliana. This work corroborates the potential of Citrus genomic resources to assist mutagenesis-based approaches for functional genetics, structural studies and comparative genomics, and hence to facilitate citrus variety improvement. PMID:18691431
Gao, Liangliang; Turner, M. Kathryn; Chao, Shiaoman; Kolmer, James; Anderson, James A.
2016-01-01
Leaf rust is an important disease, threatening wheat production annually. Identification of resistance genes or QTLs for effective field resistance could greatly enhance our ability to breed durably resistant varieties. We applied a genome wide association study (GWAS) approach to identify resistance genes or QTLs in 338 spring wheat breeding lines from public and private sectors that were predominately developed in the Americas. A total of 46 QTLs were identified for field and seedling traits and approximately 20–30 confer field resistance in varying degrees. The 10 QTLs accounting for the most variation in field resistance explained 26–30% of the total variation (depending on traits: percent severity, coefficient of infection or response type). Similarly, the 10 QTLs accounting for most of the variation in seedling resistance to different races explained 24–34% of the variation, after correcting for population structure. Two potentially novel QTLs (QLr.umn-1AL, QLr.umn-4AS) were identified. Identification of novel genes or QTLs and validation of previously identified genes or QTLs for seedling and especially adult plant resistance will enhance understanding of leaf rust resistance and assist breeding for resistant wheat varieties. We also developed computer programs to automate field and seedling rust phenotype data conversions. This is the first GWAS study of leaf rust resistance in elite wheat breeding lines genotyped with high density 90K SNP arrays. PMID:26849364
LIM-domain proteins, LIMD1, Ajuba, and WTIP are required for microRNA-mediated gene silencing
James, Victoria; Zhang, Yining; Foxler, Daniel E.; de Moor, Cornelia H.; Kong, Yi Wen; Webb, Thomas M.; Self, Tim J.; Feng, Yungfeng; Lagos, Dimitrios; Chu, Chia-Ying; Rana, Tariq M.; Morley, Simon J.; Longmore, Gregory D.; Bushell, Martin; Sharp, Tyson V.
2010-01-01
In recent years there have been major advances with respect to the identification of the protein components and mechanisms of microRNA (miRNA) mediated silencing. However, the complete and precise repertoire of components and mechanism(s) of action remain to be fully elucidated. Herein we reveal the identification of a family of three LIM domain-containing proteins, LIMD1, Ajuba and WTIP (Ajuba LIM proteins) as novel mammalian processing body (P-body) components, which highlight a novel mechanism of miRNA-mediated gene silencing. Furthermore, we reveal that LIMD1, Ajuba, and WTIP bind to Ago1/2, RCK, Dcp2, and eIF4E in vivo, that they are required for miRNA-mediated, but not siRNA-mediated gene silencing and that all three proteins bind to the mRNA 5′ m7GTP cap–protein complex. Mechanistically, we propose the Ajuba LIM proteins interact with the m7GTP cap structure via a specific interaction with eIF4E that prevents 4EBP1 and eIF4G interaction. In addition, these LIM-domain proteins facilitate miRNA-mediated gene silencing by acting as an essential molecular link between the translationally inhibited eIF4E-m7GTP-5′cap and Ago1/2 within the miRISC complex attached to the 3′-UTR of mRNA, creating an inhibitory closed-loop complex. PMID:20616046
Dubovenko, Alexey; Nikolsky, Yuri; Rakhmatulin, Eugene; Nikolskaya, Tatiana
2017-01-01
Analysis of NGS and other sequencing data, gene variants, gene expression, proteomics, and other high-throughput (OMICs) data is challenging because of its biological complexity and high level of technical and biological noise. One way to deal with both problems is to perform analysis with a high fidelity annotated knowledgebase of protein interactions, pathways, and functional ontologies. This knowledgebase has to be structured in a computer-readable format and must include software tools for managing experimental data, analysis, and reporting. Here, we present MetaCore™ and Key Pathway Advisor (KPA), an integrated platform for functional data analysis. On the content side, MetaCore and KPA encompass a comprehensive database of molecular interactions of different types, pathways, network models, and ten functional ontologies covering human, mouse, and rat genes. The analytical toolkit includes tools for gene/protein list enrichment analysis, statistical "interactome" tool for the identification of over- and under-connected proteins in the dataset, and a biological network analysis module made up of network generation algorithms and filters. The suite also features Advanced Search, an application for combinatorial search of the database content, as well as a Java-based tool called Pathway Map Creator for drawing and editing custom pathway maps. Applications of MetaCore and KPA include molecular mode of action of disease research, identification of potential biomarkers and drug targets, pathway hypothesis generation, analysis of biological effects for novel small molecule compounds and clinical applications (analysis of large cohorts of patients, and translational and personalized medicine).
Zhou, Yong; Hu, Lifang; Wu, Hao; Jiang, Lunwei
2017-01-01
Superoxide dismutase (SOD) proteins are widely present in the plant kingdom and play important roles in different biological processes. However, little is known about the SOD genes in cucumber. In this study, night SOD genes were identified from cucumber (Cucumis sativus) using bioinformatics-based methods, including 5 Cu/ZnSODs, 3 FeSODs, and 1 MnSOD. Gene structure and motif analysis indicated that most of the SOD genes have relatively conserved exon/intron arrangement and motif composition. Phylogenetic analyses with SODs from cucumber and several other species revealed that these SOD proteins can be traced back to two ancestral SODs before the divergence of monocot and dicot plants. Many cis-elements related to stress responses and plant hormones were found in the promoter sequence of each CsSOD gene. Gene expression analysis revealed that most of the CsSOD genes are expressed in almost all the tested tissues. qRT-PCR analysis of 8 selected CsSOD genes showed that these genes could respond to heat, cold, osmotic, and salt stresses. Our results provide a basis for further functional research on SOD gene family in cucumber and facilitate their potential applications in the genetic improvement of cucumber. PMID:28808654
Han, Yahui; Ding, Ting; Su, Bo; Jiang, Haiyang
2016-01-01
Members of the chalcone synthase (CHS) family participate in the synthesis of a series of secondary metabolites in plants, fungi and bacteria. The metabolites play important roles in protecting land plants against various environmental stresses during the evolutionary process. Our research was conducted on comprehensive investigation of CHS genes in maize (Zea mays L.), including their phylogenetic relationships, gene structures, chromosomal locations and expression analysis. Fourteen CHS genes (ZmCHS01–14) were identified in the genome of maize, representing one of the largest numbers of CHS family members identified in one organism to date. The gene family was classified into four major classes (classes I–IV) based on their phylogenetic relationships. Most of them contained two exons and one intron. The 14 genes were unevenly located on six chromosomes. Two segmental duplication events were identified, which might contribute to the expansion of the maize CHS gene family to some extent. In addition, quantitative real-time PCR and microarray data analyses suggested that ZmCHS genes exhibited various expression patterns, indicating functional diversification of the ZmCHS genes. Our results will contribute to future studies of the complexity of the CHS gene family in maize and provide valuable information for the systematic analysis of the functions of the CHS gene family. PMID:26828478
Benítez-Burraco, A
FOXP2 is the first gene linked to a hereditary variant of specific language impairment and seems to code for a transcriptional repressor that intervenes in the regulation of the development and the functioning of certain thalamic-cortical-striatal circuits. In the last three years, significant progress has been made in the determination of the structural and functional properties of the gene. These advances essentially have to do with the precise analysis of the most important structural motifs of the protein that it codes for and the main parameters that determine its interaction with DNA. They also concern the determination of the functional and behavioural properties in vivo of the main isoforms of the FOXP2 protein, the exact determination of the pattern of expression of new orthologues of the gene, and the identification of the different target genes for factor FOXP2. This new evidence suggests that protein FOXP2 protein has a high degree of versatility in vivo when it comes to binding to DNA; that its different isoforms are biologically functional; and that the FOXP2 gene is functional during embryonic development and during the adult phase. It also suggests that it is involved in the development and/or functioning of the thalamic-cortical-striatal circuits associated to motor planning, sequential behaviour and procedural learning (a significant saving in developmental terms of the regulatory mechanism in which the gene is involved), as well as the accuracy of the models of linguistic processing that consider language to be, to a large extent, the result of an interaction between certain cortical and subcortical structures.
Przybylski, Cédric; Benito, Juan M; Bonnet, Véronique; Mellet, Carmen Ortiz; García Fernández, José M
2016-12-15
Polycationic carbohydrates represent an attractive class of biomolecules for several applications and particularly as non viral gene delivery vectors. In this case, the establishment of structure-biological activity relationship requires sensitive and accurate characterization tools to both control and achieve fine structural deciphering. Electrospray-tandem mass spectrometry (ESI-MS/MS) appears as a suitable approach to address these questions. In the study herein, we have investigated the usefulness of electron transfer dissociation (ETD) to get structural data about five polycationic carbohydrates demonstrated as promising gene delivery agents. A particular attention was paid to determine the influence of charge states as well as both fluoranthene reaction time and supplementary activation (SA) on production of charge reduced species, fragmentation yield, varying from 2 to 62%, as well as to obtain the most higher both diversity and intensity of fragments, according to charge states and targeted compounds. ETD fragmentation appeared to be mainly directed toward pending group rather than carbohydrate cyclic scaffold leading to a partial sequencing for building blocks when amino groups are close to carbohydrate core, but allowing to complete structural deciphering of some of them, such as those including dithioureidocysteaminyl group which was not possible with CID only. Such findings clearly highlight the potential to help the rational choice of the suitable analytical conditions, according to the nature of the gene delivery molecules exhibiting polycationic features. Moreover, our ETD-MS/MS approach open the way to a fine sequencing/identification of grafted groups carried on various sets of oligo-/polysaccharides in various fields such as glycobiology or nanomaterials, even with unknown or questionable extraction, synthesis or modification steps. Copyright © 2016 Elsevier B.V. All rights reserved.
Ivaskevicius, Vytautas; Biswas, Arijit; Bevans, Carville; Schroeder, Verena; Kohler, Hans Peter; Rott, Hannelore; Halimeh, Susan; Petrides, Petro E.; Lenk, Harald; Krause, Manuele; Miterski, Bruno; Harbrecht, Ursula; Oldenburg, Johannes
2010-01-01
Background Severe hereditary coagulation factor XIII deficiency is a rare homozygous bleeding disorder affecting one person in every two million individuals. In contrast, heterozygous factor XIII deficiency is more common, but usually not associated with severe hemorrhage such as intracranial bleeding or hemarthrosis. In most cases, the disease is caused by F13A gene mutations. Causative mutations associated with the F13B gene are rarer. Design and Methods We analyzed ten index patients and three relatives for factor XIII activity using a photometric assay and sequenced their F13A and F13B genes. Additionally, structural analysis of the wild-type protein structure from a previously reported X-ray crystallographic model identified potential structural and functional effects of the missense mutations. Results All individuals except one were heterozygous for factor XIIIA mutations (average factor XIII activity 51%), while the remaining homozygous individual was found to have severe factor XIII deficiency (<5% of normal factor XIII activity). Eight of the 12 heterozygous patients exhibited a bleeding tendency upon provocation. Conclusions The identified missense (Pro289Arg, Arg611His, Asp668Gly) and nonsense (Gly390X, Trp664X) mutations are causative for factor XIII deficiency. A Gly592Ser variant identified in three unrelated index patients, as well as in 200 healthy controls (minor allele frequency 0.005), and two further Tyr167Cys and Arg540Gln variants, represent possible candidates for rare F13A gene polymorphisms since they apparently do not have a significant influence on the structure of the factor XIIIA protein. Future in vitro expression studies of the factor XIII mutations are required to confirm their pathological mechanisms. PMID:20179087
Ivaskevicius, Vytautas; Biswas, Arijit; Bevans, Carville; Schroeder, Verena; Kohler, Hans Peter; Rott, Hannelore; Halimeh, Susan; Petrides, Petro E; Lenk, Harald; Krause, Manuele; Miterski, Bruno; Harbrecht, Ursula; Oldenburg, Johannes
2010-06-01
Severe hereditary coagulation factor XIII deficiency is a rare homozygous bleeding disorder affecting one person in every two million individuals. In contrast, heterozygous factor XIII deficiency is more common, but usually not associated with severe hemorrhage such as intracranial bleeding or hemarthrosis. In most cases, the disease is caused by F13A gene mutations. Causative mutations associated with the F13B gene are rarer. We analyzed ten index patients and three relatives for factor XIII activity using a photometric assay and sequenced their F13A and F13B genes. Additionally, structural analysis of the wild-type protein structure from a previously reported X-ray crystallographic model identified potential structural and functional effects of the missense mutations. All individuals except one were heterozygous for factor XIIIA mutations (average factor XIII activity 51%), while the remaining homozygous individual was found to have severe factor XIII deficiency (<5% of normal factor XIII activity). Eight of the 12 heterozygous patients exhibited a bleeding tendency upon provocation. The identified missense (Pro289Arg, Arg611His, Asp668Gly) and nonsense (Gly390X, Trp664X) mutations are causative for factor XIII deficiency. A Gly592Ser variant identified in three unrelated index patients, as well as in 200 healthy controls (minor allele frequency 0.005), and two further Tyr167Cys and Arg540Gln variants, represent possible candidates for rare F13A gene polymorphisms since they apparently do not have a significant influence on the structure of the factor XIIIA protein. Future in vitro expression studies of the factor XIII mutations are required to confirm their pathological mechanisms.
Genome-wide identification and characterization of WRKY gene family in Salix suchowensis.
Bi, Changwei; Xu, Yiqing; Ye, Qiaolin; Yin, Tongming; Ye, Ning
2016-01-01
WRKY proteins are the zinc finger transcription factors that were first identified in plants. They can specifically interact with the W-box, which can be found in the promoter region of a large number of plant target genes, to regulate the expressions of downstream target genes. They also participate in diverse physiological and growing processes in plants. Prior to this study, a plenty of WRKY genes have been identified and characterized in herbaceous species, but there is no large-scale study of WRKY genes in willow. With the whole genome sequencing of Salix suchowensis, we have the opportunity to conduct the genome-wide research for willow WRKY gene family. In this study, we identified 85 WRKY genes in the willow genome and renamed them from SsWRKY1 to SsWRKY85 on the basis of their specific distributions on chromosomes. Due to their diverse structural features, the 85 willow WRKY genes could be further classified into three main groups (group I-III), with five subgroups (IIa-IIe) in group II. With the multiple sequence alignment and the manual search, we found three variations of the WRKYGQK heptapeptide: WRKYGRK, WKKYGQK and WRKYGKK, and four variations of the normal zinc finger motif, which might execute some new biological functions. In addition, the SsWRKY genes from the same subgroup share the similar exon-intron structures and conserved motif domains. Further studies of SsWRKY genes revealed that segmental duplication events (SDs) played a more prominent role in the expansion of SsWRKY genes. Distinct expression profiles of SsWRKY genes with RNA sequencing data revealed that diverse expression patterns among five tissues, including tender roots, young leaves, vegetative buds, non-lignified stems and barks. With the analyses of WRKY gene family in willow, it is not only beneficial to complete the functional and annotation information of WRKY genes family in woody plants, but also provide important references to investigate the expansion and evolution of this gene family in flowering plants.
Genome-wide identification and characterization of WRKY gene family in Salix suchowensis
Ye, Qiaolin; Yin, Tongming
2016-01-01
WRKY proteins are the zinc finger transcription factors that were first identified in plants. They can specifically interact with the W-box, which can be found in the promoter region of a large number of plant target genes, to regulate the expressions of downstream target genes. They also participate in diverse physiological and growing processes in plants. Prior to this study, a plenty of WRKY genes have been identified and characterized in herbaceous species, but there is no large-scale study of WRKY genes in willow. With the whole genome sequencing of Salix suchowensis, we have the opportunity to conduct the genome-wide research for willow WRKY gene family. In this study, we identified 85 WRKY genes in the willow genome and renamed them from SsWRKY1 to SsWRKY85 on the basis of their specific distributions on chromosomes. Due to their diverse structural features, the 85 willow WRKY genes could be further classified into three main groups (group I–III), with five subgroups (IIa–IIe) in group II. With the multiple sequence alignment and the manual search, we found three variations of the WRKYGQK heptapeptide: WRKYGRK, WKKYGQK and WRKYGKK, and four variations of the normal zinc finger motif, which might execute some new biological functions. In addition, the SsWRKY genes from the same subgroup share the similar exon–intron structures and conserved motif domains. Further studies of SsWRKY genes revealed that segmental duplication events (SDs) played a more prominent role in the expansion of SsWRKY genes. Distinct expression profiles of SsWRKY genes with RNA sequencing data revealed that diverse expression patterns among five tissues, including tender roots, young leaves, vegetative buds, non-lignified stems and barks. With the analyses of WRKY gene family in willow, it is not only beneficial to complete the functional and annotation information of WRKY genes family in woody plants, but also provide important references to investigate the expansion and evolution of this gene family in flowering plants. PMID:27651997
2013-01-01
Background Cytokinins (CKs) have significant roles in various aspects of plant growth and development, and they are also involved in plant stress adaptations. The fine-tuning of the controlled CK levels in individual tissues, cells, and organelles is properly maintained by isopentenyl transferases (IPTs) and cytokinin oxidase/dehydrogenases (CKXs). Chinese cabbage is one of the most economically important vegetable crops worldwide. The whole genome sequencing of Brassica rapa enables us to perform the genome-wide identification and functional analysis of the IPT and CKX gene families. Results In this study, a total of 13 BrIPT genes and 12 BrCKX genes were identified. The gene structures, conserved domains and phylogenetic relationships were analyzed. The isoelectric point, subcellular localization and glycosylation sites of the proteins were predicted. Segmental duplicates were found in both BrIPT and BrCKX gene families. We also analyzed evolutionary patterns and divergence of the IPT and CKX genes in the Cruciferae family. The transcription levels of BrIPT and BrCKX genes were analyzed to obtain an initial picture of the functions of these genes. Abiotic stress elements related to adverse environmental stimuli were found in the promoter regions of BrIPT and BrCKX genes and they were confirmed to respond to drought and high salinity conditions. The effects of 6-BA and ABA on the expressions of BrIPT and BrCKX genes were also investigated. Conclusions The expansion of BrIPT and BrCKX genes after speciation from Arabidopsis thaliana is mainly attributed to segmental duplication events during the whole genome triplication (WGT) and substantial duplicated genes are lost during the long evolutionary history. Genes produced by segmental duplication events have changed their expression patterns or may adopted new functions and thus are obtained. BrIPT and BrCKX genes respond well to drought and high salinity stresses, and their transcripts are affected by exogenous hormones, such as 6-BA and ABA, suggesting their potential roles in abiotic stress conditions and regulatory mechanisms of plant hormone homeostasis. The appropriate modulation of endogenous CKs levels by IPT and CKX genes is a promising approach for developing economically important high-yielding and high-quality stress-tolerant crops in agriculture. PMID:24001366
Construction and analysis of gene-gene dynamics influence networks based on a Boolean model.
Mazaya, Maulida; Trinh, Hung-Cuong; Kwon, Yung-Keun
2017-12-21
Identification of novel gene-gene relations is a crucial issue to understand system-level biological phenomena. To this end, many methods based on a correlation analysis of gene expressions or structural analysis of molecular interaction networks have been proposed. They have a limitation in identifying more complicated gene-gene dynamical relations, though. To overcome this limitation, we proposed a measure to quantify a gene-gene dynamical influence (GDI) using a Boolean network model and constructed a GDI network to indicate existence of a dynamical influence for every ordered pair of genes. It represents how much a state trajectory of a target gene is changed by a knockout mutation subject to a source gene in a gene-gene molecular interaction (GMI) network. Through a topological comparison between GDI and GMI networks, we observed that the former network is denser than the latter network, which implies that there exist many gene pairs of dynamically influencing but molecularly non-interacting relations. In addition, a larger number of hub genes were generated in the GDI network. On the other hand, there was a correlation between these networks such that the degree value of a node was positively correlated to each other. We further investigated the relationships of the GDI value with structural properties and found that there are negative and positive correlations with the length of a shortest path and the number of paths, respectively. In addition, a GDI network could predict a set of genes whose steady-state expression is affected in E. coli gene-knockout experiments. More interestingly, we found that the drug-targets with side-effects have a larger number of outgoing links than the other genes in the GDI network, which implies that they are more likely to influence the dynamics of other genes. Finally, we found biological evidences showing that the gene pairs which are not molecularly interacting but dynamically influential can be considered for novel gene-gene relationships. Taken together, construction and analysis of the GDI network can be a useful approach to identify novel gene-gene relationships in terms of the dynamical influence.
Ayeni, Funmilola A; Andersen, Camilla; Nørskov-Lauritsen, Niels
2017-04-01
Mannitol salt agar (MSA) is often used in resources' limited laboratories for identification of S. aureus however, coagulase-negative staphylococci (CoNS) grows and ferments mannitol on MSA. 171 strains of CoNS which have been previously misidentified as S. aureus due to growth on MSA were collected from different locations in Nigeria and two methods for identification of CoNS were compared i.e. ViTEK 2 and MALDI-TOF MS with partial 16S rRNA gene sequencing as gold standard. Partial tuf gene sequencing was used for contradicting identification. All 171 strains (13 species) grew on MSA and ferments mannitol. All tested strains of S. epidermidis, S. haemolyticus, S. nepalensis, S. pasteuri, S. sciuri,, S. warneri, S. xylosus, S. capitis were correctly identified by MALDI-TOF while variable identification were observed in S. saprophyticus and S. cohnii (90%, 81%). There was low identification of S. arlettae (14%) while all strains of S. kloosii and S. gallinarum were misidentified. There is absence of S. gallinarum in the MALDI-TOF database at the period of this study. All tested strains of S. epidermidis, S. gallinarum, S. haemolyticus, S. sciuri,, S. warneri, S. xylosus and S. capitis were correctly identified by ViTEK while variable identification were observed in S. saprophyticus, S. arlettae, S. cohnii, S. kloosii, (84%, 86%, 75%, 60%) and misidentification of S. nepalensis, S. pasteuri. Partial sequencing of 16S rRNA gene was used as gold standard for most strains except S. capitis and S. xylosus where the two species were misidentified by partial sequencing of 16S rRNA contrary to MALDI-TOF and ViTEK identification. Tuf gene sequencing was used for correct identification. Characteristic growth on MSA for CoNS is also identical to S. aureus growth on the media and therefore, MSA could not differentiate between S. aureus and CoNS. The percentage accuracy of ViTEK was better than MALDI-TOF in identification of CoNS. Although partial sequencing of 16S rRNA gene was used as gold standard in this study, it could not correctly identify S. capitis and S. xylosus. Copyright © 2017 Elsevier Ltd. All rights reserved.
The standard operating procedure of the DOE-JGI Microbial Genome Annotation Pipeline (MGAP v.4)
Huntemann, Marcel; Ivanova, Natalia N.; Mavromatis, Konstantinos; ...
2015-10-26
The DOE-JGI Microbial Genome Annotation Pipeline performs structural and functional annotation of microbial genomes that are further included into the Integrated Microbial Genome comparative analysis system. MGAP is applied to assembled nucleotide sequence datasets that are provided via the IMG submission site. Dataset submission for annotation first requires project and associated metadata description in GOLD. The MGAP sequence data processing consists of feature prediction including identification of protein-coding genes, non-coding RNAs and regulatory RNA features, as well as CRISPR elements. In conclusion, structural annotation is followed by assignment of protein product names and functions.
The standard operating procedure of the DOE-JGI Microbial Genome Annotation Pipeline (MGAP v.4)
DOE Office of Scientific and Technical Information (OSTI.GOV)
Huntemann, Marcel; Ivanova, Natalia N.; Mavromatis, Konstantinos
The DOE-JGI Microbial Genome Annotation Pipeline performs structural and functional annotation of microbial genomes that are further included into the Integrated Microbial Genome comparative analysis system. MGAP is applied to assembled nucleotide sequence datasets that are provided via the IMG submission site. Dataset submission for annotation first requires project and associated metadata description in GOLD. The MGAP sequence data processing consists of feature prediction including identification of protein-coding genes, non-coding RNAs and regulatory RNA features, as well as CRISPR elements. In conclusion, structural annotation is followed by assignment of protein product names and functions.
Petillo, David; Westphal, Michael; Koelzer, Katherine; Metcalf, Julie L.; Zhang, Zhongfa; Matsuda, Daisuke; Dykema, Karl J.; Houseman, Heather L.; Kort, Eric J.; Furge, Laura L.; Kahnoski, Richard J.; Richard, Stéphane; Vieillefond, Annick; Swiatek, Pamela J.; Teh, Bin Tean; Ohh, Michael; Furge, Kyle A.
2008-01-01
Chromosomal abnormalities, such as structural and numerical abnormalities, are a common occurrence in cancer. The close association of homologous chromosomes during interphase, a phenomenon termed somatic chromosome pairing, has been observed in cancerous cells, but the functional consequences of somatic pairing have not been established. Gene expression profiling studies revealed that somatic pairing of chromosome 19 is a recurrent chromosomal abnormality in renal oncocytoma, a neoplasia of the adult kidney. Somatic pairing was associated with significant disruption of gene expression within the paired regions and resulted in the deregulation of the prolyl-hydroxylase ELGN2, a key protein that regulates the oxygen-dependent degradation of hypoxia-inducible factor (HIF). Overexpression of ELGN2 in renal oncocytoma increased ubiquitin-mediated destruction of HIF and concomitantly suppressed the expression of several HIF-target genes, including the pro-death BNIP3L gene. The transcriptional changes that are associated with somatic pairing of chromosome 19 mimic the transcriptional changes that occur following DNA amplification. Therefore, in addition to numerical and structural chromosomal abnormalities, alterations in chromosomal spatial dynamics should be considered as genomic events that are associated with tumorigenesis. The identification of EGLN2 as a significantly deregulated gene that maps within the paired chromosome region directly implicates defects in the oxygen-sensing network to the biology of renal oncocytoma. PMID:18773095
Miao, Wenwen; Sun, Lirong; Tian, Mi; Wang, Ji
2017-01-01
Abscisic acid (ABA) receptor pyrabactin resistance1/PYR1-like/regulatory components of ABA receptor (PYR1/PYL/RCAR) (named PYLs for simplicity) are core regulators of ABA signaling, and have been well studied in Arabidopsis and rice. However, knowledge is limited about the PYL family regarding genome organization, gene structure, phylogenesis, gene expression and protein interaction with downstream targets in Gossypium. A comprehensive analysis of the Gossypium PYL family was carried out, and 21, 20, 40 and 39 PYL genes were identified in the genomes from the diploid progenitor G. arboretum, G. raimondii and the tetraploid G. hirsutum and G. barbadense, respectively. Characterization of the physical properties, chromosomal locations, structures and phylogeny of these family members revealed that Gossypium PYLs were quite conservative among the surveyed cotton species. Segmental duplication might be the main force promoting the expansion of PYLs, and the majority of the PYLs underwent evolution under purifying selection in Gossypium. Additionally, the expression profiles of GhPYL genes were specific in tissues. Transcriptions of many GhPYL genes were inhibited by ABA treatments and induced by osmotic stress. A number of GhPYLs can interact with GhABI1A or GhABID in the presence and/or absence of ABA by the yeast-two hybrid method in cotton. PMID:29230363
Chen, Meili; Hu, Yibo; Liu, Jingxing; Wu, Qi; Zhang, Chenglin; Yu, Jun; Xiao, Jingfa; Wei, Fuwen; Wu, Jiayan
2015-12-11
High-quality and complete gene models are the basis of whole genome analyses. The giant panda (Ailuropoda melanoleuca) genome was the first genome sequenced on the basis of solely short reads, but the genome annotation had lacked the support of transcriptomic evidence. In this study, we applied RNA-seq to globally improve the genome assembly completeness and to detect novel expressed transcripts in 12 tissues from giant pandas, by using a transcriptome reconstruction strategy that combined reference-based and de novo methods. Several aspects of genome assembly completeness in the transcribed regions were effectively improved by the de novo assembled transcripts, including genome scaffolding, the detection of small-size assembly errors, the extension of scaffold/contig boundaries, and gap closure. Through expression and homology validation, we detected three groups of novel full-length protein-coding genes. A total of 12.62% of the novel protein-coding genes were validated by proteomic data. GO annotation analysis showed that some of the novel protein-coding genes were involved in pigmentation, anatomical structure formation and reproduction, which might be related to the development and evolution of the black-white pelage, pseudo-thumb and delayed embryonic implantation of giant pandas. The updated genome annotation will help further giant panda studies from both structural and functional perspectives.
Zhang, Gaofeng; Lu, Tingting; Miao, Wenwen; Sun, Lirong; Tian, Mi; Wang, Ji; Hao, Fushun
2017-01-01
Abscisic acid (ABA) receptor pyrabactin resistance1/PYR1-like/regulatory components of ABA receptor (PYR1/PYL/RCAR) (named PYLs for simplicity) are core regulators of ABA signaling, and have been well studied in Arabidopsis and rice. However, knowledge is limited about the PYL family regarding genome organization, gene structure, phylogenesis, gene expression and protein interaction with downstream targets in Gossypium . A comprehensive analysis of the Gossypium PYL family was carried out, and 21, 20, 40 and 39 PYL genes were identified in the genomes from the diploid progenitor G. arboretum , G. raimondii and the tetraploid G. hirsutum and G. barbadense , respectively. Characterization of the physical properties, chromosomal locations, structures and phylogeny of these family members revealed that Gossypium PYLs were quite conservative among the surveyed cotton species. Segmental duplication might be the main force promoting the expansion of PYLs , and the majority of the PYLs underwent evolution under purifying selection in Gossypium . Additionally, the expression profiles of GhPYL genes were specific in tissues. Transcriptions of many GhPYL genes were inhibited by ABA treatments and induced by osmotic stress. A number of GhPYLs can interact with GhABI1A or GhABID in the presence and/or absence of ABA by the yeast-two hybrid method in cotton.
Alamgir, A S M; Owens, Nick; Lavignon, Marc; Malik, Frank; Evans, Leonard H
2005-04-01
Polytropic murine leukemia viruses (MuLVs) are generated by recombination of ecotropic MuLVs with env genes of a family of endogenous proviruses in mice, resulting in viruses with an expanded host range and greater virulence. Inbred mouse strains contain numerous endogenous proviruses that are potential donors of the env gene sequences of polytropic MuLVs; however, the precise identification of those proviruses that participate in recombination has been elusive. Three different structural groups of proviruses in NFS/N mice have been described and different ecotropic MuLVs preferentially recombine with different groups of proviruses. In contrast to other ecotropic MuLVs such as Friend MuLV or Akv that recombine predominantly with a single group of proviruses, Moloney MuLV (M-MuLV) recombines with at least two distinct groups. In this study, we determined that only three endogenous proviruses, two of one group and one of another group, are major participants in recombination with M-MuLV. Furthermore, the distinction between the polytropic MuLVs generated by M-MuLV and other ecotropic MuLVs is the result of recombination with a single endogenous provirus. This provirus exhibits a frameshift mutation in the 3' region of the surface glycoprotein-encoding sequences that is excluded in recombinants with M-MuLV. The sites of recombination between the env genes of M-MuLV and endogenous proviruses were confined to a short region exhibiting maximum homology between the ecotropic and polytropic env sequences and maximum stability of predicted RNA secondary structure. These observations suggest a possible mechanism for the specificity of recombination observed for different ecotropic MuLVs.
Yu, Xiang-Qin; Drew, Bryan T; Yang, Jun-Bo; Gao, Lian-Ming; Li, De-Zhu
2017-01-01
Schima is an ecologically and economically important woody genus in tea family (Theaceae). Unresolved species delimitations and phylogenetic relationships within Schima limit our understanding of the genus and hinder utilization of the genus for economic purposes. In the present study, we conducted comparative analysis among the complete chloroplast (cp) genomes of 11 Schima species. Our results indicate that Schima cp genomes possess a typical quadripartite structure, with conserved genomic structure and gene order. The size of the Schima cp genome is about 157 kilo base pairs (kb). They consistently encode 114 unique genes, including 80 protein-coding genes, 30 tRNAs, and 4 rRNAs, with 17 duplicated in the inverted repeat (IR). These cp genomes are highly conserved and do not show obvious expansion or contraction of the IR region. The percent variability of the 68 coding and 93 noncoding (>150 bp) fragments is consistently less than 3%. The seven most widely touted DNA barcode regions as well as one promising barcode candidate showed low sequence divergence. Eight mutational hotspots were identified from the 11 cp genomes. These hotspots may potentially be useful as specific DNA barcodes for species identification of Schima. The 58 cpSSR loci reported here are complementary to the microsatellite markers identified from the nuclear genome, and will be leveraged for further population-level studies. Phylogenetic relationships among the 11 Schima species were resolved with strong support based on the cp genome data set, which corresponds well with the species distribution pattern. The data presented here will serve as a foundation to facilitate species identification, DNA barcoding and phylogenetic reconstructions for future exploration of Schima.
Zhou, Ying; Zhou, Yu; Yang, Jie
2016-01-01
The GRAS gene family is one of the most important plant-specific gene families, which encodes transcriptional regulators and plays an essential role in plant development and physiological processes. The GRAS gene family has been well characterized in many higher plants such as Arabidopsis, rice, Chinese cabbage, tomato and tobacco. In this study, we identified 38 GRAS genes in sacred lotus (Nelumbo nucifera), analyzed their physical and chemical characteristics and performed phylogenetic analysis using the GRAS genes from eight representative plant species to show the evolution of GRAS genes in Planta. In addition, the gene structures and motifs of the sacred lotus GRAS proteins were characterized in detail. Comparative analysis identified 42 orthologous and 9 co-orthologous gene pairs between sacred lotus and Arabidopsis, and 35 orthologous and 22 co-orthologous gene pairs between sacred lotus and rice. Based on publically available RNA-seq data generated from leaf, petiole, rhizome and root, we found that most of the sacred lotus GRAS genes exhibited a tissue-specific expression pattern. Eight of the ten PAT1-clade GRAS genes, particularly NnuGRAS-05, NnuGRAS-10 and NnuGRAS-25, were preferentially expressed in rhizome and root. In summary, this is the first in silico analysis of the GRAS gene family in sacred lotus, which will provide valuable information for further molecular and biological analyses of this important gene family. PMID:27635351
Smita, Shuchi; Katiyar, Amit; Pandey, Dev Mani; Chinnusamy, Viswanathan; Archak, Sunil; Bansal, Kailash Chander
2013-01-01
Identification of genes that are coexpressed across various tissues and environmental stresses is biologically interesting, since they may play coordinated role in similar biological processes. Genes with correlated expression patterns can be best identified by using coexpression network analysis of transcriptome data. In the present study, we analyzed the temporal-spatial coordination of gene expression in root, leaf and panicle of rice under drought stress and constructed network using WGCNA and Cytoscape. Total of 2199 differentially expressed genes (DEGs) were identified in at least three or more tissues, wherein 88 genes have coordinated expression profile among all the six tissues under drought stress. These 88 highly coordinated genes were further subjected to module identification in the coexpression network. Based on chief topological properties we identified 18 hub genes such as ABC transporter, ATP-binding protein, dehydrin, protein phosphatase 2C, LTPL153 - Protease inhibitor, phosphatidylethanolaminebinding protein, lactose permease-related, NADP-dependent malic enzyme, etc. Motif enrichment analysis showed the presence of ABRE cis-elements in the promoters of > 62% of the coordinately expressed genes. Our results suggest that drought stress mediated upregulated gene expression was coordinated through an ABA-dependent signaling pathway across tissues, at least for the subset of genes identified in this study, while down regulation appears to be regulated by tissue specific pathways in rice.
Ye, Jianqiu; Yang, Hai; Shi, Haitao; Wei, Yunxie; Tie, Weiwei; Ding, Zehong; Yan, Yan; Luo, Ying; Xia, Zhiqiang; Wang, Wenquan; Peng, Ming; Li, Kaimian; Zhang, He; Hu, Wei
2017-11-02
Mitogen-activated protein kinase kinase kinases (MAPKKKs), an important unit of MAPK cascade, play crucial roles in plant development and response to various stresses. However, little is known concerning the MAPKKK family in the important subtropical and tropical crop cassava. In this study, 62 MAPKKK genes were identified in the cassava genome, and were classified into 3 subfamilies based on phylogenetic analysis. Most of MAPKKKs in the same subfamily shared similar gene structures and conserved motifs. The comprehensive transcriptome analysis showed that MAPKKK genes participated in tissue development and response to drought stress. Comparative expression profiles revealed that many MAPKKK genes were activated in cultivated varieties SC124 and Arg7 and the function of MeMAPKKKs in drought resistance may be different between SC124/Arg7 and W14. Expression analyses of the 7 selected MeMAPKKK genes showed that most of them were significantly upregulated by osmotic, salt and ABA treatments, whereas slightly induced by H 2 O 2 and cold stresses. Taken together, this study identified candidate MeMAPKKK genes for genetic improvement of abiotic stress resistance and provided new insights into MAPKKK -mediated cassava resistance to drought stress.
Identification of positive selection in disease response genes within members of the Poaceae.
Rech, Gabriel E; Vargas, Walter A; Sukno, Serenella A; Thon, Michael R
2012-12-01
Millions of years of coevolution between plants and pathogens can leave footprints on their genomes and genes involved on this interaction are expected to show patterns of positive selection in which novel, beneficial alleles are rapidly fixed within the population. Using information about upregulated genes in maize during Colletotrichum graminicola infection and resources available in the Phytozome database, we looked for evidence of positive selection in the Poaceae lineage, acting on protein coding sequences related with plant defense. We found six genes with evidence of positive selection and another eight with sites showing episodic selection. Some of them have already been described as evolving under positive selection, but others are reported here for the first time including genes encoding isocitrate lyase, dehydrogenases, a multidrug transporter, a protein containing a putative leucine-rich repeat and other proteins with unknown functions. Mapping positively selected residues onto the predicted 3-D structure of proteins showed that most of them are located on the surface, where proteins are in contact with other molecules. We present here a set of Poaceae genes that are likely to be involved in plant defense mechanisms and have evidence of positive selection. These genes are excellent candidates for future functional validation.
NASA Astrophysics Data System (ADS)
Chen, Zhihao; Zhao, Fan; Qi, Yiduo; Hu, Lifang; Li, Dijie; Yin, Chong; Su, Peihong; Zhang, Yan; Ma, Jianhua; Qian, Jing; Zhou, Hongpo; Zou, Yiwei; Qian, Airong
2016-12-01
Bone undergoes dynamic modelling and remodelling processes, and it requires gravity-mediated mechanical stimulation for the maintenance of mineral content and structure. Osteocytes are the most commonly found cells in the mature bone, and they are sensitive to mechanical changes. The purpose of this study was to investigate the effects of microgravity simulated with a random position machine (RPM) on the gene expression profile of osteocytes. Genes sensitive to RPM treatment were sorted on the basis of biological processes, interactions and signalling pathways. Overall, 504 differentially expressed genes (DEGs) in osteocytes cultured under RPM conditions were found. The DEGs were further analysed using bioinformatics tools such as DAVID and iReport. A total of 15 ATP-binding and cytoskeleton-related genes were further confirmed by quantitative real-time PCR (qRT-PCR). Our findings demonstrate that the RPM affected the expression of genes involved in cytoskeleton remodelling and the energy-transfer process in osteocytes. The identification of mechanosensitive genes may enhance our understanding of the roles of osteocytes in mechanosensation and may provide some potential targets for preventing and treating bone-related diseases.
Izquierdo, Javier A; Sizova, Maria V; Lynd, Lee R
2010-06-01
The enrichment from nature of novel microbial communities with high cellulolytic activity is useful in the identification of novel organisms and novel functions that enhance the fundamental understanding of microbial cellulose degradation. In this work we identify predominant organisms in three cellulolytic enrichment cultures with thermophilic compost as an inoculum. Community structure based on 16S rRNA gene clone libraries featured extensive representation of clostridia from cluster III, with minor representation of clostridial clusters I and XIV and a novel Lutispora species cluster. Our studies reveal different levels of 16S rRNA gene diversity, ranging from 3 to 18 operational taxonomic units (OTUs), as well as variability in community membership across the three enrichment cultures. By comparison, glycosyl hydrolase family 48 (GHF48) diversity analyses revealed a narrower breadth of novel clostridial genes associated with cultured and uncultured cellulose degraders. The novel GHF48 genes identified in this study were related to the novel clostridia Clostridium straminisolvens and Clostridium clariflavum, with one cluster sharing as little as 73% sequence similarity with the closest known relative. In all, 14 new GHF48 gene sequences were added to the known diversity of 35 genes from cultured species.
Fraisier, V; Dorbe, M F; Daniel-Vedele, F
2001-01-01
Higher plants have both high- and low-affinity nitrate uptake systems (HATS and LATS respectively). Here we report the isolation and characterization of two genes, NpNRT1.1 and NpNRT1.2, from Nicotiana plumbaginifolia whose structural features suggest that they both belong to the NRT1 gene family, which is involved in the LATS. Amino acid sequence alignment showed that the N. plumbaginifolia proteins have greater similarity to their corresponding tomato homologues than to each other. Genomic Southern blot analysis indicates that there are probably more than two members of this family in N. plumbaginifolia. Northern blot analysis shows that NpNRT1.2 expression is restricted strictly to roots, whereas NpNRT1.1, in addition to roots, is expressed at a basal level in all other plant organs. Likewise, differential expression in response to external treatments with various N sources was observed for these two genes: NpNRT1.1 can be considered as a constitutively expressed gene whereas NpNRT1.2 expression is dependent strictly on high nitrate concentrations. Finally, over-expression of a gene involved in the HATS does not lead to any modification of LATS gene expression.
Discovery of novel bacterial toxins by genomics and computational biology.
Doxey, Andrew C; Mansfield, Michael J; Montecucco, Cesare
2018-06-01
Hundreds and hundreds of bacterial protein toxins are presently known. Traditionally, toxin identification begins with pathological studies of bacterial infectious disease. Following identification and cultivation of a bacterial pathogen, the protein toxin is purified from the culture medium and its pathogenic activity is studied using the methods of biochemistry and structural biology, cell biology, tissue and organ biology, and appropriate animal models, supplemented by bioimaging techniques. The ongoing and explosive development of high-throughput DNA sequencing and bioinformatic approaches have set in motion a revolution in many fields of biology, including microbiology. One consequence is that genes encoding novel bacterial toxins can be identified by bioinformatic and computational methods based on previous knowledge accumulated from studies of the biology and pathology of thousands of known bacterial protein toxins. Starting from the paradigmatic cases of diphtheria toxin, tetanus and botulinum neurotoxins, this review discusses traditional experimental approaches as well as bioinformatics and genomics-driven approaches that facilitate the discovery of novel bacterial toxins. We discuss recent work on the identification of novel botulinum-like toxins from genera such as Weissella, Chryseobacterium, and Enteroccocus, and the implications of these computationally identified toxins in the field. Finally, we discuss the promise of metagenomics in the discovery of novel toxins and their ecological niches, and present data suggesting the existence of uncharacterized, botulinum-like toxin genes in insect gut metagenomes. Copyright © 2018. Published by Elsevier Ltd.
Manananggal - a novel viewer for alternative splicing events.
Barann, Matthias; Zimmer, Ralf; Birzele, Fabian
2017-02-21
Alternative splicing is an important cellular mechanism that can be analyzed by RNA sequencing. However, identification of splicing events in an automated fashion is error-prone. Thus, further validation is required to select reliable instances of alternative splicing events (ASEs). There are only few tools specifically designed for interactive inspection of ASEs and available visualization approaches can be significantly improved. Here, we present Manananggal, an application specifically designed for the identification of splicing events in next generation sequencing data. Manananggal includes a web application for visual inspection and a command line tool that allows for ASE detection. We compare the sashimi plots available in the IGV Viewer, the DEXSeq splicing plots and SpliceSeq to the Manananggal interface and discuss the advantages and drawbacks of these tools. We show that sashimi plots (such as those used by the IGV Viewer and SpliceSeq) offer a practical solution for simple ASEs, but also indicate short-comings for highly complex genes. Manananggal is an interactive web application that offers functions specifically tailored to the identification of alternative splicing events that other tools are lacking. The ability to select a subset of isoforms allows an easier interpretation of complex alternative splicing events. In contrast to SpliceSeq and the DEXSeq splicing plot, Manananggal does not obscure the gene structure by showing full transcript models that makes it easier to determine which isoforms are expressed and which are not.
Lynch, Caitlin; Pan, Yongmei; Li, Linhao; Ferguson, Stephen S.; Xia, Menghang; Swaan, Peter W.; Wang, Hongbing
2012-01-01
Purpose The constitutive androstane receptor (CAR, NR1I3) is a xenobiotic sensor governing the transcription of numerous hepatic genes associated with drug metabolism and clearance. Recent evidence suggests that CAR also modulates energy homeostasis and cancer development. Thus, identification of novel human (h) CAR activators is of both clinical importance and scientific interest. Methods Docking and ligand-based structure-activity models were used for virtual screening of a database containing over 2000 FDA-approved drugs. Identified lead compounds were evaluated in cell-based reporter assays to determine hCAR activation. Potential activators were further tested in human primary hepatocytes (HPHs) for the expression of the prototypical hCAR target gene CYP2B6. Results Nineteen lead compounds with optimal modeling parameters were selected for biological evaluation. Seven of the 19 leads exhibited moderate to potent activation of hCAR. Five out of the seven compounds translocated hCAR from the cytoplasm to the nucleus of HPHs in a concentration-dependent manner. These compounds also induce the expression of CYP2B6 in HPHs with rank-order of efficacies closely resembling that of hCAR activation. Conclusion These results indicate that our strategically integrated approaches are effective in the identification of novel hCAR modulators, which may function as valuable research tools or potential therapeutic molecules. PMID:23090669
Li, Donghua; Liu, Pan; Yu, Jingyin; Wang, Linhai; Dossa, Komivi; Zhang, Yanxin; Zhou, Rong; Wei, Xin; Zhang, Xiurong
2017-09-11
Sesame (Sesamum indicum L.) is one of the world's most important oil crops. However, it is susceptible to abiotic stresses in general, and to waterlogging and drought stresses in particular. The molecular mechanisms of abiotic stress tolerance in sesame have not yet been elucidated. The WRKY domain transcription factors play significant roles in plant growth, development, and responses to stresses. However, little is known about the number, location, structure, molecular phylogenetics, and expression of the WRKY genes in sesame. We performed a comprehensive study of the WRKY gene family in sesame and identified 71 SiWRKYs. In total, 65 of these genes were mapped to 15 linkage groups within the sesame genome. A phylogenetic analysis was performed using a related species (Arabidopsis thaliana) to investigate the evolution of the sesame WRKY genes. Tissue expression profiles of the WRKY genes demonstrated that six SiWRKY genes were highly expressed in all organs, suggesting that these genes may be important for plant growth and organ development in sesame. Analysis of the SiWRKY gene expression patterns revealed that 33 and 26 SiWRKYs respond strongly to waterlogging and drought stresses, respectively. Changes in the expression of 12 SiWRKY genes were observed at different times after the waterlogging and drought treatments had begun, demonstrating that sesame gene expression patterns vary in response to abiotic stresses. In this study, we analyzed the WRKY family of transcription factors encoded by the sesame genome. Insight was gained into the classification, evolution, and function of the SiWRKY genes, revealing their putative roles in a variety of tissues. Responses to abiotic stresses in different sesame cultivars were also investigated. The results of our study provide a better understanding of the structures and functions of sesame WRKY genes and suggest that manipulating these WRKYs could enhance resistance to waterlogging and drought.
Osorio-Guarín, Jaime A; Enciso-Rodríguez, Felix E; González, Carolina; Fernández-Pozo, Noé; Mueller, Lukas A; Barrero, Luz Stella
2016-03-18
Vascular wilt caused by Fusarium oxysporum is the most important disease in cape gooseberry (Physalis peruviana L.) in Colombia. The development of resistant cultivars is considered one of the most cost-effective means to reduce the impact of this disease. In order to do so, it is necessary to provide breeders with molecular markers and promising germplasm for introgression of different resistance loci as part of breeding schemes. Here we described an association mapping study in cape gooseberry with the goal to: (i) select promising materials for use in plant breeding and (ii) identify SNPs associated with the cape gooseberry resistance response to the F. oxysporum pathogen under greenhouse conditions, as potential markers for cape gooseberry breeding. We found a total of 21 accessions with different resistance responses within a diversity panel of 100 cape gooseberry accessions. A total of 60,663 SNPs were also identified within the same panel by means of GBS (Genotyping By Sequencing). Model-based population structure and neighbor-joining analyses showed three populations comprising the cape gooseberry panel. After correction for population structure and kinship, we identified SNPs markers associated with the resistance response against F. oxysporum. The identification of markers was based on common tags using the reference genomes of tomato and potato as well as the root/stem transcriptome of cape gooseberry. By comparing their location with the tomato genome, 16 SNPs were found in genes involved in defense/resistance response to pathogens, likewise when compared with the genome of potato, 12 markers were related. The work presented herein provides the first association mapping study in cape gooseberry showing both the identification of promising accessions with resistance response phenotypes and the identification of a set of SNP markers mapped to defense/resistance response genes of reference genomes. Thus, the work also provides new knowledge on candidate genes involved in the P. peruviana - F. oxysporum pathosystem as a foundation for further validation in marker-assisted selection. The results have important implications for conservation and breeding strategies in cape gooseberry.
Li, Xiong; Wu, Yuansheng; Li, Boqun; He, Wenqi; Yang, Yonghong; Yang, Yongping
2018-01-01
The cation diffusion facilitator (CDF) family is one of the gene families involved in metal ion uptake and transport in plants, but the understanding of the definite roles and mechanisms of most CDF genes remain limited. In the present study, we identified 18 candidate CDF genes from the turnip genome and named them BrrMTP1.1 - BrrMTP12 . Then, we performed a comparative genomic analysis on the phylogenetic relationships, gene structures and chromosome distributions, conserved domains, and motifs of turnip CDFs. The constructed phylogenetic tree indicated that the BrrMTPs were divided into seven groups (groups 1, 5, 6, 7, 8, 9, and 12) and formed three major clusters (Zn-CDFs, Fe/Zn-CDFs, and Mn-CDFs). Moreover, the structural characteristics of the BrrMTP members in the same group were similar but varied among groups. To investigate the potential roles of BrrMTPs in turnip, we conducted an expression analysis on all BrrMTP genes under Mg, Zn, Cu, Mn, Fe, Co, Na, and Cd stresses. Results showed that the expression levels of all BrrMTP members were induced by at least one metal ion, indicating that these genes may be related to the tolerance or transport of those metal ions. Based on the roles of different metal ions for plants, we hypothesized that BrrMTP genes are possibly involved in heavy metal accumulation and tolerance to salt stress apart from their roles in the maintenance of mineral nutrient homeostasis in turnip. These findings are helpful to understand the roles of MTPs in plants and provide preliminary information for the study of the functions of BrrMTP genes.
Park, Chihyun; Yun, So Jeong; Ryu, Sung Jin; Lee, Soyoung; Lee, Young-Sam; Yoon, Youngmi; Park, Sang Chul
2017-03-15
Cellular senescence irreversibly arrests growth of human diploid cells. In addition, recent studies have indicated that senescence is a multi-step evolving process related to important complex biological processes. Most studies analyzed only the genes and their functions representing each senescence phase without considering gene-level interactions and continuously perturbed genes. It is necessary to reveal the genotypic mechanism inferred by affected genes and their interaction underlying the senescence process. We suggested a novel computational approach to identify an integrative network which profiles an underlying genotypic signature from time-series gene expression data. The relatively perturbed genes were selected for each time point based on the proposed scoring measure denominated as perturbation scores. Then, the selected genes were integrated with protein-protein interactions to construct time point specific network. From these constructed networks, the conserved edges across time point were extracted for the common network and statistical test was performed to demonstrate that the network could explain the phenotypic alteration. As a result, it was confirmed that the difference of average perturbation scores of common networks at both two time points could explain the phenotypic alteration. We also performed functional enrichment on the common network and identified high association with phenotypic alteration. Remarkably, we observed that the identified cell cycle specific common network played an important role in replicative senescence as a key regulator. Heretofore, the network analysis from time series gene expression data has been focused on what topological structure was changed over time point. Conversely, we focused on the conserved structure but its context was changed in course of time and showed it was available to explain the phenotypic changes. We expect that the proposed method will help to elucidate the biological mechanism unrevealed by the existing approaches.
Assessing the effects of common variation in the FOXP2 gene on human brain structure.
Hoogman, Martine; Guadalupe, Tulio; Zwiers, Marcel P; Klarenbeek, Patricia; Francks, Clyde; Fisher, Simon E
2014-01-01
The FOXP2 transcription factor is one of the most well-known genes to have been implicated in developmental speech and language disorders. Rare mutations disrupting the function of this gene have been described in different families and cases. In a large three-generation family carrying a missense mutation, neuroimaging studies revealed significant effects on brain structure and function, most notably in the inferior frontal gyrus, caudate nucleus, and cerebellum. After the identification of rare disruptive FOXP2 variants impacting on brain structure, several reports proposed that common variants at this locus may also have detectable effects on the brain, extending beyond disorder into normal phenotypic variation. These neuroimaging genetics studies used groups of between 14 and 96 participants. The current study assessed effects of common FOXP2 variants on neuroanatomy using voxel-based morphometry (VBM) and volumetric techniques in a sample of >1300 people from the general population. In a first targeted stage we analyzed single nucleotide polymorphisms (SNPs) claimed to have effects in prior smaller studies (rs2253478, rs12533005, rs2396753, rs6980093, rs7784315, rs17137124, rs10230558, rs7782412, rs1456031), beginning with regions proposed in the relevant papers, then assessing impact across the entire brain. In the second gene-wide stage, we tested all common FOXP2 variation, focusing on volumetry of those regions most strongly implicated from analyses of rare disruptive mutations. Despite using a sample that is more than 10 times that used for prior studies of common FOXP2 variation, we found no evidence for effects of SNPs on variability in neuroanatomy in the general population. Thus, the impact of this gene on brain structure may be largely limited to extreme cases of rare disruptive alleles. Alternatively, effects of common variants at this gene exist but are too subtle to be detected with standard volumetric techniques.
A New Algorithm for Identifying Cis-Regulatory Modules Based on Hidden Markov Model
2017-01-01
The discovery of cis-regulatory modules (CRMs) is the key to understanding mechanisms of transcription regulation. Since CRMs have specific regulatory structures that are the basis for the regulation of gene expression, how to model the regulatory structure of CRMs has a considerable impact on the performance of CRM identification. The paper proposes a CRM discovery algorithm called ComSPS. ComSPS builds a regulatory structure model of CRMs based on HMM by exploring the rules of CRM transcriptional grammar that governs the internal motif site arrangement of CRMs. We test ComSPS on three benchmark datasets and compare it with five existing methods. Experimental results show that ComSPS performs better than them. PMID:28497059
Functional clustering of time series gene expression data by Granger causality
2012-01-01
Background A common approach for time series gene expression data analysis includes the clustering of genes with similar expression patterns throughout time. Clustered gene expression profiles point to the joint contribution of groups of genes to a particular cellular process. However, since genes belong to intricate networks, other features, besides comparable expression patterns, should provide additional information for the identification of functionally similar genes. Results In this study we perform gene clustering through the identification of Granger causality between and within sets of time series gene expression data. Granger causality is based on the idea that the cause of an event cannot come after its consequence. Conclusions This kind of analysis can be used as a complementary approach for functional clustering, wherein genes would be clustered not solely based on their expression similarity but on their topological proximity built according to the intensity of Granger causality among them. PMID:23107425
Ponting, C P; Mott, R; Bork, P; Copley, R R
2001-12-01
Sequence database searching methods such as BLAST, are invaluable for predicting molecular function on the basis of sequence similarities among single regions of proteins. Searches of whole databases however, are not optimized to detect multiple homologous regions within a single polypeptide. Here we have used the prospero algorithm to perform self-comparisons of all predicted Drosophila melanogaster gene products. Predicted repeats, and their homologs from all species, were analyzed further to detect hitherto unappreciated evolutionary relationships. Results included the identification of novel tandem repeats in the human X-linked retinitis pigmentosa type-2 gene product, repeated segments in cystinosin, associated with a defect in cystine transport, and 'nested' homologous domains in dysferlin, whose gene is mutated in limb girdle muscular dystrophy. Novel signaling domain families were found that may regulate the microtubule-based cytoskeleton and ubiquitin-mediated proteolysis, respectively. Two families of glycosyl hydrolases were shown to contain internal repetitions that hint at their evolution via a piecemeal, modular approach. In addition, three examples of fruit fly genes were detected with tandem exons that appear to have arisen via internal duplication. These findings demonstrate how completely sequenced genomes can be exploited to further understand the relationships between molecular structure, function, and evolution.
Krol, Kamil; Jendrysek, Justyna; Debski, Janusz; Skoneczny, Marek; Kurlandzka, Anna; Kaminska, Joanna; Dadlez, Michal; Skoneczna, Adrianna
2017-04-11
Ribosomal RNA-encoding genes (rDNA) are the most abundant genes in eukaryotic genomes. To meet the high demand for rRNA, rDNA genes are present in multiple tandem repeats clustered on a single or several chromosomes and are vastly transcribed. To facilitate intensive transcription and prevent rDNA destabilization, the rDNA-encoding portion of the chromosome is confined in the nucleolus. However, the rDNA region is susceptible to recombination and DNA damage, accumulating mutations, rearrangements and atypical DNA structures. Various sophisticated techniques have been applied to detect these abnormalities. Here, we present a simple method for the evaluation of the activity and integrity of an rDNA region called a "DNA cloud assay". We verified the efficacy of this method using yeast mutants lacking genes important for nucleolus function and maintenance (RAD52, SGS1, RRM3, PIF1, FOB1 and RPA12). The DNA cloud assay permits the evaluation of nucleolus status and is compatible with downstream analyses, such as the chromosome comet assay to identify DNA structures present in the cloud and mass spectrometry of agarose squeezed proteins (ASPIC-MS) to detect nucleolar DNA-bound proteins, including Las17, the homolog of human Wiskott-Aldrich Syndrome Protein (WASP).
Krol, Kamil; Jendrysek, Justyna; Debski, Janusz; Skoneczny, Marek; Kurlandzka, Anna; Kaminska, Joanna; Dadlez, Michal; Skoneczna, Adrianna
2017-01-01
Ribosomal RNA-encoding genes (rDNA) are the most abundant genes in eukaryotic genomes. To meet the high demand for rRNA, rDNA genes are present in multiple tandem repeats clustered on a single or several chromosomes and are vastly transcribed. To facilitate intensive transcription and prevent rDNA destabilization, the rDNA-encoding portion of the chromosome is confined in the nucleolus. However, the rDNA region is susceptible to recombination and DNA damage, accumulating mutations, rearrangements and atypical DNA structures. Various sophisticated techniques have been applied to detect these abnormalities. Here, we present a simple method for the evaluation of the activity and integrity of an rDNA region called a “DNA cloud assay”. We verified the efficacy of this method using yeast mutants lacking genes important for nucleolus function and maintenance (RAD52, SGS1, RRM3, PIF1, FOB1 and RPA12). The DNA cloud assay permits the evaluation of nucleolus status and is compatible with downstream analyses, such as the chromosome comet assay to identify DNA structures present in the cloud and mass spectrometry of agarose squeezed proteins (ASPIC-MS) to detect nucleolar DNA-bound proteins, including Las17, the homolog of human Wiskott-Aldrich Syndrome Protein (WASP). PMID:28212567
Technical and biological variance structure in mRNA-Seq data: life in the real world
2012-01-01
Background mRNA expression data from next generation sequencing platforms is obtained in the form of counts per gene or exon. Counts have classically been assumed to follow a Poisson distribution in which the variance is equal to the mean. The Negative Binomial distribution which allows for over-dispersion, i.e., for the variance to be greater than the mean, is commonly used to model count data as well. Results In mRNA-Seq data from 25 subjects, we found technical variation to generally follow a Poisson distribution as has been reported previously and biological variability was over-dispersed relative to the Poisson model. The mean-variance relationship across all genes was quadratic, in keeping with a Negative Binomial (NB) distribution. Over-dispersed Poisson and NB distributional assumptions demonstrated marked improvements in goodness-of-fit (GOF) over the standard Poisson model assumptions, but with evidence of over-fitting in some genes. Modeling of experimental effects improved GOF for high variance genes but increased the over-fitting problem. Conclusions These conclusions will guide development of analytical strategies for accurate modeling of variance structure in these data and sample size determination which in turn will aid in the identification of true biological signals that inform our understanding of biological systems. PMID:22769017
Pontvianne, Frédéric; Carpentier, Marie-Christine; Durut, Nathalie; Pavlištová, Veronika; Jaške, Karin; Schořová, Šárka; Parrinello, Hugues; Rohmer, Marine; Pikaard, Craig S; Fojtová, Miloslava; Fajkus, Jiří; Saez-Vasquez, Julio
2017-01-01
The nucleolus is the site of ribosomal RNA (rRNA) gene transcription, rRNA processing and ribosome biogenesis. However, the nucleolus also plays additional roles in the cell. We isolated nucleoli by Fluorescence Activated Cell Sorting (FACS) and identified Nucleolus-Associated Chromatin Domains (NADs) by deep sequencing, comparing wild-type plants and null mutants for the nucleolar protein, NUCLEOLIN 1 (NUC1). NADs are primarily genomic regions with heterochromatic signatures and include transposable elements (TEs), sub-telomeric regions and mostly inactive protein-coding genes. However, NADs also include active ribosomal RNA genes, and the entire short arm of chromosome 4 adjacent to them. In nuc1 null mutants, which alter rRNA gene expression and overall nucleolar structure, NADs are altered, telomere association with the nucleolus is decreased and telomeres become shorter. Collectively, our studies reveal roles for NUC1 and the nucleolus in the spatial organization of chromosomes as well as telomere maintenance. PMID:27477271
Choi, Ye-Na; Oh, Bong-Kyeong; Kawasaki, Ichiro; Oh, Wan-Suk; Lee, Yi; Paik, Young-Ki; Shim, Yhong-Hee
2010-02-28
The cdc25 gene, which is highly conserved in many eukaryotes, encodes a phosphatase that plays essential roles in cell cycle regulation. We identified a cdc25 ortholog in the pinewood nematode, Bursaphelenchus xylophilus. The B. xylophilus ortholog (Bx-cdc25) was found to be highly similar to Caenorhabditis elegans cdc-25.2 in sequence as well as in gene structure, both having long intron 1. The Bx-cdc25 gene was determined to be composed of seven exons and six introns in a 2,580 bp region, and was shown to encode 360 amino acids of a protein containing a highly-conserved phosphatase domain. Bx-cdc25 mRNA was hardly detectable throughout the juvenile stages but was highly expressed in eggs and in both female and male adults. Functional conservation during germline development between C. elegans cdc25 and Bx-cdc25 was revealed by Bx-cdc25 RNA interference in C. elegans.
Pontigo, Juan Pablo; Agüero, María José; Sánchez, Patricio; Oyarzún, Ricardo; Vargas-Lagos, Carolina; Mancilla, Jorge; Kossmann, Hans; Morera, Francisco J; Yáñez, Alejandro J; Vargas-Chacoff, Luis
2016-11-01
The NOD-like receptors (NLRs) were recently identified as an intracellular pathogen recognition receptor family in vertebrates. While the immune system participation of NLRs has been characterized and analyzed in various mammalian models, few studies have considered NLRs in teleost species. Therefore, this study analyzed the Atlantic salmon (Salmo salar) NLRC5. Structurally, Atlantic salmon NLRC5 presented leucine-rich repeat subfamily genes. Phylogenetically, NLRC5 was moderately conserved between S. salar and other species. Real-time quantitative PCR revealed NLRC5 expression in almost all analyzed organs, with greatest expressions in the head kidney, spleen, and hindgut. Furthermore, NLRC5 gene expression decreased during smolt stage. These data suggest that NLRC5 participates in the Atlantic salmon immune response and is regulated, at least partly, by the smoltification process, suggesting that there is a depression of immune system from parr at smolt stage. This is the first report on the NLRC5 gene in salmonid smolts. Copyright © 2016 Elsevier Ltd. All rights reserved.
Clayton, William; Eaton, Carla Jane; Dupont, Pierre-Yves; Gillanders, Tim; Cameron, Nick; Saikia, Sanjay; Scott, Barry
2017-01-01
Epichloë grass endophytes comprise a group of filamentous fungi of both sexual and asexual species. Known for the beneficial characteristics they endow upon their grass hosts, the identification of these endophyte species has been of great interest agronomically and scientifically. The use of simple sequence repeat loci and the variation in repeat elements has been used to rapidly identify endophyte species and strains, however, little is known of how the structure of repeat elements changes between species and strains, and where these repeat elements are located in the fungal genome. We report on an in-depth analysis of the structure and genomic location of the simple sequence repeat locus B10, commonly used for Epichloë endophyte species identification. The B10 repeat was found to be located within an exon of a putative bZIP transcription factor, suggesting possible impacts on polypeptide sequence and thus protein function. Analysis of this repeat in the asexual endophyte hybrid Epichloë uncinata revealed that the structure of B10 alleles reflects the ancestral species that hybridized to give rise to this species. Understanding the structure and sequence of these simple sequence repeats provides a useful set of tools for readily distinguishing strains and for gaining insights into the ancestral species that have undergone hybridization events.
Li, Si-Bei; OuYang, Wei-Zhi; Hou, Xiao-Jin; Xie, Liang-Liang; Hu, Chun-Gen; Zhang, Jin-Zhi
2015-01-01
Auxin response factors (ARFs) are an important family of proteins in auxin-mediated response, with key roles in various physiological and biochemical processes. To date, a genome-wide overview of the ARF gene family in citrus was not available. A systematic analysis of this gene family in citrus was begun by carrying out a genome-wide search for the homologs of ARFs. A total of 19 nonredundant ARF genes (CiARF) were found and validated from the sweet orange. A comprehensive overview of the CiARFs was undertaken, including the gene structures, phylogenetic analysis, chromosome locations, conserved motifs of proteins, and cis-elements in promoters of CiARF. Furthermore, expression profiling using real-time PCR revealed many CiARF genes, albeit with different patterns depending on types of tissues and/or developmental stages. Comprehensive expression analysis of these genes was also performed under two hormone treatments using real-time PCR. Indole-3-acetic acid (IAA) and N-1-napthylphthalamic acid (NPA) treatment experiments revealed differential up-regulation and down-regulation, respectively, of the 19 citrus ARF genes in the callus of sweet orange. Our comprehensive analysis of ARF genes further elucidates the roles of CiARF family members during citrus growth and development process. PMID:25870601
Hu, Wei; Hou, Xiaowan; Huang, Chao; Yan, Yan; Tie, Weiwei; Ding, Zehong; Wei, Yunxie; Liu, Juhua; Miao, Hongxia; Lu, Zhiwei; Li, Meiying; Xu, Biyu; Jin, Zhiqiang
2015-01-01
Aquaporins (AQPs) function to selectively control the flow of water and other small molecules through biological membranes, playing crucial roles in various biological processes. However, little information is available on the AQP gene family in bananas. In this study, we identified 47 banana AQP genes based on the banana genome sequence. Evolutionary analysis of AQPs from banana, Arabidopsis, poplar, and rice indicated that banana AQPs (MaAQPs) were clustered into four subfamilies. Conserved motif analysis showed that all banana AQPs contained the typical AQP-like or major intrinsic protein (MIP) domain. Gene structure analysis suggested the majority of MaAQPs had two to four introns with a highly specific number and length for each subfamily. Expression analysis of MaAQP genes during fruit development and postharvest ripening showed that some MaAQP genes exhibited high expression levels during these stages, indicating the involvement of MaAQP genes in banana fruit development and ripening. Additionally, some MaAQP genes showed strong induction after stress treatment and therefore, may represent potential candidates for improving banana resistance to abiotic stress. Taken together, this study identified some excellent tissue-specific, fruit development- and ripening-dependent, and abiotic stress-responsive candidate MaAQP genes, which could lay a solid foundation for genetic improvement of banana cultivars. PMID:26307965
Sun, Yang; Huang, Shuijin; Wang, Shuping; Guo, Dianhao; Ge, Chang; Xiao, Huamei; Jie, Wencai; Yang, Qiupu; Teng, Xiaolu; Li, Fei
2017-04-01
Insects undergo metamorphosis, involving an abrupt change in body structure through cell growth and differentiation. Rice stem stripped borer (SSB), Chilo suppressalis, is one of the most destructive rice pests. However, little is known about the regulation mechanism of metamorphosis development in this notorious insect pest. Here, we studied the expression of 22,197 SSB genes at seven time points during pupa development with a customized microarray, identifying 622 differentially expressed genes (DEG) during pupa development. Gene ontology (GO) analysis of these DEGs indicated that the genes related to substance metabolism were highly expressed in the early pupa, which participate in the physiological processes of larval tissue disintegration at these stages. In comparison, highly expressed genes in the late pupal stages were mainly associated with substance biosynthesis, consistent with adult organ formation at these stages. There were 27 solute carrier (SLC) genes that were highly expressed during pupa development. We knocked down SLC22A3 at the prepupal stage, demonstrating that silencing SLC22A3 induced a deficiency in pupa stiffness and pigmentation. The RNAi-treated individuals had white and soft pupa, suggesting that this gene has an essential role in pupal development. Copyright © 2016 Elsevier Ltd. All rights reserved.
Yan, Yan; Wang, Lianzhe; Ding, Zehong; Tie, Weiwei; Ding, Xupo; Zeng, Changying; Wei, Yunxie; Zhao, Hongliang; Peng, Ming; Hu, Wei
2016-01-01
Mitogen-activated protein kinases (MAPKs) play central roles in plant developmental processes, hormone signaling transduction, and responses to abiotic stress. However, no data are currently available about the MAPK family in cassava, an important tropical crop. Herein, 21 MeMAPK genes were identified from cassava. Phylogenetic analysis indicated that MeMAPKs could be classified into four subfamilies. Gene structure analysis demonstrated that the number of introns in MeMAPK genes ranged from 1 to 10, suggesting large variation among cassava MAPK genes. Conserved motif analysis indicated that all MeMAPKs had typical protein kinase domains. Transcriptomic analysis suggested that MeMAPK genes showed differential expression patterns in distinct tissues and in response to drought stress between wild subspecies and cultivated varieties. Interaction networks and co-expression analyses revealed that crucial pathways controlled by MeMAPK networks may be involved in the differential response to drought stress in different accessions of cassava. Expression of nine selected MAPK genes showed that these genes could comprehensively respond to osmotic, salt, cold, oxidative stressors, and abscisic acid (ABA) signaling. These findings yield new insights into the transcriptional control of MAPK gene expression, provide an improved understanding of abiotic stress responses and signaling transduction in cassava, and lead to potential applications in the genetic improvement of cassava cultivars. PMID:27625666
Freytag, Virginie; Probst, Sabine; Hadziselimovic, Nils; Boglari, Csaba; Hauser, Yannick; Peter, Fabian; Gabor Fenyves, Bank; Milnik, Annette; Demougin, Philippe; Vukojevic, Vanja; de Quervain, Dominique J-F; Papassotiropoulos, Andreas; Stetak, Attila
2017-07-12
The identification of genes related to encoding, storage, and retrieval of memories is a major interest in neuroscience. In the current study, we analyzed the temporal gene expression changes in a neuronal mRNA pool during an olfactory long-term associative memory (LTAM) in Caenorhabditis elegans hermaphrodites. Here, we identified a core set of 712 (538 upregulated and 174 downregulated) genes that follows three distinct temporal peaks demonstrating multiple gene regulation waves in LTAM. Compared with the previously published positive LTAM gene set (Lakhina et al., 2015), 50% of the identified upregulated genes here overlap with the previous dataset, possibly representing stimulus-independent memory-related genes. On the other hand, the remaining genes were not previously identified in positive associative memory and may specifically regulate aversive LTAM. Our results suggest a multistep gene activation process during the formation and retrieval of long-term memory and define general memory-implicated genes as well as conditioning-type-dependent gene sets. SIGNIFICANCE STATEMENT The identification of genes regulating different steps of memory is of major interest in neuroscience. Identification of common memory genes across different learning paradigms and the temporal activation of the genes are poorly studied. Here, we investigated the temporal aspects of Caenorhabditis elegans gene expression changes using aversive olfactory associative long-term memory (LTAM) and identified three major gene activation waves. Like in previous studies, aversive LTAM is also CREB dependent, and CREB activity is necessary immediately after training. Finally, we define a list of memory paradigm-independent core gene sets as well as conditioning-dependent genes. Copyright © 2017 the authors 0270-6474/17/376661-12$15.00/0.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Howe, John A.; Xiao, Li; Fischmann, Thierry O.
2016-08-02
Bacterial riboswitches are non-coding RNA structural elements that direct gene expression in numerous metabolic pathways. The key regulatory roles of riboswitches, and the urgent need for new classes of antibiotics to treat multi-drug resistant bacteria, has led to efforts to develop small-molecules that mimic natural riboswitch ligands to inhibit metabolic pathways and bacterial growth. Recently, we reported the results of a phenotypic screen targeting the riboflavin biosynthesis pathway in the Gram-negative bacteria Escherichia coli that led to the identification of ribocil, a small molecule inhibitor of the flavin mononucleotide (FMN) riboswitch controlling expression of this biosynthetic pathway. Although ribocil ismore » structurally distinct from FMN, ribocil functions as a potent and highly selective synthetic mimic of the natural ligand to repress riboswitch-mediated ribB gene expression and inhibit bacterial growth both in vitro and in vivo. Herein, we expand our analysis of ribocil; including mode of binding in the FMN binding pocket of the riboswitch, mechanisms of resistance and structure-activity relationship guided efforts to generate more potent analogs.« less
USDA-ARS?s Scientific Manuscript database
Repetitive sequence analysis has become an integral part of genome sequencing projects in addition to gene identification and annotation. Identification of repeats is important not only because it improves gene prediction, but also because of the role that repetitive sequences play in determining th...
2010-03-01
amino acid substitution in this gene has been associated with uric acid nephrolithiasis (32). Recent GWAS have identified another variant within this...Identification of a novel gene and a common variant associated with uric acid nephrolithiasis in a Sardinian genetic isolate. Am J Hum Genet 72
Barcoding of fresh water fishes from Pakistan.
Karim, Asma; Iqbal, Asad; Akhtar, Rehan; Rizwan, Muhammad; Amar, Ali; Qamar, Usman; Jahan, Shah
2016-07-01
DNA bar-coding is a taxonomic method that uses small genetic markers in organisms' mitochondrial DNA (mt DNA) for identification of particular species. It uses sequence diversity in a 658-base pair fragment near the 5' end of the mitochondrial cytochrome c oxidase subunit 1 (CO1) gene as a tool for species identification. DNA barcoding is more accurate and reliable method as compared with the morphological identification. It is equally useful in juveniles as well as adult stages of fishes. The present study was conducted to identify three farm fish species of Pakistan (Cyprinus carpio, Cirrhinus mrigala, and Ctenopharyngodon idella) genetically. All of them belonged to family cyprinidae. CO1 gene was amplified. PCR products were sequenced and analyzed by bioinformatic software. Conspecific, congenric, and confamilial k2P nucleotide divergence was estimated. From these findings, it was concluded that the gene sequence, CO1, may serve as milestone for the identification of related species at molecular level.
Possibilities in identification of genomic species of Burkholderia cepacia complex by PCR and RFLP.
Navrátilová, Lucie; Chromá, Magdalena; Hanulík, Vojtech; Raclavský, Vladislav
2013-01-01
The strains belonging to Burkholderia cepacia complex are important opportunistic pathogens in immunocompromised patients and cause serious diseases. It is possible to obtain isolates from soil, water, plants and human samples. Taxonomy of this group is difficult. Burkholderia cepacia complex consists of seventeen genomic species and the genetic scheme is based on recA gene. Commonly, first five genomovars occurre in humans, mostly genomovars II and III, subdivision IIIA. Within this study we tested identification of first five genomovars by PCR with following melting analysis and RFLP. The experiments were targeted on eubacterial 16S rDNA and specific gene recA, which allowed identification of all five genomovars. RecA gene appeared as more suitable than 16S rDNA, which enabled direct identification of only genomovars II and V; genomovars I, III and IV were similar within 16S rDNA sequence.
Syed, Khajamohiddin; Shale, Karabo; Pagadala, Nataraj Sekhar; Tuszynski, Jack
2014-01-01
Genome sequencing of basidiomycetes, a group of fungi capable of degrading/mineralizing plant material, revealed the presence of numerous cytochrome P450 monooxygenases (P450s) in their genomes, with some exceptions. Considering the large repertoire of P450s found in fungi, it is difficult to identify P450s that play an important role in fungal metabolism and the adaptation of fungi to diverse ecological niches. In this study, we followed Sir Charles Darwin’s theory of natural selection to identify such P450s in model basidiomycete fungi showing a preference for different types of plant components degradation. Any P450 family comprising a large number of member P450s compared to other P450 families indicates its natural selection over other P450 families by its important role in fungal physiology. Genome-wide comparative P450 analysis in the basidiomycete species, Phanerochaete chrysosporium, Phanerochaete carnosa, Agaricus bisporus, Postia placenta, Ganoderma sp. and Serpula lacrymans, revealed enrichment of 11 P450 families (out of 68 P450 families), CYP63, CYP512, CYP5035, CYP5037, CYP5136, CYP5141, CYP5144, CYP5146, CYP5150, CYP5348 and CYP5359. Phylogenetic analysis of the P450 family showed species-specific alignment of P450s across the P450 families with the exception of P450s of Phanerochaete chrysosporium and Phanerochaete carnosa, suggesting paralogous evolution of P450s in model basidiomycetes. P450 gene-structure analysis revealed high conservation in the size of exons and the location of introns. P450s with the same gene structure were found tandemly arranged in the genomes of selected fungi. This clearly suggests that extensive gene duplications, particularly tandem gene duplications, led to the enrichment of selective P450 families in basidiomycetes. Functional analysis and gene expression profiling data suggest that members of the P450 families are catalytically versatile and possibly involved in fungal colonization of plant material. To our knowledge, this is the first report on the identification and comparative-evolutionary analysis of P450 families enriched in model basidiomycetes. PMID:24466198
Paul, Catherine J; Twine, Susan M; Tam, Kevin J; Mullen, James A; Kelly, John F; Austin, John W; Logan, Susan M
2007-05-01
Strains of Clostridium botulinum are traditionally identified by botulinum neurotoxin type; however, identification of an additional target for typing would improve differentiation. Isolation of flagellar filaments and analysis by sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) showed that C. botulinum produced multiple flagellin proteins. Nano-liquid chromatography-tandem mass spectrometry (nLC-MS/MS) analysis of in-gel tryptic digests identified peptides in all flagellin bands that matched two homologous tandem flagellin genes identified in the C. botulinum Hall A genome. Designated flaA1 and flaA2, these open reading frames encode the major structural flagellins of C. botulinum. Colony PCR and sequencing of flaA1/A2 variable regions classified 80 environmental and clinical strains into group I or group II and clustered isolates into 12 flagellar types. Flagellar type was distinct from neurotoxin type, and epidemiologically related isolates clustered together. Sequencing a larger PCR product, obtained during amplification of flaA1/A2 from type E strain Bennett identified a second flagellin gene, flaB. LC-MS analysis confirmed that flaB encoded a large type E-specific flagellin protein, and the predicted molecular mass for FlaB matched that observed by SDS-PAGE. In contrast, the molecular mass of FlaA was 2 to 12 kDa larger than the mass predicted by the flaA1/A2 sequence of a given strain, suggesting that FlaA is posttranslationally modified. While identification of FlaB, and the observation by SDS-PAGE of different masses of the FlaA proteins, showed the flagellin proteins of C. botulinum to be diverse, the presence of the flaA1/A2 gene in all strains examined facilitates single locus sequence typing of C. botulinum using the flagellin variable region.
LCGbase: A Comprehensive Database for Lineage-Based Co-regulated Genes.
Wang, Dapeng; Zhang, Yubin; Fan, Zhonghua; Liu, Guiming; Yu, Jun
2012-01-01
Animal genes of different lineages, such as vertebrates and arthropods, are well-organized and blended into dynamic chromosomal structures that represent a primary regulatory mechanism for body development and cellular differentiation. The majority of genes in a genome are actually clustered, which are evolutionarily stable to different extents and biologically meaningful when evaluated among genomes within and across lineages. Until now, many questions concerning gene organization, such as what is the minimal number of genes in a cluster and what is the driving force leading to gene co-regulation, remain to be addressed. Here, we provide a user-friendly database-LCGbase (a comprehensive database for lineage-based co-regulated genes)-hosting information on evolutionary dynamics of gene clustering and ordering within animal kingdoms in two different lineages: vertebrates and arthropods. The database is constructed on a web-based Linux-Apache-MySQL-PHP framework and effective interactive user-inquiry service. Compared to other gene annotation databases with similar purposes, our database has three comprehensible advantages. First, our database is inclusive, including all high-quality genome assemblies of vertebrates and representative arthropod species. Second, it is human-centric since we map all gene clusters from other genomes in an order of lineage-ranks (such as primates, mammals, warm-blooded, and reptiles) onto human genome and start the database from well-defined gene pairs (a minimal cluster where the two adjacent genes are oriented as co-directional, convergent, and divergent pairs) to large gene clusters. Furthermore, users can search for any adjacent genes and their detailed annotations. Third, the database provides flexible parameter definitions, such as the distance of transcription start sites between two adjacent genes, which is extendable to genes that flanking the cluster across species. We also provide useful tools for sequence alignment, gene ontology (GO) annotation, promoter identification, gene expression (co-expression), and evolutionary analysis. This database not only provides a way to define lineage-specific and species-specific gene clusters but also facilitates future studies on gene co-regulation, epigenetic control of gene expression (DNA methylation and histone marks), and chromosomal structures in a context of gene clusters and species evolution. LCGbase is freely available at http://lcgbase.big.ac.cn/LCGbase.
2012-09-30
computational tools provide the ability to display, browse, select, filter and summarize spatio-temporal relationships of these individual-based...her research assistant at Esri, Shaun Walbridge, and members of the Marine Mammal Institute ( MMI ), including Tomas Follet and Debbie Steel. This...Genomics Laboratory, MMI , OSU. 4 As part of the geneGIS initiative, these SPLASH photo-identification records and the geneSPLASH DNA profiles
Oligonucleotide microarray for the identification of potential mycotoxigenic fungi
2010-01-01
Background Mycotoxins are secondary metabolites which are produced by numerous fungi and pose a continuous challenge to the safety and quality of food commodities in South Africa. These toxins have toxicologically relevant effects on humans and animals that eat contaminated foods. In this study, a diagnostic DNA microarray was developed for the identification of the most common food-borne fungi, as well as the genes leading to toxin production. Results A total of 40 potentially mycotoxigenic fungi isolated from different food commodities, as well as the genes that are involved in the mycotoxin synthetic pathways, were analyzed. For fungal identification, oligonucleotide probes were designed by exploiting the sequence variations of the elongation factor 1-alpha (EF-1 α) coding regions and the internal transcribed spacer (ITS) regions of the rRNA gene cassette. For the detection of fungi able to produce mycotoxins, oligonucleotide probes directed towards genes leading to toxin production from different fungal strains were identified in data available in the public domain. The probes selected for fungal identification and the probes specific for toxin producing genes were spotted onto microarray slides. Conclusions The diagnostic microarray developed can be used to identify single pure strains or cultures of potentially mycotoxigenic fungi as well as genes leading to toxin production in both laboratory samples and maize-derived foods offering an interesting potential for microbiological laboratories. PMID:20307326
Bénit, Paule; Steffann, Julie; Lebon, Sophie; Chretien, Dominique; Kadhom, Noman; de Lonlay, Pascale; Goldenberg, Alice; Dumez, Yves; Dommergues, Marc; Rustin, Pierre; Munnich, Arnold; Rötig, Agnès
2003-05-01
Complex I deficiency, the most common cause of mitochondrial disorders, accounts for a variety of clinical symptoms and its genetic heterogeneity makes identification of the disease genes particularly tedious. Indeed, most of the 43 complex I subunits are encoded by nuclear genes, only seven of them being mitochondrially encoded. In order to offer urgent prenatal diagnosis, we have studied an inbred/multiplex family with complex I deficiency by using microsatellite DNA markers flanking the putative disease loci. Microsatellite DNA markers have allowed us to exclude the NDUFS7, NDUFS8, NDUFV1 and NDUFS1 genes and to find homozygosity at the NDUFS4 locus. Direct sequencing has led to identification of a homozygous splice acceptor site mutation in intron 1 of the NDUFS4 gene (IVS1nt -1, G-->A); this was not found in chorion villi of the ongoing pregnancy. We suggest that genotyping microsatellite DNA markers at putative disease loci in inbred/multiplex families helps to identify the disease-causing mutation. More generally, we suggest giving consideration to a more systematic microsatellite analysis of putative disease loci for identification of disease genes in inbred/multiplex families affected with genetically heterogeneous conditions.
Uronic polysaccharide degrading enzymes.
Garron, Marie-Line; Cygler, Miroslaw
2014-10-01
In the past several years progress has been made in the field of structure and function of polysaccharide lyases (PLs). The number of classified polysaccharide lyase families has increased to 23 and more detailed analysis has allowed the identification of more closely related subfamilies, leading to stronger correlation between each subfamily and a unique substrate. The number of as yet unclassified polysaccharide lyases has also increased and we expect that sequencing projects will allow many of these unclassified sequences to emerge as new families. The progress in structural analysis of PLs has led to having at least one representative structure for each of the families and for two unclassified enzymes. The newly determined structures have folds observed previously in other PL families and their catalytic mechanisms follow either metal-assisted or Tyr/His mechanisms characteristic for other PL enzymes. Comparison of PLs with glycoside hydrolases (GHs) shows several folds common to both classes but only for the β-helix fold is there strong indication of divergent evolution from a common ancestor. Analysis of bacterial genomes identified gene clusters containing multiple polysaccharide cleaving enzymes, the Polysaccharides Utilization Loci (PULs), and their gene complement suggests that they are organized to process completely a specific polysaccharide. Copyright © 2014 Elsevier Ltd. All rights reserved.
Characterizing visible and invisible cell wall mutant phenotypes
DOE Office of Scientific and Technical Information (OSTI.GOV)
Carpita, Nicholas C.; McCann, Maureen C.
2015-04-06
About 10% of a plant's genome is devoted to generating the protein machinery to synthesize, remodel, and deconstruct the cell wall. High-throughput genome sequencing technologies have enabled a reasonably complete inventory of wall-related genes that can be assembled into families of common evolutionary origin. Assigning function to each gene family member has been aided immensely by identification of mutants with visible phenotypes or by chemical and spectroscopic analysis of mutants with ‘invisible’ phenotypes of modified cell wall composition and architecture that do not otherwise affect plant growth or development. This review connects the inference of gene function on the basismore » of deviation from the wild type in genetic functional analyses to insights provided by modern analytical techniques that have brought us ever closer to elucidating the sequence structures of the major polysaccharide components of the plant cell wall.« less
In silico identification of novel ligands for G-quadruplex in the c- MYC promoter
NASA Astrophysics Data System (ADS)
Kang, Hyun-Jin; Park, Hyun-Ju
2015-04-01
G-quadruplex DNA formed in NHEIII1 region of oncogene promoter inhibits transcription of the genes. In this study, virtual screening combining pharmacophore-based search and structure-based docking screening was conducted to discover ligands binding to G-quadruplex in promoter region of c- MYC. Several hit ligands showed the selective PCR-arresting effects for oligonucleotide containing c- MYC G-quadruplex forming sequence. Among them, three hits selectively inhibited cell proliferation and decreased c- MYC mRNA level in Ramos cells, where NHEIII1 is included in translocated c- MYC gene for overexpression. Promoter assay using two kinds of constructs with wild-type and mutant sequences showed that interaction of these ligands with the G-quadruplex resulted in turning-off of the reporter gene. In conclusion, combined virtual screening methods were successfully used for discovery of selective c- MYC promoter G-quadruplex binders with anticancer activity.
Sumner, Lloyd W.; Lei, Zhentian; Nikolau, Basil J.; ...
2014-10-24
Plant metabolomics has matured and modern plant metabolomics has accelerated gene discoveries and the elucidation of a variety of plant natural product biosynthetic pathways. This study highlights specific examples of the discovery and characterization of novel genes and enzymes associated with the biosynthesis of natural products such as flavonoids, glucosinolates, terpenoids, and alkaloids. Additional examples of the integration of metabolomics with genome-based functional characterizations of plant natural products that are important to modern pharmaceutical technology are also reviewed. This article also provides a substantial review of recent technical advances in mass spectrometry imaging, nuclear magnetic resonance imaging, integrated LC-MS-SPE-NMR formore » metabolite identifications, and x-ray crystallography of microgram quantities for structural determinations. The review closes with a discussion on the future prospects of metabolomics related to crop species and herbal medicine.« less
Identification of the Viridicatumtoxin and Griseofulvin Gene Clusters from Penicillium aethiopicum
Chooi, Yit-Heng; Cacho, Ralph; Tang, Yi
2010-01-01
SUMMARY Penicillium aethiopicum produces two structurally interesting and biologically active polyketides: the tetracycline-like viridicatumtoxin 1 and the classic antifungal agent griseofulvin 2. Here, we report the concurrent discovery of the two corresponding biosynthetic gene clusters (vrt and gsf) by 454 shotgun sequencing. Gene deletions confirmed two nonreducing PKSs (NRPKS), vrtA and gsfA, are required for the biosynthesis of 1 and 2, respectively. Both PKSs share similar domain architectures and lack a C-terminal thioesterase domain. We identified gsfI as the chlorinase involved in the biosynthesis of 2, as deletion of gsfI resulted in the accumulation of decholorogriseofulvin 3. Comparative analysis with the P. chrysogenum genome revealed that both clusters are embedded within conserved syntenic regions of P. aethiopicum chromosomes. Discovery of the vrt and gsf clusters provided the basis for genetic and biochemical studies of the pathways. PMID:20534346
Liu, S; Liu, L; Tang, Y; Xiong, S; Long, J; Liu, Z; Tian, N
2017-07-01
The regulatory mechanism of flavonoids, which synergise anti-malarial and anti-cancer compounds in Artemisia annua, is still unclear. In this study, an anthocyanidin-accumulating mutant callus was induced from A. annua and comparative transcriptomic analysis of wild-type and mutant calli performed, based on the next-generation Illumina/Solexa sequencing platform and de novo assembly. A total of 82,393 unigenes were obtained and 34,764 unigenes were annotated in the public database. Among these, 87 unigenes were assigned to 14 structural genes involved in the flavonoid biosynthetic pathway and 37 unigenes were assigned to 17 structural genes related to metabolism of flavonoids. More than 30 unigenes were assigned to regulatory genes, including R2R3-MYB, bHLH and WD40, which might regulate flavonoid biosynthesis. A further 29 unigenes encoding flavonoid biosynthetic enzymes or transcription factors were up-regulated in the mutant, while 19 unigenes were down-regulated, compared with the wild type. Expression levels of nine genes involved in the flavonoid pathway were compared using semi-quantitative RT-PCR, and results were consistent with comparative transcriptomic analysis. Finally, a putative flavonol synthase gene (AaFLS1) was identified from enzyme assay in vitro and in vivo through heterogeneous expression, and confirmed comparative transcriptomic analysis of wild-type and mutant callus. The present work has provided important target genes for the regulation of flavonoid biosynthesis in A. annua. © 2017 German Botanical Society and The Royal Botanical Society of the Netherlands.
SASD: the Synthetic Alternative Splicing Database for identifying novel isoform from proteomics
2013-01-01
Background Alternative splicing is an important and widespread mechanism for generating protein diversity and regulating protein expression. High-throughput identification and analysis of alternative splicing in the protein level has more advantages than in the mRNA level. The combination of alternative splicing database and tandem mass spectrometry provides a powerful technique for identification, analysis and characterization of potential novel alternative splicing protein isoforms from proteomics. Therefore, based on the peptidomic database of human protein isoforms for proteomics experiments, our objective is to design a new alternative splicing database to 1) provide more coverage of genes, transcripts and alternative splicing, 2) exclusively focus on the alternative splicing, and 3) perform context-specific alternative splicing analysis. Results We used a three-step pipeline to create a synthetic alternative splicing database (SASD) to identify novel alternative splicing isoforms and interpret them at the context of pathway, disease, drug and organ specificity or custom gene set with maximum coverage and exclusive focus on alternative splicing. First, we extracted information on gene structures of all genes in the Ensembl Genes 71 database and incorporated the Integrated Pathway Analysis Database. Then, we compiled artificial splicing transcripts. Lastly, we translated the artificial transcripts into alternative splicing peptides. The SASD is a comprehensive database containing 56,630 genes (Ensembl gene IDs), 95,260 transcripts (Ensembl transcript IDs), and 11,919,779 Alternative Splicing peptides, and also covering about 1,956 pathways, 6,704 diseases, 5,615 drugs, and 52 organs. The database has a web-based user interface that allows users to search, display and download a single gene/transcript/protein, custom gene set, pathway, disease, drug, organ related alternative splicing. Moreover, the quality of the database was validated with comparison to other known databases and two case studies: 1) in liver cancer and 2) in breast cancer. Conclusions The SASD provides the scientific community with an efficient means to identify, analyze, and characterize novel Exon Skipping and Intron Retention protein isoforms from mass spectrometry and interpret them at the context of pathway, disease, drug and organ specificity or custom gene set with maximum coverage and exclusive focus on alternative splicing. PMID:24267658
Singh, Vikas K; Khan, Aamir W; Saxena, Rachit K; Sinha, Pallavi; Kale, Sandip M; Parupalli, Swathi; Kumar, Vinay; Chitikineni, Annapurna; Vechalapu, Suryanarayana; Sameer Kumar, Chanda Venkata; Sharma, Mamta; Ghanta, Anuradha; Yamini, Kalinati Narasimhan; Muniswamy, Sonnappa; Varshney, Rajeev K
2017-07-01
Identification of candidate genomic regions associated with target traits using conventional mapping methods is challenging and time-consuming. In recent years, a number of single nucleotide polymorphism (SNP)-based mapping approaches have been developed and used for identification of candidate/putative genomic regions. However, in the majority of these studies, insertion-deletion (Indel) were largely ignored. For efficient use of Indels in mapping target traits, we propose Indel-seq approach, which is a combination of whole-genome resequencing (WGRS) and bulked segregant analysis (BSA) and relies on the Indel frequencies in extreme bulks. Deployment of Indel-seq approach for identification of candidate genomic regions associated with fusarium wilt (FW) and sterility mosaic disease (SMD) resistance in pigeonpea has identified 16 Indels affecting 26 putative candidate genes. Of these 26 affected putative candidate genes, 24 genes showed effect in the upstream/downstream of the genic region and two genes showed effect in the genes. Validation of these 16 candidate Indels in other FW- and SMD-resistant and FW- and SMD-susceptible genotypes revealed a significant association of five Indels (three for FW and two for SMD resistance). Comparative analysis of Indel-seq with other genetic mapping approaches highlighted the importance of the approach in identification of significant genomic regions associated with target traits. Therefore, the Indel-seq approach can be used for quick and precise identification of candidate genomic regions for any target traits in any crop species. © 2016 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
Improving substructure identification accuracy of shear structures using virtual control system
NASA Astrophysics Data System (ADS)
Zhang, Dongyu; Yang, Yang; Wang, Tingqiang; Li, Hui
2018-02-01
Substructure identification is a powerful tool to identify the parameters of a complex structure. Previously, the authors developed an inductive substructure identification method for shear structures. The identification error analysis showed that the identification accuracy of this method is significantly influenced by the magnitudes of two key structural responses near a certain frequency; if these responses are unfavorable, the method cannot provide accurate estimation results. In this paper, a novel method is proposed to improve the substructure identification accuracy by introducing a virtual control system (VCS) into the structure. A virtual control system is a self-balanced system, which consists of some control devices and a set of self-balanced forces. The self-balanced forces counterbalance the forces that the control devices apply on the structure. The control devices are combined with the structure to form a controlled structure used to replace the original structure in the substructure identification; and the self-balance forces are treated as known external excitations to the controlled structure. By optimally tuning the VCS’s parameters, the dynamic characteristics of the controlled structure can be changed such that the original structural responses become more favorable for the substructure identification and, thus, the identification accuracy is improved. A numerical example of 6-story shear structure is utilized to verify the effectiveness of the VCS based controlled substructure identification method. Finally, shake table tests are conducted on a 3-story structural model to verify the efficacy of the VCS to enhance the identification accuracy of the structural parameters.
Hayashi, Yukiko
2013-01-01
Myofibrillar myopathy (MFM) is a group of hereditary disorders pathologically characterized by focal disorganizations of myofibril structures with cytoplasmic inclusions. Most of the diseases so-called desmin-related or storage myopathy, cytoplasmic body myopathy, spheroid body myopathy, reducing body myopathy, and hyaline body myopathy are included in MFM. Several causative genes have been identified such as DES, CRYAB, MYOT, ZASP, BAG3, FLNC, DNAJB6, FHL1, TTN, and VCP. Most of these genes encode Z-line related proteins or proteins associated with protein quality control. Since MFM is the name from pathological characteristics, clinical features of the patients including the age at disease onset, affected muscles, disease course, and complications are quite variable. In this paper, characteristic clinical and pathological features of each causative gene are summarized. Unexpectedly, hereditary myopathy with early respiratory failure (HMERF) caused by mutation in the A-band region of TTN is the most common cause of MFM in our cohort. Despite of intensive mutation screening, the causative gene of more than 60% of MFM patients is still unknown. Further identification of novel causative genes and elucidate pathomechanisms of protein aggregation in necessary.
Wang, Meng; Xu, Zongchang; Ding, Anming; Kong, Yingzhen
2018-05-24
Xyloglucan endotransglucosylase/hydrolase genes ( XTHs ) encode enzymes required for the reconstruction and modification of xyloglucan backbones, which will result in changes of cell wall extensibility during growth. A total of 56 NtXTH genes were identified from common tobacco, and 50 cDNA fragments were verified by PCR amplification. The 56 NtXTH genes could be classified into two subfamilies: Group I/II and Group III according to their phylogenetic relationships. The gene structure, chromosomal localization, conserved protein domains prediction, sub-cellular localization of NtXTH proteins and evolutionary relationships among Nicotiana tabacum , Nicotiana sylvestrisis , Nicotiana tomentosiformis , Arabidopsis , and rice were also analyzed. The NtXTHs expression profiles analyzed by the TobEA database and qRT-PCR revealed that NtXTHs display different expression patterns in different tissues. Notably, the expression patterns of 12 NtXTHs responding to environment stresses, including salinity, alkali, heat, chilling, and plant hormones, including IAA and brassinolide, were characterized. All the results would be useful for the function study of NtXTHs during different growth cycles and stresses.
Addressing the Challenges of Pathogen Evolution on the World's Arable Crops.
Burdon, Jeremy J; Zhan, Jiasui; Barrett, Luke G; Papaïx, Julien; Thrall, Peter H
2016-10-01
Advances in genomic and molecular technologies coupled with an increasing understanding of the fine structure of many resistance and infectivity genes, have opened up a new era of hope in controlling the many plant pathogens that continue to be a major source of loss in arable crops. Some new approaches are under consideration including the use of nonhost resistance and the targeting of critical developmental constraints. However, the major thrust of these genomic and molecular approaches is to enhance the identification of resistance genes, to increase their ease of manipulation through marker and gene editing technologies and to lock a range of resistance genes together in simply manipulable resistance gene cassettes. All these approaches essentially continue a strategy that assumes the ability to construct genetic-based resistance barriers that are insurmountable to target pathogens. Here we show how the recent advances in knowledge and marker technologies can be used to generate more durable disease resistance strategies that are based on broad evolutionary principles aimed at presenting pathogens with a shifting, landscape of fluctuating directional selection.
Zhang, Zhenzhu; Chen, Xiuling; Guan, Xin; Liu, Yang; Chen, Hongyu; Wang, Tingting; Mouekouba, Liana Dalcantara Ongouya; Li, Jingfu; Wang, Aoxue
2014-01-01
Homeodomain-leucine zipper (HD-Zip) proteins are a kind of transcriptional factors that play a vital role in plant growth and development. However, no detailed information of HD-Zip family in tomato has been reported till now. In this study, 51 HD-Zip genes (SlHZ01-51) in this family were identified and categorized into 4 classes by exon-intron and protein structure in tomato (Solanum lycopersicum) genome. The synthetical phylogenetic tree of tomato, Arabidopsis and rice HD-Zip genes were established for an insight into their evolutionary relationships and putative functions. The results showed that the contribution of segmental duplication was larger than that of tandem duplication for expansion and evolution of genes in this family of tomato. The expression profile results under abiotic stress suggested that all SlHZ I genes were responsive to cold stress. This study will provide a clue for the further investigation of functional identification and the role of tomato HD-Zip I subfamily in plant cold stress responses and developmental events.
Li, Chunquan; Han, Junwei; Yao, Qianlan; Zou, Chendan; Xu, Yanjun; Zhang, Chunlong; Shang, Desi; Zhou, Lingyun; Zou, Chaoxia; Sun, Zeguo; Li, Jing; Zhang, Yunpeng; Yang, Haixiu; Gao, Xu; Li, Xia
2013-05-01
Various 'omics' technologies, including microarrays and gas chromatography mass spectrometry, can be used to identify hundreds of interesting genes, proteins and metabolites, such as differential genes, proteins and metabolites associated with diseases. Identifying metabolic pathways has become an invaluable aid to understanding the genes and metabolites associated with studying conditions. However, the classical methods used to identify pathways fail to accurately consider joint power of interesting gene/metabolite and the key regions impacted by them within metabolic pathways. In this study, we propose a powerful analytical method referred to as Subpathway-GM for the identification of metabolic subpathways. This provides a more accurate level of pathway analysis by integrating information from genes and metabolites, and their positions and cascade regions within the given pathway. We analyzed two colorectal cancer and one metastatic prostate cancer data sets and demonstrated that Subpathway-GM was able to identify disease-relevant subpathways whose corresponding entire pathways might be ignored using classical entire pathway identification methods. Further analysis indicated that the power of a joint genes/metabolites and subpathway strategy based on their topologies may play a key role in reliably recalling disease-relevant subpathways and finding novel subpathways.
Recognizing the enemy within: licensing RNA-guided genome defense
Dumesic, Phillip A.; Madhani, Hiten D.
2014-01-01
How do cells distinguish normal genes from transposons? Although much has been learned about RNAi-related RNA silencing pathways responsible for genome defense, this fundamental question remains. The literature points to several classes of mechanisms. In some cases, double-stranded RNA structures produced by transposon inverted repeats or antisense integration trigger endo-siRNA biogenesis. In other instances, DNA features associated with transposons—such as their unusual copy number, chromosomal arrangement, and/or chromatin environment—license RNA silencing. Finally, recent studies have identified improper transcript processing events, such as stalled pre-mRNA splicing, as signals for siRNA production. Thus, the suboptimal gene expression properties of selfish elements can enable their identification by RNA silencing pathways. PMID:24280023
The hppA gene of Helicobacter pylori encodes the class C acid phosphatase precursor.
Godlewska, Renata; Bujnicki, Janusz M; Ostrowski, Jerzy; Jagusztyn-Krynicka, Elzbieta K
2002-08-14
Screening of the Helicobacter pylori genomic library with sera from infected humans and from immunized rabbits resulted in identification of the 25 kDa protein cell envelope (HppA) which exhibits acid phosphatase activity. Enzyme activity was demonstrated by specific enzymatic assays with whole-cell protein preparations of H. pylori strain N6 and from Escherichia coli carrying the hppA gene (pUWM192). HppA showed optimum activity at pH 5.6 and was resistant to inhibition by EDTA. Bioinformatics analysis and site-directed mutagenesis of two putative active site residues (D73 and D192) provide further insight into the sequence-structure-function relationships of HppA as a member of the DDDD phosphohydrolase superfamily.
Demirci, Berna; Lee, Yoosook; Lanzaro, Gregory C; Alten, Bulent
2012-05-01
Culex theileri Theobald (Diptera: Culicidae) is one of the most common mosquito species in northeastern Turkey and serves as a vector for various zoonotic diseases including West Nile virus. Although there have been some studies on the ecology of Cx. theileri, very little genetic data has been made available. We successfully sequenced 11 gene fragments from Cx. theileri specimens collected from the northeastern part of Turkey. On average, we found a Single nucleotide polymorphism every 45 bp. Transitions outnumbered transversions, at a ratio of 2:1. This is the first report of genetic polymorphisms in Cx. theileri and Single nucleotide polymorphism discovered from this study can be used to investigate population structure and gene-environmental interactions.
Winchester, L; Newbury, D F; Monaco, A P; Ragoussis, J
2008-01-01
Copy Number Variants (CNV) and other submicroscopic structural changes are now recognised to be widespread across the human genome. We show that SNP data generated for association study can be utilised for the identification of deletion CNVs. During analysis of data for an SNP association study for Specific Language Impairment (SLI) a deletion was identified. SLI adversely affects the language development of children in the absence of any obvious cause. Previous studies have found linkage to a region on chromosome 16. The deletion was located in a known fragile site FRA16D in intron 5-6 of the WWOX gene (also known as FOR). Changes in the FRA16D site have been previously linked to cancer and are often characterised in cell lines. A long-range PCR assay was used to confirm the existence of the deletion. We also show the breakpoint identification and large-scale characterisation of this CNV in a normal human sample set. Copyright 2009 S. Karger AG, Basel.
QTLomics in Soybean: A Way Forward for Translational Genomics and Breeding
Kumawat, Giriraj; Gupta, Sanjay; Ratnaparkhe, Milind B.; Maranna, Shivakumar; Satpute, Gyanesh K.
2016-01-01
Food legumes play an important role in attaining both food and nutritional security along with sustainable agricultural production for the well-being of humans globally. The various traits of economic importance in legume crops are complex and quantitative in nature, which are governed by quantitative trait loci (QTLs). Mapping of quantitative traits is a tedious and costly process, however, a large number of QTLs has been mapped in soybean for various traits albeit their utilization in breeding programmes is poorly reported. For their effective use in breeding programme it is imperative to narrow down the confidence interval of QTLs, to identify the underlying genes, and most importantly allelic characterization of these genes for identifying superior variants. In the field of functional genomics, especially in the identification and characterization of gene responsible for quantitative traits, soybean is far ahead from other legume crops. The availability of genic information about quantitative traits is more significant because it is easy and effective to identify homologs than identifying shared syntenic regions in other crop species. In soybean, genes underlying QTLs have been identified and functionally characterized for phosphorous efficiency, flowering and maturity, pod dehiscence, hard-seededness, α-Tocopherol content, soybean cyst nematode, sudden death syndrome, and salt tolerance. Candidate genes have also been identified for many other quantitative traits for which functional validation is required. Using the sequence information of identified genes from soybean, comparative genomic analysis of homologs in other legume crops could discover novel structural variants and useful alleles for functional marker development. The functional markers may be very useful for molecular breeding in soybean and harnessing benefit of translational research from soybean to other leguminous crops. Thus, soybean crop can act as a model crop for translational genomics and breeding of quantitative traits in legume crops. In this review, we summarize current status of identification and characterization of genes underlying QTLs for various quantitative traits in soybean and their significance in translational genomics and breeding of other legume crops. PMID:28066449
Yadav, Manoj Kumar; S, Aravindan; Ngangkham, Umakanta; Shubudhi, H N; Bag, Manas Kumar; Adak, Totan; Munda, Sushmita; Samantaray, Sanghamitra; Jena, Mayabini
2017-01-01
Rice blast disease caused by Magnaporthe oryzae is one of the most destructive disease causing huge losses to rice yield in different parts of the world. Therefore, an attempt has been made to find out the resistance by screening and studying the genetic diversity of eighty released rice varieties by National Rice Research Institute, Cuttack (NRVs) using molecular markers linked to twelve major blast resistance (R) genes viz Pib, Piz, Piz-t, Pik, Pik-p, Pikm Pik-h, Pita/Pita-2, Pi2, Pi9, Pi1 and Pi5. Out of which, nineteen varieties (23.75%) showed resistance, twenty one were moderately resistant (26.25%) while remaining forty varieties (50%) showed susceptible in uniform blast nursery. Rice varieties possessing blast resistance genes varied from four to twelve and the frequencies of the resistance genes ranged from 0 to 100%. The cluster analysis grouped the eighty NRVs into two major clusters at 63% level of genetic similarity coefficient. The PIC value for seventeen markers varied from 0 to 0.37 at an average of 0.20. Out of seventeen markers, only five markers, 195R-1, Pi9-i, Pita3, YL155/YL87 and 40N23r corresponded to three broad spectrum R genes viz. Pi9, Pita/Pita2 and Pi5 were found to be significantly associated with the blast disease with explaining phenotypic variance from 3.5% to 7.7%. The population structure analysis and PCoA divided the entire 80 NRVs into two sub-groups. The outcome of this study would help to formulate strategies for improving rice blast resistance through genetic studies, plant-pathogen interaction, identification of novel R genes, development of new resistant varieties through marker-assisted breeding for improving rice blast resistance in India and worldwide.
Busarcevic, Milos; Dalgalarrondo, Michèle
2012-08-01
The aim of this study was to investigate the antimicrobial potential of Lactobacillus salivarius BGHO1, a human oral strain with probiotic characteristics and a broad inhibitory spectrum both against Gram-positive and Gram-negative pathogens. Here we present the bacteriocin LS2, an extremely pH- and heat-stable peptide with antilisterial activity. LS2 is a novel member of the class IId bacteriocins, unique among all currently characterised bacteriocins. It is somewhat similar to putative bacteriocins from several oral streptococci, including the cariogenic Streptococcus mutans. LS2 is a 41-amino-acid, highly hydrophobic cationic peptide of 4115.1Da that is sensitive to proteolytic enzymes. LS2 was purified from cells of strain BGHO1 by solvent extraction and reverse-phase chromatography. Mass spectrometry was used to determine the molecular mass of the purified peptide. N-terminal amino acid sequencing enabled identification of the LS2 structural gene bacls2 by a reverse genetics approach. Downstream of the bacls2 gene, two bacteriocin-like genes were found, named blp1a and blp1b, and one putative bacteriocin immunity gene named bimlp. We also present the identification of the 242-kb megaplasmid pMPHO1 by pulsed-field gel electrophoresis, which harbours the genes bacls2, blp1a, blp1b and bimlp. Two peptides with antimicrobial activity, whose approximate sizes corresponded to those of blp1a and blp1b, were identified only after culturing strain BGHO1 in a chemically defined medium. This study demonstrated the capacity of Lactobacillus salivarius BGHO1 to produce multiple bacteriocins and further established this strain as a promising probiotic candidate. Copyright © 2012 Elsevier B.V. and the International Society of Chemotherapy. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Adamopoulos, Panagiotis G.; Kontos, Christos K.; Scorilas, Andreas
Tissue kallikrein and kallikrein-related peptidases (KLKs) form the largest group of serine proteases in the human genome, sharing many structural and functional characteristics. Multiple alternative transcripts have been reported for the most human KLK genes, while many of them are aberrantly expressed in various malignancies, thus possessing significant prognostic and/or diagnostic value. Alternative splicing of cancer-related genes is a common cellular mechanism accounting for cancer cell transcriptome complexity, as it affects cell cycle control, proliferation, apoptosis, invasion, and metastasis. In this study, we describe the identification and molecular cloning of eight novel transcripts of the human KLK10 gene using 3′more » rapid amplification of cDNA ends (3′ RACE) and next-generation sequencing (NGS), as well as their expression analysis in a wide panel of cell lines, originating from several distinct cancerous and normal tissues. Bioinformatic analysis revealed that the novel KLK10 transcripts contain new alternative splicing events between already annotated exons as well as novel exons. In addition, investigation of their expression profile in a wide panel of cell lines was performed with nested RT-PCR using variant-specific pairs of primers. Since many KLK mRNA transcripts possess clinical value, these newly discovered alternatively spliced KLK10 transcripts appear as new potential biomarkers for diagnostic and/or prognostic purposes or as targets for therapeutic strategies. - Highlights: • NGS was used to identify novel transcripts of the human KLK10 gene. • 8 novel KLK10 transcripts were identified. • A novel 3′UTR was detected and characterized. • The expression profiles of all 8 novel KLK10 transcripts were identified.« less
Tan, Wei; Dean, Michael; Law, Amanda J.
2010-01-01
ErbB4 is a growth factor receptor tyrosine kinase essential for neurodevelopment. Genetic variation in ErbB4 is associated with schizophrenia and risk-associated polymorphisms predict overexpression of ErbB4 CYT-1 isoforms in the brain in the disorder. The molecular mechanism of association is unclear because the polymorphisms flank exon 3 of the gene and reside 700 kb distal to the CYT-1 defining exon. We hypothesized that the polymorphisms are indirectly associated with ErbB4 CYT-1 via splicing of exon 3 on the CYT-1 background. We report via cloning and sequencing of adult and fetal human brain cDNA libraries the identification of novel splice isoforms of ErbB4, whereby exon 3 is skipped (del.3). ErbB4 del.3 transcripts exist as CYT-2 isoforms and are predicted to produce truncated proteins. Furthermore, our data refine the structure of the human ErbB4 gene, clarify that juxtamembrane (JM) splice variants of ErbB4, JM-a and JM-b respectively, are characterized by the replacement of a 75 nucleotide (nt) sequence with a 45-nt insertion, and demonstrate that there are four alternative exons in the gene. Our analyses reveal that novel splice variants of ErbB4 exist in the developing and adult human brain and, given the failure to identify ErbB4 del.3 CYT-1 transcripts, suggest that the association of risk polymorphisms in the ErbB4 gene with CYT-1 transcript levels is not mediated via an exon 3 splicing event. PMID:20886074
Genome-based identification and analysis of ionotropic receptors in Spodoptera litura.
Zhu, Jia-Ying; Xu, Zhi-Wen; Zhang, Xin-Min; Liu, Nai-Yong
2018-05-22
The ability to sense and recognize various classes of compounds is of particular importance for survival and reproduction of insects. Ionotropic receptor (IR), a sub-family of the ionotropic glutamate receptor family, has been identified as one of crucial chemoreceptor super-families, which mediates the sensing of odors and/or tastants, and serves as non-chemosensory functions. Yet, little is known about IR characteristics, evolution, and functions in Lepidoptera. Here, we identify the IR gene repertoire from a destructive polyphagous pest, Spodoptera litura. The exhaustive analyses with genome and transcriptome data lead to the identification of 45 IR genes, comprising 17 antennal IRs (A-IRs), 8 Lepidoptera-specific IRs (LS-IRs), and 20 divergent IRs (D-IRs). Phylogenetic analysis reveals that S. litura A-IRs generally retain a strict single copy within each orthologous group, and two lineage expansions are observed in the D-IR sub-family including IR100d-h and 100i-o, likely attributed to gene duplications. Results of gene structure analysis classify the SlitIRs into four types: I (intronless), II (1-3 introns), III (5-9 introns), and IV (10-18 introns). Extensive expression profiles demonstrate that the majority of SlitIRs (28/43) are enriched in adult antennae, and some are detected in gustatory-associated tissues like proboscises and legs as well as non-chemosensory organs like abdomens and reproductive tissues of both sexes. These results indicate that SlitIRs have diverse functional roles in olfaction, taste, and reproduction. Together, our study has complemented the information on chemoreceptor genes in S. litura, and meanwhile allows for target experiments to identify potential IR candidates for the control of this pest.
Babben, Steve; Perovic, Dragan; Koch, Michael; Ordon, Frank
2015-01-01
Recent declines in costs accelerated sequencing of many species with large genomes, including hexaploid wheat (Triticum aestivum L.). Although the draft sequence of bread wheat is known, it is still one of the major challenges to developlocus specific primers suitable to be used in marker assisted selection procedures, due to the high homology of the three genomes. In this study we describe an efficient approach for the development of locus specific primers comprising four steps, i.e. (i) identification of genomic and coding sequences (CDS) of candidate genes, (ii) intron- and exon-structure reconstruction, (iii) identification of wheat A, B and D sub-genome sequences and primer development based on sequence differences between the three sub-genomes, and (iv); testing of primers for functionality, correct size and localisation. This approach was applied to single, low and high copy genes involved in frost tolerance in wheat. In summary for 27 of these genes for which sequences were derived from Triticum aestivum, Triticum monococcum and Hordeum vulgare, a set of 119 primer pairs was developed and after testing on Nulli-tetrasomic (NT) lines, a set of 65 primer pairs (54.6%), corresponding to 19 candidate genes, turned out to be specific. Out of these a set of 35 fragments was selected for validation via Sanger's amplicon re-sequencing. All fragments, with the exception of one, could be assigned to the original reference sequence. The approach presented here showed a much higher specificity in primer development in comparison to techniques used so far in bread wheat and can be applied to other polyploid species with a known draft sequence. PMID:26565976
Genome-based identification and analysis of ionotropic receptors in Spodoptera litura
NASA Astrophysics Data System (ADS)
Zhu, Jia-Ying; Xu, Zhi-Wen; Zhang, Xin-Min; Liu, Nai-Yong
2018-06-01
The ability to sense and recognize various classes of compounds is of particular importance for survival and reproduction of insects. Ionotropic receptor (IR), a sub-family of the ionotropic glutamate receptor family, has been identified as one of crucial chemoreceptor super-families, which mediates the sensing of odors and/or tastants, and serves as non-chemosensory functions. Yet, little is known about IR characteristics, evolution, and functions in Lepidoptera. Here, we identify the IR gene repertoire from a destructive polyphagous pest, Spodoptera litura. The exhaustive analyses with genome and transcriptome data lead to the identification of 45 IR genes, comprising 17 antennal IRs (A-IRs), 8 Lepidoptera-specific IRs (LS-IRs), and 20 divergent IRs (D-IRs). Phylogenetic analysis reveals that S. litura A-IRs generally retain a strict single copy within each orthologous group, and two lineage expansions are observed in the D-IR sub-family including IR100d-h and 100i-o, likely attributed to gene duplications. Results of gene structure analysis classify the SlitIRs into four types: I (intronless), II (1-3 introns), III (5-9 introns), and IV (10-18 introns). Extensive expression profiles demonstrate that the majority of SlitIRs (28/43) are enriched in adult antennae, and some are detected in gustatory-associated tissues like proboscises and legs as well as non-chemosensory organs like abdomens and reproductive tissues of both sexes. These results indicate that SlitIRs have diverse functional roles in olfaction, taste, and reproduction. Together, our study has complemented the information on chemoreceptor genes in S. litura, and meanwhile allows for target experiments to identify potential IR candidates for the control of this pest.
Polat, İlknur; Baysal, Ömür; Mercati, Francesco; Gümrükcü, Emine; Sülü, Görkem; Kitapcı, Aytül; Araniti, Fabrizio; Carimi, Francesco
2018-06-01
Botrytis cinerea is a polyphagous fungal pathogen causing gray mold disease. Moreover, it is one of the most destructive infections of small fruit crops such as pepper (Capsicum annnum L.). C. sativum is a species belonging to the Solanaceae family and Turkey is one of the main producers in the World. In the present work, aiming to obtain information useful for pest management, fifty B. cinerea isolates collected from Turkey and a reference isolate (B05.10) were characterized using molecular markers and fungicide resistance genes. Morphological and molecular (ITS1-ITS4) identification of B. cinerea isolates, the degree of virulence and mating types were determined. Since one or several allelic mutations in the histidine kinase (Bos1) and β-tubulin genes generally confer the resistance to fungicides, the sequences of these target genes were investigated in the selected isolates, which allowed the identification of two different haplotypes. Mating types were also determined by PCR assays using primer specific for MAT1-1 alpha gene (MAT1-1-1) and MAT1-2 HMG (MAT1-2-1) of B. cinerea. Twenty-two out of 50 isolates (44%) were MAT1-2, while 38% were MAT1-1. Interestingly, out of whole studied samples, 9 isolates (18%) were heterokaryotic or mixed colonies. In addition, cluster and population structure analyses identified five main groups and two genetic pools, respectively, underlining a good level of variability in the analysed panel. The results highlighted the presence of remarkable genetic diversity in B. cinerea isolates collected in a crucial economical area for pepper cultivation in Turkey and the data will be beneficial in view of future gray mold disease management. Copyright © 2018 Elsevier B.V. All rights reserved.
Dietzel, Lars; Gläßer, Christine; Liebers, Monique; Hiekel, Stefan; Courtois, Florence; Czarnecki, Olaf; Schlicke, Hagen; Zubo, Yan; Börner, Thomas; Mayer, Klaus; Grimm, Bernhard; Pfannschmidt, Thomas
2015-08-01
Natural illumination conditions are highly variable and because of their sessile life style, plants are forced to acclimate to them at the cellular and molecular level. Changes in light intensity or quality induce changes in the reduction/oxidation (redox) state of the photosynthetic electron chain that acts as a trigger for compensatory acclimation responses comprising functional and structural adjustments of photosynthesis and metabolism. Such responses include redox-controlled changes in plant gene expression in the nucleus and organelles. Here we describe a strategy for the identification of early redox-regulated genes (ERGs) in the nucleus of the model organism Arabidopsis thaliana that respond significantly 30 or 60 min after the generation of a reduction signal in the photosynthetic electron transport chain. By comparing the response of wild-type plants with that of the acclimation mutant stn7, we could specifically identify ERGs. The results reveal a significant impact of chloroplast redox signals on distinct nuclear gene groups including genes for the mitochondrial electron transport chain, tetrapyrrole biosynthesis, carbohydrate metabolism, and signaling lipid synthesis. These expression profiles are clearly different from those observed in response to the reduction of photosynthetic electron transport by high light treatments. Thus, the ERGs identified are unique to redox imbalances in photosynthetic electron transport and were then used for analyzing potential redox-responsive cis-elements, trans-factors, and chromosomal regulatory hot spots. The data identify a novel redox-responsive element and indicate extensive redox control at transcriptional and chromosomal levels that point to an unprecedented impact of redox signals on epigenetic processes. Copyright © 2015 The Author. Published by Elsevier Inc. All rights reserved.
Mouse forward genetics in the study of the peripheral nervous system and human peripheral neuropathy
Douglas, Darlene S.; Popko, Brian
2009-01-01
Forward genetics, the phenotype-driven approach to investigating gene identity and function, has a long history in mouse genetics. Random mutations in the mouse transcend bias about gene function and provide avenues towards unique discoveries. The study of the peripheral nervous system is no exception; from historical strains such as the trembler mouse, which led to the identification of PMP22 as a human disease gene causing multiple forms of peripheral neuropathy, to the more recent identification of the claw paw and sprawling mutations, forward genetics has long been a tool for probing the physiology, pathogenesis, and genetics of the PNS. Even as spontaneous and mutagenized mice continue to enable the identification of novel genes, provide allelic series for detailed functional studies, and generate models useful for clinical research, new methods, such as the piggyBac transposon, are being developed to further harness the power of forward genetics. PMID:18481175
Liu, Jun-Jun; Xiang, Yu
2011-01-01
WRKY transcription factors are key regulators of numerous biological processes in plant growth and development, as well as plant responses to abiotic and biotic stresses. Research on biological functions of plant WRKY genes has focused in the past on model plant species or species with largely characterized transcriptomes. However, a variety of non-model plants, such as forest conifers, are essential as feed, biofuel, and wood or for sustainable ecosystems. Identification of WRKY genes in these non-model plants is equally important for understanding the evolutionary and function-adaptive processes of this transcription factor family. Because of limited genomic information, the rarity of regulatory gene mRNAs in transcriptomes, and the sequence divergence to model organism genes, identification of transcription factors in non-model plants using methods similar to those generally used for model plants is difficult. This chapter describes a gene family discovery strategy for identification of WRKY transcription factors in conifers by a combination of in silico-based prediction and PCR-based experimental approaches. Compared to traditional cDNA library screening or EST sequencing at transcriptome scales, this integrated gene discovery strategy provides fast, simple, reliable, and specific methods to unveil the WRKY gene family at both genome and transcriptome levels in non-model plants.
Yang, Shuzhi; Cai, Qunfeng; Bard, Jonathan; Jamison, Jennifer; Wang, Jianmin; Yang, Weiping; Hu, Bo Hua
2015-12-01
Individual variation in the susceptibility of the auditory system to acoustic overstimulation has been well-documented at both the functional and structural levels. However, the molecular mechanism responsible for this variation is unclear. The current investigation was designed to examine the variation patterns of cochlear gene expression using RNA-seq data and to identify the genes with expression variation that increased following acoustic trauma. This study revealed that the constitutive expressions of cochlear genes displayed diverse levels of gene-specific variation. These variation patterns were altered by acoustic trauma; approximately one-third of the examined genes displayed marked increases in their expression variation. Bioinformatics analyses revealed that the genes that exhibited increased variation were functionally related to cell death, biomolecule metabolism, and membrane function. In contrast, the stable genes were primarily related to basic cellular processes, including protein and macromolecular syntheses and transport. There was no functional overlap between the stable and variable genes. Importantly, we demonstrated that glutamate metabolism is related to the variation in the functional response of the cochlea to acoustic overstimulation. Taken together, the results indicate that our analyses of the individual variations in transcriptome changes of cochlear genes provide important information for the identification of genes that potentially contribute to the generation of individual variation in cochlear responses to acoustic overstimulation. Copyright © 2015 Elsevier B.V. All rights reserved.
Identification and Characterization of Genes That Interact with Lin-12 in Caenorhabditis Elegans
Tax, F. E.; Thomas, J. H.; Ferguson, E. L.; Horvitz, H. R.
1997-01-01
We identified and characterized 14 extragenic mutations that suppressed the dominant egg-laying defect of certain lin-12 gain-of-function mutations. These suppressors defined seven genes: sup-17, lag-2, sel-4, sel-5, sel-6, sel-7 and sel-8. Mutations in six of the genes are recessive suppressors, whereas the two mutations that define the seventh gene, lag-2, are semi-dominant suppressors. These suppressor mutations were able to suppress other lin-12 gain-of-function mutations. The suppressor mutations arose at a very low frequency per gene, 10-50 times below the typical loss-of-function mutation frequency. The suppressor mutations in sup-17 and lag-2 were shown to be rare non-null alleles, and we present evidence that null mutations in these two genes cause lethality. Temperature-shift studies for two suppressor genes, sup-17 and lag-2, suggest that both genes act at approximately the same time as lin-12 in specifying a cell fate. Suppressor alleles of six of these genes enhanced a temperature-sensitive loss-of-function allele of glp-1, a gene related to lin-12 in structure and function. Our analysis of these suppressors suggests that the majority of these genes are part of a shared lin-12/glp-1 signal transduction pathway, or act to regulate the expression or stability of lin-12 and glp-1. PMID:9409830
Diversity and evolution of myxozoan minicollagens and nematogalectins.
Shpirer, Erez; Chang, E Sally; Diamant, Arik; Rubinstein, Nimrod; Cartwright, Paulyn; Huchon, Dorothée
2014-09-29
Myxozoa are a diverse group of metazoan parasites with a very simple organization, which has for decades eluded their evolutionary origin. Their most prominent and characteristic feature is the polar capsule: a complex intracellular structure of the myxozoan spore, which plays a role in host infection. Striking morphological similarities have been found between myxozoan polar capsules and nematocysts, the stinging structures of cnidarians (corals, sea anemones and jellyfish) leading to the suggestion that Myxozoa and Cnidaria share a more recent common ancestry. This hypothesis has recently been supported by phylogenomic evidence and by the identification of a nematocyst specific minicollagen gene in the myxozoan Tetracapsuloides bryosalmonae. Here we searched genomes and transcriptomes of several myxozoan taxa for the presence of additional cnidarian specific genes and characterized these genes within a phylogenetic context. Illumina assemblies of transcriptome or genome data of three myxozoan species (Enteromyxum leei, Kudoa iwatai, and Sphaeromyxa zaharoni) and of the enigmatic cnidarian parasite Polypodium hydriforme (Polypodiozoa) were mined using tBlastn searches with nematocyst-specific proteins as queries. Several orthologs of nematogalectins and minicollagens were identified. Our phylogenetic analyses indicate that myxozoans possess three distinct minicollagens. We found that the cnidarian repertoire of nematogalectins is more complex than previously thought and we identified additional members of the nematogalectin family. Cnidarians were found to possess four nematogalectin/ nematogalectin-related genes, while in myxozoans only three genes could be identified. Our results demonstrate that myxozoans possess a diverse array of genes that are taxonomically restricted to Cnidaria. Characterization of these genes provide compelling evidence that polar capsules and nematocysts are homologous structures and that myxozoans are highly degenerate cnidarians. The diversity of minicollagens was higher than previously thought, with the presence of three minicollagen genes in myxozoans. Our phylogenetic results suggest that the different myxozoan sequences are the results of ancient divergences within Cnidaria and not of recent specializations of the polar capsule. For both minicollagen and nematogalectin, our results show that myxozoans possess less gene copies than their cnidarian counter parts, suggesting that the polar capsule gene repertoire was simplified with their reduced body plan.
1999-09-01
I.. Zbar. B.. androle for the VHL gene in the development of hyperplasia in a number Lerman. I. I. Identification of the son Hippel-Lindau disease...of heterozy- gosity of chromosome 3p markers in small-cell lung cancer. Nature (Lond.). 329: eleguns produced hyperplasia in all tissues (26...central fibrovascular core lined by cuboidal tumor cells. Tumor weights were determined (Fig. 2d). At the end of 47 days after cells were
Zhu, X L; Yang, F; Li, H X; Dou, Y X; Meng, X L; Li, H; Luo, X N; Cai, X P
2013-05-14
An outbreak of sheep pox was investigated in the Ningxia Hui Autonomous Region in China. Through immunofluorescence testing, isolated viruses, polymerase chain reaction identification, and electron microscopic examination, the isolated strain was identified as a sheep pox virus. The virus was identified through sequence and phylogenetic analysis of the P32 gene, open reading frame (ORF) 095, and ORF 103 genes. This study is the first to use the ORF 095 and ORF 103 genes as candidate genes for the analysis of sheep pox. The results showed that the ORF 095 and ORF 103 genes could be used for the genotyping of the sheep pox virus.
Vandelle, Elodie; Vannozzi, Alessandro; Wong, Darren; Danzi, Davide; Digby, Anne-Marie; Dal Santo, Silvia; Astegno, Alessandra
2018-06-04
Calcium (Ca 2+ ) is an ubiquitous key second messenger in plants, where it modulates many developmental and adaptive processes in response to various stimuli. Several proteins containing Ca 2+ binding domain have been identified in plants, including calmodulin (CaM) and calmodulin-like (CML) proteins, which play critical roles in translating Ca 2+ signals into proper cellular responses. In this work, a genome-wide analysis conducted in Vitis vinifera identified three CaM- and 62 CML-encoding genes. We assigned gene family nomenclature, analyzed gene structure, chromosomal location and gene duplication, as well as protein motif organization. The phylogenetic clustering revealed a total of eight subgroups, including one unique clade of VviCaMs distinct from VviCMLs. VviCaMs were found to contain four EF-hand motifs whereas VviCML proteins have one to five. Most of grapevine CML genes were intronless, while VviCaMs were intron rich. All the genes were well spread among the 19 grapevine chromosomes and displayed a high level of duplication. The expression profiling of VviCaM/VviCML genes revealed a broad expression pattern across all grape organs and tissues at various developmental stages, and a significant modulation in biotic stress-related responses. Our results highlight the complexity of CaM/CML protein family also in grapevine, supporting the versatile role of its different members in modulating cellular responses to various stimuli, in particular to biotic stresses. This work lays the foundation for further functional and structural studies on specific grapevine CaMs/CMLs in order to better understand the role of Ca 2+ -binding proteins in grapevine and to explore their potential for further biotechnological applications. Copyright © 2018 Elsevier Masson SAS. All rights reserved.
Gupta, Deepti; Bijarnia-Mahay, Sunita; Saxena, Renu; Kohli, Sudha; Dua-Puri, Ratna; Verma, Jyotsna; Thomas, E; Shigematsu, Yosuke; Yamaguchi, Seiji; Deb, Roumi; Verma, Ishwar Chander
2015-09-01
Maple syrup urine disease (MSUD) is caused by mutations in genes BCKDHA, BCKDHB, DBT encoding E1α, E1β, and E2 subunits of enzyme complex, branched-chain alpha-ketoacid dehydrogenase (BCKDH). BCKDH participates in catabolism of branched-chain amino acids (BCAAs) - leucine, isoleucine and valine in the energy production pathway. Deficiency or defect in the enzyme complex causes accumulation of BCAAs and keto-acids leading to toxicity. Twenty-four patients with MSUD were enrolled in the study for molecular characterization and genotype-phenotype correlation. Molecular studies were carried out by sequencing of the 3 genes by Sanger method. Bioinformatics tools were employed to classify novel variations into pathogenic or benign. The predicted effects of novel changes on protein structure were elucidated by 3D modeling. Mutations were detected in 22 of 24 patients (11, 7 and 4 in BCKDHB, BCKDHA and DBT genes, respectively). Twenty mutations including 11 novel mutations were identified. Protein modeling in novel mutations showed alteration of structure and function of these subunits. Mutations, c.1065 delT (BCKDHB gene) and c.939G > C (DBT gene) were noted to be recurrent, identified in 6 of 22 alleles and 5 of 8 alleles, respectively. Two-third patients were of neonatal classical phenotype (16 of 24). BCKDHB gene mutations were present in 10 of these 16 patients. Prenatal diagnoses were performed in 4 families. Consanguinity was noted in 37.5% families. Although no obvious genotype-phenotype correlation could be found in our study, most cases with mutation in BCKDHB gene presented in neonatal period. Large number of novel mutations underlines the heterogeneity and distinctness of gene pool from India. Copyright © 2015 Elsevier Masson SAS. All rights reserved.
Veenstra, Jan A; Khammassi, Hela
2017-04-01
RYamides are arthropod neuropeptides with unknown function. In 2011 two RYamides were isolated from D. melanogaster as the ligands for the G-protein coupled receptor CG5811. The D. melanogaster gene encoding these neuropeptides is highly unusual, as there are four RYamide encoding exons in the current genome assembly, but an exon encoding a signal peptide is absent. Comparing the D. melanogaster gene structure with those from other species, including D. virilis, suggests that the gene is degenerating. RNAseq data from 1634 short sequence read archives at NCBI containing more than 34 billion spots yielded numerous individual spots that correspond to the RYamide encoding exons, of which a large number include the intron-exon boundary at the start of this exon. Although 72 different sequences have been spliced onto this RYamide encoding exon, none codes for the signal peptide of this gene. Thus, the RNAseq data for this gene reveal only noise and no signal. The very small quantities of peptide recovered during isolation and the absence of credible RNAseq data, indicates that the gene is very little expressed, while the RYamide gene structure in D. melanogaster suggests that it might be evolving into a pseudogene. Yet, the identification of the peptides it encodes clearly shows it is still functional. Using region specific antisera, we could localize numerous neurons and enteroendocrine cells in D. willistoni, D. virilis and D. pseudoobscura, but only two adult abdominal neurons in D. melanogaster. Those two neurons project to and innervate the rectal papillae, suggesting that RYamides may be involved in the regulation of water homeostasis. Copyright © 2017 Elsevier Ltd. All rights reserved.
Willkomm, Dagmar K.; Minnerup, Jens; Hüttenhofer, Alexander; Hartmann, Roland K.
2005-01-01
By an experimental RNomics approach, we have generated a cDNA library from small RNAs expressed from the genome of the hyperthermophilic bacterium Aquifex aeolicus. The library included RNAs that were antisense to mRNAs and tRNAs as well as RNAs encoded in intergenic regions. Substantial steady-state levels in A.aeolicus cells were confirmed for several of the cloned RNAs by northern blot analysis. The most abundant intergenic RNA of the library was identified as the 6S RNA homolog of A.aeolicus. Although shorter in size (150 nt) than its γ-proteobacterial homologs (∼185 nt), it is predicted to have the most stable structure among known 6S RNAs. As in the γ-proteobacteria, the A.aeolicus 6S RNA gene (ssrS) is located immediately upstream of the ygfA gene encoding a widely conserved 5-formyltetrahydrofolate cyclo-ligase. We identifed novel 6S RNA candidates within the γ-proteobacteria but were unable to identify reasonable 6S RNA candidates in other bacterial branches, utilizing mfold analyses of the region immediately upstream of ygfA combined with 6S RNA blastn searches. By RACE experiments, we mapped the major transcription initiation site of A.aeolicus 6S RNA primary transcripts, located within the pheT gene preceding ygfA, as well as three processing sites. PMID:15814812
Izquierdo, Esther; Cai, Yimin; Marchioni, Eric; Ennahar, Saïd
2009-05-01
Enterococcus faecium IT62, a strain isolated from ryegrass in Japan, produces three bacteriocins (enterocins L50A, L50B, and IT) that have been previously purified and the primary structures of which have been determined by amino acid sequencing (E. Izquierdo, A. Bednarczyk, C. Schaeffer, Y. Cai, E. Marchioni, A. Van Dorsselaer, and S. Ennahar, Antimicrob. Agents Chemother., 52:1917-1923, 2008). Genetic analysis showed that the bacteriocins of E. faecium IT62 are plasmid encoded, but with the structural genes specifying enterocin L50A and enterocin L50B being carried by a plasmid (pTAB1) that is separate from the one (pTIT1) carrying the structural gene of enterocin IT. Sequencing analysis of a 1,475-bp region from pTAB1 identified two consecutive open reading frames corresponding, with the exception of 2 bp, to the genes entL50A and entL50B, encoding EntL50A and EntL50B, respectively. Both bacteriocins are synthesized without N-terminal leader sequences. Genetic analysis of a sequenced 1,380-bp pTIT1 fragment showed that the genes entIT and entIM, encoding enterocin IT and its immunity protein, respectively, were both found in E. faecium VRE200 for bacteriocin 32. Enterocin IT, a 6,390-Da peptide made up of 54 amino acids, has been previously shown to be identical to the C-terminal part of bacteriocin 32, a 7,998-Da bacteriocin produced by E. faecium VRE200 whose structure was deduced from its structural gene (T. Inoue, H. Tomita, and Y. Ike, Antimicrob. Agents Chemother., 50:1202-1212, 2006). By combining the biochemical and genetic data on enterocin IT, it was concluded that bacteriocin 32 is in fact identical to enterocin IT, both being encoded by the same plasmid-borne gene, and that the N-terminal leader peptide for this bacteriocin is 35 amino acids long and not 19 amino acids long as previously reported.
USDA-ARS?s Scientific Manuscript database
The comprehensive identification of genes underlying phenotypic variation of complex traits remains a major challenge. Most genome-wide screens lack sufficient resolving power as they typically depend on linkage. An alternate method is to screen for allele-specific expression (ASE), a simple yet pow...
Genome-Wide Identification of the Invertase Gene Family in Populus.
Chen, Zhong; Gao, Kai; Su, Xiaoxing; Rao, Pian; An, Xinmin
2015-01-01
Invertase plays a crucial role in carbohydrate partitioning and plant development as it catalyses the irreversible hydrolysis of sucrose into glucose and fructose. The invertase family in plants is composed of two sub-families: acid invertases, which are targeted to the cell wall and vacuole; and neutral/alkaline invertases, which function in the cytosol. In this study, 5 cell wall invertase genes (PtCWINV1-5), 3 vacuolar invertase genes (PtVINV1-3) and 16 neutral/alkaline invertase genes (PtNINV1-16) were identified in the Populus genome and found to be distributed on 14 chromosomes. A comprehensive analysis of poplar invertase genes was performed, including structures, chromosome location, phylogeny, evolutionary pattern and expression profiles. Phylogenetic analysis indicated that the two sub-families were both divided into two clades. Segmental duplication is contributed to neutral/alkaline sub-family expansion. Furthermore, the Populus invertase genes displayed differential expression in roots, stems, leaves, leaf buds and in response to salt/cold stress and pathogen infection. In addition, the analysis of enzyme activity and sugar content revealed that invertase genes play key roles in the sucrose metabolism of various tissues and organs in poplar. This work lays the foundation for future functional analysis of the invertase genes in Populus and other woody perennials.
Genome-Wide Identification of the Invertase Gene Family in Populus
Su, Xiaoxing; Rao, Pian; An, Xinmin
2015-01-01
Invertase plays a crucial role in carbohydrate partitioning and plant development as it catalyses the irreversible hydrolysis of sucrose into glucose and fructose. The invertase family in plants is composed of two sub-families: acid invertases, which are targeted to the cell wall and vacuole; and neutral/alkaline invertases, which function in the cytosol. In this study, 5 cell wall invertase genes (PtCWINV1-5), 3 vacuolar invertase genes (PtVINV1-3) and 16 neutral/alkaline invertase genes (PtNINV1-16) were identified in the Populus genome and found to be distributed on 14 chromosomes. A comprehensive analysis of poplar invertase genes was performed, including structures, chromosome location, phylogeny, evolutionary pattern and expression profiles. Phylogenetic analysis indicated that the two sub-families were both divided into two clades. Segmental duplication is contributed to neutral/alkaline sub-family expansion. Furthermore, the Populus invertase genes displayed differential expression in roots, stems, leaves, leaf buds and in response to salt/cold stress and pathogen infection. In addition, the analysis of enzyme activity and sugar content revealed that invertase genes play key roles in the sucrose metabolism of various tissues and organs in poplar. This work lays the foundation for future functional analysis of the invertase genes in Populus and other woody perennials. PMID:26393355
Common genetic variants influence human subcortical brain structures.
Hibar, Derrek P; Stein, Jason L; Renteria, Miguel E; Arias-Vasquez, Alejandro; Desrivières, Sylvane; Jahanshad, Neda; Toro, Roberto; Wittfeld, Katharina; Abramovic, Lucija; Andersson, Micael; Aribisala, Benjamin S; Armstrong, Nicola J; Bernard, Manon; Bohlken, Marc M; Boks, Marco P; Bralten, Janita; Brown, Andrew A; Chakravarty, M Mallar; Chen, Qiang; Ching, Christopher R K; Cuellar-Partida, Gabriel; den Braber, Anouk; Giddaluru, Sudheer; Goldman, Aaron L; Grimm, Oliver; Guadalupe, Tulio; Hass, Johanna; Woldehawariat, Girma; Holmes, Avram J; Hoogman, Martine; Janowitz, Deborah; Jia, Tianye; Kim, Sungeun; Klein, Marieke; Kraemer, Bernd; Lee, Phil H; Olde Loohuis, Loes M; Luciano, Michelle; Macare, Christine; Mather, Karen A; Mattheisen, Manuel; Milaneschi, Yuri; Nho, Kwangsik; Papmeyer, Martina; Ramasamy, Adaikalavan; Risacher, Shannon L; Roiz-Santiañez, Roberto; Rose, Emma J; Salami, Alireza; Sämann, Philipp G; Schmaal, Lianne; Schork, Andrew J; Shin, Jean; Strike, Lachlan T; Teumer, Alexander; van Donkelaar, Marjolein M J; van Eijk, Kristel R; Walters, Raymond K; Westlye, Lars T; Whelan, Christopher D; Winkler, Anderson M; Zwiers, Marcel P; Alhusaini, Saud; Athanasiu, Lavinia; Ehrlich, Stefan; Hakobjan, Marina M H; Hartberg, Cecilie B; Haukvik, Unn K; Heister, Angelien J G A M; Hoehn, David; Kasperaviciute, Dalia; Liewald, David C M; Lopez, Lorna M; Makkinje, Remco R R; Matarin, Mar; Naber, Marlies A M; McKay, D Reese; Needham, Margaret; Nugent, Allison C; Pütz, Benno; Royle, Natalie A; Shen, Li; Sprooten, Emma; Trabzuni, Daniah; van der Marel, Saskia S L; van Hulzen, Kimm J E; Walton, Esther; Wolf, Christiane; Almasy, Laura; Ames, David; Arepalli, Sampath; Assareh, Amelia A; Bastin, Mark E; Brodaty, Henry; Bulayeva, Kazima B; Carless, Melanie A; Cichon, Sven; Corvin, Aiden; Curran, Joanne E; Czisch, Michael; de Zubicaray, Greig I; Dillman, Allissa; Duggirala, Ravi; Dyer, Thomas D; Erk, Susanne; Fedko, Iryna O; Ferrucci, Luigi; Foroud, Tatiana M; Fox, Peter T; Fukunaga, Masaki; Gibbs, J Raphael; Göring, Harald H H; Green, Robert C; Guelfi, Sebastian; Hansell, Narelle K; Hartman, Catharina A; Hegenscheid, Katrin; Heinz, Andreas; Hernandez, Dena G; Heslenfeld, Dirk J; Hoekstra, Pieter J; Holsboer, Florian; Homuth, Georg; Hottenga, Jouke-Jan; Ikeda, Masashi; Jack, Clifford R; Jenkinson, Mark; Johnson, Robert; Kanai, Ryota; Keil, Maria; Kent, Jack W; Kochunov, Peter; Kwok, John B; Lawrie, Stephen M; Liu, Xinmin; Longo, Dan L; McMahon, Katie L; Meisenzahl, Eva; Melle, Ingrid; Mohnke, Sebastian; Montgomery, Grant W; Mostert, Jeanette C; Mühleisen, Thomas W; Nalls, Michael A; Nichols, Thomas E; Nilsson, Lars G; Nöthen, Markus M; Ohi, Kazutaka; Olvera, Rene L; Perez-Iglesias, Rocio; Pike, G Bruce; Potkin, Steven G; Reinvang, Ivar; Reppermund, Simone; Rietschel, Marcella; Romanczuk-Seiferth, Nina; Rosen, Glenn D; Rujescu, Dan; Schnell, Knut; Schofield, Peter R; Smith, Colin; Steen, Vidar M; Sussmann, Jessika E; Thalamuthu, Anbupalam; Toga, Arthur W; Traynor, Bryan J; Troncoso, Juan; Turner, Jessica A; Valdés Hernández, Maria C; van 't Ent, Dennis; van der Brug, Marcel; van der Wee, Nic J A; van Tol, Marie-Jose; Veltman, Dick J; Wassink, Thomas H; Westman, Eric; Zielke, Ronald H; Zonderman, Alan B; Ashbrook, David G; Hager, Reinmar; Lu, Lu; McMahon, Francis J; Morris, Derek W; Williams, Robert W; Brunner, Han G; Buckner, Randy L; Buitelaar, Jan K; Cahn, Wiepke; Calhoun, Vince D; Cavalleri, Gianpiero L; Crespo-Facorro, Benedicto; Dale, Anders M; Davies, Gareth E; Delanty, Norman; Depondt, Chantal; Djurovic, Srdjan; Drevets, Wayne C; Espeseth, Thomas; Gollub, Randy L; Ho, Beng-Choon; Hoffmann, Wolfgang; Hosten, Norbert; Kahn, René S; Le Hellard, Stephanie; Meyer-Lindenberg, Andreas; Müller-Myhsok, Bertram; Nauck, Matthias; Nyberg, Lars; Pandolfo, Massimo; Penninx, Brenda W J H; Roffman, Joshua L; Sisodiya, Sanjay M; Smoller, Jordan W; van Bokhoven, Hans; van Haren, Neeltje E M; Völzke, Henry; Walter, Henrik; Weiner, Michael W; Wen, Wei; White, Tonya; Agartz, Ingrid; Andreassen, Ole A; Blangero, John; Boomsma, Dorret I; Brouwer, Rachel M; Cannon, Dara M; Cookson, Mark R; de Geus, Eco J C; Deary, Ian J; Donohoe, Gary; Fernández, Guillén; Fisher, Simon E; Francks, Clyde; Glahn, David C; Grabe, Hans J; Gruber, Oliver; Hardy, John; Hashimoto, Ryota; Hulshoff Pol, Hilleke E; Jönsson, Erik G; Kloszewska, Iwona; Lovestone, Simon; Mattay, Venkata S; Mecocci, Patrizia; McDonald, Colm; McIntosh, Andrew M; Ophoff, Roel A; Paus, Tomas; Pausova, Zdenka; Ryten, Mina; Sachdev, Perminder S; Saykin, Andrew J; Simmons, Andy; Singleton, Andrew; Soininen, Hilkka; Wardlaw, Joanna M; Weale, Michael E; Weinberger, Daniel R; Adams, Hieab H H; Launer, Lenore J; Seiler, Stephan; Schmidt, Reinhold; Chauhan, Ganesh; Satizabal, Claudia L; Becker, James T; Yanek, Lisa; van der Lee, Sven J; Ebling, Maritza; Fischl, Bruce; Longstreth, W T; Greve, Douglas; Schmidt, Helena; Nyquist, Paul; Vinke, Louis N; van Duijn, Cornelia M; Xue, Luting; Mazoyer, Bernard; Bis, Joshua C; Gudnason, Vilmundur; Seshadri, Sudha; Ikram, M Arfan; Martin, Nicholas G; Wright, Margaret J; Schumann, Gunter; Franke, Barbara; Thompson, Paul M; Medland, Sarah E
2015-04-09
The highly complex structure of the human brain is strongly shaped by genetic influences. Subcortical brain regions form circuits with cortical areas to coordinate movement, learning, memory and motivation, and altered circuits can lead to abnormal behaviour and disease. To investigate how common genetic variants affect the structure of these brain regions, here we conduct genome-wide association studies of the volumes of seven subcortical regions and the intracranial volume derived from magnetic resonance images of 30,717 individuals from 50 cohorts. We identify five novel genetic variants influencing the volumes of the putamen and caudate nucleus. We also find stronger evidence for three loci with previously established influences on hippocampal volume and intracranial volume. These variants show specific volumetric effects on brain structures rather than global effects across structures. The strongest effects were found for the putamen, where a novel intergenic locus with replicable influence on volume (rs945270; P = 1.08 × 10(-33); 0.52% variance explained) showed evidence of altering the expression of the KTN1 gene in both brain and blood tissue. Variants influencing putamen volume clustered near developmental genes that regulate apoptosis, axon guidance and vesicle transport. Identification of these genetic variants provides insight into the causes of variability in human brain development, and may help to determine mechanisms of neuropsychiatric dysfunction.
Sykes, Timothy; Yates, Steven; Nagy, Istvan; Asp, Torben; Small, Ian
2017-01-01
Perennial ryegrass (Lolium perenne L.) is widely used for forage production in both permanent and temporary grassland systems. To increase yields in perennial ryegrass, recent breeding efforts have been focused on strategies to more efficiently exploit heterosis by hybrid breeding. Cytoplasmic male sterility (CMS) is a widely applied mechanism to control pollination for commercial hybrid seed production and although CMS systems have been identified in perennial ryegrass, they are yet to be fully characterized. Here, we present a bioinformatics pipeline for efficient identification of candidate restorer of fertility (Rf) genes for CMS. From a high-quality draft of the perennial ryegrass genome, 373 pentatricopeptide repeat (PPR) genes were identified and classified, further identifying 25 restorer of fertility-like PPR (RFL) genes through a combination of DNA sequence clustering and comparison to known Rf genes. This extensive gene family was targeted as the majority of Rf genes in higher plants are RFL genes. These RFL genes were further investigated by phylogenetic analyses, identifying three groups of perennial ryegrass RFLs. These three groups likely represent genomic regions of active RFL generation and identify the probable location of perennial ryegrass PPR-Rf genes. This pipeline allows for the identification of candidate PPR-Rf genes from genomic sequence data and can be used in any plant species. Functional markers for PPR-Rf genes will facilitate map-based cloning of Rf genes and enable the use of CMS as an efficient tool to control pollination for hybrid crop production. PMID:26951780