Cloud, Joann L; Harmsen, Dag; Iwen, Peter C; Dunn, James J; Hall, Gerri; Lasala, Paul Rocco; Hoggan, Karen; Wilson, Deborah; Woods, Gail L; Mellmann, Alexander
2010-04-01
Correct identification of nonfermenting Gram-negative bacilli (NFB) is crucial for patient management. We compared phenotypic identifications of 96 clinical NFB isolates with identifications obtained by 5' 16S rRNA gene sequencing. Sequencing identified 88 isolates (91.7%) with >99% similarity to a sequence from the assigned species; 61.5% of sequencing results were concordant with phenotypic results, indicating the usability of sequencing to identify NFB.
Mishra, Apurva; Pandey, Ramesh K; Manickam, Natesan
2015-01-01
Rapid phylogenetic and functional gene (gtfB) identification of S. mutans from the dental plaque derived from children. Dental plaque collected from fifteen patients of age group 7-12 underwent centrifugation followed by genomic DNA extraction for S. mutans. Genomic DNA was processed with S. mutans specific primers in suitable PCR condtions for phylogenetic and functional gene (gtfB) identification. The yield and results were confirmed by agarose gel electrophoresis. 1% agarose gel electrophoresis depicts the positive PCR amplification at 1,485 bp when compared with standard 1 kbp indicating the presence of S. mutans in the test sample. Another PCR reaction was set using gtfB primers specific for S. mutans for functional gene identification. 1.2% agarose gel electrophoresis was done and a positive amplication was observed at 192 bp when compared to 100 bp standards. With the advancement in molecular biology techniques, PCR based identification and quantification of the bacterial load can be done within hours using species-specific primers and DNA probes. Thus, this technique may reduce the laboratory time spend in conventional culture methods, reduces the possibility of colony identification errors and is more sensitive to culture techniques.
Lynch, T; Gregson, D; Church, D L
2016-03-01
Actinomyces species are uncommon but important causes of invasive infections. The ability of our regional clinical microbiology laboratory to report species-level identification of Actinomyces relied on molecular identification by partial sequencing of the 16S ribosomal gene prior to the implementation of the Vitek MS (matrix-assisted laser desorption ionization-time of flight mass spectrometry [MALDI-TOF MS]) system. We compared the use of the Vitek MS to that of 16S rRNA gene sequencing for reliable species-level identification of invasive infections caused by Actinomyces spp. because limited data had been published for this important genera. A total of 115 cases of Actinomyces spp., either alone or as part of a polymicrobial infection, were diagnosed between 2011 and 2014. Actinomyces spp. were considered the principal pathogen in bloodstream infections (n = 17, 15%), in skin and soft tissue abscesses (n = 25, 22%), and in pulmonary (n = 26, 23%), bone (n = 27, 23%), intraabdominal (n = 16, 14%), and central nervous system (n = 4, 3%) infections. Compared to sequencing and identification from the SmartGene Integrated Database Network System (IDNS), Vitek MS identified 47/115 (41%) isolates to the correct species and 10 (9%) isolates to the correct genus. However, the Vitek MS was unable to provide identification for 43 (37%) isolates while 15 (13%) had discordant results. Phylogenetic analyses of the 16S rRNA sequences demonstrate high diversity in recovered Actinomyces spp. and provide additional information to compare/confirm discordant identifications between MALDI-TOF and 16S rRNA gene sequences. This study highlights the diversity of clinically relevant Actinomyces spp. and provides an important typing comparison. Based on our analysis, 16S rRNA gene sequencing should be used to rapidly identify Actinomyces spp. until MALDI-TOF databases are optimized. Copyright © 2016, American Society for Microbiology. All Rights Reserved.
Gregson, D.; Church, D. L.
2016-01-01
Actinomyces species are uncommon but important causes of invasive infections. The ability of our regional clinical microbiology laboratory to report species-level identification of Actinomyces relied on molecular identification by partial sequencing of the 16S ribosomal gene prior to the implementation of the Vitek MS (matrix-assisted laser desorption ionization–time of flight mass spectrometry [MALDI-TOF MS]) system. We compared the use of the Vitek MS to that of 16S rRNA gene sequencing for reliable species-level identification of invasive infections caused by Actinomyces spp. because limited data had been published for this important genera. A total of 115 cases of Actinomyces spp., either alone or as part of a polymicrobial infection, were diagnosed between 2011 and 2014. Actinomyces spp. were considered the principal pathogen in bloodstream infections (n = 17, 15%), in skin and soft tissue abscesses (n = 25, 22%), and in pulmonary (n = 26, 23%), bone (n = 27, 23%), intraabdominal (n = 16, 14%), and central nervous system (n = 4, 3%) infections. Compared to sequencing and identification from the SmartGene Integrated Database Network System (IDNS), Vitek MS identified 47/115 (41%) isolates to the correct species and 10 (9%) isolates to the correct genus. However, the Vitek MS was unable to provide identification for 43 (37%) isolates while 15 (13%) had discordant results. Phylogenetic analyses of the 16S rRNA sequences demonstrate high diversity in recovered Actinomyces spp. and provide additional information to compare/confirm discordant identifications between MALDI-TOF and 16S rRNA gene sequences. This study highlights the diversity of clinically relevant Actinomyces spp. and provides an important typing comparison. Based on our analysis, 16S rRNA gene sequencing should be used to rapidly identify Actinomyces spp. until MALDI-TOF databases are optimized. PMID:26739153
Targeting Conserved Genes in Penicillium Species.
Peterson, Stephen W
2017-01-01
Polymerase chain reaction amplification of conserved genes and sequence analysis provides a very powerful tool for the identification of toxigenic as well as non-toxigenic Penicillium species. Sequences are obtained by amplification of the gene fragment, sequencing via capillary electrophoresis of dideoxynucleotide-labeled fragments or NGS. The sequences are compared to a database of validated isolates. Identification of species indicates the potential of the fungus to make particular mycotoxins.
Woldesemayat, Adugna Abdi; Van Heusden, Peter; Ndimba, Bongani K; Christoffels, Alan
2017-12-22
Drought is the most disastrous abiotic stress that severely affects agricultural productivity worldwide. Understanding the biological basis of drought-regulated traits, requires identification and an in-depth characterization of genetic determinants using model organisms and high-throughput technologies. However, studies on drought tolerance have generally been limited to traditional candidate gene approach that targets only a single gene in a pathway that is related to a trait. In this study, we used sorghum, one of the model crops that is well adapted to arid regions, to mine genes and define determinants for drought tolerance using drought expression libraries and RNA-seq data. We provide an integrated and comparative in silico candidate gene identification, characterization and annotation approach, with an emphasis on genes playing a prominent role in conferring drought tolerance in sorghum. A total of 470 non-redundant functionally annotated drought responsive genes (DRGs) were identified using experimental data from drought responses by employing pairwise sequence similarity searches, pathway and interpro-domain analysis, expression profiling and orthology relation. Comparison of the genomic locations between these genes and sorghum quantitative trait loci (QTLs) showed that 40% of these genes were co-localized with QTLs known for drought tolerance. The genome reannotation conducted using the Program to Assemble Spliced Alignment (PASA), resulted in 9.6% of existing single gene models being updated. In addition, 210 putative novel genes were identified using AUGUSTUS and PASA based analysis on expression dataset. Among these, 50% were single exonic, 69.5% represented drought responsive and 5.7% were complete gene structure models. Analysis of biochemical metabolism revealed 14 metabolic pathways that are related to drought tolerance and also had a strong biological network, among categories of genes involved. Identification of these pathways, signifies the interplay of biochemical reactions that make up the metabolic network, constituting fundamental interface for sorghum defence mechanism against drought stress. This study suggests untapped natural variability in sorghum that could be used for developing drought tolerance. The data presented here, may be regarded as an initial reference point in functional and comparative genomics in the Gramineae family.
Microarray data from independent labs and studies can be compared to potentially identify toxicologically and biologically relevant genes. The Baseline Animal Database working group of HESI was formed to assess baseline gene expression from microarray data derived from control or...
Arbefeville, S; Harris, A; Ferrieri, P
2017-09-01
Fungal infections cause considerable morbidity and mortality in immunocompromised patients. Rapid and accurate identification of fungi is essential to guide accurately targeted antifungal therapy. With the advent of molecular methods, clinical laboratories can use new technologies to supplement traditional phenotypic identification of fungi. The aims of the study were to evaluate the sole commercially available MicroSEQ® D2 LSU rDNA Fungal Identification Kit compared to the in-house developed internal transcribed spacer (ITS) regions assay in identifying moulds, using two well-known online public databases to analyze sequenced data. 85 common and uncommon clinically relevant fungi isolated from clinical specimens were sequenced for the D2 region of the large subunit (LSU) of ribosomal RNA (rRNA) gene with the MicroSEQ® Kit and the ITS regions with the in house developed assay. The generated sequenced data were analyzed with the online GenBank and MycoBank public databases. The D2 region of the LSU rRNA gene identified 89.4% or 92.9% of the 85 isolates to the genus level and the full ITS region (f-ITS) 96.5% or 100%, using GenBank or MycoBank, respectively, when compared to the consensus ID. When comparing species-level designations to the consensus ID, D2 region of the LSU rRNA gene aligned with 44.7% (38/85) or 52.9% (45/85) of these isolates in GenBank or MycoBank, respectively. By comparison, f-ITS possessed greater specificity, followed by ITS1, then ITS2 regions using GenBank or MycoBank. Using GenBank or MycoBank, D2 region of the LSU rRNA gene outperformed phenotypic based ID at the genus level. Comparing rates of ID between D2 region of the LSU rRNA gene and the ITS regions in GenBank or MycoBank at the species level against the consensus ID, f-ITS and ITS2 exceeded performance of the D2 region of the LSU rRNA gene, but ITS1 had similar performance to the D2 region of the LSU rRNA gene using MycoBank. Our results indicated that the MicroSEQ® D2 LSU rDNA Fungal Identification Kit was equivalent to the in-house developed ITS regions assay to identify fungi at the genus level. The MycoBank database gave a better curated database and thus allowed a better genus and species identification for both D2 region of the LSU rRNA gene and ITS regions. Copyright © 2017 Elsevier B.V. All rights reserved.
Perceptron ensemble of graph-based positive-unlabeled learning for disease gene identification.
Jowkar, Gholam-Hossein; Mansoori, Eghbal G
2016-10-01
Identification of disease genes, using computational methods, is an important issue in biomedical and bioinformatics research. According to observations that diseases with the same or similar phenotype have the same biological characteristics, researchers have tried to identify genes by using machine learning tools. In recent attempts, some semi-supervised learning methods, called positive-unlabeled learning, is used for disease gene identification. In this paper, we present a Perceptron ensemble of graph-based positive-unlabeled learning (PEGPUL) on three types of biological attributes: gene ontologies, protein domains and protein-protein interaction networks. In our method, a reliable set of positive and negative genes are extracted using co-training schema. Then, the similarity graph of genes is built using metric learning by concentrating on multi-rank-walk method to perform inference from labeled genes. At last, a Perceptron ensemble is learned from three weighted classifiers: multilevel support vector machine, k-nearest neighbor and decision tree. The main contributions of this paper are: (i) incorporating the statistical properties of gene data through choosing proper metrics, (ii) statistical evaluation of biological features, and (iii) noise robustness characteristic of PEGPUL via using multilevel schema. In order to assess PEGPUL, we have applied it on 12950 disease genes with 949 positive genes from six class of diseases and 12001 unlabeled genes. Compared with some popular disease gene identification methods, the experimental results show that PEGPUL has reasonable performance. Copyright © 2016 Elsevier Ltd. All rights reserved.
Sönksen, Ute Wolff; Christensen, Jens Jørgen; Nielsen, Lisbeth; Hesselbjerg, Annemarie; Hansen, Dennis Schrøder; Bruun, Brita
2010-12-31
Taxonomy and identification of fastidious Gram negatives are evolving and challenging. We compared identifications achieved with the Vitek 2 Neisseria-Haemophilus (NH) card and partial 16S rRNA gene sequence (526 bp stretch) analysis with identifications obtained with extensive phenotypic characterization using 100 fastidious Gram negative bacteria. Seventy-five strains represented 21 of the 26 taxa included in the Vitek 2 NH database and 25 strains represented related species not included in the database. Of the 100 strains, 31 were the type strains of the species. Vitek 2 NH identification results: 48 of 75 database strains were correctly identified, 11 strains gave `low discrimination´, seven strains were unidentified, and nine strains were misidentified. Identification of 25 non-database strains resulted in 14 strains incorrectly identified as belonging to species in the database. Partial 16S rRNA gene sequence analysis results: For 76 strains phenotypic and sequencing identifications were identical, for 23 strains the sequencing identifications were either probable or possible, and for one strain only the genus was confirmed. Thus, the Vitek 2 NH system identifies most of the commonly occurring species included in the database. Some strains of rarely occurring species and strains of non-database species closely related to database species cause problems. Partial 16S rRNA gene sequence analysis performs well, but does not always suffice, additional phenotypical characterization being useful for final identification.
Sönksen, Ute Wolff; Christensen, Jens Jørgen; Nielsen, Lisbeth; Hesselbjerg, Annemarie; Hansen, Dennis Schrøder; Bruun, Brita
2010-01-01
Taxonomy and identification of fastidious Gram negatives are evolving and challenging. We compared identifications achieved with the Vitek 2 Neisseria-Haemophilus (NH) card and partial 16S rRNA gene sequence (526 bp stretch) analysis with identifications obtained with extensive phenotypic characterization using 100 fastidious Gram negative bacteria. Seventy-five strains represented 21 of the 26 taxa included in the Vitek 2 NH database and 25 strains represented related species not included in the database. Of the 100 strains, 31 were the type strains of the species. Vitek 2 NH identification results: 48 of 75 database strains were correctly identified, 11 strains gave `low discrimination´, seven strains were unidentified, and nine strains were misidentified. Identification of 25 non-database strains resulted in 14 strains incorrectly identified as belonging to species in the database. Partial 16S rRNA gene sequence analysis results: For 76 strains phenotypic and sequencing identifications were identical, for 23 strains the sequencing identifications were either probable or possible, and for one strain only the genus was confirmed. Thus, the Vitek 2 NH system identifies most of the commonly occurring species included in the database. Some strains of rarely occurring species and strains of non-database species closely related to database species cause problems. Partial 16S rRNA gene sequence analysis performs well, but does not always suffice, additional phenotypical characterization being useful for final identification. PMID:21347215
Functional clustering of time series gene expression data by Granger causality
2012-01-01
Background A common approach for time series gene expression data analysis includes the clustering of genes with similar expression patterns throughout time. Clustered gene expression profiles point to the joint contribution of groups of genes to a particular cellular process. However, since genes belong to intricate networks, other features, besides comparable expression patterns, should provide additional information for the identification of functionally similar genes. Results In this study we perform gene clustering through the identification of Granger causality between and within sets of time series gene expression data. Granger causality is based on the idea that the cause of an event cannot come after its consequence. Conclusions This kind of analysis can be used as a complementary approach for functional clustering, wherein genes would be clustered not solely based on their expression similarity but on their topological proximity built according to the intensity of Granger causality among them. PMID:23107425
The Essential Genome of Escherichia coli K-12
2018-01-01
ABSTRACT Transposon-directed insertion site sequencing (TraDIS) is a high-throughput method coupling transposon mutagenesis with short-fragment DNA sequencing. It is commonly used to identify essential genes. Single gene deletion libraries are considered the gold standard for identifying essential genes. Currently, the TraDIS method has not been benchmarked against such libraries, and therefore, it remains unclear whether the two methodologies are comparable. To address this, a high-density transposon library was constructed in Escherichia coli K-12. Essential genes predicted from sequencing of this library were compared to existing essential gene databases. To decrease false-positive identification of essential genes, statistical data analysis included corrections for both gene length and genome length. Through this analysis, new essential genes and genes previously incorrectly designated essential were identified. We show that manual analysis of TraDIS data reveals novel features that would not have been detected by statistical analysis alone. Examples include short essential regions within genes, orientation-dependent effects, and fine-resolution identification of genome and protein features. Recognition of these insertion profiles in transposon mutagenesis data sets will assist genome annotation of less well characterized genomes and provides new insights into bacterial physiology and biochemistry. PMID:29463657
Barcoding of fresh water fishes from Pakistan.
Karim, Asma; Iqbal, Asad; Akhtar, Rehan; Rizwan, Muhammad; Amar, Ali; Qamar, Usman; Jahan, Shah
2016-07-01
DNA bar-coding is a taxonomic method that uses small genetic markers in organisms' mitochondrial DNA (mt DNA) for identification of particular species. It uses sequence diversity in a 658-base pair fragment near the 5' end of the mitochondrial cytochrome c oxidase subunit 1 (CO1) gene as a tool for species identification. DNA barcoding is more accurate and reliable method as compared with the morphological identification. It is equally useful in juveniles as well as adult stages of fishes. The present study was conducted to identify three farm fish species of Pakistan (Cyprinus carpio, Cirrhinus mrigala, and Ctenopharyngodon idella) genetically. All of them belonged to family cyprinidae. CO1 gene was amplified. PCR products were sequenced and analyzed by bioinformatic software. Conspecific, congenric, and confamilial k2P nucleotide divergence was estimated. From these findings, it was concluded that the gene sequence, CO1, may serve as milestone for the identification of related species at molecular level.
Data on the genome-wide identification of CNL R-genes in Setaria italica (L.) P. Beauv.
Andersen, Ethan J; Nepal, Madhav P
2017-08-01
We report data associated with the identification of 242 disease resistance genes (R-genes) in the genome of Setaria italica as presented in "Genetic diversity of disease resistance genes in foxtail millet ( Setaria italica L.)" (Andersen and Nepal, 2017) [1]. Our data describe the structure and evolution of the Coiled-coil, Nucleotide-binding site, Leucine-rich repeat (CNL) R-genes in foxtail millet. The CNL genes were identified through rigorous extraction and analysis of recently available plant genome sequences using cutting-edge analytical software. Data visualization includes gene structure diagrams, chromosomal syntenic maps, a chromosomal density plot, and a maximum-likelihood phylogenetic tree comparing Sorghum bicolor , Panicum virgatum , Setaria italica , and Arabidopsis thaliana . Compilation of InterProScan annotations, Gene Ontology (GO) annotations, and Basic Local Alignment Search Tool (BLAST) results for the 242 R-genes identified in the foxtail millet genome are also included in tabular format.
Uhlik, Ondrej; Strejcek, Michal; Junkova, Petra; Sanda, Miloslav; Hroudova, Miluse; Vlcek, Cestmir; Mackova, Martina; Macek, Tomas
2011-01-01
Bacteria that are able to utilize biphenyl as a sole source of carbon were extracted and isolated from polychlorinated biphenyl (PCB)-contaminated soil vegetated by horseradish. Isolates were identified using matrix-assisted laser desorption ionization-time of flight mass spectrometry (MALDI-TOF MS). The usage of MALDI Biotyper for the classification of isolates was evaluated and compared to 16S rRNA gene sequence analysis. A wide spectrum of bacteria was isolated, with Arthrobacter, Serratia, Rhodococcus, and Rhizobium being predominant. Arthrobacter isolates also represented the most diverse group. The use of MALDI Biotyper in many cases permitted the identification at the level of species, which was not achieved by 16S rRNA gene sequence analyses. However, some isolates had to be identified by 16S rRNA gene analyses if MALDI Biotyper-based identification was at the level of probable or not reliable identification, usually due to a lack of reference spectra included in the database. Overall, this study shows the possibility of using MALDI-TOF MS and MALDI Biotyper for the fast and relatively nonlaborious identification/classification of soil isolates. At the same time, it demonstrates the dominant role of employing 16S rRNA gene analyses for the identification of recently isolated strains that can later fill the gaps in the protein-based identification databases. PMID:21821747
Comparative Genomics and Host Resistance against Infectious Diseases
Qureshi, Salman T.; Skamene, Emil
1999-01-01
The large size and complexity of the human genome have limited the identification and functional characterization of components of the innate immune system that play a critical role in front-line defense against invading microorganisms. However, advances in genome analysis (including the development of comprehensive sets of informative genetic markers, improved physical mapping methods, and novel techniques for transcript identification) have reduced the obstacles to discovery of novel host resistance genes. Study of the genomic organization and content of widely divergent vertebrate species has shown a remarkable degree of evolutionary conservation and enables meaningful cross-species comparison and analysis of newly discovered genes. Application of comparative genomics to host resistance will rapidly expand our understanding of human immune defense by facilitating the translation of knowledge acquired through the study of model organisms. We review the rationale and resources for comparative genomic analysis and describe three examples of host resistance genes successfully identified by this approach. PMID:10081670
The Essential Genome of Escherichia coli K-12.
Goodall, Emily C A; Robinson, Ashley; Johnston, Iain G; Jabbari, Sara; Turner, Keith A; Cunningham, Adam F; Lund, Peter A; Cole, Jeffrey A; Henderson, Ian R
2018-02-20
Transposon-directed insertion site sequencing (TraDIS) is a high-throughput method coupling transposon mutagenesis with short-fragment DNA sequencing. It is commonly used to identify essential genes. Single gene deletion libraries are considered the gold standard for identifying essential genes. Currently, the TraDIS method has not been benchmarked against such libraries, and therefore, it remains unclear whether the two methodologies are comparable. To address this, a high-density transposon library was constructed in Escherichia coli K-12. Essential genes predicted from sequencing of this library were compared to existing essential gene databases. To decrease false-positive identification of essential genes, statistical data analysis included corrections for both gene length and genome length. Through this analysis, new essential genes and genes previously incorrectly designated essential were identified. We show that manual analysis of TraDIS data reveals novel features that would not have been detected by statistical analysis alone. Examples include short essential regions within genes, orientation-dependent effects, and fine-resolution identification of genome and protein features. Recognition of these insertion profiles in transposon mutagenesis data sets will assist genome annotation of less well characterized genomes and provides new insights into bacterial physiology and biochemistry. IMPORTANCE Incentives to define lists of genes that are essential for bacterial survival include the identification of potential targets for antibacterial drug development, genes required for rapid growth for exploitation in biotechnology, and discovery of new biochemical pathways. To identify essential genes in Escherichia coli , we constructed a transposon mutant library of unprecedented density. Initial automated analysis of the resulting data revealed many discrepancies compared to the literature. We now report more extensive statistical analysis supported by both literature searches and detailed inspection of high-density TraDIS sequencing data for each putative essential gene for the E. coli model laboratory organism. This paper is important because it provides a better understanding of the essential genes of E. coli , reveals the limitations of relying on automated analysis alone, and provides a new standard for the analysis of TraDIS data. Copyright © 2018 Goodall et al.
Schmid, Jonas; Zehe, Anja; Vogel, Rudi F.
2016-01-01
As the number of bacterial genomes increases dramatically, the demand for easy to use tools with transparent functionality and comprehensible output for applied comparative genomics grows as well. We present BlAst Diagnostic Gene findEr (BADGE), a tool for the rapid prediction of diagnostic marker genes (DMGs) for the differentiation of bacterial groups (e.g. pathogenic / nonpathogenic). DMG identification settings can be modified easily and installing and running BADGE does not require specific bioinformatics skills. During the BADGE run the user is informed step by step about the DMG finding process, thus making it easy to evaluate the impact of chosen settings and options. On the basis of an example with relevance for beer brewing, being one of the oldest biotechnological processes known, we show a straightforward procedure, from phenotyping, genome sequencing, assembly and annotation, up to a discriminant marker gene PCR assay, making comparative genomics a means to an end. The value and the functionality of BADGE were thoroughly examined, resulting in the successful identification and validation of an outstanding novel DMG (fabZ) for the discrimination of harmless and harmful contaminations of Pediococcus damnosus, which can be applied for spoilage risk determination in breweries. Concomitantly, we present and compare five complete P. damnosus genomes sequenced in this study, finding that the ability to produce the unwanted, spoilage associated off-flavor diacetyl is a plasmid encoded trait in this important beer spoiling species. PMID:27028007
Vu Manh, Thien-Phong; Elhmouzi-Younes, Jamila; Urien, Céline; Ruscanu, Suzana; Jouneau, Luc; Bourge, Mickaël; Moroldo, Marco; Foucras, Gilles; Salmon, Henri; Marty, Hélène; Quéré, Pascale; Bertho, Nicolas; Boudinot, Pierre; Dalod, Marc; Schwartz-Cornil, Isabelle
2015-01-01
Mononuclear phagocytes are organized in a complex system of ontogenetically and functionally distinct subsets, that has been best described in mouse and to some extent in human. Identification of homologous mononuclear phagocyte subsets in other vertebrate species of biomedical, economic, and environmental interest is needed to improve our knowledge in physiologic and physio-pathologic processes, and to design intervention strategies against a variety of diseases, including zoonotic infections. We developed a streamlined approach combining refined cell sorting and integrated comparative transcriptomics analyses which revealed conservation of the mononuclear phagocyte organization across human, mouse, sheep, pigs and, in some respect, chicken. This strategy should help democratizing the use of omics analyses for the identification and study of cell types across tissues and species. Moreover, we identified conserved gene signatures that enable robust identification and universal definition of these cell types. We identified new evolutionarily conserved gene candidates and gene interaction networks for the molecular regulation of the development or functions of these cell types, as well as conserved surface candidates for refined subset phenotyping throughout species. A phylogenetic analysis revealed that orthologous genes of the conserved signatures exist in teleost fishes and apparently not in Lamprey. PMID:26150816
GSNFS: Gene subnetwork biomarker identification of lung cancer expression data.
Doungpan, Narumol; Engchuan, Worrawat; Chan, Jonathan H; Meechai, Asawin
2016-12-05
Gene expression has been used to identify disease gene biomarkers, but there are ongoing challenges. Single gene or gene-set biomarkers are inadequate to provide sufficient understanding of complex disease mechanisms and the relationship among those genes. Network-based methods have thus been considered for inferring the interaction within a group of genes to further study the disease mechanism. Recently, the Gene-Network-based Feature Set (GNFS), which is capable of handling case-control and multiclass expression for gene biomarker identification, has been proposed, partly taking into account of network topology. However, its performance relies on a greedy search for building subnetworks and thus requires further improvement. In this work, we establish a new approach named Gene Sub-Network-based Feature Selection (GSNFS) by implementing the GNFS framework with two proposed searching and scoring algorithms, namely gene-set-based (GS) search and parent-node-based (PN) search, to identify subnetworks. An additional dataset is used to validate the results. The two proposed searching algorithms of the GSNFS method for subnetwork expansion are concerned with the degree of connectivity and the scoring scheme for building subnetworks and their topology. For each iteration of expansion, the neighbour genes of a current subnetwork, whose expression data improved the overall subnetwork score, is recruited. While the GS search calculated the subnetwork score using an activity score of a current subnetwork and the gene expression values of its neighbours, the PN search uses the expression value of the corresponding parent of each neighbour gene. Four lung cancer expression datasets were used for subnetwork identification. In addition, using pathway data and protein-protein interaction as network data in order to consider the interaction among significant genes were discussed. Classification was performed to compare the performance of the identified gene subnetworks with three subnetwork identification algorithms. The two searching algorithms resulted in better classification and gene/gene-set agreement compared to the original greedy search of the GNFS method. The identified lung cancer subnetwork using the proposed searching algorithm resulted in an improvement of the cross-dataset validation and an increase in the consistency of findings between two independent datasets. The homogeneity measurement of the datasets was conducted to assess dataset compatibility in cross-dataset validation. The lung cancer dataset with higher homogeneity showed a better result when using the GS search while the dataset with low homogeneity showed a better result when using the PN search. The 10-fold cross-dataset validation on the independent lung cancer datasets showed higher classification performance of the proposed algorithms when compared with the greedy search in the original GNFS method. The proposed searching algorithms provide a higher number of genes in the subnetwork expansion step than the greedy algorithm. As a result, the performance of the subnetworks identified from the GSNFS method was improved in terms of classification performance and gene/gene-set level agreement depending on the homogeneity of the datasets used in the analysis. Some common genes obtained from the four datasets using different searching algorithms are genes known to play a role in lung cancer. The improvement of classification performance and the gene/gene-set level agreement, and the biological relevance indicated the effectiveness of the GSNFS method for gene subnetwork identification using expression data.
Gene prioritization and clustering by multi-view text mining
2010-01-01
Background Text mining has become a useful tool for biologists trying to understand the genetics of diseases. In particular, it can help identify the most interesting candidate genes for a disease for further experimental analysis. Many text mining approaches have been introduced, but the effect of disease-gene identification varies in different text mining models. Thus, the idea of incorporating more text mining models may be beneficial to obtain more refined and accurate knowledge. However, how to effectively combine these models still remains a challenging question in machine learning. In particular, it is a non-trivial issue to guarantee that the integrated model performs better than the best individual model. Results We present a multi-view approach to retrieve biomedical knowledge using different controlled vocabularies. These controlled vocabularies are selected on the basis of nine well-known bio-ontologies and are applied to index the vast amounts of gene-based free-text information available in the MEDLINE repository. The text mining result specified by a vocabulary is considered as a view and the obtained multiple views are integrated by multi-source learning algorithms. We investigate the effect of integration in two fundamental computational disease gene identification tasks: gene prioritization and gene clustering. The performance of the proposed approach is systematically evaluated and compared on real benchmark data sets. In both tasks, the multi-view approach demonstrates significantly better performance than other comparing methods. Conclusions In practical research, the relevance of specific vocabulary pertaining to the task is usually unknown. In such case, multi-view text mining is a superior and promising strategy for text-based disease gene identification. PMID:20074336
Bork, Peer
2018-02-14
The U.S. Department of Energy Joint Genome Institute (JGI) invited scientists interested in the application of genomics to bioenergy and environmental issues, as well as all current and prospective users and collaborators, to attend the annual DOE JGI Genomics of Energy & Environment Meeting held March 22-24, 2011 in Walnut Creek, Calif. The emphasis of this meeting was on the genomics of renewable energy strategies, carbon cycling, environmental gene discovery, and engineering of fuel-producing organisms. The meeting features presentations by leading scientists advancing these topics. Peer Bork of the European Molecular Biology Laboratory on Comparative Metagenomics of Gut and Ocean: Identification of Microbial Marker Genes for Complex Environmental Properties at the 6th annual Genomics of Energy & Environment Meeting on March 23, 2011.
Cui, Peng; Zhong, Tingyan; Wang, Zhuo; Wang, Tao; Zhao, Hongyu; Liu, Chenglin; Lu, Hui
2018-06-01
Circadian genes express periodically in an approximate 24-h period and the identification and study of these genes can provide deep understanding of the circadian control which plays significant roles in human health. Although many circadian gene identification algorithms have been developed, large numbers of false positives and low coverage are still major problems in this field. In this study we constructed a novel computational framework for circadian gene identification using deep neural networks (DNN) - a deep learning algorithm which can represent the raw form of data patterns without imposing assumptions on the expression distribution. Firstly, we transformed time-course gene expression data into categorical-state data to denote the changing trend of gene expression. Two distinct expression patterns emerged after clustering of the state data for circadian genes from our manually created learning dataset. DNN was then applied to discriminate the aperiodic genes and the two subtypes of periodic genes. In order to assess the performance of DNN, four commonly used machine learning methods including k-nearest neighbors, logistic regression, naïve Bayes, and support vector machines were used for comparison. The results show that the DNN model achieves the best balanced precision and recall. Next, we conducted large scale circadian gene detection using the trained DNN model for the remaining transcription profiles. Comparing with JTK_CYCLE and a study performed by Möller-Levet et al. (doi: https://doi.org/10.1073/pnas.1217154110), we identified 1132 novel periodic genes. Through the functional analysis of these novel circadian genes, we found that the GTPase superfamily exhibits distinct circadian expression patterns and may provide a molecular switch of circadian control of the functioning of the immune system in human blood. Our study provides novel insights into both the circadian gene identification field and the study of complex circadian-driven biological control. This article is part of a Special Issue entitled: Accelerating Precision Medicine through Genetic and Genomic Big Data Analysis edited by Yudong Cai & Tao Huang. Copyright © 2017. Published by Elsevier B.V.
Pacheco, Luis G C; Mattos-Guaraldi, Ana L; Santos, Carolina S; Veras, Adonney A O; Guimarães, Luis C; Abreu, Vinícius; Pereira, Felipe L; Soares, Siomar C; Dorella, Fernanda A; Carvalho, Alex F; Leal, Carlos G; Figueiredo, Henrique C P; Ramos, Juliana N; Vieira, Veronica V; Farfour, Eric; Guiso, Nicole; Hirata, Raphael; Azevedo, Vasco; Silva, Artur; Ramos, Rommel T J
2015-01-01
Non-diphtheriae Corynebacterium species have been increasingly recognized as the causative agents of infections in humans. Differential identification of these bacteria in the clinical microbiology laboratory by the most commonly used biochemical tests is challenging, and normally requires additional molecular methods. Herein, we present the annotated draft genome sequences of two isolates of "difficult-to-identify" human-pathogenic corynebacterial species: C. xerosis and C. minutissimum. The genome sequences of ca. 2.7 Mbp, with a mean number of 2,580 protein encoding genes, were also compared with the publicly available genome sequences of strains of C. amycolatum and C. striatum. These results will aid the exploration of novel biochemical reactions to improve existing identification tests as well as the development of more accurate molecular identification methods through detection of species-specific target genes for isolate's identification or drug susceptibility profiling.
Liu, Jun-Jun; Xiang, Yu
2011-01-01
WRKY transcription factors are key regulators of numerous biological processes in plant growth and development, as well as plant responses to abiotic and biotic stresses. Research on biological functions of plant WRKY genes has focused in the past on model plant species or species with largely characterized transcriptomes. However, a variety of non-model plants, such as forest conifers, are essential as feed, biofuel, and wood or for sustainable ecosystems. Identification of WRKY genes in these non-model plants is equally important for understanding the evolutionary and function-adaptive processes of this transcription factor family. Because of limited genomic information, the rarity of regulatory gene mRNAs in transcriptomes, and the sequence divergence to model organism genes, identification of transcription factors in non-model plants using methods similar to those generally used for model plants is difficult. This chapter describes a gene family discovery strategy for identification of WRKY transcription factors in conifers by a combination of in silico-based prediction and PCR-based experimental approaches. Compared to traditional cDNA library screening or EST sequencing at transcriptome scales, this integrated gene discovery strategy provides fast, simple, reliable, and specific methods to unveil the WRKY gene family at both genome and transcriptome levels in non-model plants.
CORECLUST: identification of the conserved CRM grammar together with prediction of gene regulation.
Nikulova, Anna A; Favorov, Alexander V; Sutormin, Roman A; Makeev, Vsevolod J; Mironov, Andrey A
2012-07-01
Identification of transcriptional regulatory regions and tracing their internal organization are important for understanding the eukaryotic cell machinery. Cis-regulatory modules (CRMs) of higher eukaryotes are believed to possess a regulatory 'grammar', or preferred arrangement of binding sites, that is crucial for proper regulation and thus tends to be evolutionarily conserved. Here, we present a method CORECLUST (COnservative REgulatory CLUster STructure) that predicts CRMs based on a set of positional weight matrices. Given regulatory regions of orthologous and/or co-regulated genes, CORECLUST constructs a CRM model by revealing the conserved rules that describe the relative location of binding sites. The constructed model may be consequently used for the genome-wide prediction of similar CRMs, and thus detection of co-regulated genes, and for the investigation of the regulatory grammar of the system. Compared with related methods, CORECLUST shows better performance at identification of CRMs conferring muscle-specific gene expression in vertebrates and early-developmental CRMs in Drosophila.
Diehn, Till A.; Pommerrenig, Benjamin; Bernhardt, Nadine; Hartmann, Anja; Bienert, Gerd P.
2015-01-01
Aquaporins (AQPs) are essential channel proteins that regulate plant water homeostasis and the uptake and distribution of uncharged solutes such as metalloids, urea, ammonia, and carbon dioxide. Despite their importance as crop plants, little is known about AQP gene and protein function in cabbage (Brassica oleracea) and other Brassica species. The recent releases of the genome sequences of B. oleracea and Brassica rapa allow comparative genomic studies in these species to investigate the evolution and features of Brassica genes and proteins. In this study, we identified all AQP genes in B. oleracea by a genome-wide survey. In total, 67 genes of four plant AQP subfamilies were identified. Their full-length gene sequences and locations on chromosomes and scaffolds were manually curated. The identification of six additional full-length AQP sequences in the B. rapa genome added to the recently published AQP protein family of this species. A phylogenetic analysis of AQPs of Arabidopsis thaliana, B. oleracea, B. rapa allowed us to follow AQP evolution in closely related species and to systematically classify and (re-) name these isoforms. Thirty-three groups of AQP-orthologous genes were identified between B. oleracea and Arabidopsis and their expression was analyzed in different organs. The two selectivity filters, gene structure and coding sequences were highly conserved within each AQP subfamily while sequence variations in some introns and untranslated regions were frequent. These data suggest a similar substrate selectivity and function of Brassica AQPs compared to Arabidopsis orthologs. The comparative analyses of all AQP subfamilies in three Brassicaceae species give initial insights into AQP evolution in these taxa. Based on the genome-wide AQP identification in B. oleracea and the sequence analysis and reprocessing of Brassica AQP information, our dataset provides a sequence resource for further investigations of the physiological and molecular functions of Brassica crop AQPs. PMID:25904922
Taxonomic resolutions based on 18S rRNA genes: a case study of subclass copepoda.
Wu, Shu; Xiong, Jie; Yu, Yuhe
2015-01-01
Biodiversity studies are commonly conducted using 18S rRNA genes. In this study, we compared the inter-species divergence of variable regions (V1-9) within the copepod 18S rRNA gene, and tested their taxonomic resolutions at different taxonomic levels. Our results indicate that the 18S rRNA gene is a good molecular marker for the study of copepod biodiversity, and our conclusions are as follows: 1) 18S rRNA genes are highly conserved intra-species (intra-species similarities are close to 100%); and could aid in species-level analyses, but with some limitations; 2) nearly-whole-length sequences and some partial regions (around V2, V4, and V9) of the 18S rRNA gene can be used to discriminate between samples at both the family and order levels (with a success rate of about 80%); 3) compared with other regions, V9 has a higher resolution at the genus level (with an identification success rate of about 80%); and 4) V7 is most divergent in length, and would be a good candidate marker for the phylogenetic study of Acartia species. This study also evaluated the correlation between similarity thresholds and the accuracy of using nuclear 18S rRNA genes for the classification of organisms in the subclass Copepoda. We suggest that sample identification accuracy should be considered when a molecular sequence divergence threshold is used for taxonomic identification, and that the lowest similarity threshold should be determined based on a pre-designated level of acceptable accuracy.
Taxonomic Resolutions Based on 18S rRNA Genes: A Case Study of Subclass Copepoda
Wu, Shu; Xiong, Jie; Yu, Yuhe
2015-01-01
Biodiversity studies are commonly conducted using 18S rRNA genes. In this study, we compared the inter-species divergence of variable regions (V1–9) within the copepod 18S rRNA gene, and tested their taxonomic resolutions at different taxonomic levels. Our results indicate that the 18S rRNA gene is a good molecular marker for the study of copepod biodiversity, and our conclusions are as follows: 1) 18S rRNA genes are highly conserved intra-species (intra-species similarities are close to 100%); and could aid in species-level analyses, but with some limitations; 2) nearly-whole-length sequences and some partial regions (around V2, V4, and V9) of the 18S rRNA gene can be used to discriminate between samples at both the family and order levels (with a success rate of about 80%); 3) compared with other regions, V9 has a higher resolution at the genus level (with an identification success rate of about 80%); and 4) V7 is most divergent in length, and would be a good candidate marker for the phylogenetic study of Acartia species. This study also evaluated the correlation between similarity thresholds and the accuracy of using nuclear 18S rRNA genes for the classification of organisms in the subclass Copepoda. We suggest that sample identification accuracy should be considered when a molecular sequence divergence threshold is used for taxonomic identification, and that the lowest similarity threshold should be determined based on a pre-designated level of acceptable accuracy. PMID:26107258
Identification of genes regulated during mechanical load-induced cardiac hypertrophy
NASA Technical Reports Server (NTRS)
Johnatty, S. E.; Dyck, J. R.; Michael, L. H.; Olson, E. N.; Abdellatif, M.; Schneider, M. (Principal Investigator)
2000-01-01
Cardiac hypertrophy is associated with both adaptive and adverse changes in gene expression. To identify genes regulated by pressure overload, we performed suppressive subtractive hybridization between cDNA from the hearts of aortic-banded (7-day) and sham-operated mice. In parallel, we performed a subtraction between an adult and a neonatal heart, for the purpose of comparing different forms of cardiac hypertrophy. Sequencing more than 100 clones led to the identification of an array of functionally known (70%) and unknown genes (30%) that are upregulated during cardiac growth. At least nine of those genes were preferentially expressed in both the neonatal and pressure over-load hearts alike. Using Northern blot analysis to investigate whether some of the identified genes were upregulated in the load-independent calcineurin-induced cardiac hypertrophy mouse model, revealed its incomplete similarity with the former models of cardiac growth. Copyright 2000 Academic Press.
Chen, Rui; Jiang, Li-Yun; Qiao, Ge-Xia
2012-01-01
The mitochondrial gene COI has been widely used by taxonomists as a standard DNA barcode sequence for the identification of many animal species. However, the COI region is of limited use for identifying certain species and is not efficiently amplified by PCR in all animal taxa. To evaluate the utility of COI as a DNA barcode and to identify other barcode genes, we chose the aphid subfamily Lachninae (Hemiptera: Aphididae) as the focus of our study. We compared the results obtained using COI with two other mitochondrial genes, COII and Cytb. In addition, we propose a new method to improve the efficiency of species identification using DNA barcoding. Three mitochondrial genes (COI, COII and Cytb) were sequenced and were used in the identification of over 80 species of Lachninae. The COI and COII genes demonstrated a greater PCR amplification efficiency than Cytb. Species identification using COII sequences had a higher frequency of success (96.9% in "best match" and 90.8% in "best close match") and yielded lower intra- and higher interspecific genetic divergence values than the other two markers. The use of "tag barcodes" is a new approach that involves attaching a species-specific tag to the standard DNA barcode. With this method, the "barcoding overlap" can be nearly eliminated. As a result, we were able to increase the identification success rate from 83.9% to 95.2% by using COI and the "best close match" technique. A COII-based identification system should be more effective in identifying lachnine species than COI or Cytb. However, the Cytb gene is an effective marker for the study of aphid population genetics due to its high sequence diversity. Furthermore, the use of "tag barcodes" can improve the accuracy of DNA barcoding identification by reducing or removing the overlap between intra- and inter-specific genetic divergence values.
Lin, Michael F.; Deoras, Ameya N.; Rasmussen, Matthew D.; Kellis, Manolis
2008-01-01
Comparative genomics of multiple related species is a powerful methodology for the discovery of functional genomic elements, and its power should increase with the number of species compared. Here, we use 12 Drosophila genomes to study the power of comparative genomics metrics to distinguish between protein-coding and non-coding regions. First, we study the relative power of different comparative metrics and their relationship to single-species metrics. We find that even relatively simple multi-species metrics robustly outperform advanced single-species metrics, especially for shorter exons (≤240 nt), which are common in animal genomes. Moreover, the two capture largely independent features of protein-coding genes, with different sensitivity/specificity trade-offs, such that their combinations lead to even greater discriminatory power. In addition, we study how discovery power scales with the number and phylogenetic distance of the genomes compared. We find that species at a broad range of distances are comparably effective informants for pairwise comparative gene identification, but that these are surpassed by multi-species comparisons at similar evolutionary divergence. In particular, while pairwise discovery power plateaued at larger distances and never outperformed the most advanced single-species metrics, multi-species comparisons continued to benefit even from the most distant species with no apparent saturation. Last, we find that genes in functional categories typically considered fast-evolving can nonetheless be recovered at very high rates using comparative methods. Our results have implications for comparative genomics analyses in any species, including the human. PMID:18421375
Hypertension and cancer are prevalent diseases. Epidemiological studies suggest that hypertension may increase the long term risk of cancer. Identification of resistance and/or susceptibility genes using rodent models could provide important insights into the management and treat...
Alagarasan, Ganesh; Dubey, Mahima; Aswathy, Kumar S; Chandel, Girish
2017-01-01
Genes in the ZIP family encode transcripts to store and transport bivalent metal micronutrient, particularly iron (Fe) and or zinc (Zn). These transcripts are important for a variety of functions involved in the developmental and physiological processes in many plant species, including most, if not all, Poaceae plant species and the model species Arabidopsis. Here, we present the report of a genome wide investigation of orthologous ZIP genes in Setaria italica and the identification of 7 single copy genes. RT-PCR shows 4 of them could be used to increase the bio-availability of zinc and iron content in grains. Of 36 ZIP members, 25 genes have traces of signal peptide based sub-cellular localization, as compared to those of plant species studied previously, yet translocation of ions remains unclear. In silico analysis of gene structure and protein nature suggests that these two were preeminent in shaping the functional diversity of the ZIP gene family in S. italica . NAC, bZIP and bHLH are the predominant Fe and Zn responsive transcription factors present in SiZIP genes. Together, our results provide new insights into the signal peptide based/independent iron and zinc translocation in the plant system and allowed identification of ZIP genes that may be involved in the zinc and iron absorption from the soil, and thus transporting it to the cereal grain underlying high micronutrient accumulation.
USDA-ARS?s Scientific Manuscript database
Focusing on the identification of pathogenicity gene content, we leveraged the reference genomes of Fusarium pathogens F. oxysporum f. sp. lycopersici (tomato-infecting) and F. solani (pea-infecting) and their well-characterised core and dispensable chromosomes to predict genomic organisation in the...
Freytag, Virginie; Probst, Sabine; Hadziselimovic, Nils; Boglari, Csaba; Hauser, Yannick; Peter, Fabian; Gabor Fenyves, Bank; Milnik, Annette; Demougin, Philippe; Vukojevic, Vanja; de Quervain, Dominique J-F; Papassotiropoulos, Andreas; Stetak, Attila
2017-07-12
The identification of genes related to encoding, storage, and retrieval of memories is a major interest in neuroscience. In the current study, we analyzed the temporal gene expression changes in a neuronal mRNA pool during an olfactory long-term associative memory (LTAM) in Caenorhabditis elegans hermaphrodites. Here, we identified a core set of 712 (538 upregulated and 174 downregulated) genes that follows three distinct temporal peaks demonstrating multiple gene regulation waves in LTAM. Compared with the previously published positive LTAM gene set (Lakhina et al., 2015), 50% of the identified upregulated genes here overlap with the previous dataset, possibly representing stimulus-independent memory-related genes. On the other hand, the remaining genes were not previously identified in positive associative memory and may specifically regulate aversive LTAM. Our results suggest a multistep gene activation process during the formation and retrieval of long-term memory and define general memory-implicated genes as well as conditioning-type-dependent gene sets. SIGNIFICANCE STATEMENT The identification of genes regulating different steps of memory is of major interest in neuroscience. Identification of common memory genes across different learning paradigms and the temporal activation of the genes are poorly studied. Here, we investigated the temporal aspects of Caenorhabditis elegans gene expression changes using aversive olfactory associative long-term memory (LTAM) and identified three major gene activation waves. Like in previous studies, aversive LTAM is also CREB dependent, and CREB activity is necessary immediately after training. Finally, we define a list of memory paradigm-independent core gene sets as well as conditioning-dependent genes. Copyright © 2017 the authors 0270-6474/17/376661-12$15.00/0.
Comparative Genome Sequence Analysis of the Bpa/Str Region in Mouse and Man
Mallon, A.-M.; Platzer, M.; Bate, R.; Gloeckner, G.; Botcherby, M.R.M.; Nordsiek, G.; Strivens, M.A.; Kioschis, P.; Dangel, A.; Cunningham, D.; Straw, R.N.A.; Weston, P.; Gilbert, M.; Fernando, S.; Goodall, K.; Hunter, G.; Greystrong, J.S.; Clarke, D.; Kimberley, C.; Goerdes, M.; Blechschmidt, K.; Rump, A.; Hinzmann, B.; Mundy, C.R.; Miller, W.; Poustka, A.; Herman, G.E.; Rhodes, M.; Denny, P.; Rosenthal, A.; Brown, S.D.M.
2000-01-01
The progress of human and mouse genome sequencing programs presages the possibility of systematic cross-species comparison of the two genomes as a powerful tool for gene and regulatory element identification. As the opportunities to perform comparative sequence analysis emerge, it is important to develop parameters for such analyses and to examine the outcomes of cross-species comparison. Our analysis used gene prediction and a database search of 430 kb of genomic sequence covering the Bpa/Str region of the mouse X chromosome, and 745 kb of genomic sequence from the homologous human X chromosome region. We identified 11 genes in mouse and 13 genes and two pseudogenes in human. In addition, we compared the mouse and human sequences using pairwise alignment and searches for evolutionary conserved regions (ECRs) exceeding a defined threshold of sequence identity. This approach aided the identification of at least four further putative conserved genes in the region. Comparative sequencing revealed that this region is a mosaic in evolutionary terms, with considerably more rearrangement between the two species than realized previously from comparative mapping studies. Surprisingly, this region showed an extremely high LINE and low SINE content, low G+C content, and yet a relatively high gene density, in contrast to the low gene density usually associated with such regions. [The sequence data described in this paper have been submitted to EMBL under the following accession nos.: Mouse Genomic Sequence: Mouse contig A (AL021127), Mouse contig B (AL049866), BAC41M10 (AL136328), PAC303O11(AL136329). Human Genomic Sequence: Human contig 1 (U82671, U82670), Human contig 2 (U82695).] PMID:10854409
Singh, Vikas K; Khan, Aamir W; Saxena, Rachit K; Sinha, Pallavi; Kale, Sandip M; Parupalli, Swathi; Kumar, Vinay; Chitikineni, Annapurna; Vechalapu, Suryanarayana; Sameer Kumar, Chanda Venkata; Sharma, Mamta; Ghanta, Anuradha; Yamini, Kalinati Narasimhan; Muniswamy, Sonnappa; Varshney, Rajeev K
2017-07-01
Identification of candidate genomic regions associated with target traits using conventional mapping methods is challenging and time-consuming. In recent years, a number of single nucleotide polymorphism (SNP)-based mapping approaches have been developed and used for identification of candidate/putative genomic regions. However, in the majority of these studies, insertion-deletion (Indel) were largely ignored. For efficient use of Indels in mapping target traits, we propose Indel-seq approach, which is a combination of whole-genome resequencing (WGRS) and bulked segregant analysis (BSA) and relies on the Indel frequencies in extreme bulks. Deployment of Indel-seq approach for identification of candidate genomic regions associated with fusarium wilt (FW) and sterility mosaic disease (SMD) resistance in pigeonpea has identified 16 Indels affecting 26 putative candidate genes. Of these 26 affected putative candidate genes, 24 genes showed effect in the upstream/downstream of the genic region and two genes showed effect in the genes. Validation of these 16 candidate Indels in other FW- and SMD-resistant and FW- and SMD-susceptible genotypes revealed a significant association of five Indels (three for FW and two for SMD resistance). Comparative analysis of Indel-seq with other genetic mapping approaches highlighted the importance of the approach in identification of significant genomic regions associated with target traits. Therefore, the Indel-seq approach can be used for quick and precise identification of candidate genomic regions for any target traits in any crop species. © 2016 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
Exploiting induced variation to dissect quantitative traits in barley.
Druka, Arnis; Franckowiak, Jerome; Lundqvist, Udda; Bonar, Nicola; Alexander, Jill; Guzy-Wrobelska, Justyna; Ramsay, Luke; Druka, Ilze; Grant, Iain; Macaulay, Malcolm; Vendramin, Vera; Shahinnia, Fahimeh; Radovic, Slobodanka; Houston, Kelly; Harrap, David; Cardle, Linda; Marshall, David; Morgante, Michele; Stein, Nils; Waugh, Robbie
2010-04-01
The identification of genes underlying complex quantitative traits such as grain yield by means of conventional genetic analysis (positional cloning) requires the development of several large mapping populations. However, it is possible that phenotypically related, but more extreme, allelic variants generated by mutational studies could provide a means for more efficient cloning of QTLs (quantitative trait loci). In barley (Hordeum vulgare), with the development of high-throughput genome analysis tools, efficient genome-wide identification of genetic loci harbouring mutant alleles has recently become possible. Genotypic data from NILs (near-isogenic lines) that carry induced or natural variants of genes that control aspects of plant development can be compared with the location of QTLs to potentially identify candidate genes for development--related traits such as grain yield. As yield itself can be divided into a number of allometric component traits such as tillers per plant, kernels per spike and kernel size, mutant alleles that both affect these traits and are located within the confidence intervals for major yield QTLs may represent extreme variants of the underlying genes. In addition, the development of detailed comparative genomic models based on the alignment of a high-density barley gene map with the rice and sorghum physical maps, has enabled an informed prioritization of 'known function' genes as candidates for both QTLs and induced mutant genes.
Microarray-based identification of differentially expressed genes in extramammary Paget’s disease
Lin, Jin-Ran; Liang, Jun; Zhang, Qiao-An; Huang, Qiong; Wang, Shang-Shang; Qin, Hai-Hong; Chen, Lian-Jun; Xu, Jin-Hua
2015-01-01
Extramammary Paget’s disease (EMPD) is a rare cutaneous malignancy accounting for approximately 1-2% of vulvar cancers. The rarity of this disease has caused difficulties in characterization and the molecular mechanism underlying EMPD development remains largely unclear. Here we used microarray analysis to identify differentially expressed genes in EMPD of the scrotum comparing with normal epithelium from healthy donors. Agilent single-channel microarray was used to compare the gene expression between 6 EMPD specimens and 6 normal scrotum epithelium samples. A total of 799 up-regulated genes and 723 down-regulated genes were identified in EMPD tissues. Real-time PCR was conducted to verify the differential expression of some representative genes, including ERBB4, TCF3, PAPSS2, PIK3R3, PRLR, SULT1A1, TCF7L1, and CREB3L4. Generally, the real-time PCR results were consistent with microarray data, and the expression of ERBB4, PRLR, TCF3, PIK3R3, SULT1A1, and TCF7L1 was significantly overexpressed in EMPD (P<0.05). Moreover, the overexpression of PRLR in EMPD, a receptor for the anterior pituitary hormone prolactin (PRL), was confirmed by immunohistochemistry. These data demonstrate that the differentially expressed genes from the microarray-based identification are tightly associated with EMPD occurrence. PMID:26221264
Schuemie, Martijn J; Mons, Barend; Weeber, Marc; Kors, Jan A
2007-06-01
Gene and protein name identification in text requires a dictionary approach to relate synonyms to the same gene or protein, and to link names to external databases. However, existing dictionaries are incomplete. We investigate two complementary methods for automatic generation of a comprehensive dictionary: combination of information from existing gene and protein databases and rule-based generation of spelling variations. Both methods have been reported in literature before, but have hitherto not been combined and evaluated systematically. We combined gene and protein names from several existing databases of four different organisms. The combined dictionaries showed a substantial increase in recall on three different test sets, as compared to any single database. Application of 23 spelling variation rules to the combined dictionaries further increased recall. However, many rules appeared to have no effect and some appear to have a detrimental effect on precision.
Identification of Cell Cycle-Regulated Genes by Convolutional Neural Network.
Liu, Chenglin; Cui, Peng; Huang, Tao
2017-01-01
The cell cycle-regulated genes express periodically with the cell cycle stages, and the identification and study of these genes can provide a deep understanding of the cell cycle process. Large false positives and low overlaps are big problems in cell cycle-regulated gene detection. Here, a computational framework called DLGene was proposed for cell cycle-regulated gene detection. It is based on the convolutional neural network, a deep learning algorithm representing raw form of data pattern without assumption of their distribution. First, the expression data was transformed to categorical state data to denote the changing state of gene expression, and four different expression patterns were revealed for the reported cell cycle-regulated genes. Then, DLGene was applied to discriminate the non-cell cycle gene and the four subtypes of cell cycle genes. Its performances were compared with six traditional machine learning methods. At last, the biological functions of representative cell cycle genes for each subtype are analyzed. Our method showed better and more balanced performance of sensitivity and specificity comparing to other machine learning algorithms. The cell cycle genes had very different expression pattern with non-cell cycle genes and among the cell-cycle genes, there were four subtypes. Our method not only detects the cell cycle genes, but also describes its expression pattern, such as when its highest expression level is reached and how it changes with time. For each type, we analyzed the biological functions of the representative genes and such results provided novel insight to the cell cycle mechanisms. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Rue-Albrecht, Kévin; McGettigan, Paul A; Hernández, Belinda; Nalpas, Nicolas C; Magee, David A; Parnell, Andrew C; Gordon, Stephen V; MacHugh, David E
2016-03-11
Identification of gene expression profiles that differentiate experimental groups is critical for discovery and analysis of key molecular pathways and also for selection of robust diagnostic or prognostic biomarkers. While integration of differential expression statistics has been used to refine gene set enrichment analyses, such approaches are typically limited to single gene lists resulting from simple two-group comparisons or time-series analyses. In contrast, functional class scoring and machine learning approaches provide powerful alternative methods to leverage molecular measurements for pathway analyses, and to compare continuous and multi-level categorical factors. We introduce GOexpress, a software package for scoring and summarising the capacity of gene ontology features to simultaneously classify samples from multiple experimental groups. GOexpress integrates normalised gene expression data (e.g., from microarray and RNA-seq experiments) and phenotypic information of individual samples with gene ontology annotations to derive a ranking of genes and gene ontology terms using a supervised learning approach. The default random forest algorithm allows interactions between all experimental factors, and competitive scoring of expressed genes to evaluate their relative importance in classifying predefined groups of samples. GOexpress enables rapid identification and visualisation of ontology-related gene panels that robustly classify groups of samples and supports both categorical (e.g., infection status, treatment) and continuous (e.g., time-series, drug concentrations) experimental factors. The use of standard Bioconductor extension packages and publicly available gene ontology annotations facilitates straightforward integration of GOexpress within existing computational biology pipelines.
Kong, Wei; Mou, Xiaoyang; Di, Benteng; Deng, Jin; Zhong, Ruxing; Wang, Shuaiqun
2017-11-20
Dysregulated pathway identification is an important task which can gain insight into the underlying biological processes of disease. Current pathway-identification methods focus on a set of co-expression genes and single pathways and ignore the correlation between genes and pathways. The method proposed in this study, takes into account the internal correlations not only between genes but also pathways to identifying dysregulated pathways related to Alzheimer's disease (AD), the most common form of dementia. In order to find the significantly differential genes for AD, mutual information (MI) is used to measure interdependencies between genes other than expression valves. Then, by integrating the topology information from KEGG, the significant pathways involved in the feature genes are identified. Next, the distance correlation (DC) is applied to measure the pairwise pathway crosstalks since DC has the advantage of detecting nonlinear correlations when compared to Pearson correlation. Finally, the pathway pairs with significantly different correlations between normal and AD samples are known as dysregulated pathways. The molecular biology analysis demonstrated that many dysregulated pathways related to AD pathogenesis have been discovered successfully by the internal correlation detection. Furthermore, the insights of the dysregulated pathways in the development and deterioration of AD will help to find new effective target genes and provide important theoretical guidance for drug design. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Dehury, Budheswar; Panda, Debashis; Sahu, Jagajjit; Sahu, Mousumi; Sarma, Kishore; Barooah, Madhumita; Sen, Priyabrata; Modi, Mahendra Kumar
2013-01-01
The endogenous small non-coding micro RNAs (miRNAs), which are typically ~21–24 nt nucleotides, play a crucial role in regulating the intrinsic normal growth of cells and development of the plants as well as in maintaining the integrity of genomes. These small non-coding RNAs function as the universal specificity factors in post-transcriptional gene silencing. Discovering miRNAs, identifying their targets, and further inferring miRNA functions is a routine process to understand normal biological processes of miRNAs and their roles in the development of plants. Comparative genomics based approach using expressed sequence tags (EST) and genome survey sequences (GSS) offer a cost-effective platform for identification and characterization of miRNAs and their target genes in plants. Despite the fact that sweet potato (Ipomoea batatas L.) is an important staple food source for poor small farmers throughout the world, the role of miRNA in various developmental processes remains largely unknown. In this paper, we report the computational identification of miRNAs and their target genes in sweet potato from their ESTs. Using comparative genomics-based approach, 8 potential miRNA candidates belonging to miR168, miR2911, and miR156 families were identified from 23 406 ESTs in sweet potato. A total of 42 target genes were predicted and their probable functions were illustrated. Most of the newly identified miRNAs target transcription factors as well as genes involved in plant growth and development, signal transduction, metabolism, defense, and stress response. The identification of miRNAs and their targets is expected to accelerate the pace of miRNA discovery, leading to an improved understanding of the role of miRNA in development and physiology of sweet potato, as well as stress response. PMID:24067297
ITEP: an integrated toolkit for exploration of microbial pan-genomes.
Benedict, Matthew N; Henriksen, James R; Metcalf, William W; Whitaker, Rachel J; Price, Nathan D
2014-01-03
Comparative genomics is a powerful approach for studying variation in physiological traits as well as the evolution and ecology of microorganisms. Recent technological advances have enabled sequencing large numbers of related genomes in a single project, requiring computational tools for their integrated analysis. In particular, accurate annotations and identification of gene presence and absence are critical for understanding and modeling the cellular physiology of newly sequenced genomes. Although many tools are available to compare the gene contents of related genomes, new tools are necessary to enable close examination and curation of protein families from large numbers of closely related organisms, to integrate curation with the analysis of gain and loss, and to generate metabolic networks linking the annotations to observed phenotypes. We have developed ITEP, an Integrated Toolkit for Exploration of microbial Pan-genomes, to curate protein families, compute similarities to externally-defined domains, analyze gene gain and loss, and generate draft metabolic networks from one or more curated reference network reconstructions in groups of related microbial species among which the combination of core and variable genes constitute the their "pan-genomes". The ITEP toolkit consists of: (1) a series of modular command-line scripts for identification, comparison, curation, and analysis of protein families and their distribution across many genomes; (2) a set of Python libraries for programmatic access to the same data; and (3) pre-packaged scripts to perform common analysis workflows on a collection of genomes. ITEP's capabilities include de novo protein family prediction, ortholog detection, analysis of functional domains, identification of core and variable genes and gene regions, sequence alignments and tree generation, annotation curation, and the integration of cross-genome analysis and metabolic networks for study of metabolic network evolution. ITEP is a powerful, flexible toolkit for generation and curation of protein families. ITEP's modular design allows for straightforward extension as analysis methods and tools evolve. By integrating comparative genomics with the development of draft metabolic networks, ITEP harnesses the power of comparative genomics to build confidence in links between genotype and phenotype and helps disambiguate gene annotations when they are evaluated in both evolutionary and metabolic network contexts.
Gene expression complex networks: synthesis, identification, and analysis.
Lopes, Fabrício M; Cesar, Roberto M; Costa, Luciano Da F
2011-10-01
Thanks to recent advances in molecular biology, allied to an ever increasing amount of experimental data, the functional state of thousands of genes can now be extracted simultaneously by using methods such as cDNA microarrays and RNA-Seq. Particularly important related investigations are the modeling and identification of gene regulatory networks from expression data sets. Such a knowledge is fundamental for many applications, such as disease treatment, therapeutic intervention strategies and drugs design, as well as for planning high-throughput new experiments. Methods have been developed for gene networks modeling and identification from expression profiles. However, an important open problem regards how to validate such approaches and its results. This work presents an objective approach for validation of gene network modeling and identification which comprises the following three main aspects: (1) Artificial Gene Networks (AGNs) model generation through theoretical models of complex networks, which is used to simulate temporal expression data; (2) a computational method for gene network identification from the simulated data, which is founded on a feature selection approach where a target gene is fixed and the expression profile is observed for all other genes in order to identify a relevant subset of predictors; and (3) validation of the identified AGN-based network through comparison with the original network. The proposed framework allows several types of AGNs to be generated and used in order to simulate temporal expression data. The results of the network identification method can then be compared to the original network in order to estimate its properties and accuracy. Some of the most important theoretical models of complex networks have been assessed: the uniformly-random Erdös-Rényi (ER), the small-world Watts-Strogatz (WS), the scale-free Barabási-Albert (BA), and geographical networks (GG). The experimental results indicate that the inference method was sensitive to average degree
Schmitt, Bryan H; Cunningham, Scott A; Dailey, Aaron L; Gustafson, Daniel R; Patel, Robin
2013-03-01
Identification of anaerobic bacteria using phenotypic methods is often time-consuming; methods such as 16S rRNA gene sequencing are costly and may not be readily available. We evaluated 253 clinical isolates of anaerobic bacteria using the Bruker MALDI Biotyper (Bruker Daltonics, Billerica, MA) matrix-assisted laser desorption ionization-time of flight mass spectrometry (MALDI-TOF MS) system with a user-supplemented database and an on-plate formic acid-based preparation method and compared results to those of conventional identification using biochemical testing or 16S rRNA gene sequencing. A total of 179 (70.8%) and 232 (91.7%) isolates were correctly identified to the species and genus levels, respectively, using manufacturer-recommended score cutoffs. MALDI-TOF MS offers a rapid, inexpensive method for identification of anaerobic bacteria.
OrthoMCL: Identification of Ortholog Groups for Eukaryotic Genomes
Li, Li; Stoeckert, Christian J.; Roos, David S.
2003-01-01
The identification of orthologous groups is useful for genome annotation, studies on gene/protein evolution, comparative genomics, and the identification of taxonomically restricted sequences. Methods successfully exploited for prokaryotic genome analysis have proved difficult to apply to eukaryotes, however, as larger genomes may contain multiple paralogous genes, and sequence information is often incomplete. OrthoMCL provides a scalable method for constructing orthologous groups across multiple eukaryotic taxa, using a Markov Cluster algorithm to group (putative) orthologs and paralogs. This method performs similarly to the INPARANOID algorithm when applied to two genomes, but can be extended to cluster orthologs from multiple species. OrthoMCL clusters are coherent with groups identified by EGO, but improved recognition of “recent” paralogs permits overlapping EGO groups representing the same gene to be merged. Comparison with previously assigned EC annotations suggests a high degree of reliability, implying utility for automated eukaryotic genome annotation. OrthoMCL has been applied to the proteome data set from seven publicly available genomes (human, fly, worm, yeast, Arabidopsis, the malaria parasite Plasmodium falciparum, and Escherichia coli). A Web interface allows queries based on individual genes or user-defined phylogenetic patterns (http://www.cbil.upenn.edu/gene-family). Analysis of clusters incorporating P. falciparum genes identifies numerous enzymes that were incompletely annotated in first-pass annotation of the parasite genome. PMID:12952885
Rapid identification of acetic acid bacteria using MALDI-TOF mass spectrometry fingerprinting.
Andrés-Barrao, Cristina; Benagli, Cinzia; Chappuis, Malou; Ortega Pérez, Ruben; Tonolla, Mauro; Barja, François
2013-03-01
Acetic acid bacteria (AAB) are widespread microorganisms characterized by their ability to transform alcohols and sugar-alcohols into their corresponding organic acids. The suitability of matrix-assisted laser desorption-time of flight mass spectrometry (MALDI-TOF MS) for the identification of cultured AAB involved in the industrial production of vinegar was evaluated on 64 reference strains from the genera Acetobacter, Gluconacetobacter and Gluconobacter. Analysis of MS spectra obtained from single colonies of these strains confirmed their basic classification based on comparative 16S rRNA gene sequence analysis. MALDI-TOF analyses of isolates from vinegar cross-checked by comparative sequence analysis of 16S rRNA gene fragments allowed AAB to be identified, and it was possible to differentiate them from mixed cultures and non-AAB. The results showed that MALDI-TOF MS analysis was a rapid and reliable method for the clustering and identification of AAB species. Copyright © 2012 Elsevier GmbH. All rights reserved.
The opportunities and challenges of large-scale molecular approaches to songbird neurobiology
Mello, C.V.; Clayton, D.F.
2014-01-01
High-through put methods for analyzing genome structure and function are having a large impact in song-bird neurobiology. Methods include genome sequencing and annotation, comparative genomics, DNA microarrays and transcriptomics, and the development of a brain atlas of gene expression. Key emerging findings include the identification of complex transcriptional programs active during singing, the robust brain expression of non-coding RNAs, evidence of profound variations in gene expression across brain regions, and the identification of molecular specializations within song production and learning circuits. Current challenges include the statistical analysis of large datasets, effective genome curations, the efficient localization of gene expression changes to specific neuronal circuits and cells, and the dissection of behavioral and environmental factors that influence brain gene expression. The field requires efficient methods for comparisons with organisms like chicken, which offer important anatomical, functional and behavioral contrasts. As sequencing costs plummet, opportunities emerge for comparative approaches that may help reveal evolutionary transitions contributing to vocal learning, social behavior and other properties that make songbirds such compelling research subjects. PMID:25280907
Identification of essential genes in Streptococcus pneumoniae by allelic replacement mutagenesis.
Song, Jae-Hoon; Ko, Kwan Soo; Lee, Ji-Young; Baek, Jin Yang; Oh, Won Sup; Yoon, Ha Sik; Jeong, Jin-Yong; Chun, Jongsik
2005-06-30
To find potential targets of novel antimicrobial agents, we identified essential genes of Streptococcus pneumoniae using comparative genomics and allelic replacement mutagenesis. We compared the genome of S. pneumoniae R6 with those of Bacillus subtilis, Enterococcus faecalis, Escherichia coli, and Staphylococcus aureus, and selected 693 candidate target genes with > 40% amino acid sequence identity to the corresponding genes in at least two of the other species. The 693 genes were disrupted and 133 were found to be essential for growth. Of these, 32 encoded proteins of unknown function, and we were able to identify orthologues of 22 of these genes by genomic comparisons. The experimental method used in this study is easy to perform, rapid and efficient for identifying essential genes of bacterial pathogens.
The skeletal and heart muscle triacylglycerol lipolysis revisited.
Knapp, M; Gorski, J
2017-02-01
For 40 years, the enzyme hormone sensitive lipase was considered to hydrolyze the first ester bond of the triacylglycerol moiety and thus initiate hydrolysis. However, 12 years ago a new lipolytic enzyme, termed adipose triglyceride lipase was discovered. It was further shown that the process of lipolysis of triacylglycerol to diacylglycerol and fatty acid is initiated by adipose triglyceride lipase and not by hormone sensitive lipase, responsible for hydrolysis of diacylglycerol to monoacyglycerol and fatty acid. Adipose triglyceride lipase is present in all types of cells containing neutral fat. The enzyme is activated by a protein called comparative gene identification-58 and inhibited by a protein called G0/G1 switch protein 2. It has also been discovered that perilipins, the main proteins coating lipid droplets in the cells, are involved in the process of triacylglycerol lipolysis. Five perilipins (1-5) were identified, however, up to now their role has been poorly assessed. In skeletal muscles, exercise and training affect the mRNA expression and protein content of adipose triglyceride lipase, comparative gene identification-58, G0/G1 switch protein 2, perilipin 2 and 5. The effect of exercise/training depends on exercise intensity and type of muscle fiber. An interaction between comparative gene identification-58 and adipose triglyceride lipase seems to be responsible for the enzyme activation during contractile activity. Adipose triglyceride lipase is also responsible for the activation of the first step of triacylglycerol lipolysis in the heart. There is substantial evidence that cardiac triacylglycerol metabolism affects the function of the heart. ATGL gene mutations leads to the development of neutral lipid storage diseases.
Comparative prion disease gene expression profiling using the prion disease mimetic, cuprizone
Moody, Laura R; Herbst, Allen J; Yoo, Han Sang; Vanderloo, Joshua P
2009-01-01
Identification of genes expressed in response to prion infection may elucidate biomarkers for disease, identify factors involved in agent replication, mechanisms of neuropathology and therapeutic targets. Although several groups have sought to identify gene expression changes specific to prion disease, expression profiles rife with cell population changes have consistently been identified. Cuprizone, a neurotoxicant, qualitatively mimics the cell population changes observed in prion disease, resulting in both spongiform change and astrocytosis. The use of cuprizone-treated animals as an experimental control during comparative expression profiling allows for the identification of transcripts whose expression increases during prion disease and remains unchanged during cuprizone-triggered neuropathology. In this study, expression profiles from the brains of mice preclinically and clinically infected with Rocky Mountain Laboratory (RML) mouse-adapted scrapie agent and age-matched controls were profiled using Affymetrix gene arrays. In total, 164 genes were differentially regulated during prion infection. Eighty-three of these transcripts have been previously undescribed as differentially regulated during prion disease. A 0.4% cuprizone diet was utilized as a control for comparative expression profiling. Cuprizone treatment induced spongiosis and astrocyte proliferation as indicated by glial fibrillary acidic protein (Gfap) transcriptional activation and immunohistochemistry. Gene expression profiles from brain tissue obtained from cuprizone-treated mice identified 307 differentially regulated transcript changes. After comparative analysis, 17 transcripts unaffected by cuprizone treatment but increasing in expression from preclinical to clinical prion infection were identified. Here we describe the novel use of the prion disease mimetic, cuprizone, to control for cell population changes in the brain during prion infection. PMID:19535908
Shortening tobacco life cycle accelerates functional gene identification in genomic research.
Ning, G; Xiao, X; Lv, H; Li, X; Zuo, Y; Bao, M
2012-11-01
Definitive allocation of function requires the introduction of genetic mutations and analysis of their phenotypic consequences. Novel, rapid and convenient techniques or materials are very important and useful to accelerate gene identification in functional genomics research. Here, over-expression of PmFT (Prunus mume), a novel FT orthologue, and PtFT (Populus tremula) lead to shortening of the tobacco life cycle. A series of novel short life cycle stable tobacco lines (30-50 days) were developed through repeated self-crossing selection breeding. Based on the second transformation via a gusA reporter gene, the promoter from BpFULL1 in silver birch (Betula pendula) and the gene (CPC) from Arabidopsis thaliana were effectively tested using short life cycle tobacco lines. Comparative analysis among wild type, short life cycle tobacco and Arabidopsis transformation system verified that it is optional to accelerate functional gene studies by shortening host plant material life cycle, at least in these short life cycle tobacco lines. The results verified that the novel short life cycle transgenic tobacco lines not only combine the advantages of economic nursery requirements and a simple transformation system, but also provide a robust, effective and stable host system to accelerate gene analysis. Thus, shortening tobacco life cycle strategy is feasible to accelerate heterologous or homologous functional gene identification in genomic research. © 2012 German Botanical Society and The Royal Botanical Society of the Netherlands.
Four Linked Genes Participate in Controlling Sporulation Efficiency in Budding Yeast
Ben-Ari, Giora; Zenvirth, Drora; Sherman, Amir; David, Lior; Klutstein, Michael; Lavi, Uri; Hillel, Jossi; Simchen, Giora
2006-01-01
Quantitative traits are conditioned by several genetic determinants. Since such genes influence many important complex traits in various organisms, the identification of quantitative trait loci (QTLs) is of major interest, but still encounters serious difficulties. We detected four linked genes within one QTL, which participate in controlling sporulation efficiency in Saccharomyces cerevisiae. Following the identification of single nucleotide polymorphisms by comparing the sequences of 145 genes between the parental strains SK1 and S288c, we analyzed the segregating progeny of the cross between them. Through reciprocal hemizygosity analysis, four genes, RAS2, PMS1, SWS2, and FKH2, located in a region of 60 kilobases on Chromosome 14, were found to be associated with sporulation efficiency. Three of the four “high” sporulation alleles are derived from the “low” sporulating strain. Two of these sporulation-related genes were verified through allele replacements. For RAS2, the causative variation was suggested to be a single nucleotide difference in the upstream region of the gene. This quantitative trait nucleotide accounts for sporulation variability among a set of ten closely related winery yeast strains. Our results provide a detailed view of genetic complexity in one “QTL region” that controls a quantitative trait and reports a single nucleotide polymorphism-trait association in wild strains. Moreover, these findings have implications on QTL identification in higher eukaryotes. PMID:17112318
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yang, Xiaohan; Ye, Chuyu; Bisaria, Anjali
2011-01-01
Populus is an important bioenergy crop for bioethanol production. A greater understanding of cell wall biosynthesis processes is critical in reducing biomass recalcitrance, a major hindrance in efficient generation of ethanol from lignocellulosic biomass. Here, we report the identification of candidate cell wall biosynthesis genes through the development and application of a novel bioinformatics pipeline. As a first step, via text-mining of PubMed publications, we obtained 121 Arabidopsis genes that had the experimental evidences supporting their involvement in cell wall biosynthesis or remodeling. The 121 genes were then used as bait genes to query an Arabidopsis co-expression database and additionalmore » genes were identified as neighbors of the bait genes in the network, increasing the number of genes to 548. The 548 Arabidopsis genes were then used to re-query the Arabidopsis co-expression database and re-construct a network that captured additional network neighbors, expanding to a total of 694 genes. The 694 Arabidopsis genes were computationally divided into 22 clusters. Queries of the Populus genome using the Arabidopsis genes revealed 817 Populus orthologs. Functional analysis of gene ontology and tissue-specific gene expression indicated that these Arabidopsis and Populus genes are high likelihood candidates for functional genomics in relation to cell wall biosynthesis.« less
9. international mouse genome conference
DOE Office of Scientific and Technical Information (OSTI.GOV)
NONE
This conference was held November 12--16, 1995 in Ann Arbor, Michigan. The purpose of this conference was to provide a multidisciplinary forum for exchange of state-of-the-art information on genetic mapping in mice. This report contains abstracts of presentations, focusing on the following areas: mutation identification; comparative mapping; informatics and complex traits; mutagenesis; gene identification and new technology; and genetic and physical mapping.
Huang, Hung-Chung; Jupiter, Daniel; VanBuren, Vincent
2010-01-01
Background Identification of genes with switch-like properties will facilitate discovery of regulatory mechanisms that underlie these properties, and will provide knowledge for the appropriate application of Boolean networks in gene regulatory models. As switch-like behavior is likely associated with tissue-specific expression, these gene products are expected to be plausible candidates as tissue-specific biomarkers. Methodology/Principal Findings In a systematic classification of genes and search for biomarkers, gene expression profiles (GEPs) of more than 16,000 genes from 2,145 mouse array samples were analyzed. Four distribution metrics (mean, standard deviation, kurtosis and skewness) were used to classify GEPs into four categories: predominantly-off, predominantly-on, graded (rheostatic), and switch-like genes. The arrays under study were also grouped and examined by tissue type. For example, arrays were categorized as ‘brain group’ and ‘non-brain group’; the Kolmogorov-Smirnov distance and Pearson correlation coefficient were then used to compare GEPs between brain and non-brain for each gene. We were thus able to identify tissue-specific biomarker candidate genes. Conclusions/Significance The methodology employed here may be used to facilitate disease-specific biomarker discovery. PMID:20140228
Ayeni, Funmilola A; Andersen, Camilla; Nørskov-Lauritsen, Niels
2017-04-01
Mannitol salt agar (MSA) is often used in resources' limited laboratories for identification of S. aureus however, coagulase-negative staphylococci (CoNS) grows and ferments mannitol on MSA. 171 strains of CoNS which have been previously misidentified as S. aureus due to growth on MSA were collected from different locations in Nigeria and two methods for identification of CoNS were compared i.e. ViTEK 2 and MALDI-TOF MS with partial 16S rRNA gene sequencing as gold standard. Partial tuf gene sequencing was used for contradicting identification. All 171 strains (13 species) grew on MSA and ferments mannitol. All tested strains of S. epidermidis, S. haemolyticus, S. nepalensis, S. pasteuri, S. sciuri,, S. warneri, S. xylosus, S. capitis were correctly identified by MALDI-TOF while variable identification were observed in S. saprophyticus and S. cohnii (90%, 81%). There was low identification of S. arlettae (14%) while all strains of S. kloosii and S. gallinarum were misidentified. There is absence of S. gallinarum in the MALDI-TOF database at the period of this study. All tested strains of S. epidermidis, S. gallinarum, S. haemolyticus, S. sciuri,, S. warneri, S. xylosus and S. capitis were correctly identified by ViTEK while variable identification were observed in S. saprophyticus, S. arlettae, S. cohnii, S. kloosii, (84%, 86%, 75%, 60%) and misidentification of S. nepalensis, S. pasteuri. Partial sequencing of 16S rRNA gene was used as gold standard for most strains except S. capitis and S. xylosus where the two species were misidentified by partial sequencing of 16S rRNA contrary to MALDI-TOF and ViTEK identification. Tuf gene sequencing was used for correct identification. Characteristic growth on MSA for CoNS is also identical to S. aureus growth on the media and therefore, MSA could not differentiate between S. aureus and CoNS. The percentage accuracy of ViTEK was better than MALDI-TOF in identification of CoNS. Although partial sequencing of 16S rRNA gene was used as gold standard in this study, it could not correctly identify S. capitis and S. xylosus. Copyright © 2017 Elsevier Ltd. All rights reserved.
Neupane, Achal; Nepal, Madhav P; Benson, Benjamin V; MacArthur, Kenton J; Piya, Sarbottam
2013-01-01
Mitogen-Activated Protein Kinase (MAPK) genes encode proteins that mediate various signaling pathways associated with biotic and abiotic stress responses in eukaryotes. The MAPK genes form a 3-tier signal transduction cascade between cellular stimuli and physiological responses. Recent identification of soybean MAPKs and availability of genome sequences from other legume species allowed us to identify their MAPK genes. The main objectives of this study were to identify MAPKs in 3 legume species, Lotus japonicus, Medicago truncatula, and Phaseolus vulgaris, and to assess their phylogenetic relationships. We used approaches in comparative genomics for MAPK gene identification and named the newly identified genes following Arabidopsis MAPK nomenclature model. We identified 19, 18, and 15 MAPKs and 7, 4, and 9 MAPKKs in the genome of Lotus japonicus, Medicago truncatula, and Phaseolus vulgaris, respectively. Within clade placement of MAPKs and MAPKKs in the 3 legume species were consistent with those in soybean and Arabidopsis. Among 5 clades of MAPKs, 4 founder clades were consistent to MAPKs of other plant species and orthologs of MAPK genes in the fifth clade-"Clade E" were consistent with those in soybean. Our results also indicated that some gene duplication events might have occurred prior to eudicot-monocot divergence. Highly diversified MAPKs in soybean relative to those in 3 other legume species are attributable to the polyploidization events in soybean. The identification of the MAPK genes in the legume species is important for the legume crop improvement; and evolutionary relationships and functional divergence of these gene members provide insights into plant genome evolution. PMID:24317362
Sánchez-Herrera, K; Sandoval, H; Mouniee, D; Ramírez-Durán, N; Bergeron, E; Boiron, P; Sánchez-Saucedo, N; Rodríguez-Nava, V
2017-09-01
Currently for bacterial identification and classification the rrs gene encoding 16S rRNA is used as a reference method for the analysis of strains of the genus Nocardia. However, it does not have enough polymorphism to differentiate them at the species level. This fact makes it necessary to search for molecular targets that can provide better identification. The sod A gene (encoding the enzyme superoxide dismutase) has had good results in identifying species of other Actinomycetes. In this study the sod A gene is proposed for the identification and differentiation at the species level of the genus Nocardia. We used 41 type species of various collections; a 386 bp fragment of the sod A gene was amplified and sequenced, and a phylogenetic analysis was performed comparing the genes rrs (1171 bp), hsp 65 (401 bp), sec A1 (494 bp), gyr B (1195 bp) and rpo B (401 bp). The sequences were aligned using the Clustal X program. Evolutionary trees according to the neighbour-joining method were created with the programs Phylo_win and MEGA 6. The specific variability of the sod A genus of the genus Nocardia was analysed. A high phylogenetic resolution, significant genetic variability, and specificity and reliability were observed for the differentiation of the isolates at the species level. The polymorphism observed in the sod A gene sequence contains variable regions that allow the discrimination of closely related Nocardia species. The clear specificity, despite its small size, proves to be of great advantage for use in taxonomic studies and clinical diagnosis of the genus Nocardia.
Optimal Scaling of Digital Transcriptomes
Glusman, Gustavo; Caballero, Juan; Robinson, Max; Kutlu, Burak; Hood, Leroy
2013-01-01
Deep sequencing of transcriptomes has become an indispensable tool for biology, enabling expression levels for thousands of genes to be compared across multiple samples. Since transcript counts scale with sequencing depth, counts from different samples must be normalized to a common scale prior to comparison. We analyzed fifteen existing and novel algorithms for normalizing transcript counts, and evaluated the effectiveness of the resulting normalizations. For this purpose we defined two novel and mutually independent metrics: (1) the number of “uniform” genes (genes whose normalized expression levels have a sufficiently low coefficient of variation), and (2) low Spearman correlation between normalized expression profiles of gene pairs. We also define four novel algorithms, one of which explicitly maximizes the number of uniform genes, and compared the performance of all fifteen algorithms. The two most commonly used methods (scaling to a fixed total value, or equalizing the expression of certain ‘housekeeping’ genes) yielded particularly poor results, surpassed even by normalization based on randomly selected gene sets. Conversely, seven of the algorithms approached what appears to be optimal normalization. Three of these algorithms rely on the identification of “ubiquitous” genes: genes expressed in all the samples studied, but never at very high or very low levels. We demonstrate that these include a “core” of genes expressed in many tissues in a mutually consistent pattern, which is suitable for use as an internal normalization guide. The new methods yield robustly normalized expression values, which is a prerequisite for the identification of differentially expressed and tissue-specific genes as potential biomarkers. PMID:24223126
Jin, Feng-Jie; Katayama, Takuya; Maruyama, Jun-Ichi; Kitamoto, Katsuhiko
2016-11-01
Genomic mapping of mutations using next-generation sequencing technologies has facilitated the identification of genes contributing to fundamental biological processes, including human diseases. However, few studies have used this approach to identify mutations contributing to heterologous protein production in industrial strains of filamentous fungi, such as Aspergillus oryzae. In a screening of A. oryzae strains that hyper-produce human lysozyme (HLY), we previously isolated an AUT1 mutant that showed higher production of various heterologous proteins; however, the underlying factors contributing to the increased heterologous protein production remained unclear. Here, using a comparative genomic approach performed with whole-genome sequences, we attempted to identify the genes responsible for the high-level production of heterologous proteins in the AUT1 mutant. The comparative sequence analysis led to the detection of a gene (AO090120000003), designated autA, which was predicted to encode an unknown cytoplasmic protein containing an alpha/beta-hydrolase fold domain. Mutation or deletion of autA was associated with higher production levels of HLY. Specifically, the HLY yields of the autA mutant and deletion strains were twofold higher than that of the control strain during the early stages of cultivation. Taken together, these results indicate that combining classical mutagenesis approaches with comparative genomic analysis facilitates the identification of novel genes involved in heterologous protein production in filamentous fungi.
Evidence for the importance of personalized molecular profiling in pancreatic cancer.
Lili, Loukia N; Matyunina, Lilya V; Walker, L DeEtte; Daneker, George W; McDonald, John F
2014-03-01
There is a growing body of evidence that targeted gene therapy holds great promise for the future treatment of cancer. A crucial step in this therapy is the accurate identification of appropriate candidate genes/pathways for targeted treatment. One approach is to identify variant genes/pathways that are significantly enriched in groups of afflicted individuals relative to control subjects. However, if there are multiple molecular pathways to the same cancer, the molecular determinants of the disease may be heterogeneous among individuals and possibly go undetected by group analyses. In an effort to explore this question in pancreatic cancer, we compared the most significantly differentially expressed genes/pathways between cancer and control patient samples as determined by group versus personalized analyses. We found little to no overlap between genes/pathways identified by gene expression profiling using group analyses relative to those identified by personalized analyses. Our results indicate that personalized and not group molecular profiling is the most appropriate approach for the identification of putative candidates for targeted gene therapy of pancreatic and perhaps other cancers with heterogeneous molecular etiology.
Theodorus H. de Koker; Philip J. Kersten
2002-01-01
The recent sequencing of the Phanerochaete chrysosporium genome presents many opportunities, including the possibility of rapidly correlating specific wood decay proteins of the fungus with the corresponding gene sequences. Here we compare mass fragments of trypsin digests, determined by MALDI-MS (Matrix Assisted Laser Desorption Ionization-Mass Spectrometry), with...
Lessons learned from gene identification studies in Mendelian epilepsy disorders
Hardies, Katia; Weckhuysen, Sarah; De Jonghe, Peter; Suls, Arvid
2016-01-01
Next-generation sequencing (NGS) technologies are now routinely used for gene identification in Mendelian disorders. Setting up cost-efficient NGS projects and managing the large amount of variants remains, however, a challenging job. Here we provide insights in the decision-making processes before and after the use of NGS in gene identification studies. Genetic factors are thought to have a role in ~70% of all epilepsies, and a variety of inheritance patterns have been described for seizure-associated gene defects. We therefore chose epilepsy as disease model and selected 35 NGS studies that focused on patients with a Mendelian epilepsy disorder. The strategies used for gene identification and their respective outcomes were reviewed. High-throughput NGS strategies have led to the identification of several new epilepsy-causing genes, enlarging our knowledge on both known and novel pathomechanisms. NGS findings have furthermore extended the awareness of phenotypical and genetic heterogeneity. By discussing recent studies we illustrate: (I) the power of NGS for gene identification in Mendelian disorders, (II) the accelerating pace in which this field evolves, and (III) the considerations that have to be made when performing NGS studies. Nonetheless, the enormous rise in gene discovery over the last decade, many patients and families included in gene identification studies still remain without a molecular diagnosis; hence, further genetic research is warranted. On the basis of successful NGS studies in epilepsy, we discuss general approaches to guide human geneticists and clinicians in setting up cost-efficient gene identification NGS studies. PMID:26603999
Hinić, V; Straub, C; Schultheiss, E; Kaempfer, P; Frei, R; Goldenberger, D
2013-07-01
Little is known about the clinical significance and laboratory diagnosis of Actinomyces funkei. In this report we describe six clinical cases where A. funkei was isolated from purulent, polymicrobial infections. Conventional identification procedures were compared with molecular methods including matrix-assisted laser desorption/ionization time-of-flight mass spectrometry technique. Analysis of the full 16S rRNA gene sequence of the six investigated strains revealed differences from the A. funkei type strain. DNA-DNA hybridization showed that the clinical strains represent a novel 16S rRNA gene variant within the species of A. funkei. © 2013 The Authors Clinical Microbiology and Infection © 2013 European Society of Clinical Microbiology and Infectious Diseases.
Asif, Siddiqui M; Asad, Amir; Faizan, Ahmad; Anjali, Malik S; Arvind, Arya; Neelesh, Kapoor; Hirdesh, Kumar; Sanjay, Kumar
2009-12-31
Mycobacterium tuberculosis is the causative agent of the disease, tuberculosis and H37Rv is the most studied clinical strain. We use comparative genome analysis of Mycobacterium tuberculosis H37Rv and human for the identification of potential targets dataset. We used DEG (Database of Essential Genes) to identify essential genes in the H37Rv strain. The analysis shows that 628 of the 3989 genes in Mycobacterium tuberculosis H37Rv were found to be essential of which 324 genes lack similarity to the human genome. Subsequently hypothetical proteins were removed through manual curation. This further resulted in a dataset of 135 proteins with essential function and no homology to human.
Nasr Esfahani, Bahram; Rezaei Yazdi, Hadi; Moghim, Sharareh; Ghasemian Safaei, Hajieh; Zarkesh Esfahani, Hamid
2012-11-01
Rapid and accurate identification of mycobacteria isolates from primary culture is important due to timely and appropriate antibiotic therapy. Conventional methods for identification of Mycobacterium species based on biochemical tests needs several weeks and may remain inconclusive. In this study, a novel multiplex real-time PCR was developed for rapid identification of Mycobacterium genus, Mycobacterium tuberculosis complex (MTC) and the most common non-tuberculosis mycobacteria species including M. abscessus, M. fortuitum, M. avium complex, M. kansasii, and the M. gordonae in three reaction tubes but under same PCR condition. Genetic targets for primer designing included the 16S rDNA gene, the dnaJ gene, the gyrB gene and internal transcribed spacer (ITS). Multiplex real-time PCR was setup with reference Mycobacterium strains and was subsequently tested with 66 clinical isolates. Results of multiplex real-time PCR were analyzed with melting curves and melting temperature (T (m)) of Mycobacterium genus, MTC, and each of non-tuberculosis Mycobacterium species were determined. Multiplex real-time PCR results were compared with amplification and sequencing of 16S-23S rDNA ITS for identification of Mycobacterium species. Sensitivity and specificity of designed primers were each 100 % for MTC, M. abscessus, M. fortuitum, M. avium complex, M. kansasii, and M. gordonae. Sensitivity and specificity of designed primer for genus Mycobacterium was 96 and 100 %, respectively. According to the obtained results, we conclude that this multiplex real-time PCR with melting curve analysis and these novel primers can be used for rapid and accurate identification of genus Mycobacterium, MTC, and the most common non-tuberculosis Mycobacterium species.
Impact of sequencing depth and read length on single cell RNA sequencing data of T cells.
Rizzetto, Simone; Eltahla, Auda A; Lin, Peijie; Bull, Rowena; Lloyd, Andrew R; Ho, Joshua W K; Venturi, Vanessa; Luciani, Fabio
2017-10-06
Single cell RNA sequencing (scRNA-seq) provides great potential in measuring the gene expression profiles of heterogeneous cell populations. In immunology, scRNA-seq allowed the characterisation of transcript sequence diversity of functionally relevant T cell subsets, and the identification of the full length T cell receptor (TCRαβ), which defines the specificity against cognate antigens. Several factors, e.g. RNA library capture, cell quality, and sequencing output affect the quality of scRNA-seq data. We studied the effects of read length and sequencing depth on the quality of gene expression profiles, cell type identification, and TCRαβ reconstruction, utilising 1,305 single cells from 8 publically available scRNA-seq datasets, and simulation-based analyses. Gene expression was characterised by an increased number of unique genes identified with short read lengths (<50 bp), but these featured higher technical variability compared to profiles from longer reads. Successful TCRαβ reconstruction was achieved for 6 datasets (81% - 100%) with at least 0.25 millions (PE) reads of length >50 bp, while it failed for datasets with <30 bp reads. Sufficient read length and sequencing depth can control technical noise to enable accurate identification of TCRαβ and gene expression profiles from scRNA-seq data of T cells.
Falade, Mofolusho O.; Opene, Anthony J.; Benson, Otarigho
2016-01-01
DNA barcoding has been adopted as a gold standard rapid, precise and unifying identification system for animal species and provides a database of genetic sequences that can be used as a tool for universal species identification. In this study, we employed mitochondrial genes 16S rRNA (16S) and cytochrome oxidase subunit I (COI) for the identification of some Nigerian freshwater catfish and Tilapia species. Approximately 655 bp were amplified from the 5′ region of the mitochondrial cytochrome C oxidase subunit I (COI) gene whereas 570 bp were amplified for the 16S rRNA gene. Nucleotide divergences among sequences were estimated based on Kimura 2-parameter distances and the genetic relationships were assessed by constructing phylogenetic trees using the neighbour-joining (NJ) and maximum likelihood (ML) methods. Analyses of consensus barcode sequences for each species, and alignment of individual sequences from within a given species revealed highly consistent barcodes (99% similarity on average), which could be compared with deposited sequences in public databases. The nucleotide distance between species belonging to different genera based on COI ranged from 0.17% between Sarotherodon melanotheron and Coptodon zillii to 0.49% between Clarias gariepinus and C. zillii, indicating that S. melanotheron and C. zillii are closely related. Based on the data obtained, the utility of COI gene was confirmed in accurate identification of three fish species from Southwest Nigeria. PMID:27990256
Identification of essential genes and synthetic lethal gene combinations in Escherichia coli K-12.
Mori, Hirotada; Baba, Tomoya; Yokoyama, Katsushi; Takeuchi, Rikiya; Nomura, Wataru; Makishi, Kazuichi; Otsuka, Yuta; Dose, Hitomi; Wanner, Barry L
2015-01-01
Here we describe the systematic identification of single genes and gene pairs, whose knockout causes lethality in Escherichia coli K-12. During construction of precise single-gene knockout library of E. coli K-12, we identified 328 essential gene candidates for growth in complex (LB) medium. Upon establishment of the Keio single-gene deletion library, we undertook the development of the ASKA single-gene deletion library carrying a different antibiotic resistance. In addition, we developed tools for identification of synthetic lethal gene combinations by systematic construction of double-gene knockout mutants. We introduce these methods herein.
Huis, Rudy; Hawkins, Simon; Neutelings, Godfrey
2010-04-19
Quantitative real-time PCR (qRT-PCR) is currently the most accurate method for detecting differential gene expression. Such an approach depends on the identification of uniformly expressed 'housekeeping genes' (HKGs). Extensive transcriptomic data mining and experimental validation in different model plants have shown that the reliability of these endogenous controls can be influenced by the plant species, growth conditions and organs/tissues examined. It is therefore important to identify the best reference genes to use in each biological system before using qRT-PCR to investigate differential gene expression. In this paper we evaluate different candidate HKGs for developmental transcriptomic studies in the economically-important flax fiber- and oil-crop (Linum usitatissimum L). Specific primers were designed in order to quantify the expression levels of 20 different potential housekeeping genes in flax roots, internal- and external-stem tissues, leaves and flowers at different developmental stages. After calculations of PCR efficiencies, 13 HKGs were retained and their expression stabilities evaluated by the computer algorithms geNorm and NormFinder. According to geNorm, 2 Transcriptional Elongation Factors (TEFs) and 1 Ubiquitin gene are necessary for normalizing gene expression when all studied samples are considered. However, only 2 TEFs are required for normalizing expression in stem tissues. In contrast, NormFinder identified glyceraldehyde-3-phosphate dehydrogenase (GADPH) as the most stably expressed gene when all samples were grouped together, as well as when samples were classed into different sub-groups.qRT-PCR was then used to investigate the relative expression levels of two splice variants of the flax LuMYB1 gene (homologue of AtMYB59). LuMYB1-1 and LuMYB1-2 were highly expressed in the internal stem tissues as compared to outer stem tissues and other samples. This result was confirmed with both geNorm-designated- and NormFinder-designated-reference genes. The use of 2 different statistical algorithms results in the identification of different combinations of flax HKGs for expression data normalization. Despite such differences, the use of geNorm-designated- and NormFinder-designated-reference genes enabled us to accurately compare the expression levels of a flax MYB gene in different organs and tissues. Our identification and validation of suitable flax HKGs will facilitate future developmental transcriptomic studies in this economically-important plant.
Dolan, Liam; Langdale, Jane A.
2015-01-01
Real-time quantitative polymerase chain reaction (qPCR) has become widely used as a method to compare gene transcript levels across different conditions. However, selection of suitable reference genes to normalize qPCR data is required for accurate transcript level analysis. Recently, Marchantia polymorpha has been adopted as a model for the study of liverwort development and land plant evolution. Identification of appropriate reference genes has therefore become a necessity for gene expression studies. In this study, transcript levels of eleven candidate reference genes have been analyzed across a range of biological contexts that encompass abiotic stress, hormone treatment and different developmental stages. The consistency of transcript levels was assessed using both geNorm and NormFinder algorithms, and a consensus ranking of the different candidate genes was then obtained. MpAPT and MpACT showed relatively constant transcript levels across all conditions tested whereas the transcript levels of other candidate genes were clearly influenced by experimental conditions. By analyzing transcript levels of phosphate and nitrate starvation reporter genes, we confirmed that MpAPT and MpACT are suitable reference genes in M. polymorpha and also demonstrated that normalization with an inappropriate gene can lead to erroneous analysis of qPCR data. PMID:25798897
Computational Identification of Novel Genes: Current and Future Perspectives.
Klasberg, Steffen; Bitard-Feildel, Tristan; Mallet, Ludovic
2016-01-01
While it has long been thought that all genomic novelties are derived from the existing material, many genes lacking homology to known genes were found in recent genome projects. Some of these novel genes were proposed to have evolved de novo, ie, out of noncoding sequences, whereas some have been shown to follow a duplication and divergence process. Their discovery called for an extension of the historical hypotheses about gene origination. Besides the theoretical breakthrough, increasing evidence accumulated that novel genes play important roles in evolutionary processes, including adaptation and speciation events. Different techniques are available to identify genes and classify them as novel. Their classification as novel is usually based on their similarity to known genes, or lack thereof, detected by comparative genomics or against databases. Computational approaches are further prime methods that can be based on existing models or leveraging biological evidences from experiments. Identification of novel genes remains however a challenging task. With the constant software and technologies updates, no gold standard, and no available benchmark, evaluation and characterization of genomic novelty is a vibrant field. In this review, the classical and state-of-the-art tools for gene prediction are introduced. The current methods for novel gene detection are presented; the methodological strategies and their limits are discussed along with perspective approaches for further studies.
Grijseels, Sietske; Nielsen, Jens Christian; Randelovic, Milica; Nielsen, Jens; Nielsen, Kristian Fog; Workman, Mhairi; Frisvad, Jens Christian
2016-10-14
A new soil-borne species belonging to the Penicillium section Canescentia is described, Penicillium arizonense sp. nov. (type strain CBS 141311 T = IBT 12289 T ). The genome was sequenced and assembled into 33.7 Mb containing 12,502 predicted genes. A phylogenetic assessment based on marker genes confirmed the grouping of P. arizonense within section Canescentia. Compared to related species, P. arizonense proved to encode a high number of proteins involved in carbohydrate metabolism, in particular hemicellulases. Mining the genome for genes involved in secondary metabolite biosynthesis resulted in the identification of 62 putative biosynthetic gene clusters. Extracts of P. arizonense were analysed for secondary metabolites and austalides, pyripyropenes, tryptoquivalines, fumagillin, pseurotin A, curvulinic acid and xanthoepocin were detected. A comparative analysis against known pathways enabled the proposal of biosynthetic gene clusters in P. arizonense responsible for the synthesis of all detected compounds except curvulinic acid. The capacity to produce biomass degrading enzymes and the identification of a high chemical diversity in secreted bioactive secondary metabolites, offers a broad range of potential industrial applications for the new species P. arizonense. The description and availability of the genome sequence of P. arizonense, further provides the basis for biotechnological exploitation of this species.
Grijseels, Sietske; Nielsen, Jens Christian; Randelovic, Milica; Nielsen, Jens; Nielsen, Kristian Fog; Workman, Mhairi; Frisvad, Jens Christian
2016-01-01
A new soil-borne species belonging to the Penicillium section Canescentia is described, Penicillium arizonense sp. nov. (type strain CBS 141311T = IBT 12289T). The genome was sequenced and assembled into 33.7 Mb containing 12,502 predicted genes. A phylogenetic assessment based on marker genes confirmed the grouping of P. arizonense within section Canescentia. Compared to related species, P. arizonense proved to encode a high number of proteins involved in carbohydrate metabolism, in particular hemicellulases. Mining the genome for genes involved in secondary metabolite biosynthesis resulted in the identification of 62 putative biosynthetic gene clusters. Extracts of P. arizonense were analysed for secondary metabolites and austalides, pyripyropenes, tryptoquivalines, fumagillin, pseurotin A, curvulinic acid and xanthoepocin were detected. A comparative analysis against known pathways enabled the proposal of biosynthetic gene clusters in P. arizonense responsible for the synthesis of all detected compounds except curvulinic acid. The capacity to produce biomass degrading enzymes and the identification of a high chemical diversity in secreted bioactive secondary metabolites, offers a broad range of potential industrial applications for the new species P. arizonense. The description and availability of the genome sequence of P. arizonense, further provides the basis for biotechnological exploitation of this species. PMID:27739446
Validation of MIMGO: a method to identify differentially expressed GO terms in a microarray dataset
2012-01-01
Background We previously proposed an algorithm for the identification of GO terms that commonly annotate genes whose expression is upregulated or downregulated in some microarray data compared with in other microarray data. We call these “differentially expressed GO terms” and have named the algorithm “matrix-assisted identification method of differentially expressed GO terms” (MIMGO). MIMGO can also identify microarray data in which genes annotated with a differentially expressed GO term are upregulated or downregulated. However, MIMGO has not yet been validated on a real microarray dataset using all available GO terms. Findings We combined Gene Set Enrichment Analysis (GSEA) with MIMGO to identify differentially expressed GO terms in a yeast cell cycle microarray dataset. GSEA followed by MIMGO (GSEA + MIMGO) correctly identified (p < 0.05) microarray data in which genes annotated to differentially expressed GO terms are upregulated. We found that GSEA + MIMGO was slightly less effective than, or comparable to, GSEA (Pearson), a method that uses Pearson’s correlation as a metric, at detecting true differentially expressed GO terms. However, unlike other methods including GSEA (Pearson), GSEA + MIMGO can comprehensively identify the microarray data in which genes annotated with a differentially expressed GO term are upregulated or downregulated. Conclusions MIMGO is a reliable method to identify differentially expressed GO terms comprehensively. PMID:23232071
Sibout, Richard; Proost, Sebastian; Hansen, Bjoern Oest; Vaid, Neha; Giorgi, Federico M; Ho-Yue-Kuang, Severine; Legée, Frédéric; Cézart, Laurent; Bouchabké-Coussa, Oumaya; Soulhat, Camille; Provart, Nicholas; Pasha, Asher; Le Bris, Philippe; Roujol, David; Hofte, Herman; Jamet, Elisabeth; Lapierre, Catherine; Persson, Staffan; Mutwil, Marek
2017-08-01
While Brachypodium distachyon (Brachypodium) is an emerging model for grasses, no expression atlas or gene coexpression network is available. Such tools are of high importance to provide insights into the function of Brachypodium genes. We present a detailed Brachypodium expression atlas, capturing gene expression in its major organs at different developmental stages. The data were integrated into a large-scale coexpression database ( www.gene2function.de), enabling identification of duplicated pathways and conserved processes across 10 plant species, thus allowing genome-wide inference of gene function. We highlight the importance of the atlas and the platform through the identification of duplicated cell wall modules, and show that a lignin biosynthesis module is conserved across angiosperms. We identified and functionally characterised a putative ferulate 5-hydroxylase gene through overexpression of it in Brachypodium, which resulted in an increase in lignin syringyl units and reduced lignin content of mature stems, and led to improved saccharification of the stem biomass. Our Brachypodium expression atlas thus provides a powerful resource to reveal functionally related genes, which may advance our understanding of important biological processes in grasses. © 2017 The Authors. New Phytologist © 2017 New Phytologist Trust.
USDA-ARS?s Scientific Manuscript database
To better understand the olfactory mechanism in the rice leaf folder, Cnaphalocrocis medinalis (Guenée), one of the most serious insect pests of rice in Asia, we have established six partial transcriptomes from antennae, tarsus, and reproductive organs of male and female adults. A total of 102 genes...
Identification of Reference Genes for RT-qPCR Data Normalization in Cannabis sativa Stem Tissues.
Mangeot-Peter, Lauralie; Legay, Sylvain; Hausman, Jean-Francois; Esposito, Sergio; Guerriero, Gea
2016-09-15
Gene expression profiling via quantitative real-time PCR is a robust technique widely used in the life sciences to compare gene expression patterns in, e.g., different tissues, growth conditions, or after specific treatments. In the field of plant science, real-time PCR is the gold standard to study the dynamics of gene expression and is used to validate the results generated with high throughput techniques, e.g., RNA-Seq. An accurate relative quantification of gene expression relies on the identification of appropriate reference genes, that need to be determined for each experimental set-up used and plant tissue studied. Here, we identify suitable reference genes for expression profiling in stems of textile hemp (Cannabis sativa L.), whose tissues (isolated bast fibres and core) are characterized by remarkable differences in cell wall composition. We additionally validate the reference genes by analysing the expression of putative candidates involved in the non-oxidative phase of the pentose phosphate pathway and in the first step of the shikimate pathway. The goal is to describe the possible regulation pattern of some genes involved in the provision of the precursors needed for lignin biosynthesis in the different hemp stem tissues. The results here shown are useful to design future studies focused on gene expression analyses in hemp.
Nam, Seungyoon
2017-04-01
Cancer transcriptome analysis is one of the leading areas of Big Data science, biomarker, and pharmaceutical discovery, not to forget personalized medicine. Yet, cancer transcriptomics and postgenomic medicine require innovation in bioinformatics as well as comparison of the performance of available algorithms. In this data analytics context, the value of network generation and algorithms has been widely underscored for addressing the salient questions in cancer pathogenesis. Analysis of cancer trancriptome often results in complicated networks where identification of network modularity remains critical, for example, in delineating the "druggable" molecular targets. Network clustering is useful, but depends on the network topology in and of itself. Notably, the performance of different network-generating tools for network cluster (NC) identification has been little investigated to date. Hence, using gastric cancer (GC) transcriptomic datasets, we compared two algorithms for generating pathway versus gene regulatory network-based NCs, showing that the pathway-based approach better agrees with a reference set of cancer-functional contexts. Finally, by applying pathway-based NC identification to GC transcriptome datasets, we describe cancer NCs that associate with candidate therapeutic targets and biomarkers in GC. These observations collectively inform future research on cancer transcriptomics, drug discovery, and rational development of new analysis tools for optimal harnessing of omics data.
Pandey, Ravi S; Saxena, Garima; Bhattacharya, Debashish; Qiu, Huan; Azad, Rajeev K
2017-02-01
Identification of horizontal gene transfers (HGTs) has primarily relied on phylogenetic tree based methods, which require a rich sampling of sequenced genomes to ensure a reliable inference. Because the success of phylogenetic approaches depends on the breadth and depth of the database, researchers usually apply stringent filters to detect only the most likely gene transfers in the genomes of interest. One such study focused on a highly conservative estimate of trans-domain gene transfers in the extremophile eukaryote, Galdieria sulphuraria (Galdieri) Merola (Rhodophyta), by applying multiple filters in their phylogenetic pipeline. This led to the identification of 75 inter-domain acquisitions from Bacteria or Archaea. Because of the evolutionary, ecological, and potential biotechnological significance of foreign genes in algae, alternative approaches and pipelines complementing phylogenetics are needed for a more comprehensive assessment of HGT. We present here a novel pipeline that uncovered 17 novel foreign genes of prokaryotic origin in G. sulphuraria, results that are supported by multiple lines of evidence including composition-based, comparative data, and phylogenetics. These genes encode a variety of potentially adaptive functions, from metabolite transport to DNA repair. © 2016 Phycological Society of America.
The dog genome map and its use in mammalian comparative genomics.
Switonski, Marek; Szczerbal, Izabela; Nowacka, Joanna
2004-01-01
The dog genome organization was extensively studied in the last ten years. The most important achievements are the well-developed marker genome maps, including over 3200 marker loci, and a survey of the DNA genome sequence. This knowledge, along with the most advanced map of the human genome, turned out to be very useful in comparative genomic studies. On the one hand, it has promoted the development of marker genome maps of other species of the family Canidae (red fox, arctic fox, Chinese raccoon dog) as well as studies on the evolution of their karyotype. But the most important approach is the comparative analysis of human and canine hereditary diseases. At present, causative gene mutations are known for 30 canine hereditary diseases. A majority of them have human counterparts with similar clinical and molecular features. Studies on identification of genes having a major impact on some multifactorial diseases (hip dysplasia, epilepsy) and cancers (multifocal renal cystadenocarcinoma and nodular dermatofibrosis) are advanced. Very promising are the results of gene therapy for certain canine monogenic diseases (haemophilia, hereditary retinal dystrophy, mucopolysaccharidosis), which have human equivalents. The above-mentioned examples prove a very important model role of the dog in studies of human genetic diseases. On the other hand, the identification of gene mutations responsible for hereditary diseases has a substantial impact on breeding strategy in the dog.
[Hydrophidae identification through analysis on Cyt b gene barcode].
Liao, Li-xi; Zeng, Ke-wu; Tu, Peng-fei
2015-08-01
Hydrophidae, one of the precious traditional Chinese medicines, is generally drily preserved to prevent corruption, but it is hard to identify the species of Hydrophidae through the appearance because of the change due to the drying process. The identification through analysis on gene barcode, a new technique in species identification, can avoid the problem. The gene barcodes of the 6 species of Hydrophidae like Lapemis hardwickii were aquired through DNA extraction and gene sequencing. These barcodes were then in sequence alignment and test the identification efficency by BLAST. Our results revealed that the barcode sequences performed high identification efficiency, and had obvious difference between intra- and inter-species. These all indicated that Cyt b DNA barcoding can confirm the Hydrophidae identification.
2013-01-01
Background The species of T. harzianum are well known for their biocontrol activity against many plant pathogens. However, there is a lack of studies concerning its use as a biological control agent against F. solani, a pathogen involved in several crop diseases. In this study, we have used subtractive library hybridization (SSH) and quantitative real-time PCR (RT-qPCR) techniques in order to explore changes in T. harzianum genes expression during growth on cell wall of F. solani (FSCW) or glucose. RT-qPCR was also used to examine the regulation of 18 genes, potentially involved in biocontrol, during confrontation between T. harzianum and F. solani. Results Data obtained from two subtractive libraries were compared after annotation using the Blast2GO suite. A total of 417 and 78 readable EST sequence were annotated in the FSCW and glucose libraries, respectively. Functional annotation of these genes identified diverse biological processes and molecular functions required during T. harzianum growth on FSCW or glucose. We identified various genes of biotechnological value encoding to proteins which function such as transporters, hydrolytic activity, adherence, appressorium development and pathogenesis. Fifteen genes were up-regulated and sixteen were down-regulated at least at one-time point during growth of T. harzianum in FSCW. During the confrontation assay most of the genes were up-regulated, mainly after contact, when the interaction has been established. Conclusions This study demonstrates that T. harzianum expressed different genes when grown on FSCW compared to glucose. It provides insights into the mechanisms of gene expression involved in mycoparasitism of T. harzianum against F. solani. The identification and evaluation of these genes may contribute to the development of an efficient biological control agent. PMID:23497274
Vieira, Pabline Marinho; Coelho, Alexandre Siqueira Guedes; Steindorff, Andrei Stecca; de Siqueira, Saulo José Linhares; Silva, Roberto do Nascimento; Ulhoa, Cirano José
2013-03-15
The species of T. harzianum are well known for their biocontrol activity against many plant pathogens. However, there is a lack of studies concerning its use as a biological control agent against F. solani, a pathogen involved in several crop diseases. In this study, we have used subtractive library hybridization (SSH) and quantitative real-time PCR (RT-qPCR) techniques in order to explore changes in T. harzianum genes expression during growth on cell wall of F. solani (FSCW) or glucose. RT-qPCR was also used to examine the regulation of 18 genes, potentially involved in biocontrol, during confrontation between T. harzianum and F. solani. Data obtained from two subtractive libraries were compared after annotation using the Blast2GO suite. A total of 417 and 78 readable EST sequence were annotated in the FSCW and glucose libraries, respectively. Functional annotation of these genes identified diverse biological processes and molecular functions required during T. harzianum growth on FSCW or glucose. We identified various genes of biotechnological value encoding to proteins which function such as transporters, hydrolytic activity, adherence, appressorium development and pathogenesis. Fifteen genes were up-regulated and sixteen were down-regulated at least at one-time point during growth of T. harzianum in FSCW. During the confrontation assay most of the genes were up-regulated, mainly after contact, when the interaction has been established. This study demonstrates that T. harzianum expressed different genes when grown on FSCW compared to glucose. It provides insights into the mechanisms of gene expression involved in mycoparasitism of T. harzianum against F. solani. The identification and evaluation of these genes may contribute to the development of an efficient biological control agent.
Blaschke, Anne J.; Heyrend, Caroline; Byington, Carrie L.; Fisher, Mark A.; Barker, Elizabeth; Garrone, Nicholas F.; Thatcher, Stephanie A.; Pavia, Andrew T.; Barney, Trenda; Alger, Garrison D.; Daly, Judy A.; Ririe, Kirk M.; Ota, Irene; Poritz, Mark A.
2012-01-01
Sepsis is a leading cause of death. Rapid and accurate identification of pathogens and antimicrobial resistance directly from blood culture could improve patient outcomes. The FilmArray® (FA; Idaho Technology, Inc., Salt Lake City, UT) Blood Culture (BC) panel can identify > 25 pathogens and 4 antibiotic resistance genes from positive blood cultures in 1 hour. We compared a development version of the panel to conventional culture and susceptibility testing on 102 archived blood cultures from adults and children with bacteremia. Of 109 pathogens identified by culture, 95% were identified by FA. Among 111 prospectively collected blood cultures, the FA identified 84 of 92 pathogens (91%) covered by the panel. Among 25 Staphylococcus aureus and 21 Enterococcus species detected, FA identified all culture-proven MRSA and VRE. The FA BC panel is an accurate method for the rapid identification of pathogens and resistance genes from blood culture. PMID:22999332
Matsuda, Mari; Iguchi, Shigekazu; Mizutani, Tomonori; Hiramatsu, Keiichi; Tega-Ishii, Michiru; Sansaka, Kaori; Negishi, Kenta; Shimada, Kimie; Umemura, Jun; Notake, Shigeyuki; Yanagisawa, Hideji; Yabusaki, Reiko; Araoka, Hideki; Yoneyama, Akiko
2017-01-01
Background. Early detection of Gram-positive bacteremia and timely appropriate antimicrobial therapy are required for decreasing patient mortality. The purpose of our study was to evaluate the performance of the Verigene Gram-positive blood culture assay (BC-GP) in two special healthcare settings and determine the potential impact of rapid blood culture testing for Gram-positive bacteremia within the Japanese healthcare delivery system. Furthermore, the study included simulated blood cultures, which included a library of well-characterized methicillin-resistant Staphylococcus aureus (MRSA) and vancomycin-resistant enterococci (VRE) isolates reflecting different geographical regions in Japan. Methods. A total 347 BC-GP assays were performed on clinical and simulated blood cultures. BC-GP results were compared to results obtained by reference methods for genus/species identification and detection of resistance genes using molecular and MALDI-TOF MS methodologies. Results. For identification and detection of resistance genes at two clinical sites and simulated blood cultures, overall concordance of BC-GP with reference methods was 327/347 (94%). The time for identification and antimicrobial resistance detection by BC-GP was significantly shorter compared to routine testing especially at the cardiology hospital, which does not offer clinical microbiology services on weekends and holidays. Conclusion. BC-GP generated accurate identification and detection of resistance markers compared with routine laboratory methods for Gram-positive organisms in specialized clinical settings providing more rapid results than current routine testing. PMID:28316631
Singh, Sangeeta; Chand, Suresh; Singh, N. K.; Sharma, Tilak Raj
2015-01-01
The resistance (R) genes and defense response (DR) genes have become very important resources for the development of disease resistant cultivars. In the present investigation, genome-wide identification, expression, phylogenetic and synteny analysis was done for R and DR-genes across three species of rice viz: Oryza sativa ssp indica cv 93-11, Oryza sativa ssp japonica and wild rice species, Oryza brachyantha. We used the in silico approach to identify and map 786 R -genes and 167 DR-genes, 672 R-genes and 142 DR-genes, 251 R-genes and 86 DR-genes in the japonica, indica and O. brachyanth a genomes, respectively. Our analysis showed that 60.5% and 55.6% of the R-genes are tandemly repeated within clusters and distributed over all the rice chromosomes in indica and japonica genomes, respectively. The phylogenetic analysis along with motif distribution shows high degree of conservation of R- and DR-genes in clusters. In silico expression analysis of R-genes and DR-genes showed more than 85% were expressed genes showing corresponding EST matches in the databases. This study gave special emphasis on mechanisms of gene evolution and duplication for R and DR genes across species. Analysis of paralogs across rice species indicated 17% and 4.38% R-genes, 29% and 11.63% DR-genes duplication in indica and Oryza brachyantha, as compared to 20% and 26% duplication of R-genes and DR-genes in japonica respectively. We found that during the course of duplication only 9.5% of R- and DR-genes changed their function and rest of the genes have maintained their identity. Syntenic relationship across three genomes inferred that more orthology is shared between indica and japonica genomes as compared to brachyantha genome. Genome wide identification of R-genes and DR-genes in the rice genome will help in allele mining and functional validation of these genes, and to understand molecular mechanism of disease resistance and their evolution in rice and related species. PMID:25902056
Epilepsy genetics: the ongoing revolution.
Lesca, G; Depienne, C
2015-01-01
Epilepsies have long remained refractory to gene identification due to several obstacles, including a highly variable inter- and intrafamilial expressivity of the phenotypes, a high frequency of phenocopies, and a huge genetic heterogeneity. Recent technological breakthroughs, such as array comparative genomic hybridization and next generation sequencing, have been leading, in the past few years, to the identification of an increasing number of genomic regions and genes in which mutations or copy-number variations cause various epileptic disorders, revealing an enormous diversity of pathophysiological mechanisms. The field that has undergone the most striking revolution is that of epileptic encephalopathies, for which most of causing genes have been discovered since the year 2012. Some examples are the continuous spike-and-waves during slow-wave sleep and Landau-Kleffner syndromes for which the recent discovery of the role of GRIN2A mutations has finally confirmed the genetic bases. These new technologies begin to be used for diagnostic applications, and the main challenge now resides in the interpretation of the huge mass of variants detected by these methods. The identification of causative mutations in epilepsies provides definitive confirmation of the clinical diagnosis, allows accurate genetic counselling, and sometimes permits the development of new appropriate and specific antiepileptic therapies. Future challenges include the identification of the genetic or environmental factors that modify the epileptic phenotypes caused by mutations in a given gene and the understanding of the role of somatic mutations in sporadic epilepsies. Copyright © 2015 Elsevier Masson SAS. All rights reserved.
Costa, Fabrizio; Alba, Rob; Schouten, Henk; Soglio, Valeria; Gianfranceschi, Luca; Serra, Sara; Musacchi, Stefano; Sansavini, Silviero; Costa, Guglielmo; Fei, Zhangjun; Giovannoni, James
2010-10-25
Fruit development, maturation and ripening consists of a complex series of biochemical and physiological changes that in climacteric fruits, including apple and tomato, are coordinated by the gaseous hormone ethylene. These changes lead to final fruit quality and understanding of the functional machinery underlying these processes is of both biological and practical importance. To date many reports have been made on the analysis of gene expression in apple. In this study we focused our investigation on the role of ethylene during apple maturation, specifically comparing transcriptomics of normal ripening with changes resulting from application of the hormone receptor competitor 1-methylcyclopropene. To gain insight into the molecular process regulating ripening in apple, and to compare to tomato (model species for ripening studies), we utilized both homologous and heterologous (tomato) microarray to profile transcriptome dynamics of genes involved in fruit development and ripening, emphasizing those which are ethylene regulated.The use of both types of microarrays facilitated transcriptome comparison between apple and tomato (for the later using data previously published and available at the TED: tomato expression database) and highlighted genes conserved during ripening of both species, which in turn represent a foundation for further comparative genomic studies. The cross-species analysis had the secondary aim of examining the efficiency of heterologous (specifically tomato) microarray hybridization for candidate gene identification as related to the ripening process. The resulting transcriptomics data revealed coordinated gene expression during fruit ripening of a subset of ripening-related and ethylene responsive genes, further facilitating the analysis of ethylene response during fruit maturation and ripening. Our combined strategy based on microarray hybridization enabled transcriptome characterization during normal climacteric apple ripening, as well as definition of ethylene-dependent transcriptome changes. Comparison with tomato fruit maturation and ethylene responsive transcriptome activity facilitated identification of putative conserved orthologous ripening-related genes, which serve as an initial set of candidates for assessing conservation of gene activity across genomes of fruit bearing plant species.
Abdul-Redha, Rawaa Jalil; Kemp, Michael; Bangsborg, Jette M; Arpi, Magnus; Christensen, Jens Jørgen
2010-01-01
Streptococci, enterococci and Streptococcus-like bacteria are frequent etiologic agents of infective endocarditis and correct species identification can be a laboratory challenge. Viridans streptococci (VS) not seldomly cause contamination of blood cultures. Vitek 2 and partial sequencing of the 16S rRNA gene were applied in order to compare the results of both methods. STRAINS ORIGINATED FROM TWO GROUPS OF PATIENTS: 149 strains from patients with infective endocarditis and 181 strains assessed as blood culture contaminants. Of the 330 strains, based on partial 16S rRNA gene sequencing results, 251 (76%) were VS strains, 10 (3%) were pyogenic streptococcal strains, 54 (16%) were E. faecalis strains and 15 (5%) strains belonged to a group of miscellaneous catalase-negative, Gram-positive cocci. Among VS strains, respectively, 220 (87,6%) and 31 (12,3%) obtained agreeing and non-agreeing identifications with the two methods with respect to allocation to the same VS group. Non-agreeing species identification mostly occurred among strains in the contaminant group, while for endocarditis strains notably fewer disagreeing results were observed.Only 67 of 150 strains in the mitis group strains obtained identical species identifications by the two methods. Most VS strains belonging to the groups of salivarius, anginosus, and mutans obtained agreeing species identifications with the two methods, while this only was the case for 13 of the 21 bovis strains. Pyogenic strains (n=10), Enterococcus faecalis strains (n=54) and a miscellaneous group of catalase-negative, Gram-positive cocci (n=15) seemed well identified by both methods, except that disagreements in identifications in the miscellaneous group of strains occurred for 6 of 15 strains.
High-throughput gene mapping in Caenorhabditis elegans.
Swan, Kathryn A; Curtis, Damian E; McKusick, Kathleen B; Voinov, Alexander V; Mapa, Felipa A; Cancilla, Michael R
2002-07-01
Positional cloning of mutations in model genetic systems is a powerful method for the identification of targets of medical and agricultural importance. To facilitate the high-throughput mapping of mutations in Caenorhabditis elegans, we have identified a further 9602 putative new single nucleotide polymorphisms (SNPs) between two C. elegans strains, Bristol N2 and the Hawaiian mapping strain CB4856, by sequencing inserts from a CB4856 genomic DNA library and using an informatics pipeline to compare sequences with the canonical N2 genomic sequence. When combined with data from other laboratories, our marker set of 17,189 SNPs provides even coverage of the complete worm genome. To date, we have confirmed >1099 evenly spaced SNPs (one every 91 +/- 56 kb) across the six chromosomes and validated the utility of our SNP marker set and new fluorescence polarization-based genotyping methods for systematic and high-throughput identification of genes in C. elegans by cloning several proprietary genes. We illustrate our approach by recombination mapping and confirmation of the mutation in the cloned gene, dpy-18.
Rusiniak, Michael E.; Kunnev, Dimiter; Freeland, Amy; Cady, Gillian K.; Pruitt, Steven C.
2011-01-01
Mini-chromosome maintenance (Mcm) proteins are part of the replication licensing complex that is loaded onto chromatin during the G1-phase of the cell cycle and required for initiation of DNA replication in the subsequent S-phase. Mcm proteins are typically loaded in excess of the number of locations that are utilized during S-phase. Nonetheless, partial depletion of Mcm proteins leads to cancers and stem cell deficiencies. Mcm2 deficient mice, on a 129Sv genetic background, display a high rate of thymic lymphoblastic lymphoma. Here array comparative genomic hybridization (aCGH) is utilized to characterize the genetic damage accruing in these tumors. The predominant events are deletions averaging less than 0.5 Mb, considerably shorter than observed in prior studies using alternative mouse lymphoma models or human tumors. Such deletions facilitate identification of specific genes and pathways responsible for the tumors. Mutations in many genes that have been implicated in human lymphomas are recapitulated in this mouse model. These features, and the fact that the mutation underlying the accelerated genetic damage does not target a specific gene or pathway a priori, are valuable features of this mouse model for identification of tumor suppressor genes. Genes affected in all tumors include Pten, Tcfe2a, Mbd3 and Setd1b. Notch1 and additional genes are affected in subsets of tumors. The high frequency of relatively short deletions is consistent with elevated recombination between nearby stalled replication forks in Mcm2 deficient mice. PMID:22158038
Livny, Jonathan; Zhou, Xiaohui; Mandlik, Anjali; Hubbard, Troy; Davis, Brigid M.; Waldor, Matthew K.
2014-01-01
Vibrio parahaemolyticus is the leading worldwide cause of seafood-associated gastroenteritis, yet little is known regarding its intraintestinal gene expression or physiology. To date, in vivo analyses have focused on identification and characterization of virulence factors—e.g. a crucial Type III secretion system (T3SS2)—rather than genome-wide analyses of in vivo biology. Here, we used RNA-Seq to profile V. parahaemolyticus gene expression in infected infant rabbits, which mimic human infection. Comparative transcriptomic analysis of V. parahaemolyticus isolated from rabbit intestines and from several laboratory conditions enabled identification of mRNAs and sRNAs induced during infection and of regulatory factors that likely control them. More than 12% of annotated V. parahaemolyticus genes are differentially expressed in the intestine, including the genes of T3SS2, which are likely induced by bile-mediated activation of the transcription factor VtrB. Our analyses also suggest that V. parahaemolyticus has access to glucose or other preferred carbon sources in vivo, but that iron is inconsistently available. The V. parahaemolyticus transcriptional response to in vivo growth is far more widespread than and largely distinct from that of V. cholerae, likely due to the distinct ways in which these diarrheal pathogens interact with and modulate the environment in the small intestine. PMID:25262354
Kothari, Ankita; Charrier, Marimikel; Wu, Yu -Wei; ...
2016-09-22
The hydrocarbonoclastic bacterium Acinetobacter venetianus RAG-1 has attracted substantial attention due to its powerful oil-degrading capabilities and its potential to play an important ecological role in the cleanup of alkanes. In this study, we compare the transcriptome of the strain RAG-1 grown in dodecane, the corresponding alkanol (dodecanol), and sodium acetate for the characterization of genes involved in dodecane uptake and utilization. Comparison of the transcriptional responses of RAG-1 grown on dodecane led to the identification of 1074 genes that were differentially expressed relative to sodium acetate. Of these, 622 genes were upregulated when grown in dodecane. The highly upregulatedmore » genes were involved in alkane catabolism, along with stress response. Our data suggest AlkMb to be primarily involved in dodecane oxidation. Transcriptional response of RAG-1 grown on dodecane relative to dodecanol also led to the identification of permease, outer membrane protein and thin fimbriae coding genes potentially involved in dodecane uptake. As a result, this study provides the first model for key genes involved in alkane uptake and metabolism in A. venetianus RAG-1.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kothari, Ankita; Charrier, Marimikel; Wu, Yu -Wei
The hydrocarbonoclastic bacterium Acinetobacter venetianus RAG-1 has attracted substantial attention due to its powerful oil-degrading capabilities and its potential to play an important ecological role in the cleanup of alkanes. In this study, we compare the transcriptome of the strain RAG-1 grown in dodecane, the corresponding alkanol (dodecanol), and sodium acetate for the characterization of genes involved in dodecane uptake and utilization. Comparison of the transcriptional responses of RAG-1 grown on dodecane led to the identification of 1074 genes that were differentially expressed relative to sodium acetate. Of these, 622 genes were upregulated when grown in dodecane. The highly upregulatedmore » genes were involved in alkane catabolism, along with stress response. Our data suggest AlkMb to be primarily involved in dodecane oxidation. Transcriptional response of RAG-1 grown on dodecane relative to dodecanol also led to the identification of permease, outer membrane protein and thin fimbriae coding genes potentially involved in dodecane uptake. As a result, this study provides the first model for key genes involved in alkane uptake and metabolism in A. venetianus RAG-1.« less
Singh, Amarjeet; Kanwar, Poonam; Pandey, Amita; Tyagi, Akhilesh K.; Sopory, Sudhir K.; Kapoor, Sanjay; Pandey, Girdhar K.
2013-01-01
Background Phospholipase C (PLC) is one of the major lipid hydrolysing enzymes, implicated in lipid mediated signaling. PLCs have been found to play a significant role in abiotic stress triggered signaling and developmental processes in various plant species. Genome wide identification and expression analysis have been carried out for this gene family in Arabidopsis, yet not much has been accomplished in crop plant rice. Methodology/Principal Findings An exhaustive in-silico exploration of rice genome using various online databases and tools resulted in the identification of nine PLC encoding genes. Based on sequence, motif and phylogenetic analysis rice PLC gene family could be divided into phosphatidylinositol-specific PLCs (PI-PLCs) and phosphatidylcholine- PLCs (PC-PLC or NPC) classes with four and five members, respectively. A comparative analysis revealed that PLCs are conserved in Arabidopsis (dicots) and rice (monocot) at gene structure and protein level but they might have evolved through a separate evolutionary path. Transcript profiling using gene chip microarray and quantitative RT-PCR showed that most of the PLC members expressed significantly and differentially under abiotic stresses (salt, cold and drought) and during various developmental stages with condition/stage specific and overlapping expression. This finding suggested an important role of different rice PLC members in abiotic stress triggered signaling and plant development, which was also supported by the presence of relevant cis-regulatory elements in their promoters. Sub-cellular localization of few selected PLC members in Nicotiana benthamiana and onion epidermal cells has provided a clue about their site of action and functional behaviour. Conclusion/Significance The genome wide identification, structural and expression analysis and knowledge of sub-cellular localization of PLC gene family envisage the functional characterization of these genes in crop plants in near future. PMID:23638098
Di Palma, Tina; Conti, Anna; de Cristofaro, Tiziana; Scala, Serena; Nitsch, Lucio; Zannini, Mariastella
2011-01-01
Background The differentiation program of thyroid follicular cells (TFCs), by far the most abundant cell population of the thyroid gland, relies on the interplay between sequence-specific transcription factors and transcriptional coregulators with the basal transcriptional machinery of the cell. However, the molecular mechanisms leading to the fully differentiated thyrocyte are still the object of intense study. The transcription factor Pax8, a member of the Paired-box gene family, has been demonstrated to be a critical regulator required for proper development and differentiation of thyroid follicular cells. Despite being Pax8 well-characterized with respect to its role in regulating genes involved in thyroid differentiation, genomics approaches aiming at the identification of additional Pax8 targets are lacking and the biological pathways controlled by this transcription factor are largely unknown. Methodology/Principal Findings To identify unique downstream targets of Pax8, we investigated the genome-wide effect of Pax8 silencing comparing the transcriptome of silenced versus normal differentiated FRTL-5 thyroid cells. In total, 2815 genes were found modulated 72 h after Pax8 RNAi, induced or repressed. Genes previously reported to be regulated by Pax8 in FRTL-5 cells were confirmed. In addition, novel targets genes involved in functional processes such as DNA replication, anion transport, kinase activity, apoptosis and cellular processes were newly identified. Transcriptome analysis highlighted that Pax8 is a key molecule for thyroid morphogenesis and differentiation. Conclusions/Significance This is the first large-scale study aimed at the identification of new genes regulated by Pax8, a master regulator of thyroid development and differentiation. The biological pathways and target genes controlled by Pax8 will have considerable importance to understand thyroid disease progression as well as to set up novel therapeutic strategies. PMID:21966443
Singh, Amarjeet; Kanwar, Poonam; Pandey, Amita; Tyagi, Akhilesh K; Sopory, Sudhir K; Kapoor, Sanjay; Pandey, Girdhar K
2013-01-01
Phospholipase C (PLC) is one of the major lipid hydrolysing enzymes, implicated in lipid mediated signaling. PLCs have been found to play a significant role in abiotic stress triggered signaling and developmental processes in various plant species. Genome wide identification and expression analysis have been carried out for this gene family in Arabidopsis, yet not much has been accomplished in crop plant rice. An exhaustive in-silico exploration of rice genome using various online databases and tools resulted in the identification of nine PLC encoding genes. Based on sequence, motif and phylogenetic analysis rice PLC gene family could be divided into phosphatidylinositol-specific PLCs (PI-PLCs) and phosphatidylcholine- PLCs (PC-PLC or NPC) classes with four and five members, respectively. A comparative analysis revealed that PLCs are conserved in Arabidopsis (dicots) and rice (monocot) at gene structure and protein level but they might have evolved through a separate evolutionary path. Transcript profiling using gene chip microarray and quantitative RT-PCR showed that most of the PLC members expressed significantly and differentially under abiotic stresses (salt, cold and drought) and during various developmental stages with condition/stage specific and overlapping expression. This finding suggested an important role of different rice PLC members in abiotic stress triggered signaling and plant development, which was also supported by the presence of relevant cis-regulatory elements in their promoters. Sub-cellular localization of few selected PLC members in Nicotiana benthamiana and onion epidermal cells has provided a clue about their site of action and functional behaviour. The genome wide identification, structural and expression analysis and knowledge of sub-cellular localization of PLC gene family envisage the functional characterization of these genes in crop plants in near future.
Yang, Chia-Chun; Andrews, Erik H; Chen, Min-Hsuan; Wang, Wan-Yu; Chen, Jeremy J W; Gerstein, Mark; Liu, Chun-Chi; Cheng, Chao
2016-08-12
Chromatin immunoprecipitation followed by massively parallel DNA sequencing (ChIP-seq) or microarray hybridization (ChIP-chip) has been widely used to determine the genomic occupation of transcription factors (TFs). We have previously developed a probabilistic method, called TIP (Target Identification from Profiles), to identify TF target genes using ChIP-seq/ChIP-chip data. To achieve high specificity, TIP applies a conservative method to estimate significance of target genes, with the trade-off being a relatively low sensitivity of target gene identification compared to other methods. Additionally, TIP's output does not render binding-peak locations or intensity, information highly useful for visualization and general experimental biological use, while the variability of ChIP-seq/ChIP-chip file formats has made input into TIP more difficult than desired. To improve upon these facets, here we present are fined TIP with key extensions. First, it implements a Gaussian mixture model for p-value estimation, increasing target gene identification sensitivity and more accurately capturing the shape of TF binding profile distributions. Second, it enables the incorporation of TF binding-peak data by identifying their locations in significant target gene promoter regions and quantifies their strengths. Finally, for full ease of implementation we have incorporated it into a web server ( http://syslab3.nchu.edu.tw/iTAR/ ) that enables flexibility of input file format, can be used across multiple species and genome assembly versions, and is freely available for public use. The web server additionally performs GO enrichment analysis for the identified target genes to reveal the potential function of the corresponding TF. The iTAR web server provides a user-friendly interface and supports target gene identification in seven species, ranging from yeast to human. To facilitate investigating the quality of ChIP-seq/ChIP-chip data, the web server generates the chart of the characteristic binding profiles and the density plot of normalized regulatory scores. The iTAR web server is a useful tool in identifying TF target genes from ChIP-seq/ChIP-chip data and discovering biological insights.
Jado, Isabel; Fenoll, Asunción; Casal, Julio; Pérez, Amalia
2001-01-01
The gene encoding the pneumococcal surface adhesin A (PsaA) protein has been identified in three different viridans group streptococcal species. Comparative studies of the psaA gene identified in different pneumococcal isolates by sequencing PCR products showed a high degree of conservation among these strains. PsaA is encoded by an open reading frame of 930 bp. The analysis of this fragment in Streptococcus mitis, Streptococcus oralis, and Streptococcus anginosus strains revealed a sequence identity of 95, 94, and 90%, respectively, to the corresponding open reading frame of the previously reported Streptococcus pneumoniae serotype 6B strain. Our results confirm that psaA is present and detectable in heterologous bacterial species. The possible implications of these results for the suitability and potential use of PsaA in the identification and diagnosis of pneumococcal diseases are discussed. PMID:11527799
Soetens, Oriane; De Bel, Annelies; Echahidi, Fedoua; Vancutsem, Ellen; Vandoorslaer, Kristof; Piérard, Denis
2012-01-01
The performance of matrix-assisted laser desorption–ionization time of flight mass spectrometry (MALDI-TOF MS) for species identification of Prevotella was evaluated and compared with 16S rRNA gene sequencing. Using a Bruker database, 62.7% of the 102 clinical isolates were identified to the species level and 73.5% to the genus level. Extension of the commercial database improved these figures to, respectively, 83.3% and 89.2%. MALDI-TOF MS identification of Prevotella is reliable but needs a more extensive database. PMID:22301022
Sun, Wenyue; Zhang, Kaitai; Zhang, Xinyu; Lei, Wendong; Xiao, Ting; Ma, Jinfang; Guo, Suping; Shao, Shujuan; Zhang, Husheng; Liu, Yan; Yuan, Jinsong; Hu, Zhi; Ma, Ying; Feng, Xiaoli; Hu, Songnian; Zhou, Jun; Cheng, Shujun; Gao, Yanning
2004-08-20
Lung cancer is one of the major causes of cancer-related deaths. Over the past decade, much has been known about the molecular changes associated with lung carcinogenesis; however, our understanding to lung tumorigenesis is still incomplete. To identify genes that are differentially expressed in squamous cell carcinoma (SCC) of the lung, we compared the expression profiles between primarily cultured SCC tumor cells and bronchial epithelial cells derived from morphologically normal bronchial epithelium of the same patient. Using suppression subtractive hybridization (SSH), two cDNA libraries containing up- and down-regulated genes in the tumor cells were constructed, named as LCTP and LCBP. The two libraries comprise 258 known genes and 133 unknown genes in total. The known up-regulated genes in the library LCTP represented a variety of functional groups; including metabolism-, cell adhesion and migration-, signal transduction-, and anti-apoptosis-related genes. Using semi-quantitative reverse transcription-polymerase chain reaction, seven genes chosen randomly from the LCTP were analyzed in the tumor tissue paired with its corresponding adjacent normal lung tissue derived from 16 cases of the SCC. Among them, the IQGAP1, RAP1GDS1, PAICS, MLF1, and MARK1 genes showed a consistent expression pattern with that of the SSH analysis. Identification and further characterization of these genes may allow a better understanding of lung carcinogenesis.
oPOSSUM: identification of over-represented transcription factor binding sites in co-expressed genes
Ho Sui, Shannan J.; Mortimer, James R.; Arenillas, David J.; Brumm, Jochen; Walsh, Christopher J.; Kennedy, Brian P.; Wasserman, Wyeth W.
2005-01-01
Targeted transcript profiling studies can identify sets of co-expressed genes; however, identification of the underlying functional mechanism(s) is a significant challenge. Established methods for the analysis of gene annotations, particularly those based on the Gene Ontology, can identify functional linkages between genes. Similar methods for the identification of over-represented transcription factor binding sites (TFBSs) have been successful in yeast, but extension to human genomics has largely proved ineffective. Creation of a system for the efficient identification of common regulatory mechanisms in a subset of co-expressed human genes promises to break a roadblock in functional genomics research. We have developed an integrated system that searches for evidence of co-regulation by one or more transcription factors (TFs). oPOSSUM combines a pre-computed database of conserved TFBSs in human and mouse promoters with statistical methods for identification of sites over-represented in a set of co-expressed genes. The algorithm successfully identified mediating TFs in control sets of tissue-specific genes and in sets of co-expressed genes from three transcript profiling studies. Simulation studies indicate that oPOSSUM produces few false positives using empirically defined thresholds and can tolerate up to 50% noise in a set of co-expressed genes. PMID:15933209
NASA Technical Reports Server (NTRS)
Hedenstierna, K. O.; Lee, Y. H.; Yang, Y.; Fox, G. E.
1993-01-01
A prototype stable RNA identification cassette for monitoring genetically engineered plasmids carried by strains of Escherichia coli has been developed. The cassette consists of a Vibrio proteolyticus 5S ribosomal RNA (rRNA) gene surrounded by promoters and terminators from the rrnB operon of Escherischia coli. The identifier RNA is expressed and successfully processed so that approximately 30% of the 5S rRNA isolated from either whole cells or 70S ribosomes is of the V. proteolyticus type. Cells carrying the identifier are readily detectable by hybridization. Accurate measurements show that the identification cassette has little effect on fitness compared to a strain containing an analogous plasmid carrying wild type E. coli 5S rRNA, and the V. proteolyticus 5S rRNA gene is not inactivated after prolonged growth. These results demonstrate the feasibility of developing small standardized identification cassettes that can utilize already existing highly sensitive rRNA detection methods. Cassettes of this type could in principle be incorporated into either the engineered regions of recombinant plasmids or their hosts.
Barbosa, Catarina; García-Martínez, José; Pérez-Ortín, José E.; Mendes-Ferreira, Ana
2015-01-01
Nitrogen levels in grape-juices are of major importance in winemaking ensuring adequate yeast growth and fermentation performance. Here we used a comparative transcriptome analysis to uncover wine yeasts responses to nitrogen availability during fermentation. Gene expression was assessed in three genetically and phenotypically divergent commercial wine strains (CEG, VL1 and QA23), under low (67 mg/L) and high nitrogen (670 mg/L) regimes, at three time points during fermentation (12h, 24h and 96h). Two-way ANOVA analysis of each fermentation condition led to the identification of genes whose expression was dependent on strain, fermentation stage and on the interaction of both factors. The high fermenter yeast strain QA23 was more clearly distinct from the other two strains, by differential expression of genes involved in flocculation, mitochondrial functions, energy generation and protein folding and stabilization. For all strains, higher transcriptional variability due to fermentation stage was seen in the high nitrogen fermentations. A positive correlation between maximum fermentation rate and the expression of genes involved in stress response was observed. The finding of common genes correlated with both fermentation activity and nitrogen up-take underlies the role of nitrogen on yeast fermentative fitness. The comparative analysis of genes differentially expressed between both fermentation conditions at 12h, where the main difference was the level of nitrogen available, showed the highest variability amongst strains revealing strain-specific responses. Nevertheless, we were able to identify a small set of genes whose expression profiles can quantitatively assess the common response of the yeast strains to varying nitrogen conditions. The use of three contrasting yeast strains in gene expression analysis prompts the identification of more reliable, accurate and reproducible biomarkers that will facilitate the diagnosis of deficiency of this nutrient in the grape-musts and the development of strategies to optimize yeast performance in industrial fermentations. PMID:25884705
Liu, Lei; Ang, Keng Pee; Elliott, J A K; Kent, Matthew Peter; Lien, Sigbjørn; MacDonald, Danielle; Boulding, Elizabeth Grace
2017-03-01
Comparative genome scans can be used to identify chromosome regions, but not traits, that are putatively under selection. Identification of targeted traits may be more likely in recently domesticated populations under strong artificial selection for increased production. We used a North American Atlantic salmon 6K SNP dataset to locate genome regions of an aquaculture strain (Saint John River) that were highly diverged from that of its putative wild founder population (Tobique River). First, admixed individuals with partial European ancestry were detected using STRUCTURE and removed from the dataset. Outlier loci were then identified as those showing extreme differentiation between the aquaculture population and the founder population. All Arlequin methods identified an overlapping subset of 17 outlier loci, three of which were also identified by BayeScan. Many outlier loci were near candidate genes and some were near published quantitative trait loci (QTLs) for growth, appetite, maturity, or disease resistance. Parallel comparisons using a wild, nonfounder population (Stewiacke River) yielded only one overlapping outlier locus as well as a known maturity QTL. We conclude that genome scans comparing a recently domesticated strain with its wild founder population can facilitate identification of candidate genes for traits known to have been under strong artificial selection.
Van Vooren, Steven; Coessens, Bert; De Moor, Bart; Moreau, Yves; Vermeesch, Joris R
2007-09-01
Genome-wide array comparative genomic hybridization screening is uncovering pathogenic submicroscopic chromosomal imbalances in patients with developmental disorders. In those patients, imbalances appear now to be scattered across the whole genome, and most patients carry different chromosomal anomalies. Screening patients with developmental disorders can be considered a forward functional genome screen. The imbalances pinpoint the location of genes that are involved in human development. Because most imbalances encompass regions harboring multiple genes, the challenge is to (1) identify those genes responsible for the specific phenotype and (2) disentangle the role of the different genes located in an imbalanced region. In this review, we discuss novel tools and relevant databases that have recently been developed to aid this gene discovery process. Identification of the functional relevance of genes will not only deepen our understanding of human development but will, in addition, aid in the data interpretation and improve genetic counseling.
Mistry, Divya; Wise, Roger P; Dickerson, Julie A
2017-01-01
Identification of central genes and proteins in biomolecular networks provides credible candidates for pathway analysis, functional analysis, and essentiality prediction. The DiffSLC centrality measure predicts central and essential genes and proteins using a protein-protein interaction network. Network centrality measures prioritize nodes and edges based on their importance to the network topology. These measures helped identify critical genes and proteins in biomolecular networks. The proposed centrality measure, DiffSLC, combines the number of interactions of a protein and the gene coexpression values of genes from which those proteins were translated, as a weighting factor to bias the identification of essential proteins in a protein interaction network. Potentially essential proteins with low node degree are promoted through eigenvector centrality. Thus, the gene coexpression values are used in conjunction with the eigenvector of the network's adjacency matrix and edge clustering coefficient to improve essentiality prediction. The outcome of this prediction is shown using three variations: (1) inclusion or exclusion of gene co-expression data, (2) impact of different coexpression measures, and (3) impact of different gene expression data sets. For a total of seven networks, DiffSLC is compared to other centrality measures using Saccharomyces cerevisiae protein interaction networks and gene expression data. Comparisons are also performed for the top ranked proteins against the known essential genes from the Saccharomyces Gene Deletion Project, which show that DiffSLC detects more essential proteins and has a higher area under the ROC curve than other compared methods. This makes DiffSLC a stronger alternative to other centrality methods for detecting essential genes using a protein-protein interaction network that obeys centrality-lethality principle. DiffSLC is implemented using the igraph package in R, and networkx package in Python. The python package can be obtained from git.io/diffslcpy. The R implementation and code to reproduce the analysis is available via git.io/diffslc.
Fatania, Nita; Fraser, Mark; Savage, Mike; Hart, Jason; Abdolrasouli, Alireza
2015-12-01
Performance of matrix-assisted laser desorption ionisation-time of flight mass spectrometry (MALDI-TOF MS) was compared in a side-by side-analysis with conventional phenotypic methods currently in use in our laboratory for identification of yeasts in a routine diagnostic setting. A diverse collection of 200 clinically important yeasts (19 species, five genera) were identified by both methods using standard protocols. Discordant or unreliable identifications were resolved by sequencing of the internal transcribed spacer region of the rRNA gene. MALDI-TOF and conventional methods were in agreement for 182 isolates (91%) with correct identification to species level. Eighteen discordant results (9%) were due to rarely encountered species, hence the difficulty in their identification using traditional phenotypic methods. MALDI-TOF MS enabled rapid, reliable and accurate identification of clinically important yeasts in a routine diagnostic microbiology laboratory. Isolates with rare, unusual or low probability identifications should be confirmed using robust molecular methods. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/
Intragenome Diversity of Gene Families Encoding Toxin-like Proteins in Venomous Animals.
Rodríguez de la Vega, Ricardo C; Giraud, Tatiana
2016-11-01
The evolution of venoms is the story of how toxins arise and of the processes that generate and maintain their diversity. For animal venoms these processes include recruitment for expression in the venom gland, neofunctionalization, paralogous expansions, and functional divergence. The systematic study of these processes requires the reliable identification of the venom components involved in antagonistic interactions. High-throughput sequencing has the potential of uncovering the entire set of toxins in a given organism, yet the existence of non-venom toxin paralogs and the misleading effects of partial census of the molecular diversity of toxins make necessary to collect complementary evidence to distinguish true toxins from their non-venom paralogs. Here, we analyzed the whole genomes of two scorpions, one spider and one snake, aiming at the identification of the full repertoires of genes encoding toxin-like proteins. We classified the entire set of protein-coding genes into paralogous groups and monotypic genes, identified genes encoding toxin-like proteins based on known toxin families, and quantified their expression in both venom-glands and pooled tissues. Our results confirm that genes encoding toxin-like proteins are part of multigene families, and that these families arise by recruitment events from non-toxin genes followed by limited expansions of the toxin-like protein coding genes. We also show that failing to account for sequence similarity with non-toxin proteins has a considerable misleading effect that can be greatly reduced by comparative transcriptomics. Our study overall contributes to the understanding of the evolutionary dynamics of proteins involved in antagonistic interactions. © The Author 2016. Published by Oxford University Press on behalf of the Society for Integrative and Comparative Biology. All rights reserved. For permissions please email: journals.permissions@oup.com.
Charlesworth, Jac C; Peralta, Juan M; Drigalenko, Eugene; Göring, Harald Hh; Almasy, Laura; Dyer, Thomas D; Blangero, John
2009-12-15
Gene identification using linkage, association, or genome-wide expression is often underpowered. We propose that formal combination of information from multiple gene-identification approaches may lead to the identification of novel loci that are missed when only one form of information is available. Firstly, we analyze the Genetic Analysis Workshop 16 Framingham Heart Study Problem 2 genome-wide association data for HDL-cholesterol using a "gene-centric" approach. Then we formally combine the association test results with genome-wide transcriptional profiling data for high-density lipoprotein cholesterol (HDL-C), from the San Antonio Family Heart Study, using a Z-transform test (Stouffer's method). We identified 39 genes by the joint test at a conservative 1% false-discovery rate, including 9 from the significant gene-based association test and 23 whose expression was significantly correlated with HDL-C. Seven genes identified as significant in the joint test were not independently identified by either the association or expression tests. This combined approach has increased power and leads to the direct nomination of novel candidate genes likely to be involved in the determination of HDL-C levels. Such information can then be used as justification for a more exhaustive search for functional sequence variation within the nominated genes. We anticipate that this type of analysis will improve our speed of identification of regulatory genes causally involved in disease risk.
Sowpati, Divya Tej; Srivastava, Surabhi; Dhawan, Jyotsna; Mishra, Rakesh K
2017-09-13
Comparative epigenomic analysis across multiple genes presents a bottleneck for bench biologists working with NGS data. Despite the development of standardized peak analysis algorithms, the identification of novel epigenetic patterns and their visualization across gene subsets remains a challenge. We developed a fast and interactive web app, C-State (Chromatin-State), to query and plot chromatin landscapes across multiple loci and cell types. C-State has an interactive, JavaScript-based graphical user interface and runs locally in modern web browsers that are pre-installed on all computers, thus eliminating the need for cumbersome data transfer, pre-processing and prior programming knowledge. C-State is unique in its ability to extract and analyze multi-gene epigenetic information. It allows for powerful GUI-based pattern searching and visualization. We include a case study to demonstrate its potential for identifying user-defined epigenetic trends in context of gene expression profiles.
Applications and Limitations of Mouse Models for Understanding Human Atherosclerosis
von Scheidt, Moritz; Zhao, Yuqi; Kurt, Zeyneb; Pan, Calvin; Zeng, Lingyao; Yang, Xia; Schunkert, Heribert; Lusis, Aldons J.
2017-01-01
Most of the biological understanding of mechanisms underlying coronary artery disease (CAD) derives from studies of mouse models. The identification of multiple CAD loci and strong candidate genes in large human genome-wide association studies (GWAS) presented an opportunity to examine the relevance of mouse models for the human disease. We comprehensively reviewed the mouse literature, including 827 literature-derived genes, and compared it to human data. First, we observed striking concordance of risk factors for atherosclerosis in mice and humans. Second, there was highly significant overlap of mouse genes with human genes identified by GWAS. In particular, of the 46 genes with strong association signals in CAD-GWAS that were studied in mouse models all but one exhibited consistent effects on atherosclerosis-related phenotypes. Third, we compared 178 CAD-associated pathways derived from human GWAS with 263 from mouse studies and observed that over 50% were consistent between both species. PMID:27916529
Camps, Jordi; Nguyen, Quang Tri; Padilla-Nash, Hesed M.; Knutsen, Turid; McNeil, Nicole E.; Wangsa, Danny; Hummon, Amanda B.; Grade, Marian; Ried, Thomas; Difilippantonio, Michael J.
2016-01-01
To evaluate the mechanisms and consequences of chromosomal aberrations in colorectal cancer (CRC), we used a combination of spectral karyotyping, array comparative genomic hybridization (aCGH), and array-based global gene expression profiling on 31 primary carcinomas and 15 established cell lines. Importantly, aCGH showed that the genomic profiles of primary tumors are recapitulated in the cell lines. We revealed a preponderance of chromosome breakpoints at sites of copy number variants (CNVs) in the CRC cell lines, a novel mechanism of DNA breakage in cancer. The integration of gene expression and aCGH led to the identification of 157 genes localized within high-level copy number changes whose transcriptional deregulation was significantly affected across all of the samples, thereby suggesting that these genes play a functional role in CRC. Genomic amplification at 8q24 was the most recurrent event and led to the overexpression of MYC and FAM84B. Copy number dependent gene expression resulted in deregulation of known cancer genes such as APC, FGFR2, and ERBB2. The identification of only 36 genes whose localization near a breakpoint could account for their observed deregulated expression demonstrates that the major mechanism for transcriptional deregulation in CRC is genomic copy number changes resulting from chromosomal aberrations. PMID:19691111
Trower, M K; Orton, S M; Purvis, I J; Sanseau, P; Riley, J; Christodoulou, C; Burt, D; See, C G; Elgar, G; Sherrington, R; Rogaev, E I; St George-Hyslop, P; Brenner, S; Dykes, C W
1996-02-20
The genome of the pufferfish (Fugu rubripes) (400 Mb) is approximately 7.5 times smaller than the human genome, but it has a similar gene repertoire to that of man. If regions of the two genomes exhibited conservation of gene order (i.e., were syntenic), it should be possible to reduce dramatically the effort required for identification of candidate genes in human disease loci by sequencing syntenic regions of the compact Fugu genome. We have demonstrated that three genes (dihydrolipoamide succinyltransferase, S31iii125, and S20i15), which are linked to FOS in the familial Alzheimer disease focus (AD3) on human chromosome 14, have homologues in the Fugu genome adjacent to Fugu cFOS. The relative gene order of cFOS, S31iii125, and S20i15 was the same in both genomes, but in Fugu these three genes lay within a 12.4-kb region, compared to >600 kb in the human AD3 locus. These results demonstrate the conservation of synteny between the genomes of Fugu and man and highlight the utility of this approach for sequence-based identification of genes in human disease loci.
Williams, Angela H; Sharma, Mamta; Thatcher, Louise F; Azam, Sarwar; Hane, James K; Sperschneider, Jana; Kidd, Brendan N; Anderson, Jonathan P; Ghosh, Raju; Garg, Gagan; Lichtenzveig, Judith; Kistler, H Corby; Shea, Terrance; Young, Sarah; Buck, Sally-Anne G; Kamphuis, Lars G; Saxena, Rachit; Pande, Suresh; Ma, Li-Jun; Varshney, Rajeev K; Singh, Karam B
2016-03-05
Soil-borne fungi of the Fusarium oxysporum species complex cause devastating wilt disease on many crops including legumes that supply human dietary protein needs across many parts of the globe. We present and compare draft genome assemblies for three legume-infecting formae speciales (ff. spp.): F. oxysporum f. sp. ciceris (Foc-38-1) and f. sp. pisi (Fop-37622), significant pathogens of chickpea and pea respectively, the world's second and third most important grain legumes, and lastly f. sp. medicaginis (Fom-5190a) for which we developed a model legume pathosystem utilising Medicago truncatula. Focusing on the identification of pathogenicity gene content, we leveraged the reference genomes of Fusarium pathogens F. oxysporum f. sp. lycopersici (tomato-infecting) and F. solani (pea-infecting) and their well-characterised core and dispensable chromosomes to predict genomic organisation in the newly sequenced legume-infecting isolates. Dispensable chromosomes are not essential for growth and in Fusarium species are known to be enriched in host-specificity and pathogenicity-associated genes. Comparative genomics of the publicly available Fusarium species revealed differential patterns of sequence conservation across F. oxysporum formae speciales, with legume-pathogenic formae speciales not exhibiting greater sequence conservation between them relative to non-legume-infecting formae speciales, possibly indicating the lack of a common ancestral source for legume pathogenicity. Combining predicted dispensable gene content with in planta expression in the model legume-infecting isolate, we identified small conserved regions and candidate effectors, four of which shared greatest similarity to proteins from another legume-infecting ff. spp. We demonstrate that distinction of core and potential dispensable genomic regions of novel F. oxysporum genomes is an effective tool to facilitate effector discovery and the identification of gene content possibly linked to host specificity. While the legume-infecting isolates didn't share large genomic regions of pathogenicity-related content, smaller regions and candidate effector proteins were highly conserved, suggesting that they may play specific roles in inducing disease on legume hosts.
Miyagawa, Maiko; Nishio, Shin-Ya; Usami, Shin-Ichi
2016-01-01
Objective: Cochlear implantation is the most important treatment currently available for profound sensorineural hearing loss. The aim of this study was to investigate the etiology of hearing loss in patients with cochlear implantation, and to compare outcomes. Methods: Japanese hearing loss patients who received cochlear implants (CIs) or electric acoustic stimulation (EAS) in Shinshu University hospital (n = 173, prelingual onset: 92, postlingual onset: 81) participated in this study. Invader assay followed by the targeted exon-sequencing of 63 deafness genes using Massively parallel DNA sequencing (MPS) was applied. For prelingual patients, additional imaging examination, cCMV screening, and pediatric examination were performed for precise diagnosis. Results: Genetic screening successfully identified the causative mutation in 60% of patients with prelingual onset hearing loss and in 36% of those with postlingual hearing loss. Differences in the kinds of genes identified were observed between the two groups. Although there were marked variations in the outcome of cochlear implantation, patients with specific deafness gene mutations showed relatively good results. Conclusion: The present study showed genetic etiology is a major cause of hearing loss in CI/EAS patients. Patients possessing mutations in a number of deafness genes known to be expressed within inner ear have achieved satisfactory auditory performance, suggesting that the identification of the genetic background facilitates the prediction of post-CI performance. MPS is a powerful tool for the identification of causative deafness genes in patients receiving cochlear implantation. Therefore, determination of the involved region inside/outside of the cochlea by identification of the responsible gene is essential. PMID:26756145
Identification and functional analysis of secreted effectors from phytoparasitic nematodes.
Rehman, Sajid; Gupta, Vijai K; Goyal, Aakash K
2016-03-21
Plant parasitic nematodes develop an intimate and long-term feeding relationship with their host plants. They induce a multi-nucleate feeding site close to the vascular bundle in the roots of their host plant and remain sessile for the rest of their life. Nematode secretions, produced in the oesophageal glands and secreted through a hollow stylet into the host plant cytoplasm, are believed to play key role in pathogenesis. To combat these persistent pathogens, the identity and functional analysis of secreted effectors can serve as a key to devise durable control measures. In this review, we will recapitulate the knowledge over the identification and functional characterization of secreted nematode effector repertoire from phytoparasitic nematodes. Despite considerable efforts, the identity of genes encoding nematode secreted proteins has long been severely hampered because of their microscopic size, long generation time and obligate biotrophic nature. The methodologies such as bioinformatics, protein structure modeling, in situ hybridization microscopy, and protein-protein interaction have been used to identify and to attribute functions to the effectors. In addition, RNA interference (RNAi) has been instrumental to decipher the role of the genes encoding secreted effectors necessary for parasitism and genes attributed to normal development. Recent comparative and functional genomic approaches have accelerated the identification of effectors from phytoparasitic nematodes and offers opportunities to control these pathogens. Plant parasitic nematodes pose a serious threat to global food security of various economically important crops. There is a wealth of genomic and transcriptomic information available on plant parasitic nematodes and comparative genomics has identified many effectors. Bioengineering crops with dsRNA of phytonematode genes can disrupt the life cycle of parasitic nematodes and therefore holds great promise to develop resistant crops against plant-parasitic nematodes.
Gene identification in the congenital disorders of glycosylation type I by whole-exome sequencing.
Timal, Sharita; Hoischen, Alexander; Lehle, Ludwig; Adamowicz, Maciej; Huijben, Karin; Sykut-Cegielska, Jolanta; Paprocka, Justyna; Jamroz, Ewa; van Spronsen, Francjan J; Körner, Christian; Gilissen, Christian; Rodenburg, Richard J; Eidhof, Ilse; Van den Heuvel, Lambert; Thiel, Christian; Wevers, Ron A; Morava, Eva; Veltman, Joris; Lefeber, Dirk J
2012-10-01
Congenital disorders of glycosylation type I (CDG-I) form a growing group of recessive neurometabolic diseases. Identification of disease genes is compromised by the enormous heterogeneity in clinical symptoms and the large number of potential genes involved. Until now, gene identification included the sequential application of biochemical methods in blood samples and fibroblasts. In genetically unsolved cases, homozygosity mapping has been applied in consanguineous families. Altogether, this time-consuming diagnostic strategy led to the identification of defects in 17 different CDG-I genes. Here, we applied whole-exome sequencing (WES) in combination with the knowledge of the protein N-glycosylation pathway for gene identification in our remaining group of six unsolved CDG-I patients from unrelated non-consanguineous families. Exome variants were prioritized based on a list of 76 potential CDG-I candidate genes, leading to the rapid identification of one known and two novel CDG-I gene defects. These included the first X-linked CDG-I due to a de novo mutation in ALG13, and compound heterozygous mutations in DPAGT1, together the first two steps in dolichol-PP-glycan assembly, and mutations in PGM1 in two cases, involved in nucleotide sugar biosynthesis. The pathogenicity of the mutations was confirmed by showing the deficient activity of the corresponding enzymes in patient fibroblasts. Combined with these results, the gene defect has been identified in 98% of our CDG-I patients. Our results implicate the potential of WES to unravel disease genes in the CDG-I in newly diagnosed singleton families.
An integrative approach to ortholog prediction for disease-focused and other functional studies.
Hu, Yanhui; Flockhart, Ian; Vinayagam, Arunachalam; Bergwitz, Clemens; Berger, Bonnie; Perrimon, Norbert; Mohr, Stephanie E
2011-08-31
Mapping of orthologous genes among species serves an important role in functional genomics by allowing researchers to develop hypotheses about gene function in one species based on what is known about the functions of orthologs in other species. Several tools for predicting orthologous gene relationships are available. However, these tools can give different results and identification of predicted orthologs is not always straightforward. We report a simple but effective tool, the Drosophila RNAi Screening Center Integrative Ortholog Prediction Tool (DIOPT; http://www.flyrnai.org/diopt), for rapid identification of orthologs. DIOPT integrates existing approaches, facilitating rapid identification of orthologs among human, mouse, zebrafish, C. elegans, Drosophila, and S. cerevisiae. As compared to individual tools, DIOPT shows increased sensitivity with only a modest decrease in specificity. Moreover, the flexibility built into the DIOPT graphical user interface allows researchers with different goals to appropriately 'cast a wide net' or limit results to highest confidence predictions. DIOPT also displays protein and domain alignments, including percent amino acid identity, for predicted ortholog pairs. This helps users identify the most appropriate matches among multiple possible orthologs. To facilitate using model organisms for functional analysis of human disease-associated genes, we used DIOPT to predict high-confidence orthologs of disease genes in Online Mendelian Inheritance in Man (OMIM) and genes in genome-wide association study (GWAS) data sets. The results are accessible through the DIOPT diseases and traits query tool (DIOPT-DIST; http://www.flyrnai.org/diopt-dist). DIOPT and DIOPT-DIST are useful resources for researchers working with model organisms, especially those who are interested in exploiting model organisms such as Drosophila to study the functions of human disease genes.
Noor Uddin, Gazi Md; Larsen, Marianne Halberg; Christensen, Henrik; Aarestrup, Frank M; Phu, Tran Minh; Dalsgaard, Anders
2015-01-01
Probiotics are increasingly used in aquaculture to control diseases and improve feed digestion and pond water quality; however, little is known about the antimicrobial resistance properties of such probiotic bacteria and to what extent they may contribute to the development of bacterial resistance in aquaculture ponds. Concerns have been raised that the declared information on probiotic product labels are incorrect and information on bacterial composition are often missing. We therefore evaluated seven probiotics commonly used in Vietnamese shrimp culture for their bacterial species content, phenotypic antimicrobial resistance and associated transferable resistance genes. The bacterial species was established by 16S rRNA sequence analysis of 125 representative bacterial isolates. MIC testing was done for a range of antimicrobials and whole genome sequencing of six multiple antimicrobial resistant Bacillus spp. used to identify resistance genes and genetic elements associated with horizontal gene transfer. Thirteen bacterial species declared on the probiotic products could not be identified and 11 non-declared Bacillus spp. were identified. Although our culture-based isolation and identification may have missed a few bacterial species present in the tested products this would represent minor bias, but future studies may apply culture independent identification methods like pyro sequencing. Only 6/60 isolates were resistant to more than four antimicrobials and whole genome sequencing showed that they contained macrolide (ermD), tetracycline (tetL), phenicol (fexA) and trimethoprim (dfrD, dfrG and dfrK) resistance genes, but not known structures associated with horizontal gene transfer. Probiotic bacterial strains used in Vietnamese shrimp culture seem to contribute with very limited types and numbers of resistance genes compared to the naturally occurring bacterial species in aquaculture environments. Approval procedures of probiotic products must be strengthened through scientific-based efficacy trials and product labels should allow identification of individual bacterial strains and inform the farmer on specific purpose, dosage and correct application measures.
Wang, Kehua; Liu, Yanrong; Tian, Jinli; Huang, Kunyong; Shi, Tianran; Dai, Xiaoxia; Zhang, Wanjun
2017-01-01
Perennial ryegrass (Lolium perenne) is one of the most widely used forage and turf grasses in the world due to its desirable agronomic qualities. However, as a cool-season perennial grass species, high temperature is a major factor limiting its performance in warmer and transition regions. In this study, a de novo transcriptome was generated using a cDNA library constructed from perennial ryegrass leaves subjected to short-term heat stress treatment. Then the expression profiling and identification of perennial ryegrass heat response genes by digital gene expression analyses was performed. The goal of this work was to produce expression profiles of high temperature stress responsive genes in perennial ryegrass leaves and further identify the potentially important candidate genes with altered levels of transcript, such as those genes involved in transcriptional regulation, antioxidant responses, plant hormones and signal transduction, and cellular metabolism. The de novo assembly of perennial ryegrass transcriptome in this study obtained more total and annotated unigenes compared to previously published ones. Many DEGs identified were genes that are known to respond to heat stress in plants, including HSFs, HSPs, and antioxidant related genes. In the meanwhile, we also identified four gene candidates mainly involved in C4 carbon fixation, and one TOR gene. Their exact roles in plant heat stress response need to dissect further. This study would be important by providing the gene resources for improving heat stress tolerance in both perennial ryegrass and other cool-season perennial grass plants. PMID:28680431
Salerno, Paola; Persson, Jessica; Bucca, Giselda; Laing, Emma; Ausmees, Nora; Smith, Colin P; Flärdh, Klas
2013-12-05
The sporulation of aerial hyphae of Streptomyces coelicolor is a complex developmental process. Only a limited number of the genes involved in this intriguing morphological differentiation programme are known, including some key regulatory genes. The aim of this study was to expand our knowledge of the gene repertoire involved in S. coelicolor sporulation. We report a DNA microarray-based investigation of developmentally controlled gene expression in S. coelicolor. By comparing global transcription patterns of the wild-type parent and two mutants lacking key regulators of aerial hyphal sporulation, we found a total of 114 genes that had significantly different expression in at least one of the two mutants compared to the wild-type during sporulation. A whiA mutant showed the largest effects on gene expression, while only a few genes were specifically affected by whiH mutation. Seven new sporulation loci were investigated in more detail with respect to expression patterns and mutant phenotypes. These included SCO7449-7451 that affect spore pigment biogenesis; SCO1773-1774 that encode an L-alanine dehydrogenase and a regulator-like protein and are required for maturation of spores; SCO3857 that encodes a protein highly similar to a nosiheptide resistance regulator and affects spore maturation; and four additional loci (SCO4421, SCO4157, SCO0934, SCO1195) that show developmental regulation but no overt mutant phenotype. Furthermore, we describe a new promoter-probe vector that takes advantage of the red fluorescent protein mCherry as a reporter of cell type-specific promoter activity. Aerial hyphal sporulation in S. coelicolor is a technically challenging process for global transcriptomic investigations since it occurs only as a small fraction of the colony biomass and is not highly synchronized. Here we show that by comparing a wild-type to mutants lacking regulators that are specifically affecting processes in aerial hypha, it is possible to identify previously unknown genes with important roles in sporulation. The transcriptomic data reported here should also serve as a basis for identification of further developmentally important genes in future functional studies.
Bautista-Trujillo, G U; Solorio-Rivera, J L; Rentería-Solórzano, I; Carranza-Germán, S I; Bustos-Martínez, J A; Arteaga-Garibay, R I; Baizabal-Aguirre, V M; Cajero-Juárez, M; Bravo-Patiño, A; Valdez-Alarcón, J J
2013-03-01
Rapid isolation and identification of pathogens is a major goal of diagnostic microbiology. In order to isolate and identify Staphylococcus aureus, a number of authors have used a variety of selective and/or differential culture media. However, to date, there are no reports comparing the efficacy of selective and differential culture media for S. aureus isolation from bovine mastitis cases using the 16S rRNA (rrs) gene sequence as a gold standard test. In the present study, we evaluated the efficacy of four selective and/or differential culture media for the isolation of S. aureus from milk samples collected from cows suffering from bovine mastitis. Four hundred and forty isolates were obtained using salt-mannitol agar (SMA, Bioxon), Staphylococcus-110 agar (S110, Bioxon), CHROMAgar Staph aureus (CSA, BD-BBL) and sheep's blood agar (SBA, BD-BBL). All bacterial isolates were identified by their typical colony morphology in the respective media, by secondary tests (for coagulase and β-haemolysis) and by partial 16S rRNA (rrs) gene sequencing as a gold standard test. Sensitivity, positive predictive and negative predictive values were higher for SMA (86.96, 52.63 and 95.95%, respectively) compared with S110 (70.00, 23.73 and 90.91%, respectively), CSA (69.23, 28.13 and 95.74%, respectively) and SBA (68.75, 37.93 and 89.58%, respectively) while specificity values were similar for all media. Data indicated that the use of culture media for S. aureus isolation combined with determination of coagulase activity and haemolysis as secondary tests improved accuracy of the identification and was in accordance with rrs gene sequence-analysis compared with the use of the culture media alone.
Ecology and genomics of Bacillus subtilis.
Earl, Ashlee M; Losick, Richard; Kolter, Roberto
2008-06-01
Bacillus subtilis is a remarkably diverse bacterial species that is capable of growth within many environments. Recent microarray-based comparative genomic analyses have revealed that members of this species also exhibit considerable genomic diversity. The identification of strain-specific genes might explain how B. subtilis has become so broadly adapted. The goal of identifying ecologically adaptive genes could soon be realized with the imminent release of several new B. subtilis genome sequences. As we embark upon this exciting new era of B. subtilis comparative genomics we review what is currently known about the ecology and evolution of this species.
Everts-van der Wind, Annelie; Kata, Srinivas R.; Band, Mark R.; Rebeiz, Mark; Larkin, Denis M.; Everts, Robin E.; Green, Cheryl A.; Liu, Lei; Natarajan, Shreedhar; Goldammer, Tom; Lee, Jun Heon; McKay, Stephanie; Womack, James E.; Lewin, Harris A.
2004-01-01
A second-generation 5000 rad radiation hybrid (RH) map of the cattle genome was constructed primarily using cattle ESTs that were targeted to gaps in the existing cattle–human comparative map, as well as to sparsely populated map intervals. A total of 870 targeted markers were added, bringing the number of markers mapped on the RH5000 panel to 1913. Of these, 1463 have significant BLASTN hits (E < e–5) against the human genome sequence. A cattle–human comparative map was created using human genome sequence coordinates of the paired orthologs. One-hundred and ninety-five conserved segments (defined by two or more genes) were identified between the cattle and human genomes, of which 31 are newly discovered and 34 were extended singletons on the first-generation map. The new map represents an improvement of 20% genome-wide comparative coverage compared with the first-generation map. Analysis of gene content within human genome regions where there are gaps in the comparative map revealed gaps with both significantly greater and significantly lower gene content. The new, more detailed cattle–human comparative map provides an improved resource for the analysis of mammalian chromosome evolution, the identification of candidate genes for economically important traits, and for proper alignment of sequence contigs on cattle chromosomes. PMID:15231756
Gene Expression Profiling of Gastric Cancer
Marimuthu, Arivusudar; Jacob, Harrys K.C.; Jakharia, Aniruddha; Subbannayya, Yashwanth; Keerthikumar, Shivakumar; Kashyap, Manoj Kumar; Goel, Renu; Balakrishnan, Lavanya; Dwivedi, Sutopa; Pathare, Swapnali; Dikshit, Jyoti Bajpai; Maharudraiah, Jagadeesha; Singh, Sujay; Sameer Kumar, Ghantasala S; Vijayakumar, M.; Veerendra Kumar, Kariyanakatte Veeraiah; Premalatha, Chennagiri Shrinivasamurthy; Tata, Pramila; Hariharan, Ramesh; Roa, Juan Carlos; Prasad, T.S.K; Chaerkady, Raghothama; Kumar, Rekha Vijay; Pandey, Akhilesh
2015-01-01
Gastric cancer is the second leading cause of cancer death worldwide, both in men and women. A genomewide gene expression analysis was carried out to identify differentially expressed genes in gastric adenocarcinoma tissues as compared to adjacent normal tissues. We used Agilent’s whole human genome oligonucleotide microarray platform representing ~41,000 genes to carry out gene expression analysis. Two-color microarray analysis was employed to directly compare the expression of genes between tumor and normal tissues. Through this approach, we identified several previously known candidate genes along with a number of novel candidate genes in gastric cancer. Testican-1 (SPOCK1) was one of the novel molecules that was 10-fold upregulated in tumors. Using tissue microarrays, we validated the expression of testican-1 by immunohistochemical staining. It was overexpressed in 56% (160/282) of the cases tested. Pathway analysis led to the identification of several networks in which SPOCK1 was among the topmost networks of interacting genes. By gene enrichment analysis, we identified several genes involved in cell adhesion and cell proliferation to be significantly upregulated while those corresponding to metabolic pathways were significantly downregulated. The differentially expressed genes identified in this study are candidate biomarkers for gastric adenoacarcinoma. PMID:27030788
Pim-1: A Molecular Target to Modulate Cellular Resistance to Therapy in Prostate Cancer
2005-10-01
Reiter RE, Lilly MB: Gene expression profiling in R- flurbiprofen -treated prostate cancer: Identification of prostate stem cell antigen as a... flurbiprofen -regulated gene. (submitted, 2006). 51. Holder SL, Zemskova M, Bremner R, Neidigh J, Lilly MB: Identification of specific, cell-permeable...profiling in R- flurbiprofen - treated prostate cancer: Identification of prostate stem cell antigen as a flurbiprofen - regulated gene. (poster
Furlong, Michael; Seong, Jae Young
2017-01-01
Seven transmembrane receptors (7TMRs), also known as G protein-coupled receptors, are popular targets of drug development, particularly 7TMR systems that are activated by peptide ligands. Although many pharmaceutical drugs have been discovered via conventional bulk analysis techniques the increasing availability of structural and evolutionary data are facilitating change to rational, targeted drug design. This article discusses the appeal of neuropeptide-7TMR systems as drug targets and provides an overview of concepts in the evolution of vertebrate genomes and gene families. Subsequently, methods that use evolutionary concepts and comparative analysis techniques to aid in gene discovery, gene function identification, and novel drug design are provided along with case study examples.
Furlong, Michael; Seong, Jae Young
2017-01-01
Seven transmembrane receptors (7TMRs), also known as G protein-coupled receptors, are popular targets of drug development, particularly 7TMR systems that are activated by peptide ligands. Although many pharmaceutical drugs have been discovered via conventional bulk analysis techniques the increasing availability of structural and evolutionary data are facilitating change to rational, targeted drug design. This article discusses the appeal of neuropeptide-7TMR systems as drug targets and provides an overview of concepts in the evolution of vertebrate genomes and gene families. Subsequently, methods that use evolutionary concepts and comparative analysis techniques to aid in gene discovery, gene function identification, and novel drug design are provided along with case study examples. PMID:28035082
Abbott, Kenneth L; Nyre, Erik T; Abrahante, Juan; Ho, Yen-Yi; Isaksson Vogel, Rachel; Starr, Timothy K
2015-01-01
Identification of cancer driver gene mutations is crucial for advancing cancer therapeutics. Due to the overwhelming number of passenger mutations in the human tumor genome, it is difficult to pinpoint causative driver genes. Using transposon mutagenesis in mice many laboratories have conducted forward genetic screens and identified thousands of candidate driver genes that are highly relevant to human cancer. Unfortunately, this information is difficult to access and utilize because it is scattered across multiple publications using different mouse genome builds and strength metrics. To improve access to these findings and facilitate meta-analyses, we developed the Candidate Cancer Gene Database (CCGD, http://ccgd-starrlab.oit.umn.edu/). The CCGD is a manually curated database containing a unified description of all identified candidate driver genes and the genomic location of transposon common insertion sites (CISs) from all currently published transposon-based screens. To demonstrate relevance to human cancer, we performed a modified gene set enrichment analysis using KEGG pathways and show that human cancer pathways are highly enriched in the database. We also used hierarchical clustering to identify pathways enriched in blood cancers compared to solid cancers. The CCGD is a novel resource available to scientists interested in the identification of genetic drivers of cancer. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Rodrigues, Thais B; Duan, Jian J; Palli, Subba R; Rieske, Lynne K
2018-03-22
Recent study has shown that RNA interference (RNAi) is efficient in emerald ash borer (EAB), Agrilus planipennis, and that ingestion of double-stranded RNA (dsRNA) targeting specific genes causes gene silencing and mortality in neonates. Here, we report on the identification of highly effective target genes for RNAi-mediated control of EAB. We screened 13 candidate genes in neonate larvae and selected the most effective target genes for further investigation, including their effect on EAB adults and on a non-target organism, Tribolium castaneum. The two most efficient target genes selected, hsp (heat shock 70-kDa protein cognate 3) and shi (shibire), caused up to 90% mortality of larvae and adults. In EAB eggs, larvae, and adults, the hsp is expressed at higher levels when compared to that of shi. Ingestion of dsHSP and dsSHI caused mortality in both neonate larvae and adults. Administration of a mixture of both dsRNAs worked better than either dsRNA by itself. In contrast, injection of EAB.dsHSP and EAB.dsSHI did not cause mortality in T. castaneum. Thus, the two genes identified cause high mortality in the EAB with no apparent phenotype effects in a non-target organism, the red flour beetle, and could be used in RNAi-mediated control of this invasive pest.
Livny, Jonathan; Zhou, Xiaohui; Mandlik, Anjali; Hubbard, Troy; Davis, Brigid M; Waldor, Matthew K
2014-10-29
Vibrio parahaemolyticus is the leading worldwide cause of seafood-associated gastroenteritis, yet little is known regarding its intraintestinal gene expression or physiology. To date, in vivo analyses have focused on identification and characterization of virulence factors--e.g. a crucial Type III secretion system (T3SS2)--rather than genome-wide analyses of in vivo biology. Here, we used RNA-Seq to profile V. parahaemolyticus gene expression in infected infant rabbits, which mimic human infection. Comparative transcriptomic analysis of V. parahaemolyticus isolated from rabbit intestines and from several laboratory conditions enabled identification of mRNAs and sRNAs induced during infection and of regulatory factors that likely control them. More than 12% of annotated V. parahaemolyticus genes are differentially expressed in the intestine, including the genes of T3SS2, which are likely induced by bile-mediated activation of the transcription factor VtrB. Our analyses also suggest that V. parahaemolyticus has access to glucose or other preferred carbon sources in vivo, but that iron is inconsistently available. The V. parahaemolyticus transcriptional response to in vivo growth is far more widespread than and largely distinct from that of V. cholerae, likely due to the distinct ways in which these diarrheal pathogens interact with and modulate the environment in the small intestine. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Isaksson, Jenny; Rasmussen, Magnus; Nilson, Bo; Stadler, Liselott Svensson; Kurland, Siri; Olaison, Lars; Ek, Elisabeth; Herrmann, Björn
2015-04-01
Streptococcus spp. are important causes of infective endocarditis but challenging in species identification. This study compared identification based on sequence determination of the rnpB gene with 2 systems of matrix-assisted laser desorption ionization-time of flight mass spectrometry, MALDI Biotyper (Bruker) and VITEK MS IVD (bioMérieux). Blood culture isolates of viridans streptococci from 63 patients with infective endocarditis were tested. The 3 methods showed full agreement for all 36 isolates identified in the Anginosus, Bovis, and Mutans groups or identified as Streptococcus cristatus, Streptococcus gordonii, or Streptococcus sanguinis. None of the methods could reliably identify the 23 isolates to the species level when designated as Streptococcus mitis, Streptococcus oralis, or Streptococcus tigurinus. In 7 isolates classified to the Mitis group, the rnpB sequences deviated strikingly from all reference sequences, and additional analysis of sodA and groEL genes indicated the occurrence of yet unidentified Streptococcus spp. Copyright © 2015 Elsevier Inc. All rights reserved.
Zhu, Jie; Qin, Yufang; Liu, Taigang; Wang, Jun; Zheng, Xiaoqi
2013-01-01
Identification of gene-phenotype relationships is a fundamental challenge in human health clinic. Based on the observation that genes causing the same or similar phenotypes tend to correlate with each other in the protein-protein interaction network, a lot of network-based approaches were proposed based on different underlying models. A recent comparative study showed that diffusion-based methods achieve the state-of-the-art predictive performance. In this paper, a new diffusion-based method was proposed to prioritize candidate disease genes. Diffusion profile of a disease was defined as the stationary distribution of candidate genes given a random walk with restart where similarities between phenotypes are incorporated. Then, candidate disease genes are prioritized by comparing their diffusion profiles with that of the disease. Finally, the effectiveness of our method was demonstrated through the leave-one-out cross-validation against control genes from artificial linkage intervals and randomly chosen genes. Comparative study showed that our method achieves improved performance compared to some classical diffusion-based methods. To further illustrate our method, we used our algorithm to predict new causing genes of 16 multifactorial diseases including Prostate cancer and Alzheimer's disease, and the top predictions were in good consistent with literature reports. Our study indicates that integration of multiple information sources, especially the phenotype similarity profile data, and introduction of global similarity measure between disease and gene diffusion profiles are helpful for prioritizing candidate disease genes. Programs and data are available upon request.
Neupane, Achal; Nepal, Madhav P.; Piya, Sarbottam; Subramanian, Senthil; Rohila, Jai S.; Reese, R. Neil; Benson, Benjamin V.
2013-01-01
Mitogen-activated protein kinase (MAPK) genes in eukaryotes regulate various developmental and physiological processes including those associated with biotic and abiotic stresses. Although MAPKs in some plant species including Arabidopsis have been identified, they are yet to be identified in soybean. Major objectives of this study were to identify GmMAPKs, assess their evolutionary relationships, and analyze their functional divergence. We identified a total of 38 MAPKs, eleven MAPKKs, and 150 MAPKKKs in soybean. Within the GmMAPK family, we also identified a new clade of six genes: four genes with TEY and two genes with TQY motifs requiring further investigation into possible legume-specific functions. The results indicated the expansion of the GmMAPK families attributable to the ancestral polyploidy events followed by chromosomal rearrangements. The GmMAPK and GmMAPKKK families were substantially larger than those in other plant species. The duplicated GmMAPK members presented complex evolutionary relationships and functional divergence when compared to their counterparts in Arabidopsis. We also highlighted existing nomenclatural issues, stressing the need for nomenclatural consistency. GmMAPK identification is vital to soybean crop improvement, and novel insights into the evolutionary relationships will enhance our understanding about plant genome evolution. PMID:24137047
GTA: a game theoretic approach to identifying cancer subnetwork markers.
Farahmand, S; Goliaei, S; Ansari-Pour, N; Razaghi-Moghadam, Z
2016-03-01
The identification of genetic markers (e.g. genes, pathways and subnetworks) for cancer has been one of the most challenging research areas in recent years. A subset of these studies attempt to analyze genome-wide expression profiles to identify markers with high reliability and reusability across independent whole-transcriptome microarray datasets. Therefore, the functional relationships of genes are integrated with their expression data. However, for a more accurate representation of the functional relationships among genes, utilization of the protein-protein interaction network (PPIN) seems to be necessary. Herein, a novel game theoretic approach (GTA) is proposed for the identification of cancer subnetwork markers by integrating genome-wide expression profiles and PPIN. The GTA method was applied to three distinct whole-transcriptome breast cancer datasets to identify the subnetwork markers associated with metastasis. To evaluate the performance of our approach, the identified subnetwork markers were compared with gene-based, pathway-based and network-based markers. We show that GTA is not only capable of identifying robust metastatic markers, it also provides a higher classification performance. In addition, based on these GTA-based subnetworks, we identified a new bonafide candidate gene for breast cancer susceptibility.
Mello, I C T; Ribeiro, A S D; Dias, V H G; Silva, R; Sabino, B D; Garrido, R G; Seldin, L; de Moura Neto, Rodrigo Soares
2016-03-01
Cannabis sativa, known by the common name marijuana, is the psychoactive drug most widely distributed in the world. Identification of Cannabis cultivars may be useful for association to illegal crops, which may reveal trafficking routes and related criminal groups. This study provides evidence for the performance of a segment of the rbcL gene, through genetic signature, as a tool for identification for C. sativa samples apprehended by the Rio de Janeiro Police, Brazil. The PCR amplified and further sequenced the fragment of approximately 561 bp of 24 samples of C. sativa rbcL gene and showed the same nucleotide sequences, suggesting a possible genetic similarity or identical varieties. Comparing with other Cannabaceae family sequences, we have found 99% of similarity between the Rio de Janeiro sequence and three other C. sativa rbcL genes. These findings suggest that the fragment utilized at this study is efficient in identifying C. sativa samples, therefore, useful in genetic discrimination of samples seized in forensic cases.
Oh, Sunghee; Song, Seongho
2017-01-01
In gene expression profile, data analysis pipeline is categorized into four levels, major downstream tasks, i.e., (1) identification of differential expression; (2) clustering co-expression patterns; (3) classification of subtypes of samples; and (4) detection of genetic regulatory networks, are performed posterior to preprocessing procedure such as normalization techniques. To be more specific, temporal dynamic gene expression data has its inherent feature, namely, two neighboring time points (previous and current state) are highly correlated with each other, compared to static expression data which samples are assumed as independent individuals. In this chapter, we demonstrate how HMMs and hierarchical Bayesian modeling methods capture the horizontal time dependency structures in time series expression profiles by focusing on the identification of differential expression. In addition, those differential expression genes and transcript variant isoforms over time detected in core prerequisite steps can be generally further applied in detection of genetic regulatory networks to comprehensively uncover dynamic repertoires in the aspects of system biology as the coupled framework.
Johnston, Jennifer J; Walker, Robert L; Davis, Sean; Facio, Flavia; Turner, Joyce T; Bick, David P; Daentl, Donna L; Ellison, Jay W; Meltzer, Paul S; Biesecker, Leslie G
2007-01-01
Contiguous gene syndromes cause disorders via haploinsufficiency for adjacent genes. Some contiguous gene syndromes (CGS) have stereotypical breakpoints, but others have variable breakpoints. In CGS that have variable breakpoints, the extent of the deletions may be correlated with severity. The Greig cephalopolysyndactyly contiguous gene syndrome (GCPS‐CGS) is a multiple malformation syndrome caused by haploinsufficiency of GLI3 and adjacent genes. In addition, non‐CGS GCPS can be caused by deletions or duplications in GLI3. Although fluorescence in situ hybridisation (FISH) can identify large deletion mutations in patients with GCPS or GCPS‐CGS, it is not practical for identification of small intragenic deletions or insertions, and it is difficult to accurately characterise the extent of the large deletions using this technique. We have designed a custom comparative genomic hybridisation (CGH) array that allows identification of deletions and duplications at kilobase resolution in the vicinity of GLI3. The array averages one probe every 730 bp for a total of about 14 000 probes over 10 Mb. We have analysed 16 individuals with known or suspected deletions or duplications. In 15 of 16 individuals (14 deletions and 1 duplication), the array confirmed the prior results. In the remaining patient, the normal CGH array result was correct, and the prior assessment was a false positive quantitative polymerase chain reaction result. We conclude that high‐density CGH array analysis is more sensitive than FISH analysis for detecting deletions and provides clinically useful results on the extent of the deletion. We suggest that high‐density CGH array analysis should replace FISH analysis for assessment of deletions and duplications in patients with contiguous gene syndromes caused by variable deletions. PMID:17098889
Mofatto, Luciana Souto; Carneiro, Fernanda de Araújo; Vieira, Natalia Gomes; Duarte, Karoline Estefani; Vidal, Ramon Oliveira; Alekcevetch, Jean Carlos; Cotta, Michelle Guitton; Verdeil, Jean-Luc; Lapeyre-Montes, Fabienne; Lartaud, Marc; Leroy, Thierry; De Bellis, Fabien; Pot, David; Rodrigues, Gustavo Costa; Carazzolle, Marcelo Falsarella; Pereira, Gonçalo Amarante Guimarães; Andrade, Alan Carvalho; Marraccini, Pierre
2016-04-19
Drought is a widespread limiting factor in coffee plants. It affects plant development, fruit production, bean development and consequently beverage quality. Genetic diversity for drought tolerance exists within the coffee genus. However, the molecular mechanisms underlying the adaptation of coffee plants to drought are largely unknown. In this study, we compared the molecular responses to drought in two commercial cultivars (IAPAR59, drought-tolerant and Rubi, drought-susceptible) of Coffea arabica grown in the field under control (irrigation) and drought conditions using the pyrosequencing of RNA extracted from shoot apices and analysing the expression of 38 candidate genes. Pyrosequencing from shoot apices generated a total of 34.7 Mbp and 535,544 reads enabling the identification of 43,087 clusters (41,512 contigs and 1,575 singletons). These data included 17,719 clusters (16,238 contigs and 1,575 singletons) exclusively from 454 sequencing reads, along with 25,368 hybrid clusters assembled with 454 sequences. The comparison of DNA libraries identified new candidate genes (n = 20) presenting differential expression between IAPAR59 and Rubi and/or drought conditions. Their expression was monitored in plagiotropic buds, together with those of other (n = 18) candidates genes. Under drought conditions, up-regulated expression was observed in IAPAR59 but not in Rubi for CaSTK1 (protein kinase), CaSAMT1 (SAM-dependent methyltransferase), CaSLP1 (plant development) and CaMAS1 (ABA biosynthesis). Interestingly, the expression of lipid-transfer protein (nsLTP) genes was also highly up-regulated under drought conditions in IAPAR59. This may have been related to the thicker cuticle observed on the abaxial leaf surface in IAPAR59 compared to Rubi. The full transcriptome assembly of C. arabica, followed by functional annotation, enabled us to identify differentially expressed genes related to drought conditions. Using these data, candidate genes were selected and their differential expression profiles were confirmed by qPCR experiments in plagiotropic buds of IAPAR59 and Rubi under drought conditions. As regards the genes up-regulated under drought conditions, specifically in the drought-tolerant IAPAR59, several corresponded to orphan genes but also to genes coding proteins involved in signal transduction pathways, as well as ABA and lipid metabolism, for example. The identification of these genes should help advance our understanding of the genetic determinism of drought tolerance in coffee.
NASA Astrophysics Data System (ADS)
Song, Xiaoming; Duan, Weike; Huang, Zhinan; Liu, Gaofeng; Wu, Peng; Liu, Tongkun; Li, Ying; Hou, Xilin
2015-09-01
In plants, flowering is the most important transition from vegetative to reproductive growth. The flowering patterns of monocots and eudicots are distinctly different, but few studies have described the evolutionary patterns of the flowering genes in them. In this study, we analysed the evolutionary pattern, duplication and expression level of these genes. The main results were as follows: (i) characterization of flowering genes in monocots and eudicots, including the identification of family-specific, orthologous and collinear genes; (ii) full characterization of CONSTANS-like genes in Brassica rapa (BraCOL genes), the key flowering genes; (iii) exploration of the evolution of COL genes in plant kingdom and construction of the evolutionary pattern of COL genes; (iv) comparative analysis of CO and FT genes between Brassicaceae and Grass, which identified several family-specific amino acids, and revealed that CO and FT protein structures were similar in B. rapa and Arabidopsis but different in rice; and (v) expression analysis of photoperiod pathway-related genes in B. rapa under different photoperiod treatments by RT-qPCR. This analysis will provide resources for understanding the flowering mechanisms and evolutionary pattern of COL genes. In addition, this genome-wide comparative study of COL genes may also provide clues for evolution of other flowering genes.
Khamis, Atieh; Raoult, Didier; La Scola, Bernard
2005-01-01
Higher proportions (91%) of 168 corynebacterial isolates were positively identified by partial rpoB gene determination than by that based on 16S rRNA gene sequences. This method is thus a simple, molecular-analysis-based method for identification of corynebacteria, but it should be used in conjunction with other tests for definitive identification. PMID:15815024
Ruppitsch, W; Stöger, A; Indra, A; Grif, K; Schabereiter-Gurtner, C; Hirschl, A; Allerberger, F
2007-03-01
In a bioterrorism event a rapid tool is needed to identify relevant dangerous bacteria. The aim of the study was to assess the usefulness of partial 16S rRNA gene sequence analysis and the suitability of diverse databases for identifying dangerous bacterial pathogens. For rapid identification purposes a 500-bp fragment of the 16S rRNA gene of 28 isolates comprising Bacillus anthracis, Brucella melitensis, Burkholderia mallei, Burkholderia pseudomallei, Francisella tularensis, Yersinia pestis, and eight genus-related and unrelated control strains was amplified and sequenced. The obtained sequence data were submitted to three public and two commercial sequence databases for species identification. The most frequent reason for incorrect identification was the lack of the respective 16S rRNA gene sequences in the database. Sequence analysis of a 500-bp 16S rDNA fragment allows the rapid identification of dangerous bacterial species. However, for discrimination of closely related species sequencing of the entire 16S rRNA gene, additional sequencing of the 23S rRNA gene or sequencing of the 16S-23S rRNA intergenic spacer is essential. This work provides comprehensive information on the suitability of partial 16S rDNA analysis and diverse databases for rapid and accurate identification of dangerous bacterial pathogens.
2010-01-01
Background Fruit development, maturation and ripening consists of a complex series of biochemical and physiological changes that in climacteric fruits, including apple and tomato, are coordinated by the gaseous hormone ethylene. These changes lead to final fruit quality and understanding of the functional machinery underlying these processes is of both biological and practical importance. To date many reports have been made on the analysis of gene expression in apple. In this study we focused our investigation on the role of ethylene during apple maturation, specifically comparing transcriptomics of normal ripening with changes resulting from application of the hormone receptor competitor 1-Methylcyclopropene. Results To gain insight into the molecular process regulating ripening in apple, and to compare to tomato (model species for ripening studies), we utilized both homologous and heterologous (tomato) microarray to profile transcriptome dynamics of genes involved in fruit development and ripening, emphasizing those which are ethylene regulated. The use of both types of microarrays facilitated transcriptome comparison between apple and tomato (for the later using data previously published and available at the TED: tomato expression database) and highlighted genes conserved during ripening of both species, which in turn represent a foundation for further comparative genomic studies. The cross-species analysis had the secondary aim of examining the efficiency of heterologous (specifically tomato) microarray hybridization for candidate gene identification as related to the ripening process. The resulting transcriptomics data revealed coordinated gene expression during fruit ripening of a subset of ripening-related and ethylene responsive genes, further facilitating the analysis of ethylene response during fruit maturation and ripening. Conclusion Our combined strategy based on microarray hybridization enabled transcriptome characterization during normal climacteric apple ripening, as well as definition of ethylene-dependent transcriptome changes. Comparison with tomato fruit maturation and ethylene responsive transcriptome activity facilitated identification of putative conserved orthologous ripening-related genes, which serve as an initial set of candidates for assessing conservation of gene activity across genomes of fruit bearing plant species. PMID:20973957
Mechergui, Arij; Achour, Wafa; Ben Hassen, Assia
2014-08-01
We aimed to compare accuracy of genus and species level identification of Neisseria spp. using biochemical testing and 16S rRNA sequence analysis. These methods were evaluated using 85 Neisseria spp. clinical isolates initially identified to the genus level by conventional biochemical tests and API NH system (Bio-Mérieux(®)). In 34 % (29/85), more than one possibility was given by 16S rRNA sequence analysis. In 6 % (5/85), one of the possibilities offered by 16S rRNA gene sequencing, agreed with the result given by biochemical testing. In 4 % (3/85), the same species was given by both methods. 16S rRNA gene sequencing results did not correlate well with biochemical tests.
Park, Ju Heon; Shin, Jong Hee; Choi, Min Ji; Choi, Jin Un; Park, Yeon-Joon; Jang, Sook Jin; Won, Eun Jeong; Kim, Soo Hyun; Kee, Seung Jung; Shin, Myung Geun; Suh, Soon Pal
2017-01-01
We evaluated the ability of the Filamentous Fungi Library 1.0 of the MALDI-TOF MS Biotyper system to identify 345 clinical Aspergillus isolates from 11 Korean hospitals. Compared with results of the internal transcribed spacer region sequencing, the frequencies of correct identification at the species-complex level were 94.5% and 98.8% with cutoff values of 2.0 and 1.7, respectively. Compared with results of β-tubulin gene sequencing, the frequencies of correct identification at the species level were 96.0% (cutoff 2.0) and 100% (cutoff 1.7) for 303 Aspergillus isolates of five common, non-cryptic species, but only 4.8% (cutoff 1.7) and 0% (cutoff 2.0) for 42 Aspergillus isolates of six cryptic species (identifiable by β-tubulin or calmodulin sequencing). These results show that the MALDI Biotyper using the Filamentous Fungi Library version 1.0 enables reliable identification of the majority of common clinical Aspergillus isolates, although the database should be expanded to facilitate identification of cryptic species. Copyright © 2016 Elsevier Inc. All rights reserved.
da Mota, F F; Gomes, E A; Paiva, E; Rosado, A S; Seldin, L
2004-01-01
To avoid the limitations of 16S rRNA-based phylogenetic analysis for Paenibacillus species, the usefulness of the RNA polymerase beta-subunit encoding gene (rpoB) was investigated as an alternative to the 16S rRNA gene for taxonomic studies. Partial rpoB sequences were generated for the type strains of eight nitrogen-fixing Paenibacillus species. The presence of only one copy of rpoB in the genome of P. graminis strain RSA19(T) was demonstrated by denaturing gradient gel electrophoresis and hybridization assays. A comparative analysis of the sequences of the 16S rRNA and rpoB genes was performed and the eight species showed between 91.6-99.1% (16S rRNA) and 77.9-97.3% (rpoB) similarity, allowing a more accurate discrimination between the different species using the rpoB gene. Finally, 24 isolates from the rhizosphere of different cultivars of maize previously identified as Paenibacillus spp. were assigned correctly to one of the nitrogen-fixing species. The data obtained in this study indicate that rpoB is a powerful identification tool, which can be used for the correct discrimination of the nitrogen-fixing species of agricultural and industrial importance within the genus Paenibacillus.
PanGEA: identification of allele specific gene expression using the 454 technology.
Kofler, Robert; Teixeira Torres, Tatiana; Lelley, Tamas; Schlötterer, Christian
2009-05-14
Next generation sequencing technologies hold great potential for many biological questions. While mainly used for genomic sequencing, they are also very promising for gene expression profiling. Sequencing of cDNA does not only provide an estimate of the absolute expression level, it can also be used for the identification of allele specific gene expression. We developed PanGEA, a tool which enables a fast and user-friendly analysis of allele specific gene expression using the 454 technology. PanGEA allows mapping of 454-ESTs to genes or whole genomes, displaying gene expression profiles, identification of SNPs and the quantification of allele specific gene expression. The intuitive GUI of PanGEA facilitates a flexible and interactive analysis of the data. PanGEA additionally implements a modification of the Smith-Waterman algorithm which deals with incorrect estimates of homopolymer length as occuring in the 454 technology To our knowledge, PanGEA is the first tool which facilitates the identification of allele specific gene expression. PanGEA is distributed under the Mozilla Public License and available at: http://www.kofler.or.at/bioinformatics/PanGEA
PanGEA: Identification of allele specific gene expression using the 454 technology
Kofler, Robert; Teixeira Torres, Tatiana; Lelley, Tamas; Schlötterer, Christian
2009-01-01
Background Next generation sequencing technologies hold great potential for many biological questions. While mainly used for genomic sequencing, they are also very promising for gene expression profiling. Sequencing of cDNA does not only provide an estimate of the absolute expression level, it can also be used for the identification of allele specific gene expression. Results We developed PanGEA, a tool which enables a fast and user-friendly analysis of allele specific gene expression using the 454 technology. PanGEA allows mapping of 454-ESTs to genes or whole genomes, displaying gene expression profiles, identification of SNPs and the quantification of allele specific gene expression. The intuitive GUI of PanGEA facilitates a flexible and interactive analysis of the data. PanGEA additionally implements a modification of the Smith-Waterman algorithm which deals with incorrect estimates of homopolymer length as occuring in the 454 technology Conclusion To our knowledge, PanGEA is the first tool which facilitates the identification of allele specific gene expression. PanGEA is distributed under the Mozilla Public License and available at: PMID:19442283
DOE Office of Scientific and Technical Information (OSTI.GOV)
Fields, C.A.
1994-09-01
This Report concludes the DOE Human Genome Program project, ``Identification of Genes in Anonymous DNA Sequence.`` The central goals of this project have been (1) understanding the problem of identifying genes in anonymous sequences, and (2) development of tools, primarily the automated identification system gm, for identifying genes. The activities supported under the previous award are summarized here to provide a single complete report on the activities supported as part of the project from its inception to its completion.
Deficiency of liver Comparative Gene Identification-58 causes steatohepatitis and fibrosis in mice
Guo, Feng; Ma, Yinyan; Kadegowda, Anil K. G.; Betters, Jenna L.; Xie, Ping; Liu, George; Liu, Xiuli; Miao, Hongming; Ou, Juanjuan; Su, Xiong; Zheng, Zhenlin; Xue, Bingzhong; Shi, Hang; Yu, Liqing
2013-01-01
Triglyceride (TG) accumulation in hepatocytes (hepatic steatosis) preludes the development of advanced nonalcoholic fatty liver diseases (NAFLDs) such as steatohepatitis, fibrosis, and cirrhosis. Mutations in human Comparative Gene Identification-58 (CGI-58) cause cytosolic TG-rich lipid droplets to accumulate in almost all cell types including hepatocytes. However, it is unclear if CGI-58 mutation causes hepatic steatosis locally or via altering lipid metabolism in other tissues. To directly address this question, we created liver-specific CGI-58 knockout (LivKO) mice. LivKO mice on standard chow diet displayed microvesicular and macrovesicular panlobular steatosis, and progressed to advanced NAFLD stages over time, including lobular inflammation and centrilobular fibrosis. Compared with CGI-58 floxed control littermates, LivKO mice showed 8-fold and 52-fold increases in hepatic TG content, which was associated with 40% and 58% decreases in hepatic TG hydrolase activity at 16 and 42 weeks, respectively. Hepatic cholesterol also increased significantly in LivKO mice. At 42 weeks, LivKO mice showed increased hepatic oxidative stress, plasma aminotransferases, and hepatic mRNAs for genes involved in fibrosis and inflammation, such as α-smooth muscle actin, collagen type 1 α1, tumor necrosis factor α, and interleukin-1β. In conclusion, CGI-58 deficiency in the liver directly causes not only hepatic steatosis but also steatohepatitis and fibrosis. PMID:23733885
A large-scale benchmark of gene prioritization methods.
Guala, Dimitri; Sonnhammer, Erik L L
2017-04-21
In order to maximize the use of results from high-throughput experimental studies, e.g. GWAS, for identification and diagnostics of new disease-associated genes, it is important to have properly analyzed and benchmarked gene prioritization tools. While prospective benchmarks are underpowered to provide statistically significant results in their attempt to differentiate the performance of gene prioritization tools, a strategy for retrospective benchmarking has been missing, and new tools usually only provide internal validations. The Gene Ontology(GO) contains genes clustered around annotation terms. This intrinsic property of GO can be utilized in construction of robust benchmarks, objective to the problem domain. We demonstrate how this can be achieved for network-based gene prioritization tools, utilizing the FunCoup network. We use cross-validation and a set of appropriate performance measures to compare state-of-the-art gene prioritization algorithms: three based on network diffusion, NetRank and two implementations of Random Walk with Restart, and MaxLink that utilizes network neighborhood. Our benchmark suite provides a systematic and objective way to compare the multitude of available and future gene prioritization tools, enabling researchers to select the best gene prioritization tool for the task at hand, and helping to guide the development of more accurate methods.
Yu, C; Jin, J; Meng, L-Q; Xia, H-H; Yuan, H-F; Wang, J; Yu, D-S; Zhao, X-Y; Sha, C-Q
2017-05-20
Given the close genetic relationship between Bacillus amyloliquefaciens and B. subtilis, distinguishing the two solely based on their physiological and biochemical characteristics and 16S rRNA sequences is difficult. Molecular identification was used to discover suitable genes for distinguishing the two bacteria, and to identify the bio-controlling strain B29, due to molecular identification has been paid more and more attention. The similarity of four genes, cheA, gyrB, groEL and phoR, of the two species was compared by the software BLASTN and MAGA, and phylogenetic tree was constructed. The B29 strain was re-identified by using the screened genes. The similarities of the four genes, gyrB, groEL, cheA and phoR, of the two species were 93-95%, 82-84%, 76-78% and 76-77%, respectively. The homologies of the four genes of the strain B29 and the strains of B. amyloliquefaciens strains were more than 95%. We determined how well the phoR and cheA genes could be used to differentiate B. amyloliquefacien and B. subtilis. The previously isolated biological control strain B29, initially classified as B. subtilis, was re-classified as B. amyloliquefaciens. Our data indicate that other than the phoR gene, the cheA gene might be a useful phylogenetic marker for differentiating B. subtilis and B. amyloliquefaciens.
Listorti, Valeria; Laconi, Andrea; Catelli, Elena; Cecchinato, Mattia; Lupini, Caterina; Naylor, Clive J
2017-10-09
IBV genotype QX causes sufficient disease in Europe for several commercial companies to have started developing live attenuated vaccines. Here, one of those vaccines (L1148) was fully consensus sequenced alongside its progenitor field strain (1148-A) to determine vaccine markers, thereby enabling detection on farms. Twenty-eight single nucleotide substitutions were associated with the 1148-A attenuation, of which any combination can identify vaccine L1148 in the field. Sixteen substitutions resulted in amino acid coding changes of which half were in spike. One change in the 1b gene altered the normally highly conserved final 5 nucleotides of the transcription regulatory sequence of the S gene, common to all IBV QX genes. No mutations can currently be associated with the attenuation process. Field vaccination strategies would greatly benefit by such comparative sequence data being mandatorily submitted to regulators prior to vaccine release following a successful registration process. Copyright © 2017. Published by Elsevier Ltd.
Rivera-Posada, J A; Pratchett, M; Cano-Gomez, A; Arango-Gomez, J D; Owens, L
2011-09-09
We used a polyphasic approach for precise identification of bacterial flora (Vibrionaceae) isolated from crown-of-thorns starfish (COTS) from Lizard Island (Great Barrier Reef, Australia) and Guam (U.S.A., Western Pacific Ocean). Previous 16S rRNA gene phylogenetic analysis was useful to allocate and identify isolates within the Photobacterium, Splendidus and Harveyi clades but failed in the identification of Vibrio harveyi-like isolates. Species of the V harveyi group have almost indistinguishable phenotypes and genotypes, and thus, identification by standard biochemical tests and 16S rRNA gene analysis is commonly inaccurate. Biochemical profiling and sequence analysis of additional topA and mreB housekeeping genes were carried out for definitive identification of 19 bacterial isolates recovered from sick and wild COTS. For 8 isolates, biochemical profiles and topA and mreB gene sequence alignments with the closest relatives (GenBank) confirmed previous 16S rRNA-based identification: V. fortis and Photobacterium eurosenbergii species (from wild COTS), and V natriegens (from diseased COTS). Further phylogenetic analysis based on topA and mreB concatenated sequences served to identify the remaining 11 V harveyi-like isolates: V. owensii and V. rotiferianus (from wild COTS), and V. owensii, V. rotiferianus, and V. harveyi (from diseased COTS). This study further confirms the reliability of topA-mreB gene sequence analysis for identification of these close species, and it reveals a wider distribution range of the potentially pathogenic V. harveyi group.
Petti, C. A.; Polage, C. R.; Schreckenberger, P.
2005-01-01
Traditional methods for microbial identification require the recognition of differences in morphology, growth, enzymatic activity, and metabolism to define genera and species. Full and partial 16S rRNA gene sequencing methods have emerged as useful tools for identifying phenotypically aberrant microorganisms. We report on three bacterial blood isolates from three different College of American Pathologists-certified laboratories that were referred to ARUP Laboratories for definitive identification. Because phenotypic identification suggested unusual organisms not typically associated with the submitted clinical diagnosis, consultation with the Medical Director was sought and further testing was performed including partial 16S rRNA gene sequencing. All three patients had endocarditis, and conventional methods identified isolates from patients A, B, and C as a Facklamia sp., Eubacterium tenue, and a Bifidobacterium sp. 16S rRNA gene sequencing identified the isolates as Enterococcus faecalis, Cardiobacterium valvarum, and Streptococcus mutans, respectively. We conclude that the initial identifications of these three isolates were erroneous, may have misled clinicians, and potentially impacted patient care. 16S rRNA gene sequencing is a more objective identification tool, unaffected by phenotypic variation or technologist bias, and has the potential to reduce laboratory errors. PMID:16333109
Forest, David; Nishikawa, Ryuhei; Kobayashi, Hiroshi; Parton, Angela; Bayne, Christopher J.; Barnes, David W.
2007-01-01
We have established a cartilaginous fish cell line [Squalus acanthias embryo cell line (SAE)], a mesenchymal stem cell line derived from the embryo of an elasmobranch, the spiny dogfish shark S. acanthias. Elasmobranchs (sharks and rays) first appeared >400 million years ago, and existing species provide useful models for comparative vertebrate cell biology, physiology, and genomics. Comparative vertebrate genomics among evolutionarily distant organisms can provide sequence conservation information that facilitates identification of critical coding and noncoding regions. Although these genomic analyses are informative, experimental verification of functions of genomic sequences depends heavily on cell culture approaches. Using ESTs defining mRNAs derived from the SAE cell line, we identified lengthy and highly conserved gene-specific nucleotide sequences in the noncoding 3′ UTRs of eight genes involved in the regulation of cell growth and proliferation. Conserved noncoding 3′ mRNA regions detected by using the shark nucleotide sequences as a starting point were found in a range of other vertebrate orders, including bony fish, birds, amphibians, and mammals. Nucleotide identity of shark and human in these regions was remarkably well conserved. Our results indicate that highly conserved gene sequences dating from the appearance of jawed vertebrates and representing potential cis-regulatory elements can be identified through the use of cartilaginous fish as a baseline. Because the expression of genes in the SAE cell line was prerequisite for their identification, this cartilaginous fish culture system also provides a physiologically valid tool to test functional hypotheses on the role of these ancient conserved sequences in comparative cell biology. PMID:17227856
Ehrlich, Kenneth C; Mack, Brian M
2014-06-23
Fifty six secondary metabolite biosynthesis gene clusters are predicted to be in the Aspergillus flavus genome. In spite of this, the biosyntheses of only seven metabolites, including the aflatoxins, kojic acid, cyclopiazonic acid and aflatrem, have been assigned to a particular gene cluster. We used RNA-seq to compare expression of secondary metabolite genes in gene clusters for the closely related fungi A. parasiticus, A. oryzae, and A. flavus S and L sclerotial morphotypes. The data help to refine the identification of probable functional gene clusters within these species. Our results suggest that A. flavus, a prevalent contaminant of maize, cottonseed, peanuts and tree nuts, is capable of producing metabolites which, besides aflatoxin, could be an underappreciated contributor to its toxicity.
Ehrlich, Kenneth C.; Mack, Brian M.
2014-01-01
Fifty six secondary metabolite biosynthesis gene clusters are predicted to be in the Aspergillus flavus genome. In spite of this, the biosyntheses of only seven metabolites, including the aflatoxins, kojic acid, cyclopiazonic acid and aflatrem, have been assigned to a particular gene cluster. We used RNA-seq to compare expression of secondary metabolite genes in gene clusters for the closely related fungi A. parasiticus, A. oryzae, and A. flavus S and L sclerotial morphotypes. The data help to refine the identification of probable functional gene clusters within these species. Our results suggest that A. flavus, a prevalent contaminant of maize, cottonseed, peanuts and tree nuts, is capable of producing metabolites which, besides aflatoxin, could be an underappreciated contributor to its toxicity. PMID:24960201
James M. Slavicek; Nancy Hayes-Plazolles
1991-01-01
Viral immediate early gene products are usually regulatory proteins that control expression of other viral genes at the transcriptional level or are proteins that are part of the viral DNA replication complex. The identification and functional characterization of the immediate early gene products of Lymantria dispar nuclear polyhedrosis virus (LdNPV...
De Zoysa, Aruni; Efstratiou, Androulla; Mann, Ginder; Harrison, Timothy G; Fry, Norman K
2016-12-01
Toxigenic corynebacteria are uncommon in the UK; however, laboratory confirmation by the national reference laboratory can inform public health action according to national guidelines. Standard phenotypic tests for identification and toxin expression of isolates can take from ≥24 to ≥48 h from receipt. To decrease the time to result, a real-time PCR (qPCR) assay was developed for confirmation of both identification of Corynebacterium diphtheriae and Corynebacterium ulcerans/Corynebacterium pseudotuberculosis and detection of the diphtheria toxin gene. Target genes were the RNA polymerase β-subunit-encoding gene (rpoB) and A-subunit of the diphtheria toxin gene (tox). Green fluorescent protein DNA (gfp) was used as an internal process control. qPCR results were obtained within 3 to 4 h after receipt of isolate. The assay was validated according to published guidelines and demonstrated high diagnostic sensitivity (100 %), high specificity (98-100 %) and positive and negative predictive values of 91 to 100 % and 100 %, respectively, compared to both block-based PCR and the Elek test, together with a greatly reduced time from isolate receipt to reporting. Limitations of the qPCR assay were the inability to distinguish between C. ulcerans and C. pseudotuberculosis and that the presence of the toxin gene as demonstrated by qPCR may not always predict toxin expression. Thus, confirmation of expression of diphtheria toxin is always sought using the phenotypic Elek test. The new qPCR assay was formally introduced as the front-line test for putative toxigenic corynebacteria to inform public health action in England and Wales on 1 April 2014.
Butsch Kovacic, Melinda; Biagini Myers, Jocelyn M.; Wang, Ning; Martin, Lisa J.; Lindsey, Mark; Ericksen, Mark B.; He, Hua; Patterson, Tia L.; Baye, Tesfaye M.; Torgerson, Dara; Roth, Lindsey A.; Gupta, Jayanta; Sivaprasad, Umasundari; Gibson, Aaron M.; Tsoras, Anna M.; Hu, Donglei; Eng, Celeste; Chapela, Rocío; Rodríguez-Santana, José R.; Rodríguez-Cintrón, William; Avila, Pedro C.; Beckman, Kenneth; Seibold, Max A.; Gignoux, Chris; Musaad, Salma M.; Chen, Weiguo; Burchard, Esteban González; Khurana Hershey, Gurjit K.
2011-01-01
Background Asthma is a chronic inflammatory disease with a strong genetic predisposition. A major challenge for candidate gene association studies in asthma is the selection of biologically relevant genes. Methodology/Principal Findings Using epithelial RNA expression arrays, HapMap allele frequency variation, and the literature, we identified six possible candidate susceptibility genes for childhood asthma including ADCY2, DNAH5, KIF3A, PDE4B, PLAU, SPRR2B. To evaluate these genes, we compared the genotypes of 194 predominantly tagging SNPs in 790 asthmatic, allergic and non-allergic children. We found that SNPs in all six genes were nominally associated with asthma (p<0.05) in our discovery cohort and in three independent cohorts at either the SNP or gene level (p<0.05). Further, we determined that our selection approach was superior to random selection of genes either differentially expressed in asthmatics compared to controls (p = 0.0049) or selected based on the literature alone (p = 0.0049), substantiating the validity of our gene selection approach. Importantly, we observed that 7 of 9 SNPs in the KIF3A gene more than doubled the odds of asthma (OR = 2.3, p<0.0001) and increased the odds of allergic disease (OR = 1.8, p<0.008). Our data indicate that KIF3A rs7737031 (T-allele) has an asthma population attributable risk of 18.5%. The association between KIF3A rs7737031 and asthma was validated in 3 independent populations, further substantiating the validity of our gene selection approach. Conclusions/Significance Our study demonstrates that KIF3A, a member of the kinesin superfamily of microtubule associated motors that are important in the transport of protein complexes within cilia, is a novel candidate gene for childhood asthma. Polymorphisms in KIF3A may in part be responsible for poor mucus and/or allergen clearance from the airways. Furthermore, our study provides a promising framework for the identification and evaluation of novel candidate susceptibility genes. PMID:21912604
Genome-wide analysis of starch metabolism genes in potato (Solanum tuberosum L.).
Van Harsselaar, Jessica K; Lorenz, Julia; Senning, Melanie; Sonnewald, Uwe; Sonnewald, Sophia
2017-01-05
Starch is the principle constituent of potato tubers and is of considerable importance for food and non-food applications. Its metabolism has been subject of extensive research over the past decades. Despite its importance, a description of the complete inventory of genes involved in starch metabolism and their genome organization in potato plants is still missing. Moreover, mechanisms regulating the expression of starch genes in leaves and tubers remain elusive with regard to differences between transitory and storage starch metabolism, respectively. This study aimed at identifying and mapping the complete set of potato starch genes, and to study their expression pattern in leaves and tubers using different sets of transcriptome data. Moreover, we wanted to uncover transcription factors co-regulated with starch accumulation in tubers in order to get insight into the regulation of starch metabolism. We identified 77 genomic loci encoding enzymes involved in starch metabolism. Novel isoforms of many enzymes were found. Their analysis will help to elucidate mechanisms of starch biosynthesis and degradation. Expression analysis of starch genes led to the identification of tissue-specific isoenzymes suggesting differences in the transcriptional regulation of starch metabolism between potato leaf and tuber tissues. Selection of genes predominantly expressed in developing potato tubers and exhibiting an expression pattern indicative for a role in starch biosynthesis enabled the identification of possible transcriptional regulators of tuber starch biosynthesis by co-expression analysis. This study provides the annotation of the complete set of starch metabolic genes in potato plants and their genomic localizations. Novel, so far undescribed, enzyme isoforms were revealed. Comparative transcriptome analysis enabled the identification of tuber- and leaf-specific isoforms of starch genes. This finding suggests distinct regulatory mechanisms in transitory and storage starch metabolism. Putative regulatory proteins of starch biosynthesis in potato tubers have been identified by co-expression and their expression was verified by quantitative RT-PCR.
Xu, Yuantao; Wu, Guizhi; Hao, Baohai; Chen, Lingling; Deng, Xiuxin; Xu, Qiang
2015-11-23
With the availability of rapidly increasing number of genome and transcriptome sequences, lineage-specific genes (LSGs) can be identified and characterized. Like other conserved functional genes, LSGs play important roles in biological evolution and functions. Two set of citrus LSGs, 296 citrus-specific genes (CSGs) and 1039 orphan genes specific to sweet orange, were identified by comparative analysis between the sweet orange genome sequences and 41 genomes and 273 transcriptomes. With the two sets of genes, gene structure and gene expression pattern were investigated. On average, both the CSGs and orphan genes have fewer exons, shorter gene length and higher GC content when compared with those evolutionarily conserved genes (ECs). Expression profiling indicated that most of the LSGs expressed in various tissues of sweet orange and some of them exhibited distinct temporal and spatial expression patterns. Particularly, the orphan genes were preferentially expressed in callus, which is an important pluripotent tissue of citrus. Besides, part of the CSGs and orphan genes expressed responsive to abiotic stress, indicating their potential functions during interaction with environment. This study identified and characterized two sets of LSGs in citrus, dissected their sequence features and expression patterns, and provided valuable clues for future functional analysis of the LSGs in sweet orange.
A simplified protocol for molecular identification of Eimeria species in field samples.
Haug, Anita; Thebo, Per; Mattsson, Jens G
2007-05-15
This study aimed to find a fast, sensitive and efficient protocol for molecular identification of chicken Eimeria spp. in field samples. Various methods for each of the three steps of the protocol were evaluated: oocyst wall rupturing methods, DNA extraction methods, and identification of species-specific DNA sequences by PCR. We then compared and evaluated five complete protocols. Three series of oocyst suspensions of known number of oocysts from Eimeria mitis, Eimeria praecox, Eimeria maxima and Eimeria tenella were prepared and ground using glass beads or mini-pestle. DNA was extracted from ruptured oocysts using commercial systems (GeneReleaser, Qiagen Stoolkit and Prepman) or phenol-chloroform DNA extraction, followed by identification of species-specific ITS-1 sequences by optimised single species PCR assays. The Stoolkit and Prepman protocols showed insufficient repeatability, and the former was also expensive and relatively time-consuming. In contrast, both the GeneReleaser protocol and phenol-chloroform protocols were robust and sensitive, detecting less than 0.4 oocysts of each species per PCR. Finally, we evaluated our new protocol on 68 coccidia positive field samples. Our data suggests that rupturing the oocysts by mini-pestle grinding, preparing the DNA with GeneReleaser, followed by optimised single species PCR assays, makes a robust and sensitive procedure for identifying chicken Eimeria species in field samples. Importantly, it also provides minimal hands-on-time in the pre-PCR process, lower contamination risk and no handling of toxic chemicals.
Martini, Paolo; Risso, Davide; Sales, Gabriele; Romualdi, Chiara; Lanfranchi, Gerolamo; Cagnin, Stefano
2011-04-11
In the last decades, microarray technology has spread, leading to a dramatic increase of publicly available datasets. The first statistical tools developed were focused on the identification of significant differentially expressed genes. Later, researchers moved toward the systematic integration of gene expression profiles with additional biological information, such as chromosomal location, ontological annotations or sequence features. The analysis of gene expression linked to physical location of genes on chromosomes allows the identification of transcriptionally imbalanced regions, while, Gene Set Analysis focuses on the detection of coordinated changes in transcriptional levels among sets of biologically related genes. In this field, meta-analysis offers the possibility to compare different studies, addressing the same biological question to fully exploit public gene expression datasets. We describe STEPath, a method that starts from gene expression profiles and integrates the analysis of imbalanced region as an a priori step before performing gene set analysis. The application of STEPath in individual studies produced gene set scores weighted by chromosomal activation. As a final step, we propose a way to compare these scores across different studies (meta-analysis) on related biological issues. One complication with meta-analysis is batch effects, which occur because molecular measurements are affected by laboratory conditions, reagent lots and personnel differences. Major problems occur when batch effects are correlated with an outcome of interest and lead to incorrect conclusions. We evaluated the power of combining chromosome mapping and gene set enrichment analysis, performing the analysis on a dataset of leukaemia (example of individual study) and on a dataset of skeletal muscle diseases (meta-analysis approach). In leukaemia, we identified the Hox gene set, a gene set closely related to the pathology that other algorithms of gene set analysis do not identify, while the meta-analysis approach on muscular disease discriminates between related pathologies and correlates similar ones from different studies. STEPath is a new method that integrates gene expression profiles, genomic co-expressed regions and the information about the biological function of genes. The usage of the STEPath-computed gene set scores overcomes batch effects in the meta-analysis approaches allowing the direct comparison of different pathologies and different studies on a gene set activation level.
Makarova, Kira S; Sorokin, Alexander V; Novichkov, Pavel S; Wolf, Yuri I; Koonin, Eugene V
2007-11-27
An evolutionary classification of genes from sequenced genomes that distinguishes between orthologs and paralogs is indispensable for genome annotation and evolutionary reconstruction. Shortly after multiple genome sequences of bacteria, archaea, and unicellular eukaryotes became available, an attempt on such a classification was implemented in Clusters of Orthologous Groups of proteins (COGs). Rapid accumulation of genome sequences creates opportunities for refining COGs but also represents a challenge because of error amplification. One of the practical strategies involves construction of refined COGs for phylogenetically compact subsets of genomes. New Archaeal Clusters of Orthologous Genes (arCOGs) were constructed for 41 archaeal genomes (13 Crenarchaeota, 27 Euryarchaeota and one Nanoarchaeon) using an improved procedure that employs a similarity tree between smaller, group-specific clusters, semi-automatically partitions orthology domains in multidomain proteins, and uses profile searches for identification of remote orthologs. The annotation of arCOGs is a consensus between three assignments based on the COGs, the CDD database, and the annotations of homologs in the NR database. The 7538 arCOGs, on average, cover approximately 88% of the genes in a genome compared to a approximately 76% coverage in COGs. The finer granularity of ortholog identification in the arCOGs is apparent from the fact that 4538 arCOGs correspond to 2362 COGs; approximately 40% of the arCOGs are new. The archaeal gene core (protein-coding genes found in all 41 genome) consists of 166 arCOGs. The arCOGs were used to reconstruct gene loss and gene gain events during archaeal evolution and gene sets of ancestral forms. The Last Archaeal Common Ancestor (LACA) is conservatively estimated to possess 996 genes compared to 1245 and 1335 genes for the last common ancestors of Crenarchaeota and Euryarchaeota, respectively. It is inferred that LACA was a chemoautotrophic hyperthermophile that, in addition to the core archaeal functions, encoded more idiosyncratic systems, e.g., the CASS systems of antivirus defense and some toxin-antitoxin systems. The arCOGs provide a convenient, flexible framework for functional annotation of archaeal genomes, comparative genomics and evolutionary reconstructions. Genomic reconstructions suggest that the last common ancestor of archaea might have been (nearly) as advanced as the modern archaeal hyperthermophiles. ArCOGs and related information are available at: ftp://ftp.ncbi.nih.gov/pub/koonin/arCOGs/.
USDA-ARS?s Scientific Manuscript database
Identification of genes with differential transcript abundance (GDTA) in seedless mutants may enhance understanding of seedless citrus development. Transcriptome analysis was conducted at three time points during early fruit development (Phase 1) of three seedy citrus genotypes: Fallglo [Bower citru...
An integrated workflow for analysis of ChIP-chip data.
Weigelt, Karin; Moehle, Christoph; Stempfl, Thomas; Weber, Bernhard; Langmann, Thomas
2008-08-01
Although ChIP-chip is a powerful tool for genome-wide discovery of transcription factor target genes, the steps involving raw data analysis, identification of promoters, and correlation with binding sites are still laborious processes. Therefore, we report an integrated workflow for the analysis of promoter tiling arrays with the Genomatix ChipInspector system. We compare this tool with open-source software packages to identify PU.1 regulated genes in mouse macrophages. Our results suggest that ChipInspector data analysis, comparative genomics for binding site prediction, and pathway/network modeling significantly facilitate and enhance whole-genome promoter profiling to reveal in vivo sites of transcription factor-DNA interactions.
Ohshiro, Takeya; Miyagi, Chihiro; Tamaki, Yoshikazu; Mizuno, Takuya; Ezaki, Takayuki
2016-06-01
Blood culturing and the rapid reporting of results are essential for infectious disease clinics to obtain bacterial information that can affect patient prognosis. When gram-positive coccoid cells are observed in blood culture bottles, it is important to determine whether the strain is Staphylococcus aureus and whether the strain has resistance genes, such as mecA and blaZ, for proper antibiotic selection. Previous work led to the development of a PCR method that is useful for rapid identification of bacterial species and antimicrobial susceptibility. However, that method has not yet been adopted in community hospitals due to the high cost and methodological complexity. We report here the development of a quick PCR and DNA-chromatography test, based on single-tag hybridization chromatography, that permits detection of S. aureus and the mecA and blaZ genes; results can be obtained within 1 h for positive blood culture bottles. We evaluated this method using 42 clinical isolates. Detection of S. aureus and the resistance genes by the PCR-DNA-chromatography method was compared with that obtained via the conventional identification method and actual antimicrobial susceptibility testing. Our method had a sensitivity of 97.0% and a specificity of 100% for the identification of the bacterial species. For the detection of the mecA gene of S. aureus, the sensitivity was 100% and the specificity was 95.2%. For the detection of the blaZ gene of S. aureus, the sensitivity was 100% and the specificity was 88.9%. The speed and simplicity of this PCR-DNA-chromatography method suggest that our method will facilitate rapid diagnoses. Copyright © 2016 Japanese Society of Chemotherapy and The Japanese Association for Infectious Diseases. Published by Elsevier Ltd. All rights reserved.
funRNA: a fungi-centered genomics platform for genes encoding key components of RNAi.
Choi, Jaeyoung; Kim, Ki-Tae; Jeon, Jongbum; Wu, Jiayao; Song, Hyeunjeong; Asiegbu, Fred O; Lee, Yong-Hwan
2014-01-01
RNA interference (RNAi) is involved in genome defense as well as diverse cellular, developmental, and physiological processes. Key components of RNAi are Argonaute, Dicer, and RNA-dependent RNA polymerase (RdRP), which have been functionally characterized mainly in model organisms. The key components are believed to exist throughout eukaryotes; however, there is no systematic platform for archiving and dissecting these important gene families. In addition, few fungi have been studied to date, limiting our understanding of RNAi in fungi. Here we present funRNA http://funrna.riceblast.snu.ac.kr/, a fungal kingdom-wide comparative genomics platform for putative genes encoding Argonaute, Dicer, and RdRP. To identify and archive genes encoding the abovementioned key components, protein domain profiles were determined from reference sequences obtained from UniProtKB/SwissProt. The domain profiles were searched using fungal, metazoan, and plant genomes, as well as bacterial and archaeal genomes. 1,163, 442, and 678 genes encoding Argonaute, Dicer, and RdRP, respectively, were predicted. Based on the identification results, active site variation of Argonaute, diversification of Dicer, and sequence analysis of RdRP were discussed in a fungus-oriented manner. funRNA provides results from diverse bioinformatics programs and job submission forms for BLAST, BLASTMatrix, and ClustalW. Furthermore, sequence collections created in funRNA are synced with several gene family analysis portals and databases, offering further analysis opportunities. funRNA provides identification results from a broad taxonomic range and diverse analysis functions, and could be used in diverse comparative and evolutionary studies. It could serve as a versatile genomics workbench for key components of RNAi.
McNeill, Brian; Perez-Iratxeta, Carol; Mazerolle, Chantal; Furimsky, Marosh; Mishina, Yuji; Andrade-Navarro, Miguel A; Wallace, Valerie A
2012-03-01
The hedgehog (Hh) signaling pathway is involved in numerous developmental and adult processes with many links to cancer. In vertebrates, the activity of the Hh pathway is mediated primarily through three Gli transcription factors (Gli1, 2 and 3) that can serve as transcriptional activators or repressors. The identification of Gli target genes is essential for the understanding of the Hh-mediated processes. We used a comparative genomics approach using the mouse and human genomes to identify 390 genes that contained conserved Gli binding sites. RT-qPCR validation of 46 target genes in E14.5 and P0.5 retinal explants revealed that Hh pathway activation resulted in the modulation of 30 of these targets, 25 of which demonstrated a temporal regulation. Further validation revealed that the expression of Bok, FoxA1, Sox8 and Wnt7a was dependent upon Sonic Hh (Shh) signaling in the retina and their regulation is under positive and negative controls by Gli2 and Gli3, respectively. We also show using chromatin immunoprecipitation that Gli2 binds to the Sox8 promoter, suggesting that Sox8 is an Hh-dependent direct target of Gli2. Finally, we demonstrate that the Hh pathway also modulates the expression of Sox9 and Sox10, which together with Sox8 make up the SoxE group. Previously, it has been shown that Hh and SoxE group genes promote Müller glial cell development in the retina. Our data are consistent with the possibility for a role of SoxE group genes downstream of Hh signaling on Müller cell development. Crown Copyright © 2012. Published by Elsevier Inc. All rights reserved.
Alam, Mohammad J.; Tisdel, Naradah L.; Shah, Dhara N.; Yapar, Mehmet; Lasco, Todd M.; Garey, Kevin W.
2015-01-01
Background The aim of this study was to develop and validate a multiplex real-time PCR assay for simultaneous identification and toxigenic type characterization of Clostridium difficile. Methods The multiplex real-time PCR assay targeted and simultaneously detected triose phosphate isomerase (tpi) and binary toxin (cdtA) genes, and toxin A (tcdA) and B (tcdB) genes in the first and sec tubes, respectively. The results of multiplex real-time PCR were compared to those of the BD GeneOhm Cdiff assay, targeting the tcdB gene alone. The toxigenic culture was used as the reference, where toxin genes were detected by multiplex real-time PCR. Results A total of 351 stool samples from consecutive patients were included in the study. Fifty-five stool samples (15.6%) were determined to be positive for the presence of C. difficile by using multiplex real-time PCR. Of these, 48 (87.2%) were toxigenic (46 tcdA and tcdB-positive, two positive for only tcdB) and 11 (22.9%) were cdtA-positive. The sensitivity, specificity, negative predictive value (NPV), and positive predictive value (PPV) of the multiplex real-time PCR compared with the toxigenic culture were 95.6%, 98.6%, 91.6%, and 99.3%, respectively. The analytical sensitivity of the multiplex real-time PCR assay was determined to be 103colonyforming unit (CFU)/g spiked stool sample and 0.0625 pg genomic DNA from culture. Analytical specificity determined by using 15 enteric and non-clostridial reference strains was 100%. Conclusions The multiplex real-time PCR assay accurately detected C. difficile isolates from diarrheal stool samples and characterized its toxin genes in a single PCR run. PMID:25932438
Akram, Pakeeza; Liao, Li
2017-12-06
Identification of common genes associated with comorbid diseases can be critical in understanding their pathobiological mechanism. This work presents a novel method to predict missing common genes associated with a disease pair. Searching for missing common genes is formulated as an optimization problem to minimize network based module separation from two subgraphs produced by mapping genes associated with disease onto the interactome. Using cross validation on more than 600 disease pairs, our method achieves significantly higher average receiver operating characteristic ROC Score of 0.95 compared to a baseline ROC score 0.60 using randomized data. Missing common genes prediction is aimed to complete gene set associated with comorbid disease for better understanding of biological intervention. It will also be useful for gene targeted therapeutics related to comorbid diseases. This method can be further considered for prediction of missing edges to complete the subgraph associated with disease pair.
Esmaeili Rastaghi, Ahmad Reza; Spotin, Adel; Khataminezhad, Mohammad Reza; Jafarpour, Mostafa; Alaeenovin, Elnaz; Najafzadeh, Narmin; Samei, Neda; Taleshi, Neda; Mohammadi, Somayeh; Parvizi, Parviz
2017-10-01
Leishmaniasis as an emerging and reemerging disease is increasing worldwide with high prevalence and new incidence in recent years. For epidemiological investigation and accurate identification of Leishmania species, three nuclear and mitochondrial genes (ITS-rDNA, Hsp70, and Cyt b ) were employed and analyzed from clinical samples in three important Zoonotic Cutaneous Leishmaniasis (ZCL) foci of Iran. In this cross-sectional/descriptive study conducted in 2014-15, serous smears of lesions were directly prepared from suspected patients of ZCL in Turkmen in northeast, Abarkouh in center and Shush district in southwest of Iran. They were directly prepared from suspected patients and DNA was extracted. Two nuclear genes of ITS-rDNA, Hsp70 and one mitochondrial gene of Cyt b within Leishmania parasites were amplified. RFLP was performed on PCR-positive samples. PCR products were sequenced, aligned and edited with sequencher 4.1.4 and phylogenic analyses performed using MEGA 5.05 software. Overall, 203 out of 360 clinical samples from suspected patients were Leishmania positive using routine laboratory methods and 231 samples were positive by molecular techniques. L. major L. tropica , and L. turanica were firmly identified by employing different molecular genes and phylogenic analyses. By combining different molecular genes, Leishmania parasites were identified accurately. The sensitivity and specificity three genes were evaluated and had more advantages to compare routine laboratory methods. ITS-rDNA gene is more appropriate for firm identification of Leishmania species.
Lawton, Samantha J; Weis, Allison M; Byrne, Barbara A; Fritz, Heather; Taff, Conor C; Townsend, Andrea K; Weimer, Bart C; Mete, Aslı; Wheeler, Sarah; Boyce, Walter M
2018-05-01
Matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS) was compared to conventional biochemical testing methods and nucleic acid analyses (16S rDNA sequencing, hippurate hydrolysis gene testing, whole genome sequencing [WGS]) for species identification of Campylobacter isolates obtained from chickens ( Gallus gallus domesticus, n = 8), American crows ( Corvus brachyrhynchos, n = 17), a mallard duck ( Anas platyrhynchos, n = 1), and a western scrub-jay ( Aphelocoma californica, n = 1). The test results for all 27 isolates were in 100% agreement between MALDI-TOF MS, the combined results of 16S rDNA sequencing, and the hippurate hydrolysis gene PCR ( p = 0.0027, kappa = 1). Likewise, the identifications derived from WGS from a subset of 14 isolates were in 100% agreement with the MALDI-TOF MS identification. In contrast, biochemical testing misclassified 5 isolates of C. jejuni as C. coli, and 16S rDNA sequencing alone was not able to differentiate between C. coli and C. jejuni for 11 sequences ( p = 0.1573, kappa = 0.0857) when compared to MALDI-TOF MS and WGS. No agreement was observed between MALDI-TOF MS dendrograms and the phylogenetic relationships revealed by rDNA sequencing or WGS. Our results confirm that MALDI-TOF MS is a fast and reliable method for identifying Campylobacter isolates to the species level from wild birds and chickens, but not for elucidating phylogenetic relationships among Campylobacter isolates.
Ai, Ye; Zhang, Chunling; Sun, Yalin; Wang, Weining; He, Yanhong; Bao, Manzhu
2017-01-01
According to the floral organ development ABC model, B class genes specify petal and stamen identification. In order to study the function of B class genes in flower development of Tagetes erecta, five MADS-box B class genes were identified and their expression and putative functions were studied. Sequence comparisons and phylogenetic analyses indicated that there were one PI-like gene-TePI, two euAP3-like genes-TeAP3-1 and TeAP3-2, and two TM6-like genes-TeTM6-1 and TeTM6-2 in T. erecta. Strong expression levels of these genes were detected in stamens of the disk florets, but little or no expression was detected in bracts, receptacles or vegetative organs. Yeast hybrid experiments of the B class proteins showed that TePI protein could form a homodimer and heterodimers with all the other four B class proteins TeAP3-1, TeAP3-2, TeTM6-1 and TeTM6-2. No homodimer or interaction was observed between the euAP3 and TM6 clade members. Over-expression of five B class genes of T. erecta in Nicotiana rotundifolia showed that only the transgenic plants of 35S::TePI showed altered floral morphology compared with the non-transgenic line. This study could contribute to the understanding of the function of B class genes in flower development of T. erecta, and provide a theoretical basis for further research to change floral organ structures and create new materials for plant breeding.
Kampmann, Martin; Bassik, Michael C.; Weissman, Jonathan S.
2013-01-01
A major challenge of the postgenomic era is to understand how human genes function together in normal and disease states. In microorganisms, high-density genetic interaction (GI) maps are a powerful tool to elucidate gene functions and pathways. We have developed an integrated methodology based on pooled shRNA screening in mammalian cells for genome-wide identification of genes with relevant phenotypes and systematic mapping of all GIs among them. We recently demonstrated the potential of this approach in an application to pathways controlling the susceptibility of human cells to the toxin ricin. Here we present the complete quantitative framework underlying our strategy, including experimental design, derivation of quantitative phenotypes from pooled screens, robust identification of hit genes using ultra-complex shRNA libraries, parallel measurement of tens of thousands of GIs from a single double-shRNA experiment, and construction of GI maps. We describe the general applicability of our strategy. Our pooled approach enables rapid screening of the same shRNA library in different cell lines and under different conditions to determine a range of different phenotypes. We illustrate this strategy here for single- and double-shRNA libraries. We compare the roles of genes for susceptibility to ricin and Shiga toxin in different human cell lines and reveal both toxin-specific and cell line-specific pathways. We also present GI maps based on growth and ricin-resistance phenotypes, and we demonstrate how such a comparative GI mapping strategy enables functional dissection of physical complexes and context-dependent pathways. PMID:23739767
DOE Office of Scientific and Technical Information (OSTI.GOV)
Berman, Benjamin P.; Pfeiffer, Barret D.; Laverty, Todd R.
2004-08-06
The identification of sequences that control transcription in metazoans is a major goal of genome analysis. In a previous study, we demonstrated that searching for clusters of predicted transcription factor binding sites could discover active regulatory sequences, and identified 37 regions of the Drosophila melanogaster genome with high densities of predicted binding sites for five transcription factors involved in anterior-posterior embryonic patterning. Nine of these clusters overlapped known enhancers. Here, we report the results of in vivo functional analysis of 27 remaining clusters. We generated transgenic flies carrying each cluster attached to a basal promoter and reporter gene, and assayedmore » embryos for reporter gene expression. Six clusters are enhancers of adjacent genes: giant, fushi tarazu, odd-skipped, nubbin, squeeze and pdm2; three drive expression in patterns unrelated to those of neighboring genes; the remaining 18 do not appear to have enhancer activity. We used the Drosophila pseudoobscura genome to compare patterns of evolution in and around the 15 positive and 18 false-positive predictions. Although conservation of primary sequence cannot distinguish true from false positives, conservation of binding-site clustering accurately discriminates functional binding-site clusters from those with no function. We incorporated conservation of binding-site clustering into a new genome-wide enhancer screen, and predict several hundred new regulatory sequences, including 85 adjacent to genes with embryonic patterns. Measuring conservation of sequence features closely linked to function--such as binding-site clustering--makes better use of comparative sequence data than commonly used methods that examine only sequence identity.« less
Oduru, Sreedhar; Campbell, Janee L; Karri, SriTulasi; Hendry, William J; Khan, Shafiq A; Williams, Simon C
2003-01-01
Background Complete genome annotation will likely be achieved through a combination of computer-based analysis of available genome sequences combined with direct experimental characterization of expressed regions of individual genomes. We have utilized a comparative genomics approach involving the sequencing of randomly selected hamster testis cDNAs to begin to identify genes not previously annotated on the human, mouse, rat and Fugu (pufferfish) genomes. Results 735 distinct sequences were analyzed for their relatedness to known sequences in public databases. Eight of these sequences were derived from previously unidentified genes and expression of these genes in testis was confirmed by Northern blotting. The genomic locations of each sequence were mapped in human, mouse, rat and pufferfish, where applicable, and the structure of their cognate genes was derived using computer-based predictions, genomic comparisons and analysis of uncharacterized cDNA sequences from human and macaque. Conclusion The use of a comparative genomics approach resulted in the identification of eight cDNAs that correspond to previously uncharacterized genes in the human genome. The proteins encoded by these genes included a new member of the kinesin superfamily, a SET/MYND-domain protein, and six proteins for which no specific function could be predicted. Each gene was expressed primarily in testis, suggesting that they may play roles in the development and/or function of testicular cells. PMID:12783626
Salmond, G P; Lutkenhaus, J F; Donachie, W D
1980-01-01
We report the identification, cloning, and mapping of a new cell envelope gene, murG. This lies in a group of five genes of similar phenotype (in the order murE murF murG murC ddl) all concerned with peptidoglycan biosynthesis. This group is in a larger cluster of at least 10 genes, all of which are involved in some way with cell envelope growth. Images PMID:6998962
USDA-ARS?s Scientific Manuscript database
Polymerase chain reaction amplification of conserved genes and sequence analysis provides a very powerful tool for the identification of toxigenic as well as non-toxigenic Penicillium species. Sequences are obtained by amplification of the gene fragment, sequencing via capillary electrophoresis of d...
NASA Astrophysics Data System (ADS)
Novianti, T.; Sadikin, M.; Widia, S.; Juniantito, V.; Arida, E. A.
2018-03-01
Development of unidentified specific gene is essential to analyze the availability these genes in biological process. Identification unidentified specific DNA of HIF 1α genes is important to analyze their contribution in tissue regeneration process in lizard tail (Hemidactylus platyurus). Bioinformatics and PCR techniques are relatively an easier method to identify an unidentified gene. The most widely used method is BLAST (Basic Local Alignment Sequence Tools) method for alignment the sequences from the other organism. BLAST technique is online software from website https://blast.ncbi.nlm.nih.gov/Blast.cgi that capable to generate the similar sequences from closest kinship to distant kindship. Gecko japonicus is a species that it has closest kinship with H. platyurus. Comparing HIF 1 α gene sequence of G. japonicus with the other species used multiple alignment methods from Mega7 software. Conserved base areas were identified using Clustal IX method. Primary DNA of HIF 1 α gene was design by Primer3 software. HIF 1α gene of lizard (H. platyurus) was successfully amplified using a real-time PCR machine by primary DNA that we had designed from Gecko japonicus. Identification unidentified gene of HIF 1a lizard has been done successfully with multiple alignment method. The study was conducted by analyzing during the growth of tail on day 1, 3, 5, 7, 10, 13 and 17 of lizard tail after autotomy. Process amplification of HIF 1α gene was described by CT value in real time PCR machine. HIF 1α expression of gene is quantified by Livak formula. Chi-square statistic test is 0.000 which means that there is a different expression of HIF 1 α gene in every growth day treatment.
Li, Yongsheng; Sahni, Nidhi; Yi, Song
2016-11-29
Comprehensive understanding of human cancer mechanisms requires the identification of a thorough list of cancer-associated genes, which could serve as biomarkers for diagnoses and therapies in various types of cancer. Although substantial progress has been made in functional studies to uncover genes involved in cancer, these efforts are often time-consuming and costly. Therefore, it remains challenging to comprehensively identify cancer candidate genes. Network-based methods have accelerated this process through the analysis of complex molecular interactions in the cell. However, the extent to which various interactome networks can contribute to prediction of candidate genes responsible for cancer is still enigmatic. In this study, we evaluated different human protein-protein interactome networks and compared their application to cancer gene prioritization. Our results indicate that network analyses can increase the power to identify novel cancer genes. In particular, such predictive power can be enhanced with the use of unbiased systematic protein interaction maps for cancer gene prioritization. Functional analysis reveals that the top ranked genes from network predictions co-occur often with cancer-related terms in literature, and further, these candidate genes are indeed frequently mutated across cancers. Finally, our study suggests that integrating interactome networks with other omics datasets could provide novel insights into cancer-associated genes and underlying molecular mechanisms.
Werblow, A; Flechl, E; Klimpel, S; Zittra, C; Lebl, K; Kieser, K; Laciny, A; Silbermayr, K; Melaun, C; Fuehrer, H-P
2016-03-01
Millions of people die each year as a result of pathogens transmitted by mosquitoes. However, the morphological identification of mosquito species can be difficult even for experts. The identification of morphologically indistinguishable species, such as members of the Anopheles maculipennis complex (Diptera: Culicidae), and possible hybrids, such as Culex pipiens pipiens/Culex pipiens molestus (Diptera: Culicidae), presents a major problem. In addition, the detection and discrimination of newly introduced species can be challenging, particularly to researchers without previous experience. Because of their medical importance, the clear identification of all relevant mosquito species is essential. Using the direct polymerase chain reaction (PCR) method described here, DNA amplification without prior DNA extraction is possible and thus species identification after sequencing can be achieved. Different amounts of tissue (leg, head; larvae or adult) as well as different storage conditions (dry, ethanol, -20 and -80 °C) and storage times were successfully applied and showed positive results after amplification and gel electrophoresis. Overall, 28 different indigenous and non-indigenous mosquito species were analysed using a gene fragment of the COX1 gene for species differentiation and identification by sequencing this 658-bp fragment. Compared with standard PCR, this method is time- and cost-effective and could thus improve existing surveillance and control programmes. © 2015 The Authors. Medical and Veterinary Entomology published by John Wiley & Sons Ltd on behalf of Royal Entomological Society.
Comparative genomics of Toll-like receptor signalling in five species
Jann, Oliver C; King, Annemarie; Corrales, Nestor Lopez; Anderson, Susan I; Jensen, Kirsty; Ait-ali, Tahar; Tang, Haizhou; Wu, Chunhua; Cockett, Noelle E; Archibald, Alan L; Glass, Elizabeth J
2009-01-01
Background Over the last decade, several studies have identified quantitative trait loci (QTL) affecting variation of immune related traits in mammals. Recent studies in humans and mice suggest that part of this variation may be caused by polymorphisms in genes involved in Toll-like receptor (TLR) signalling. In this project, we used a comparative approach to investigate the importance of TLR-related genes in comparison with other immunologically relevant genes for resistance traits in five species by associating their genomic location with previously published immune-related QTL regions. Results We report the genomic localisation of TLR1-10 and ten associated signalling molecules in sheep and pig using in-silico and/or radiation hybrid (RH) mapping techniques and compare their positions with their annotated homologues in the human, cattle and mouse whole genome sequences. We also report medium-density RH maps for porcine chromosomes 8 and 13. A comparative analysis of the positions of previously published relevant QTLs allowed the identification of homologous regions that are associated with similar health traits in several species and which contain TLR related and other immunologically relevant genes. Additional evidence was gathered by examining relevant gene expression and association studies. Conclusion This comparative genomic approach identified eight genes as potentially causative genes for variations of health related traits. These include susceptibility to clinical mastitis in dairy cattle, general disease resistance in sheep, cattle, humans and mice, and tolerance to protozoan infection in cattle and mice. Four TLR-related genes (TLR1, 6, MyD88, IRF3) appear to be the most likely candidate genes underlying QTL regions which control the resistance to the same or similar pathogens in several species. Further studies are required to investigate the potential role of polymorphisms within these genes. PMID:19432955
Han, Junwei; Li, Chunquan; Yang, Haixiu; Xu, Yanjun; Zhang, Chunlong; Ma, Jiquan; Shi, Xinrui; Liu, Wei; Shang, Desi; Yao, Qianlan; Zhang, Yunpeng; Su, Fei; Feng, Li; Li, Xia
2015-01-01
Identifying dysregulated pathways from high-throughput experimental data in order to infer underlying biological insights is an important task. Current pathway-identification methods focus on single pathways in isolation; however, consideration of crosstalk between pathways could improve our understanding of alterations in biological states. We propose a novel method of pathway analysis based on global influence (PAGI) to identify dysregulated pathways, by considering both within-pathway effects and crosstalk between pathways. We constructed a global gene–gene network based on the relationships among genes extracted from a pathway database. We then evaluated the extent of differential expression for each gene, and mapped them to the global network. The random walk with restart algorithm was used to calculate the extent of genes affected by global influence. Finally, we used cumulative distribution functions to determine the significance values of the dysregulated pathways. We applied the PAGI method to five cancer microarray datasets, and compared our results with gene set enrichment analysis and five other methods. Based on these analyses, we demonstrated that PAGI can effectively identify dysregulated pathways associated with cancer, with strong reproducibility and robustness. We implemented PAGI using the freely available R-based and Web-based tools (http://bioinfo.hrbmu.edu.cn/PAGI). PMID:25551156
Yu, Ron X.; Liu, Jie; True, Nick; Wang, Wei
2008-01-01
A major challenge in the post-genome era is to reconstruct regulatory networks from the biological knowledge accumulated up to date. The development of tools for identifying direct target genes of transcription factors (TFs) is critical to this endeavor. Given a set of microarray experiments, a probabilistic model called TRANSMODIS has been developed which can infer the direct targets of a TF by integrating sequence motif, gene expression and ChIP-chip data. The performance of TRANSMODIS was first validated on a set of transcription factor perturbation experiments (TFPEs) involving Pho4p, a well studied TF in Saccharomyces cerevisiae. TRANSMODIS removed elements of arbitrariness in manual target gene selection process and produced results that concur with one's intuition. TRANSMODIS was further validated on a genome-wide scale by comparing it with two other methods in Saccharomyces cerevisiae. The usefulness of TRANSMODIS was then demonstrated by applying it to the identification of direct targets of DAF-16, a critical TF regulating ageing in Caenorhabditis elegans. We found that 189 genes were tightly regulated by DAF-16. In addition, DAF-16 has differential preference for motifs when acting as an activator or repressor, which awaits experimental verification. TRANSMODIS is computationally efficient and robust, making it a useful probabilistic framework for finding immediate targets. PMID:18350157
Fast gene ontology based clustering for microarray experiments.
Ovaska, Kristian; Laakso, Marko; Hautaniemi, Sampsa
2008-11-21
Analysis of a microarray experiment often results in a list of hundreds of disease-associated genes. In order to suggest common biological processes and functions for these genes, Gene Ontology annotations with statistical testing are widely used. However, these analyses can produce a very large number of significantly altered biological processes. Thus, it is often challenging to interpret GO results and identify novel testable biological hypotheses. We present fast software for advanced gene annotation using semantic similarity for Gene Ontology terms combined with clustering and heat map visualisation. The methodology allows rapid identification of genes sharing the same Gene Ontology cluster. Our R based semantic similarity open-source package has a speed advantage of over 2000-fold compared to existing implementations. From the resulting hierarchical clustering dendrogram genes sharing a GO term can be identified, and their differences in the gene expression patterns can be seen from the heat map. These methods facilitate advanced annotation of genes resulting from data analysis.
Xi, Jianing; Wang, Minghui; Li, Ao
2017-09-26
The accumulating availability of next-generation sequencing data offers an opportunity to pinpoint driver genes that are causally implicated in oncogenesis through computational models. Despite previous efforts made regarding this challenging problem, there is still room for improvement in the driver gene identification accuracy. In this paper, we propose a novel integrated approach called IntDriver for prioritizing driver genes. Based on a matrix factorization framework, IntDriver can effectively incorporate functional information from both the interaction network and Gene Ontology similarity, and detect driver genes mutated in different sets of patients at the same time. When evaluated through known benchmarking driver genes, the top ranked genes of our result show highly significant enrichment for the known genes. Meanwhile, IntDriver also detects some known driver genes that are not found by the other competing approaches. When measured by precision, recall and F1 score, the performances of our approach are comparable or increased in comparison to the competing approaches.
Garner, O; Mochon, A; Branda, J; Burnham, C-A; Bythrow, M; Ferraro, M; Ginocchio, C; Jennemann, R; Manji, R; Procop, G W; Richter, S; Rychert, J; Sercia, L; Westblade, L; Lewinski, M
2014-04-01
Accurate and timely identification of anaerobic bacteria is critical to successful treatment. Classic phenotypic methods for identification require long turnaround times and can exhibit poor species level identification. Matrix-assisted laser desorption ionization time-of-flight mass spectrometry (MALDI-TOF MS) is an identification method that can provide rapid identification of anaerobes. We present a multi-centre study assessing the clinical performance of the VITEK(®) MS in the identification of anaerobic bacteria. Five different test sites analysed a collection of 651 unique anaerobic isolates comprising 11 different genera. Multiple species were included for several of the genera. Briefly, anaerobic isolates were applied directly to a well of a target plate. Matrix solution (α-cyano-4-hydroxycinnamic acid) was added and allowed to dry. Mass spectra results were generated with the VITEK(®) MS, and the comparative spectral analysis and organism identification were determined using the VITEK(®) MS database 2.0. Results were confirmed by 16S rRNA gene sequencing. Of the 651 isolates analysed, 91.2% (594/651) exhibited the correct species identification. An additional eight isolates were correctly identified to genus level, raising the rate of identification to 92.5%. Genus-level identification consisted of Actinomyces, Bacteroides and Prevotella species. Fusobacterium nucleatum, Actinomyces neuii and Bacteroides uniformis were notable for an increased percentage of no-identification results compared with the other anaerobes tested. VITEK(®) MS identification of clinically relevant anaerobes is highly accurate and represents a dramatic improvement over other phenotypic methods in accuracy and turnaround time. © 2013 The Authors Clinical Microbiology and Infection © 2013 European Society of Clinical Microbiology and Infectious Diseases.
Identifying differentially expressed genes in cancer patients using a non-parameter Ising model.
Li, Xumeng; Feltus, Frank A; Sun, Xiaoqian; Wang, James Z; Luo, Feng
2011-10-01
Identification of genes and pathways involved in diseases and physiological conditions is a major task in systems biology. In this study, we developed a novel non-parameter Ising model to integrate protein-protein interaction network and microarray data for identifying differentially expressed (DE) genes. We also proposed a simulated annealing algorithm to find the optimal configuration of the Ising model. The Ising model was applied to two breast cancer microarray data sets. The results showed that more cancer-related DE sub-networks and genes were identified by the Ising model than those by the Markov random field model. Furthermore, cross-validation experiments showed that DE genes identified by Ising model can improve classification performance compared with DE genes identified by Markov random field model. Copyright © 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Nakamura, Sayaka; Sato, Hiroaki; Tanaka, Reiko; Kusuya, Yoko; Takahashi, Hiroki; Yaguchi, Takashi
2017-04-26
Accurate identification of Aspergillus species is a very important subject. Mass spectral fingerprinting using matrix-assisted laser desorption ionization time-of-flight mass spectrometry (MALDI-TOF MS) is generally employed for the rapid identification of fungal isolates. However, the results are based on simple mass spectral pattern-matching, with no peak assignment and no taxonomic input. We propose here a ribosomal subunit protein (RSP) typing technique using MALDI-TOF MS for the identification and discrimination of Aspergillus species. The results are concluded to be phylogenetic in that they reflect the molecular evolution of housekeeping RSPs. The amino acid sequences of RSPs of genome-sequenced strains of Aspergillus species were first verified and compared to compile a reliable biomarker list for the identification of Aspergillus species. In this process, we revealed that many amino acid sequences of RSPs (about 10-60%, depending on strain) registered in the public protein databases needed to be corrected or newly added. The verified RSPs were allocated to RSP types based on their mass. Peak assignments of RSPs of each sample strain as observed by MALDI-TOF MS were then performed to set RSP type profiles, which were then further processed by means of cluster analysis. The resulting dendrogram based on RSP types showed a relatively good concordance with the tree based on β-tubulin gene sequences. RSP typing was able to further discriminate the strains belonging to Aspergillus section Fumigati. The RSP typing method could be applied to identify Aspergillus species, even for species within section Fumigati. The discrimination power of RSP typing appears to be comparable to conventional β-tubulin gene analysis. This method would therefore be suitable for species identification and discrimination at the strain to species level. Because RSP typing can characterize the strains within section Fumigati, this method has potential as a powerful and reliable tool in the field of clinical microbiology.
Phylogenomics of plant genomes: a methodology for genome-wide searches for orthologs in plants
Conte, Matthieu G; Gaillard, Sylvain; Droc, Gaetan; Perin, Christophe
2008-01-01
Background Gene ortholog identification is now a major objective for mining the increasing amount of sequence data generated by complete or partial genome sequencing projects. Comparative and functional genomics urgently need a method for ortholog detection to reduce gene function inference and to aid in the identification of conserved or divergent genetic pathways between several species. As gene functions change during evolution, reconstructing the evolutionary history of genes should be a more accurate way to differentiate orthologs from paralogs. Phylogenomics takes into account phylogenetic information from high-throughput genome annotation and is the most straightforward way to infer orthologs. However, procedures for automatic detection of orthologs are still scarce and suffer from several limitations. Results We developed a procedure for ortholog prediction between Oryza sativa and Arabidopsis thaliana. Firstly, we established an efficient method to cluster A. thaliana and O. sativa full proteomes into gene families. Then, we developed an optimized phylogenomics pipeline for ortholog inference. We validated the full procedure using test sets of orthologs and paralogs to demonstrate that our method outperforms pairwise methods for ortholog predictions. Conclusion Our procedure achieved a high level of accuracy in predicting ortholog and paralog relationships. Phylogenomic predictions for all validated gene families in both species were easily achieved and we can conclude that our methodology outperforms similarly based methods. PMID:18426584
Nong, Guang; Chow, Virginia; Schmidt, Liesbeth M; Dickson, Don W; Preston, James F
2007-08-01
Pasteuria species are endospore-forming obligate bacterial parasites of soil-inhabiting nematodes and water-inhabiting cladocerans, e.g. water fleas, and are closely related to Bacillus spp. by 16S rRNA gene sequence. As naturally occurring bacteria, biotypes of Pasteuria penetrans are attractive candidates for the biocontrol of various Meloidogyne spp. (root-knot nematodes). Failure to culture these bacteria outside their hosts has prevented isolation of genomic DNA in quantities sufficient for identification of genes associated with host recognition and virulence. We have applied multiple-strand displacement amplification (MDA) to generate DNA for comparative genomics of biotypes exhibiting different host preferences. Using the genome of Bacillus subtilis as a paradigm, MDA allowed quantitative detection and sequencing of 12 marker genes from 2000 cells. Meloidogyne spp. infected with P. penetrans P20 or B4 contained single nucleotide polymorphisms (SNPs) in the spoIIAB gene that did not change the amino acid sequence, or that substituted amino acids with similar chemical properties. Individual nematodes infected with P. penetrans P20 or B4 contained SNPs in the spoIIAB gene sequenced in MDA-generated products. Detection of SNPs in the spoIIAB gene in a nematode indicates infection by more than one genotype, supporting the need to sequence genomes of Pasteuria spp. derived from single spore isolates.
Liu, S; Liu, L; Tang, Y; Xiong, S; Long, J; Liu, Z; Tian, N
2017-07-01
The regulatory mechanism of flavonoids, which synergise anti-malarial and anti-cancer compounds in Artemisia annua, is still unclear. In this study, an anthocyanidin-accumulating mutant callus was induced from A. annua and comparative transcriptomic analysis of wild-type and mutant calli performed, based on the next-generation Illumina/Solexa sequencing platform and de novo assembly. A total of 82,393 unigenes were obtained and 34,764 unigenes were annotated in the public database. Among these, 87 unigenes were assigned to 14 structural genes involved in the flavonoid biosynthetic pathway and 37 unigenes were assigned to 17 structural genes related to metabolism of flavonoids. More than 30 unigenes were assigned to regulatory genes, including R2R3-MYB, bHLH and WD40, which might regulate flavonoid biosynthesis. A further 29 unigenes encoding flavonoid biosynthetic enzymes or transcription factors were up-regulated in the mutant, while 19 unigenes were down-regulated, compared with the wild type. Expression levels of nine genes involved in the flavonoid pathway were compared using semi-quantitative RT-PCR, and results were consistent with comparative transcriptomic analysis. Finally, a putative flavonol synthase gene (AaFLS1) was identified from enzyme assay in vitro and in vivo through heterogeneous expression, and confirmed comparative transcriptomic analysis of wild-type and mutant callus. The present work has provided important target genes for the regulation of flavonoid biosynthesis in A. annua. © 2017 German Botanical Society and The Royal Botanical Society of the Netherlands.
Aldehyde dehydrogenase (ALDH) superfamily in plants: gene nomenclature and comparative genomics
Brocker, Chad; Vasiliou, Melpomene; Carpenter, Sarah; Carpenter, Christopher; Zhang, Yucheng; Wang, Xiping; Kotchoni, Simeon O.; Wood, Andrew J.; Kirch, Hans-Hubert; Kopečný, David; Nebert, Daniel W.
2012-01-01
In recent years, there has been a significant increase in the number of completely sequenced plant genomes. The comparison of fully sequenced genomes allows for identification of new gene family members, as well as comprehensive analysis of gene family evolution. The aldehyde dehydrogenase (ALDH) gene superfamily comprises a group of enzymes involved in the NAD+- or NADP+-dependent conversion of various aldehydes to their corresponding carboxylic acids. ALDH enzymes are involved in processing many aldehydes that serve as biogenic intermediates in a wide range of metabolic pathways. In addition, many of these enzymes function as ‘aldehyde scavengers’ by removing reactive aldehydes generated during the oxidative degradation of lipid membranes, also known as lipid peroxidation. Plants and animals share many ALDH families, and many genes are highly conserved between these two evolutionarily distinct groups. Conversely, both plants and animals also contain unique ALDH genes and families. Herein we carried outgenome-wide identification of ALDH genes in a number of plant species—including Arabidopsis thaliana (thale crest), Chlamydomonas reinhardtii (unicellular algae), Oryza sativa (rice), Physcomitrella patens (moss), Vitis vinifera (grapevine) and Zea mays (maize). These data were then combined with previous analysis of Populus trichocarpa (poplar tree), Selaginella moellindorffii (gemmiferous spikemoss), Sorghum bicolor (sorghum) and Volvox carteri (colonial algae) for a comprehensive evolutionary comparison of the plant ALDH superfamily. As a result, newly identified genes can be more easily analyzed and gene names can be assigned according to current nomenclature guidelines; our goal is to clarify previously confusing and conflicting names and classifications that might confound results and prevent accurate comparisons between studies. PMID:23007552
Aldehyde dehydrogenase (ALDH) superfamily in plants: gene nomenclature and comparative genomics.
Brocker, Chad; Vasiliou, Melpomene; Carpenter, Sarah; Carpenter, Christopher; Zhang, Yucheng; Wang, Xiping; Kotchoni, Simeon O; Wood, Andrew J; Kirch, Hans-Hubert; Kopečný, David; Nebert, Daniel W; Vasiliou, Vasilis
2013-01-01
In recent years, there has been a significant increase in the number of completely sequenced plant genomes. The comparison of fully sequenced genomes allows for identification of new gene family members, as well as comprehensive analysis of gene family evolution. The aldehyde dehydrogenase (ALDH) gene superfamily comprises a group of enzymes involved in the NAD(+)- or NADP(+)-dependent conversion of various aldehydes to their corresponding carboxylic acids. ALDH enzymes are involved in processing many aldehydes that serve as biogenic intermediates in a wide range of metabolic pathways. In addition, many of these enzymes function as 'aldehyde scavengers' by removing reactive aldehydes generated during the oxidative degradation of lipid membranes, also known as lipid peroxidation. Plants and animals share many ALDH families, and many genes are highly conserved between these two evolutionarily distinct groups. Conversely, both plants and animals also contain unique ALDH genes and families. Herein we carried out genome-wide identification of ALDH genes in a number of plant species-including Arabidopsis thaliana (thale crest), Chlamydomonas reinhardtii (unicellular algae), Oryza sativa (rice), Physcomitrella patens (moss), Vitis vinifera (grapevine) and Zea mays (maize). These data were then combined with previous analysis of Populus trichocarpa (poplar tree), Selaginella moellindorffii (gemmiferous spikemoss), Sorghum bicolor (sorghum) and Volvox carteri (colonial algae) for a comprehensive evolutionary comparison of the plant ALDH superfamily. As a result, newly identified genes can be more easily analyzed and gene names can be assigned according to current nomenclature guidelines; our goal is to clarify previously confusing and conflicting names and classifications that might confound results and prevent accurate comparisons between studies.
The Importance of Barley Genetics and Domestication in a Global Perspective
Pourkheirandish, Mohammad; Komatsuda, Takao
2007-01-01
Background Archaeological evidence has revealed that barley (Hordeum vulgare) is one of the oldest crops used by ancient farmers. Studies of the time and place of barley domestication may help in understanding ancient human civilization. Scope The studies of domesticated genes in crops have uncovered the mechanisms which converted wild and unpromising wild species to the most important food for humans. In addition to archaeological studies, molecular studies are finding new insights into the process of domestication. Throughout the process of barley domestication human selection on wild species resulted in plants with more harvestable seeds. One of the remarkable changes during barley domestications was the appearance of six-rowed barley. The gene associated with this trait results in three times more seed per spike compared with ancestral wild barley. This increase in number of seed resulted in a major dichotomy in the evolution of barley. The identification of the six-rowed spike gene provided a framework for understanding how this character was evolved. Some important barley domestication genes have been discovered and many are currently being investigated. Conclusions Identification of domestication genes in crops revealed that most of the drastic changes during domestication are the result of functional impairments in transcription factor genes, and creation of new functions is rare. Isolation of the six-rowed spike gene revealed that this trait was domesticated more than once in the domestication history of barley. Six-rowed barley is derived from two-rowed ancestral forms. Isolation of photoperiod-response genes in barley and rice revealed that different genes belonging to similar genetic networks partially control this trait. PMID:17761690
Identification of genetic elements in metabolism by high-throughput mouse phenotyping.
Rozman, Jan; Rathkolb, Birgit; Oestereicher, Manuela A; Schütt, Christine; Ravindranath, Aakash Chavan; Leuchtenberger, Stefanie; Sharma, Sapna; Kistler, Martin; Willershäuser, Monja; Brommage, Robert; Meehan, Terrence F; Mason, Jeremy; Haselimashhadi, Hamed; Hough, Tertius; Mallon, Ann-Marie; Wells, Sara; Santos, Luis; Lelliott, Christopher J; White, Jacqueline K; Sorg, Tania; Champy, Marie-France; Bower, Lynette R; Reynolds, Corey L; Flenniken, Ann M; Murray, Stephen A; Nutter, Lauryl M J; Svenson, Karen L; West, David; Tocchini-Valentini, Glauco P; Beaudet, Arthur L; Bosch, Fatima; Braun, Robert B; Dobbie, Michael S; Gao, Xiang; Herault, Yann; Moshiri, Ala; Moore, Bret A; Kent Lloyd, K C; McKerlie, Colin; Masuya, Hiroshi; Tanaka, Nobuhiko; Flicek, Paul; Parkinson, Helen E; Sedlacek, Radislav; Seong, Je Kyung; Wang, Chi-Kuang Leo; Moore, Mark; Brown, Steve D; Tschöp, Matthias H; Wurst, Wolfgang; Klingenspor, Martin; Wolf, Eckhard; Beckers, Johannes; Machicao, Fausto; Peter, Andreas; Staiger, Harald; Häring, Hans-Ulrich; Grallert, Harald; Campillos, Monica; Maier, Holger; Fuchs, Helmut; Gailus-Durner, Valerie; Werner, Thomas; Hrabe de Angelis, Martin
2018-01-18
Metabolic diseases are a worldwide problem but the underlying genetic factors and their relevance to metabolic disease remain incompletely understood. Genome-wide research is needed to characterize so-far unannotated mammalian metabolic genes. Here, we generate and analyze metabolic phenotypic data of 2016 knockout mouse strains under the aegis of the International Mouse Phenotyping Consortium (IMPC) and find 974 gene knockouts with strong metabolic phenotypes. 429 of those had no previous link to metabolism and 51 genes remain functionally completely unannotated. We compared human orthologues of these uncharacterized genes in five GWAS consortia and indeed 23 candidate genes are associated with metabolic disease. We further identify common regulatory elements in promoters of candidate genes. As each regulatory element is composed of several transcription factor binding sites, our data reveal an extensive metabolic phenotype-associated network of co-regulated genes. Our systematic mouse phenotype analysis thus paves the way for full functional annotation of the genome.
USDA-ARS?s Scientific Manuscript database
The hypersensitive response (HR) is the most visible and arguably the most important defense response in plants, although the details of how it is controlled and executed remain patchy. In this paper a novel genetic technique called MAGIC (Mutant-Assisted Gene Identification and Characterization) i...
PCR-TRFLP methodology targeting rRNA genes has effectively been used to discriminate between microbial communities but to date has not been used specifically for the analysis of ectomycorrhizal communities colonizing plant roots. We describe here results of a study conducted to a...
USDA-ARS?s Scientific Manuscript database
Comparative Gene Identification-58 (CGI-58) is an alpha/beta hydrolase-type protein that regulates lipid homeostasis and signaling in eukaryotes by interacting with and stimulating the activity of several different types of proteins, including a lipase in mammalian cells and a peroxisomal ABC transp...
USDA-ARS?s Scientific Manuscript database
Nine hundred twenty two differentially expressed transcripts of cotton in non-inoculated pericarp (NIP) and seed (NIS), pericarp (NTP) and seed (NTS) of cotton inoculated with atoxigenic strain (AF13), and pericarp (TP) and seed (TS) inoculated with toxigenic strain (AF36) of Aspergillus flavus were...
Lin, Yuli; Zou, Weikun; Lin, Shiqiang; Onofua, Dennis; Yang, Zhijian; Chen, Haizhou; Wang, Songliang; Chen, Xuanyang
2017-01-01
Sweet potato production is constrained by Fusarium wilt, which is caused by Fusarium oxysporum f. sp. batatas (Fob). The identification of genes related to disease resistance and the underlying mechanisms will contribute to improving disease resistance via sweet potato breeding programs. In the present study, we performed de novo transcriptome assembly and digital gene expression (DGE) profiling of sweet potato challenged with Fob using Illumina HiSeq technology. In total, 89,944,188 clean reads were generated from 12 samples and assembled into 101,988 unigenes with an average length of 666 bp; of these unigenes, 62,605 (61.38%) were functionally annotated in the NCBI non-redundant protein database by BLASTX with a cutoff E-value of 10-5. Clusters of Orthologous Groups (COG), Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) annotations were examined to explore the unigenes' functions. We constructed four DGE libraries for the sweet potato cultivars JinShan57 (JS57, highly resistant) and XinZhongHua (XZH, highly susceptible), which were challenged with pathogenic Fob. Genes that were differentially expressed in the four libraries were identified by comparing the transcriptomes. Various genes that were differentially expressed during defense, including chitin elicitor receptor kinase 1 (CERK), mitogen-activated protein kinase (MAPK), WRKY, NAC, MYB, and ethylene-responsive transcription factor (ERF), as well as resistance genes, pathogenesis-related genes, and genes involved in salicylic acid (SA) and jasmonic acid (JA) signaling pathways, were identified. These data represent a sequence resource for genetic and genomic studies of sweet potato that will enhance the understanding of the mechanism of disease resistance.
Araripe, Luciana O; Montenegro, Horácio; Lemos, Bernardo; Hartl, Daniel L
2010-12-14
Hybrid male sterility (HMS) is a usual outcome of hybridization between closely related animal species. It arises because interactions between alleles that are functional within one species may be disrupted in hybrids. The identification of genes leading to hybrid sterility is of great interest for understanding the evolutionary process of speciation. In the current work we used marked P-element insertions as dominant markers to efficiently locate one genetic factor causing a severe reduction in fertility in hybrid males of Drosophila simulans and D. mauritiana. Our mapping effort identified a region of 9 kb on chromosome 3, containing three complete and one partial coding sequences. Within this region, two annotated genes are suggested as candidates for the HMS factor, based on the comparative molecular characterization and public-source information. Gene Taf1 is partially contained in the region, but yet shows high polymorphism with four fixed non-synonymous substitutions between the two species. Its molecular functions involve sequence-specific DNA binding and transcription factor activity. Gene agt is a small, intronless gene, whose molecular function is annotated as methylated-DNA-protein-cysteine S-methyltransferase activity. High polymorphism and one fixed non-synonymous substitution suggest this is a fast evolving gene. The gene trees of both genes perfectly separate D. simulans and D. mauritiana into monophyletic groups. Analysis of gene expression using microarray revealed trends that were similar to those previously found in comparisons between whole-genome hybrids and parental species. The identification following confirmation of the HMS candidate gene will add another case study leading to understanding the evolutionary process of hybrid incompatibility.
Ensemble positive unlabeled learning for disease gene identification.
Yang, Peng; Li, Xiaoli; Chua, Hon-Nian; Kwoh, Chee-Keong; Ng, See-Kiong
2014-01-01
An increasing number of genes have been experimentally confirmed in recent years as causative genes to various human diseases. The newly available knowledge can be exploited by machine learning methods to discover additional unknown genes that are likely to be associated with diseases. In particular, positive unlabeled learning (PU learning) methods, which require only a positive training set P (confirmed disease genes) and an unlabeled set U (the unknown candidate genes) instead of a negative training set N, have been shown to be effective in uncovering new disease genes in the current scenario. Using only a single source of data for prediction can be susceptible to bias due to incompleteness and noise in the genomic data and a single machine learning predictor prone to bias caused by inherent limitations of individual methods. In this paper, we propose an effective PU learning framework that integrates multiple biological data sources and an ensemble of powerful machine learning classifiers for disease gene identification. Our proposed method integrates data from multiple biological sources for training PU learning classifiers. A novel ensemble-based PU learning method EPU is then used to integrate multiple PU learning classifiers to achieve accurate and robust disease gene predictions. Our evaluation experiments across six disease groups showed that EPU achieved significantly better results compared with various state-of-the-art prediction methods as well as ensemble learning classifiers. Through integrating multiple biological data sources for training and the outputs of an ensemble of PU learning classifiers for prediction, we are able to minimize the potential bias and errors in individual data sources and machine learning algorithms to achieve more accurate and robust disease gene predictions. In the future, our EPU method provides an effective framework to integrate the additional biological and computational resources for better disease gene predictions.
Xu, Hai-Ming; Kong, Xiang-Dong; Chen, Fei; Huang, Ji-Xiang; Lou, Xiang-Yang; Zhao, Jian-Yi
2015-10-24
Brassica napus is an important oilseed crop. Dissection of the genetic architecture underlying oil-related biological processes will greatly facilitates the genetic improvement of rapeseed. The differential gene expression during pod development offers a snapshot on the genes responsible for oil accumulation in. To identify candidate genes in the linkage peaks reported previously, we used RNA sequencing (RNA-Seq) technology to analyze the pod transcriptomes of German cultivar Sollux and Chinese inbred line Gaoyou. The RNA samples were collected for RNA-Seq at 5-7, 15-17 and 25-27 days after flowering (DAF). Bioinformatics analysis was performed to investigate differentially expressed genes (DEGs). Gene annotation analysis was integrated with QTL mapping and Brassica napus pod transcriptome profiling to detect potential candidate genes in oilseed. Four hundred sixty five and two thousand, one hundred fourteen candidate DEGs were identified, respectively, between two varieties at the same stages and across different periods of each variety. Then, 33 DEGs between Sollux and Gaoyou were identified as the candidate genes affecting seed oil content by combining those DEGs with the quantitative trait locus (QTL) mapping results, of which, one was found to be homologous to Arabidopsis thaliana lipid-related genes. Intervarietal DEGs of lipid pathways in QTL regions represent important candidate genes for oil-related traits. Integrated analysis of transcriptome profiling, QTL mapping and comparative genomics with other relative species leads to efficient identification of most plausible functional genes underlying oil-content related characters, offering valuable resources for bettering breeding program of Brassica napus. This study provided a comprehensive overview on the pod transcriptomes of two varieties with different oil-contents at the three developmental stages.
Norling, A; Hirschberg, A L; Rodriguez-Wallberg, K A; Iwarsson, E; Wedell, A; Barbaro, M
2014-08-01
Can high-resolution array comparative genomic hybridization (CGH) analysis of DNA samples from women with primary ovarian insufficiency (POI) improve the diagnosis of the condition and identify novel candidate genes for POI? A mutation affecting the regulatory region of growth differentiation factor 9 (GDF9) was identified for the first time together with several novel candidate genes for POI. Most patients with POI do not receive a molecular diagnosis despite a significant genetic component in the pathogenesis. We performed a case-control study. Twenty-six patients were analyzed by array CGH for identification of copy number variants. Novel changes were investigated in 95 controls and in a separate population of 28 additional patients with POI. The experimental procedures were performed during a 1-year period. DNA samples from 26 patients with POI were analyzed by a customized 1M array-CGH platform with whole genome coverage and probe enrichment targeting 78 genes in sex development. By PCR amplification and sequencing, the breakpoint of an identified partial GDF9 gene duplication was characterized. A multiplex ligation-dependent probe amplification (MLPA) probe set for specific identification of deletions/duplications affecting GDF9 was developed. An MLPA probe set for the identification of additional cases or controls carrying novel candidate regions identified by array-CGH was developed. Sequencing of three candidate genes was performed. Eleven unique copy number changes were identified in a total of 11 patients, including a tandem duplication of 475 bp, containing part of the GDF9 gene promoter region. The duplicated region contains three NOBOX-binding elements and an E-box, important for GDF9 gene regulation. This aberration is likely causative of POI. Fifty-four patients were investigated for copy number changes within GDF9, but no additional cases were found. Ten aberrations constituting novel candidate regions were detected, including a second DNAH6 deletion in a patient with POI. Other identified candidate genes were TSPYL6, SMARCC1, CSPG5 and ZFR2. This is a descriptive study and no functional experiments were performed. The study illustrates the importance of analyzing small copy number changes in addition to sequence alterations in the genetic investigation of patients with POI. Also, promoter regions should be included in the investigation. The study was supported by grants from the Swedish Research council (project no 12198 to A.W. and project no 20324 to A.L.H.), Stockholm County Council (E.I., A.W. and K.R.W.), Foundation Frimurare Barnhuset (A.N., A.W. and M.B.), Karolinska Institutet (A.N., A.L.H., E.I., A.W. and M.B.), Novo Nordic Foundation (A.W.) and Svenska Läkaresällskapet (M.B.). The funding sources had no involvement in the design or analysis of the study. The authors have no competing interests to declare. Not applicable. © The Author 2014. Published by Oxford University Press on behalf of the European Society of Human Reproduction and Embryology.
Comparative analysis of genome-wide Mlo gene family in Cajanus cajan and Phaseolus vulgaris.
Deshmukh, Reena; Singh, V K; Singh, B D
2016-04-01
The Mlo gene was discovered in barley because the mutant 'mlo' allele conferred broad-spectrum, non-race-specific resistance to powdery mildew caused by Blumeria graminis f. sp. hordei. The Mlo genes also play important roles in growth and development of plants, and in responses to biotic and abiotic stresses. The Mlo gene family has been characterized in several crop species, but only a single legume species, soybean (Glycine max L.), has been investigated so far. The present report describes in silico identification of 18 CcMlo and 20 PvMlo genes in the important legume crops Cajanus cajan (L.) Millsp. and Phaseolus vulgaris L., respectively. In silico analysis of gene organization, protein properties and conserved domains revealed that the C. cajan and P. vulgaris Mlo gene paralogs are more divergent from each other than from their orthologous pairs. The comparative phylogenetic analysis classified CcMlo and PvMlo genes into three major clades. A comparative analysis of CcMlo and PvMlo proteins with the G. max Mlo proteins indicated close association of one CcMlo, one PvMlo with two GmMlo genes, indicating that there was no further expansion of the Mlo gene family after the separation of these species. Thus, most of the diploid species of eudicots might be expected to contain 15-20 Mlo genes. The genes CcMlo12 and 14, and PvMlo11 and 12 are predicted to participate in powdery mildew resistance. If this prediction were verified, these genes could be targeted by TILLING or CRISPR to isolate powdery mildew resistant mutants.
Dellett, Margaret; O’Hagan, Kathleen Ann; Colyer, Hilary Ann Alexandra; Mills, Ken I.
2010-01-01
Around 80% of acute myeloid leukemia (AML) patients achieve a complete remission, however many will relapse and ultimately die of their disease. The association between karyotype and prognosis has been studied extensively and identified patient cohorts as having favourable [e.g. t(8; 21), inv (16)/t(16; 16), t(15; 17)], intermediate [e.g. cytogenetically normal (NK-AML)] or adverse risk [e.g. complex karyotypes]. Previous studies have shown that gene expression profiling signatures can classify the sub-types of AML, although few reports have shown a similar feature by using methylation markers. The global methylation patterns in 19 diagnostic AML samples were investigated using the Methylated CpG Island Amplification Microarray (MCAM) method and CpG island microarrays containing 12,000 CpG sites. The first analysis, comparing favourable and intermediate cytogenetic risk groups, revealed significantly differentially methylated CpG sites (594 CpG islands) between the two subgroups. Mutations in the NPM1 gene occur at a high frequency (40%) within the NK-AML subgroup and are associated with a more favourable prognosis in these patients. A second analysis comparing the NPM1 mutant and wild-type research study subjects again identified distinct methylation profiles between these two subgroups. Network and pathway analysis revealed possible molecular mechanisms associated with the different risk and/or mutation sub-groups. This may result in a better classification of the risk groups, improved monitoring targets, or the identification of novel molecular therapies. PMID:24179384
Bahramnejad, Bahman
2014-01-01
P. atlantica subsp. Kurdica, with the local name of Baneh, is a wild medicinal plant which grows in Kurdistan, Iran. The identification of resistance gene analogs holds great promise for the development of resistant cultivars. A PCR approach with degenerate primers designed according to conserved NBS-LRR (nucleotide binding site-leucine rich repeat) regions of known disease-resistance (R) genes was used to amplify and clone homologous sequences from P. atlantica subsp. Kurdica. A DNA fragment of the expected 500-bp size was amplified. The nucleotide sequence of this amplicon was obtained through sequencing and the predicted amino acid sequence compared to the amino acid sequences of known R-genes revealed significant sequence similarity. Alignment of the deduced amino acid sequence of P. atlantica subsp. Kurdica resistance gene analog (RGA) showed strong identity, ranging from 68% to 77%, to the non-toll interleukin receptor (non-TIR) R-gene subfamily from other plants. A P-loop motif (GMMGGEGKTT), a conserved and hydrophobic motif GLPLAL, a kinase-2a motif (LLVLDDV), when replaced by IAVFDDI in PAKRGA1 and a kinase-3a (FGPGSRIII) were presented in all RGA. A phylogenetic tree, based on the deduced amino-acid sequences of PAKRGA1 and RGAs from different species indicated that they were separated in two clusters, PAKRGA1 being on cluster II. The isolated NBS analogs can be eventually used as guidelines to isolate numerous R-genes in Pistachio. PMID:27843981
Comparison of algorithms for the detection of cancer-drivers at sub-gene resolution
Porta-Pardo, Eduard; Kamburov, Atanas; Tamborero, David; Pons, Tirso; Grases, Daniela; Valencia, Alfonso; Lopez-Bigas, Nuria; Getz, Gad; Godzik, Adam
2018-01-01
Understanding genetic events that lead to cancer initiation and progression remains one of the biggest challenges in cancer biology. Traditionally most algorithms for cancer driver identification look for genes that have more mutations than expected from the average background mutation rate. However, there is now a wide variety of methods that look for non-random distribution of mutations within proteins as a signal they have a driving role in cancer. Here we classify and review the progress of such sub-gene resolution algorithms, compare their findings on four distinct cancer datasets from The Cancer Genome Atlas and discuss how predictions from these algorithms can be interpreted in the emerging paradigms that challenge the simple dichotomy between driver and passenger genes. PMID:28714987
Identification and consequences of miRNA-target interactions--beyond repression of gene expression.
Hausser, Jean; Zavolan, Mihaela
2014-09-01
Comparative genomics analyses and high-throughput experimental studies indicate that a microRNA (miRNA) binds to hundreds of sites across the transcriptome. Although the knockout of components of the miRNA biogenesis pathway has profound phenotypic consequences, most predicted miRNA targets undergo small changes at the mRNA and protein levels when the expression of the miRNA is perturbed. Alternatively, miRNAs can establish thresholds in and increase the coherence of the expression of their target genes, as well as reduce the cell-to-cell variability in target gene expression. Here, we review the recent progress in identifying miRNA targets and the emerging paradigms of how miRNAs shape the dynamics of target gene expression.
MALDI-TOF MS versus VITEK 2 ANC card for identification of anaerobic bacteria.
Li, Yang; Gu, Bing; Liu, Genyan; Xia, Wenying; Fan, Kun; Mei, Yaning; Huang, Peijun; Pan, Shiyang
2014-05-01
Matrix-assisted laser desorption ionization time-of-flight mass spectrometry (MALDI-TOF MS) is an accurate, rapid and inexpensive technique that has initiated a revolution in the clinical microbiology laboratory for identification of pathogens. The Vitek 2 anaerobe and Corynebacterium (ANC) identification card is a newly developed method for identification of corynebacteria and anaerobic species. The aim of this study was to evaluate the effectiveness of the ANC card and MALDI-TOF MS techniques for identification of clinical anaerobic isolates. Five reference strains and a total of 50 anaerobic bacteria clinical isolates comprising ten different genera and 14 species were identified and analyzed by the ANC card together with Vitek 2 identification system and Vitek MS together with version 2.0 database respectively. 16S rRNA gene sequencing was used as reference method for accuracy in the identification. Vitek 2 ANC card and Vitek MS provided comparable results at species level for the five reference strains. Of 50 clinical strains, the Vitek MS provided identification for 46 strains (92%) to the species level, 47 (94%) to genus level, one (2%) low discrimination, two (4%) no identification and one (2%) misidentification. The Vitek 2 ANC card provided identification for 43 strains (86%) correct to the species level, 47 (94%) correct to the genus level, three (6%) low discrimination, three (6%) no identification and one (2%) misidentification. Both Vitek MS and Vitek 2 ANC card can be used for accurate routine clinical anaerobe identification. Comparing to the Vitek 2 ANC card, Vitek MS is easier, faster and more economic for each test. The databases currently available for both systems should be updated and further developed to enhance performance.
Tsai, Yu-Shuen; Aguan, Kripamoy; Pal, Nikhil R.; Chung, I-Fang
2011-01-01
Informative genes from microarray data can be used to construct prediction model and investigate biological mechanisms. Differentially expressed genes, the main targets of most gene selection methods, can be classified as single- and multiple-class specific signature genes. Here, we present a novel gene selection algorithm based on a Group Marker Index (GMI), which is intuitive, of low-computational complexity, and efficient in identification of both types of genes. Most gene selection methods identify only single-class specific signature genes and cannot identify multiple-class specific signature genes easily. Our algorithm can detect de novo certain conditions of multiple-class specificity of a gene and makes use of a novel non-parametric indicator to assess the discrimination ability between classes. Our method is effective even when the sample size is small as well as when the class sizes are significantly different. To compare the effectiveness and robustness we formulate an intuitive template-based method and use four well-known datasets. We demonstrate that our algorithm outperforms the template-based method in difficult cases with unbalanced distribution. Moreover, the multiple-class specific genes are good biomarkers and play important roles in biological pathways. Our literature survey supports that the proposed method identifies unique multiple-class specific marker genes (not reported earlier to be related to cancer) in the Central Nervous System data. It also discovers unique biomarkers indicating the intrinsic difference between subtypes of lung cancer. We also associate the pathway information with the multiple-class specific signature genes and cross-reference to published studies. We find that the identified genes participate in the pathways directly involved in cancer development in leukemia data. Our method gives a promising way to find genes that can involve in pathways of multiple diseases and hence opens up the possibility of using an existing drug on other diseases as well as designing a single drug for multiple diseases. PMID:21909426
Meta-Analysis of Tumor Stem-Like Breast Cancer Cells Using Gene Set and Network Analysis
Lee, Won Jun; Kim, Sang Cheol; Yoon, Jung-Ho; Yoon, Sang Jun; Lim, Johan; Kim, You-Sun; Kwon, Sung Won; Park, Jeong Hill
2016-01-01
Generally, cancer stem cells have epithelial-to-mesenchymal-transition characteristics and other aggressive properties that cause metastasis. However, there have been no confident markers for the identification of cancer stem cells and comparative methods examining adherent and sphere cells are widely used to investigate mechanism underlying cancer stem cells, because sphere cells have been known to maintain cancer stem cell characteristics. In this study, we conducted a meta-analysis that combined gene expression profiles from several studies that utilized tumorsphere technology to investigate tumor stem-like breast cancer cells. We used our own gene expression profiles along with the three different gene expression profiles from the Gene Expression Omnibus, which we combined using the ComBat method, and obtained significant gene sets using the gene set analysis of our datasets and the combined dataset. This experiment focused on four gene sets such as cytokine-cytokine receptor interaction that demonstrated significance in both datasets. Our observations demonstrated that among the genes of four significant gene sets, six genes were consistently up-regulated and satisfied the p-value of < 0.05, and our network analysis showed high connectivity in five genes. From these results, we established CXCR4, CXCL1 and HMGCS1, the intersecting genes of the datasets with high connectivity and p-value of < 0.05, as significant genes in the identification of cancer stem cells. Additional experiment using quantitative reverse transcription-polymerase chain reaction showed significant up-regulation in MCF-7 derived sphere cells and confirmed the importance of these three genes. Taken together, using meta-analysis that combines gene set and network analysis, we suggested CXCR4, CXCL1 and HMGCS1 as candidates involved in tumor stem-like breast cancer cells. Distinct from other meta-analysis, by using gene set analysis, we selected possible markers which can explain the biological mechanisms and suggested network analysis as an additional criterion for selecting candidates. PMID:26870956
Shahdoust, Maryam; Hajizadeh, Ebrahim; Mozdarani, Hossein; Chehrei, Ali
2013-01-01
Cigarette smoking is the major risk factor for development of lung cancer. Identification of effects of tobacco on airway gene expression may provide insight into the causes. This research aimed to compare gene expression of large airway epithelium cells in normal smokers (n=13) and non-smokers (n=9) in order to find genes which discriminate the two groups and assess cigarette smoking effects on large airway epithelium cells. Genes discriminating smokers from non-smokers were identified by applying a neural network clustering method, growing self-organizing maps (GSOM), to microarray data according to class discrimination scores. An index was computed based on differentiation between each mean of gene expression in the two groups. This clustering approach provided the possibility of comparing thousands of genes simultaneously. The applied approach compared the mean of 7,129 genes in smokers and non-smokers simultaneously and classified the genes of large airway epithelium cells which had differently expressed in smokers comparing with non-smokers. Seven genes were identified which had the highest different expression in smokers compared with the non-smokers group: NQO1, H19, ALDH3A1, AKR1C1, ABHD2, GPX2 and ADH7. Most (NQO1, ALDH3A1, AKR1C1, H19 and GPX2) are known to be clinically notable in lung cancer studies. Furthermore, statistical discriminate analysis showed that these genes could classify samples in smokers and non-smokers correctly with 100% accuracy. With the performed GSOM map, other nodes with high average discriminate scores included genes with alterations strongly related to the lung cancer such as AKR1C3, CYP1B1, UCHL1 and AKR1B10. This clustering by comparing expression of thousands of genes at the same time revealed alteration in normal smokers. Most of the identified genes were strongly relevant to lung cancer in the existing literature. The genes may be utilized to identify smokers with increased risk for lung cancer. A large sample study is now recommended to determine relations between the genes ABHD2 and ADH7 and smoking.
o-p′-DDT-mediated uterotrophy and gene expression in immature C57BL/6 mice and Sprague–Dawley rats
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kwekel, Joshua C.; Forgacs, Agnes L.; Center for Integrative Toxicology, Michigan State University, East Lansing, MI
1,1,1-Trichloro-2,2-bis(2-chlorophenyl-4-chlorophenyl)ethane (o,p′-DDT) is an organochlorine pesticide and endocrine disruptor known to activate the estrogen receptor. Comprehensive ligand- and species-comparative dose- and time-dependent studies were conducted to systematically assess the uterine physiological, morphological and gene expression responses elicited by o,p′-DDT and ethynyl estradiol (EE) in immature ovariectomized C57BL/6 mice and Sprague–Dawley rats. Custom cDNA microarrays were used to identify conserved and divergent differential gene expression responses. A total of 1256 genes were differentially expressed by both ligands in both species, 559 of which exhibited similar temporal expression profiles suggesting that o,p′-DDT elicits estrogenic effects at high doses when compared to EE.more » However, 51 genes exhibited species-specific uterine expression elicited by o,p′-DDT. For example, carbonic anhydrase 2 exhibited species- and ligand-divergent expression as confirmed by quantitative real-time PCR. The identification of comparable temporal phenotypic responses linked to gene expression demonstrates that systematic comparative gene expression assessments are valuable for elucidating conserved and divergent estrogen signaling mechanisms in rodent uterotrophy. - Highlights: • o,p′-DDT and enthynyl estradiol (EE) both elicit uterotrophy in mice and rats. • o,p′-DDT and EE have different kinetics in uterine wet weight induction. • o,p′-DDT elicited stromal hypertrophy in rats but myometrial hypertrophy in mice. • 1256 genes were differentially expressed by both ligands in both species. • Only 51 genes had species-specific uterine expression.« less
Accurate population genetic measurements require cryptic species identification in corals
NASA Astrophysics Data System (ADS)
Sheets, Elizabeth A.; Warner, Patricia A.; Palumbi, Stephen R.
2018-06-01
Correct identification of closely related species is important for reliable measures of gene flow. Incorrectly lumping individuals of different species together has been shown to over- or underestimate population differentiation, but examples highlighting when these different results are observed in empirical datasets are rare. Using 199 single nucleotide polymorphisms, we assigned 768 individuals in the Acropora hyacinthus and A. cytherea morphospecies complexes to each of eight previously identified cryptic genetic species and measured intraspecific genetic differentiation across three geographic scales (within reefs, among reefs within an archipelago, and among Pacific archipelagos). We then compared these calculations to estimated genetic differentiation at each scale with all cryptic genetic species mixed as if we could not tell them apart. At the reef scale, correct genetic species identification yielded lower F ST estimates and fewer significant comparisons than when species were mixed, raising estimates of short-scale gene flow. In contrast, correct genetic species identification at large spatial scales yielded higher F ST measurements than mixed-species comparisons, lowering estimates of long-term gene flow among archipelagos. A meta-analysis of published population genetic studies in corals found similar results: F ST estimates at small spatial scales were lower and significance was found less often in studies that controlled for cryptic species. Our results and these prior datasets controlling for cryptic species suggest that genetic differentiation among local reefs may be lower than what has generally been reported in the literature. Not properly controlling for cryptic species structure can bias population genetic analyses in different directions across spatial scales, and this has important implications for conservation strategies that rely on these estimates.
Fang, H; Ohlsson, A-K; Ullberg, M; Ozenci, V
2012-11-01
The purpose of this investigation was to compare the performance of species-specific polymerase chain reaction (PCR), matrix-assisted laser desorption/ionisation time-of-flight mass spectrometry (MALDI-TOF MS) and phenotypic identification systems for the identification of Enterococcus species. A total of 132 clinical isolates were investigated by the following: (1) a multiplex real-time PCR assay targeting ddl Enterococcus faecium, ddl Enterococcus faecalis, vanC1 and vanC2/C3 genes, and a high-resolution melting (HRM) analysis of the groESL gene for the differentiation of Enterococcus casseliflavus and Enterococcus gallinarum; (2) Bruker MS; (3) VITEK MS; and (4) the VITEK 2 system. 16S rRNA gene sequencing was used as a reference method in the study. The 132 isolates were identified as 32 E. faecalis, 63 E. faecium, 16 E. casseliflavus and 21 E. gallinarum. The multiplex PCR, Bruker MS and VITEK MS were able to identify all the isolates correctly at the species level. The VITEK 2 system could identify 131/132 (99.2 %) and 121/132 (91.7 %) of the isolates at the genus and species levels, respectively. The HRM-groESL assay identified all (21/21) E. gallinarum isolates and 81.3 % (13/16) of the E. casseliflavus isolates. The PCR methods described in the present study are effective in identifying the enterococcal species. MALDI-TOF MS is a rapid, reliable and cost-effective identification technique for enterococci. The VITEK 2 system is less efficient at detecting non-faecalis and non-faecium Enterococcus species.
NASA Astrophysics Data System (ADS)
Ng, Siuk-Mun; Lee, Xin-Wei; Wan, Kiew-Lian; Firdaus-Raih, Mohd
2015-09-01
Regulation of functional nucleus-encoded proteins targeting the plastidial functions was comparatively studied for a plant parasite, Rafflesia cantleyi versus a photosynthetic plant, Arabidopsis thaliana. This study involved two species of different feeding modes and different developmental stages. A total of 30 nucleus-encoded proteins were found to be differentially-regulated during two stages in the parasite; whereas 17 nucleus-encoded proteins were differentially-expressed during two developmental stages in Arabidopsis thaliana. One notable finding observed for the two plants was the identification of genes involved in the regulation of photosynthesis-related processes where these processes, as expected, seem to be present only in the autotroph.
Jaggi, Preeti; Mejias, Asuncion; Xu, Zhaohui; Yin, Han; Moore-Clingenpeel, Melissa; Smith, Bennett; Burns, Jane C; Tremoulet, Adriana H; Jordan-Villegas, Alejandro; Chaussabel, Damien; Texter, Karen; Pascual, Virginia; Ramilo, Octavio
2018-01-01
Early identification of children with Kawasaki Disease (KD) is key for timely initiation of intravenous immunoglobulin (IVIG) therapy. However, the diagnosis of the disease remains challenging, especially in children with an incomplete presentation (inKD). Moreover, we currently lack objective tools for identification of non-response (NR) to IVIG. Children with KD were enrolled and samples obtained before IVIG treatment and sequentially at 24 h and 4-6 weeks post-IVIG in a subset of patients. We also enrolled children with other febrile illnesses [adenovirus (AdV); group A streptococcus (GAS)] and healthy controls (HC) for comparative analyses. Blood transcriptional profiles were analyzed to define: a) the cKD and inKD biosignature, b) compare the KD signature with other febrile illnesses and, c) identify biomarkers predictive of clinical outcomes. We identified a cKD biosignature (n = 39; HC, n = 16) that was validated in two additional cohorts of children with cKD (n = 37; HC, n = 20) and inKD (n = 13; HC, n = 8) and was characterized by overexpression of inflammation, platelets, apoptosis and neutrophil genes, and underexpression of T and NK cell genes. Classifier genes discriminated KD from adenovirus with higher sensitivity and specificity (92% and 100%, respectively) than for GAS (75% and 87%, respectively). We identified a genomic score (MDTH) that was higher at baseline in IVIG-NR [median 12,290 vs. 5,572 in responders, p = 0.009] and independently predicted IVIG-NR. A reproducible biosignature from KD patients was identified, and was similar in children with cKD and inKD. A genomic score allowed early identification of children at higher risk for non-response to IVIG.
He, Hongjuan; Xiu, Youcheng; Guo, Jing; Liu, Hui; Liu, Qi; Zeng, Tiebo; Chen, Yan; Zhang, Yan; Wu, Qiong
2013-01-01
Long non-coding RNAs (lncRNAs) as a key group of non-coding RNAs have gained widely attention. Though lncRNAs have been functionally annotated and systematic explored in higher mammals, few are under systematical identification and annotation. Owing to the expression specificity, known lncRNAs expressed in embryonic brain tissues remain still limited. Considering a large number of lncRNAs are only transcribed in brain tissues, studies of lncRNAs in developmental brain are therefore of special interest. Here, publicly available RNA-sequencing (RNA-seq) data in embryonic brain are integrated to identify thousands of embryonic brain lncRNAs by a customized pipeline. A significant proportion of novel transcripts have not been annotated by available genomic resources. The putative embryonic brain lncRNAs are shorter in length, less spliced and show less conservation than known genes. The expression of putative lncRNAs is in one tenth on average of known coding genes, while comparable with known lncRNAs. From chromatin data, putative embryonic brain lncRNAs are associated with active chromatin marks, comparable with known lncRNAs. Embryonic brain expressed lncRNAs are also indicated to have expression though not evident in adult brain. Gene Ontology analysis of putative embryonic brain lncRNAs suggests that they are associated with brain development. The putative lncRNAs are shown to be related to possible cis-regulatory roles in imprinting even themselves are deemed to be imprinted lncRNAs. Re-analysis of one knockdown data suggests that four regulators are associated with lncRNAs. Taken together, the identification and systematic analysis of putative lncRNAs would provide novel insights into uncharacterized mouse non-coding regions and the relationships with mammalian embryonic brain development. PMID:23967161
Xu, Zhaohui; Yin, Han; Moore-Clingenpeel, Melissa; Smith, Bennett; Burns, Jane C.; Tremoulet, Adriana H.; Jordan-Villegas, Alejandro; Chaussabel, Damien; Texter, Karen; Pascual, Virginia; Ramilo, Octavio
2018-01-01
Background Early identification of children with Kawasaki Disease (KD) is key for timely initiation of intravenous immunoglobulin (IVIG) therapy. However, the diagnosis of the disease remains challenging, especially in children with an incomplete presentation (inKD). Moreover, we currently lack objective tools for identification of non-response (NR) to IVIG. Methods Children with KD were enrolled and samples obtained before IVIG treatment and sequentially at 24 h and 4–6 weeks post-IVIG in a subset of patients. We also enrolled children with other febrile illnesses [adenovirus (AdV); group A streptococcus (GAS)] and healthy controls (HC) for comparative analyses. Blood transcriptional profiles were analyzed to define: a) the cKD and inKD biosignature, b) compare the KD signature with other febrile illnesses and, c) identify biomarkers predictive of clinical outcomes. Results We identified a cKD biosignature (n = 39; HC, n = 16) that was validated in two additional cohorts of children with cKD (n = 37; HC, n = 20) and inKD (n = 13; HC, n = 8) and was characterized by overexpression of inflammation, platelets, apoptosis and neutrophil genes, and underexpression of T and NK cell genes. Classifier genes discriminated KD from adenovirus with higher sensitivity and specificity (92% and 100%, respectively) than for GAS (75% and 87%, respectively). We identified a genomic score (MDTH) that was higher at baseline in IVIG-NR [median 12,290 vs. 5,572 in responders, p = 0.009] and independently predicted IVIG-NR. Conclusion A reproducible biosignature from KD patients was identified, and was similar in children with cKD and inKD. A genomic score allowed early identification of children at higher risk for non-response to IVIG. PMID:29813106
Identification of trans-acting factors regulating SamDC expression in Oryza sativa
DOE Office of Scientific and Technical Information (OSTI.GOV)
Basu, Supratim, E-mail: supratim_genetics@yahoo.co.in; Division of Plant Biology, Bose Institute, Kolkata; Roychoudhury, Aryadeep
2014-03-07
Highlights: • Identification of cis elements responsible for SamDC expression by in silico analysis. • qPCR analysis of SamDC expression to abiotic and biotic stress treatments. • Detection of SamDC regulators using identified cis-elements as probe by EMSA. • Southwestern Blot analysis to predict the size of the trans-acting factors. - Abstract: Abiotic stress affects the growth and productivity of crop plants; to cope with the adverse environmental conditions, plants have developed efficient defense machinery comprising of antioxidants like phenolics and flavonoids, and osmolytes like polyamines. SamDC is a key enzyme in the polyamine biosynthesis pathway in plants. In ourmore » present communication we have done in silico analysis of the promoter region of SamDC to look for the presence of different cis-regulatory elements contributing to its expression. Based on the presence of different cis-regulatory elements we completed comparative analysis of SamDC gene expression in rice lamina of IR-29 and Nonabokra by qPCR in response to the abiotic stress treatments of salinity, drought, cold and the biotic stress treatments of ABA and light. Additionally, to explore the role of the cis-regulatory elements in regulating the expression of SamDC gene in plants we comparatively analyzed the binding of rice nuclear proteins prepared from IR-29 and Nonabokra undergoing various stress treatments. The intensity of the complex formed was low and inducible in IR-29 in contrast to Nonabokra. Southwestern blot analysis helped in predicting the size of the trans-acting factors binding to these cis-elements. To our knowledge this is the first report on the comprehensive analysis of SamDC gene expression in rice and identification of the trans-acting factors regulating its expression.« less
Kong, Ling-An; Wu, Du-Qing; Huang, Wen-Kun; Peng, Huan; Wang, Gao-Feng; Cui, Jiang-Kuan; Liu, Shi-Ming; Li, Zhi-Gang; Yang, Jun; Peng, De-Liang
2015-10-16
Cereal cyst nematode Heterodera avenae, an important soil-borne pathogen in wheat, causes numerous annual yield losses worldwide, and use of resistant cultivars is the best strategy for control. However, target genes are not readily available for breeding resistant cultivars. Therefore, comparative transcriptomic analyses were performed to identify more applicable resistance genes for cultivar breeding. The developing nematodes within roots were stained with acid fuchsin solution. Transcriptome assemblies and redundancy filteration were obtained by Trinity, TGI Clustering Tool and BLASTN, respectively. Gene Ontology annotation was yielded by Blast2GO program, and metabolic pathways of transcripts were analyzed by Path_finder. The ROS levels were determined by luminol-chemiluminescence assay. The transcriptional gene expression profiles were obtained by quantitative RT-PCR. The RNA-sequencing was performed using an incompatible wheat cultivar VP1620 and a compatible control cultivar WEN19 infected with H. avenae at 24 h, 3 d and 8 d. Infection assays showed that VP1620 failed to block penetration of H. avenae but disturbed the transition of developmental stages, leading to a significant reduction in cyst formation. Two types of expression profiles were established to predict candidate resistance genes after developing a novel strategy to generate clean RNA-seq data by removing the transcripts of H. avenae within the raw data before assembly. Using the uncoordinated expression profiles with transcript abundance as a standard, 424 candidate resistance genes were identified, including 302 overlapping genes and 122 VP1620-specific genes. Genes with similar expression patterns were further classified according to the scales of changed transcript abundances, and 182 genes were rescued as supplementary candidate resistance genes. Functional characterizations revealed that diverse defense-related pathways were responsible for wheat resistance against H. avenae. Moreover, phospholipase was involved in many defense-related pathways and localized in the connection position. Furthermore, strong bursts of reactive oxygen species (ROS) within VP1620 roots infected with H. avenae were induced at 24 h and 3 d, and eight ROS-producing genes were significantly upregulated, including three class III peroxidase and five lipoxygenase genes. Large-scale identification of wheat resistance genes were processed by comparative transcriptomic analysis. Functional characterization showed that phospholipases associated with ROS production played vital roles in early defense responses to H. avenae via involvement in diverse defense-related pathways as a hub switch. This study is the first to investigate the early defense responses of wheat against H. avenae, not only provides applicable candidate resistance genes for breeding novel wheat cultivars, but also enables a better understanding of the defense mechanisms of wheat against H. avenae.
Ortholog Identification and Comparative Analysis of Microbial Genomes Using MBGD and RECOG.
Uchiyama, Ikuo
2017-01-01
Comparative genomics is becoming an essential approach for identification of genes associated with a specific function or phenotype. Here, we introduce the microbial genome database for comparative analysis (MBGD), which is a comprehensive ortholog database among the microbial genomes available so far. MBGD contains several precomputed ortholog tables including the standard ortholog table covering the entire taxonomic range and taxon-specific ortholog tables for various major taxa. In addition, MBGD allows the users to create an ortholog table within any specified set of genomes through dynamic calculations. In particular, MBGD has a "My MBGD" mode where users can upload their original genome sequences and incorporate them into orthology analysis. The created ortholog table can serve as the basis for various comparative analyses. Here, we describe the use of MBGD and briefly explain how to utilize the orthology information during comparative genome analysis in combination with the stand-alone comparative genomics software RECOG, focusing on the application to comparison of closely related microbial genomes.
ERIC Educational Resources Information Center
Castermans, Dries; Wilquet, Valerie; Steyaert, Jean; van de Ven, Wim; Fryns, Jean-Pierre; Devriendt, Koen
2004-01-01
We review the different strategies currently used to try to identify susceptibility genes for idiopathic autism. Although identification of genes is usually straightforward in Mendelian disorders, it has proved to be much more difficult to establish in polygenic disorders like autism. Neither genome screens of affected siblings nor the large…
Spittel, Susanne; Hoedemaker, Martina
2012-01-01
In the following field study, the commercial PathoProof Mastitis PCR Assay, a real-time PCR for identifying eleven mastitis pathogens and the staphylococcal beta-lactamase gene, was compared with conventional bacterial culture. For this purpose, 681 udder quarter samples from 173 clinically healthy cows with varying somatic cell count from four dairy herds in the region of Osnabrück, Lower Saxony, Germany, were collected between July 2010 and February 2011 and subjected to PCR and bacterial culture. The frequency of positive pathogen signals was markedly higher with PCR compared with culture (70.6% vs. 32.2%). This was accompanied by a substantial higher percentage of multiple pathogen identifications and a lower percentage of single identifications in the PCR compared with bacterial culture. Using bacterial culture as gold standard, moderate to high sensitivities (76.9-100%) and specificities (63.3-98.7%) were calculated for six out of seven pathogens with sufficient detection numbers. For Enterococcus spp, the sensitivity was only 9.1%. When the PCR results of pooled udder quarter samples of the 173 cows were compared with the single udder quarter samples, in 72% of the cases, major pathogen DNA was either not found in both types of samples, or in the case of a positive pool sample, the respective pathogens were found in at least one udder quarter sample. With both methods, the most frequently detected mastitis pathogens were coryneform bacteria (PCR: Corynebacterium bovis), coagulase-negative staphylococci (CNS) and Staphylococcus (S.) aureus, followed by Arcanobacterium pyogenes/Peptoniphilus indolicus with PCR, and then with both methods, Streptococcus uberis. The staphylococcal beta-lactamase gene was found in 27.7% of the S. aureus and in 37.0% of the CNS identifications.
Genome-wide identification of lineage-specific genes in Arabidopsis, Oryza and Populus
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yang, Xiaohan; Jawdy, Sara; Tschaplinski, Timothy J
2009-01-01
Protein sequences were compared among Arabidopsis, Oryza and Populus to identify differential gene (DG) sets that are in one but not the other two genomes. The DG sets were screened against a plant transcript database, the NR protein database and six newly-sequenced genomes (Carica, Glycine, Medicago, Sorghum, Vitis and Zea) to identify a set of species-specific genes (SS). Gene expression, protein motif and intron number were examined. 192, 641 and 109 SS genes were identified in Arabidopsis, Oryza and Populus, respectively. Some SS genes were preferentially expressed in flowers, roots, xylem and cambium or up-regulated by stress. Six conserved motifsmore » in Arabidopsis and Oryza SS proteins were found in other distant lineages. The SS gene sets were enriched with intronless genes. The results reflect functional and/or anatomical differences between monocots and eudicots or between herbaceous and woody plants. The Populus-specific genes are candidates for carbon sequestration and biofuel research.« less
The FMRP regulon: from targets to disease convergence
Fernández, Esperanza; Rajan, Nicholas; Bagni, Claudia
2013-01-01
The fragile X mental retardation protein (FMRP) is an RNA-binding protein that regulates mRNA metabolism. FMRP has been largely studied in the brain, where the absence of this protein leads to fragile X syndrome, the most frequent form of inherited intellectual disability. Since the identification of the FMRP gene in 1991, many studies have primarily focused on understanding the function/s of this protein. Hundreds of potential FMRP mRNA targets and several interacting proteins have been identified. Here, we report the identification of FMRP mRNA targets in the mammalian brain that support the key role of this protein during brain development and in regulating synaptic plasticity. We compared the genes from databases and genome-wide association studies with the brain FMRP transcriptome, and identified several FMRP mRNA targets associated with autism spectrum disorders, mood disorders and schizophrenia, showing a potential common pathway/s for these apparently different disorders. PMID:24167470
Microarray expression profiling identifies genes with altered expression in HDL-deficient mice
DOE Office of Scientific and Technical Information (OSTI.GOV)
Callow, Matthew J.; Dudoit, Sandrine; Gong, Elaine L.
2000-05-05
Based on the assumption that severe alterations in the expression of genes known to be involved in HDL metabolism may affect the expression of other genes we screened an array of over 5000 mouse expressed sequence tags (ESTs) for altered gene expression in the livers of two lines of mice with dramatic decreases in HDL plasma concentrations. Labeled cDNA from livers of apolipoprotein AI (apo AI) knockout mice, Scavenger Receptor BI (SR-BI) transgenic mice and control mice were co-hybridized to microarrays. Two-sample t-statistics were used to identify genes with altered expression levels in the knockout or transgenic mice compared withmore » the control mice. In the SR-BI group we found 9 array elements representing at least 5 genes to be significantly altered on the basis of an adjusted p value of less than 0.05. In the apo AI knockout group 8 array elements representing 4 genes were altered compared with the control group (p < 0.05). Several of the genes identified in the SR-BI transgenic suggest altered sterol metabolism and oxidative processes. These studies illustrate the use of multiple-testing methods for the identification of genes with altered expression in replicated microarray experiments of apo AI knockout and SR-BI transgenic mice.« less
Genome-wide identification and evolution of the PIN-FORMED (PIN) gene family in Glycine max.
Liu, Yuan; Wei, Haichao
2017-07-01
Soybean (Glycine max) is one of the most important crop plants. Wild and cultivated soybean varieties have significant differences worth further investigation, such as plant morphology, seed size, and seed coat development; these characters may be related to auxin biology. The PIN gene family encodes essential transport proteins in cell-to-cell auxin transport, but little research on soybean PIN genes (GmPIN genes) has been done, especially with respect to the evolution and differences between wild and cultivated soybean. In this study, we retrieved 23 GmPIN genes from the latest updated G. max genome database; six GmPIN protein sequences were changed compared with the previous database. Based on the Plant Genome Duplication Database, 18 GmPIN genes have been involved in segment duplication. Three pairs of GmPIN genes arose after the second soybean genome duplication, and six occurred after the first genome duplication. The duplicated GmPIN genes retained similar expression patterns. All the duplicated GmPIN genes experienced purifying selection (K a /K s < 1) to prevent accumulation of non-synonymous mutations and thus remained more similar. In addition, we also focused on the artificial selection of the soybean PIN genes. Five artificially selected GmPIN genes were identified by comparing the genome sequence of 17 wild and 14 cultivated soybean varieties. Our research provides useful and comprehensive basic information for understanding GmPIN genes.
Identification of genomic islands in six plant pathogens.
Chen, Ling-Ling
2006-06-07
Genomic islands (GIs) play important roles in microbial evolution, which are acquired by horizontal gene transfer. In this paper, the GIs of six completely sequenced plant pathogens are identified using a windowless method based on Z curve representation of DNA sequences. Consequently, four, eight, four, one, two and four GIs are recognized with the length greater than 20-Kb in plant pathogens Agrobacterium tumefaciens str. C58, Rolstonia solanacearum GMI1000, Xanthomonas axonopodis pv. citri str. 306 (Xac), Xanthomonas campestris pv. campestris str. ATCC33913 (Xcc), Xylella fastidiosa 9a5c and Pseudomonas syringae pv. tomato str. DC3000, respectively. Most of these regions share a set of conserved features of GIs, including an abrupt change in GC content compared with that of the rest of the genome, the existence of integrase genes at the junction, the use of tRNA as the integration sites, the presence of genetic mobility genes, the difference of codon usage, codon preference and amino acid usage, etc. The identification of these GIs will benefit the research for the six important phytopathogens.
Ponce-Alonso, M; Rodríguez-Rojas, L; Del Campo, R; Cantón, R; Morosini, M-I
2016-03-01
The genus Raoultella was excised from Klebsiella in 2001, but difficulties in its identification may have led to an underestimation of its incidence and uncertainty on its pathogenic role. Recently, clinical reports involving Raoultella have increased, probably through the introduction of mass-spectrometry in clinical microbiology laboratories and the development of accurate molecular techniques. We performed a retrospective analysis using our blood culture collection (2011-14) to identify Raoultella isolates that could have been erroneously reported as Klebsiella. PCR and gene sequencing of highly specific chromosomal class A β-lactamase genes was established as the reference method, and compared with 16S rRNA and rpoβ sequencing, as well as matrix-assisted laser desorption/ionization time-of-flight mass spectroscopy (MALDI-TOF MS), MicroScan Walkaway system and API20E biochemical identification. MALDI-TOF and rpoβ correctly identified all Raoultella isolates, whereas 16S rRNA provided inconclusive results, and MicroScan and API20E failed to detect this genus. The analysis of the clinical characteristics of all Raoultella bacteraemia cases reported in the literature supports the role of Raoultella as an opportunistic pathogen that causes biliary tract infections in elderly patients who suffer from some kind of malignancy or have undergone an invasive procedure. Two salient conclusions are that Raoultella shows tropism for the biliary tract and so its identification could help clinicians to suspect underlying biliary tract disease when bacteraemia occurs. Concomitantly, as most phenotypic identification systems are not optimized for the identification of Raoultella, the use of MALDI-TOF or additional phenotypic tests is recommended for the reliable identification of this genus. Copyright © 2015 European Society of Clinical Microbiology and Infectious Diseases. Published by Elsevier Ltd. All rights reserved.
Rosli, Rozana; Amiruddin, Nadzirah; Ab Halim, Mohd Amin; Chan, Pek-Lan; Chan, Kuang-Lim; Azizi, Norazah; Morris, Priscilla E; Leslie Low, Eng-Ti; Ong-Abdullah, Meilina; Sambanthamurthi, Ravigadevi; Singh, Rajinder; Murphy, Denis J
2018-01-01
Comparative genomics and transcriptomic analyses were performed on two agronomically important groups of genes from oil palm versus other major crop species and the model organism, Arabidopsis thaliana. The first analysis was of two gene families with key roles in regulation of oil quality and in particular the accumulation of oleic acid, namely stearoyl ACP desaturases (SAD) and acyl-acyl carrier protein (ACP) thioesterases (FAT). In both cases, these were found to be large gene families with complex expression profiles across a wide range of tissue types and developmental stages. The detailed classification of the oil palm SAD and FAT genes has enabled the updating of the latest version of the oil palm gene model. The second analysis focused on disease resistance (R) genes in order to elucidate possible candidates for breeding of pathogen tolerance/resistance. Ortholog analysis showed that 141 out of the 210 putative oil palm R genes had homologs in banana and rice. These genes formed 37 clusters with 634 orthologous genes. Classification of the 141 oil palm R genes showed that the genes belong to the Kinase (7), CNL (95), MLO-like (8), RLK (3) and Others (28) categories. The CNL R genes formed eight clusters. Expression data for selected R genes also identified potential candidates for breeding of disease resistance traits. Furthermore, these findings can provide information about the species evolution as well as the identification of agronomically important genes in oil palm and other major crops.
Rosli, Rozana; Amiruddin, Nadzirah; Ab Halim, Mohd Amin; Chan, Pek-Lan; Chan, Kuang-Lim; Azizi, Norazah; Morris, Priscilla E.; Leslie Low, Eng-Ti; Ong-Abdullah, Meilina; Sambanthamurthi, Ravigadevi; Singh, Rajinder
2018-01-01
Comparative genomics and transcriptomic analyses were performed on two agronomically important groups of genes from oil palm versus other major crop species and the model organism, Arabidopsis thaliana. The first analysis was of two gene families with key roles in regulation of oil quality and in particular the accumulation of oleic acid, namely stearoyl ACP desaturases (SAD) and acyl-acyl carrier protein (ACP) thioesterases (FAT). In both cases, these were found to be large gene families with complex expression profiles across a wide range of tissue types and developmental stages. The detailed classification of the oil palm SAD and FAT genes has enabled the updating of the latest version of the oil palm gene model. The second analysis focused on disease resistance (R) genes in order to elucidate possible candidates for breeding of pathogen tolerance/resistance. Ortholog analysis showed that 141 out of the 210 putative oil palm R genes had homologs in banana and rice. These genes formed 37 clusters with 634 orthologous genes. Classification of the 141 oil palm R genes showed that the genes belong to the Kinase (7), CNL (95), MLO-like (8), RLK (3) and Others (28) categories. The CNL R genes formed eight clusters. Expression data for selected R genes also identified potential candidates for breeding of disease resistance traits. Furthermore, these findings can provide information about the species evolution as well as the identification of agronomically important genes in oil palm and other major crops. PMID:29672525
2013-01-01
Background Transcription factors (TFs) are vital elements that regulate transcription and the spatio-temporal expression of genes, thereby ensuring the accurate development and functioning of an organism. The identification of TF-encoding genes in a liverwort, Marchantia polymorpha, offers insights into TF organization in the members of the most basal lineages of land plants (embryophytes). Therefore, a comparison of Marchantia TF genes with other land plants (monocots, dicots, bryophytes) and algae (chlorophytes, rhodophytes) provides the most comprehensive view of the rates of expansion or contraction of TF genes in plant evolution. Results In this study, we report the identification of TF-encoding transcripts in M. polymorpha for the first time, as evidenced by deep RNA sequencing data. In total, 3,471 putative TF encoding transcripts, distributed in 80 families, were identified, representing 7.4% of the generated Marchantia gametophytic transcriptome dataset. Overall, TF basic functions and distribution across families appear to be conserved when compared to other plant species. However, it is of interest to observe the genesis of novel sequences in 24 TF families and the apparent termination of 2 TF families with the emergence of Marchantia. Out of 24 TF families, 6 are known to be associated with plant reproductive development processes. We also examined the expression pattern of these TF-encoding transcripts in six male and female developmental stages in vegetative and reproductive gametophytic tissues of Marchantia. Conclusions The analysis highlighted the importance of Marchantia, a model plant system, in an evolutionary context. The dataset generated here provides a scientific resource for TF gene discovery and other comparative evolutionary studies of land plants. PMID:24365221
Quantitative Proteomic Analysis of the Hfq-Regulon in Sinorhizobium meliloti 2011
Sobrero, Patricio; Schlüter, Jan-Philip; Lanner, Ulrike; Schlosser, Andreas; Becker, Anke; Valverde, Claudio
2012-01-01
Riboregulation stands for RNA-based control of gene expression. In bacteria, small non-coding RNAs (sRNAs) are a major class of riboregulatory elements, most of which act at the post-transcriptional level by base-pairing target mRNA genes. The RNA chaperone Hfq facilitates antisense interactions between target mRNAs and regulatory sRNAs, thus influencing mRNA stability and/or translation rate. In the α-proteobacterium Sinorhizobium meliloti strain 2011, the identification and detection of multiple sRNAs genes and the broadly pleitropic phenotype associated to the absence of a functional Hfq protein both support the existence of riboregulatory circuits controlling gene expression to ensure the fitness of this bacterium in both free living and symbiotic conditions. In order to identify target mRNAs subject to Hfq-dependent riboregulation, we have compared the proteome of an hfq mutant and the wild type S. meliloti by quantitative proteomics following protein labelling with 15N. Among 2139 univocally identified proteins, a total of 195 proteins showed a differential abundance between the Hfq mutant and the wild type strain; 65 proteins accumulated ≥2-fold whereas 130 were downregulated (≤0.5-fold) in the absence of Hfq. This profound proteomic impact implies a major role for Hfq on regulation of diverse physiological processes in S. meliloti, from transport of small molecules to homeostasis of iron and nitrogen. Changes in the cellular levels of proteins involved in transport of nucleotides, peptides and amino acids, and in iron homeostasis, were confirmed with phenotypic assays. These results represent the first quantitative proteomic analysis in S. meliloti. The comparative analysis of the hfq mutant proteome allowed identification of novel strongly Hfq-regulated genes in S. meliloti. PMID:23119037
Quantitative proteomic analysis of the Hfq-regulon in Sinorhizobium meliloti 2011.
Sobrero, Patricio; Schlüter, Jan-Philip; Lanner, Ulrike; Schlosser, Andreas; Becker, Anke; Valverde, Claudio
2012-01-01
Riboregulation stands for RNA-based control of gene expression. In bacteria, small non-coding RNAs (sRNAs) are a major class of riboregulatory elements, most of which act at the post-transcriptional level by base-pairing target mRNA genes. The RNA chaperone Hfq facilitates antisense interactions between target mRNAs and regulatory sRNAs, thus influencing mRNA stability and/or translation rate. In the α-proteobacterium Sinorhizobium meliloti strain 2011, the identification and detection of multiple sRNAs genes and the broadly pleitropic phenotype associated to the absence of a functional Hfq protein both support the existence of riboregulatory circuits controlling gene expression to ensure the fitness of this bacterium in both free living and symbiotic conditions. In order to identify target mRNAs subject to Hfq-dependent riboregulation, we have compared the proteome of an hfq mutant and the wild type S. meliloti by quantitative proteomics following protein labelling with (15)N. Among 2139 univocally identified proteins, a total of 195 proteins showed a differential abundance between the Hfq mutant and the wild type strain; 65 proteins accumulated ≥2-fold whereas 130 were downregulated (≤0.5-fold) in the absence of Hfq. This profound proteomic impact implies a major role for Hfq on regulation of diverse physiological processes in S. meliloti, from transport of small molecules to homeostasis of iron and nitrogen. Changes in the cellular levels of proteins involved in transport of nucleotides, peptides and amino acids, and in iron homeostasis, were confirmed with phenotypic assays. These results represent the first quantitative proteomic analysis in S. meliloti. The comparative analysis of the hfq mutant proteome allowed identification of novel strongly Hfq-regulated genes in S. meliloti.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Berman, Benjamin P.; Pfeiffer, Barret D.; Laverty, Todd R.
2004-08-06
Background The identification of sequences that control transcription in metazoans is a major goal of genome analysis. In a previous study, we demonstrated that searching for clusters of predicted transcription factor binding sites could discover active regulatory sequences, and identified 37 regions of the Drosophila melanogaster genome with high densities of predicted binding sites for five transcription factors involved in anterior-posterior embryonic patterning. Nine of these clusters overlapped known enhancers. Here, we report the results of in vivo functional analysis of 27 remaining clusters. Results We generated transgenic flies carrying each cluster attached to a basal promoter and reporter gene,more » and assayed embryos for reporter gene expression. Six clusters are enhancers of adjacent genes: giant, fushi tarazu, odd-skipped, nubbin, squeeze and pdm2; three drive expression in patterns unrelated to those of neighboring genes; the remaining 18 do not appear to have enhancer activity. We used the Drosophila pseudoobscura genome to compare patterns of evolution in and around the 15 positive and 18 false-positive predictions. Although conservation of primary sequence cannot distinguish true from false positives, conservation of binding-site clustering accurately discriminates functional binding-site clusters from those with no function. We incorporated conservation of binding-site clustering into a new genome-wide enhancer screen, and predict several hundred new regulatory sequences, including 85 adjacent to genes with embryonic patterns. Conclusions Measuring conservation of sequence features closely linked to function - such as binding-site clustering - makes better use of comparative sequence data than commonly used methods that examine only sequence identity.« less
Characterization of human septic sera induced gene expression modulation in human myocytes
Hussein, Shaimaa; Michael, Paul; Brabant, Danielle; Omri, Abdelwahab; Narain, Ravin; Passi, Kalpdrum; Ramana, Chilakamarti V.; Parrillo, Joseph E.; Kumar, Anand; Parissenti, Amadeo; Kumar, Aseem
2009-01-01
To gain a better understanding of the gene expression changes that occurs during sepsis, we have performed a cDNA microarray study utilizing a tissue culture model that mimics human sepsis. This study utilized an in vitro model of cultured human fetal cardiac myocytes treated with 10% sera from septic patients or 10% sera from healthy volunteers. A 1700 cDNA expression microarray was used to compare the transcription profile from human cardiac myocytes treated with septic sera vs normal sera. Septic sera treatment of myocytes resulted in the down-regulation of 178 genes and the up-regulation of 4 genes. Our data indicate that septic sera induced cell cycle, metabolic, transcription factor and apoptotic gene expression changes in human myocytes. Identification and characterization of gene expression changes that occur during sepsis may lead to the development of novel therapeutics and diagnostics. PMID:19684886
Genetics of intellectual disability in consanguineous families.
Hu, Hao; Kahrizi, Kimia; Musante, Luciana; Fattahi, Zohreh; Herwig, Ralf; Hosseini, Masoumeh; Oppitz, Cornelia; Abedini, Seyedeh Sedigheh; Suckow, Vanessa; Larti, Farzaneh; Beheshtian, Maryam; Lipkowitz, Bettina; Akhtarkhavari, Tara; Mehvari, Sepideh; Otto, Sabine; Mohseni, Marzieh; Arzhangi, Sanaz; Jamali, Payman; Mojahedi, Faezeh; Taghdiri, Maryam; Papari, Elaheh; Soltani Banavandi, Mohammad Javad; Akbari, Saeide; Tonekaboni, Seyed Hassan; Dehghani, Hossein; Ebrahimpour, Mohammad Reza; Bader, Ingrid; Davarnia, Behzad; Cohen, Monika; Khodaei, Hossein; Albrecht, Beate; Azimi, Sarah; Zirn, Birgit; Bastami, Milad; Wieczorek, Dagmar; Bahrami, Gholamreza; Keleman, Krystyna; Vahid, Leila Nouri; Tzschach, Andreas; Gärtner, Jutta; Gillessen-Kaesbach, Gabriele; Varaghchi, Jamileh Rezazadeh; Timmermann, Bernd; Pourfatemi, Fatemeh; Jankhah, Aria; Chen, Wei; Nikuei, Pooneh; Kalscheuer, Vera M; Oladnabi, Morteza; Wienker, Thomas F; Ropers, Hans-Hilger; Najmabadi, Hossein
2018-01-04
Autosomal recessive (AR) gene defects are the leading genetic cause of intellectual disability (ID) in countries with frequent parental consanguinity, which account for about 1/7th of the world population. Yet, compared to autosomal dominant de novo mutations, which are the predominant cause of ID in Western countries, the identification of AR-ID genes has lagged behind. Here, we report on whole exome and whole genome sequencing in 404 consanguineous predominantly Iranian families with two or more affected offspring. In 219 of these, we found likely causative variants, involving 77 known and 77 novel AR-ID (candidate) genes, 21 X-linked genes, as well as 9 genes previously implicated in diseases other than ID. This study, the largest of its kind published to date, illustrates that high-throughput DNA sequencing in consanguineous families is a superior strategy for elucidating the thousands of hitherto unknown gene defects underlying AR-ID, and it sheds light on their prevalence.
Differentially regulated gene expression associated with hepatitis C virus clearance.
Grimes, Carolyn Z; Hwang, Lu-Yu; Wei, Peng; Shah, Dimpy P; Volcik, Kelly A; Brown, Eric L
2013-03-01
Human chronic hepatitis C virus (HCV) infections pose a significant public health threat, necessitating the development of novel treatments and vaccines. HCV infections range from spontaneous resolution to end-stage liver disease. Approximately 10-30% of HCV infections undergo spontaneous resolution independent of treatment by yet-to-be-defined mechanisms. These individuals test positive for anti-HCV antibodies in the absence of detectable viral serum RNA. To identify genes associated with HCV clearance, this study compared gene expression profiles between current drug users chronically infected with HCV and drug users who cleared their HCV infection. This analysis identified 91 differentially regulated (up- or downregulated by twofold or more) genes potentially associated with HCV clearance. The majority of genes identified were associated with immune function, with the remaining genes categorized either as cancer related or 'other'. Identification of factors and pathways that may influence virus clearance will be essential to the development of novel treatment strategies.
Gene and translation initiation site prediction in metagenomic sequences
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hyatt, Philip Douglas; LoCascio, Philip F; Hauser, Loren John
2012-01-01
Gene prediction in metagenomic sequences remains a difficult problem. Current sequencing technologies do not achieve sufficient coverage to assemble the individual genomes in a typical sample; consequently, sequencing runs produce a large number of short sequences whose exact origin is unknown. Since these sequences are usually smaller than the average length of a gene, algorithms must make predictions based on very little data. We present MetaProdigal, a metagenomic version of the gene prediction program Prodigal, that can identify genes in short, anonymous coding sequences with a high degree of accuracy. The novel value of the method consists of enhanced translationmore » initiation site identification, ability to identify sequences that use alternate genetic codes and confidence values for each gene call. We compare the results of MetaProdigal with other methods and conclude with a discussion of future improvements.« less
Milanesi, Luciano; Petrillo, Mauro; Sepe, Leandra; Boccia, Angelo; D'Agostino, Nunzio; Passamano, Myriam; Di Nardo, Salvatore; Tasco, Gianluca; Casadio, Rita; Paolella, Giovanni
2005-01-01
Background Protein kinases are a well defined family of proteins, characterized by the presence of a common kinase catalytic domain and playing a significant role in many important cellular processes, such as proliferation, maintenance of cell shape, apoptosys. In many members of the family, additional non-kinase domains contribute further specialization, resulting in subcellular localization, protein binding and regulation of activity, among others. About 500 genes encode members of the kinase family in the human genome, and although many of them represent well known genes, a larger number of genes code for proteins of more recent identification, or for unknown proteins identified as kinase only after computational studies. Results A systematic in silico study performed on the human genome, led to the identification of 5 genes, on chromosome 1, 11, 13, 15 and 16 respectively, and 1 pseudogene on chromosome X; some of these genes are reported as kinases from NCBI but are absent in other databases, such as KinBase. Comparative analysis of 483 gene regions and subsequent computational analysis, aimed at identifying unannotated exons, indicates that a large number of kinase may code for alternately spliced forms or be incorrectly annotated. An InterProScan automated analysis was perfomed to study domain distribution and combination in the various families. At the same time, other structural features were also added to the annotation process, including the putative presence of transmembrane alpha helices, and the cystein propensity to participate into a disulfide bridge. Conclusion The predicted human kinome was extended by identifiying both additional genes and potential splice variants, resulting in a varied panorama where functionality may be searched at the gene and protein level. Structural analysis of kinase proteins domains as defined in multiple sources together with transmembrane alpha helices and signal peptide prediction provides hints to function assignment. The results of the human kinome analysis are collected in the KinWeb database, available for browsing and searching over the internet, where all results from the comparative analysis and the gene structure annotation are made available, alongside the domain information. Kinases may be searched by domain combinations and the relative genes may be viewed in a graphic browser at various level of magnification up to gene organization on the full chromosome set. PMID:16351747
Tian, Qian; Zhao, Wenjun; Lu, Songyu; Zhu, Shuifang; Li, Shidong
2016-01-01
Genus Xanthomonas comprises many economically important plant pathogens that affect a wide range of hosts. Indeed, fourteen Xanthomonas species/pathovars have been regarded as official quarantine bacteria for imports in China. To date, however, a rapid and accurate method capable of identifying all of the quarantine species/pathovars has yet to be developed. In this study, we therefore evaluated the capacity of DNA barcoding as a digital identification method for discriminating quarantine species/pathovars of Xanthomonas. For these analyses, 327 isolates, representing 45 Xanthomonas species/pathovars, as well as five additional species/pathovars from GenBank (50 species/pathovars total), were utilized to test the efficacy of four DNA barcode candidate genes (16S rRNA gene, cpn60, gyrB, and avrBs2). Of these candidate genes, cpn60 displayed the highest rate of PCR amplification and sequencing success. The tree-building (Neighbor-joining), ‘best close match’, and barcode gap methods were subsequently employed to assess the species- and pathovar-level resolution of each gene. Notably, all isolates of each quarantine species/pathovars formed a monophyletic group in the neighbor-joining tree constructed using the cpn60 sequences. Moreover, cpn60 also demonstrated the most satisfactory results in both barcoding gap analysis and the ‘best close match’ test. Thus, compared with the other markers tested, cpn60 proved to be a powerful DNA barcode, providing a reliable and effective means for the species- and pathovar-level identification of the quarantine plant pathogen Xanthomonas. PMID:27861494
NASA Astrophysics Data System (ADS)
Agung, Muhammad Budi; Budiarsa, I. Made; Suwastika, I. Nengah
2017-02-01
Cocoa bean is one of the main commodities from Indonesia for the world, which still have problem regarding yield degradation due to pathogens and disease attack. Developing robust cacao plant that genetically resistant to pathogen and disease attack is an ideal solution in over taking on this problem. The aim of this study was to identify Theobroma cacao genes on database of cacao genome that homolog to response genes of pathogen and disease attack in other plant, through in silico analysis. Basic information survey and gene identification were performed in GenBank and The Arabidopsis Information Resource database. The In silico analysis contains protein BLAST, homology test of each gene's protein candidates, and identification of homologue gene in Cacao Genome Database using data source "Theobroma cacao cv. Matina 1-6 v1.1" genome. Identification found that Thecc1EG011959t1 (EDS1), Thecc1EG006803t1 (EDS5), Thecc1EG013842t1 (ICS1), and Thecc1EG015614t1 (BG_PPAP) gene of Cacao Genome Database were Theobroma cacao genes that homolog to plant's resistance genes which highly possible to have similar functions of each gene's homologue gene.
Comparative transcriptional profiling-based identification of raphanusanin-inducible genes
2010-01-01
Background Raphanusanin (Ra) is a light-induced growth inhibitor involved in the inhibition of hypocotyl growth in response to unilateral blue-light illumination in radish seedlings. Knowledge of the roles of Ra still remains elusive. To understand the roles of Ra and its functional coupling to light signalling, we constructed the Ra-induced gene library using the Suppression Subtractive Hybridisation (SSH) technique and present a comparative investigation of gene regulation in radish seedlings in response to short-term Ra and blue-light exposure. Results The predicted gene ontology (GO) term revealed that 55% of the clones in the Ra-induced gene library were associated with genes involved in common defence mechanisms, including thirty four genes homologous to Arabidopsis genes implicated in R-gene-triggered resistance in the programmed cell death (PCD) pathway. Overall, the library was enriched with transporters, hydrolases, protein kinases, and signal transducers. The transcriptome analysis revealed that, among the fifty genes from various functional categories selected from 88 independent genes of the Ra-induced library, 44 genes were up-regulated and 4 were down-regulated. The comparative analysis showed that, among the transcriptional profiles of 33 highly Ra-inducible genes, 25 ESTs were commonly regulated by different intensities and duration of blue-light irradiation. The transcriptional profiles, coupled with the transcriptional regulation of early blue light, have provided the functional roles of many genes expected to be involved in the light-mediated defence mechanism. Conclusions This study is the first comprehensive survey of transcriptional regulation in response to Ra. The results described herein suggest a link between Ra and cellular defence and light signalling, and thereby contribute to further our understanding of how Ra is involved in light-mediated mechanisms of plant defence. PMID:20553608
Ren, Lipin; Chen, Wei; Shang, Yanjie; Meng, Fanming; Zha, Lagabaiyila; Wang, Yong; Guo, Yadong
2018-05-17
Muscid Flies (Diptera: Muscidae) are of great forensic importance due to their wide distribution, ubiquitous and synanthropic nature. They are frequently neglected as they tend to arrive at the corpses later than the flesh flies and blow flies. Moreover, the lack of species-level identification also hinders investigation of medicolegal purposes. To overcome the difficulty of morphological identification, molecular method has gained relevance. Cytochrome c oxidase subunit I (COI) gene has been widely utilized. Nonetheless, to achieve correct identification of an unknown sample, it is important to survey certain muscid taxa from its geographic distribution range. Accordingly, the aim of this study is to contribute more geographically specific. We sequenced the COI gene of 51 muscid specimens of 12 species, and added all correct sequences available in GenBank to yield a total data set of 125 COI sequences from 33 muscid species to evaluate the COI gene as a molecular diagnostic tool. The interspecific distances were extremely high (4.7-19.8%) in either the standard barcoding fragment (658 bp) or the long COI sequence (1,019-1,535 bp), demonstrating that these two genetic markers were nearly identical in the species identification. However, the intraspecific distances of the long COI sequences were significantly higher than the barcoding region for the conspecific species that geographical locations vary greatly. Therefore, genetic diversity presented in this study provides a reference for species identification of muscid flies. Nevertheless, further investigation and data from more muscid species are required to enhance the efficacy of species-level identification using COI gene as a genetic marker.
Abraham, Tintu; Sistla, Sujatha
2016-07-01
Traditionally Group A Streptococcus pyogenes (GAS) is differentiated from other beta haemolytic streptococci (BHS) by certain presumptive tests such as bacitracin sensitivity and production of Pyrollidonyl Aryl Sulfatase (PYR). The phenotypic and genotypic confirmatory tests are Lancefield grouping for cell wall carbohydrate antigen and PCR for spy1258 gene respectively. Reliance on presumptive tests alone may lead to misidentification of isolates. To compare the predictive values of routine phenotypic tests with spy1258 PCR for the identification of Streptococcus pyogenes. This comparative analytical study was carried out in the Department of Microbiology, JIPMER, Puducherry, over a period of 18 months (1(st) November 2013 to 30(th) April 2015). Two hundred and six consecutive BHS isolates from various clinical samples were subjected to phenotypic tests such as bacitracin sensitivity, PYR test and Lancefield grouping. The results were compared with spy1258 PCR which was considered 95 the confirmatory test for identification. The sensitivity and specificity of phenotypic tests were as follows; Susceptibility to bacitracin - 95.42%, 70.96%, PYR test - 95.42%, 77.41%, Lancefield grouping- 97.71%, 80.64%. Clinical laboratories should not depend on bacitracin sensitivity as a single presumptive test for the routine identification of GAS but should use supplemental tests such as PYR test or latex agglutination test and for best results use spy1258 PCR.
Goh, Swee Han; Driedger, David; Gillett, Sandra; Low, Donald E.; Hemmingsen, Sean M.; Amos, Mayben; Chan, David; Lovgren, Marguerite; Willey, Barbara M.; Shaw, Carol; Smith, John A.
1998-01-01
It was recently reported that Streptococcus iniae, a bacterial pathogen of aquatic animals, can cause serious disease in humans. Using the chaperonin 60 (Cpn60) gene identification method with reverse checkerboard hybridization and chemiluminescent detection, we identified correctly each of 12 S. iniae samples among 34 aerobic gram-positive isolates from animal and clinical human sources. PMID:9650992
USDA-ARS?s Scientific Manuscript database
Soybean is the second largest crop in the US. Its yield directly impacts US agricultural economics. Drought and flooding are two major causes for soybean yield loss. To better understand their underlying molecular regulatory mechanisms, we sequenced the transcriptomes of soybean grown in drought a...
USDA-ARS?s Scientific Manuscript database
Silencing phytochrome A1 gene (PHYA1) by RNA interference in Upland cotton (Gossypium hirsutum L. cv. Coker 312) had generated PHYA1 RNAi lines with simultaneously improved fiber quality (longer, stronger and finer fiber) and other key agronomic traits. Comparative analyses of altered molecular proc...
USDA-ARS?s Scientific Manuscript database
COMPARATIVE GENE IDENTIFICATION-58 (CGI-58) is a key regulator of lipid metabolism and signaling in mammals, but its underlying mechanisms are unclear. Disruption of CGI-58 in either mammals or plants results in a significant increase in triacylglycerol (TAG), suggesting that CGI-58 activity is evol...
Alanio, A; Garcia-Hermoso, D; Mercier-Delarue, S; Lanternier, F; Gits-Muselli, M; Menotti, J; Denis, B; Bergeron, A; Legrand, M; Lortholary, O; Bretagne, S
2015-06-01
Molecular methods are crucial for mucormycosis diagnosis because cultures are frequently negative, even if microscopy suggests the presence of hyphae in tissues. We assessed PCR/electrospray-ionization mass spectrometry (PCR/ESI-MS) for Mucorales identification in 19 unfixed tissue samples from 13 patients with proven or probable mucormycosis and compared the results with culture, quantitative real-time PCR, 16S-23S rRNA gene internal transcribed spacer region (ITS PCR) and 18S PCR sequencing. Concordance with culture identification to both genus and species levels was higher for PCR/ESI-MS than for the other techniques. Thus, PCR/ESI-MS is suitable for Mucorales identification, within 6 hours, for tissue samples for which microscopy results suggest the presence of hyphae. Copyright © 2015 European Society of Clinical Microbiology and Infectious Diseases. Published by Elsevier Ltd. All rights reserved.
Cell type-selective disease-association of genes under high regulatory load
Galhardo, Mafalda; Berninger, Philipp; Nguyen, Thanh-Phuong; Sauter, Thomas; Sinkkonen, Lasse
2015-01-01
We previously showed that disease-linked metabolic genes are often under combinatorial regulation. Using the genome-wide ChIP-Seq binding profiles for 93 transcription factors in nine different cell lines, we show that genes under high regulatory load are significantly enriched for disease-association across cell types. We find that transcription factor load correlates with the enhancer load of the genes and thereby allows the identification of genes under high regulatory load by epigenomic mapping of active enhancers. Identification of the high enhancer load genes across 139 samples from 96 different cell and tissue types reveals a consistent enrichment for disease-associated genes in a cell type-selective manner. The underlying genes are not limited to super-enhancer genes and show several types of disease-association evidence beyond genetic variation (such as biomarkers). Interestingly, the high regulatory load genes are involved in more KEGG pathways than expected by chance, exhibit increased betweenness centrality in the interaction network of liver disease genes, and carry longer 3′ UTRs with more microRNA (miRNA) binding sites than genes on average, suggesting a role as hubs integrating signals within regulatory networks. In summary, epigenetic mapping of active enhancers presents a promising and unbiased approach for identification of novel disease genes in a cell type-selective manner. PMID:26338775
Identification of bacteria isolated from veterinary clinical specimens using MALDI-TOF MS.
Pavlovic, Melanie; Wudy, Corinna; Zeller-Peronnet, Veronique; Maggipinto, Marzena; Zimmermann, Pia; Straubinger, Alix; Iwobi, Azuka; Märtlbauer, Erwin; Busch, Ulrich; Huber, Ingrid
2015-01-01
Matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS) has recently emerged as a rapid and accurate identification method for bacterial species. Although it has been successfully applied for the identification of human pathogens, it has so far not been well evaluated for routine identification of veterinary bacterial isolates. This study was performed to compare and evaluate the performance of MALDI-TOF MS based identification of veterinary bacterial isolates with commercially available conventional test systems. Discrepancies of both methods were resolved by sequencing 16S rDNA and, if necessary, the infB gene for Actinobacillus isolates. A total of 375 consecutively isolated veterinary samples were collected. Among the 357 isolates (95.2%) correctly identified at the genus level by MALDI-TOF MS, 338 of them (90.1% of the total isolates) were also correctly identified at the species level. Conventional methods offered correct species identification for 319 isolates (85.1%). MALDI-TOF identification therefore offered more accurate identification of veterinary bacterial isolates. An update of the in-house mass spectra database with additional reference spectra clearly improved the identification results. In conclusion, the presented data suggest that MALDI-TOF MS is an appropriate platform for classification and identification of veterinary bacterial isolates.
Yamada, Takahisa; Muramatsu, Youji; Taniguchi, Yukio; Sasaki, Yoshiyuki
Our previous study detected 291 and 77 genes showing early embryonic death-associated elevation and reduction of expression, respectively, in the fetal placenta of the cow carrying somatic nuclear transfer-derived cloned embryo. In this study, we mapped the 10 genes showing the elevation and the 10 genes doing the reduction most significantly, using somatic cell hybrid and bovine draft genome sequence. We then compared the mapped positions for these genes with the genomic locations of bovine quantitative trait loci for still-birth and/or abortion. Among the mapped genes, peptidylglycine alpha-amidating monooxygenase (PAM), spectrin, beta, nonerythrocytic 1 (SPTBNI), and an unknown novel gene containing AU277832 expressed sequence tag were intriguing, in that the mapped positions were consistent with the genomic locations of bovine still-birth and/or abortion quantitative trait loci, and thus identified as positional candidates for bovine placental genes responsible for the early embryonic death during the pregnancy attempted by somatic nuclear transfer-derived cloning.
Terabayashi, Yasunobu; Sano, Motoaki; Yamane, Noriko; Marui, Junichiro; Tamano, Koichi; Sagara, Junichi; Dohmoto, Mitsuko; Oda, Ken; Ohshima, Eiji; Tachibana, Kuniharu; Higa, Yoshitaka; Ohashi, Shinichi; Koike, Hideaki; Machida, Masayuki
2010-12-01
Kojic acid is produced in large amounts by Aspergillus oryzae as a secondary metabolite and is widely used in the cosmetic industry. Glucose can be converted to kojic acid, perhaps by only a few steps, but no genes for the conversion have thus far been revealed. Using a DNA microarray, gene expression profiles under three pairs of conditions significantly affecting kojic acid production were compared. All genes were ranked using an index parameter reflecting both high amounts of transcription and a high induction ratio under producing conditions. After disruption of nine candidate genes selected from the top of the list, two genes of unknown function were found to be responsible for kojic acid biosynthesis, one having an oxidoreductase motif and the other a transporter motif. These two genes are closely associated in the genome, showing typical characteristics of genes involved in secondary metabolism. Copyright © 2010 Elsevier Inc. All rights reserved.
Turyagyenda, Laban F.; Kizito, Elizabeth B.; Ferguson, Morag; Baguma, Yona; Agaba, Morris; Harvey, Jagger J. W.; Osiru, David S. O.
2013-01-01
Cassava is an important root crop to resource-poor farmers in marginal areas, where its production faces drought stress constraints. Given the difficulties associated with cassava breeding, a molecular understanding of drought tolerance in cassava will help in the identification of markers for use in marker-assisted selection and genes for transgenic improvement of drought tolerance. This study was carried out to identify candidate drought-tolerance genes and expression-based markers of drought stress in cassava. One drought-tolerant (improved variety) and one drought-susceptible (farmer-preferred) cassava landrace were grown in the glasshouse under well-watered and water-stressed conditions. Their morphological, physiological and molecular responses to drought were characterized. Morphological and physiological measurements indicate that the tolerance of the improved variety is based on drought avoidance, through reduction of water loss via partial stomatal closure. Ten genes that have previously been biologically validated as conferring or being associated with drought tolerance in other plant species were confirmed as being drought responsive in cassava. Four genes (MeALDH, MeZFP, MeMSD and MeRD28) were identified as candidate cassava drought-tolerance genes, as they were exclusively up-regulated in the drought-tolerant genotype to comparable levels known to confer drought tolerance in other species. Based on these genes, we hypothesize that the basis of the tolerance at the cellular level is probably through mitigation of the oxidative burst and osmotic adjustment. This study provides an initial characterization of the molecular response of cassava to drought stress resembling field conditions. The drought-responsive genes can now be used as expression-based markers of drought stress tolerance in cassava, and the candidate tolerance genes tested in the context of breeding (as possible quantitative trait loci) and engineering drought tolerance in transgenics. PMID:23519782
Shimada, Nao; Maeda, Mineko; Urushihara, Hideko; Kawata, Takefumi
2004-09-01
Signal Transducers and Activators of Transcription (STATs) are transcription factors which lie at the end of cytokine and growth signal transduction pathways. Dictyostelium Dd-STATa is a functional homologue of metazoan STATs. It is activated by cAMP and, at the slug stage, it translocates into the nuclei of the tip cells, which are a subset of the anterior, prestalk A (pstA) cells. Here we searched for novel Dd-STATa regulated genes by in situ hybridisation. A set of 54 cDNA clones whose gene expression patterns are known to be prestalk-specific (Maeda et al., 2003), were chosen as probes and we compared their expression patterns in parental and Dd-STATa-null strains. We identified 13 genes which are candidates for direct induction by Dd-STATa. In the parental strain, most of these genes are expressed in the cone shaped mass of pstAB cells which is located within the prestalk region. These cDNAs show little or no expression in the Dd-STATa-null strain. This contrasts markedly with the paradigmatic ecmB gene which is expressed in pstAB cells in parental cells, but which is expressed throughout the prestalk zone in the Dd-STATa-null strain. We also identified several genes which are normally expressed in pstA cells, or throughout the prestalk region, but whose expression is markedly down-regulated in the null mutant. Again, this contrasts with markers derived from the paradigmatic, ecmA gene which are expressed normally in the Dd-STATa-null strain. The identification of these novel genes provides valuable tools to investigate the role of Dd-STATa.
Xiao, Lin-Fan; Zhang, Wei; Jing, Tian-Xing; Zhang, Meng-Yi; Miao, Ze-Qing; Wei, Dan-Dan; Yuan, Guo-Rui; Wang, Jin-Jun
2018-03-01
The ATP-binding cassette (ABC) is the largest transporter gene family and the genes play key roles in xenobiotic resistance, metabolism, and development of all phyla. However, the specific functions of ABC gene families in insects is unclear. We report a genome-wide identification, phylogenetic, and transcriptional analysis of the ABC genes in the oriental fruit fly, Bactrocera dorsalis (Hendel). We identified a total of 47 ABC genes (BdABCs) from the transcriptomic and genomic databases of B. dorsalis and classified these genes into eight subfamilies (A-H), including 7 ABCAs, 7 ABCBs, 9 ABCCs, 2 ABCDs, 1 ABCE, 3 ABCFs, 15 ABCGs, and 3 ABCHs. Comparative phylogenetic analysis of the ABCs suggests an orthologous relationship between B. dorsalis and other insect species in which these genes have been related to pesticide resistance and essential biological processes. Comparison of transcriptome and relative expression patterns of BdABCs indicated diverse multifunctions within different B. dorsalis tissues. The expression of 4, 10, and 14 BdABCs from 18 BdABCs was significantly upregulated after exposure to LD 50 s of malathion, avermectin, and beta-cypermethrin, respectively. The maximum expression level of most BdABCs (including BdABCFs, BdABCGs, and BdABCHs) occurred at 48h post exposures, whereas BdABCEs peaked at 24h after treatment. Furthermore, RNA interference-mediated suppression of BdABCB7 resulted in increased toxicity of malathion against B. dorsalis. These data suggest that ABC transporter genes might play key roles in xenobiotic metabolism and biosynthesis in B. dorsalis. Copyright © 2017 Elsevier Inc. All rights reserved.
Ashtiani, Nafiseh Mohebbi; Kachuei, Reza; Yalfani, Roozbeh; Harchegani, Asghar Beigi; Nosratabadi, Mohsen
2017-06-01
Aspergillus species are important in medicine, agriculture and various industries. The sections Fumigati, Flavi, and Nigri are the most important members of the Aspergillus genus. This study intended to identify and separate these three Aspergillus sections and to differentiate among them using specific primers. A bioinformatics study was initially performed to analyse the sequences of five genes, namely, beta-tubulin, calmodulin, the pre-rRNA processing protein Tsr1, the DNA-replication licensing factor Mcm7, and RNA polymerase II second largest subunit (RPB2) in the three Aspergillus sections using MEGA6 software and the NCBI database. Primers were designed to select genes for each of the Aspergillus sections being analysed. A total of 134 environmental and clinical Aspergillus species were isolated, purified and initially identified by colony morphology.. Subsequently, DNA was extracted using the phenol-chloroform method, specific primers were synthesized, PCR was performed for DNA from all isolates, and the results were compared to morphological characteristics. Of the 134 isolates tested, 56 were Nigri, 32 were Fumigati, 32 were Flavi, and the rest (14 isolates) belonged to other sections. The beta-tubulin and calmodulin genes were found to be the most suitable for differentiating among these three groups; the beta-tubulin gene was used for molecular identification of Aspergillus section Fumigati, and the calmodulin gene for identifying sections Flavi and Nigri.
Combining Genotype, Phenotype, and Environment to Infer Potential Candidate Genes.
Talbot, Benoit; Chen, Ting-Wen; Zimmerman, Shawna; Joost, Stéphane; Eckert, Andrew J; Crow, Taylor M; Semizer-Cuming, Devrim; Seshadri, Chitra; Manel, Stéphanie
2017-03-01
Population genomic analysis can be an important tool in understanding local adaptation. Identification of potential adaptive loci in such analyses is usually based on the survey of a large genomic dataset in combination with environmental variables. Phenotypic data are less commonly incorporated into such studies, although combining a genome scan analysis with a phenotypic trait analysis can greatly improve the insights obtained from each analysis individually. Here, we aimed to identify loci potentially involved in adaptation to climate in 283 Loblolly pine (Pinus taeda) samples from throughout the species' range in the southeastern United States. We analyzed associations between phenotypic, molecular, and environmental variables from datasets of 3082 single nucleotide polymorphism (SNP) loci and 3 categories of phenotypic traits (gene expression, metabolites, and whole-plant traits). We found only 6 SNP loci that displayed potential signals of local adaptation. Five of the 6 identified SNPs are linked to gene expression traits for lignin development, and 1 is linked with whole-plant traits. We subsequently compared the 6 candidate genes with environmental variables and found a high correlation in only 3 of them (R2 > 0.2). Our study highlights the need for a combination of genotypes, phenotypes, and environmental variables, and for an appropriate sampling scheme and study design, to improve confidence in the identification of potential candidate genes. © The American Genetic Association 2016. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Scoring clustering solutions by their biological relevance.
Gat-Viks, I; Sharan, R; Shamir, R
2003-12-12
A central step in the analysis of gene expression data is the identification of groups of genes that exhibit similar expression patterns. Clustering gene expression data into homogeneous groups was shown to be instrumental in functional annotation, tissue classification, regulatory motif identification, and other applications. Although there is a rich literature on clustering algorithms for gene expression analysis, very few works addressed the systematic comparison and evaluation of clustering results. Typically, different clustering algorithms yield different clustering solutions on the same data, and there is no agreed upon guideline for choosing among them. We developed a novel statistically based method for assessing a clustering solution according to prior biological knowledge. Our method can be used to compare different clustering solutions or to optimize the parameters of a clustering algorithm. The method is based on projecting vectors of biological attributes of the clustered elements onto the real line, such that the ratio of between-groups and within-group variance estimators is maximized. The projected data are then scored using a non-parametric analysis of variance test, and the score's confidence is evaluated. We validate our approach using simulated data and show that our scoring method outperforms several extant methods, including the separation to homogeneity ratio and the silhouette measure. We apply our method to evaluate results of several clustering methods on yeast cell-cycle gene expression data. The software is available from the authors upon request.
Tsai, Pei-Chien; Breen, Matthew
2012-09-01
To identify suitable reference genes for normalization of real-time quantitative PCR (RT-qPCR) assay data for common tumors of dogs. Malignant lymph node (n = 8), appendicular osteosarcoma (9), and histiocytic sarcoma (12) samples and control samples of various nonneoplastic canine tissues. Array-based comparative genomic hybridization (aCGH) data were used to guide selection of 9 candidate reference genes. Expression stability of candidate reference genes and 4 commonly used reference genes was determined for tumor samples with RT-qPCR assays and 3 software programs. LOC611555 was the candidate reference gene with the highest expression stability among the 3 tumor types. Of the commonly used reference genes, expression stability of HPRT was high in histiocytic sarcoma samples, and expression stability of Ubi and RPL32 was high in osteosarcoma samples. Some of the candidate reference genes had higher expression stability than did the commonly used reference genes. Data for constitutively expressed genes with high expression stability are required for normalization of RT-qPCR assay results. Without such data, accurate quantification of gene expression in tumor tissue samples is difficult. Results of the present study indicated LOC611555 may be a useful RT-qPCR assay reference gene for multiple tissue types. Some commonly used reference genes may be suitable for normalization of gene expression data for tumors of dogs, such as lymphomas, osteosarcomas, or histiocytic sarcomas.
Comparing the landcapes of common retroviral insertion sites across tumor models
NASA Astrophysics Data System (ADS)
Weishaupt, Holger; Čančer, Matko; Engström, Cristopher; Silvestrov, Sergei; Swartling, Fredrik J.
2017-01-01
Retroviral tagging represents an important technique, which allows researchers to screen for candidate cancer genes. The technique is based on the integration of retroviral sequences into the genome of a host organism, which might then lead to the artificial inhibition or expression of proximal genetic elements. The identification of potential cancer genes in this framework involves the detection of genomic regions (common insertion sites; CIS) which contain a number of such viral integration sites that is greater than expected by chance. During the last two decades, a number of different methods have been discussed for the identification of such loci and the respective techniques have been applied to a variety of different retroviruses and/or tumor models. We have previously established a retrovirus driven brain tumor model and reported the CISs which were found based on a Monte Carlo statistics derived detection paradigm. In this study, we consider a recently proposed alternative graph theory based method for identifying CISs and compare the resulting CIS landscape in our brain tumor dataset to those obtained when using the Monte Carlo approach. Finally, we also employ the graph-based method to compare the CIS landscape in our brain tumor model with those of other published retroviral tumor models.
Porcelli, Damiano; Barsanti, Paolo; Pesole, Graziano; Caggese, Corrado
2007-01-01
Background When orthologous sequences from species distributed throughout an optimal range of divergence times are available, comparative genomics is a powerful tool to address problems such as the identification of the forces that shape gene structure during evolution, although the functional constraints involved may vary in different genes and lineages. Results We identified and annotated in the MitoComp2 dataset the orthologs of 68 nuclear genes controlling oxidative phosphorylation in 11 Drosophilidae species and in five non-Drosophilidae insects, and compared them with each other and with their counterparts in three vertebrates (Fugu rubripes, Danio rerio and Homo sapiens) and in the cnidarian Nematostella vectensis, taking into account conservation of gene structure and regulatory motifs, and preservation of gene paralogs in the genome. Comparative analysis indicates that the ancestral insect OXPHOS genes were intron rich and that extensive intron loss and lineage-specific intron gain occurred during evolution. Comparison with vertebrates and cnidarians also shows that many OXPHOS gene introns predate the cnidarian/Bilateria evolutionary split. The nuclear respiratory gene element (NRG) has played a key role in the evolution of the insect OXPHOS genes; it is constantly conserved in the OXPHOS orthologs of all the insect species examined, while their duplicates either completely lack the element or possess only relics of the motif. Conclusion Our observations reinforce the notion that the common ancestor of most animal phyla had intron-rich gene, and suggest that changes in the pattern of expression of the gene facilitate the fixation of duplications in the genome and the development of novel genetic functions. PMID:18315839
Chondrocyte channel transcriptomics
Lewis, Rebecca; May, Hannah; Mobasheri, Ali; Barrett-Jolley, Richard
2013-01-01
To date, a range of ion channels have been identified in chondrocytes using a number of different techniques, predominantly electrophysiological and/or biomolecular; each of these has its advantages and disadvantages. Here we aim to compare and contrast the data available from biophysical and microarray experiments. This letter analyses recent transcriptomics datasets from chondrocytes, accessible from the European Bioinformatics Institute (EBI). We discuss whether such bioinformatic analysis of microarray datasets can potentially accelerate identification and discovery of ion channels in chondrocytes. The ion channels which appear most frequently across these microarray datasets are discussed, along with their possible functions. We discuss whether functional or protein data exist which support the microarray data. A microarray experiment comparing gene expression in osteoarthritis and healthy cartilage is also discussed and we verify the differential expression of 2 of these genes, namely the genes encoding large calcium-activated potassium (BK) and aquaporin channels. PMID:23995703
Thomas, Carissa M; Saulnier, Delphine M A; Spinler, Jennifer K; Hemarajata, Peera; Gao, Chunxu; Jones, Sara E; Grimm, Ashley; Balderas, Miriam A; Burstein, Matthew D; Morra, Christina; Roeth, Daniel; Kalkum, Markus; Versalovic, James
2016-10-01
Bacterial-derived compounds from the intestinal microbiome modulate host mucosal immunity. Identification and mechanistic studies of these compounds provide insights into host-microbial mutualism. Specific Lactobacillus reuteri strains suppress production of the proinflammatory cytokine, tumor necrosis factor (TNF), and are protective in a mouse model of colitis. Human-derived L. reuteri strain ATCC PTA 6475 suppresses intestinal inflammation and produces 5,10-methenyltetrahydrofolic acid polyglutamates. Insertional mutagenesis identified the bifunctional dihydrofolate synthase/folylpolyglutamate synthase type 2 (folC2) gene as essential for 5,10-methenyltetrahydrofolic acid polyglutamate biosynthesis, as well as for suppression of TNF production by activated human monocytes, and for the anti-inflammatory effect of L. reuteri 6475 in a trinitrobenzene sulfonic acid-induced mouse model of acute colitis. In contrast, folC encodes the enzyme responsible for folate polyglutamylation but does not impact TNF suppression by L. reuteri. Comparative transcriptomics between wild-type and mutant L. reuteri strains revealed additional genes involved in immunomodulation, including previously identified hdc genes involved in histidine to histamine conversion. The folC2 mutant yielded diminished hdc gene cluster expression and diminished histamine production, suggesting a link between folate and histadine/histamine metabolism. The identification of genes and gene networks regulating production of bacterial-derived immunoregulatory molecules may lead to improved anti-inflammatory strategies for digestive diseases. © 2016 The Authors. MicrobiologyOpen published by John Wiley & Sons Ltd.
Sokhi, Upneet K.; Bacolod, Manny D.; Dasgupta, Santanu; Emdad, Luni; Das, Swadesh K.; Dumur, Catherine I.; Miles, Michael F.; Sarkar, Devanand; Fisher, Paul B.
2013-01-01
Human Polynucleotide Phosphorylase (hPNPaseold-35 or PNPT1) is an evolutionarily conserved 3′→5′ exoribonuclease implicated in the regulation of numerous physiological processes including maintenance of mitochondrial homeostasis, mtRNA import and aging-associated inflammation. From an RNase perspective, little is known about the RNA or miRNA species it targets for degradation or whose expression it regulates; except for c-myc and miR-221. To further elucidate the functional implications of hPNPaseold-35 in cellular physiology, we knocked-down and overexpressed hPNPaseold-35 in human melanoma cells and performed gene expression analyses to identify differentially expressed transcripts. Ingenuity Pathway Analysis indicated that knockdown of hPNPaseold-35 resulted in significant gene expression changes associated with mitochondrial dysfunction and cholesterol biosynthesis; whereas overexpression of hPNPaseold-35 caused global changes in cell-cycle related functions. Additionally, comparative gene expression analyses between our hPNPaseold-35 knockdown and overexpression datasets allowed us to identify 77 potential “direct” and 61 potential “indirect” targets of hPNPaseold-35 which formed correlated networks enriched for cell-cycle and wound healing functional association, respectively. These results provide a comprehensive database of genes responsive to hPNPaseold-35 expression levels; along with the identification new potential candidate genes offering fresh insight into cellular pathways regulated by PNPT1 and which may be used in the future for possible therapeutic intervention in mitochondrial- or inflammation-associated disease phenotypes. PMID:24143183
Fotouhi-Ardakani, Reza; Dabiri, Shahriar; Ajdari, Soheila; Alimohammadian, Mohammad Hossein; AlaeeNovin, Elnaz; Taleshi, Neda; Parvizi, Parviz
2016-12-01
The polymorphism and genetic diversity of Leishmania genus has status under discussion depending on many items such as nuclear and/or mitochondrial genes, molecular tools, Leishmania species, geographical origin, condition of micro-environment of Leishmania parasites and isolation of Leishmania from clinical samples, reservoir host and vectors. The genetic variation of Leishmania species (L. major, L. tropica, L. tarentolae, L. mexicana, L. infantum) were analyzed and compared using mitochondrial (COII and Cyt b) and nuclear (nagt, ITS-rDNA and HSP70) genes. The role of each enzymatic (COII, Cyt b and nagt) or housekeeping (ITS-rDNA, HSP70) gene was employed for accurate identification of Leishmania parasites. After DNA extractions and amplifying of native, natural and reference strains of Leishmania parasites, polymerase chain reaction (PCR) products were sequenced and evaluation of genetic proximity and phylogenetic analysis were performed using MEGA6 and DnaSP5 software. Among the 72 sequences of the five genes, the number of polymorphic sites was significantly lower as compared to the monomorphic sites. Of the 72 sequences, 54 new haplotypes (five genes) of Leishmania species were submitted in GenBank (Access number: KU680818 - KU680871). Four genes had a remarkable number of informative sites (P=0.00), except HSP70 maybe because of its microsatellite regions. The non-synonymous (dN) variants of nagt gene were more than that of other expression genes (47.4%). The synonymous (dS)/dN ratio in three expression genes showed a significant variation between five Leishmania species (P=0.001). The highest and lowest levels of haplotype diversity were observed in L. tropica (81.35%) and L. major (28.38%) populations, respectively. Tajima's D index analyses showed that Cyt b gene in L. tropica species was significantly negative (Tajima's D=-2.2, P<0.01), while COII and nagt genes were produced through evolutionary processes for both L. tropica and L. major (Tajima's D=2.85 & 2.91, P<0.01). More different clinical lesions with extensive phylogenetic and evolutionary analyses should be employed to avoid confusion in the diagnosis of leishmaniasis and development of vaccines for eradicating Leishmania parasites. Copyright © 2016 Elsevier B.V. All rights reserved.
Xie, Xin-Ping; Xie, Yu-Feng; Wang, Hong-Qiang
2017-08-23
Large-scale accumulation of omics data poses a pressing challenge of integrative analysis of multiple data sets in bioinformatics. An open question of such integrative analysis is how to pinpoint consistent but subtle gene activity patterns across studies. Study heterogeneity needs to be addressed carefully for this goal. This paper proposes a regulation probability model-based meta-analysis, jGRP, for identifying differentially expressed genes (DEGs). The method integrates multiple transcriptomics data sets in a gene regulatory space instead of in a gene expression space, which makes it easy to capture and manage data heterogeneity across studies from different laboratories or platforms. Specifically, we transform gene expression profiles into a united gene regulation profile across studies by mathematically defining two gene regulation events between two conditions and estimating their occurring probabilities in a sample. Finally, a novel differential expression statistic is established based on the gene regulation profiles, realizing accurate and flexible identification of DEGs in gene regulation space. We evaluated the proposed method on simulation data and real-world cancer datasets and showed the effectiveness and efficiency of jGRP in identifying DEGs identification in the context of meta-analysis. Data heterogeneity largely influences the performance of meta-analysis of DEGs identification. Existing different meta-analysis methods were revealed to exhibit very different degrees of sensitivity to study heterogeneity. The proposed method, jGRP, can be a standalone tool due to its united framework and controllable way to deal with study heterogeneity.
Hollenbach, Jill A.; Saperstein, Aliya; Albrecht, Mark; Vierra-Green, Cynthia; Parham, Peter; Norman, Paul J.; Maiers, Martin
2015-01-01
We conducted a nationwide study comparing self-identification to genetic ancestry classifications in a large cohort (n = 1752) from the National Marrow Donor Program. We sought to determine how various measures of self-identification intersect with genetic ancestry, with the aim of improving matching algorithms for unrelated bone marrow transplant. Multiple dimensions of self-identification, including race/ethnicity and geographic ancestry were compared to classifications based on ancestry informative markers (AIMs), and the human leukocyte antigen (HLA) genes, which are required for transplant matching. Nearly 20% of responses were inconsistent between reporting race/ethnicity versus geographic ancestry. Despite strong concordance between AIMs and HLA, no measure of self-identification shows complete correspondence with genetic ancestry. In certain cases geographic ancestry reporting matches genetic ancestry not reflected in race/ethnicity identification, but in other cases geographic ancestries show little correspondence to genetic measures, with important differences by gender. However, when respondents assign ancestry to grandparents, we observe sub-groups of individuals with well- defined genetic ancestries, including important differences in HLA frequencies, with implications for transplant matching. While we advocate for tailored questioning to improve accuracy of ancestry ascertainment, collection of donor grandparents’ information will improve the chances of finding matches for many patients, particularly for mixed-ancestry individuals. PMID:26287376
Kakrana, Atul; Kumar, Anil; Satheesh, Viswanathan; Abdin, M. Z.; Subramaniam, Kuppuswamy; Bhattacharya, R. C.; Srinivasan, Ramamurthy; Sirohi, Anil; Jain, Pradeep K.
2017-01-01
The root-knot nematode (RKN), Meloidogyne incognita, is an obligate, sedentary endoparasite that infects a large number of crops and severely affects productivity. The commonly used nematode control strategies have their own limitations. Of late, RNA interference (RNAi) has become a popular approach for the development of nematode resistance in plants. Transgenic crops capable of expressing dsRNAs, specifically in roots for disrupting the parasitic process, offer an effective and efficient means of producing resistant crops. We identified nematode-responsive and root-specific (NRRS) promoters by using microarray data from the public domain and known conserved cis-elements. A set of 51 NRRS genes was identified which was narrowed down further on the basis of presence of cis-elements combined with minimal expression in the absence of nematode infection. The comparative analysis of promoters from the enriched NRRS set, along with earlier reported nematode-responsive genes, led to the identification of specific cis-elements. The promoters of two candidate genes were used to generate transgenic plants harboring promoter GUS constructs and tested in planta against nematodes. Both promoters showed preferential expression upon nematode infection, exclusively in the root in one and galls in the other. One of these NRRS promoters was used to drive the expression of splicing factor, a nematode-specific gene, for generating host-delivered RNAi-mediated nematode-resistant plants. Transgenic lines expressing dsRNA of splicing factor under the NRRS promoter exhibited upto a 32% reduction in number of galls compared to control plants. PMID:29312363
George, Ellen M.; Hare, Matthew P.; Crabtree, Darran L.; Lantry, Brian F.; Rudstam, Lars G.
2017-01-01
Cisco Coregonus artedi are an important component of native food webs in the Great Lakes, and their restoration is instrumental to the recovery of lake trout Salvelinus namaycush and Atlantic salmon Salmo salar. Difficulties with visual identification of larvae can confound early life history surveys, as cisco are often difficult to distinguish from lake whitefish C. clupeaformis. We compared traditional visual species identification methods to genetic identifications based on barcoding of the mitochondrial cytochrome C oxidase I gene for 726 coregonine larvae caught in Chaumont Bay, Lake Ontario. We found little agreement between the visual characteristics of cisco identified by genetic barcoding and the most widely used dichotomous key, and the considerable overlap in ranges of traditionally utilized metrics suggest that visual identification of coregonine larvae from Chaumont Bay is impractical. Coregonines are highly variable and plastic species, and often display wide variations in morphometric characteristics across their broad range. This study highlights the importance of developing accurate, geographically appropriate larval identification methods in order to best inform cisco restoration and management efforts.
Cherkaoui, Abdessalam; Hibbs, Jonathan; Emonet, Stéphane; Tangomo, Manuela; Girard, Myriam; Francois, Patrice; Schrenzel, Jacques
2010-04-01
Bacterial identification relies primarily on culture-based methodologies requiring 24 h for isolation and an additional 24 to 48 h for species identification. Matrix-assisted laser desorption ionization-time of flight mass spectrometry (MALDI-TOF MS) is an emerging technology newly applied to the problem of bacterial species identification. We evaluated two MALDI-TOF MS systems with 720 consecutively isolated bacterial colonies under routine clinical laboratory conditions. Isolates were analyzed in parallel on both devices, using the manufacturers' default recommendations. We compared MS with conventional biochemical test system identifications. Discordant results were resolved with "gold standard" 16S rRNA gene sequencing. The first MS system (Bruker) gave high-confidence identifications for 680 isolates, of which 674 (99.1%) were correct; the second MS system (Shimadzu) gave high-confidence identifications for 639 isolates, of which 635 (99.4%) were correct. Had MS been used for initial testing and biochemical identification used only in the absence of high-confidence MS identifications, the laboratory would have saved approximately US$5 per isolate in marginal costs and reduced average turnaround time by more than an 8-h shift, with no loss in accuracy. Our data suggest that implementation of MS as a first test strategy for one-step species identification would improve timeliness and reduce isolate identification costs in clinical bacteriology laboratories now.
Genome analysis of medicinal Ganoderma spp. with plant-pathogenic and saprotrophic life-styles.
Kües, Ursula; Nelson, David R; Liu, Chang; Yu, Guo-Jun; Zhang, Jianhui; Li, Jianqin; Wang, Xin-Cun; Sun, Hui
2015-06-01
Ganoderma is a fungal genus belonging to the Ganodermataceae family and Polyporales order. Plant-pathogenic species in this genus can cause severe diseases (stem, butt, and root rot) in economically important trees and perennial crops, especially in tropical countries. Ganoderma species are white rot fungi and have ecological importance in the breakdown of woody plants for nutrient mobilization. They possess effective machineries of lignocellulose-decomposing enzymes useful for bioenergy production and bioremediation. In addition, the genus contains many important species that produce pharmacologically active compounds used in health food and medicine. With the rapid adoption of next-generation DNA sequencing technologies, whole genome sequencing and systematic transcriptome analyses become affordable approaches to identify an organism's genes. In the last few years, numerous projects have been initiated to identify the genetic contents of several Ganoderma species, particularly in different strains of Ganoderma lucidum. In November 2013, eleven whole genome sequencing projects for Ganoderma species were registered in international databases, three of which were already completed with genomes being assembled to high quality. In addition to the nuclear genome, two mitochondrial genomes for Ganoderma species have also been reported. Complementing genome analysis, four transcriptome studies on various developmental stages of Ganoderma species have been performed. Information obtained from these studies has laid the foundation for the identification of genes involved in biological pathways that are critical for understanding the biology of Ganoderma, such as the mechanism of pathogenesis, the biosynthesis of active components, life cycle and cellular development, etc. With abundant genetic information becoming available, a few centralized resources have been established to disseminate the knowledge and integrate relevant data to support comparative genomic analyses of Ganoderma species. The current review carries out a detailed comparison of the nuclear genomes, mitochondrial genomes and transcriptomes from several Ganoderma species. Genes involved in biosynthetic pathways such as CYP450 genes and in cellular development such as matA and matB genes are characterized and compared in detail, as examples to demonstrate the usefulness of comparative genomic analyses for the identification of critical genes. Resources needed for future data integration and exploitation are also discussed. Copyright © 2014 Elsevier Ltd. All rights reserved.
Identification of genes differentially expressed during ripening of banana.
Manrique-Trujillo, Sandra Mabel; Ramírez-López, Ana Cecilia; Ibarra-Laclette, Enrique; Gómez-Lim, Miguel Angel
2007-08-01
The banana (Musa acuminata, subgroup Cavendish 'Grand Nain') is a climacteric fruit of economic importance. A better understanding of the banana ripening process is needed to improve fruit quality and to extend shelf life. Eighty-four up-regulated unigenes were identified by differential screening of a banana fruit cDNA subtraction library at a late ripening stage. The ripening stages in this study were defined according to the peel color index (PCI). Unigene sequences were analyzed with different databases to assign a putative identification. The expression patterns of 36 transcripts confirmed as positive by differential screening were analyzed comparing the PCI 1, PCI 5 and PCI 7 ripening stages. Expression profiles were obtained for unigenes annotated as orcinol O-methyltransferase, putative alcohol dehydrogenase, ubiquitin-protein ligase, chorismate mutase and two unigenes with non-significant matches with any reported sequence. Similar expression profiles were observed in banana pulp and peel. Our results show differential expression of a group of genes involved in processes associated with fruit ripening, such as stress, detoxification, cytoskeleton and biosynthesis of volatile compounds. Some of the identified genes had not been characterized in banana fruit. Besides providing an overview of gene expression programs and metabolic pathways at late stages of banana fruit ripening, this study contributes to increasing the information available on banana fruit ESTs.
New support vector machine-based method for microRNA target prediction.
Li, L; Gao, Q; Mao, X; Cao, Y
2014-06-09
MicroRNA (miRNA) plays important roles in cell differentiation, proliferation, growth, mobility, and apoptosis. An accurate list of precise target genes is necessary in order to fully understand the importance of miRNAs in animal development and disease. Several computational methods have been proposed for miRNA target-gene identification. However, these methods still have limitations with respect to their sensitivity and accuracy. Thus, we developed a new miRNA target-prediction method based on the support vector machine (SVM) model. The model supplies information of two binding sites (primary and secondary) for a radial basis function kernel as a similarity measure for SVM features. The information is categorized based on structural, thermodynamic, and sequence conservation. Using high-confidence datasets selected from public miRNA target databases, we obtained a human miRNA target SVM classifier model with high performance and provided an efficient tool for human miRNA target gene identification. Experiments have shown that our method is a reliable tool for miRNA target-gene prediction, and a successful application of an SVM classifier. Compared with other methods, the method proposed here improves the sensitivity and accuracy of miRNA prediction. Its performance can be further improved by providing more training examples.
Identification of susceptibility genes and genetic modifiers of human diseases
NASA Astrophysics Data System (ADS)
Abel, Kenneth; Kammerer, Stefan; Hoyal, Carolyn; Reneland, Rikard; Marnellos, George; Nelson, Matthew R.; Braun, Andreas
2005-03-01
The completion of the human genome sequence enables the discovery of genes involved in common human disorders. The successful identification of these genes is dependent on the availability of informative sample sets, validated marker panels, a high-throughput scoring technology, and a strategy for combining these resources. We have developed a universal platform technology based on mass spectrometry (MassARRAY) for analyzing nucleic acids with high precision and accuracy. To fuel this technology, we generated more than 100,000 validated assays for single nucleotide polymorphisms (SNPs) covering virtually all known and predicted human genes. We also established a large DNA sample bank comprised of more than 50,000 consented healthy and diseased individuals. This combination of reagents and technology allows the execution of large-scale genome-wide association studies. Taking advantage of MassARRAY"s capability for quantitative analysis of nucleic acids, allele frequencies are estimated in sample pools containing large numbers of individual DNAs. To compare pools as a first-pass "filtering" step is a tremendous advantage in throughput and cost over individual genotyping. We employed this approach in numerous genome-wide, hypothesis-free searches to identify genes associated with common complex diseases, such as breast cancer, osteoporosis, and osteoarthritis, and genes involved in quantitative traits like high density lipoproteins cholesterol (HDL-c) levels and central fat. Access to additional well-characterized patient samples through collaborations allows us to conduct replication studies that validate true disease genes. These discoveries will expand our understanding of genetic disease predisposition, and our ability for early diagnosis and determination of specific disease subtype or progression stage.
Chakravorty, S; Sarkar, S; Gachhui, R
2015-01-01
The Acetobacteraceae family of the class Alpha Proteobacteria is comprised of high sugar and acid tolerant bacteria. The Acetic Acid Bacteria are the economically most significant group of this family because of its association with food products like vinegar, wine etc. Acetobacteraceae are often hard to culture in laboratory conditions and they also maintain very low abundances in their natural habitats. Thus identification of the organisms in such environments is greatly dependent on modern tools of molecular biology which require a thorough knowledge of specific conserved gene sequences that may act as primers and or probes. Moreover unconserved domains in genes also become markers for differentiating closely related genera. In bacteria, the 16S rRNA gene is an ideal candidate for such conserved and variable domains. In order to study the conserved and variable domains of the 16S rRNA gene of Acetic Acid Bacteria and the Acetobacteraceae family, sequences from publicly available databases were aligned and compared. Near complete sequences of the gene were also obtained from Kombucha tea biofilm, a known Acetobacteraceae family habitat, in order to corroborate the domains obtained from the alignment studies. The study indicated that the degree of conservation in the gene is significantly higher among the Acetic Acid Bacteria than the whole Acetobacteraceae family. Moreover it was also observed that the previously described hypervariable regions V1, V3, V5, V6 and V7 were more or less conserved in the family and the spans of the variable regions are quite distinct as well.
Liu, Pu; Zhang, Chao; Ma, Jin-Qi; Zhang, Li-Yuan; Yang, Bo; Tang, Xin-Yu; Huang, Ling; Zhou, Xin-Tong; Lu, Kun; Li, Jia-Na
2018-03-16
Cytokinin oxidase/dehydrogenases (CKXs) play a critical role in the irreversible degradation of cytokinins, thereby regulating plant growth and development. Brassica napus is one of the most widely cultivated oilseed crops worldwide. With the completion of whole-genome sequencing of B. napus , genome-wide identification and expression analysis of the BnCKX gene family has become technically feasible. In this study, we identified 23 BnCKX genes and analyzed their phylogenetic relationships, gene structures, conserved motifs, protein subcellular localizations, and other properties. We also analyzed the expression of the 23 BnCKX genes in the B. napus cultivar Zhong Shuang 11 ('ZS11') by quantitative reverse-transcription polymerase chain reaction (qRT-PCR), revealing their diverse expression patterns. We selected four BnCKX genes based on the results of RNA-sequencing and qRT-PCR and compared their expression in cultivated varieties with extremely long versus short siliques. The expression levels of BnCKX5-1 , 5-2 , 6-1 , and 7-1 significantly differed between the two lines and changed during pod development, suggesting they might play roles in determining silique length and in pod development. Finally, we investigated the effects of treatment with the synthetic cytokinin 6-benzylaminopurine (6-BA) and the auxin indole-3-acetic acid (IAA) on the expression of the four selected BnCKX genes. Our results suggest that regulating BnCKX expression is a promising way to enhance the harvest index and stress resistance in plants.
Muthamilarasan, Mehanathan; Khan, Yusuf; Jaishankar, Jananee; Shweta, Shweta; Lata, Charu; Prasad, Manoj
2015-01-01
Several underutilized grasses have excellent potential for use as bioenergy feedstock due to their lignocellulosic biomass. Genomic tools have enabled identification of lignocellulose biosynthesis genes in several sequenced plants. However, the non-availability of whole genome sequence of bioenergy grasses hinders the study on bioenergy genomics and their genomics-assisted crop improvement. Foxtail millet (Setaria italica L.; Si) is a model crop for studying systems biology of bioenergy grasses. In the present study, a systematic approach has been used for identification of gene families involved in cellulose (CesA/Csl), callose (Gsl) and monolignol biosynthesis (PAL, C4H, 4CL, HCT, C3H, CCoAOMT, F5H, COMT, CCR, CAD) and construction of physical map of foxtail millet. Sequence alignment and phylogenetic analysis of identified proteins showed that monolignol biosynthesis proteins were highly diverse, whereas CesA/Csl and Gsl proteins were homologous to rice and Arabidopsis. Comparative mapping of foxtail millet lignocellulose biosynthesis genes with other C4 panicoid genomes revealed maximum homology with switchgrass, followed by sorghum and maize. Expression profiling of candidate lignocellulose genes in response to different abiotic stresses and hormone treatments showed their differential expression pattern, with significant higher expression of SiGsl12, SiPAL2, SiHCT1, SiF5H2, and SiCAD6 genes. Further, due to the evolutionary conservation of grass genomes, the insights gained from the present study could be extrapolated for identifying genes involved in lignocellulose biosynthesis in other biofuel species for further characterization. PMID:26583030
Narayanan, M P; Menon, Krishnakumar N; Vasudevan, D M
2013-10-01
Maple syrup urine disease (MSUD) is predominantly caused by mutations in the BCKDHA, BCKDHB and DBT genes, which encode for the E1alpha, E1beta and E2 subunits of the branched-chain alpha-keto acid dehydrogenase complex, respectively. Because disease causing mutations play a major role in the development of the disease, prenatal diagnosis at gestational level may have significance in making decisions by parents. Thus, this study was aimed to screen South Indian MSUD patients for mutations and assess the genotype-phenotype correlation. Thirteen patients diagnosed with MSUD by conventional biochemical screening such as urine analysis by DNPH test, thin layer chromatography for amino acids and blood amino acid quantification by HPLC were selected for mutation analysis. The entire coding regions of the BCKDHA, BCKDHB and DBT genes were analyzed for mutations by PCR-based direct DNA sequencing. BCKDHA and BCKDHB mutations were seen in 43% of the total ten patients, while disease-causing DBT gene mutation was observed only in 14%. Three patients displayed no mutations. Novel mutations were c.130C>T in BCKDHA gene, c. 599C>T and c.121_122delAC in BCKDHB gene and c.190G>A in DBT gene. Notably, patients harbouring these mutations were non-responsive to thiamine supplementation and other treatment regimens and might have a worse prognosis as compared to the patients not having such mutations. Thus, identification of these mutations may have a crucial role in the treatment as well as understanding the molecular mechanisms in MSUD.
Muthamilarasan, Mehanathan; Khan, Yusuf; Jaishankar, Jananee; Shweta, Shweta; Lata, Charu; Prasad, Manoj
2015-01-01
Several underutilized grasses have excellent potential for use as bioenergy feedstock due to their lignocellulosic biomass. Genomic tools have enabled identification of lignocellulose biosynthesis genes in several sequenced plants. However, the non-availability of whole genome sequence of bioenergy grasses hinders the study on bioenergy genomics and their genomics-assisted crop improvement. Foxtail millet (Setaria italica L.; Si) is a model crop for studying systems biology of bioenergy grasses. In the present study, a systematic approach has been used for identification of gene families involved in cellulose (CesA/Csl), callose (Gsl) and monolignol biosynthesis (PAL, C4H, 4CL, HCT, C3H, CCoAOMT, F5H, COMT, CCR, CAD) and construction of physical map of foxtail millet. Sequence alignment and phylogenetic analysis of identified proteins showed that monolignol biosynthesis proteins were highly diverse, whereas CesA/Csl and Gsl proteins were homologous to rice and Arabidopsis. Comparative mapping of foxtail millet lignocellulose biosynthesis genes with other C4 panicoid genomes revealed maximum homology with switchgrass, followed by sorghum and maize. Expression profiling of candidate lignocellulose genes in response to different abiotic stresses and hormone treatments showed their differential expression pattern, with significant higher expression of SiGsl12, SiPAL2, SiHCT1, SiF5H2, and SiCAD6 genes. Further, due to the evolutionary conservation of grass genomes, the insights gained from the present study could be extrapolated for identifying genes involved in lignocellulose biosynthesis in other biofuel species for further characterization.
Smith, Desmond J.; Rubin, Edward M.
2000-01-01
A a diagnostic test useful for prenatal identification of Down syndrome and mental retardation. A method for gene therapy for correction and treatment of Down syndrome. DYRK gene involved in the ability to learn. A method for diagnosing Down's syndrome and mental retardation and an assay therefor. A pharmaceutical composition for treatment of Down's syndrome mental retardation.
Seng, Piseth; Abat, Cedric; Rolain, Jean Marc; Colson, Philippe; Lagier, Jean-Christophe; Gouriet, Frédérique; Fournier, Pierre Edouard; Drancourt, Michel; La Scola, Bernard; Raoult, Didier
2013-07-01
During the past 5 years, matrix-assisted laser desorption ionization-time of flight (MALDI-TOF) mass spectrometry (MS) has become a powerful tool for routine identification in many clinical laboratories. We analyzed our 11-year experience in routine identification of clinical isolates (40 months using MALDI-TOF MS and 91 months using conventional phenotypic identification [CPI]). Among the 286,842 clonal isolates, 284,899 isolates of 459 species were identified. The remaining 1,951 isolates were misidentified and required confirmation using a second phenotypic identification for 670 isolates and using a molecular technique for 1,273 isolates of 339 species. MALDI-TOF MS annually identified 112 species, i.e., 36 species/10,000 isolates, compared to 44 species, i.e., 19 species/10,000 isolates, for CPI. Only 50 isolates required second phenotypic identifications during the MALDI-TOF MS period (i.e., 4.5 reidentifications/10,000 isolates) compared with 620 isolates during the CPI period (i.e., 35.2/10,000 isolates). We identified 128 bacterial species rarely reported as human pathogens, including 48 using phenotypic techniques (22 using CPI and 37 using MALDI-TOF MS). Another 75 rare species were identified using molecular methods. MALDI-TOF MS reduced the time required for identification by 55-fold and 169-fold and the cost by 5-fold and 96-fold compared with CPI and gene sequencing, respectively. MALDI-TOF MS was a powerful tool not only for routine bacterial identification but also for identification of rare bacterial species implicated in human infectious diseases. The ability to rapidly identify bacterial species rarely described as pathogens in specific clinical specimens will help us to study the clinical burden resulting from the emergence of these species as human pathogens, and MALDI-TOF MS may be considered an alternative to molecular methods in clinical laboratories.
Seng, Piseth; Abat, Cedric; Rolain, Jean Marc; Colson, Philippe; Lagier, Jean-Christophe; Gouriet, Frédérique; Fournier, Pierre Edouard; Drancourt, Michel; La Scola, Bernard
2013-01-01
During the past 5 years, matrix-assisted laser desorption ionization–time of flight (MALDI-TOF) mass spectrometry (MS) has become a powerful tool for routine identification in many clinical laboratories. We analyzed our 11-year experience in routine identification of clinical isolates (40 months using MALDI-TOF MS and 91 months using conventional phenotypic identification [CPI]). Among the 286,842 clonal isolates, 284,899 isolates of 459 species were identified. The remaining 1,951 isolates were misidentified and required confirmation using a second phenotypic identification for 670 isolates and using a molecular technique for 1,273 isolates of 339 species. MALDI-TOF MS annually identified 112 species, i.e., 36 species/10,000 isolates, compared to 44 species, i.e., 19 species/10,000 isolates, for CPI. Only 50 isolates required second phenotypic identifications during the MALDI-TOF MS period (i.e., 4.5 reidentifications/10,000 isolates) compared with 620 isolates during the CPI period (i.e., 35.2/10,000 isolates). We identified 128 bacterial species rarely reported as human pathogens, including 48 using phenotypic techniques (22 using CPI and 37 using MALDI-TOF MS). Another 75 rare species were identified using molecular methods. MALDI-TOF MS reduced the time required for identification by 55-fold and 169-fold and the cost by 5-fold and 96-fold compared with CPI and gene sequencing, respectively. MALDI-TOF MS was a powerful tool not only for routine bacterial identification but also for identification of rare bacterial species implicated in human infectious diseases. The ability to rapidly identify bacterial species rarely described as pathogens in specific clinical specimens will help us to study the clinical burden resulting from the emergence of these species as human pathogens, and MALDI-TOF MS may be considered an alternative to molecular methods in clinical laboratories. PMID:23637301
Gu, Jinghua; Xuan, Jianhua; Riggins, Rebecca B; Chen, Li; Wang, Yue; Clarke, Robert
2012-08-01
Identification of transcriptional regulatory networks (TRNs) is of significant importance in computational biology for cancer research, providing a critical building block to unravel disease pathways. However, existing methods for TRN identification suffer from the inclusion of excessive 'noise' in microarray data and false-positives in binding data, especially when applied to human tumor-derived cell line studies. More robust methods that can counteract the imperfection of data sources are therefore needed for reliable identification of TRNs in this context. In this article, we propose to establish a link between the quality of one target gene to represent its regulator and the uncertainty of its expression to represent other target genes. Specifically, an outlier sum statistic was used to measure the aggregated evidence for regulation events between target genes and their corresponding transcription factors. A Gibbs sampling method was then developed to estimate the marginal distribution of the outlier sum statistic, hence, to uncover underlying regulatory relationships. To evaluate the effectiveness of our proposed method, we compared its performance with that of an existing sampling-based method using both simulation data and yeast cell cycle data. The experimental results show that our method consistently outperforms the competing method in different settings of signal-to-noise ratio and network topology, indicating its robustness for biological applications. Finally, we applied our method to breast cancer cell line data and demonstrated its ability to extract biologically meaningful regulatory modules related to estrogen signaling and action in breast cancer. The Gibbs sampler MATLAB package is freely available at http://www.cbil.ece.vt.edu/software.htm. xuan@vt.edu Supplementary data are available at Bioinformatics online.
Gu, Jinghua; Xuan, Jianhua; Riggins, Rebecca B.; Chen, Li; Wang, Yue; Clarke, Robert
2012-01-01
Motivation: Identification of transcriptional regulatory networks (TRNs) is of significant importance in computational biology for cancer research, providing a critical building block to unravel disease pathways. However, existing methods for TRN identification suffer from the inclusion of excessive ‘noise’ in microarray data and false-positives in binding data, especially when applied to human tumor-derived cell line studies. More robust methods that can counteract the imperfection of data sources are therefore needed for reliable identification of TRNs in this context. Results: In this article, we propose to establish a link between the quality of one target gene to represent its regulator and the uncertainty of its expression to represent other target genes. Specifically, an outlier sum statistic was used to measure the aggregated evidence for regulation events between target genes and their corresponding transcription factors. A Gibbs sampling method was then developed to estimate the marginal distribution of the outlier sum statistic, hence, to uncover underlying regulatory relationships. To evaluate the effectiveness of our proposed method, we compared its performance with that of an existing sampling-based method using both simulation data and yeast cell cycle data. The experimental results show that our method consistently outperforms the competing method in different settings of signal-to-noise ratio and network topology, indicating its robustness for biological applications. Finally, we applied our method to breast cancer cell line data and demonstrated its ability to extract biologically meaningful regulatory modules related to estrogen signaling and action in breast cancer. Availability and implementation: The Gibbs sampler MATLAB package is freely available at http://www.cbil.ece.vt.edu/software.htm. Contact: xuan@vt.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:22595208
Zhang, Guojun; Zheng, Guanghui; Zhang, Yan; Ma, Ruimin; Kang, Xixiong
2018-05-01
Post-neurosurgical meningitis (PNM) is one of the most severe hospital-acquired infections worldwide, and a large number of pathogens, especially those possessing multi-resistance genes, are related to these infections. Existing methods for detecting bacteria and measuring their response to antibiotics lack sensitivity and stability, and laboratory-based detection methods are inconvenient, requiring at least 24h to complete. Rapid identification of bacteria and the determination of their susceptibility to antibiotics are urgently needed, in order to combat the emergence of multi-resistant bacterial strains. This study evaluated a novel, fast, and easy-to-use micro/nanofluidic chip platform (MNCP), which overcomes the difficulties of diagnosing bacterial infections in neurosurgery. This platform can identify 10 genus or species targets and 13 genetic resistance determinants within 1h, and it is very simple to operate. A total of 108 bacterium-containing cerebrospinal fluid (CSF) cultures were tested using the MNCP for the identification of bacteria and determinants of genetic resistance. The results were compared to those obtained with conventional identification and antimicrobial susceptibility testing methods. For the 108 CSF cultures, the concordance rate between the MNCP and the conventional identification method was 94.44%; six species attained 100% consistency. For the production of carbapenemase- and extended-spectrum beta-lactamase (ESBL)-related antibiotic resistance genes, both the sensitivity and specificity of the MNCP tests were high (>90.0%) and could fully meet the requirements of clinical diagnosis. The MNCP is fast, accurate, and easy to use, and has great clinical potential in the treatment of post-neurosurgical meningitis. Copyright © 2018 The Author(s). Published by Elsevier Ltd.. All rights reserved.
GeneSigDB—a curated database of gene expression signatures
Culhane, Aedín C.; Schwarzl, Thomas; Sultana, Razvan; Picard, Kermshlise C.; Picard, Shaita C.; Lu, Tim H.; Franklin, Katherine R.; French, Simon J.; Papenhausen, Gerald; Correll, Mick; Quackenbush, John
2010-01-01
The primary objective of most gene expression studies is the identification of one or more gene signatures; lists of genes whose transcriptional levels are uniquely associated with a specific biological phenotype. Whilst thousands of experimentally derived gene signatures are published, their potential value to the community is limited by their computational inaccessibility. Gene signatures are embedded in published article figures, tables or in supplementary materials, and are frequently presented using non-standard gene or probeset nomenclature. We present GeneSigDB (http://compbio.dfci.harvard.edu/genesigdb) a manually curated database of gene expression signatures. GeneSigDB release 1.0 focuses on cancer and stem cells gene signatures and was constructed from more than 850 publications from which we manually transcribed 575 gene signatures. Most gene signatures (n = 560) were successfully mapped to the genome to extract standardized lists of EnsEMBL gene identifiers. GeneSigDB provides the original gene signature, the standardized gene list and a fully traceable gene mapping history for each gene from the original transcribed data table through to the standardized list of genes. The GeneSigDB web portal is easy to search, allows users to compare their own gene list to those in the database, and download gene signatures in most common gene identifier formats. PMID:19934259
Sánchez-Juanes, Fernando; Ferreira, Laura; Alonso de la Vega, Pablo; Valverde, Angel; Barrios, Milagros León; Rivas, Raúl; Mateos, Pedro F; Martínez-Molina, Eustoquio; González-Buitrago, José Manuel; Trujillo, Martha E; Velázquez, Encarna
2013-12-01
Genus Bradyrhizobium includes slow growing bacteria able to nodulate different legumes as well as species isolated from plant tumours. The slow growth presented by the members of this genus and the phylogenetic closeness of most of its species difficults their identification. In the present work we applied for the first time Matrix-Assisted Laser Desorption Ionization-Time-of-Flight Mass Spectrometry (MALDI-TOF MS) to the analysis of Bradyrhizobium species after the extension of MALDI Biotyper 2.0 database with the currently valid species of this genus. With this methodology it was possible to identify strains belonging to phylogenetically closely related species of genus Bradyrhizobium allowing the discrimination among species with rrs gene identities higher than 99%. The application of MALDI-TOF MS to strains isolated from nodules of different Lupinus species in diverse geographical locations allowed their correct identification when comparing with the results of rrs gene and ITS analyses. The nodulation of Lupinus gredensis, an endemic species of the west of Spain, by B. canariense supports the European origin of this species. Copyright © 2013. Published by Elsevier GmbH.
Lim, Shu Yong; Yap, Kien-Pong; Thong, Kwai Lin
2016-01-01
Listeria monocytogenes is an important foodborne pathogen that causes considerable morbidity in humans with high mortality rates. In this study, we have sequenced the genomes and performed comparative genomics analyses on two strains, LM115 and LM41, isolated from ready-to-eat food in Malaysia. The genome size of LM115 and LM41 was 2,959,041 and 2,963,111 bp, respectively. These two strains shared approximately 90% homologous genes. Comparative genomics and phylogenomic analyses revealed that LM115 and LM41 were more closely related to the reference strains F2365 and EGD-e, respectively. Our virulence profiling indicated a total of 31 virulence genes shared by both analysed strains. These shared genes included those that encode for internalins and L. monocytogenes pathogenicity island 1 (LIPI-1). Both the Malaysian L. monocytogenes strains also harboured several genes associated with stress tolerance to counter the adverse conditions. Seven antibiotic and efflux pump related genes which may confer resistance against lincomycin, erythromycin, fosfomycin, quinolone, tetracycline, and penicillin, and macrolides were identified in the genomes of both strains. Whole genome sequencing and comparative genomics analyses revealed two virulent L. monocytogenes strains isolated from ready-to-eat foods in Malaysia. The identification of strains with pathogenic, persistent, and antibiotic resistant potentials from minimally processed food warrant close attention from both healthcare and food industry.
Identification of three duplicated Spin genes in medaka (Oryzias latipes).
Wang, Xiao-Lei; Mei, Jie; Sun, Min; Hong, Yun-Han; Gui, Jian-Fang
2005-05-09
Gene and genomic duplications are very important and frequent events in fish evolution, and the divergence of duplicated genes in sequences and functions is a focus of research on gene evolution. Here, we report the identification and characterization of three duplicated Spindlin (Spin) genes from medaka (Oryzias latipes): OlSpinA, OlSpinB, and OlSpinC. Molecular cloning, genomic DNA Blast analysis and phylogenetic relationship analysis demonstrated that the three duplicated OlSpin genes should belong to gene duplication. Furthermore, Western blot analysis revealed significant expression differences of the three OlSpins among different tissues and during embryogenesis in medaka, and suggested that sequence and functional divergence might have occurred in evolution among them.
DNA barcoding for molecular identification of Demodex based on mitochondrial genes.
Hu, Li; Yang, YuanJun; Zhao, YaE; Niu, DongLing; Yang, Rui; Wang, RuiLing; Lu, Zhaohui; Li, XiaoQi
2017-12-01
There has been no widely accepted DNA barcode for species identification of Demodex. In this study, we attempted to solve this issue. First, mitochondrial cox1-5' and 12S gene fragments of Demodex folloculorum, D. brevis, D. canis, and D. caprae were amplified, cloned, and sequenced for the first time; intra/interspecific divergences were computed and phylogenetic trees were reconstructed. Then, divergence frequency distribution plots of those two gene fragments were drawn together with mtDNA cox1-middle region and 16S obtained in previous studies. Finally, their identification efficiency was evaluated by comparing barcoding gap. Results indicated that 12S had the higher identification efficiency. Specifically, for cox1-5' region of the four Demodex species, intraspecific divergences were less than 2.0%, and interspecific divergences were 21.1-31.0%; for 12S, intraspecific divergences were less than 1.4%, and interspecific divergences were 20.8-26.9%. The phylogenetic trees demonstrated that the four Demodex species clustered separately, and divergence frequency distribution plot showed that the largest intraspecific divergence of 12S (1.4%) was less than cox1-5' region (2.0%), cox1-middle region (3.1%), and 16S (2.8%). The barcoding gap of 12S was 19.4%, larger than cox1-5' region (19.1%), cox1-middle region (11.3%), and 16S (13.0%); the interspecific divergence span of 12S was 6.2%, smaller than cox1-5' region (10.0%), cox1-middle region (14.1%), and 16S (11.4%). Moreover, 12S has a moderate length (517 bp) for sequencing at once. Therefore, we proposed mtDNA 12S was more suitable than cox1 and 16S to be a DNA barcode for classification and identification of Demodex at lower category level.
Schuurs-Hoeijmakers, Janneke H M; Vulto-van Silfhout, Anneke T; Vissers, Lisenka E L M; van de Vondervoort, Ilse I G M; van Bon, Bregje W M; de Ligt, Joep; Gilissen, Christian; Hehir-Kwa, Jayne Y; Neveling, Kornelia; del Rosario, Marisol; Hira, Gausiya; Reitano, Santina; Vitello, Aurelio; Failla, Pinella; Greco, Donatella; Fichera, Marco; Galesi, Ornella; Kleefstra, Tjitske; Greally, Marie T; Ockeloen, Charlotte W; Willemsen, Marjolein H; Bongers, Ernie M H F; Janssen, Irene M; Pfundt, Rolph; Veltman, Joris A; Romano, Corrado; Willemsen, Michèl A; van Bokhoven, Hans; Brunner, Han G; de Vries, Bert B A; de Brouwer, Arjan P M
2013-12-01
Intellectual disability (ID) is a common neurodevelopmental disorder affecting 1-3% of the general population. Mutations in more than 10% of all human genes are considered to be involved in this disorder, although the majority of these genes are still unknown. We investigated 19 small non-consanguineous families with two to five affected siblings in order to identify pathogenic gene variants in known, novel and potential ID candidate genes. Non-consanguineous families have been largely ignored in gene identification studies as small family size precludes prior mapping of the genetic defect. Using exome sequencing, we identified pathogenic mutations in three genes, DDHD2, SLC6A8, and SLC9A6, of which the latter two have previously been implicated in X-linked ID phenotypes. In addition, we identified potentially pathogenic mutations in BCORL1 on the X-chromosome and in MCM3AP, PTPRT, SYNE1, and ZNF528 on autosomes. We show that potentially pathogenic gene variants can be identified in small, non-consanguineous families with as few as two affected siblings, thus emphasising their value in the identification of syndromic and non-syndromic ID genes.
Identification of pathogen avirulencegenes in the fusiform rust pathosystem
John M. Davis; Katherine E. Smith; Amanda Pendleton; Jason A. Smith; C. Dana Nelson
2012-01-01
The Cronartium quercuum f.sp. fusiforme (Cqf) whole genome sequencing project will enable identification of avirulence genes in the most devastating pine fungal pathogen in the southeastern United States. Amerson and colleagues (unpublished) have mapped nine fusiform rust resistance genes in loblolly pine,...
Florio, Marta; Heide, Michael; Pinson, Anneline; Brandl, Holger; Albert, Mareike; Winkler, Sylke; Wimberger, Pauline; Huttner, Wieland B; Hiller, Michael
2018-03-21
Understanding the molecular basis that underlies the expansion of the neocortex during primate, and notably human, evolution requires the identification of genes that are particularly active in the neural stem and progenitor cells of the developing neocortex. Here, we have used existing transcriptome datasets to carry out a comprehensive screen for protein-coding genes preferentially expressed in progenitors of fetal human neocortex. We show that 15 human-specific genes exhibit such expression, and many of them evolved distinct neural progenitor cell-type expression profiles and levels compared to their ancestral paralogs. Functional studies on one such gene, NOTCH2NL , demonstrate its ability to promote basal progenitor proliferation in mice. An additional 35 human genes with progenitor-enriched expression are shown to have orthologs only in primates. Our study provides a resource of genes that are promising candidates to exert specific, and novel, roles in neocortical development during primate, and notably human, evolution. © 2018, Florio et al.
Identification of genes involved in cold-shock response in rainbow trout (Oncorhynchus mykiss).
Borchel, Andreas; Verleih, Marieke; Rebl, Alexander; Goldammer, Tom
2017-09-01
A rapid decline in temperature poses a major challenge for poikilothermic fish, as their entire metabolism depends on ambient temperature. The gene expression of rainbow trout Oncorhynchus mykiss having undergone such a cold shock (0◦C) was compared to a control (5◦C) in a microarray and quantitative real-time PCR based study. The tissues of gill, kidney and liver were examined. The most differently expressed genes were found in liver, many of them contributing to the network 'cellular compromise, cellular growth and proliferation'.However, the number of genes found to be regulated at 0◦Cwas surprisingly low. Instead of classical genes involved in temperature shock, the three genes encoding fibroblast growth factor 1 (fgf1), growth arrest and DNA-damageinducible, alpha (gadd45a) and sclerostin domain-containing protein 1 (sostdc1) were upregulated in the liver upon cold shock in two different rainbow trout strains, suggesting that these genes may be considered as general biomarkers for cold shock in rainbow trout.
Pinson, Anneline; Brandl, Holger; Albert, Mareike; Winkler, Sylke; Wimberger, Pauline
2018-01-01
Understanding the molecular basis that underlies the expansion of the neocortex during primate, and notably human, evolution requires the identification of genes that are particularly active in the neural stem and progenitor cells of the developing neocortex. Here, we have used existing transcriptome datasets to carry out a comprehensive screen for protein-coding genes preferentially expressed in progenitors of fetal human neocortex. We show that 15 human-specific genes exhibit such expression, and many of them evolved distinct neural progenitor cell-type expression profiles and levels compared to their ancestral paralogs. Functional studies on one such gene, NOTCH2NL, demonstrate its ability to promote basal progenitor proliferation in mice. An additional 35 human genes with progenitor-enriched expression are shown to have orthologs only in primates. Our study provides a resource of genes that are promising candidates to exert specific, and novel, roles in neocortical development during primate, and notably human, evolution. PMID:29561261
20 years since the introduction of DNA barcoding: from theory to application.
Fišer Pečnikar, Živa; Buzan, Elena V
2014-02-01
Traditionally, taxonomic identification has relied upon morphological characters. In the last two decades, molecular tools based on DNA sequences of short standardised gene fragments, termed DNA barcodes, have been developed for species discrimination. The most common DNA barcode used in animals is a fragment of the cytochrome c oxidase (COI) mitochondrial gene, while for plants, two chloroplast gene fragments from the RuBisCo large subunit (rbcL) and maturase K (matK) genes are widely used. Information gathered from DNA barcodes can be used beyond taxonomic studies and will have far-reaching implications across many fields of biology, including ecology (rapid biodiversity assessment and food chain analysis), conservation biology (monitoring of protected species), biosecurity (early identification of invasive pest species), medicine (identification of medically important pathogens and their vectors) and pharmacology (identification of active compounds). However, it is important that the limitations of DNA barcoding are understood and techniques continually adapted and improved as this young science matures.
Brabec, Jan; Kostadinova, Aneta; Scholz, Tomáš; Littlewood, D Timothy J
2015-06-19
The genus Diplostomum (Platyhelminthes: Trematoda: Diplostomidae) is a diverse group of freshwater parasites with complex life-cycles and global distribution. The larval stages are important pathogens causing eye fluke disease implicated in substantial impacts on natural fish populations and losses in aquaculture. However, the problematic species delimitation and difficulties in the identification of larval stages hamper the assessment of the distributional and host ranges of Diplostomum spp. and their transmission ecology. Total genomic DNA was isolated from adult worms and shotgun sequenced using Illumina MiSeq technology. Mitochondrial (mt) genomes and nuclear ribosomal RNA (rRNA) operons were assembled using established bioinformatic tools and fully annotated. Mt protein-coding genes and nuclear rRNA genes were subjected to phylogenetic analysis by maximum likelihood and the resulting topologies compared. We characterised novel complete mt genomes and nuclear rRNA operons of two closely related species, Diplostomum spathaceum and D. pseudospathaceum. Comparative mt genome assessment revealed that the cox1 gene and its 'barcode' region used for molecular identification are the most conserved regions; instead, nad4 and nad5 genes were identified as most promising molecular diagnostic markers. Using the novel data, we provide the first genome wide estimation of the phylogenetic relationships of the order Diplostomida, one of the two fundamental lineages of the Digenea. Analyses of the mitogenomic data invariably recovered the Diplostomidae as a sister lineage of the order Plagiorchiida rather than as a basal lineage of the Diplostomida as inferred in rDNA phylogenies; this was concordant with the mt gene order of Diplostomum spp. exhibiting closer match to the conserved gene order of the Plagiorchiida. Complete sequences of the mt genome and rRNA operon of two species of Diplostomum provide a valuable resource for novel genetic markers for species delineation and large-scale molecular epidemiology and disease ecology studies based on the most accessible life-cycle stages of eye flukes.
Identification and comparative analysis of the epidermal differentiation complex in snakes
Brigit Holthaus, Karin; Mlitz, Veronika; Strasser, Bettina; Tschachler, Erwin; Alibardi, Lorenzo; Eckhart, Leopold
2017-01-01
The epidermis of snakes efficiently protects against dehydration and mechanical stress. However, only few proteins of the epidermal barrier to the environment have so far been identified in snakes. Here, we determined the organization of the Epidermal Differentiation Complex (EDC), a cluster of genes encoding protein constituents of cornified epidermal structures, in snakes and compared it to the EDCs of other squamates and non-squamate reptiles. The EDC of snakes displays shared synteny with that of the green anole lizard, including the presence of a cluster of corneous beta-protein (CBP)/beta-keratin genes. We found that a unique CBP comprising 4 putative beta-sheets and multiple cysteine-rich EDC proteins are conserved in all snakes and other squamates investigated. Comparative genomics of squamates suggests that the evolution of snakes was associated with a gene duplication generating two isoforms of the S100 fused-type protein, scaffoldin, the origin of distinct snake-specific EDC genes, and the loss of other genes that were present in the EDC of the last common ancestor of snakes and lizards. Taken together, our results provide new insights into the evolution of the skin in squamates and a basis for the characterization of the molecular composition of the epidermis in snakes. PMID:28345630
Comparative transcriptional profiling identifies takeout as a gene that regulates life span
Bauer, Johannes; Antosh, Michael; Chang, Chengyi; Schorl, Christoph; Kolli, Santharam; Neretti, Nicola; Helfand, Stephen L.
2010-01-01
A major challenge in translating the positive effects of dietary restriction (DR) for the improvement of human health is the development of therapeutic mimics. One approach to finding DR mimics is based upon identification of the proximal effectors of DR life span extension. Whole genome profiling of DR in Drosophila shows a large number of changes in gene expression, making it difficult to establish which changes are involved in life span determination as opposed to other unrelated physiological changes. We used comparative whole genome expression profiling to discover genes whose change in expression is shared between DR and two molecular genetic life span extending interventions related to DR, increased dSir2 and decreased Dmp53 activity. We find twenty-one genes shared among the three related life span extending interventions. One of these genes, takeout, thought to be involved in circadian rhythms, feeding behavior and juvenile hormone binding is also increased in four other life span extending conditions: Rpd3, Indy, chico and methuselah. We demonstrate takeout is involved in longevity determination by specifically increasing adult takeout expression and extending life span. These studies demonstrate the power of comparative whole genome transcriptional profiling for identifying specific downstream elements of the DR life span extending pathway. PMID:20519778
The complexity of gene expression dynamics revealed by permutation entropy
2010-01-01
Background High complexity is considered a hallmark of living systems. Here we investigate the complexity of temporal gene expression patterns using the concept of Permutation Entropy (PE) first introduced in dynamical systems theory. The analysis of gene expression data has so far focused primarily on the identification of differentially expressed genes, or on the elucidation of pathway and regulatory relationships. We aim to study gene expression time series data from the viewpoint of complexity. Results Applying the PE complexity metric to abiotic stress response time series data in Arabidopsis thaliana, genes involved in stress response and signaling were found to be associated with the highest complexity not only under stress, but surprisingly, also under reference, non-stress conditions. Genes with house-keeping functions exhibited lower PE complexity. Compared to reference conditions, the PE of temporal gene expression patterns generally increased upon stress exposure. High-complexity genes were found to have longer upstream intergenic regions and more cis-regulatory motifs in their promoter regions indicative of a more complex regulatory apparatus needed to orchestrate their expression, and to be associated with higher correlation network connectivity degree. Arabidopsis genes also present in other plant species were observed to exhibit decreased PE complexity compared to Arabidopsis specific genes. Conclusions We show that Permutation Entropy is a simple yet robust and powerful approach to identify temporal gene expression profiles of varying complexity that is equally applicable to other types of molecular profile data. PMID:21176199
Mahajan, Ameya S.; Kondhare, Kirtikumar R.; Rajabhoj, Mohit P.; Kumar, Amit; Ghate, Tejashree; Ravindran, Nevedha; Habib, Farhat; Siddappa, Sundaresha; Banerjee, Anjan K.
2016-01-01
Potato Homeobox 15 (POTH15) is a KNOX-I (Knotted1-like homeobox) family gene in potato that is orthologous to Shoot Meristemless (STM) in Arabidopsis. Despite numerous reports on KNOX genes from different species, studies in potato are limited. Here, we describe photoperiodic regulation of POTH15, its overexpression phenotype, and identification of its potential targets in potato (Solanum tuberosum ssp. andigena). qRT-PCR analysis showed a higher abundance of POTH15 mRNA in shoot tips and stolons under tuber-inducing short-day conditions. POTH15 promoter activity was detected in apical and axillary meristems, stolon tips, tuber eyes, and meristems of tuber sprouts, indicating its role in meristem maintenance and leaf development. POTH15 overexpression altered multiple morphological traits including leaf and stem development, leaflet number, and number of nodes and branches. In particular, the rachis of the leaf was completely reduced and leaves appeared as a bouquet of leaflets. Comparative transcriptomic analysis of 35S::GUS and two POTH15 overexpression lines identified more than 6000 differentially expressed genes, including 2014 common genes between the two overexpression lines. Functional analysis of these genes revealed their involvement in responses to hormones, biotic/abiotic stresses, transcription regulation, and signal transduction. qRT-PCR of selected candidate target genes validated their differential expression in both overexpression lines. Out of 200 randomly chosen POTH15 targets, 173 were found to have at least one tandem TGAC core motif, characteristic of KNOX interaction, within 3.0kb in the upstream sequence of the transcription start site. Overall, this study provides insights to the role of POTH15 in controlling diverse developmental processes in potato. PMID:27217546
Amiri, Azam; Bandani, Ali Reza; Alizadeh, Houshang
2016-04-01
Sunn pest, Eurygaster integriceps, is a serious pest of cereals in the wide area of the globe from Near and Middle East to East and South Europe and North Africa. This study described for the first time, identification of E. integriceps trypsin serine protease and cathepsin-L cysteine, transcripts involved in digestion, which might serve as targets for pest control management. A total of 478 and 500 base pair long putative trypsin and cysteine gene sequences were characterized and named Tryp and Cys, respectively. In addition, the tissue-specific relative gene expression levels of these genes as well as gluten hydrolase (Gl) were determined under different host kernels feeding conditions. Result showed that mRNA expression of Cys, Tryp, and Gl was significantly affected after feeding on various host plant species. Transcript levels of these genes were most abundant in the wheat-fed E. integriceps larvae compared to other hosts. The Cys transcript was detected exclusively in the gut, whereas the Gl and Tryp transcripts were detectable in both salivary glands and gut. Also possibility of Sunn pest gene silencing was studied by topical application of cysteine double-stranded RNA (dsRNA). The results indicated that topically applied dsRNA on fifth nymphal stage can penetrate the cuticle of the insect and induce RNA interference. The Cys gene mRNA transcript in the gut was reduced to 83.8% 2 days posttreatment. Also, it was found that dsRNA of Cys gene affected fifth nymphal stage development suggesting the involvement of this protease in the insect growth, development, and molting. © 2015 Wiley Periodicals, Inc.
Petit, Daniel; Teppa, Elin; Mir, Anne-Marie; Vicogne, Dorothée; Thisse, Christine; Thisse, Bernard; Filloux, Cyril; Harduin-Lepers, Anne
2015-01-01
Sialyltransferases are responsible for the synthesis of a diverse range of sialoglycoconjugates predicted to be pivotal to deuterostomes’ evolution. In this work, we reconstructed the evolutionary history of the metazoan α2,3-sialyltransferases family (ST3Gal), a subset of sialyltransferases encompassing six subfamilies (ST3Gal I–ST3Gal VI) functionally characterized in mammals. Exploration of genomic and expressed sequence tag databases and search of conserved sialylmotifs led to the identification of a large data set of st3gal-related gene sequences. Molecular phylogeny and large scale sequence similarity network analysis identified four new vertebrate subfamilies called ST3Gal III-r, ST3Gal VII, ST3Gal VIII, and ST3Gal IX. To address the issue of the origin and evolutionary relationships of the st3gal-related genes, we performed comparative syntenic mapping of st3gal gene loci combined to ancestral genome reconstruction. The ten vertebrate ST3Gal subfamilies originated from genome duplication events at the base of vertebrates and are organized in three distinct and ancient groups of genes predating the early deuterostomes. Inferring st3gal gene family history identified also several lineage-specific gene losses, the significance of which was explored in a functional context. Toward this aim, spatiotemporal distribution of st3gal genes was analyzed in zebrafish and bovine tissues. In addition, molecular evolutionary analyses using specificity determining position and coevolved amino acid predictions led to the identification of amino acid residues with potential implication in functional divergence of vertebrate ST3Gal. We propose a detailed scenario of the evolutionary relationships of st3gal genes coupled to a conceptual framework of the evolution of ST3Gal functions. PMID:25534026
Identifying module biomarkers from gastric cancer by differential correlation network
Liu, Xiaoping; Chang, Xiao
2016-01-01
Gastric cancer (stomach cancer) is a severe disease caused by dysregulation of many functionally correlated genes or pathways instead of the mutation of individual genes. Systematic identification of gastric cancer biomarkers can provide insights into the mechanisms underlying this deadly disease and help in the development of new drugs. In this paper, we present a novel network-based approach to predict module biomarkers of gastric cancer that can effectively distinguish the disease from normal samples. Specifically, by assuming that gastric cancer has mainly resulted from dysfunction of biomolecular networks rather than individual genes in an organism, the genes in the module biomarkers are potentially related to gastric cancer. Finally, we identified a module biomarker with 27 genes, and by comparing the module biomarker with known gastric cancer biomarkers, we found that our module biomarker exhibited a greater ability to diagnose the samples with gastric cancer. PMID:27703371
RatMap--rat genome tools and data.
Petersen, Greta; Johnson, Per; Andersson, Lars; Klinga-Levan, Karin; Gómez-Fabre, Pedro M; Ståhl, Fredrik
2005-01-01
The rat genome database RatMap (http://ratmap.org or http://ratmap.gen.gu.se) has been one of the main resources for rat genome information since 1994. The database is maintained by CMB-Genetics at Goteborg University in Sweden and provides information on rat genes, polymorphic rat DNA-markers and rat quantitative trait loci (QTLs), all curated at RatMap. The database is under the supervision of the Rat Gene and Nomenclature Committee (RGNC); thus much attention is paid to rat gene nomenclature. RatMap presents information on rat idiograms, karyotypes and provides a unified presentation of the rat genome sequence and integrated rat linkage maps. A set of tools is also available to facilitate the identification and characterization of rat QTLs, as well as the estimation of exon/intron number and sizes in individual rat genes. Furthermore, comparative gene maps of rat in regard to mouse and human are provided.
RatMap—rat genome tools and data
Petersen, Greta; Johnson, Per; Andersson, Lars; Klinga-Levan, Karin; Gómez-Fabre, Pedro M.; Ståhl, Fredrik
2005-01-01
The rat genome database RatMap (http://ratmap.org or http://ratmap.gen.gu.se) has been one of the main resources for rat genome information since 1994. The database is maintained by CMB–Genetics at Göteborg University in Sweden and provides information on rat genes, polymorphic rat DNA-markers and rat quantitative trait loci (QTLs), all curated at RatMap. The database is under the supervision of the Rat Gene and Nomenclature Committee (RGNC); thus much attention is paid to rat gene nomenclature. RatMap presents information on rat idiograms, karyotypes and provides a unified presentation of the rat genome sequence and integrated rat linkage maps. A set of tools is also available to facilitate the identification and characterization of rat QTLs, as well as the estimation of exon/intron number and sizes in individual rat genes. Furthermore, comparative gene maps of rat in regard to mouse and human are provided. PMID:15608244
Bayesian median regression for temporal gene expression data
NASA Astrophysics Data System (ADS)
Yu, Keming; Vinciotti, Veronica; Liu, Xiaohui; 't Hoen, Peter A. C.
2007-09-01
Most of the existing methods for the identification of biologically interesting genes in a temporal expression profiling dataset do not fully exploit the temporal ordering in the dataset and are based on normality assumptions for the gene expression. In this paper, we introduce a Bayesian median regression model to detect genes whose temporal profile is significantly different across a number of biological conditions. The regression model is defined by a polynomial function where both time and condition effects as well as interactions between the two are included. MCMC-based inference returns the posterior distribution of the polynomial coefficients. From this a simple Bayes factor test is proposed to test for significance. The estimation of the median rather than the mean, and within a Bayesian framework, increases the robustness of the method compared to a Hotelling T2-test previously suggested. This is shown on simulated data and on muscular dystrophy gene expression data.
Jeffrey R. Row; Kevin E. Doherty; Todd B. Cross; Michael K. Schwartz; Sara Oyler-McCance; Dave E. Naugle; Steven T. Knick; Bradley C. Fedy
2018-01-01
Functional connectivity, quantified using landscape genetics, can inform conservation through the identification of factors linking genetic structure to landscape mechanisms. We used breeding habitat metrics, landscape attributes and indices of grouse abundance, to compare fit between structural connectivity and genetic differentiation within five longâestablished Sage...
Ou, Hong-Yu; He, Xinyi; Harrison, Ewan M.; Kulasekara, Bridget R.; Thani, Ali Bin; Kadioglu, Aras; Lory, Stephen; Hinton, Jay C. D.; Barer, Michael R.; Rajakumar, Kumar
2007-01-01
MobilomeFINDER (http://mml.sjtu.edu.cn/MobilomeFINDER) is an interactive online tool that facilitates bacterial genomic island or ‘mobile genome’ (mobilome) discovery; it integrates the ArrayOme and tRNAcc software packages. ArrayOme utilizes a microarray-derived comparative genomic hybridization input data set to generate ‘inferred contigs’ produced by merging adjacent genes classified as ‘present’. Collectively these ‘fragments’ represent a hypothetical ‘microarray-visualized genome (MVG)’. ArrayOme permits recognition of discordances between physical genome and MVG sizes, thereby enabling identification of strains rich in microarray-elusive novel genes. Individual tRNAcc tools facilitate automated identification of genomic islands by comparative analysis of the contents and contexts of tRNA sites and other integration hotspots in closely related sequenced genomes. Accessory tools facilitate design of hotspot-flanking primers for in silico and/or wet-science-based interrogation of cognate loci in unsequenced strains and analysis of islands for features suggestive of foreign origins; island-specific and genome-contextual features are tabulated and represented in schematic and graphical forms. To date we have used MobilomeFINDER to analyse several Enterobacteriaceae, Pseudomonas aeruginosa and Streptococcus suis genomes. MobilomeFINDER enables high-throughput island identification and characterization through increased exploitation of emerging sequence data and PCR-based profiling of unsequenced test strains; subsequent targeted yeast recombination-based capture permits full-length sequencing and detailed functional studies of novel genomic islands. PMID:17537813
Cell type-selective disease-association of genes under high regulatory load.
Galhardo, Mafalda; Berninger, Philipp; Nguyen, Thanh-Phuong; Sauter, Thomas; Sinkkonen, Lasse
2015-10-15
We previously showed that disease-linked metabolic genes are often under combinatorial regulation. Using the genome-wide ChIP-Seq binding profiles for 93 transcription factors in nine different cell lines, we show that genes under high regulatory load are significantly enriched for disease-association across cell types. We find that transcription factor load correlates with the enhancer load of the genes and thereby allows the identification of genes under high regulatory load by epigenomic mapping of active enhancers. Identification of the high enhancer load genes across 139 samples from 96 different cell and tissue types reveals a consistent enrichment for disease-associated genes in a cell type-selective manner. The underlying genes are not limited to super-enhancer genes and show several types of disease-association evidence beyond genetic variation (such as biomarkers). Interestingly, the high regulatory load genes are involved in more KEGG pathways than expected by chance, exhibit increased betweenness centrality in the interaction network of liver disease genes, and carry longer 3' UTRs with more microRNA (miRNA) binding sites than genes on average, suggesting a role as hubs integrating signals within regulatory networks. In summary, epigenetic mapping of active enhancers presents a promising and unbiased approach for identification of novel disease genes in a cell type-selective manner. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Hofer, Peter; Boeszoermenyi, Andras; Jaeger, Doris; Feiler, Ursula; Arthanari, Haribabu; Mayer, Nicole; Zehender, Fabian; Rechberger, Gerald; Oberer, Monika; Zimmermann, Robert; Lass, Achim; Haemmerle, Guenter; Breinbauer, Rolf; Zechner, Rudolf; Preiss-Landl, Karina
2015-01-01
The coordinated breakdown of intracellular triglyceride (TG) stores requires the exquisitely regulated interaction of lipolytic enzymes with regulatory, accessory, and scaffolding proteins. Together they form a dynamic multiprotein network designated as the “lipolysome.” Adipose triglyceride lipase (Atgl) catalyzes the initiating step of TG hydrolysis and requires comparative gene identification-58 (Cgi-58) as a potent activator of enzyme activity. Here, we identify adipocyte-type fatty acid-binding protein (A-Fabp) and other members of the fatty acid-binding protein (Fabp) family as interaction partners of Cgi-58. Co-immunoprecipitation, microscale thermophoresis, and solid phase assays proved direct protein/protein interaction between A-Fabp and Cgi-58. Using nuclear magnetic resonance titration experiments and site-directed mutagenesis, we located a potential contact region on A-Fabp. In functional terms, A-Fabp stimulates Atgl-catalyzed TG hydrolysis in a Cgi-58-dependent manner. Additionally, transcriptional transactivation assays with a luciferase reporter system revealed that Fabps enhance the ability of Atgl/Cgi-58-mediated lipolysis to induce the activity of peroxisome proliferator-activated receptors. Our studies identify Fabps as crucial structural and functional components of the lipolysome. PMID:25953897
Buchan, Blake W.; Ginocchio, Christine C.; Manii, Ryhana; Cavagnolo, Robert; Pancholi, Preeti; Swyers, Lettie; Thomson, Richard B.; Anderson, Christopher; Kaul, Karen; Ledeboer, Nathan A.
2013-01-01
Background A multicenter study was conducted to evaluate the diagnostic accuracy (sensitivity and specificity) of the Verigene Gram-Positive Blood Culture Test (BC-GP) test to identify 12 Gram-positive bacterial gene targets and three genetic resistance determinants directly from positive blood culture broths containing Gram-positive bacteria. Methods and Findings 1,252 blood cultures containing Gram-positive bacteria were prospectively collected and tested at five clinical centers between April, 2011 and January, 2012. An additional 387 contrived blood cultures containing uncommon targets (e.g., Listeria spp., S. lugdunensis, vanB-positive Enterococci) were included to fully evaluate the performance of the BC-GP test. Sensitivity and specificity for the 12 specific genus or species targets identified by the BC-GP test ranged from 92.6%–100% and 95.4%–100%, respectively. Identification of the mecA gene in 599 cultures containing S. aureus or S. epidermidis was 98.6% sensitive and 94.3% specific compared to cefoxitin disk method. Identification of the vanA gene in 81 cultures containing Enterococcus faecium or E. faecalis was 100% sensitive and specific. Approximately 7.5% (87/1,157) of single-organism cultures contained Gram-positive bacteria not present on the BC-GP test panel. In 95 cultures containing multiple organisms the BC-GP test was in 71.6% (68/95) agreement with culture results. Retrospective analysis of 107 separate blood cultures demonstrated that identification of methicillin resistant S. aureus and vancomycin resistant Enterococcus spp. was completed an average of 41.8 to 42.4 h earlier using the BC-GP test compared to routine culture methods. The BC-GP test was unable to assign mecA to a specific organism in cultures containing more than one Staphylococcus isolate and does not identify common blood culture contaminants such as Micrococcus, Corynebacterium, and Bacillus. Conclusions The BC-GP test is a multiplex test capable of detecting most leading causes of Gram-positive bacterial blood stream infections as well as genetic markers of methicillin and vancomycin resistance directly from positive blood cultures. Please see later in the article for the Editors' Summary PMID:23843749
Moody, Michael L; Rieseberg, Loren H
2012-07-01
The annual sunflowers (Helianthus sect. Helianthus) present a formidable challenge for phylogenetic inference because of ancient hybrid speciation, recent introgression, and suspected issues with deep coalescence. Here we analyze sequence data from 11 nuclear DNA (nDNA) genes for multiple genotypes of species within the section to (1) reconstruct the phylogeny of this group, (2) explore the utility of nDNA gene trees for detecting hybrid speciation and introgression; and (3) test an empirical method of hybrid identification based on the phylogenetic congruence of nDNA gene trees from tightly linked genes. We uncovered considerable topological heterogeneity among gene trees with or without three previously identified hybrid species included in the analyses, as well as a general lack of reciprocal monophyly of species. Nonetheless, partitioned Bayesian analyses provided strong support for the reciprocal monophyly of all species except H. annuus (0.89 PP), the most widespread and abundant annual sunflower. Previous hypotheses of relationships among taxa were generally strongly supported (1.0 PP), except among taxa typically associated with H. annuus, apparently due to the paraphyly of the latter in all gene trees. While the individual nDNA gene trees provided a useful means for detecting recent hybridization, identification of ancient hybridization was problematic for all ancient hybrid species, even when linkage was considered. We discuss biological factors that affect the efficacy of phylogenetic methods for hybrid identification.
Sakai, Kanae; Komaki, Hisayuki; Gonoi, Tohru
2015-01-01
Nocardithiocin is a thiopeptide compound isolated from the opportunistic pathogen Nocardia pseudobrasiliensis. It shows a strong activity against acid-fast bacteria and is also active against rifampicin-resistant Mycobacterium tuberculosis. Here, we report the identification of the nocardithiocin gene cluster in N. pseudobrasiliensis IFM 0761 based on conserved thiopeptide biosynthesis gene sequence and the whole genome sequence. The predicted gene cluster was confirmed by gene disruption and complementation. As expected, strains containing the disrupted gene did not produce nocardithiocin while gene complementation restored nocardithiocin production in these strains. The predicted cluster was further analyzed using RNA-seq which showed that the nocardithiocin gene cluster contains 12 genes within a 15.2-kb region. This finding will promote the improvement of nocardithiocin productivity and its derivatives production. PMID:26588225
rpoB Gene Sequencing for Identification of Corynebacterium Species
Khamis, Atieh; Raoult, Didier; La Scola, Bernard
2004-01-01
The genus Corynebacterium is a heterogeneous group of species comprising human and animal pathogens and environmental bacteria. It is defined on the basis of several phenotypic characters and the results of DNA-DNA relatedness and, more recently, 16S rRNA gene sequencing. However, the 16S rRNA gene is not polymorphic enough to ensure reliable phylogenetic studies and needs to be completely sequenced for accurate identification. The almost complete rpoB sequences of 56 Corynebacterium species were determined by both PCR and genome walking methods. In all cases the percent similarities between different species were lower than those observed by 16S rRNA gene sequencing, even for those species with degrees of high similarity. Several clusters supported by high bootstrap values were identified. In order to propose a method for strain identification which does not require sequencing of the complete rpoB sequence (approximately 3,500 bp), we identified an area with a high degree of polymorphism, bordered by conserved sequences that can be used as universal primers for PCR amplification and sequencing. The sequence of this fragment (434 to 452 bp) allows accurate species identification and may be used in the future for routine sequence-based identification of Corynebacterium species. PMID:15364970
Comparative modular analysis of gene expression in vertebrate organs.
Piasecka, Barbara; Kutalik, Zoltán; Roux, Julien; Bergmann, Sven; Robinson-Rechavi, Marc
2012-03-29
The degree of conservation of gene expression between homologous organs largely remains an open question. Several recent studies reported some evidence in favor of such conservation. Most studies compute organs' similarity across all orthologous genes, whereas the expression level of many genes are not informative about organ specificity. Here, we use a modularization algorithm to overcome this limitation through the identification of inter-species co-modules of organs and genes. We identify such co-modules using mouse and human microarray expression data. They are functionally coherent both in terms of genes and of organs from both organisms. We show that a large proportion of genes belonging to the same co-module are orthologous between mouse and human. Moreover, their zebrafish orthologs also tend to be expressed in the corresponding homologous organs. Notable exceptions to the general pattern of conservation are the testis and the olfactory bulb. Interestingly, some co-modules consist of single organs, while others combine several functionally related organs. For instance, amygdala, cerebral cortex, hypothalamus and spinal cord form a clearly discernible unit of expression, both in mouse and human. Our study provides a new framework for comparative analysis which will be applicable also to other sets of large-scale phenotypic data collected across different species.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dichgans, M.; Mayer, M.; Straube, A.
1996-02-15
This article reports on new information regarding the genetic mapping of the human CADASIL gene region. Previously, the gene had been mapped to human chromosome 19q12. Using the identification of a chromosomal crossover, the region has been refined to an 8-cM interval. 11 refs., 2 figs., 1 tab.
Large Scale Single Nucleotide Polymorphism Study of PD Susceptibility
2005-03-01
identification of eight genetic loci in the familial PD, the results of intensive investigations of polymorphisms in dozens of genes related to sporadic, late...1) investigate the association between classical, sporadic PD and 2386 SNPs in 23 genes implicated in the pathogenesis of PD; (2) construct...addition, experiences derived from this study may be applied in other complex disorders for the identification of susceptibility genes , as well as in genome
Loots, Gabriela G
2008-01-01
Despite remarkable recent advances in genomics that have enabled us to identify most of the genes in the human genome, comparable efforts to define transcriptional cis-regulatory elements that control gene expression are lagging behind. The difficulty of this task stems from two equally important problems: our knowledge of how regulatory elements are encoded in genomes remains elementary, and there is a vast genomic search space for regulatory elements, since most of mammalian genomes are noncoding. Comparative genomic approaches are having a remarkable impact on the study of transcriptional regulation in eukaryotes and currently represent the most efficient and reliable methods of predicting noncoding sequences likely to control the patterns of gene expression. By subjecting eukaryotic genomic sequences to computational comparisons and subsequent experimentation, we are inching our way toward a more comprehensive catalog of common regulatory motifs that lie behind fundamental biological processes. We are still far from comprehending how the transcriptional regulatory code is encrypted in the human genome and providing an initial global view of regulatory gene networks, but collectively, the continued development of comparative and experimental approaches will rapidly expand our knowledge of the transcriptional regulome.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Corbin, Cyrielle; Drouet, Samantha; Markulin, Lucija
Identification of DIR encoding genes in flax genome. Analysis of phylogeny, gene/protein structures and evolution. Identification of new conserved motifs linked to biochemical functions. Investigation of spatio-temporal gene expression and response to stress. Dirigent proteins (DIRs) were discovered during 8-8' lignan biosynthesis studies, through identification of stereoselective coupling to afford either (+)- or (-)-pinoresinols from E-coniferyl alcohol. DIRs are also involved or potentially involved in terpenoid, allyl/propenyl phenol lignan, pterocarpan and lignin biosynthesis. DIRs have very large multigene families in different vascular plants including flax, with most still of unknown function. DIR studies typically focus on a small subset ofmore » genes and identification of biochemical/physiological functions. Herein, a genome-wide analysis and characterization of the predicted flax DIR 44-membered multigene family was performed, this species being a rich natural grain source of 8-8' linked secoisolariciresinol-derived lignan oligomers. All predicted DIR sequences, including their promoters, were analyzed together with their public gene expression datasets. Expression patterns of selected DIRs were examined using qPCR, as well as through clustering analysis of DIR gene expression. These analyses further implicated roles for specific DIRs in (-)-pinoresinol formation in seed-coats, as well as (+)-pinoresinol in vegetative organs and/or specific responses to stress. Phylogeny and gene expression analysis segregated flax DIRs into six distinct clusters with new cluster-specific motifs identified. We propose that these findings can serve as a foundation to further systematically determine functions of DIRs, i.e. other than those already known in lignan biosynthesis in flax and other species. Given the differential expression profiles and inducibility of the flax DIR family, we provisionally propose that some DIR genes of unknown function could be involved in different aspects of secondary cell wall biosynthesis and plant defense.« less
Corbin, Cyrielle; Drouet, Samantha; Markulin, Lucija; Auguin, Daniel; Lainé, Éric; Davin, Laurence B; Cort, John R; Lewis, Norman G; Hano, Christophe
2018-05-01
Identification of DIR encoding genes in flax genome. Analysis of phylogeny, gene/protein structures and evolution. Identification of new conserved motifs linked to biochemical functions. Investigation of spatio-temporal gene expression and response to stress. Dirigent proteins (DIRs) were discovered during 8-8' lignan biosynthesis studies, through identification of stereoselective coupling to afford either (+)- or (-)-pinoresinols from E-coniferyl alcohol. DIRs are also involved or potentially involved in terpenoid, allyl/propenyl phenol lignan, pterocarpan and lignin biosynthesis. DIRs have very large multigene families in different vascular plants including flax, with most still of unknown function. DIR studies typically focus on a small subset of genes and identification of biochemical/physiological functions. Herein, a genome-wide analysis and characterization of the predicted flax DIR 44-membered multigene family was performed, this species being a rich natural grain source of 8-8' linked secoisolariciresinol-derived lignan oligomers. All predicted DIR sequences, including their promoters, were analyzed together with their public gene expression datasets. Expression patterns of selected DIRs were examined using qPCR, as well as through clustering analysis of DIR gene expression. These analyses further implicated roles for specific DIRs in (-)-pinoresinol formation in seed-coats, as well as (+)-pinoresinol in vegetative organs and/or specific responses to stress. Phylogeny and gene expression analysis segregated flax DIRs into six distinct clusters with new cluster-specific motifs identified. We propose that these findings can serve as a foundation to further systematically determine functions of DIRs, i.e. other than those already known in lignan biosynthesis in flax and other species. Given the differential expression profiles and inducibility of the flax DIR family, we provisionally propose that some DIR genes of unknown function could be involved in different aspects of secondary cell wall biosynthesis and plant defense.
Wojewoda, Christina M.; Sercia, Linda; Navas, Maria; Tuohy, Marion; Wilson, Deborah; Hall, Geraldine S.; Procop, Gary W.
2013-01-01
Rapid identification of pathogens from blood cultures can decrease lengths of stay and improve patient outcomes. We evaluated the accuracy of the Verigene Gram-positive blood culture (BC-GP) nucleic acid test for investigational use only (Nanosphere, Inc., Northbrook, IL) for the identification of Gram-positive bacteria from blood cultures. The detection of resistance genes (mecA in Staphylococcus aureus and Staphylococcus epidermidis and vanA or vanB in Enterococcus faecium and Enterococcus faecalis) by the BC-GP assay also was assessed. A total of 186 positive blood cultures (in BacT/Alert FA bottles) with Gram-positive cocci observed with Gram staining were analyzed using the BC-GP assay. The BC-GP results were compared with the identification and susceptibility profiles obtained with routine methods in the clinical laboratory. Discordant results were arbitrated with additional biochemical, cefoxitin disk, and repeat BC-GP testing. The initial BC-GP organism identification was concordant with routine method results for 94.6% of the blood cultures. Only 40% of the Streptococcus pneumoniae identifications were correct. The detection of the mecA gene for 69 blood cultures with only S. aureus or S. epidermidis was concordant with susceptibility testing results. For 3 of 6 cultures with multiple Staphylococcus spp., mecA detection was reported but was correlated with oxacillin resistance in a species other than S. aureus or S. epidermidis. The detection of vanA agreed with susceptibility testing results for 45 of 46 cultures with E. faecalis or E. faecium. Comparison of the mean times to results for each organism group showed that BC-GP results were available 31 to 42 h earlier than phenotypic identifications and 41 to 50 h earlier than susceptibility results. PMID:23596240
An overview to the investigative approach to species testing in wildlife forensic science
2011-01-01
The extent of wildlife crime is unknown but it is on the increase and has observable effects with the dramatic decline in many species of flora and fauna. The growing awareness of this area of criminal activity is reflected in the increase in research papers on animal DNA testing, either for the identification of species or for the genetic linkage of a sample to a particular organism. This review focuses on the use of species testing in wildlife crime investigations. Species identification relies primarily on genetic loci within the mitochondrial genome; focusing on the cytochrome b and cytochrome oxidase 1 genes. The use of cytochrome b gained early prominence in species identification through its use in taxonomic and phylogenetic studies, while the gene sequence for cytochrome oxidase was adopted by the Barcode for Life research group. This review compares how these two loci are used in species identification with respect to wildlife crime investigations. As more forensic science laboratories undertake work in the wildlife area, it is important that the quality of work is of the highest standard and that the conclusions reached are based on scientific principles. A key issue in reporting on the identification of a particular species is a knowledge of both the intraspecies variation and the possible overlap of sequence variation from one species to that of a closely related species. Recent data showing this degree of genetic separation in mammalian species will allow greater confidence when preparing a report on an alleged event where the identification of the species is of prime importance. The aim of this review is to illustrate aspects of species testing in wildlife forensic science and to explain how a knowledge of genetic variation at the genus and species level can aid in the reporting of results. PMID:21232099
Wojewoda, Christina M; Sercia, Linda; Navas, Maria; Tuohy, Marion; Wilson, Deborah; Hall, Geraldine S; Procop, Gary W; Richter, Sandra S
2013-07-01
Rapid identification of pathogens from blood cultures can decrease lengths of stay and improve patient outcomes. We evaluated the accuracy of the Verigene Gram-positive blood culture (BC-GP) nucleic acid test for investigational use only (Nanosphere, Inc., Northbrook, IL) for the identification of Gram-positive bacteria from blood cultures. The detection of resistance genes (mecA in Staphylococcus aureus and Staphylococcus epidermidis and vanA or vanB in Enterococcus faecium and Enterococcus faecalis) by the BC-GP assay also was assessed. A total of 186 positive blood cultures (in BacT/Alert FA bottles) with Gram-positive cocci observed with Gram staining were analyzed using the BC-GP assay. The BC-GP results were compared with the identification and susceptibility profiles obtained with routine methods in the clinical laboratory. Discordant results were arbitrated with additional biochemical, cefoxitin disk, and repeat BC-GP testing. The initial BC-GP organism identification was concordant with routine method results for 94.6% of the blood cultures. Only 40% of the Streptococcus pneumoniae identifications were correct. The detection of the mecA gene for 69 blood cultures with only S. aureus or S. epidermidis was concordant with susceptibility testing results. For 3 of 6 cultures with multiple Staphylococcus spp., mecA detection was reported but was correlated with oxacillin resistance in a species other than S. aureus or S. epidermidis. The detection of vanA agreed with susceptibility testing results for 45 of 46 cultures with E. faecalis or E. faecium. Comparison of the mean times to results for each organism group showed that BC-GP results were available 31 to 42 h earlier than phenotypic identifications and 41 to 50 h earlier than susceptibility results.
Zwaenepoel, Arthur; Diels, Tim; Amar, David; Van Parys, Thomas; Shamir, Ron; Van de Peer, Yves; Tzfadia, Oren
2018-01-01
Recent times have seen an enormous growth of "omics" data, of which high-throughput gene expression data are arguably the most important from a functional perspective. Despite huge improvements in computational techniques for the functional classification of gene sequences, common similarity-based methods often fall short of providing full and reliable functional information. Recently, the combination of comparative genomics with approaches in functional genomics has received considerable interest for gene function analysis, leveraging both gene expression based guilt-by-association methods and annotation efforts in closely related model organisms. Besides the identification of missing genes in pathways, these methods also typically enable the discovery of biological regulators (i.e., transcription factors or signaling genes). A previously built guilt-by-association method is MORPH, which was proven to be an efficient algorithm that performs particularly well in identifying and prioritizing missing genes in plant metabolic pathways. Here, we present MorphDB, a resource where MORPH-based candidate genes for large-scale functional annotations (Gene Ontology, MapMan bins) are integrated across multiple plant species. Besides a gene centric query utility, we present a comparative network approach that enables researchers to efficiently browse MORPH predictions across functional gene sets and species, facilitating efficient gene discovery and candidate gene prioritization. MorphDB is available at http://bioinformatics.psb.ugent.be/webtools/morphdb/morphDB/index/. We also provide a toolkit, named "MORPH bulk" (https://github.com/arzwa/morph-bulk), for running MORPH in bulk mode on novel data sets, enabling researchers to apply MORPH to their own species of interest.
Zou, Shanmei; Fei, Cong; Wang, Chun; Gao, Zhan; Bao, Yachao; He, Meilin; Wang, Changhai
2016-01-01
Microalgae identification is extremely difficult. The efficiency of DNA barcoding in microalgae identification involves ideal gene markers and approaches employed, which however, is still under the way. Although Scenedesmus has obtained much research in producing lipids its identification is difficult. Here we present a comprehensive coalescent, distance and character-based DNA barcoding for 118 Scenedesmus strains based on rbcL, tufA, ITS and 16S. The four genes, and their combined data rbcL + tufA + ITS + 16S, rbcL + tufA and ITS + 16S were analyzed by all of GMYC, P ID, PTP, ABGD, and character-based barcoding respectively. It was apparent that the three combined gene data showed a higher proportion of resolution success than the single gene. In comparison, the GMYC and PTP analysis produced more taxonomic lineages. The ABGD generated various resolution in discrimination among the single and combined data. The character-based barcoding was proved to be the most effective approach for species discrimination in both single and combined data which produced consistent species identification. All the integrated results recovered 11 species, five out of which were revealed as potential cryptic species. We suggest that the character-based DNA barcoding together with other approaches based on multiple genes and their combined data could be more effective in microalgae diversity revelation. PMID:27827440
Zou, Shanmei; Fei, Cong; Wang, Chun; Gao, Zhan; Bao, Yachao; He, Meilin; Wang, Changhai
2016-11-09
Microalgae identification is extremely difficult. The efficiency of DNA barcoding in microalgae identification involves ideal gene markers and approaches employed, which however, is still under the way. Although Scenedesmus has obtained much research in producing lipids its identification is difficult. Here we present a comprehensive coalescent, distance and character-based DNA barcoding for 118 Scenedesmus strains based on rbcL, tufA, ITS and 16S. The four genes, and their combined data rbcL + tufA + ITS + 16S, rbcL + tufA and ITS + 16S were analyzed by all of GMYC, P ID, PTP, ABGD, and character-based barcoding respectively. It was apparent that the three combined gene data showed a higher proportion of resolution success than the single gene. In comparison, the GMYC and PTP analysis produced more taxonomic lineages. The ABGD generated various resolution in discrimination among the single and combined data. The character-based barcoding was proved to be the most effective approach for species discrimination in both single and combined data which produced consistent species identification. All the integrated results recovered 11 species, five out of which were revealed as potential cryptic species. We suggest that the character-based DNA barcoding together with other approaches based on multiple genes and their combined data could be more effective in microalgae diversity revelation.
Lamy, Brigitte; Kodjo, Angeli; Laurent, Frédéric
2011-09-01
We evaluated the accuracy of matrix-assisted laser desorption/ionization time-of-flight mass spectrometry for identifying aeromonads with an extraction procedure. Genus-level accuracy was 100%. Compared to rpoB gene sequencing, species-level accuracy was 90.6% (29/32) for type and reference strains and 91.4% for a collection of 139 clinical and environmental isolates, making this system one of the most accurate and rapid methods for phenotypic identification. The reliability of this technique was very promising, although some improvements in database composition, taxonomy, and discriminatory power are needed. Copyright © 2011 Elsevier Inc. All rights reserved.
RNA-Seq Based Transcriptional Map of Bovine Respiratory Disease Pathogen “Histophilus somni 2336”
Kumar, Ranjit; Lawrence, Mark L.; Watt, James; Cooksey, Amanda M.; Burgess, Shane C.; Nanduri, Bindu
2012-01-01
Genome structural annotation, i.e., identification and demarcation of the boundaries for all the functional elements in a genome (e.g., genes, non-coding RNAs, proteins and regulatory elements), is a prerequisite for systems level analysis. Current genome annotation programs do not identify all of the functional elements of the genome, especially small non-coding RNAs (sRNAs). Whole genome transcriptome analysis is a complementary method to identify “novel” genes, small RNAs, regulatory regions, and operon structures, thus improving the structural annotation in bacteria. In particular, the identification of non-coding RNAs has revealed their widespread occurrence and functional importance in gene regulation, stress and virulence. However, very little is known about non-coding transcripts in Histophilus somni, one of the causative agents of Bovine Respiratory Disease (BRD) as well as bovine infertility, abortion, septicemia, arthritis, myocarditis, and thrombotic meningoencephalitis. In this study, we report a single nucleotide resolution transcriptome map of H. somni strain 2336 using RNA-Seq method. The RNA-Seq based transcriptome map identified 94 sRNAs in the H. somni genome of which 82 sRNAs were never predicted or reported in earlier studies. We also identified 38 novel potential protein coding open reading frames that were absent in the current genome annotation. The transcriptome map allowed the identification of 278 operon (total 730 genes) structures in the genome. When compared with the genome sequence of a non-virulent strain 129Pt, a disproportionate number of sRNAs (∼30%) were located in genomic region unique to strain 2336 (∼18% of the total genome). This observation suggests that a number of the newly identified sRNAs in strain 2336 may be involved in strain-specific adaptations. PMID:22276113
RNA-seq based transcriptional map of bovine respiratory disease pathogen "Histophilus somni 2336".
Kumar, Ranjit; Lawrence, Mark L; Watt, James; Cooksey, Amanda M; Burgess, Shane C; Nanduri, Bindu
2012-01-01
Genome structural annotation, i.e., identification and demarcation of the boundaries for all the functional elements in a genome (e.g., genes, non-coding RNAs, proteins and regulatory elements), is a prerequisite for systems level analysis. Current genome annotation programs do not identify all of the functional elements of the genome, especially small non-coding RNAs (sRNAs). Whole genome transcriptome analysis is a complementary method to identify "novel" genes, small RNAs, regulatory regions, and operon structures, thus improving the structural annotation in bacteria. In particular, the identification of non-coding RNAs has revealed their widespread occurrence and functional importance in gene regulation, stress and virulence. However, very little is known about non-coding transcripts in Histophilus somni, one of the causative agents of Bovine Respiratory Disease (BRD) as well as bovine infertility, abortion, septicemia, arthritis, myocarditis, and thrombotic meningoencephalitis. In this study, we report a single nucleotide resolution transcriptome map of H. somni strain 2336 using RNA-Seq method.The RNA-Seq based transcriptome map identified 94 sRNAs in the H. somni genome of which 82 sRNAs were never predicted or reported in earlier studies. We also identified 38 novel potential protein coding open reading frames that were absent in the current genome annotation. The transcriptome map allowed the identification of 278 operon (total 730 genes) structures in the genome. When compared with the genome sequence of a non-virulent strain 129Pt, a disproportionate number of sRNAs (∼30%) were located in genomic region unique to strain 2336 (∼18% of the total genome). This observation suggests that a number of the newly identified sRNAs in strain 2336 may be involved in strain-specific adaptations.
Ries, David; Holtgräwe, Daniela; Viehöver, Prisca; Weisshaar, Bernd
2016-03-15
The combination of bulk segregant analysis (BSA) and next generation sequencing (NGS), also known as mapping by sequencing (MBS), has been shown to significantly accelerate the identification of causal mutations for species with a reference genome sequence. The usual approach is to cross homozygous parents that differ for the monogenic trait to address, to perform deep sequencing of DNA from F2 plants pooled according to their phenotype, and subsequently to analyze the allele frequency distribution based on a marker table for the parents studied. The method has been successfully applied for EMS induced mutations as well as natural variation. Here, we show that pooling genetically diverse breeding lines according to a contrasting phenotype also allows high resolution mapping of the causal gene in a crop species. The test case was the monogenic locus causing red vs. green hypocotyl color in Beta vulgaris (R locus). We determined the allele frequencies of polymorphic sequences using sequence data from two diverging phenotypic pools of 180 B. vulgaris accessions each. A single interval of about 31 kbp among the nine chromosomes was identified which indeed contained the causative mutation. By applying a variation of the mapping by sequencing approach, we demonstrated that phenotype-based pooling of diverse accessions from breeding panels and subsequent direct determination of the allele frequency distribution can be successfully applied for gene identification in a crop species. Our approach made it possible to identify a small interval around the causative gene. Sequencing of parents or individual lines was not necessary. Whenever the appropriate plant material is available, the approach described saves time compared to the generation of an F2 population. In addition, we provide clues for planning similar experiments with regard to pool size and the sequencing depth required.
Cefalù, Angelo B; Spina, Rossella; Noto, Davide; Ingrassia, Valeria; Valenti, Vincenza; Giammanco, Antonina; Fayer, Francesca; Misiano, Gabriella; Cocorullo, Gianfranco; Scrimali, Chiara; Palesano, Ornella; Altieri, Grazia I; Ganci, Antonina; Barbagallo, Carlo M; Averna, Maurizio R
Severe hypertriglyceridemia (HTG) may result from mutations in genes affecting the intravascular lipolysis of triglyceride (TG)-rich lipoproteins. The aim of this study was to develop a targeted next-generation sequencing panel for the molecular diagnosis of disorders characterized by severe HTG. We developed a targeted customized panel for next-generation sequencing Ion Torrent Personal Genome Machine to capture the coding exons and intron/exon boundaries of 18 genes affecting the main pathways of TG synthesis and metabolism. We sequenced 11 samples of patients with severe HTG (TG>885 mg/dL-10 mmol/L): 4 positive controls in whom pathogenic mutations had previously been identified by Sanger sequencing and 7 patients in whom the molecular defect was still unknown. The customized panel was accurate, and it allowed to confirm genetic variants previously identified in all positive controls with primary severe HTG. Only 1 patient of 7 with HTG was found to be carrier of a homozygous pathogenic mutation of the third novel mutation of LMF1 gene (c.1380C>G-p.Y460X). The clinical and molecular familial cascade screening allowed the identification of 2 additional affected siblings and 7 heterozygous carriers of the mutation. We showed that our targeted resequencing approach for genetic diagnosis of severe HTG appears to be accurate, less time consuming, and more economical compared with traditional Sanger resequencing. The identification of pathogenic mutations in candidate genes remains challenging and clinical resequencing should mainly intended for patients with strong clinical criteria for monogenic severe HTG. Copyright © 2017 National Lipid Association. Published by Elsevier Inc. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Catfish Genome Consortium; Wang, Shaolin; Peatman, Eric
2010-03-23
Background-Through the Community Sequencing Program, a catfish EST sequencing project was carried out through a collaboration between the catfish research community and the Department of Energy's Joint Genome Institute. Prior to this project, only a limited EST resource from catfish was available for the purpose of SNP identification. Results-A total of 438,321 quality ESTs were generated from 8 channel catfish (Ictalurus punctatus) and 4 blue catfish (Ictalurus furcatus) libraries, bringing the number of catfish ESTs to nearly 500,000. Assembly of all catfish ESTs resulted in 45,306 contigs and 66,272 singletons. Over 35percent of the unique sequences had significant similarities tomore » known genes, allowing the identification of 14,776 unique genes in catfish. Over 300,000 putative SNPs have been identified, of which approximately 48,000 are high-quality SNPs identified from contigs with at least four sequences and the minor allele presence of at least two sequences in the contig. The EST resource should be valuable for identification of microsatellites, genome annotation, large-scale expression analysis, and comparative genome analysis. Conclusions-This project generated a large EST resource for catfish that captured the majority of the catfish transcriptome. The parallel analysis of ESTs from two closely related Ictalurid catfishes should also provide powerful means for the evaluation of ancient and recent gene duplications, and for the development of high-density microarrays in catfish. The inter- and intra-specific SNPs identified from all catfish EST dataset assembly will greatly benefit the catfish introgression breeding program and whole genome association studies.« less
Singh, Param Priya; Arora, Jatin; Isambert, Hervé
2015-07-01
Whole genome duplications (WGD) have now been firmly established in all major eukaryotic kingdoms. In particular, all vertebrates descend from two rounds of WGDs, that occurred in their jawless ancestor some 500 MY ago. Paralogs retained from WGD, also coined 'ohnologs' after Susumu Ohno, have been shown to be typically associated with development, signaling and gene regulation. Ohnologs, which amount to about 20 to 35% of genes in the human genome, have also been shown to be prone to dominant deleterious mutations and frequently implicated in cancer and genetic diseases. Hence, identifying ohnologs is central to better understand the evolution of vertebrates and their susceptibility to genetic diseases. Early computational analyses to identify vertebrate ohnologs relied on content-based synteny comparisons between the human genome and a single invertebrate outgroup genome or within the human genome itself. These approaches are thus limited by lineage specific rearrangements in individual genomes. We report, in this study, the identification of vertebrate ohnologs based on the quantitative assessment and integration of synteny conservation between six amniote vertebrates and six invertebrate outgroups. Such a synteny comparison across multiple genomes is shown to enhance the statistical power of ohnolog identification in vertebrates compared to earlier approaches, by overcoming lineage specific genome rearrangements. Ohnolog gene families can be browsed and downloaded for three statistical confidence levels or recompiled for specific, user-defined, significance criteria at http://ohnologs.curie.fr/. In the light of the importance of WGD on the genetic makeup of vertebrates, our analysis provides a useful resource for researchers interested in gaining further insights on vertebrate evolution and genetic diseases.
Singh, Param Priya; Arora, Jatin; Isambert, Hervé
2015-01-01
Whole genome duplications (WGD) have now been firmly established in all major eukaryotic kingdoms. In particular, all vertebrates descend from two rounds of WGDs, that occurred in their jawless ancestor some 500 MY ago. Paralogs retained from WGD, also coined ‘ohnologs’ after Susumu Ohno, have been shown to be typically associated with development, signaling and gene regulation. Ohnologs, which amount to about 20 to 35% of genes in the human genome, have also been shown to be prone to dominant deleterious mutations and frequently implicated in cancer and genetic diseases. Hence, identifying ohnologs is central to better understand the evolution of vertebrates and their susceptibility to genetic diseases. Early computational analyses to identify vertebrate ohnologs relied on content-based synteny comparisons between the human genome and a single invertebrate outgroup genome or within the human genome itself. These approaches are thus limited by lineage specific rearrangements in individual genomes. We report, in this study, the identification of vertebrate ohnologs based on the quantitative assessment and integration of synteny conservation between six amniote vertebrates and six invertebrate outgroups. Such a synteny comparison across multiple genomes is shown to enhance the statistical power of ohnolog identification in vertebrates compared to earlier approaches, by overcoming lineage specific genome rearrangements. Ohnolog gene families can be browsed and downloaded for three statistical confidence levels or recompiled for specific, user-defined, significance criteria at http://ohnologs.curie.fr/. In the light of the importance of WGD on the genetic makeup of vertebrates, our analysis provides a useful resource for researchers interested in gaining further insights on vertebrate evolution and genetic diseases. PMID:26181593
2011-01-01
Background Alfalfa, [Medicago sativa (L.) sativa], a widely-grown perennial forage has potential for development as a cellulosic ethanol feedstock. However, the genomics of alfalfa, a non-model species, is still in its infancy. The recent advent of RNA-Seq, a massively parallel sequencing method for transcriptome analysis, provides an opportunity to expand the identification of alfalfa genes and polymorphisms, and conduct in-depth transcript profiling. Results Cell walls in stems of alfalfa genotype 708 have higher cellulose and lower lignin concentrations compared to cell walls in stems of genotype 773. Using the Illumina GA-II platform, a total of 198,861,304 expression sequence tags (ESTs, 76 bp in length) were generated from cDNA libraries derived from elongating stem (ES) and post-elongation stem (PES) internodes of 708 and 773. In addition, 341,984 ESTs were generated from ES and PES internodes of genotype 773 using the GS FLX Titanium platform. The first alfalfa (Medicago sativa) gene index (MSGI 1.0) was assembled using the Sanger ESTs available from GenBank, the GS FLX Titanium EST sequences, and the de novo assembled Illumina sequences. MSGI 1.0 contains 124,025 unique sequences including 22,729 tentative consensus sequences (TCs), 22,315 singletons and 78,981 pseudo-singletons. We identified a total of 1,294 simple sequence repeats (SSR) among the sequences in MSGI 1.0. In addition, a total of 10,826 single nucleotide polymorphisms (SNPs) were predicted between the two genotypes. Out of 55 SNPs randomly selected for experimental validation, 47 (85%) were polymorphic between the two genotypes. We also identified numerous allelic variations within each genotype. Digital gene expression analysis identified numerous candidate genes that may play a role in stem development as well as candidate genes that may contribute to the differences in cell wall composition in stems of the two genotypes. Conclusions Our results demonstrate that RNA-Seq can be successfully used for gene identification, polymorphism detection and transcript profiling in alfalfa, a non-model, allogamous, autotetraploid species. The alfalfa gene index assembled in this study, and the SNPs, SSRs and candidate genes identified can be used to improve alfalfa as a forage crop and cellulosic feedstock. PMID:21504589
Flynn, Christopher M; Schmidt-Dannert, Claudia
2018-06-01
The wood-rotting mushroom Stereum hirsutum is a known producer of a large number of namesake hirsutenoids, many with important bioactivities. Hirsutenoids form a structurally diverse and distinct class of sesquiterpenoids. No genes involved in hirsutenoid biosynthesis have yet been identified or their enzymes characterized. Here, we describe the cloning and functional characterization of a hirsutene synthase as an unexpected fusion protein of a sesquiterpene synthase (STS) with a C-terminal 3-hydroxy-3-methylglutaryl-coenzyme A (3-hydroxy-3-methylglutaryl-CoA) synthase (HMGS) domain. Both the full-length fusion protein and truncated STS domain are highly product-specific 1,11-cyclizing STS enzymes with kinetic properties typical of STSs. Complementation studies in Saccharomyces cerevisiae confirmed that the HMGS domain is also functional in vivo Phylogenetic analysis shows that the hirsutene synthase domain does not form a clade with other previously characterized sesquiterpene synthases from Basidiomycota. Comparative gene structure analysis of this hirsutene synthase with characterized fungal enzymes reveals a significantly higher intron density, suggesting that this enzyme may be acquired by horizontal gene transfer. In contrast, the HMGS domain is clearly related to other fungal homologs. This STS-HMGS fusion protein is part of a biosynthetic gene cluster that includes P450s and oxidases that are expressed and could be cloned from cDNA. Finally, this unusual fusion of a terpene synthase to an HMGS domain, which is not generally recognized as a key regulatory enzyme of the mevalonate isoprenoid precursor pathway, led to the identification of additional HMGS duplications in many fungal genomes, including the localization of HMGSs in other predicted sesquiterpenoid biosynthetic gene clusters. IMPORTANCE Hirsutenoids represent a structurally diverse class of bioactive sesquiterpenoids isolated from fungi. Identification of their biosynthetic pathways will provide access to this chemodiversity for the discovery and synthesis of molecules with new bioactivities. The identification and successful cloning of the previously elusive hirsutene synthase from the S. hirsutum provide important insights and strategies for biosynthetic gene discovery in Basidiomycota. The finding of a terpene synthase-HMGS fusion, the discovery of other sesquiterpenoid biosynthetic gene clusters with dedicated HMGS genes, and HMGS gene duplications in fungal genomes give new importance to the role of HMGS as a key regulatory enzyme in isoprenoid and sterol biosynthesis that should be exploited for metabolic engineering. Copyright © 2018 American Society for Microbiology.
Genome-Wide Analysis of Syntenic Gene Deletion in the Grasses
Schnable, James C.; Freeling, Michael; Lyons, Eric
2012-01-01
The grasses, Poaceae, are one of the largest and most successful angiosperm families. Like many radiations of flowering plants, the divergence of the major grass lineages was preceded by a whole-genome duplication (WGD), although these events are not rare for flowering plants. By combining identification of syntenic gene blocks with measures of gene pair divergence and different frequencies of ancient gene loss, we have separated the two subgenomes present in modern grasses. Reciprocal loss of duplicated genes or genomic regions has been hypothesized to reproductively isolate populations and, thus, speciation. However, in contrast to previous studies in yeast and teleost fishes, we found very little evidence of reciprocal loss of homeologous genes between the grasses, suggesting that post-WGD gene loss may not be the cause of the grass radiation. The sets of homeologous and orthologous genes and predicted locations of deleted genes identified in this study, as well as links to the CoGe comparative genomics web platform for analyzing pan-grass syntenic regions, are provided along with this paper as a resource for the grass genetics community. PMID:22275519
Conditioned taste aversion dependent regulation of amygdala gene expression.
Panguluri, Siva K; Kuwabara, Nobuyuki; Kang, Yi; Cooper, Nigel; Lundy, Robert F
2012-02-28
The present experiments investigated gene expression in the amygdala following contingent taste/LiCl treatment that supports development of conditioned taste aversion (CTA). The use of whole genome chips and stringent data set filtering led to the identification of 168 genes regulated by CTA compared to non-contingent LiCl treatment that does not support CTA learning. Seventy-six of these genes were eligible for network analysis. Such analysis identified "behavior" as the top biological function, which was represented by 15 of the 76 genes. These genes included several neuropeptides, G protein-coupled receptors, ion channels, kinases, and phosphatases. Subsequent qRT-PCR analyses confirmed changes in mRNA expression for 5 of 7 selected genes. We were able to demonstrate directionally consistent changes in protein level for 3 of these genes; insulin 1, oxytocin, and major histocompatibility complex class I-C. Behavioral analyses demonstrated that blockade of central insulin receptors produced a weaker CTA that was less resistant to extinction. Together, these results support the notion that we have identified downstream genes in the amygdala that contribute to CTA learning. Copyright © 2011 Elsevier Inc. All rights reserved.
Johnston, Daniel S; Jelinsky, Scott A; Zhi, Yu; Finger, Joshua N; Kopf, Gregory S; Wright, William W
2007-12-01
In an effort to identify novel targets for the development of nonhormonal male contraceptives, genome-wide transcriptional profiling of the rat testis was performed. Specifically, enzymatically purified spermatogonia plus early spermatocyctes, pachytene spermatocytes, round spermatids, and Sertoli cells was analyzed along with microdissected rat seminiferous tubules at stages I, II-III, IV-V, VI, VIIa,b, VIIc,d, VIII, IX- XI, XII, XIII-XIV of the cycle of the seminiferous epithelium using RAE 230_2.0 microarrays. The combined analysis of these studies identified 16,971 expressed probe sets on the array. How these expression data, combined with additional bioinformatic data analysis and quantitative reverse transcriptase polymerase chain reaction (qRT-PCR) analysis, led to the identification of 58 genes that have 1000-fold higher expression transcriptionally in the testis when compared to over 20 other nonreproductive tissues is described. The products of these genes may play important roles in testicular and/or sperm function, and further investigation on their utility as nonhormonal contraceptive targets is warranted. Moreover, these microarray data have been used to expedite the identification of a mutation in RIKEN cDNA 2410004F06 gene as likely being responsible for spermatogenic failure in a line of infertile mice generated by N-ethyl-N-nitrosourea (ENU) mutagenesis. The microarray data and the qRT-PCR data described are available in the Mammalian Reproductive Genetics database (http://mrg.genetics.washington.edu/).
Mesquita, Rosilene Oliveira; de Almeida Soares, Eduardo; de Barros, Everaldo Gonçalves; Loureiro, Marcelo Ehlers
2012-01-01
The most critical step in any proteomic study is protein extraction and sample preparation. Better solubilization increases the separation and resolution of gels, allowing identification of a higher number of proteins and more accurate quantitation of differences in gene expression. Despite the existence of published results for the optimization of proteomic analyses of soybean seeds, no comparable data are available for proteomic studies of soybean leaf tissue. In this work we have tested the effects of modification of a TCA-acetone method on the resolution of 2-DE gels of leaves and roots of soybean. Better focusing was obtained when both mercaptoethanol and dithiothreitol were used in the extraction buffer simultaneously. Increasing the number of washes of TCA precipitated protein with acetone, using a final wash with 80% ethanol and using sonication to ressuspend the pellet increased the number of detected proteins as well the resolution of the 2-DE gels. Using this approach we have constructed a soybean protein map. The major group of identified proteins corresponded to genes of unknown function. The second and third most abundant groups of proteins were composed of photosynthesis and metabolism related genes. The resulting protocol improved protein solubility and gel resolution allowing the identification of 122 soybean leaf proteins, 72 of which were not detected in other published soybean leaf 2-DE gel datasets, including a transcription factor and several signaling proteins. PMID:22802721
Król, Jaroslaw; Bania, Jacek; Florek, Magdalena; Pliszczak-Król, Aleksandra; Staroniewicz, Zdzislaw
2011-05-01
A set of polymerase chain reaction (PCR) assays for identification of the most important Pasteurellaceae species encountered in cats and dogs were developed. Primers for Pasteurella multocida were designed to detect a fragment of the kmt, a gene encoding the outer-membrane protein. Primers specific to Pasteurella canis, Pasteurella dagmatis, and Pasteurella stomatis were based on the manganese-dependent superoxide dismutase gene (sodA) and those specific to [Haemophilus] haemoglobinophilus on species-specific sequences of the 16S ribosomal RNA gene. All the primers were tested on respective reference and control strains and applied to the identification of 47 canine and feline field isolates of Pasteurellaceae. The PCR assays were shown to be species specific, providing a valuable supplement to phenotypic identification of species within this group of bacteria. © 2011 The Author(s)
Ríos, Gabino; Naranjo, Miguel A; Iglesias, Domingo J; Ruiz-Rivero, Omar; Geraud, Marion; Usach, Antonio; Talón, Manuel
2008-01-01
Background Many fruit-tree species, including relevant Citrus spp varieties exhibit a reproductive biology that impairs breeding and strongly constrains genetic improvements. In citrus, juvenility increases the generation time while sexual sterility, inbreeding depression and self-incompatibility prevent the production of homozygous cultivars. Genomic technology may provide citrus researchers with a new set of tools to address these various restrictions. In this work, we report a valuable genomics-based protocol for the structural analysis of deletion mutations on an heterozygous background. Results Two independent fast neutron mutants of self-incompatible clementine (Citrus clementina Hort. Ex Tan. cv. Clemenules) were the subject of the study. Both mutants, named 39B3 and 39E7, were expected to carry DNA deletions in hemizygous dosage. Array-based Comparative Genomic Hybridization (array-CGH) using a Citrus cDNA microarray allowed the identification of underrepresented genes in these two mutants. Subsequent comparison of citrus deleted genes with annotated plant genomes, especially poplar, made possible to predict the presence of a large deletion in 39B3 of about 700 kb and at least two deletions of approximately 100 and 500 kb in 39E7. The deletion in 39B3 was further characterized by PCR on available Citrus BACs, which helped us to build a partial physical map of the deletion. Among the deleted genes, ClpC-like gene coding for a putative subunit of a multifunctional chloroplastic protease involved in the regulation of chlorophyll b synthesis was directly related to the mutated phenotype since the mutant showed a reduced chlorophyll a/b ratio in green tissues. Conclusion In this work, we report the use of array-CGH for the successful identification of genes included in a hemizygous deletion induced by fast neutron irradiation on Citrus clementina. The study of gene content and order into the 39B3 deletion also led to the unexpected conclusion that microsynteny and local gene colinearity in this species were higher with Populus trichocarpa than with the phylogenetically closer Arabidopsis thaliana. This work corroborates the potential of Citrus genomic resources to assist mutagenesis-based approaches for functional genetics, structural studies and comparative genomics, and hence to facilitate citrus variety improvement. PMID:18691431
Mallakin, Ali; Sugiyama, Takayuki; Kai, Fumitake; Taneja, Pankaj; Kendig, Robert D.; Frazier, Donna P.; Maglic, Dejan; Matise, Lauren A.; Willingham, Mark C.; Inoue, Kazushi
2009-01-01
Dmp1 (Dmtf1) encodes a Myb-like transcription factor implicated in tumor suppression through direct activation of the Arf-p53 pathway. The human DMP1 gene is frequently deleted in non-small cell lung cancers, especially those that retain wild-type INK4a/ARF and/or p53. To identify novel genes that are regulated by Dmp1, transcriptional profiles of lung tissue from Dmp1-null and wild-type mice were generated using the GeneChip Microarray. Comparative analysis of gene expression changes between the two groups resulted in identification of numerous genes that may be regulated by Dmp1. Notably, amphiregulin (Areg), thrombospondin-1 (Tsp-1), JunB, Egr1, adrenomedullin (Adm), Bcl-3 and methyl-CpG binding domain protein 1 (Mbd1) were downregulated in the lungs from Dmp1-null mice while Gas1 and Ect2 genes were upregulated. These target genes were chosen for further analyses since they are involved in cell proliferation, transcription, angiogenesis/metastasis, apoptosis, or DNA methylation, and thus could account for the tumor suppressor phenotype of Dmp1. Dmp1 directly bound to the genomic loci of Areg, Tsp-1, JunB and Egr1. Significant upregulation or downregulation of the novel Dmp1 target genes was observed upon transient expression of Dmp1 in alveolar epithelial cells, an effect which was nullified by the inhibition of de novo mRNA synthesis. Interestingly, these genes and their protein products were significantly downregulated or upregulated in the lungs from Dmp1-heterozygous mice as well. Identification of novel Dmp1 target genes not only provides insights into the effects of Dmp1 on global gene expression, but also sheds light on the mechanism of haploid insufficiency of Dmp1 in tumor suppression. PMID:19816943
A DNA barcode for land plants.
2009-08-04
DNA barcoding involves sequencing a standard region of DNA as a tool for species identification. However, there has been no agreement on which region(s) should be used for barcoding land plants. To provide a community recommendation on a standard plant barcode, we have compared the performance of 7 leading candidate plastid DNA regions (atpF-atpH spacer, matK gene, rbcL gene, rpoB gene, rpoC1 gene, psbK-psbI spacer, and trnH-psbA spacer). Based on assessments of recoverability, sequence quality, and levels of species discrimination, we recommend the 2-locus combination of rbcL+matK as the plant barcode. This core 2-locus barcode will provide a universal framework for the routine use of DNA sequence data to identify specimens and contribute toward the discovery of overlooked species of land plants.
Hollingsworth, Peter M.; Forrest, Laura L.; Spouge, John L.; Hajibabaei, Mehrdad; Ratnasingham, Sujeevan; van der Bank, Michelle; Chase, Mark W.; Cowan, Robyn S.; Erickson, David L.; Fazekas, Aron J.; Graham, Sean W.; James, Karen E.; Kim, Ki-Joong; Kress, W. John; Schneider, Harald; van AlphenStahl, Jonathan; Barrett, Spencer C.H.; van den Berg, Cassio; Bogarin, Diego; Burgess, Kevin S.; Cameron, Kenneth M.; Carine, Mark; Chacón, Juliana; Clark, Alexandra; Clarkson, James J.; Conrad, Ferozah; Devey, Dion S.; Ford, Caroline S.; Hedderson, Terry A.J.; Hollingsworth, Michelle L.; Husband, Brian C.; Kelly, Laura J.; Kesanakurti, Prasad R.; Kim, Jung Sung; Kim, Young-Dong; Lahaye, Renaud; Lee, Hae-Lim; Long, David G.; Madriñán, Santiago; Maurin, Olivier; Meusnier, Isabelle; Newmaster, Steven G.; Park, Chong-Wook; Percy, Diana M.; Petersen, Gitte; Richardson, James E.; Salazar, Gerardo A.; Savolainen, Vincent; Seberg, Ole; Wilkinson, Michael J.; Yi, Dong-Keun; Little, Damon P.
2009-01-01
DNA barcoding involves sequencing a standard region of DNA as a tool for species identification. However, there has been no agreement on which region(s) should be used for barcoding land plants. To provide a community recommendation on a standard plant barcode, we have compared the performance of 7 leading candidate plastid DNA regions (atpF–atpH spacer, matK gene, rbcL gene, rpoB gene, rpoC1 gene, psbK–psbI spacer, and trnH–psbA spacer). Based on assessments of recoverability, sequence quality, and levels of species discrimination, we recommend the 2-locus combination of rbcL+matK as the plant barcode. This core 2-locus barcode will provide a universal framework for the routine use of DNA sequence data to identify specimens and contribute toward the discovery of overlooked species of land plants. PMID:19666622
Guo, Chun Yu; Yin, Hui Jun; Jiang, Yue Rong; Xue, Mei; Zhang, Lu; Shi, Da Zhuo
2008-06-18
To construct the differential genes expressed profile in the ischemic myocardium tissue reduced from acute myocardial infarction(AMI), and determine the biological functions of target genes. AMI model was generated by ligation of the left anterior descending coronary artery in Wistar rats. Total RNA was extracted from the normal and the ischemic heart tissues under the ligation point 7 days after the operation. Differential gene expression profiles of the two samples were constructed using Long Serial Analysis of Gene Expression(LongSAGE). Real time fluorescence quantitative PCR was used to verify gene expression profile and to identify the expression of 2 functional genes. The activities of enzymes from functional genes were determined by histochemistry. A total of 15,966 tags were screened from the normal and the ischemic LongSAGE maps. The similarities of the sequences were compared using the BLAST algebra in NCBI and 7,665 novel tags were found. In the ischemic tissue 142 genes were significantly changed compared with those in the normal tissue (P<0.05). These differentially expressed genes represented the proteins which might play important roles in the pathways of oxidation and phosphorylation, ATP synthesis and glycolysis. The partial genes identified by LongSAGE were confirmed using real time fluorescence quantitative PCR. Two genes related to energy metabolism, COX5a and ATP5e, were screened and quantified. Expression of two functional genes down-regulated at their mRNA levels and the activities of correlative functional enzymes decreased compared with those in the normal tissue. AMI causes a series of changes in gene expression, in which the abnormal expression of genes related to energy metabolism could be one of the molecular mechanisms of AMI. The intervention of the expressions of COX5a and ATP5e may be a new target for AMI therapy.
Molecular phylogeny of some avian species using Cytochrome b gene sequence analysis
Awad, A; Khalil, S. R; Abd-Elhakim, Y. M
2015-01-01
Veritable identification and differentiation of avian species is a vital step in conservative, taxonomic, forensic, legal and other ornithological interventions. Therefore, this study involved the application of molecular approach to identify some avian species i.e. Chicken (Gallus gallus), Muskovy duck (Cairina moschata), Japanese quail (Coturnix japonica), Laughing dove (Streptopelia senegalensis), and Rock pigeon (Columba livia). Genomic DNA was extracted from blood samples and partial sequence of the mitochondrial cytochrome b gene (358 bp) was amplified and sequenced using universal primers. Sequences alignment and phylogenetic analyses were performed by CLC main workbench program. The obtained five sequences were deposited in GenBank and compared with those previously registered in GenBank. The similarity percentage was 88.60% between Gallus gallus and Coturnix japonica and 80.46% between Gallus gallus and Columba livia. The percentage of identity between the studied species and GenBank species ranged from 77.20% (Columba oenas and Anas platyrhynchos) to 100% (Gallus gallus and Gallus sonneratii, Coturnix coturnix and Coturnix japonica, Meleagris gallopavo and Columba livia). Amplification of the partial sequence of mitochondrial cytochrome b gene proved to be practical for identification of an avian species unambiguously. PMID:27175180
NASA Astrophysics Data System (ADS)
Chen, Ye; Wolanyk, Nathaniel; Ilker, Tunc; Gao, Shouguo; Wang, Xujing
Methods developed based on bifurcation theory have demonstrated their potential in driving network identification for complex human diseases, including the work by Chen, et al. Recently bifurcation theory has been successfully applied to model cellular differentiation. However, there one often faces a technical challenge in driving network prediction: time course cellular differentiation study often only contains one sample at each time point, while driving network prediction typically require multiple samples at each time point to infer the variation and interaction structures of candidate genes for the driving network. In this study, we investigate several methods to identify both the critical time point and the driving network through examination of how each time point affects the autocorrelation and phase locking. We apply these methods to a high-throughput sequencing (RNA-Seq) dataset of 42 subsets of thymocytes and mature peripheral T cells at multiple time points during their differentiation (GSE48138 from GEO). We compare the predicted driving genes with known transcription regulators of cellular differentiation. We will discuss the advantages and limitations of our proposed methods, as well as potential further improvements of our methods.
Espada, Margarida; Silva, Ana Cláudia; Eves van den Akker, Sebastian; Cock, Peter J A; Mota, Manuel; Jones, John T
2016-02-01
The migratory endoparasitic nematode Bursaphelenchus xylophilus, which is the causal agent of pine wilt disease, has phytophagous and mycetophagous phases during its life cycle. This highly unusual feature distinguishes it from other plant-parasitic nematodes and requires profound changes in biology between modes. During the phytophagous stage, the nematode migrates within pine trees, feeding on the contents of parenchymal cells. Like other plant pathogens, B. xylophilus secretes effectors from pharyngeal gland cells into the host during infection. We provide the first description of changes in the morphology of these gland cells between juvenile and adult life stages. Using a comparative transcriptomics approach and an effector identification pipeline, we identify numerous novel parasitism genes which may be important for the mediation of interactions of B. xylophilus with its host. In-depth characterization of all parasitism genes using in situ hybridization reveals two major categories of detoxification proteins, those specifically expressed in either the pharyngeal gland cells or the digestive system. These data suggest that B. xylophilus incorporates effectors in a multilayer detoxification strategy in order to protect itself from host defence responses during phytophagy. © 2015 BSPP AND JOHN WILEY & SONS LTD.
ESEA: Discovering the Dysregulated Pathways based on Edge Set Enrichment Analysis
Han, Junwei; Shi, Xinrui; Zhang, Yunpeng; Xu, Yanjun; Jiang, Ying; Zhang, Chunlong; Feng, Li; Yang, Haixiu; Shang, Desi; Sun, Zeguo; Su, Fei; Li, Chunquan; Li, Xia
2015-01-01
Pathway analyses are playing an increasingly important role in understanding biological mechanism, cellular function and disease states. Current pathway-identification methods generally focus on only the changes of gene expression levels; however, the biological relationships among genes are also the fundamental components of pathways, and the dysregulated relationships may also alter the pathway activities. We propose a powerful computational method, Edge Set Enrichment Analysis (ESEA), for the identification of dysregulated pathways. This provides a novel way of pathway analysis by investigating the changes of biological relationships of pathways in the context of gene expression data. Simulation studies illustrate the power and performance of ESEA under various simulated conditions. Using real datasets from p53 mutation, Type 2 diabetes and lung cancer, we validate effectiveness of ESEA in identifying dysregulated pathways. We further compare our results with five other pathway enrichment analysis methods. With these analyses, we show that ESEA is able to help uncover dysregulated biological pathways underlying complex traits and human diseases via specific use of the dysregulated biological relationships. We develop a freely available R-based tool of ESEA. Currently, ESEA can support pathway analysis of the seven public databases (KEGG; Reactome; Biocarta; NCI; SPIKE; HumanCyc; Panther). PMID:26267116
Pao, Sheng-Ying; Lin, Win-Li; Hwang, Ming-Jing
2006-01-01
Background Screening for differentially expressed genes on the genomic scale and comparative analysis of the expression profiles of orthologous genes between species to study gene function and regulation are becoming increasingly feasible. Expressed sequence tags (ESTs) are an excellent source of data for such studies using bioinformatic approaches because of the rich libraries and tremendous amount of data now available in the public domain. However, any large-scale EST-based bioinformatics analysis must deal with the heterogeneous, and often ambiguous, tissue and organ terms used to describe EST libraries. Results To deal with the issue of tissue source, in this work, we carefully screened and organized more than 8 million human and mouse ESTs into 157 human and 108 mouse tissue/organ categories, to which we applied an established statistic test using different thresholds of the p value to identify genes differentially expressed in different tissues. Further analysis of the tissue distribution and level of expression of human and mouse orthologous genes showed that tissue-specific orthologs tended to have more similar expression patterns than those lacking significant tissue specificity. On the other hand, a number of orthologs were found to have significant disparity in their expression profiles, hinting at novel functions, divergent regulation, or new ortholog relationships. Conclusion Comprehensive statistics on the tissue-specific expression of human and mouse genes were obtained in this very large-scale, EST-based analysis. These statistical results have been organized into a database, freely accessible at our website , for easy searching of human and mouse tissue-specific genes and for investigating gene expression profiles in the context of comparative genomics. Comparative analysis showed that, although highly tissue-specific genes tend to exhibit similar expression profiles in human and mouse, there are significant exceptions, indicating that orthologous genes, while sharing basic genomic properties, could result in distinct phenotypes. PMID:16626500
2014-01-01
Background Diarrheagenic Escherichia coli (DEC), including Enterotoxigenic E.coli (ETEC), Enteroaggregative E.coli (EAEC), Enteropathogenic E.coli (EPEC), Enterohemolysin E.coli (EHEC) and Enteroinvasive E.coli (EIEC) causes diarrhea or hemolytic uremic syndromes among infants and travelers around the world. A rapid, reliable and repeatable method is urgent for identifying DEC so as to provide the reference for responding to diarrheal disease outbreak and the treatment of the diarrheal patients associated with DEC. Methods In this study, specific primers and modified molecular beacon probes of nine specific virulence genes, whose 5′end were added with homo tail sequence, were designed; and a two-tube modified molecular beacon based multiplex real–time PCR (rtPCR) assay for the identification of five Escherichia coli pathotypes, including ETEC, EAEC, EPEC, EHEC and EIEC was developed and optimized. Totally 102 bacterial strains, including 52 reference bacterial strains and 50 clinical strains were detected to confirm whether the target genes selected were specific. Then detection limits of the assay were tested. Lastly, the assay was applied to the detection of 11860 clinical samples to evaluate the specificity and sensitivity of the developed assay compared with the conventional PCR. Results The target genes were 100% specific as assessed on 102 bacterial strains since no cross-reactions were observed. The detection limits ranged from 88 CFU/mL (EHEC) to 880 CFU/mL (EPEC). Compared with the conventional PCR, the specificity and sensitivity of the multiplex rtPCR was 100% and over 99%, respectively. The coefficient of variation (CV) for each target gene ranged from 0.45% to 1.53%. 171 positive clinical samples were mostly identified as ETEC (n = 111, 64.9%) and EPEC (n = 38, 22.2%), which were the dominating pathotypes of DEC strains. Conclusion The developed multiplex rtPCR assay for the identification of DEC was high sensitive and specific and could be applied to the rapid identification of DEC in clinical and public health laboratories. PMID:25023669
Costa-Alcalde, José Javier; Barbeito-Castiñeiras, Gema; González-Alba, José María; Aguilera, Antonio; Galán, Juan Carlos; Pérez-Del-Molino, María Luisa
2018-06-02
The American Thoracic Society and the Infectious Diseases Society of America recommend that clinically significant non-tuberculous mycobacteria (NTM) should be identified to the species level in order to determine their clinical significance. The aim of this study was to evaluate identification of rapidly growing NTM (RGM) isolated from clinical samples by using MALDI-TOF MS and a commercial molecular system. The results were compared with identification using a reference method. We included 46 clinical isolates of RGM and identified them using the commercial molecular system GenoType ® CM/AS (Hain, Lifescience, Germany), MALDI-TOF MS (Bruker) and, as reference method, partial rpoβ gene sequencing followed by BLAST and phylogenetic analysis with the 1093 sequences available in the GeneBank. The degree of agreement between GenoType ® and MALDI-TOF MS and the reference method, partial rpoβ sequencing, was 27/43 (62.8%) and 38/43 cases (88.3%) respectively. For all the samples correctly classified by GenoType ® , we obtained the same result with MALDI-TOF MS (27/27). However, MALDI-TOF MS also correctly identified 68.75% (11/16) of the samples that GenoType ® had misclassified (p=0.005). MALDI-TOF MS classified significantly better than GenoType ® . When a MALDI-TOF MS score >1.85 was achieved, MALDI-TOF MS and partial rpoβ gene sequencing were equivalent. GenoType ® was not able to distinguish between species belonging to the M. fortuitum complex. MALDI-TOF MS methodology is simple, rapid and associated with lower consumable costs than GenoType ® . The partial rpoβ sequencing methods with BLAST and phylogenetic analysis were not able to identify some RGM unequivocally. Therefore, sequencing of additional regions would be indicated in these cases. Copyright © 2018 Elsevier España, S.L.U. and Sociedad Española de Enfermedades Infecciosas y Microbiología Clínica. All rights reserved.
Sharma, Akanksha; Sharma, Niharika; Bhalla, Prem; Singh, Mohan
2017-01-01
Comparative genomics have facilitated the mining of biological information from a genome sequence, through the detection of similarities and differences with genomes of closely or more distantly related species. By using such comparative approaches, knowledge can be transferred from the model to non-model organisms and insights can be gained in the structural and evolutionary patterns of specific genes. In the absence of sequenced genomes for allergenic grasses, this study was aimed at understanding the structure, organisation and expression profiles of grass pollen allergens using the genomic data from Brachypodium distachyon as it is phylogenetically related to the allergenic grasses. Combining genomic data with the anther RNA-Seq dataset revealed 24 pollen allergen genes belonging to eight allergen groups mapping on the five chromosomes in B. distachyon. High levels of anther-specific expression profiles were observed for the 24 identified putative allergen-encoding genes in Brachypodium. The genomic evidence suggests that gene encoding the group 5 allergen, the most potent trigger of hay fever and allergic asthma originated as a pollen specific orphan gene in a common grass ancestor of Brachypodium and Triticiae clades. Gene structure analysis showed that the putative allergen-encoding genes in Brachypodium either lack or contain reduced number of introns. Promoter analysis of the identified Brachypodium genes revealed the presence of specific cis-regulatory sequences likely responsible for high anther/pollen-specific expression. With the identification of putative allergen-encoding genes in Brachypodium, this study has also described some important plant gene families (e.g. expansin superfamily, EF-Hand family, profilins etc) for the first time in the model plant Brachypodium. Altogether, the present study provides new insights into structural characterization and evolution of pollen allergens and will further serve as a base for their functional characterization in related grass species.
Xu, Xing-Li; Cheng, Tian-Yin; Yang, Hu; Yan, Fen; Yang, Ya
2015-06-01
Saliva plays an important role in feeding and pathogen transmission, identification and analysis of tick salivary gland (SG) proteins is considered as a hot spot in anti-tick researching area. Herein, we present the first description of SG transcriptome of Haemaphysalis flava using next-generation sequencing (NGS). A total of over 143 million high-quality reads were assembled into 54,357 unigenes, of which 20,145 (37.06%) had significant similarities to proteins in the Swiss-Prot database. 13,513 annotated sequences were associated with GO terms. Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis showed that 14,280 unigenes were assigned to 279 KEGG pathways in total. Reads per kb per million reads (RPKM) analysis showed that there were 3035 down-regulated unigenes and 2260 up-regulated unigenes in the engorged ticks (ET) compared with the semi-engorged one (SET). Several important genes are associated with blood feeding and ingestion as secreted salivary proteins, concluding cysteine, longipain, 4D8, calreticulin, metalloproteases, serine protease inhibitor, enolase, heat shock protein and AV422 in SG, were identified. The qRT-PCR results confirmed that patterns of these genes (except for the longipain gene) expression were consistent with RNA-seq results. This de novo assembly of SG transcriptome of H. flava not only provides more chance for screening and cloning functional genes, but also forms a solid basis for further insight into the changes of salivary proteins during blood-feeding. Copyright © 2015 Elsevier B.V. All rights reserved.
Mootapally, Chandra Shekar; Nathani, Neelam M; Patel, Amrutlal K; Jakhesara, Subhash J; Joshi, Chaitanya G
2016-01-01
Phytases have been widely used as animal feed supplements to increase the availability of digestible phosphorus, especially in monogastric animals fed cereal grains. The present study describes the identification of a full-length phytase gene of Prevotella species present in Mehsani buffalo rumen. The gene, designated as RPHY1, consists of 1,251 bp and is expressed into protein with 417 amino acids. A homology search of the deduced amino acid sequence of the RPHY1 phytase gene in a nonredundant protein database showed that it shares 92% similarity with the histidine acid phosphatase domain. Subsequently, the RPHY1 gene was expressed using a pET32a expression vector in Escherichia coli BL21 and purified using a His60 Ni-NTA gravity column. The mass of the purified RPHY1 was estimated to be approximately 63 kDa by sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE). The optimal RPHY1 enzyme activity was observed at 55°C (pH 5) and exhibited good stability at 5°C and within the acidic pH range. Significant inhibition of RPHY1 activity was observed for Mg2+ and K+ metal ions, while Ca2+, Mn2+, and Na+ slightly inhibited enzyme activity. The RPHY1 phytase was susceptible to SDS, and it was highly stimulated in the presence of EDTA. Overall, the observed comparatively high enzyme activity levels and characteristics of the RPHY1 gene mined from rumen prove its promising candidature as a feed supplement enzyme in animal farming. © 2016 S. Karger AG, Basel.
Smita, Shuchi; Katiyar, Amit; Pandey, Dev Mani; Chinnusamy, Viswanathan; Archak, Sunil; Bansal, Kailash Chander
2013-01-01
Identification of genes that are coexpressed across various tissues and environmental stresses is biologically interesting, since they may play coordinated role in similar biological processes. Genes with correlated expression patterns can be best identified by using coexpression network analysis of transcriptome data. In the present study, we analyzed the temporal-spatial coordination of gene expression in root, leaf and panicle of rice under drought stress and constructed network using WGCNA and Cytoscape. Total of 2199 differentially expressed genes (DEGs) were identified in at least three or more tissues, wherein 88 genes have coordinated expression profile among all the six tissues under drought stress. These 88 highly coordinated genes were further subjected to module identification in the coexpression network. Based on chief topological properties we identified 18 hub genes such as ABC transporter, ATP-binding protein, dehydrin, protein phosphatase 2C, LTPL153 - Protease inhibitor, phosphatidylethanolaminebinding protein, lactose permease-related, NADP-dependent malic enzyme, etc. Motif enrichment analysis showed the presence of ABRE cis-elements in the promoters of > 62% of the coordinately expressed genes. Our results suggest that drought stress mediated upregulated gene expression was coordinated through an ABA-dependent signaling pathway across tissues, at least for the subset of genes identified in this study, while down regulation appears to be regulated by tissue specific pathways in rice.
Gatta, V; Zizzari, V L; Dd ' Amico, V; Salini, L; D' Aurora, M; Franchi, S; Antonucci, I; Sberna, M T; Gherlone, E; Stuppia, L; Tetè, S
2012-01-01
Dental pulp undergoes a number of changes passing from healthy status to inflammation due to deep decay. These changes are regulated by several genes resulting differently expressed in inflamed and healthy dental pulp, and the knowledge of the processes underlying this differential expression is of great relevance in the identification of the pathogenesis of the disease. In this study, the gene expression profile of inflamed and healthy dental pulps were compared by microarray analysis, and data obtained were analyzed by Ingenuity Pathway Analysis (IPA) software. This analysis allows to focus on a variety of genes, typically expressed in inflamed tissues. The comparison analysis showed an increased expression of several genes in inflamed pulp, among which IL1β and CD40 resulted of particular interest. These results indicate that gene expression profile of human dental pulp in different physiological and pathological conditions may become an useful tool for improving our knowledge about processes regulating pulp inflammation.
A TALE of shrimps: Genome-wide survey of homeobox genes in 120 species from diverse crustacean taxa.
Chang, Wai Hoong; Lai, Alvina G
2018-01-01
The homeodomain-containing proteins are an important group of transcription factors found in most eukaryotes including animals, plants and fungi. Homeobox genes are responsible for a wide range of critical developmental and physiological processes, ranging from embryonic development, innate immune homeostasis to whole-body regeneration. With continued fascination on this key class of proteins by developmental and evolutionary biologists, multiple efforts have thus far focused on the identification and characterization of homeobox orthologs from key model organisms in attempts to infer their evolutionary origin and how this underpins the evolution of complex body plans. Despite their importance, the genetic complement of homeobox genes has yet been described in one of the most valuable groups of animals representing economically important food crops. With crustacean aquaculture being a growing industry worldwide, it is clear that systematic and cross-species identification of crustacean homeobox orthologs is necessary in order to harness this genetic circuitry for the improvement of aquaculture sustainability. Using publicly available transcriptome data sets, we identified a total of 4183 putative homeobox genes from 120 crustacean species that include food crop species, such as lobsters, shrimps, crayfish and crabs. Additionally, we identified 717 homeobox orthologs from 6 other non-crustacean arthropods, which include the scorpion, deer tick, mosquitoes and centipede. This high confidence set of homeobox genes will now serve as a key resource to the broader community for future functional and comparative genomics studies.
Array CGH analysis of a cohort of Russian patients with intellectual disability.
Kashevarova, Anna A; Nazarenko, Lyudmila P; Skryabin, Nikolay A; Salyukova, Olga A; Chechetkina, Nataliya N; Tolmacheva, Ekaterina N; Sazhenova, Elena A; Magini, Pamela; Graziano, Claudio; Romeo, Giovanni; Kučinskas, Vaidutis; Lebedev, Igor N
2014-02-15
The use of array comparative genomic hybridization (array CGH) as a diagnostic tool in molecular genetics has facilitated the identification of many new microdeletion/microduplication syndromes (MMSs). Furthermore, this method has allowed for the identification of copy number variations (CNVs) whose pathogenic role has yet to be uncovered. Here, we report on our application of array CGH for the identification of pathogenic CNVs in 79 Russian children with intellectual disability (ID). Twenty-six pathogenic or likely pathogenic changes in copy number were detected in 22 patients (28%): 8 CNVs corresponded to known MMSs, and 17 were not associated with previously described syndromes. In this report, we describe our findings and comment on genes potentially associated with ID that are located within the CNV regions. Copyright © 2013 Elsevier B.V. All rights reserved.
Identification of Novel Gene Signatures in Atopic Dermatitis Complicated by Eczema Herpeticum
Bin, Lianghua; Edwards, Michael G.; Heiser, Ryan; Streib, Joanne; Richers, Brittany; Hall, Cliff; Leung, Donald Y.M.
2014-01-01
Background A subset of patients with atopic dermatitis (AD) is prone to disseminated herpes simplex virus (HSV) infection, i.e. eczema herpeticum (ADEH+). Biomarkers that identify ADEH+ are lacking. Objective To search for novel ADEH+ gene signatures in peripheral blood mononuclear cells (PBMCs). Methods A RNA-sequencing (RNA-seq) approach was applied to evaluate global transcriptional changes using PBMCs from ADEH+ and AD without a history of EH (ADEH−). Candidate genes were confirmed by qPCR or ELISA. RESULTS ADEH+ PBMCs had distinct changes to the transcriptome when compared to ADEH− PBMCs following HSV-1 stimulation: 792 genes were differentially expressed at a false discovery rate (FDR) < 0.05 (ANOVA), and 15 type I and type III interferon (IFN) genes were among the top 20 most down-regulated genes in ADEH+. We further validated that IFN-α and IL-29 mRNA and protein levels were significantly decreased in HSV-1 stimulated PBMCs from ADEH+ compared to ADEH− and normal. Ingenuity pathway analysis (IPA) demonstrated that the up-stream regulators of type I and type III IFNs, IRF3 and IRF7, was significantly inhibited in ADEH+ based on the down-regulation of their target genes. Furthermore, we found that gene expression of IRF3 and IRF7 were significantly decreased in HSV-1 stimulated PBMC from ADEH+ subjects. CONCLUSIONS PBMCs from ADEH+ have a distinct immune response following HSV-1 exposure compared to ADEH−. Inhibition of the IRF3 and IRF7 innate immune pathways in ADEH+ may be important mechanism for increased susceptibility to disseminated viral infection. PMID:25159465
Wan, Pin-Jun; Yuan, San-Yue; Wang, Wei-Xia; Chen, Xu; Lai, Feng-Xiang; Fu, Qiang
2016-01-01
The basic helix-loop-helix (bHLH) transcription factors in insects play essential roles in multiple developmental processes including neurogenesis, sterol metabolism, circadian rhythms, organogenesis and formation of olfactory sensory neurons. The identification and function analysis of bHLH family members of the most destructive insect pest of rice, Nilaparvata lugens, may provide novel tools for pest management. Here, a genome-wide survey for bHLH sequences identified 60 bHLH sequences (NlbHLHs) encoded in the draft genome of N. lugens. Phylogenetic analysis of the bHLH domains successfully classified these genes into 40 bHLH families in group A (25), B (14), C (10), D (1), E (8) and F (2). The number of NlbHLHs with introns is higher than many other insect species, and the average intron length is shorter than those of Acyrthosiphon pisum. High number of ortholog families of NlbHLHs was found suggesting functional conversation for these proteins. Compared to other insect species studied, N. lugens has the highest number of bHLH members. Furthermore, gene duplication events of SREBP, Kn(col), Tap, Delilah, Sim, Ato and Crp were found in N. lugens. In addition, a putative full set of NlbHLH genes is defined and compared with another insect species. Thus, our classification of these NlbHLH members provides a platform for further investigations of bHLH protein functions in the regulation of N. lugens, and of insects in general. PMID:27869716
Crow, Megan; Paul, Anirban; Ballouz, Sara; Huang, Z Josh; Gillis, Jesse
2018-02-28
Single-cell RNA-sequencing (scRNA-seq) technology provides a new avenue to discover and characterize cell types; however, the experiment-specific technical biases and analytic variability inherent to current pipelines may undermine its replicability. Meta-analysis is further hampered by the use of ad hoc naming conventions. Here we demonstrate our replication framework, MetaNeighbor, that quantifies the degree to which cell types replicate across datasets, and enables rapid identification of clusters with high similarity. We first measure the replicability of neuronal identity, comparing results across eight technically and biologically diverse datasets to define best practices for more complex assessments. We then apply this to novel interneuron subtypes, finding that 24/45 subtypes have evidence of replication, which enables the identification of robust candidate marker genes. Across tasks we find that large sets of variably expressed genes can identify replicable cell types with high accuracy, suggesting a general route forward for large-scale evaluation of scRNA-seq data.
Improved PCR primers for the detection and identification of arbuscular mycorrhizal fungi.
Lee, Jaikoo; Lee, Sangsun; Young, J Peter W
2008-08-01
A set of PCR primers that should amplify all subgroups of arbuscular mycorrhizal fungi (AMF, Glomeromycota), but exclude sequences from other organisms, was designed to facilitate rapid detection and identification directly from field-grown plant roots. The small subunit rRNA gene was targeted for the new primers (AML1 and AML2) because phylogenetic relationships among the Glomeromycota are well understood for this gene. Sequence comparisons indicate that the new primers should amplify all published AMF sequences except those from Archaeospora trappei. The specificity of the new primers was tested using 23 different AMF spore morphotypes from trap cultures and Miscanthus sinensis, Glycine max and Panax ginseng roots sampled from the field. Non-AMF DNA of 14 plants, 14 Basidiomycota and 18 Ascomycota was also tested as negative controls. Sequences amplified from roots using the new primers were compared with those obtained using the established NS31 and AM1 primer combination. The new primers have much better specificity and coverage of all known AMF groups.
Khachane, Amit; Kumar, Ranjit; Jain, Sanyam; Jain, Samta; Banumathy, Gowrishankar; Singh, Varsha; Nagpal, Saurabh; Tatu, Utpal
2005-01-01
Bioinformatics tools to aid gene and protein sequence analysis have become an integral part of biology in the post-genomic era. Release of the Plasmodium falciparum genome sequence has allowed biologists to define the gene and the predicted protein content as well as their sequences in the parasite. Using pI and molecular weight as characteristics unique to each protein, we have developed a bioinformatics tool to aid identification of proteins from Plasmodium falciparum. The tool makes use of a Virtual 2-DE generated by plotting all of the proteins from the Plasmodium database on a pI versus molecular weight scale. Proteins are identified by comparing the position of migration of desired protein spots from an experimental 2-DE and that on a virtual 2-DE. The procedure has been automated in the form of user-friendly software called "Plasmo2D". The tool can be downloaded from http://144.16.89.25/Plasmo2D.zip.
Areeshi, Mohammed Yahya
2013-01-01
DNA repair capacity is crucial in maintaining cellular functions and homeostasis. However, it can be altered based on DNA sequence variations in DNA repair genes and this may lead to the development of many diseases including malignancies. Identification of genetic polymorphisms responsible for reduced DNA repair capacity is necessary for better prevention. Homologous recombination (HR), a major double strand break repair pathway, plays a critical role in maintaining the genome stability. The present study was performed to determine the frequency of the HR gene XRCC3 Exon 7 (C18067T, rs861539) polymorphisms in Saudi Arabian population in comparison with epidemiological studies by "MEDLINE" search to equate with global populations. The variant allelic (T) frequency of XRCC3 (C>T) was found to be 39%. Our results suggest that frequency of XRCC3 (C>T) DNA repair gene exhibits distinctive patterns compared with the Saudi Arabian population and this might be attributed to ethnic variation. The present findings may help in high-risk screening of humans exposed to environmental carcinogens and cancer predisposition in different ethnic groups.
Chaillou, Thomas; Jackson, Janna R; England, Jonathan H; Kirby, Tyler J; Richards-White, Jena; Esser, Karyn A; Dupont-Versteegden, Esther E; McCarthy, John J
2015-01-01
The purpose of this study was to compare the gene expression profile of mouse skeletal muscle undergoing two forms of growth (hypertrophy and regrowth) with the goal of identifying a conserved set of differentially expressed genes. Expression profiling by microarray was performed on the plantaris muscle subjected to 1, 3, 5, 7, 10, and 14 days of hypertrophy or regrowth following 2 wk of hind-limb suspension. We identified 97 differentially expressed genes (≥2-fold increase or ≥50% decrease compared with control muscle) that were conserved during the two forms of muscle growth. The vast majority (∼90%) of the differentially expressed genes was upregulated and occurred at a single time point (64 out of 86 genes), which most often was on the first day of the time course. Microarray analysis from the conserved upregulated genes showed a set of genes related to contractile apparatus and stress response at day 1, including three genes involved in mechanotransduction and four genes encoding heat shock proteins. Our analysis further identified three cell cycle-related genes at day and several genes associated with extracellular matrix (ECM) at both days 3 and 10. In conclusion, we have identified a core set of genes commonly upregulated in two forms of muscle growth that could play a role in the maintenance of sarcomere stability, ECM remodeling, cell proliferation, fast-to-slow fiber type transition, and the regulation of skeletal muscle growth. These findings suggest conserved regulatory mechanisms involved in the adaptation of skeletal muscle to increased mechanical loading. Copyright © 2015 the American Physiological Society.
Chaillou, Thomas; Jackson, Janna R.; England, Jonathan H.; Kirby, Tyler J.; Richards-White, Jena; Esser, Karyn A.; Dupont-Versteegden, Esther E.
2014-01-01
The purpose of this study was to compare the gene expression profile of mouse skeletal muscle undergoing two forms of growth (hypertrophy and regrowth) with the goal of identifying a conserved set of differentially expressed genes. Expression profiling by microarray was performed on the plantaris muscle subjected to 1, 3, 5, 7, 10, and 14 days of hypertrophy or regrowth following 2 wk of hind-limb suspension. We identified 97 differentially expressed genes (≥2-fold increase or ≥50% decrease compared with control muscle) that were conserved during the two forms of muscle growth. The vast majority (∼90%) of the differentially expressed genes was upregulated and occurred at a single time point (64 out of 86 genes), which most often was on the first day of the time course. Microarray analysis from the conserved upregulated genes showed a set of genes related to contractile apparatus and stress response at day 1, including three genes involved in mechanotransduction and four genes encoding heat shock proteins. Our analysis further identified three cell cycle-related genes at day and several genes associated with extracellular matrix (ECM) at both days 3 and 10. In conclusion, we have identified a core set of genes commonly upregulated in two forms of muscle growth that could play a role in the maintenance of sarcomere stability, ECM remodeling, cell proliferation, fast-to-slow fiber type transition, and the regulation of skeletal muscle growth. These findings suggest conserved regulatory mechanisms involved in the adaptation of skeletal muscle to increased mechanical loading. PMID:25554798
d'Ersu, J; Aubin, G G; Mercier, P; Nicollet, P; Bémer, P; Corvec, S
2016-01-01
Staphylococcus caprae is an emerging microorganism in human bone and joint infections (BJI). The aim of this study is to describe the features of S. caprae isolates involved in BJI (H for human) compared with those of isolates recovered in goat mastitis (A for animal). Fourteen isolates of each origin were included. Identifications were performed using a Vitek 2 GP ID card, tuf gene sequencing, and matrix-assisted laser desorption ionization-time of flight (MALDI-TOF) Vitek MS. Molecular typing was carried out using pulsed-field gel electrophoresis (PFGE) and DiversiLab technology. The crystal violet method was used to determine biofilm-forming ability. Virulence factors were searched by PCR. Vitek MS technology provides an accurate identification for the two types of isolates compared to that of gold-standard sequencing (sensitivity, 96.4%), whereas the Vitek 2 GP ID card was more effective for H isolates. Molecular typing methods revealed two distinct lineages corresponding to the origin despite few overlaps: H and A. In our experimental conditions, no significant difference was observed in biofilm production ability between H and A isolates. Nine isolates (5 H isolates and 4 A isolates) behaved as weak producers while one A isolate was a strong producer. Concerning virulence factors, the autolysin atlC and the serine aspartate adhesin (sdrZ) genes were detected in 24 isolates (86%), whereas the lipase gene was always detected, except in one H isolate (96%). The ica operon was present in 23 isolates (82%). Fibrinogen-binding (fbe) or collagen-binding (cna) genes were not detected by using primers designed for Staphylococcus aureus or Staphylococcus epidermidis, even in low stringency conditions. Although S. caprae probably remains underestimated in human infections, further studies are needed to better understand the evolution and the adaptation of this species to its host. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Biomarkers of acute respiratory allergen exposure: Screening for sensitization potential
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pucheu-Haston, Cherie M., E-mail: Pucheu-Haston.Cherie@epa.go; Copeland, Lisa B.; Vallanat, Beena
2010-04-15
Effective hazard screening will require the development of high-throughput or in vitro assays for the identification of potential sensitizers. The goal of this preliminary study was to identify potential biomarkers that differentiate the response to allergens vs non-allergens following an acute exposure in naive individuals. Female BALB/c mice received a single intratracheal aspiration exposure to Metarhizium anisopliae crude antigen (MACA) or bovine serum albumin (BSA) in Hank's Balanced Salt Solution (HBSS) or HBSS alone. Mice were terminated after 1, 3, 6, 12, 18 and 24 h. Bronchoalveolar lavage fluid (BALF) was evaluated to determine total and differential cellularity, total proteinmore » concentration and LDH activity. RNA was isolated from lung tissue for microarray analysis and qRT-PCR. MACA administration induced a rapid increase in BALF neutrophils, lymphocytes, eosinophils and total protein compared to BSA or HBSS. Microarray analysis demonstrated differential expression of genes involved in cytokine production, signaling, inflammatory cell recruitment, adhesion and activation in 3 and 12 h MACA-treated samples compared to BSA or HBSS. Further analyses allowed identification of approx 100 candidate biomarker genes. Eleven genes were selected for further assessment by qRT-PCR. Of these, 6 demonstrated persistently increased expression (Ccl17, Ccl22, Ccl7, Cxcl10, Cxcl2, Saa1), while C3ar1 increased from 6-24 h. In conclusion, a single respiratory exposure of mice to an allergenic mold extract induces an inflammatory response which is distinct in phenotype and gene transcription from the response to a control protein. Further validation of these biomarkers with additional allergens and irritants is needed. These biomarkers may facilitate improvements in screening methods.« less
Identification of De Novo Copy Number Variants Associated with Human Disorders of Sexual Development
Tannour-Louet, Mounia; Han, Shuo; Corbett, Sean T.; Louet, Jean-Francois; Yatsenko, Svetlana; Meyers, Lindsay; Shaw, Chad A.; Kang, Sung-Hae L.; Cheung, Sau Wai; Lamb, Dolores J.
2010-01-01
Disorders of sexual development (DSD), ranging in severity from genital abnormalities to complete sex reversal, are among the most common human birth defects with incidence rates reaching almost 3%. Although causative alterations in key genes controlling gonad development have been identified, the majority of DSD cases remain unexplained. To improve the diagnosis, we screened 116 children born with idiopathic DSD using a clinically validated array-based comparative genomic hybridization platform. 8951 controls without urogenital defects were used to compare with our cohort of affected patients. Clinically relevant imbalances were found in 21.5% of the analyzed patients. Most anomalies (74.2%) evaded detection by the routinely ordered karyotype and were scattered across the genome in gene-enriched subtelomeric loci. Among these defects, confirmed de novo duplication and deletion events were noted on 1p36.33, 9p24.3 and 19q12-q13.11 for ambiguous genitalia, 10p14 and Xq28 for cryptorchidism and 12p13 and 16p11.2 for hypospadias. These variants were significantly associated with genitourinary defects (P = 6.08×10−12). The causality of defects observed in 5p15.3, 9p24.3, 22q12.1 and Xq28 was supported by the presence of overlapping chromosomal rearrangements in several unrelated patients. In addition to known gonad determining genes including SRY and DMRT1, novel candidate genes such as FGFR2, KANK1, ADCY2 and ZEB2 were encompassed. The identification of risk germline rearrangements for urogenital birth defects may impact diagnosis and genetic counseling and contribute to the elucidation of the molecular mechanisms underlying the pathogenesis of human sexual development. PMID:21048976
Identification and Functional Analysis of Healing Regulators in Drosophila
Álvarez-Fernández, Carmen; Tamirisa, Srividya; Prada, Federico; Chernomoretz, Ariel; Podhajcer, Osvaldo; Blanco, Enrique; Martín-Blanco, Enrique
2015-01-01
Wound healing is an essential homeostatic mechanism that maintains the epithelial barrier integrity after tissue damage. Although we know the overall steps in wound healing, many of the underlying molecular mechanisms remain unclear. Genetically amenable systems, such as wound healing in Drosophila imaginal discs, do not model all aspects of the repair process. However, they do allow the less understood aspects of the healing response to be explored, e.g., which signal(s) are responsible for initiating tissue remodeling? How is sealing of the epithelia achieved? Or, what inhibitory cues cancel the healing machinery upon completion? Answering these and other questions first requires the identification and functional analysis of wound specific genes. A variety of different microarray analyses of murine and humans have identified characteristic profiles of gene expression at the wound site, however, very few functional studies in healing regulation have been carried out. We developed an experimentally controlled method that is healing-permissive and that allows live imaging and biochemical analysis of cultured imaginal discs. We performed comparative genome-wide profiling between Drosophila imaginal cells actively involved in healing versus their non-engaged siblings. Sets of potential wound-specific genes were subsequently identified. Importantly, besides identifying and categorizing new genes, we functionally tested many of their gene products by genetic interference and overexpression in healing assays. This non-saturated analysis defines a relevant set of genes whose changes in expression level are functionally significant for proper tissue repair. Amongst these we identified the TCP1 chaperonin complex as a key regulator of the actin cytoskeleton essential for the wound healing response. There is promise that our newly identified wound-healing genes will guide future work in the more complex mammalian wound healing response. PMID:25647511
Identification of causal genes for complex traits
Hormozdiari, Farhad; Kichaev, Gleb; Yang, Wen-Yun; Pasaniuc, Bogdan; Eskin, Eleazar
2015-01-01
Motivation: Although genome-wide association studies (GWAS) have identified thousands of variants associated with common diseases and complex traits, only a handful of these variants are validated to be causal. We consider ‘causal variants’ as variants which are responsible for the association signal at a locus. As opposed to association studies that benefit from linkage disequilibrium (LD), the main challenge in identifying causal variants at associated loci lies in distinguishing among the many closely correlated variants due to LD. This is particularly important for model organisms such as inbred mice, where LD extends much further than in human populations, resulting in large stretches of the genome with significantly associated variants. Furthermore, these model organisms are highly structured and require correction for population structure to remove potential spurious associations. Results: In this work, we propose CAVIAR-Gene (CAusal Variants Identification in Associated Regions), a novel method that is able to operate across large LD regions of the genome while also correcting for population structure. A key feature of our approach is that it provides as output a minimally sized set of genes that captures the genes which harbor causal variants with probability ρ. Through extensive simulations, we demonstrate that our method not only speeds up computation, but also have an average of 10% higher recall rate compared with the existing approaches. We validate our method using a real mouse high-density lipoprotein data (HDL) and show that CAVIAR-Gene is able to identify Apoa2 (a gene known to harbor causal variants for HDL), while reducing the number of genes that need to be tested for functionality by a factor of 2. Availability and implementation: Software is freely available for download at genetics.cs.ucla.edu/caviar. Contact: eeskin@cs.ucla.edu PMID:26072484
Identification of causal genes for complex traits.
Hormozdiari, Farhad; Kichaev, Gleb; Yang, Wen-Yun; Pasaniuc, Bogdan; Eskin, Eleazar
2015-06-15
Although genome-wide association studies (GWAS) have identified thousands of variants associated with common diseases and complex traits, only a handful of these variants are validated to be causal. We consider 'causal variants' as variants which are responsible for the association signal at a locus. As opposed to association studies that benefit from linkage disequilibrium (LD), the main challenge in identifying causal variants at associated loci lies in distinguishing among the many closely correlated variants due to LD. This is particularly important for model organisms such as inbred mice, where LD extends much further than in human populations, resulting in large stretches of the genome with significantly associated variants. Furthermore, these model organisms are highly structured and require correction for population structure to remove potential spurious associations. In this work, we propose CAVIAR-Gene (CAusal Variants Identification in Associated Regions), a novel method that is able to operate across large LD regions of the genome while also correcting for population structure. A key feature of our approach is that it provides as output a minimally sized set of genes that captures the genes which harbor causal variants with probability ρ. Through extensive simulations, we demonstrate that our method not only speeds up computation, but also have an average of 10% higher recall rate compared with the existing approaches. We validate our method using a real mouse high-density lipoprotein data (HDL) and show that CAVIAR-Gene is able to identify Apoa2 (a gene known to harbor causal variants for HDL), while reducing the number of genes that need to be tested for functionality by a factor of 2. Software is freely available for download at genetics.cs.ucla.edu/caviar. © The Author 2015. Published by Oxford University Press.
Genotype Diversity and Distribution of Orientia tsutsugamushi Causing Scrub Typhus in Thailand
2011-07-01
typhus assay and vaccine development. Orientia tsutsugamushi, formerly known as Rickettsia tsutsug- amushi, is the causative agent of scrub typhus, a...Sunderland, MA. 13. Horinoucbi, H., et al. 1996. Genotypic identification of Rickettsia tsutsuga- mushi by restriction fragment length polymorphism... Rickettsia tsutsu· gamushi. Sequence and comparative analyses of the genes encoding TSA homologues from four antigenic variants. J. Bioi. Chern. 267:12728
SynFind: Compiling Syntenic Regions across Any Set of Genomes on Demand.
Tang, Haibao; Bomhoff, Matthew D; Briones, Evan; Zhang, Liangsheng; Schnable, James C; Lyons, Eric
2015-11-11
The identification of conserved syntenic regions enables discovery of predicted locations for orthologous and homeologous genes, even when no such gene is present. This capability means that synteny-based methods are far more effective than sequence similarity-based methods in identifying true-negatives, a necessity for studying gene loss and gene transposition. However, the identification of syntenic regions requires complex analyses which must be repeated for pairwise comparisons between any two species. Therefore, as the number of published genomes increases, there is a growing demand for scalable, simple-to-use applications to perform comparative genomic analyses that cater to both gene family studies and genome-scale studies. We implemented SynFind, a web-based tool that addresses this need. Given one query genome, SynFind is capable of identifying conserved syntenic regions in any set of target genomes. SynFind is capable of reporting per-gene information, useful for researchers studying specific gene families, as well as genome-wide data sets of syntenic gene and predicted gene locations, critical for researchers focused on large-scale genomic analyses. Inference of syntenic homologs provides the basis for correlation of functional changes around genes of interests between related organisms. Deployed on the CoGe online platform, SynFind is connected to the genomic data from over 15,000 organisms from all domains of life as well as supporting multiple releases of the same organism. SynFind makes use of a powerful job execution framework that promises scalability and reproducibility. SynFind can be accessed at http://genomevolution.org/CoGe/SynFind.pl. A video tutorial of SynFind using Phytophthrora as an example is available at http://www.youtube.com/watch?v=2Agczny9Nyc. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Hettne, Kristina M; Boorsma, André; van Dartel, Dorien A M; Goeman, Jelle J; de Jong, Esther; Piersma, Aldert H; Stierum, Rob H; Kleinjans, Jos C; Kors, Jan A
2013-01-29
Availability of chemical response-specific lists of genes (gene sets) for pharmacological and/or toxic effect prediction for compounds is limited. We hypothesize that more gene sets can be created by next-generation text mining (next-gen TM), and that these can be used with gene set analysis (GSA) methods for chemical treatment identification, for pharmacological mechanism elucidation, and for comparing compound toxicity profiles. We created 30,211 chemical response-specific gene sets for human and mouse by next-gen TM, and derived 1,189 (human) and 588 (mouse) gene sets from the Comparative Toxicogenomics Database (CTD). We tested for significant differential expression (SDE) (false discovery rate -corrected p-values < 0.05) of the next-gen TM-derived gene sets and the CTD-derived gene sets in gene expression (GE) data sets of five chemicals (from experimental models). We tested for SDE of gene sets for six fibrates in a peroxisome proliferator-activated receptor alpha (PPARA) knock-out GE dataset and compared to results from the Connectivity Map. We tested for SDE of 319 next-gen TM-derived gene sets for environmental toxicants in three GE data sets of triazoles, and tested for SDE of 442 gene sets associated with embryonic structures. We compared the gene sets to triazole effects seen in the Whole Embryo Culture (WEC), and used principal component analysis (PCA) to discriminate triazoles from other chemicals. Next-gen TM-derived gene sets matching the chemical treatment were significantly altered in three GE data sets, and the corresponding CTD-derived gene sets were significantly altered in five GE data sets. Six next-gen TM-derived and four CTD-derived fibrate gene sets were significantly altered in the PPARA knock-out GE dataset. None of the fibrate signatures in cMap scored significant against the PPARA GE signature. 33 environmental toxicant gene sets were significantly altered in the triazole GE data sets. 21 of these toxicants had a similar toxicity pattern as the triazoles. We confirmed embryotoxic effects, and discriminated triazoles from other chemicals. Gene set analysis with next-gen TM-derived chemical response-specific gene sets is a scalable method for identifying similarities in gene responses to other chemicals, from which one may infer potential mode of action and/or toxic effect.
2013-01-01
Background Availability of chemical response-specific lists of genes (gene sets) for pharmacological and/or toxic effect prediction for compounds is limited. We hypothesize that more gene sets can be created by next-generation text mining (next-gen TM), and that these can be used with gene set analysis (GSA) methods for chemical treatment identification, for pharmacological mechanism elucidation, and for comparing compound toxicity profiles. Methods We created 30,211 chemical response-specific gene sets for human and mouse by next-gen TM, and derived 1,189 (human) and 588 (mouse) gene sets from the Comparative Toxicogenomics Database (CTD). We tested for significant differential expression (SDE) (false discovery rate -corrected p-values < 0.05) of the next-gen TM-derived gene sets and the CTD-derived gene sets in gene expression (GE) data sets of five chemicals (from experimental models). We tested for SDE of gene sets for six fibrates in a peroxisome proliferator-activated receptor alpha (PPARA) knock-out GE dataset and compared to results from the Connectivity Map. We tested for SDE of 319 next-gen TM-derived gene sets for environmental toxicants in three GE data sets of triazoles, and tested for SDE of 442 gene sets associated with embryonic structures. We compared the gene sets to triazole effects seen in the Whole Embryo Culture (WEC), and used principal component analysis (PCA) to discriminate triazoles from other chemicals. Results Next-gen TM-derived gene sets matching the chemical treatment were significantly altered in three GE data sets, and the corresponding CTD-derived gene sets were significantly altered in five GE data sets. Six next-gen TM-derived and four CTD-derived fibrate gene sets were significantly altered in the PPARA knock-out GE dataset. None of the fibrate signatures in cMap scored significant against the PPARA GE signature. 33 environmental toxicant gene sets were significantly altered in the triazole GE data sets. 21 of these toxicants had a similar toxicity pattern as the triazoles. We confirmed embryotoxic effects, and discriminated triazoles from other chemicals. Conclusions Gene set analysis with next-gen TM-derived chemical response-specific gene sets is a scalable method for identifying similarities in gene responses to other chemicals, from which one may infer potential mode of action and/or toxic effect. PMID:23356878
USDA-ARS?s Scientific Manuscript database
Repetitive sequence analysis has become an integral part of genome sequencing projects in addition to gene identification and annotation. Identification of repeats is important not only because it improves gene prediction, but also because of the role that repetitive sequences play in determining th...
2010-03-01
amino acid substitution in this gene has been associated with uric acid nephrolithiasis (32). Recent GWAS have identified another variant within this...Identification of a novel gene and a common variant associated with uric acid nephrolithiasis in a Sardinian genetic isolate. Am J Hum Genet 72
Possibilities in identification of genomic species of Burkholderia cepacia complex by PCR and RFLP.
Navrátilová, Lucie; Chromá, Magdalena; Hanulík, Vojtech; Raclavský, Vladislav
2013-01-01
The strains belonging to Burkholderia cepacia complex are important opportunistic pathogens in immunocompromised patients and cause serious diseases. It is possible to obtain isolates from soil, water, plants and human samples. Taxonomy of this group is difficult. Burkholderia cepacia complex consists of seventeen genomic species and the genetic scheme is based on recA gene. Commonly, first five genomovars occurre in humans, mostly genomovars II and III, subdivision IIIA. Within this study we tested identification of first five genomovars by PCR with following melting analysis and RFLP. The experiments were targeted on eubacterial 16S rDNA and specific gene recA, which allowed identification of all five genomovars. RecA gene appeared as more suitable than 16S rDNA, which enabled direct identification of only genomovars II and V; genomovars I, III and IV were similar within 16S rDNA sequence.
SeMPI: a genome-based secondary metabolite prediction and identification web server.
Zierep, Paul F; Padilla, Natàlia; Yonchev, Dimitar G; Telukunta, Kiran K; Klementz, Dennis; Günther, Stefan
2017-07-03
The secondary metabolism of bacteria, fungi and plants yields a vast number of bioactive substances. The constantly increasing amount of published genomic data provides the opportunity for an efficient identification of gene clusters by genome mining. Conversely, for many natural products with resolved structures, the encoding gene clusters have not been identified yet. Even though genome mining tools have become significantly more efficient in the identification of biosynthetic gene clusters, structural elucidation of the actual secondary metabolite is still challenging, especially due to as yet unpredictable post-modifications. Here, we introduce SeMPI, a web server providing a prediction and identification pipeline for natural products synthesized by polyketide synthases of type I modular. In order to limit the possible structures of PKS products and to include putative tailoring reactions, a structural comparison with annotated natural products was introduced. Furthermore, a benchmark was designed based on 40 gene clusters with annotated PKS products. The web server of the pipeline (SeMPI) is freely available at: http://www.pharmaceutical-bioinformatics.de/sempi. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
2012-09-30
computational tools provide the ability to display, browse, select, filter and summarize spatio-temporal relationships of these individual-based...her research assistant at Esri, Shaun Walbridge, and members of the Marine Mammal Institute ( MMI ), including Tomas Follet and Debbie Steel. This...Genomics Laboratory, MMI , OSU. 4 As part of the geneGIS initiative, these SPLASH photo-identification records and the geneSPLASH DNA profiles
A TaqMan real-time PCR-based assay for the identification of Fasciola spp.
Alasaad, Samer; Soriguer, Ramón C; Abu-Madi, Marawan; El Behairy, Ahmed; Jowers, Michael J; Baños, Pablo Díez; Píriz, Ana; Fickel, Joerns; Zhu, Xing-Quan
2011-06-30
Real time quantitative PCR (qPCR) is one of the key technologies of the post-genome era, with clear advantages compared to normal end-point PCR. In this paper, we report the first qPCR-based assay for the identification of Fasciola spp. Based on sequences of the second internal transcribed spacers (ITS-2) of the ribosomal rRNA gene, we used a set of genus-specific primers for Fasciola ITS-2 amplification, and we designed species-specific internal TaqMan probes to identify F. hepatica and F. gigantica, as well as the hybrid 'intermediate'Fasciola. These primers and probes were used for the highly specific, sensitive, and simple identification of Fasciola species collected from different animal host from China, Spain, Niger and Egypt. The novel qPCR-based technique for the identification of Fasciola spp. may provide a useful tool for the epidemiological investigation of Fasciola infection, including their intermediate snail hosts. Copyright © 2011 Elsevier B.V. All rights reserved.
Oligonucleotide microarray for the identification of potential mycotoxigenic fungi
2010-01-01
Background Mycotoxins are secondary metabolites which are produced by numerous fungi and pose a continuous challenge to the safety and quality of food commodities in South Africa. These toxins have toxicologically relevant effects on humans and animals that eat contaminated foods. In this study, a diagnostic DNA microarray was developed for the identification of the most common food-borne fungi, as well as the genes leading to toxin production. Results A total of 40 potentially mycotoxigenic fungi isolated from different food commodities, as well as the genes that are involved in the mycotoxin synthetic pathways, were analyzed. For fungal identification, oligonucleotide probes were designed by exploiting the sequence variations of the elongation factor 1-alpha (EF-1 α) coding regions and the internal transcribed spacer (ITS) regions of the rRNA gene cassette. For the detection of fungi able to produce mycotoxins, oligonucleotide probes directed towards genes leading to toxin production from different fungal strains were identified in data available in the public domain. The probes selected for fungal identification and the probes specific for toxin producing genes were spotted onto microarray slides. Conclusions The diagnostic microarray developed can be used to identify single pure strains or cultures of potentially mycotoxigenic fungi as well as genes leading to toxin production in both laboratory samples and maize-derived foods offering an interesting potential for microbiological laboratories. PMID:20307326
Bénit, Paule; Steffann, Julie; Lebon, Sophie; Chretien, Dominique; Kadhom, Noman; de Lonlay, Pascale; Goldenberg, Alice; Dumez, Yves; Dommergues, Marc; Rustin, Pierre; Munnich, Arnold; Rötig, Agnès
2003-05-01
Complex I deficiency, the most common cause of mitochondrial disorders, accounts for a variety of clinical symptoms and its genetic heterogeneity makes identification of the disease genes particularly tedious. Indeed, most of the 43 complex I subunits are encoded by nuclear genes, only seven of them being mitochondrially encoded. In order to offer urgent prenatal diagnosis, we have studied an inbred/multiplex family with complex I deficiency by using microsatellite DNA markers flanking the putative disease loci. Microsatellite DNA markers have allowed us to exclude the NDUFS7, NDUFS8, NDUFV1 and NDUFS1 genes and to find homozygosity at the NDUFS4 locus. Direct sequencing has led to identification of a homozygous splice acceptor site mutation in intron 1 of the NDUFS4 gene (IVS1nt -1, G-->A); this was not found in chorion villi of the ongoing pregnancy. We suggest that genotyping microsatellite DNA markers at putative disease loci in inbred/multiplex families helps to identify the disease-causing mutation. More generally, we suggest giving consideration to a more systematic microsatellite analysis of putative disease loci for identification of disease genes in inbred/multiplex families affected with genetically heterogeneous conditions.
Kumar, Kamal; Srivastava, Vikas; Purayannur, Savithri; Kaladhar, V Chandra; Cheruvu, Purnima Jaiswal; Verma, Praveen Kumar
2016-06-01
The WRKY genes have been identified as important transcriptional modulators predominantly during the environmental stresses, but they also play critical role at various stages of plant life cycle. We report the identification of WRKY domain (WD)-encoding genes from galegoid clade legumes chickpea (Cicer arietinum L.) and barrel medic (Medicago truncatula). In total, 78 and 98 WD-encoding genes were found in chickpea and barrel medic, respectively. Comparative analysis suggests the presence of both conserved and unique WRKYs, and expansion of WRKY family in M. truncatula primarily by tandem duplication. Exclusively found in galegoid legumes, CaWRKY16 and its orthologues encode for a novel protein having a transmembrane and partial Exo70 domains flanking a group-III WD. Genomic region of galegoids, having CaWRKY16, is more dynamic when compared with millettioids. In onion cells, fused CaWRKY16-EYFP showed punctate fluorescent signals in cytoplasm. The chickpea WRKY group-III genes were further characterized for their transcript level modulation during pathogenic stress and treatments of abscisic acid, jasmonic acid, and salicylic acid (SA) by real-time PCR. Differential regulation of genes was observed during Ascochyta rabiei infection and SA treatment. Characterization of A. rabiei and SA inducible gene CaWRKY50 showed that it localizes to plant nucleus, binds to W-box, and have a C-terminal transactivation domain. Overexpression of CaWRKY50 in tobacco plants resulted in early flowering and senescence. The in-depth comparative account presented here for two legume WRKY genes will be of great utility in hastening functional characterization of crop legume WRKYs and will also help in characterization of Exo70Js. © The Author 2016. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Stevenson, Lindsay G.; Drake, Steven K.; Murray, Patrick R.
2010-01-01
Matrix-assisted laser desorption ionization-time of flight (MALDI-TOF) mass spectrometry is a rapid, accurate method for identifying bacteria and fungi recovered on agar culture media. We report herein a method for the direct identification of bacteria in positive blood culture broths by MALDI-TOF mass spectrometry. A total of 212 positive cultures were examined, representing 32 genera and 60 species or groups. The identification of bacterial isolates by MALDI-TOF mass spectrometry was compared with biochemical testing, and discrepancies were resolved by gene sequencing. No identification (spectral score of <1.7) was obtained for 42 (19.8%) of the isolates, due most commonly to insufficient numbers of bacteria in the blood culture broth. Of the bacteria with a spectral score of ≥1.7, 162 (95.3%) of 170 isolates were correctly identified. All 8 isolates of Streptococcus mitis were misidentified as being Streptococcus pneumoniae isolates. This method provides a rapid, accurate, definitive identification of bacteria within 1 h of detection in positive blood cultures with the caveat that the identification of S. pneumoniae would have to be confirmed by an alternative test. PMID:19955282
Stevenson, Lindsay G; Drake, Steven K; Murray, Patrick R
2010-02-01
Matrix-assisted laser desorption ionization-time of flight (MALDI-TOF) mass spectrometry is a rapid, accurate method for identifying bacteria and fungi recovered on agar culture media. We report herein a method for the direct identification of bacteria in positive blood culture broths by MALDI-TOF mass spectrometry. A total of 212 positive cultures were examined, representing 32 genera and 60 species or groups. The identification of bacterial isolates by MALDI-TOF mass spectrometry was compared with biochemical testing, and discrepancies were resolved by gene sequencing. No identification (spectral score of < 1.7) was obtained for 42 (19.8%) of the isolates, due most commonly to insufficient numbers of bacteria in the blood culture broth. Of the bacteria with a spectral score of > or = 1.7, 162 (95.3%) of 170 isolates were correctly identified. All 8 isolates of Streptococcus mitis were misidentified as being Streptococcus pneumoniae isolates. This method provides a rapid, accurate, definitive identification of bacteria within 1 h of detection in positive blood cultures with the caveat that the identification of S. pneumoniae would have to be confirmed by an alternative test.
Braem, G; De Vliegher, S; Supré, K; Haesebrouck, F; Leroy, F; De Vuyst, L
2011-01-10
Due to significant financial losses in the dairy cattle farming industry caused by mastitis and the possible influence of coagulase-negative staphylococci (CNS) in the development of this disease, accurate identification methods are needed that untangle the different species of the diverse CNS group. In this study, 39 Staphylococcus type strains and 253 field isolates were subjected to (GTG)(5)-PCR fingerprinting to construct a reference framework for the classification and identification of different CNS from (sub)clinical milk samples and teat apices swabs. Validation of the reference framework was performed by dividing the field isolates in two separate groups and testing whether one group of field isolates, in combination with type strains, could be used for a correct classification and identification of a second group of field isolates. (GTG)(5)-PCR fingerprinting achieved a typeability of 94.7% and an accuracy of 94.3% compared to identifications based on gene sequencing. The study shows the usefulness of the method to determine the identity of bovine Staphylococcus species, provided an identification framework updated with field isolates is available. Copyright © 2010 Elsevier B.V. All rights reserved.
Zhao, Jinshan; Li, Hegang; Liu, Kaidong; Zhang, Baoxun; Li, Peipei; He, Jianning; Cheng, Ming; De, Wei; Liu, Jifeng; Zhao, Yaofeng; Yang, Lihua; Liu, Nan
2016-10-01
Goats are an important source of fibers. In the present study microarray technology was used to investigate the potential genes primarily involved in hair and cashmere growth in the Laiwu black goat. A total of 655 genes differentially expressed in body (hair‑growing) and groin (hairless) skin were identified, and their potential association with hair and cashmere growth was analyzed. The majority of genes associated with hair growth regulation could be assigned to intracellular, intracellular organelle, membrane‑bound vesicle, cytoplasmic vesicle, pattern binding, heparin binding, polysaccharide binding, glycosaminoglycan binding and cytoplasmic membrane‑bound vesicle categories. Numerous genes upregulated in body compared with groin skin contained common motifs for nuclear factor 1A, Yi, E2 factor (E2F) and cyclic adenosine monophosphate response element binding (CREB)/CREBβ binding sites in their promoter region. The promoter region of certain genes downregulated in body compared with groin skin contained three common regions with LF‑A1, Yi, E2F, Collier/Olfactory‑1/early B‑cell factor 1, peroxisome proliferator‑activated receptor α or U sites. Thus, the present study identified molecules in the cashmere‑bearing skin area of the Laiwu black goat, which may contribute to hair and cashmere traits.
Higuera-Matas, A; Montoya, G. L; Coria, S.M; Miguéns, M; García-Lecumberri, C; Ambrosio, E
2011-01-01
Drug addiction results from the interplay between social and biological factors. Among these, genetic variables play a major role. The use of genetically related inbred rat strains that differ in their preference for drugs of abuse is one approach of great importance to explore genetic determinants. Lewis and Fischer 344 rats have been extensively studied and it has been shown that the Lewis strain is especially vulnerable to the addictive properties of several drugs when compared with the Fischer 344 strain. Here, we have used microarrays to analyze gene expression profiles in the frontal cortex and nucleus accumbens of Lewis and Fischer 344 rats. Our results show that only a very limited group of genes were differentially expressed in Lewis rats when compared with the Fischer 344 strain. The genes that were induced in the Lewis strain were related to oxygen transport, neurotransmitter processing and fatty acid metabolism. On the contrary genes that were repressed in Lewis rats were involved in physiological functions such as drug and proton transport, oligodendrocyte survival and lipid catabolism. These data might be useful for the identification of genes which could be potential markers of the vulnerability to the addictive properties of drugs of abuse. PMID:21886580
Lei, Wanjun; Ni, Dapeng; Wang, Yujun; Shao, Junjie; Wang, Xincun; Yang, Dan; Wang, Jinsheng; Chen, Haimei; Liu, Chang
2016-02-22
Astragalus membranaceus is an important medicinal plant in Asia. Several of its varieties have been used interchangeably as raw materials for commercial production. High resolution genetic markers are in urgent need to distinguish these varieties. Here, we sequenced and analyzed the chloroplast genome of A. membranaceus (Fisch.) Bunge var. mongholicus (Bunge) P.K. Hsiao using the next generation DNA sequencing technology. The genome was assembled using Abyss and then subjected to gene prediction using CPGAVAS and repeat analysis using MISA, Tandem Repeats Finder, and REPuter. Finally, the genome was subjected phylogenetic and comparative genomic analyses. The complete genome is 123,582 bp long, containing only one copy of the inverted repeat. Gene prediction revealed 110 genes encoding 76 proteins, 30 tRNAs, and four rRNAs. Five intra-specific hypermutation loci were identified, three of which are heteroplasmic. Furthermore, three gene losses and two large inversions were identified. Comparative genomic analyses demonstrated the dynamic nature of the Papilionoideae chloroplast genomes, which showed occurrence of numerous hypermutation loci, frequent gene losses, and fragment inversions. Results obtained herein elucidate the complex evolutionary history of chloroplast genomes and have laid the foundation for the identification of genetic markers to distinguish A. membranaceus varieties.
Yu, Hong; Soler, Marçal; Mila, Isabelle; San Clemente, Hélène; Savelli, Bruno; Dunand, Christophe; Paiva, Jorge A. P.; Myburg, Alexander A.; Bouzayen, Mondher; Grima-Pettenati, Jacqueline; Cassan-Wang, Hua
2014-01-01
Auxin is a central hormone involved in a wide range of developmental processes including the specification of vascular stem cells. Auxin Response Factors (ARF) are important actors of the auxin signalling pathway, regulating the transcription of auxin-responsive genes through direct binding to their promoters. The recent availability of the Eucalyptus grandis genome sequence allowed us to examine the characteristics and evolutionary history of this gene family in a woody plant of high economic importance. With 17 members, the E. grandis ARF gene family is slightly contracted, as compared to those of most angiosperms studied hitherto, lacking traces of duplication events. In silico analysis of alternative transcripts and gene truncation suggested that these two mechanisms were preeminent in shaping the functional diversity of the ARF family in Eucalyptus. Comparative phylogenetic analyses with genomes of other taxonomic lineages revealed the presence of a new ARF clade found preferentially in woody and/or perennial plants. High-throughput expression profiling among different organs and tissues and in response to environmental cues highlighted genes expressed in vascular cambium and/or developing xylem, responding dynamically to various environmental stimuli. Finally, this study allowed identification of three ARF candidates potentially involved in the auxin-regulated transcriptional program underlying wood formation. PMID:25269088
Syed, Khajamohiddin; Shale, Karabo; Pagadala, Nataraj Sekhar; Tuszynski, Jack
2014-01-01
Genome sequencing of basidiomycetes, a group of fungi capable of degrading/mineralizing plant material, revealed the presence of numerous cytochrome P450 monooxygenases (P450s) in their genomes, with some exceptions. Considering the large repertoire of P450s found in fungi, it is difficult to identify P450s that play an important role in fungal metabolism and the adaptation of fungi to diverse ecological niches. In this study, we followed Sir Charles Darwin’s theory of natural selection to identify such P450s in model basidiomycete fungi showing a preference for different types of plant components degradation. Any P450 family comprising a large number of member P450s compared to other P450 families indicates its natural selection over other P450 families by its important role in fungal physiology. Genome-wide comparative P450 analysis in the basidiomycete species, Phanerochaete chrysosporium, Phanerochaete carnosa, Agaricus bisporus, Postia placenta, Ganoderma sp. and Serpula lacrymans, revealed enrichment of 11 P450 families (out of 68 P450 families), CYP63, CYP512, CYP5035, CYP5037, CYP5136, CYP5141, CYP5144, CYP5146, CYP5150, CYP5348 and CYP5359. Phylogenetic analysis of the P450 family showed species-specific alignment of P450s across the P450 families with the exception of P450s of Phanerochaete chrysosporium and Phanerochaete carnosa, suggesting paralogous evolution of P450s in model basidiomycetes. P450 gene-structure analysis revealed high conservation in the size of exons and the location of introns. P450s with the same gene structure were found tandemly arranged in the genomes of selected fungi. This clearly suggests that extensive gene duplications, particularly tandem gene duplications, led to the enrichment of selective P450 families in basidiomycetes. Functional analysis and gene expression profiling data suggest that members of the P450 families are catalytically versatile and possibly involved in fungal colonization of plant material. To our knowledge, this is the first report on the identification and comparative-evolutionary analysis of P450 families enriched in model basidiomycetes. PMID:24466198
2012-01-01
Background Natrialba magadii is an aerobic chemoorganotrophic member of the Euryarchaeota and is a dual extremophile requiring alkaline conditions and hypersalinity for optimal growth. The genome sequence of Nab. magadii type strain ATCC 43099 was deciphered to obtain a comprehensive insight into the genetic content of this haloarchaeon and to understand the basis of some of the cellular functions necessary for its survival. Results The genome of Nab. magadii consists of four replicons with a total sequence of 4,443,643 bp and encodes 4,212 putative proteins, some of which contain peptide repeats of various lengths. Comparative genome analyses facilitated the identification of genes encoding putative proteins involved in adaptation to hypersalinity, stress response, glycosylation, and polysaccharide biosynthesis. A proton-driven ATP synthase and a variety of putative cytochromes and other proteins supporting aerobic respiration and electron transfer were encoded by one or more of Nab. magadii replicons. The genome encodes a number of putative proteases/peptidases as well as protein secretion functions. Genes encoding putative transcriptional regulators, basal transcription factors, signal perception/transduction proteins, and chemotaxis/phototaxis proteins were abundant in the genome. Pathways for the biosynthesis of thiamine, riboflavin, heme, cobalamin, coenzyme F420 and other essential co-factors were deduced by in depth sequence analyses. However, approximately 36% of Nab. magadii protein coding genes could not be assigned a function based on Blast analysis and have been annotated as encoding hypothetical or conserved hypothetical proteins. Furthermore, despite extensive comparative genomic analyses, genes necessary for survival in alkaline conditions could not be identified in Nab. magadii. Conclusions Based on genomic analyses, Nab. magadii is predicted to be metabolically versatile and it could use different carbon and energy sources to sustain growth. Nab. magadii has the genetic potential to adapt to its milieu by intracellular accumulation of inorganic cations and/or neutral organic compounds. The identification of Nab. magadii genes involved in coenzyme biosynthesis is a necessary step toward further reconstruction of the metabolic pathways in halophilic archaea and other extremophiles. The knowledge gained from the genome sequence of this haloalkaliphilic archaeon is highly valuable in advancing the applications of extremophiles and their enzymes. PMID:22559199
Ohshima, Chihiro; Takahashi, Hajime; Phraephaisarn, Chirapiphat; Vesaratchavest, Mongkol; Keeratipibul, Suwimon; Kuda, Takashi; Kimura, Bon
2014-01-01
Listeria monocytogenes is the causative bacteria of listeriosis, which has a higher mortality rate than that of other causes of food poisoning. Listeria spp., of which L. monocytogenes is a member, have been isolated from food and manufacturing environments. Several methods have been published for identifying Listeria spp.; however, many of the methods cannot identify newly categorized Listeria spp. Additionally, they are often not suitable for the food industry, owing to their complexity, cost, or time consumption. Recently, high-resolution melting analysis (HRMA), which exploits DNA-sequence differences, has received attention as a simple and quick genomic typing method. In the present study, a new method for the simple, rapid, and low-cost identification of Listeria spp. has been presented using the genes rarA and ldh as targets for HRMA. DNA sequences of 9 Listeria species were first compared, and polymorphisms were identified for each species for primer design. Species specificity of each HRM curve pattern was estimated using type strains of all the species. Among the 9 species, 7 were identified by HRMA using rarA gene, including 3 new species. The remaining 2 species were identified by HRMA of ldh gene. The newly developed HRMA method was then used to assess Listeria isolates from the food industry, and the method efficiency was compared to that of identification by 16S rDNA sequence analysis. The 2 methods were in coherence for 92.6% of the samples, demonstrating the high accuracy of HRMA. The time required for identifying Listeria spp. was substantially low, and the process was considerably simplified, providing a useful and precise method for processing multiple samples per day. Our newly developed method for identifying Listeria spp. is highly valuable; its use is not limited to the food industry, and it can be used for the isolates from the natural environment.
Ohshima, Chihiro; Takahashi, Hajime; Phraephaisarn, Chirapiphat; Vesaratchavest, Mongkol; Keeratipibul, Suwimon; Kuda, Takashi; Kimura, Bon
2014-01-01
Listeria monocytogenes is the causative bacteria of listeriosis, which has a higher mortality rate than that of other causes of food poisoning. Listeria spp., of which L. monocytogenes is a member, have been isolated from food and manufacturing environments. Several methods have been published for identifying Listeria spp.; however, many of the methods cannot identify newly categorized Listeria spp. Additionally, they are often not suitable for the food industry, owing to their complexity, cost, or time consumption. Recently, high-resolution melting analysis (HRMA), which exploits DNA-sequence differences, has received attention as a simple and quick genomic typing method. In the present study, a new method for the simple, rapid, and low-cost identification of Listeria spp. has been presented using the genes rarA and ldh as targets for HRMA. DNA sequences of 9 Listeria species were first compared, and polymorphisms were identified for each species for primer design. Species specificity of each HRM curve pattern was estimated using type strains of all the species. Among the 9 species, 7 were identified by HRMA using rarA gene, including 3 new species. The remaining 2 species were identified by HRMA of ldh gene. The newly developed HRMA method was then used to assess Listeria isolates from the food industry, and the method efficiency was compared to that of identification by 16S rDNA sequence analysis. The 2 methods were in coherence for 92.6% of the samples, demonstrating the high accuracy of HRMA. The time required for identifying Listeria spp. was substantially low, and the process was considerably simplified, providing a useful and precise method for processing multiple samples per day. Our newly developed method for identifying Listeria spp. is highly valuable; its use is not limited to the food industry, and it can be used for the isolates from the natural environment. PMID:24918440
Reid, William R; Sun, Haina; Becnel, James J; Clark, Andrew G; Scott, Jeffrey G
2018-06-21
Neonicotinoids are the largest class of insecticides and are used for control of house fly populations at animal production facilities throughout the world. There have been several reports of neonicotinoid resistance in house fly populations, but identification of the factors involved in resistance has proven challenging. The KS8S3 population of house flies is highly resistant to the neonicotinoid insecticide imidacloprid due to two factors: one on chromosome 3 and one on chromosome 4. A comparative transcriptomic approach was used, followed by validation using transgenic Drosophila melanogaster to investigate the genes responsible for resistance in the KS8S3 strain. Overexpression of a microsomal glutathione S-transferase (Mdgst) was identified as the factor likely responsible for resistance on chromosome 3. Resistance on chromosome 4 appears to be due to an unidentified trans-regulatory gene which causes overexpression of a galactosyltransferase-like gene (Mdgt1). No single nucleotide polymorphisms were found that could be associated with imidacloprid resistance. Identification of the underlying processes that cause imidacloprid resistance is an important first step towards the development of novel and sensitive resistance monitoring techniques. It will be valuable to investigate if overexpression of Mdgst and Mdgt1 are found in other imidacloprid resistant populations. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
Yao, Qiu-Yang; Xia, En-Hua; Liu, Fei-Hu; Gao, Li-Zhi
2015-02-15
WRKY transcription factors (TFs), one of the ten largest TF families in higher plants, play important roles in regulating plant development and resistance. To date, little is known about the WRKY TF family in Brassica oleracea. Recently, the completed genome sequence of cabbage (B. oleracea var. capitata) allows us to systematically analyze WRKY genes in this species. A total of 148 WRKY genes were characterized and classified into seven subgroups that belong to three major groups. Phylogenetic and synteny analyses revealed that the repertoire of cabbage WRKY genes was derived from a common ancestor shared with Arabidopsis thaliana. The B. oleracea WRKY genes were found to be preferentially retained after the whole-genome triplication (WGT) event in its recent ancestor, suggesting that the WGT event had largely contributed to a rapid expansion of the WRKY gene family in B. oleracea. The analysis of RNA-Seq data from various tissues (i.e., roots, stems, leaves, buds, flowers and siliques) revealed that most of the identified WRKY genes were positively expressed in cabbage, and a large portion of them exhibited patterns of differential and tissue-specific expression, demonstrating that these gene members might play essential roles in plant developmental processes. Comparative analysis of the expression level among duplicated genes showed that gene expression divergence was evidently presented among cabbage WRKY paralogs, indicating functional divergence of these duplicated WRKY genes. Copyright © 2014 Elsevier B.V. All rights reserved.
Liu, Xiang; Li, Shangqi; Peng, Wenzhu; Feng, Shuaisheng; Feng, Jianxin; Mahboob, Shahid; Al-Ghanim, Khalid A; Xu, Peng
2016-01-01
The ATP-binding cassette (ABC) gene family is considered to be one of the largest gene families in all forms of prokaryotic and eukaryotic life. Although the ABC transporter genes have been annotated in some species, detailed information about the ABC superfamily and the evolutionary characterization of ABC genes in common carp (Cyprinus carpio) are still unclear. In this research, we identified 61 ABC transporter genes in the common carp genome. Phylogenetic analysis revealed that they could be classified into seven subfamilies, namely 11 ABCAs, six ABCBs, 19 ABCCs, eight ABCDs, two ABCEs, four ABCFs, and 11 ABCGs. Comparative analysis of the ABC genes in seven vertebrate species including common carp, showed that at least 10 common carp genes were retained from the third round of whole genome duplication, while 12 duplicated ABC genes may have come from the fourth round of whole genome duplication. Gene losses were also observed for 14 ABC genes. Expression profiles of the 61 ABC genes in six common carp tissues (brain, heart, spleen, kidney, intestine, and gill) revealed extensive functional divergence among the ABC genes. Different copies of some genes had tissue-specific expression patterns, which may indicate some gene function specialization. This study provides essential genomic resources for future studies in common carp.
Peng, Wenzhu; Feng, Shuaisheng; Feng, Jianxin; Mahboob, Shahid; Al-Ghanim, Khalid A.
2016-01-01
The ATP-binding cassette (ABC) gene family is considered to be one of the largest gene families in all forms of prokaryotic and eukaryotic life. Although the ABC transporter genes have been annotated in some species, detailed information about the ABC superfamily and the evolutionary characterization of ABC genes in common carp (Cyprinus carpio) are still unclear. In this research, we identified 61 ABC transporter genes in the common carp genome. Phylogenetic analysis revealed that they could be classified into seven subfamilies, namely 11 ABCAs, six ABCBs, 19 ABCCs, eight ABCDs, two ABCEs, four ABCFs, and 11 ABCGs. Comparative analysis of the ABC genes in seven vertebrate species including common carp, showed that at least 10 common carp genes were retained from the third round of whole genome duplication, while 12 duplicated ABC genes may have come from the fourth round of whole genome duplication. Gene losses were also observed for 14 ABC genes. Expression profiles of the 61 ABC genes in six common carp tissues (brain, heart, spleen, kidney, intestine, and gill) revealed extensive functional divergence among the ABC genes. Different copies of some genes had tissue-specific expression patterns, which may indicate some gene function specialization. This study provides essential genomic resources for future studies in common carp. PMID:27058731
Li, Chunquan; Han, Junwei; Yao, Qianlan; Zou, Chendan; Xu, Yanjun; Zhang, Chunlong; Shang, Desi; Zhou, Lingyun; Zou, Chaoxia; Sun, Zeguo; Li, Jing; Zhang, Yunpeng; Yang, Haixiu; Gao, Xu; Li, Xia
2013-05-01
Various 'omics' technologies, including microarrays and gas chromatography mass spectrometry, can be used to identify hundreds of interesting genes, proteins and metabolites, such as differential genes, proteins and metabolites associated with diseases. Identifying metabolic pathways has become an invaluable aid to understanding the genes and metabolites associated with studying conditions. However, the classical methods used to identify pathways fail to accurately consider joint power of interesting gene/metabolite and the key regions impacted by them within metabolic pathways. In this study, we propose a powerful analytical method referred to as Subpathway-GM for the identification of metabolic subpathways. This provides a more accurate level of pathway analysis by integrating information from genes and metabolites, and their positions and cascade regions within the given pathway. We analyzed two colorectal cancer and one metastatic prostate cancer data sets and demonstrated that Subpathway-GM was able to identify disease-relevant subpathways whose corresponding entire pathways might be ignored using classical entire pathway identification methods. Further analysis indicated that the power of a joint genes/metabolites and subpathway strategy based on their topologies may play a key role in reliably recalling disease-relevant subpathways and finding novel subpathways.
Puthiyedth, Nisha; Riveros, Carlos; Berretta, Regina; Moscato, Pablo
2016-01-01
Alzheimer's disease (AD) is the most common form of dementia in older adults that damages the brain and results in impaired memory, thinking and behaviour. The identification of differentially expressed genes and related pathways among affected brain regions can provide more information on the mechanisms of AD. In the past decade, several studies have reported many genes that are associated with AD. This wealth of information has become difficult to follow and interpret as most of the results are conflicting. In that case, it is worth doing an integrated study of multiple datasets that helps to increase the total number of samples and the statistical power in detecting biomarkers. In this study, we present an integrated analysis of five different brain region datasets and introduce new genes that warrant further investigation. The aim of our study is to apply a novel combinatorial optimisation based meta-analysis approach to identify differentially expressed genes that are associated to AD across brain regions. In this study, microarray gene expression data from 161 samples (74 non-demented controls, 87 AD) from the Entorhinal Cortex (EC), Hippocampus (HIP), Middle temporal gyrus (MTG), Posterior cingulate cortex (PC), Superior frontal gyrus (SFG) and visual cortex (VCX) brain regions were integrated and analysed using our method. The results are then compared to two popular meta-analysis methods, RankProd and GeneMeta, and to what can be obtained by analysing the individual datasets. We find genes related with AD that are consistent with existing studies, and new candidate genes not previously related with AD. Our study confirms the up-regualtion of INFAR2 and PTMA along with the down regulation of GPHN, RAB2A, PSMD14 and FGF. Novel genes PSMB2, WNK1, RPL15, SEMA4C, RWDD2A and LARGE are found to be differentially expressed across all brain regions. Further investigation on these genes may provide new insights into the development of AD. In addition, we identified the presence of 23 non-coding features, including four miRNA precursors (miR-7, miR570, miR-1229 and miR-6821), dysregulated across the brain regions. Furthermore, we compared our results with two popular meta-analysis methods RankProd and GeneMeta to validate our findings and performed a sensitivity analysis by removing one dataset at a time to assess the robustness of our results. These new findings may provide new insights into the disease mechanisms and thus make a significant contribution in the near future towards understanding, prevention and cure of AD.
Hamid, Rasmieh; Tomar, Rukam S; Marashi, Hassan; Shafaroudi, Saeid Malekzadeh; Golakiya, Balaji A; Mohsenpour, Motahhareh
2018-06-20
Cytoplasmic Male Sterility is maternally inherited trait in plants, characterized by failure to produce functional pollen during anther development. Anther development is modulated through the interaction of nuclear and mitochondrial genes. In the present study, differential gene expression of floral buds at the sporogenous stage (SS) and microsporocyte stage (MS) between CGMS and its fertile maintainer line of cotton plants was studied. A total of 320 significantly differentially expressed genes, including 20 down-regulated and 37 up-regulated in CGMS comparing with its maintainer line at the SS stage, as well as and 89 down-regulated and 4 up-regulated in CGMS compared to the fertile line at MS stage. Comparing the two stages in the same line, there were 6 down-regulated differentially expressed genes only induced in CGMS and 9 up-regulated differentially expressed gene only induced in its maintainer. GO analysis revealed essential genes responsible for pollen development, and cytoskeleton category show differential expression between the fertile and CGMS lines. Validation studies by qRT-PCR shows concordance with RNA-seq result. A set of novel SSRs identified in this study can be used in evaluating genetic relationships among cultivars, QTL mapping, and marker-assisted breeding. We reported aberrant expression of genes related to pollen exine formation, and synthesis of pectin lyase, myosine heavy chain, tubulin, actin-beta, heat shock protein and myeloblastosis (MYB) protein as targets for CMS in cotton. The results of this study contribute to basic information for future screening of genes and identification of molecular portraits responsible for CMS as well as to elucidate molecular mechanisms that lead to CMS in cotton. Copyright © 2018 Elsevier B.V. All rights reserved.
Guelke, Eileen; Bucan, Vesna; Liebsch, Christina; Lazaridis, Andrea; Radtke, Christine; Vogt, Peter M; Reimers, Kerstin
2015-04-10
For the precise quantitative RT-PCR normalization a set of valid reference genes is obligatory. Moreover have to be taken into concern the experimental conditions as they bias the regulation of reference genes. Up till now, no reference targets have been described for the axolotl (Ambystoma mexicanum). In a search in the public database SalSite for genetic information of the axolotl we identified fourteen presumptive reference genes, eleven of which were further tested for their gene expression stability. This study characterizes the expressional patterns of 11 putative endogenous control genes during axolotl limb regeneration and in an axolotl tissue panel. All 11 reference genes showed variable expression. Strikingly, ACTB was to be found most stable expressed in all comparative tissue groups, so we reason it to be suitable for all different kinds of axolotl tissue-type investigations. Moreover do we suggest GAPDH and RPLP0 as suitable for certain axolotl tissue analysis. When it comes to axolotl limb regeneration, a validated pair of reference genes is ODC and RPLP0. With these findings, new insights into axolotl gene expression profiling might be gained. Copyright © 2015 Elsevier B.V. All rights reserved.
Vickers, Timothy A.; Freier, Susan M.; Bui, Huynh-Hoa; Watt, Andrew; Crooke, Stanley T.
2014-01-01
A new strategy for identifying potent RNase H-dependent antisense oligonucleotides (ASOs) is presented. Our analysis of the human transcriptome revealed that a significant proportion of genes contain unique repeated sequences of 16 or more nucleotides in length. Activities of ASOs targeting these repeated sites in several representative genes were compared to those of ASOs targeting unique single sites in the same transcript. Antisense activity at repeated sites was also evaluated in a highly controlled minigene system. Targeting both native and minigene repeat sites resulted in significant increases in potency as compared to targeting of non-repeated sites. The increased potency at these sites is a result of increased frequency of ASO/RNA interactions which, in turn, increases the probability of a productive interaction between the ASO/RNA heteroduplex and human RNase H1 in the cell. These results suggest a new, highly efficient strategy for rapid identification of highly potent ASOs. PMID:25334092
Zhao, Chanjuan; Xie, Junqi; Li, Li; Cao, Chongjiang
2017-09-20
The transcriptomes of paddy rice in response to high temperature and humidity were studied using a high-throughput RNA sequencing approach. Effects of high temperature and humidity on the sucrose and starch contents and α/β-amylase activity were also investigated. Results showed that 6876 differentially expressed genes (DEGs) were identified in paddy rice under high temperature and humidity storage. Importantly, 12 DEGs that were downregulated fell into the "starch and sucrose pathway". The quantitative real-time polymerase chain reaction assays indicated that expression of these 12 DEGs was significantly decreased, which was in parallel with the reduced level of enzyme activities and the contents of sucrose and starch in paddy rice stored at high temperature and humidity conditions compared to the control group. Taken together, high temperature and humidity influence the quality of paddy rice at least partially by downregulating the expression of genes encoding sucrose transferases and hydrolases, which might result in the decrease of starch and sucrose contents.
Harnessing Whole Genome Sequencing in Medical Mycology.
Cuomo, Christina A
2017-01-01
Comparative genome sequencing studies of human fungal pathogens enable identification of genes and variants associated with virulence and drug resistance. This review describes current approaches, resources, and advances in applying whole genome sequencing to study clinically important fungal pathogens. Genomes for some important fungal pathogens were only recently assembled, revealing gene family expansions in many species and extreme gene loss in one obligate species. The scale and scope of species sequenced is rapidly expanding, leveraging technological advances to assemble and annotate genomes with higher precision. By using iteratively improved reference assemblies or those generated de novo for new species, recent studies have compared the sequence of isolates representing populations or clinical cohorts. Whole genome approaches provide the resolution necessary for comparison of closely related isolates, for example, in the analysis of outbreaks or sampled across time within a single host. Genomic analysis of fungal pathogens has enabled both basic research and diagnostic studies. The increased scale of sequencing can be applied across populations, and new metagenomic methods allow direct analysis of complex samples.
Origins of De Novo Genes in Human and Chimpanzee.
Ruiz-Orera, Jorge; Hernandez-Rodriguez, Jessica; Chiva, Cristina; Sabidó, Eduard; Kondova, Ivanela; Bontrop, Ronald; Marqués-Bonet, Tomàs; Albà, M Mar
2015-12-01
The birth of new genes is an important motor of evolutionary innovation. Whereas many new genes arise by gene duplication, others originate at genomic regions that did not contain any genes or gene copies. Some of these newly expressed genes may acquire coding or non-coding functions and be preserved by natural selection. However, it is yet unclear which is the prevalence and underlying mechanisms of de novo gene emergence. In order to obtain a comprehensive view of this process, we have performed in-depth sequencing of the transcriptomes of four mammalian species--human, chimpanzee, macaque, and mouse--and subsequently compared the assembled transcripts and the corresponding syntenic genomic regions. This has resulted in the identification of over five thousand new multiexonic transcriptional events in human and/or chimpanzee that are not observed in the rest of species. Using comparative genomics, we show that the expression of these transcripts is associated with the gain of regulatory motifs upstream of the transcription start site (TSS) and of U1 snRNP sites downstream of the TSS. In general, these transcripts show little evidence of purifying selection, suggesting that many of them are not functional. However, we find signatures of selection in a subset of de novo genes which have evidence of protein translation. Taken together, the data support a model in which frequently-occurring new transcriptional events in the genome provide the raw material for the evolution of new proteins.
Origins of De Novo Genes in Human and Chimpanzee
Ruiz-Orera, Jorge; Hernandez-Rodriguez, Jessica; Chiva, Cristina; Sabidó, Eduard; Kondova, Ivanela; Bontrop, Ronald; Marqués-Bonet, Tomàs; Albà, M.Mar
2015-01-01
The birth of new genes is an important motor of evolutionary innovation. Whereas many new genes arise by gene duplication, others originate at genomic regions that did not contain any genes or gene copies. Some of these newly expressed genes may acquire coding or non-coding functions and be preserved by natural selection. However, it is yet unclear which is the prevalence and underlying mechanisms of de novo gene emergence. In order to obtain a comprehensive view of this process, we have performed in-depth sequencing of the transcriptomes of four mammalian species—human, chimpanzee, macaque, and mouse—and subsequently compared the assembled transcripts and the corresponding syntenic genomic regions. This has resulted in the identification of over five thousand new multiexonic transcriptional events in human and/or chimpanzee that are not observed in the rest of species. Using comparative genomics, we show that the expression of these transcripts is associated with the gain of regulatory motifs upstream of the transcription start site (TSS) and of U1 snRNP sites downstream of the TSS. In general, these transcripts show little evidence of purifying selection, suggesting that many of them are not functional. However, we find signatures of selection in a subset of de novo genes which have evidence of protein translation. Taken together, the data support a model in which frequently-occurring new transcriptional events in the genome provide the raw material for the evolution of new proteins. PMID:26720152
Pesik, V Yu; Fedunin, A A; Agdzhoyan, A T; Utevska, O M; Chukhraeva, M I; Evseeva, I V; Churnosov, M I; Lependina, I N; Bogunov, Yu V; Bogunova, A A; Ignashkin, M A; Yankovsky, N K; Balanovska, E V; Orekhov, V A; Balanovsky, O P
2014-06-01
We conducted the first genetic analysis of a wide a range of rural Russian populations in European Russia with a panel of common DNA markers commonly used in criminalistics genetic identification. We examined a total of 647 samples from indigenous ethnic Russian populations in Arkhangelsk, Belgorod, Voronezh, Kursk, Rostov, Ryazan, and Orel regions. We employed a multiplex genotyping kit, COrDIS Plus, to genotype Short Tandem Repeat (STR) loci, which included the genetic marker panel officially recommended for DNA identification in the Russian Federation, the United States, and the European Union. In the course of our study, we created a database of allelic frequencies, examined the distribution of alleles and genotypes in seven rural Russian populations, and defined the genetic relationships between these populations. We found that, although multidimensional analysis indicated a difference between the Northern gene pool and the rest of the Russian European populations, a pairwise comparison using 19 STR markers among all populations did not reveal significant differences. This is in concordance with previous studies, which examined up to 12 STR markers of urban Russian populations. Therefore, the database of allelic frequencies created in this study can be applied for forensic examinations and DNA identification among the ethnic Russian population over European Russia. We also noted a decrease in the levels of heterozygosity in the northern Russian population compared to ethnic populations in southern and central Russia, which is consistent with trends identified previously using classical gene markers and analysis of mitochondrial DNA.
Kim, Yeon-Hee; Lee, Si Young
2015-02-01
Mitis-salivarius (MS) agar has been used widely in microbial epidemiological studies because oral viridans streptococci can be selectively grown on this medium. Even though the previous findings reported the limited selecting power of MS agar for streptococcus strains, the identities of non-streptococcal strains from human oral samples which can grow on this medium are not clear yet. In this study, we identified non-streptococcal organisms grown on MS agar plates by polymerase chain reaction (PCR) amplification and sequencing of the 16S ribosomal RNA (rRNA) gene. Eighty bacterial colonies on MS plates were isolated from plaque samples, and bacterial identification was achieved with the rapid ID 32 Strep system and mini API reader. The bacterial colonies identified as non-streptococci by the API system were selected for further identification. The 16S rRNA gene was amplified by PCR and verified using DNA sequencing analysis for identification. Sequences were compared with those of reference organisms in the genome database of the National Center for Biotechnology Information using the Basic Local Alignment Search Tool (BLAST). Among the 11 isolated non-streptococcal strains on MS plates, 3 strains were identified as Actinomyces naeslundii, 7 strains were identified as Actinomyces oris and 1 strain were identified as Actinomyces sp. using Blastn. In this study, we showed that some oral Actinomyces species can grow on Streptococcus-selective MS agar plates. Copyright © 2014 Elsevier Ltd. All rights reserved.
Characteristics of invasive Acinetobacter species isolates recovered in a pediatric academic center.
Jain, Avish L; Harding, Christian M; Assani, Kaivon; Shrestha, Chandra L; Haga, Mercedees; Leber, Amy; Munson, Robert S; Kopp, Benjamin T
2016-07-22
Acinetobacter species are associated with increasing mortality due to emerging drug-resistance. Pediatric Acinetobacter infections are largely undefined in developed countries and clinical laboratory identification methods do not reliably differentiate between members of the Acinetobacter calcoaceticus-baumannii complex, leading to improper identification. Therefore we aimed to determine risk factors for invasive Acinetobacter infections within an academic, pediatric setting as well as defining microbiologic characteristics of predominant strains. Twenty-four invasive Acinetobacter isolates were collected from 2009-2013. Comparative sequence analysis of the rpoB gene was performed coupled with phenotypic characterization of antibiotic resistance, motility, biofilm production and clinical correlation. Affected patients had a median age of 3.5 years, and 71 % had a central catheter infection source. rpoB gene sequencing revealed a predominance of A. pittii (45.8 %) and A. baumannii (33.3 %) strains. There was increasing incidence of A. pittii over the study. Two fatalities occurred in the A. pittii group. Seventeen percent of isolates were multi-drug resistant. A pittii and A. baumannii strains were similar in motility, but A pittii strains had significantly more biofilm production (P value = 0.018). A. pittii was the most isolated species highlighting the need for proper species identification. The isolated strains had limited acute mortality in children, but the occurrence of more multi-drug resistant strains in the future is a distinct possibility, justifying continued research and accurate species identification.
Furlaneto-Maia, Luciana; Rocha, Kátia Real; Siqueira, Vera Lúcia Dias; Furlaneto, Márcia Cristina
2014-01-01
Enterococci are increasingly responsible for nosocomial infections worldwide. This study was undertaken to compare the identification and susceptibility profile using an automated MicrosScan system, PCR-based assay and disk diffusion assay of Enterococcus spp. We evaluated 30 clinical isolates of Enterococcus spp. Isolates were identified by MicrosScan system and PCR-based assay. The detection of antibiotic resistance genes (vancomycin, gentamicin, tetracycline and erythromycin) was also determined by PCR. Antimicrobial susceptibilities to vancomycin (30 µg), gentamicin (120 µg), tetracycline (30 µg) and erythromycin (15 µg) were tested by the automated system and disk diffusion method, and were interpreted according to the criteria recommended in CLSI guidelines. Concerning Enterococcus identification the general agreement between data obtained by the PCR method and by the automatic system was 90.0% (27/30). For all isolates of E. faecium and E. faecalis we observed 100% agreement. Resistance frequencies were higher in E. faecium than E. faecalis. The resistance rates obtained were higher for erythromycin (86.7%), vancomycin (80.0%), tetracycline (43.35) and gentamicin (33.3%). The correlation between disk diffusion and automation revealed an agreement for the majority of the antibiotics with category agreement rates of > 80%. The PCR-based assay, the van(A) gene was detected in 100% of vancomycin resistant enterococci. This assay is simple to conduct and reliable in the identification of clinically relevant enterococci. The data obtained reinforced the need for an improvement of the automated system to identify some enterococci. PMID:24626409
Adderson, Elisabeth E.; Boudreaux, Jan W.; Cummings, Jessica R.; Pounds, Stanley; Wilson, Deborah A.; Procop, Gary W.; Hayden, Randall T.
2008-01-01
We compared the relative levels of effectiveness of three commercial identification kits and three nucleic acid amplification tests for the identification of coryneform bacteria by testing 50 diverse isolates, including 12 well-characterized control strains and 38 organisms obtained from pediatric oncology patients at our institution. Between 33.3 and 75.0% of control strains were correctly identified to the species level by phenotypic systems or nucleic acid amplification assays. The most sensitive tests were the API Coryne system and amplification and sequencing of the 16S rRNA gene using primers optimized for coryneform bacteria, which correctly identified 9 of 12 control isolates to the species level, and all strains with a high-confidence call were correctly identified. Organisms not correctly identified were species not included in the test kit databases or not producing a pattern of reactions included in kit databases or which could not be differentiated among several genospecies based on reaction patterns. Nucleic acid amplification assays had limited abilities to identify some bacteria to the species level, and comparison of sequence homologies was complicated by the inclusion of allele sequences obtained from uncultivated and uncharacterized strains in databases. The utility of rpoB genotyping was limited by the small number of representative gene sequences that are currently available for comparison. The correlation between identifications produced by different classification systems was poor, particularly for clinical isolates. PMID:18160450
Mouse forward genetics in the study of the peripheral nervous system and human peripheral neuropathy
Douglas, Darlene S.; Popko, Brian
2009-01-01
Forward genetics, the phenotype-driven approach to investigating gene identity and function, has a long history in mouse genetics. Random mutations in the mouse transcend bias about gene function and provide avenues towards unique discoveries. The study of the peripheral nervous system is no exception; from historical strains such as the trembler mouse, which led to the identification of PMP22 as a human disease gene causing multiple forms of peripheral neuropathy, to the more recent identification of the claw paw and sprawling mutations, forward genetics has long been a tool for probing the physiology, pathogenesis, and genetics of the PNS. Even as spontaneous and mutagenized mice continue to enable the identification of novel genes, provide allelic series for detailed functional studies, and generate models useful for clinical research, new methods, such as the piggyBac transposon, are being developed to further harness the power of forward genetics. PMID:18481175
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ranjan, Priya; Yin, Tongming; Zhang, Xinye
2009-11-01
Quantitative trait locus (QTL) studies are an integral part of plant research and are used to characterize the genetic basis of phenotypic variation observed in structured populations and inform marker-assisted breeding efforts. These QTL intervals can span large physical regions on a chromosome comprising hundreds of genes, thereby hampering candidate gene identification. Genome history, evolution, and expression evidence can be used to narrow the genes in the interval to a smaller list that is manageable for detailed downstream functional genomics characterization. Our primary motivation for the present study was to address the need for a research methodology that identifies candidatemore » genes within a broad QTL interval. Here we present a bioinformatics-based approach for subdividing candidate genes within QTL intervals into alternate groups of high probability candidates. Application of this approach in the context of studying cell wall traits, specifically lignin content and S/G ratios of stem and root in Populus plants, resulted in manageable sets of genes of both known and putative cell wall biosynthetic function. These results provide a roadmap for future experimental work leading to identification of new genes controlling cell wall recalcitrance and, ultimately, in the utility of plant biomass as an energy feedstock.« less
Lee, Wonmok; Kim, Myungsook; Yong, Dongeun; Jeong, Seok Hoon; Lee, Kyungwon; Chong, Yunsop
2015-01-01
By conventional methods, the identification of anaerobic bacteria is more time consuming and requires more expertise than the identification of aerobic bacteria. Although the matrix-assisted laser desorption ionization time-of-flight mass spectrometry (MALDI-TOF MS) systems are relatively less studied, they have been reported to be a promising method for the identification of anaerobes. We evaluated the performance of the VITEK MS in vitro diagnostic (IVD; 1.1 database; bioMérieux, France) in the identification of anaerobes. We used 274 anaerobic bacteria isolated from various clinical specimens. The results for the identification of the bacteria by VITEK MS were compared to those obtained by phenotypic methods and 16S rRNA gene sequencing. Among the 249 isolates included in the IVD database, the VITEK MS correctly identified 209 (83.9%) isolates to the species level and an additional 18 (7.2%) at the genus level. In particular, the VITEK MS correctly identified clinically relevant and frequently isolated anaerobic bacteria to the species level. The remaining 22 isolates (8.8%) were either not identified or misidentified. The VITEK MS could not identify the 25 isolates absent from the IVD database to the species level. The VITEK MS showed reliable identifications for clinically relevant anaerobic bacteria.
2018-01-01
Effect-directed analysis (EDA) is a commonly used approach for effect-based identification of endocrine disruptive chemicals in complex (environmental) mixtures. However, for routine toxicity assessment of, for example, water samples, current EDA approaches are considered time-consuming and laborious. We achieved faster EDA and identification by downscaling of sensitive cell-based hormone reporter gene assays and increasing fractionation resolution to allow testing of smaller fractions with reduced complexity. The high-resolution EDA approach is demonstrated by analysis of four environmental passive sampler extracts. Downscaling of the assays to a 384-well format allowed analysis of 64 fractions in triplicate (or 192 fractions without technical replicates) without affecting sensitivity compared to the standard 96-well format. Through a parallel exposure method, agonistic and antagonistic androgen and estrogen receptor activity could be measured in a single experiment following a single fractionation. From 16 selected candidate compounds, identified through nontargeted analysis, 13 could be confirmed chemically and 10 were found to be biologically active, of which the most potent nonsteroidal estrogens were identified as oxybenzone and piperine. The increased fractionation resolution and the higher throughput that downscaling provides allow for future application in routine high-resolution screening of large numbers of samples in order to accelerate identification of (emerging) endocrine disruptors. PMID:29547277
Tichy, Diana; Pickl, Julia Maria Anna; Benner, Axel; Sültmann, Holger
2017-03-31
The identification of microRNA (miRNA) target genes is crucial for understanding miRNA function. Many methods for the genome-wide miRNA target identification have been developed in recent years; however, they have several limitations including the dependence on low-confident prediction programs and artificial miRNA manipulations. Ago-RNA immunoprecipitation combined with high-throughput sequencing (Ago-RIP-Seq) is a promising alternative. However, appropriate statistical data analysis algorithms taking into account the experimental design and the inherent noise of such experiments are largely lacking.Here, we investigate the experimental design for Ago-RIP-Seq and examine biostatistical methods to identify de novo miRNA target genes. Statistical approaches considered are either based on a negative binomial model fit to the read count data or applied to transformed data using a normal distribution-based generalized linear model. We compare them by a real data simulation study using plasmode data sets and evaluate the suitability of the approaches to detect true miRNA targets by sensitivity and false discovery rates. Our results suggest that simple approaches like linear regression models on (appropriately) transformed read count data are preferable. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Comparative genome analysis of entomopathogenic fungi reveals a complex set of secreted proteins.
Staats, Charley Christian; Junges, Angela; Guedes, Rafael Lucas Muniz; Thompson, Claudia Elizabeth; de Morais, Guilherme Loss; Boldo, Juliano Tomazzoni; de Almeida, Luiz Gonzaga Paula; Andreis, Fábio Carrer; Gerber, Alexandra Lehmkuhl; Sbaraini, Nicolau; da Paixão, Rana Louise de Andrade; Broetto, Leonardo; Landell, Melissa; Santi, Lucélia; Beys-da-Silva, Walter Orlando; Silveira, Carolina Pereira; Serrano, Thaiane Rispoli; de Oliveira, Eder Silva; Kmetzsch, Lívia; Vainstein, Marilene Henning; de Vasconcelos, Ana Tereza Ribeiro; Schrank, Augusto
2014-09-29
Metarhizium anisopliae is an entomopathogenic fungus used in the biological control of some agricultural insect pests, and efforts are underway to use this fungus in the control of insect-borne human diseases. A large repertoire of proteins must be secreted by M. anisopliae to cope with the various available nutrients as this fungus switches through different lifestyles, i.e., from a saprophytic, to an infectious, to a plant endophytic stage. To further evaluate the predicted secretome of M. anisopliae, we employed genomic and transcriptomic analyses, coupled with phylogenomic analysis, focusing on the identification and characterization of secreted proteins. We determined the M. anisopliae E6 genome sequence and compared this sequence to other entomopathogenic fungi genomes. A robust pipeline was generated to evaluate the predicted secretomes of M. anisopliae and 15 other filamentous fungi, leading to the identification of a core of secreted proteins. Transcriptomic analysis using the tick Rhipicephalus microplus cuticle as an infection model during two periods of infection (48 and 144 h) allowed the identification of several differentially expressed genes. This analysis concluded that a large proportion of the predicted secretome coding genes contained altered transcript levels in the conditions analyzed in this study. In addition, some specific secreted proteins from Metarhizium have an evolutionary history similar to orthologs found in Beauveria/Cordyceps. This similarity suggests that a set of secreted proteins has evolved to participate in entomopathogenicity. The data presented represents an important step to the characterization of the role of secreted proteins in the virulence and pathogenicity of M. anisopliae.
1999-09-01
I.. Zbar. B.. androle for the VHL gene in the development of hyperplasia in a number Lerman. I. I. Identification of the son Hippel-Lindau disease...of heterozy- gosity of chromosome 3p markers in small-cell lung cancer. Nature (Lond.). 329: eleguns produced hyperplasia in all tissues (26...central fibrovascular core lined by cuboidal tumor cells. Tumor weights were determined (Fig. 2d). At the end of 47 days after cells were
Ciok, Anna; Adamczuk, Marcin; Bartosik, Dariusz; Dziewit, Lukasz
2016-11-28
Pseudomonas strains isolated from the heavily contaminated Lubin copper mine and Zelazny Most post-flotation waste reservoir in Poland were screened for the presence of integrons. This analysis revealed that two strains carried homologous DNA regions composed of a gene encoding a DNA_BRE_C domain-containing tyrosine recombinase (with no significant sequence similarity to other integrases of integrons) plus a three-component array of putative integron gene cassettes. The predicted gene cassettes encode three putative polypeptides with homology to (i) transmembrane proteins, (ii) GCN5 family acetyltransferases, and (iii) hypothetical proteins of unknown function (homologous proteins are encoded by the gene cassettes of several class 1 integrons). Comparative sequence analyses identified three structural variants of these novel integron-like elements within the sequenced bacterial genomes. Analysis of their distribution revealed that they are found exclusively in strains of the genus Pseudomonas .
Hu, Jianhua; Wright, Fred A
2007-03-01
The identification of the genes that are differentially expressed in two-sample microarray experiments remains a difficult problem when the number of arrays is very small. We discuss the implications of using ordinary t-statistics and examine other commonly used variants. For oligonucleotide arrays with multiple probes per gene, we introduce a simple model relating the mean and variance of expression, possibly with gene-specific random effects. Parameter estimates from the model have natural shrinkage properties that guard against inappropriately small variance estimates, and the model is used to obtain a differential expression statistic. A limiting value to the positive false discovery rate (pFDR) for ordinary t-tests provides motivation for our use of the data structure to improve variance estimates. Our approach performs well compared to other proposed approaches in terms of the false discovery rate.
Diverse types of genetic variation converge on functional gene networks involved in schizophrenia.
Gilman, Sarah R; Chang, Jonathan; Xu, Bin; Bawa, Tejdeep S; Gogos, Joseph A; Karayiorgou, Maria; Vitkup, Dennis
2012-12-01
Despite the successful identification of several relevant genomic loci, the underlying molecular mechanisms of schizophrenia remain largely unclear. We developed a computational approach (NETBAG+) that allows an integrated analysis of diverse disease-related genetic data using a unified statistical framework. The application of this approach to schizophrenia-associated genetic variations, obtained using unbiased whole-genome methods, allowed us to identify several cohesive gene networks related to axon guidance, neuronal cell mobility, synaptic function and chromosomal remodeling. The genes forming the networks are highly expressed in the brain, with higher brain expression during prenatal development. The identified networks are functionally related to genes previously implicated in schizophrenia, autism and intellectual disability. A comparative analysis of copy number variants associated with autism and schizophrenia suggests that although the molecular networks implicated in these distinct disorders may be related, the mutations associated with each disease are likely to lead, at least on average, to different functional consequences.
Identification of embryonic pancreatic genes using Xenopus DNA microarrays.
Hayata, Tadayoshi; Blitz, Ira L; Iwata, Nahoko; Cho, Ken W Y
2009-06-01
The pancreas is both an exocrine and endocrine endodermal organ involved in digestion and glucose homeostasis. During embryogenesis, the anlagen of the pancreas arise from dorsal and ventral evaginations of the foregut that later fuse to form a single organ. To better understand the molecular genetics of early pancreas development, we sought to isolate markers that are uniquely expressed in this tissue. Microarray analysis was performed comparing dissected pancreatic buds, liver buds, and the stomach region of tadpole stage Xenopus embryos. A total of 912 genes were found to be differentially expressed between these organs during early stages of organogenesis. K-means clustering analysis predicted 120 of these genes to be specifically enriched in the pancreas. Of these, we report on the novel expression patterns of 24 genes. Our analyses implicate the involvement of previously unsuspected signaling pathways during early pancreas development. Developmental Dynamics 238:1455-1466, 2009. (c) 2009 Wiley-Liss, Inc.
Gene expression profile of human Down syndrome leukocytes.
Malagó, Wilson; Sommer, César A; Del Cistia Andrade, Camillo; Soares-Costa, Andrea; Abrao Possik, Patricia; Cassago, Alexandre; Santejo Silveira, Henrique C; Henrique-Silva, Flavio
2005-08-01
Identification of differences in the gene expression patterns of Down syndrome and normal leukocytes. We constructed the first Down syndrome leukocyte serial analysis of gene expression (SAGE) library from a 28 year-old patient. This library was analyzed and compared with a normal leukocyte SAGE library using the eSAGE software. Reverse transcriptase polymerase chain reaction (RT-PCR) was used to validate the results. We found that a large number of unidentified transcripts were overexpressed in Down syndrome leukocytes and some transcripts coding for growth factors (e.g. interleukin 8, IL-8), ribosomaproteins (e.g. L13a, L29, and L37), and transcription factors (e.g., Jun B, Jun D, and C/EBP beta) were underexpressed. The SAGE data were successfully validated for the genes IL-8, CXCR4, BCL2A1, L13a, L29, L37, and GTF3A using RT-PCR. Our analysis identified significant changes in the expression pattern of Down syndrome leukocytes compared with normal ones, including key regulators of growth and proliferation, ribosomal proteins, and a large number of overexpressed transcripts that were not matched in UniGene clusters and that may represent novel genes related to Down syndrome. This study offers a new insight into transcriptional changes in Down syndrome leukocytes and indicates candidate genes for further investigations into the molecular mechanism of Down syndrome pathology.
Zhu, X L; Yang, F; Li, H X; Dou, Y X; Meng, X L; Li, H; Luo, X N; Cai, X P
2013-05-14
An outbreak of sheep pox was investigated in the Ningxia Hui Autonomous Region in China. Through immunofluorescence testing, isolated viruses, polymerase chain reaction identification, and electron microscopic examination, the isolated strain was identified as a sheep pox virus. The virus was identified through sequence and phylogenetic analysis of the P32 gene, open reading frame (ORF) 095, and ORF 103 genes. This study is the first to use the ORF 095 and ORF 103 genes as candidate genes for the analysis of sheep pox. The results showed that the ORF 095 and ORF 103 genes could be used for the genotyping of the sheep pox virus.
Identification of Bacterial Species in Kuwaiti Waters Through DNA Sequencing
NASA Astrophysics Data System (ADS)
Chen, K.
2017-01-01
With an objective of identifying the bacterial diversity associated with ecosystem of various Kuwaiti Seas, bacteria were cultured and isolated from 3 water samples. Due to the difficulties for cultured and isolated fecal coliforms on the selective agar plates, bacterial isolates from marine agar plates were selected for molecular identification. 16S rRNA genes were successfully amplified from the genome of the selected isolates using Universal Eubacterial 16S rRNA primers. The resulted amplification products were subjected to automated DNA sequencing. Partial 16S rDNA sequences obtained were compared directly with sequences in the NCBI database using BLAST as well as with the sequences available with Ribosomal Database Project (RDP).
Thimgan, Matthew S.; Seugnet, Laurent; Turk, John; Shaw, Paul J.
2015-01-01
Background and Study Objectives: Flies mutant for the canonical clock protein cycle (cyc01) exhibit a sleep rebound that is ∼10 times larger than wild-type flies and die after only 10 h of sleep deprivation. Surprisingly, when starved, cyc01 mutants can remain awake for 28 h without demonstrating negative outcomes. Thus, we hypothesized that identifying transcripts that are differentially regulated between waking induced by sleep deprivation and waking induced by starvation would identify genes that underlie the deleterious effects of sleep deprivation and/or protect flies from the negative consequences of waking. Design: We used partial complementary DNA microarrays to identify transcripts that are differentially expressed between cyc01 mutants that had been sleep deprived or starved for 7 h. We then used genetics to determine whether disrupting genes involved in lipid metabolism would exhibit alterations in their response to sleep deprivation. Setting: Laboratory. Patients or Participants: Drosophila melanogaster. Interventions: Sleep deprivation and starvation. Measurements and Results: We identified 84 genes with transcript levels that were differentially modulated by 7 h of sleep deprivation and starvation in cyc01 mutants and were confirmed in independent samples using quantitative polymerase chain reaction. Several of these genes were predicted to be lipid metabolism genes, including bubblegum, cueball, and CG4500, which based on our data we have renamed heimdall (hll). Using lipidomics we confirmed that knockdown of hll using RNA interference significantly decreased lipid stores. Importantly, genetically modifying bubblegum, cueball, or hll resulted in sleep rebound alterations following sleep deprivation compared to genetic background controls. Conclusions: We have identified a set of genes that may confer resilience/vulnerability to sleep deprivation and demonstrate that genes involved in lipid metabolism modulate sleep homeostasis. Citation: Thimgan MS, Seugnet L, Turk J, Shaw PJ. Identification of genes associated with resilience/vulnerability to sleep deprivation and starvation in Drosophila. SLEEP 2015;38(5):801–814. PMID:25409104
Jia, Tianqi; Wei, Danfeng; Meng, Shan; Allan, Andrew C.; Zeng, Lihui
2014-01-01
Longan (Dimocarpus longan L.) is a tropical/subtropical fruit tree of significant economic importance in Southeast Asia. However, a lack of transcriptomic and genomic information hinders research on longan traits, such as the control of flowering. In this study, high-throughput RNA sequencing (RNA-Seq) was used to investigate differentially expressed genes between a unique longan cultivar ‘Sijimi’(S) which flowers throughout the year and a more typical cultivar ‘Lidongben’(L) which flowers only once in the season, with the aim of identifying candidate genes associated with continuous flowering. 36,527 and 40,982 unigenes were obtained by de novo assembly of the clean reads from cDNA libraries of L and S cultivars. Additionally 40,513 unigenes were assembled from combined reads of these libraries. A total of 32,475 unigenes were annotated by BLAST search to NCBI non-redundant protein (NR), Swiss-Prot, Clusters of Orthologous Groups (COGs) and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases. Of these, almost fifteen thousand unigenes were identified as significantly differentially expressed genes (DEGs) by using Reads Per kb per Million reads (RPKM) method. A total of 6,415 DEGs were mapped to 128 KEGG pathways, and 8,743 DEGs were assigned to 54 Gene Ontology categories. After blasting the DEGs to public sequence databases, 539 potential flowering-related DEGs were identified. In addition, 107 flowering-time genes were identified in longan, their expression levels between two longan samples were compared by RPKM method, of which the expression levels of 15 were confirmed by real-time quantitative PCR. Our results suggest longan homologues of SHORT VEGETATIVE PHASE (SVP), GIGANTEA (GI), F-BOX 1 (FKF1) and EARLY FLOWERING 4 (ELF4) may be involved this flowering trait and ELF4 may be a key gene. The identification of candidate genes related to continuous flowering will provide new insight into the molecular process of regulating flowering time in woody plants. PMID:25479005
Goettel, Wolfgang; Ramirez, Martha; Upchurch, Robert G; An, Yong-Qiang Charles
2016-08-01
Identification and characterization of a 254-kb genomic deletion on a duplicated chromosome segment that resulted in a low level of palmitic acid in soybean seeds using transcriptome sequencing. A large number of soybean genotypes varying in seed oil composition and content have been identified. Understanding the molecular mechanisms underlying these variations is important for breeders to effectively utilize them as a genetic resource. Through design and application of a bioinformatics approach, we identified nine co-regulated gene clusters by comparing seed transcriptomes of nine soybean genotypes varying in oil composition and content. We demonstrated that four gene clusters in the genotypes M23, Jack and N0304-303-3 coincided with large-scale genome rearrangements. The co-regulated gene clusters in M23 and Jack mapped to a previously described 164-kb deletion and a copy number amplification of the Rhg1 locus, respectively. The coordinately down-regulated gene clusters in N0304-303-3 were caused by a 254-kb deletion containing 19 genes including a fatty acyl-ACP thioesterase B gene (FATB1a). This deletion was associated with reduced palmitic acid content in seeds and was the molecular cause of a previously reported nonfunctional FATB1a allele, fap nc . The M23 and N0304-304-3 deletions were located in duplicated genome segments retained from the Glycine-specific whole genome duplication that occurred 13 million years ago. The homoeologous genes in these duplicated regions shared a strong similarity in both their encoded protein sequences and transcript accumulation levels, suggesting that they may have conserved and important functions in seeds. The functional conservation of homoeologous genes may result in genetic redundancy and gene dosage effects for their associated seed traits, explaining why the large deletion did not cause lethal effects or completely eliminate palmitic acid in N0304-303-3.
Du, Y F; Ding, Q L; Li, Y M; Fang, W R
2017-04-03
In the modern chicken industry, fast-growing broilers have undergone strong artificial selection for muscle growth, which has led to remarkable phenotypic variations compared with slow-growing chickens. However, the molecular mechanism underlying these phenotypes differences remains unknown. In this study, a systematic identification of candidate genes and new pathways related to myofiber development and composition in chicken Soleus muscle (SOL) has been made using gene expression profiles of two distinct breeds: Qingyuan partridge (QY), a slow-growing Chinese breed possessing high meat quality and Cobb 500 (CB), a commercial fast-growing broiler line. Agilent cDNA microarray analyses were conducted to determine gene expression profiles of soleus muscle sampled at sexual maturity age of QY (112 d) and CB (42 d). The 1318 genes with at least 2-fold differences were identified (P < 0.05, FDR <0.05, FC ≥ 2) in SOL muscles of QY and CB chickens. Differentially expressed genes (DEGs) related to muscle development, energy metabolism or lipid metabolism processes were examined further in each breed based on Gene Ontology (GO) analysis, and 11 genes involved in these processes were selected for further validation studies by qRT-PCR. In addition, based on KEGG pathway analysis of DEGs in both QY and CB chickens, it was found that in addition to pathways affecting myogenic fibre-type development and differentiation (pathways for Hedgehog & Calcium signaling), energy metabolism (Phosphatidylinositol signaling system, VEGF signaling pathway, Purine metabolism, Pyrimidine metabolism) were also enriched and might form a network with pathways related to muscle metabolism to influence the development of myofibers. This study is the first stage in the understanding of molecular mechanisms underlying variations in poultry meat quality. Large scale analyses are now required to validate the role of the genes identified and ultimately to find molecular markers that can be used for selection or to optimize rearing practices.
USDA-ARS?s Scientific Manuscript database
The comprehensive identification of genes underlying phenotypic variation of complex traits remains a major challenge. Most genome-wide screens lack sufficient resolving power as they typically depend on linkage. An alternate method is to screen for allele-specific expression (ASE), a simple yet pow...
Lee, I-M; Bottner-Parker, K D; Zhao, Y; Bertaccini, A; Davis, R E
2012-09-01
The pigeon pea witches'-broom phytoplasma group (16SrIX) comprises diverse strains that cause numerous diseases in leguminous trees and herbaceous crops, vegetables, a fruit, a nut tree and a forest tree. At least 14 strains have been reported worldwide. Comparative phylogenetic analyses of the highly conserved 16S rRNA gene and the moderately conserved rplV (rpl22)-rpsC (rps3) and secY genes indicated that the 16SrIX group consists of at least six distinct genetic lineages. Some of these lineages cannot be readily differentiated based on analysis of 16S rRNA gene sequences alone. The relative genetic distances among these closely related lineages were better assessed by including more variable genes [e.g. ribosomal protein (rp) and secY genes]. The present study demonstrated that virtual RFLP analyses using rp and secY gene sequences allowed unambiguous identification of such lineages. A coding system is proposed to designate each distinct rp and secY subgroup in the 16SrIX group.
Xie, Qi; Liu, Xue; Zhang, Yinbing; Tang, Jinfu; Yin, Dedong; Fan, Bo; Zhu, Lihuang; Han, Liebao; Song, Guilong; Li, Dayong
2017-01-01
Due to its high biomass yield, low environmental impact, and widespread adaptability to poor soils and harsh conditions, switchgrass ( Panicum virgatum L.), a warm-region perennial herbaceous plant, has attracted much attention in recent years. However, little is known about microRNAs (miRNAs) and their functions in this bioenergy grass. Here, we identified and characterized a miRNA gene, Pvi-MIR319a , encoding microRNA319a in switchgrass. Transgenic rice lines generated by overexpressing the Pvi-MIR319a precursor gene exhibited broader leaves and delayed flowering compared with the control. Gene expression analysis indicated at least four putative target genes were downregulated. Additionally, we cloned a putative target gene ( PvPCF5 ) of Pvi-MIR319a from switchgrass. PvPCF5, a TCP transcription factor, is a nuclear-localized protein with transactivation activity and control the development of leaf. Our results suggest that Pvi-MIR319a and its target genes may be used as potential genetic regulators for future switchgrass genetic improvement.
matK-QR classifier: a patterns based approach for plant species identification.
More, Ravi Prabhakar; Mane, Rupali Chandrashekhar; Purohit, Hemant J
2016-01-01
DNA barcoding is widely used and most efficient approach that facilitates rapid and accurate identification of plant species based on the short standardized segment of the genome. The nucleotide sequences of maturaseK ( matK ) and ribulose-1, 5-bisphosphate carboxylase ( rbcL ) marker loci are commonly used in plant species identification. Here, we present a new and highly efficient approach for identifying a unique set of discriminating nucleotide patterns to generate a signature (i.e. regular expression) for plant species identification. In order to generate molecular signatures, we used matK and rbcL loci datasets, which encompass 125 plant species in 52 genera reported by the CBOL plant working group. Initially, we performed Multiple Sequence Alignment (MSA) of all species followed by Position Specific Scoring Matrix (PSSM) for both loci to achieve a percentage of discrimination among species. Further, we detected Discriminating Patterns (DP) at genus and species level using PSSM for the matK dataset. Combining DP and consecutive pattern distances, we generated molecular signatures for each species. Finally, we performed a comparative assessment of these signatures with the existing methods including BLASTn, Support Vector Machines (SVM), Jrip-RIPPER, J48 (C4.5 algorithm), and the Naïve Bayes (NB) methods against NCBI-GenBank matK dataset. Due to the higher discrimination success obtained with the matK as compared to the rbcL , we selected matK gene for signature generation. We generated signatures for 60 species based on identified discriminating patterns at genus and species level. Our comparative assessment results suggest that a total of 46 out of 60 species could be correctly identified using generated signatures, followed by BLASTn (34 species), SVM (18 species), C4.5 (7 species), NB (4 species) and RIPPER (3 species) methods As a final outcome of this study, we converted signatures into QR codes and developed a software matK -QR Classifier (http://www.neeri.res.in/matk_classifier/index.htm), which search signatures in the query matK gene sequences and predict corresponding plant species. This novel approach of employing pattern-based signatures opens new avenues for the classification of species. In addition to existing methods, we believe that matK -QR Classifier would be a valuable tool for molecular taxonomists enabling precise identification of plant species.
Identification of three novel NHS mutations in families with Nance-Horan syndrome.
Huang, Kristen M; Wu, Junhua; Brooks, Simon P; Hardcastle, Alison J; Lewis, Richard Alan; Stambolian, Dwight
2007-03-27
Nance-Horan Syndrome (NHS) is an infrequent and often overlooked X-linked disorder characterized by dense congenital cataracts, microphthalmia, and dental abnormalities. The syndrome is caused by mutations in the NHS gene, whose function is not known. The purpose of this study was to identify the frequency and distribution of NHS gene mutations and compare genotype with Nance-Horan phenotype in five North American NHS families. Genomic DNA was isolated from white blood cells from NHS patients and family members. The NHS gene coding region and its splice site donor and acceptor regions were amplified from genomic DNA by PCR, and the amplicons were sequenced directly. We identified three unique NHS coding region mutations in these NHS families. This report extends the number of unique identified NHS mutations to 14.
Structural and functional partitioning of bread wheat chromosome 3B.
Choulet, Frédéric; Alberti, Adriana; Theil, Sébastien; Glover, Natasha; Barbe, Valérie; Daron, Josquin; Pingault, Lise; Sourdille, Pierre; Couloux, Arnaud; Paux, Etienne; Leroy, Philippe; Mangenot, Sophie; Guilhot, Nicolas; Le Gouis, Jacques; Balfourier, Francois; Alaux, Michael; Jamilloux, Véronique; Poulain, Julie; Durand, Céline; Bellec, Arnaud; Gaspin, Christine; Safar, Jan; Dolezel, Jaroslav; Rogers, Jane; Vandepoele, Klaas; Aury, Jean-Marc; Mayer, Klaus; Berges, Hélène; Quesneville, Hadi; Wincker, Patrick; Feuillet, Catherine
2014-07-18
We produced a reference sequence of the 1-gigabase chromosome 3B of hexaploid bread wheat. By sequencing 8452 bacterial artificial chromosomes in pools, we assembled a sequence of 774 megabases carrying 5326 protein-coding genes, 1938 pseudogenes, and 85% of transposable elements. The distribution of structural and functional features along the chromosome revealed partitioning correlated with meiotic recombination. Comparative analyses indicated high wheat-specific inter- and intrachromosomal gene duplication activities that are potential sources of variability for adaption. In addition to providing a better understanding of the organization, function, and evolution of a large and polyploid genome, the availability of a high-quality sequence anchored to genetic maps will accelerate the identification of genes underlying important agronomic traits. Copyright © 2014, American Association for the Advancement of Science.
Yu, Jingyin; Tehrim, Sadia; Zhang, Fengqi; Tong, Chaobo; Huang, Junyan; Cheng, Xiaohui; Dong, Caihua; Zhou, Yanqiu; Qin, Rui; Hua, Wei; Liu, Shengyi
2014-01-03
Plant disease resistance (R) genes with the nucleotide binding site (NBS) play an important role in offering resistance to pathogens. The availability of complete genome sequences of Brassica oleracea and Brassica rapa provides an important opportunity for researchers to identify and characterize NBS-encoding R genes in Brassica species and to compare with analogues in Arabidopsis thaliana based on a comparative genomics approach. However, little is known about the evolutionary fate of NBS-encoding genes in the Brassica lineage after split from A. thaliana. Here we present genome-wide analysis of NBS-encoding genes in B. oleracea, B. rapa and A. thaliana. Through the employment of HMM search and manual curation, we identified 157, 206 and 167 NBS-encoding genes in B. oleracea, B. rapa and A. thaliana genomes, respectively. Phylogenetic analysis among 3 species classified NBS-encoding genes into 6 subgroups. Tandem duplication and whole genome triplication (WGT) analyses revealed that after WGT of the Brassica ancestor, NBS-encoding homologous gene pairs on triplicated regions in Brassica ancestor were deleted or lost quickly, but NBS-encoding genes in Brassica species experienced species-specific gene amplification by tandem duplication after divergence of B. rapa and B. oleracea. Expression profiling of NBS-encoding orthologous gene pairs indicated the differential expression pattern of retained orthologous gene copies in B. oleracea and B. rapa. Furthermore, evolutionary analysis of CNL type NBS-encoding orthologous gene pairs among 3 species suggested that orthologous genes in B. rapa species have undergone stronger negative selection than those in B .oleracea species. But for TNL type, there are no significant differences in the orthologous gene pairs between the two species. This study is first identification and characterization of NBS-encoding genes in B. rapa and B. oleracea based on whole genome sequences. Through tandem duplication and whole genome triplication analysis in B. oleracea, B. rapa and A. thaliana genomes, our study provides insight into the evolutionary history of NBS-encoding genes after divergence of A. thaliana and the Brassica lineage. These results together with expression pattern analysis of NBS-encoding orthologous genes provide useful resource for functional characterization of these genes and genetic improvement of relevant crops.
Sykes, Timothy; Yates, Steven; Nagy, Istvan; Asp, Torben; Small, Ian
2017-01-01
Perennial ryegrass (Lolium perenne L.) is widely used for forage production in both permanent and temporary grassland systems. To increase yields in perennial ryegrass, recent breeding efforts have been focused on strategies to more efficiently exploit heterosis by hybrid breeding. Cytoplasmic male sterility (CMS) is a widely applied mechanism to control pollination for commercial hybrid seed production and although CMS systems have been identified in perennial ryegrass, they are yet to be fully characterized. Here, we present a bioinformatics pipeline for efficient identification of candidate restorer of fertility (Rf) genes for CMS. From a high-quality draft of the perennial ryegrass genome, 373 pentatricopeptide repeat (PPR) genes were identified and classified, further identifying 25 restorer of fertility-like PPR (RFL) genes through a combination of DNA sequence clustering and comparison to known Rf genes. This extensive gene family was targeted as the majority of Rf genes in higher plants are RFL genes. These RFL genes were further investigated by phylogenetic analyses, identifying three groups of perennial ryegrass RFLs. These three groups likely represent genomic regions of active RFL generation and identify the probable location of perennial ryegrass PPR-Rf genes. This pipeline allows for the identification of candidate PPR-Rf genes from genomic sequence data and can be used in any plant species. Functional markers for PPR-Rf genes will facilitate map-based cloning of Rf genes and enable the use of CMS as an efficient tool to control pollination for hybrid crop production. PMID:26951780
Behr, Jürgen; Geissler, Andreas J; Preissler, Patrick; Ehrenreich, Armin; Angelov, Angel; Vogel, Rudi F
2015-10-01
The tolerance to hop compounds, which is mainly associated with inhibition of bacterial growth in beer, is a multi-factorial trait. Any approaches to predict the physiological differences between beer-spoiling and non-spoiling strains on the basis of a single marker gene are limited. We identified ecotype-specific genes related to the ability to grow in Pilsner beer via comparative genome sequencing. The genome sequences of four different strains of Lactobacillus brevis were compared, including newly established genomes of two highly hop tolerant beer isolates, one strain isolated from faeces and one published genome of a silage isolate. Gene fragments exclusively occurring in beer-spoiling strains as well as sequences only occurring in non-spoiling strains were identified. Comparative genomic arrays were established and hybridized with a set of L. brevis strains, which are characterized by their ability to spoil beer. As result, a set of 33 and 4 oligonucleotide probes could be established specifically detecting beer-spoilers and non-spoilers, respectively. The detection of more than one of these marker sequences according to a genetic barcode enables scoring of L. brevis for their beer-spoiling potential and can thus assist in risk evaluation in brewing industry. Copyright © 2015 Elsevier Ltd. All rights reserved.
Bottari, Benedetta; Felis, Giovanna E; Salvetti, Elisa; Castioni, Anna; Campedelli, Ilenia; Torriani, Sandra; Bernini, Valentina; Gatti, Monica
2017-07-01
Lactobacillus casei,Lactobacillus paracasei and Lactobacillusrhamnosus form a closely related taxonomic group (the L. casei group) within the facultatively heterofermentative lactobacilli. Strains of these species have been used for a long time as probiotics in a wide range of products, and they represent the dominant species of nonstarter lactic acid bacteria in ripened cheeses, where they contribute to flavour development. The close genetic relationship among those species, as well as the similarity of biochemical properties of the strains, hinders the development of an adequate selective method to identify these bacteria. Despite this being a hot topic, as demonstrated by the large amount of literature about it, the results of different proposed identification methods are often ambiguous and unsatisfactory. The aim of this study was to develop a more robust species-specific identification assay for differentiating the species of the L. casei group. A taxonomy-driven comparative genomic analysis was carried out to select the potential target genes whose similarity could better reflect genome-wide diversity. The gene mutL appeared to be the most promising one and, therefore, a novel species-specific multiplex PCR assay was developed to rapidly and effectively distinguish L. casei, L. paracasei and L. rhamnosus strains. The analysis of a collection of 76 wild dairy isolates, previously identified as members of the L. casei group combining the results of multiple approaches, revealed that the novel designed primers, especially in combination with already existing ones, were able to improve the discrimination power at the species level and reveal previously undiscovered intraspecific biodiversity.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Galeazzi, Luca; Bocci, Paolo; Amici, Adolfo
2011-09-27
The pyridine nucleotide cycle (PNC) is a network of salvage and recycling routes maintaining homeostasis of NAD(P) cofactor pool in the cell. Nicotinamide mononucleotide (NMN) deamidase (EC 3.5.1.42), one of the key enzymes of the bacterial PNC was originally described in Enterobacteria, but the corresponding gene eluded identification for over 30 years. A genomics-based reconstruction of NAD metabolism across hundreds bacterial species suggested that NMN deamidase reaction is the only possible way of nicotinamide salvage in the marine bacterium Shewanella oneidensis. This prediction was verified via purification of native NMN deamidase from S. oneidensis followed by the identification of themore » respective gene, termed pncC. Enzymatic characterization of the PncC protein, as well as phenotype analysis of deletion mutants, confirmed its proposed biochemical and physiological function in S. oneidensis. Of the three PncC homologs present in E. coli, NMN deamidase activity was confirmed only for the recombinant purified product of the ygaD gene. A comparative analysis at the level of sequence and three dimensional structure, which is available for one of the PncC family member, shows no homology with any previously described amidohydrolases. Multiple alignment analysis of functional and non functional PncC homologs, together with NMN docking experiments, allowed us to tentatively identify the active site area and conserved residues therein. An observed broad phylogenomic distribution of predicted functional PncCs in bacterial kingdom is consistent with a possible role in detoxification of NMN, resulting from NAD utilization by DNA ligase.« less
Identification of cis-suppression of human disease mutations by comparative genomics.
Jordan, Daniel M; Frangakis, Stephan G; Golzio, Christelle; Cassa, Christopher A; Kurtzberg, Joanne; Davis, Erica E; Sunyaev, Shamil R; Katsanis, Nicholas
2015-08-13
Patterns of amino acid conservation have served as a tool for understanding protein evolution. The same principles have also found broad application in human genomics, driven by the need to interpret the pathogenic potential of variants in patients. Here we performed a systematic comparative genomics analysis of human disease-causing missense variants. We found that an appreciable fraction of disease-causing alleles are fixed in the genomes of other species, suggesting a role for genomic context. We developed a model of genetic interactions that predicts most of these to be simple pairwise compensations. Functional testing of this model on two known human disease genes revealed discrete cis amino acid residues that, although benign on their own, could rescue the human mutations in vivo. This approach was also applied to ab initio gene discovery to support the identification of a de novo disease driver in BTG2 that is subject to protective cis-modification in more than 50 species. Finally, on the basis of our data and models, we developed a computational tool to predict candidate residues subject to compensation. Taken together, our data highlight the importance of cis-genomic context as a contributor to protein evolution; they provide an insight into the complexity of allele effect on phenotype; and they are likely to assist methods for predicting allele pathogenicity.
Konishi, Kyoko; Joober, Ridha; Poirier, Judes; MacDonald, Kathleen; Chakravarty, Mallar; Patel, Raihaan; Breitner, John; Bohbot, Véronique D.
2018-01-01
Early detection of Alzheimer’s disease (AD) has been challenging as current biomarkers are invasive and costly. Strong predictors of future AD diagnosis include lower volume of the hippocampus and entorhinal cortex, as well as the ɛ4 allele of the Apolipoprotein E gene (APOE) gene. Therefore, studying functions that are critically mediated by the hippocampus and entorhinal cortex, such as spatial memory, in APOE ɛ4 allele carriers, may be key to the identification of individuals at risk of AD, prior to the manifestation of cognitive impairments. Using a virtual navigation task developed in-house, specifically designed to assess spatial versus non-spatial strategies, the current study is the first to differentiate functional and structural differences within APOE ɛ4 allele carriers. APOE ɛ4 allele carriers that predominantly use non-spatial strategies have decreased fMRI activity in the hippocampus and increased atrophy in the hippocampus, entorhinal cortex, and fimbria compared to APOE ɛ4 allele carriers who use spatial strategies. In contrast, APOE ɛ4 allele carriers who use spatial strategies have grey matter levels comparable to non-APOE ɛ4 allele carriers. Furthermore, in a leave-one-out analysis, grey matter in the entorhinal cortex could predict navigational strategy with 92% accuracy. PMID:29278888
Garcia-Gonzalez, Eva; Müller, Sebastian; Ensle, Paul; Süssmuth, Roderich D; Genersch, Elke
2014-05-01
American foulbrood (AFB) caused by the bee pathogenic bacterium Paenibacillus larvae is the most devastating bacterial disease of honey bees worldwide. From AFB-dead larvae, pure cultures of P. larvae can normally be cultivated indicating that P. larvae is able to defend its niche against all other bacteria present. Recently, comparative genome analysis within the species P. larvae suggested the presence of gene clusters coding for multi-enzyme complexes, such as non-ribosomal peptide synthetases (NRPSs). The products of these enzyme complexes are known to have a wide range of biological activities including antibacterial activities. We here present our results on antibacterial activity exhibited by vegetative P. larvae and the identification and analysis of a novel antibacterially active P. larvae tripeptide (called sevadicin; Sev) produced by a NRPS encoded by a gene cluster found in the genome of P. larvae. Identification of Sev was ultimately achieved by comparing the secretome of wild-type P. larvae with knockout mutants of P. larvae lacking production of Sev. Subsequent mass spectrometric studies, enantiomer analytics and chemical synthesis revealed the sequence and configuration of the tripeptide, D-Phe-D-ALa-Trp, which was shown to have antibacterial activity. The relevance of our findings is discussed in respect to host-pathogen interactions.
Konishi, Kyoko; Joober, Ridha; Poirier, Judes; MacDonald, Kathleen; Chakravarty, Mallar; Patel, Raihaan; Breitner, John; Bohbot, Véronique D
2018-01-01
Early detection of Alzheimer's disease (AD) has been challenging as current biomarkers are invasive and costly. Strong predictors of future AD diagnosis include lower volume of the hippocampus and entorhinal cortex, as well as the ɛ4 allele of the Apolipoprotein E gene (APOE) gene. Therefore, studying functions that are critically mediated by the hippocampus and entorhinal cortex, such as spatial memory, in APOE ɛ4 allele carriers, may be key to the identification of individuals at risk of AD, prior to the manifestation of cognitive impairments. Using a virtual navigation task developed in-house, specifically designed to assess spatial versus non-spatial strategies, the current study is the first to differentiate functional and structural differences within APOE ɛ4 allele carriers. APOE ɛ4 allele carriers that predominantly use non-spatial strategies have decreased fMRI activity in the hippocampus and increased atrophy in the hippocampus, entorhinal cortex, and fimbria compared to APOE ɛ4 allele carriers who use spatial strategies. In contrast, APOE ɛ4 allele carriers who use spatial strategies have grey matter levels comparable to non-APOE ɛ4 allele carriers. Furthermore, in a leave-one-out analysis, grey matter in the entorhinal cortex could predict navigational strategy with 92% accuracy.
Rolfe, Rebecca A; Nowlan, Niamh C; Kenny, Elaine M; Cormican, Paul; Morris, Derek W; Prendergast, Patrick J; Kelly, Daniel; Murphy, Paula
2014-01-20
Mechanical stimulation is necessary for regulating correct formation of the skeleton. Here we test the hypothesis that mechanical stimulation of the embryonic skeletal system impacts expression levels of genes implicated in developmentally important signalling pathways in a genome wide approach. We use a mutant mouse model with altered mechanical stimulation due to the absence of limb skeletal muscle (Splotch-delayed) where muscle-less embryos show specific defects in skeletal elements including delayed ossification, changes in the size and shape of cartilage rudiments and joint fusion. We used Microarray and RNA sequencing analysis tools to identify differentially expressed genes between muscle-less and control embryonic (TS23) humerus tissue. We found that 680 independent genes were down-regulated and 452 genes up-regulated in humeri from muscle-less Spd embryos compared to littermate controls (at least 2-fold; corrected p-value ≤0.05). We analysed the resulting differentially expressed gene sets using Gene Ontology annotations to identify significant enrichment of genes associated with particular biological processes, showing that removal of mechanical stimuli from muscle contractions affected genes associated with development and differentiation, cytoskeletal architecture and cell signalling. Among cell signalling pathways, the most strongly disturbed was Wnt signalling, with 34 genes including 19 pathway target genes affected. Spatial gene expression analysis showed that both a Wnt ligand encoding gene (Wnt4) and a pathway antagonist (Sfrp2) are up-regulated specifically in the developing joint line, while the expression of a Wnt target gene, Cd44, is no longer detectable in muscle-less embryos. The identification of 84 genes associated with the cytoskeleton that are down-regulated in the absence of muscle indicates a number of candidate genes that are both mechanoresponsive and potentially involved in mechanotransduction, converting a mechanical stimulus into a transcriptional response. This work identifies key developmental regulatory genes impacted by altered mechanical stimulation, sheds light on the molecular mechanisms that interpret mechanical stimulation during skeletal development and provides valuable resources for further investigation of the mechanistic basis of mechanoregulation. In particular it highlights the Wnt signalling pathway as a potential point of integration of mechanical and molecular signalling and cytoskeletal components as mediators of the response.
2014-01-01
Background Mechanical stimulation is necessary for regulating correct formation of the skeleton. Here we test the hypothesis that mechanical stimulation of the embryonic skeletal system impacts expression levels of genes implicated in developmentally important signalling pathways in a genome wide approach. We use a mutant mouse model with altered mechanical stimulation due to the absence of limb skeletal muscle (Splotch-delayed) where muscle-less embryos show specific defects in skeletal elements including delayed ossification, changes in the size and shape of cartilage rudiments and joint fusion. We used Microarray and RNA sequencing analysis tools to identify differentially expressed genes between muscle-less and control embryonic (TS23) humerus tissue. Results We found that 680 independent genes were down-regulated and 452 genes up-regulated in humeri from muscle-less Spd embryos compared to littermate controls (at least 2-fold; corrected p-value ≤0.05). We analysed the resulting differentially expressed gene sets using Gene Ontology annotations to identify significant enrichment of genes associated with particular biological processes, showing that removal of mechanical stimuli from muscle contractions affected genes associated with development and differentiation, cytoskeletal architecture and cell signalling. Among cell signalling pathways, the most strongly disturbed was Wnt signalling, with 34 genes including 19 pathway target genes affected. Spatial gene expression analysis showed that both a Wnt ligand encoding gene (Wnt4) and a pathway antagonist (Sfrp2) are up-regulated specifically in the developing joint line, while the expression of a Wnt target gene, Cd44, is no longer detectable in muscle-less embryos. The identification of 84 genes associated with the cytoskeleton that are down-regulated in the absence of muscle indicates a number of candidate genes that are both mechanoresponsive and potentially involved in mechanotransduction, converting a mechanical stimulus into a transcriptional response. Conclusions This work identifies key developmental regulatory genes impacted by altered mechanical stimulation, sheds light on the molecular mechanisms that interpret mechanical stimulation during skeletal development and provides valuable resources for further investigation of the mechanistic basis of mechanoregulation. In particular it highlights the Wnt signalling pathway as a potential point of integration of mechanical and molecular signalling and cytoskeletal components as mediators of the response. PMID:24443808
Contreras Gutiérrez, María Angélica; Vivero, Rafael J; Vélez, Iván D; Porter, Charles H; Uribe, Sandra
2014-01-01
Sand flies include a group of insects that are of medical importance and that vary in geographic distribution, ecology, and pathogen transmission. Approximately 163 species of sand flies have been reported in Colombia. Surveillance of the presence of sand fly species and the actualization of species distribution are important for predicting risks for and monitoring the expansion of diseases which sand flies can transmit. Currently, the identification of phlebotomine sand flies is based on morphological characters. However, morphological identification requires considerable skills and taxonomic expertise. In addition, significant morphological similarity between some species, especially among females, may cause difficulties during the identification process. DNA-based approaches have become increasingly useful and promising tools for estimating sand fly diversity and for ensuring the rapid and accurate identification of species. A partial sequence of the mitochondrial cytochrome oxidase gene subunit I (COI) is currently being used to differentiate species in different animal taxa, including insects, and it is referred as a barcoding sequence. The present study explored the utility of the DNA barcode approach for the identification of phlebotomine sand flies in Colombia. We sequenced 700 bp of the COI gene from 36 species collected from different geographic localities. The COI barcode sequence divergence within a single species was <2% in most cases, whereas this divergence ranged from 9% to 26.6% among different species. These results indicated that the barcoding gene correctly discriminated among the previously morphologically identified species with an efficacy of nearly 100%. Analyses of the generated sequences indicated that the observed species groupings were consistent with the morphological identifications. In conclusion, the barcoding gene was useful for species discrimination in sand flies from Colombia.
Contreras Gutiérrez, María Angélica; Vivero, Rafael J.; Vélez, Iván D.; Porter, Charles H.; Uribe, Sandra
2014-01-01
Sand flies include a group of insects that are of medical importance and that vary in geographic distribution, ecology, and pathogen transmission. Approximately 163 species of sand flies have been reported in Colombia. Surveillance of the presence of sand fly species and the actualization of species distribution are important for predicting risks for and monitoring the expansion of diseases which sand flies can transmit. Currently, the identification of phlebotomine sand flies is based on morphological characters. However, morphological identification requires considerable skills and taxonomic expertise. In addition, significant morphological similarity between some species, especially among females, may cause difficulties during the identification process. DNA-based approaches have become increasingly useful and promising tools for estimating sand fly diversity and for ensuring the rapid and accurate identification of species. A partial sequence of the mitochondrial cytochrome oxidase gene subunit I (COI) is currently being used to differentiate species in different animal taxa, including insects, and it is referred as a barcoding sequence. The present study explored the utility of the DNA barcode approach for the identification of phlebotomine sand flies in Colombia. We sequenced 700 bp of the COI gene from 36 species collected from different geographic localities. The COI barcode sequence divergence within a single species was <2% in most cases, whereas this divergence ranged from 9% to 26.6% among different species. These results indicated that the barcoding gene correctly discriminated among the previously morphologically identified species with an efficacy of nearly 100%. Analyses of the generated sequences indicated that the observed species groupings were consistent with the morphological identifications. In conclusion, the barcoding gene was useful for species discrimination in sand flies from Colombia. PMID:24454877
Hoshino, Tomonori; Fujiwara, Taku; Kilian, Mogens
2005-12-01
The aim of this study was to evaluate molecular and phenotypic methods for the identification of nonhemolytic streptococci. A collection of 148 strains consisting of 115 clinical isolates from cases of infective endocarditis, septicemia, and meningitis and 33 reference strains, including type strains of all relevant Streptococcus species, were examined. Identification was performed by phylogenetic analysis of nucleotide sequences of four housekeeping genes, ddl, gdh, rpoB, and sodA; by PCR analysis of the glucosyltransferase (gtf) gene; and by conventional phenotypic characterization and identification using two commercial kits, Rapid ID 32 STREP and STREPTOGRAM and the associated databases. A phylogenetic tree based on concatenated sequences of the four housekeeping genes allowed unequivocal differentiation of recognized species and was used as the reference. Analysis of single gene sequences revealed deviation clustering in eight strains (5.4%) due to homologous recombination with other species. This was particularly evident in S. sanguinis and in members of the anginosus group of streptococci. The rate of correct identification of the strains by both commercial identification kits was below 50% but varied significantly between species. The most significant problems were observed with S. mitis and S. oralis and 11 Streptococcus species described since 1991. Our data indicate that identification based on multilocus sequence analysis is optimal. As a more practical alternative we recommend identification based on sodA sequences with reference to a comprehensive set of sequences that is available for downloading from our server. An analysis of the species distribution of 107 nonhemolytic streptococci from bacteremic patients showed a predominance of S. oralis and S. anginosus with various underlying infections.
Identification of Mycoparasitism-Related Genes in Trichoderma atroviride ▿ † ‡
Reithner, Barbara; Ibarra-Laclette, Enrique; Mach, Robert L.; Herrera-Estrella, Alfredo
2011-01-01
A high-throughput sequencing approach was utilized to carry out a comparative transcriptome analysis of Trichoderma atroviride IMI206040 during mycoparasitic interactions with the plant-pathogenic fungus Rhizoctonia solani. In this study, transcript fragments of 7,797 Trichoderma genes were sequenced, 175 of which were host responsive. According to the functional annotation of these genes by KOG (eukaryotic orthologous groups), the most abundant group during direct contact was “metabolism.” Quantitative reverse transcription (RT)-PCR confirmed the differential transcription of 13 genes (including swo1, encoding an expansin-like protein; axe1, coding for an acetyl xylan esterase; and homologs of genes encoding the aspartyl protease papA and a trypsin-like protease, pra1) in the presence of R. solani. An additional relative gene expression analysis of these genes, conducted at different stages of mycoparasitism against Botrytis cinerea and Phytophthora capsici, revealed a synergistic transcription of various genes involved in cell wall degradation. The similarities in expression patterns and the occurrence of regulatory binding sites in the corresponding promoter regions suggest a possible analog regulation of these genes during the mycoparasitism of T. atroviride. Furthermore, a chitin- and distance-dependent induction of pra1 was demonstrated. PMID:21531825
Wei, Ling; Yang, Chao; Tao, Wenjing; Wang, Deshou
2016-01-01
The Sox transcription factor family is characterized with the presence of a Sry-related high-mobility group (HMG) box and plays important roles in various biological processes in animals, including sex determination and differentiation, and the development of multiple organs. In this study, 27 Sox genes were identified in the genome of the Nile tilapia (Oreochromis niloticus), and were classified into seven groups. The members of each group of the tilapia Sox genes exhibited a relatively conserved exon-intron structure. Comparative analysis showed that the Sox gene family has undergone an expansion in tilapia and other teleost fishes following their whole genome duplication, and group K only exists in teleosts. Transcriptome-based analysis demonstrated that most of the tilapia Sox genes presented stage-specific and/or sex-dimorphic expressions during gonadal development, and six of the group B Sox genes were specifically expressed in the adult brain. Our results provide a better understanding of gene structure and spatio-temporal expression of the Sox gene family in tilapia, and will be useful for further deciphering the roles of the Sox genes during sex determination and gonadal development in teleosts. PMID:26907269
Wei, Ling; Yang, Chao; Tao, Wenjing; Wang, Deshou
2016-02-23
The Sox transcription factor family is characterized with the presence of a Sry-related high-mobility group (HMG) box and plays important roles in various biological processes in animals, including sex determination and differentiation, and the development of multiple organs. In this study, 27 Sox genes were identified in the genome of the Nile tilapia (Oreochromis niloticus), and were classified into seven groups. The members of each group of the tilapia Sox genes exhibited a relatively conserved exon-intron structure. Comparative analysis showed that the Sox gene family has undergone an expansion in tilapia and other teleost fishes following their whole genome duplication, and group K only exists in teleosts. Transcriptome-based analysis demonstrated that most of the tilapia Sox genes presented stage-specific and/or sex-dimorphic expressions during gonadal development, and six of the group B Sox genes were specifically expressed in the adult brain. Our results provide a better understanding of gene structure and spatio-temporal expression of the Sox gene family in tilapia, and will be useful for further deciphering the roles of the Sox genes during sex determination and gonadal development in teleosts.
A signature inferred from Drosophila mitotic genes predicts survival of breast cancer patients.
Damasco, Christian; Lembo, Antonio; Somma, Maria Patrizia; Gatti, Maurizio; Di Cunto, Ferdinando; Provero, Paolo
2011-02-28
The classification of breast cancer patients into risk groups provides a powerful tool for the identification of patients who will benefit from aggressive systemic therapy. The analysis of microarray data has generated several gene expression signatures that improve diagnosis and allow risk assessment. There is also evidence that cell proliferation-related genes have a high predictive power within these signatures. We thus constructed a gene expression signature (the DM signature) using the human orthologues of 108 Drosophila melanogaster genes required for either the maintenance of chromosome integrity (36 genes) or mitotic division (72 genes). The DM signature has minimal overlap with the extant signatures and is highly predictive of survival in 5 large breast cancer datasets. In addition, we show that the DM signature outperforms many widely used breast cancer signatures in predictive power, and performs comparably to other proliferation-based signatures. For most genes of the DM signature, an increased expression is negatively correlated with patient survival. The genes that provide the highest contribution to the predictive power of the DM signature are those involved in cytokinesis. This finding highlights cytokinesis as an important marker in breast cancer prognosis and as a possible target for antimitotic therapies.
Park, Ji Hye
2018-01-01
Estimation of postmortem interval (PMI) is paramount in modern forensic investigation. After the disappearance of the early postmortem phenomena conventionally used to estimate PMI, entomologic evidence provides important indicators for PMI estimation. The age of the oldest fly larvae or pupae can be estimated to pinpoint the time of oviposition, which is considered the minimum PMI (PMImin). The development rate of insects is usually temperature dependent and species specific. Therefore, species identification is mandatory for PMImin estimation using entomological evidence. The classical morphological identification method cannot be applied when specimens are damaged or have not yet matured. To overcome this limitation, some investigators employ molecular identification using mitochondrial cytochrome c oxidase subunit I (COI) nucleotide sequences. The molecular identification method commonly uses Sanger's nucleotide sequencing and molecular phylogeny, which are complex and time consuming and constitute another obstacle for forensic investigators. In this study, instead of using conventional Sanger's nucleotide sequencing, single-nucleotide polymorphisms (SNPs) in the COI gene region, which are unique between fly species, were selected and targeted for single-base extension (SBE) technology. These SNPs were genotyped using a SNaPshot® kit. Eleven Calliphoridae and seven Sarcophagidae species were covered. To validate this genotyping, fly DNA samples (103 adults, 84 larvae, and 4 pupae) previously confirmed by DNA barcoding were used. This method worked quickly with minimal DNA, providing a potential alternative to conventional DNA barcoding. Consisting of only a few simple electropherogram peaks, the results were more straightforward compared with those of the conventional DNA barcoding produced by Sanger's nucleotide sequencing. PMID:29682531
Feather Development Genes and Associated Regulatory Innovation Predate the Origin of Dinosauria
Lowe, Craig B.; Clarke, Julia A.; Baker, Allan J.; Haussler, David; Edwards, Scott V.
2015-01-01
The evolution of avian feathers has recently been illuminated by fossils and the identification of genes involved in feather patterning and morphogenesis. However, molecular studies have focused mainly on protein-coding genes. Using comparative genomics and more than 600,000 conserved regulatory elements, we show that patterns of genome evolution in the vicinity of feather genes are consistent with a major role for regulatory innovation in the evolution of feathers. Rates of innovation at feather regulatory elements exhibit an extended period of innovation with peaks in the ancestors of amniotes and archosaurs. We estimate that 86% of such regulatory elements and 100% of the nonkeratin feather gene set were present prior to the origin of Dinosauria. On the branch leading to modern birds, we detect a strong signal of regulatory innovation near insulin-like growth factor binding protein (IGFBP) 2 and IGFBP5, which have roles in body size reduction, and may represent a genomic signature for the miniaturization of dinosaurian body size preceding the origin of flight. PMID:25415961
DOE Office of Scientific and Technical Information (OSTI.GOV)
Duncan, Katherine R.; Crüsemann, Max; Lechner, Anna
Genome sequencing has revealed that bacteria contain many more biosynthetic gene clusters than predicted based on the number of secondary metabolites discovered to date. While this biosynthetic reservoir has fostered interest in new tools for natural product discovery, there remains a gap between gene cluster detection and compound discovery. In this paper, we apply molecular networking and the new concept of pattern-based genome mining to 35 Salinispora strains, including 30 for which draft genome sequences were either available or obtained for this study. The results provide a method to simultaneously compare large numbers of complex microbial extracts, which facilitated themore » identification of media components, known compounds and their derivatives, and new compounds that could be prioritized for structure elucidation. Finally, these efforts revealed considerable metabolite diversity and led to several molecular family-gene cluster pairings, of which the quinomycin-type depsipeptide retimycin A was characterized and linked to gene cluster NRPS40 using pattern-based bioinformatic approaches.« less
Duncan, Katherine R.; Crüsemann, Max; Lechner, Anna; ...
2015-04-09
Genome sequencing has revealed that bacteria contain many more biosynthetic gene clusters than predicted based on the number of secondary metabolites discovered to date. While this biosynthetic reservoir has fostered interest in new tools for natural product discovery, there remains a gap between gene cluster detection and compound discovery. In this paper, we apply molecular networking and the new concept of pattern-based genome mining to 35 Salinispora strains, including 30 for which draft genome sequences were either available or obtained for this study. The results provide a method to simultaneously compare large numbers of complex microbial extracts, which facilitated themore » identification of media components, known compounds and their derivatives, and new compounds that could be prioritized for structure elucidation. Finally, these efforts revealed considerable metabolite diversity and led to several molecular family-gene cluster pairings, of which the quinomycin-type depsipeptide retimycin A was characterized and linked to gene cluster NRPS40 using pattern-based bioinformatic approaches.« less
High-throughput identification of antigen-specific TCRs by TCR gene capture.
Linnemann, Carsten; Heemskerk, Bianca; Kvistborg, Pia; Kluin, Roelof J C; Bolotin, Dmitriy A; Chen, Xiaojing; Bresser, Kaspar; Nieuwland, Marja; Schotte, Remko; Michels, Samira; Gomez-Eerland, Raquel; Jahn, Lorenz; Hombrink, Pleun; Legrand, Nicolas; Shu, Chengyi Jenny; Mamedov, Ilgar Z; Velds, Arno; Blank, Christian U; Haanen, John B A G; Turchaninova, Maria A; Kerkhoven, Ron M; Spits, Hergen; Hadrup, Sine Reker; Heemskerk, Mirjam H M; Blankenstein, Thomas; Chudakov, Dmitriy M; Bendle, Gavin M; Schumacher, Ton N M
2013-11-01
The transfer of T cell receptor (TCR) genes into patient T cells is a promising approach for the treatment of both viral infections and cancer. Although efficient methods exist to identify antibodies for the treatment of these diseases, comparable strategies to identify TCRs have been lacking. We have developed a high-throughput DNA-based strategy to identify TCR sequences by the capture and sequencing of genomic DNA fragments encoding the TCR genes. We establish the value of this approach by assembling a large library of cancer germline tumor antigen-reactive TCRs. Furthermore, by exploiting the quantitative nature of TCR gene capture, we show the feasibility of identifying antigen-specific TCRs in oligoclonal T cell populations from either human material or TCR-humanized mice. Finally, we demonstrate the ability to identify tumor-reactive TCRs within intratumoral T cell subsets without knowledge of antigen specificities, which may be the first step toward the development of autologous TCR gene therapy to target patient-specific neoantigens in human cancer.
Cho, Ah Ra; Lim, Eun Jin; Veeranagouda, Yaligara; Lee, Kyoung
2011-11-01
In this study, the chromosome-encoded pcuRCAXB genes that are required for p-cresol degradation have been identified by using a newly constructed green fluorescent protein (GFP)-based promoter probe transposon in the long-chain alkylphenol degrader Pseudomonas alkylphenolia. The deduced amino acid sequences of the genes showed the highest identities at the levels of 65-93% compared with those in the databases. The transposon was identified to be inserted in the pcuA gene, with the promoterless gfp gene being under the control of the pcu catabolic gene promoter. The expression of GFP was positively induced by p-cresol and was about 10 times higher by cells grown on agar than those in liquid culture. In addition, phydroxybenzoic acid was detected during p-cresol degradation. These results indicate that P. alkylphenolia additionally possesses a protocatechuate ortho-cleavage route for pcresol degradation that is dominantly expressed in colonies.
Duncan, Katherine R.; Crüsemann, Max; Lechner, Anna; Sarkar, Anindita; Li, Jie; Ziemert, Nadine; Wang, Mingxun; Bandeira, Nuno; Moore, Bradley S.; Dorrestein, Pieter C.; Jensen, Paul R.
2015-01-01
Summary Genome sequencing has revealed that bacteria contain many more biosynthetic gene clusters than predicted based on the number of secondary metabolites discovered to date. While this biosynthetic reservoir has fostered interest in new tools for natural product discovery, there remains a gap between gene cluster detection and compound discovery. Here we apply molecular networking and the new concept of pattern-based genome mining to 35 Salinispora strains including 30 for which draft genome sequences were either available or obtained for this study. The results provide a method to simultaneously compare large numbers of complex microbial extracts, which facilitated the identification of media components, known compounds and their derivatives, and new compounds that could be prioritized for structure elucidation. These efforts revealed considerable metabolite diversity and led to several molecular family-gene cluster pairings, of which the quinomycin-type depsipeptide retimycin A was characterized and linked to gene cluster NRPS40 using pattern-based bioinformatic approaches. PMID:25865308
Wang, Jia-Hong; Zhao, Ling-Feng; Lin, Pei; Su, Xiao-Rong; Chen, Shi-Jun; Huang, Li-Qiang; Wang, Hua-Feng; Zhang, Hai; Hu, Zhen-Fu; Yao, Kai-Tai; Huang, Zhong-Xi
2014-09-01
Identifying biological functions and molecular networks in a gene list and how the genes may relate to various topics is of considerable value to biomedical researchers. Here, we present a web-based text-mining server, GenCLiP 2.0, which can analyze human genes with enriched keywords and molecular interactions. Compared with other similar tools, GenCLiP 2.0 offers two unique features: (i) analysis of gene functions with free terms (i.e. any terms in the literature) generated by literature mining or provided by the user and (ii) accurate identification and integration of comprehensive molecular interactions from Medline abstracts, to construct molecular networks and subnetworks related to the free terms. http://ci.smu.edu.cn. Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Hereditary breast cancer: from molecular pathology to tailored therapies.
Tan, D S P; Marchiò, C; Reis-Filho, J S
2008-10-01
Hereditary breast cancer accounts for up to 5-10% of all breast carcinomas. Recent studies have demonstrated that mutations in two high-penetrance genes, namely BRCA1 and BRCA2, are responsible for about 16% of the familial risk of breast cancer. Even though subsequent studies have failed to find another high-penetrance breast cancer susceptibility gene, several genes that confer a moderate to low risk of breast cancer development have been identified; moreover, hereditary breast cancer can be part of multiple cancer syndromes. In this review we will focus on the hereditary breast carcinomas caused by mutations in BRCA1, BRCA2, Fanconi anaemia (FANC) genes, CHK2 and ATM tumour suppressor genes. We describe the hallmark histological features of these carcinomas compared with non-hereditary breast cancers and show how an accurate histopathological diagnosis may help improve the identification of patients to be screened for mutations. Finally, novel therapeutic approaches to treat patients with BRCA1 and BRCA2 germ line mutations, including cross-linking agents and PARP inhibitors, are discussed.
Janeja, H S; Banga, S S; Lakshmikumaran, M
2003-06-01
The tournefortii cytoplasmic male-sterility system is being used as a method of pollination control to develop hybrids in Brassica napus. Genetic analyses have indicated that two dominant genes, one major ( Rft1) and another minor ( Rft2), were required to achieve complete fertility restoration. Though the major gene ( Rft1) can cause complete fertility restoration on its own, its expression was significantly enhanced in the presence of the minor gene ( Rft2). In the absence of Rft1, Rft2 caused only partial fertility restoration. We used a pair of near-isogenic lines (NILs), differing for the presence/absence of Rf genes, to identify AFLP markers linked to fertility restorer genes. A total of 64 EcoRI/ MseI primer combinations were surveyed which produced 3,225 bands, of which 19 (0.006%) were polymorphic between parental NILs. Primer combinations which led to the identification of polymorphic bands present in fertile parental NILs were used for assaying a mapping population of 70 F(2) plants for determining the segregation pattern of markers. Initial screening resulted in the identification of five AFLP markers. The recombination analyses of these AFLP markers revealed that at least two (EACC/MCTT(105), EAAG/MCTC(80)) were present in the same linkage group along with the Rf loci. Marker EACC/MCTT(105) was separated from the major gene ( Rft1) by a distance of 18.1 cM, while it was 33.2 cM away from the minor fertility restorer gene ( Rft2). Another marker EAAG/MCTC(80) was also located adjacent to Rft1 at a distance of 18.1 cM, but on other side. Identification of flanking markers (EACC/MCTT(105), EAAG/MCTC(80)) for the major fertility restorer gene ( Rft1) provides a crucial component for marker-assisted selection and map-based cloning of the restorer genes, and can hence be used to construct elite restorer genotypes.
Suh, Yeunsu; Davis, Michael E.; Lee, Kichoon
2013-01-01
Understanding the tissue-specific pattern of gene expression is critical in elucidating the molecular mechanisms of tissue development, gene function, and transcriptional regulations of biological processes. Although tissue-specific gene expression information is available in several databases, follow-up strategies to integrate and use these data are limited. The objective of the current study was to identify and evaluate novel tissue-specific genes in human and mouse tissues by performing comparative microarray database analysis and semi-quantitative PCR analysis. We developed a powerful approach to predict tissue-specific genes by analyzing existing microarray data from the NCBI′s Gene Expression Omnibus (GEO) public repository. We investigated and confirmed tissue-specific gene expression in the human and mouse kidney, liver, lung, heart, muscle, and adipose tissue. Applying our novel comparative microarray approach, we confirmed 10 kidney, 11 liver, 11 lung, 11 heart, 8 muscle, and 8 adipose specific genes. The accuracy of this approach was further verified by employing semi-quantitative PCR reaction and by searching for gene function information in existing publications. Three novel tissue-specific genes were discovered by this approach including AMDHD1 (amidohydrolase domain containing 1) in the liver, PRUNE2 (prune homolog 2) in the heart, and ACVR1C (activin A receptor, type IC) in adipose tissue. We further confirmed the tissue-specific expression of these 3 novel genes by real-time PCR. Among them, ACVR1C is adipose tissue-specific and adipocyte-specific in adipose tissue, and can be used as an adipocyte developmental marker. From GEO profiles, we predicted the processes in which AMDHD1 and PRUNE2 may participate. Our approach provides a novel way to identify new sets of tissue-specific genes and to predict functions in which they may be involved. PMID:23741331
DNA-microarrays identification of Streptococcus mutans genes associated with biofilm thickness
Shemesh, Moshe; Tam, Avshalom; Kott-Gutkowski, Miriam; Feldman, Mark; Steinberg, Doron
2008-01-01
Background A biofilm is a complex community of microorganisms that develop on surfaces in diverse environments. The thickness of the biofilm plays a crucial role in the physiology of the immobilized bacteria. The most cariogenic bacteria, mutans streptococci, are common inhabitants of a dental biofilm community. In this study, DNA-microarray analysis was used to identify differentially expressed genes associated with the thickness of S. mutans biofilms. Results Comparative transcriptome analyses indicated that expression of 29 genes was differentially altered in 400- vs. 100-microns depth and 39 genes in 200- vs. 100-microns biofilms. Only 10 S. mutans genes showed differential expression in both 400- vs. 100-microns and 200- vs. 100-microns biofilms. All of these genes were upregulated. As sucrose is a predominant factor in oral biofilm development, its influence was evaluated on selected genes expression in the various depths of biofilms. The presence of sucrose did not noticeably change the regulation of these genes in 400- vs. 100-microns and/or 200- vs. 100-microns biofilms tested by real-time RT-PCR. Furthermore, we analyzed the expression profile of selected biofilm thickness associated genes in the luxS- mutant strain. The expression of those genes was not radically changed in the mutant strain compared to wild-type bacteria in planktonic condition. Only slight downregulation was recorded in SMU.2146c, SMU.574, SMU.609, and SMU.987 genes expression in luxS- bacteria in biofilm vs. planktonic environments. Conclusion These findings reveal genes associated with the thickness of biofilms of S. mutans. Expression of these genes is apparently not regulated directly by luxS and is not necessarily influenced by the presence of sucrose in the growth media. PMID:19114020
He, Yajun; Mao, Shaoshuai; Gao, Yulong; Zhu, Liying; Wu, Daoming; Cui, Yixin; Li, Jiana; Qian, Wei
2016-01-01
WRKY transcription factors play important roles in responses to environmental stress stimuli. Using a genome-wide domain analysis, we identified 287 WRKY genes with 343 WRKY domains in the sequenced genome of Brassica napus, 139 in the A sub-genome and 148 in the C sub-genome. These genes were classified into eight groups based on phylogenetic analysis. In the 343 WRKY domains, a total of 26 members showed divergence in the WRKY domain, and 21 belonged to group I. This finding suggested that WRKY genes in group I are more active and variable compared with genes in other groups. Using genome-wide identification and analysis of the WRKY gene family in Brassica napus, we observed genome duplication, chromosomal/segmental duplications and tandem duplication. All of these duplications contributed to the expansion of the WRKY gene family. The duplicate segments that were detected indicated that genome duplication events occurred in the two diploid progenitors B. rapa and B. olearecea before they combined to form B. napus. Analysis of the public microarray database and EST database for B. napus indicated that 74 WRKY genes were induced or preferentially expressed under stress conditions. According to the public QTL data, we identified 77 WRKY genes in 31 QTL regions related to various stress tolerance. We further evaluated the expression of 26 BnaWRKY genes under multiple stresses by qRT-PCR. Most of the genes were induced by low temperature, salinity and drought stress, indicating that the WRKYs play important roles in B. napus stress responses. Further, three BnaWRKY genes were strongly responsive to the three multiple stresses simultaneously, which suggests that these 3 WRKY may have multi-functional roles in stress tolerance and can potentially be used in breeding new rapeseed cultivars. We also found six tandem repeat pairs exhibiting similar expression profiles under the various stress conditions, and three pairs were mapped in the stress related QTL regions, indicating tandem duplicate WRKYs in the adaptive responses to environmental stimuli during the evolution process. Our results provide a framework for future studies regarding the function of WRKY genes in response to stress in B. napus. PMID:27322342
Łastowska, M; Viprey, V; Santibanez-Koref, M; Wappler, I; Peters, H; Cullinane, C; Roberts, P; Hall, A G; Tweddle, D A; Pearson, A D J; Lewis, I; Burchill, S A; Jackson, M S
2007-11-22
Identifying genes, whose expression is consistently altered by chromosomal gains or losses, is an important step in defining genes of biological relevance in a wide variety of tumour types. However, additional criteria are needed to discriminate further among the large number of candidate genes identified. This is particularly true for neuroblastoma, where multiple genomic copy number changes of proven prognostic value exist. We have used Affymetrix microarrays and a combination of fluorescent in situ hybridization and single nucleotide polymorphism (SNP) microarrays to establish expression profiles and delineate copy number alterations in 30 primary neuroblastomas. Correlation of microarray data with patient survival and analysis of expression within rodent neuroblastoma cell lines were then used to define further genes likely to be involved in the disease process. Using this approach, we identify >1000 genes within eight recurrent genomic alterations (loss of 1p, 3p, 4p, 10q and 11q, 2p gain, 17q gain, and the MYCN amplicon) whose expression is consistently altered by copy number change. Of these, 84 correlate with patient survival, with the minimal regions of 17q gain and 4p loss being enriched significantly for such genes. These include genes involved in RNA and DNA metabolism, and apoptosis. Orthologues of all but one of these genes on 17q are overexpressed in rodent neuroblastoma cell lines. A significant excess of SNPs whose copy number correlates with survival is also observed on proximal 4p in stage 4 tumours, and we find that deletion of 4p is associated with improved outcome in an extended cohort of tumours. These results define the major impact of genomic copy number alterations upon transcription within neuroblastoma, and highlight genes on distal 17q and proximal 4p for downstream analyses. They also suggest that integration of discriminators, such as survival and comparative gene expression, with microarray data may be useful in the identification of critical genes within regions of loss or gain in many human cancers.
He, Yajun; Mao, Shaoshuai; Gao, Yulong; Zhu, Liying; Wu, Daoming; Cui, Yixin; Li, Jiana; Qian, Wei
2016-01-01
WRKY transcription factors play important roles in responses to environmental stress stimuli. Using a genome-wide domain analysis, we identified 287 WRKY genes with 343 WRKY domains in the sequenced genome of Brassica napus, 139 in the A sub-genome and 148 in the C sub-genome. These genes were classified into eight groups based on phylogenetic analysis. In the 343 WRKY domains, a total of 26 members showed divergence in the WRKY domain, and 21 belonged to group I. This finding suggested that WRKY genes in group I are more active and variable compared with genes in other groups. Using genome-wide identification and analysis of the WRKY gene family in Brassica napus, we observed genome duplication, chromosomal/segmental duplications and tandem duplication. All of these duplications contributed to the expansion of the WRKY gene family. The duplicate segments that were detected indicated that genome duplication events occurred in the two diploid progenitors B. rapa and B. olearecea before they combined to form B. napus. Analysis of the public microarray database and EST database for B. napus indicated that 74 WRKY genes were induced or preferentially expressed under stress conditions. According to the public QTL data, we identified 77 WRKY genes in 31 QTL regions related to various stress tolerance. We further evaluated the expression of 26 BnaWRKY genes under multiple stresses by qRT-PCR. Most of the genes were induced by low temperature, salinity and drought stress, indicating that the WRKYs play important roles in B. napus stress responses. Further, three BnaWRKY genes were strongly responsive to the three multiple stresses simultaneously, which suggests that these 3 WRKY may have multi-functional roles in stress tolerance and can potentially be used in breeding new rapeseed cultivars. We also found six tandem repeat pairs exhibiting similar expression profiles under the various stress conditions, and three pairs were mapped in the stress related QTL regions, indicating tandem duplicate WRKYs in the adaptive responses to environmental stimuli during the evolution process. Our results provide a framework for future studies regarding the function of WRKY genes in response to stress in B. napus.
The role of retrotransposons in gene family expansions: insights from the mouse Abp gene family.
Janoušek, Václav; Karn, Robert C; Laukaitis, Christina M
2013-05-29
Retrotransposons have been suggested to provide a substrate for non-allelic homologous recombination (NAHR) and thereby promote gene family expansion. Their precise role, however, is controversial. Here we ask whether retrotransposons contributed to the recent expansions of the Androgen-binding protein (Abp) gene families that occurred independently in the mouse and rat genomes. Using dot plot analysis, we found that the most recent duplication in the Abp region of the mouse genome is flanked by L1Md_T elements. Analysis of the sequence of these elements revealed breakpoints that are the relicts of the recombination that caused the duplication, confirming that the duplication arose as a result of NAHR using L1 elements as substrates. L1 and ERVII retrotransposons are considerably denser in the Abp regions than in one Mb flanking regions, while other repeat types are depleted in the Abp regions compared to flanking regions. L1 retrotransposons preferentially accumulated in the Abp gene regions after lineage separation and roughly followed the pattern of Abp gene expansion. By contrast, the proportion of shared vs. lineage-specific ERVII repeats in the Abp region resembles the rest of the genome. We confirmed the role of L1 repeats in Abp gene duplication with the identification of recombinant L1Md_T elements at the edges of the most recent mouse Abp gene duplication. High densities of L1 and ERVII repeats were found in the Abp gene region with abrupt transitions at the region boundaries, suggesting that their higher densities are tightly associated with Abp gene duplication. We observed that the major accumulation of L1 elements occurred after the split of the mouse and rat lineages and that there is a striking overlap between the timing of L1 accumulation and expansion of the Abp gene family in the mouse genome. Establishing a link between the accumulation of L1 elements and the expansion of the Abp gene family and identification of an NAHR-related breakpoint in the most recent duplication are the main contributions of our study.
The role of retrotransposons in gene family expansions: insights from the mouse Abp gene family
2013-01-01
Background Retrotransposons have been suggested to provide a substrate for non-allelic homologous recombination (NAHR) and thereby promote gene family expansion. Their precise role, however, is controversial. Here we ask whether retrotransposons contributed to the recent expansions of the Androgen-binding protein (Abp) gene families that occurred independently in the mouse and rat genomes. Results Using dot plot analysis, we found that the most recent duplication in the Abp region of the mouse genome is flanked by L1Md_T elements. Analysis of the sequence of these elements revealed breakpoints that are the relicts of the recombination that caused the duplication, confirming that the duplication arose as a result of NAHR using L1 elements as substrates. L1 and ERVII retrotransposons are considerably denser in the Abp regions than in one Mb flanking regions, while other repeat types are depleted in the Abp regions compared to flanking regions. L1 retrotransposons preferentially accumulated in the Abp gene regions after lineage separation and roughly followed the pattern of Abp gene expansion. By contrast, the proportion of shared vs. lineage-specific ERVII repeats in the Abp region resembles the rest of the genome. Conclusions We confirmed the role of L1 repeats in Abp gene duplication with the identification of recombinant L1Md_T elements at the edges of the most recent mouse Abp gene duplication. High densities of L1 and ERVII repeats were found in the Abp gene region with abrupt transitions at the region boundaries, suggesting that their higher densities are tightly associated with Abp gene duplication. We observed that the major accumulation of L1 elements occurred after the split of the mouse and rat lineages and that there is a striking overlap between the timing of L1 accumulation and expansion of the Abp gene family in the mouse genome. Establishing a link between the accumulation of L1 elements and the expansion of the Abp gene family and identification of an NAHR-related breakpoint in the most recent duplication are the main contributions of our study. PMID:23718880
Piombo, Edoardo; Sela, Noa; Wisniewski, Michael; Hoffmann, Maria; Gullino, Maria L.; Allard, Marc W.; Levin, Elena; Spadaro, Davide; Droby, Samir
2018-01-01
The yeast Metschnikowia fructicola was reported as an efficient biological control agent of postharvest diseases of fruits and vegetables, and it is the bases of the commercial formulated product “Shemer.” Several mechanisms of action by which M. fructicola inhibits postharvest pathogens were suggested including iron-binding compounds, induction of defense signaling genes, production of fungal cell wall degrading enzymes and relatively high amounts of superoxide anions. We assembled the whole genome sequence of two strains of M. fructicola using PacBio and Illumina shotgun sequencing technologies. Using the PacBio, a high-quality draft genome consisting of 93 contigs, with an estimated genome size of approximately 26 Mb, was obtained. Comparative analysis of M. fructicola proteins with the other three available closely related genomes revealed a shared core of homologous proteins coded by 5,776 genes. Comparing the genomes of the two M. fructicola strains using a SNP calling approach resulted in the identification of 564,302 homologous SNPs with 2,004 predicted high impact mutations. The size of the genome is exceptionally high when compared with those of available closely related organisms, and the high rate of homology among M. fructicola genes points toward a recent whole-genome duplication event as the cause of this large genome. Based on the assembled genome, sequences were annotated with a gene description and gene ontology (GO term) and clustered in functional groups. Analysis of CAZymes family genes revealed 1,145 putative genes, and transcriptomic analysis of CAZyme expression levels in M. fructicola during its interaction with either grapefruit peel tissue or Penicillium digitatum revealed a high level of CAZyme gene expression when the yeast was placed in wounded fruit tissue. PMID:29666611
The expression of proinflammatory genes in epidermal keratinocytes is regulated by hydration status.
Xu, Wei; Jia, Shengxian; Xie, Ping; Zhong, Aimei; Galiano, Robert D; Mustoe, Thomas A; Hong, Seok J
2014-04-01
Mucosal wounds heal more rapidly, exhibit less inflammation, and are associated with minimal scarring when compared with equivalent cutaneous wounds. We previously demonstrated that cutaneous epithelium exhibits an exaggerated response to injury compared with mucosal epithelium. We hypothesized that treatment of injured skin with a semiocclusive dressing preserves the hydration of the skin and results in a wound healing phenotype that more closely resembles that of mucosa. Here we explored whether changes in hydration status alter epidermal gene expression patterns in rabbit partial-thickness incisional wounds. Using microarray studies on injured epidermis, we showed that global gene expression patterns in highly occluded versus non-occluded wounds are distinct. Many genes including IL-1β, IL-8, TNF-α (tumor necrosis factor-α), and COX-2 (cyclooxygenase 2) are upregulated in non-occluded wounds compared with highly occluded wounds. In addition, decreased levels of hydration resulted in an increased expression of proinflammatory genes in human ex vivo skin culture (HESC) and stratified keratinocytes. Hierarchical analysis of genes using RNA interference showed that both TNF-α and IL-1β regulate the expression of IL-8 through independent pathways in response to reduced hydration. Furthermore, both gene knockdown and pharmacological inhibition studies showed that COX-2 mediates the TNF-α/IL-8 pathway by increasing the production of prostaglandin E2 (PGE2). IL-8 in turn controls the production of matrix metalloproteinase-9 in keratinocytes. Our data show that hydration status directly affects the expression of inflammatory signaling in the epidermis. The identification of genes involved in the epithelial hydration pathway provides an opportunity to develop strategies to reduce scarring and optimize wound healing.
Olson, Nathan D.; Lund, Steven P.; Zook, Justin M.; Rojas-Cornejo, Fabiola; Beck, Brian; Foy, Carole; Huggett, Jim; Whale, Alexandra S.; Sui, Zhiwei; Baoutina, Anna; Dobeson, Michael; Partis, Lina; Morrow, Jayne B.
2015-01-01
This study presents the results from an interlaboratory sequencing study for which we developed a novel high-resolution method for comparing data from different sequencing platforms for a multi-copy, paralogous gene. The combination of PCR amplification and 16S ribosomal RNA gene (16S rRNA) sequencing has revolutionized bacteriology by enabling rapid identification, frequently without the need for culture. To assess variability between laboratories in sequencing 16S rRNA, six laboratories sequenced the gene encoding the 16S rRNA from Escherichia coli O157:H7 strain EDL933 and Listeria monocytogenes serovar 4b strain NCTC11994. Participants performed sequencing methods and protocols available in their laboratories: Sanger sequencing, Roche 454 pyrosequencing®, or Ion Torrent PGM®. The sequencing data were evaluated on three levels: (1) identity of biologically conserved position, (2) ratio of 16S rRNA gene copies featuring identified variants, and (3) the collection of variant combinations in a set of 16S rRNA gene copies. The same set of biologically conserved positions was identified for each sequencing method. Analytical methods using Bayesian and maximum likelihood statistics were developed to estimate variant copy ratios, which describe the ratio of nucleotides at each identified biologically variable position, as well as the likely set of variant combinations present in 16S rRNA gene copies. Our results indicate that estimated variant copy ratios at biologically variable positions were only reproducible for high throughput sequencing methods. Furthermore, the likely variant combination set was only reproducible with increased sequencing depth and longer read lengths. We also demonstrate novel methods for evaluating variable positions when comparing multi-copy gene sequence data from multiple laboratories generated using multiple sequencing technologies. PMID:27077030
Lv, Qiang; Chen, Ming; Xu, Haiyan; Song, Yuqin; Sun, Zhihong; Dan, Tong; Sun, Tiansong
2013-07-04
Using the 16S rRNA, dnaA, murC and pyrG gene sequences, we identified the phylogenetic relationship among closely related Leuconostoc citreum species. Seven Leu. citreum strains originally isolated from sourdough were characterized by PCR methods to amplify the dnaA, murC and pyrG gene sequences, which were determined to assess the suitability as phylogenetic markers. Then, we estimated the genetic distance and constructed the phylogenetic trees including 16S rRNA and above mentioned three housekeeping genes combining with published corresponding sequences. By comparing the phylogenetic trees, the topology of three housekeeping genes trees were consistent with that of 16S rRNA gene. The homology of closely related Leu. citreum species among dnaA, murC, pyrG and 16S rRNA gene sequences were different, ranged from75.5% to 97.2%, 50.2% to 99.7%, 65.0% to 99.8% and 98.5% 100%, respectively. The phylogenetic relationship of three housekeeping genes sequences were highly consistent with the results of 16S rRNA gene sequence, while the genetic distance of these housekeeping genes were extremely high than 16S rRNA gene. Consequently, the dnaA, murC and pyrG gene are suitable for classification and identification closely related Leu. citreum species.
Chen, Min; Tan, Qiuping; Sun, Mingyue; Li, Dongmei; Fu, Xiling; Chen, Xiude; Xiao, Wei; Li, Ling; Gao, Dongsheng
2016-06-01
Bud dormancy in deciduous fruit trees is an important adaptive mechanism for their survival in cold climates. The WRKY genes participate in several developmental and physiological processes, including dormancy. However, the dormancy mechanisms of WRKY genes have not been studied in detail. We conducted a genome-wide analysis and identified 58 WRKY genes in peach. These putative genes were located on all eight chromosomes. In bioinformatics analyses, we compared the sequences of WRKY genes from peach, rice, and Arabidopsis. In a cluster analysis, the gene sequences formed three groups, of which group II was further divided into five subgroups. Gene structure was highly conserved within each group, especially in groups IId and III. Gene expression analyses by qRT-PCR showed that WRKY genes showed different expression patterns in peach buds during dormancy. The mean expression levels of six WRKY genes (Prupe.6G286000, Prupe.1G393000, Prupe.1G114800, Prupe.1G071400, Prupe.2G185100, and Prupe.2G307400) increased during endodormancy and decreased during ecodormancy, indicating that these six WRKY genes may play a role in dormancy in a perennial fruit tree. This information will be useful for selecting fruit trees with desirable dormancy characteristics or for manipulating dormancy in genetic engineering programs.
Hussain, Tajammul; Plunkett, Blue; Ejaz, Mahwish; Espley, Richard V.; Kayser, Oliver
2018-01-01
The liverwort Radula marginata belongs to the bryophyte division of land plants and is a prospective alternate source of cannabinoid-like compounds. However, mechanistic insights into the molecular pathways directing the synthesis of these cannabinoid-like compounds have been hindered due to the lack of genetic information. This prompted us to do deep sequencing, de novo assembly and annotation of R. marginata transcriptome, which resulted in the identification and validation of the genes for cannabinoid biosynthetic pathway. In total, we have identified 11,421 putative genes encoding 1,554 enzymes from 145 biosynthetic pathways. Interestingly, we have identified all the upstream genes of the central precursor of cannabinoid biosynthesis, cannabigerolic acid (CBGA), including its two first intermediates, stilbene acid (SA) and geranyl diphosphate (GPP). Expression of all these genes was validated using quantitative real-time PCR. We have characterized the protein structure of stilbene synthase (STS), which is considered as a homolog of olivetolic acid in R. marginata. Moreover, the metabolomics approach enabled us to identify CBGA-analogous compounds using electrospray ionization mass spectrometry (ESI-MS/MS) and gas chromatography mass spectrometry (GC-MS). Transcriptomic analysis revealed 1085 transcription factors (TF) from 39 families. Comparative analysis showed that six TF families have been uniquely predicted in R. marginata. In addition, the bioinformatics analysis predicted a large number of simple sequence repeats (SSRs) and non-coding RNAs (ncRNAs). Our results collectively provide mechanistic insights into the putative precursor genes for the biosynthesis of cannabinoid-like compounds and a novel transcriptomic resource for R. marginata. The large-scale transcriptomic resource generated in this study would further serve as a reference transcriptome to explore the Radulaceae family.
Overview of the Genetics of Alcohol Use Disorder
Tawa, Elisabeth A.; Hall, Samuel D.; Lohoff, Falk W.
2016-01-01
Aims Alcohol Use Disorder (AUD) is a chronic psychiatric illness characterized by harmful drinking patterns leading to negative emotional, physical, and social ramifications. While the underlying pathophysiology of AUD is poorly understood, there is substantial evidence for a genetic component; however, identification of universal genetic risk variants for AUD has been difficult. Recent efforts in the search for AUD susceptibility genes will be reviewed in this article. Methods In this review, we provide an overview of genetic studies on AUD, including twin studies, linkage studies, candidate gene studies, and genome-wide association studies (GWAS). Results Several potential genetic susceptibility factors for AUD have been identified, but the genes of alcohol metabolism, alcohol dehydrogenase (ADH) and aldehyde dehydrogenase (ALDH), have been found to be protective against the development of AUD. GWAS have also identified a heterogeneous list of SNPs associated with AUD and alcohol-related phenotypes, emphasizing the complexity and heterogeneity of the disorder. In addition, many of these findings have small effect sizes when compared to alcohol metabolism genes, and biological relevance is often unknown. Conclusions Although studies spanning multiple approaches have suggested a genetic basis for AUD, identification of the genetic risk variants has been challenging. Some promising results are emerging from GWAS studies; however, larger sample sizes are needed to improve GWAS results and resolution. As the field of genetics is rapidly developing, whole genome sequencing could soon become the new standard of interrogation of the genes and neurobiological pathways which contribute to the complex phenotype of AUD. Short summary This review examines the genetic underpinnings of Alcohol Use Disorder (AUD), with an emphasis on GWAS approaches for identifying genetic risk variants. The most promising results associated with AUD and alcohol-related phenotypes have included SNPs of the alcohol metabolism genes ADH and ALDH. PMID:27445363
de Vos, B; Rijken, J A; Adank, M A; Hoksbergen, A W J; Bayley, J P; Leemans, C R; Hensen, E F
2018-06-01
In the Netherlands, the majority of hereditary head and neck paragangliomas (HNPGL) are caused by germline variants in the succinate dehydrogenase genes (SDHD, SDHB, SDHAF2). Here, we evaluate a four-generation family linked to a novel SDHB gene variant with the manifestation of a HNPGL. A family-based study. The VU University Medical Center (VUmc) Amsterdam, a tertiary clinic for Otolaryngology and Head and Neck Surgery. The index patients presented with an embryonic rhabdomyosarcoma and a non-Hodgkin lymphoma. Array-based comparative genomic hybridisation (aCGH) analysis and multiplex ligation-dependent probe amplification (MLPA) revealed a novel deletion of exon 1-3 in the SDHB gene, suspected to predispose to paraganglioma (PGL)/pheochromocytoma (PHEO) syndrome type 4. Subsequently, genetic counselling and DNA testing were offered to all family members at risk. Individuals that tested positive for this novel SDHB gene variant were counselled and additional clinical evaluation was offered for the identification of HNPGL and/or PHEO. The DNA of 18 family members was tested, resulting in the identification of 10 carriers of the exon 1-3 deletion in the SDHB gene. One carrier was diagnosed with a carotid body PGL and serum catecholamine excess, which was surgically excised. Negative SDHB immunostaining of the carotid body tumour confirmed that it was caused by the SDHB variant. The remaining 9 carriers showed no evidence of PGL/PHEO. Deletion of exon 1-3 in the SDHB gene is a novel germline variant associated with the formation of hereditary HNPGL. © 2018 The Authors. Clinical Otolaryngology Published by John Wiley & Sons Ltd.
Optimization of Multilocus Sequence Analysis for Identification of Species in the Genus Vibrio
Gabriel, Michael W.; Matsui, George Y.; Friedman, Robert
2014-01-01
Multilocus sequence analysis (MLSA) is an important method for identification of taxa that are not well differentiated by 16S rRNA gene sequences alone. In this procedure, concatenated sequences of selected genes are constructed and then analyzed. The effects that the number and the order of genes used in MLSA have on reconstruction of phylogenetic relationships were examined. The recA, rpoA, gapA, 16S rRNA gene, gyrB, and ftsZ sequences from 56 species of the genus Vibrio were used to construct molecular phylogenies, and these were evaluated individually and using various gene combinations. Phylogenies from two-gene sequences employing recA and rpoA in both possible gene orders were different. The addition of the gapA gene sequence, producing all six possible concatenated sequences, reduced the differences in phylogenies to degrees of statistical (bootstrap) support for some nodes. The overall statistical support for the phylogenetic tree, assayed on the basis of a reliability score (calculated from the number of nodes having bootstrap values of ≥80 divided by the total number of nodes) increased with increasing numbers of genes used, up to a maximum of four. No further improvement was observed from addition of the fifth gene sequence (ftsZ), and addition of the sixth gene (gyrB) resulted in lower proportions of strongly supported nodes. Reductions in the numbers of strongly supported nodes were also observed when maximum parsimony was employed for tree construction. Use of a small number of gene sequences in MLSA resulted in accurate identification of Vibrio species. PMID:24951781