Science.gov

Sample records for avian comparative genomics

  1. Comparative genomics reveals insights into avian genome evolution and adaptation.

    PubMed

    Zhang, Guojie; Li, Cai; Li, Qiye; Li, Bo; Larkin, Denis M; Lee, Chul; Storz, Jay F; Antunes, Agostinho; Greenwold, Matthew J; Meredith, Robert W; Ödeen, Anders; Cui, Jie; Zhou, Qi; Xu, Luohao; Pan, Hailin; Wang, Zongji; Jin, Lijun; Zhang, Pei; Hu, Haofu; Yang, Wei; Hu, Jiang; Xiao, Jin; Yang, Zhikai; Liu, Yang; Xie, Qiaolin; Yu, Hao; Lian, Jinmin; Wen, Ping; Zhang, Fang; Li, Hui; Zeng, Yongli; Xiong, Zijun; Liu, Shiping; Zhou, Long; Huang, Zhiyong; An, Na; Wang, Jie; Zheng, Qiumei; Xiong, Yingqi; Wang, Guangbiao; Wang, Bo; Wang, Jingjing; Fan, Yu; da Fonseca, Rute R; Alfaro-Núñez, Alonzo; Schubert, Mikkel; Orlando, Ludovic; Mourier, Tobias; Howard, Jason T; Ganapathy, Ganeshkumar; Pfenning, Andreas; Whitney, Osceola; Rivas, Miriam V; Hara, Erina; Smith, Julia; Farré, Marta; Narayan, Jitendra; Slavov, Gancho; Romanov, Michael N; Borges, Rui; Machado, João Paulo; Khan, Imran; Springer, Mark S; Gatesy, John; Hoffmann, Federico G; Opazo, Juan C; Håstad, Olle; Sawyer, Roger H; Kim, Heebal; Kim, Kyu-Won; Kim, Hyeon Jeong; Cho, Seoae; Li, Ning; Huang, Yinhua; Bruford, Michael W; Zhan, Xiangjiang; Dixon, Andrew; Bertelsen, Mads F; Derryberry, Elizabeth; Warren, Wesley; Wilson, Richard K; Li, Shengbin; Ray, David A; Green, Richard E; O'Brien, Stephen J; Griffin, Darren; Johnson, Warren E; Haussler, David; Ryder, Oliver A; Willerslev, Eske; Graves, Gary R; Alström, Per; Fjeldså, Jon; Mindell, David P; Edwards, Scott V; Braun, Edward L; Rahbek, Carsten; Burt, David W; Houde, Peter; Zhang, Yong; Yang, Huanming; Wang, Jian; Jarvis, Erich D; Gilbert, M Thomas P; Wang, Jun

    2014-12-12

    Birds are the most species-rich class of tetrapod vertebrates and have wide relevance across many research fields. We explored bird macroevolution using full genomes from 48 avian species representing all major extant clades. The avian genome is principally characterized by its constrained size, which predominantly arose because of lineage-specific erosion of repetitive elements, large segmental deletions, and gene loss. Avian genomes furthermore show a remarkably high degree of evolutionary stasis at the levels of nucleotide sequence, gene synteny, and chromosomal structure. Despite this pattern of conservation, we detected many non-neutral evolutionary changes in protein-coding genes and noncoding regions. These analyses reveal that pan-avian genomic diversity covaries with adaptations to different lifestyles and convergent evolution of traits. PMID:25504712

  2. Comparative genomics reveals insights into avian genome evolution and adaptation.

    PubMed

    Zhang, Guojie; Li, Cai; Li, Qiye; Li, Bo; Larkin, Denis M; Lee, Chul; Storz, Jay F; Antunes, Agostinho; Greenwold, Matthew J; Meredith, Robert W; Ödeen, Anders; Cui, Jie; Zhou, Qi; Xu, Luohao; Pan, Hailin; Wang, Zongji; Jin, Lijun; Zhang, Pei; Hu, Haofu; Yang, Wei; Hu, Jiang; Xiao, Jin; Yang, Zhikai; Liu, Yang; Xie, Qiaolin; Yu, Hao; Lian, Jinmin; Wen, Ping; Zhang, Fang; Li, Hui; Zeng, Yongli; Xiong, Zijun; Liu, Shiping; Zhou, Long; Huang, Zhiyong; An, Na; Wang, Jie; Zheng, Qiumei; Xiong, Yingqi; Wang, Guangbiao; Wang, Bo; Wang, Jingjing; Fan, Yu; da Fonseca, Rute R; Alfaro-Núñez, Alonzo; Schubert, Mikkel; Orlando, Ludovic; Mourier, Tobias; Howard, Jason T; Ganapathy, Ganeshkumar; Pfenning, Andreas; Whitney, Osceola; Rivas, Miriam V; Hara, Erina; Smith, Julia; Farré, Marta; Narayan, Jitendra; Slavov, Gancho; Romanov, Michael N; Borges, Rui; Machado, João Paulo; Khan, Imran; Springer, Mark S; Gatesy, John; Hoffmann, Federico G; Opazo, Juan C; Håstad, Olle; Sawyer, Roger H; Kim, Heebal; Kim, Kyu-Won; Kim, Hyeon Jeong; Cho, Seoae; Li, Ning; Huang, Yinhua; Bruford, Michael W; Zhan, Xiangjiang; Dixon, Andrew; Bertelsen, Mads F; Derryberry, Elizabeth; Warren, Wesley; Wilson, Richard K; Li, Shengbin; Ray, David A; Green, Richard E; O'Brien, Stephen J; Griffin, Darren; Johnson, Warren E; Haussler, David; Ryder, Oliver A; Willerslev, Eske; Graves, Gary R; Alström, Per; Fjeldså, Jon; Mindell, David P; Edwards, Scott V; Braun, Edward L; Rahbek, Carsten; Burt, David W; Houde, Peter; Zhang, Yong; Yang, Huanming; Wang, Jian; Jarvis, Erich D; Gilbert, M Thomas P; Wang, Jun

    2014-12-12

    Birds are the most species-rich class of tetrapod vertebrates and have wide relevance across many research fields. We explored bird macroevolution using full genomes from 48 avian species representing all major extant clades. The avian genome is principally characterized by its constrained size, which predominantly arose because of lineage-specific erosion of repetitive elements, large segmental deletions, and gene loss. Avian genomes furthermore show a remarkably high degree of evolutionary stasis at the levels of nucleotide sequence, gene synteny, and chromosomal structure. Despite this pattern of conservation, we detected many non-neutral evolutionary changes in protein-coding genes and noncoding regions. These analyses reveal that pan-avian genomic diversity covaries with adaptations to different lifestyles and convergent evolution of traits.

  3. Comparative genomics reveals insights into avian genome evolution and adaptation

    PubMed Central

    Zhang, Guojie; Li, Cai; Li, Qiye; Li, Bo; Larkin, Denis M.; Lee, Chul; Storz, Jay F.; Antunes, Agostinho; Greenwold, Matthew J.; Meredith, Robert W.; Ödeen, Anders; Cui, Jie; Zhou, Qi; Xu, Luohao; Pan, Hailin; Wang, Zongji; Jin, Lijun; Zhang, Pei; Hu, Haofu; Yang, Wei; Hu, Jiang; Xiao, Jin; Yang, Zhikai; Liu, Yang; Xie, Qiaolin; Yu, Hao; Lian, Jinmin; Wen, Ping; Zhang, Fang; Li, Hui; Zeng, Yongli; Xiong, Zijun; Liu, Shiping; Zhou, Long; Huang, Zhiyong; An, Na; Wang, Jie; Zheng, Qiumei; Xiong, Yingqi; Wang, Guangbiao; Wang, Bo; Wang, Jingjing; Fan, Yu; da Fonseca, Rute R.; Alfaro-Núñez, Alonzo; Schubert, Mikkel; Orlando, Ludovic; Mourier, Tobias; Howard, Jason T.; Ganapathy, Ganeshkumar; Pfenning, Andreas; Whitney, Osceola; Rivas, Miriam V.; Hara, Erina; Smith, Julia; Farré, Marta; Narayan, Jitendra; Slavov, Gancho; Romanov, Michael N; Borges, Rui; Machado, João Paulo; Khan, Imran; Springer, Mark S.; Gatesy, John; Hoffmann, Federico G.; Opazo, Juan C.; Håstad, Olle; Sawyer, Roger H.; Kim, Heebal; Kim, Kyu-Won; Kim, Hyeon Jeong; Cho, Seoae; Li, Ning; Huang, Yinhua; Bruford, Michael W.; Zhan, Xiangjiang; Dixon, Andrew; Bertelsen, Mads F.; Derryberry, Elizabeth; Warren, Wesley; Wilson, Richard K; Li, Shengbin; Ray, David A.; Green, Richard E.; O’Brien, Stephen J.; Griffin, Darren; Johnson, Warren E.; Haussler, David; Ryder, Oliver A.; Willerslev, Eske; Graves, Gary R.; Alström, Per; Fjeldså, Jon; Mindell, David P.; Edwards, Scott V.; Braun, Edward L.; Rahbek, Carsten; Burt, David W.; Houde, Peter; Zhang, Yong; Yang, Huanming; Wang, Jian; Jarvis, Erich D.; Gilbert, M. Thomas P.; Wang, Jun

    2015-01-01

    Birds are the most species-rich class of tetrapod vertebrates and have wide relevance across many research fields. We explored bird macroevolution using full genomes from 48 avian species representing all major extant clades. The avian genome is principally characterized by its constrained size, which predominantly arose because of lineage-specific erosion of repetitive elements, large segmental deletions, and gene loss. Avian genomes furthermore show a remarkably high degree of evolutionary stasis at the levels of nucleotide sequence, gene synteny, and chromosomal structure. Despite this pattern of conservation, we detected many non-neutral evolutionary changes in protein-coding genes and noncoding regions. These analyses reveal that pan-avian genomic diversity covaries with adaptations to different lifestyles and convergent evolution of traits. PMID:25504712

  4. Avian comparative genomics: reciprocal chromosome painting between domestic chicken (Gallus gallus) and the stone curlew (Burhinus oedicnemus, Charadriiformes)—An atypical species with low diploid number

    PubMed Central

    2009-01-01

    The chicken is the most extensively studied species in birds and thus constitutes an ideal reference for comparative genomics in birds. Comparative cytogenetic studies indicate that the chicken has retained many chromosome characters of the ancestral avian karyotype. The homology between chicken macrochromosomes (1–9 and Z) and their counterparts in more than 40 avian species of 10 different orders has been established by chromosome painting. However, the avian homologues of chicken micro-chromosomes remain to be defined. Moreover, no reciprocal chromosome painting in birds has been performed due to the lack of chromosome-specific probes from other avian species. Here we have generated a set of chromosome-specific paints using flow cytometry that cover the whole genome of the stone curlew (Burhinus oedicnemus, Charadriiformes), a species with one of the lowest diploid number so far reported in birds, as well as paints from more microchromosomes of the chicken. A genome-wide comparative map between the chicken and the stone curlew has been constructed for the first time based on reciprocal chromosome painting. The results indicate that extensive chromosome fusions underlie the sharp decrease in the diploid number in the stone curlew. To a lesser extent, chromosome fissions and inversions occurred also during the evolution of the stone curlew. It is anticipated that this complete set of chromosome painting probes from the first Neoaves species will become an invaluable tool for avian comparative cytogenetics. PMID:19172404

  5. DNA Sequence and Comparative Genomics of pAPEC-O2-R, an Avian Pathogenic Escherichia coli Transmissible R Plasmid

    PubMed Central

    Johnson, Timothy J.; Siek, Kylie E.; Johnson, Sara J.; Nolan, Lisa K.

    2005-01-01

    In this study, a 101-kb IncF plasmid from an avian pathogenic Escherichia coli (APEC) strain (APEC O2) was sequenced and analyzed, providing the first completed APEC plasmid sequence. This plasmid, pAPEC-O2-R, has functional transfer and antimicrobial resistance-encoding regions. The resistance-encoding region encodes resistance to eight groups of antimicrobial agents, including silver and other heavy metals, quaternary ammonium compounds, tetracycline, sulfonamides, aminoglycosides, trimethoprim, and beta-lactam antimicrobial agents. This region of the plasmid is unique among previously described IncF plasmids in that it possesses a class 1 integron that harbors three gene cassettes and a heavy metal resistance operon. This region spans 33 kb and is flanked by the RepFII plasmid replicon and an assortment of plasmid maintenance genes. pAPEC-O2-R also contains a 32-kb transfer region that is nearly identical to that found in the E. coli F plasmid, rendering it transferable by conjugation to plasmid-less strains of bacteria, including an APEC strain, a fecal E. coli strain from an apparently healthy bird, a Salmonella enterica serovar Typhimurium strain, and a uropathogenic E. coli strain from humans. Differences in the G+C contents of individual open reading frames suggest that various regions of pAPEC-O2-R had dissimilar origins. The presence of pAPEC-O2-R-like plasmids that encode resistance to multiple antimicrobial agents and that are readily transmissible from APEC to other bacteria suggests the possibility that such plasmids may serve as a reservoir of resistance genes for other bacteria of animal and human health significance. PMID:16251312

  6. Complete Genomic Sequence for an Avian Group G Rotavirus from South Africa

    PubMed Central

    Stucker, Karla M.; Stockwell, Timothy B.; Nyaga, Martin M.; Halpin, Rebecca A.; Fedorova, Nadia; Akopov, Asmik; Ngoveni, Harry; Peenze, Ina; Seheri, Mapaseka L.; Mphahlele, M. Jeffrey

    2015-01-01

    We report the first complete sequence for an avian group G rotavirus (RVG) genome from Africa, which is the third publically available RVG genome. These RVG genomes are highly diverse, especially in their VP4, VP7, NSP4, and NSP3 segments, indicating that RVG diversity is comparable to that of rotavirus A. PMID:25767240

  7. Complete genomic sequence for an avian group G rotavirus from South Africa.

    PubMed

    Stucker, Karla M; Stockwell, Timothy B; Nyaga, Martin M; Halpin, Rebecca A; Fedorova, Nadia; Akopov, Asmik; Ngoveni, Harry; Peenze, Ina; Seheri, Mapaseka L; Mphahlele, M Jeffrey; Wentworth, David E

    2015-01-01

    We report the first complete sequence for an avian group G rotavirus (RVG) genome from Africa, which is the third publically available RVG genome. These RVG genomes are highly diverse, especially in their VP4, VP7, NSP4, and NSP3 segments, indicating that RVG diversity is comparable to that of rotavirus A. PMID:25767240

  8. Comparative genomics - a perspective.

    PubMed

    Sivashankari, Selvarajan; Shanmughavel, Piramanayagam

    2007-03-27

    The rapidly emerging field of comparative genomics has yielded dramatic results. Comparative genome analysis has become feasible with the availability of a number of completely sequenced genomes. Comparison of complete genomes between organisms allow for global views on genome evolution and the availability of many completely sequenced genomes increases the predictive power in deciphering the hidden information in genome design, function and evolution. Thus, comparison of human genes with genes from other genomes in a genomic landscape could help assign novel functions for un-annotated genes. Here, we discuss the recently used techniques for comparative genomics and their derived inferences in genome biology.

  9. Comparative genomics - A perspective

    PubMed Central

    Sivashankari, Selvarajan; Shanmughavel, Piramanayagam

    2007-01-01

    The rapidly emerging field of comparative genomics has yielded dramatic results. Comparative genome analysis has become feasible with the availability of a number of completely sequenced genomes. Comparison of complete genomes between organisms allow for global views on genome evolution and the availability of many completely sequenced genomes increases the predictive power in deciphering the hidden information in genome design, function and evolution. Thus, comparison of human genes with genes from other genomes in a genomic landscape could help assign novel functions for un-annotated genes. Here, we discuss the recently used techniques for comparative genomics and their derived inferences in genome biology. PMID:17597925

  10. The value of avian genomics to the conservation of wildlife

    PubMed Central

    2009-01-01

    Background Genomic studies in non-domestic avian models, such as the California condor and white-throated sparrow, can lead to more comprehensive conservation plans and provide clues for understanding mechanisms affecting genetic variation, adaptation and evolution. Developing genomic tools and resources including genomic libraries and a genetic map of the California condor is a prerequisite for identification of candidate loci for a heritable embryonic lethal condition. The white-throated sparrow exhibits a stable genetic polymorphism (i.e. chromosomal rearrangements) associated with variation in morphology, physiology, and behavior (e.g., aggression, social behavior, sexual behavior, parental care). In this paper we outline the utility of these species as well as report on recent advances in the study of their genomes. Results Genotyping of the condor resource population at 17 microsatellite loci provided a better assessment of the current population's genetic variation. Specific New World vulture repeats were found in the condor genome. Using condor BAC library and clones, chicken-condor comparative maps were generated. A condor fibroblast cell line transcriptome was characterized using the 454 sequencing technology. Our karyotypic analyses of the sparrow in combination with other studies indicate that the rearrangements in both chromosomes 2m and 3a are complex and likely involve multiple inversions, interchromosomal linkage, and pleiotropy. At least a portion of the rearrangement in chromosome 2m existed in the common ancestor of the four North American species of Zonotrichia, but not in the one South American species, and that the 2m form, originally thought to be the derived condition, might actually be the ancestral one. Conclusion Mining and characterization of candidate loci in the California condor using molecular genetic and genomic techniques as well as linkage and comparative genomic mapping will eventually enable the identification of carriers of the

  11. Ensembl comparative genomics resources

    PubMed Central

    Muffato, Matthieu; Beal, Kathryn; Fitzgerald, Stephen; Gordon, Leo; Pignatelli, Miguel; Vilella, Albert J.; Searle, Stephen M. J.; Amode, Ridwan; Brent, Simon; Spooner, William; Kulesha, Eugene; Yates, Andrew; Flicek, Paul

    2016-01-01

    Evolution provides the unifying framework with which to understand biology. The coherent investigation of genic and genomic data often requires comparative genomics analyses based on whole-genome alignments, sets of homologous genes and other relevant datasets in order to evaluate and answer evolutionary-related questions. However, the complexity and computational requirements of producing such data are substantial: this has led to only a small number of reference resources that are used for most comparative analyses. The Ensembl comparative genomics resources are one such reference set that facilitates comprehensive and reproducible analysis of chordate genome data. Ensembl computes pairwise and multiple whole-genome alignments from which large-scale synteny, per-base conservation scores and constrained elements are obtained. Gene alignments are used to define Ensembl Protein Families, GeneTrees and homologies for both protein-coding and non-coding RNA genes. These resources are updated frequently and have a consistent informatics infrastructure and data presentation across all supported species. Specialized web-based visualizations are also available including synteny displays, collapsible gene tree plots, a gene family locator and different alignment views. The Ensembl comparative genomics infrastructure is extensively reused for the analysis of non-vertebrate species by other projects including Ensembl Genomes and Gramene and much of the information here is relevant to these projects. The consistency of the annotation across species and the focus on vertebrates makes Ensembl an ideal system to perform and support vertebrate comparative genomic analyses. We use robust software and pipelines to produce reference comparative data and make it freely available. Database URL: http://www.ensembl.org. PMID:26896847

  12. Ensembl comparative genomics resources.

    PubMed

    Herrero, Javier; Muffato, Matthieu; Beal, Kathryn; Fitzgerald, Stephen; Gordon, Leo; Pignatelli, Miguel; Vilella, Albert J; Searle, Stephen M J; Amode, Ridwan; Brent, Simon; Spooner, William; Kulesha, Eugene; Yates, Andrew; Flicek, Paul

    2016-01-01

    Evolution provides the unifying framework with which to understand biology. The coherent investigation of genic and genomic data often requires comparative genomics analyses based on whole-genome alignments, sets of homologous genes and other relevant datasets in order to evaluate and answer evolutionary-related questions. However, the complexity and computational requirements of producing such data are substantial: this has led to only a small number of reference resources that are used for most comparative analyses. The Ensembl comparative genomics resources are one such reference set that facilitates comprehensive and reproducible analysis of chordate genome data. Ensembl computes pairwise and multiple whole-genome alignments from which large-scale synteny, per-base conservation scores and constrained elements are obtained. Gene alignments are used to define Ensembl Protein Families, GeneTrees and homologies for both protein-coding and non-coding RNA genes. These resources are updated frequently and have a consistent informatics infrastructure and data presentation across all supported species. Specialized web-based visualizations are also available including synteny displays, collapsible gene tree plots, a gene family locator and different alignment views. The Ensembl comparative genomics infrastructure is extensively reused for the analysis of non-vertebrate species by other projects including Ensembl Genomes and Gramene and much of the information here is relevant to these projects. The consistency of the annotation across species and the focus on vertebrates makes Ensembl an ideal system to perform and support vertebrate comparative genomic analyses. We use robust software and pipelines to produce reference comparative data and make it freely available. Database URL: http://www.ensembl.org.

  13. Genome sequencing of a virulent avian Pasteurella multocida strain GX-Pm reveals the candidate genes involved in the pathogenesis.

    PubMed

    Yu, Chengjie; Sizhu, Suolang; Luo, Qingping; Xu, Xuewen; Fu, Lei; Zhang, Anding

    2016-04-01

    Pasteurella multocida (P. multocida) was first shown to be the causative agent of fowl cholera by Louis Pasteur in 1881. First genomic study was performed on an avirulent avian strain Pm70, and until 2013, two genomes of virulent avian strains X73 and P1059 were sequenced. Comparative genome study supplied important information for further study on the pathogenesis of fowl cholera. In the previous study, a capsular serotype A strain GX-Pm was isolated from the liver of a chicken, which died during an outbreak of fowl cholera in 2011. The strain showed multiple drug resistance and was highly virulent to chickens. Therefore, the present study performed the genome sequencing and a comparative genomic analysis to reveal the candidate genes involved in virulence of P. multocida. Sequenced draft genome sequence of GX-Pm was 2,292,886 bp, contained 2941 protein-coding genes, 5 genomic islands, 4 IS elements and 2 prophage regions. Notability, all the predicted drug-resistance genes were included in predicted genomic islands. A comparative genome study on virulent avian strains P1059, X73 and GX-Pm with the avirulent avian strain Pm 70 indicated that 475 unique genes were only identified in either of virulent strains but absent in the avirulent strain. Among these genes, 20 genes were contained within genomes of all three virulent strains, including a few of putative virulence genes. Further characterization of the pathogenic functions of these genes would benefit the understanding of pathogenesis of fowl cholera.

  14. Current genomic editing approaches in avian transgenesis.

    PubMed

    Park, Tae Sub; Kang, Kyung Soo; Han, Jae Yong

    2013-09-01

    The chicken was domesticated from Red Jungle Fowl over 8000years ago and became one of the major food sources worldwide. At present, the poultry industry is one of the largest industrial animal stocks in the world, and its economic scale is expanding significantly with increasing consumption. Additionally, since Aristotle used chicken eggs as a model to provide remarkable insights into how life begins, chickens have been used as invaluable and powerful experimental materials for studying embryo development, immune systems, biomedical processes, and hormonal regulation. Combined with advancements in efficient transgenic technology, avian models have become even more important than would have been expected.

  15. Intrachromosomal rearrangements in avian genome evolution: evidence for regions prone to breakpoints

    PubMed Central

    Skinner, B M; Griffin, D K

    2012-01-01

    It is generally believed that the organization of avian genomes remains highly conserved in evolution as chromosome number is constant and comparative chromosome painting demonstrated there to be very few interchromosomal rearrangements. The recent sequencing of the zebra finch (Taeniopygia guttata) genome allowed an assessment of the number of intrachromosomal rearrangements between it and the chicken (Gallus gallus) genome, revealing a surprisingly high number of intrachromosomal rearrangements. With the publication of the turkey (Meleagris gallopavo) genome it has become possible to describe intrachromosomal rearrangements between these three important avian species, gain insight into the direction of evolutionary change and assess whether breakpoint regions are reused in birds. To this end, we aligned entire chromosomes between chicken, turkey and zebra finch, identifying syntenic blocks of at least 250 kb. Potential optimal pathways of rearrangements between each of the three genomes were determined, as was a potential Galliform ancestral organization. From this, our data suggest that around one-third of chromosomal breakpoint regions may recur during avian evolution, with 10% of breakpoints apparently recurring in different lineages. This agrees with our previous hypothesis that mechanisms of genome evolution are driven by hotspots of non-allelic homologous recombination. PMID:22045382

  16. Ebolavirus comparative genomics

    DOE PAGES

    Jun, Se-Ran; Leuze, Michael R.; Nookaew, Intawat; Uberbacher, Edward C.; Land, Miriam; Zhang, Qian; Wanchai, Visanu; Chai, Juanjuan; Nielsen, Morten; Trolle, Thomas; et al

    2015-07-14

    The 2014 Ebola outbreak in West Africa is the largest documented for this virus. We examine the dynamics of this genome, comparing more than one hundred currently available ebolavirus genomes to each other and to other viral genomes. Based on oligomer frequency analysis, the family Filoviridae forms a distinct group from all other sequenced viral genomes. All filovirus genomes sequenced to date encode proteins with similar functions and gene order, although there is considerable divergence in sequences between the three genera Ebolavirus, Cuevavirus, and Marburgvirus within the family Filoviridae. Whereas all ebolavirus genomes are quite similar (multiple sequences of themore » same strain are often identical), variation is most common in the intergenic regions and within specific areas of the genes encoding the glycoprotein (GP), nucleoprotein (NP), and polymerase (L). We predict regions that could contain epitope-binding sites, which might be good vaccine targets. In conclusion, this information, combined with glycosylation sites and experimentally determined epitopes, can identify the most promising regions for the development of therapeutic strategies.« less

  17. Comparative genome analysis of an avirulent and two virulent strains of avian Pasteurella multocida reveals candidate genes involved in fitness and pathogenicity

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Fowl cholera is a highly contagious systemic disease affecting wild and domestic birds, frequently resulting in high morbidity and mortality. The causative agent is Pasteurella multocida (P. multocida). The completed genome of P. multocida strain Pm70 has been available for over eleven years and has...

  18. Ebolavirus comparative genomics

    PubMed Central

    Jun, Se-Ran; Leuze, Michael R.; Nookaew, Intawat; Uberbacher, Edward C.; Land, Miriam; Zhang, Qian; Wanchai, Visanu; Chai, Juanjuan; Nielsen, Morten; Trolle, Thomas; Lund, Ole; Buzard, Gregory S.; Pedersen, Thomas D.; Wassenaar, Trudy M.; Ussery, David W.

    2015-01-01

    The 2014 Ebola outbreak in West Africa is the largest documented for this virus. To examine the dynamics of this genome, we compare more than 100 currently available ebolavirus genomes to each other and to other viral genomes. Based on oligomer frequency analysis, the family Filoviridae forms a distinct group from all other sequenced viral genomes. All filovirus genomes sequenced to date encode proteins with similar functions and gene order, although there is considerable divergence in sequences between the three genera Ebolavirus, Cuevavirus and Marburgvirus within the family Filoviridae. Whereas all ebolavirus genomes are quite similar (multiple sequences of the same strain are often identical), variation is most common in the intergenic regions and within specific areas of the genes encoding the glycoprotein (GP), nucleoprotein (NP) and polymerase (L). We predict regions that could contain epitope-binding sites, which might be good vaccine targets. This information, combined with glycosylation sites and experimentally determined epitopes, can identify the most promising regions for the development of therapeutic strategies. This manuscript has been authored by UT-Battelle, LLC under Contract No. DE-AC05-00OR22725 with the U.S. Department of Energy. The United States Government retains and the publisher, by accepting the article for publication, acknowledges that the United States Government retains a non-exclusive, paid-up, irrevocable, world-wide license to publish or reproduce the published form of this manuscript, or allow others to do so, for United States Government purposes. The Department of Energy will provide public access to these results of federally sponsored research in accordance with the DOE Public Access Plan (http://energy.gov/downloads/doe-public-access-plan). PMID:26175035

  19. The smallest avian genomes are found in hummingbirds.

    PubMed

    Gregory, T Ryan; Andrews, Chandler B; McGuire, Jimmy A; Witt, Christopher C

    2009-11-01

    It has often been suggested that the genome sizes of birds are constrained relative to other tetrapods owing to the high metabolic demands of powered flight and the link between nuclear DNA content and red blood cell size. This hypothesis predicts that hummingbirds, which engage in energy-intensive hovering flight, will display especially constrained genomes even relative to other birds. We report genome size measurements for 37 species of hummingbirds that confirm this prediction. Our results suggest that genome size was reduced before the divergence of extant hummingbird lineages, and that only minimal additional reduction occurred during hummingbird diversification. Unlike in some other avian taxa, the small amount of variation observed within hummingbirds is not explained by variation in respiratory and flight-related parameters. Unexpectedly, genome size appears to have increased in four unrelated hummingbird species whose distributions are centred on humid forests of the upper-tropical elevational zone on the eastern slope of the Andes. This suggests that the secondary expansion of the genome may have been mediated by biogeographical and demographic effects.

  20. The smallest avian genomes are found in hummingbirds.

    PubMed

    Gregory, T Ryan; Andrews, Chandler B; McGuire, Jimmy A; Witt, Christopher C

    2009-11-01

    It has often been suggested that the genome sizes of birds are constrained relative to other tetrapods owing to the high metabolic demands of powered flight and the link between nuclear DNA content and red blood cell size. This hypothesis predicts that hummingbirds, which engage in energy-intensive hovering flight, will display especially constrained genomes even relative to other birds. We report genome size measurements for 37 species of hummingbirds that confirm this prediction. Our results suggest that genome size was reduced before the divergence of extant hummingbird lineages, and that only minimal additional reduction occurred during hummingbird diversification. Unlike in some other avian taxa, the small amount of variation observed within hummingbirds is not explained by variation in respiratory and flight-related parameters. Unexpectedly, genome size appears to have increased in four unrelated hummingbird species whose distributions are centred on humid forests of the upper-tropical elevational zone on the eastern slope of the Andes. This suggests that the secondary expansion of the genome may have been mediated by biogeographical and demographic effects. PMID:19656792

  1. Phytozome Comparative Plant Genomics Portal

    SciTech Connect

    Goodstein, David; Batra, Sajeev; Carlson, Joseph; Hayes, Richard; Phillips, Jeremy; Shu, Shengqiang; Schmutz, Jeremy; Rokhsar, Daniel

    2014-09-09

    The Dept. of Energy Joint Genome Institute is a genomics user facility supporting DOE mission science in the areas of Bioenergy, Carbon Cycling, and Biogeochemistry. The Plant Program at the JGI applies genomic, analytical, computational and informatics platforms and methods to: 1. Understand and accelerate the improvement (domestication) of bioenergy crops 2. Characterize and moderate plant response to climate change 3. Use comparative genomics to identify constrained elements and infer gene function 4. Build high quality genomic resource platforms of JGI Plant Flagship genomes for functional and experimental work 5. Expand functional genomic resources for Plant Flagship genomes

  2. Genome Mapping in Plant Comparative Genomics.

    PubMed

    Chaney, Lindsay; Sharp, Aaron R; Evans, Carrie R; Udall, Joshua A

    2016-09-01

    Genome mapping produces fingerprints of DNA sequences to construct a physical map of the whole genome. It provides contiguous, long-range information that complements and, in some cases, replaces sequencing data. Recent advances in genome-mapping technology will better allow researchers to detect large (>1kbp) structural variations between plant genomes. Some molecular and informatics complications need to be overcome for this novel technology to achieve its full utility. This technology will be useful for understanding phenotype responses due to DNA rearrangements and will yield insights into genome evolution, particularly in polyploids. In this review, we outline recent advances in genome-mapping technology, including the processes required for data collection and analysis, and applications in plant comparative genomics.

  3. Endogenous hepadnaviruses in the genome of the budgerigar (Melopsittacus undulatus) and the evolution of avian hepadnaviruses.

    PubMed

    Cui, Jie; Holmes, Edward C

    2012-07-01

    Endogenous hepadnaviruses (hepatitis B viruses [HBVs]) were recently discovered in the genomes of passerine birds. We mined six additional avian genomes and discovered multiple copies of endogenous HBVs in the budgerigar (order Psittaciformes), designated eBHBV. A phylogenetic analysis reveals that the endogenous hepadnaviruses are more diverse than their exogenous counterparts and that the endogenous and exogenous hepadnaviruses form distinct lineages even when sampled from the same avian order, indicative of multiple genomic integration events. PMID:22553337

  4. Endogenous Hepadnaviruses in the Genome of the Budgerigar (Melopsittacus undulatus) and the Evolution of Avian Hepadnaviruses

    PubMed Central

    Cui, Jie

    2012-01-01

    Endogenous hepadnaviruses (hepatitis B viruses [HBVs]) were recently discovered in the genomes of passerine birds. We mined six additional avian genomes and discovered multiple copies of endogenous HBVs in the budgerigar (order Psittaciformes), designated eBHBV. A phylogenetic analysis reveals that the endogenous hepadnaviruses are more diverse than their exogenous counterparts and that the endogenous and exogenous hepadnaviruses form distinct lineages even when sampled from the same avian order, indicative of multiple genomic integration events. PMID:22553337

  5. Complete Genome Sequence of an Avian Bornavirus Isolated from a Healthy Canadian Goose (Branta canadensis).

    PubMed

    Guo, Jianhua; Baroch, John; Randall, Adam; Tizard, Ian

    2013-10-24

    A recent survey among wild birds demonstrated the presence of a unique genotype of avian bornavirus (ABV) in wild geese and swans in North America. Here, we report the first complete genome sequence of an avian bornavirus of the goose genotype.

  6. Comparative genomics of Brassicaceae crops.

    PubMed

    Sharma, Ashutosh; Li, Xiaonan; Lim, Yong Pyo

    2014-05-01

    The family Brassicaceae is one of the major groups of the plant kingdom and comprises diverse species of great economic, agronomic and scientific importance, including the model plant Arabidopsis. The sequencing of the Arabidopsis genome has revolutionized our knowledge in the field of plant biology and provides a foundation in genomics and comparative biology. Genomic resources have been utilized in Brassica for diversity analyses, construction of genetic maps and identification of agronomic traits. In Brassicaceae, comparative sequence analysis across the species has been utilized to understand genome structure, evolution and the detection of conserved genomic segments. In this review, we focus on the progress made in genetic resource development, genome sequencing and comparative mapping in Brassica and related species. The utilization of genomic resources and next-generation sequencing approaches in improvement of Brassica crops is also discussed. PMID:24987286

  7. Complete Genome Sequence of an Avian Paramyxovirus Representative of Putative New Serotype 13.

    PubMed

    Goraichuk, Iryna; Sharma, Poonam; Stegniy, Borys; Muzyka, Denys; Pantin-Jackwood, Mary J; Gerilovych, Anton; Solodiankin, Olexii; Bolotin, Vitaliy; Miller, Patti J; Dimitrov, Kiril M; Afonso, Claudio L

    2016-01-01

    Here, we report the complete genome sequence of a virus of a putative new serotype of avian paramyxovirus (APMV). The virus was isolated from a white-fronted goose in Ukraine in 2011 and designated white-fronted goose/Ukraine/Askania-Nova/48-15-02/2011. The genomic characterization of the isolate suggests that it represents the novel avian paramyxovirus group APMV 13. PMID:27469958

  8. Complete Genome Sequence of an Avian Paramyxovirus Representative of Putative New Serotype 13

    PubMed Central

    Goraichuk, Iryna; Sharma, Poonam; Stegniy, Borys; Muzyka, Denys; Pantin-Jackwood, Mary J.; Gerilovych, Anton; Solodiankin, Olexii; Bolotin, Vitaliy; Miller, Patti J.; Dimitrov, Kiril M.

    2016-01-01

    Here, we report the complete genome sequence of a virus of a putative new serotype of avian paramyxovirus (APMV). The virus was isolated from a white-fronted goose in Ukraine in 2011 and designated white-fronted goose/Ukraine/Askania-Nova/48-15-02/2011. The genomic characterization of the isolate suggests that it represents the novel avian paramyxovirus group APMV 13. PMID:27469958

  9. Cloud computing for comparative genomics

    PubMed Central

    2010-01-01

    Background Large comparative genomics studies and tools are becoming increasingly more compute-expensive as the number of available genome sequences continues to rise. The capacity and cost of local computing infrastructures are likely to become prohibitive with the increase, especially as the breadth of questions continues to rise. Alternative computing architectures, in particular cloud computing environments, may help alleviate this increasing pressure and enable fast, large-scale, and cost-effective comparative genomics strategies going forward. To test this, we redesigned a typical comparative genomics algorithm, the reciprocal smallest distance algorithm (RSD), to run within Amazon's Elastic Computing Cloud (EC2). We then employed the RSD-cloud for ortholog calculations across a wide selection of fully sequenced genomes. Results We ran more than 300,000 RSD-cloud processes within the EC2. These jobs were farmed simultaneously to 100 high capacity compute nodes using the Amazon Web Service Elastic Map Reduce and included a wide mix of large and small genomes. The total computation time took just under 70 hours and cost a total of $6,302 USD. Conclusions The effort to transform existing comparative genomics algorithms from local compute infrastructures is not trivial. However, the speed and flexibility of cloud computing environments provides a substantial boost with manageable cost. The procedure designed to transform the RSD algorithm into a cloud-ready application is readily adaptable to similar comparative genomics problems. PMID:20482786

  10. Molecular characterization of avian polyomavirus isolated from psittacine birds based on the whole genome sequence analysis.

    PubMed

    Katoh, Hiroshi; Ohya, Kenji; Une, Yumi; Yamaguchi, Tsuyoshi; Fukushi, Hideto

    2009-07-01

    Seven avian polyomaviruses (APVs) were isolated from seven psittacine birds of four species. Their whole genome sequences were genetically analyzed. Comparing with the sequence of BFDV1 strain, nucleotide substitutions in the sequences of seven APV isolates were found at 63 loci and a high level of conservation of amino acid sequence in each viral protein (VP1, VP2, VP3, VP4, and t/T antigen) was predicted. An A-to-T nucleotide substitution was observed in non-control region of all seven APV sequences in comparison with BFDV1 strain. Two C-to-T nucleotide substitutions were also detected in non-coding regions of one isolate. A phylogenetic analysis of the whole genome sequences indicated that the sequences from the same species of bird were closely related. APV has been reported to have distinct tropism for cell cultures of various avian species. The present study indicated that a single amino acid substitution at position 221 in VP2 was essential for propagating in chicken embryonic fibroblast culture and this substitution was promoted by propagation on budgerigar embryonic fibroblast culture. For two isolates, three serial amino acids appeared to be deleted in VP4. However, this deletion had little effect on virus propagation.

  11. Complete Genome Sequence of an Avian Paramyxovirus Representative of Putative New Serotype 13

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Here, we report the complete genome sequence of a virus of a putative new serotype of avian paramyxovirus (APMV). The virus was isolated from a white-fronted goose in Ukraine in 2011 and designated white-fronted goose/Ukraine/Askania-Nova/48-15- 02/2011. The genomic characterization of the isolate s...

  12. Early penguin fossils, plus mitochondrial genomes, calibrate avian evolution.

    PubMed

    Slack, Kerryn E; Jones, Craig M; Ando, Tatsuro; Harrison, G L Abby; Fordyce, R Ewan; Arnason, Ulfur; Penny, David

    2006-06-01

    Testing models of macroevolution, and especially the sufficiency of microevolutionary processes, requires good collaboration between molecular biologists and paleontologists. We report such a test for events around the Late Cretaceous by describing the earliest penguin fossils, analyzing complete mitochondrial genomes from an albatross, a petrel, and a loon, and describe the gradual decline of pterosaurs at the same time modern birds radiate. The penguin fossils comprise four naturally associated skeletons from the New Zealand Waipara Greensand, a Paleocene (early Tertiary) formation just above a well-known Cretaceous/Tertiary boundary site. The fossils, in a new genus (Waimanu), provide a lower estimate of 61-62 Ma for the divergence between penguins and other birds and thus establish a reliable calibration point for avian evolution. Combining fossil calibration points, DNA sequences, maximum likelihood, and Bayesian analysis, the penguin calibrations imply a radiation of modern (crown group) birds in the Late Cretaceous. This includes a conservative estimate that modern sea and shorebird lineages diverged at least by the Late Cretaceous about 74 +/- 3 Ma (Campanian). It is clear that modern birds from at least the latest Cretaceous lived at the same time as archaic birds including Hesperornis, Ichthyornis, and the diverse Enantiornithiformes. Pterosaurs, which also coexisted with early crown birds, show notable changes through the Late Cretaceous. There was a decrease in taxonomic diversity, and small- to medium-sized species disappeared well before the end of the Cretaceous. A simple reading of the fossil record might suggest competitive interactions with birds, but much more needs to be understood about pterosaur life histories. Additional fossils and molecular data are still required to help understand the role of biotic interactions in the evolution of Late Cretaceous birds and thus to test that the mechanisms of microevolution are sufficient to explain

  13. Early penguin fossils, plus mitochondrial genomes, calibrate avian evolution.

    PubMed

    Slack, Kerryn E; Jones, Craig M; Ando, Tatsuro; Harrison, G L Abby; Fordyce, R Ewan; Arnason, Ulfur; Penny, David

    2006-06-01

    Testing models of macroevolution, and especially the sufficiency of microevolutionary processes, requires good collaboration between molecular biologists and paleontologists. We report such a test for events around the Late Cretaceous by describing the earliest penguin fossils, analyzing complete mitochondrial genomes from an albatross, a petrel, and a loon, and describe the gradual decline of pterosaurs at the same time modern birds radiate. The penguin fossils comprise four naturally associated skeletons from the New Zealand Waipara Greensand, a Paleocene (early Tertiary) formation just above a well-known Cretaceous/Tertiary boundary site. The fossils, in a new genus (Waimanu), provide a lower estimate of 61-62 Ma for the divergence between penguins and other birds and thus establish a reliable calibration point for avian evolution. Combining fossil calibration points, DNA sequences, maximum likelihood, and Bayesian analysis, the penguin calibrations imply a radiation of modern (crown group) birds in the Late Cretaceous. This includes a conservative estimate that modern sea and shorebird lineages diverged at least by the Late Cretaceous about 74 +/- 3 Ma (Campanian). It is clear that modern birds from at least the latest Cretaceous lived at the same time as archaic birds including Hesperornis, Ichthyornis, and the diverse Enantiornithiformes. Pterosaurs, which also coexisted with early crown birds, show notable changes through the Late Cretaceous. There was a decrease in taxonomic diversity, and small- to medium-sized species disappeared well before the end of the Cretaceous. A simple reading of the fossil record might suggest competitive interactions with birds, but much more needs to be understood about pterosaur life histories. Additional fossils and molecular data are still required to help understand the role of biotic interactions in the evolution of Late Cretaceous birds and thus to test that the mechanisms of microevolution are sufficient to explain

  14. Comparative genomics for biodiversity conservation.

    PubMed

    Grueber, Catherine E

    2015-01-01

    Genomic approaches are gathering momentum in biology and emerging opportunities lie in the creative use of comparative molecular methods for revealing the processes that influence diversity of wildlife. However, few comparative genomic studies are performed with explicit and specific objectives to aid conservation of wild populations. Here I provide a brief overview of comparative genomic approaches that offer specific benefits to biodiversity conservation. Because conservation examples are few, I draw on research from other areas to demonstrate how comparing genomic data across taxa may be used to inform the characterisation of conservation units and studies of hybridisation, as well as studies that provide conservation outcomes from a better understanding of the drivers of divergence. A comparative approach can also provide valuable insight into the threatening processes that impact rare species, such as emerging diseases and their management in conservation. In addition to these opportunities, I note areas where additional research is warranted. Overall, comparing and contrasting the genomic composition of threatened and other species provide several useful tools for helping to preserve the molecular biodiversity of the global ecosystem.

  15. Comparative genomics for biodiversity conservation

    PubMed Central

    Grueber, Catherine E.

    2015-01-01

    Genomic approaches are gathering momentum in biology and emerging opportunities lie in the creative use of comparative molecular methods for revealing the processes that influence diversity of wildlife. However, few comparative genomic studies are performed with explicit and specific objectives to aid conservation of wild populations. Here I provide a brief overview of comparative genomic approaches that offer specific benefits to biodiversity conservation. Because conservation examples are few, I draw on research from other areas to demonstrate how comparing genomic data across taxa may be used to inform the characterisation of conservation units and studies of hybridisation, as well as studies that provide conservation outcomes from a better understanding of the drivers of divergence. A comparative approach can also provide valuable insight into the threatening processes that impact rare species, such as emerging diseases and their management in conservation. In addition to these opportunities, I note areas where additional research is warranted. Overall, comparing and contrasting the genomic composition of threatened and other species provide several useful tools for helping to preserve the molecular biodiversity of the global ecosystem. PMID:26106461

  16. Complete Genome Sequence of the Avian Paramyxovirus Serotype 5 Strain APMV-5/budgerigar/Japan/TI/75

    PubMed Central

    Hiono, Takahiro; Matsuno, Keita; Tuchiya, Kotaro; Lin, Zhifeng; Okamatsu, Masatoshi

    2016-01-01

    Here, we report the complete genome sequence of the avian paramyxovirus serotype 5 strain APMV-5/budgerigar/Japan/TI/75, which was determined using the Illumina MiSeq platform. The determined sequence shares 97% homology and similar genetic features with the previously known genome sequence of avian paramyxovirus serotype 5 strain APMV-5/budgerigar/Japan/Kunitachi/74. PMID:27660785

  17. Complete Genome Sequence of the Avian Paramyxovirus Serotype 5 Strain APMV-5/budgerigar/Japan/TI/75.

    PubMed

    Hiono, Takahiro; Matsuno, Keita; Tuchiya, Kotaro; Lin, Zhifeng; Okamatsu, Masatoshi; Sakoda, Yoshihiro

    2016-01-01

    Here, we report the complete genome sequence of the avian paramyxovirus serotype 5 strain APMV-5/budgerigar/Japan/TI/75, which was determined using the Illumina MiSeq platform. The determined sequence shares 97% homology and similar genetic features with the previously known genome sequence of avian paramyxovirus serotype 5 strain APMV-5/budgerigar/Japan/Kunitachi/74. PMID:27660785

  18. Genomic Signatures for Avian H7N9 Viruses Adapting to Humans.

    PubMed

    Chen, Guang-Wu; Kuo, Shu-Ming; Yang, Shu-Li; Gong, Yu-Nong; Hsiao, Mei-Ren; Liu, Yi-Chun; Shih, Shin-Ru; Tsao, Kuo-Chien

    2016-01-01

    An avian influenza A H7N9 virus emerged in March 2013 and caused a remarkable number of human fatalities. Genome variability in these viruses may provide insights into host adaptability. We scanned over 140 genomes of the H7N9 viruses isolated from humans and identified 104 positions that exhibited seven or more amino acid substitutions. Approximately half of these substitutions were identified in the influenza ribonucleoprotein (RNP) complex. Although PB2 627K of the avian virus promotes replication in humans, 45 of the 147 investigated PB2 sequences retained the E signature at this position, which is an avian characteristic. We discovered 10 PB2 substitutions that covaried with K627E. An RNP activity assay showed that Q591K, D701N, and M535L restored the polymerase activity in human cells when 627K transformed to an avian-like E. Genomic analysis of the human-isolated avian influenza virus is crucial in assessing genome variability, because relationships between position-specific variations can be observed and explored. In this study, we observed alternative positions that can potentially compensate for PB2 627K, a well-known marker for cross-species infection. An RNP assay suggested Q591K, D701N, and M535L as potential markers for an H7N9 virus capable of infecting humans.

  19. Comparative genome mapping in Brassica.

    PubMed

    Lagercrantz, U; Lydiate, D J

    1996-12-01

    A Brassica nigra genetic linkage map was developed from a highly polymorphic cross analyzed with a set of low copy number Brassica RFLP probes. The Brassica genome is extensively duplicated with eight distinct sets of chromosomal segments, each present in three copies, covering virtually the whole genome. Thus, B. nigra could be descended from a hexaploid ancestor. A comparative analysis of B. nigra, B. oleracea and B. rapa genomes, based on maps developed using a common set of RFLP probes, was also performed. The three genomes have distinct chromosomal structures differentiated by a large number of rearrangements, but collinear regions involving virtually the whole of each the three genomes were identified. The genic contents of B. nigra, B. oleracea and B. rapa were basically equivalent and differences in chromosome number (8, 9 and 10, respectively) are probably the result of chromosome fusions and/ or fissions. The strong conservation of overall genic content across the three Brassica genomes mirrors the conservation of genic content observed over a much longer evolutionary span in cereals. However, the rate of chromosomal rearrangement in crucifers is much higher than that observed in cereal genomes.

  20. Comparative Genomic Analysis Shows That Avian Pathogenic Escherichia coli Isolate IMT5155 (O2:K1:H5; ST Complex 95, ST140) Shares Close Relationship with ST95 APEC O1:K1 and Human ExPEC O18:K1 Strains

    PubMed Central

    Pan, Zihao; Hu, Lin; Wang, Shaohui; Wang, Haojin; Leung, Frederick C.; Dai, Jianjun; Fan, Hongjie

    2014-01-01

    Avian pathogenic E. coli and human extraintestinal pathogenic E. coli serotypes O1, O2 and O18 strains isolated from different hosts are generally located in phylogroup B2 and ST complex 95, and they share similar genetic characteristics and pathogenicity, with no or minimal host specificity. They are popular objects for the study of ExPEC genetic characteristics and pathogenesis in recent years. Here, we investigated the evolution and genetic blueprint of APEC pathotype by performing phylogenetic and comparative genome analysis of avian pathogenic E. coli strain IMT5155 (O2:K1:H5; ST complex 95, ST140) with other E. coli pathotypes. Phylogeny analyses indicated that IMT5155 has closest evolutionary relationship with APEC O1, IHE3034, and UTI89. Comparative genomic analysis showed that IMT5155 and APEC O1 shared significant genetic overlap/similarities with human ExPEC dominant O18:K1 strains (IHE3034 and UTI89). Furthermore, the unique PAI I5155 (GI-12) was identified and found to be conserved in APEC O2 serotype isolates. GI-7 and GI-16 encoding two typical T6SSs in IMT5155 might be useful markers for the identification of ExPEC dominant serotypes (O1, O2, and O18) strains. IMT5155 contained a ColV plasmid p1ColV5155, which defined the APEC pathotype. The distribution analysis of 10 sequenced ExPEC pan-genome virulence factors among 47 sequenced E. coli strains provided meaningful information for B2 APEC/ExPEC-specific virulence factors, including several adhesins, invasins, toxins, iron acquisition systems, and so on. The pathogenicity tests of IMT5155 and other APEC O1:K1 and O2:K1 serotypes strains (isolated in China) through four animal models showed that they were highly virulent for avian colisepticemia and able to cause septicemia and meningitis in neonatal rats, suggesting zoonotic potential of these APEC O1:K1 and O2:K1 isolates. PMID:25397580

  1. Comparative genomic analyses in Asparagus.

    PubMed

    Kuhl, Joseph C; Havey, Michael J; Martin, William J; Cheung, Foo; Yuan, Qiaoping; Landherr, Lena; Hu, Yi; Leebens-Mack, James; Town, Christopher D; Sink, Kenneth C

    2005-12-01

    Garden asparagus (Asparagus officinalis L.) belongs to the monocot family Asparagaceae in the order Asparagales. Onion (Allium cepa L.) and Asparagus officinalis are 2 of the most economically important plants of the core Asparagales, a well supported monophyletic group within the Asparagales. Coding regions in onion have lower GC contents than the grasses. We compared the GC content of 3374 unique expressed sequence tags (ESTs) from A. officinalis with Lycoris longituba and onion (both members of the core Asparagales), Acorus americanus (sister to all other monocots), the grasses, and Arabidopsis. Although ESTs in A. officinalis and Acorus had a higher average GC content than Arabidopsis, Lycoris, and onion, all were clearly lower than the grasses. The Asparagaceae have the smallest nuclear genomes among all plants in the core Asparagales, which typically have huge genomes. Within the Asparagaceae, European Asparagus species have approximately twice the nuclear DNA of that of southern African Asparagus species. We cloned and sequenced 20 genomic amplicons from European A. officinalis and the southern African species Asparagus plumosus and observed no clear evidence for a recent genome doubling in A. officinalis relative to A. plumosus. These results indicate that members of the genus Asparagus with smaller genomes may be useful genomic models for plants in the core Asparagales. PMID:16391674

  2. Enhancer Identification through Comparative Genomics

    SciTech Connect

    Visel, Axel; Bristow, James; Pennacchio, Len A.

    2006-10-01

    With the availability of genomic sequence from numerousvertebrates, a paradigm shift has occurred in the identification ofdistant-acting gene regulatory elements. In contrast to traditionalgene-centric studies in which investigators randomly scanned genomicfragments that flank genes of interest in functional assays, the modernapproach begins electronically with publicly available comparativesequence datasets that provide investigators with prioritized lists ofputative functional sequences based on their evolutionary conservation.However, although a large number of tools and resources are nowavailable, application of comparative genomic approaches remains far fromtrivial. In particular, it requires users to dynamically consider thespecies and methods for comparison depending on the specific biologicalquestion under investigation. While there is currently no single generalrule to this end, it is clear that when applied appropriately,comparative genomic approaches exponentially increase our power ingenerating biological hypotheses for subsequent experimentaltesting.

  3. Avian Polyomavirus Genome Sequences Recovered from Parrots in Captive Breeding Facilities in Poland

    PubMed Central

    Dayaram, Anisha; Piasecki, Tomasz; Chrząstek, Klaudia; White, Robyn; Julian, Laurel; van Bysterveldt, Katherine

    2015-01-01

    Eight genomes of avian polyomaviruses (APVs) were recovered and sequenced from deceased Psittacula eupatria, Psittacula krameri, and Melopsittacus undulatus from various breeding facilities in Poland. Of these APV-positive samples, six had previously tested positive for beak and feather disease virus (BFDV) and/or parrot hepatitis B virus (PHBV). PMID:26404592

  4. Avian Polyomavirus Genome Sequences Recovered from Parrots in Captive Breeding Facilities in Poland.

    PubMed

    Dayaram, Anisha; Piasecki, Tomasz; Chrząstek, Klaudia; White, Robyn; Julian, Laurel; van Bysterveldt, Katherine; Varsani, Arvind

    2015-01-01

    Eight genomes of avian polyomaviruses (APVs) were recovered and sequenced from deceased Psittacula eupatria, Psittacula krameri, and Melopsittacus undulatus from various breeding facilities in Poland. Of these APV-positive samples, six had previously tested positive for beak and feather disease virus (BFDV) and/or parrot hepatitis B virus (PHBV). PMID:26404592

  5. Complete Genome Sequence of a Brazil-Type Avian coronavirus Detected in a Chicken

    PubMed Central

    Ayres, Giselle R. R.; Torres, Carolina A.; Villarreal, Laura Y. B.; Hora, Aline S.; Taniwaki, Sueli A.

    2016-01-01

    Avian coronavirus is the causative agent of infectious bronchitis in chickens, leading to multisystemic disease that might be controlled if adequate vaccine strains are used. This paper reports the first complete genome sequence of a Brazil type of this virus (27,615 nucleotides [nt]) isolated from the kidneys of a chicken. PMID:27738043

  6. Genomic organization of the crested ibis MHC provides new insight into ancestral avian MHC structure

    PubMed Central

    Chen, Li-Cheng; Lan, Hong; Sun, Li; Deng, Yan-Li; Tang, Ke-Yi; Wan, Qiu-Hong

    2015-01-01

    The major histocompatibility complex (MHC) plays an important role in immune response. Avian MHCs are not well characterized, only reporting highly compact Galliformes MHCs and extensively fragmented zebra finch MHC. We report the first genomic structure of an endangered Pelecaniformes (crested ibis) MHC containing 54 genes in three regions spanning ~500 kb. In contrast to the loose BG (26 loci within 265 kb) and Class I (11 within 150) genomic structures, the Core Region is condensed (17 within 85). Furthermore, this Region exhibits a COL11A2 gene, followed by four tandem MHC class II αβ dyads retaining two suites of anciently duplicated “αβ” lineages. Thus, the crested ibis MHC structure is entirely different from the known avian MHC architectures but similar to that of mammalian MHCs, suggesting that the fundamental structure of ancestral avian class II MHCs should be “COL11A2-IIαβ1-IIαβ2.” The gene structures, residue characteristics, and expression levels of the five class I genes reveal inter-locus functional divergence. However, phylogenetic analysis indicates that these five genes generate a well-supported intra-species clade, showing evidence for recent duplications. Our analyses suggest dramatic structural variation among avian MHC lineages, help elucidate avian MHC evolution, and provide a foundation for future conservation studies. PMID:25608659

  7. Prevalence of Avian-Pathogenic Escherichia coli Strain O1 Genomic Islands among Extraintestinal and Commensal E. coli Isolates

    PubMed Central

    Johnson, Timothy J.; Wannemuehler, Yvonne; Kariyawasam, Subhashinie; Johnson, James R.; Logue, Catherine M.

    2012-01-01

    Escherichia coli strains that cause disease outside the intestine are known as extraintestinal pathogenic E. coli (ExPEC) and include pathogens of humans and animals. Previously, the genome of avian-pathogenic E. coli (APEC) O1:K1:H7 strain O1, from ST95, was sequenced and compared to those of several other E. coli strains, identifying 43 genomic islands. Here, the genomic islands of APEC O1 were compared to those of other sequenced E. coli strains, and the distribution of 81 genes belonging to 12 APEC O1 genomic islands among 828 human and avian ExPEC and commensal E. coli isolates was determined. Multiple islands were highly prevalent among isolates belonging to the O1 and O18 serogroups within phylogenetic group B2, which are implicated in human neonatal meningitis. Because of the extensive genomic similarities between APEC O1 and other human ExPEC strains belonging to the ST95 phylogenetic lineage, its ability to cause disease in a rat model of sepsis and meningitis was assessed. Unlike other ST95 lineage strains, APEC O1 was unable to cause bacteremia or meningitis in the neonatal rat model and was significantly less virulent than uropathogenic E. coli (UPEC) CFT073 in a mouse sepsis model, despite carrying multiple neonatal meningitis E. coli (NMEC) virulence factors and belonging to the ST95 phylogenetic lineage. These results suggest that host adaptation or genome modifications have occurred either in APEC O1 or in highly virulent ExPEC isolates, resulting in differences in pathogenicity. Overall, the genomic islands examined provide targets for further discrimination of the different ExPEC subpathotypes, serogroups, phylogenetic types, and sequence types. PMID:22467781

  8. Comparative susceptibility of avian species to low pathogenic avian influenza viruses of the H13 subtype

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Gulls are widely recognized reservoirs for low pathogenic avian influenza (LPAI) viruses; however, the subtypes maintained in these populations and/or the transmission mechanisms involved are poorly understood. Although, a wide diversity of influenza viruses have been isolated from gulls, two hemag...

  9. Avian Astrovirus

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Avian astroviruses comprise a diverse group of viruses affecting many avian species and causing enteritis, hepatitis and nephritis. To date, six different astroviruses have been identified in avian species based on the species of origin and viral genome characteristics: two turkey-origin astroviru...

  10. Comparative genomics of mycobacterial proteases.

    PubMed

    Ribeiro-Guimarães, Michelle Lopes; Pessolani, Maria Cristina Vidal

    2007-01-01

    Although proteases are recognized as important virulent factors in pathogenic microorganisms, little information is available so far regarding the potential role of these enzymes in diseases caused by mycobacteria. Here we use bioinformatic tools to compare the protease-coding genes present in the genome of Mycobacterium leprae, Mycobacterium tuberculosis, Mycobacterium bovis and Mycobacterium avium paratuberculosis. This analysis allowed a review of the nomenclature of the protease family present in mycobacteria. A special attention was devoted to the 'decaying genome' of M. leprae where a relatively high level of conservation of protease-coding genes was observed when compared to other genes families. A total of 39 genes out of the 49 found in M. bovis were identified in M. leprae. Of relevance, a core of well-conserved 38 protease genes shared by the four species was defined. This set of proteases is probably essential for survival in the host and disease outcome and may constitute novel targets for drug development leading to a more effective control of mycobacterial diseases.

  11. Comparative primate genomics: emerging patterns of genome content and dynamics.

    PubMed

    Rogers, Jeffrey; Gibbs, Richard A

    2014-05-01

    Advances in genome sequencing technologies have created new opportunities for comparative primate genomics. Genome assemblies have been published for various primate species, and analyses of several others are underway. Whole-genome assemblies for the great apes provide remarkable new information about the evolutionary origins of the human genome and the processes involved. Genomic data for macaques and other non-human primates offer valuable insights into genetic similarities and differences among species that are used as models for disease-related research. This Review summarizes current knowledge regarding primate genome content and dynamics, and proposes a series of goals for the near future.

  12. Comparative primate genomics: emerging patterns of genome content and dynamics.

    PubMed

    Rogers, Jeffrey; Gibbs, Richard A

    2014-05-01

    Advances in genome sequencing technologies have created new opportunities for comparative primate genomics. Genome assemblies have been published for various primate species, and analyses of several others are underway. Whole-genome assemblies for the great apes provide remarkable new information about the evolutionary origins of the human genome and the processes involved. Genomic data for macaques and other non-human primates offer valuable insights into genetic similarities and differences among species that are used as models for disease-related research. This Review summarizes current knowledge regarding primate genome content and dynamics, and proposes a series of goals for the near future. PMID:24709753

  13. Comparative primate genomics: emerging patterns of genome content and dynamics

    PubMed Central

    Rogers, Jeffrey; Gibbs, Richard A.

    2014-01-01

    Preface Advances in genome sequencing technologies have created new opportunities for comparative primate genomics. Genome assemblies have been published for several primates, with analyses of several others underway. Whole genome assemblies for the great apes provide remarkable new information about the evolutionary origins of the human genome and the processes involved. Genomic data for macaques and other nonhuman primates provide valuable insight into genetic similarities and differences among species used as models for disease-related research. This review summarizes current knowledge regarding primate genome content and dynamics and offers a series of goals for the near future. PMID:24709753

  14. Complete genome sequences of an avian orthoreovirus isolated from guangxi, china.

    PubMed

    Teng, Liqiong; Xie, Zhixun; Xie, Liji; Liu, Jiabo; Pang, Yaoshan; Deng, Xianwen; Xie, Zhiqin; Fan, Qing; Luo, Sisi

    2013-01-01

    We report the complete genomic sequences of an avian orthoreovirus, strain GuangxiR1, isolated from a chicken flock in Guangxi Province, southern China, in 2000. Phylogenetic analyses suggest that the strain is closely related to the S1133 strain, which is associated with tenosynovitis, but is far different from strain AVS-B, which is associated with runting-stunting syndrome in broilers. PMID:23846280

  15. Comparative genomics of Listeria species.

    PubMed

    Glaser, P; Frangeul, L; Buchrieser, C; Rusniok, C; Amend, A; Baquero, F; Berche, P; Bloecker, H; Brandt, P; Chakraborty, T; Charbit, A; Chetouani, F; Couvé, E; de Daruvar, A; Dehoux, P; Domann, E; Domínguez-Bernal, G; Duchaud, E; Durant, L; Dussurget, O; Entian, K D; Fsihi, H; García-del Portillo, F; Garrido, P; Gautier, L; Goebel, W; Gómez-López, N; Hain, T; Hauf, J; Jackson, D; Jones, L M; Kaerst, U; Kreft, J; Kuhn, M; Kunst, F; Kurapkat, G; Madueno, E; Maitournam, A; Vicente, J M; Ng, E; Nedjari, H; Nordsiek, G; Novella, S; de Pablos, B; Pérez-Diaz, J C; Purcell, R; Remmel, B; Rose, M; Schlueter, T; Simoes, N; Tierrez, A; Vázquez-Boland, J A; Voss, H; Wehland, J; Cossart, P

    2001-10-26

    Listeria monocytogenes is a food-borne pathogen with a high mortality rate that has also emerged as a paradigm for intracellular parasitism. We present and compare the genome sequences of L. monocytogenes (2,944,528 base pairs) and a nonpathogenic species, L. innocua (3,011,209 base pairs). We found a large number of predicted genes encoding surface and secreted proteins, transporters, and transcriptional regulators, consistent with the ability of both species to adapt to diverse environments. The presence of 270 L. monocytogenes and 149 L. innocua strain-specific genes (clustered in 100 and 63 islets, respectively) suggests that virulence in Listeria results from multiple gene acquisition and deletion events.

  16. Conservation and Losses of Non-Coding RNAs in Avian Genomes

    PubMed Central

    Gardner, Paul P.; Fasold, Mario; Burge, Sarah W.; Ninova, Maria; Hertel, Jana; Kehr, Stephanie; Steeves, Tammy E.; Griffiths-Jones, Sam; Stadler, Peter F.

    2015-01-01

    Here we present the results of a large-scale bioinformatics annotation of non-coding RNA loci in 48 avian genomes. Our approach uses probabilistic models of hand-curated families from the Rfam database to infer conserved RNA families within each avian genome. We supplement these annotations with predictions from the tRNA annotation tool, tRNAscan-SE and microRNAs from miRBase. We identify 34 lncRNA-associated loci that are conserved between birds and mammals and validate 12 of these in chicken. We report several intriguing cases where a reported mammalian lncRNA, but not its function, is conserved. We also demonstrate extensive conservation of classical ncRNAs (e.g., tRNAs) and more recently discovered ncRNAs (e.g., snoRNAs and miRNAs) in birds. Furthermore, we describe numerous “losses” of several RNA families, and attribute these to either genuine loss, divergence or missing data. In particular, we show that many of these losses are due to the challenges associated with assembling avian microchromosomes. These combined results illustrate the utility of applying homology-based methods for annotating novel vertebrate genomes. PMID:25822729

  17. Comparative Analysis of Uninhibited and Constrained Avian Wing Aerodynamics

    NASA Astrophysics Data System (ADS)

    Cox, Jordan A.

    The flight of birds has intrigued and motivated man for many years. Bird flight served as the primary inspiration of flying machines developed by Leonardo Da Vinci, Otto Lilienthal, and even the Wright brothers. Avian flight has once again drawn the attention of the scientific community as unmanned aerial vehicles (UAV) are not only becoming more popular, but smaller. Birds are once again influencing the designs of aircraft. Small UAVs operating within flight conditions and low Reynolds numbers common to birds are not yet capable of the high levels of control and agility that birds display with ease. Many researchers believe the potential to improve small UAV performance can be obtained by applying features common to birds such as feathers and flapping flight to small UAVs. Although the effects of feathers on a wing have received some attention, the effects of localized transient feather motion and surface geometry on the flight performance of a wing have been largely overlooked. In this research, the effects of freely moving feathers on a preserved red tailed hawk wing were studied. A series of experiments were conducted to measure the aerodynamic forces on a hawk wing with varying levels of feather movement permitted. Angle of attack and air speed were varied within the natural flight envelope of the hawk. Subsequent identical tests were performed with the feather motion constrained through the use of externally-applied surface treatments. Additional tests involved the study of an absolutely fixed geometry mold-and-cast wing model of the original bird wing. Final tests were also performed after applying surface coatings to the cast wing. High speed videos taken during tests revealed the extent of the feather movement between wing models. Images of the microscopic surface structure of each wing model were analyzed to establish variations in surface geometry between models. Recorded aerodynamic forces were then compared to the known feather motion and surface

  18. The comparison of pathology in ferrets infected by H9N2 avian influenza viruses with different genomic features.

    PubMed

    Gao, Rongbao; Bai, Tian; Li, Xiaodan; Xiong, Ying; Huang, Yiwei; Pan, Ming; Zhang, Ye; Bo, Hong; Zou, Shumei; Shu, Yuelong

    2016-01-15

    H9N2 avian influenza virus circulates widely in poultry and has been responsible for sporadic human infections in several regions. Few studies have been conducted on the pathogenicity of H9N2 AIV isolates that have different genomic features. We compared the pathology induced by a novel reassortant H9N2 virus and two currently circulating H9N2 viruses that have different genomic features in ferrets. The results showed that the three viruses can induce infections with various amounts of viral shedding in ferrets. The novel H9N2 induced respiratory infection, but no pathological lesions were observed in lung tissues. The other two viruses induced mild to intermediate pathological lesions in lung tissues, although the clinical signs presented mildly in ferrets. The pathological lesions presented a diversity consistent with viral replication in ferrets. PMID:26638019

  19. Interspecies transmission and limited persistence of low pathogenic avian influenza genomes among Alaska dabbling ducks

    USGS Publications Warehouse

    Reeves, A.B.; Pearce, J.M.; Ramey, A.M.; Meixell, B.W.; Runstadler, J.A.

    2011-01-01

    The reassortment and geographic distribution of low pathogenic avian influenza (LPAI) virus genes are well documented, but little is known about the persistence of intact LPAI genomes among species and locations. To examine persistence of entire LPAI genome constellations in Alaska, we calculated the genetic identities among 161 full-genome LPAI viruses isolated across 4. years from five species of duck: northern pintail (Anas acuta), mallard (Anas platyrhynchos), American green-winged teal (Anas crecca), northern shoveler (Anas clypeata) and American wigeon (Anas americana). Based on pairwise genetic distance, highly similar LPAI genomes (>99% identity) were observed within and between species and across a range of geographic distances (up to and >1000 km), but most often between isolates collected 0-10. km apart. Highly similar viruses were detected between years, suggesting inter-annual persistence, but these were rare in our data set with the majority occurring within 0-9. days of sampling. These results identify LPAI transmission pathways in the context of species, space and time, an initial perspective into the extent of regional virus distribution and persistence, and insight into why no completely Eurasian genomes have ever been detected in Alaska. Such information will be useful in forecasting the movement of foreign-origin avian influenza strains should they be introduced to North America. ?? 2011.

  20. Interspecies transmission and limited persistence of low pathogenic avian influenza genomes among Alaska dabbling ducks

    USGS Publications Warehouse

    Reeves, Andrew B.; Pearce, John M.; Ramey, Andy M.; Meixell, Brandt; Runstadler, Jonathan A.

    2011-01-01

    The reassortment and geographic distribution of low pathogenic avian influenza (LPAI) virus genes are well documented, but little is known about the persistence of intact LPAI genomes among species and locations. To examine persistence of entire LPAI genome constellations in Alaska, we calculated the genetic identities among 161 full-genome LPAI viruses isolated across 4 years from five species of duck: northern pintail (Anas acuta), mallard (Anas platyrhynchos), American green-winged teal (Anas crecca), northern shoveler (Anas clypeata) and American wigeon (Anas Americana). Based on pairwise genetic distance, highly similar LPAI genomes (>99 percent identity) were observed within and between species and across a range of geographic distances (up to and >1000 km), but most often between isolates collected 0-10 km apart. Highly similar viruses were detected between years, suggesting inter-annual persistence, but these were rare in our data set with the majority occurring within 0-9 days of sampling. These results identify LPAI transmission pathways in the context of species, space and time, an initial perspective into the extent of regional virus distribution and persistence, and insight into why no completely Eurasian genomes have ever been detected in Alaska. Such information will be useful in forecasting the movement of foreign-origin avian influenza strains should they be introduced to North America.

  1. The Chlamydia psittaci Genome: A Comparative Analysis of Intracellular Pathogens

    PubMed Central

    Saluz, Hans Peter

    2012-01-01

    Background Chlamydiaceae are a family of obligate intracellular pathogens causing a wide range of diseases in animals and humans, and facing unique evolutionary constraints not encountered by free-living prokaryotes. To investigate genomic aspects of infection, virulence and host preference we have sequenced Chlamydia psittaci, the pathogenic agent of ornithosis. Results A comparison of the genome of the avian Chlamydia psittaci isolate 6BC with the genomes of other chlamydial species, C. trachomatis, C. muridarum, C. pneumoniae, C. abortus, C. felis and C. caviae, revealed a high level of sequence conservation and synteny across taxa, with the major exception of the human pathogen C. trachomatis. Important differences manifest in the polymorphic membrane protein family specific for the Chlamydiae and in the highly variable chlamydial plasticity zone. We identified a number of psittaci-specific polymorphic membrane proteins of the G family that may be related to differences in host-range and/or virulence as compared to closely related Chlamydiaceae. We calculated non-synonymous to synonymous substitution rate ratios for pairs of orthologous genes to identify putative targets of adaptive evolution and predicted type III secreted effector proteins. Conclusions This study is the first detailed analysis of the Chlamydia psittaci genome sequence. It provides insights in the genome architecture of C. psittaci and proposes a number of novel candidate genes mostly of yet unknown function that may be important for pathogen-host interactions. PMID:22506068

  2. Exploring novel candidate genes from the Mouse Genome Informatics database: Potential implications for avian migration research.

    PubMed

    Contina, Andrea; Bridge, Eli S; Kelly, Jeffrey F

    2016-07-01

    To search for genes associated with migratory phenotypes in songbirds, we selected candidate genes through annotations from the Mouse Genome Informatics database and assembled an extensive candidate-gene library. Then, we implemented a next-generation sequencing approach to obtain DNA sequences from the Painted Bunting genome. We focused on those sequences that were conserved across avian species and that aligned with candidate genes in our mouse library. We genotyped short sequence repeats from the following candidate genes: ADRA1d, ANKRD17, CISH and MYH7. We studied the possible correlations between allelic variations occurring in these novel candidate migration genes and avian migratory phenotypes available from the published literature. We found that allele variation at MYH7 correlated with a calculated index of speed of migration (km/day) across 11 species of songbirds. We highlight the potential of the Mouse Genome Informatics database in providing new candidate genes that might play a crucial role in regulating migration in birds and possibly in other taxa. Our research effort shows the benefits and limitations of working with extensive genomic datasets and offers a snapshot of the challenges related to cross-species validation in behavioral and molecular ecology studies.

  3. Exploring novel candidate genes from the Mouse Genome Informatics database: Potential implications for avian migration research.

    PubMed

    Contina, Andrea; Bridge, Eli S; Kelly, Jeffrey F

    2016-07-01

    To search for genes associated with migratory phenotypes in songbirds, we selected candidate genes through annotations from the Mouse Genome Informatics database and assembled an extensive candidate-gene library. Then, we implemented a next-generation sequencing approach to obtain DNA sequences from the Painted Bunting genome. We focused on those sequences that were conserved across avian species and that aligned with candidate genes in our mouse library. We genotyped short sequence repeats from the following candidate genes: ADRA1d, ANKRD17, CISH and MYH7. We studied the possible correlations between allelic variations occurring in these novel candidate migration genes and avian migratory phenotypes available from the published literature. We found that allele variation at MYH7 correlated with a calculated index of speed of migration (km/day) across 11 species of songbirds. We highlight the potential of the Mouse Genome Informatics database in providing new candidate genes that might play a crucial role in regulating migration in birds and possibly in other taxa. Our research effort shows the benefits and limitations of working with extensive genomic datasets and offers a snapshot of the challenges related to cross-species validation in behavioral and molecular ecology studies. PMID:27061206

  4. Complete Genome Sequence of Avian Tembusu-Related Virus Strain WR Isolated from White Kaiya Ducks in Fujian, China

    PubMed Central

    Wan, Chunhe; Fu, Guanghua; Shi, Shaohua; Cheng, Longfei; Chen, Hongmei

    2012-01-01

    Avian tembusu-related virus, which was first identified in China, is an emerging virus causing serious economic loss to the Chinese poultry industry. We report here the complete genome sequences of avian tembusu-related virus strain WR, isolated from a White Kaiya duck with disease characterized by an abrupt decrease in egg laying with ovarian hemorrhage, which will help in further understanding the molecular and evolutionary characteristics and pathogenesis of avian tembusu-related virus, the new flavivirus affecting ducks in Southern China. PMID:22966199

  5. Genetic architecture dissection by genome-wide association analysis reveals avian eggshell ultrastructure traits

    PubMed Central

    Duan, Zhongyi; Sun, Congjiao; Shen, ManMan; Wang, Kehua; Yang, Ning; Zheng, Jiangxia; Xu, Guiyun

    2016-01-01

    The ultrastructure of an eggshell is considered the major determinant of eggshell quality, which has biological and economic significance for the avian and poultry industries. However, the interrelationships and genome-wide architecture of eggshell ultrastructure remain to be elucidated. Herein, we measured eggshell thickness (EST), effective layer thickness (ET), mammillary layer thickness (MT), and mammillary density (MD) and conducted genome-wide association studies in 927 F2 hens. The SNP-based heritabilities of eggshell ultrastructure traits were estimated to be 0.39, 0.36, 0.17 and 0.19 for EST, ET, MT and MD, respectively, and a total of 719, 784, 1 and 10 genome-wide significant SNPs were associated with EST, ET, MT and MD, respectively. ABCC9, ITPR2, KCNJ8 and WNK1, which are involved in ion transport, were suggested to be the key genes regulating EST and ET. ITM2C and KNDC1 likely affect MT and MD, respectively. Additionally, there were linear relationships between the chromosome lengths and the variance explained per chromosome for EST (R2 = 0.57) and ET (R2 = 0.67). In conclusion, the interrelationships and genetic architecture of eggshell ultrastructure traits revealed in this study are valuable for our understanding of the avian eggshell and contribute to research on a variety of other calcified shells. PMID:27456605

  6. Genetic architecture dissection by genome-wide association analysis reveals avian eggshell ultrastructure traits.

    PubMed

    Duan, Zhongyi; Sun, Congjiao; Shen, ManMan; Wang, Kehua; Yang, Ning; Zheng, Jiangxia; Xu, Guiyun

    2016-01-01

    The ultrastructure of an eggshell is considered the major determinant of eggshell quality, which has biological and economic significance for the avian and poultry industries. However, the interrelationships and genome-wide architecture of eggshell ultrastructure remain to be elucidated. Herein, we measured eggshell thickness (EST), effective layer thickness (ET), mammillary layer thickness (MT), and mammillary density (MD) and conducted genome-wide association studies in 927 F2 hens. The SNP-based heritabilities of eggshell ultrastructure traits were estimated to be 0.39, 0.36, 0.17 and 0.19 for EST, ET, MT and MD, respectively, and a total of 719, 784, 1 and 10 genome-wide significant SNPs were associated with EST, ET, MT and MD, respectively. ABCC9, ITPR2, KCNJ8 and WNK1, which are involved in ion transport, were suggested to be the key genes regulating EST and ET. ITM2C and KNDC1 likely affect MT and MD, respectively. Additionally, there were linear relationships between the chromosome lengths and the variance explained per chromosome for EST (R(2) = 0.57) and ET (R(2) = 0.67). In conclusion, the interrelationships and genetic architecture of eggshell ultrastructure traits revealed in this study are valuable for our understanding of the avian eggshell and contribute to research on a variety of other calcified shells. PMID:27456605

  7. Comparative Reannotation of 21 Aspergillus Genomes

    SciTech Connect

    Salamov, Asaf; Riley, Robert; Kuo, Alan; Grigoriev, Igor

    2013-03-08

    We used comparative gene modeling to reannotate 21 Aspergillus genomes. Initial automatic annotation of individual genomes may contain some errors of different nature, e.g. missing genes, incorrect exon-intron structures, 'chimeras', which fuse 2 or more real genes or alternatively splitting some real genes into 2 or more models. The main premise behind the comparative modeling approach is that for closely related genomes most orthologous families have the same conserved gene structure. The algorithm maps all gene models predicted in each individual Aspergillus genome to the other genomes and, for each locus, selects from potentially many competing models, the one which most closely resembles the orthologous genes from other genomes. This procedure is iterated until no further change in gene models is observed. For Aspergillus genomes we predicted in total 4503 new gene models ( ~;;2percent per genome), supported by comparative analysis, additionally correcting ~;;18percent of old gene models. This resulted in a total of 4065 more genes with annotated PFAM domains (~;;3percent increase per genome). Analysis of a few genomes with EST/transcriptomics data shows that the new annotation sets also have a higher number of EST-supported splice sites at exon-intron boundaries.

  8. Complete Genomic and Lysis-Cassette Characterization of the Novel Phage, KBNP1315, which Infects Avian Pathogenic Escherichia coli (APEC)

    PubMed Central

    Lee, Jung Seok; Jang, Ho Bin; Kim, Ki Sei; Kim, Tae Hwan; Im, Se Pyeong; Kim, Si Won; Lazarte, Jassy Mary S.; Kim, Jae Sung; Jung, Tae Sung

    2015-01-01

    Avian pathogenic Escherichia coli (APEC) is a major pathogen that causes avian colibacillosis and is associated with severe economic losses in the chicken-farming industry. Here, bacteriophage KBNP1315, infecting APEC strain KBP1315, was genomically and functionally characterized. The evolutionary relationships of KBNP1315 were analyzed at the genomic level using gene (protein)-sharing networks, the Markov clustering (MCL) algorithm, and comparative genomics. Our network analysis showed that KBNP1315 was connected to 30 members of the Autographivirinae subfamily, which comprises the SP6-, T7-, P60-, phiKMV-, GAP227- and KP34-related groups. Network decomposition suggested that KBNP1315 belongs to the SP6-like phages, but our comparison of putative encoded proteins revealed that key proteins of KBNP1315, including the tail spike protein and endolysin, had relative low levels of amino acid sequence similarity with other members of the SP6-like phages. Thus KBNP1315 may only be distantly related to the SP6-like phages, and (based on the difference in endolysin) its lysis mechanism may differ from theirs. To characterize the lytic functions of the holin and endolysin proteins from KBNP1315, we expressed these proteins individually or simultaneously in E. coli BL21 (DE3) competent cell. Interestingly, the expressed endolysin was secreted into the periplasm and caused a high degree of host cell lysis that was dose-dependently delayed/blocked by NaN3-mediated inhibition of the SecA pathway. The expressed holin triggered only a moderate inhibition of cell growth, whereas coexpression of holin and endolysin enhanced the lytic effect of endolysin. Together, these results revealed that KBNP1315 appears to use a pin-holin/signal-arrest-release (SAR) endolysin pathway to trigger host cell lysis. PMID:26555076

  9. Comparative Transcriptomic Exploration Reveals Unique Molecular Adaptations of Neuropathogenic Trichobilharzia to Invade and Parasitize Its Avian Definitive Host

    PubMed Central

    Leontovyč, Roman; Young, Neil D.; Korhonen, Pasi K.; Hall, Ross S.; Tan, Patrick; Mikeš, Libor; Kašný, Martin; Horák, Petr; Gasser, Robin B.

    2016-01-01

    To date, most molecular investigations of schistosomatids have focused principally on blood flukes (schistosomes) of humans. Despite the clinical importance of cercarial dermatitis in humans caused by Trichobilharzia regenti and the serious neuropathologic disease that this parasite causes in its permissive avian hosts and accidental mammalian hosts, almost nothing is known about the molecular aspects of how this fluke invades its hosts, migrates in host tissues and how it interacts with its hosts’ immune system. Here, we explored selected aspects using a transcriptomic-bioinformatic approach. To do this, we sequenced, assembled and annotated the transcriptome representing two consecutive life stages (cercariae and schistosomula) of T. regenti involved in the first phases of infection of the avian host. We identified key biological and metabolic pathways specific to each of these two developmental stages and also undertook comparative analyses using data available for taxonomically related blood flukes of the genus Schistosoma. Detailed comparative analyses revealed the unique involvement of carbohydrate metabolism, translation and amino acid metabolism, and calcium in T. regenti cercariae during their invasion and in growth and development, as well as the roles of cell adhesion molecules, microaerobic metabolism (citrate cycle and oxidative phosphorylation), peptidases (cathepsins) and other histolytic and lysozomal proteins in schistosomula during their particular migration in neural tissues of the avian host. In conclusion, the present transcriptomic exploration provides new and significant insights into the molecular biology of T. regenti, which should underpin future genomic and proteomic investigations of T. regenti and, importantly, provides a useful starting point for a range of comparative studies of schistosomatids and other trematodes. PMID:26863542

  10. Gramene: a growing plant comparative genomics resource

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Gramene (www.gramene.org) is a curated genetic, genomic and comparative genome analysis resource for the major crop species, such as rice, maize, wheat and many other plant (mainly grass) species. Gramene is an open-source project, with all data and software freely downloadable through the ftp site ...

  11. Gramene 2013: Comparative plant genomics resources

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Gramene (http://www.gramene.org) is a curated online resource for comparative functional genomics in crops and model plant species, currently hosting 27 fully and 10 partially sequenced reference genomes in its build number 38. Its strength derives from the application of a phylogenetic framework fo...

  12. Comparative genomics of BCG vaccines.

    PubMed

    Behr, M A

    2001-01-01

    Bacille Calmette-Guérin (BCG) vaccines have been given to more people than any other vaccine. They have also probably resulted in as much controversy as any other vaccine. In clinical trials, the efficacy of BCG vaccination against pulmonary TB has been widely variable. At the same time, a number of investigators have observed phenotypic differences between BCG daughter strains, raising the possibility that differences between BCG products may in some way translate into different outcomes. With recent genomic analysis of BCG strains, it has become possible to piece together the molecular events that have resulted in current BCG vaccines. Between the derivation of BCG in 1921 and the lyophilization of BCG Pasteur 1173 in 1961, there have been at least seven genetic events, including deletions, duplications and a single nucleotide polymorphism. The phenotypic relevance of these changes in BCG vaccines remains to be explored.

  13. Comparative genomics of BCG vaccines.

    PubMed

    Behr, M A

    2001-01-01

    Bacille Calmette-Guérin (BCG) vaccines have been given to more people than any other vaccine. They have also probably resulted in as much controversy as any other vaccine. In clinical trials, the efficacy of BCG vaccination against pulmonary TB has been widely variable. At the same time, a number of investigators have observed phenotypic differences between BCG daughter strains, raising the possibility that differences between BCG products may in some way translate into different outcomes. With recent genomic analysis of BCG strains, it has become possible to piece together the molecular events that have resulted in current BCG vaccines. Between the derivation of BCG in 1921 and the lyophilization of BCG Pasteur 1173 in 1961, there have been at least seven genetic events, including deletions, duplications and a single nucleotide polymorphism. The phenotypic relevance of these changes in BCG vaccines remains to be explored. PMID:11463238

  14. Orthology for comparative genomics in the mouse genome database.

    PubMed

    Dolan, Mary E; Baldarelli, Richard M; Bello, Susan M; Ni, Li; McAndrews, Monica S; Bult, Carol J; Kadin, James A; Richardson, Joel E; Ringwald, Martin; Eppig, Janan T; Blake, Judith A

    2015-08-01

    The mouse genome database (MGD) is the model organism database component of the mouse genome informatics system at The Jackson Laboratory. MGD is the international data resource for the laboratory mouse and facilitates the use of mice in the study of human health and disease. Since its beginnings, MGD has included comparative genomics data with a particular focus on human-mouse orthology, an essential component of the use of mouse as a model organism. Over the past 25 years, novel algorithms and addition of orthologs from other model organisms have enriched comparative genomics in MGD data, extending the use of orthology data to support the laboratory mouse as a model of human biology. Here, we describe current comparative data in MGD and review the history and refinement of orthology representation in this resource.

  15. Orthology for comparative genomics in the mouse genome database.

    PubMed

    Dolan, Mary E; Baldarelli, Richard M; Bello, Susan M; Ni, Li; McAndrews, Monica S; Bult, Carol J; Kadin, James A; Richardson, Joel E; Ringwald, Martin; Eppig, Janan T; Blake, Judith A

    2015-08-01

    The mouse genome database (MGD) is the model organism database component of the mouse genome informatics system at The Jackson Laboratory. MGD is the international data resource for the laboratory mouse and facilitates the use of mice in the study of human health and disease. Since its beginnings, MGD has included comparative genomics data with a particular focus on human-mouse orthology, an essential component of the use of mouse as a model organism. Over the past 25 years, novel algorithms and addition of orthologs from other model organisms have enriched comparative genomics in MGD data, extending the use of orthology data to support the laboratory mouse as a model of human biology. Here, we describe current comparative data in MGD and review the history and refinement of orthology representation in this resource. PMID:26223881

  16. Genomicus: five genome browsers for comparative genomics in eukaryota.

    PubMed

    Louis, Alexandra; Muffato, Matthieu; Roest Crollius, Hugues

    2013-01-01

    Genomicus (http://www.dyogen.ens.fr/genomicus/) is a database and an online tool that allows easy comparative genomic visualization in >150 eukaryote genomes. It provides a way to explore spatial information related to gene organization within and between genomes and temporal relationships related to gene and genome evolution. For the specific vertebrate phylum, it also provides access to ancestral gene order reconstructions and conserved non-coding elements information. We extended the Genomicus database originally dedicated to vertebrate to four new clades, including plants, non-vertebrate metazoa, protists and fungi. This visualization tool allows evolutionary phylogenomics analysis and exploration. Here, we describe the graphical modules of Genomicus and show how it is capable of revealing differential gene loss and gain, segmental or genome duplications and study the evolution of a locus through homology relationships.

  17. Genomic Selection for the Improvement of Antibody Response to Newcastle Disease and Avian Influenza Virus in Chickens

    PubMed Central

    Luo, Chenglong; Li, Xuewei; Shu, Dingming; Lund, Mogens Sandø; Su, Guosheng

    2014-01-01

    Newcastle disease (ND) and avian influenza (AI) are the most feared diseases in the poultry industry worldwide. They can cause flock mortality up to 100%, resulting in a catastrophic economic loss. This is the first study to investigate the feasibility of genomic selection for antibody response to Newcastle disease virus (Ab-NDV) and antibody response to Avian Influenza virus (Ab-AIV) in chickens. The data were collected from a crossbred population. Breeding values for Ab-NDV and Ab-AIV were estimated using a pedigree-based best linear unbiased prediction model (BLUP) and a genomic best linear unbiased prediction model (GBLUP). Single-trait and multiple-trait analyses were implemented. According to the analysis using the pedigree-based model, the heritability for Ab-NDV estimated from the single-trait and multiple-trait models was 0.478 and 0.487, respectively. The heritability for Ab-AIV estimated from the two models was 0.301 and 0.291, respectively. The estimated genetic correlation between the two traits was 0.438. A four-fold cross-validation was used to assess the accuracy of the estimated breeding values (EBV) in the two validation scenarios. In the family sample scenario each half-sib family is randomly allocated to one of four subsets and in the random sample scenario the individuals are randomly divided into four subsets. In the family sample scenario, compared with the pedigree-based model, the accuracy of the genomic prediction increased from 0.086 to 0.237 for Ab-NDV and from 0.080 to 0.347 for Ab-AIV. In the random sample scenario, the accuracy was improved from 0.389 to 0.427 for Ab-NDV and from 0.281 to 0.367 for Ab-AIV. The multiple-trait GBLUP model led to a slightly higher accuracy of genomic prediction for both traits. These results indicate that genomic selection for antibody response to ND and AI in chickens is promising. PMID:25401767

  18. Linking the genomes of nonmodel teleosts through comparative genomics.

    PubMed

    Sarropoulou, E; Nousdili, D; Magoulas, A; Kotoulas, G

    2008-01-01

    Recently the genomes of two more teleost species have been released: the medaka (Oryzias latipes), and the three-spined stickleback (Gasterosteus aculateus). The rapid developments in genomics of fish species paved the way to new and valuable research in comparative genetics and genomics. With the accumulation of information in model species, the genetic and genomic characterization of nonmodel, but economically important species, is now feasible. Furthermore, comparison of low coverage gene maps of aquacultured fish species against fully sequenced fish species will enhance the efficiency of candidate genes identification projected for quantitative trait loci (QTL) scans for traits of commercial interest. This study shows the syntenic relationship between the genomes of six different teleost species, including three fully sequenced model species: Tetraodon nigroviridis, Oryzias latipes, Gasterosteus aculateus, and three marine species of commercial and evolutionary interest: Sparus aurata, Dicentrarchus labrax, Oreochromis spp. All three commercial fish species belong to the order Perciformes, which is the richest in number of species (approximately 10,000) but poor in terms of available genomic information and tools. Syntenic relationships were established by using 800 EST and microsatellites sequences successfully mapped on the RH map of seabream. Comparison to the stickleback genome produced most positive BLAT hits (58%) followed by medaka (32%) and Tetraodon (30%). Thus, stickleback was used as the major stepping stone to compare seabass and tilapia to seabream. In addition to the significance for the aquaculture industry, this approach can encompass important ecological and evolutionary implications. PMID:18297360

  19. Draft Genome Sequences of Two Avian Pathogenic Escherichia coli Strains of Clinical Importance, E44 and E51.

    PubMed

    Ronco, Troels; Stegger, Marc; Andersen, Paal S; Pedersen, Karl; Li, Lili; Thøfner, Ida C N; Olsen, Rikke H

    2016-01-01

    Avian pathogenic Escherichia coli strains have remarkable impacts on animal welfare and the production economy in the poultry industry worldwide. Here, we present the draft genomes of two isolates from chickens (E44 and E51) obtained from field outbreaks and subsequently investigated for their potential for use in autogenous vaccines for broiler breeders. PMID:27491996

  20. Complete Genome Sequence of a Novel Avian Paramyxovirus (APMV-13) Isolated from a Wild Bird in Kazakhstan.

    PubMed

    Karamendin, K; Kydyrmanov, A; Seidalina, A; Asanova, S; Sayatov, M; Kasymbekov, E; Khan, E; Daulbayeva, K; Harrison, S M; Carr, I M; Goodman, S J; Zhumatov, K

    2016-01-01

    A novel avian paramyxovirus was identified during annual viral surveillance of wild bird populations in Kazakhstan in 2013. The virus was isolated from a white fronted goose (Anser albifrons) in northern Kazakhstan. Here, we report the complete genome sequence of the isolate, which we suggest should constitute a novel serotype. PMID:27198008

  1. Complete Genome Sequence of the First H5N1 Avian Influenza Virus Isolated from Chickens in Lebanon in 2016

    PubMed Central

    Ibrahim, Elias; Sirawan, Abeer; El-Bazzal, Bassel; El Hage, Jeanne; Abi Said, Mounir; Kandeil, Ahmed; Ali, Mohamed A.

    2016-01-01

    We generated the full genome of a highly pathogenic H5N1 avian influenza virus that caused an outbreak on a chicken farm in Lebnaon in April 2016. Analysis revealed that the virus belonged to clade 2.3.2.1c that recently caused outbreaks in West Africa and the United Arab Emirates. PMID:27795243

  2. Complete Genome Sequence of a Novel Avian Paramyxovirus (APMV-13) Isolated from a Wild Bird in Kazakhstan

    PubMed Central

    Kydyrmanov, A.; Seidalina, A.; Asanova, S.; Sayatov, M.; Kasymbekov, E.; Khan, E.; Daulbayeva, K.; Harrison, S. M.; Carr, I. M.; Goodman, S. J.; Zhumatov, K.

    2016-01-01

    A novel avian paramyxovirus was identified during annual viral surveillance of wild bird populations in Kazakhstan in 2013. The virus was isolated from a white fronted goose (Anser albifrons) in northern Kazakhstan. Here, we report the complete genome sequence of the isolate, which we suggest should constitute a novel serotype. PMID:27198008

  3. Draft Genome Sequences of Two Avian Pathogenic Escherichia coli Strains of Clinical Importance, E44 and E51

    PubMed Central

    Stegger, Marc; Andersen, Paal S.; Pedersen, Karl; Li, Lili; Thøfner, Ida C. N.; Olsen, Rikke H.

    2016-01-01

    Avian pathogenic Escherichia coli strains have remarkable impacts on animal welfare and the production economy in the poultry industry worldwide. Here, we present the draft genomes of two isolates from chickens (E44 and E51) obtained from field outbreaks and subsequently investigated for their potential for use in autogenous vaccines for broiler breeders. PMID:27491996

  4. Evidence that avian reovirus σNS is an RNA chaperone: implications for genome segment assortment.

    PubMed

    Borodavka, Alexander; Ault, James; Stockley, Peter G; Tuma, Roman

    2015-08-18

    Reoviruses are important human, animal and plant pathogens having 10-12 segments of double-stranded genomic RNA. The mechanisms controlling the assortment and packaging of genomic segments in these viruses, remain poorly understood. RNA-protein and RNA-RNA interactions between viral genomic segment precursors have been implicated in the process. While non-structural viral RNA-binding proteins, such as avian reovirus σNS, are essential for virus replication, the mechanism by which they assist packaging is unclear. Here we demonstrate that σNS assembles into stable elongated hexamers in vitro, which bind single-stranded nucleic acids with high affinity, but little sequence specificity. Using ensemble and single molecule fluorescence spectroscopy, we show that σNS also binds to a partially double-stranded RNA, resulting in gradual helix unwinding. The hexamer can bind multiple RNA molecules and exhibits strand-annealing activity, thus mediating conversion of metastable, intramolecular stem-loops into more stable heteroduplexes. We demonstrate that the ARV σNS acts as an RNA chaperone facilitating specific RNA-RNA interactions between genomic precursors during segment assortment and packaging.

  5. Evidence that avian reovirus σNS is an RNA chaperone: implications for genome segment assortment

    PubMed Central

    Borodavka, Alexander; Ault, James; Stockley, Peter G.; Tuma, Roman

    2015-01-01

    Reoviruses are important human, animal and plant pathogens having 10–12 segments of double-stranded genomic RNA. The mechanisms controlling the assortment and packaging of genomic segments in these viruses, remain poorly understood. RNA–protein and RNA–RNA interactions between viral genomic segment precursors have been implicated in the process. While non-structural viral RNA-binding proteins, such as avian reovirus σNS, are essential for virus replication, the mechanism by which they assist packaging is unclear. Here we demonstrate that σNS assembles into stable elongated hexamers in vitro, which bind single-stranded nucleic acids with high affinity, but little sequence specificity. Using ensemble and single molecule fluorescence spectroscopy, we show that σNS also binds to a partially double-stranded RNA, resulting in gradual helix unwinding. The hexamer can bind multiple RNA molecules and exhibits strand-annealing activity, thus mediating conversion of metastable, intramolecular stem-loops into more stable heteroduplexes. We demonstrate that the ARV σNS acts as an RNA chaperone facilitating specific RNA–RNA interactions between genomic precursors during segment assortment and packaging. PMID:26109354

  6. A White Paper on Nematode Comparative Genomics

    PubMed Central

    Bird, David McK.; Blaxter, Mark L.; McCarter, James P.; Mitreva, Makedonka; Sternberg, Paul W.; Thomas, W. Kelley

    2005-01-01

    In response to the new opportunities for genome sequencing and comparative genomics, the Society of Nematology (SON) formed a committee to develop a white paper in support of the broad scientific needs associated with this phylum and interests of SON members. Although genome sequencing is expensive, the data generated are unique in biological systems in that genomes have the potential to be complete (every base of the genome can be accounted for), accurate (the data are digital and not subject to stochastic variation), and permanent (once obtained, the genome of a species does not need to be experimentally re-sampled). The availability of complete, accurate, and permanent genome sequences from diverse nematode species will underpin future studies into the biology and evolution of this phylum and the ecological associations (particularly parasitic) nematodes have with other organisms. We anticipate that upwards of 100 nematode genomes will be solved to varying levels of completion in the coming decade and suggest biological and practical considerations to guide the selection of the most informative taxa for sequencing. PMID:19262884

  7. Comparative Genome Analysis of Enterobacter cloacae

    PubMed Central

    Liu, Wing-Yee; Wong, Chi-Fat; Chung, Karl Ming-Kar; Jiang, Jing-Wei; Leung, Frederick Chi-Ching

    2013-01-01

    The Enterobacter cloacae species includes an extremely diverse group of bacteria that are associated with plants, soil and humans. Publication of the complete genome sequence of the plant growth-promoting endophytic E. cloacae subsp. cloacae ENHKU01 provided an opportunity to perform the first comparative genome analysis between strains of this dynamic species. Examination of the pan-genome of E. cloacae showed that the conserved core genome retains the general physiological and survival genes of the species, while genomic factors in plasmids and variable regions determine the virulence of the human pathogenic E. cloacae strain; additionally, the diversity of fimbriae contributes to variation in colonization and host determination of different E. cloacae strains. Comparative genome analysis further illustrated that E. cloacae strains possess multiple mechanisms for antagonistic action against other microorganisms, which involve the production of siderophores and various antimicrobial compounds, such as bacteriocins, chitinases and antibiotic resistance proteins. The presence of Type VI secretion systems is expected to provide further fitness advantages for E. cloacae in microbial competition, thus allowing it to survive in different environments. Competition assays were performed to support our observations in genomic analysis, where E. cloacae subsp. cloacae ENHKU01 demonstrated antagonistic activities against a wide range of plant pathogenic fungal and bacterial species. PMID:24069314

  8. Avian influenza at both ends of a migratory flyway: characterizing viral genomic diversity to optimize surveillance plans for North America

    PubMed Central

    Pearce, John M; Ramey, Andrew M; Flint, Paul L; Koehler, Anson V; Fleskes, Joseph P; Franson, J Christian; Hall, Jeffrey S; Derksen, Dirk V; Ip, Hon S

    2009-01-01

    Although continental populations of avian influenza viruses are genetically distinct, transcontinental reassortment in low pathogenic avian influenza (LPAI) viruses has been detected in migratory birds. Thus, genomic analyses of LPAI viruses could serve as an approach to prioritize species and regions targeted by North American surveillance activities for foreign origin highly pathogenic avian influenza (HPAI). To assess the applicability of this approach, we conducted a phylogenetic and population genetic analysis of 68 viral genomes isolated from the northern pintail (Anas acuta) at opposite ends of the Pacific migratory flyway in North America. We found limited evidence for Asian LPAI lineages on wintering areas used by northern pintails in California in contrast to a higher frequency on breeding locales of Alaska. Our results indicate that the number of Asian LPAI lineages observed in Alaskan northern pintails, and the nucleotide composition of LPAI lineages, is not maintained through fall migration. Accordingly, our data indicate that surveillance of Pacific Flyway northern pintails to detect foreign avian influenza viruses would be most effective in Alaska. North American surveillance plans could be optimized through an analysis of LPAI genomics from species that demonstrate evolutionary linkages with European or Asian lineages and in regions that have overlapping migratory flyways with areas of HPAI outbreaks. PMID:25567891

  9. Avian influenza at both ends of a migratory flyway: characterizing viral genomic diversity to optimize surveillance plans for North America

    USGS Publications Warehouse

    Pearce, John M.; Ramey, Andrew M.; Flint, Paul L.; Koehler, Anson V.; Fleskes, Joseph P.; Franson, J. Christian; Hall, Jeffrey S.; Derksen, Dirk V.; Ip, Hon S.

    2009-01-01

    Although continental populations of avian influenza viruses are genetically distinct, transcontinental reassortment in low pathogenic avian influenza (LPAI) viruses has been detected in migratory birds. Thus, genomic analyses of LPAI viruses could serve as an approach to prioritize species and regions targeted by North American surveillance activities for foreign origin highly pathogenic avian influenza (HPAI). To assess the applicability of this approach, we conducted a phylogenetic and population genetic analysis of 68 viral genomes isolated from the northern pintail (Anas acuta) at opposite ends of the Pacific migratory flyway in North America. We found limited evidence for Asian LPAI lineages on wintering areas used by northern pintails in California in contrast to a higher frequency on breeding locales of Alaska. Our results indicate that the number of Asian LPAI lineages observed in Alaskan northern pintails, and the nucleotide composition of LPAI lineages, is not maintained through fall migration. Accordingly, our data indicate that surveillance of Pacific Flyway northern pintails to detect foreign avian influenza viruses would be most effective in Alaska. North American surveillance plans could be optimized through an analysis of LPAI genomics from species that demonstrate evolutionary linkages with European or Asian lineages and in regions that have overlapping migratory flyways with areas of HPAI outbreaks.

  10. Phytozome System for Comparative Plant Genomics

    2011-09-27

    Phytozome is a joint project of the Department of Energy's Joint Genome Institute and the UC Berkeley Center for Integrative Genomics to facilitate comparative genomic studies amongst green plants. Families of orthologous and paralogous genes that represent the modern descendents of ancestral gene sets are constructed at key phylogenetic nodes. These families allow easy access to clade specific orthology/paralogy relationships as well as clade specific genes and gene expansions. As of release 7.0, Phytozome providesmore » access to twenty-five sequenced and annotated green plant genomes which have been clustered into gene families at eleven evolutionarily significant nodes., Where possible, each gene has been annotated with PFAM, KOG, KEGG, and PANTHER assignments, and publicly available annotations from RefSeq, UniProt, TAIR, JGI are lyper-linked and searchable.« less

  11. Phytozome System for Comparative Plant Genomics

    SciTech Connect

    2011-09-27

    Phytozome is a joint project of the Department of Energy's Joint Genome Institute and the UC Berkeley Center for Integrative Genomics to facilitate comparative genomic studies amongst green plants. Families of orthologous and paralogous genes that represent the modern descendents of ancestral gene sets are constructed at key phylogenetic nodes. These families allow easy access to clade specific orthology/paralogy relationships as well as clade specific genes and gene expansions. As of release 7.0, Phytozome provides access to twenty-five sequenced and annotated green plant genomes which have been clustered into gene families at eleven evolutionarily significant nodes., Where possible, each gene has been annotated with PFAM, KOG, KEGG, and PANTHER assignments, and publicly available annotations from RefSeq, UniProt, TAIR, JGI are lyper-linked and searchable.

  12. Homology-Independent Metrics for Comparative Genomics

    PubMed Central

    Coutinho, Tarcisio José Domingos; Franco, Glória Regina; Lobo, Francisco Pereira

    2015-01-01

    A mainstream procedure to analyze the wealth of genomic data available nowadays is the detection of homologous regions shared across genomes, followed by the extraction of biological information from the patterns of conservation and variation observed in such regions. Although of pivotal importance, comparative genomic procedures that rely on homology inference are obviously not applicable if no homologous regions are detectable. This fact excludes a considerable portion of “genomic dark matter” with no significant similarity — and, consequently, no inferred homology to any other known sequence — from several downstream comparative genomic methods. In this review we compile several sequence metrics that do not rely on homology inference and can be used to compare nucleotide sequences and extract biologically meaningful information from them. These metrics comprise several compositional parameters calculated from sequence data alone, such as GC content, dinucleotide odds ratio, and several codon bias metrics. They also share other interesting properties, such as pervasiveness (patterns persist on smaller scales) and phylogenetic signal. We also cite examples where these homology-independent metrics have been successfully applied to support several bioinformatics challenges, such as taxonomic classification of biological sequences without homology inference. They where also used to detect higher-order patterns of interactions in biological systems, ranging from detecting coevolutionary trends between the genomes of viruses and their hosts to characterization of gene pools of entire microbial communities. We argue that, if correctly understood and applied, homology-independent metrics can add important layers of biological information in comparative genomic studies without prior homology inference. PMID:26029354

  13. Sequencing and comparing whole mitochondrial genomes ofanimals

    SciTech Connect

    Boore, Jeffrey L.; Macey, J. Robert; Medina, Monica

    2005-04-22

    Comparing complete animal mitochondrial genome sequences is becoming increasingly common for phylogenetic reconstruction and as a model for genome evolution. Not only are they much more informative than shorter sequences of individual genes for inferring evolutionary relatedness, but these data also provide sets of genome-level characters, such as the relative arrangements of genes, that can be especially powerful. We describe here the protocols commonly used for physically isolating mtDNA, for amplifying these by PCR or RCA, for cloning,sequencing, assembly, validation, and gene annotation, and for comparing both sequences and gene arrangements. On several topics, we offer general observations based on our experiences to date with determining and comparing complete mtDNA sequences.

  14. Using comparative genomics to reorder the human genome sequence into a virtual sheep genome

    PubMed Central

    Dalrymple, Brian P; Kirkness, Ewen F; Nefedov, Mikhail; McWilliam, Sean; Ratnakumar, Abhirami; Barris, Wes; Zhao, Shaying; Shetty, Jyoti; Maddox, Jillian F; O'Grady, Margaret; Nicholas, Frank; Crawford, Allan M; Smith, Tim; de Jong, Pieter J; McEwan, John; Oddy, V Hutton; Cockett, Noelle E

    2007-01-01

    Background Is it possible to construct an accurate and detailed subgene-level map of a genome using bacterial artificial chromosome (BAC) end sequences, a sparse marker map, and the sequences of other genomes? Results A sheep BAC library, CHORI-243, was constructed and the BAC end sequences were determined and mapped with high sensitivity and low specificity onto the frameworks of the human, dog, and cow genomes. To maximize genome coverage, the coordinates of all BAC end sequence hits to the cow and dog genomes were also converted to the equivalent human genome coordinates. The 84,624 sheep BACs (about 5.4-fold genome coverage) with paired ends in the correct orientation (tail-to-tail) and spacing, combined with information from sheep BAC comparative genome contigs (CGCs) built separately on the dog and cow genomes, were used to construct 1,172 sheep BAC-CGCs, covering 91.2% of the human genome. Clustered non-tail-to-tail and outsize BACs located close to the ends of many BAC-CGCs linked BAC-CGCs covering about 70% of the genome to at least one other BAC-CGC on the same chromosome. Using the BAC-CGCs, the intrachromosomal and interchromosomal BAC-CGC linkage information, human/cow and vertebrate synteny, and the sheep marker map, a virtual sheep genome was constructed. To identify BACs potentially located in gaps between BAC-CGCs, an additional set of 55,668 sheep BACs were positioned on the sheep genome with lower confidence. A coordinate conversion process allowed us to transfer human genes and other genome features to the virtual sheep genome to display on a sheep genome browser. Conclusion We demonstrate that limited sequencing of BACs combined with positioning on a well assembled genome and integrating locations from other less well assembled genomes can yield extensive, detailed subgene-level maps of mammalian genomes, for which genomic resources are currently limited. PMID:17663790

  15. Comparative genomic hybridization with single cells after whole genome amplification

    SciTech Connect

    Haddad, B.R.; Baldini, A.; Hughes, M.R.

    1994-09-01

    Conventional karyotype analysis is the ideal way to diagnose chromosomal imbalances. However it requires cell culture and chromosome preparation. There are instances where a very small number of cells are available for cytogenetic evaluation and chromosomes cannot be obtained. Comparative genomic hybridization (CGH) is a novel molecular cytogenetic technique that provides information about genetic imbalances affecting the genome. The power of this technique lies in its ability to detect genetic imbalances using total genomic DNA. We have previously demonstrated the feasibility of whole genome amplification from single cells for subsequent analysis of multiple genetic loci by PCR. In this present work, we combine whole genome amplification with CGH to detect chromosomal imbalances from small numbers of cells. Both cytogenetically normal and abnormal cells were individually picked by micromanipulation and subjected to whole genome amplification using random oligonucleotide primers. Amplified test and control DNA were differentially labeled by incorporation of digoxigenin or biotin, mixed together and hybridized to normal male metaphase spreads. Hybridization was detected with two fluorochromes, rhodamine-anti-digoxigenin and FITC -Avidin. Ratio of intensities of the two fluorochromes along the target chromosomes was analyzed using locally developed computer imaging software. Using the combination of whole genome amplification and CGH, we were able to detect different chromosomal aneuploidies from 30, 20, and 10 cells. It can also be applied to the analysis of fetal cells sorted from maternal circulation, or to tumor cells obtained from needle biopsies or from different body fluids and effusions. Finally, its successful application to single cells will have a great impact on preimplantation diagnosis.

  16. VISTA - computational tools for comparative genomics

    SciTech Connect

    Frazer, Kelly A.; Pachter, Lior; Poliakov, Alexander; Rubin,Edward M.; Dubchak, Inna

    2004-01-01

    Comparison of DNA sequences from different species is a fundamental method for identifying functional elements in genomes. Here we describe the VISTA family of tools created to assist biologists in carrying out this task. Our first VISTA server at http://www-gsd.lbl.gov/VISTA/ was launched in the summer of 2000 and was designed to align long genomic sequences and visualize these alignments with associated functional annotations. Currently the VISTA site includes multiple comparative genomics tools and provides users with rich capabilities to browse pre-computed whole-genome alignments of large vertebrate genomes and other groups of organisms with VISTA Browser, submit their own sequences of interest to several VISTA servers for various types of comparative analysis, and obtain detailed comparative analysis results for a set of cardiovascular genes. We illustrate capabilities of the VISTA site by the analysis of a 180 kilobase (kb) interval on human chromosome 5 that encodes for the kinesin family member3A (KIF3A) protein.

  17. VISTA: computational tools for comparative genomics.

    PubMed

    Frazer, Kelly A; Pachter, Lior; Poliakov, Alexander; Rubin, Edward M; Dubchak, Inna

    2004-07-01

    Comparison of DNA sequences from different species is a fundamental method for identifying functional elements in genomes. Here, we describe the VISTA family of tools created to assist biologists in carrying out this task. Our first VISTA server at http://www-gsd.lbl.gov/vista/ was launched in the summer of 2000 and was designed to align long genomic sequences and visualize these alignments with associated functional annotations. Currently the VISTA site includes multiple comparative genomics tools and provides users with rich capabilities to browse pre-computed whole-genome alignments of large vertebrate genomes and other groups of organisms with VISTA Browser, to submit their own sequences of interest to several VISTA servers for various types of comparative analysis and to obtain detailed comparative analysis results for a set of cardiovascular genes. We illustrate capabilities of the VISTA site by the analysis of a 180 kb interval on human chromosome 5 that encodes for the kinesin family member 3A (KIF3A) protein.

  18. Comparative analysis of the Borrelia garinii genome.

    PubMed

    Glöckner, G; Lehmann, R; Romualdi, A; Pradella, S; Schulte-Spechtel, U; Schilhabel, M; Wilske, B; Sühnel, J; Platzer, M

    2004-01-01

    Three members of the genus Borrelia (B.burgdorferi, B.garinii, B.afzelii) cause tick-borne borreliosis. Depending on the Borrelia species involved, the borreliosis differs in its clinical symptoms. Comparative genomics opens up a way to elucidate the underlying differences in Borrelia species. We analysed a low redundancy whole-genome shotgun (WGS) assembly of a B.garinii strain isolated from a patient with neuroborreliosis in comparison to the B.burgdorferi genome. This analysis reveals that most of the chromosome is conserved (92.7% identity on DNA as well as on amino acid level) in the two species, and no chromosomal rearrangement or larger insertions/deletions could be observed. Furthermore, two collinear plasmids (lp54 and cp26) seem to belong to the basic genome inventory of Borrelia species. These three collinear parts of the Borrelia genome encode 861 genes, which are orthologous in the two species examined. The majority of the genetic information of the other plasmids of B.burgdorferii is also present in B.garinii although orthology is not easy to define due to a high redundancy of the plasmid fraction. Yet, we did not find counterparts of the B.burgdorferi plasmids lp36 and lp38 or their respective gene repertoire in the B.garinii genome. Thus, phenotypic differences between the two species could be attributable to the presence or absence of these two plasmids as well as to the potentially positively selected genes. PMID:15547252

  19. Ebolavirus comparative genomics

    SciTech Connect

    Jun, Se-Ran; Leuze, Michael R.; Nookaew, Intawat; Uberbacher, Edward C.; Land, Miriam; Zhang, Qian; Wanchai, Visanu; Chai, Juanjuan; Nielsen, Morten; Trolle, Thomas; Lund, Ole; Buzard, Gregory S.; Pedersen, Thomas D.; Ussery, David W.

    2015-07-14

    The 2014 Ebola outbreak in West Africa is the largest documented for this virus. We examine the dynamics of this genome, comparing more than one hundred currently available ebolavirus genomes to each other and to other viral genomes. Based on oligomer frequency analysis, the family Filoviridae forms a distinct group from all other sequenced viral genomes. All filovirus genomes sequenced to date encode proteins with similar functions and gene order, although there is considerable divergence in sequences between the three genera Ebolavirus, Cuevavirus, and Marburgvirus within the family Filoviridae. Whereas all ebolavirus genomes are quite similar (multiple sequences of the same strain are often identical), variation is most common in the intergenic regions and within specific areas of the genes encoding the glycoprotein (GP), nucleoprotein (NP), and polymerase (L). We predict regions that could contain epitope-binding sites, which might be good vaccine targets. In conclusion, this information, combined with glycosylation sites and experimentally determined epitopes, can identify the most promising regions for the development of therapeutic strategies.

  20. Comparative genomics of Shiga toxin encoding bacteriophages

    PubMed Central

    2012-01-01

    Background Stx bacteriophages are responsible for driving the dissemination of Stx toxin genes (stx) across their bacterial host range. Lysogens carrying Stx phages can cause severe, life-threatening disease and Stx toxin is an integral virulence factor. The Stx-bacteriophage vB_EcoP-24B, commonly referred to as Ф24B, is capable of multiply infecting a single bacterial host cell at a high frequency, with secondary infection increasing the rate at which subsequent bacteriophage infections can occur. This is biologically unusual, therefore determining the genomic content and context of Ф24B compared to other lambdoid Stx phages is important to understanding the factors controlling this phenomenon and determining whether they occur in other Stx phages. Results The genome of the Stx2 encoding phage, Ф24B was sequenced and annotated. The genomic organisation and general features are similar to other sequenced Stx bacteriophages induced from Enterohaemorrhagic Escherichia coli (EHEC), however Ф24B possesses significant regions of heterogeneity, with implications for phage biology and behaviour. The Ф24B genome was compared to other sequenced Stx phages and the archetypal lambdoid phage, lambda, using the Circos genome comparison tool and a PCR-based multi-loci comparison system. Conclusions The data support the hypothesis that Stx phages are mosaic, and recombination events between the host, phages and their remnants within the same infected bacterial cell will continue to drive the evolution of Stx phage variants and the subsequent dissemination of shigatoxigenic potential. PMID:22799768

  1. Comparative genomics of biotechnologically important yeasts

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Ascomycete yeasts are metabolically diverse, with great potential for biotechnology. Here, we report the comparative genome analysis of 29 taxonomically and biotechnologically important yeasts, including 16 newly sequenced. We identify a genetic code change, CUG-Ala, in Pachysolen tannophilus in the...

  2. Advances in genetic engineering of the avian genome: "Realising the promise".

    PubMed

    Doran, Timothy J; Cooper, Caitlin A; Jenkins, Kristie A; Tizard, Mark L V

    2016-06-01

    This review provides an historic perspective of the key steps from those reported at the 1st Transgenic Animal Research Conference in 1997 through to the very latest developments in avian transgenesis. Eighteen years later, on the occasion of the 10th conference in this series, we have seen breakthrough advances in the use of viral vectors and transposons to transform the germline via the direct manipulation of the chicken embryo, through to the establishment of PGC cultures allowing in vitro modification, expansion into populations to analyse the genetic modifications and then injection of these cells into embryos to create germline chimeras. We have now reached an unprecedented time in the history of chicken transgenic research where we have the technology to introduce precise, targeted modifications into the chicken genome, ranging from; new transgenes that provide improved phenotypes such as increased resilience to economically important diseases; the targeted disruption of immunoglobulin genes and replacement with human sequences to generate transgenic chickens that express "humanised" antibodies for biopharming; and the deletion of specific nucleotides to generate targeted gene knockout chickens for functional genomics. The impact of these advances is set to be realised through applications in chickens, and other bird species as models in scientific research, for novel biotechnology and to protect and improve agricultural productivity.

  3. Genomic analysis of avian influenza viruses from waterfowl in Western Alaska, USA

    USGS Publications Warehouse

    Reeves, A.B.; Pearce, J.M.; Ramey, A.M.; Ely, C.R.; Schmutz, J.A.; Flint, P.L.; Derksen, D.V.; Ip, H.S.; Trust, K.A.

    2013-01-01

    The Yukon-Kuskokwim Delta (Y-K Delta) in western Alaska is an immense and important breeding ground for waterfowl. Migratory birds from the Pacific Americas, Central Pacific, and East Asian-Australasian flyways converge in this region, providing opportunities for intermixing of North American- and Eurasian-origin hosts and infectious agents, such as avian influenza virus (AIV). We characterized the genomes of 90 low pathogenic (LP) AIV isolates from 11 species of waterfowl sampled on the Y-K Delta between 2006 and 2009 as part of an interagency surveillance program for the detection of the H5N1 highly pathogenic (HP) strain of AIV. We found evidence for subtype and genetic differences between viruses from swans and geese, dabbling ducks, and sea ducks. At least one gene segment in 39% of all isolates was Eurasian in origin. Target species (those ranked as having a relatively high potential to introduce HP H5N1 AIV to North America) were no more likely than nontarget species to carry viruses with genes of Eurasian origin. These findings provide evidence that the frequency at which viral gene segments of Eurasian origin are detected does not result from a strong species effect, but rather we suspect it is linked to the geographic location of the Y-K Delta in western Alaska where flyways from different continents overlap. This study provides support for retaining the Y-K Delta as a high priority region for the surveillance of Asian avian pathogens such as HP H5N1 AIV.

  4. Isolation, genome sequencing and functional analysis of two T7-like coliphages of avian pathogenic Escherichia coli.

    PubMed

    Chen, Mianmian; Xu, Juntian; Yao, Huochun; Lu, Chengping; Zhang, Wei

    2016-05-10

    Avian pathogenic Escherichia coli (APEC) causes colibacillosis, which results in significant economic losses to the poultry industry worldwide. Due to the drug residues and increased antibiotic resistance caused by antibiotic use, bacteriophages and other alternative therapeutic agents are expected to control APEC infection in poultry. Two APEC phages, named P483 and P694, were isolated from the feces from the farmers market in China. We then studied their biological properties, and carried out high-throughput genome sequencing and homology analyses of these phages. Assembly results of high-throughput sequencing showed that the structures of both P483 and P694 genomes consist of linear and double-stranded DNA. Results of the electron microscopy and homology analysis revealed that both P483 and P694 belong to T7-like virus which is a member of the Podoviridae family of the Caudovirales order. Comparative genomic analysis showed that most of the predicted proteins of these two phages showed strongest sequence similarity to the Enterobacteria phages BA14 and 285P, Erwinia phage FE44, and Kluyvera phage Kvp1; however, some proteins such as gp0.6a, gp1.7 and gp17 showed lower similarity (<85%) with the homologs of other phages in the T7 subgroup. We also found some unique characteristics of P483 and P694, such as the two types of the genes of P694 and no lytic activity of P694 against its host bacteria in liquid medium. Our results serve to further our understanding of phage evolution of T7-like coliphages and provide the potential application of the phages as therapeutic agents for the treatment of diseases. PMID:26828615

  5. Isolation, genome sequencing and functional analysis of two T7-like coliphages of avian pathogenic Escherichia coli.

    PubMed

    Chen, Mianmian; Xu, Juntian; Yao, Huochun; Lu, Chengping; Zhang, Wei

    2016-05-10

    Avian pathogenic Escherichia coli (APEC) causes colibacillosis, which results in significant economic losses to the poultry industry worldwide. Due to the drug residues and increased antibiotic resistance caused by antibiotic use, bacteriophages and other alternative therapeutic agents are expected to control APEC infection in poultry. Two APEC phages, named P483 and P694, were isolated from the feces from the farmers market in China. We then studied their biological properties, and carried out high-throughput genome sequencing and homology analyses of these phages. Assembly results of high-throughput sequencing showed that the structures of both P483 and P694 genomes consist of linear and double-stranded DNA. Results of the electron microscopy and homology analysis revealed that both P483 and P694 belong to T7-like virus which is a member of the Podoviridae family of the Caudovirales order. Comparative genomic analysis showed that most of the predicted proteins of these two phages showed strongest sequence similarity to the Enterobacteria phages BA14 and 285P, Erwinia phage FE44, and Kluyvera phage Kvp1; however, some proteins such as gp0.6a, gp1.7 and gp17 showed lower similarity (<85%) with the homologs of other phages in the T7 subgroup. We also found some unique characteristics of P483 and P694, such as the two types of the genes of P694 and no lytic activity of P694 against its host bacteria in liquid medium. Our results serve to further our understanding of phage evolution of T7-like coliphages and provide the potential application of the phages as therapeutic agents for the treatment of diseases.

  6. Nucleotide sequences derived from pheasant DNA in the genome of recombinant avian leukosis viruses with subgroup F specificity.

    PubMed

    Keshet, E; Temin, H M

    1977-11-01

    Recombination between viral and cellular genes can give rise to new strains of retroviruses. For example, Rous-associated virus 61 (RAV-61) is a recombinant between the Bryan high-titer strain of Rous sarcoma virus (RSV) and normal pheasant DNA. Nucleic acid hybridization techniques were used to study the genome of RAV-61 and another RAV with subgroup F specificity (RAV-F) obtained by passage of RSV-RAV-0 in cells from a ring-necked pheasant embryo. The nucleotide sequences acquired by these two independent isolates of RAV-F that were not shared with the parental virus comprised 20 to 25% of the RAV-F genomes and were indistinguishable by nucleic acid hybridization. (In addition, RAV-F genomes had another set of nucleotide sequences that were homologous to some pheasant nucleotide sequences and also were present in the parental viruses.) A specific complementary DNA, containing only nucleotide sequences complementary to those acquired by RAV-61 through recombination, was prepared. These nucleotide sequences were pheasant derived and were not present in the genomes of reticuloendotheliosis viruses, pheasant viruses, and avian leukosis-sarcoma viruses of subgroups A, B, C, D, and E. They were partially endogenous, however, to avian DNA other than pheasant. The fraction of these nucleotide sequences present in other avian DNAs generally paralleled the genetic relatedness of these avian species to pheasants. However, there was a high degree of homology between these pheasant nucleotide sequences and related nucleotide sequences in the DNA of normal chickens as indicated by the identical melting profiles of the respective hybrids.

  7. IDENTIFICATION OF AVIAN-SPECIFIC FECAL METAGENOMIC SEQUENCES USING GENOME FRAGMENT ENRICHMENTS

    EPA Science Inventory

    Sequence analysis of microbial genomes has provided biologists the opportunity to compare genetic differences between closely related microorganisms. While random sequencing has also been used to study natural microbial communities, metagenomic comparisons via sequencing analysis...

  8. Comparative genomics in the Amoebozoa clade.

    PubMed

    Glöckner, Gernot; Noegel, Angelika A

    2013-02-01

    Amoeboid life forms can be found throughout the evolutionary tree. The greatest proportion of these life forms is found in the Amoebozoa clade, one of the six major eukaryote evolutionary branches. Despite its common origin this clade exhibits a wide diversity of lifestyles including free-living and parasitic species and species with multicellular and multinucleate life stages. In this group, development, cooperation, and social behaviour can be studied in addition to traits common to unicellular organisms. To date, only a few Amoebozoa genomes have been sequenced completely, however a number of expressed sequence tags (ESTs) and complete and draft genomes have become available recently for several species that represent some of the major evolutionary lineages in this clade. This resource allows us to compare and analyse the evolutionary history and fate of branch-specific genes if properly exploited. Despite the large evolutionary time scale since the emergence of the major groups the genomic organization in Amoebozoa has retained common features. The number of Amoebozoa-specific genetic inventions seems to be rather small. The emergence of subgroups is accompanied by gene and domain losses and acquisitions of bacterial gene material. The sophisticated developmental cycles of Myxogastria and Dictyosteliida likely have a common origin and are deeply rooted in amoebozoan evolution. In this review we describe initial approaches to comparative genomics in Amoebozoa, summarize recent findings, and identify goals for further studies.

  9. A Comparative Map of the Zebrafish Genome

    PubMed Central

    Woods, Ian G.; Kelly, Peter D.; Chu, Felicia; Ngo-Hazelett, Phuong; Yan, Yi-Lin; Huang, Hui; Postlethwait, John H.; Talbot, William S.

    2000-01-01

    Zebrafish mutations define the functions of hundreds of essential genes in the vertebrate genome. To accelerate the molecular analysis of zebrafish mutations and to facilitate comparisons among the genomes of zebrafish and other vertebrates, we used a homozygous diploid meiotic mapping panel to localize polymorphisms in 691 previously unmapped genes and expressed sequence tags (ESTs). Together with earlier efforts, this work raises the total number of markers scored in the mapping panel to 2119, including 1503 genes and ESTs and 616 previously characterized simple-sequence length polymorphisms. Sequence analysis of zebrafish genes mapped in this study and in prior work identified putative human orthologs for 804 zebrafish genes and ESTs. Map comparisons revealed 139 new conserved syntenies, in which two or more genes are on the same chromosome in zebrafish and human. Although some conserved syntenies are quite large, there were changes in gene order within conserved groups, apparently reflecting the relatively frequent occurrence of inversions and other intrachromosomal rearrangements since the divergence of teleost and tetrapod ancestors. Comparative mapping also shows that there is not a one-to-one correspondence between zebrafish and human chromosomes. Mapping of duplicate gene pairs identified segments of 20 linkage groups that may have arisen during a genome duplication that occurred early in the evolution of teleosts after the divergence of teleost and mammalian ancestors. This comparative map will accelerate the molecular analysis of zebrafish mutations and enhance the understanding of the evolution of the vertebrate genome. PMID:11116086

  10. Comparative genomics tools applied to bioterrorism defence.

    PubMed

    Slezak, Tom; Kuczmarski, Tom; Ott, Linda; Torres, Clinton; Medeiros, Dan; Smith, Jason; Truitt, Brian; Mulakken, Nisha; Lam, Marisa; Vitalis, Elizabeth; Zemla, Adam; Zhou, Carol Ecale; Gardner, Shea

    2003-06-01

    Rapid advances in the genomic sequencing of bacteria and viruses over the past few years have made it possible to consider sequencing the genomes of all pathogens that affect humans and the crops and livestock upon which our lives depend. Recent events make it imperative that full genome sequencing be accomplished as soon as possible for pathogens that could be used as weapons of mass destruction or disruption. This sequence information must be exploited to provide rapid and accurate diagnostics to identify pathogens and distinguish them from harmless near-neighbours and hoaxes. The Chem-Bio Non-Proliferation (CBNP) programme of the US Department of Energy (DOE) began a large-scale effort of pathogen detection in early 2000 when it was announced that the DOE would be providing bio-security at the 2002 Winter Olympic Games in Salt Lake City, Utah. Our team at the Lawrence Livermore National Lab (LLNL) was given the task of developing reliable and validated assays for a number of the most likely bioterrorist agents. The short timeline led us to devise a novel system that utilised whole-genome comparison methods to rapidly focus on parts of the pathogen genomes that had a high probability of being unique. Assays developed with this approach have been validated by the Centers for Disease Control (CDC). They were used at the 2002 Winter Olympics, have entered the public health system, and have been in continual use for non-publicised aspects of homeland defence since autumn 2001. Assays have been developed for all major threat list agents for which adequate genomic sequence is available, as well as for other pathogens requested by various government agencies. Collaborations with comparative genomics algorithm developers have enabled our LLNL team to make major advances in pathogen detection, since many of the existing tools simply did not scale well enough to be of practical use for this application. It is hoped that a discussion of a real-life practical application of

  11. Phytome: a platform for plant comparative genomics.

    PubMed

    Hartmann, Stefanie; Lu, Dihui; Phillips, Jason; Vision, Todd J

    2006-01-01

    Phytome is an online comparative genomics resource that can be applied to functional plant genomics, molecular breeding and evolutionary studies. It contains predicted protein sequences, protein family assignments, multiple sequence alignments, phylogenies and functional annotations for proteins from a large, phylogenetically diverse set of plant taxa. Phytome serves as a glue between disparate plant gene databases both by identifying the evolutionary relationships among orthologous and paralogous protein sequences from different species and by enabling cross-references between different versions of the same gene curated independently by different database groups. The web interface enables sophisticated queries on lineage-specific patterns of gene/protein family proliferation and loss. This rich dataset is serving as a platform for the unification of sequence-anchored comparative maps across taxonomic families of plants. The Phytome web interface can be accessed at the following URL: http://www.phytome.org. Batch homology searches and bulk downloads are available upon free registration.

  12. Whole-genome sequencing for comparative genomics and de novo genome assembly.

    PubMed

    Benjak, Andrej; Sala, Claudia; Hartkoorn, Ruben C

    2015-01-01

    Next-generation sequencing technologies for whole-genome sequencing of mycobacteria are rapidly becoming an attractive alternative to more traditional sequencing methods. In particular this technology is proving useful for genome-wide identification of mutations in mycobacteria (comparative genomics) as well as for de novo assembly of whole genomes. Next-generation sequencing however generates a vast quantity of data that can only be transformed into a usable and comprehensible form using bioinformatics. Here we describe the methodology one would use to prepare libraries for whole-genome sequencing, and the basic bioinformatics to identify mutations in a genome following Illumina HiSeq or MiSeq sequencing, as well as de novo genome assembly following sequencing using Pacific Biosciences (PacBio).

  13. Comparative genomics of biotechnologically important yeasts.

    PubMed

    Riley, Robert; Haridas, Sajeet; Wolfe, Kenneth H; Lopes, Mariana R; Hittinger, Chris Todd; Göker, Markus; Salamov, Asaf A; Wisecaver, Jennifer H; Long, Tanya M; Calvey, Christopher H; Aerts, Andrea L; Barry, Kerrie W; Choi, Cindy; Clum, Alicia; Coughlan, Aisling Y; Deshpande, Shweta; Douglass, Alexander P; Hanson, Sara J; Klenk, Hans-Peter; LaButti, Kurt M; Lapidus, Alla; Lindquist, Erika A; Lipzen, Anna M; Meier-Kolthoff, Jan P; Ohm, Robin A; Otillar, Robert P; Pangilinan, Jasmyn L; Peng, Yi; Rokas, Antonis; Rosa, Carlos A; Scheuner, Carmen; Sibirny, Andriy A; Slot, Jason C; Stielow, J Benjamin; Sun, Hui; Kurtzman, Cletus P; Blackwell, Meredith; Grigoriev, Igor V; Jeffries, Thomas W

    2016-08-30

    Ascomycete yeasts are metabolically diverse, with great potential for biotechnology. Here, we report the comparative genome analysis of 29 taxonomically and biotechnologically important yeasts, including 16 newly sequenced. We identify a genetic code change, CUG-Ala, in Pachysolen tannophilus in the clade sister to the known CUG-Ser clade. Our well-resolved yeast phylogeny shows that some traits, such as methylotrophy, are restricted to single clades, whereas others, such as l-rhamnose utilization, have patchy phylogenetic distributions. Gene clusters, with variable organization and distribution, encode many pathways of interest. Genomics can predict some biochemical traits precisely, but the genomic basis of others, such as xylose utilization, remains unresolved. Our data also provide insight into early evolution of ascomycetes. We document the loss of H3K9me2/3 heterochromatin, the origin of ascomycete mating-type switching, and panascomycete synteny at the MAT locus. These data and analyses will facilitate the engineering of efficient biosynthetic and degradative pathways and gateways for genomic manipulation.

  14. Comparative Analysis of Genome Sequences with VISTA

    DOE Data Explorer

    Dubchak, Inna

    VISTA is a comprehensive suite of programs and databases developed by and hosted at the Genomics Division of Lawrence Berkeley National Laboratory. They provide information and tools designed to facilitate comparative analysis of genomic sequences. Users have two ways to interact with the suite of applications at the VISTA portal. They can submit their own sequences and alignments for analysis (VISTA servers) or examine pre-computed whole-genome alignments of different species. A key menu option is the Enhancer Browser and Database at http://enhancer.lbl.gov/. The VISTA Enhancer Browser is a central resource for experimentally validated human noncoding fragments with gene enhancer activity as assessed in transgenic mice. Most of these noncoding elements were selected for testing based on their extreme conservation with other vertebrates. The results of this enhancer screen are provided through this publicly available website. The browser also features relevant results by external contributors and a large collection of additional genome-wide conserved noncoding elements which are candidate enhancer sequences. The LBL developers invite external groups to submit computational predictions of developmental enhancers. As of 10/19/2009 the database contains information on 1109 in vivo tested elements - 508 elements with enhancer activity.

  15. Comparative genome analysis of Basidiomycete fungi

    SciTech Connect

    Riley, Robert; Salamov, Asaf; Henrissat, Bernard; Nagy, Laszlo; Brown, Daren; Held, Benjamin; Baker, Scott; Blanchette, Robert; Boussau, Bastien; Doty, Sharon L.; Fagnan, Kirsten; Floudas, Dimitris; Levasseur, Anthony; Manning, Gerard; Martin, Francis; Morin, Emmanuelle; Otillar, Robert; Pisabarro, Antonio; Walton, Jonathan; Wolfe, Ken; Hibbett, David; Grigoriev, Igor

    2013-08-07

    Fungi of the phylum Basidiomycota (basidiomycetes), make up some 37percent of the described fungi, and are important in forestry, agriculture, medicine, and bioenergy. This diverse phylum includes symbionts, pathogens, and saprotrophs including the majority of wood decaying and ectomycorrhizal species. To better understand the genetic diversity of this phylum we compared the genomes of 35 basidiomycetes including 6 newly sequenced genomes. These genomes span extremes of genome size, gene number, and repeat content. Analysis of core genes reveals that some 48percent of basidiomycete proteins are unique to the phylum with nearly half of those (22percent) found in only one organism. Correlations between lifestyle and certain gene families are evident. Phylogenetic patterns of plant biomass-degrading genes in Agaricomycotina suggest a continuum rather than a dichotomy between the white rot and brown rot modes of wood decay. Based on phylogenetically-informed PCA analysis of wood decay genes, we predict that that Botryobasidium botryosum and Jaapia argillacea have properties similar to white rot species, although neither has typical ligninolytic class II fungal peroxidases (PODs). This prediction is supported by growth assays in which both fungi exhibit wood decay with white rot-like characteristics. Based on this, we suggest that the white/brown rot dichotomy may be inadequate to describe the full range of wood decaying fungi. Analysis of the rate of discovery of proteins with no or few homologs suggests the value of continued sequencing of basidiomycete fungi.

  16. Comparative genomics of biotechnologically important yeasts.

    PubMed

    Riley, Robert; Haridas, Sajeet; Wolfe, Kenneth H; Lopes, Mariana R; Hittinger, Chris Todd; Göker, Markus; Salamov, Asaf A; Wisecaver, Jennifer H; Long, Tanya M; Calvey, Christopher H; Aerts, Andrea L; Barry, Kerrie W; Choi, Cindy; Clum, Alicia; Coughlan, Aisling Y; Deshpande, Shweta; Douglass, Alexander P; Hanson, Sara J; Klenk, Hans-Peter; LaButti, Kurt M; Lapidus, Alla; Lindquist, Erika A; Lipzen, Anna M; Meier-Kolthoff, Jan P; Ohm, Robin A; Otillar, Robert P; Pangilinan, Jasmyn L; Peng, Yi; Rokas, Antonis; Rosa, Carlos A; Scheuner, Carmen; Sibirny, Andriy A; Slot, Jason C; Stielow, J Benjamin; Sun, Hui; Kurtzman, Cletus P; Blackwell, Meredith; Grigoriev, Igor V; Jeffries, Thomas W

    2016-08-30

    Ascomycete yeasts are metabolically diverse, with great potential for biotechnology. Here, we report the comparative genome analysis of 29 taxonomically and biotechnologically important yeasts, including 16 newly sequenced. We identify a genetic code change, CUG-Ala, in Pachysolen tannophilus in the clade sister to the known CUG-Ser clade. Our well-resolved yeast phylogeny shows that some traits, such as methylotrophy, are restricted to single clades, whereas others, such as l-rhamnose utilization, have patchy phylogenetic distributions. Gene clusters, with variable organization and distribution, encode many pathways of interest. Genomics can predict some biochemical traits precisely, but the genomic basis of others, such as xylose utilization, remains unresolved. Our data also provide insight into early evolution of ascomycetes. We document the loss of H3K9me2/3 heterochromatin, the origin of ascomycete mating-type switching, and panascomycete synteny at the MAT locus. These data and analyses will facilitate the engineering of efficient biosynthetic and degradative pathways and gateways for genomic manipulation. PMID:27535936

  17. COMPARISON OF COMPARATIVE GENOMIC HYBRIDIZATIONS TECHNOLOGIES ACROSS MICROARRAY PLATFORMS

    EPA Science Inventory

    Comparative Genomic Hybridization (CGH) measures DNA copy number differences between a reference genome and a test genome. The DNA samples are differentially labeled and hybridized to an immobilized substrate. In early CGH experiments, the DNA targets were hybridized to metaphase...

  18. Comparative genomics and evolution of eukaryotic phospholipidbiosynthesis

    SciTech Connect

    Lykidis, Athanasios

    2006-12-01

    Phospholipid biosynthetic enzymes produce diverse molecular structures and are often present in multiple forms encoded by different genes. This work utilizes comparative genomics and phylogenetics for exploring the distribution, structure and evolution of phospholipid biosynthetic genes and pathways in 26 eukaryotic genomes. Although the basic structure of the pathways was formed early in eukaryotic evolution, the emerging picture indicates that individual enzyme families followed unique evolutionary courses. For example, choline and ethanolamine kinases and cytidylyltransferases emerged in ancestral eukaryotes, whereas, multiple forms of the corresponding phosphatidyltransferases evolved mainly in a lineage specific manner. Furthermore, several unicellular eukaryotes maintain bacterial-type enzymes and reactions for the synthesis of phosphatidylglycerol and cardiolipin. Also, base-exchange phosphatidylserine synthases are widespread and ancestral enzymes. The multiplicity of phospholipid biosynthetic enzymes has been largely generated by gene expansion in a lineage specific manner. Thus, these observations suggest that phospholipid biosynthesis has been an actively evolving system. Finally, comparative genomic analysis indicates the existence of novel phosphatidyltransferases and provides a candidate for the uncharacterized eukaryotic phosphatidylglycerol phosphate phosphatase.

  19. Image analysis in comparative genomic hybridization

    SciTech Connect

    Lundsteen, C.; Maahr, J.; Christensen, B.

    1995-01-01

    Comparative genomic hybridization (CGH) is a new technique by which genomic imbalances can be detected by combining in situ suppression hybridization of whole genomic DNA and image analysis. We have developed software for rapid, quantitative CGH image analysis by a modification and extension of the standard software used for routine karyotyping of G-banded metaphase spreads in the Magiscan chromosome analysis system. The DAPI-counterstained metaphase spread is karyotyped interactively. Corrections for image shifts between the DAPI, FITC, and TRITC images are done manually by moving the three images relative to each other. The fluorescence background is subtracted. A mean filter is applied to smooth the FITC and TRITC images before the fluorescence ratio between the individual FITC and TRITC-stained chromosomes is computed pixel by pixel inside the area of the chromosomes determined by the DAPI boundaries. Fluorescence intensity ratio profiles are generated, and peaks and valleys indicating possible gains and losses of test DNA are marked if they exceed ratios below 0.75 and above 1.25. By combining the analysis of several metaphase spreads, consistent findings of gains and losses in all or almost all spreads indicate chromosomal imbalance. Chromosomal imbalances are detected either by visual inspection of fluorescence ratio (FR) profiles or by a statistical approach that compares FR measurements of the individual case with measurements of normal chromosomes. The complete analysis of one metaphase can be carried out in approximately 10 minutes. 8 refs., 7 figs., 1 tab.

  20. Comparative genome map of human and cattle

    SciTech Connect

    Solinas-Toldo, S.; Fries, R.; Lengauer, C.

    1995-06-10

    Chromosomal homologies between individual human chromosomes and the bovine karyotype have been established by using a new approach termed Zoo-FISH. Labeled DNA libraries from flow-sorted human chromosomes were used as probes for fluorescence in situ hybridization on cattle chromosomes. All human DNA libraries, except the Y chromosome library, hybridized to one or more cattle chromosomes, identifying and delineating 50 segments of homology, most of them corresponding to the regions of homology as identified by the previous mapping of individual conserved loci. However, Zoo-FISH refines the comparative maps constructed by molecular gene mapping of individual loci by providing information on the boundaries of conserved regions in the absence of obvious cytogenetic homologies of human and bovine chromosomes. It allows study of karyotypic evolution and opens new avenues for genomic analysis by facilitating the extrapolation of results from the human genome initiative. 50 refs., 3 figs., 1 tab.

  1. Comparative genome analyses reveal distinct structure in the saltwater crocodile MHC.

    PubMed

    Jaratlerdsiri, Weerachai; Deakin, Janine; Godinez, Ricardo M; Shan, Xueyan; Peterson, Daniel G; Marthey, Sylvain; Lyons, Eric; McCarthy, Fiona M; Isberg, Sally R; Higgins, Damien P; Chong, Amanda Y; John, John St; Glenn, Travis C; Ray, David A; Gongora, Jaime

    2014-01-01

    The major histocompatibility complex (MHC) is a dynamic genome region with an essential role in the adaptive immunity of vertebrates, especially antigen presentation. The MHC is generally divided into subregions (classes I, II and III) containing genes of similar function across species, but with different gene number and organisation. Crocodylia (crocodilians) are widely distributed and represent an evolutionary distinct group among higher vertebrates, but the genomic organisation of MHC within this lineage has been largely unexplored. Here, we studied the MHC region of the saltwater crocodile (Crocodylus porosus) and compared it with that of other taxa. We characterised genomic clusters encompassing MHC class I and class II genes in the saltwater crocodile based on sequencing of bacterial artificial chromosomes. Six gene clusters spanning ∼452 kb were identified to contain nine MHC class I genes, six MHC class II genes, three TAP genes, and a TRIM gene. These MHC class I and class II genes were in separate scaffold regions and were greater in length (2-6 times longer) than their counterparts in well-studied fowl B loci, suggesting that the compaction of avian MHC occurred after the crocodilian-avian split. Comparative analyses between the saltwater crocodile MHC and that from the alligator and gharial showed large syntenic areas (>80% identity) with similar gene order. Comparisons with other vertebrates showed that the saltwater crocodile had MHC class I genes located along with TAP, consistent with birds studied. Linkage between MHC class I and TRIM39 observed in the saltwater crocodile resembled MHC in eutherians compared, but absent in avian MHC, suggesting that the saltwater crocodile MHC appears to have gene organisation intermediate between these two lineages. These observations suggest that the structure of the saltwater crocodile MHC, and other crocodilians, can help determine the MHC that was present in the ancestors of archosaurs.

  2. Comparative Genome Analyses Reveal Distinct Structure in the Saltwater Crocodile MHC

    PubMed Central

    Jaratlerdsiri, Weerachai; Deakin, Janine; Godinez, Ricardo M.; Shan, Xueyan; Peterson, Daniel G.; Marthey, Sylvain; Lyons, Eric; McCarthy, Fiona M.; Isberg, Sally R.; Higgins, Damien P.; Chong, Amanda Y.; John, John St; Glenn, Travis C.; Ray, David A.; Gongora, Jaime

    2014-01-01

    The major histocompatibility complex (MHC) is a dynamic genome region with an essential role in the adaptive immunity of vertebrates, especially antigen presentation. The MHC is generally divided into subregions (classes I, II and III) containing genes of similar function across species, but with different gene number and organisation. Crocodylia (crocodilians) are widely distributed and represent an evolutionary distinct group among higher vertebrates, but the genomic organisation of MHC within this lineage has been largely unexplored. Here, we studied the MHC region of the saltwater crocodile (Crocodylus porosus) and compared it with that of other taxa. We characterised genomic clusters encompassing MHC class I and class II genes in the saltwater crocodile based on sequencing of bacterial artificial chromosomes. Six gene clusters spanning ∼452 kb were identified to contain nine MHC class I genes, six MHC class II genes, three TAP genes, and a TRIM gene. These MHC class I and class II genes were in separate scaffold regions and were greater in length (2–6 times longer) than their counterparts in well-studied fowl B loci, suggesting that the compaction of avian MHC occurred after the crocodilian-avian split. Comparative analyses between the saltwater crocodile MHC and that from the alligator and gharial showed large syntenic areas (>80% identity) with similar gene order. Comparisons with other vertebrates showed that the saltwater crocodile had MHC class I genes located along with TAP, consistent with birds studied. Linkage between MHC class I and TRIM39 observed in the saltwater crocodile resembled MHC in eutherians compared, but absent in avian MHC, suggesting that the saltwater crocodile MHC appears to have gene organisation intermediate between these two lineages. These observations suggest that the structure of the saltwater crocodile MHC, and other crocodilians, can help determine the MHC that was present in the ancestors of archosaurs. PMID:25503521

  3. Survival analysis of infected mice reveals pathogenic variations in the genome of avian H1N1 viruses

    PubMed Central

    Koçer, Zeynep A.; Fan, Yiping; Huether, Robert; Obenauer, John; Webby, Richard J.; Zhang, Jinghui; Webster, Robert G.; Wu, Gang

    2014-01-01

    Most influenza pandemics have been caused by H1N1 viruses of purely or partially avian origin. Here, using Cox proportional hazard model, we attempt to identify the genetic variations in the whole genome of wild-type North American avian H1N1 influenza A viruses that are associated with their virulence in mice by residue variations, host origins of virus (Anseriformes-ducks or Charadriiformes-shorebirds), and host-residue interactions. In addition, through structural modeling, we predicted that several polymorphic sites associated with pathogenicity were located in structurally important sites, especially in the polymerase complex and NS genes. Our study introduces a new approach to identify pathogenic variations in wild-type viruses circulating in the natural reservoirs and ultimately to understand their infectious risks to humans as part of risk assessment efforts towards the emergence of future pandemic strains. PMID:25503687

  4. Genome sequence of a natural reassortant H5N2 avian influenza virus from domestic mallard ducks in eastern China.

    PubMed

    Gu, Min; Huang, Junqing; Chen, Yuxin; Chen, Jian; Wang, Xiaoquan; Liu, Xiaowen; Liu, Xiufan

    2012-11-01

    Here, we report the genomic sequence of a Chinese reassortant H5N2 avian influenza virus which possessed the polybasic motif PLREKRRK-R/GL at the hemagglutinin cleavage site. Phylogenetic analysis showed that all eight genes were of the Eurasian lineage, five of which were highly homologous to the endemic clade 2.3.4 H5N1 viruses and their H5N5 reassortant descendants. These data suggested that novel multisubtypic NA reassortants bearing the H5N1 backbone could be generated through genetic reassortment in H5N1 circulating regions, which will help in understanding the evolution and segment reassortment mechanism of H5 subtype avian influenza viruses.

  5. Comparative bacterial genomics: defining the minimal core genome.

    PubMed

    Huang, C H; Hsiang, T; Trevors, J T

    2013-02-01

    A comparative genomics analysis revealed 702 genes present in the bacterial Gram-negative core gene set (92 species analyzed) and 959 genes in the Gram-positive core gene set (93 species analyzed). Mycoplasma genitalium, which has the smallest known genome (517 genes) of a non-symbiont, was used in a three-way reciprocal analysis with the Gram-negative core genes and the Gram-positive core genes, and 151 common bacterial core genes were found. Of these 151 core genes, 39 were putative genes encoding the 30S and 50S ribosomal subunits, whilst among recognized cell division genes, only one gene, the major ftsZ, was present. In addition, 86 reciprocal matches were identified between the 151 common bacterial genes and a previously determined 2,723 common eukaryotic core gene set. An analysis was also done to optimize the threshold bit score used to declare that genes were homologous, and a bit score cutoff of 40 was selected.

  6. A Novel Recombinant Retrovirus in the Genomes of Modern Birds Combines Features of Avian and Mammalian Retroviruses

    PubMed Central

    Henzy, Jamie E.; Gifford, Robert J.; Johnson, Welkin E.

    2014-01-01

    ABSTRACT Endogenous retroviruses (ERVs) represent ancestral sequences of modern retroviruses or their extinct relatives. The majority of ERVs cluster alongside exogenous retroviruses into two main groups based on phylogenetic analyses of the reverse transcriptase (RT) enzyme. Class I includes gammaretroviruses, and class II includes lentiviruses and alpha-, beta-, and deltaretroviruses. However, analyses of the transmembrane subunit (TM) of the envelope glycoprotein (env) gene result in a different topology for some retroviruses, suggesting recombination events in which heterologous env sequences have been acquired. We previously demonstrated that the TM sequences of five of the six genera of orthoretroviruses can be divided into three types, each of which infects a distinct set of vertebrate classes. Moreover, these classes do not always overlap the host range of the associated RT classes. Thus, recombination resulting in acquisition of a heterologous env gene could in theory facilitate cross-species transmissions across vertebrate classes, for example, from mammals to reptiles. Here we characterized a family of class II avian ERVs, “TgERV-F,” that acquired a mammalian gammaretroviral env sequence. Although TgERV-F clusters near a sister clade to alpharetroviruses, its genome also has some features of betaretroviruses. We offer evidence that this unusual recombinant has circulated among several avian orders and may still have infectious members. In addition to documenting the infection of a nongalliform avian species by a mammalian retrovirus, TgERV-F also underscores the importance of env sequences in reconstructing phylogenies and supports a possible role for env swapping in allowing cross-species transmissions across wide taxonomic distances. IMPORTANCE Retroviruses can sometimes acquire an envelope gene (env) from a distantly related retrovirus. Since env is a key determinant of host range, such an event affects the host range of the recombinant virus and

  7. Comparative genomics of ten solanaceous plastomes.

    PubMed

    Kaur, Harpreet; Singh, Bhupinder Pal; Singh, Harpreet; Nagpal, Avinash Kaur

    2014-01-01

    Availability of complete plastid genomes of ten solanaceous species, Atropa belladonna, Capsicum annuum, Datura stramonium, Nicotiana sylvestris, Nicotiana tabacum, Nicotiana tomentosiformis, Nicotiana undulata, Solanum bulbocastanum, Solanum lycopersicum, and Solanum tuberosum provided us with an opportunity to conduct their in silico comparative analysis in depth. The size of complete chloroplast genomes and LSC and SSC regions of three species of Solanum is comparatively smaller than that of any other species studied till date (exception: SSC region of A. belladonna). AT content of coding regions was found to be less than noncoding regions. A duplicate copy of trnH gene in C. annuum and two alternative tRNA genes for proline in D. stramonium were observed for the first time in this analysis. Further, homology search revealed the presence of rps19 pseudogene and infA genes in A. belladonna and D. stramonium, a region identical to rps19 pseudogene in C. annum and orthologues of sprA gene in another six species. Among the eighteen intron-containing genes, 3 genes have two introns and 15 genes have one intron. The longest insertion was found in accD gene in C. annuum. Phylogenetic analysis using concatenated protein coding sequences gave two clades, one for Nicotiana species and another for Solanum, Capsicum, Atropa, and Datura.

  8. Comparative genomics of ten solanaceous plastomes.

    PubMed

    Kaur, Harpreet; Singh, Bhupinder Pal; Singh, Harpreet; Nagpal, Avinash Kaur

    2014-01-01

    Availability of complete plastid genomes of ten solanaceous species, Atropa belladonna, Capsicum annuum, Datura stramonium, Nicotiana sylvestris, Nicotiana tabacum, Nicotiana tomentosiformis, Nicotiana undulata, Solanum bulbocastanum, Solanum lycopersicum, and Solanum tuberosum provided us with an opportunity to conduct their in silico comparative analysis in depth. The size of complete chloroplast genomes and LSC and SSC regions of three species of Solanum is comparatively smaller than that of any other species studied till date (exception: SSC region of A. belladonna). AT content of coding regions was found to be less than noncoding regions. A duplicate copy of trnH gene in C. annuum and two alternative tRNA genes for proline in D. stramonium were observed for the first time in this analysis. Further, homology search revealed the presence of rps19 pseudogene and infA genes in A. belladonna and D. stramonium, a region identical to rps19 pseudogene in C. annum and orthologues of sprA gene in another six species. Among the eighteen intron-containing genes, 3 genes have two introns and 15 genes have one intron. The longest insertion was found in accD gene in C. annuum. Phylogenetic analysis using concatenated protein coding sequences gave two clades, one for Nicotiana species and another for Solanum, Capsicum, Atropa, and Datura. PMID:25477958

  9. Temporal Dynamics of Avian Populations during Pleistocene Revealed by Whole-Genome Sequences.

    PubMed

    Nadachowska-Brzyska, Krystyna; Li, Cai; Smeds, Linnea; Zhang, Guojie; Ellegren, Hans

    2015-05-18

    Global climate fluctuations have significantly influenced the distribution and abundance of biodiversity. During unfavorable glacial periods, many species experienced range contraction and fragmentation, expanding again during interglacials. An understanding of the evolutionary consequences of both historical and ongoing climate changes requires knowledge of the temporal dynamics of population numbers during such climate cycles. Variation in abundance should have left clear signatures in the patterns of intraspecific genetic variation in extant species, from which historical effective population sizes (N(e)) can be estimated. We analyzed whole-genome sequences of 38 avian species in a pairwise sequentially Markovian coalescent (PSMC, [5]) framework to quantitatively reveal changes in N(e) from approximately 10 million to 10 thousand years ago. Significant fluctuations in N(e) over time were evident for most species. The most pronounced pattern observed in many species was a severe reduction in N(e) coinciding with the beginning of the last glacial period (LGP). Among species, N(e) varied by at least three orders of magnitude, exceeding 1 million in the most abundant species. Several species on the IUCN Red List of Threatened Species showed long-term reduction in population size, predating recent declines. We conclude that cycles of population expansions and contractions have been a common feature of many bird species during the Quaternary period, likely coinciding with climate cycles. Population size reduction should have increased the risk of extinction but may also have promoted speciation. Species that have experienced long-term declines may be especially vulnerable to recent anthropogenic threats. PMID:25891404

  10. Temporal Dynamics of Avian Populations during Pleistocene Revealed by Whole-Genome Sequences

    PubMed Central

    Nadachowska-Brzyska, Krystyna; Li, Cai; Smeds, Linnea; Zhang, Guojie; Ellegren, Hans

    2015-01-01

    Summary Global climate fluctuations have significantly influenced the distribution and abundance of biodiversity [1]. During unfavorable glacial periods, many species experienced range contraction and fragmentation, expanding again during interglacials [2–4]. An understanding of the evolutionary consequences of both historical and ongoing climate changes requires knowledge of the temporal dynamics of population numbers during such climate cycles. Variation in abundance should have left clear signatures in the patterns of intraspecific genetic variation in extant species, from which historical effective population sizes (Ne) can be estimated [3]. We analyzed whole-genome sequences of 38 avian species in a pairwise sequentially Markovian coalescent (PSMC, [5]) framework to quantitatively reveal changes in Ne from approximately 10 million to 10 thousand years ago. Significant fluctuations in Ne over time were evident for most species. The most pronounced pattern observed in many species was a severe reduction in Ne coinciding with the beginning of the last glacial period (LGP). Among species, Ne varied by at least three orders of magnitude, exceeding 1 million in the most abundant species. Several species on the IUCN Red List of Threatened Species showed long-term reduction in population size, predating recent declines. We conclude that cycles of population expansions and contractions have been a common feature of many bird species during the Quaternary period, likely coinciding with climate cycles. Population size reduction should have increased the risk of extinction but may also have promoted speciation. Species that have experienced long-term declines may be especially vulnerable to recent anthropogenic threats. PMID:25891404

  11. The comparative genomics of viral emergence

    PubMed Central

    Holmes, Edward C.

    2010-01-01

    RNA viruses are the main agents of emerging and re-emerging diseases. It is therefore important to reveal the evolutionary processes that underpin their ability to jump species boundaries and establish themselves in new hosts. Here, I discuss how comparative genomics can contribute to this endeavor. Arguably the most important evolutionary process in RNA virus evolution, abundant mutation, may even open up avenues for their control through “lethal mutagenesis.” Despite this remarkable mutational power, adaptation to diverse host species remains a major adaptive challenge, such that the most common outcome of host jumps are short-term “spillover” infections. A powerful case study of the utility of genomic approaches to studies of viral evolution and emergence is provided by influenza virus and brought into sharp focus by the ongoing epidemic of swine-origin H1N1 influenza A virus (A/H1N1pdm). Research here reveals a marked lack of surveillance of influenza viruses in pigs, coupled with the possibility of cryptic transmission before the first reported human cases, such that the exact genesis of A/H1N1pdm (where, when, how) is uncertain. PMID:19858482

  12. Comparative Genome Analysis in the Integrated Microbial Genomes(IMG) System

    SciTech Connect

    Kyrpides, Nikos C.; Markowitz, Victor M.

    2006-03-01

    Comparative genome analysis is critical for the effectiveexploration of a rapidly growing number of complete and draft sequencesfor microbial genomes. The Integrated Microbial Genomes (IMG) system(img.jgi.doe.gov) has been developed as a community resource thatprovides support for comparative analysis of microbial genomes in anintegrated context. IMG allows users to navigate the multidimensionalmicrobial genome data space and focus their analysis on a subset ofgenes, genomes, and functions of interest. IMG provides graphicalviewers, summaries and occurrence profile tools for comparing genes,pathways and functions (terms) across specific genomes. Genes can befurther examined using gene neighborhoods and compared with sequencealignment tools.

  13. Avian papillomaviruses: the parrot Psittacus erithacus papillomavirus (PePV) genome has a unique organization of the early protein region and is phylogenetically related to the chaffinch papillomavirus

    PubMed Central

    Tachezy, Ruth; Rector, Annabel; Havelkova, Marta; Wollants, Elke; Fiten, Pierre; Opdenakker, Ghislain; Jenson, A Bennett; Sundberg, John P; Van Ranst, Marc

    2002-01-01

    Background An avian papillomavirus genome has been cloned from a cutaneous exophytic papilloma from an African grey parrot (Psittacus erithacus). The nucleotide sequence, genome organization, and phylogenetic position of the Psittacus erithacus papillomavirus (PePV) were determined. This PePV sequence represents the first complete avian papillomavirus genome defined. Results The PePV genome (7304 basepairs) differs from other papillomaviruses, in that it has a unique organization of the early protein region lacking classical E6 and E7 open reading frames. Phylogenetic comparison of the PePV sequence with partial E1 and L1 sequences of the chaffinch (Fringilla coelebs) papillomavirus (FPV) reveals that these two avian papillomaviruses form a monophyletic cluster with a common branch that originates near the unresolved center of the papillomavirus evolutionary tree. Conclusions The PePV genome has a unique layout of the early protein region which represents a novel prototypic genomic organization for avian papillomaviruses. The close relationship between PePV and FPV, and between their Psittaciformes and Passeriformes hosts, supports the hypothesis that papillomaviruses have co-evolved and speciated together with their host species throughout evolution. PMID:12110158

  14. Avihepadnavirus diversity in parrots is comparable to that found amongst all other avian species.

    PubMed

    Piasecki, Tomasz; Harkins, Gordon W; Chrząstek, Klaudia; Julian, Laurel; Martin, Darren P; Varsani, Arvind

    2013-04-10

    Avihepadnaviruses have previously been isolated from various species of duck, goose, stork, heron and crane. Recently the first parrot avihepadnavirus was isolated from a Ring-necked Parakeet in Poland. In this study, 41 psittacine liver samples archived in Poland over the last nine years were tested for presence of Parrot hepatitis B virus (PHBV). We cloned and sequenced PHBV isolates from 18 birds including a Crimson Rosella, an African grey parrot and sixteen Ring-necked Parakeets. PHBV isolates display a degree of diversity (>78% genome wide pairwise identity) that is comparable to that found amongst all other avihepadnaviruses (>79% genome wide pairwise identity). The PHBV viruses can be subdivided into seven genetically distinct groups (tentatively named A-G) of which the two isolated of PHBV-G are the most divergent sharing ∼79% genome wide pairwise identity with all their PHBVs. All PHBV isolates display classical avihepadnavirus genome architecture.

  15. The bonobo genome compared with the chimpanzee and human genomes.

    PubMed

    Prüfer, Kay; Munch, Kasper; Hellmann, Ines; Akagi, Keiko; Miller, Jason R; Walenz, Brian; Koren, Sergey; Sutton, Granger; Kodira, Chinnappa; Winer, Roger; Knight, James R; Mullikin, James C; Meader, Stephen J; Ponting, Chris P; Lunter, Gerton; Higashino, Saneyuki; Hobolth, Asger; Dutheil, Julien; Karakoç, Emre; Alkan, Can; Sajjadian, Saba; Catacchio, Claudia Rita; Ventura, Mario; Marques-Bonet, Tomas; Eichler, Evan E; André, Claudine; Atencia, Rebeca; Mugisha, Lawrence; Junhold, Jörg; Patterson, Nick; Siebauer, Michael; Good, Jeffrey M; Fischer, Anne; Ptak, Susan E; Lachmann, Michael; Symer, David E; Mailund, Thomas; Schierup, Mikkel H; Andrés, Aida M; Kelso, Janet; Pääbo, Svante

    2012-06-28

    Two African apes are the closest living relatives of humans: the chimpanzee (Pan troglodytes) and the bonobo (Pan paniscus). Although they are similar in many respects, bonobos and chimpanzees differ strikingly in key social and sexual behaviours, and for some of these traits they show more similarity with humans than with each other. Here we report the sequencing and assembly of the bonobo genome to study its evolutionary relationship with the chimpanzee and human genomes. We find that more than three per cent of the human genome is more closely related to either the bonobo or the chimpanzee genome than these are to each other. These regions allow various aspects of the ancestry of the two ape species to be reconstructed. In addition, many of the regions that overlap genes may eventually help us understand the genetic basis of phenotypes that humans share with one of the two apes to the exclusion of the other.

  16. [Comparative genomic classification of human hepatocellular carcinoma].

    PubMed

    Kaposi-Novák, Pál

    2009-03-01

    Global transcriptome analysis has been successfully applied to characterize various human tumors, including hepatocellular carcinomas. This novel technology can facilitate early diagnosis, as well as prognostic and therapeutic diversification of cancer patients. To enhance access to the genomic information buried in archived pathology samples, we assessed RT-PCR amplification rates in paraffin-embedded tissues preserved in three different fixatives. Reliable amplification could be achieved from all paraffin-embedded specimens, when the amplicon size did not exceed 225 bp. A longer amplicon size resulted in rapid decrease of yield and reproducibility. In addition, formalin provided superior morphology and better reactivity with claudin-4 and -7 immunohistochemistry. Amplification of the initial sample is often required before transcriptome analysis of clinical specimens could be performed. We introduced a random nonamer primed T3 polymerase reaction into the conventional linear RNA amplification protocol. The modified T3T7 method generated a sense strand product ideal for synthesizing indirectly labeled cDNA templates. Microarray analysis of amplified frozen and laser-microdissected Myc and Myc/TGFalpha mouse liver tumors confirmed good reproducibility (r=0.9) of the reaction and conservation of original transcriptional patterns (r=0.78). Finally, we tested the utility of expression profiling for the classification of human HCC samples. By comparing expression data from HGF-treated c-Met conditional knock-out and control primary mouse hepatocytes, we identified 690 HGF/c-Met target genes. Functional analysis of the significant gene set implicated c-Met as key regulator of hepatocyte motility and oxidative homeostasis. Cross comparison of the c-Met-induced transcription signature with human HCC expression profiles revealed a group of tumors (27%) with potentially activated c-Met signaling (MET+). These tumors were characterized by higher vascular invasion rate

  17. Comparative Chloroplast Genomes of Camellia Species

    PubMed Central

    Li, Hong-Tao; Yang, Jing; Li, De-Zhu

    2013-01-01

    Background Camellia, comprising more than 200 species, is a valuable economic commodity due to its enormously popular commercial products: tea leaves, flowers, and high-quality edible oils. It is the largest and most important genus in the family Theaceae. However, phylogenetic resolution of the species has proven to be difficult. Consequently, the interspecies relationships of the genus Camellia are still hotly debated. Phylogenomics is an attractive avenue that can be used to reconstruct the tree of life, especially at low taxonomic levels. Methodology/Principal Findings Seven complete chloroplast (cp) genomes were sequenced from six species representing different subdivisions of the genus Camellia using Illumina sequencing technology. Four junctions between the single-copy segments and the inverted repeats were confirmed and genome assemblies were validated by PCR-based product sequencing using 123 pairs of primers covering preliminary cp genome assemblies. The length of the Camellia cp genome was found to be about 157kb, which contained 123 unique genes and 23 were duplicated in the IR regions. We determined that the complete Camellia cp genome was relatively well conserved, but contained enough genetic differences to provide useful phylogenetic information. Phylogenetic relationships were analyzed using seven complete cp genomes of six Camellia species. We also identified rapidly evolving regions of the cp genome that have the potential to be used for further species identification and phylogenetic resolution. Conclusions/Significance In this study, we wanted to determine if analyzing completely sequenced cp genomes could help settle these controversies of interspecies relationships in Camellia. The results demonstrate that cp genome data are beneficial in resolving species definition because they indicate that organelle-based “barcodes”, can be established for a species and then used to unmask interspecies phylogenetic relationships. It reveals that

  18. Comparative genomic hybridization in clinical cytogenetics

    SciTech Connect

    Bryndorf, T.; Kirchhoff, M.; Rose, H.

    1995-11-01

    We report the results of applying comparative genomic hybridization (CGH) in a cytogenetic service laboratory for (1) determination of the origin of extra and missing chromosomal material in intricate cases of unbalanced aberrations and (2) detection of common prenatal numerical chromosome aberrations. A total of 11 fetal samples were analyzed. Seven cases of complex unbalanced aberrations that could not be identified reliably by conventional cytogenetics were successfully resolved by CGH analysis. CGH results were validated by using FISH with chromosome-specific probes. Four cases representing common prenatal numerical aberrations (trisomy 21, 18, and 13 and monosomy X) were also successfully diagnosed by CGH. We conclude that CGH is a powerful adjunct to traditional cytogenetic techniques that makes it possible to solve clinical cases of intricate unbalanced aberrations in a single hybridization. CGH may also be a useful adjunct to screen for euchromatic involvement in marker chromosomes. Further technical development may render CGH applicable for routine aberration screening. 16 refs., 4 figs., 2 tabs.

  19. In vitro development of Haemoproteus columbae (Haemosporida: Haemoproteidae), with perspectives for genomic studies of avian haemosporidian parasites.

    PubMed

    Coral, Arelis A; Valkiūnas, Gediminas; González, Angie D; Matta, Nubia E

    2015-10-01

    The evolutionary origin of wildlife and human malaria parasites (Plasmodium spp.) has been discussed for several decades. The lack of genomic data about species of wildlife haemosporidian parasites related to Plasmodium limits the number of taxa available for phylogenetic analysis. Genomic data about avian parasites of the genus Haemoproteus parasites, the sister genus to Plasmodium are still not available, mainly due to difficulties in obtaining pure DNA of parasites inhabiting nucleated avian host cells. Recent studies show that microgametes of Haemoproteus (Parahaemoproteus) spp. develop in vitro and can be isolated by simple centrifugation, allowing the isolation of pure parasite DNA for genomic studies. However, in vitro development of Haemoproteus (Haemoproteus) spp. has not been investigated, and it is unclear if microgametes of these parasites also can be obtained under in vitro conditions. Here, we provide the first data about the in vitro development of Haemoproteus (Haemoproteus) columbae, a widespread avian haemosporidian parasite, which is specific to pigeons and doves (Columbiformes) and is transmitted by hippoboscid flies (Diptera, Hippoboscidae). In vitro gametogenesis and ookinete development of H. columbae were studied using a strain isolated from a feral Rock Pigeon (Columba livia) in Bogotá-Colombia. The morphological events leading to exflagellation, fertilization and ookinete formation, as well as the rate of development of these stages were followed in vitro at 40 °C, 19 °C and 15 °C for 48 h. Macrogametes, microgametes, zygotes and initial stages of ookinete development were observed in all temperatures, but mature ookinetes were seen only at 40 °C. The largest diversity of sporogonic stages of H. columbae were present at 40 °C however, exflagellation, fertilization of macrogametes and development of immature ookinetes were also observed at 15 °C and 19 °C. Morphological and morphometric features of these stages in vitro were

  20. Genome Wide Host Gene Expression Analysis in Chicken Lungs Infected with Avian Influenza Viruses

    PubMed Central

    Gandhale, Pradeep N.; Kumar, Himanshu; Kulkarni, Diwakar D.

    2016-01-01

    The molecular pathogenesis of avian influenza infection varies greatly with individual bird species and virus strain. The molecular pathogenesis of the highly pathogenic avian influenza virus (HPAIV) or the low pathogenic avian influenza virus (LPAIV) infection in avian species remains poorly understood. Thus, global immune response of chickens infected with HPAI H5N1 (A/duck/India/02CA10/2011) and LPAI H9N2 (A/duck/India/249800/2010) viruses was studied using microarray to identify crucial host genetic components responsive to these infection. HPAI H5N1 virus induced excessive expression of type I IFNs (IFNA and IFNG), cytokines (IL1B, IL18, IL22, IL13, and IL12B), chemokines (CCL4, CCL19, CCL10, and CX3CL1) and IFN stimulated genes (OASL, MX1, RSAD2, IFITM5, IFIT5, GBP 1, and EIF2AK) in lung tissues. This dysregulation of host innate immune genes may be the critical determinant of the severity and the outcome of the influenza infection in chickens. In contrast, the expression levels of most of these genes was not induced in the lungs of LPAI H9N2 virus infected chickens. This study indicated the relationship between host immune genes and their roles in pathogenesis of HPAIV infection in chickens. PMID:27071061

  1. Comparative pathogenicity of early and recent isolates of avian metapneumovirus subtype C in turkeys

    PubMed Central

    Velayudhan, Binu T.; Noll, Sally L.; Thachil, Anil J.; Halvorson, David A.; Shaw, Daniel P.; Goyal, Sagar M.; Nagaraja, Kakambi V.

    2008-01-01

    The objective of the present study was to compare the pathogenicity of early and recent isolates of avian metapneumovirus subtype-C (aMPV-C) in turkeys. Two-week-old turkeys were inoculated with early and recent isolates of aMPV-C. Clinical signs were monitored. Tissues were examined for viral ribonucleic acid (RNA), lesions, and viral antigen by reverse transcription-polymerase chain reaction (RT-PCR), histopathology and immunohistochemistry, respectively. Birds infected with the recent isolate had higher clinical sign scores than those infected with the early isolate. Only the recent isolate produced a multifocal loss of cilia in the nasal turbinate of infected birds. Immunohistochemistry revealed intense staining of aMPV antigen in turbinate and trachea of birds infected with the recent isolate. The findings indicate that the recent isolate produced more severe clinical signs and lesions in turkeys compared to the early isolate. The recent isolate could be ideal for the development of a challenge model for aMPV infection in turkeys. PMID:18783028

  2. Comparative pathogenesis of a subtype A with a subtype B avian pneumovirus in turkeys.

    PubMed

    Van de Zande, S; Nauwynck, H; De Jonghe, S; Pensaert, M

    1999-06-01

    This paper describes a study in which the pathogenesis of avian pneumovirus strains, isolated in Belgium, and belonging to the two subtypes A and B, were compared in 2-week-old turkeys. After oculonasal inoculation, animals were either observed for clinical signs or killed for pathological and virological examination. Virus titration and immunofluorescence were performed on the conjunctivae, turbinates, sinuses, upper and lower part of the trachea, lungs and air sacs. No differences were seen between the two subtypes concerning respiratory signs, or macroscopic and microscopic lesions in the respiratory tract. Slight variations were found in site and extent of virus replication. First, only subtype A was able to invade the lower parts of the respiratory tract (bronchi), whereas viral antigens were not detected in the lungs with subtype B. Secondly, the subtype A strain infected two times more epithelial cells at all levels of the upper respiratory tract compared to subtype B. Thirdly, the amount of virus produced at different sites along the respiratory tract was lower in subtype B-inoculated turkeys than in subtype A-inoculated ones.

  3. Comparative avian phylogeography of Cameroon and equatorial Guinea mountains: implications for conservation.

    PubMed

    Smith, T B; Holder, K; Girman, D; O'Keefe, K; Larison, B; Chan, Y

    2000-10-01

    We illustrate the use of Faith's 'Phylogenetic Diversity' measure to compare the phylogeographic structure of two bird species with patterns of avian endemism across six mountains in Cameroon and Equatorial Guinea. The Mountain Greenbul and Cameroon Blue-headed Sunbird showed phylogeographic patterns that together defined three biogeographic regions: Bioko, Mt. Cameroon, and the northern mountains of Cameroon. In contrast, the distributions of endemic species were largely a function of geographical distance, with close mountains sharing more endemic species than distant mountains. Moreover, for both species, populations on Mt. Cameroon were distinctive with respect to the ecologically relevant character bill size. Our results, while preliminary, illustrate the utility of a comparative approach for identifying geographical regions that harbour evolutionarily distinct populations and caution against using only the distributional patterns of endemics to prioritize regions for conservation. Results show that patterns of endemism may not be concordant with patterns of phylogenetic diversity nor morphological variation in a character important in fitness. While incorporation of additional species from unrelated taxa will be necessary to draw definitive conclusions about evolutionarily distinct regions, our preliminary results suggest a conservation approach for the Afromontane region of the Gulf of Guinea that would: (i) emphasize protection of both Bioko and Mt. Cameroon, thereby maximizing preservation of within-species phylogenetic and morphologic diversity; (ii) emphasize protection within the northern mountains to further conserve intraspecific phylogenetic diversity and maximize protection of endemic species.

  4. Comparative Genomics of the Campylobacter lari Group

    PubMed Central

    Miller, William G.; Yee, Emma; Chapman, Mary H.; Smith, Timothy P.L.; Bono, James L.; Huynh, Steven; Parker, Craig T.; Vandamme, Peter; Luong, Khai; Korlach, Jonas

    2014-01-01

    The Campylobacter lari group is a phylogenetic clade within the epsilon subdivision of the Proteobacteria and is part of the thermotolerant Campylobacter spp., a division within the genus that includes the human pathogen Campylobacter jejuni. The C. lari group is currently composed of five species (C. lari, Campylobacter insulaenigrae, Campylobacter volucris, Campylobacter subantarcticus, and Campylobacter peloridis), as well as a group of strains termed the urease-positive thermophilic Campylobacter (UPTC) and other C. lari-like strains. Here we present the complete genome sequences of 11 C. lari group strains, including the five C. lari group species, four UPTC strains, and a lari-like strain isolated in this study. The genome of C. lari subsp. lari strain RM2100 was described previously. Analysis of the C. lari group genomes indicates that this group is highly related at the genome level. Furthermore, these genomes are strongly syntenic with minor rearrangements occurring only in 4 of the 12 genomes studied. The C. lari group can be bifurcated, based on the flagella and flagellar modification genes. Genomic analysis of the UPTC strains indicated that these organisms are variable but highly similar, closely related to but distinct from C. lari. Additionally, the C. lari group contains multiple genes encoding hemagglutination domain proteins, which are either contingency genes or linked to conserved contingency genes. Many of the features identified in strain RM2100, such as major deficiencies in amino acid biosynthesis and energy metabolism, are conserved across all 12 genomes, suggesting that these common features may play a role in the association of the C. lari group with coastal environments and watersheds. PMID:25381664

  5. Evolution of mammalian genome organization inferred from comparative gene mapping

    PubMed Central

    Murphy, William J; Stanyon, Roscoe; O'Brien, Stephen J

    2001-01-01

    Comparative genome analyses, including chromosome painting in over 40 diverse mammalian species, ordered gene maps from several representatives of different mammalian and vertebrate orders, and large-scale sequencing of the human and mouse genomes are beginning to provide insight into the rates and patterns of chromosomal evolution on a whole-genome scale, as well as into the forces that have sculpted the genomes of extant mammalian species. PMID:11423011

  6. Avian haemosporidian parasites (Haemosporida): A comparative analysis of different polymerase chain reaction assays in detection of mixed infections.

    PubMed

    Bernotienė, Rasa; Palinauskas, Vaidas; Iezhova, Tatjana; Murauskaitė, Dovilė; Valkiūnas, Gediminas

    2016-04-01

    Mixed infections of different species and genetic lineages of haemosporidian parasites (Haemosporida) predominate in wildlife, and such infections are particularly virulent. However, currently used polymerase chain reaction (PCR)-based detection methods often do not read mixed infections. Sensitivity of different PCR assays in detection of mixed infections has been insufficiently tested, but this knowledge is essential in studies addressing parasite diversity in wildlife. Here, we applied five different PCR assays, which are broadly used in wildlife avian haemosporidian research, and compared their sensitivity in detection of experimentally designed mixed infections of Haemoproteus and Plasmodium parasites. Three of these PCR assays use primer sets that amplify fragments of cytochrome b gene (cyt b), one of cytochrome oxidase subunit I (COI) gene, and one target apicoplast genome. We collected blood from wild-caught birds and, using microscopic and PCR-based methods applied in parallel, identified single infections of ten haemosporidian species with similar parasitemia. Then, we prepared 15 experimental mixes of different haemosporidian parasites, which often are present simultaneously in wild birds. Similar concentration of total DNA was used in each parasite lineage during preparation of mixes. Positive amplifications were sequenced, and the presence of mixed infections was reported by visualising double-base calling in sequence electropherograms. This study shows that the use of each single PCR assay markedly underestimates biodiversity of haemosporidian parasites. The application of at least 3 PCR assays in parallel detected the majority, but still not all lineages present in mixed infections. We determined preferences of different primers in detection of parasites belonging to different genera of haemosporidians during mixed infections.

  7. 3D genome tuner: compare multiple circular genomes in a 3D context.

    PubMed

    Wang, Qi; Liang, Qun; Zhang, Xiuqing

    2009-09-01

    Circular genomes, being the largest proportion of sequenced genomes, play an important role in genome analysis. However, traditional 2D circular map only provides an overview and annotations of genome but does not offer feature-based comparison. For remedying these shortcomings, we developed 3D Genome Tuner, a hybrid of circular map and comparative map tools. Its capability of viewing comparisons between multiple circular maps in a 3D space offers great benefits to the study of comparative genomics. The program is freely available (under an LGPL licence) at http://sourceforge.net/projects/dgenometuner.

  8. Initial sequencing and comparative analysis of the mouse genome

    SciTech Connect

    Waterston, Robert H.; Lindblad-Toh, Kerstin; Birney, Ewan; Rogers, Jane; Abril, Josep F.; Agarwal, Pankaj; Agarwala, Richa; Ainscough, Rachel; Alexandersson, Marina; An, Peter; Antonarakis, Stylianos E.; Attwood, John; Baertsch, Robert; Bailey, Jonathon; Barlow, Karen; Beck, Stephan; Berry, Eric; Birren, Bruce; Bloom, Toby; Bork, Peer; Botcherby, Marc; Bray, Nicolas; Brent, Michael R.; Brown, Daniel G.; Brown, Stephen D.; Bult, Carol; Burton, John; Butler, Jonathan; Campbell, Robert D.; Carninci, Piero; Cawley, Simon; Chiaromonte, Francesca; Chinwalla, Asif T.; Church, Deanna M.; Clamp, Michele; Clee, Christopher; Collins, Francis S.; Cook, Lisa L.; Copley, Richard R.; Coulson, Alan; Couronne, Olivier; Cuff, James; Curwen, Val; Cutts, Tim; Daly, Mark; David, Robert; Davies, Joy; Delehaunty, Kimberly D.; Deri, Justin; Dermitzakis, Emmanouil T.; Dewey, Colin; Dickens, Nicholas J.; Diekhans, Mark; Dodge, Sheila; Dubchak, Inna; Dunn, Diane M.; Eddy, Sean R.; Elnitski, Laura; Emes, Richard D.; Eswara, Pallavi; Eyras, Eduardo; Felsenfeld, Adam; Fewell, Ginger A.; Flicek, Paul; Foley, Karen; Frankel, Wayne N.; Fulton, Lucinda A.; Fulton, Robert S.; Furey, Terrence S.; Gage, Diane; Gibbs, Richard A.; Glusman, Gustavo; Gnerre, Sante; Goldman, Nick; Goodstadt, Leo; Grafham, Darren; Graves, Tina A.; Green, Eric D.; Gregory, Simon; Guigo, Roderic; Guyer, Mark; Hardison, Ross C.; Haussler, David; Hayashizaki, Yoshihide; Hillier, LaDeana W.; Hinrichs, Angela; Hlavina, Wratko; Holzer, Timothy; Hsu, Fan; Hua, Axin; Hubbard, Tim; Hunt, Adrienne; Jackson, Ian; Jaffe, David B.; Johnson, L. Steven; Jones, Matthew; Jones, Thomas A.; Joy, Ann; Kamal, Michael; Karlsson, Elinor K.; Karolchik, Donna; Kasprzyk, Arkadiusz; Kawai, Jun; Keibler, Evan; Kells, Cristyn; Kent, W. James; Kirby, Andrew; Kolbe, Diana L.; Korf, Ian; Kucherlapati, Raju S.; Kulbokas III, Edward J.; Kulp, David; Landers, Tom; Leger, J.P.; Leonard, Steven; Letunic, Ivica; Levine, Rosie; et al.

    2002-12-15

    The sequence of the mouse genome is a key informational tool for understanding the contents of the human genome and a key experimental tool for biomedical research. Here, we report the results of an international collaboration to produce a high-quality draft sequence of the mouse genome. We also present an initial comparative analysis of the mouse and human genomes, describing some of the insights that can be gleaned from the two sequences. We discuss topics including the analysis of the evolutionary forces shaping the size, structure and sequence of the genomes; the conservation of large-scale synteny across most of the genomes; the much lower extent of sequence orthology covering less than half of the genomes; the proportions of the genomes under selection; the number of protein-coding genes; the expansion of gene families related to reproduction and immunity; the evolution of proteins; and the identification of intraspecies polymorphism.

  9. Comparative Genomics of Cluster O Mycobacteriophages

    PubMed Central

    Cresawn, Steven G.; Pope, Welkin H.; Jacobs-Sera, Deborah; Bowman, Charles A.; Russell, Daniel A.; Dedrick, Rebekah M.; Adair, Tamarah; Anders, Kirk R.; Ball, Sarah; Bollivar, David; Breitenberger, Caroline; Burnett, Sandra H.; Butela, Kristen; Byrnes, Deanna; Carzo, Sarah; Cornely, Kathleen A.; Cross, Trevor; Daniels, Richard L.; Dunbar, David; Findley, Ann M.; Gissendanner, Chris R.; Golebiewska, Urszula P.; Hartzog, Grant A.; Hatherill, J. Robert; Hughes, Lee E.; Jalloh, Chernoh S.; De Los Santos, Carla; Ekanem, Kevin; Khambule, Sphindile L.; King, Rodney A.; King-Smith, Christina; Klyczek, Karen; Krukonis, Greg P.; Laing, Christian; Lapin, Jonathan S.; Lopez, A. Javier; Mkhwanazi, Sipho M.; Molloy, Sally D.; Moran, Deborah; Munsamy, Vanisha; Pacey, Eddie; Plymale, Ruth; Poxleitner, Marianne; Reyna, Nathan; Schildbach, Joel F.; Stukey, Joseph; Taylor, Sarah E.; Ware, Vassie C.; Wellmann, Amanda L.; Westholm, Daniel; Wodarski, Donna; Zajko, Michelle; Zikalala, Thabiso S.; Hendrix, Roger W.; Hatfull, Graham F.

    2015-01-01

    Mycobacteriophages – viruses of mycobacterial hosts – are genetically diverse but morphologically are all classified in the Caudovirales with double-stranded DNA and tails. We describe here a group of five closely related mycobacteriophages – Corndog, Catdawg, Dylan, Firecracker, and YungJamal – designated as Cluster O with long flexible tails but with unusual prolate capsids. Proteomic analysis of phage Corndog particles, Catdawg particles, and Corndog-infected cells confirms expression of half of the predicted gene products and indicates a non-canonical mechanism for translation of the Corndog tape measure protein. Bioinformatic analysis identifies 8–9 strongly predicted SigA promoters and all five Cluster O genomes contain more than 30 copies of a 17 bp repeat sequence with dyad symmetry located throughout the genomes. Comparison of the Cluster O phages provides insights into phage genome evolution including the processes of gene flux by horizontal genetic exchange. PMID:25742016

  10. Comparative genomics of Cluster O mycobacteriophages.

    PubMed

    Cresawn, Steven G; Pope, Welkin H; Jacobs-Sera, Deborah; Bowman, Charles A; Russell, Daniel A; Dedrick, Rebekah M; Adair, Tamarah; Anders, Kirk R; Ball, Sarah; Bollivar, David; Breitenberger, Caroline; Burnett, Sandra H; Butela, Kristen; Byrnes, Deanna; Carzo, Sarah; Cornely, Kathleen A; Cross, Trevor; Daniels, Richard L; Dunbar, David; Findley, Ann M; Gissendanner, Chris R; Golebiewska, Urszula P; Hartzog, Grant A; Hatherill, J Robert; Hughes, Lee E; Jalloh, Chernoh S; De Los Santos, Carla; Ekanem, Kevin; Khambule, Sphindile L; King, Rodney A; King-Smith, Christina; Klyczek, Karen; Krukonis, Greg P; Laing, Christian; Lapin, Jonathan S; Lopez, A Javier; Mkhwanazi, Sipho M; Molloy, Sally D; Moran, Deborah; Munsamy, Vanisha; Pacey, Eddie; Plymale, Ruth; Poxleitner, Marianne; Reyna, Nathan; Schildbach, Joel F; Stukey, Joseph; Taylor, Sarah E; Ware, Vassie C; Wellmann, Amanda L; Westholm, Daniel; Wodarski, Donna; Zajko, Michelle; Zikalala, Thabiso S; Hendrix, Roger W; Hatfull, Graham F

    2015-01-01

    Mycobacteriophages--viruses of mycobacterial hosts--are genetically diverse but morphologically are all classified in the Caudovirales with double-stranded DNA and tails. We describe here a group of five closely related mycobacteriophages--Corndog, Catdawg, Dylan, Firecracker, and YungJamal--designated as Cluster O with long flexible tails but with unusual prolate capsids. Proteomic analysis of phage Corndog particles, Catdawg particles, and Corndog-infected cells confirms expression of half of the predicted gene products and indicates a non-canonical mechanism for translation of the Corndog tape measure protein. Bioinformatic analysis identifies 8-9 strongly predicted SigA promoters and all five Cluster O genomes contain more than 30 copies of a 17 bp repeat sequence with dyad symmetry located throughout the genomes. Comparison of the Cluster O phages provides insights into phage genome evolution including the processes of gene flux by horizontal genetic exchange. PMID:25742016

  11. Neisseria Base: a comparative genomics database for Neisseria meningitidis.

    PubMed

    Katz, Lee S; Humphrey, Jay C; Conley, Andrew B; Nelakuditi, Viswateja; Kislyuk, Andrey O; Agrawal, Sonia; Jayaraman, Pushkala; Harcourt, Brian H; Olsen-Rasmussen, Melissa A; Frace, Michael; Sharma, Nitya V; Mayer, Leonard W; Jordan, I King

    2011-01-01

    Neisseria meningitidis is an important pathogen, causing life-threatening diseases including meningitis, septicemia and in some cases pneumonia. Genomic studies hold great promise for N. meningitidis research, but substantial database resources are needed to deal with the wealth of information that comes with completely sequenced and annotated genomes. To address this need, we developed Neisseria Base (NBase), a comparative genomics database and genome browser that houses and displays publicly available N. meningitidis genomes. In addition to existing N. meningitidis genome sequences, we sequenced and annotated 19 new genomes using 454 pyrosequencing and the CG-Pipeline genome analysis tool. In total, NBase hosts 27 complete N. meningitidis genome sequences along with their associated annotations. The NBase platform is designed to be scalable, via the underlying database schema and modular code architecture, such that it can readily incorporate new genomes and their associated annotations. The front page of NBase provides user access to these genomes through searching, browsing and downloading. NBase search utility includes BLAST-based sequence similarity searches along with a variety of semantic search options. All genomes can be browsed using a modified version of the GBrowse platform, and a plethora of information on each gene can be viewed using a customized details page. NBase also has a whole-genome comparison tool that yields single-nucleotide polymorphism differences between two user-defined groups of genomes. Using the virulent ST-11 lineage as an example, we demonstrate how this comparative genomics utility can be used to identify novel genomic markers for molecular profiling of N. meningitidis. PMID:21930505

  12. Comparative DNA Sequence Analysis of Wheat and Rice Genomes

    PubMed Central

    Sorrells, Mark E.; La Rota, Mauricio; Bermudez-Kandianis, Catherine E.; Greene, Robert A.; Kantety, Ramesh; Munkvold, Jesse D.; Miftahudin; Mahmoud, Ahmed; Ma, Xuefeng; Gustafson, Perry J.; Qi, Lili L.; Echalier, Benjamin; Gill, Bikram S.; Matthews, David E.; Lazo, Gerard R.; Chao, Shiaoman; Anderson, Olin D.; Edwards, Hugh; Linkiewicz, Anna M.; Dubcovsky, Jorge; Akhunov, Eduard D.; Dvorak, Jan; Zhang, Deshui; Nguyen, Henry T.; Peng, Junhua; Lapitan, Nora L.V.; Gonzalez-Hernandez, Jose L.; Anderson, James A.; Hossain, Khwaja; Kalavacharla, Venu; Kianian, Shahryar F.; Choi, Dong-Woog; Close, Timothy J.; Dilbirligi, Muharrem; Gill, Kulvinder S.; Steber, Camille; Walker-Simmons, Mary K.; McGuire, Patrick E.; Qualset, Calvin O.

    2003-01-01

    The use of DNA sequence-based comparative genomics for evolutionary studies and for transferring information from model species to crop species has revolutionized molecular genetics and crop improvement strategies. This study compared 4485 expressed sequence tags (ESTs) that were physically mapped in wheat chromosome bins, to the public rice genome sequence data from 2251 ordered BAC/PAC clones using BLAST. A rice genome view of homologous wheat genome locations based on comparative sequence analysis revealed numerous chromosomal rearrangements that will significantly complicate the use of rice as a model for cross-species transfer of information in nonconserved regions. PMID:12902377

  13. Complete genome sequences of avian paramyxovirus serotype 2 (APMV-2) strains Bangor, England and Kenya: Evidence for the existence of subgroups within serotype 2

    PubMed Central

    Subbiah, Madhuri; Nayak, Sreeraj; Collins, Peter L.; Samal, Siba K.

    2010-01-01

    The complete consensus genome sequences of avian paramyxovirus (APMV) serotype 2 strains Bangor, England and Kenya were determined and compared with those of APMV-2 prototype strain Yucaipa and other paramyxoviruses. The genome lengths of APMV-2 strains Bangor, England and Kenya are 15024, 14904, 14916 nucleotides (nt), respectively, compared to 14904 nt for Yucaipa. Each genome consists of six non-overlapping genes in the order of 3′N-P/V/W-M-F-HN-L5′, with a 55-nt leader at the 3′end. The length of the trailer at the 5′ end of strain Bangor was 173 nt, compared to 154 nt for strains England, Kenya, and Yucaipa. In general, sequence comparison of APMV-2 strains England and Kenya with strain Yucaipa have 94.5 and 88% nt and 96 and 92% aggregate amino acid (aa) identity, respectively. In contrast, strain Bangor has a much lower percent nt identity (70.4, 69.4, and 70.8%) and aa identity (75.3, 76.2, and 76.3%) with strains Yucaipa, England, and Kenya, respectively. Furthermore, strain Bangor has a single basic aa residue (101TLPSAR↓F108) at the fusion protein cleavage site compared to the dibasic aa (93DKPASR↓F100) found in those of other three strains. Reciprocal cross-hemagglutination inhibition (HI) and cross-neutralization assays using post-infection chicken sera indicated that strain Bangor is antigenically related to the other APMV-2 strains, but with a 4- to 8-fold difference in homologous versus heterologous HI titer. These differences in antigenic relatedness suggests that these four APMV-2 strains represent a single serotype with two antigenic subgroups, and this is strongly supported by the dimorphism in nt and aa sequence identity. PMID:20600395

  14. Avian Retroviral Replication

    PubMed Central

    Justice, James; Beemon, Karen L.

    2013-01-01

    Avian retroviruses have undergone intense study since the beginning of the 20th century. They were originally identified as cancer-inducing filterable agents in chicken neoplasms. Since their discovery, the study of these simple retroviruses has contributed greatly to our understanding of retroviral replication and cancer. Avian retroviruses are continuing to evolve and have great economic importance in the poultry industry worldwide. The aim of this review is to provide a broad overview of the genome, pathology, and replication of avian retroviruses. Notable gaps in our current knowledge are highlighted, and areas where avian retroviruses differ from other retrovirus are also emphasized. PMID:24011707

  15. Comparative Genomic Analyses of Attenuated Strains of Mycoplasma gallisepticum▿ †

    PubMed Central

    Szczepanek, S. M.; Tulman, E. R.; Gorton, T. S.; Liao, X.; Lu, Z.; Zinski, J.; Aziz, F.; Frasca, S.; Kutish, G. F.; Geary, S. J.

    2010-01-01

    Mycoplasma gallisepticum is a significant respiratory and reproductive pathogen of domestic poultry. While the complete genomic sequence of the virulent, low-passage M. gallisepticum strain R (Rlow) has been reported, genomic determinants responsible for differences in virulence and host range remain to be completely identified. Here, we utilize genome sequencing and microarray-based comparative genomic data to identify these genomic determinants of virulence and to elucidate genomic variability among strains of M. gallisepticum. Analysis of the high-passage, attenuated derivative of Rlow, Rhigh, indicated that relatively few total genomic changes (64 loci) occurred, yet they are potentially responsible for the observed attenuation of this strain. In addition to previously characterized mutations in cytadherence-related proteins, changes included those in coding sequences of genes involved in sugar metabolism. Analyses of the genome of the M. gallisepticum vaccine strain F revealed numerous differences relative to strain R, including a highly divergent complement of vlhA surface lipoprotein genes, and at least 16 genes absent or significantly fragmented relative to strain R. Notably, an Rlow isogenic mutant in one of these genes (MGA_1107) caused significantly fewer severe tracheal lesions in the natural host compared to virulent M. gallisepticum Rlow. Comparative genomic hybridizations indicated few genetic loci commonly affected in F and vaccine strains ts-11 and 6/85, which would correlate with proteins affecting strain R virulence. Together, these data provide novel insights into inter- and intrastrain M. gallisepticum genomic variability and the genetic basis of M. gallisepticum virulence. PMID:20123709

  16. Comparative genomics of the lactic acid bacteria

    SciTech Connect

    Makarova, K.; Slesarev, A.; Wolf, Y.; Sorokin, A.; Mirkin, B.; Koonin, E.; Pavlov, A.; Pavlova, N.; Karamychev, V.; Polouchine, N.; Shakhova, V.; Grigoriev, I.; Lou, Y.; Rokhsar, D.; Lucas, S.; Huang, K.; Goodstein, D. M.; Hawkins, T.; Plengvidhya, V.; Welker, D.; Hughes, J.; Goh, Y.; Benson, A.; Baldwin, K.; Lee, J. -H.; Diaz-Muniz, I.; Dosti, B.; Smeianov, V; Wechter, W.; Barabote, R.; Lorca, G.; Altermann, E.; Barrangou, R.; Ganesan, B.; Xie, Y.; Rawsthorne, H.; Tamir, D.; Parker, C.; Breidt, F.; Broadbent, J.; Hutkins, R.; O'Sullivan, D.; Steele, J.; Unlu, G.; Saier, M.; Klaenhammer, T.; Richardson, P.; Kozyavkin, S.; Weimer, B.; Mills, D.

    2006-06-01

    Lactic acid-producing bacteria are associated with various plant and animal niches and play a key role in the production of fermented foods and beverages. We report nine genome sequences representing the phylogenetic and functional diversity of these bacteria. The small genomes of lactic acid bacteria encode a broad repertoire of transporters for efficient carbon and nitrogen acquisition from the nutritionally rich environments they inhabit and reflect a limited range of biosynthetic capabilities that indicate both prototrophic and auxotrophic strains. Phylogenetic analyses, comparison of gene content across the group, and reconstruction of ancestral gene sets indicate a combination of extensive gene loss and key gene acquisitions via horizontal gene transfer during the coevolution of lactic acid bacteria with their habitats.

  17. GenColors-based comparative genome databases for small eukaryotic genomes.

    PubMed

    Felder, Marius; Romualdi, Alessandro; Petzold, Andreas; Platzer, Matthias; Sühnel, Jürgen; Glöckner, Gernot

    2013-01-01

    Many sequence data repositories can give a quick and easily accessible overview on genomes and their annotations. Less widespread is the possibility to compare related genomes with each other in a common database environment. We have previously described the GenColors database system (http://gencolors.fli-leibniz.de) and its applications to a number of bacterial genomes such as Borrelia, Legionella, Leptospira and Treponema. This system has an emphasis on genome comparison. It combines data from related genomes and provides the user with an extensive set of visualization and analysis tools. Eukaryote genomes are normally larger than prokaryote genomes and thus pose additional challenges for such a system. We have, therefore, adapted GenColors to also handle larger datasets of small eukaryotic genomes and to display eukaryotic gene structures. Further recent developments include whole genome views, genome list options and, for bacterial genome browsers, the display of horizontal gene transfer predictions. Two new GenColors-based databases for two fungal species (http://fgb.fli-leibniz.de) and for four social amoebas (http://sacgb.fli-leibniz.de) were set up. Both new resources open up a single entry point for related genomes for the amoebozoa and fungal research communities and other interested users. Comparative genomics approaches are greatly facilitated by these resources.

  18. GenColors-based comparative genome databases for small eukaryotic genomes

    PubMed Central

    Felder, Marius; Romualdi, Alessandro; Petzold, Andreas; Platzer, Matthias; Sühnel, Jürgen; Glöckner, Gernot

    2013-01-01

    Many sequence data repositories can give a quick and easily accessible overview on genomes and their annotations. Less widespread is the possibility to compare related genomes with each other in a common database environment. We have previously described the GenColors database system (http://gencolors.fli-leibniz.de) and its applications to a number of bacterial genomes such as Borrelia, Legionella, Leptospira and Treponema. This system has an emphasis on genome comparison. It combines data from related genomes and provides the user with an extensive set of visualization and analysis tools. Eukaryote genomes are normally larger than prokaryote genomes and thus pose additional challenges for such a system. We have, therefore, adapted GenColors to also handle larger datasets of small eukaryotic genomes and to display eukaryotic gene structures. Further recent developments include whole genome views, genome list options and, for bacterial genome browsers, the display of horizontal gene transfer predictions. Two new GenColors-based databases for two fungal species (http://fgb.fli-leibniz.de) and for four social amoebas (http://sacgb.fli-leibniz.de) were set up. Both new resources open up a single entry point for related genomes for the amoebozoa and fungal research communities and other interested users. Comparative genomics approaches are greatly facilitated by these resources. PMID:23193285

  19. Comparative genetics and genomics of nematodes: genome structure, development, and lifestyle.

    PubMed

    Sommer, Ralf J; Streit, Adrian

    2011-01-01

    Nematodes are found in virtually all habitats on earth. Many of them are parasites of plants and animals, including humans. The free-living nematode, Caenorhabditis elegans, is one of the genetically best-studied model organisms and was the first metazoan whose genome was fully sequenced. In recent years, the draft genome sequences of another six nematodes representing four of the five major clades of nematodes were published. Compared to mammalian genomes, all these genomes are very small. Nevertheless, they contain almost the same number of genes as the human genome. Nematodes are therefore a very attractive system for comparative genetic and genomic studies, with C. elegans as an excellent baseline. Here, we review the efforts that were made to extend genetic analysis to nematodes other than C. elegans, and we compare the seven available nematode genomes. One of the most striking findings is the unexpectedly high incidence of gene acquisition through horizontal gene transfer (HGT). PMID:21721943

  20. The Avian Brain Nomenclature Forum: Terminology for a New Century in Comparative Neuroanatomy

    PubMed Central

    REINER, ANTON; PERKEL, DAVID J.; BRUCE, LAURA L.; BUTLER, ANN B.; CSILLAG, ANDRÁS; KUENZEL, WAYNE; MEDINA, LORETA; PAXINOS, GEORGE; SHIMIZU, TORU; STRIEDTER, GEORG; WILD, MARTIN; BALL, GREGORY F.; DURAND, SARAH; GÜTÜRKÜN, ONUR; LEE, DIANE W.; MELLO, CLAUDIO V.; POWERS, ALICE; WHITE, STEPHANIE A.; HOUGH, GERALD; KUBIKOVA, LUBICA; SMULDERS, TOM V.; WADA, KAZUHIRO; DUGAS-FORD, JENNIFER; HUSBAND, SCOTT; YAMAMOTO, KEIKO; YU, JING; SIANG, CONNIE; JARVIS, ERICH D.

    2008-01-01

    Many of the assumptions of homology on which the standard nomenclature for the cell groups and fiber tracts of avian brains have been based are in error, and as a result that terminology promotes misunderstanding of the functional organization of avian brains and their evolutionary relationship to mammalian brains. Recognizing this problem, a number of avian brain researchers began an effort to revise the terminology, which culminated in the Avian Brain Nomenclature Forum, held at Duke University from July 18 to 20, 2002. In the new terminology approved at this Forum, the flawed conception that the telencephalon of birds consists nearly entirely of a hypertrophied basal ganglia has been purged from the telencephalic terminology, and the actual parts of the basal ganglia and its brainstem afferent cell groups have been given names reflecting their now evident homologies. The telencephalic regions that were erroneously named to reflect presumed homology to mammalian basal ganglia were renamed as parts of the pallium, using prefixes that retained most established abbreviations (to maintain continuity with the replaced nomenclature). Details of this meeting and its major conclusions are presented in this paper, and the details of the new terminology and its basis are presented in a longer companion paper. We urge all to use this new terminology, because we believe it will promote better communication among neuroscientists. PMID:19626136

  1. Ten years of bacterial genome sequencing: comparative-genomics-based discoveries.

    PubMed

    Binnewies, Tim T; Motro, Yair; Hallin, Peter F; Lund, Ole; Dunn, David; La, Tom; Hampson, David J; Bellgard, Matthew; Wassenaar, Trudy M; Ussery, David W

    2006-07-01

    It has been more than 10 years since the first bacterial genome sequence was published. Hundreds of bacterial genome sequences are now available for comparative genomics, and searching a given protein against more than a thousand genomes will soon be possible. The subject of this review will address a relatively straightforward question: "What have we learned from this vast amount of new genomic data?" Perhaps one of the most important lessons has been that genetic diversity, at the level of large-scale variation amongst even genomes of the same species, is far greater than was thought. The classical textbook view of evolution relying on the relatively slow accumulation of mutational events at the level of individual bases scattered throughout the genome has changed. One of the most obvious conclusions from examining the sequences from several hundred bacterial genomes is the enormous amount of diversity--even in different genomes from the same bacterial species. This diversity is generated by a variety of mechanisms, including mobile genetic elements and bacteriophages. An examination of the 20 Escherichia coli genomes sequenced so far dramatically illustrates this, with the genome size ranging from 4.6 to 5.5 Mbp; much of the variation appears to be of phage origin. This review also addresses mobile genetic elements, including pathogenicity islands and the structure of transposable elements. There are at least 20 different methods available to compare bacterial genomes. Metagenomics offers the chance to study genomic sequences found in ecosystems, including genomes of species that are difficult to culture. It has become clear that a genome sequence represents more than just a collection of gene sequences for an organism and that information concerning the environment and growth conditions for the organism are important for interpretation of the genomic data. The newly proposed Minimal Information about a Genome Sequence standard has been developed to obtain this

  2. Ten years of bacterial genome sequencing: comparative-genomics-based discoveries.

    PubMed

    Binnewies, Tim T; Motro, Yair; Hallin, Peter F; Lund, Ole; Dunn, David; La, Tom; Hampson, David J; Bellgard, Matthew; Wassenaar, Trudy M; Ussery, David W

    2006-07-01

    It has been more than 10 years since the first bacterial genome sequence was published. Hundreds of bacterial genome sequences are now available for comparative genomics, and searching a given protein against more than a thousand genomes will soon be possible. The subject of this review will address a relatively straightforward question: "What have we learned from this vast amount of new genomic data?" Perhaps one of the most important lessons has been that genetic diversity, at the level of large-scale variation amongst even genomes of the same species, is far greater than was thought. The classical textbook view of evolution relying on the relatively slow accumulation of mutational events at the level of individual bases scattered throughout the genome has changed. One of the most obvious conclusions from examining the sequences from several hundred bacterial genomes is the enormous amount of diversity--even in different genomes from the same bacterial species. This diversity is generated by a variety of mechanisms, including mobile genetic elements and bacteriophages. An examination of the 20 Escherichia coli genomes sequenced so far dramatically illustrates this, with the genome size ranging from 4.6 to 5.5 Mbp; much of the variation appears to be of phage origin. This review also addresses mobile genetic elements, including pathogenicity islands and the structure of transposable elements. There are at least 20 different methods available to compare bacterial genomes. Metagenomics offers the chance to study genomic sequences found in ecosystems, including genomes of species that are difficult to culture. It has become clear that a genome sequence represents more than just a collection of gene sequences for an organism and that information concerning the environment and growth conditions for the organism are important for interpretation of the genomic data. The newly proposed Minimal Information about a Genome Sequence standard has been developed to obtain this

  3. GenoSets: Visual Analytic Methods for Comparative Genomics

    PubMed Central

    Cain, Aurora A.; Kosara, Robert; Gibas, Cynthia J.

    2012-01-01

    Many important questions in biology are, fundamentally, comparative, and this extends to our analysis of a growing number of sequenced genomes. Existing genomic analysis tools are often organized around literal views of genomes as linear strings. Even when information is highly condensed, these views grow cumbersome as larger numbers of genomes are added. Data aggregation and summarization methods from the field of visual analytics can provide abstracted comparative views, suitable for sifting large multi-genome datasets to identify critical similarities and differences. We introduce a software system for visual analysis of comparative genomics data. The system automates the process of data integration, and provides the analysis platform to identify and explore features of interest within these large datasets. GenoSets borrows techniques from business intelligence and visual analytics to provide a rich interface of interactive visualizations supported by a multi-dimensional data warehouse. In GenoSets, visual analytic approaches are used to enable querying based on orthology, functional assignment, and taxonomic or user-defined groupings of genomes. GenoSets links this information together with coordinated, interactive visualizations for both detailed and high-level categorical analysis of summarized data. GenoSets has been designed to simplify the exploration of multiple genome datasets and to facilitate reasoning about genomic comparisons. Case examples are included showing the use of this system in the analysis of 12 Brucella genomes. GenoSets software and the case study dataset are freely available at http://genosets.uncc.edu. We demonstrate that the integration of genomic data using a coordinated multiple view approach can simplify the exploration of large comparative genomic data sets, and facilitate reasoning about comparisons and features of interest. PMID:23056299

  4. GenoSets: visual analytic methods for comparative genomics.

    PubMed

    Cain, Aurora A; Kosara, Robert; Gibas, Cynthia J

    2012-01-01

    Many important questions in biology are, fundamentally, comparative, and this extends to our analysis of a growing number of sequenced genomes. Existing genomic analysis tools are often organized around literal views of genomes as linear strings. Even when information is highly condensed, these views grow cumbersome as larger numbers of genomes are added. Data aggregation and summarization methods from the field of visual analytics can provide abstracted comparative views, suitable for sifting large multi-genome datasets to identify critical similarities and differences. We introduce a software system for visual analysis of comparative genomics data. The system automates the process of data integration, and provides the analysis platform to identify and explore features of interest within these large datasets. GenoSets borrows techniques from business intelligence and visual analytics to provide a rich interface of interactive visualizations supported by a multi-dimensional data warehouse. In GenoSets, visual analytic approaches are used to enable querying based on orthology, functional assignment, and taxonomic or user-defined groupings of genomes. GenoSets links this information together with coordinated, interactive visualizations for both detailed and high-level categorical analysis of summarized data. GenoSets has been designed to simplify the exploration of multiple genome datasets and to facilitate reasoning about genomic comparisons. Case examples are included showing the use of this system in the analysis of 12 Brucella genomes. GenoSets software and the case study dataset are freely available at http://genosets.uncc.edu. We demonstrate that the integration of genomic data using a coordinated multiple view approach can simplify the exploration of large comparative genomic data sets, and facilitate reasoning about comparisons and features of interest.

  5. Computational Methods for the Analysis of Array Comparative Genomic Hybridization

    PubMed Central

    Chari, Raj; Lockwood, William W.; Lam, Wan L.

    2006-01-01

    Array comparative genomic hybridization (array CGH) is a technique for assaying the copy number status of cancer genomes. The widespread use of this technology has lead to a rapid accumulation of high throughput data, which in turn has prompted the development of computational strategies for the analysis of array CGH data. Here we explain the principles behind array image processing, data visualization and genomic profile analysis, review currently available software packages, and raise considerations for future software development. PMID:17992253

  6. Comparative Genomics of an Emerging Amphibian Virus.

    PubMed

    Epstein, Brendan; Storfer, Andrew

    2015-11-03

    Ranaviruses, a genus of the Iridoviridae, are large double-stranded DNA viruses that infect cold-blooded vertebrates worldwide. Ranaviruses have caused severe epizootics in commercial frog and fish populations, and are currently classified as notifiable pathogens in international trade. Previous work shows that a ranavirus that infects tiger salamanders throughout Western North America (Ambystoma tigrinum virus, or ATV) is in high prevalence among salamanders in the fishing bait trade. Bait ATV strains have elevated virulence and are transported long distances by humans, providing widespread opportunities for pathogen pollution. We sequenced the genomes of 15 strains of ATV collected from tiger salamanders across western North America and performed phylogenetic and population genomic analyses and tests for recombination. We find that ATV forms a monophyletic clade within the rest of the Ranaviruses and that it likely emerged within the last several thousand years, before human activities influenced its spread. We also identify several genes under strong positive selection, some of which appear to be involved in viral virulence and/or host immune evasion. In addition, we provide support for the pathogen pollution hypothesis with evidence of recombination among ATV strains, and potential bait-endemic strain recombination.

  7. Comparative Genomics of an Emerging Amphibian Virus

    PubMed Central

    Epstein, Brendan; Storfer, Andrew

    2015-01-01

    Ranaviruses, a genus of the Iridoviridae, are large double-stranded DNA viruses that infect cold-blooded vertebrates worldwide. Ranaviruses have caused severe epizootics in commercial frog and fish populations, and are currently classified as notifiable pathogens in international trade. Previous work shows that a ranavirus that infects tiger salamanders throughout Western North America (Ambystoma tigrinum virus, or ATV) is in high prevalence among salamanders in the fishing bait trade. Bait ATV strains have elevated virulence and are transported long distances by humans, providing widespread opportunities for pathogen pollution. We sequenced the genomes of 15 strains of ATV collected from tiger salamanders across western North America and performed phylogenetic and population genomic analyses and tests for recombination. We find that ATV forms a monophyletic clade within the rest of the Ranaviruses and that it likely emerged within the last several thousand years, before human activities influenced its spread. We also identify several genes under strong positive selection, some of which appear to be involved in viral virulence and/or host immune evasion. In addition, we provide support for the pathogen pollution hypothesis with evidence of recombination among ATV strains, and potential bait-endemic strain recombination. PMID:26530419

  8. Comparative Genomics of an Emerging Amphibian Virus.

    PubMed

    Epstein, Brendan; Storfer, Andrew

    2016-01-01

    Ranaviruses, a genus of the Iridoviridae, are large double-stranded DNA viruses that infect cold-blooded vertebrates worldwide. Ranaviruses have caused severe epizootics in commercial frog and fish populations, and are currently classified as notifiable pathogens in international trade. Previous work shows that a ranavirus that infects tiger salamanders throughout Western North America (Ambystoma tigrinum virus, or ATV) is in high prevalence among salamanders in the fishing bait trade. Bait ATV strains have elevated virulence and are transported long distances by humans, providing widespread opportunities for pathogen pollution. We sequenced the genomes of 15 strains of ATV collected from tiger salamanders across western North America and performed phylogenetic and population genomic analyses and tests for recombination. We find that ATV forms a monophyletic clade within the rest of the Ranaviruses and that it likely emerged within the last several thousand years, before human activities influenced its spread. We also identify several genes under strong positive selection, some of which appear to be involved in viral virulence and/or host immune evasion. In addition, we provide support for the pathogen pollution hypothesis with evidence of recombination among ATV strains, and potential bait-endemic strain recombination. PMID:26530419

  9. Transgenic Quail as a Model for Research in the Avian Nervous System – A Comparative Study of the Auditory Brainstem

    PubMed Central

    Seidl, Armin H.; Sanchez, Jason Tait; Schecterson, Leslayann; Tabor, Kathryn M.; Wang, Yuan; Kashima, Daniel T.; Poynter, Greg; Huss, David; Fraser, Scott E.; Lansford, Rusty; Rubel, Edwin W

    2012-01-01

    Research performed on transgenic animals has led to numerous advances in biological research. However, using traditional retroviral methods to generate transgenic avian research models has proven problematic. As a result, experiments aimed at genetic manipulations on birds remained difficult for this popular research tool. Recently, lentiviral methods have enabled production of transgenic birds, including a transgenic Japanese quail (Coturnix coturnix japonica) line showing neuronal-specificity and stable expression of eGFP across generations (termed here as GFP quail). To test whether the GFP quail may serve as a viable alternative to the popular chicken model system, with the additional benefit of gene manipulation, we compared the development, organization, structure and function of a specific neuronal circuit in chicken (Gallus gallus domesticus) to that of the GFP quail. This study focuses on a well-defined avian brain region, the principal nuclei of the sound localization circuit in the auditory brainstem, nucleus magnocellularis (NM) and nucleus laminaris (NL). Our results demonstrate that structural and functional properties of NM and NL neurons in the GFP quail, as well as their dynamic properties in response to changes in the environment, are nearly identical to those in chickens. These similarities demonstrate that the GFP quail, as well as other transgenic quail lines, can serve as an attractive avian model system, with the advantage of being able to build on the wealth of information already available from the chicken. PMID:22806400

  10. Complete Genome Sequence and Comparative Genomics of a Novel Myxobacterium Myxococcus hansupus

    PubMed Central

    Sharma, Gaurav; Narwani, Tarun; Subramanian, Srikrishna

    2016-01-01

    Myxobacteria, a group of Gram-negative aerobes, belong to the class δ-proteobacteria and order Myxococcales. Unlike anaerobic δ-proteobacteria, they exhibit several unusual physiogenomic properties like gliding motility, desiccation-resistant myxospores and large genomes with high coding density. Here we report a 9.5 Mbp complete genome of Myxococcus hansupus that encodes 7,753 proteins. Phylogenomic and genome-genome distance based analysis suggest that Myxococcus hansupus is a novel member of the genus Myxococcus. Comparative genome analysis with other members of the genus Myxococcus was performed to explore their genome diversity. The variation in number of unique proteins observed across different species is suggestive of diversity at the genus level while the overrepresentation of several Pfam families indicates the extent and mode of genome expansion as compared to non-Myxococcales δ-proteobacteria. PMID:26900859

  11. Gramene 2016: comparative plant genomics and pathway resources

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Gramene (http://www.gramene.org) is an online resource for comparative functional genomics in crops and model plant species. Its two main frameworks are genomes (collaboration with Ensembl Plants) and pathways (The Plant Reactome and archival BioCyc databases). Since our last NAR update, the data...

  12. Cyberinfrastructure for (Comparative) Plant Genome Research Through PlantGDB

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Accurate and comprehensive gene structure annotation in emerging and assembled genomes is fundamental to comparative, functional, and translational genomics. We plan to build the cyberinfrastructure necessary for defining and accessing the plant gene space. Our Plant Genetic Data Base (PlantGDB) r...

  13. Genomic organization, transcriptomic analysis, and functional characterization of avian α- and β-keratins in diverse feather forms.

    PubMed

    Ng, Chen Siang; Wu, Ping; Fan, Wen-Lang; Yan, Jie; Chen, Chih-Kuan; Lai, Yu-Ting; Wu, Siao-Man; Mao, Chi-Tang; Chen, Jun-Jie; Lu, Mei-Yeh Jade; Ho, Meng-Ru; Widelitz, Randall B; Chen, Chih-Feng; Chuong, Cheng-Ming; Li, Wen-Hsiung

    2014-08-24

    Feathers are hallmark avian integument appendages, although they were also present on theropods. They are composed of flexible corneous materials made of α- and β-keratins, but their genomic organization and their functional roles in feathers have not been well studied. First, we made an exhaustive search of α- and β-keratin genes in the new chicken genome assembly (Galgal4). Then, using transcriptomic analysis, we studied α- and β-keratin gene expression patterns in five types of feather epidermis. The expression patterns of β-keratin genes were different in different feather types, whereas those of α-keratin genes were less variable. In addition, we obtained extensive α- and β-keratin mRNA in situ hybridization data, showing that α-keratins and β-keratins are preferentially expressed in different parts of the feather components. Together, our data suggest that feather morphological and structural diversity can largely be attributed to differential combinations of α- and β-keratin genes in different intrafeather regions and/or feather types from different body parts. The expression profiles provide new insights into the evolutionary origin and diversification of feathers. Finally, functional analysis using mutant chicken keratin forms based on those found in the human α-keratin mutation database led to abnormal phenotypes. This demonstrates that the chicken can be a convenient model for studying the molecular biology of human keratin-based diseases.

  14. Microbial NAD metabolism: lessons from comparative genomics.

    PubMed

    Gazzaniga, Francesca; Stebbins, Rebecca; Chang, Sheila Z; McPeek, Mark A; Brenner, Charles

    2009-09-01

    NAD is a coenzyme for redox reactions and a substrate of NAD-consuming enzymes, including ADP-ribose transferases, Sir2-related protein lysine deacetylases, and bacterial DNA ligases. Microorganisms that synthesize NAD from as few as one to as many as five of the six identified biosynthetic precursors have been identified. De novo NAD synthesis from aspartate or tryptophan is neither universal nor strictly aerobic. Salvage NAD synthesis from nicotinamide, nicotinic acid, nicotinamide riboside, and nicotinic acid riboside occurs via modules of different genes. Nicotinamide salvage genes nadV and pncA, found in distinct bacteria, appear to have spread throughout the tree of life via horizontal gene transfer. Biochemical, genetic, and genomic analyses have advanced to the point at which the precursors and pathways utilized by a microorganism can be predicted. Challenges remain in dissecting regulation of pathways. PMID:19721089

  15. Computational tools for Brassica-Arabidopsis comparative genomics.

    PubMed

    Beckett, Paul; Bancroft, Ian; Trick, Martin

    2005-01-01

    Recent advances, such as the availability of extensive genome survey sequence (GSS) data and draft physical maps, are radically transforming the means by which we can dissect Brassica genome structure and systematically relate it to the Arabidopsis model. Hitherto, our view of the co-linearities between these closely related genomes had been largely inferred from comparative RFLP data, necessitating substantial interpolation and expert interpretation. Sequencing of the Brassica rapa genome by the Multinational Brassica Genome Project will, however, enable an entirely computational approach to this problem. Meanwhile we have been developing databases and bioinformatics tools to support our work in Brassica comparative genomics, including a recently completed draft physical map of B. rapa integrated with anchor probes derived from the Arabidopsis genome sequence. We are also exploring new ways to display the emerging Brassica-Arabidopsis sequence homology data. We have mapped all publicly available Brassica sequences in silico to the Arabidopsis TIGR v5 genome sequence and published this in the ATIDB database that uses Generic Genome Browser (GBrowse). This in silico approach potentially identifies all paralogous sequences and so we colour-code the significance of the mappings and offer an integrated, real-time multiple alignment tool to partition them into paralogous groups. The MySQL database driving GBrowse can also be directly interrogated, using the powerful API offered by the Perl BioColon, two colonsDBColon, two colonsGFF methods, facilitating a wide range of data-mining possibilities.

  16. Phytozome: a comparative platform for green plant genomics.

    PubMed

    Goodstein, David M; Shu, Shengqiang; Howson, Russell; Neupane, Rochak; Hayes, Richard D; Fazo, Joni; Mitros, Therese; Dirks, William; Hellsten, Uffe; Putnam, Nicholas; Rokhsar, Daniel S

    2012-01-01

    The number of sequenced plant genomes and associated genomic resources is growing rapidly with the advent of both an increased focus on plant genomics from funding agencies, and the application of inexpensive next generation sequencing. To interact with this increasing body of data, we have developed Phytozome (http://www.phytozome.net), a comparative hub for plant genome and gene family data and analysis. Phytozome provides a view of the evolutionary history of every plant gene at the level of sequence, gene structure, gene family and genome organization, while at the same time providing access to the sequences and functional annotations of a growing number (currently 25) of complete plant genomes, including all the land plants and selected algae sequenced at the Joint Genome Institute, as well as selected species sequenced elsewhere. Through a comprehensive plant genome database and web portal, these data and analyses are available to the broader plant science research community, providing powerful comparative genomics tools that help to link model systems with other plants of economic and ecological importance. PMID:22110026

  17. Comparative Genomics and Extensive Recombinations in Phage Communities

    NASA Astrophysics Data System (ADS)

    Poisson, Guylaine; Belcaid, Mahdi; Bergeron, Anne

    Comparing the genomes of two closely related viruses often produces mosaics where nearly identical sequences alternate with sequences that are unique to each genome. When several closely related genomes are compared, the unique sequences are likely to be shared with third genomes, leading to virus mosaic communities. Here we present comparative analysis of sets of Staphylococcus aureus phages that share large identical sequences with up to three other genomes, and with different partners along their genomes. We introduce mosaic graphs to represent these complex recombination events, and use them to illustrate the breath and depth of sequence sharing: some genomes are almost completely made up of shared sequences, while genomes that share very large identical sequences can adopt alternate functional modules. Mosaic graphs also allow us to identify breakpoints that could eventually be used for the construction of recombination networks. These findings have several implications on phage metagenomics assembly, on the horizontal gene transfer paradigm, and more generally on the understanding of the composition and evolutionary dynamics of virus communities.

  18. Reference-Free Comparative Genomics of 174 Chloroplasts

    PubMed Central

    Kua, Chai-Shian; Ruan, Jue; Harting, John; Ye, Cheng-Xi; Helmus, Matthew R.; Yu, Jun; Cannon, Charles H.

    2012-01-01

    Direct analysis of unassembled genomic data could greatly increase the power of short read DNA sequencing technologies and allow comparative genomics of organisms without a completed reference available. Here, we compare 174 chloroplasts by analyzing the taxanomic distribution of short kmers across genomes [1]. We then assemble de novo contigs centered on informative variation. The localized de novo contigs can be separated into two major classes: tip = unique to a single genome and group = shared by a subset of genomes. Prior to assembly, we found that ∼18% of the chloroplast was duplicated in the inverted repeat (IR) region across a four-fold difference in genome sizes, from a highly reduced parasitic orchid [2] to a massive algal chloroplast [3], including gnetophytes [4] and cycads [5]. The conservation of this ratio between single copy and duplicated sequence was basal among green plants, independent of photosynthesis and mechanism of genome size change, and different in gymnosperms and lower plants. Major lineages in the angiosperm clade differed in the pattern of shared kmers and de novo contigs. For example, parasitic plants demonstrated an expected accelerated overall rate of evolution, while the hemi-parasitic genomes contained a great deal more novel sequence than holo-parasitic plants, suggesting different mechanisms at different stages of genomic contraction. Additionally, the legumes are diverging more quickly and in different ways than other major families. Small duplicated fragments of the rrn23 genes were deeply conserved among seed plants, including among several species without the IR regions, indicating a crucial functional role of this duplication. Localized de novo assembly of informative kmers greatly reduces the complexity of large comparative analyses by confining the analysis to a small partition of data and genomes relevant to the specific question, allowing direct analysis of next-gen sequence data from previously unstudied

  19. Reference-free comparative genomics of 174 chloroplasts.

    PubMed

    Kua, Chai-Shian; Ruan, Jue; Harting, John; Ye, Cheng-Xi; Helmus, Matthew R; Yu, Jun; Cannon, Charles H

    2012-01-01

    Direct analysis of unassembled genomic data could greatly increase the power of short read DNA sequencing technologies and allow comparative genomics of organisms without a completed reference available. Here, we compare 174 chloroplasts by analyzing the taxanomic distribution of short kmers across genomes [1]. We then assemble de novo contigs centered on informative variation. The localized de novo contigs can be separated into two major classes: tip = unique to a single genome and group = shared by a subset of genomes. Prior to assembly, we found that ~18% of the chloroplast was duplicated in the inverted repeat (IR) region across a four-fold difference in genome sizes, from a highly reduced parasitic orchid [2] to a massive algal chloroplast [3], including gnetophytes [4] and cycads [5]. The conservation of this ratio between single copy and duplicated sequence was basal among green plants, independent of photosynthesis and mechanism of genome size change, and different in gymnosperms and lower plants. Major lineages in the angiosperm clade differed in the pattern of shared kmers and de novo contigs. For example, parasitic plants demonstrated an expected accelerated overall rate of evolution, while the hemi-parasitic genomes contained a great deal more novel sequence than holo-parasitic plants, suggesting different mechanisms at different stages of genomic contraction. Additionally, the legumes are diverging more quickly and in different ways than other major families. Small duplicated fragments of the rrn23 genes were deeply conserved among seed plants, including among several species without the IR regions, indicating a crucial functional role of this duplication. Localized de novo assembly of informative kmers greatly reduces the complexity of large comparative analyses by confining the analysis to a small partition of data and genomes relevant to the specific question, allowing direct analysis of next-gen sequence data from previously unstudied genomes and

  20. Sinbase: an integrated database to study genomics, genetics and comparative genomics in Sesamum indicum.

    PubMed

    Wang, Linhai; Yu, Jingyin; Li, Donghua; Zhang, Xiurong

    2015-01-01

    Sesame (Sesamum indicum L.) is an ancient and important oilseed crop grown widely in tropical and subtropical areas. It belongs to the gigantic order Lamiales, which includes many well-known or economically important species, such as olive (Olea europaea), leonurus (Leonurus japonicus) and lavender (Lavandula spica), many of which have important pharmacological properties. Despite their importance, genetic and genomic analyses on these species have been insufficient due to a lack of reference genome information. The now available S. indicum genome will provide an unprecedented opportunity for studying both S. indicum genetic traits and comparative genomics. To deliver S. indicum genomic information to the worldwide research community, we designed Sinbase, a web-based database with comprehensive sesame genomic, genetic and comparative genomic information. Sinbase includes sequences of assembled sesame pseudomolecular chromosomes, protein-coding genes (27,148), transposable elements (372,167) and non-coding RNAs (1,748). In particular, Sinbase provides unique and valuable information on colinear regions with various plant genomes, including Arabidopsis thaliana, Glycine max, Vitis vinifera and Solanum lycopersicum. Sinbase also provides a useful search function and data mining tools, including a keyword search and local BLAST service. Sinbase will be updated regularly with new features, improvements to genome annotation and new genomic sequences, and is freely accessible at http://ocri-genomics.org/Sinbase/. PMID:25480115

  1. Sinbase: an integrated database to study genomics, genetics and comparative genomics in Sesamum indicum.

    PubMed

    Wang, Linhai; Yu, Jingyin; Li, Donghua; Zhang, Xiurong

    2015-01-01

    Sesame (Sesamum indicum L.) is an ancient and important oilseed crop grown widely in tropical and subtropical areas. It belongs to the gigantic order Lamiales, which includes many well-known or economically important species, such as olive (Olea europaea), leonurus (Leonurus japonicus) and lavender (Lavandula spica), many of which have important pharmacological properties. Despite their importance, genetic and genomic analyses on these species have been insufficient due to a lack of reference genome information. The now available S. indicum genome will provide an unprecedented opportunity for studying both S. indicum genetic traits and comparative genomics. To deliver S. indicum genomic information to the worldwide research community, we designed Sinbase, a web-based database with comprehensive sesame genomic, genetic and comparative genomic information. Sinbase includes sequences of assembled sesame pseudomolecular chromosomes, protein-coding genes (27,148), transposable elements (372,167) and non-coding RNAs (1,748). In particular, Sinbase provides unique and valuable information on colinear regions with various plant genomes, including Arabidopsis thaliana, Glycine max, Vitis vinifera and Solanum lycopersicum. Sinbase also provides a useful search function and data mining tools, including a keyword search and local BLAST service. Sinbase will be updated regularly with new features, improvements to genome annotation and new genomic sequences, and is freely accessible at http://ocri-genomics.org/Sinbase/.

  2. Complete genome sequence of an avian-origin H3N2 canine influenza virus isolated from dogs in South Korea.

    PubMed

    Park, Seong-Jun; Moon, Hyoung-Joon; Kang, Bo-Kyu; Hong, Minki; Na, Woonseong; Kim, Jeong-Ki; Poo, Haryoung; Park, Bong-Kyun; Song, Dae-Sub

    2012-09-01

    An avian-origin Korean H3N2 canine influenza virus (CIV) strain, designated A/canine/Korea/01/2007 (H3N2), was isolated from nasal swabs of pet dogs exhibiting severe respiratory syndrome in 2007. In the present study, we report the first complete genome sequence containing 3' and 5' noncoding regions (NCRs) of H3N2 CIV, which will provide important insights into the molecular basis of pathogenesis, transmission, and evolution of CIV.

  3. Enhanced annotations and features for comparing thousands of Pseudomonas genomes in the Pseudomonas genome database.

    PubMed

    Winsor, Geoffrey L; Griffiths, Emma J; Lo, Raymond; Dhillon, Bhavjinder K; Shay, Julie A; Brinkman, Fiona S L

    2016-01-01

    The Pseudomonas Genome Database (http://www.pseudomonas.com) is well known for the application of community-based annotation approaches for producing a high-quality Pseudomonas aeruginosa PAO1 genome annotation, and facilitating whole-genome comparative analyses with other Pseudomonas strains. To aid analysis of potentially thousands of complete and draft genome assemblies, this database and analysis platform was upgraded to integrate curated genome annotations and isolate metadata with enhanced tools for larger scale comparative analysis and visualization. Manually curated gene annotations are supplemented with improved computational analyses that help identify putative drug targets and vaccine candidates or assist with evolutionary studies by identifying orthologs, pathogen-associated genes and genomic islands. The database schema has been updated to integrate isolate metadata that will facilitate more powerful analysis of genomes across datasets in the future. We continue to place an emphasis on providing high-quality updates to gene annotations through regular review of the scientific literature and using community-based approaches including a major new Pseudomonas community initiative for the assignment of high-quality gene ontology terms to genes. As we further expand from thousands of genomes, we plan to provide enhancements that will aid data visualization and analysis arising from whole-genome comparative studies including more pan-genome and population-based approaches. PMID:26578582

  4. Enhanced annotations and features for comparing thousands of Pseudomonas genomes in the Pseudomonas genome database

    PubMed Central

    Winsor, Geoffrey L.; Griffiths, Emma J.; Lo, Raymond; Dhillon, Bhavjinder K.; Shay, Julie A.; Brinkman, Fiona S. L.

    2016-01-01

    The Pseudomonas Genome Database (http://www.pseudomonas.com) is well known for the application of community-based annotation approaches for producing a high-quality Pseudomonas aeruginosa PAO1 genome annotation, and facilitating whole-genome comparative analyses with other Pseudomonas strains. To aid analysis of potentially thousands of complete and draft genome assemblies, this database and analysis platform was upgraded to integrate curated genome annotations and isolate metadata with enhanced tools for larger scale comparative analysis and visualization. Manually curated gene annotations are supplemented with improved computational analyses that help identify putative drug targets and vaccine candidates or assist with evolutionary studies by identifying orthologs, pathogen-associated genes and genomic islands. The database schema has been updated to integrate isolate metadata that will facilitate more powerful analysis of genomes across datasets in the future. We continue to place an emphasis on providing high-quality updates to gene annotations through regular review of the scientific literature and using community-based approaches including a major new Pseudomonas community initiative for the assignment of high-quality gene ontology terms to genes. As we further expand from thousands of genomes, we plan to provide enhancements that will aid data visualization and analysis arising from whole-genome comparative studies including more pan-genome and population-based approaches. PMID:26578582

  5. Comparative Pharmacokinetics and Allometric Scaling of Carboplatin in Different Avian Species

    PubMed Central

    De Baere, Siegrid; Hellebuyck, Tom; Van de Maele, Isabel; Rouffaer, Lieze; Stemkens, Hendrickus J. J.; De Backer, Patrick; Martel, An; Croubels, Siska

    2015-01-01

    The use of chemotherapeutics as a possible treatment strategy in avian oncology is steadily increasing over the last years. Despite this, literature reports regarding dosing strategies and pharmacokinetic behaviour of chemotherapeutics in avian species are lacking. The aim of the present study was to investigate the pharmacokinetics of carboplatin in a representative species of the order of Galliformes, Anseriformes, Columbiformes and Psittaciformes. Eight chickens, ducks and pigeons and twenty-eight parakeets were administered carboplatin intravenously (5 mg/kg body weight). A specific and sensitive liquid chromatography-tandem mass spectrometry method was developed and validated for quantification of the free carboplatin in plasma of the four birds species (limit of quantification: 20 ng/mL for chicken and duck, 50 ng/mL for pigeon and 100 ng/mL for parakeets). Non-compartmental pharmacokinetic analysis and allometric scaling demonstrated a significant correlation (R² = 0.9769) between body weight (BW) and elimination half-life (T1/2el). T1/2el ranged from 0.41 h in parakeets (BW: 61 ± 8 g) to 1.16 h chickens (BW: 1909 ± 619 g). T1/2el is a good parameter for dose optimization of carboplatin in other avian species, since also the previously reported T1/2el in cockatoos (average BW: 769 ± 68 g) of 1.00 h corresponds to the results obtained in the present study. PMID:26222777

  6. Comparative Pharmacokinetics and Allometric Scaling of Carboplatin in Different Avian Species.

    PubMed

    Antonissen, Gunther; Devreese, Mathias; De Baere, Siegrid; Hellebuyck, Tom; Van de Maele, Isabel; Rouffaer, Lieze; Stemkens, Hendrickus J J; De Backer, Patrick; Martel, An; Croubels, Siska

    2015-01-01

    The use of chemotherapeutics as a possible treatment strategy in avian oncology is steadily increasing over the last years. Despite this, literature reports regarding dosing strategies and pharmacokinetic behaviour of chemotherapeutics in avian species are lacking. The aim of the present study was to investigate the pharmacokinetics of carboplatin in a representative species of the order of Galliformes, Anseriformes, Columbiformes and Psittaciformes. Eight chickens, ducks and pigeons and twenty-eight parakeets were administered carboplatin intravenously (5 mg/kg body weight). A specific and sensitive liquid chromatography-tandem mass spectrometry method was developed and validated for quantification of the free carboplatin in plasma of the four birds species (limit of quantification: 20 ng/mL for chicken and duck, 50 ng/mL for pigeon and 100 ng/mL for parakeets). Non-compartmental pharmacokinetic analysis and allometric scaling demonstrated a significant correlation (R² = 0.9769) between body weight (BW) and elimination half-life (T1/2el). T1/2el ranged from 0.41 h in parakeets (BW: 61 ± 8 g) to 1.16 h chickens (BW: 1909 ± 619 g). T1/2el is a good parameter for dose optimization of carboplatin in other avian species, since also the previously reported T1/2el in cockatoos (average BW: 769 ± 68 g) of 1.00 h corresponds to the results obtained in the present study.

  7. Whole Genome Amplification of Labeled Viable Single Cells Suited for Array-Comparative Genomic Hybridization.

    PubMed

    Kroneis, Thomas; El-Heliebi, Amin

    2015-01-01

    Understanding details of a complex biological system makes it necessary to dismantle it down to its components. Immunostaining techniques allow identification of several distinct cell types thereby giving an inside view of intercellular heterogeneity. Often staining reveals that the most remarkable cells are the rarest. To further characterize the target cells on a molecular level, single cell techniques are necessary. Here, we describe the immunostaining, micromanipulation, and whole genome amplification of single cells for the purpose of genomic characterization. First, we exemplify the preparation of cell suspensions from cultured cells as well as the isolation of peripheral mononucleated cells from blood. The target cell population is then subjected to immunostaining. After cytocentrifugation target cells are isolated by micromanipulation and forwarded to whole genome amplification. For whole genome amplification, we use GenomePlex(®) technology allowing downstream genomic analysis such as array-comparative genomic hybridization.

  8. The Korea brassica genome project: a glimpse of the brassica genome based on comparative genome analysis with Arabidopsis.

    PubMed

    Yang, Tae-Jin; Kim, Jung-Sun; Lim, Ki-Byung; Kwon, Soo-Jin; Kim, Jin-A; Jin, Mina; Park, Jee Young; Lim, Myung-Ho; Kim, Ho-Il; Kim, Seog Hyung; Lim, Yong Pyo; Park, Beom-Seok

    2005-01-01

    A complete genome sequence provides unlimited information in the sequenced organism as well as in related taxa. According to the guidance of the Multinational Brassica Genome Project (MBGP), the Korea Brassica Genome Project (KBGP) is sequencing chromosome 1 (cytogenetically oriented chromosome #1) of Brassica rapa. We have selected 48 seed BACs on chromosome 1 using EST genetic markers and FISH analyses. Among them, 30 BAC clones have been sequenced and 18 are on the way. Comparative genome analyses of the EST sequences and sequenced BAC clones from Brassica chromosome 1 revealed their homeologous partner regions on the Arabidopsis genome and a syntenic comparative map between Brassica chromosome 1 and Arabidopsis chromosomes. In silico chromosome walking and clone validation have been successfully applied to extending sequence contigs based on the comparative map and BAC end sequences. In addition, we have defined the (peri)centromeric heterochromatin blocks with centromeric tandem repeats, rDNA and centromeric retrotransposons. In-depth sequence analyses of five homeologous BAC clones and an Arabidopsis chromosomal region reveal overall co-linearity, with 82% sequence similarity. The data indicate that the Brassica genome has undergone triplication and subsequent gene losses after the divergence of Arabidopsis and Brassica. Based on in-depth comparative genome analyses, we propose a comparative genomics approach for conquering the Brassica genome. In 2005 we intend to construct an integrated physical map, including sequence information from 500 BAC clones and integration of fingerprinting data and end sequence data of more than 100,000 BAC clones.

  9. Comparative pathogenesis of an avian H5N2 and a swine H1N1 influenza virus in pigs.

    PubMed

    De Vleeschauwer, Annebel; Atanasova, Kalina; Van Borm, Steven; van den Berg, Thierry; Rasmussen, Thomas Bruun; Uttenthal, Ase; Van Reeth, Kristien

    2009-01-01

    Pigs are considered intermediate hosts for the transmission of avian influenza viruses (AIVs) to humans but the basic organ pathogenesis of AIVs in pigs has been barely studied. We have used 42 four-week-old influenza naive pigs and two different inoculation routes (intranasal and intratracheal) to compare the pathogenesis of a low pathogenic (LP) H5N2 AIV with that of an H1N1 swine influenza virus. The respiratory tract and selected extra-respiratory tissues were examined for virus replication by titration, immunofluorescence and RT-PCR throughout the course of infection. Both viruses caused a productive infection of the entire respiratory tract and epithelial cells in the lungs were the major target. Compared to the swine virus, the AIV produced lower virus titers and fewer antigen positive cells at all levels of the respiratory tract. The respiratory part of the nasal mucosa in particular showed only rare AIV positive cells and this was associated with reduced nasal shedding of the avian compared to the swine virus. The titers and distribution of the AIV varied extremely between individual pigs and were strongly affected by the route of inoculation. Gross lung lesions and clinical signs were milder with the avian than with the swine virus, corresponding with lower viral loads in the lungs. The brainstem was the single extra-respiratory tissue found positive for virus and viral RNA with both viruses. Our data do not reject the theory of the pig as an intermediate host for AIVs, but they suggest that AIVs need to undergo genetic changes to establish full replication potential in pigs. From a biomedical perspective, experimental LP H5 AIV infection of pigs may be useful to examine heterologous protection provided by H5 vaccines or other immunization strategies, as well as for further studies on the molecular pathogenesis and neurotropism of AIVs in mammals.

  10. Comparative genomic paleontology across plant kingdom reveals the dynamics of TE-driven genome evolution.

    PubMed

    El Baidouri, Moaine; Panaud, Olivier

    2013-01-01

    Long terminal repeat-retrotransposons (LTR-RTs) are the most abundant class of transposable elements (TEs) in plants. They strongly impact the structure, function, and evolution of their host genome, and, in particular, their role in genome size variation has been clearly established. However, the dynamics of the process through which LTR-RTs have differentially shaped plant genomes is still poorly understood because of a lack of comparative studies. Using a new robust and automated family classification procedure, we exhaustively characterized the LTR-RTs in eight plant genomes for which a high-quality sequence is available (i.e., Arabidopsis thaliana, A. lyrata, grapevine, soybean, rice, Brachypodium dystachion, sorghum, and maize). This allowed us to perform a comparative genome-wide study of the retrotranspositional landscape in these eight plant lineages from both monocots and dicots. We show that retrotransposition has recurrently occurred in all plant genomes investigated, regardless their size, and through bursts, rather than a continuous process. Moreover, in each genome, only one or few LTR-RT families have been active in the recent past, and the difference in genome size among the species studied could thus mostly be accounted for by the extent of the latest transpositional burst(s). Following these bursts, LTR-RTs are efficiently eliminated from their host genomes through recombination and deletion, but we show that the removal rate is not lineage specific. These new findings lead us to propose a new model of TE-driven genome evolution in plants.

  11. IMGD: an integrated platform supporting comparative genomics and phylogenetics of insect mitochondrial genomes

    PubMed Central

    Lee, Wonhoon; Park, Jongsun; Choi, Jaeyoung; Jung, Kyongyong; Park, Bongsoo; Kim, Donghan; Lee, Jaeyoung; Ahn, Kyohun; Song, Wonho; Kang, Seogchan; Lee, Yong-Hwan; Lee, Seunghwan

    2009-01-01

    Background Sequences and organization of the mitochondrial genome have been used as markers to investigate evolutionary history and relationships in many taxonomic groups. The rapidly increasing mitochondrial genome sequences from diverse insects provide ample opportunities to explore various global evolutionary questions in the superclass Hexapoda. To adequately support such questions, it is imperative to establish an informatics platform that facilitates the retrieval and utilization of available mitochondrial genome sequence data. Results The Insect Mitochondrial Genome Database (IMGD) is a new integrated platform that archives the mitochondrial genome sequences from 25,747 hexapod species, including 112 completely sequenced and 20 nearly completed genomes and 113,985 partially sequenced mitochondrial genomes. The Species-driven User Interface (SUI) of IMGD supports data retrieval and diverse analyses at multi-taxon levels. The Phyloviewer implemented in IMGD provides three methods for drawing phylogenetic trees and displays the resulting trees on the web. The SNP database incorporated to IMGD presents the distribution of SNPs and INDELs in the mitochondrial genomes of multiple isolates within eight species. A newly developed comparative SNU Genome Browser supports the graphical presentation and interactive interface for the identified SNPs/INDELs. Conclusion The IMGD provides a solid foundation for the comparative mitochondrial genomics and phylogenetics of insects. All data and functions described here are available at the web site . PMID:19351385

  12. The perennial ryegrass GenomeZipper: targeted use of genome resources for comparative grass genomics.

    PubMed

    Pfeifer, Matthias; Martis, Mihaela; Asp, Torben; Mayer, Klaus F X; Lübberstedt, Thomas; Byrne, Stephen; Frei, Ursula; Studer, Bruno

    2013-02-01

    Whole-genome sequences established for model and major crop species constitute a key resource for advanced genomic research. For outbreeding forage and turf grass species like ryegrasses (Lolium spp.), such resources have yet to be developed. Here, we present a model of the perennial ryegrass (Lolium perenne) genome on the basis of conserved synteny to barley (Hordeum vulgare) and the model grass genome Brachypodium (Brachypodium distachyon) as well as rice (Oryza sativa) and sorghum (Sorghum bicolor). A transcriptome-based genetic linkage map of perennial ryegrass served as a scaffold to establish the chromosomal arrangement of syntenic genes from model grass species. This scaffold revealed a high degree of synteny and macrocollinearity and was then utilized to anchor a collection of perennial ryegrass genes in silico to their predicted genome positions. This resulted in the unambiguous assignment of 3,315 out of 8,876 previously unmapped genes to the respective chromosomes. In total, the GenomeZipper incorporates 4,035 conserved grass gene loci, which were used for the first genome-wide sequence divergence analysis between perennial ryegrass, barley, Brachypodium, rice, and sorghum. The perennial ryegrass GenomeZipper is an ordered, information-rich genome scaffold, facilitating map-based cloning and genome assembly in perennial ryegrass and closely related Poaceae species. It also represents a milestone in describing synteny between perennial ryegrass and fully sequenced model grass genomes, thereby increasing our understanding of genome organization and evolution in the most important temperate forage and turf grass species.

  13. Complete genome sequences of avian paramyxovirus serotype 6 prototype strain Hong Kong and a recent novel strain from Italy: evidence for the existence of subgroups within the serotype

    PubMed Central

    Xiao, Sa; Subbiah, Madhuri; Kumar, Sachin; De Nardi, Roberta; Terregino, Calogero; Collins, Peter L.; Samal, Siba K.

    2010-01-01

    Complete genome sequences were determined for two strains of avian paramyxovirus serotype 6 (APMV-6): the prototype Hong Kong (HK) strain and a more recent isolate from Italy (IT4524-2). The genome length of strain HK is 16236 nucleotide (nt), which is the same as for the other two APMV-6 strains (FE and TW) that have been reported to date, whereas that of strain IT4524-2 is 16230 nt. The length difference in strain IT4524-2 is due to a 6-nt deletion in the downstream untranslated region of the F gene. All of these viruses follow the “rule of six”. Each genome consists of seven genes in the order of 3’N-P-M-F-SH-HN-L5’, which differs from other APMV serotypes in containing an additional gene encoding the small hydrophobic (SH) protein. Sequence comparisons revealed that strain IT4524-2 shares an unexpectedly low level of genome nt sequence identity (70%) and aggregate predicted amino acid (aa) sequence identity (79%) with other three strains, which in contrast are more closely related to each other with nt sequence 94–98% nt identity and 90–100% aggregate aa identity. Sequence analysis of the F-SH-HN genome region of two other recent Italian isolates showed that they fall in the HK/FE/TW group. The predicted signal peptide of IT4524-2 F protein lacks the N-terminal first 10 aa that are present in the other five strains. Also, the F protein cleavage site of strain IT4524-2, REPR↓L, has two dibasic aa (arginine, R) compared to the monobasic F protein cleavage site of PEPR↓L in the other strains. Reciprocal cross-hemagglutination inhibition (HI) assays using post infection chicken sera indicated that strain IT4524-2 is antigenically related to the other APMV-6 strains, but with 4- to 8-fold lower HI tiers for the test sera between strain IT4524-2 and the other APMV-6 strains. Taken together, our results indicated that the APMV-6 strains represents a single serotype with two subgroups that differ substantially based on nt and aa sequences and can be

  14. Mycobacterial species as case-study of comparative genome analysis.

    PubMed

    Zakham, F; Belayachi, L; Ussery, D; Akrim, M; Benjouad, A; El Aouad, R; Ennaji, M M

    2011-02-08

    The genus Mycobacterium represents more than 120 species including important pathogens of human and cause major public health problems and illnesses. Further, with more than 100 genome sequences from this genus, comparative genome analysis can provide new insights for better understanding the evolutionary events of these species and improving drugs, vaccines, and diagnostics tools for controlling Mycobacterial diseases. In this present study we aim to outline a comparative genome analysis of fourteen Mycobacterial genomes: M. avium subsp. paratuberculosis K—10, M. bovis AF2122/97, M. bovis BCG str. Pasteur 1173P2, M. leprae Br4923, M. marinum M, M. sp. KMS, M. sp. MCS, M. tuberculosis CDC1551, M. tuberculosis F11, M. tuberculosis H37Ra, M. tuberculosis H37Rv, M. tuberculosis KZN 1435 , M. ulcerans Agy99,and M. vanbaalenii PYR—1, For this purpose a comparison has been done based on their length of genomes, GC content, number of genes in different data bases (Genbank, Refseq, and Prodigal). The BLAST matrix of these genomes has been figured to give a lot of information about the similarity between species in a simple scheme. As a result of multiple genome analysis, the pan and core genome have been defined for twelve Mycobacterial species. We have also introduced the genome atlas of the reference strain M. tuberculosis H37Rv which can give a good overview of this genome. And for examining the phylogenetic relationships among these bacteria, a phylogenic tree has been constructed from 16S rRNA gene for tuberculosis and non tuberculosis Mycobacteria to understand the evolutionary events of these species.

  15. Evolutionary and comparative analyses of the soybean genome

    PubMed Central

    Cannon, Steven B.; Shoemaker, Randy C.

    2012-01-01

    The soybean genome assembly has been available since the end of 2008. Significant features of the genome include large, gene-poor, repeat-dense pericentromeric regions, spanning roughly 57% of the genome sequence; a relatively large genome size of ~1.15 billion bases; remnants of a genome duplication that occurred ~13 million years ago (Mya); and fainter remnants of older polyploidies that occurred ~58 Mya and >130 Mya. The genome sequence has been used to identify the genetic basis for numerous traits, including disease resistance, nutritional characteristics, and developmental features. The genome sequence has provided a scaffold for placement of many genomic feature elements, both from within soybean and from related species. These may be accessed at several websites, including http://www.phytozome.net, http://soybase.org, http://comparative-legumes.org, and http://www.legumebase.brc.miyazaki-u.ac.jp. The taxonomic position of soybean in the Phaseoleae tribe of the legumes means that there are approximately two dozen other beans and relatives that have undergone independent domestication, and which may have traits that will be useful for transfer to soybean. Methods of translating information between species in the Phaseoleae range from design of markers for marker assisted selection, to transformation with Agrobacterium or with other experimental transformation methods. PMID:23136483

  16. Evolutionary and comparative analyses of the soybean genome.

    PubMed

    Cannon, Steven B; Shoemaker, Randy C

    2012-01-01

    The soybean genome assembly has been available since the end of 2008. Significant features of the genome include large, gene-poor, repeat-dense pericentromeric regions, spanning roughly 57% of the genome sequence; a relatively large genome size of ~1.15 billion bases; remnants of a genome duplication that occurred ~13 million years ago (Mya); and fainter remnants of older polyploidies that occurred ~58 Mya and >130 Mya. The genome sequence has been used to identify the genetic basis for numerous traits, including disease resistance, nutritional characteristics, and developmental features. The genome sequence has provided a scaffold for placement of many genomic feature elements, both from within soybean and from related species. These may be accessed at several websites, including http://www.phytozome.net, http://soybase.org, http://comparative-legumes.org, and http://www.legumebase.brc.miyazaki-u.ac.jp. The taxonomic position of soybean in the Phaseoleae tribe of the legumes means that there are approximately two dozen other beans and relatives that have undergone independent domestication, and which may have traits that will be useful for transfer to soybean. Methods of translating information between species in the Phaseoleae range from design of markers for marker assisted selection, to transformation with Agrobacterium or with other experimental transformation methods. PMID:23136483

  17. A Multi-Platform Draft de novo Genome Assembly and Comparative Analysis for the Scarlet Macaw (Ara macao)

    PubMed Central

    Seabury, Christopher M.; Dowd, Scot E.; Seabury, Paul M.; Raudsepp, Terje; Brightsmith, Donald J.; Liboriussen, Poul; Halley, Yvette; Fisher, Colleen A.; Owens, Elaine; Viswanathan, Ganesh; Tizard, Ian R.

    2013-01-01

    Data deposition to NCBI Genomes This Whole Genome Shotgun project has been deposited at DDBJ/EMBL/GenBank under the accession AMXX00000000 (SMACv1.0, unscaffolded genome assembly). The version described in this paper is the first version (AMXX01000000). The scaffolded assembly (SMACv1.1) has been deposited at DDBJ/EMBL/GenBank under the accession AOUJ00000000, and is also the first version (AOUJ01000000). Strong biological interest in traits such as the acquisition and utilization of speech, cognitive abilities, and longevity catalyzed the utilization of two next-generation sequencing platforms to provide the first-draft de novo genome assembly for the large, new world parrot Ara macao (Scarlet Macaw). Despite the challenges associated with genome assembly for an outbred avian species, including 951,507 high-quality putative single nucleotide polymorphisms, the final genome assembly (>1.035 Gb) includes more than 997 Mb of unambiguous sequence data (excluding N’s). Cytogenetic analyses including ZooFISH revealed complex rearrangements associated with two scarlet macaw macrochromosomes (AMA6, AMA7), which supports the hypothesis that translocations, fusions, and intragenomic rearrangements are key factors associated with karyotype evolution among parrots. In silico annotation of the scarlet macaw genome provided robust evidence for 14,405 nuclear gene annotation models, their predicted transcripts and proteins, and a complete mitochondrial genome. Comparative analyses involving the scarlet macaw, chicken, and zebra finch genomes revealed high levels of nucleotide-based conservation as well as evidence for overall genome stability among the three highly divergent species. Application of a new whole-genome analysis of divergence involving all three species yielded prioritized candidate genes and noncoding regions for parrot traits of interest (i.e., speech, intelligence, longevity) which were independently supported by the results of previous human GWAS studies. We

  18. Comparative rates of evolution in endosymbiotic nuclear genomes

    PubMed Central

    Patron, Nicola J; Rogers, Matthew B; Keeling, Patrick J

    2006-01-01

    Background The nucleomorphs associated with secondary plastids of cryptomonads and chlorarachniophytes are the sole examples of organelles with eukaryotic nuclear genomes. Although not as widespread as their prokaryotic equivalents in mitochondria and plastids, nucleomorph genomes share similarities in terms of reduction and compaction. They also differ in several aspects, not least in that they encode proteins that target to the plastid, and so function in a different compartment from that in which they are encoded. Results Here, we test whether the phylogenetically distinct nucleomorph genomes of the cryptomonad, Guillardia theta, and the chlorarachniophyte, Bigelowiella natans, have experienced similar evolutionary pressures during their transformation to reduced organelles. We compared the evolutionary rates of genes from nuclear, nucleomorph, and plastid genomes, all of which encode proteins that function in the same cellular compartment, the plastid, and are thus subject to similar selection pressures. Furthermore, we investigated the divergence of nucleomorphs within cryptomonads by comparing G. theta and Rhodomonas salina. Conclusion Chlorarachniophyte nucleomorph genes have accumulated errors at a faster rate than other genomes within the same cell, regardless of the compartment where the gene product functions. In contrast, most nucleomorph genes in cryptomonads have evolved faster than genes in other genomes on average, but genes for plastid-targeted proteins are not overly divergent, and it appears that cryptomonad nucleomorphs are not presently evolving rapidly and have therefore stabilized. Overall, these analyses suggest that the forces at work in the two lineages are different, despite the similarities between the structures of their genomes. PMID:16772046

  19. Comparative Genomics of a Parthenogenesis-Inducing Wolbachia Symbiont

    PubMed Central

    Lindsey, Amelia R. I.; Werren, John H.; Richards, Stephen; Stouthamer, Richard

    2016-01-01

    Wolbachia is an intracellular symbiont of invertebrates responsible for inducing a wide variety of phenotypes in its host. These host-Wolbachia relationships span the continuum from reproductive parasitism to obligate mutualism, and provide a unique system to study genomic changes associated with the evolution of symbiosis. We present the genome sequence from a parthenogenesis-inducing Wolbachia strain (wTpre) infecting the minute parasitoid wasp Trichogramma pretiosum. The wTpre genome is the most complete parthenogenesis-inducing Wolbachia genome available to date. We used comparative genomics across 16 Wolbachia strains, representing five supergroups, to identify a core Wolbachia genome of 496 sets of orthologous genes. Only 14 of these sets are unique to Wolbachia when compared to other bacteria from the Rickettsiales. We show that the B supergroup of Wolbachia, of which wTpre is a member, contains a significantly higher number of ankyrin repeat-containing genes than other supergroups. In the wTpre genome, there is evidence for truncation of the protein coding sequences in 20% of ORFs, mostly as a result of frameshift mutations. The wTpre strain represents a conversion from cytoplasmic incompatibility to a parthenogenesis-inducing lifestyle, and is required for reproduction in the Trichogramma host it infects. We hypothesize that the large number of coding frame truncations has accompanied the change in reproductive mode of the wTpre strain. PMID:27194801

  20. The MicrobesOnline Web site for comparative genomics

    SciTech Connect

    Alm, Eric J.; Huang, Katherine H.; Price, Morgan N.; Koche,Richard P.; Keller, Keith; Dubchak, Inna L.; Arkin, Adam P.

    2004-11-05

    At present, hundreds of microbial genomes have been sequenced, and hundreds more are currently in the pipeline. The Virtual Institute for Microbial Stress and Survival has developed a publicly available suite of Web-based comparative genomic tools (http://www.microbesonline.org) designed to facilitate multispecies comparison among prokaryotes. Highlights of the Microbes Online Web site include operon and regulon predictions, a multispecies genome browser, a multispecies Gene Ontology browser, a comparative KEGG metabolic pathway viewer, a Bioinformatics Workbench for in-depth sequence analysis, and Gene Carts that allow users to save genes of interest for further study while they browse. In addition, we provide an interface for genome annotation, which like all of the tools reported here, is freely available to the scientific community.

  1. The MicrobesOnline Web site for comparative genomics

    PubMed Central

    Alm, Eric J.; Huang, Katherine H.; Price, Morgan N.; Koche, Richard P.; Keller, Keith; Dubchak, Inna L.; Arkin, Adam P.

    2005-01-01

    At present, hundreds of microbial genomes have been sequenced, and hundreds more are currently in the pipeline. The Virtual Institute for Microbial Stress and Survival has developed a publicly available suite of Web-based comparative genomic tools (http://www.microbesonline.org) designed to facilitate multispecies comparison among prokaryotes. Highlights of the MicrobesOnline Web site include operon and regulon predictions, a multispecies genome browser, a multispecies Gene Ontology browser, a comparative KEGG metabolic pathway viewer, a Bioinformatics Workbench for in-depth sequence analysis, and Gene Carts that allow users to save genes of interest for further study while they browse. In addition, we provide an interface for genome annotation, which like all of the tools reported here, is freely available to the scientific community. PMID:15998914

  2. Sputnik: a database platform for comparative plant genomics.

    PubMed

    Rudd, Stephen; Mewes, Hans-Werner; Mayer, Klaus F X

    2003-01-01

    Two million plant ESTs, from 20 different plant species, and totalling more than one 1000 Mbp of DNA sequence, represents a formidable transcriptomic resource. Sputnik uses the potential of this sequence resource to fill some of the information gap in the un-sequenced plant genomes and to serve as the foundation for in silicio comparative plant genomics. The complexity of the individual EST collections has been reduced using optimised EST clustering techniques. Annotation of cluster sequences is performed by exploiting and transferring information from the comprehensive knowledgebase already produced for the completed model plant genome (Arabidopsis thaliana) and by performing additional state of-the-art sequence analyses relevant to today's plant biologist. Functional predictions, comparative analyses and associative annotations for 500 000 plant EST derived peptides make Sputnik (http://mips.gsf.de/proj/sputnik/) a valid platform for contemporary plant genomics.

  3. Sputnik: a database platform for comparative plant genomics.

    PubMed

    Rudd, Stephen; Mewes, Hans-Werner; Mayer, Klaus F X

    2003-01-01

    Two million plant ESTs, from 20 different plant species, and totalling more than one 1000 Mbp of DNA sequence, represents a formidable transcriptomic resource. Sputnik uses the potential of this sequence resource to fill some of the information gap in the un-sequenced plant genomes and to serve as the foundation for in silicio comparative plant genomics. The complexity of the individual EST collections has been reduced using optimised EST clustering techniques. Annotation of cluster sequences is performed by exploiting and transferring information from the comprehensive knowledgebase already produced for the completed model plant genome (Arabidopsis thaliana) and by performing additional state of-the-art sequence analyses relevant to today's plant biologist. Functional predictions, comparative analyses and associative annotations for 500 000 plant EST derived peptides make Sputnik (http://mips.gsf.de/proj/sputnik/) a valid platform for contemporary plant genomics. PMID:12519965

  4. Comparative genomics for the investigation of autoimmune diseases.

    PubMed

    Möller, Steffen; Zettl, Uwe K; Serrano-Fernández, Pablo; Goertsches, Robert

    2006-01-01

    The complete DNA sequence of the human genome and of several related mammals are now available, due to the investments of enormous resources and advances in sequencing technology. Novel technologies have been developed to compare multiple genomes with each other, thus specifying regions of sequence similarity among mammals and with their pathogens. Larger blocks of sequence similarity (syntenic regions) have been determined and made publicly available. In many ways, novel insights can be gained by such data when combining external genetic or clinical information for these syntenic loci. These novel tools have proven to be successful in inferring functional equivalence between loci of multiple genomes. This review reports on the role of comparative genomics in research on autoimmune diseases, a field with strong dependencies on animal models of human diseases and the problem of an adequate information transfer between multiple organisms and research areas. PMID:17073670

  5. Sputnik: a database platform for comparative plant genomics

    PubMed Central

    Rudd, Stephen; Mewes, Hans-Werner; Mayer, Klaus F.X.

    2003-01-01

    Two million plant ESTs, from 20 different plant species, and totalling more than one 1000 Mbp of DNA sequence, represents a formidable transcriptomic resource. Sputnik uses the potential of this sequence resource to fill some of the information gap in the un-sequenced plant genomes and to serve as the foundation for in silicio comparative plant genomics. The complexity of the individual EST collections has been reduced using optimised EST clustering techniques. Annotation of cluster sequences is performed by exploiting and transferring information from the comprehensive knowledgebase already produced for the completed model plant genome (Arabidopsis thaliana) and by performing additional state of-the-art sequence analyses relevant to today's plant biologist. Functional predictions, comparative analyses and associative annotations for 500 000 plant EST derived peptides make Sputnik (http://mips.gsf.de/proj/sputnik/) a valid platform for contemporary plant genomics. PMID:12519965

  6. Comparative Genomics via Wavelet Analysis for Closely Related Bacteria

    NASA Astrophysics Data System (ADS)

    Song, Jiuzhou; Ware, Tony; Liu, Shu-Lin; Surette, M.

    2004-12-01

    Comparative genomics has been a valuable method for extracting and extrapolating genome information among closely related bacteria. The efficiency of the traditional methods is extremely influenced by the software method used. To overcome the problem here, we propose using wavelet analysis to perform comparative genomics. First, global comparison using wavelet analysis gives the difference at a quantitative level. Then local comparison using keto-excess or purine-excess plots shows precise positions of inversions, translocations, and horizontally transferred DNA fragments. We firstly found that the level of energy spectra difference is related to the similarity of bacteria strains; it could be a quantitative index to describe the similarities of genomes. The strategy is described in detail by comparisons of closely related strains: S.typhi CT18, S.typhi Ty2, S.typhimurium LT2, H.pylori 26695, and H.pylori J99.

  7. Transcriptional analysis of the innate immune response using the avian innate immunity microarray

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The avian innate immunity microarray (AIIM) is a genomics tool designed to study the transcriptional activity of the avian immune response (Cytogenet. Genome Res. 117:139-145, 2007). It is an avian cDNA microarray representing 4,959 avian genes spotted in triplicate. The AIIM contains 25 avian int...

  8. Comparative Genome Analysis of Basidiomycete Fungi

    SciTech Connect

    Riley, Robert; Salamov, Asaf; Morin, Emmanuelle; Nagy, Laszlo; Manning, Gerard; Baker, Scott; Brown, Daren; Henrissat, Bernard; Levasseur, Anthony; Hibbett, David; Martin, Francis; Grigoriev, Igor

    2012-03-19

    Fungi of the phylum Basidiomycota (basidiomycetes), make up some 37percent of the described fungi, and are important in forestry, agriculture, medicine, and bioenergy. This diverse phylum includes the mushrooms, wood rots, symbionts, and plant and animal pathogens. To better understand the diversity of phenotypes in basidiomycetes, we performed a comparative analysis of 35 basidiomycete fungi spanning the diversity of the phylum. Phylogenetic patterns of lignocellulose degrading genes suggest a continuum rather than a sharp dichotomy between the white rot and brown rot modes of wood decay. Patterns of secondary metabolic enzymes give additional insight into the broad array of phenotypes found in the basidiomycetes. We suggest that the profile of an organism in lignocellulose-targeting genes can be used to predict its nutritional mode, and predict Dacryopinax sp. as a brown rot; Botryobasidium botryosum and Jaapia argillacea as white rots.

  9. PLAZA: A Comparative Genomics Resource to Study Gene and Genome Evolution in Plants[W

    PubMed Central

    Proost, Sebastian; Van Bel, Michiel; Sterck, Lieven; Billiau, Kenny; Van Parys, Thomas; Van de Peer, Yves; Vandepoele, Klaas

    2009-01-01

    The number of sequenced genomes of representatives within the green lineage is rapidly increasing. Consequently, comparative sequence analysis has significantly altered our view on the complexity of genome organization, gene function, and regulatory pathways. To explore all this genome information, a centralized infrastructure is required where all data generated by different sequencing initiatives is integrated and combined with advanced methods for data mining. Here, we describe PLAZA, an online platform for plant comparative genomics (http://bioinformatics.psb.ugent.be/plaza/). This resource integrates structural and functional annotation of published plant genomes together with a large set of interactive tools to study gene function and gene and genome evolution. Precomputed data sets cover homologous gene families, multiple sequence alignments, phylogenetic trees, intraspecies whole-genome dot plots, and genomic colinearity between species. Through the integration of high confidence Gene Ontology annotations and tree-based orthology between related species, thousands of genes lacking any functional description are functionally annotated. Advanced query systems, as well as multiple interactive visualization tools, are available through a user-friendly and intuitive Web interface. In addition, detailed documentation and tutorials introduce the different tools, while the workbench provides an efficient means to analyze user-defined gene sets through PLAZA's interface. In conclusion, PLAZA provides a comprehensible and up-to-date research environment to aid researchers in the exploration of genome information within the green plant lineage. PMID:20040540

  10. PLAZA: a comparative genomics resource to study gene and genome evolution in plants.

    PubMed

    Proost, Sebastian; Van Bel, Michiel; Sterck, Lieven; Billiau, Kenny; Van Parys, Thomas; Van de Peer, Yves; Vandepoele, Klaas

    2009-12-01

    The number of sequenced genomes of representatives within the green lineage is rapidly increasing. Consequently, comparative sequence analysis has significantly altered our view on the complexity of genome organization, gene function, and regulatory pathways. To explore all this genome information, a centralized infrastructure is required where all data generated by different sequencing initiatives is integrated and combined with advanced methods for data mining. Here, we describe PLAZA, an online platform for plant comparative genomics (http://bioinformatics.psb.ugent.be/plaza/). This resource integrates structural and functional annotation of published plant genomes together with a large set of interactive tools to study gene function and gene and genome evolution. Precomputed data sets cover homologous gene families, multiple sequence alignments, phylogenetic trees, intraspecies whole-genome dot plots, and genomic colinearity between species. Through the integration of high confidence Gene Ontology annotations and tree-based orthology between related species, thousands of genes lacking any functional description are functionally annotated. Advanced query systems, as well as multiple interactive visualization tools, are available through a user-friendly and intuitive Web interface. In addition, detailed documentation and tutorials introduce the different tools, while the workbench provides an efficient means to analyze user-defined gene sets through PLAZA's interface. In conclusion, PLAZA provides a comprehensible and up-to-date research environment to aid researchers in the exploration of genome information within the green plant lineage.

  11. Genomic and comparative genomic analyses of Rickettsia heilongjiangensis provide insight into its evolution and pathogenesis.

    PubMed

    Duan, Changsong; Xiong, Xiaolu; Qi, Yong; Gong, Wenping; Jiao, Jun; Wen, Bohai

    2014-08-01

    Rickettsia heilongjiangensis, the causative agent of far eastern spotted fever, is an obligate intracellular gram-negative bacterium that belongs to the spotted fever group rickettsiae. To understand the evolution and pathogenesis of R. heilongjiangensis, we analyzed its genome and compared it with other rickettsial genomes available in GenBank. The R. heilongjiangensis chromosome contains 1333 genes, including 1297 protein coding genes and 36 RNA coding genes. The genome also contains 121 pseudogenes, 54 insertion sequences, and 39 tandem repeats. Sixteen genes encoding the major components of the type IV secretion systems were identified in the R. heilongjiangensis genome. In total, 37 β-barrel outer membrane proteins were predicted in the genome, eight of which have been previously confirmed to be outer membrane proteins. In addition, 266 potential virulence factor genes, seven partially deleted antibiotic resistance genes, and a genomic island were identified in the genome. The codon usage in the genome is compatible with its low GC content, and the amino acid usage shows apparent bias. A comparative genomic analysis showed that R. heilongjiangensis and R. japonica share one unique fragment that may be a target sequence for a diagnostic assay. The orthologs of 37 genes of R. heilongjiangensis were found in pathogenic R. rickettsii str. Sheila Smith but not in non-pathogenic R. rickettsii str. Iowa, which may explain why R. heilongjiangensis is pathogenic. Pan-genome analysis showed that R. heilongjiangensis and 42 other rickettsiae strains share 693 core genes with a pan-genome size of 4837 genes. The pan-genome-based phylogeny showed that R. heilongjiangensis was closely related to R. japonica.

  12. Gramene 2016: comparative plant genomics and pathway resources.

    PubMed

    Tello-Ruiz, Marcela K; Stein, Joshua; Wei, Sharon; Preece, Justin; Olson, Andrew; Naithani, Sushma; Amarasinghe, Vindhya; Dharmawardhana, Palitha; Jiao, Yinping; Mulvaney, Joseph; Kumari, Sunita; Chougule, Kapeel; Elser, Justin; Wang, Bo; Thomason, James; Bolser, Daniel M; Kerhornou, Arnaud; Walts, Brandon; Fonseca, Nuno A; Huerta, Laura; Keays, Maria; Tang, Y Amy; Parkinson, Helen; Fabregat, Antonio; McKay, Sheldon; Weiser, Joel; D'Eustachio, Peter; Stein, Lincoln; Petryszak, Robert; Kersey, Paul J; Jaiswal, Pankaj; Ware, Doreen

    2016-01-01

    Gramene (http://www.gramene.org) is an online resource for comparative functional genomics in crops and model plant species. Its two main frameworks are genomes (collaboration with Ensembl Plants) and pathways (The Plant Reactome and archival BioCyc databases). Since our last NAR update, the database website adopted a new Drupal management platform. The genomes section features 39 fully assembled reference genomes that are integrated using ontology-based annotation and comparative analyses, and accessed through both visual and programmatic interfaces. Additional community data, such as genetic variation, expression and methylation, are also mapped for a subset of genomes. The Plant Reactome pathway portal (http://plantreactome.gramene.org) provides a reference resource for analyzing plant metabolic and regulatory pathways. In addition to ∼ 200 curated rice reference pathways, the portal hosts gene homology-based pathway projections for 33 plant species. Both the genome and pathway browsers interface with the EMBL-EBI's Expression Atlas to enable the projection of baseline and differential expression data from curated expression studies in plants. Gramene's archive website (http://archive.gramene.org) continues to provide previously reported resources on comparative maps, markers and QTL. To further aid our users, we have also introduced a live monthly educational webinar series and a Gramene YouTube channel carrying video tutorials.

  13. Gramene 2016: comparative plant genomics and pathway resources

    PubMed Central

    Tello-Ruiz, Marcela K.; Stein, Joshua; Wei, Sharon; Preece, Justin; Olson, Andrew; Naithani, Sushma; Amarasinghe, Vindhya; Dharmawardhana, Palitha; Jiao, Yinping; Mulvaney, Joseph; Kumari, Sunita; Chougule, Kapeel; Elser, Justin; Wang, Bo; Thomason, James; Bolser, Daniel M.; Kerhornou, Arnaud; Walts, Brandon; Fonseca, Nuno A.; Huerta, Laura; Keays, Maria; Tang, Y. Amy; Parkinson, Helen; Fabregat, Antonio; McKay, Sheldon; Weiser, Joel; D'Eustachio, Peter; Stein, Lincoln; Petryszak, Robert; Kersey, Paul J.; Jaiswal, Pankaj; Ware, Doreen

    2016-01-01

    Gramene (http://www.gramene.org) is an online resource for comparative functional genomics in crops and model plant species. Its two main frameworks are genomes (collaboration with Ensembl Plants) and pathways (The Plant Reactome and archival BioCyc databases). Since our last NAR update, the database website adopted a new Drupal management platform. The genomes section features 39 fully assembled reference genomes that are integrated using ontology-based annotation and comparative analyses, and accessed through both visual and programmatic interfaces. Additional community data, such as genetic variation, expression and methylation, are also mapped for a subset of genomes. The Plant Reactome pathway portal (http://plantreactome.gramene.org) provides a reference resource for analyzing plant metabolic and regulatory pathways. In addition to ∼200 curated rice reference pathways, the portal hosts gene homology-based pathway projections for 33 plant species. Both the genome and pathway browsers interface with the EMBL-EBI's Expression Atlas to enable the projection of baseline and differential expression data from curated expression studies in plants. Gramene's archive website (http://archive.gramene.org) continues to provide previously reported resources on comparative maps, markers and QTL. To further aid our users, we have also introduced a live monthly educational webinar series and a Gramene YouTube channel carrying video tutorials. PMID:26553803

  14. Gramene 2016: comparative plant genomics and pathway resources.

    PubMed

    Tello-Ruiz, Marcela K; Stein, Joshua; Wei, Sharon; Preece, Justin; Olson, Andrew; Naithani, Sushma; Amarasinghe, Vindhya; Dharmawardhana, Palitha; Jiao, Yinping; Mulvaney, Joseph; Kumari, Sunita; Chougule, Kapeel; Elser, Justin; Wang, Bo; Thomason, James; Bolser, Daniel M; Kerhornou, Arnaud; Walts, Brandon; Fonseca, Nuno A; Huerta, Laura; Keays, Maria; Tang, Y Amy; Parkinson, Helen; Fabregat, Antonio; McKay, Sheldon; Weiser, Joel; D'Eustachio, Peter; Stein, Lincoln; Petryszak, Robert; Kersey, Paul J; Jaiswal, Pankaj; Ware, Doreen

    2016-01-01

    Gramene (http://www.gramene.org) is an online resource for comparative functional genomics in crops and model plant species. Its two main frameworks are genomes (collaboration with Ensembl Plants) and pathways (The Plant Reactome and archival BioCyc databases). Since our last NAR update, the database website adopted a new Drupal management platform. The genomes section features 39 fully assembled reference genomes that are integrated using ontology-based annotation and comparative analyses, and accessed through both visual and programmatic interfaces. Additional community data, such as genetic variation, expression and methylation, are also mapped for a subset of genomes. The Plant Reactome pathway portal (http://plantreactome.gramene.org) provides a reference resource for analyzing plant metabolic and regulatory pathways. In addition to ∼ 200 curated rice reference pathways, the portal hosts gene homology-based pathway projections for 33 plant species. Both the genome and pathway browsers interface with the EMBL-EBI's Expression Atlas to enable the projection of baseline and differential expression data from curated expression studies in plants. Gramene's archive website (http://archive.gramene.org) continues to provide previously reported resources on comparative maps, markers and QTL. To further aid our users, we have also introduced a live monthly educational webinar series and a Gramene YouTube channel carrying video tutorials. PMID:26553803

  15. Comparative osteohistology of Hesperornis with reference to pygoscelid penguins: the effects of climate and behaviour on avian bone microstructure

    PubMed Central

    Wilson, Laura E.; Chin, Karen

    2014-01-01

    The broad biogeographic distribution of Hesperornis fossils in Late Cretaceous Western Interior Seaway deposits has prompted questions about whether they endured polar winters or migrated between mid- and high latitudes. Here, we compare microstructures of hesperornithiform long bones from Kansas and the Arctic to investigate whether migration or Late Cretaceous polar climate affected bone growth. We also examine modern penguin bones to determine how migration and climate may influence bone growth in birds with known behaviours. Histological analysis of hesperornithiform samples reveals continuous bone deposition throughout the cortex, plus an outer circumferential layer in adults. No cyclic growth marks, zonation or differences in vasculature are apparent in the Hesperornis specimens. Comparatively, migratory Adélie and chinstrap penguin bones show no zonation or changes in microstructure, suggesting that migration is not necessarily recorded in avian bone microstructure. Non-migratory gentoos show evidence of rapid bone growth possibly associated with increased chick growth rates in high-latitude populations and large body size. The absence of histological evidence for migration in extinct Hesperornis and extant pygoscelid penguins may reflect that these birds reached skeletal maturity before migration or overwintering. This underscores the challenges of using bone microstructure to infer the effects of behaviour and climate on avian growth. PMID:26064560

  16. Comparative osteohistology of Hesperornis with reference to pygoscelid penguins: the effects of climate and behaviour on avian bone microstructure.

    PubMed

    Wilson, Laura E; Chin, Karen

    2014-11-01

    The broad biogeographic distribution of Hesperornis fossils in Late Cretaceous Western Interior Seaway deposits has prompted questions about whether they endured polar winters or migrated between mid- and high latitudes. Here, we compare microstructures of hesperornithiform long bones from Kansas and the Arctic to investigate whether migration or Late Cretaceous polar climate affected bone growth. We also examine modern penguin bones to determine how migration and climate may influence bone growth in birds with known behaviours. Histological analysis of hesperornithiform samples reveals continuous bone deposition throughout the cortex, plus an outer circumferential layer in adults. No cyclic growth marks, zonation or differences in vasculature are apparent in the Hesperornis specimens. Comparatively, migratory Adélie and chinstrap penguin bones show no zonation or changes in microstructure, suggesting that migration is not necessarily recorded in avian bone microstructure. Non-migratory gentoos show evidence of rapid bone growth possibly associated with increased chick growth rates in high-latitude populations and large body size. The absence of histological evidence for migration in extinct Hesperornis and extant pygoscelid penguins may reflect that these birds reached skeletal maturity before migration or overwintering. This underscores the challenges of using bone microstructure to infer the effects of behaviour and climate on avian growth.

  17. Complete genome sequencing and comparative genomic analysis of functionally diverse Lysinibacillus sphaericus III(3)7.

    PubMed

    Rey, Andrés; Silva-Quintero, Laura; Dussán, Jenny

    2016-09-01

    Lysinibacillus sphaericus III(3)7 is a native Colombian strain, the first one isolated from soil samples. This strain has shown high levels of pathogenic activity against Culex quinquefaciatus larvae in laboratory assays compared to other members of the same species. Using Pacific Biosciences sequencing technology we sequenced, annotated (de novo) and described the genome of strain III(3)7, achieving a complete genome sequence status. We then performed a comparative analysis between the newly sequenced genome and the ones previously reported for Colombian isolates L. sphaericus OT4b.31, CBAM5 and OT4b.25, with the inclusion of L. sphaericus C3-41 that has been used as a reference genome for most of previous genome sequencing projects. We concluded that L. sphaericus III(3)7 is highly similar with strain OT4b.25 and shares high levels of synteny with isolates CBAM5 and C3-41. PMID:27419068

  18. The ABC's of comparative genomics in the Brassicaceae: building blocks of crucifer genomes.

    PubMed

    Schranz, M Eric; Lysak, Martin A; Mitchell-Olds, Thomas

    2006-11-01

    In this review we summarize recent advances in our understanding of phylogenetics, polyploidization and comparative genomics in the family Brassicaceae. These findings pave the way for a unified comparative genomic framework. We integrate several of these findings into a simple system of 24 conserved chromosomal blocks (labeled A-X). The naming, order, orientation and color-coding of these blocks are based on their positions in a proposed ancestral karyotype (n=8), rather than by their position in the reduced genome of Arabidopsis thaliana (n=5). We show how these crucifer building blocks can be rearranged to model the genome structures of A. thaliana, Arabidopsis lyrata, Capsella rubella and Brassica rapa. A framework for comparison between species is timely because several crucifer genome-sequencing projects are underway.

  19. Comparative proteogenomics: combining mass spectrometry and comparative genomics to analyze multiple genomes

    SciTech Connect

    Gupta, Nitin; Benhamida, Jamal; Bhargava, Vipul; Goodman, Daniel; Kain , Elisabeth; Kerman, Ian; Nguyen , Ngan; Ollikainen, Noah; Rodriguez, Jesse; Wang, J.; Lipton, Mary S.; Romine, Margaret F.; Bafna, Vineet; Smith, Richard D.; Pevzner, Pavel A.

    2008-07-30

    While bacterial genome annotations have significantly improved in recent years, techniques for bacterial proteome annotation (including post-translational chemical modifications, signal peptides, proteolytic events, etc.) are still in their infancy. At the same time, the number of sequenced bacterial genomes is rising sharply, far outpacing our ability to validate the predicted genes, let alone annotate bacterial proteomes. In this study, we use tandem mass spectrometry (MS/MS) to annotate the proteome of Shewanella oneidensis MR-1, an important microbe for bioremediation. In particular, we provide the first comprehensive map of post-translational modifications in a bacterial genome, including a large number of chemical modifications, signal peptide cleavages and cleavage of N-terminal methionine residues. We also detect multiple genes that were missed or assigned incorrect start positions by gene prediction programs and suggest corrections to improve the gene annotation. This study demonstrates that complementing every genome sequencing project by an MS/MS project would significantly improve both genome and proteome annotations for a reasonable cost.

  20. SUPERFAMILY--sophisticated comparative genomics, data mining, visualization and phylogeny.

    PubMed

    Wilson, Derek; Pethica, Ralph; Zhou, Yiduo; Talbot, Charles; Vogel, Christine; Madera, Martin; Chothia, Cyrus; Gough, Julian

    2009-01-01

    SUPERFAMILY provides structural, functional and evolutionary information for proteins from all completely sequenced genomes, and large sequence collections such as UniProt. Protein domain assignments for over 900 genomes are included in the database, which can be accessed at http://supfam.org/. Hidden Markov models based on Structural Classification of Proteins (SCOP) domain definitions at the superfamily level are used to provide structural annotation. We recently produced a new model library based on SCOP 1.73. Family level assignments are also available. From the web site users can submit sequences for SCOP domain classification; search for keywords such as superfamilies, families, organism names, models and sequence identifiers; find over- and underrepresented families or superfamilies within a genome relative to other genomes or groups of genomes; compare domain architectures across selections of genomes and finally build multiple sequence alignments between Protein Data Bank (PDB), genomic and custom sequences. Recent extensions to the database include InterPro abstracts and Gene Ontology terms for superfamiles, taxonomic visualization of the distribution of families across the tree of life, searches for functionally similar domain architectures and phylogenetic trees. The database, models and associated scripts are available for download from the ftp site.

  1. Avian sex, sex chromosomes, and dosage compensation in the age of genomics.

    PubMed

    Graves, Jennifer A Marshall

    2014-04-01

    Comparisons of the sex chromosome systems in birds and mammals are widening our view and deepening our understanding of vertebrate sex chromosome organization, function, and evolution. Birds have a very conserved ZW system of sex determination in which males have two copies of a large, gene-rich Z chromosome, and females have a single Z and a female-specific W chromosome. The avian ZW system is quite the reverse of the well-studied mammalian XY chromosome system, and evolved independently from different autosomal blocs. Despite the different gene content of mammal and bird sex chromosomes, there are many parallels. Genes on the bird Z and the mammal X have both undergone selection for male-advantage functions, and there has been amplification of male-advantage genes and accumulation of LINEs. The bird W and mammal Y have both undergone extensive degradation, but some birds retain early stages and some mammals terminal stages of the process, suggesting that the process is more advanced in mammals. Different sex-determining genes, DMRT1 and SRY, define the ZW and XY systems, but DMRT1 is involved in downstream events in mammals. Birds show strong cell autonomous specification of somatic sex differences in ZZ and ZW tissue, but there is growing evidence for direct X chromosome effects on sexual phenotype in mammals. Dosage compensation in birds appears to be phenotypically and molecularly quite different from X inactivation, being partial and gene-specific, but both systems use tools from the same molecular toolbox and there are some signs that galliform birds represent an early stage in the evolution of a coordinated system.

  2. DCODE.ORG Anthology of Comparative Genomic Tools

    SciTech Connect

    Loots, G G; Ovcharenko, I

    2005-01-11

    Comparative genomics provides the means to demarcate functional regions in anonymous DNA sequences. The successful application of this method to identifying novel genes is currently shifting to deciphering the noncoding encryption of gene regulation across genomes. To facilitate the use of comparative genomics to practical applications in genetics and genomics we have developed several analytical and visualization tools for the analysis of arbitrary sequences and whole genomes. These tools include two alignment tools: zPicture and Mulan; a phylogenetic shadowing tool: eShadow for identifying lineage- and species-specific functional elements; two evolutionary conserved transcription factor analysis tools: rVista and multiTF; a tool for extracting cis-regulatory modules governing the expression of co-regulated genes, CREME; and a dynamic portal to multiple vertebrate and invertebrate genome alignments, the ECR Browser. Here we briefly describe each one of these tools and provide specific examples on their practical applications. All the tools are publicly available at the http://www.dcode.org/ web site.

  3. Comparative Genomic and Phylogenomic Analyses Reveal a Conserved Core Genome Shared by Estuarine and Oceanic Cyanopodoviruses.

    PubMed

    Huang, Sijun; Zhang, Si; Jiao, Nianzhi; Chen, Feng

    2015-01-01

    Podoviruses are among the major viral groups that infect marine picocyanobacteria Prochlorococcus and Synechococcus. Here, we reported the genome sequences of five Synechococcus podoviruses isolated from the estuarine environment, and performed comparative genomic and phylogenomic analyses based on a total of 20 cyanopodovirus genomes. The genomes of all the known marine cyanopodoviruses are highly syntenic. A pan-genome of 349 clustered orthologous groups was determined, among which 15 were core genes. These core genes make up nearly half of each genome in length, reflecting the high level of genome conservation among this cyanophage type. The whole genome phylogenies based on concatenated core genes and gene content were highly consistent and confirmed the separation of two discrete marine cyanopodovirus clusters MPP-A and MPP-B. The genomes within cluster MPP-B grouped into subclusters mainly corresponding to Prochlorococcus or Synechococcus host types. Auxiliary metabolic genes tend to occur in a specific phylogenetic group of these cyanopodoviruses. All the MPP-B phages analyzed here encode the photosynthesis gene psbA, which are absent in all the MPP-A genomes thus far. Interestingly, all the MPP-B and two MPP-A Synechococcus podoviruses encode the thymidylate synthase gene thyX, while at the same genome locus all the MPP-B Prochlorococcus podoviruses encode the transaldolase gene talC. Both genes are hypothesized to have the potential to facilitate the biosynthesis of deoxynucleotide for phage replication. Inheritance of specific functional genes could be important to the evolution and ecological fitness of certain cyanophage genotypes. Our analyses demonstrate that cyanopodoviruses of estuarine and oceanic origins share a conserved core genome and suggest that accessory genes may be related to environmental adaptation.

  4. Comparative Genomic and Phylogenomic Analyses Reveal a Conserved Core Genome Shared by Estuarine and Oceanic Cyanopodoviruses.

    PubMed

    Huang, Sijun; Zhang, Si; Jiao, Nianzhi; Chen, Feng

    2015-01-01

    Podoviruses are among the major viral groups that infect marine picocyanobacteria Prochlorococcus and Synechococcus. Here, we reported the genome sequences of five Synechococcus podoviruses isolated from the estuarine environment, and performed comparative genomic and phylogenomic analyses based on a total of 20 cyanopodovirus genomes. The genomes of all the known marine cyanopodoviruses are highly syntenic. A pan-genome of 349 clustered orthologous groups was determined, among which 15 were core genes. These core genes make up nearly half of each genome in length, reflecting the high level of genome conservation among this cyanophage type. The whole genome phylogenies based on concatenated core genes and gene content were highly consistent and confirmed the separation of two discrete marine cyanopodovirus clusters MPP-A and MPP-B. The genomes within cluster MPP-B grouped into subclusters mainly corresponding to Prochlorococcus or Synechococcus host types. Auxiliary metabolic genes tend to occur in a specific phylogenetic group of these cyanopodoviruses. All the MPP-B phages analyzed here encode the photosynthesis gene psbA, which are absent in all the MPP-A genomes thus far. Interestingly, all the MPP-B and two MPP-A Synechococcus podoviruses encode the thymidylate synthase gene thyX, while at the same genome locus all the MPP-B Prochlorococcus podoviruses encode the transaldolase gene talC. Both genes are hypothesized to have the potential to facilitate the biosynthesis of deoxynucleotide for phage replication. Inheritance of specific functional genes could be important to the evolution and ecological fitness of certain cyanophage genotypes. Our analyses demonstrate that cyanopodoviruses of estuarine and oceanic origins share a conserved core genome and suggest that accessory genes may be related to environmental adaptation. PMID:26569403

  5. Genome Evolution in the Eremothecium Clade of the Saccharomyces Complex Revealed by Comparative Genomics

    PubMed Central

    Wendland, Jürgen; Walther, Andrea

    2011-01-01

    We used comparative genomics to elucidate the genome evolution within the pre–whole-genome duplication genus Eremothecium. To this end, we sequenced and assembled the complete genome of Eremothecium cymbalariae, a filamentous ascomycete representing the Eremothecium type strain. Genome annotation indicated 4712 gene models and 143 tRNAs. We compared the E. cymbalariae genome with that of its relative, the riboflavin overproducer Ashbya (Eremothecium) gossypii, and the reconstructed yeast ancestor. Decisive changes in the Eremothecium lineage leading to the evolution of the A. gossypii genome include the reduction from eight to seven chromosomes, the downsizing of the genome by removal of 10% or 900 kb of DNA, mostly in intergenic regions, the loss of a TY3-Gypsy–type transposable element, the re-arrangement of mating-type loci, and a massive increase of its GC content. Key species-specific events are the loss of MNN1-family of mannosyltransferases required to add the terminal fourth and fifth α-1,3-linked mannose residue to O-linked glycans and genes of the Ehrlich pathway in E. cymbalariae and the loss of ZMM-family of meiosis-specific proteins and acquisition of riboflavin overproduction in A. gossypii. This reveals that within the Saccharomyces complex genome, evolution is not only based on genome duplication with subsequent gene deletions and chromosomal rearrangements but also on fungi associated with specific environments (e.g. involving fungal-insect interactions as in Eremothecium), which have encountered challenges that may be reflected both in genome streamlining and their biosynthetic potential. PMID:22384365

  6. Cytogenetic analysis from DNA by comparative genomic hybridization.

    PubMed

    Tachdjian, G; Aboura, A; Lapierre, J M; Viguié, F

    2000-01-01

    Comparative genomic hybridization (CGH) is a modified in situ hybridization technique which allows detection and mapping of DNA sequence copy differences between two genomes in a single experiment. In CGH analysis, two differentially labelled genomic DNA (study and reference) are co-hybridized to normal metaphase spreads. Chromosomal locations of copy number changes in the DNA segments of the study genome are revealed by a variable fluorescence intensity ratio along each target chromosome. Since its development, CGH has been applied mostly as a research tool in the field of cancer cytogenetics to identify genetic changes in many previously unknown regions. CGH may also have a role in clinical cytogenetics for detection and identification of unbalanced chromosomal abnormalities.

  7. Phytozome: a Tool for Green Plant Comparative Genomics

    DOE Data Explorer

    Phytozome is a joint project of the Department of Energy's Joint Genome Institute and the Center for Integrative Genomics to facilitate comparative genomic studies amongst green plants. Clusters of orthologous and paralogous genes that represent the modern descendents of ancestral gene sets are constructed at key phylogenetic nodes. These clusters allow easy access to clade specific orthology/paralogy relationships as well as clade specific genes and gene expansions. As of release v4.0, Phytozome provides access to nine sequenced and annotated green plant genomes, eight of which have been clustered into gene families at six evolutionarily significant nodes. Where possible, each gene has been annotated with PFAM, KOG, KEGG, and PANTHER assignments, and publicly available annotations from RefSeq, UniProt, TAIR, JGI are hyper-linked and searchable. [Copied from the Overview at http://www.phytozome.net/Phytozome_info.php

  8. Assigning protein functions by comparative genome analysis protein phylogenetic profiles

    DOEpatents

    Pellegrini, Matteo; Marcotte, Edward M.; Thompson, Michael J.; Eisenberg, David; Grothe, Robert; Yeates, Todd O.

    2003-05-13

    A computational method system, and computer program are provided for inferring functional links from genome sequences. One method is based on the observation that some pairs of proteins A' and B' have homologs in another organism fused into a single protein chain AB. A trans-genome comparison of sequences can reveal these AB sequences, which are Rosetta Stone sequences because they decipher an interaction between A' and B. Another method compares the genomic sequence of two or more organisms to create a phylogenetic profile for each protein indicating its presence or absence across all the genomes. The profile provides information regarding functional links between different families of proteins. In yet another method a combination of the above two methods is used to predict functional links.

  9. Comparative Bacterial Proteomics: Analysis of the Core Genome Concept

    PubMed Central

    Callister, Stephen J.; McCue, Lee Ann; Turse, Joshua E.; Monroe, Matthew E.; Auberry, Kenneth J.; Smith, Richard D.; Adkins, Joshua N.; Lipton, Mary S.

    2008-01-01

    While comparative bacterial genomic studies commonly predict a set of genes indicative of common ancestry, experimental validation of the existence of this core genome requires extensive measurement and is typically not undertaken. Enabled by an extensive proteome database developed over six years, we have experimentally verified the expression of proteins predicted from genomic ortholog comparisons among 17 environmental and pathogenic bacteria. More exclusive relationships were observed among the expressed protein content of phenotypically related bacteria, which is indicative of the specific lifestyles associated with these organisms. Although genomic studies can establish relative orthologous relationships among a set of bacteria and propose a set of ancestral genes, our proteomics study establishes expressed lifestyle differences among conserved genes and proposes a set of expressed ancestral traits. PMID:18253490

  10. Comparative genomics of Brachyspira pilosicoli strains: genome rearrangements, reductions and correlation of genetic compliment with phenotypic diversity

    PubMed Central

    2012-01-01

    Background The anaerobic spirochaete Brachyspira pilosicoli causes enteric disease in avian, porcine and human hosts, amongst others. To date, the only available genome sequence of B. pilosicoli is that of strain 95/1000, a porcine isolate. In the first intra-species genome comparison within the Brachyspira genus, we report the whole genome sequence of B. pilosicoli B2904, an avian isolate, the incomplete genome sequence of B. pilosicoli WesB, a human isolate, and the comparisons with B. pilosicoli 95/1000. We also draw on incomplete genome sequences from three other Brachyspira species. Finally we report the first application of the high-throughput Biolog phenotype screening tool on the B. pilosicoli strains for detailed comparisons between genotype and phenotype. Results Feature and sequence genome comparisons revealed a high degree of similarity between the three B. pilosicoli strains, although the genomes of B2904 and WesB were larger than that of 95/1000 (~2,765, 2.890 and 2.596 Mb, respectively). Genome rearrangements were observed which correlated largely with the positions of mobile genetic elements. Through comparison of the B2904 and WesB genomes with the 95/1000 genome, features that we propose are non-essential due to their absence from 95/1000 include a peptidase, glycine reductase complex components and transposases. Novel bacteriophages were detected in the newly-sequenced genomes, which appeared to have involvement in intra- and inter-species horizontal gene transfer. Phenotypic differences predicted from genome analysis, such as the lack of genes for glucuronate catabolism in 95/1000, were confirmed by phenotyping. Conclusions The availability of multiple B. pilosicoli genome sequences has allowed us to demonstrate the substantial genomic variation that exists between these strains, and provides an insight into genetic events that are shaping the species. In addition, phenotype screening allowed determination of how genotypic differences translated

  11. Genome Sequences of Two Tunisian Field Strains of Avian Mycoplasma, M. meleagridis and M. gallinarum

    PubMed Central

    Yacoub, Elhem; Sirand-Pugnet, Pascal; Barré, Aurélien; Blanchard, Alain; Hubert, Christophe; Maurier, Florence; Bouilhol, Emmanuel

    2016-01-01

    Mycoplasma meleagridis and Mycoplasma gallinarum are bacteria that affect birds, but little is known about the genetic basis of their interaction with chickens and other poultry. Here, we sequenced the genomes of M. meleagridis strain MM_26B8_IPT and M. gallinarum strain Mgn_IPT, both isolated from chickens showing respiratory symptoms, poor growth, reduction in hatchability, and loss of production. PMID:27313300

  12. Genome Sequences of Two Tunisian Field Strains of Avian Mycoplasma, M. meleagridis and M. gallinarum.

    PubMed

    Yacoub, Elhem; Sirand-Pugnet, Pascal; Barré, Aurélien; Blanchard, Alain; Hubert, Christophe; Maurier, Florence; Bouilhol, Emmanuel; Ben Abdelmoumen Mardassi, Boutheina

    2016-01-01

    Mycoplasma meleagridis and Mycoplasma gallinarum are bacteria that affect birds, but little is known about the genetic basis of their interaction with chickens and other poultry. Here, we sequenced the genomes of M. meleagridis strain MM_26B8_IPT and M. gallinarum strain Mgn_IPT, both isolated from chickens showing respiratory symptoms, poor growth, reduction in hatchability, and loss of production. PMID:27313300

  13. Complete genome sequence of a natural reassortant H9N2 avian influenza virus found in bean goose (Anser fabalis): direct evidence for virus exchange between Korea and China via wild birds.

    PubMed

    Lee, Dong-Hun; Park, Jae-Keun; Yuk, Seong-Su; Erdene-Ochir, Tseren-Ochir; Kwon, Jung-Hoon; Lee, Joong-Bok; Park, Seung-Yong; Choi, In-Soo; Lee, Sang-Won; Song, Chang-Seon

    2014-08-01

    In 2011, we isolated a natural recombinant H9N2 avian influenza virus from fecal droppings of bean goose (Anser fabalis) in Korea. Phylogenetic analyses showed that the A/bean goose/Korea/220/2011(H9N2) isolate is a reassortant of Eurasian and North American lineages of avian influenza virus. In addition, the complete genome sequence, including all 8 gene segments, was associated with Chinese H9N2 viruses isolated from wild birds in the Hunan East Dongting Lake National Nature Reserve. These data provide direct evidence for the exchange of avian influenza viruses between Korea and China via wild birds.

  14. Comparative genomics and proteomics of 13 Porphyromonas gingivalis strains.

    PubMed

    Chen, Tsute; Siddiqui, Huma; Olsen, Ingar

    2015-01-01

    At the current time, genome sequences of a total of 13 Porphyromonas gingivalis strains are available, including five completed genomes (strains ATCC 33277, HG66, TDC60, JCVISC001, and W83) and eight high-coverage draft sequences (F0185, F0566, F0568, F0569, F0570, SJD2, W4087, and W50) that are assembled into fewer than 300 contigs. This study compared these genomes at both nucleotide and protein sequence levels in order to understand their phylogenetic and functional relatedness. There are four copies of 16S rRNA gene sequences in each of the strains of ATCC 33277, HG66, TDC60, and W83 and one copy in the other nine genomes. These 25 16S rRNA sequences represent only 13 unique sequences. The five copies in W83 and W50 are identical and the three copies in HG66 are identical to the four copies in ATCC 33277, suggesting close evolutionary lineage between W83 and W50, as well as HG66 and ATCC 33277. Genome-wide comparison based on "Rapid Annotation using Subsystem Technology" (RAST) also showed that for the overall biological functions of the genomes, W83 is closer to W50, and HG66 to ATCC33277, than to other genomes. The comparison of the RAST subsystems identified biological functions that are unique to individual, shared by some, or by all genomes. Functions unique to individual genomes include: a tetracycline resistance protein TetQ, DNA metabolism gene YcfH, and DNA repair gene exonuclease SbcC (only in SJD2); very-short-patch mismatch repair endonuclease and a phage packaging terminase similar to Bacteroides phage B124-14 (in W4087); an internalin similar to a Listeria surface virulence protein (W83); a Type I restriction-modification system (F0569); an iron acquisition/heme transport protein (F0566); colicin I receptor and carbamoylputrescine amidase (W50); L-serine dehydratase (TDC60); and spermidine synthase and ribokinase (JCVISC001). The results also identified biological functions that are missing in individual or several genomes. For example, JCVISC001

  15. Initial sequence and comparative analysis of the cat genome

    PubMed Central

    Pontius, Joan U.; Mullikin, James C.; Smith, Douglas R.; Lindblad-Toh, Kerstin; Gnerre, Sante; Clamp, Michele; Chang, Jean; Stephens, Robert; Neelam, Beena; Volfovsky, Natalia; Schäffer, Alejandro A.; Agarwala, Richa; Narfström, Kristina; Murphy, William J.; Giger, Urs; Roca, Alfred L.; Antunes, Agostinho; Menotti-Raymond, Marilyn; Yuhki, Naoya; Pecon-Slattery, Jill; Johnson, Warren E.; Bourque, Guillaume; Tesler, Glenn; O’Brien, Stephen J.

    2007-01-01

    The genome sequence (1.9-fold coverage) of an inbred Abyssinian domestic cat was assembled, mapped, and annotated with a comparative approach that involved cross-reference to annotated genome assemblies of six mammals (human, chimpanzee, mouse, rat, dog, and cow). The results resolved chromosomal positions for 663,480 contigs, 20,285 putative feline gene orthologs, and 133,499 conserved sequence blocks (CSBs). Additional annotated features include repetitive elements, endogenous retroviral sequences, nuclear mitochondrial (numt) sequences, micro-RNAs, and evolutionary breakpoints that suggest historic balancing of translocation and inversion incidences in distinct mammalian lineages. Large numbers of single nucleotide polymorphisms (SNPs), deletion insertion polymorphisms (DIPs), and short tandem repeats (STRs), suitable for linkage or association studies were characterized in the context of long stretches of chromosome homozygosity. In spite of the light coverage capturing ∼65% of euchromatin sequence from the cat genome, these comparative insights shed new light on the tempo and mode of gene/genome evolution in mammals, promise several research applications for the cat, and also illustrate that a comparative approach using more deeply covered mammals provides an informative, preliminary annotation of a light (1.9-fold) coverage mammal genome sequence. PMID:17975172

  16. The tiger genome and comparative analysis with lion and snow leopard genomes.

    PubMed

    Cho, Yun Sung; Hu, Li; Hou, Haolong; Lee, Hang; Xu, Jiaohui; Kwon, Soowhan; Oh, Sukhun; Kim, Hak-Min; Jho, Sungwoong; Kim, Sangsoo; Shin, Young-Ah; Kim, Byung Chul; Kim, Hyunmin; Kim, Chang-Uk; Luo, Shu-Jin; Johnson, Warren E; Koepfli, Klaus-Peter; Schmidt-Küntzel, Anne; Turner, Jason A; Marker, Laurie; Harper, Cindy; Miller, Susan M; Jacobs, Wilhelm; Bertola, Laura D; Kim, Tae Hyung; Lee, Sunghoon; Zhou, Qian; Jung, Hyun-Ju; Xu, Xiao; Gadhvi, Priyvrat; Xu, Pengwei; Xiong, Yingqi; Luo, Yadan; Pan, Shengkai; Gou, Caiyun; Chu, Xiuhui; Zhang, Jilin; Liu, Sanyang; He, Jing; Chen, Ying; Yang, Linfeng; Yang, Yulan; He, Jiaju; Liu, Sha; Wang, Junyi; Kim, Chul Hong; Kwak, Hwanjong; Kim, Jong-Soo; Hwang, Seungwoo; Ko, Junsu; Kim, Chang-Bae; Kim, Sangtae; Bayarlkhagva, Damdin; Paek, Woon Kee; Kim, Seong-Jin; O'Brien, Stephen J; Wang, Jun; Bhak, Jong

    2013-01-01

    Tigers and their close relatives (Panthera) are some of the world's most endangered species. Here we report the de novo assembly of an Amur tiger whole-genome sequence as well as the genomic sequences of a white Bengal tiger, African lion, white African lion and snow leopard. Through comparative genetic analyses of these genomes, we find genetic signatures that may reflect molecular adaptations consistent with the big cats' hypercarnivorous diet and muscle strength. We report a snow leopard-specific genetic determinant in EGLN1 (Met39>Lys39), which is likely to be associated with adaptation to high altitude. We also detect a TYR260G>A mutation likely responsible for the white lion coat colour. Tiger and cat genomes show similar repeat composition and an appreciably conserved synteny. Genomic data from the five big cats provide an invaluable resource for resolving easily identifiable phenotypes evident in very close, but distinct, species.

  17. The tiger genome and comparative analysis with lion and snow leopard genomes

    PubMed Central

    Cho, Yun Sung; Hu, Li; Hou, Haolong; Lee, Hang; Xu, Jiaohui; Kwon, Soowhan; Oh, Sukhun; Kim, Hak-Min; Jho, Sungwoong; Kim, Sangsoo; Shin, Young-Ah; Kim, Byung Chul; Kim, Hyunmin; Kim, Chang-uk; Luo, Shu-Jin; Johnson, Warren E.; Koepfli, Klaus-Peter; Schmidt-Küntzel, Anne; Turner, Jason A.; Marker, Laurie; Harper, Cindy; Miller, Susan M.; Jacobs, Wilhelm; Bertola, Laura D.; Kim, Tae Hyung; Lee, Sunghoon; Zhou, Qian; Jung, Hyun-Ju; Xu, Xiao; Gadhvi, Priyvrat; Xu, Pengwei; Xiong, Yingqi; Luo, Yadan; Pan, Shengkai; Gou, Caiyun; Chu, Xiuhui; Zhang, Jilin; Liu, Sanyang; He, Jing; Chen, Ying; Yang, Linfeng; Yang, Yulan; He, Jiaju; Liu, Sha; Wang, Junyi; Kim, Chul Hong; Kwak, Hwanjong; Kim, Jong-Soo; Hwang, Seungwoo; Ko, Junsu; Kim, Chang-Bae; Kim, Sangtae; Bayarlkhagva, Damdin; Paek, Woon Kee; Kim, Seong-Jin; O’Brien, Stephen J.; Wang, Jun; Bhak, Jong

    2013-01-01

    Tigers and their close relatives (Panthera) are some of the world’s most endangered species. Here we report the de novo assembly of an Amur tiger whole-genome sequence as well as the genomic sequences of a white Bengal tiger, African lion, white African lion and snow leopard. Through comparative genetic analyses of these genomes, we find genetic signatures that may reflect molecular adaptations consistent with the big cats’ hypercarnivorous diet and muscle strength. We report a snow leopard-specific genetic determinant in EGLN1 (Met39>Lys39), which is likely to be associated with adaptation to high altitude. We also detect a TYR260G>A mutation likely responsible for the white lion coat colour. Tiger and cat genomes show similar repeat composition and an appreciably conserved synteny. Genomic data from the five big cats provide an invaluable resource for resolving easily identifiable phenotypes evident in very close, but distinct, species. PMID:24045858

  18. The mitochondrial genome of Grateloupia taiwanensis (Halymeniaceae, Rhodophyta) and comparative mitochondrial genomics of red algae.

    PubMed

    DePriest, Michael S; Bhattacharya, Debashish; López-Bautista, Juan M

    2014-10-01

    Although red algae are economically highly valuable for their gelatinous cell wall compounds as well as being integral parts of marine benthic habitats, very little genome data are currently available. We present mitochondrial genome sequence data from the red alga Grateloupia taiwanensis S.-M. Lin & H.-Y. Liang. Comprising 28,906 nucleotide positions, the mitochondrial genome contig contains 25 protein-coding genes and 24 transfer RNA genes. It is highly similar to other red algal genomes in gene content as well as overall structure. An intron in the cox1 gene was found to be shared by G. taiwanensis and Grateloupia angusta (Okamura) S. Kawaguchi & H. W. Wang. We also used whole-genome alignments to compare G. taiwanensis to different groups of red algae, and these results are consistent with the currently accepted phylogeny of Rhodophyta.

  19. The tiger genome and comparative analysis with lion and snow leopard genomes.

    PubMed

    Cho, Yun Sung; Hu, Li; Hou, Haolong; Lee, Hang; Xu, Jiaohui; Kwon, Soowhan; Oh, Sukhun; Kim, Hak-Min; Jho, Sungwoong; Kim, Sangsoo; Shin, Young-Ah; Kim, Byung Chul; Kim, Hyunmin; Kim, Chang-Uk; Luo, Shu-Jin; Johnson, Warren E; Koepfli, Klaus-Peter; Schmidt-Küntzel, Anne; Turner, Jason A; Marker, Laurie; Harper, Cindy; Miller, Susan M; Jacobs, Wilhelm; Bertola, Laura D; Kim, Tae Hyung; Lee, Sunghoon; Zhou, Qian; Jung, Hyun-Ju; Xu, Xiao; Gadhvi, Priyvrat; Xu, Pengwei; Xiong, Yingqi; Luo, Yadan; Pan, Shengkai; Gou, Caiyun; Chu, Xiuhui; Zhang, Jilin; Liu, Sanyang; He, Jing; Chen, Ying; Yang, Linfeng; Yang, Yulan; He, Jiaju; Liu, Sha; Wang, Junyi; Kim, Chul Hong; Kwak, Hwanjong; Kim, Jong-Soo; Hwang, Seungwoo; Ko, Junsu; Kim, Chang-Bae; Kim, Sangtae; Bayarlkhagva, Damdin; Paek, Woon Kee; Kim, Seong-Jin; O'Brien, Stephen J; Wang, Jun; Bhak, Jong

    2013-01-01

    Tigers and their close relatives (Panthera) are some of the world's most endangered species. Here we report the de novo assembly of an Amur tiger whole-genome sequence as well as the genomic sequences of a white Bengal tiger, African lion, white African lion and snow leopard. Through comparative genetic analyses of these genomes, we find genetic signatures that may reflect molecular adaptations consistent with the big cats' hypercarnivorous diet and muscle strength. We report a snow leopard-specific genetic determinant in EGLN1 (Met39>Lys39), which is likely to be associated with adaptation to high altitude. We also detect a TYR260G>A mutation likely responsible for the white lion coat colour. Tiger and cat genomes show similar repeat composition and an appreciably conserved synteny. Genomic data from the five big cats provide an invaluable resource for resolving easily identifiable phenotypes evident in very close, but distinct, species. PMID:24045858

  20. Comparing Competitive Fitness of West Nile Virus Strains in Avian and Mosquito Hosts

    PubMed Central

    Worwa, Gabriella; Wheeler, Sarah S.; Brault, Aaron C.; Reisen, William K.

    2015-01-01

    Enzootic transmission of West Nile virus (WNV; Flaviviridae, Flavivirus) involves various species of birds and ornithophilic mosquitoes. Single nucleotide substitutions in the WNV genome may impact viral fitness necessary for WNV adaptation and evolution as previously shown for the WN02 genotype. In an effort to study phenotypic change, we developed an in vivo fitness competition model in two biologically relevant hosts for WNV. The House Finch (HOFI; Haemorhous mexicanus) and Culex tarsalis mosquitoes represent moderately susceptible hosts for WNV, are highly abundant in Western North America and frequently are infected with WNV in nature. Herein, we inoculated HOFIs and Cx. tarsalis competitively (dually) and singly with infectious-clone derived viruses of the founding California isolate COAV997-2003 (COAV997-IC), the founding North American isolate NY99 (NY99-IC), and a 2004 field isolate from California (CA-04), and compared the replicative capacities (fitness) of these viruses to a genetically marked virus of COAV997 (COAV997-5nt) by measuring RNA copy numbers. COAV997 and COAV997-5nt exhibited neutral fitness in HOFIs and Cx. tarsalis, and the temperature-sensitive phenotype of COAV997 did not affect replication in HOFIs as none of the infected birds became febrile. The NY99 and CA-04 isolates demonstrated elevated fitness in HOFIs compared to COAV997-5nt, whereas all viruses replicated to similar titers and RNA copies in Cx. tarsalis, and the only fitness differences were related to infection rates. Our data demonstrated that competitive replication allows for the sensitive comparison of fitness differences among two genetically closely related viruses using relevant hosts of WNV while eliminating host-to-host differences. In conclusion, our approach may be helpful in understanding the extent of phenotypic change in fitness associated with genetic changes in WNV. PMID:25965850

  1. Sequencing and comparative analyses of the genomes of zoysiagrasses

    PubMed Central

    Tanaka, Hidenori; Hirakawa, Hideki; Kosugi, Shunichi; Nakayama, Shinobu; Ono, Akiko; Watanabe, Akiko; Hashiguchi, Masatsugu; Gondo, Takahiro; Ishigaki, Genki; Muguerza, Melody; Shimizu, Katsuya; Sawamura, Noriko; Inoue, Takayasu; Shigeki, Yuichi; Ohno, Naoki; Tabata, Satoshi; Akashi, Ryo; Sato, Shusei

    2016-01-01

    Zoysia is a warm-season turfgrass, which comprises 11 allotetraploid species (2n = 4x = 40), each possessing different morphological and physiological traits. To characterize the genetic systems of Zoysia plants and to analyse their structural and functional differences in individual species and accessions, we sequenced the genomes of Zoysia species using HiSeq and MiSeq platforms. As a reference sequence of Zoysia species, we generated a high-quality draft sequence of the genome of Z. japonica accession ‘Nagirizaki’ (334 Mb) in which 59,271 protein-coding genes were predicted. In parallel, draft genome sequences of Z. matrella ‘Wakaba’ and Z. pacifica ‘Zanpa’ were also generated for comparative analyses. To investigate the genetic diversity among the Zoysia species, genome sequence reads of three additional accessions, Z. japonica ‘Kyoto’, Z. japonica ‘Miyagi’ and Z. matrella ‘Chiba Fair Green’, were accumulated, and aligned against the reference genome of ‘Nagirizaki’ along with those from ‘Wakaba’ and ‘Zanpa’. As a result, we detected 7,424,163 single-nucleotide polymorphisms and 852,488 short indels among these species. The information obtained in this study will be valuable for basic studies on zoysiagrass evolution and genetics as well as for the breeding of zoysiagrasses, and is made available in the ‘Zoysia Genome Database’ at http://zoysia.kazusa.or.jp. PMID:26975196

  2. Sequencing and comparative analyses of the genomes of zoysiagrasses.

    PubMed

    Tanaka, Hidenori; Hirakawa, Hideki; Kosugi, Shunichi; Nakayama, Shinobu; Ono, Akiko; Watanabe, Akiko; Hashiguchi, Masatsugu; Gondo, Takahiro; Ishigaki, Genki; Muguerza, Melody; Shimizu, Katsuya; Sawamura, Noriko; Inoue, Takayasu; Shigeki, Yuichi; Ohno, Naoki; Tabata, Satoshi; Akashi, Ryo; Sato, Shusei

    2016-04-01

    Zoysiais a warm-season turfgrass, which comprises 11 allotetraploid species (2n= 4x= 40), each possessing different morphological and physiological traits. To characterize the genetic systems of Zoysia plants and to analyse their structural and functional differences in individual species and accessions, we sequenced the genomes of Zoysia species using HiSeq and MiSeq platforms. As a reference sequence of Zoysia species, we generated a high-quality draft sequence of the genome of Z. japonica accession 'Nagirizaki' (334 Mb) in which 59,271 protein-coding genes were predicted. In parallel, draft genome sequences of Z. matrella 'Wakaba' and Z. pacifica 'Zanpa' were also generated for comparative analyses. To investigate the genetic diversity among the Zoysia species, genome sequence reads of three additional accessions, Z. japonica'Kyoto', Z. japonica'Miyagi' and Z. matrella'Chiba Fair Green', were accumulated, and aligned against the reference genome of 'Nagirizaki' along with those from 'Wakaba' and 'Zanpa'. As a result, we detected 7,424,163 single-nucleotide polymorphisms and 852,488 short indels among these species. The information obtained in this study will be valuable for basic studies on zoysiagrass evolution and genetics as well as for the breeding of zoysiagrasses, and is made available in the 'Zoysia Genome Database' at http://zoysia.kazusa.or.jp.

  3. Comparative analysis of rosaceous genomes and the reconstruction of a putative ancestral genome for the family

    PubMed Central

    2011-01-01

    Background Comparative genome mapping studies in Rosaceae have been conducted until now by aligning genetic maps within the same genus, or closely related genera and using a limited number of common markers. The growing body of genomics resources and sequence data for both Prunus and Fragaria permits detailed comparisons between these genera and the recently released Malus × domestica genome sequence. Results We generated a comparative analysis using 806 molecular markers that are anchored genetically to the Prunus and/or Fragaria reference maps, and physically to the Malus genome sequence. Markers in common for Malus and Prunus, and Malus and Fragaria, respectively were 784 and 148. The correspondence between marker positions was high and conserved syntenic blocks were identified among the three genera in the Rosaceae. We reconstructed a proposed ancestral genome for the Rosaceae. Conclusions A genome containing nine chromosomes is the most likely candidate for the ancestral Rosaceae progenitor. The number of chromosomal translocations observed between the three genera investigated was low. However, the number of inversions identified among Malus and Prunus was much higher than any reported genome comparisons in plants, suggesting that small inversions have played an important role in the evolution of these two genera or of the Rosaceae. PMID:21226921

  4. Using comparative genomics to drive new discoveries in microbiology.

    PubMed

    Haft, Daniel H

    2015-02-01

    Bioinformatics looks to many microbiologists like a service industry. In this view, annotation starts with what is known from experiments in the lab, makes reasonable inferences of which genes match other genes in function, builds databases to make all that we know accessible, but creates nothing truly new. Experiments lead, then biocuration and computational biology follow. But the astounding success of genome sequencing is changing the annotation paradigm. Every genome sequenced is an intercepted coded message from the microbial world, and as all cryptographers know, it is easier to decode a thousand messages than a single message. Some biology is best discovered not by phenomenology, but by decoding genome content, forming hypotheses, and doing the first few rounds of validation computationally. Through such reasoning, a role and function may be assigned to a protein with no sequence similarity to any protein yet studied. Experimentation can follow after the discovery to cement and to extend the findings. Unfortunately, this approach remains so unfamiliar to most bench scientists that lab work and comparative genomics typically segregate to different teams working on unconnected projects. This review will discuss several themes in comparative genomics as a discovery method, including highly derived data, use of patterns of design to reason by analogy, and in silico testing of computationally generated hypotheses.

  5. Dcode.org anthology of comparative genomic tools.

    PubMed

    Loots, Gabriela G; Ovcharenko, Ivan

    2005-07-01

    Comparative genomics provides the means to demarcate functional regions in anonymous DNA sequences. The successful application of this method to identifying novel genes is currently shifting to deciphering the non-coding encryption of gene regulation across genomes. To facilitate the practical application of comparative sequence analysis to genetics and genomics, we have developed several analytical and visualization tools for the analysis of arbitrary sequences and whole genomes. These tools include two alignment tools, zPicture and Mulan; a phylogenetic shadowing tool, eShadow for identifying lineage- and species-specific functional elements; two evolutionary conserved transcription factor analysis tools, rVista and multiTF; a tool for extracting cis-regulatory modules governing the expression of co-regulated genes, Creme 2.0; and a dynamic portal to multiple vertebrate and invertebrate genome alignments, the ECR Browser. Here, we briefly describe each one of these tools and provide specific examples on their practical applications. All the tools are publicly available at the http://www.dcode.org/ website.

  6. CFGP: a web-based, comparative fungal genomics platform.

    PubMed

    Park, Jongsun; Park, Bongsoo; Jung, Kyongyong; Jang, Suwang; Yu, Kwangyul; Choi, Jaeyoung; Kong, Sunghyung; Park, Jaejin; Kim, Seryun; Kim, Hyojeong; Kim, Soonok; Kim, Jihyun F; Blair, Jaime E; Lee, Kwangwon; Kang, Seogchan; Lee, Yong-Hwan

    2008-01-01

    Since the completion of the Saccharomyces cerevisiae genome sequencing project in 1996, the genomes of over 80 fungal species have been sequenced or are currently being sequenced. Resulting data provide opportunities for studying and comparing fungal biology and evolution at the genome level. To support such studies, the Comparative Fungal Genomics Platform (CFGP; http://cfgp.snu.ac.kr), a web-based multifunctional informatics workbench, was developed. The CFGP comprises three layers, including the basal layer, middleware and the user interface. The data warehouse in the basal layer contains standardized genome sequences of 65 fungal species. The middleware processes queries via six analysis tools, including BLAST, ClustalW, InterProScan, SignalP 3.0, PSORT II and a newly developed tool named BLASTMatrix. The BLASTMatrix permits the identification and visualization of genes homologous to a query across multiple species. The Data-driven User Interface (DUI) of the CFGP was built on a new concept of pre-collecting data and post-executing analysis instead of the 'fill-in-the-form-and-press-SUBMIT' user interfaces utilized by most bioinformatics sites. A tool termed Favorite, which supports the management of encapsulated sequence data and provides a personalized data repository to users, is another novel feature in the DUI.

  7. Comparative cryopreservation of avian spermatozoa: effects of freezing and thawing rates on turkey and sandhill crane sperm cryosurvival.

    PubMed

    Blanco, Juan M; Long, Julie A; Gee, George; Wildt, David E; Donoghue, Ann M

    2012-03-01

    A comparative approach was used to evaluate semen cooling rates, thawing rates and freezing volume on the cryosurvival of avian sperm. Turkey (Meleagris gallopavo) and sandhill crane (Grus canadensis) sperm were cryopreserved with dimethylacetamide (DMA) concentrations ranging from 6% to 26%. Experiments evaluated the efficacy of (1) rapid, moderate and slow cooling rates, (2) rapid and slow thawing rates, and (3) final volume of semen frozen (0.2 mL compared to 0.5 mL). For crane sperm only, additional experiments were conducted to evaluate the effect of sucrose on cryosurvival. The functionality of frozen/thawed crane sperm was evaluated by fertility trials. For all studies, sperm viability was assessed using the nigrosin-eosin stain. Higher percentages of crane and turkey sperm maintained intact membranes when frozen with moderate or slow cooling rates compared to rapid cooling rates (P<0.05), regardless of DMA concentration. Turkey sperm viability was not affected by thawing rate at any DMA concentration (P>0.05). Crane sperm viability was only affected by thawing rate for the 24% DMA treatment, where moderate thawing was better than slow thawing (P<0.05). Sperm viability was not affected by the semen volume used for freezing for either species (P>0.05). The percentage of membrane-intact crane sperm at lower DMA concentrations was improved by addition of 0.1M sucrose (P<0.05) but not 0.29 M NaCl. The mean fertility rate from frozen/thawed crane semen was 57.5%, and 71.4% of the fertile eggs hatched. The viability of crane sperm was always greater than turkey sperm, regardless of cooling rate, thawing rate or volume of semen frozen. These data verify avian-specific differences in sperm cryosurvival, further emphasize the need for species specific studies to optimize cryopreservation protocols.

  8. The Genome Sequence of Avian Pathogenic Escherichia coli Strain O1:K1:H7 Shares Strong Similarities with Human Extraintestinal Pathogenic E. coli Genomes▿

    PubMed Central

    Johnson, Timothy J.; Kariyawasam, Subhashinie; Wannemuehler, Yvonne; Mangiamele, Paul; Johnson, Sara J.; Doetkott, Curt; Skyberg, Jerod A.; Lynne, Aaron M.; Johnson, James R.; Nolan, Lisa K.

    2007-01-01

    Escherichia coli strains that cause disease outside the intestine are known as extraintestinal pathogenic E. coli (ExPEC) and include human uropathogenic E. coli (UPEC) and avian pathogenic E. coli (APEC). Regardless of host of origin, ExPEC strains share many traits. It has been suggested that these commonalities may enable APEC to cause disease in humans. Here, we begin to test the hypothesis that certain APEC strains possess potential to cause human urinary tract infection through virulence genotyping of 1,000 APEC and UPEC strains, generation of the first complete genomic sequence of an APEC (APEC O1:K1:H7) strain, and comparison of this genome to all available human ExPEC genomic sequences. The genomes of APEC O1 and three human UPEC strains were found to be remarkably similar, with only 4.5% of APEC O1's genome not found in other sequenced ExPEC genomes. Also, use of multilocus sequence typing showed that some of the sequenced human ExPEC strains were more like APEC O1 than other human ExPEC strains. This work provides evidence that at least some human and avian ExPEC strains are highly similar to one another, and it supports the possibility that a food-borne link between some APEC and UPEC strains exists. Future studies are necessary to assess the ability of APEC to overcome the hurdles necessary for such a food-borne transmission, and epidemiological studies are required to confirm that such a phenomenon actually occurs. PMID:17293413

  9. Comparative genomics of wild type yeast strains unveils important genome diversity

    PubMed Central

    Carreto, Laura; Eiriz, Maria F; Gomes, Ana C; Pereira, Patrícia M; Schuller, Dorit; Santos, Manuel AS

    2008-01-01

    Background Genome variability generates phenotypic heterogeneity and is of relevance for adaptation to environmental change, but the extent of such variability in natural populations is still poorly understood. For example, selected Saccharomyces cerevisiae strains are variable at the ploidy level, have gene amplifications, changes in chromosome copy number, and gross chromosomal rearrangements. This suggests that genome plasticity provides important genetic diversity upon which natural selection mechanisms can operate. Results In this study, we have used wild-type S. cerevisiae (yeast) strains to investigate genome variation in natural and artificial environments. We have used comparative genome hybridization on array (aCGH) to characterize the genome variability of 16 yeast strains, of laboratory and commercial origin, isolated from vineyards and wine cellars, and from opportunistic human infections. Interestingly, sub-telomeric instability was associated with the clinical phenotype, while Ty element insertion regions determined genomic differences of natural wine fermentation strains. Copy number depletion of ASP3 and YRF1 genes was found in all wild-type strains. Other gene families involved in transmembrane transport, sugar and alcohol metabolism or drug resistance had copy number changes, which also distinguished wine from clinical isolates. Conclusion We have isolated and genotyped more than 1000 yeast strains from natural environments and carried out an aCGH analysis of 16 strains representative of distinct genotype clusters. Important genomic variability was identified between these strains, in particular in sub-telomeric regions and in Ty-element insertion sites, suggesting that this type of genome variability is the main source of genetic diversity in natural populations of yeast. The data highlights the usefulness of yeast as a model system to unravel intraspecific natural genome diversity and to elucidate how natural selection shapes the yeast genome

  10. Whole genomic DNA sequencing and comparative genomic analysis of Arthrospira platensis: high genome plasticity and genetic diversity

    PubMed Central

    Xu, Teng; Qin, Song; Hu, Yongwu; Song, Zhijian; Ying, Jianchao; Li, Peizhen; Dong, Wei; Zhao, Fangqing; Yang, Huanming; Bao, Qiyu

    2016-01-01

    Arthrospira platensis is a multi-cellular and filamentous non-N2-fixing cyanobacterium that is capable of performing oxygenic photosynthesis. In this study, we determined the nearly complete genome sequence of A. platensis YZ. A. platensis YZ genome is a single, circular chromosome of 6.62 Mb in size. Phylogenetic and comparative genomic analyses revealed that A. platensis YZ was more closely related to A. platensis NIES-39 than Arthrospira sp. PCC 8005 and A. platensis C1. Broad gene gains were identified between A. platensis YZ and three other Arthrospira speices, some of which have been previously demonstrated that can be laterally transferred among different species, such as restriction-modification systems-coding genes. Moreover, unprecedented extensive chromosomal rearrangements among different strains were observed. The chromosomal rearrangements, particularly the chromosomal inversions, were analysed and estimated to be closely related to palindromes that involved long inverted repeat sequences and the extensively distributed type IIR restriction enzyme in the Arthrospira genome. In addition, species from genus Arthrospira unanimously contained the highest rate of repetitive sequence compared with the other species of order Oscillatoriales, suggested that sequence duplication significantly contributed to Arthrospira genome phylogeny. These results provided in-depth views into the genomic phylogeny and structural variation of A. platensis, as well as provide a valuable resource for functional genomics studies. PMID:27330141

  11. Whole genomic DNA sequencing and comparative genomic analysis of Arthrospira platensis: high genome plasticity and genetic diversity.

    PubMed

    Xu, Teng; Qin, Song; Hu, Yongwu; Song, Zhijian; Ying, Jianchao; Li, Peizhen; Dong, Wei; Zhao, Fangqing; Yang, Huanming; Bao, Qiyu

    2016-08-01

    Arthrospira platensis is a multi-cellular and filamentous non-N2-fixing cyanobacterium that is capable of performing oxygenic photosynthesis. In this study, we determined the nearly complete genome sequence of A. platensis YZ. A. platensis YZ genome is a single, circular chromosome of 6.62 Mb in size. Phylogenetic and comparative genomic analyses revealed that A. platensis YZ was more closely related to A. platensis NIES-39 than Arthrospira sp. PCC 8005 and A. platensis C1. Broad gene gains were identified between A. platensis YZ and three other Arthrospira speices, some of which have been previously demonstrated that can be laterally transferred among different species, such as restriction-modification systems-coding genes. Moreover, unprecedented extensive chromosomal rearrangements among different strains were observed. The chromosomal rearrangements, particularly the chromosomal inversions, were analysed and estimated to be closely related to palindromes that involved long inverted repeat sequences and the extensively distributed type IIR restriction enzyme in the Arthrospira genome. In addition, species from genus Arthrospira unanimously contained the highest rate of repetitive sequence compared with the other species of order Oscillatoriales, suggested that sequence duplication significantly contributed to Arthrospira genome phylogeny. These results provided in-depth views into the genomic phylogeny and structural variation of A. platensis, as well as provide a valuable resource for functional genomics studies. PMID:27330141

  12. Whole genomic DNA sequencing and comparative genomic analysis of Arthrospira platensis: high genome plasticity and genetic diversity.

    PubMed

    Xu, Teng; Qin, Song; Hu, Yongwu; Song, Zhijian; Ying, Jianchao; Li, Peizhen; Dong, Wei; Zhao, Fangqing; Yang, Huanming; Bao, Qiyu

    2016-08-01

    Arthrospira platensis is a multi-cellular and filamentous non-N2-fixing cyanobacterium that is capable of performing oxygenic photosynthesis. In this study, we determined the nearly complete genome sequence of A. platensis YZ. A. platensis YZ genome is a single, circular chromosome of 6.62 Mb in size. Phylogenetic and comparative genomic analyses revealed that A. platensis YZ was more closely related to A. platensis NIES-39 than Arthrospira sp. PCC 8005 and A. platensis C1. Broad gene gains were identified between A. platensis YZ and three other Arthrospira speices, some of which have been previously demonstrated that can be laterally transferred among different species, such as restriction-modification systems-coding genes. Moreover, unprecedented extensive chromosomal rearrangements among different strains were observed. The chromosomal rearrangements, particularly the chromosomal inversions, were analysed and estimated to be closely related to palindromes that involved long inverted repeat sequences and the extensively distributed type IIR restriction enzyme in the Arthrospira genome. In addition, species from genus Arthrospira unanimously contained the highest rate of repetitive sequence compared with the other species of order Oscillatoriales, suggested that sequence duplication significantly contributed to Arthrospira genome phylogeny. These results provided in-depth views into the genomic phylogeny and structural variation of A. platensis, as well as provide a valuable resource for functional genomics studies.

  13. Phylogeny and comparative genome analysis of a Basidiomycete fungi

    SciTech Connect

    Riley, Robert W.; Salamov, Asaf; Grigoriev, Igor; Hibbett, David

    2011-03-14

    Fungi of the phylum Basidiomycota, make up some 37percent of the described fungi, and are important from the perspectives of forestry, agriculture, medicine, and bioenergy. This diverse phylum includes the mushrooms, wood rots, plant pathogenic rusts and smuts, and some human pathogens. To better understand these important fungi, we have undertaken a comparative genomic analysis of the Basidiomycetes with available sequenced genomes. We report a phylogeny that sheds light on previously unclear evolutionary relationships among the Basidiomycetes. We also define a `core proteome? based on protein families conserved in all Basidiomycetes. We identify key expansions and contractions in protein families that may be responsible for the degradation of plant biomass such as cellulose, hemicellulose, and lignin. Finally, we speculate as to the genomic changes that drove such expansions and contractions.

  14. A web server for mining Comparative Genomic Hybridization (CGH) data

    NASA Astrophysics Data System (ADS)

    Liu, Jun; Ranka, Sanjay; Kahveci, Tamer

    2007-11-01

    Advances in cytogenetics and molecular biology has established that chromosomal alterations are critical in the pathogenesis of human cancer. Recurrent chromosomal alterations provide cytological and molecular markers for the diagnosis and prognosis of disease. They also facilitate the identification of genes that are important in carcinogenesis, which in the future may help in the development of targeted therapy. A large amount of publicly available cancer genetic data is now available and it is growing. There is a need for public domain tools that allow users to analyze their data and visualize the results. This chapter describes a web based software tool that will allow researchers to analyze and visualize Comparative Genomic Hybridization (CGH) datasets. It employs novel data mining methodologies for clustering and classification of CGH datasets as well as algorithms for identifying important markers (small set of genomic intervals with aberrations) that are potentially cancer signatures. The developed software will help in understanding the relationships between genomic aberrations and cancer types.

  15. Comparative genomics reveals mobile pathogenicity chromosomes in Fusarium

    PubMed Central

    Ma, Li-Jun; van der Does, H. Charlotte; Borkovich, Katherine A.; Coleman, Jeffrey J.; Daboussi, Marie-Josée; Di Pietro, Antonio; Dufresne, Marie; Freitag, Michael; Grabherr, Manfred; Henrissat, Bernard; Houterman, Petra M.; Kang, Seogchan; Shim, Won-Bo; Woloshuk, Charles; Xie, Xiaohui; Xu, Jin-Rong; Antoniw, John; Baker, Scott E.; Bluhm, Burton H.; Breakspear, Andrew; Brown, Daren W.; Butchko, Robert A. E.; Chapman, Sinead; Coulson, Richard; Coutinho, Pedro M.; Danchin, Etienne G. J.; Diener, Andrew; Gale, Liane R.; Gardiner, Donald M.; Goff, Stephen; Hammond-Kosack, Kim E.; Hilburn, Karen; Hua-Van, Aurélie; Jonkers, Wilfried; Kazan, Kemal; Kodira, Chinnappa D.; Koehrsen, Michael; Kumar, Lokesh; Lee, Yong-Hwan; Li, Liande; Manners, John M.; Miranda-Saavedra, Diego; Mukherjee, Mala; Park, Gyungsoon; Park, Jongsun; Park, Sook-Young; Proctor, Robert H.; Regev, Aviv; Ruiz-Roldan, M. Carmen; Sain, Divya; Sakthikumar, Sharadha; Sykes, Sean; Schwartz, David C.; Turgeon, B. Gillian; Wapinski, Ilan; Yoder, Olen; Young, Sarah; Zeng, Qiandong; Zhou, Shiguo; Galagan, James; Cuomo, Christina A.; Kistler, H. Corby; Rep, Martijn

    2011-01-01

    Fusarium species are among the most important phytopathogenic and toxigenic fungi. To understand the molecular underpinnings of pathogenicity in the genus Fusarium, we compared the genomes of three phenotypically diverse species: Fusarium graminearum, Fusarium verticillioides and Fusarium oxysporum f. sp. lycopersici. Our analysis revealed lineage-specific (LS) genomic regions in F. oxysporum that include four entire chromosomes and account for more than one-quarter of the genome. LS regions are rich in transposons and genes with distinct evolutionary profiles but related to pathogenicity, indicative of horizontal acquisition. Experimentally, we demonstrate the transfer of two LS chromosomes between strains of F. oxysporum, converting a non-pathogenic strain into a pathogen. Transfer of LS chromosomes between otherwise genetically isolated strains explains the polyphyletic origin of host specificity and the emergence of new pathogenic lineages in F. oxysporum. These findings put the evolution of fungal pathogenicity into a new perspective. PMID:20237561

  16. CyanoClust: comparative genome resources of cyanobacteria and plastids.

    PubMed

    Sasaki, Naobumi V; Sato, Naoki

    2010-01-01

    Cyanobacteria, which perform oxygen-evolving photosynthesis as do chloroplasts of plants and algae, are one of the best-studied prokaryotic phyla and one from which many representative genomes have been sequenced. Lack of a suitable comparative genomic database has been a problem in cyanobacterial genomics because many proteins involved in physiological functions such as photosynthesis and nitrogen fixation are not catalogued in commonly used databases, such as Clusters of Orthologous Proteins (COG). CyanoClust is a database of homolog groups in cyanobacteria and plastids that are produced by the program Gclust. We have developed a web-server system for the protein homology database featuring cyanobacteria and plastids. Database URL: http://cyanoclust.c.u-tokyo.ac.jp/.

  17. Comparative analysis of methods for genome-wide nucleosome cartography.

    PubMed

    Quintales, Luis; Vázquez, Enrique; Antequera, Francisco

    2015-07-01

    Nucleosomes contribute to compacting the genome into the nucleus and regulate the physical access of regulatory proteins to DNA either directly or through the epigenetic modifications of the histone tails. Precise mapping of nucleosome positioning across the genome is, therefore, essential to understanding the genome regulation. In recent years, several experimental protocols have been developed for this purpose that include the enzymatic digestion, chemical cleavage or immunoprecipitation of chromatin followed by next-generation sequencing of the resulting DNA fragments. Here, we compare the performance and resolution of these methods from the initial biochemical steps through the alignment of the millions of short-sequence reads to a reference genome to the final computational analysis to generate genome-wide maps of nucleosome occupancy. Because of the lack of a unified protocol to process data sets obtained through the different approaches, we have developed a new computational tool (NUCwave), which facilitates their analysis, comparison and assessment and will enable researchers to choose the most suitable method for any particular purpose. NUCwave is freely available at http://nucleosome.usal.es/nucwave along with a step-by-step protocol for its use. PMID:25296770

  18. Comparative mitochondrial genomics within and among species of killifish

    PubMed Central

    Whitehead, Andrew

    2009-01-01

    Background This study was motivated by the observation of unusual mitochondrial haplotype distributions and associated physiological differences between populations of the killifish Fundulus heteroclitus distributed along the Atlantic coast of North America. A distinct "northern" haplotype is fixed in all populations north of New Jersey, and does not appear south of New Jersey except in extreme upper-estuary fresh water habitats, and northern individuals are known to be more tolerant of hyposmotic conditions than southern individuals. Complete mitochondrial genomes were sequenced from individuals from northern coastal, southern coastal, and fresh water populations (and from out-groups). Comparative genomics approaches were used to test multiple evolutionary hypotheses proposed to explain among-population genome variation including directional selection and hybridization. Results Structure and organization of the Fundulus mitochondrial genome is typical of animals, yet subtle differences in substitution patterns exist among populations. No signals of directional selection or hybridization were detected. Mitochondrial genes evolve at variable rates, but all genes exhibit very low dN/dS ratios across all lineages, and the southern population harbors more synonymous polymorphism than other populations. Conclusion Evolution of mitochondrial genomes within Fundulus is primarily governed by interaction between strong purifying selection and demographic influences, including larger historical population size in the south. Though directional selection and hybridization hypotheses were not supported, adaptive processes may indirectly contribute to partitioning of variation between populations. PMID:19144111

  19. Comparative analysis of methods for genome-wide nucleosome cartography.

    PubMed

    Quintales, Luis; Vázquez, Enrique; Antequera, Francisco

    2015-07-01

    Nucleosomes contribute to compacting the genome into the nucleus and regulate the physical access of regulatory proteins to DNA either directly or through the epigenetic modifications of the histone tails. Precise mapping of nucleosome positioning across the genome is, therefore, essential to understanding the genome regulation. In recent years, several experimental protocols have been developed for this purpose that include the enzymatic digestion, chemical cleavage or immunoprecipitation of chromatin followed by next-generation sequencing of the resulting DNA fragments. Here, we compare the performance and resolution of these methods from the initial biochemical steps through the alignment of the millions of short-sequence reads to a reference genome to the final computational analysis to generate genome-wide maps of nucleosome occupancy. Because of the lack of a unified protocol to process data sets obtained through the different approaches, we have developed a new computational tool (NUCwave), which facilitates their analysis, comparison and assessment and will enable researchers to choose the most suitable method for any particular purpose. NUCwave is freely available at http://nucleosome.usal.es/nucwave along with a step-by-step protocol for its use.

  20. Comparative Analysis of Genome Sequences Covering the Seven Cronobacter Species

    PubMed Central

    Cummings, Craig A.; Shih, Rita; Degoricija, Lovorka; Rico, Alain; Brzoska, Pius; Hamby, Stephen E.; Masood, Naqash; Hariri, Sumyya; Sonbol, Hana; Chuzhanova, Nadia; McClelland, Michael; Furtado, Manohar R.; Forsythe, Stephen J.

    2012-01-01

    Background Species of Cronobacter are widespread in the environment and are occasional food-borne pathogens associated with serious neonatal diseases, including bacteraemia, meningitis, and necrotising enterocolitis. The genus is composed of seven species: C. sakazakii, C. malonaticus, C. turicensis, C. dublinensis, C. muytjensii, C. universalis, and C. condimenti. Clinical cases are associated with three species, C. malonaticus, C. turicensis and, in particular, with C. sakazakii multilocus sequence type 4. Thus, it is plausible that virulence determinants have evolved in certain lineages. Methodology/Principal Findings We generated high quality sequence drafts for eleven Cronobacter genomes representing the seven Cronobacter species, including an ST4 strain of C. sakazakii. Comparative analysis of these genomes together with the two publicly available genomes revealed Cronobacter has over 6,000 genes in one or more strains and over 2,000 genes shared by all Cronobacter. Considerable variation in the presence of traits such as type six secretion systems, metal resistance (tellurite, copper and silver), and adhesins were found. C. sakazakii is unique in the Cronobacter genus in encoding genes enabling the utilization of exogenous sialic acid which may have clinical significance. The C. sakazakii ST4 strain 701 contained additional genes as compared to other C. sakazakii but none of them were known specific virulence-related genes. Conclusions/Significance Genome comparison revealed that pair-wise DNA sequence identity varies between 89 and 97% in the seven Cronobacter species, and also suggested various degrees of divergence. Sets of universal core genes and accessory genes unique to each strain were identified. These gene sequences can be used for designing genus/species specific detection assays. Genes encoding adhesins, T6SS, and metal resistance genes as well as prophages are found in only subsets of genomes and have contributed considerably to the variation of

  1. MicroScope: a platform for microbial genome annotation and comparative genomics.

    PubMed

    Vallenet, D; Engelen, S; Mornico, D; Cruveiller, S; Fleury, L; Lajus, A; Rouy, Z; Roche, D; Salvignol, G; Scarpelli, C; Médigue, C

    2009-01-01

    The initial outcome of genome sequencing is the creation of long text strings written in a four letter alphabet. The role of in silico sequence analysis is to assist biologists in the act of associating biological knowledge with these sequences, allowing investigators to make inferences and predictions that can be tested experimentally. A wide variety of software is available to the scientific community, and can be used to identify genomic objects, before predicting their biological functions. However, only a limited number of biologically interesting features can be revealed from an isolated sequence. Comparative genomics tools, on the other hand, by bringing together the information contained in numerous genomes simultaneously, allow annotators to make inferences based on the idea that evolution and natural selection are central to the definition of all biological processes. We have developed the MicroScope platform in order to offer a web-based framework for the systematic and efficient revision of microbial genome annotation and comparative analysis (http://www.genoscope.cns.fr/agc/microscope). Starting with the description of the flow chart of the annotation processes implemented in the MicroScope pipeline, and the development of traditional and novel microbial annotation and comparative analysis tools, this article emphasizes the essential role of expert annotation as a complement of automatic annotation. Several examples illustrate the use of implemented tools for the review and curation of annotations of both new and publicly available microbial genomes within MicroScope's rich integrated genome framework. The platform is used as a viewer in order to browse updated annotation information of available microbial genomes (more than 440 organisms to date), and in the context of new annotation projects (117 bacterial genomes). The human expertise gathered in the MicroScope database (about 280,000 independent annotations) contributes to improve the quality of

  2. Sequencing and Comparative Genome Analysis of Two Pathogenic Streptococcus gallolyticus Subspecies: Genome Plasticity, Adaptation and Virulence

    PubMed Central

    Teng, Yu-Ting; Wu, Hui-Lun; Liu, Yen-Ming; Wu, Keh-Ming; Chang, Chuan-Hsiung; Hsu, Ming-Ta

    2011-01-01

    Streptococcus gallolyticus infections in humans are often associated with bacteremia, infective endocarditis and colon cancers. The disease manifestations are different depending on the subspecies of S. gallolyticus causing the infection. Here, we present the complete genomes of S. gallolyticus ATCC 43143 (biotype I) and S. pasteurianus ATCC 43144 (biotype II.2). The genomic differences between the two biotypes were characterized with comparative genomic analyses. The chromosome of ATCC 43143 and ATCC 43144 are 2,36 and 2,10 Mb in length and encode 2246 and 1869 CDS respectively. The organization and genomic contents of both genomes were most similar to the recently published S. gallolyticus UCN34, where 2073 (92%) and 1607 (86%) of the ATCC 43143 and ATCC 43144 CDS were conserved in UCN34 respectively. There are around 600 CDS conserved in all Streptococcus genomes, indicating the Streptococcus genus has a small core-genome (constitute around 30% of total CDS) and substantial evolutionary plasticity. We identified eight and five regions of genome plasticity in ATCC 43143 and ATCC 43144 respectively. Within these regions, several proteins were recognized to contribute to the fitness and virulence of each of the two subspecies. We have also predicted putative cell-surface associated proteins that could play a role in adherence to host tissues, leading to persistent infections causing sub-acute and chronic diseases in humans. This study showed evidence that the S. gallolyticus still possesses genes making it suitable in a rumen environment, whereas the ability for S. pasteurianus to live in rumen is reduced. The genome heterogeneity and genetic diversity among the two biotypes, especially membrane and lipoproteins, most likely contribute to the differences in the pathogenesis of the two S. gallolyticus biotypes and the type of disease an infected patient eventually develops. PMID:21633709

  3. Comparative genomics reveals diversity among xanthomonads infecting tomato and pepper

    PubMed Central

    2011-01-01

    Background Bacterial spot of tomato and pepper is caused by four Xanthomonas species and is a major plant disease in warm humid climates. The four species are distinct from each other based on physiological and molecular characteristics. The genome sequence of strain 85-10, a member of one of the species, Xanthomonas euvesicatoria (Xcv) has been previously reported. To determine the relationship of the four species at the genome level and to investigate the molecular basis of their virulence and differing host ranges, draft genomic sequences of members of the other three species were determined and compared to strain 85-10. Results We sequenced the genomes of X. vesicatoria (Xv) strain 1111 (ATCC 35937), X. perforans (Xp) strain 91-118 and X. gardneri (Xg) strain 101 (ATCC 19865). The genomes were compared with each other and with the previously sequenced Xcv strain 85-10. In addition, the molecular features were predicted that may be required for pathogenicity including the type III secretion apparatus, type III effectors, other secretion systems, quorum sensing systems, adhesins, extracellular polysaccharide, and lipopolysaccharide determinants. Several novel type III effectors from Xg strain 101 and Xv strain 1111 genomes were computationally identified and their translocation was validated using a reporter gene assay. A homolog to Ax21, the elicitor of XA21-mediated resistance in rice, and a functional Ax21 sulfation system were identified in Xcv. Genes encoding proteins with functions mediated by type II and type IV secretion systems have also been compared, including enzymes involved in cell wall deconstruction, as contributors to pathogenicity. Conclusions Comparative genomic analyses revealed considerable diversity among bacterial spot pathogens, providing new insights into differences and similarities that may explain the diverse nature of these strains. Genes specific to pepper pathogens, such as the O-antigen of the lipopolysaccharide cluster, and genes

  4. Comparative genomics of transcriptional regulation of methionine metabolism in Proteobacteria.

    PubMed

    Leyn, Semen A; Suvorova, Inna A; Kholina, Tatiana D; Sherstneva, Sofia S; Novichkov, Pavel S; Gelfand, Mikhail S; Rodionov, Dmitry A

    2014-01-01

    Methionine metabolism and uptake genes in Proteobacteria are controlled by a variety of RNA and DNA regulatory systems. We have applied comparative genomics to reconstruct regulons for three known transcription factors, MetJ, MetR, and SahR, and three known riboswitch motifs, SAH, SAM-SAH, and SAM_alpha, in ∼ 200 genomes from 22 taxonomic groups of Proteobacteria. We also identified two novel regulons: a SahR-like transcription factor SamR controlling various methionine biosynthesis genes in the Xanthomonadales group, and a potential RNA regulatory element with terminator-antiterminator mechanism controlling the metX or metZ genes in beta-proteobacteria. For each analyzed regulator we identified the core, taxon-specific and genome-specific regulon members. By analyzing the distribution of these regulators in bacterial genomes and by comparing their regulon contents we elucidated possible evolutionary scenarios for the regulation of the methionine metabolism genes in Proteobacteria. PMID:25411846

  5. Comparative genomics of transcriptional regulation of methionine metabolism in proteobacteria

    DOE PAGES

    Leyn, Semen A.; Suvorova, Inna A.; Kholina, Tatiana D.; Sherstneva, Sofia S.; Novichkov, Pavel S.; Gelfand, Mikhail S.; Rodionov, Dmitry A.; Kuipers, Oscar P.

    2014-11-20

    Methionine metabolism and uptake genes in Proteobacteria are controlled by a variety of RNA and DNA regulatory systems. We have applied comparative genomics to reconstruct regulons for three known transcription factors, MetJ, MetR, and SahR, and three known riboswitch motifs, SAH, SAM-SAH, and SAM_alpha, in ~200 genomes from 22 taxonomic groups of Proteobacteria. We also identified two novel regulons: a SahR-like transcription factor SamR controlling various methionine biosynthesis genes in the Xanthomonadales group, and a potential RNA regulatory element with terminator-antiterminator mechanism controlling the metX or metZ genes in beta-proteobacteria. For each analyzed regulator we identified the core, taxon-specific andmore » genome-specific regulon members. By analyzing the distribution of these regulators in bacterial genomes and by comparing their regulon contents we elucidated possible evolutionary scenarios for the regulation of the methionine metabolism genes in Proteobacteria.« less

  6. Comparative genomics of transcriptional regulation of methionine metabolism in proteobacteria

    SciTech Connect

    Leyn, Semen A.; Suvorova, Inna A.; Kholina, Tatiana D.; Sherstneva, Sofia S.; Novichkov, Pavel S.; Gelfand, Mikhail S.; Rodionov, Dmitry A.; Kuipers, Oscar P.

    2014-11-20

    Methionine metabolism and uptake genes in Proteobacteria are controlled by a variety of RNA and DNA regulatory systems. We have applied comparative genomics to reconstruct regulons for three known transcription factors, MetJ, MetR, and SahR, and three known riboswitch motifs, SAH, SAM-SAH, and SAM_alpha, in ~200 genomes from 22 taxonomic groups of Proteobacteria. We also identified two novel regulons: a SahR-like transcription factor SamR controlling various methionine biosynthesis genes in the Xanthomonadales group, and a potential RNA regulatory element with terminator-antiterminator mechanism controlling the metX or metZ genes in beta-proteobacteria. For each analyzed regulator we identified the core, taxon-specific and genome-specific regulon members. By analyzing the distribution of these regulators in bacterial genomes and by comparing their regulon contents we elucidated possible evolutionary scenarios for the regulation of the methionine metabolism genes in Proteobacteria.

  7. Comparative genomics and transcriptomics of trait-gene association

    PubMed Central

    2012-01-01

    Background The Order Rickettsiales includes important tick-borne pathogens, from Rickettsia rickettsii, which causes Rocky Mountain spotted fever, to Anaplasma marginale, the most prevalent vector-borne pathogen of cattle. Although most pathogens in this Order are transmitted by arthropod vectors, little is known about the microbial determinants of transmission. A. marginale provides unique tools for studying the determinants of transmission, with multiple strain sequences available that display distinct and reproducible transmission phenotypes. The closed core A. marginale genome suggests that any phenotypic differences are due to single nucleotide polymorphisms (SNPs). We combined DNA/RNA comparative genomic approaches using strains with different tick transmission phenotypes and identified genes that segregate with transmissibility. Results Comparison of seven strains with different transmission phenotypes generated a list of SNPs affecting 18 genes and nine promoters. Transcriptional analysis found two candidate genes downstream from promoter SNPs that were differentially transcribed. To corroborate the comparative genomics approach we used three RNA-seq platforms to analyze the transcriptomes from two A. marginale strains with different transmission phenotypes. RNA-seq analysis confirmed the comparative genomics data and found 10 additional genes whose transcription between strains with distinct transmission efficiencies was significantly different. Six regions of the genome that contained no annotation were found to be transcriptionally active, and two of these newly identified transcripts were differentially transcribed. Conclusions This approach identified 30 genes and two novel transcripts potentially involved in tick transmission. We describe the transcriptome of an obligate intracellular bacterium in depth, while employing massive parallel sequencing to dissect an important trait in bacterial pathogenesis. PMID:23181781

  8. Lactobacillus paracasei Comparative Genomics: Towards Species Pan-Genome Definition and Exploitation of Diversity

    PubMed Central

    Smokvina, Tamara; Wels, Michiel; Polka, Justyna; Chervaux, Christian; Brisse, Sylvain; Boekhorst, Jos; Vlieg, Johan E. T. van Hylckama; Siezen, Roland J.

    2013-01-01

    Lactobacillus paracasei is a member of the normal human and animal gut microbiota and is used extensively in the food industry in starter cultures for dairy products or as probiotics. With the development of low-cost, high-throughput sequencing techniques it has become feasible to sequence many different strains of one species and to determine its “pan-genome”. We have sequenced the genomes of 34 different L. paracasei strains, and performed a comparative genomics analysis. We analysed genome synteny and content, focussing on the pan-genome, core genome and variable genome. Each genome was shown to contain around 2800–3100 protein-coding genes, and comparative analysis identified over 4200 ortholog groups that comprise the pan-genome of this species, of which about 1800 ortholog groups make up the conserved core. Several factors previously associated with host-microbe interactions such as pili, cell-envelope proteinase, hydrolases p40 and p75 or the capacity to produce short branched-chain fatty acids (bkd operon) are part of the L. paracasei core genome present in all analysed strains. The variome consists mainly of hypothetical proteins, phages, plasmids, transposon/conjugative elements, and known functions such as sugar metabolism, cell-surface proteins, transporters, CRISPR-associated proteins, and EPS biosynthesis proteins. An enormous variety and variability of sugar utilization gene cassettes were identified, with each strain harbouring between 25–53 cassettes, reflecting the high adaptability of L. paracasei to different niches. A phylogenomic tree was constructed based on total genome contents, and together with an analysis of horizontal gene transfer events we conclude that evolution of these L. paracasei strains is complex and not always related to niche adaptation. The results of this genome content comparison was used, together with high-throughput growth experiments on various carbohydrates, to perform gene-trait matching analysis, in order to

  9. Comparative dynamics of avian communities across edges and interiors of North American ecoregions

    USGS Publications Warehouse

    Karanth, K.K.; Nichols, J.D.; Sauer, J.R.; Hines, J.E.

    2006-01-01

    Aim Based on a priori hypotheses, we developed predictions about how avian communities might differ at the edges vs. interiors of ecoregions. Specifically, we predicted lower species richness and greater local turnover and extinction probabilities for regional edges. We tested these predictions using North American Breeding Bird Survey (BBS) data across nine ecoregions over a 20-year time period. Location Data from 2238 BBS routes within nine ecoregions of the United States were used. Methods The estimation methods used accounted for species detection probabilities < 1. Parameter estimates for species richness, local turnover and extinction probabilities were obtained using the program COMDYN. We examined the difference in community-level parameters estimated from within exterior edges (the habitat interface between ecoregions), interior edges (the habitat interface between two bird conservation regions within the same ecoregion) and interior (habitat excluding interfaces). General linear models were constructed to examine sources of variation in community parameters for five ecoregions (containing all three habitat types) and all nine ecoregions (containing two habitat types). Results Analyses provided evidence that interior habitats and interior edges had on average higher bird species richness than exterior edges, providing some evidence of reduced species richness near habitat edges. Lower average extinction probabilities and turnover rates in interior habitats (five-region analysis) provided some support for our predictions about these quantities. However, analyses directed at all three response variables, i.e. species richness, local turnover, and local extinction probability, provided evidence of an interaction between habitat and region, indicating that the relationships did not hold in all regions. Main conclusions The overall predictions of lower species richness, higher local turnover and extinction probabilities in regional edge habitats, as opposed to

  10. The Genomic Contributions of Avian H1N1 Influenza A Viruses to the Evolution of Mammalian Strains.

    PubMed

    Koçer, Zeynep A; Carter, Robert; Wu, Gang; Zhang, Jinghui; Webster, Robert G

    2015-01-01

    Among the influenza A viruses (IAVs) in wild aquatic birds, only H1, H2, and H3 subtypes have caused epidemics in humans. H1N1 viruses of avian origin have also caused 3 of 5 pandemics. To understand the reappearance of H1N1 in the context of pandemic emergence, we investigated whether avian H1N1 IAVs have contributed to the evolution of human, swine, and 2009 pandemic H1N1 IAVs. On the basis of phylogenetic analysis, we concluded that the polymerase gene segments (especially PB2 and PA) circulating in North American avian H1N1 IAVs have been reintroduced to swine multiple times, resulting in different lineages that led to the emergence of the 2009 pandemic H1N1 IAVs. Moreover, the similar topologies of hemagglutinin and nucleoprotein and neuraminidase and matrix gene segments suggest that each surface glycoprotein coevolved with an internal gene segment within the H1N1 subtype. The genotype of avian H1N1 IAVs of Charadriiformes origin isolated in 2009 differs from that of avian H1N1 IAVs of Anseriformes origin. When the antigenic sites in the hemagglutinin of all 31 North American avian H1N1 IAVs were considered, 60%-80% of the amino acids at the antigenic sites were identical to those in 1918 and/or 2009 pandemic H1N1 viruses. Thus, although the pathogenicity of avian H1N1 IAVs could not be inferred from the phylogeny due to the small dataset, the evolutionary process within the H1N1 IAV subtype suggests that the circulation of H1N1 IAVs in wild birds poses a continuous threat for future influenza pandemics in humans. PMID:26208281

  11. The Genomic Contributions of Avian H1N1 Influenza A Viruses to the Evolution of Mammalian Strains.

    PubMed

    Koçer, Zeynep A; Carter, Robert; Wu, Gang; Zhang, Jinghui; Webster, Robert G

    2015-01-01

    Among the influenza A viruses (IAVs) in wild aquatic birds, only H1, H2, and H3 subtypes have caused epidemics in humans. H1N1 viruses of avian origin have also caused 3 of 5 pandemics. To understand the reappearance of H1N1 in the context of pandemic emergence, we investigated whether avian H1N1 IAVs have contributed to the evolution of human, swine, and 2009 pandemic H1N1 IAVs. On the basis of phylogenetic analysis, we concluded that the polymerase gene segments (especially PB2 and PA) circulating in North American avian H1N1 IAVs have been reintroduced to swine multiple times, resulting in different lineages that led to the emergence of the 2009 pandemic H1N1 IAVs. Moreover, the similar topologies of hemagglutinin and nucleoprotein and neuraminidase and matrix gene segments suggest that each surface glycoprotein coevolved with an internal gene segment within the H1N1 subtype. The genotype of avian H1N1 IAVs of Charadriiformes origin isolated in 2009 differs from that of avian H1N1 IAVs of Anseriformes origin. When the antigenic sites in the hemagglutinin of all 31 North American avian H1N1 IAVs were considered, 60%-80% of the amino acids at the antigenic sites were identical to those in 1918 and/or 2009 pandemic H1N1 viruses. Thus, although the pathogenicity of avian H1N1 IAVs could not be inferred from the phylogeny due to the small dataset, the evolutionary process within the H1N1 IAV subtype suggests that the circulation of H1N1 IAVs in wild birds poses a continuous threat for future influenza pandemics in humans.

  12. A Mitochondrial Genome of Rhyparochromidae (Hemiptera: Heteroptera) and a Comparative Analysis of Related Mitochondrial Genomes

    PubMed Central

    Li, Teng; Yang, Jie; Li, Yinwan; Cui, Ying; Xie, Qiang; Bu, Wenjun; Hillis, David M.

    2016-01-01

    The Rhyparochromidae, the largest family of Lygaeoidea, encompasses more than 1,850 described species, but no mitochondrial genome has been sequenced to date. Here we describe the first mitochondrial genome for Rhyparochromidae: a complete mitochondrial genome of Panaorus albomaculatus (Scott, 1874). This mitochondrial genome is comprised of 16,345 bp, and contains the expected 37 genes and control region. The majority of the control region is made up of a large tandem-repeat region, which has a novel pattern not previously observed in other insects. The tandem-repeats region of P. albomaculatus consists of 53 tandem duplications (including one partial repeat), which is the largest number of tandem repeats among all the known insect mitochondrial genomes. Slipped-strand mispairing during replication is likely to have generated this novel pattern of tandem repeats. Comparative analysis of tRNA gene families in sequenced Pentatomomorpha and Lygaeoidea species shows that the pattern of nucleotide conservation is markedly higher on the J-strand. Phylogenetic reconstruction based on mitochondrial genomes suggests that Rhyparochromidae is not the sister group to all the remaining Lygaeoidea, and supports the monophyly of Lygaeoidea. PMID:27756915

  13. The Whole Genome Assembly and Comparative Genomic Research of Thellungiella parvula (Extremophile Crucifer) Mitochondrion.

    PubMed

    Wang, Xuelin; Bi, Changwei; Xu, Yiqing; Wei, Suyun; Dai, Xiaogang; Yin, Tongming; Ye, Ning

    2016-01-01

    The complete nucleotide sequences of the mitochondrial (mt) genome of an extremophile species Thellungiella parvula (T. parvula) have been determined with the lengths of 255,773 bp. T. parvula mt genome is a circular sequence and contains 32 protein-coding genes, 19 tRNA genes, and three ribosomal RNA genes with a 11.5% coding sequence. The base composition of 27.5% A, 27.5% T, 22.7% C, and 22.3% G in descending order shows a slight bias of 55% AT. Fifty-three repeats were identified in the mitochondrial genome of T. parvula, including 24 direct repeats, 28 tandem repeats (TRs), and one palindromic repeat. Furthermore, a total of 199 perfect microsatellites have been mined with a high A/T content (83.1%) through simple sequence repeat (SSR) analysis and they were distributed unevenly within this mitochondrial genome. We also analyzed other plant mitochondrial genomes' evolution in general, providing clues for the understanding of the evolution of organelles genomes in plants. Comparing with other Brassicaceae species, T. parvula is related to Arabidopsis thaliana whose characters of low temperature resistance have been well documented. This study will provide important genetic tools for other Brassicaceae species research and improve yields of economically important plants. PMID:27148547

  14. The Whole Genome Assembly and Comparative Genomic Research of Thellungiella parvula (Extremophile Crucifer) Mitochondrion

    PubMed Central

    Wang, Xuelin; Bi, Changwei; Xu, Yiqing; Wei, Suyun; Dai, Xiaogang; Yin, Tongming; Ye, Ning

    2016-01-01

    The complete nucleotide sequences of the mitochondrial (mt) genome of an extremophile species Thellungiella parvula (T. parvula) have been determined with the lengths of 255,773 bp. T. parvula mt genome is a circular sequence and contains 32 protein-coding genes, 19 tRNA genes, and three ribosomal RNA genes with a 11.5% coding sequence. The base composition of 27.5% A, 27.5% T, 22.7% C, and 22.3% G in descending order shows a slight bias of 55% AT. Fifty-three repeats were identified in the mitochondrial genome of T. parvula, including 24 direct repeats, 28 tandem repeats (TRs), and one palindromic repeat. Furthermore, a total of 199 perfect microsatellites have been mined with a high A/T content (83.1%) through simple sequence repeat (SSR) analysis and they were distributed unevenly within this mitochondrial genome. We also analyzed other plant mitochondrial genomes' evolution in general, providing clues for the understanding of the evolution of organelles genomes in plants. Comparing with other Brassicaceae species, T. parvula is related to Arabidopsis thaliana whose characters of low temperature resistance have been well documented. This study will provide important genetic tools for other Brassicaceae species research and improve yields of economically important plants. PMID:27148547

  15. Comparative genomics of unintrogressed Campylobacter coli clades 2 and 3

    PubMed Central

    2014-01-01

    Background Campylobacter jejuni and C. coli share a multitude of risk factors associated with human gastrointestinal disease, yet their phylogeny differs significantly. C. jejuni is scattered into several lineages, with no apparent linkage, whereas C. coli clusters into three distinct phylogenetic groups (clades) of which clade 1 has shown extensive genome-wide introgression with C. jejuni, yet the other two clades (2 and 3) have less than 2% of C. jejuni ancestry. We characterized a C. coli strain (76339) with four novel multilocus sequence type alleles (ST-5088) and having the capability to express gamma-glutamyltranspeptidase (GGT); an accessory feature in C. jejuni. Our aim was to further characterize unintrogressed C. coli clades 2 and 3, using comparative genomics and with additional genome sequences available, to investigate the impact of horizontal gene transfer in shaping the accessory and core gene pools in unintrogressed C. coli. Results Here, we present the first fully closed C. coli clade 3 genome (76339). The phylogenomic analysis of strain 76339, revealed that it belonged to clade 3 of unintrogressed C. coli. A more extensive respiratory metabolism among unintrogressed C. coli strains was found compared to introgressed C. coli (clade 1). We also identified other genes, such as serine proteases and an active sialyltransferase in the lipooligosaccharide locus, not present in C. coli clade 1 and we further propose a unique scenario for the evolution of Campylobacter ggt. Conclusions We propose new insights into the evolution of the accessory genome of C. coli clade 3 and C. jejuni. Also, in silico analysis of the gene content revealed that C. coli clades 2 and 3 have genes associated with infection, suggesting they are a potent human pathogen, and may currently be underreported in human infections due to niche separation. PMID:24524824

  16. Comparative genomics of 9 novel Paenibacillus larvae bacteriophages

    PubMed Central

    Stamereilers, Casey; LeBlanc, Lucy; Yost, Diane; Amy, Penny S.; Tsourkas, Philippos K.

    2016-01-01

    ABSTRACT American Foulbrood Disease, caused by the bacterium Paenibacillus larvae, is one of the most destructive diseases of the honeybee, Apis mellifera. Our group recently published the sequences of 9 new phages with the ability to infect and lyse P. larvae. Here, we characterize the genomes of these P. larvae phages, compare them to each other and to other sequenced P. larvae phages, and putatively identify protein function. The phage genomes are 38–45 kb in size and contain 68–86 genes, most of which appear to be unique to P. larvae phages. We classify P. larvae phages into 2 main clusters and one singleton based on nucleotide sequence identity. Three of the new phages show sequence similarity to other sequenced P. larvae phages, while the remaining 6 do not. We identified functions for roughly half of the P. larvae phage proteins, including structural, assembly, host lysis, DNA replication/metabolism, regulatory, and host-related functions. Structural and assembly proteins are highly conserved among our phages and are located at the start of the genome. DNA replication/metabolism, regulatory, and host-related proteins are located in the middle and end of the genome, and are not conserved, with many of these genes found in some of our phages but not others. All nine phages code for a conserved N-acetylmuramoyl-L-alanine amidase. Comparative analysis showed the phages use the “cohesive ends with 3′ overhang” DNA packaging strategy. This work is the first in-depth study of P. larvae phage genomics, and serves as a marker for future work in this area. PMID:27738559

  17. Comparative analysis of Acinetobacters: three genomes for three lifestyles.

    PubMed

    Vallenet, David; Nordmann, Patrice; Barbe, Valérie; Poirel, Laurent; Mangenot, Sophie; Bataille, Elodie; Dossat, Carole; Gas, Shahinaz; Kreimeyer, Annett; Lenoble, Patricia; Oztas, Sophie; Poulain, Julie; Segurens, Béatrice; Robert, Catherine; Abergel, Chantal; Claverie, Jean-Michel; Raoult, Didier; Médigue, Claudine; Weissenbach, Jean; Cruveiller, Stéphane

    2008-03-19

    Acinetobacter baumannii is the source of numerous nosocomial infections in humans and therefore deserves close attention as multidrug or even pandrug resistant strains are increasingly being identified worldwide. Here we report the comparison of two newly sequenced genomes of A. baumannii. The human isolate A. baumannii AYE is multidrug resistant whereas strain SDF, which was isolated from body lice, is antibiotic susceptible. As reference for comparison in this analysis, the genome of the soil-living bacterium A. baylyi strain ADP1 was used. The most interesting dissimilarities we observed were that i) whereas strain AYE and A. baylyi genomes harbored very few Insertion Sequence elements which could promote expression of downstream genes, strain SDF sequence contains several hundred of them that have played a crucial role in its genome reduction (gene disruptions and simple DNA loss); ii) strain SDF has low catabolic capacities compared to strain AYE. Interestingly, the latter has even higher catabolic capacities than A. baylyi which has already been reported as a very nutritionally versatile organism. This metabolic performance could explain the persistence of A. baumannii nosocomial strains in environments where nutrients are scarce; iii) several processes known to play a key role during host infection (biofilm formation, iron uptake, quorum sensing, virulence factors) were either different or absent, the best example of which is iron uptake. Indeed, strain AYE and A. baylyi use siderophore-based systems to scavenge iron from the environment whereas strain SDF uses an alternate system similar to the Haem Acquisition System (HAS). Taken together, all these observations suggest that the genome contents of the 3 Acinetobacters compared are partly shaped by life in distinct ecological niches: human (and more largely hospital environment), louse, soil.

  18. Exploring the early origins of the synapse by comparative genomics

    PubMed Central

    Kosik, Kenneth S.

    2008-01-01

    One set of evolutionary features that has received less attention than the evolution of genes or species is the evolution of cellular machines, the self-contained structures in cells with dedicated functions. Here I suggest that domain expansion through shuffling, duplication, and changes in protein expression level are critical drivers in the evolution of cellular machines. Once established, evolutionary change in these cellular machines tends to occur by paralogy or expansion and modification of the existing core genes. A comparative genomics approach to one cellular machine—the post-synaptic complex—provided preliminary validation of these views. A comparative genomics approach to the entire cellulome may reveal the diversity of cellular machines and their inter-relationships. PMID:19049956

  19. Decoding the molecular evolution of human cognition using comparative genomics.

    PubMed

    Usui, Noriyoshi; Co, Marissa; Konopka, Genevieve

    2014-01-01

    Identification of genetic and molecular factors responsible for the specialized cognitive abilities of humans is expected to provide important insights into the mechanisms responsible for disorders of cognition such as autism, schizophrenia and Alzheimer's disease. Here, we discuss the use of comparative genomics for identifying salient genes and gene networks that may underlie cognition. We focus on the comparison of human and non-human primate brain gene expression and the utility of building gene coexpression networks for prioritizing hundreds of genes that differ in expression among the species queried. We also discuss the importance of and methods for functional studies of the individual genes identified. Together, this integration of comparative genomics with cellular and animal models should provide improved systems for developing effective therapeutics for disorders of cognition. PMID:25247723

  20. Floral gene resources from basal angiosperms for comparative genomics research

    PubMed Central

    Albert, Victor A; Soltis, Douglas E; Carlson, John E; Farmerie, William G; Wall, P Kerr; Ilut, Daniel C; Solow, Teri M; Mueller, Lukas A; Landherr, Lena L; Hu, Yi; Buzgo, Matyas; Kim, Sangtae; Yoo, Mi-Jeong; Frohlich, Michael W; Perl-Treves, Rafael; Schlarbaum, Scott E; Bliss, Barbara J; Zhang, Xiaohong; Tanksley, Steven D; Oppenheimer, David G; Soltis, Pamela S; Ma, Hong; dePamphilis, Claude W; Leebens-Mack, James H

    2005-01-01

    Background The Floral Genome Project was initiated to bridge the genomic gap between the most broadly studied plant model systems. Arabidopsis and rice, although now completely sequenced and under intensive comparative genomic investigation, are separated by at least 125 million years of evolutionary time, and cannot in isolation provide a comprehensive perspective on structural and functional aspects of flowering plant genome dynamics. Here we discuss new genomic resources available to the scientific community, comprising cDNA libraries and Expressed Sequence Tag (EST) sequences for a suite of phylogenetically basal angiosperms specifically selected to bridge the evolutionary gaps between model plants and provide insights into gene content and genome structure in the earliest flowering plants. Results Random sequencing of cDNAs from representatives of phylogenetically important eudicot, non-grass monocot, and gymnosperm lineages has so far (as of 12/1/04) generated 70,514 ESTs and 48,170 assembled unigenes. Efficient sorting of EST sequences into putative gene families based on whole Arabidopsis/rice proteome comparison has permitted ready identification of cDNA clones for finished sequencing. Preliminarily, (i) proportions of functional categories among sequenced floral genes seem representative of the entire Arabidopsis transcriptome, (ii) many known floral gene homologues have been captured, and (iii) phylogenetic analyses of ESTs are providing new insights into the process of gene family evolution in relation to the origin and diversification of the angiosperms. Conclusion Initial comparisons illustrate the utility of the EST data sets toward discovery of the basic floral transcriptome. These first findings also afford the opportunity to address a number of conspicuous evolutionary genomic questions, including reproductive organ transcriptome overlap between angiosperms and gymnosperms, genome-wide duplication history, lineage-specific gene duplication and

  1. Mosaic supernumerary ring chromosome 19 identified by comparative genomic hybridisation.

    PubMed Central

    Ghaffari, S R; Boyd, E; Connor, J M; Jones, A M; Tolmie, J L

    1998-01-01

    We report the use of comparative genomic hybridisation (CGH) to define the origin of a supernumerary ring chromosome which conventional cytogenetic banding and fluorescence in situ hybridisation (FISH) methods had failed to identify. Targeted FISH using whole chromosome 19 library arm and site specific probes then confirmed the CGH results. This study shows the feasibility of using CGH for the identification of supernumerary marker chromosomes, even in fewer than 50% of cells, where no clinical or cytogenetic clues are present. Images PMID:9783708

  2. Genome sequence and comparative genome analysis of Lactobacillus casei: insights into their niche-associated evolution.

    PubMed

    Cai, Hui; Thompson, Rebecca; Budinich, Mateo F; Broadbent, Jeff R; Steele, James L

    2009-01-01

    Lactobacillus casei is remarkably adaptable to diverse habitats and widely used in the food industry. To reveal the genomic features that contribute to its broad ecological adaptability and examine the evolution of the species, the genome sequence of L. casei ATCC 334 is analyzed and compared with other sequenced lactobacilli. This analysis reveals that ATCC 334 contains a high number of coding sequences involved in carbohydrate utilization and transcriptional regulation, reflecting its requirement for dealing with diverse environmental conditions. A comparison of the genome sequences of ATCC 334 to L. casei BL23 reveals 12 and 19 genomic islands, respectively. For a broader assessment of the genetic variability within L. casei, gene content of 21 L. casei strains isolated from various habitats (cheeses, n = 7; plant materials, n = 8; and human sources, n = 6) was examined by comparative genome hybridization with an ATCC 334-based microarray. This analysis resulted in identification of 25 hypervariable regions. One of these regions contains an overrepresentation of genes involved in carbohydrate utilization and transcriptional regulation and was thus proposed as a lifestyle adaptation island. Differences in L. casei genome inventory reveal both gene gain and gene decay. Gene gain, via acquisition of genomic islands, likely confers a fitness benefit in specific habitats. Gene decay, that is, loss of unnecessary ancestral traits, is observed in the cheese isolates and likely results in enhanced fitness in the dairy niche. This study gives the first picture of the stable versus variable regions in L. casei and provides valuable insights into evolution, lifestyle adaptation, and metabolic diversity of L. casei. PMID:20333194

  3. Comparative Omics-Driven Genome Annotation Refinement: Application across Yersiniae

    SciTech Connect

    Rutledge, Alexandra C.; Jones, Marcus B.; Chauhan, Sadhana; Purvine, Samuel O.; Sanford, James; Monroe, Matthew E.; Brewer, Heather M.; Payne, Samuel H.; Ansong, Charles; Frank, Bryan C.; Smith, Richard D.; Peterson, Scott; Motin, Vladimir L.; Adkins, Joshua N.

    2012-03-27

    Genome sequencing continues to be a rapidly evolving technology, yet most downstream aspects of genome annotation pipelines remain relatively stable or are even being abandoned. To date, the perceived value of manual curation for genome annotations is not offset by the real cost and time associated with the process. In order to balance the large number of sequences generated, the annotation process is now performed almost exclusively in an automated fashion for most genome sequencing projects. One possible way to reduce errors inherent to automated computational annotations is to apply data from 'omics' measurements (i.e. transcriptional and proteomic) to the un-annotated genome with a proteogenomic-based approach. This approach does require additional experimental and bioinformatics methods to include omics technologies; however, the approach is readily automatable and can benefit from rapid developments occurring in those research domains as well. The annotation process can be improved by experimental validation of transcription and translation and aid in the discovery of annotation errors. Here the concept of annotation refinement has been extended to include a comparative assessment of genomes across closely related species, as is becoming common in sequencing efforts. Transcriptomic and proteomic data derived from three highly similar pathogenic Yersiniae (Y. pestis CO92, Y. pestis pestoides F, and Y. pseudotuberculosis PB1/+) was used to demonstrate a comprehensive comparative omic-based annotation methodology. Peptide and oligo measurements experimentally validated the expression of nearly 40% of each strain's predicted proteome and revealed the identification of 28 novel and 68 previously incorrect protein-coding sequences (e.g., observed frameshifts, extended start sites, and translated pseudogenes) within the three current Yersinia genome annotations. Gene loss is presumed to play a major role in Y. pestis acquiring its niche as a virulent pathogen, thus

  4. Comparative genomics of toxigenic and non-toxigenic Staphylococcus hyicus.

    PubMed

    Leekitcharoenphon, Pimlapas; Pamp, Sünje Johanna; Andresen, Lars Ole; Aarestrup, Frank M

    2016-03-15

    The most common causative agent of exudative epidermitis (EE) in pigs is Staphylococcus hyicus. S. hyicus can be grouped into toxigenic and non-toxigenic strains based on their ability to cause EE in pigs and specific virulence genes have been identified. A genome wide comparison between non-toxigenic and toxigenic strains has never been performed. In this study, we sequenced eleven toxigenic and six non-toxigenic S. hyicus strains and performed comparative genomic and phylogenetic analysis. Our analyses revealed two genomic regions encoding genes that were predominantly found in toxigenic strains and are predicted to encode for virulence determinants for EE. All toxigenic strains encoded for one of the exfoliative toxins ExhA, ExhB, ExhC, or ExhD. In addition, one of these regions encoded for an ADP-ribosyltransferase (EDIN, epidermal cell differentiation inhibitor) and a novel putative RNase toxin (polymorphic toxin) and was associated with the gene encoding ExhA. A clear differentiation between toxigenic and non-toxigenic strains based on genomic and phylogenetic analyses was not apparent. The results of this study support the observation that exfoliative toxins of S. hyicus and S. aureus are located on genetic elements such as pathogenicity islands, phages, prophages and plasmids. PMID:26931389

  5. Comparative genomic analysis of seven Mycoplasma hyosynoviae strains

    PubMed Central

    Bumgardner, Eric A; Kittichotirat, Weerayuth; Bumgarner, Roger E; Lawrence, Paulraj K

    2015-01-01

    Infection with Mycoplasma hyosynoviae can result in debilitating arthritis in pigs, particularly those aged 10 weeks or older. Strategies for controlling this pathogen are becoming increasingly important due to the rise in the number of cases of arthritis that have been attributed to infection in recent years. In order to begin to develop interventions to prevent arthritis caused by M. hyosynoviae, more information regarding the specific proteins and potential virulence factors that its genome encodes was needed. However, the genome of this emerging swine pathogen had not been sequenced previously. In this report, we present a comparative analysis of the genomes of seven strains of M. hyosynoviae isolated from different locations in North America during the years 2010 to 2013. We identified several putative virulence factors that may contribute to the ability of this pathogen to adhere to host cells. Additionally, we discovered several prophage genes present within the genomes of three strains that show significant similarity to MAV1, a phage isolated from the related species, M. arthritidis. We also identified CRISPR-Cas and type III restriction and modification systems present in two strains that may contribute to their ability to defend against phage infection. PMID:25693846

  6. Comparative genome sequencing reveals genomic signature of extreme desiccation tolerance in the anhydrobiotic midge

    PubMed Central

    Gusev, Oleg; Suetsugu, Yoshitaka; Cornette, Richard; Kawashima, Takeshi; Logacheva, Maria D.; Kondrashov, Alexey S.; Penin, Aleksey A.; Hatanaka, Rie; Kikuta, Shingo; Shimura, Sachiko; Kanamori, Hiroyuki; Katayose, Yuichi; Matsumoto, Takashi; Shagimardanova, Elena; Alexeev, Dmitry; Govorun, Vadim; Wisecaver, Jennifer; Mikheyev, Alexander; Koyanagi, Ryo; Fujie, Manabu; Nishiyama, Tomoaki; Shigenobu, Shuji; Shibata, Tomoko F.; Golygina, Veronika; Hasebe, Mitsuyasu; Okuda, Takashi; Satoh, Nori; Kikawada, Takahiro

    2014-01-01

    Anhydrobiosis represents an extreme example of tolerance adaptation to water loss, where an organism can survive in an ametabolic state until water returns. Here we report the first comparative analysis examining the genomic background of extreme desiccation tolerance, which is exclusively found in larvae of the only anhydrobiotic insect, Polypedilum vanderplanki. We compare the genomes of P. vanderplanki and a congeneric desiccation-sensitive midge P. nubifer. We determine that the genome of the anhydrobiotic species specifically contains clusters of multi-copy genes with products that act as molecular shields. In addition, the genome possesses several groups of genes with high similarity to known protective proteins. However, these genes are located in distinct paralogous clusters in the genome apart from the classical orthologues of the corresponding genes shared by both chironomids and other insects. The transcripts of these clustered paralogues contribute to a large majority of the mRNA pool in the desiccating larvae and most likely define successful anhydrobiosis. Comparison of expression patterns of orthologues between two chironomid species provides evidence for the existence of desiccation-specific gene expression systems in P. vanderplanki. PMID:25216354

  7. Whole-genome sequence of the Tibetan frog Nanorana parkeri and the comparative evolution of tetrapod genomes.

    PubMed

    Sun, Yan-Bo; Xiong, Zi-Jun; Xiang, Xue-Yan; Liu, Shi-Ping; Zhou, Wei-Wei; Tu, Xiao-Long; Zhong, Li; Wang, Lu; Wu, Dong-Dong; Zhang, Bao-Lin; Zhu, Chun-Ling; Yang, Min-Min; Chen, Hong-Man; Li, Fang; Zhou, Long; Feng, Shao-Hong; Huang, Chao; Zhang, Guo-Jie; Irwin, David; Hillis, David M; Murphy, Robert W; Yang, Huan-Ming; Che, Jing; Wang, Jun; Zhang, Ya-Ping

    2015-03-17

    The development of efficient sequencing techniques has resulted in large numbers of genomes being available for evolutionary studies. However, only one genome is available for all amphibians, that of Xenopus tropicalis, which is distantly related from the majority of frogs. More than 96% of frogs belong to the Neobatrachia, and no genome exists for this group. This dearth of amphibian genomes greatly restricts genomic studies of amphibians and, more generally, our understanding of tetrapod genome evolution. To fill this gap, we provide the de novo genome of a Tibetan Plateau frog, Nanorana parkeri, and compare it to that of X. tropicalis and other vertebrates. This genome encodes more than 20,000 protein-coding genes, a number similar to that of Xenopus. Although the genome size of Nanorana is considerably larger than that of Xenopus (2.3 vs. 1.5 Gb), most of the difference is due to the respective number of transposable elements in the two genomes. The two frogs exhibit considerable conserved whole-genome synteny despite having diverged approximately 266 Ma, indicating a slow rate of DNA structural evolution in anurans. Multigenome synteny blocks further show that amphibians have fewer interchromosomal rearrangements than mammals but have a comparable rate of intrachromosomal rearrangements. Our analysis also identifies 11 Mb of anuran-specific highly conserved elements that will be useful for comparative genomic analyses of frogs. The Nanorana genome offers an improved understanding of evolution of tetrapod genomes and also provides a genomic reference for other evolutionary studies.

  8. Whole-genome sequence of the Tibetan frog Nanorana parkeri and the comparative evolution of tetrapod genomes

    PubMed Central

    Sun, Yan-Bo; Xiong, Zi-Jun; Xiang, Xue-Yan; Liu, Shi-Ping; Zhou, Wei-Wei; Tu, Xiao-Long; Zhong, Li; Wang, Lu; Wu, Dong-Dong; Zhang, Bao-Lin; Zhu, Chun-Ling; Yang, Min-Min; Chen, Hong-Man; Li, Fang; Zhou, Long; Feng, Shao-Hong; Huang, Chao; Zhang, Guo-Jie; Irwin, David; Hillis, David M.; Murphy, Robert W.; Yang, Huan-Ming; Che, Jing; Wang, Jun; Zhang, Ya-Ping

    2015-01-01

    The development of efficient sequencing techniques has resulted in large numbers of genomes being available for evolutionary studies. However, only one genome is available for all amphibians, that of Xenopus tropicalis, which is distantly related from the majority of frogs. More than 96% of frogs belong to the Neobatrachia, and no genome exists for this group. This dearth of amphibian genomes greatly restricts genomic studies of amphibians and, more generally, our understanding of tetrapod genome evolution. To fill this gap, we provide the de novo genome of a Tibetan Plateau frog, Nanorana parkeri, and compare it to that of X. tropicalis and other vertebrates. This genome encodes more than 20,000 protein-coding genes, a number similar to that of Xenopus. Although the genome size of Nanorana is considerably larger than that of Xenopus (2.3 vs. 1.5 Gb), most of the difference is due to the respective number of transposable elements in the two genomes. The two frogs exhibit considerable conserved whole-genome synteny despite having diverged approximately 266 Ma, indicating a slow rate of DNA structural evolution in anurans. Multigenome synteny blocks further show that amphibians have fewer interchromosomal rearrangements than mammals but have a comparable rate of intrachromosomal rearrangements. Our analysis also identifies 11 Mb of anuran-specific highly conserved elements that will be useful for comparative genomic analyses of frogs. The Nanorana genome offers an improved understanding of evolution of tetrapod genomes and also provides a genomic reference for other evolutionary studies. PMID:25733869

  9. Whole-genome sequence of the Tibetan frog Nanorana parkeri and the comparative evolution of tetrapod genomes.

    PubMed

    Sun, Yan-Bo; Xiong, Zi-Jun; Xiang, Xue-Yan; Liu, Shi-Ping; Zhou, Wei-Wei; Tu, Xiao-Long; Zhong, Li; Wang, Lu; Wu, Dong-Dong; Zhang, Bao-Lin; Zhu, Chun-Ling; Yang, Min-Min; Chen, Hong-Man; Li, Fang; Zhou, Long; Feng, Shao-Hong; Huang, Chao; Zhang, Guo-Jie; Irwin, David; Hillis, David M; Murphy, Robert W; Yang, Huan-Ming; Che, Jing; Wang, Jun; Zhang, Ya-Ping

    2015-03-17

    The development of efficient sequencing techniques has resulted in large numbers of genomes being available for evolutionary studies. However, only one genome is available for all amphibians, that of Xenopus tropicalis, which is distantly related from the majority of frogs. More than 96% of frogs belong to the Neobatrachia, and no genome exists for this group. This dearth of amphibian genomes greatly restricts genomic studies of amphibians and, more generally, our understanding of tetrapod genome evolution. To fill this gap, we provide the de novo genome of a Tibetan Plateau frog, Nanorana parkeri, and compare it to that of X. tropicalis and other vertebrates. This genome encodes more than 20,000 protein-coding genes, a number similar to that of Xenopus. Although the genome size of Nanorana is considerably larger than that of Xenopus (2.3 vs. 1.5 Gb), most of the difference is due to the respective number of transposable elements in the two genomes. The two frogs exhibit considerable conserved whole-genome synteny despite having diverged approximately 266 Ma, indicating a slow rate of DNA structural evolution in anurans. Multigenome synteny blocks further show that amphibians have fewer interchromosomal rearrangements than mammals but have a comparable rate of intrachromosomal rearrangements. Our analysis also identifies 11 Mb of anuran-specific highly conserved elements that will be useful for comparative genomic analyses of frogs. The Nanorana genome offers an improved understanding of evolution of tetrapod genomes and also provides a genomic reference for other evolutionary studies. PMID:25733869

  10. Comparative genome analysis of Pseudomonas genomes including Populus-associated isolates

    SciTech Connect

    Jun, Se Ran; Wassenaar, Trudy; Nookaew, Intawat; Hauser, Loren John; Wanchai, Visanu; Land, Miriam L.; Timm, Collin M.; Lu, Tse-Yuan S.; Schadt, Christopher Warren; Doktycz, Mitchel John; Pelletier, Dale A; Ussery, David W

    2016-01-01

    The Pseudomonas genus contains a metabolically versatile group of organisms that are known to occupy numerous ecological niches including the rhizosphere and endosphere of many plants influencing phylogenetic diversity and heterogeneity. In this study, comparative genome analysis was performed on over one thousand Pseudomonas genomes, including 21 Pseudomonas strains isolated from the roots of native Populus deltoides. Based on average amino acid identity, genomic clusters were identified within the Pseudomonas genus, which showed agreements with clades by NCBI and cliques by IMG. The P. fluorescens group was organized into 20 distinct genomic clusters, representing enormous diversity and heterogeneity. The species P. aeruginosa showed clear distinction in their genomic relatedness compared to other Pseudomonas species groups based on the pan and core genome analysis. The 19 isolates of our 21 Populus-associated isolates formed three distinct subgroups within the P. fluorescens major group, supported by pathway profiles analysis, while two isolates were more closely related to P. chlororaphis and P. putida. The specific genes to Populus-associated subgroups were identified where genes specific to subgroup 1 include several sensory systems such as proteins which act in two-component signal transduction, a TonB-dependent receptor, and a phosphorelay sensor; specific genes to subgroup 2 contain unique hypothetical genes; and genes specific to subgroup 3 organisms have a different hydrolase activity. IMPORTANCE The comparative genome analyses of the genus Pseudomonas that included Populus-associated isolates resulted in novel insights into high diversity of Pseudomonas. Consistent and robust genomic clusters with phylogenetic homogeneity were identified, which resolved species-clades that are not clearly defined by 16S rRNA gene sequence analysis alone. The genomic clusters may be reflective of distinct ecological niches to which the organisms have adapted, but this

  11. Comparative genome analysis of Pseudomonas genomes including Populus-associated isolates

    DOE PAGES

    Jun, Se Ran; Wassenaar, Trudy; Nookaew, Intawat; Hauser, Loren John; Wanchai, Visanu; Land, Miriam L.; Timm, Collin M.; Lu, Tse-Yuan S.; Schadt, Christopher Warren; Doktycz, Mitchel John; et al

    2016-01-01

    The Pseudomonas genus contains a metabolically versatile group of organisms that are known to occupy numerous ecological niches including the rhizosphere and endosphere of many plants influencing phylogenetic diversity and heterogeneity. In this study, comparative genome analysis was performed on over one thousand Pseudomonas genomes, including 21 Pseudomonas strains isolated from the roots of native Populus deltoides. Based on average amino acid identity, genomic clusters were identified within the Pseudomonas genus, which showed agreements with clades by NCBI and cliques by IMG. The P. fluorescens group was organized into 20 distinct genomic clusters, representing enormous diversity and heterogeneity. The speciesmore » P. aeruginosa showed clear distinction in their genomic relatedness compared to other Pseudomonas species groups based on the pan and core genome analysis. The 19 isolates of our 21 Populus-associated isolates formed three distinct subgroups within the P. fluorescens major group, supported by pathway profiles analysis, while two isolates were more closely related to P. chlororaphis and P. putida. The specific genes to Populus-associated subgroups were identified where genes specific to subgroup 1 include several sensory systems such as proteins which act in two-component signal transduction, a TonB-dependent receptor, and a phosphorelay sensor; specific genes to subgroup 2 contain unique hypothetical genes; and genes specific to subgroup 3 organisms have a different hydrolase activity. IMPORTANCE The comparative genome analyses of the genus Pseudomonas that included Populus-associated isolates resulted in novel insights into high diversity of Pseudomonas. Consistent and robust genomic clusters with phylogenetic homogeneity were identified, which resolved species-clades that are not clearly defined by 16S rRNA gene sequence analysis alone. The genomic clusters may be reflective of distinct ecological niches to which the organisms have adapted, but

  12. Microarray Comparative Genomic Hybridisation Analysis Incorporating Genomic Organisation, and Application to Enterobacterial Plant Pathogens

    PubMed Central

    Pritchard, Leighton; Liu, Hui; Booth, Clare; Douglas, Emma; François, Patrice; Schrenzel, Jacques; Hedley, Peter E.; Birch, Paul R. J.; Toth, Ian K.

    2009-01-01

    Microarray comparative genomic hybridisation (aCGH) provides an estimate of the relative abundance of genomic DNA (gDNA) taken from comparator and reference organisms by hybridisation to a microarray containing probes that represent sequences from the reference organism. The experimental method is used in a number of biological applications, including the detection of human chromosomal aberrations, and in comparative genomic analysis of bacterial strains, but optimisation of the analysis is desirable in each problem domain. We present a method for analysis of bacterial aCGH data that encodes spatial information from the reference genome in a hidden Markov model. This technique is the first such method to be validated in comparisons of sequenced bacteria that diverge at the strain and at the genus level: Pectobacterium atrosepticum SCRI1043 (Pba1043) and Dickeya dadantii 3937 (Dda3937); and Lactococcus lactis subsp. lactis IL1403 and L. lactis subsp. cremoris MG1363. In all cases our method is found to outperform common and widely used aCGH analysis methods that do not incorporate spatial information. This analysis is applied to comparisons between commercially important plant pathogenic soft-rotting enterobacteria (SRE) Pba1043, P. atrosepticum SCRI1039, P. carotovorum 193, and Dda3937. Our analysis indicates that it should not be assumed that hybridisation strength is a reliable proxy for sequence identity in aCGH experiments, and robustly extends the applicability of aCGH to bacterial comparisons at the genus level. Our results in the SRE further provide evidence for a dynamic, plastic ‘accessory’ genome, revealing major genomic islands encoding gene products that provide insight into, and may play a direct role in determining, variation amongst the SRE in terms of their environmental survival, host range and aetiology, such as phytotoxin synthesis, multidrug resistance, and nitrogen fixation. PMID:19696881

  13. A Comparative Analysis of Mitochondrial Genomes in Eustigmatophyte Algae

    PubMed Central

    Ševčíková, Tereza; Klimeš, Vladimír; Zbránková, Veronika; Strnad, Hynek; Hroudová, Miluše; Vlček, Čestmír; Eliáš, Marek

    2016-01-01

    Eustigmatophyceae (Ochrophyta, Stramenopiles) is a small algal group with species of the genus Nannochloropsis being its best studied representatives. Nuclear and organellar genomes have been recently sequenced for several Nannochloropsis spp., but phylogenetically wider genomic studies are missing for eustigmatophytes. We sequenced mitochondrial genomes (mitogenomes) of three species representing most major eustigmatophyte lineages, Monodopsis sp. MarTras21, Vischeria sp. CAUP Q 202 and Trachydiscus minutus, and carried out their comparative analysis in the context of available data from Nannochloropsis and other stramenopiles, revealing a number of noticeable findings. First, mitogenomes of most eustigmatophytes are highly collinear and similar in the gene content, but extensive rearrangements and loss of three otherwise ubiquitous genes happened in the Vischeria lineage; this correlates with an accelerated evolution of mitochondrial gene sequences in this lineage. Second, eustigmatophytes appear to be the only ochrophyte group with the Atp1 protein encoded by the mitogenome. Third, eustigmatophyte mitogenomes uniquely share a truncated nad11 gene encoding only the C-terminal part of the Nad11 protein, while the N-terminal part is encoded by a separate gene in the nuclear genome. Fourth, UGA as a termination codon and the cognate release factor mRF2 were lost from mitochondria independently by the Nannochloropsis and T. minutus lineages. Finally, the rps3 gene in the mitogenome of Vischeria sp. is interrupted by the UAG codon, but the genome includes a gene for an unusual tRNA with an extended anticodon loop that we speculate may serve as a suppressor tRNA to properly decode the rps3 gene. PMID:26872774

  14. A Comparative Analysis of Mitochondrial Genomes in Eustigmatophyte Algae.

    PubMed

    Ševčíková, Tereza; Klimeš, Vladimír; Zbránková, Veronika; Strnad, Hynek; Hroudová, Miluše; Vlček, Čestmír; Eliáš, Marek

    2016-03-01

    Eustigmatophyceae (Ochrophyta, Stramenopiles) is a small algal group with species of the genus Nannochloropsis being its best studied representatives. Nuclear and organellar genomes have been recently sequenced for several Nannochloropsis spp., but phylogenetically wider genomic studies are missing for eustigmatophytes. We sequenced mitochondrial genomes (mitogenomes) of three species representing most major eustigmatophyte lineages, Monodopsis sp. MarTras21, Vischeria sp. CAUP Q 202 and Trachydiscus minutus, and carried out their comparative analysis in the context of available data from Nannochloropsis and other stramenopiles, revealing a number of noticeable findings. First, mitogenomes of most eustigmatophytes are highly collinear and similar in the gene content, but extensive rearrangements and loss of three otherwise ubiquitous genes happened in the Vischeria lineage; this correlates with an accelerated evolution of mitochondrial gene sequences in this lineage. Second, eustigmatophytes appear to be the only ochrophyte group with the Atp1 protein encoded by the mitogenome. Third, eustigmatophyte mitogenomes uniquely share a truncated nad11 gene encoding only the C-terminal part of the Nad11 protein, while the N-terminal part is encoded by a separate gene in the nuclear genome. Fourth, UGA as a termination codon and the cognate release factor mRF2 were lost from mitochondria independently by the Nannochloropsis and T. minutus lineages. Finally, the rps3 gene in the mitogenome of Vischeria sp. is interrupted by the UAG codon, but the genome includes a gene for an unusual tRNA with an extended anticodon loop that we speculate may serve as a suppressor tRNA to properly decode the rps3 gene. PMID:26872774

  15. Comparative genomics of Cylindrospermopsis raciborskii strains with differential toxicities

    PubMed Central

    2014-01-01

    Background Cylindrospermopsis raciborskii is an invasive filamentous freshwater cyanobacterium, some strains of which produce toxins. Sporadic toxicity may be the result of gene deletion events, the horizontal transfer of toxin biosynthesis gene clusters, or other genomic variables, yet the evolutionary drivers for cyanotoxin production remain a mystery. Through examining the genomes of toxic and non-toxic strains of C. raciborskii, we hoped to gain a better understanding of the degree of similarity between these strains of common geographical origin, and what the primary differences between these strains might be. Additionally, we hoped to ascertain why some cyanobacteria possess the cylindrospermopsin biosynthesis (cyr) gene cluster and produce toxin, while others do not. It has been hypothesised that toxicity or lack thereof might confer a selective advantage to cyanobacteria under certain environmental conditions. Results In order to examine the fundamental differences between toxic and non-toxic C. raciborskii strains, we sequenced the genomes of two closely related isolates, CS-506 (CYN+) and CS-509 (CYN-) sourced from different lakes in tropical Queensland, Australia. These genomes were then compared to a third (reference) genome from C. raciborskii CS-505 (CYN+). Genome sizes were similar across all three strains and their G + C contents were almost identical. At least 2,767 genes were shared among all three strains, including the taxonomically important rpoc1, ssuRNA, lsuRNA, cpcA, cpcB, nifB and nifH, which exhibited 99.8-100% nucleotide identity. Strains CS-506 and CS-509 contained at least 176 and 101 strain-specific (or non-homologous) genes, respectively, most of which were associated with DNA repair and modification, nutrient uptake and transport, or adaptive measures such as osmoregulation. However, the only significant genetic difference observed between the two strains was the presence or absence of the cylindrospermopsin biosynthesis gene

  16. The genome sequence of Caenorhabditis briggsae: a platform for comparative genomics.

    PubMed

    Stein, Lincoln D; Bao, Zhirong; Blasiar, Darin; Blumenthal, Thomas; Brent, Michael R; Chen, Nansheng; Chinwalla, Asif; Clarke, Laura; Clee, Chris; Coghlan, Avril; Coulson, Alan; D'Eustachio, Peter; Fitch, David H A; Fulton, Lucinda A; Fulton, Robert E; Griffiths-Jones, Sam; Harris, Todd W; Hillier, LaDeana W; Kamath, Ravi; Kuwabara, Patricia E; Mardis, Elaine R; Marra, Marco A; Miner, Tracie L; Minx, Patrick; Mullikin, James C; Plumb, Robert W; Rogers, Jane; Schein, Jacqueline E; Sohrmann, Marc; Spieth, John; Stajich, Jason E; Wei, C; Willey, David; Wilson, Richard K; Durbin, Richard; Waterston, Robert H

    2003-11-01

    The soil nematodes Caenorhabditis briggsae and Caenorhabditis elegans diverged from a common ancestor roughly 100 million years ago and yet are almost indistinguishable by eye. They have the same chromosome number and genome sizes, and they occupy the same ecological niche. To explore the basis for this striking conservation of structure and function, we have sequenced the C. briggsae genome to a high-quality draft stage and compared it to the finished C. elegans sequence. We predict approximately 19,500 protein-coding genes in the C. briggsae genome, roughly the same as in C. elegans. Of these, 12,200 have clear C. elegans orthologs, a further 6,500 have one or more clearly detectable C. elegans homologs, and approximately 800 C. briggsae genes have no detectable matches in C. elegans. Almost all of the noncoding RNAs (ncRNAs) known are shared between the two species. The two genomes exhibit extensive colinearity, and the rate of divergence appears to be higher in the chromosomal arms than in the centers. Operons, a distinctive feature of C. elegans, are highly conserved in C. briggsae, with the arrangement of genes being preserved in 96% of cases. The difference in size between the C. briggsae (estimated at approximately 104 Mbp) and C. elegans (100.3 Mbp) genomes is almost entirely due to repetitive sequence, which accounts for 22.4% of the C. briggsae genome in contrast to 16.5% of the C. elegans genome. Few, if any, repeat families are shared, suggesting that most were acquired after the two species diverged or are undergoing rapid evolution. Coclustering the C. elegans and C. briggsae proteins reveals 2,169 protein families of two or more members. Most of these are shared between the two species, but some appear to be expanding or contracting, and there seem to be as many as several hundred novel C. briggsae gene families. The C. briggsae draft sequence will greatly improve the annotation of the C. elegans genome. Based on similarity to C. briggsae, we found

  17. Diversity of Pseudomonas Genomes, Including Populus-Associated Isolates, as Revealed by Comparative Genome Analysis

    PubMed Central

    Jun, Se-Ran; Wassenaar, Trudy M.; Nookaew, Intawat; Hauser, Loren; Wanchai, Visanu; Land, Miriam; Timm, Collin M.; Lu, Tse-Yuan S.; Schadt, Christopher W.; Doktycz, Mitchel J.; Pelletier, Dale A.

    2015-01-01

    The Pseudomonas genus contains a metabolically versatile group of organisms that are known to occupy numerous ecological niches, including the rhizosphere and endosphere of many plants. Their diversity influences the phylogenetic diversity and heterogeneity of these communities. On the basis of average amino acid identity, comparative genome analysis of >1,000 Pseudomonas genomes, including 21 Pseudomonas strains isolated from the roots of native Populus deltoides (eastern cottonwood) trees resulted in consistent and robust genomic clusters with phylogenetic homogeneity. All Pseudomonas aeruginosa genomes clustered together, and these were clearly distinct from other Pseudomonas species groups on the basis of pangenome and core genome analyses. In contrast, the genomes of Pseudomonas fluorescens were organized into 20 distinct genomic clusters, representing enormous diversity and heterogeneity. Most of our 21 Populus-associated isolates formed three distinct subgroups within the major P. fluorescens group, supported by pathway profile analysis, while two isolates were more closely related to Pseudomonas chlororaphis and Pseudomonas putida. Genes specific to Populus-associated subgroups were identified. Genes specific to subgroup 1 include several sensory systems that act in two-component signal transduction, a TonB-dependent receptor, and a phosphorelay sensor. Genes specific to subgroup 2 contain hypothetical genes, and genes specific to subgroup 3 were annotated with hydrolase activity. This study justifies the need to sequence multiple isolates, especially from P. fluorescens, which displays the most genetic variation, in order to study functional capabilities from a pangenomic perspective. This information will prove useful when choosing Pseudomonas strains for use to promote growth and increase disease resistance in plants. PMID:26519390

  18. Enabling comparative modeling of closely related genomes: Example genus Brucella

    DOE PAGES

    Faria, José P.; Edirisinghe, Janaka N.; Davis, James J.; Disz, Terrence; Hausmann, Anna; Henry, Christopher S.; Olson, Robert; Overbeek, Ross A.; Pusch, Gordon D.; Shukla, Maulik; et al

    2014-03-08

    For many scientific applications, it is highly desirable to be able to compare metabolic models of closely related genomes. In this study, we attempt to raise awareness to the fact that taking annotated genomes from public repositories and using them for metabolic model reconstructions is far from being trivial due to annotation inconsistencies. We are proposing a protocol for comparative analysis of metabolic models on closely related genomes, using fifteen strains of genus Brucella, which contains pathogens of both humans and livestock. This study lead to the identification and subsequent correction of inconsistent annotations in the SEED database, as wellmore » as the identification of 31 biochemical reactions that are common to Brucella, which are not originally identified by automated metabolic reconstructions. We are currently implementing this protocol for improving automated annotations within the SEED database and these improvements have been propagated into PATRIC, Model-SEED, KBase and RAST. This method is an enabling step for the future creation of consistent annotation systems and high-quality model reconstructions that will support in predicting accurate phenotypes such as pathogenicity, media requirements or type of respiration.« less

  19. Genomic profiling of invasive melanoma cell lines by array comparative genomic hybridization.

    PubMed

    Koroknai, Viktória; Ecsedi, Szilvia; Vízkeleti, Laura; Kiss, Tímea; Szász, István; Lukács, Andrea; Papp, Orsolya; Ádány, Róza; Balázs, Margit

    2016-04-01

    Malignant melanoma is one of the most aggressive human cancers. Invasion of cells is the first step in metastasis, resulting in cell migration through tissue compartments. We aimed to evaluate genomic alterations specifically associated with the invasive characteristics of melanoma cells. Matrigel invasion assays were used to determine the invasive properties of cell lines that originated from primary melanomas. Array comparative genomic hybridization analyses were carried out to define the chromosome copy number alterations (CNAs). Several recurrent CNAs were identified by array comparative genomic hybridization that affected melanoma-related genes. Invasive primary cell lines showed high frequencies of CNAs, including the loss of 7q and gain of 12q chromosomal regions targeting PTPN12, ADAM22, FZD1, TFPI2, GNG11, COL1A2, SMURF1, VGF, RELN and GLIPR1 genes. Gain of the GDNF (5p13.1), GPAA1, PLEC and SHARPIN (8q24.3) genes was significantly more frequent in invasive cell lines compared with the noninvasive ones. Importantly, copy number gains of these genes were also found in cell lines that originated from metastases, suggesting their role in melanoma metastasis formation. The present study describes genomic differences between invasive and noninvasive melanoma cell lines that may contribute toward the aggressive phenotype of human melanoma cells. PMID:26656572

  20. Draft genome sequence of Cellulomonas carbonis T26(T) and comparative analysis of six Cellulomonas genomes.

    PubMed

    Zhuang, Weiping; Zhang, Shengzhe; Xia, Xian; Wang, Gejiao

    2015-01-01

    Most Cellulomonas strains are cellulolytic and this feature may be applied in straw degradation and bioremediation. In this study, Cellulomonas carbonis T26(T), Cellulomonas bogoriensis DSM 16987(T) and Cellulomonas cellasea 20108(T) were sequenced. Here we described the draft genomic information of C. carbonis T26(T) and compared it to the related Cellulomonas genomes. Strain T26(T) has a 3,990,666 bp genome size with a G + C content of 73.4 %, containing 3418 protein-coding genes and 59 RNA genes. The results showed good correlation between the genotypes and the physiological phenotypes. The information are useful for the better application of the Cellulomonas strains.

  1. Genomic analysis by oligonucleotide array Comparative Genomic Hybridization utilizing formalin-fixed, paraffin-embedded tissues.

    PubMed

    Savage, Stephanie J; Hostetter, Galen

    2011-01-01

    Formalin fixation has been used to preserve tissues for more than a hundred years, and there are currently more than 300 million archival samples in the United States alone. The application of genomic protocols such as high-density oligonucleotide array Comparative Genomic Hybridization (aCGH) to formalin-fixed, paraffin-embedded (FFPE) tissues, therefore, opens an untapped resource of available tissues for research and facilitates utilization of existing clinical data in a research sample set. However, formalin fixation results in cross-linking of proteins and DNA, typically leading to such a significant degradation of DNA template that little is available for use in molecular applications. Here, we describe a protocol to circumvent formalin fixation artifact by utilizing enzymatic reactions to obtain quality DNA from a wide range of FFPE tissues for successful genome-wide discovery of gene dosage alterations in archival clinical samples.

  2. The genome sequence of Blochmannia floridanus: Comparative analysis of reduced genomes

    PubMed Central

    Gil, Rosario; Silva, Francisco J.; Zientz, Evelyn; Delmotte, François; González-Candelas, Fernando; Latorre, Amparo; Rausell, Carolina; Kamerbeek, Judith; Gadau, Jürgen; Hölldobler, Bert; van Ham, Roeland C. H. J.; Gross, Roy; Moya, Andrés

    2003-01-01

    Bacterial symbioses are widespread among insects, probably being one of the key factors of their evolutionary success. We present the complete genome sequence of Blochmannia floridanus, the primary endosymbiont of carpenter ants. Although these ants feed on a complex diet, this symbiosis very likely has a nutritional basis: Blochmannia is able to supply nitrogen and sulfur compounds to the host while it takes advantage of the host metabolic machinery. Remarkably, these bacteria lack all known genes involved in replication initiation (dnaA, priA, and recA). The phylogenetic analysis of a set of conserved protein-coding genes shows that Bl. floridanus is phylogenetically related to Buchnera aphidicola and Wigglesworthia glossinidia, the other endosymbiotic bacteria whose complete genomes have been sequenced so far. Comparative analysis of the five known genomes from insect endosymbiotic bacteria reveals they share only 313 genes, a number that may be close to the minimum gene set necessary to sustain endosymbiotic life. PMID:12886019

  3. The genome sequence of Blochmannia floridanus: comparative analysis of reduced genomes.

    PubMed

    Gil, Rosario; Silva, Francisco J; Zientz, Evelyn; Delmotte, François; González-Candelas, Fernando; Latorre, Amparo; Rausell, Carolina; Kamerbeek, Judith; Gadau, Jürgen; Hölldobler, Bert; van Ham, Roeland C H J; Gross, Roy; Moya, Andrés

    2003-08-01

    Bacterial symbioses are widespread among insects, probably being one of the key factors of their evolutionary success. We present the complete genome sequence of Blochmannia floridanus, the primary endosymbiont of carpenter ants. Although these ants feed on a complex diet, this symbiosis very likely has a nutritional basis: Blochmannia is able to supply nitrogen and sulfur compounds to the host while it takes advantage of the host metabolic machinery. Remarkably, these bacteria lack all known genes involved in replication initiation (dnaA, priA, and recA). The phylogenetic analysis of a set of conserved protein-coding genes shows that Bl. floridanus is phylogenetically related to Buchnera aphidicola and Wigglesworthia glossinidia, the other endosymbiotic bacteria whose complete genomes have been sequenced so far. Comparative analysis of the five known genomes from insect endosymbiotic bacteria reveals they share only 313 genes, a number that may be close to the minimum gene set necessary to sustain endosymbiotic life. PMID:12886019

  4. A Comparative Encyclopedia of DNA Elements in the Mouse Genome

    PubMed Central

    Yue, Feng; Cheng, Yong; Breschi, Alessandra; Vierstra, Jeff; Wu, Weisheng; Ryba, Tyrone; Sandstrom, Richard; Ma, Zhihai; Davis, Carrie; Pope, Benjamin D.; Shen, Yin; Pervouchine, Dmitri D.; Djebali, Sarah; Thurman, Bob; Kaul, Rajinder; Rynes, Eric; Kirilusha, Anthony; Marinov, Georgi K.; Williams, Brian A.; Trout, Diane; Amrhein, Henry; Fisher-Aylor, Katherine; Antoshechkin, Igor; DeSalvo, Gilberto; See, Lei-Hoon; Fastuca, Meagan; Drenkow, Jorg; Zaleski, Chris; Dobin, Alex; Prieto, Pablo; Lagarde, Julien; Bussotti, Giovanni; Tanzer, Andrea; Denas, Olgert; Li, Kanwei; Bender, M. A.; Zhang, Miaohua; Byron, Rachel; Groudine, Mark T.; McCleary, David; Pham, Long; Ye, Zhen; Kuan, Samantha; Edsall, Lee; Wu, Yi-Chieh; Rasmussen, Matthew D.; Bansal, Mukul S.; Keller, Cheryl A.; Morrissey, Christapher S.; Mishra, Tejaswini; Jain, Deepti; Dogan, Nergiz; Harris, Robert S.; Cayting, Philip; Kawli, Trupti; Boyle, Alan P.; Euskirchen, Ghia; Kundaje, Anshul; Lin, Shin; Lin, Yiing; Jansen, Camden; Malladi, Venkat S.; Cline, Melissa S.; Erickson, Drew T.; Kirkup, Vanessa M; Learned, Katrina; Sloan, Cricket A.; Rosenbloom, Kate R.; de Sousa, Beatriz Lacerda; Beal, Kathryn; Pignatelli, Miguel; Flicek, Paul; Lian, Jin; Kahveci, Tamer; Lee, Dongwon; Kent, W. James; Santos, Miguel Ramalho; Herrero, Javier; Notredame, Cedric; Johnson, Audra; Vong, Shinny; Lee, Kristen; Bates, Daniel; Neri, Fidencio; Diegel, Morgan; Canfield, Theresa; Sabo, Peter J.; Wilken, Matthew S.; Reh, Thomas A.; Giste, Erika; Shafer, Anthony; Kutyavin, Tanya; Haugen, Eric; Dunn, Douglas; Reynolds, Alex P.; Neph, Shane; Humbert, Richard; Hansen, R. Scott; De Bruijn, Marella; Selleri, Licia; Rudensky, Alexander; Josefowicz, Steven; Samstein, Robert; Eichler, Evan E.; Orkin, Stuart H.; Levasseur, Dana; Papayannopoulou, Thalia; Chang, Kai-Hsin; Skoultchi, Arthur; Gosh, Srikanta; Disteche, Christine; Treuting, Piper; Wang, Yanli; Weiss, Mitchell J.; Blobel, Gerd A.; Good, Peter J.; Lowdon, Rebecca F.; Adams, Leslie B.; Zhou, Xiao-Qiao; Pazin, Michael J.; Feingold, Elise A.; Wold, Barbara; Taylor, James; Kellis, Manolis; Mortazavi, Ali; Weissman, Sherman M.; Stamatoyannopoulos, John; Snyder, Michael P.; Guigo, Roderic; Gingeras, Thomas R.; Gilbert, David M.; Hardison, Ross C.; Beer, Michael A.; Ren, Bing

    2014-01-01

    Summary As the premier model organism in biomedical research, the laboratory mouse shares the majority of protein-coding genes with humans, yet the two mammals differ in significant ways. To gain greater insights into both shared and species-specific transcriptional and cellular regulatory programs in the mouse, the Mouse ENCODE Consortium has mapped transcription, DNase I hypersensitivity, transcription factor binding, chromatin modifications, and replication domains throughout the mouse genome in diverse cell and tissue types. By comparing with the human genome, we not only confirm substantial conservation in the newly annotated potential functional sequences, but also find a large degree of divergence of other sequences involved in transcriptional regulation, chromatin state and higher order chromatin organization. Our results illuminate the wide range of evolutionary forces acting on genes and their regulatory regions, and provide a general resource for research into mammalian biology and mechanisms of human diseases. PMID:25409824

  5. Beyond the thale: comparative genomics and genetics of Arabidopsis relatives.

    PubMed

    Koenig, Daniel; Weigel, Detlef

    2015-05-01

    For decades a small number of model species have rightly occupied a privileged position in laboratory experiments, but it is becoming increasingly clear that our knowledge of biology is greatly improved when informed by a broader diversity of species and evolutionary context. Arabidopsis thaliana has been the primary model organism for plants, benefiting from a high-quality reference genome sequence and resources for reverse genetics. However, recent studies have made a group of species also in the Brassicaceae family and closely related to A. thaliana a focal point for comparative molecular, genomic, phenotypic and evolutionary studies. In this Review, we emphasize how such studies complement continued study of the model plant itself, provide an evolutionary perspective and summarize our current understanding of genetic and phenotypic diversity in plants.

  6. A comparative encyclopedia of DNA elements in the mouse genome.

    PubMed

    Yue, Feng; Cheng, Yong; Breschi, Alessandra; Vierstra, Jeff; Wu, Weisheng; Ryba, Tyrone; Sandstrom, Richard; Ma, Zhihai; Davis, Carrie; Pope, Benjamin D; Shen, Yin; Pervouchine, Dmitri D; Djebali, Sarah; Thurman, Robert E; Kaul, Rajinder; Rynes, Eric; Kirilusha, Anthony; Marinov, Georgi K; Williams, Brian A; Trout, Diane; Amrhein, Henry; Fisher-Aylor, Katherine; Antoshechkin, Igor; DeSalvo, Gilberto; See, Lei-Hoon; Fastuca, Meagan; Drenkow, Jorg; Zaleski, Chris; Dobin, Alex; Prieto, Pablo; Lagarde, Julien; Bussotti, Giovanni; Tanzer, Andrea; Denas, Olgert; Li, Kanwei; Bender, M A; Zhang, Miaohua; Byron, Rachel; Groudine, Mark T; McCleary, David; Pham, Long; Ye, Zhen; Kuan, Samantha; Edsall, Lee; Wu, Yi-Chieh; Rasmussen, Matthew D; Bansal, Mukul S; Kellis, Manolis; Keller, Cheryl A; Morrissey, Christapher S; Mishra, Tejaswini; Jain, Deepti; Dogan, Nergiz; Harris, Robert S; Cayting, Philip; Kawli, Trupti; Boyle, Alan P; Euskirchen, Ghia; Kundaje, Anshul; Lin, Shin; Lin, Yiing; Jansen, Camden; Malladi, Venkat S; Cline, Melissa S; Erickson, Drew T; Kirkup, Vanessa M; Learned, Katrina; Sloan, Cricket A; Rosenbloom, Kate R; Lacerda de Sousa, Beatriz; Beal, Kathryn; Pignatelli, Miguel; Flicek, Paul; Lian, Jin; Kahveci, Tamer; Lee, Dongwon; Kent, W James; Ramalho Santos, Miguel; Herrero, Javier; Notredame, Cedric; Johnson, Audra; Vong, Shinny; Lee, Kristen; Bates, Daniel; Neri, Fidencio; Diegel, Morgan; Canfield, Theresa; Sabo, Peter J; Wilken, Matthew S; Reh, Thomas A; Giste, Erika; Shafer, Anthony; Kutyavin, Tanya; Haugen, Eric; Dunn, Douglas; Reynolds, Alex P; Neph, Shane; Humbert, Richard; Hansen, R Scott; De Bruijn, Marella; Selleri, Licia; Rudensky, Alexander; Josefowicz, Steven; Samstein, Robert; Eichler, Evan E; Orkin, Stuart H; Levasseur, Dana; Papayannopoulou, Thalia; Chang, Kai-Hsin; Skoultchi, Arthur; Gosh, Srikanta; Disteche, Christine; Treuting, Piper; Wang, Yanli; Weiss, Mitchell J; Blobel, Gerd A; Cao, Xiaoyi; Zhong, Sheng; Wang, Ting; Good, Peter J; Lowdon, Rebecca F; Adams, Leslie B; Zhou, Xiao-Qiao; Pazin, Michael J; Feingold, Elise A; Wold, Barbara; Taylor, James; Mortazavi, Ali; Weissman, Sherman M; Stamatoyannopoulos, John A; Snyder, Michael P; Guigo, Roderic; Gingeras, Thomas R; Gilbert, David M; Hardison, Ross C; Beer, Michael A; Ren, Bing

    2014-11-20

    The laboratory mouse shares the majority of its protein-coding genes with humans, making it the premier model organism in biomedical research, yet the two mammals differ in significant ways. To gain greater insights into both shared and species-specific transcriptional and cellular regulatory programs in the mouse, the Mouse ENCODE Consortium has mapped transcription, DNase I hypersensitivity, transcription factor binding, chromatin modifications and replication domains throughout the mouse genome in diverse cell and tissue types. By comparing with the human genome, we not only confirm substantial conservation in the newly annotated potential functional sequences, but also find a large degree of divergence of sequences involved in transcriptional regulation, chromatin state and higher order chromatin organization. Our results illuminate the wide range of evolutionary forces acting on genes and their regulatory regions, and provide a general resource for research into mammalian biology and mechanisms of human diseases.

  7. Towards integration of population and comparative genomics in forest trees.

    PubMed

    Ingvarsson, Pär K; Hvidsten, Torgeir R; Street, Nathaniel R

    2016-10-01

    Contents 338 I. 338 II. 339 III. 340 IV. 342 343 References 343 SUMMARY: The past decade saw the initiation of an ongoing revolution in sequencing technologies that is transforming all fields of biology. This has been driven by the advent and widespread availability of high-throughput, massively parallel short-read sequencing (MPS) platforms. These technologies have enabled previously unimaginable studies, including draft assemblies of the massive genomes of coniferous species and population-scale resequencing. Transcriptomics studies have likewise been transformed, with RNA-sequencing enabling studies in nonmodel organisms, the discovery of previously unannotated genes (novel transcripts), entirely new classes of RNAs and previously unknown regulatory mechanisms. Here we touch upon current developments in the areas of genome assembly, comparative regulomics and population genetics as they relate to studies of forest tree species. PMID:27575589

  8. WormBase: methods for data mining and comparative genomics.

    PubMed

    Harris, Todd W; Stein, Lincoln D

    2006-01-01

    WormBase is a comprehensive repository for information on Caenorhabditis elegans and related nematodes. Although the primary web-based interface of WormBase (http:// www.wormbase.org/) is familiar to most C. elegans researchers, WormBase also offers powerful data-mining features for addressing questions of comparative genomics, genome structure, and evolution. In this chapter, we focus on data mining at WormBase through the use of flexible web interfaces, custom queries, and scripts. The intended audience includes users wishing to query the database beyond the confines of the web interface or fetch data en masse. No knowledge of programming is necessary or assumed, although users with intermediate skills in the Perl scripting language will be able to utilize additional data-mining approaches.

  9. Industrial Acetogenic Biocatalysts: A Comparative Metabolic and Genomic Analysis.

    PubMed

    Bengelsdorf, Frank R; Poehlein, Anja; Linder, Sonja; Erz, Catarina; Hummel, Tim; Hoffmeister, Sabrina; Daniel, Rolf; Dürre, Peter

    2016-01-01

    Synthesis gas (syngas) fermentation by anaerobic acetogenic bacteria employing the Wood-Ljungdahl pathway is a bioprocess for production of biofuels and biocommodities. The major fermentation products of the most relevant biocatalytic strains (Clostridium ljungdahlii, C. autoethanogenum, C. ragsdalei, and C. coskatii) are acetic acid and ethanol. A comparative metabolic and genomic analysis using the mentioned biocatalysts might offer targets for metabolic engineering and thus improve the production of compounds apart from ethanol. Autotrophic growth and product formation of the four wild type (WT) strains were compared in uncontrolled batch experiments. The genomes of C. ragsdalei and C. coskatii were sequenced and the genome sequences of all four biocatalytic strains analyzed in comparative manner. Growth and product spectra (acetate, ethanol, 2,3-butanediol) of C. autoethanogenum, C. ljungdahlii, and C. ragsdalei were rather similar. In contrast, C. coskatii produced significantly less ethanol and its genome sequence lacks two genes encoding aldehyde:ferredoxin oxidoreductases (AOR). Comparative genome sequence analysis of the four WT strains revealed high average nucleotide identity (ANI) of C. ljungdahlii and C. autoethanogenum (99.3%) and C. coskatii (98.3%). In contrast, C. ljungdahlii WT and C. ragsdalei WT showed an ANI-based similarity of only 95.8%. Additionally, recombinant C. ljungdahlii strains were constructed that harbor an artificial acetone synthesis operon (ASO) consisting of the following genes: adc, ctfA, ctfB, and thlA (encoding acetoacetate decarboxylase, acetoacetyl-CoA:acetate/butyrate:CoA-transferase subunits A and B, and thiolase) under the control of thlA promoter (P thlA ) from C. acetobutylicum or native pta-ack promoter (P pta-ack ) from C. ljungdahlii. Respective recombinant strains produced 2-propanol rather than acetone, due to the presence of a NADPH-dependent primary-secondary alcohol dehydrogenase that converts acetone to 2

  10. Industrial Acetogenic Biocatalysts: A Comparative Metabolic and Genomic Analysis

    PubMed Central

    Bengelsdorf, Frank R.; Poehlein, Anja; Linder, Sonja; Erz, Catarina; Hummel, Tim; Hoffmeister, Sabrina; Daniel, Rolf; Dürre, Peter

    2016-01-01

    Synthesis gas (syngas) fermentation by anaerobic acetogenic bacteria employing the Wood–Ljungdahl pathway is a bioprocess for production of biofuels and biocommodities. The major fermentation products of the most relevant biocatalytic strains (Clostridium ljungdahlii, C. autoethanogenum, C. ragsdalei, and C. coskatii) are acetic acid and ethanol. A comparative metabolic and genomic analysis using the mentioned biocatalysts might offer targets for metabolic engineering and thus improve the production of compounds apart from ethanol. Autotrophic growth and product formation of the four wild type (WT) strains were compared in uncontrolled batch experiments. The genomes of C. ragsdalei and C. coskatii were sequenced and the genome sequences of all four biocatalytic strains analyzed in comparative manner. Growth and product spectra (acetate, ethanol, 2,3-butanediol) of C. autoethanogenum, C. ljungdahlii, and C. ragsdalei were rather similar. In contrast, C. coskatii produced significantly less ethanol and its genome sequence lacks two genes encoding aldehyde:ferredoxin oxidoreductases (AOR). Comparative genome sequence analysis of the four WT strains revealed high average nucleotide identity (ANI) of C. ljungdahlii and C. autoethanogenum (99.3%) and C. coskatii (98.3%). In contrast, C. ljungdahlii WT and C. ragsdalei WT showed an ANI-based similarity of only 95.8%. Additionally, recombinant C. ljungdahlii strains were constructed that harbor an artificial acetone synthesis operon (ASO) consisting of the following genes: adc, ctfA, ctfB, and thlA (encoding acetoacetate decarboxylase, acetoacetyl-CoA:acetate/butyrate:CoA-transferase subunits A and B, and thiolase) under the control of thlA promoter (PthlA) from C. acetobutylicum or native pta-ack promoter (Ppta-ack) from C. ljungdahlii. Respective recombinant strains produced 2-propanol rather than acetone, due to the presence of a NADPH-dependent primary-secondary alcohol dehydrogenase that converts acetone to 2

  11. Industrial Acetogenic Biocatalysts: A Comparative Metabolic and Genomic Analysis.

    PubMed

    Bengelsdorf, Frank R; Poehlein, Anja; Linder, Sonja; Erz, Catarina; Hummel, Tim; Hoffmeister, Sabrina; Daniel, Rolf; Dürre, Peter

    2016-01-01

    Synthesis gas (syngas) fermentation by anaerobic acetogenic bacteria employing the Wood-Ljungdahl pathway is a bioprocess for production of biofuels and biocommodities. The major fermentation products of the most relevant biocatalytic strains (Clostridium ljungdahlii, C. autoethanogenum, C. ragsdalei, and C. coskatii) are acetic acid and ethanol. A comparative metabolic and genomic analysis using the mentioned biocatalysts might offer targets for metabolic engineering and thus improve the production of compounds apart from ethanol. Autotrophic growth and product formation of the four wild type (WT) strains were compared in uncontrolled batch experiments. The genomes of C. ragsdalei and C. coskatii were sequenced and the genome sequences of all four biocatalytic strains analyzed in comparative manner. Growth and product spectra (acetate, ethanol, 2,3-butanediol) of C. autoethanogenum, C. ljungdahlii, and C. ragsdalei were rather similar. In contrast, C. coskatii produced significantly less ethanol and its genome sequence lacks two genes encoding aldehyde:ferredoxin oxidoreductases (AOR). Comparative genome sequence analysis of the four WT strains revealed high average nucleotide identity (ANI) of C. ljungdahlii and C. autoethanogenum (99.3%) and C. coskatii (98.3%). In contrast, C. ljungdahlii WT and C. ragsdalei WT showed an ANI-based similarity of only 95.8%. Additionally, recombinant C. ljungdahlii strains were constructed that harbor an artificial acetone synthesis operon (ASO) consisting of the following genes: adc, ctfA, ctfB, and thlA (encoding acetoacetate decarboxylase, acetoacetyl-CoA:acetate/butyrate:CoA-transferase subunits A and B, and thiolase) under the control of thlA promoter (P thlA ) from C. acetobutylicum or native pta-ack promoter (P pta-ack ) from C. ljungdahlii. Respective recombinant strains produced 2-propanol rather than acetone, due to the presence of a NADPH-dependent primary-secondary alcohol dehydrogenase that converts acetone to 2

  12. Comparative genome analysis of Bacillus cereus group genomes withBacillus subtilis

    SciTech Connect

    Anderson, Iain; Sorokin, Alexei; Kapatral, Vinayak; Reznik, Gary; Bhattacharya, Anamitra; Mikhailova, Natalia; Burd, Henry; Joukov, Victor; Kaznadzey, Denis; Walunas, Theresa; D'Souza, Mark; Larsen, Niels; Pusch,Gordon; Liolios, Konstantinos; Grechkin, Yuri; Lapidus, Alla; Goltsman,Eugene; Chu, Lien; Fonstein, Michael; Ehrlich, S. Dusko; Overbeek, Ross; Kyrpides, Nikos; Ivanova, Natalia

    2005-09-14

    Genome features of the Bacillus cereus group genomes (representative strains of Bacillus cereus, Bacillus anthracis and Bacillus thuringiensis sub spp israelensis) were analyzed and compared with the Bacillus subtilis genome. A core set of 1,381 protein families among the four Bacillus genomes, with an additional set of 933 families common to the B. cereus group, was identified. Differences in signal transduction pathways, membrane transporters, cell surface structures, cell wall, and S-layer proteins suggesting differences in their phenotype were identified. The B. cereus group has signal transduction systems including a tyrosine kinase related to two-component system histidine kinases from B. subtilis. A model for regulation of the stress responsive sigma factor sigmaB in the B. cereus group different from the well studied regulation in B. subtilis has been proposed. Despite a high degree of chromosomal synteny among these genomes, significant differences in cell wall and spore coat proteins that contribute to the survival and adaptation in specific hosts has been identified.

  13. Comparative genomics of Serratia spp.: two paths towards endosymbiotic life.

    PubMed

    Manzano-Marín, Alejandro; Lamelas, Araceli; Moya, Andrés; Latorre, Amparo

    2012-01-01

    Symbiosis is a widespread phenomenon in nature, in which insects show a great number of these associations. Buchnera aphidicola, the obligate endosymbiont of aphids, coexists in some species with another intracellular bacterium, Serratia symbiotica. Of particular interest is the case of the cedar aphid Cinara cedri, where B. aphidicola BCc and S. symbiotica SCc need each other to fulfil their symbiotic role with the insect. Moreover, various features seem to indicate that S. symbiotica SCc is closer to an obligate endosymbiont than to other facultative S. symbiotica, such as the one described for the aphid Acirthosyphon pisum (S. symbiotica SAp). This work is based on the comparative genomics of five strains of Serratia, three free-living and two endosymbiotic ones (one facultative and one obligate) which should allow us to dissect the genome reduction taking place in the adaptive process to an intracellular life-style. Using a pan-genome approach, we have identified shared and strain-specific genes from both endosymbiotic strains and gained insight into the different genetic reduction both S. symbiotica have undergone. We have identified both retained and reduced functional categories in S. symbiotica compared to the Free-Living Serratia (FLS) that seem to be related with its endosymbiotic role in their specific host-symbiont systems. By means of a phylogenomic reconstruction we have solved the position of both endosymbionts with confidence, established the probable insect-pathogen origin of the symbiotic clade as well as the high amino-acid substitution rate in S. symbiotica SCc. Finally, we were able to quantify the minimal number of rearrangements suffered in the endosymbiotic lineages and reconstruct a minimal rearrangement phylogeny. All these findings provide important evidence for the existence of at least two distinctive S. symbiotica lineages that are characterized by different rearrangements, gene content, genome size and branch lengths.

  14. Array comparative genomic hybridization in retinoma and retinoblastoma tissues.

    PubMed

    Sampieri, Katia; Amenduni, Mariangela; Papa, Filomena Tiziana; Katzaki, Eleni; Mencarelli, Maria Antonietta; Marozza, Annabella; Epistolato, Maria Carmela; Toti, Paolo; Lazzi, Stefano; Bruttini, Mirella; De Filippis, Roberta; De Francesco, Sonia; Longo, Ilaria; Meloni, Ilaria; Mari, Francesca; Acquaviva, Antonio; Hadjistilianou, Theodora; Renieri, Alessandra; Ariani, Francesca

    2009-03-01

    In retinoblastoma, two RB1 mutations are necessary for tumor development. Recurrent genomic rearrangements may represent subsequent events required for retinoblastoma progression. Array-comparative genomic hybridization was carried out in 18 eye samples, 10 from bilateral and eight from unilateral retinoblastoma patients. Two unilateral cases also showed areas of retinoma. The most frequent imbalance in retinoblastomas was 6p gain (40%), followed by gains at 1q12-q25.3, 2p24.3-p24.2, 9q22.2, and 9q33.1 and losses at 11q24.3, 13q13.2-q22.3, and 16q12.1-q21. Bilateral cases showed a lower number of imbalances than unilateral cases (P = 0.002). Unilateral cases were divided into low-level (< or = 4) and high-level (> or = 7) chromosomal instability groups. The first group presented with younger age at diagnosis (mean 511 days) compared with the second group (mean 1606 days). In one retinoma case ophthalmoscopically diagnosed as a benign lesion no rearrangements were detected, whereas the adjacent retinoblastoma displayed seven aberrations. The other retinoma case identified by retrospective histopathological examination shared three rearrangements with the adjacent retinoblastoma. Two other gene-free rearrangements were retinoma specific. One rearrangement, dup5p, was retinoblastoma specific and included the SKP2 gene. Genomic profiling indicated that the first retinoma was a pretumoral lesion, whereas the other represents a subclone of cells bearing 'benign' rearrangements overwhelmed by another subclone presenting aberrations with higher 'oncogenic' potential. In summary, the present study shows that bilateral and unilateral retinoblastoma have different chromosomal instability that correlates with the age of tumor onset in unilateral cases. This is the first report of genomic profiling in retinoma tissue, shedding light on the different nature of lesions named 'retinoma'.

  15. Comparative Genomics of the Ubiquitous, Hydrocarbon-degrading Genus Marinobacter

    NASA Astrophysics Data System (ADS)

    Singer, E.; Webb, E.; Edwards, K. J.

    2012-12-01

    The genus Marinobacter is amongst the most ubiquitous in the global oceans and strains have been isolated from a wide variety of marine environments, including offshore oil-well heads, coastal thermal springs, Antarctic sea water, saline soils and associations with diatoms and dinoflagellates. Many strains have been recognized to be important hydrocarbon degraders in various marine habitats presenting sometimes extreme pH or salinity conditions. Analysis of the genome of M. aquaeolei revealed enormous adaptation versatility with an assortment of strategies for carbon and energy acquisition, sensation, and defense. In an effort to elucidate the ecological and biogeochemical significance of the Marinobacters, seven Marinobacter strains from diverse environments were included in a comparative genomics study. Genomes were screened for metabolic and adaptation potential to elucidate the strategies responsible for the omnipresence of the Marinobacter genus and their remedial action potential in hydrocarbon-polluted waters. The core genome predominantly encodes for key genes involved in hydrocarbon degradation, biofilm-relevant processes, including utilization of external DNA, halotolerance, as well as defense mechanisms against heavy metals, antibiotics, and toxins. All Marinobacter strains were observed to degrade a wide spectrum of hydrocarbon species, including aliphatic, polycyclic aromatic as well as acyclic isoprenoid compounds. Various genes predicted to facilitate hydrocarbon degradation, e.g. alkane 1-monooxygenase, appear to have originated from lateral gene transfer as they are located on gene clusters of 10-20% lower GC-content compared to genome averages and are flanked by transposases. Top ortholog hits are found in other hydrocarbon degrading organisms, e.g. Alcanivorax borkumensis. Strategies for hydrocarbon uptake encoded by various Marinobacter strains include cell surface hydrophobicity adaptation via capsular polysaccharide biosynthesis and attachment

  16. MGcV: the microbial genomic context viewer for comparative genome analysis

    PubMed Central

    2013-01-01

    Background Conserved gene context is used in many types of comparative genome analyses. It is used to provide leads on gene function, to guide the discovery of regulatory sequences, but also to aid in the reconstruction of metabolic networks. We present the Microbial Genomic context Viewer (MGcV), an interactive, web-based application tailored to strengthen the practice of manual comparative genome context analysis for bacteria. Results MGcV is a versatile, easy-to-use tool that renders a visualization of the genomic context of any set of selected genes, genes within a phylogenetic tree, genomic segments, or regulatory elements. It is tailored to facilitate laborious tasks such as the interactive annotation of gene function, the discovery of regulatory elements, or the sequence-based reconstruction of gene regulatory networks. We illustrate that MGcV can be used in gene function annotation by visually integrating information on prokaryotic genes, like their annotation as available from NCBI with other annotation data such as Pfam domains, sub-cellular location predictions and gene-sequence characteristics such as GC content. We also illustrate the usefulness of the interactive features that allow the graphical selection of genes to facilitate data gathering (e.g. upstream regions, ID’s or annotation), in the analysis and reconstruction of transcription regulation. Moreover, putative regulatory elements and their corresponding scores or data from RNA-seq and microarray experiments can be uploaded, visualized and interpreted in (ranked-) comparative context maps. The ranked maps allow the interpretation of predicted regulatory elements and experimental data in light of each other. Conclusion MGcV advances the manual comparative analysis of genes and regulatory elements by providing fast and flexible integration of gene related data combined with straightforward data retrieval. MGcV is available at http://mgcv.cmbi.ru.nl. PMID:23547764

  17. Comparative analysis of essential genes in prokaryotic genomic islands.

    PubMed

    Zhang, Xi; Peng, Chong; Zhang, Ge; Gao, Feng

    2015-07-30

    Essential genes are thought to encode proteins that carry out the basic functions to sustain a cellular life, and genomic islands (GIs) usually contain clusters of horizontally transferred genes. It has been assumed that essential genes are not likely to be located in GIs, but systematical analysis of essential genes in GIs has not been explored before. Here, we have analyzed the essential genes in 28 prokaryotes by statistical method and reached a conclusion that essential genes in GIs are significantly fewer than those outside GIs. The function of 362 essential genes found in GIs has been explored further by BLAST against the Virulence Factor Database (VFDB) and the phage/prophage sequence database of PHAge Search Tool (PHAST). Consequently, 64 and 60 eligible essential genes are found to share the sequence similarity with the virulence factors and phage/prophages-related genes, respectively. Meanwhile, we find several toxin-related proteins and repressors encoded by these essential genes in GIs. The comparative analysis of essential genes in genomic islands will not only shed new light on the development of the prediction algorithm of essential genes, but also give a clue to detect the functionality of essential genes in genomic islands.

  18. Comparative genomics reveals mobile pathogenicity chromosomes in Fusarium

    SciTech Connect

    Ma, Li Jun; van der Does, H. C.; Borkovich, Katherine A.; Coleman, Jeffrey J.; Daboussi, Marie-Jose; Di Pietro, Antonio; Dufresne, Marie; Freitag, Michael; Grabherr, Manfred; Henrissat, Bernard; Houterman, Petra M.; Kang, Seogchan; Shim, Won-Bo; Wolochuk, Charles; Xie, Xiaohui; Xu, Jin Rong; Antoniw, John; Baker, Scott E.; Bluhm, Burton H.; Breakspear, Andrew; Brown, Daren W.; Butchko, Robert A.; Chapman, Sinead; Coulson, Richard; Coutinho, Pedro M.; Danchin, Etienne G.; Diener, Andrew; Gale, Liane R.; Gardiner, Donald; Goff, Steven; Hammond-Kossack, Kim; Hilburn, Karen; Hua-Van, Aurelie; Jonkers, Wilfried; Kazan, Kemal; Kodira, Chinnappa D.; Koehrsen, Michael; Kumar, Lokesh; Lee, Yong Hwan; Li, Liande; Manners, John M.; Miranda-Saavedra, Diego; Mukherjee, Mala; Park, Gyungsoon; Park, Jongsun; Park, Sook Young; Proctor, Robert H.; Regev, Aviv; Ruiz-Roldan, M. C.; Sain, Divya; Sakthikumar, Sharadha; Sykes, Sean; Schwartz, David C.; Turgeon, Barbara G.; Wapinski, Ilan; Yoder, Olen; Young, Sarah; Zeng, Qiandong; Zhou, Shiguo; Galagan, James; Cuomo, Christina A.; Kistler, H. Corby; Rep, Martijn

    2010-03-18

    Fusarium species are among the most important phytopathogenic and toxigenic fungi, having significant impact on crop production and animal health. Distinctively, members of the F. oxysporum species complex exhibit wide host range but discontinuously distributed host specificity, reflecting remarkable genetic adaptability. To understand the molecular underpinnings of diverse phenotypic traits and their evolution in Fusarium, we compared the genomes of three economically important and phylogenetically related, yet phenotypically diverse plant-pathogenic species, F. graminearum, F. verticillioides and F. oxysporum f. sp. lycopersici. Our analysis revealed greatly expanded lineage-specific (LS) genomic regions in F. oxysporum that include four entire chromosomes, accounting for more than one-quarter of the genome. LS regions are rich in transposons and genes with distinct evolutionary profiles but related to pathogenicity. Experimentally, we demonstrate for the first time the transfer of two LS chromosomes between strains of F. oxysporum, resulting in the conversion of a non-pathogenic strain into a pathogen. Transfer of LS chromosomes between otherwise genetically isolated strains explains the polyphyletic origin of host specificity and the emergence of new pathogenic lineages in the F. oxysporum species complex, putting the evolution of fungal pathogenicity into a new perspective.

  19. Comparative genomic analysis of ten Streptococcus pneumoniae temperate bacteriophages.

    PubMed

    Romero, Patricia; Croucher, Nicholas J; Hiller, N Luisa; Hu, Fen Z; Ehrlich, Garth D; Bentley, Stephen D; García, Ernesto; Mitchell, Tim J

    2009-08-01

    Streptococcus pneumoniae is an important human pathogen that often carries temperate bacteriophages. As part of a program to characterize the genetic makeup of prophages associated with clinical strains and to assess the potential roles that they play in the biology and pathogenesis in their host, we performed comparative genomic analysis of 10 temperate pneumococcal phages. All of the genomes are organized into five major gene clusters: lysogeny, replication, packaging, morphogenesis, and lysis clusters. All of the phage particles observed showed a Siphoviridae morphology. The only genes that are well conserved in all the genomes studied are those involved in the integration and the lysis of the host in addition to two genes, of unknown function, within the replication module. We observed that a high percentage of the open reading frames contained no similarities to any sequences catalogued in public databases; however, genes that were homologous to known phage virulence genes, including the pblB gene of Streptococcus mitis and the vapE gene of Dichelobacter nodosus, were also identified. Interestingly, bioinformatic tools showed the presence of a toxin-antitoxin system in the phage phiSpn_6, and this represents the first time that an addition system in a pneumophage has been identified. Collectively, the temperate pneumophages contain a diverse set of genes with various levels of similarity among them. PMID:19502408

  20. Comparative Genomics of Flatworms (Platyhelminthes) Reveals Shared Genomic Features of Ecto- and Endoparastic Neodermata

    PubMed Central

    Hahn, Christoph; Fromm, Bastian; Bachmann, Lutz

    2014-01-01

    The ectoparasitic Monogenea comprise a major part of the obligate parasitic flatworm diversity. Although genomic adaptations to parasitism have been studied in the endoparasitic tapeworms (Cestoda) and flukes (Trematoda), no representative of the Monogenea has been investigated yet. We present the high-quality draft genome of Gyrodactylus salaris, an economically important monogenean ectoparasite of wild Atlantic salmon (Salmo salar). A total of 15,488 gene models were identified, of which 7,102 were functionally annotated. The controversial phylogenetic relationships within the obligate parasitic Neodermata were resolved in a phylogenomic analysis using 1,719 gene models (alignment length of >500,000 amino acids) for a set of 16 metazoan taxa. The Monogenea were found basal to the Cestoda and Trematoda, which implies ectoparasitism being plesiomorphic within the Neodermata and strongly supports a common origin of complex life cycles. Comparative analysis of seven parasitic flatworm genomes identified shared genomic features for the ecto- and endoparasitic lineages, such as a substantial reduction of the core bilaterian gene complement, including the homeodomain-containing genes, and a loss of the piwi and vasa genes, which are considered essential for animal development. Furthermore, the shared loss of functional fatty acid biosynthesis pathways and the absence of peroxisomes, the latter organelles presumed ubiquitous in eukaryotes except for parasitic protozoans, were inferred. The draft genome of G. salaris opens for future in-depth analyses of pathogenicity and host specificity of poorly characterized G. salaris strains, and will enhance studies addressing the genomics of host–parasite interactions and speciation in the highly diverse monogenean flatworms. PMID:24732282

  1. Comparative genomics of flatworms (platyhelminthes) reveals shared genomic features of ecto- and endoparastic neodermata.

    PubMed

    Hahn, Christoph; Fromm, Bastian; Bachmann, Lutz

    2014-05-01

    The ectoparasitic Monogenea comprise a major part of the obligate parasitic flatworm diversity. Although genomic adaptations to parasitism have been studied in the endoparasitic tapeworms (Cestoda) and flukes (Trematoda), no representative of the Monogenea has been investigated yet. We present the high-quality draft genome of Gyrodactylus salaris, an economically important monogenean ectoparasite of wild Atlantic salmon (Salmo salar). A total of 15,488 gene models were identified, of which 7,102 were functionally annotated. The controversial phylogenetic relationships within the obligate parasitic Neodermata were resolved in a phylogenomic analysis using 1,719 gene models (alignment length of >500,000 amino acids) for a set of 16 metazoan taxa. The Monogenea were found basal to the Cestoda and Trematoda, which implies ectoparasitism being plesiomorphic within the Neodermata and strongly supports a common origin of complex life cycles. Comparative analysis of seven parasitic flatworm genomes identified shared genomic features for the ecto- and endoparasitic lineages, such as a substantial reduction of the core bilaterian gene complement, including the homeodomain-containing genes, and a loss of the piwi and vasa genes, which are considered essential for animal development. Furthermore, the shared loss of functional fatty acid biosynthesis pathways and the absence of peroxisomes, the latter organelles presumed ubiquitous in eukaryotes except for parasitic protozoans, were inferred. The draft genome of G. salaris opens for future in-depth analyses of pathogenicity and host specificity of poorly characterized G. salaris strains, and will enhance studies addressing the genomics of host-parasite interactions and speciation in the highly diverse monogenean flatworms.

  2. Comparative Genomic Analysis of Mannheimia haemolytica from Bovine Sources.

    PubMed

    Klima, Cassidy L; Cook, Shaun R; Zaheer, Rahat; Laing, Chad; Gannon, Vick P; Xu, Yong; Rasmussen, Jay; Potter, Andrew; Hendrick, Steve; Alexander, Trevor W; McAllister, Tim A

    2016-01-01

    Bovine respiratory disease is a common health problem in beef production. The primary bacterial agent involved, Mannheimia haemolytica, is a target for antimicrobial therapy and at risk for associated antimicrobial resistance development. The role of M. haemolytica in pathogenesis is linked to serotype with serotypes 1 (S1) and 6 (S6) isolated from pneumonic lesions and serotype 2 (S2) found in the upper respiratory tract of healthy animals. Here, we sequenced the genomes of 11 strains of M. haemolytica, representing all three serotypes and performed comparative genomics analysis to identify genetic features that may contribute to pathogenesis. Possible virulence associated genes were identified within 14 distinct prophage, including a periplasmic chaperone, a lipoprotein, peptidoglycan glycosyltransferase and a stress response protein. Prophage content ranged from 2-8 per genome, but was higher in S1 and S6 strains. A type I-C CRISPR-Cas system was identified in each strain with spacer diversity and organization conserved among serotypes. The majority of spacers occur in S1 and S6 strains and originate from phage suggesting that serotypes 1 and 6 may be more resistant to phage predation. However, two spacers complementary to the host chromosome targeting a UDP-N-acetylglucosamine 2-epimerase and a glycosyl transferases group 1 gene are present in S1 and S6 strains only indicating these serotypes may employ CRISPR-Cas to regulate gene expression to avoid host immune responses or enhance adhesion during infection. Integrative conjugative elements are present in nine of the eleven genomes. Three of these harbor extensive multi-drug resistance cassettes encoding resistance against the majority of drugs used to combat infection in beef cattle, including macrolides and tetracyclines used in human medicine. The findings here identify key features that are likely contributing to serotype related pathogenesis and specific targets for vaccine design intended to reduce the

  3. Identification, characterization and comparative genomics of chimpanzee endogenous retroviruses

    PubMed Central

    Polavarapu, Nalini; Bowen, Nathan J; McDonald, John F

    2006-01-01

    Background Retrotransposons, the most abundant and widespread class of eukaryotic transposable elements, are believed to play a significant role in mutation and disease and to have contributed significantly to the evolution of genome structure and function. The recent sequencing of the chimpanzee genome is providing an unprecedented opportunity to study the functional significance of these elements in two closely related primate species and to better evaluate their role in primate evolution. Results We report here that the chimpanzee genome contains at least 42 separate families of endogenous retroviruses, nine of which were not previously identified. All but two (CERV 1/PTERV1 and CERV 2) of the 42 families of chimpanzee endogenous retroviruses were found to have orthologs in humans. Molecular analysis (PCR and Southern hybridization) of CERV 2 elements demonstrates that this family is present in chimpanzee, bonobo, gorilla and old-world monkeys but absent in human, orangutan and new-world monkeys. A survey of endogenous retroviral positional variation between chimpanzees and humans determined that approximately 7% of all chimpanzee-human INDEL variation is associated with endogenous retroviral sequences. Conclusion Nine families of chimpanzee endogenous retroviruses have been transpositionally active since chimpanzees and humans diverged from a common ancestor. Seven of these transpositionally active families have orthologs in humans, one of which has also been transpositionally active in humans since the human-chimpanzee divergence about six million years ago. Comparative analyses of orthologous regions of the human and chimpanzee genomes have revealed that a significant portion of INDEL variation between chimpanzees and humans is attributable to endogenous retroviruses and may be of evolutionary significance. PMID:16805923

  4. Comparative Genomic Analysis of Mannheimia haemolytica from Bovine Sources

    PubMed Central

    Klima, Cassidy L.; Cook, Shaun R.; Zaheer, Rahat; Laing, Chad; Gannon, Vick P.; Xu, Yong; Rasmussen, Jay; Potter, Andrew; Hendrick, Steve; Alexander, Trevor W.; McAllister, Tim A.

    2016-01-01

    Bovine respiratory disease is a common health problem in beef production. The primary bacterial agent involved, Mannheimia haemolytica, is a target for antimicrobial therapy and at risk for associated antimicrobial resistance development. The role of M. haemolytica in pathogenesis is linked to serotype with serotypes 1 (S1) and 6 (S6) isolated from pneumonic lesions and serotype 2 (S2) found in the upper respiratory tract of healthy animals. Here, we sequenced the genomes of 11 strains of M. haemolytica, representing all three serotypes and performed comparative genomics analysis to identify genetic features that may contribute to pathogenesis. Possible virulence associated genes were identified within 14 distinct prophage, including a periplasmic chaperone, a lipoprotein, peptidoglycan glycosyltransferase and a stress response protein. Prophage content ranged from 2–8 per genome, but was higher in S1 and S6 strains. A type I-C CRISPR-Cas system was identified in each strain with spacer diversity and organization conserved among serotypes. The majority of spacers occur in S1 and S6 strains and originate from phage suggesting that serotypes 1 and 6 may be more resistant to phage predation. However, two spacers complementary to the host chromosome targeting a UDP-N-acetylglucosamine 2-epimerase and a glycosyl transferases group 1 gene are present in S1 and S6 strains only indicating these serotypes may employ CRISPR-Cas to regulate gene expression to avoid host immune responses or enhance adhesion during infection. Integrative conjugative elements are present in nine of the eleven genomes. Three of these harbor extensive multi-drug resistance cassettes encoding resistance against the majority of drugs used to combat infection in beef cattle, including macrolides and tetracyclines used in human medicine. The findings here identify key features that are likely contributing to serotype related pathogenesis and specific targets for vaccine design intended to reduce the

  5. Reconstructing the Evolution of Brachypodium Genomes Using Comparative Chromosome Painting

    PubMed Central

    Betekhtin, Alexander; Jenkins, Glyn; Hasterok, Robert

    2014-01-01

    Brachypodium distachyon is a model for the temperate cereals and grasses and has a biology, genomics infrastructure and cytogenetic platform fit for purpose. It is a member of a genus with fewer than 20 species, which have different genome sizes, basic chromosome numbers and ploidy levels. The phylogeny and interspecific relationships of this group have not to date been resolved by sequence comparisons and karyotypical studies. The aims of this study are not only to reconstruct the evolution of Brachypodium karyotypes to resolve the phylogeny, but also to highlight the mechanisms that shape the evolution of grass genomes. This was achieved through the use of comparative chromosome painting (CCP) which hybridises fluorescent, chromosome-specific probes derived from B. distachyon to homoeologous meiotic chromosomes of its close relatives. The study included five diploids (B. distachyon 2n = 10, B. sylvaticum 2n = 18, B. pinnatum 2n = 16; 2n = 18, B. arbuscula 2n = 18 and B. stacei 2n = 20) three allotetraploids (B. pinnatum 2n = 28, B. phoenicoides 2n = 28 and B. hybridum 2n = 30), and two species of unknown ploidy (B. retusum 2n = 38 and B. mexicanum 2n = 40). On the basis of the patterns of hybridisation and incorporating published data, we propose two alternative, but similar, models of karyotype evolution in the genus Brachypodium. According to the first model, the extant genome of B. distachyon derives from B. mexicanum or B. stacei by several rounds of descending dysploidy, and the other diploids evolve from B. distachyon via ascending dysploidy. The allotetraploids arise by interspecific hybridisation and chromosome doubling between B. distachyon and other diploids. The second model differs from the first insofar as it incorporates an intermediate 2n = 18 species between the B. mexicanum or B. stacei progenitors and the dysploidic B. distachyon. PMID:25493646

  6. Reconstructing the Evolution of Brachypodium Genomes Using Comparative Chromosome Painting.

    PubMed

    Betekhtin, Alexander; Jenkins, Glyn; Hasterok, Robert

    2014-01-01

    Brachypodium distachyon is a model for the temperate cereals and grasses and has a biology, genomics infrastructure and cytogenetic platform fit for purpose. It is a member of a genus with fewer than 20 species, which have different genome sizes, basic chromosome numbers and ploidy levels. The phylogeny and interspecific relationships of this group have not to date been resolved by sequence comparisons and karyotypical studies. The aims of this study are not only to reconstruct the evolution of Brachypodium karyotypes to resolve the phylogeny, but also to highlight the mechanisms that shape the evolution of grass genomes. This was achieved through the use of comparative chromosome painting (CCP) which hybridises fluorescent, chromosome-specific probes derived from B. distachyon to homoeologous meiotic chromosomes of its close relatives. The study included five diploids (B. distachyon 2n = 10, B. sylvaticum 2n = 18, B. pinnatum 2n = 16; 2n = 18, B. arbuscula 2n = 18 and B. stacei 2n = 20) three allotetraploids (B. pinnatum 2n = 28, B. phoenicoides 2n = 28 and B. hybridum 2n = 30), and two species of unknown ploidy (B. retusum 2n = 38 and B. mexicanum 2n = 40). On the basis of the patterns of hybridisation and incorporating published data, we propose two alternative, but similar, models of karyotype evolution in the genus Brachypodium. According to the first model, the extant genome of B. distachyon derives from B. mexicanum or B. stacei by several rounds of descending dysploidy, and the other diploids evolve from B. distachyon via ascending dysploidy. The allotetraploids arise by interspecific hybridisation and chromosome doubling between B. distachyon and other diploids. The second model differs from the first insofar as it incorporates an intermediate 2n = 18 species between the B. mexicanum or B. stacei progenitors and the dysploidic B. distachyon. PMID:25493646

  7. Comparative effect of thymol or its glucose conjugate, thymol-β-D-glucopyranoside, on Campylobacter in avian gut contents.

    PubMed

    Epps, Sharon V R; Harvey, Roger B; Byrd, J Allen; Petrujkić, Branko T; Sedej, Ivana; Beier, Ross C; Phillips, Timothy D; Hume, Michael E; Anderson, Robin C; Nisbet, David J

    2015-01-01

    Campylobacter jejuni is an important human food-borne pathogen that can contaminate meat and poultry during processing. Consequently, strategies are sought to reduce the carriage of C. jejuni in food animals before they arrive at the abattoir. Thymol is a natural product that reduces survivability of Campylobacter in vitro, but its rapid absorption from the proximal alimentary tract limits its bactericidal efficacy in vivo. Thymol-β-D-glucopyranoside is more resistant to absorption than free thymol, but its administration to chickens has not been reported. In the present studies, 1 mM thymol-β-D-glucopyranoside was shown to exhibit near equal anti-Campylobacter activity as 1 mM thymol when incubated anaerobically in avian crop or cecal contents in vitro, resulting in reductions of 1.10-2.32 log10 colony forming units mL(-1) in C. jejuni concentrations after 24 h incubation. In a follow-up live animal study, oral administration of thymol-β-D-glucopyranoside, but not free thymol, significantly lowered (>10-fold) recovery of Campylobacter from the crop of market-aged broilers when compared to placebo-treated controls (n = 6 broilers/treatment). Neither thymol-β-D-glucopyranoside nor thymol affected recovery of Campylobacter from cecal contents of the treated broilers. These results indicate that rapid absorption or passage of free thymol from the crop precluded its anti-Campylobacter activity at this site and throughout the entire gastrointestinal tract. Conversely, lower recovery of Campylobacter from the crop of birds treated with thymol-β-D-glucopyranoside indicates this conjugate was retained and able to be hydrolyzed to biologically active free thymol at this site as intended, yet was not sufficiently protected to allow passage of efficacious amounts of the intact glycoside to the lower gut. Nevertheless, these results warrant further research to see if higher doses or encapsulation of thymol-β-D-glucopyranoside or similar glycosides may yield an

  8. Genomic Sequencing of Orientia tsutsugamushi Strain Karp, an Assembly Comparable to the Genome Size of the Strain Ikeda.

    PubMed

    Liao, Hsiao-Mei; Chao, Chien-Chung; Lei, Haiyan; Li, Bingjie; Tsai, Shien; Hung, Guo-Chiuan; Ching, Wei-Mei; Lo, Shyh-Ching

    2016-01-01

    Orientia tsutsugamushi, an intracellular bacterium, belongs to the family Rickettsiaceae This study presents the draft genome sequence of strain Karp, with 2.0 Mb as the size of the completed genome. This nearly finished draft genome sequence was annotated with the RAST server and the contents compared to those of the other strains. PMID:27540052

  9. Genomic Sequencing of Orientia tsutsugamushi Strain Karp, an Assembly Comparable to the Genome Size of the Strain Ikeda

    PubMed Central

    Liao, Hsiao-Mei; Chao, Chien-Chung; Lei, Haiyan; Li, Bingjie; Tsai, Shien; Hung, Guo-Chiuan

    2016-01-01

    Orientia tsutsugamushi, an intracellular bacterium, belongs to the family Rickettsiaceae. This study presents the draft genome sequence of strain Karp, with 2.0 Mb as the size of the completed genome. This nearly finished draft genome sequence was annotated with the RAST server and the contents compared to those of the other strains. PMID:27540052

  10. Genomic Sequencing of Orientia tsutsugamushi Strain Karp, an Assembly Comparable to the Genome Size of the Strain Ikeda.

    PubMed

    Liao, Hsiao-Mei; Chao, Chien-Chung; Lei, Haiyan; Li, Bingjie; Tsai, Shien; Hung, Guo-Chiuan; Ching, Wei-Mei; Lo, Shyh-Ching

    2016-08-18

    Orientia tsutsugamushi, an intracellular bacterium, belongs to the family Rickettsiaceae This study presents the draft genome sequence of strain Karp, with 2.0 Mb as the size of the completed genome. This nearly finished draft genome sequence was annotated with the RAST server and the contents compared to those of the other strains.

  11. Avian Influenza H7N9/13 and H7N7/13: a Comparative Virulence Study in Chickens, Pigeons, and Ferrets

    PubMed Central

    Kalthoff, Donata; Bogs, Jessica; Grund, Christian; Tauscher, Kerstin; Teifke, Jens P.; Starick, Elke; Harder, Timm

    2014-01-01

    ABSTRACT Human influenza cases caused by a novel avian H7N9 virus in China emphasize the zoonotic potential of that subtype. We compared the infectivity and pathogenicity of the novel H7N9 virus with those of a recent European avian H7N7 strain in chickens, pigeons, and ferrets. Neither virus induced signs of disease despite substantial replication in inoculated chickens and rapid transmission to contact chickens. Evidence of the replication of both viruses in pigeons, albeit at lower levels of RNA excretion, was also detected. No clear-cut differences between the two H7 isolates emerged regarding replication and antibody development in avian hosts. In ferrets, in contrast, greater replication of the avian H7N9 virus than of the H7N7 strain was observed with significant differences in viral presence, e.g., in nasal wash, lung, and cerebellum samples. Importantly, both viruses showed the potential to spread to the mammal brain. We conclude that efficient asymptomatic viral replication and shedding, as shown in chickens, facilitate the spread of H7 viruses that may harbor zoonotic potential. Biosafety measures are required for the handling of poultry infected with avian influenza viruses of the H7 subtype, independently of their pathogenicity for gallinaceous poultry. IMPORTANCE This study is important to the field since it provides data about the behavior of the novel H7N9 avian influenza virus in chickens, pigeons, and ferrets in comparison with that of a recent low-pathogenicity H7N7 strain isolated from poultry. We clearly show that chickens, but not pigeons, are highly permissive hosts of both H7 viruses, allowing high-titer replication and virus shedding without any relevant clinical signs. In the ferret model, the potential of both viruses to infect mammals could be demonstrated, including infection of the brain. However, the replication efficiency of the H7N9 virus in ferrets was higher than that of the H7N7 strain. In conclusion, valuable data for the risk

  12. Comparative analysis of genomic signal processing for microarray data clustering.

    PubMed

    Istepanian, Robert S H; Sungoor, Ala; Nebel, Jean-Christophe

    2011-12-01

    Genomic signal processing is a new area of research that combines advanced digital signal processing methodologies for enhanced genetic data analysis. It has many promising applications in bioinformatics and next generation of healthcare systems, in particular, in the field of microarray data clustering. In this paper we present a comparative performance analysis of enhanced digital spectral analysis methods for robust clustering of gene expression across multiple microarray data samples. Three digital signal processing methods: linear predictive coding, wavelet decomposition, and fractal dimension are studied to provide a comparative evaluation of the clustering performance of these methods on several microarray datasets. The results of this study show that the fractal approach provides the best clustering accuracy compared to other digital signal processing and well known statistical methods.

  13. Characterization of copy number variation in genomic regions containing STR loci using array comparative genomic hybridization.

    PubMed

    Repnikova, Elena A; Rosenfeld, Jill A; Bailes, Andrea; Weber, Cecilia; Erdman, Linda; McKinney, Aimee; Ramsey, Sarah; Hashimoto, Sayaka; Lamb Thrush, Devon; Astbury, Caroline; Reshmi, Shalini C; Shaffer, Lisa G; Gastier-Foster, Julie M; Pyatt, Robert E

    2013-09-01

    Short tandem repeat (STR) loci are commonly used in forensic casework, familial analysis for human identification, and for monitoring hematopoietic cell engraftment after bone marrow transplant. Unexpected genetic variation leading to sequence and length differences in STR loci can complicate STR typing, and presents challenges in casework interpretation. Copy number variation (CNV) is a relatively recently identified form of genetic variation consisting of genomic regions present at variable copy numbers within an individual compared to a reference genome. Large scale population studies have demonstrated that likely all individuals carry multiple regions with CNV of 1kb in size or greater in their genome. To date, no study correlating genomic regions containing STR loci with CNV has been conducted. In this study, we analyzed results from 32,850 samples sent for clinical array comparative genomic hybridization (CGH) analysis for the presence of CNV at regions containing the 13 CODIS (Combined DNA Index System) STR, and the Amelogenin X (AMELX) and Amelogenin Y (AMELY) loci. Thirty-two individuals with CNV involving STR loci on chromosomes 2, 4, 7, 11, 12, 13, 16, and 21, and twelve with CNV involving the AMELX/AMELY loci were identified. These results were correlated with data from publicly available databases housing information on CNV identified in normal populations and additional clinical cases. These collective results demonstrate the presence of CNV in regions containing 9 of the 13 CODIS STR and AMELX/Y loci. Further characterization of STR profiles within regions of CNV, additional cataloging of these variants in multiple populations, and contributing such examples to the public domain will provide valuable information for reliable use of these loci.

  14. Systematic prediction of cis-regulatory elements in the Chlamydomonas reinhardtii genome using comparative genomics.

    PubMed

    Ding, Jun; Li, Xiaoman; Hu, Haiyan

    2012-10-01

    Chlamydomonas reinhardtii is one of the most important microalgae model organisms and has been widely studied toward the understanding of chloroplast functions and various cellular processes. Further exploitation of C. reinhardtii as a model system to elucidate various molecular mechanisms and pathways requires systematic study of gene regulation. However, there is a general lack of genome-scale gene regulation study, such as global cis-regulatory element (CRE) identification, in C. reinhardtii. Recently, large-scale genomic data in microalgae species have become available, which enable the development of efficient computational methods to systematically identify CREs and characterize their roles in microalgae gene regulation. Here, we performed in silico CRE identification at the whole genome level in C. reinhardtii using a comparative genomics-based method. We predicted a large number of CREs in C. reinhardtii that are consistent with experimentally verified CREs. We also discovered that a large percentage of these CREs form combinations and have the potential to work together for coordinated gene regulation in C. reinhardtii. Multiple lines of evidence from literature, gene transcriptional profiles, and gene annotation resources support our prediction. The predicted CREs will serve, to our knowledge, as the first large-scale collection of CREs in C. reinhardtii to facilitate further experimental study of microalgae gene regulation. The accompanying software tool and the predictions in C. reinhardtii are also made available through a Web-accessible database (http://hulab.ucf.edu/research/projects/Microalgae/sdcre/motifcomb.html).

  15. High resolution comparative genomic hybridisation in clinical cytogenetics

    PubMed Central

    Kirchhoff, M.; Rose, H.; Lundsteen, C.

    2001-01-01

    High resolution comparative genomic hybridisation (HR-CGH) is a diagnostic tool in our clinical cytogenetics laboratory. The present survey reports the results of 253 clinical cases in which 47 abnormalities were detected. Among 144 dysmorphic and mentally retarded subjects with a normal conventional karyotype, 15 (10%) had small deletions or duplications, of which 11 were interstitial. In addition, a case of mosaic trisomy 9 was detected. Among 25 dysmorphic and mentally retarded subjects carrying apparently balanced de novo translocations, four had deletions at translocation breakpoints and two had deletions elsewhere in the genome. Seventeen of 19 complex rearrangements were clarified by HR-CGH. A small supernumerary marker chromosome occurring with low frequency and the breakpoint of a mosaic r(18) case could not be clarified. Three of 19 other abnormalities could not be confirmed by HR-CGH. One was a Williams syndrome deletion and two were DiGeorge syndrome deletions, which were apparently below the resolution of HR-CGH. However, we were able to confirm Angelman and Prader-Willi syndrome deletions, which are about 3-5 Mb. We conclude that HR-CGH should be used for the evaluation of (1) dysmorphic and mentally retarded subjects where normal karyotyping has failed to show abnormalities, (2) dysmorphic and mentally retarded subjects carrying apparently balanced de novo translocations, (3) apparently balanced de novo translocations detected prenatally, and (4) for clarification of complex structural rearrangements.


Keywords: comparative genomic hybridisation; chromosome analysis; chromosome aberrations; dysmorphism PMID:11694545

  16. Canine urothelial carcinoma: genomically aberrant and comparatively relevant

    PubMed Central

    Shapiro, S. G.; Raghunath, S.; Williams, C.; Motsinger-Reif, A. A.; Cullen, J. M.; Liu, T.; Albertson, D.; Ruvolo, M.; Lucas, A. Bergstrom; Jin, J.; Knapp, D. W.; Schiffman, J. D.

    2015-01-01

    Urothelial carcinoma (UC), also referred to as transitional cell carcinoma (TCC), is the most common bladder malignancy in both human and canine populations. In human UC, numerous studies have demonstrated the prevalence of chromosomal imbalances. Although the histopathology of the disease is similar in both species, studies evaluating the genomic profile of canine UC are lacking, limiting the discovery of key comparative molecular markers associated with driving UC pathogenesis. In the present study, we evaluated 31 primary canine UC biopsies by oligonucleotide array comparative genomic hybridization (oaCGH). Results highlighted the presence of three highly recurrent numerical aberrations: gain of dog chromosome (CFA) 13 and 36 and loss of CFA 19. Regional gains of CFA 13 and 36 were present in 97% and 84% of cases, respectively, and losses on CFA 19 were present in 77% of cases. Fluorescence in situ hybridization (FISH), using targeted bacterial artificial chromosome (BAC) clones and custom Agilent SureFISH probes, was performed to detect and quantify these regions in paraffin-embedded biopsy sections and urine-derived urothelial cells. The data indicate that these three aberrations are potentially diagnostic of UC. Comparison of our canine oaCGH data with that of 285 human cases identified a series of shared copy number aberrations. Using an informatics approach to interrogate the frequency of copy number aberrations across both species, we identified those that had the highest joint probability of association with UC. The most significant joint region contained the gene PABPC1, which should be considered further for its role in UC progression. In addition, cross-species filtering of genome-wide copy number data highlighted several genes as high-profile candidates for further analysis, including CDKN2A, S100A8/9, and LRP1B. We propose that these common aberrations are indicative of an evolutionarily conserved mechanism of pathogenesis and harbor genes key to

  17. Applications of comparative genomic hybridisation in constitutional chromosome studies.

    PubMed

    Breen, C J; Barton, L; Carey, A; Dunlop, A; Glancy, M; Hall, K; Hegarty, A M; Khokhar, M T; Power, M; Ryan, K; Green, A J; Stallings, R L

    1999-07-01

    G band cytogenetic analysis often leads to the discovery of unbalanced karyotypes that require further characterisation by molecular cytogenetic studies. In particular, G band analysis usually does not show the chromosomal origin of small marker chromosomes or of a small amount of extra material detected on otherwise normal chromosomes. Comparative genomic hybridisation (CGH) is one of several molecular approaches that can be applied to ascertain the origin of extra chromosomal material. CGH is also capable of detecting loss of material and thus is also applicable to confirming or further characterising subtle deletions. We have used comparative genomic hybridisation to analyse 19 constitutional chromosome abnormalities detected by G band analysis, including seven deletions, five supernumerary marker chromosomes, two interstitial duplications, and five chromosomes presenting with abnormal terminal banding patterns. CGH was successful in elucidating the origin of extra chromosomal material in 10 out of 11 non-mosaic cases, and permitted further characterisation of all of the deletions that could be detected by GTG banding. CGH appears to be a useful adjunct tool for either confirming deletions or defining their breakpoints and for determining the origin of extra chromosomal material, even in cases where abnormalities are judged to be subtle. We discuss internal quality control measures, such as the mismatching of test and reference DNA in order to assess the quality of the competitive hybridisation effect on the X chromosome.

  18. New apolipoprotein A-V: comparative genomics meets metabolism.

    PubMed

    Seda, O; Sedová, L

    2003-01-01

    The availability of the human genome sequence and the recently completed draft sequences of two major mammalian model species, the mouse (Mus musculus) and the rat (Rattus norvegicus), allow researchers to apply novel approaches for gene identification and characterization, using methods of comparative and functional genomics. Recently, a new gene coding for apolipoprotein A-V was identified in the vicinity of APOA-I/C-III/A-IV cluster on human chromosome 11q23 by comparative sequencing method. In a relatively short time, compelling evidence accumulated for the substantial role of APOA-V in lipid metabolism. Studies in knock-out and transgenic mice revealed that its expression pattern correlates negatively with triglyceride levels. This observation was verified in human population studies in variety of ethnic and age groups. Several single nucleotide polymorphisms were described and particular SNP alleles and haplotypes in the APO A-V gene region were shown to be associated with dyslipidemia. The discovery and characterization of the APO A-V demonstrates current possibilities of the integrative approaches in biology, boosted by the available bioinformatic tools.

  19. The complete mitochondrial genomes sequences of Asio flammeus and Asio otus and comparative analysis.

    PubMed

    Sun, Yi; Ma, Fei; Xiao, Bing; Zheng, Junjie; Yuan, Xiaodong; Tang, Minqian; Wang, Li; Yu, Yefei; Li, Qingwei

    2004-12-01

    The complete mitochondrial genomes of Asio flammeus and Asio otus were sequenced and found to span 18858 bp and 18493 bp, respectively. It is surprising to find the former to be the largest among all avian mitochondrial genomes sequenced so far. The two genomes have very similar gene order with that of Gallus gallus, neither contains the pseudo control region, but both have a single extra base, namely Cytidine, at position 174 in ND3 gene. The control regions of Asio flammeus and Asio otus' mitochondrial genomes span 3288 bp and 2926 bp respectively, which are the longest among vertebrates except for Myxine glutinosa and contribute to the large size of two genomes. The 3' end of the control region of Asio flammeus and Asio otus contains many tandemly repeated sequences, which are highly similar to a putative control element, i.e. Mt5, and may form stable stem-loop secondary structures. Such repeated sequences probably play an important role in regulating transcription and replication of mitochondrial genome. Our results may provide important clues for uncovering the origin and evolution mechanisms of mitochondrion genome.

  20. Comparative genomics of mitochondria in chlorarachniophyte algae: endosymbiotic gene transfer and organellar genome dynamics.

    PubMed

    Tanifuji, Goro; Archibald, John M; Hashimoto, Tetsuo

    2016-01-01

    Chlorarachniophyte algae possess four DNA-containing compartments per cell, the nucleus, mitochondrion, plastid and nucleomorph, the latter being a relic nucleus derived from a secondary endosymbiont. While the evolutionary dynamics of plastid and nucleomorph genomes have been investigated, a comparative investigation of mitochondrial genomes (mtDNAs) has not been carried out. We have sequenced the complete mtDNA of Lotharella oceanica and compared it to that of another chlorarachniophyte, Bigelowiella natans. The linear mtDNA of L. oceanica is 36.7 kbp in size and contains 35 protein genes, three rRNAs and 24 tRNAs. The codons GUG and UUG appear to be capable of acting as initiation codons in the chlorarachniophyte mtDNAs, in addition to AUG. Rpl16, rps4 and atp8 genes are missing in L.oceanica mtDNA, despite being present in B. natans mtDNA. We searched for, and found, mitochondrial rpl16 and rps4 genes with spliceosomal introns in the L. oceanica nuclear genome, indicating that mitochondrion-to-host-nucleus gene transfer occurred after the divergence of these two genera. Despite being of similar size and coding capacity, the level of synteny between L. oceanica and B. natans mtDNA is low, suggesting frequent rearrangements. Overall, our results suggest that chlorarachniophyte mtDNAs are more evolutionarily dynamic than their plastid counterparts. PMID:26888293

  1. Comparative genomics of mitochondria in chlorarachniophyte algae: endosymbiotic gene transfer and organellar genome dynamics.

    PubMed

    Tanifuji, Goro; Archibald, John M; Hashimoto, Tetsuo

    2016-02-18

    Chlorarachniophyte algae possess four DNA-containing compartments per cell, the nucleus, mitochondrion, plastid and nucleomorph, the latter being a relic nucleus derived from a secondary endosymbiont. While the evolutionary dynamics of plastid and nucleomorph genomes have been investigated, a comparative investigation of mitochondrial genomes (mtDNAs) has not been carried out. We have sequenced the complete mtDNA of Lotharella oceanica and compared it to that of another chlorarachniophyte, Bigelowiella natans. The linear mtDNA of L. oceanica is 36.7 kbp in size and contains 35 protein genes, three rRNAs and 24 tRNAs. The codons GUG and UUG appear to be capable of acting as initiation codons in the chlorarachniophyte mtDNAs, in addition to AUG. Rpl16, rps4 and atp8 genes are missing in L.oceanica mtDNA, despite being present in B. natans mtDNA. We searched for, and found, mitochondrial rpl16 and rps4 genes with spliceosomal introns in the L. oceanica nuclear genome, indicating that mitochondrion-to-host-nucleus gene transfer occurred after the divergence of these two genera. Despite being of similar size and coding capacity, the level of synteny between L. oceanica and B. natans mtDNA is low, suggesting frequent rearrangements. Overall, our results suggest that chlorarachniophyte mtDNAs are more evolutionarily dynamic than their plastid counterparts.

  2. Comparative genomics of mitochondria in chlorarachniophyte algae: endosymbiotic gene transfer and organellar genome dynamics

    NASA Astrophysics Data System (ADS)

    Tanifuji, Goro; Archibald, John M.; Hashimoto, Tetsuo

    2016-02-01

    Chlorarachniophyte algae possess four DNA-containing compartments per cell, the nucleus, mitochondrion, plastid and nucleomorph, the latter being a relic nucleus derived from a secondary endosymbiont. While the evolutionary dynamics of plastid and nucleomorph genomes have been investigated, a comparative investigation of mitochondrial genomes (mtDNAs) has not been carried out. We have sequenced the complete mtDNA of Lotharella oceanica and compared it to that of another chlorarachniophyte, Bigelowiella natans. The linear mtDNA of L. oceanica is 36.7 kbp in size and contains 35 protein genes, three rRNAs and 24 tRNAs. The codons GUG and UUG appear to be capable of acting as initiation codons in the chlorarachniophyte mtDNAs, in addition to AUG. Rpl16, rps4 and atp8 genes are missing in L.oceanica mtDNA, despite being present in B. natans mtDNA. We searched for, and found, mitochondrial rpl16 and rps4 genes with spliceosomal introns in the L. oceanica nuclear genome, indicating that mitochondrion-to-host-nucleus gene transfer occurred after the divergence of these two genera. Despite being of similar size and coding capacity, the level of synteny between L. oceanica and B. natans mtDNA is low, suggesting frequent rearrangements. Overall, our results suggest that chlorarachniophyte mtDNAs are more evolutionarily dynamic than their plastid counterparts.

  3. Comparative genomics of mitochondria in chlorarachniophyte algae: endosymbiotic gene transfer and organellar genome dynamics

    PubMed Central

    Tanifuji, Goro; Archibald, John M.; Hashimoto, Tetsuo

    2016-01-01

    Chlorarachniophyte algae possess four DNA-containing compartments per cell, the nucleus, mitochondrion, plastid and nucleomorph, the latter being a relic nucleus derived from a secondary endosymbiont. While the evolutionary dynamics of plastid and nucleomorph genomes have been investigated, a comparative investigation of mitochondrial genomes (mtDNAs) has not been carried out. We have sequenced the complete mtDNA of Lotharella oceanica and compared it to that of another chlorarachniophyte, Bigelowiella natans. The linear mtDNA of L. oceanica is 36.7 kbp in size and contains 35 protein genes, three rRNAs and 24 tRNAs. The codons GUG and UUG appear to be capable of acting as initiation codons in the chlorarachniophyte mtDNAs, in addition to AUG. Rpl16, rps4 and atp8 genes are missing in L.oceanica mtDNA, despite being present in B. natans mtDNA. We searched for, and found, mitochondrial rpl16 and rps4 genes with spliceosomal introns in the L. oceanica nuclear genome, indicating that mitochondrion-to-host-nucleus gene transfer occurred after the divergence of these two genera. Despite being of similar size and coding capacity, the level of synteny between L. oceanica and B. natans mtDNA is low, suggesting frequent rearrangements. Overall, our results suggest that chlorarachniophyte mtDNAs are more evolutionarily dynamic than their plastid counterparts. PMID:26888293

  4. Isolation and full genome characterization of avian influenza subtype H9N2 from poultry respiratory disease outbreak in Egypt.

    PubMed

    Shehata, Awad A; Parvin, Rokshana; Sultan, Hesham; Halami, Mohamed Y; Talaat, Shaimaa; Abd Elrazek, Alaa; Ibrahim, Mahmoud; Heenemann, Kristin; Vahlenkamp, Thomas

    2015-06-01

    Low pathogenic avian influenza virus of subtype H9N2 is panzootic in multiple avian species causing respiratory manifestations and severe economic losses. H9N2 co-circulate simultaneously with high pathogenic avian influenza virus subtype H5N1 in Egyptian chicken farms suggesting the possibility of reassortment. The aim of the present study was to isolate and characterize H9N2 from the recent outbreaks in chicken farms. Also the diversity of amantadine-resistant mutants among these isolates was tested by in situ ELISA and sequence analysis. Three influenza H9N2 viruses, designated A/chicken/Egypt/SCU8/2014, A/chicken/Egypt/SCU9/2014 and A/chicken/Egypt/SCU20/2014 were isolated from commercial broiler and broiler breeder chickens in specific pathogen free embryonated chicken eggs. The eight gene segments were amplified by RT-PCR, cloned, and subjected to full length sequencing. Phylogenetic analysis of these viruses revealed a close relationship between Egyptian, Middle Eastern and Israel isolates with an average of 96-99 % nucleotide homology and identified an ancestor relationship to low pathogenic H9N2 Quail/HK/G1/1997 prototype. The internal segments of the currently isolated viruses were derived from the same sub-lineage with no new evidence of reassortment. The three isolates were sensitive to amantadine as suggested by absence of mutations of M2 and confirmed by a phenotypic assay. In conclusion, avian influenza H9N2 virus is circulating in Egyptian chicken farms causing respiratory manifestations. Continuous monitoring of the molecular epidemiology and its impact on the virulence as well as emergence of new strains are necessary.

  5. Sources for Comparative Studies of Placentation. II. Genomic Resources

    PubMed Central

    Wildman, Derek E.

    2008-01-01

    The genomes of dozens of placental mammal species are now publicly available. These genome sequences have the potential to provide insight into the development and evolution of the placenta. In particular, the variable anatomy of the placenta has likely been affected by natural selection on the genomes of living and extinct mammals. In this note the current availability of mammal genome sequences is reviewed, and strengths and limitations of these data are discussed. Additionally, museums, zoos, and commercial entities are available to provide genomic resources to the placental research community. Recommendations for tissue storage conditions of placentas in genomic research are given. PMID:18155141

  6. Comparative genomics on Vangl1 and Vangl2 genes.

    PubMed

    Katoh, Yuriko; Katoh, Masaru

    2005-05-01

    WNT signals are transduced to the beta-catenin pathway or the planar cell polarity (PCP) pathway. WNT - beta-catenin pathway is implicated in carcinogenesis, while WNT-PCP pathway is implicated in cell motility and metastasis. Drosophila Van Gogh (Vang), Frizzled (Fz), Starry night (Stan), Prickle (Pk) and Diego (Dgo) are PCP signaling molecules. Vangl1 (Strabismus 2) and Vangl2 (Strabismus 1 or Ltap) are mammalian homologs of Drosophila Vang interacting with PRICKLE1, PRICKLE2, ANKRD6, DVL1, DVL2, DVL3, KAI1 and MAGI3. Here we identified and characterized rat Vangl1 and Vangl2 genes by using bioinformatics. Rat Vangl1 gene, consisting of eight exons, was located within AC098913.7 and AC108524.6 genome sequences. Rat Vangl2 gene, consisting of eight exons, was located within AC118856.3 and AC115243.5 genome sequences. Exon-intron structure of mammalian Vangl1 and Vangl2 orthologs was well conserved. E47 and double ELK1-binding sites were conserved among promoters of mammalian Vangl1 orthologs. PAX4, NFkappaB, HNF4, SOX9, RFX1, and POU2F1 (OCT1)-binding sites were conserved among promoters of mammalian Vangl2 orthologs. Rat Vangl1 (526 aa) and Vangl2 (521 aa) were four-transmembrane proteins with 71.5% total-amino-acid identity. Ser cluster motif (SxxSxxSxxSxxSxxS) in the N-terminal cytoplasmic region and PDZ-binding motif in the C-terminal cytoplasmic tail were evolutionarily conserved among vertebrate Vangl1 and Vangl2 orthologs. This is the first report on rat Vangl1 and Vangl2 genes as well as on comparative genomics for Vangl1 and Vangl2 orthologs.

  7. Genome Sequencing and Comparative Genomics of the Broad Host-Range Pathogen Rhizoctonia solani AG8

    PubMed Central

    Hane, James K.; Anderson, Jonathan P.; Williams, Angela H.; Sperschneider, Jana; Singh, Karam B.

    2014-01-01

    Rhizoctonia solani is a soil-borne basidiomycete fungus with a necrotrophic lifestyle which is classified into fourteen reproductively incompatible anastomosis groups (AGs). One of these, AG8, is a devastating pathogen causing bare patch of cereals, brassicas and legumes. R. solani is a multinucleate heterokaryon containing significant heterozygosity within a single cell. This complexity posed significant challenges for the assembly of its genome. We present a high quality genome assembly of R. solani AG8 and a manually curated set of 13,964 genes supported by RNA-seq. The AG8 genome assembly used novel methods to produce a haploid representation of its heterokaryotic state. The whole-genomes of AG8, the rice pathogen AG1-IA and the potato pathogen AG3 were observed to be syntenic and co-linear. Genes and functions putatively relevant to pathogenicity were highlighted by comparing AG8 to known pathogenicity genes, orthology databases spanning 197 phytopathogenic taxa and AG1-IA. We also observed SNP-level “hypermutation” of CpG dinucleotides to TpG between AG8 nuclei, with similarities to repeat-induced point mutation (RIP). Interestingly, gene-coding regions were widely affected along with repetitive DNA, which has not been previously observed for RIP in mononuclear fungi of the Pezizomycotina. The rate of heterozygous SNP mutations within this single isolate of AG8 was observed to be higher than SNP mutation rates observed across populations of most fungal species compared. Comparative analyses were combined to predict biological processes relevant to AG8 and 308 proteins with effector-like characteristics, forming a valuable resource for further study of this pathosystem. Predicted effector-like proteins had elevated levels of non-synonymous point mutations relative to synonymous mutations (dN/dS), suggesting that they may be under diversifying selection pressures. In addition, the distant relationship to sequenced necrotrophs of the Ascomycota suggests the

  8. Whole genome alignment based one-step real-time RT-PCR for universal detection of avian orthoreoviruses of chicken, pheasant and turkey origins.

    PubMed

    Tang, Yi; Lu, Huaguang

    2016-04-01

    Newly emerging avian orthoreovirus (ARV) variants have been continuously detected in Pennsylvania poultry since 2011. In this paper, we report our recent diagnostic assay development of one-step real-time RT-PCR (rRT-PCR) for the rapid and universal detection of all ARVs or reference strains of chicken, pheasant and turkey origins and six σC genotypes of the newly emerging field ARV variants in Pennsylvania (PA) poultry. Primers and probes for the rRT-PCR were designed from the conserved region of the M1 genome segment 5' end based on the whole-genome alignment of various ARV strains, including six field variants or novel strains obtained in PA poultry. The detection limit of the newly developed rRT-PCR for ARV was as low as 10 copies/reaction of viral RNA, and 10(0.50)-10(0.88) tissue culture infectious dose (TCID50)/100 μL of viruses. This new rRT-PCR detected all six σC genotypes from the 66 ARV field variant strains and reference strains tested in this study. There were no cross-reactions with other avian viruses. Reproducibility of the assay was confirmed by intra- and inter-assay tests with variability from 0.12% to 2.19%. Sensitivity and specificity of this new rRT-PCR for ARV were achieved at 100% and 88%, respectively, in comparison with virus isolation as the "gold standard" in testing poultry tissue specimen. PMID:26812128

  9. Whole genome alignment based one-step real-time RT-PCR for universal detection of avian orthoreoviruses of chicken, pheasant and turkey origins.

    PubMed

    Tang, Yi; Lu, Huaguang

    2016-04-01

    Newly emerging avian orthoreovirus (ARV) variants have been continuously detected in Pennsylvania poultry since 2011. In this paper, we report our recent diagnostic assay development of one-step real-time RT-PCR (rRT-PCR) for the rapid and universal detection of all ARVs or reference strains of chicken, pheasant and turkey origins and six σC genotypes of the newly emerging field ARV variants in Pennsylvania (PA) poultry. Primers and probes for the rRT-PCR were designed from the conserved region of the M1 genome segment 5' end based on the whole-genome alignment of various ARV strains, including six field variants or novel strains obtained in PA poultry. The detection limit of the newly developed rRT-PCR for ARV was as low as 10 copies/reaction of viral RNA, and 10(0.50)-10(0.88) tissue culture infectious dose (TCID50)/100 μL of viruses. This new rRT-PCR detected all six σC genotypes from the 66 ARV field variant strains and reference strains tested in this study. There were no cross-reactions with other avian viruses. Reproducibility of the assay was confirmed by intra- and inter-assay tests with variability from 0.12% to 2.19%. Sensitivity and specificity of this new rRT-PCR for ARV were achieved at 100% and 88%, respectively, in comparison with virus isolation as the "gold standard" in testing poultry tissue specimen.

  10. High-Resolution Mapping of Crossover and Non-crossover Recombination Events by Whole-Genome Re-sequencing of an Avian Pedigree

    PubMed Central

    Qvarnström, Anna; Ellegren, Hans

    2016-01-01

    Recombination is an engine of genetic diversity and therefore constitutes a key process in evolutionary biology and genetics. While the outcome of crossover recombination can readily be detected as shuffled alleles by following the inheritance of markers in pedigreed families, the more precise location of both crossover and non-crossover recombination events has been difficult to pinpoint. As a consequence, we lack a detailed portrait of the recombination landscape for most organisms and knowledge on how this landscape impacts on sequence evolution at a local scale. To localize recombination events with high resolution in an avian system, we performed whole-genome re-sequencing at high coverage of a complete three-generation collared flycatcher pedigree. We identified 325 crossovers at a median resolution of 1.4 kb, with 86% of the events localized to <10 kb intervals. Observed crossover rates were in excellent agreement with data from linkage mapping, were 52% higher in male (3.56 cM/Mb) than in female meiosis (2.28 cM/Mb), and increased towards chromosome ends in male but not female meiosis. Crossover events were non-randomly distributed in the genome with several distinct hot-spots and a concentration to genic regions, with the highest density in promoters and CpG islands. We further identified 267 non-crossovers, whose location was significantly associated with crossover locations. We detected a significant transmission bias (0.18) in favour of ‘strong’ (G, C) over ‘weak’ (A, T) alleles at non-crossover events, providing direct evidence for the process of GC-biased gene conversion in an avian system. The approach taken in this study should be applicable to any species and would thereby help to provide a more comprehensive portray of the recombination landscape across organism groups. PMID:27219623

  11. Comparative Genomics of Host-Specific Virulence in Pseudomonas syringae

    PubMed Central

    Sarkar, Sara F.; Gordon, Jeffrey S.; Martin, Gregory B.; Guttman, David S.

    2006-01-01

    While much study has gone into characterizing virulence factors that play a general role in disease, less work has been directed at identifying pathogen factors that act in a host-specific manner. Understanding these factors will help reveal the variety of mechanisms used by pathogens to suppress or avoid host defenses. We identified candidate Pseudomonas syringae host-specific virulence genes by searching for genes whose distribution among natural P. syringae isolates was statistically associated with hosts of isolation. We analyzed 91 strains isolated from 39 plant hosts by DNA microarray-based comparative genomic hybridization against an array containing 353 virulence-associated (VA) genes, including 53 type III secretion system effectors (T3SEs). We identified individual genes and gene profiles that were significantly associated with strains isolated from cauliflower, Chinese cabbage, soybean, rice, and tomato. We also identified specific horizontal gene acquisition events associated with host shifts by mapping the array data onto the core genome phylogeny of the species. This study provides the largest suite of candidate host-specificity factors from any pathogen, suggests that there are multiple ways in which P. syringae isolates can adapt to the same host, and provides insight into the evolutionary mechanisms underlying host adaptation. PMID:16951068

  12. Genome-wide Comparative Analysis of Annexin Superfamily in Plants

    PubMed Central

    Jami, Sravan Kumar; Clark, Greg B.; Ayele, Belay T.; Ashe, Paula; Kirti, Pulugurtha Bharadwaja

    2012-01-01

    Most annexins are calcium-dependent, phospholipid-binding proteins with suggested functions in response to environmental stresses and signaling during plant growth and development. They have previously been identified and characterized in Arabidopsis and rice, and constitute a multigene family in plants. In this study, we performed a comparative analysis of annexin gene families in the sequenced genomes of Viridiplantae ranging from unicellular green algae to multicellular plants, and identified 149 genes. Phylogenetic studies of these deduced annexins classified them into nine different arbitrary groups. The occurrence and distribution of bona fide type II calcium binding sites within the four annexin domains were found to be different in each of these groups. Analysis of chromosomal distribution of annexin genes in rice, Arabidopsis and poplar revealed their localization on various chromosomes with some members also found on duplicated chromosomal segments leading to gene family expansion. Analysis of gene structure suggests sequential or differential loss of introns during the evolution of land plant annexin genes. Intron positions and phases are well conserved in annexin genes from representative genomes ranging from Physcomitrella to higher plants. The occurrence of alternative motifs such as K/R/HGD was found to be overlapping or at the mutated regions of the type II calcium binding sites indicating potential functional divergence in certain plant annexins. This study provides a basis for further functional analysis and characterization of annexin multigene families in the plant lineage. PMID:23133603

  13. Comparative genomics of Arabidopsis and maize: prospects and limitations

    PubMed Central

    Brendel, Volker; Kurtz, Stefan; Walbot, Virginia

    2002-01-01

    The completed Arabidopsis genome seems to be of limited value as a model for maize genomics. In addition to the expansion of repetitive sequences in maize and the lack of genomic micro-colinearity, maize-specific or highly-diverged proteins contribute to a predicted maize proteome of about 50,000 proteins, twice the size of that of Arabidopsis. PMID:11897028

  14. Laminar segregation of GABAergic neurons in the avian nucleus isthmi pars magnocellularis: a retrograde tracer and comparative study.

    PubMed

    Faunes, Macarena; Fernández, Sara; Gutiérrez-Ibáñez, Cristián; Iwaniuk, Andrew N; Wylie, Douglas R; Mpodozis, Jorge; Karten, Harvey J; Marín, Gonzalo

    2013-06-01

    The isthmic complex is part of a visual midbrain circuit thought to be involved in stimulus selection and spatial attention. In birds, this circuit is composed of the nuclei isthmi pars magnocellularis (Imc), pars parvocellularis (Ipc), and pars semilunaris (SLu), all of them reciprocally connected to the ipsilateral optic tectum (TeO). The Imc conveys heterotopic inhibition to the TeO, Ipc, and SLu via widespread γ-aminobutyric acid (GABA)ergic axons that allow global competitive interactions among simultaneous sensory inputs. Anatomical studies in the chick have described a cytoarchitectonically uniform Imc nucleus containing two intermingled cell types: one projecting to the Ipc and SLu and the other to the TeO. Here we report that in passerine species, the Imc is segregated into an internal division displaying larger, sparsely distributed cells, and an external division displaying smaller, more densely packed cells. In vivo and in vitro injections of neural tracers in the TeO and the Ipc of the zebra finch demonstrated that neurons from the external and internal subdivisions project to the Ipc and the TeO, respectively, indicating that each Imc subdivision contains one of the two cell types hodologically defined in the chick. In an extensive survey across avian orders, we found that, in addition to passerines, only species of Piciformes and Rallidae exhibited a segregated Imc, whereas all other groups exhibited a uniform Imc. These results offer a comparative basis to investigate the functional role played by each Imc neural type in the competitive interactions mediated by this nucleus.

  15. Characterization of Three Mycobacterium spp. with Potential Use in Bioremediation by Genome Sequencing and Comparative Genomics

    PubMed Central

    Das, Sarbashis; Pettersson, B.M. Fredrik; Behra, Phani Rama Krishna; Ramesh, Malavika; Dasgupta, Santanu; Bhattacharya, Alok; Kirsebom, Leif A.

    2015-01-01

    We provide the genome sequences of the type strains of the polychlorophenol-degrading Mycobacterium chlorophenolicum (DSM43826), the degrader of chlorinated aliphatics Mycobacterium chubuense (DSM44219) and Mycobacterium obuense (DSM44075) that has been tested for use in cancer immunotherapy. The genome sizes of M. chlorophenolicum, M. chubuense, and M. obuense are 6.93, 5.95, and 5.58 Mb with GC-contents of 68.4%, 69.2%, and 67.9%, respectively. Comparative genomic analysis revealed that 3,254 genes are common and we predicted approximately 250 genes acquired through horizontal gene transfer from different sources including proteobacteria. The data also showed that the biodegrading Mycobacterium spp. NBB4, also referred to as M. chubuense NBB4, is distantly related to the M. chubuense type strain and should be considered as a separate species, we suggest it to be named Mycobacterium ethylenense NBB4. Among different categories we identified genes with potential roles in: biodegradation of aromatic compounds and copper homeostasis. These are the first nonpathogenic Mycobacterium spp. found harboring genes involved in copper homeostasis. These findings would therefore provide insight into the role of this group of Mycobacterium spp. in bioremediation as well as the evolution of copper homeostasis within the Mycobacterium genus. PMID:26079817

  16. Characterization of Three Mycobacterium spp. with Potential Use in Bioremediation by Genome Sequencing and Comparative Genomics.

    PubMed

    Das, Sarbashis; Pettersson, B M Fredrik; Behra, Phani Rama Krishna; Ramesh, Malavika; Dasgupta, Santanu; Bhattacharya, Alok; Kirsebom, Leif A

    2015-07-01

    We provide the genome sequences of the type strains of the polychlorophenol-degrading Mycobacterium chlorophenolicum (DSM43826), the degrader of chlorinated aliphatics Mycobacterium chubuense (DSM44219) and Mycobacterium obuense (DSM44075) that has been tested for use in cancer immunotherapy. The genome sizes of M. chlorophenolicum, M. chubuense, and M. obuense are 6.93, 5.95, and 5.58 Mb with GC-contents of 68.4%, 69.2%, and 67.9%, respectively. Comparative genomic analysis revealed that 3,254 genes are common and we predicted approximately 250 genes acquired through horizontal gene transfer from different sources including proteobacteria. The data also showed that the biodegrading Mycobacterium spp. NBB4, also referred to as M. chubuense NBB4, is distantly related to the M. chubuense type strain and should be considered as a separate species, we suggest it to be named Mycobacterium ethylenense NBB4. Among different categories we identified genes with potential roles in: biodegradation of aromatic compounds and copper homeostasis. These are the first nonpathogenic Mycobacterium spp. found harboring genes involved in copper homeostasis. These findings would therefore provide insight into the role of this group of Mycobacterium spp. in bioremediation as well as the evolution of copper homeostasis within the Mycobacterium genus.

  17. Comparative genomics of pectinacetylesterases: Insight on function and biology

    PubMed Central

    de Souza, Amancio José; Pauly, Markus

    2015-01-01

    Pectin acetylation influences the gelling ability of this important plant polysaccharide for the food industry. Plant apoplastic pectinacetylesterases (PAEs) play a key role in regulating the degree of pectin acetylation and modifying their expression thus represents one way to engineer plant polysaccharides for food applications. Identifying the major active enzymes within the PAE gene family will aid in our understanding of this biological phenomena as well as provide the tools for direct trait manipulation. Using comparative genomics we propose that there is a minimal set of 4 distinct PAEs in plants. Possible functional diversification of the PAE family in the grasses is also explored with the identification of 3 groups of PAE genes specific to grasses. PMID:26237162

  18. Establishing a framework for comparative analysis of genome sequences

    SciTech Connect

    Bansal, A.K.

    1995-06-01

    This paper describes a framework and a high-level language toolkit for comparative analysis of genome sequence alignment The framework integrates the information derived from multiple sequence alignment and phylogenetic tree (hypothetical tree of evolution) to derive new properties about sequences. Multiple sequence alignments are treated as an abstract data type. Abstract operations have been described to manipulate a multiple sequence alignment and to derive mutation related information from a phylogenetic tree by superimposing parsimonious analysis. The framework has been applied on protein alignments to derive constrained columns (in a multiple sequence alignment) that exhibit evolutionary pressure to preserve a common property in a column despite mutation. A Prolog toolkit based on the framework has been implemented and demonstrated on alignments containing 3000 sequences and 3904 columns.

  19. Reptile genomes open the frontier for comparative analysis of amniote development and regeneration.

    PubMed

    Tollis, Marc; Hutchins, Elizabeth D; Kusumi, Kenro

    2014-01-01

    Developmental genetic studies of vertebrates have focused primarily on zebrafish, frog and mouse models, which have clear application to medicine and well-developed genomic resources. In contrast, reptiles represent the most diverse amniote group, but have only recently begun to gather the attention of genome sequencing efforts. Extant reptilian groups last shared a common ancestor ?280 million years ago and include lepidosaurs, turtles and crocodilians. This phylogenetic diversity is reflected in great morphological and behavioral diversity capturing the attention of biologists interested in mechanisms regulating developmental processes such as somitogenesis and spinal patterning, regeneration, the evolution of "snake-like" morphology, the formation of the unique turtle shell, and the convergent evolution of the four-chambered heart shared by mammals and archosaurs. The complete genome of the first non-avian reptile, the green anole lizard, was published in 2011 and has provided insights into the origin and evolution of amniotes. Since then, the genomes of multiple snakes, turtles, and crocodilians have also been completed. Here we will review the current diversity of available reptile genomes, with an emphasis on their evolutionary relationships, and will highlight how these genomes have and will continue to facilitate research in developmental and regenerative biology.

  20. Testing avian compass calibration: comparative experiments with diurnal and nocturnal passerine migrants in South Sweden.

    PubMed

    Åkesson, Susanne; Odin, Catharina; Hegedüs, Ramón; Ilieva, Mihaela; Sjöholm, Christoffer; Farkas, Alexandra; Horváth, Gábor

    2014-01-01

    Cue-conflict experiments were performed to study the compass calibration of one predominantly diurnal migrant, the dunnock (Prunella modularis), and two species of nocturnal passerine migrants, the sedge warbler (Acrocephalus schoenobaenus), and the European robin (Erithacus rubecula) during autumn migration in South Sweden. The birds' orientation was recorded in circular cages under natural clear and simulated overcast skies in the local geomagnetic field, and thereafter the birds were exposed to a cue-conflict situation where the horizontal component of the magnetic field (mN) was shifted +90° or -90° at two occasions, one session starting shortly after sunrise and the other ca. 90 min before sunset and lasting for 60 min. The patterns of the degree and angle of skylight polarization were measured by full-sky imaging polarimetry during the cue-conflict exposures and orientation tests. All species showed orientation both under clear and overcast skies that correlated with the expected migratory orientation towards southwest to south. For the European robin the orientation under clear skies was significantly different from that recorded under overcast skies, showing a tendency that the orientation under clear skies was influenced by the position of the Sun at sunset resulting in more westerly orientation. This sun attraction was not observed for the sedge warbler and the dunnock, both orientating south. All species showed similar orientation after the cue-conflict as compared to the preferred orientation recorded before the cue-conflict, with the clearest results in the European robin and thus, the results did not support recalibration of the celestial nor the magnetic compasses as a result of the cue-conflict exposure. PMID:25505150

  1. Testing avian compass calibration: comparative experiments with diurnal and nocturnal passerine migrants in South Sweden

    PubMed Central

    Åkesson, Susanne; Odin, Catharina; Hegedüs, Ramón; Ilieva, Mihaela; Sjöholm, Christoffer; Farkas, Alexandra; Horváth, Gábor

    2015-01-01

    ABSTRACT Cue-conflict experiments were performed to study the compass calibration of one predominantly diurnal migrant, the dunnock (Prunella modularis), and two species of nocturnal passerine migrants, the sedge warbler (Acrocephalus schoenobaenus), and the European robin (Erithacus rubecula) during autumn migration in South Sweden. The birds' orientation was recorded in circular cages under natural clear and simulated overcast skies in the local geomagnetic field, and thereafter the birds were exposed to a cue-conflict situation where the horizontal component of the magnetic field (mN) was shifted +90° or −90° at two occasions, one session starting shortly after sunrise and the other ca. 90 min before sunset and lasting for 60 min. The patterns of the degree and angle of skylight polarization were measured by full-sky imaging polarimetry during the cue-conflict exposures and orientation tests. All species showed orientation both under clear and overcast skies that correlated with the expected migratory orientation towards southwest to south. For the European robin the orientation under clear skies was significantly different from that recorded under overcast skies, showing a tendency that the orientation under clear skies was influenced by the position of the Sun at sunset resulting in more westerly orientation. This sun attraction was not observed for the sedge warbler and the dunnock, both orientating south. All species showed similar orientation after the cue-conflict as compared to the preferred orientation recorded before the cue-conflict, with the clearest results in the European robin and thus, the results did not support recalibration of the celestial nor the magnetic compasses as a result of the cue-conflict exposure. PMID:25505150

  2. Reproductive Biology and Its Impact on Body Size: Comparative Analysis of Mammalian, Avian and Dinosaurian Reproduction

    PubMed Central

    Werner, Jan; Griebeler, Eva Maria

    2011-01-01

    Janis and Carrano (1992) suggested that large dinosaurs might have faced a lower risk of extinction under ecological changes than similar-sized mammals because large dinosaurs had a higher potential reproductive output than similar-sized mammals (JC hypothesis). First, we tested the assumption underlying the JC hypothesis. We therefore analysed the potential reproductive output (reflected in clutch/litter size and annual offspring number) of extant terrestrial mammals and birds (as “dinosaur analogs”) and of extinct dinosaurs. With the exception of rodents, the differences in the reproductive output of similar-sized birds and mammals proposed by Janis and Carrano (1992) existed even at the level of single orders. Fossil dinosaur clutches were larger than litters of similar-sized mammals, and dinosaur clutch sizes were comparable to those of similar-sized birds. Because the extinction risk of extant species often correlates with a low reproductive output, the latter difference suggests a lower risk of population extinction in dinosaurs than in mammals. Second, we present a very simple, mathematical model that demonstrates the advantage of a high reproductive output underlying the JC hypothesis. It predicts that a species with a high reproductive output that usually faces very high juvenile mortalities will benefit more strongly in terms of population size from reduced juvenile mortalities (e.g., resulting from a stochastic reduction in population size) than a species with a low reproductive output that usually comprises low juvenile mortalities. Based on our results, we suggest that reproductive strategy could have contributed to the evolution of the exceptional gigantism seen in dinosaurs that does not exist in extant terrestrial mammals. Large dinosaurs, e.g., the sauropods, may have easily sustained populations of very large-bodied species over evolutionary time. PMID:22194835

  3. Evolutionary insights into scleractinian corals using comparative genomic hybridizations

    PubMed Central

    2012-01-01

    Background Coral reefs belong to the most ecologically and economically important ecosystems on our planet. Yet, they are under steady decline worldwide due to rising sea surface temperatures, disease, and pollution. Understanding the molecular impact of these stressors on different coral species is imperative in order to predict how coral populations will respond to this continued disturbance. The use of molecular tools such as microarrays has provided deep insight into the molecular stress response of corals. Here, we have performed comparative genomic hybridizations (CGH) with different coral species to an Acropora palmata microarray platform containing 13,546 cDNA clones in order to identify potentially rapidly evolving genes and to determine the suitability of existing microarray platforms for use in gene expression studies (via heterologous hybridization). Results Our results showed that the current microarray platform for A. palmata is able to provide biological relevant information for a wide variety of coral species covering both the complex clade as well the robust clade. Analysis of the fraction of highly diverged genes showed a significantly higher amount of genes without annotation corroborating previous findings that point towards a higher rate of divergence for taxonomically restricted genes. Among the genes with annotation, we found many mitochondrial genes to be highly diverged in M. faveolata when compared to A. palmata, while the majority of nuclear encoded genes maintained an average divergence rate. Conclusions The use of present microarray platforms for transcriptional analyses in different coral species will greatly enhance the understanding of the molecular basis of stress and health and highlight evolutionary differences between scleractinian coral species. On a genomic basis, we show that cDNA arrays can be used to identify patterns of divergence. Mitochondrion-encoded genes seem to have diverged faster than nuclear encoded genes in robust

  4. Comparative genomic hybridizations reveal absence of large Streptomyces coelicolor genomic islands in Streptomyces lividans

    PubMed Central

    Jayapal, Karthik P; Lian, Wei; Glod, Frank; Sherman, David H; Hu, Wei-Shou

    2007-01-01

    Background The genomes of Streptomyces coelicolor and Streptomyces lividans bear a considerable degree of synteny. While S. coelicolor is the model streptomycete for studying antibiotic synthesis and differentiation, S. lividans is almost exclusively considered as the preferred host, among actinomycetes, for cloning and expression of exogenous DNA. We used whole genome microarrays as a comparative genomics tool for identifying the subtle differences between these two chromosomes. Results We identified five large S. coelicolor genomic islands (larger than 25 kb) and 18 smaller islets absent in S. lividans chromosome. Many of these regions show anomalous GC bias and codon usage patterns. Six of them are in close vicinity of tRNA genes while nine are flanked with near perfect repeat sequences indicating that these are probable recent evolutionary acquisitions into S. coelicolor. Embedded within these segments are at least four DNA methylases and two probable methyl-sensing restriction endonucleases. Comparison with S. coelicolor transcriptome and proteome data revealed that some of the missing genes are active during the course of growth and differentiation in S. coelicolor. In particular, a pair of methylmalonyl CoA mutase (mcm) genes involved in polyketide precursor biosynthesis, an acyl-CoA dehydrogenase implicated in timing of actinorhodin synthesis and bldB, a developmentally significant regulator whose mutation causes complete abrogation of antibiotic synthesis belong to this category. Conclusion Our findings provide tangible hints for elucidating the genetic basis of important phenotypic differences between these two streptomycetes. Importantly, absence of certain genes in S. lividans identified here could potentially explain the relative ease of DNA transformations and the conditional lack of actinorhodin synthesis in S. lividans. PMID:17623098

  5. Comparative analysis of trichomonad genome sizes and karyotypes.

    PubMed

    Zubácová, Zuzana; Cimbůrek, Zdenek; Tachezy, Jan

    2008-09-01

    In parasitic protists, the genome sizes range from 2.9Mb in Encephalitozoon cuniculi to about 160Mb in Trichomonas vaginalis. The suprisingly large genome size of the former human parasite resulted from the expansion of various repetitive elements, specific gene families, and possibly from large-scale genome duplication. The reason for this phenomenon, as well as whether other trichomonad species have undergone a similar genome expansion, is not known. In this work we studied the genomes of nine selected species of the Trichomonadea group. We found that each species has a characteristic karyotype with a stable and haploid number of chromosomes. Relatively large genome sizes were found in all the tested species, although over a rather broad range (86-177Mb). The largest genomes were typically observed in the Trichomonas and Tritrichomonas genera (133-177Mb), while Tetratrichomonas gallinarum contains the smallest genome (86Mb). The genome size correlated with the cell volume, however, no relationship between genome size and the site of infection or trichomonad phagocytic ability was observed. The data presented here provide primary information towards selecting a trichomonad species for future large-scale sequencing to elucidate the evolution of unusual parabasalid genomes. PMID:18606195

  6. Comparative analysis of trichomonad genome sizes and karyotypes.

    PubMed

    Zubácová, Zuzana; Cimbůrek, Zdenek; Tachezy, Jan

    2008-09-01

    In parasitic protists, the genome sizes range from 2.9Mb in Encephalitozoon cuniculi to about 160Mb in Trichomonas vaginalis. The suprisingly large genome size of the former human parasite resulted from the expansion of various repetitive elements, specific gene families, and possibly from large-scale genome duplication. The reason for this phenomenon, as well as whether other trichomonad species have undergone a similar genome expansion, is not known. In this work we studied the genomes of nine selected species of the Trichomonadea group. We found that each species has a characteristic karyotype with a stable and haploid number of chromosomes. Relatively large genome sizes were found in all the tested species, although over a rather broad range (86-177Mb). The largest genomes were typically observed in the Trichomonas and Tritrichomonas genera (133-177Mb), while Tetratrichomonas gallinarum contains the smallest genome (86Mb). The genome size correlated with the cell volume, however, no relationship between genome size and the site of infection or trichomonad phagocytic ability was observed. The data presented here provide primary information towards selecting a trichomonad species for future large-scale sequencing to elucidate the evolution of unusual parabasalid genomes.

  7. Automated comparative auditing of NCIT genomic roles using NCBI.

    PubMed

    Cohen, Barry; Oren, Marc; Min, Hua; Perl, Yehoshua; Halper, Michael

    2008-12-01

    Biomedical research has identified many human genes and various knowledge about them. The National Cancer Institute Thesaurus (NCIT) represents such knowledge as concepts and roles (relationships). Due to the rapid advances in this field, it is to be expected that the NCIT's Gene hierarchy will contain role errors. A comparative methodology to audit the Gene hierarchy with the use of the National Center for Biotechnology Information's (NCBI's) Entrez Gene database is presented. The two knowledge sources are accessed via a pair of Web crawlers to ensure up-to-date data. Our algorithms then compare the knowledge gathered from each, identify discrepancies that represent probable errors, and suggest corrective actions. The primary focus is on two kinds of gene-roles: (1) the chromosomal locations of genes, and (2) the biological processes in which genes play a role. Regarding chromosomal locations, the discrepancies revealed are striking and systematic, suggesting a structurally common origin. In regard to the biological processes, difficulties arise because genes frequently play roles in multiple processes, and processes may have many designations (such as synonymous terms). Our algorithms make use of the roles defined in the NCIT Biological Process hierarchy to uncover many probable gene-role errors in the NCIT. These results show that automated comparative auditing is a promising technique that can identify a large number of probable errors and corrections for them in a terminological genomic knowledge repository, thus facilitating its overall maintenance.

  8. Comparative genomic analysis of hyperthermophilic archaeal fuselloviridae viruses

    SciTech Connect

    B. Wiedenheft; K. Stedman; F. Roberto; D. Willits; A. K. Gleske; L. Zoeller; J. Snyder; T. Douglas; M. Young

    2004-02-01

    The complete genome sequences of two Sulfolobus spindle-shaped viruses (SSVs) from acidic hot springs in Kamchatka (Russia) and Yellowstone National Park (United States) have been determined. These nonlytic temperate viruses were isolated from hyperthermophilic Sulfolobus hosts, and both viruses share the spindleshaped morphology characteristic of the Fuselloviridae family. These two genomes, in combination with the previously determined SSV1 genome from Japan and the SSV2 genome from Iceland, have allowed us to carry out a phylogenetic comparison of these geographically distributed hyperthermal viruses. Each virus contains a circular double-stranded DNA genome of _15 kbp with approximately 34 open reading frames (ORFs). These Fusellovirus ORFs show little or no similarity to genes in the public databases. In contrast, 18 ORFs are common to all four isolates and may represent the minimal gene set defining this viral group. In general, ORFs on one half of the genome are colinear and highly conserved, while ORFs on the other half are not. One shared ORF among all four genomes is an integrase of the tyrosine recombinase family. All four viral genomes integrate into their host tRNA genes. The specific tRNA gene used for integration varies, and one genome integrates into multiple loci. Several unique ORFs are found in the genome of each isolate.

  9. Comparative Genomic Analyses of the Human NPHP1 Locus Reveal Complex Genomic Architecture and Its Regional Evolution in Primates

    PubMed Central

    Yuan, Bo; Liu, Pengfei; Gupta, Aditya; Beck, Christine R.; Tejomurtula, Anusha; Campbell, Ian M.; Gambin, Tomasz; Simmons, Alexandra D.; Withers, Marjorie A.; Harris, R. Alan; Rogers, Jeffrey; Schwartz, David C.; Lupski, James R.

    2015-01-01

    Many loci in the human genome harbor complex genomic structures that can result in susceptibility to genomic rearrangements leading to various genomic disorders. Nephronophthisis 1 (NPHP1, MIM# 256100) is an autosomal recessive disorder that can be caused by defects of NPHP1; the gene maps within the human 2q13 region where low copy repeats (LCRs) are abundant. Loss of function of NPHP1 is responsible for approximately 85% of the NPHP1 cases—about 80% of such individuals carry a large recurrent homozygous NPHP1 deletion that occurs via nonallelic homologous recombination (NAHR) between two flanking directly oriented ~45 kb LCRs. Published data revealed a non-pathogenic inversion polymorphism involving the NPHP1 gene flanked by two inverted ~358 kb LCRs. Using optical mapping and array-comparative genomic hybridization, we identified three potential novel structural variant (SV) haplotypes at the NPHP1 locus that may protect a haploid genome from the NPHP1 deletion. Inter-species comparative genomic analyses among primate genomes revealed massive genomic changes during evolution. The aggregated data suggest that dynamic genomic rearrangements occurred historically within the NPHP1 locus and generated SV haplotypes observed in the human population today, which may confer differential susceptibility to genomic instability and the NPHP1 deletion within a personal genome. Our study documents diverse SV haplotypes at a complex LCR-laden human genomic region. Comparative analyses provide a model for how this complex region arose during primate evolution, and studies among humans suggest that intra-species polymorphism may potentially modulate an individual’s susceptibility to acquiring disease-associated alleles. PMID:26641089

  10. Array Comparative Genomic Hybridizations: Assessing the ability to recapture evolutionary relationships using an in silico approach

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Comparative Genomic Hybridization (CGH) with DNA microarrays has many biological applications including surveys of copy number changes in tumorigenesis, species detection and identification, and functional genomics studies among related organisms. Array CGH has also been used to infer phylogenetic r...

  11. Faustoviruses: Comparative Genomics of New Megavirales Family Members

    PubMed Central

    Benamar, Samia; Reteno, Dorine G. I.; Bandaly, Victor; Labas, Noémie; Raoult, Didier; La Scola, Bernard

    2016-01-01

    An emerging interest for the giant virus discovery process, genome sequencing and analysis has allowed an expansion of the number of known Megavirales members. Using the protist Vermamoeba sp. as cell support, a new giant virus named Faustovirus has been isolated. In this study, we describe the genome sequences of nine Faustoviruses and build a genomic comparison in order to have a comprehensive overview of genomic composition and diversity among this new virus family. The average sequence length of these viruses is 467,592.44 bp (ranging from 455,803 to 491,024 bp), making them the fourth largest Megavirales genome after Mimiviruses, Pandoraviruses, and Pithovirus sibericum. Faustovirus genomes displayed an average G+C content of 37.14 % (ranging from 36.22 to 39.59%) which is close to the G+C content range of the Asfarviridae genomes (38%). The proportion of best matches and the phylogenetic analysis suggest a shared origin with Asfarviridae without belonging to the same family. The core-gene-based phylogeny of Faustoviruses study has identified four lineages. These results were confirmed by the analysis of amino acids and COGs category distribution. The diversity of the gene composition of these lineages is mainly explained by gene deletion or acquisition and some exceptions for gene duplications. The high proportion of best matches from Bacteria and Phycodnaviridae on the pan-genome and unique genes may be explained by an interaction occurring after the separation of the lineages. The Faustovirus core-genome appears to consolidate the surrounding of 207 genes whereas the pan-genome is described as an open pan-genome, its enrichment via the discovery of new Faustoviruses is required to better seize all the genomic diversity of this family. PMID:26903952

  12. Comparative Genomic and Sequence Analysis Provides Insight into the Molecular Functionality of NOD1 and NOD2

    PubMed Central

    Boyle, Joseph P.; Mayle, Sophie; Parkhouse, Rhiannon; Monie, Tom P.

    2013-01-01

    Amino acids with functional or key structural roles display higher degrees of conservation through evolution. The comparative analysis of protein sequences from multiple species and/or between homologous proteins can be highly informative in the identification of key structural and functional residues. Residues which in turn provide insight into the molecular mechanisms of protein function. We have explored the genomic and amino acid conservation of the prototypic innate immune genes NOD1 and NOD2. NOD1 orthologs were found in all vertebrate species analyzed, whilst NOD2 was absent from the genomes of avian, reptilian and amphibian species. Evolutionary trace analysis was used to identify highly conserved regions of NOD1 and NOD2 across multiple species. Consistent with the known functions of NOD1 and NOD2 highly conserved patches were identified that matched the Walker A and B motifs and provided interaction surfaces for the adaptor protein RIP2. Other patches of high conservation reflect key structural functions as predicted by homology models. In addition, the pattern of residue conservation within the leucine-rich repeat (LRR) region of NOD1 and NOD2 is indicative of a conserved mechanism of ligand recognition involving the concave surface of the LRRs. PMID:24109482

  13. Germline Modification and Engineering in Avian Species.

    PubMed

    Lee, Hong Jo; Lee, Hyung Chul; Han, Jae Yong

    2015-09-01

    Production of genome-edited animals using germline-competent cells and genetic modification tools has provided opportunities for investigation of biological mechanisms in various organisms. The recently reported programmed genome editing technology that can induce gene modification at a target locus in an efficient and precise manner facilitates establishment of animal models. In this regard, the demand for genome-edited avian species, which are some of the most suitable model animals due to their unique embryonic development, has also increased. Furthermore, germline chimera production through long-term culture of chicken primordial germ cells (PGCs) has facilitated research on production of genome-edited chickens. Thus, use of avian germline modification is promising for development of novel avian models for research of disease control and various biological mechanisms. Here, we discuss recent progress in genome modification technology in avian species and its applications and future strategies. PMID:26333275

  14. Germline Modification and Engineering in Avian Species

    PubMed Central

    Lee, Hong Jo; Lee, Hyung Chul; Han, Jae Yong

    2015-01-01

    Production of genome-edited animals using germline-competent cells and genetic modification tools has provided opportunities for investigation of biological mechanisms in various organisms. The recently reported programmed genome editing technology that can induce gene modification at a target locus in an efficient and precise manner facilitates establishment of animal models. In this regard, the demand for genome-edited avian species, which are some of the most suitable model animals due to their unique embryonic development, has also increased. Furthermore, germline chimera production through long-term culture of chicken primordial germ cells (PGCs) has facilitated research on production of genome-edited chickens. Thus, use of avian germline modification is promising for development of novel avian models for research of disease control and various biological mechanisms. Here, we discuss recent progress in genome modification technology in avian species and its applications and future strategies. PMID:26333275

  15. Genome-wide association mapping in a wild avian population identifies a link between genetic and phenotypic variation in a life-history trait

    PubMed Central

    Husby, Arild; Kawakami, Takeshi; Rönnegård, Lars; Smeds, Linnéa; Ellegren, Hans; Qvarnström, Anna

    2015-01-01

    Understanding the genetic basis of traits involved in adaptation is a major challenge in evolutionary biology but remains poorly understood. Here, we use genome-wide association mapping using a custom 50 k single nucleotide polymorphism (SNP) array in a natural population of collared flycatchers to examine the genetic basis of clutch size, an important life-history trait in many animal species. We found evidence for an association on chromosome 18 where one SNP significant at the genome-wide level explained 3.9% of the phenotypic variance. We also detected two suggestive quantitative trait loci (QTLs) on chromosomes 9 and 26. Fitness differences among genotypes were generally weak and not significant, although there was some indication of a sex-by-genotype interaction for lifetime reproductive success at the suggestive QTL on chromosome 26. This implies that sexual antagonism may play a role in maintaining genetic variation at this QTL. Our findings provide candidate regions for a classic avian life-history trait that will be useful for future studies examining the molecular and cellular function of, as well as evolutionary mechanisms operating at, these loci. PMID:25833857

  16. Genome Based Phylogeny and Comparative Genomic Analysis of Intra-Mammary Pathogenic Escherichia coli

    PubMed Central

    Richards, Vincent P.; Lefébure, Tristan; Pavinski Bitar, Paulina D.; Dogan, Belgin; Simpson, Kenneth W.; Schukken, Ynte H.; Stanhope, Michael J.

    2015-01-01

    Escherichia coli is an important cause of bovine mastitis and can cause both severe inflammation with a short-term transient infection, as well as less severe, but more chronic inflammation and infection persistence. E. coli is a highly diverse organism that has been classified into a number of different pathotypes or pathovars, and mammary pathogenic E. coli (MPEC) has been proposed as a new such pathotype. The purpose of this study was to use genome sequence data derived from both transient and persistent MPEC isolates (two isolates of each phenotype) to construct a genome-based phylogeny that places MPEC in its phylogenetic context with other E. coli pathovars. A subsidiary goal was to conduct comparative genomic analyses of these MPEC isolates with other E. coli pathovars to provide a preliminary perspective on loci that might be correlated with the MPEC phenotype. Both concatenated and consensus tree phylogenies did not support MPEC monophyly or the monophyly of either transient or persistent phenotypes. Three of the MPEC isolates (ECA-727, ECC-Z, and ECA-O157) originated from within the predominately commensal clade of E. coli, referred to as phylogroup A. The fourth MPEC isolate, of the persistent phenotype (ECC-1470), was sister group to an isolate of ETEC, falling within the E. coli B1 clade. This suggests that the MPEC phenotype has arisen on numerous independent occasions and that this has often, although not invariably, occurred from commensal ancestry. Examination of the genes present in the MPEC strains relative to the commensal strains identified a consistent presence of the type VI secretion system (T6SS) in the MPEC strains, with only occasional representation in commensal strains, suggesting that T6SS may be associated with MPEC pathogenesis and/or as an inter-bacterial competitive attribute and therefore could represent a useful target to explore for the development of MPEC specific inhibitors. PMID:25807497

  17. Genome Stability of Lyme Disease Spirochetes: Comparative Genomics of Borrelia burgdorferi Plasmids

    SciTech Connect

    Casjens S. R.; Dunn J.; Mongodin, E. F.; Qiu, W.-G.; Luft, B. J.; Schutzer, S. E.; Gilcrease, E. B.; Huang, W. M.; Vujadinovic, M.; Aron, J. K.; Vargas, L. C.; Freeman, S.; Radune, D.; Weidman, J. F.; Dimitrov, G. I.; Khouri, H. M.; Sosa, J. E.; Halpin, R. A.; Fraser, C. M.

    2012-03-14

    Lyme disease is the most common tick-borne human illness in North America. In order to understand the molecular pathogenesis, natural diversity, population structure and epizootic spread of the North American Lyme agent, Borrelia burgdorferi sensu stricto, a much better understanding of the natural diversity of its genome will be required. Towards this end we present a comparative analysis of the nucleotide sequences of the numerous plasmids of B. burgdorferi isolates B31, N40, JD1 and 297. These strains were chosen because they include the three most commonly studied laboratory strains, and because they represent different major genetic lineages and so are informative regarding the genetic diversity and evolution of this organism. A unique feature of Borrelia genomes is that they carry a large number of linear and circular plasmids, and this work shows that strains N40, JD1, 297 and B31 carry related but non-identical sets of 16, 20, 19 and 21 plasmids, respectively, that comprise 33-40% of their genomes. We deduce that there are at least 28 plasmid compatibility types among the four strains. The B. burgdorferi {approx}900 Kbp linear chromosomes are evolutionarily exceptionally stable, except for a short {le}20 Kbp plasmid-like section at the right end. A few of the plasmids, including the linear lp54 and circular cp26, are also very stable. We show here that the other plasmids, especially the linear ones, are considerably more variable. Nearly all of the linear plasmids have undergone one or more substantial inter-plasmid rearrangements since their last common ancestor. In spite of these rearrangements and differences in plasmid contents, the overall gene complement of the different isolates has remained relatively constant.

  18. Complete nucleotide sequence of the Cryptomeria japonica D. Don. chloroplast genome and comparative chloroplast genomics: diversified genomic structure of coniferous species

    PubMed Central

    Hirao, Tomonori; Watanabe, Atsushi; Kurita, Manabu; Kondo, Teiji; Takata, Katsuhiko

    2008-01-01

    Background The recent determination of complete chloroplast (cp) genomic sequences of various plant species has enabled numerous comparative analyses as well as advances in plant and genome evolutionary studies. In angiosperms, the complete cp genome sequences of about 70 species have been determined, whereas those of only three gymnosperm species, Cycas taitungensis, Pinus thunbergii, and Pinus koraiensis have been established. The lack of information regarding the gene content and genomic structure of gymnosperm cp genomes may severely hamper further progress of plant and cp genome evolutionary studies. To address this need, we report here the complete nucleotide sequence of the cp genome of Cryptomeria japonica, the first in the Cupressaceae sensu lato of gymnosperms, and provide a comparative analysis of their gene content and genomic structure that illustrates the unique genomic features of gymnosperms. Results The C. japonica cp genome is 131,810 bp in length, with 112 single copy genes and two duplicated (trnI-CAU, trnQ-UUG) genes that give a total of 116 genes. Compared to other land plant cp genomes, the C. japonica cp has lost one of the relevant large inverted repeats (IRs) found in angiosperms, fern, liverwort, and gymnosperms, such as Cycas and Gingko, and additionally has completely lost its trnR-CCG, partially lost its trnT-GGU, and shows diversification of accD. The genomic structure of the C. japonica cp genome also differs significantly from those of other plant species. For example, we estimate that a minimum of 15 inversions would be required to transform the gene organization of the Pinus thunbergii cp genome into that of C. japonica. In the C. japonica cp genome, direct repeat and inverted repeat sequences are observed at the inversion and translocation endpoints, and these sequences may be associated with the genomic rearrangements. Conclusion The observed differences in genomic structure between C. japonica and other land plants, including

  19. CrusView: a Java-based visualization platform for comparative genomics analyses in Brassicaceae species.

    PubMed

    Chen, Hao; Wang, Xiangfeng

    2013-09-01

    In plants and animals, chromosomal breakage and fusion events based on conserved syntenic genomic blocks lead to conserved patterns of karyotype evolution among species of the same family. However, karyotype information has not been well utilized in genomic comparison studies. We present CrusView, a Java-based bioinformatic application utilizing Standard Widget Toolkit/Swing graphics libraries and a SQLite database for performing visualized analyses of comparative genomics data in Brassicaceae (crucifer) plants. Compared with similar software and databases, one of the unique features of CrusView is its integration of karyotype information when comparing two genomes. This feature allows users to perform karyotype-based genome assembly and karyotype-assisted genome synteny analyses with preset karyotype patterns of the Brassicaceae genomes. Additionally, CrusView is a local program, which gives its users high flexibility when analyzing unpublished genomes and allows users to upload self-defined genomic information so that they can visually study the associations between genome structural variations and genetic elements, including chromosomal rearrangements, genomic macrosynteny, gene families, high-frequency recombination sites, and tandem and segmental duplications between related species. This tool will greatly facilitate karyotype, chromosome, and genome evolution studies using visualized comparative genomics approaches in Brassicaceae species. CrusView is freely available at http://www.cmbb.arizona.edu/CrusView/.

  20. GPAC-genome presence/absence compiler: a web application to comparatively visualize multiple genome-level changes.

    PubMed

    Noll, Angela; Grundmann, Norbert; Churakov, Gennady; Brosius, Jürgen; Makałowski, Wojciech; Schmitz, Jürgen

    2015-01-01

    Our understanding of genome-wide and comparative sequence information has been broadened considerably by the databases available from the University of California Santa Cruz (UCSC) Genome Bioinformatics Department. In particular, the identification and visualization of genomic sequences, present in some species but absent in others, led to fundamental insights into gene and genome evolution. However, the UCSC tools currently enable one to visualize orthologous genomic loci for a range of species in only a single locus. For large-scale comparative analyses of such presence/absence patterns a multilocus view would be more desirable. Such a tool would enable us to compare thousands of relevant loci simultaneously and to resolve many different questions about, for example, phylogeny, specific aspects of genome and gene evolution, such as the gain or loss of exons and introns, the emergence of novel transposed elements, nonprotein-coding RNAs, and viral genomic particles. Here, we present the first tool to facilitate the parallel analysis of thousands of genomic loci for cross-species presence/absence patterns based on multiway genome alignments. This genome presence/absence compiler uses annotated or other compilations of coordinates of genomic locations and compiles all presence/absence patterns in a flexible, color-coded table linked to the individual UCSC Genome Browser alignments. We provide examples of the versatile information content of such a screening system especially for 7SL-derived transposed elements, nuclear mitochondrial DNA, DNA transposons, and miRNAs in primates (http://www.bioinformatics.uni-muenster.de/tools/gpac, last accessed October 1, 2014).

  1. The Aspergillus Genome Database, a curated comparative genomics resource for gene, protein and sequence information for the Aspergillus research community.

    PubMed

    Arnaud, Martha B; Chibucos, Marcus C; Costanzo, Maria C; Crabtree, Jonathan; Inglis, Diane O; Lotia, Adil; Orvis, Joshua; Shah, Prachi; Skrzypek, Marek S; Binkley, Gail; Miyasato, Stuart R; Wortman, Jennifer R; Sherlock, Gavin

    2010-01-01

    The Aspergillus Genome Database (AspGD) is an online genomics resource for researchers studying the genetics and molecular biology of the Aspergilli. AspGD combines high-quality manual curation of the experimental scientific literature examining the genetics and molecular biology of Aspergilli, cutting-edge comparative genomics approaches to iteratively refine and improve structural gene annotations across multiple Aspergillus species, and web-based research tools for accessing and exploring the data. All of these data are freely available at http://www.aspgd.org. We welcome feedback from users and the research community at aspergillus-curator@genome.stanford.edu.

  2. Comparative Genomic Analysis of Meningitis- and Bacteremia-Causing Pneumococci Identifies a Common Core Genome.

    PubMed

    Kulohoma, Benard W; Cornick, Jennifer E; Chaguza, Chrispin; Yalcin, Feyruz; Harris, Simon R; Gray, Katherine J; Kiran, Anmol M; Molyneux, Elizabeth; French, Neil; Parkhill, Julian; Faragher, Brian E; Everett, Dean B; Bentley, Stephen D; Heyderman, Robert S

    2015-10-01

    Streptococcus pneumoniae is a nasopharyngeal commensal that occasionally invades normally sterile sites to cause bloodstream infection and meningitis. Although the pneumococcal population structure and evolutionary genetics are well defined, it is not clear whether pneumococci that cause meningitis are genetically distinct from those that do not. Here, we used whole-genome sequencing of 140 isolates of S. pneumoniae recovered from bloodstream infection (n = 70) and meningitis (n = 70) to compare their genetic contents. By fitting a double-exponential decaying-function model, we show that these isolates share a core of 1,427 genes (95% confidence interval [CI], 1,425 to 1,435 genes) and that there is no difference in the core genome or accessory gene content from these disease manifestations. Gene presence/absence alone therefore does not explain the virulence behavior of pneumococci that reach the meninges. Our analysis, however, supports the requirement of a range of previously described virulence factors and vaccine candidates for both meningitis- and bacteremia-causing pneumococci. This high-resolution view suggests that, despite considerable competency for genetic exchange, all pneumococci are under considerable pressure to retain key components advantageous for colonization and transmission and that these components are essential for access to and survival in sterile sites. PMID:26259813

  3. Comparative Genomic Analysis of Meningitis- and Bacteremia-Causing Pneumococci Identifies a Common Core Genome.

    PubMed

    Kulohoma, Benard W; Cornick, Jennifer E; Chaguza, Chrispin; Yalcin, Feyruz; Harris, Simon R; Gray, Katherine J; Kiran, Anmol M; Molyneux, Elizabeth; French, Neil; Parkhill, Julian; Faragher, Brian E; Everett, Dean B; Bentley, Stephen D; Heyderman, Robert S

    2015-10-01

    Streptococcus pneumoniae is a nasopharyngeal commensal that occasionally invades normally sterile sites to cause bloodstream infection and meningitis. Although the pneumococcal population structure and evolutionary genetics are well defined, it is not clear whether pneumococci that cause meningitis are genetically distinct from those that do not. Here, we used whole-genome sequencing of 140 isolates of S. pneumoniae recovered from bloodstream infection (n = 70) and meningitis (n = 70) to compare their genetic contents. By fitting a double-exponential decaying-function model, we show that these isolates share a core of 1,427 genes (95% confidence interval [CI], 1,425 to 1,435 genes) and that there is no difference in the core genome or accessory gene content from these disease manifestations. Gene presence/absence alone therefore does not explain the virulence behavior of pneumococci that reach the meninges. Our analysis, however, supports the requirement of a range of previously described virulence factors and vaccine candidates for both meningitis- and bacteremia-causing pneumococci. This high-resolution view suggests that, despite considerable competency for genetic exchange, all pneumococci are under considerable pressure to retain key components advantageous for colonization and transmission and that these components are essential for access to and survival in sterile sites.

  4. Comparative genomic analysis reveals bilateral breast cancers are genetically independent.

    PubMed

    Song, Fangfang; Li, Xiangchun; Song, Fengju; Zhao, Yanrui; Li, Haixin; Zheng, Hong; Gao, Zhibo; Wang, Jun; Zhang, Wei; Chen, Kexin

    2015-10-13

    Bilateral breast cancer (BBC) poses a major challenge for oncologists because of the cryptic relationship between the two lesions. The purpose of this study was to determine the origin of the contralateral breast cancer (either dependent or independent of the index tumor). Here, we used ultra-deep whole-exome sequencing and array comparative genomic hybridization (aCGH) to study four paired samples of BBCs with different tumor subtypes and time intervals between the developments of each tumor. We used two paired primary breast tumors and corresponding metastatic liver lesions as the control. We tested the origin independent nature of BBC in three ways: mutational concordance, mutational signature clustering, and clonality analysis using copy number profiles. We found that the paired BBC samples had near-zero concordant mutation rates, which were much lower than those of the paired primary/metastasis samples. The results of a mutational signature analysis also suggested that BBCs are independent of one another. A clonality analysis using aCGH data further revealed that paired BBC samples was clonally independent, in contrast to clonal related origin found for paired primary/metastasis samples. Our preliminary findings show that BBCs in Han Chinese women are origin independent and thus should be treated separately. PMID:26378809

  5. Comparative Transcriptional and Genomic Analysis of Plasmodium falciparum Field Isolates

    PubMed Central

    Mackinnon, Margaret J.; Li, Jinguang; Mok, Sachel; Kortok, Moses M.; Marsh, Kevin; Preiser, Peter R.; Bozdech, Zbynek

    2009-01-01

    Mechanisms for differential regulation of gene expression may underlie much of the phenotypic variation and adaptability of malaria parasites. Here we describe transcriptional variation among culture-adapted field isolates of Plasmodium falciparum, the species responsible for most malarial disease. It was found that genes coding for parasite protein export into the red cell cytosol and onto its surface, and genes coding for sexual stage proteins involved in parasite transmission are up-regulated in field isolates compared with long-term laboratory isolates. Much of this variability was associated with the loss of small or large chromosomal segments, or other forms of gene copy number variation that are prevalent in the P. falciparum genome (copy number variants, CNVs). Expression levels of genes inside these segments were correlated to that of genes outside and adjacent to the segment boundaries, and this association declined with distance from the CNV boundary. This observation could not be explained by copy number variation in these adjacent genes. This suggests a local-acting regulatory role for CNVs in transcription of neighboring genes and helps explain the chromosomal clustering that we observed here. Transcriptional co-regulation of physical clusters of adaptive genes may provide a way for the parasite to readily adapt to its highly heterogeneous and strongly selective environment. PMID:19898609

  6. Sequence and comparative genomic analysis of actin-related proteins.

    PubMed

    Muller, Jean; Oma, Yukako; Vallar, Laurent; Friederich, Evelyne; Poch, Olivier; Winsor, Barbara

    2005-12-01

    Actin-related proteins (ARPs) are key players in cytoskeleton activities and nuclear functions. Two complexes, ARP2/3 and ARP1/11, also known as dynactin, are implicated in actin dynamics and in microtubule-based trafficking, respectively. ARP4 to ARP9 are components of many chromatin-modulating complexes. Conventional actins and ARPs codefine a large family of homologous proteins, the actin superfamily, with a tertiary structure known as the actin fold. Because ARPs and actin share high sequence conservation, clear family definition requires distinct features to easily and systematically identify each subfamily. In this study we performed an in depth sequence and comparative genomic analysis of ARP subfamilies. A high-quality multiple alignment of approximately 700 complete protein sequences homologous to actin, including 148 ARP sequences, allowed us to extend the ARP classification to new organisms. Sequence alignments revealed conserved residues, motifs, and inserted sequence signatures to define each ARP subfamily. These discriminative characteristics allowed us to develop ARPAnno (http://bips.u-strasbg.fr/ARPAnno), a new web server dedicated to the annotation of ARP sequences. Analyses of sequence conservation among actins and ARPs highlight part of the actin fold and suggest interactions between ARPs and actin-binding proteins. Finally, analysis of ARP distribution across eukaryotic phyla emphasizes the central importance of nuclear ARPs, particularly the multifunctional ARP4.

  7. Comparative genomics of Geobacter chemotaxis genes reveals diverse signaling function

    PubMed Central

    Tran, Hoa T; Krushkal, Julia; Antommattei, Frances M; Lovley, Derek R; Weis, Robert M

    2008-01-01

    Background Geobacter species are δ-Proteobacteria and are often the predominant species in a variety of sedimentary environments where Fe(III) reduction is important. Their ability to remediate contaminated environments and produce electricity makes them attractive for further study. Cell motility, biofilm formation, and type IV pili all appear important for the growth of Geobacter in changing environments and for electricity production. Recent studies in other bacteria have demonstrated that signaling pathways homologous to the paradigm established for Escherichia coli chemotaxis can regulate type IV pili-dependent motility, the synthesis of flagella and type IV pili, the production of extracellular matrix material, and biofilm formation. The classification of these pathways by comparative genomics improves the ability to understand how Geobacter thrives in natural environments and better their use in microbial fuel cells. Results The genomes of G. sulfurreducens, G. metallireducens, and G. uraniireducens contain multiple (~70) homologs of chemotaxis genes arranged in several major clusters (six, seven, and seven, respectively). Unlike the single gene cluster of E. coli, the Geobacter clusters are not all located near the flagellar genes. The probable functions of some Geobacter clusters are assignable by homology to known pathways; others appear to be unique to the Geobacter sp. and contain genes of unknown function. We identified large numbers of methyl-accepting chemotaxis protein (MCP) homologs that have diverse sensing domain architectures and generate a potential for sensing a great variety of environmental signals. We discuss mechanisms for class-specific segregation of the MCPs in the cell membrane, which serve to maintain pathway specificity and diminish crosstalk. Finally, the regulation of gene expression in Geobacter differs from E. coli. The sequences of predicted promoter elements suggest that the alternative sigma factors σ28 and σ54 play a role

  8. Complete genome sequencing and comparative analysis of the linezolid-resistant Enterococcus faecalis strain DENG1.

    PubMed

    Yu, Zhijian; Chen, Zhong; Cheng, Hang; Zheng, Jinxin; Li, Duoyun; Deng, Xiangbin; Pan, Weiguang; Yang, Weizhi; Deng, Qiwen

    2014-07-01

    Genome level analysis of bacterial strains provides information on genetic composition and resistance mechanisms to clinically relevant antibiotics. To date, whole genome characterization of linezolid-resistant Enterococcus faecalis isolated in the clinic is lacking. In this study, we report the entire genome sequence, genomic characteristics and virulence factors of a pathogenic E. faecalis strain, DENG1. Our results showed considerable differences in genomic characteristics and virulence factors compared with other E. faecalis strains (V583 and OG1RF). The genome of this LZD-resistant E. faecalis strain can be used as a reference to study the mechanism of LZD resistance and the phylogenetic relationship of E. faecalis strains worldwide.

  9. Functional and Comparative Genomics of Lignocellulose Degradation by Schizophyllum commune

    SciTech Connect

    Ohm, Robin A.; Lee, Hanbyul; Park, Hongjae; Brewer, Heather M.; Carver, Akiko; Copeland, Alex; Grimwood, Jane; Lindquist, Erika; Lipzen, Anna; Martin, Joel; Purvine, Samuel O.; Schackwitz, Wendy; Tegelaar, Martin; Tritt, Andrew; Baker, Scott; Choi, In-Geol; Lugones, Luis G.; Wosten, Han A. B.; Grigoriev, Igor V.

    2014-03-14

    The Basidiomycete fungus Schizophyllum commune is a wood-decaying fungus and is used as a model system to study lignocellulose degradation. Version 3.0 of the genome assembly filled 269 of 316 sequence gaps and added 680 kb of sequence. This new assembly was reannotated using RNAseq transcriptomics data, and this resulted in 3110 (24percent) more genes. Two additional S. commune strains with different wood-decaying properties were sequenced, from Tattone (France) and Loenen (The Netherlands). Sequence comparison shows remarkably high sequence diversity between the strains. The overall SNP rate of > 100 SNPs/kb is among the highest rates of within-species polymorphisms in Basidiomycetes. Some well-described proteins like hydrophobins and transcription factors have less than 70percent sequence identity among the strains. Some chromosomes are better conserved than others and in some cases large parts of chromosomes are missing from one or more strains. Gene expression on glucose, cellulose and wood was analyzed in two S. commune strains. Overall, gene expression correlated between the two strains, but there were some notable exceptions. Of particular interest are CAZymes (carbohydrate-active enzymes) that are regulated in different ways in the different strains. In both strains the transcription factor Fsp1 was strongly up-regulated during growth on cellulose and wood, when compared to glucose. Over-expression of Fsp1 using a constitutive promoter resulted in higher cellulose and xylose-degrading enzyme activity, which suggests that Fsp1 is involved in regulating CAZyme gene expression. Two CAZyme genes (of family GH61 and GH11) were shown to be strongly up-regulated during growth on cellulose, compared to glucose. Proteomics on the secreted proteins in the growth medium confirmed this. A promoter analysis revealed the shortest active promoters for these two genes, as well as putative transcription factor binding sites.

  10. Genomic-associated Markers and comparative Genome Maps of Xanthomonas oryzae pv. oryzae and X. oryzae pv. oryzicola.

    PubMed

    Feng, Wenjie; Wang, Yi; Huang, Lisha; Feng, Chuanshun; Chu, Zhaohui; Ding, Xinhua; Yang, Long

    2015-09-01

    Xanthomonas oryzae pv. oryzae (Xoo) and X. oryzae pv. oryzicola (Xoc) cause two major seed quarantine diseases in rice, bacterial blight and bacterial leaf streak, respectively. Xoo and Xoc share high similarity in genomic sequence, which results in hard differentiation of the two pathogens. Genomic-associated Markers and comparative Genome Maps database (GMGM) is an integrated database providing comprehensive information including compared genome maps and full genomic-coverage molecular makers of Xoo and Xoc. This database was established based on bioinformatic analysis of complete sequenced genomes of several X. oryzae pathovars of which the similarity of the genomes was up to 91.39 %. The program was designed with a series of specific PCR primers, including 286 pairs of Xoo dominant markers, 288 pairs of Xoc dominant markers, and 288 pairs of Xoo and Xoc co-dominant markers, which were predicted to distinguish two pathovars. Test on a total of 40 donor pathogen strains using randomly selected 120 pairs of primers demonstrated that over 52.5 % of the primers were efficacious. The GMGM web portal ( http://biodb.sdau.edu.cn/gmgm/ ) will be a powerful tool that can present highly specific diagnostic markers, and it also provides information about comparative genome maps of the two pathogens for future evolution study.

  11. A high-density linkage map enables a second-generation collared flycatcher genome assembly and reveals the patterns of avian recombination rate variation and chromosomal evolution

    PubMed Central

    Kawakami, Takeshi; Smeds, Linnéa; Backström, Niclas; Husby, Arild; Qvarnström, Anna; Mugal, Carina F; Olason, Pall; Ellegren, Hans

    2014-01-01

    Detailed linkage and recombination rate maps are necessary to use the full potential of genome sequencing and population genomic analyses. We used a custom collared flycatcher 50 K SNP array to develop a high-density linkage map with 37 262 markers assigned to 34 linkage groups in 33 autosomes and the Z chromosome. The best-order map contained 4215 markers, with a total distance of 3132 cm and a mean genetic distance between markers of 0.12 cm. Facilitated by the array being designed to include markers from most scaffolds, we obtained a second-generation assembly of the flycatcher genome that approaches full chromosome sequences (N50 super-scaffold size 20.2 Mb and with 1.042 Gb (of 1.116 Gb) anchored to and mostly ordered and oriented along chromosomes). We found that flycatcher and zebra finch chromosomes are entirely syntenic but that inversions at mean rates of 1.5–2.0 event (6.6–7.5 Mb) per My have changed the organization within chromosomes, rates high enough for inversions to potentially have been involved with many speciation events during avian evolution. The mean recombination rate was 3.1 cm/Mb and correlated closely with chromosome size, from 2 cm/Mb for chromosomes >100 Mb to >10 cm/Mb for chromosomes <10 Mb. This size dependence seemed entirely due to an obligate recombination event per chromosome; if 50 cm was subtracted from the genetic lengths of chromosomes, the rate per physical unit DNA was constant across chromosomes. Flycatcher recombination rate showed similar variation along chromosomes as chicken but lacked the large interior recombination deserts characteristic of zebra finch chromosomes. PMID:24863701

  12. Comparative ruminant genomics highlights segmental duplication and mobile element insertion diversity

    Technology Transfer Automated Retrieval System (TEKTRAN)

    We have expanded upon a previously reported comparative genomics approach using a read-depth (JaRMs) and a hybrid read-pair, split-read (RAPTR-SV) copy number variation (CNV) detection method that uses read alignments to the cattle reference genome in order to identify species-specific genomic rearr...

  13. Avian Influenza

    MedlinePlus

    ... infectious viral disease of birds. Most avian influenza viruses do not infect humans; however some, such as ... often causing no apparent signs of illness. AI viruses can sometimes spread to domestic poultry and cause ...

  14. Avian Wings

    NASA Technical Reports Server (NTRS)

    Liu, Tianshu; Kuykendoll, K.; Rhew, R.; Jones, S.

    2004-01-01

    This paper describes the avian wing geometry (Seagull, Merganser, Teal and Owl) extracted from non-contact surface measurements using a three-dimensional laser scanner. The geometric quantities, including the camber line and thickness distribution of airfoil, wing planform, chord distribution, and twist distribution, are given in convenient analytical expressions. Thus, the avian wing surfaces can be generated and the wing kinematics can be simulated. The aerodynamic characteristics of avian airfoils in steady inviscid flows are briefly discussed. The avian wing kinematics is recovered from videos of three level-flying birds (Crane, Seagull and Goose) based on a two-jointed arm model. A flapping seagull wing in the 3D physical space is re-constructed from the extracted wing geometry and kinematics.

  15. Avian botulism

    USGS Publications Warehouse

    Friend, Milton; Locke, Louis N.; Kennelly, James J.

    1985-01-01

    What is avian botulism? Avian botulism, or Western duck sickness, is one of the three most important disease problems of wild migratory birds. Each year, many birds are paralyzed or die after exposure to a toxin produced by the botulinum bacterium. Two of the seven toxin types that have been identifies cause mortality in wild birds; one of these types, type C, is most often associated with dieoffs of ducks, while type E primarily affects gulls and loons.

  16. Avian Flu

    SciTech Connect

    Eckburg, Paul

    2006-11-06

    Since 2003, a severe form of H5N1 avian influenza has rapidly spread throughout Asia and Europe, infecting over 200 humans in 10 countries. The spread of H5N1 virus from person-to-person has been rare, thus preventing the emergence of a widespread pandemic. However, this ongoing epidemic continues to pose an important public health threat. Avian flu and its pandemic potential in humans will be discussed.

  17. Comparative genomic survey of microbial arylamine N-acetyltransferases

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Introduction: Microorganisms are constantly exposed to exogenous chemical influences. Our previous genomic surveys have identified putative NAT genes across a phylogenetic spectrum of prokaryotic and eukaryotic microorganisms. We are currently pursuing two lines of investigation: The first looks int...

  18. Metagenome Skimming of Insect Specimen Pools: Potential for Comparative Genomics.

    PubMed

    Linard, Benjamin; Crampton-Platt, Alex; Gillett, Conrad P D T; Timmermans, Martijn J T N; Vogler, Alfried P

    2015-06-01

    Metagenomic analyses are challenging in metazoans, but high-copy number and repeat regions can be assembled from low-coverage sequencing by "genome skimming," which is applied here as a new way of characterizing metagenomes obtained in an ecological or taxonomic context. Illumina shotgun sequencing on two pools of Coleoptera (beetles) of approximately 200 species each were assembled into tens of thousands of scaffolds. Repeated low-coverage sequencing recovered similar scaffold sets consistently, although approximately 70% of scaffolds could not be identified against existing genome databases. Identifiable scaffolds included mitochondrial DNA, conserved sequences with hits to expressed sequence tag and protein databases, and known repeat elements of high and low complexity, including numerous copies of rRNA and histone genes. Assemblies of histones captured a diversity of gene order and primary sequence in Coleoptera. Scaffolds with similarity to multiple sites in available coleopteran genome sequences for Dendroctonus and Tribolium revealed high specificity of scaffolds to either of these genomes, in particular for high-copy number repeats. Numerous "clusters" of scaffolds mapped to the same genomic site revealed intra- and/or intergenomic variation within a metagenome pool. In addition to effect of taxonomic composition of the metagenomes, the number of mapped scaffolds also revealed structural differences between the two reference genomes, although the significance of this striking finding remains unclear. Finally, apparently exogenous sequences were recovered, including potential food plants, fungal pathogens, and bacterial symbionts. The "metagenome skimming" approach is useful for capturing the genomic diversity of poorly studied, species-rich lineages and opens new prospects in environmental genomics. PMID:25979752

  19. Metagenome Skimming of Insect Specimen Pools: Potential for Comparative Genomics.

    PubMed

    Linard, Benjamin; Crampton-Platt, Alex; Gillett, Conrad P D T; Timmermans, Martijn J T N; Vogler, Alfried P

    2015-05-14

    Metagenomic analyses are challenging in metazoans, but high-copy number and repeat regions can be assembled from low-coverage sequencing by "genome skimming," which is applied here as a new way of characterizing metagenomes obtained in an ecological or taxonomic context. Illumina shotgun sequencing on two pools of Coleoptera (beetles) of approximately 200 species each were assembled into tens of thousands of scaffolds. Repeated low-coverage sequencing recovered similar scaffold sets consistently, although approximately 70% of scaffolds could not be identified against existing genome databases. Identifiable scaffolds included mitochondrial DNA, conserved sequences with hits to expressed sequence tag and protein databases, and known repeat elements of high and low complexity, including numerous copies of rRNA and histone genes. Assemblies of histones captured a diversity of gene order and primary sequence in Coleoptera. Scaffolds with similarity to multiple sites in available coleopteran genome sequences for Dendroctonus and Tribolium revealed high specificity of scaffolds to either of these genomes, in particular for high-copy number repeats. Numerous "clusters" of scaffolds mapped to the same genomic site revealed intra- and/or intergenomic variation within a metagenome pool. In addition to effect of taxonomic composition of the metagenomes, the number of mapped scaffolds also revealed structural differences between the two reference genomes, although the significance of this striking finding remains unclear. Finally, apparently exogenous sequences were recovered, including potential food plants, fungal pathogens, and bacterial symbionts. The "metagenome skimming" approach is useful for capturing the genomic diversity of poorly studied, species-rich lineages and opens new prospects in environmental genomics.

  20. PlantGDB: a resource for comparative plant genomics

    PubMed Central

    Duvick, Jon; Fu, Ann; Muppirala, Usha; Sabharwal, Mukul; Wilkerson, Matthew D.; Lawrence, Carolyn J.; Lushbough, Carol; Brendel, Volker

    2008-01-01

    PlantGDB (http://www.plantgdb.org/) is a genomics database encompassing sequence data for green plants (Viridiplantae). PlantGDB provides annotated transcript assemblies for >100 plant species, with transcripts mapped to their cognate genomic context where available, integrated with a variety of sequence analysis tools and web services. For 14 plant species with emerging or complete genome sequence, PlantGDB's genome browsers (xGDB) serve as a graphical interface for viewing, evaluating and annotating transcript and protein alignments to chromosome or bacterial artificial chromosome (BAC)-based genome assemblies. Annotation is facilitated by the integrated yrGATE module for community curation of gene models. Novel web services at PlantGDB include Tracembler, an iterative alignment tool that generates contigs from GenBank trace file data and BioExtract Server, a web-based server for executing custom sequence analysis workflows. PlantGDB also hosts a plant genomics research outreach portal (PGROP) that facilitates access to a large number of resources for research and training. PMID:18063570

  1. Kiwifruit Information Resource (KIR): a comparative platform for kiwifruit genomics.

    PubMed

    Yue, Junyang; Liu, Jian; Ban, Rongjun; Tang, Wei; Deng, Lin; Fei, Zhangjun; Liu, Yongsheng

    2015-01-01

    The Kiwifruit Information Resource (KIR) is dedicated to maintain and integrate comprehensive datasets on genomics, functional genomics and transcriptomics of kiwifruit (Actinidiaceae). KIR serves as a central access point for existing/new genomic and genetic data. KIR also provides researchers with a variety of visualization and analysis tools. Current developments include the updated genome structure of Actinidia chinensis cv. Hongyang and its newest genome annotation, putative transcripts, gene expression, physical markers of genetic traits as well as relevant publications based on the latest genome assembly. Nine thousand five hundred and forty-seven new transcripts are detected and 21 132 old transcripts are changed. At the present release, the next-generation transcriptome sequencing data has been incorporated into gene models and splice variants. Protein-protein interactions are also identified based on experimentally determined orthologous interactions. Furthermore, the experimental results reported in peer-reviewed literature are manually extracted and integrated within a well-developed query page. In total, 122 identifications are currently associated, including commonly used gene names and symbols. All KIR datasets are helpful to facilitate a broad range of kiwifruit research topics and freely available to the research community. Database URL: http://bdg.hfut.edu.cn/kir/index.html. PMID:26656885

  2. Kiwifruit Information Resource (KIR): a comparative platform for kiwifruit genomics.

    PubMed

    Yue, Junyang; Liu, Jian; Ban, Rongjun; Tang, Wei; Deng, Lin; Fei, Zhangjun; Liu, Yongsheng

    2015-01-01

    The Kiwifruit Information Resource (KIR) is dedicated to maintain and integrate comprehensive datasets on genomics, functional genomics and transcriptomics of kiwifruit (Actinidiaceae). KIR serves as a central access point for existing/new genomic and genetic data. KIR also provides researchers with a variety of visualization and analysis tools. Current developments include the updated genome structure of Actinidia chinensis cv. Hongyang and its newest genome annotation, putative transcripts, gene expression, physical markers of genetic traits as well as relevant publications based on the latest genome assembly. Nine thousand five hundred and forty-seven new transcripts are detected and 21 132 old transcripts are changed. At the present release, the next-generation transcriptome sequencing data has been incorporated into gene models and splice variants. Protein-protein interactions are also identified based on experimentally determined orthologous interactions. Furthermore, the experimental results reported in peer-reviewed literature are manually extracted and integrated within a well-developed query page. In total, 122 identifications are currently associated, including commonly used gene names and symbols. All KIR datasets are helpful to facilitate a broad range of kiwifruit research topics and freely available to the research community. Database URL: http://bdg.hfut.edu.cn/kir/index.html.

  3. Hidden Markov models for evolution and comparative genomics analysis.

    PubMed

    Bykova, Nadezda A; Favorov, Alexander V; Mironov, Andrey A

    2013-01-01

    The problem of reconstruction of ancestral states given a phylogeny and data from extant species arises in a wide range of biological studies. The continuous-time Markov model for the discrete states evolution is generally used for the reconstruction of ancestral states. We modify this model to account for a case when the states of the extant species are uncertain. This situation appears, for example, if the states for extant species are predicted by some program and thus are known only with some level of reliability; it is common for bioinformatics field. The main idea is formulation of the problem as a hidden Markov model on a tree (tree HMM, tHMM), where the basic continuous-time Markov model is expanded with the introduction of emission probabilities of observed data (e.g. prediction scores) for each underlying discrete state. Our tHMM decoding algorithm allows us to predict states at the ancestral nodes as well as to refine states at the leaves on the basis of quantitative comparative genomics. The test on the simulated data shows that the tHMM approach applied to the continuous variable reflecting the probabilities of the states (i.e. prediction score) appears to be more accurate then the reconstruction from the discrete states assignment defined by the best score threshold. We provide examples of applying our model to the evolutionary analysis of N-terminal signal peptides and transcription factor binding sites in bacteria. The program is freely available at http://bioinf.fbb.msu.ru/~nadya/tHMM and via web-service at http://bioinf.fbb.msu.ru/treehmmweb.

  4. Comparative genomics boosts target prediction for bacterial small RNAs.

    PubMed

    Wright, Patrick R; Richter, Andreas S; Papenfort, Kai; Mann, Martin; Vogel, Jörg; Hess, Wolfgang R; Backofen, Rolf; Georg, Jens

    2013-09-10

    Small RNAs (sRNAs) constitute a large and heterogeneous class of bacterial gene expression regulators. Much like eukaryotic microRNAs, these sRNAs typically target multiple mRNAs through short seed pairing, thereby acting as global posttranscriptional regulators. In some bacteria, evidence for hundreds to possibly more than 1,000 different sRNAs has been obtained by transcriptome sequencing. However, the experimental identification of possible targets and, therefore, their confirmation as functional regulators of gene expression has remained laborious. Here, we present a strategy that integrates phylogenetic information to predict sRNA targets at the genomic scale and reconstructs regulatory networks upon functional enrichment and network analysis (CopraRNA, for Comparative Prediction Algorithm for sRNA Targets). Furthermore, CopraRNA precisely predicts the sRNA domains for target recognition and interaction. When applied to several model sRNAs, CopraRNA revealed additional targets and functions for the sRNAs CyaR, FnrS, RybB, RyhB, SgrS, and Spot42. Moreover, the mRNAs gdhA, lrp, marA, nagZ, ptsI, sdhA, and yobF-cspC were suggested as regulatory hubs targeted by up to seven different sRNAs. The verification of many previously undetected targets by CopraRNA, even for extensively investigated sRNAs, demonstrates its advantages and shows that CopraRNA-based analyses can compete with experimental target prediction approaches. A Web interface allows high-confidence target prediction and efficient classification of bacterial sRNAs.

  5. Genomicus update 2015: KaryoView and MatrixView provide a genome-wide perspective to multispecies comparative genomics.

    PubMed

    Louis, Alexandra; Nguyen, Nga Thi Thuy; Muffato, Matthieu; Roest Crollius, Hugues

    2015-01-01

    The Genomicus web server (http://www.genomicus.biologie.ens.fr/genomicus) is a visualization tool allowing comparative genomics in four different phyla (Vertebrate, Fungi, Metazoan and Plants). It provides access to genomic information from extant species, as well as ancestral gene content and gene order for vertebrates and flowering plants. Here we present the new features available for vertebrate genome with a focus on new graphical tools. The interface to enter the database has been improved, two pairwise genome comparison tools are now available (KaryoView and MatrixView) and the multiple genome comparison tools (PhyloView and AlignView) propose three new kinds of representation and a more intuitive menu. These new developments have been implemented for Genomicus portal dedicated to vertebrates. This allows the analysis of 68 extant animal genomes, as well as 58 ancestral reconstructed genomes. The Genomicus server also provides access to ancestral gene orders, to facilitate evolutionary and comparative genomics studies, as well as computationally predicted regulatory interactions, thanks to the representation of conserved non-coding elements with their putative gene targets.

  6. Genomicus update 2015: KaryoView and MatrixView provide a genome-wide perspective to multispecies comparative genomics

    PubMed Central

    Louis, Alexandra; Nguyen, Nga Thi Thuy; Muffato, Matthieu; Roest Crollius, Hugues

    2015-01-01

    The Genomicus web server (http://www.genomicus.biologie.ens.fr/genomicus) is a visualization tool allowing comparative genomics in four different phyla (Vertebrate, Fungi, Metazoan and Plants). It provides access to genomic information from extant species, as well as ancestral gene content and gene order for vertebrates and flowering plants. Here we present the new features available for vertebrate genome with a focus on new graphical tools. The interface to enter the database has been improved, two pairwise genome comparison tools are now available (KaryoView and MatrixView) and the multiple genome comparison tools (PhyloView and AlignView) propose three new kinds of representation and a more intuitive menu. These new developments have been implemented for Genomicus portal dedicated to vertebrates. This allows the analysis of 68 extant animal genomes, as well as 58 ancestral reconstructed genomes. The Genomicus server also provides access to ancestral gene orders, to facilitate evolutionary and comparative genomics studies, as well as computationally predicted regulatory interactions, thanks to the representation of conserved non-coding elements with their putative gene targets. PMID:25378326

  7. Construction of a California condor BAC library and first-generation chicken-condor comparative physical map as an endangered species conservation genomics resource.

    PubMed

    Romanov, Michael N; Koriabine, Maxim; Nefedov, Mikhail; de Jong, Pieter J; Ryder, Oliver A

    2006-12-01

    To support genomic analysis of the endangered California condor (Gymnogyps californianus), a BAC library (CHORI-262) was generated using DNA from the blood of a female. The library consists of 89,665 recombinant BAC clones providing approximately 14-fold coverage of the presumed approximately 1.48-Gb genome. Taking advantage of recent progress in chicken genomics, we developed a first-generation comparative chicken-condor physical map using an overgo hybridization approach. The overgos were derived from chicken (164 probes) and New World vulture (8 probes) sequences. Screening a 2.8x subset of the total library resulted in 236 BAC-gene assignments with 2.5 positive BAC clones per successful probe. A preliminary comparative chicken-condor BAC-based map included 93 genes. Comparison of selected condor BAC sequences with orthologous chicken sequences suggested a high degree of conserved synteny between the two avian genomes. This work will aid in identification and characterization of candidate loci for the chondrodystrophy mutation to advance genetic management of this disease.

  8. Metabolic peculiarities of Aspergillus niger disclosed by comparative metabolic genomics

    PubMed Central

    Sun, Jibin; Lu, Xin; Rinas, Ursula; Zeng, An Ping

    2007-01-01

    Background Aspergillus niger is an important industrial microorganism for the production of both metabolites, such as citric acid, and proteins, such as fungal enzymes or heterologous proteins. Despite its extensive industrial applications, the genetic inventory of this fungus is only partially understood. The recently released genome sequence opens a new horizon for both scientific studies and biotechnological applications. Results Here, we present the first genome-scale metabolic network for A. niger and an in-depth genomic comparison of this species to seven other fungi to disclose its metabolic peculiarities. The raw genomic sequences of A. niger ATCC 9029 were first annotated. The reconstructed metabolic network is based on the annotation of two A. niger genomes, CBS 513.88 and ATCC 9029, including enzymes with 988 unique EC numbers, 2,443 reactions and 2,349 metabolites. More than 1,100 enzyme-coding genes are unique to A. niger in comparison to the other seven fungi. For example, we identified additional copies of genes such as those encoding alternative mitochondrial oxidoreductase and citrate synthase in A. niger, which might contribute to the high citric acid production efficiency of this species. Moreover, nine genes were identified as encoding enzymes with EC numbers exclusively found in A. niger, mostly involved in the biosynthesis of complex secondary metabolites and degradation of aromatic compounds. Conclusion The genome-level reconstruction of the metabolic network and genome-based metabolic comparison disclose peculiarities of A. niger highly relevant to its biotechnological applications and should contribute to future rational metabolic design and systems biology studies of this black mold and related species. PMID:17784953

  9. Comparative Genomic Analysis Identifies Divergent Genomic Features of Pathogenic Enterococcus cecorum Including a Type IC CRISPR-Cas System, a Capsule Locus, an epa-Like Locus, and Putative Host Tissue Binding Proteins

    PubMed Central

    Borst, Luke B.; Suyemoto, M. Mitsu; Scholl, Elizabeth H.; Fuller, Fredrick J.; Barnes, H. John

    2015-01-01

    Enterococcus cecorum (EC) is the dominant enteric commensal of adult chickens and contributes to the gut consortia of many avian and mammalian species. While EC infection is an uncommon zoonosis, like other enterococcal species it can cause life-threating nosocomial infection in people. In contrast to other enterococci which are considered opportunistic pathogens, emerging pathogenic strains of EC cause outbreaks of musculoskeletal disease in broiler chickens. Typical morbidity and mortality is comparable to other important infectious diseases of poultry. In molecular epidemiologic studies, pathogenic EC strains were found to be genetically clonal. These findings suggested acquisition of specific virulence determinants by pathogenic EC. To identify divergent genomic features and acquired virulence determinants in pathogenic EC; comparative genomic analysis was performed on genomes of 3 pathogenic and 3 commensal strains of EC. Pathogenic isolates had smaller genomes with a higher GC content, and they demonstrated large regions of synteny compared to commensal isolates. A molecular phylogenetic analysis demonstrated sequence divergence in pathogenic EC genomes. At a threshold of 98% identity, 414 predicted proteins were identified that were highly conserved in pathogenic EC but not in commensal EC. Among these, divergent CRISPR-cas defense loci were observed. In commensal EC, the type IIA arrangement typical for enterococci was present; however, pathogenic EC had a type IC locus, which is novel in enterococci but commonly observed in streptococci. Potential mediators of virulence identified in this analysis included a polysaccharide capsular locus similar to that recently described for E. faecium, an epa-like locus, and cell wall associated proteins which may bind host extracellular matrix. This analysis identified specific genomic regions, coding sequences, and predicted proteins which may be related to the divergent evolution and increased virulence of emerging

  10. Comparative genomic analysis identifies divergent genomic features of pathogenic Enterococcus cecorum including a type IC CRISPR-Cas system, a capsule locus, an epa-like locus, and putative host tissue binding proteins.

    PubMed

    Borst, Luke B; Suyemoto, M Mitsu; Scholl, Elizabeth H; Fuller, Fredrick J; Barnes, H John

    2015-01-01

    Enterococcus cecorum (EC) is the dominant enteric commensal of adult chickens and contributes to the gut consortia of many avian and mammalian species. While EC infection is an uncommon zoonosis, like other enterococcal species it can cause life-threating nosocomial infection in people. In contrast to other enterococci which are considered opportunistic pathogens, emerging pathogenic strains of EC cause outbreaks of musculoskeletal disease in broiler chickens. Typical morbidity and mortality is comparable to other important infectious diseases of poultry. In molecular epidemiologic studies, pathogenic EC strains were found to be genetically clonal. These findings suggested acquisition of specific virulence determinants by pathogenic EC. To identify divergent genomic features and acquired virulence determinants in pathogenic EC; comparative genomic analysis was performed on genomes of 3 pathogenic and 3 commensal strains of EC. Pathogenic isolates had smaller genomes with a higher GC content, and they demonstrated large regions of synteny compared to commensal isolates. A molecular phylogenetic analysis demonstrated sequence divergence in pathogenic EC genomes. At a threshold of 98% identity, 414 predicted proteins were identified that were highly conserved in pathogenic EC but not in commensal EC. Among these, divergent CRISPR-cas defense loci were observed. In commensal EC, the type IIA arrangement typical for enterococci was present; however, pathogenic EC had a type IC locus, which is novel in enterococci but commonly observed in streptococci. Potential mediators of virulence identified in this analysis included a polysaccharide capsular locus similar to that recently described for E. faecium, an epa-like locus, and cell wall associated proteins which may bind host extracellular matrix. This analysis identified specific genomic regions, coding sequences, and predicted proteins which may be related to the divergent evolution and increased virulence of emerging

  11. e-Fungi: a data resource for comparative analysis of fungal genomes

    PubMed Central

    Hedeler, Cornelia; Wong, Han Min; Cornell, Michael J; Alam, Intikhab; Soanes, Darren M; Rattray, Magnus; Hubbard, Simon J; Talbot, Nicholas J; Oliver, Stephen G; Paton, Norman W

    2007-01-01

    Background The number of sequenced fungal genomes is ever increasing, with about 200 genomes already fully sequenced or in progress. Only a small percentage of those genomes have been comprehensively studied, for example using techniques from functional genomics. Comparative analysis has proven to be a useful strategy for enhancing our understanding of evolutionary biology and of the less well understood genomes. However, the data required for these analyses tends to be distributed in various heterogeneous data sources, making systematic comparative studies a cumbersome task. Furthermore, comparative analyses benefit from close integration of derived data sets that cluster genes or organisms in a way that eases the expression of requests that clarify points of similarity or difference between species. Description To support systematic comparative analyses of fungal genomes we have developed the e-Fungi database, which integrates a variety of data for more than 30 fungal genomes. Publicly available genome data, functional annotations, and pathway information has been integrated into a single data repository and complemented with results of comparative analyses, such as MCL and OrthoMCL cluster analysis, and predictions of signaling proteins and the sub-cellular localisation of proteins. To access the data, a library of analysis tasks is available through a web interface. The analysis tasks are motivated by recent comparative genomics studies, and aim to support the study of evolutionary biology as well as community efforts for improving the annotation of genomes. Web services for each query are also available, enabling the tasks to be incorporated into workflows. Conclusion The e-Fungi database provides fungal biologists with a resource for comparative studies of a large range of fungal genomes. Its analysis library supports the comparative study of genome data, functional annotation, and results of large scale analyses over all the genomes stored in the database

  12. Comparative genomics and evolution of the tailed-bacteriophages.

    PubMed

    Casjens, Sherwood R

    2005-08-01

    The number of completely sequenced tailed-bacteriophage genomes that have been published increased to more than 125 last year. The comparison of these genomes has brought their highly mosaic nature into much sharper focus. Furthermore, reports of the complete sequences of about 150 bacterial genomes have shown that the many prophage and parts thereof that reside in these bacterial genomes must comprise a significant fraction of Earth's phage gene pool. These phage and prophage genomes are fertile ground for attempts to deduce the nature of viral evolutionary processes, and such analyses have made it clear that these phage have enjoyed a significant level of horizontal exchange of genetic information throughout their long histories. The strength of these evolutionary deductions rests largely on the extensive knowledge that has accumulated during intensive study into the molecular nature of the life cycles of a few 'model system' phages over the past half century. Recent molecular studies of phages other than these model system phages have made it clear that much remains to be learnt about the variety of lifestyle strategies utilized by the tailed-phage. PMID:16019256

  13. Applications of avian transgenesis.

    PubMed

    Scott, Benjamin B; Velho, Tarciso A; Sim, Shuyin; Lois, Carlos

    2010-01-01

    The ability to introduce foreign DNA into the genome of an organism has proven to be one of the most powerful tools in modern biology. Methods for the manipulation of the animal genome have been developed at an impressive pace for 3 decades, but only in the past 5 years have useful tools for avian transgenesis emerged. The most efficient technique involves the use of replication-deficient lentiviral vectors to deliver foreign DNA into the avian germline. Although lentiviral-mediated transgenesis presents some constraints, progress in this area has garnered interest in both industry and academia for its potential applications in biological research, biotechnology, and agriculture. In this review we evaluate methods for the production of transgenic birds, focusing on the advantages and limitations of lentiviral-mediated transgenesis. We also provide an overview of future applications of this technology. The most exciting of these include disease-resistant transgenic poultry, genetically modified hens that produce therapeutic proteins in their eggs, and transgenic songbirds that serve as a model to study communication disorders. Finally, we discuss technological advances that will be necessary to make avian transgenesis a more versatile tool.

  14. Complete genome sequences and comparative genome analysis of Lactobacillus plantarum strain 5-2 isolated from fermented soybean.

    PubMed

    Liu, Chen-Jian; Wang, Rui; Gong, Fu-Ming; Liu, Xiao-Feng; Zheng, Hua-Jun; Luo, Yi-Yong; Li, Xiao-Ran

    2015-12-01

    Lactobacillus plantarum is an important probiotic and is mostly isolated from fermented foods. We sequenced the genome of L. plantarum strain 5-2, which was derived from fermented soybean isolated from Yunnan province, China. The strain was determined to contain 3114 genes. Fourteen complete insertion sequence (IS) elements were found in 5-2 chromosome. There were 24 DNA replication proteins and 76 DNA repair proteins in the 5-2 genome. Consistent with the classification of L. plantarum as a facultative heterofermentative lactobacillus, the 5-2 genome encodes key enzymes required for the EMP (Embden-Meyerhof-Parnas) and phosphoketolase (PK) pathways. Several components of the secretion machinery are found in the 5-2 genome, which was compared with L. plantarum ST-III, JDM1 and WCFS1. Most of the specific proteins in the four genomes appeared to be related to their prophage elements.

  15. IMG 4 version of the integrated microbial genomes comparative analysis system.

    PubMed

    Markowitz, Victor M; Chen, I-Min A; Palaniappan, Krishna; Chu, Ken; Szeto, Ernest; Pillay, Manoj; Ratner, Anna; Huang, Jinghua; Woyke, Tanja; Huntemann, Marcel; Anderson, Iain; Billis, Konstantinos; Varghese, Neha; Mavromatis, Konstantinos; Pati, Amrita; Ivanova, Natalia N; Kyrpides, Nikos C

    2014-01-01

    The Integrated Microbial Genomes (IMG) data warehouse integrates genomes from all three domains of life, as well as plasmids, viruses and genome fragments. IMG provides tools for analyzing and reviewing the structural and functional annotations of genomes in a comparative context. IMG's data content and analytical capabilities have increased continuously since its first version released in 2005. Since the last report published in the 2012 NAR Database Issue, IMG's annotation and data integration pipelines have evolved while new tools have been added for recording and analyzing single cell genomes, RNA Seq and biosynthetic cluster data. Different IMG datamarts provide support for the analysis of publicly available genomes (IMG/W: http://img.jgi.doe.gov/w), expert review of genome annotations (IMG/ER: http://img.jgi.doe.gov/er) and teaching and training in the area of microbial genome analysis (IMG/EDU: http://img.jgi.doe.gov/edu). PMID:24165883

  16. IMG 4 version of the integrated microbial genomes comparative analysis system

    SciTech Connect

    Markowitz, Victor M.; Chen, I-Min A.; Palaniappan, Krishna; Chu, Ken; Szeto, Ernest; Pillay, Manoj; Ratner, Anna; Huang, Jinghua; Woyke, Tanja; Huntemann, Marcel; Anderson, Iain; Billis, Konstantinos; Varghese, Neha; Mavromatis, Konstantinos; Pati, Amrita; Ivanova, Natalia N.; Kyrpides, Nikos C.

    2013-10-27

    The Integrated Microbial Genomes (IMG) data warehouse integrates genomes from all three domains of life, as well as plasmids, viruses and genome fragments. IMG provides tools for analyzing and reviewing the structural and functional annotations of genomes in a comparative context. IMG’s data content and analytical capabilities have increased continuously since its first version released in 2005. Since the last report published in the 2012 NAR Database Issue, IMG’s annotation and data integration pipelines have evolved while new tools have been added for recording and analyzing single cell genomes, RNA Seq and biosynthetic cluster data. Finally, different IMG datamarts provide support for the analysis of publicly available genomes (IMG/W: http://img.jgi.doe.gov/w), expert review of genome annotations (IMG/ER: http://img.jgi.doe.gov/er) and teaching and training in the area of microbial genome analysis (IMG/EDU: http://img.jgi.doe.gov/edu).

  17. Natural Product Biosynthetic Diversity and Comparative Genomics of the Cyanobacteria.

    PubMed

    Dittmann, Elke; Gugger, Muriel; Sivonen, Kaarina; Fewer, David P

    2015-10-01

    Cyanobacteria are an ancient lineage of slow-growing photosynthetic bacteria and a prolific source of natural products with intricate chemical structures and potent biological activities. The bulk of these natural products are known from just a handful of genera. Recent efforts have elucidated the mechanisms underpinning the biosynthesis of a diverse array of natural products from cyanobacteria. Many of the biosynthetic mechanisms are unique to cyanobacteria or rarely described from other organisms. Advances in genome sequence technology have precipitated a deluge of genome sequences for cyanobacteria. This makes it possible to link known natural products to biosynthetic gene clusters but also accelerates the discovery of new natural products through genome mining. These studies demonstrate that cyanobacteria encode a huge variety of cryptic gene clusters for the production of natural products, and the known chemical diversity is likely to be just a fraction of the true biosynthetic capabilities of this fascinating and ancient group of organisms.

  18. Comparative genomics of Vibrio cholerae from Haiti, Asia, and Africa.

    PubMed

    Reimer, Aleisha R; Van Domselaar, Gary; Stroika, Steven; Walker, Matthew; Kent, Heather; Tarr, Cheryl; Talkington, Deborah; Rowe, Lori; Olsen-Rasmussen, Melissa; Frace, Michael; Sammons, Scott; Dahourou, Georges Anicet; Boncy, Jacques; Smith, Anthony M; Mabon, Philip; Petkau, Aaron; Graham, Morag; Gilmour, Matthew W; Gerner-Smidt, Peter

    2011-11-01

    Cholera was absent from the island of Hispaniola at least a century before an outbreak that began in Haiti in the fall of 2010. Pulsed-field gel electrophoresis (PFGE) analysis of clinical isolates from the Haiti outbreak and recent global travelers returning to the United States showed indistinguishable PFGE fingerprints. To better explore the genetic ancestry of the Haiti outbreak strain, we acquired 23 whole-genome Vibrio cholerae sequences: 9 isolates obtained in Haiti or the Dominican Republic, 12 PFGE pattern-matched isolates linked to Asia or Africa, and 2 nonmatched outliers from the Western Hemisphere. Phylogenies for whole-genome sequences and core genome single-nucleotide polymorphisms showed that the Haiti outbreak strain is genetically related to strains originating in India and Cameroon. However, because no identical genetic match was found among sequenced contemporary isolates, a definitive genetic origin for the outbreak in Haiti remains speculative.

  19. GenColors: annotation and comparative genomics of prokaryotes made easy.

    PubMed

    Romualdi, Alessandro; Felder, Marius; Rose, Dominic; Gausmann, Ulrike; Schilhabel, Markus; Glöckner, Gernot; Platzer, Matthias; Sühnel, Jürgen

    2007-01-01

    GenColors (gencolors.fli-leibniz.de) is a new web-based software/database system aimed at an improved and accelerated annotation of prokaryotic genomes considering information on related genomes and making extensive use of genome comparison. It offers a seamless integration of data from ongoing sequencing projects and annotated genomic sequences obtained from GenBank. A variety of export/import filters manages an effective data flow from sequence assembly and manipulation programs (e.g., GAP4) to GenColors and back as well as to standard GenBank file(s). The genome comparison tools include best bidirectional hits, gene conservation, syntenies, and gene core sets. Precomputed UniProt matches allow annotation and analysis in an effective manner. In addition to these analysis options, base-specific quality data (coverage and confidence) can also be handled if available. The GenColors system can be used both for annotation purposes in ongoing genome projects and as an analysis tool for finished genomes. GenColors comes in two types, as dedicated genome browsers and as the Jena Prokaryotic Genome Viewer (JPGV). Dedicated genome browsers contain genomic information on a set of related genomes and offer a large number of options for genome comparison. The system has been efficiently used in the genomic sequencing of Borrelia garinii and is currently applied to various ongoing genome projects on Borrelia, Legionella, Escherichia, and Pseudomonas genomes. One of these dedicated browsers, the Spirochetes Genome Browser (sgb.fli-leibniz.de) with Borrelia, Leptospira, and Treponema genomes, is freely accessible. The others will be released after finalization of the corresponding genome projects. JPGV (jpgv.fli-leibniz.de) offers information on almost all finished bacterial genomes, as compared to the dedicated browsers with reduced genome comparison functionality, however. As of January 2006, this viewer includes 632 genomic elements (e.g., chromosomes and plasmids) of 293

  20. Comparative population genomics: power and principles for the inference of functionality.

    PubMed

    Lawrie, David S; Petrov, Dmitri A

    2014-04-01

    The availability of sequenced genomes from multiple related organisms allows the detection and localization of functional genomic elements based on the idea that such elements evolve more slowly than neutral sequences. Although such comparative genomics methods have proven useful in discovering functional elements and ascertaining levels of functional constraint in the genome as a whole, here we outline limitations intrinsic to this approach that cannot be overcome by sequencing more species. We argue that it is essential to supplement comparative genomics with ultra-deep sampling of populations from closely related species to enable substantially more powerful genomic scans for functional elements. The convergence of sequencing technology and population genetics theory has made such projects feasible and has exciting implications for functional genomics.

  1. Comparative susceptibility of introduced forest-dwelling mosquitoes in Hawai'i to avian malaria, Plasmodium relictum

    USGS Publications Warehouse

    Lapointe, D.A.; Goff, M.L.; Atkinson, C.T.

    2005-01-01

    To identify potential vectors of avian malaria in Hawaiian native forests, the innate susceptibility of Aedes albopictus, Wyeomyia mitchellii, and Culex quinquefasciatus from 3 geographical sites along an altitudinal gradient was evaluated using local isolates of Plasmodium relictum. Mosquitoes were dissected 5-8 and 9-13 days postinfective blood meal and microscopically examined for oocysts and salivary-gland sporozoites. Sporogony was completed in all 3 species, but prevalence between species varied significantly. Oocysts were detected in 1-2% and sporozoites in 1-7% of Aedes albopictus that fed on infected ducklings. Wyeomyia mitchellii was slightly more susceptible, with 7-19% and 7% infected with oocysts and sporozoites, respectively. In both species, the median oocyst number was 5 or below. This is only the second Wyeomyia species reported to support development of a malarial parasite. Conversely, Culex quinquefasciatus from all 3 sites proved very susceptible. Prevalence of oocysts and sporozoites consistently exceeded 70%, regardless of gametocytemia or origin of the P. relictum isolate. In trials for which a maximum 200 oocysts were recorded, the median number of oocysts ranged from 144 to 200. It was concluded that Culex quinquefasciatus is the primary vector of avian malaria in Hawai'i. ?? American Society of Parasitologists 2005.

  2. INVESTIGATIONS INTO MOLECULAR PATHWAYS IN THE POST GENOME ERA: CROSS SPECIES COMPARATIVE GENOMICS APPROACH

    EPA Science Inventory


    Genome sequencing efforts in the past decade were aimed at generating draft sequences of many prokaryotic and eukaryotic model organisms. Successful completion of unicellular eukaryotes, worm, fly and human genome have opened up the new field of molecular biology and function...

  3. Genome Information Broker for Viruses (GIB-V): database for comparative analysis of virus genomes

    PubMed Central

    Hirahata, Masaki; Abe, Takashi; Tanaka, Naoto; Kuwana, Yoshikazu; Shigemoto, Yasumasa; Miyazaki, Satoru; Suzuki, Yoshiyuki; Sugawara, Hideaki

    2007-01-01

    Genome Information Broker for Viruses (GIB-V) is a comprehensive virus genome/segment database. We extracted 18 418 complete virus genomes/segments from the International Nucleotide Sequence Database Collaboration (INSDC, ) by DNA Data Bank of Japan (DDBJ), EMBL and GenBank and stored them in our system. The list of registered viruses is arranged hierarchically according to taxonomy. Keyword searches can be performed for genome/segment data or biological features of any virus stored in GIB-V. GIB-V is equipped with a BLAST search function, and search results are displayed graphically or in list form. Moreover, the BLAST results can be used online with the ClustalW feature of the DDBJ. All available virus genome/segment data can be collected by the GIB-V download function. GIB-V can be accessed at no charge at . PMID:17158166

  4. Comparative genomics reveals conserved positioning of essential genomic clusters in highly rearranged Thermococcales chromosomes

    PubMed Central

    Cossu, Matteo; Da Cunha, Violette; Toffano-Nioche, Claire; Forterre, Patrick; Oberto, Jacques

    2015-01-01

    The genomes of the 21 completely sequenced Thermococcales display a characteristic high level of rearrangements. As a result, the prediction of their origin and termination of replication on the sole basis of chromosomal DNA composition or skew is inoperative. Using a different approach based on biologically relevant sequences, we were able to determine oriC position in all 21 genomes. The position of dif, the site where chromosome dimers are resolved before DNA segregation could be predicted in 19 genomes. Computation of the core genome uncovered a number of essential gene clusters with a remarkably stable chromosomal position across species, in sharp contrast with the scrambled nature of their genomes. The active chromosomal reorganization of numerous genes acquired by horizontal transfer, mainly from mobile elements, could explain this phenomenon. PMID:26166067

  5. Comparative genomics of mutualistic viruses of Glyptapanteles parasitic wasps

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Polydnaviruses, a family of double-stranded DNA viruses with segmented genomes, have evolved as obligate endosymbionts of endoparasitoid wasps, and are some of the few viruses known to share mutualistic relationships with eukaryotic hosts. Virus particles are replication deficient and are produced o...

  6. Genomic Comparative Study of Bovine Mastitis Escherichia coli.

    PubMed

    Kempf, Florent; Slugocki, Cindy; Blum, Shlomo E; Leitner, Gabriel; Germon, Pierre

    2016-01-01

    Escherichia coli, one of the main causative agents of bovine mastitis, is responsible for significant losses on dairy farms. In order to better understand the pathogenicity of E. coli mastitis, an accurate characterization of E. coli strains isolated from mastitis cases is required. By using phylogenetic analyses and whole genome comparison of 5 currently available mastitis E. coli genome sequences, we searched for genotypic traits specific for mastitis isolates. Our data confirm that there is a bias in the distribution of mastitis isolates in the different phylogenetic groups of the E. coli species, with the majority of strains belonging to phylogenetic groups A and B1. An interesting feature is that clustering of strains based on their accessory genome is very similar to that obtained using the core genome. This finding illustrates the fact that phenotypic properties of strains from different phylogroups are likely to be different. As a consequence, it is possible that different strategies could be used by mastitis isolates of different phylogroups to trigger mastitis. Our results indicate that mastitis E. coli isolates analyzed in this study carry very few of the virulence genes described in other pathogenic E. coli strains. A more detailed analysis of the presence/absence of genes involved in LPS synthesis, iron acquisition and type 6 secretion systems did not uncover specific properties of mastitis isolates. Altogether, these results indicate that mastitis E. coli isolates are rather characterized by a lack of bona fide currently described virulence genes.

  7. Comparative Analysis of Genome Diversity in Bullmastiff Dogs.

    PubMed

    Mortlock, Sally-Anne; Khatkar, Mehar S; Williamson, Peter

    2016-01-01

    Management and preservation of genomic diversity in dog breeds is a major objective for maintaining health. The present study was undertaken to characterise genomic diversity in Bullmastiff dogs using both genealogical and molecular analysis. Genealogical analysis of diversity was conducted using a database consisting of 16,378 Bullmastiff pedigrees from year 1980 to 2013. Additionally, a total of 188 Bullmastiff dogs were genotyped using the 170,000 SNP Illumina CanineHD Beadchip. Genealogical parameters revealed a mean inbreeding coefficient of 0.047; 142 total founders (f); an effective number of founders (fe) of 79; an effective number of ancestors (fa) of 62; and an effective population size of the reference population of 41. Genetic diversity and the degree of genome-wide homogeneity within the breed were also investigated using molecular data. Multiple-locus heterozygosity (MLH) was equal to 0.206; runs of homozygosity (ROH) as proportion of the genome, averaged 16.44%; effective population size was 29.1, with an average inbreeding coefficient of 0.035, all estimated using SNP Data. Fine-scale population structure was analysed using NETVIEW, a population analysis pipeline. Visualisation of the high definition network captured relationships among individuals within and between subpopulations. Effects of unequal founder use, and ancestral inbreeding and selection, were evident. While current levels of Bullmastiff heterozygosity, inbreeding and homozygosity are not unusual, a relatively small effective population size indicates that a breeding strategy to reduce the inbreeding rate may be beneficial. PMID:26824579

  8. Cloud Computing for Comparative Genomics with Windows Azure Platform

    PubMed Central

    Kim, Insik; Jung, Jae-Yoon; DeLuca, Todd F.; Nelson, Tristan H.; Wall, Dennis P.

    2012-01-01

    Cloud computing services have emerged as a cost-effective alternative for cluster systems as the number of genomes and required computation power to analyze them increased in recent years. Here we introduce the Microsoft Azure platform with detailed execution steps and a cost comparison with Amazon Web Services. PMID:23032609

  9. Comparative Analysis of Alu Repeats in Primate Genomes

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Background: Alu repeats are SINEs (Short intersperse repetitive elements) which enjoy a successful application in genome evolution, population biology, phylogenetics and forensics. Human Alu consensus sequences were widely used as surrogates in nonhuman primate studies with an assumption that all p...

  10. Evaluating Theobroma grandiflorum for comparative genomic studies with Theobroma cacao

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The seeds of Theobroma cacao (cacao) are the source of cocoa, the raw material for the multi-billion dollar chocolate industry. Cacao’s two most important traits are its unique seed storage triglyceride (cocoa butter) and the flavor of its fermented beans (chocolate). The genome of T. cacao is bei...

  11. Comparative functional genomics of plant pathogenic Fusarium species

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Fusarium species are among the most economically important group of plant pathogenic fungi. Comparison of the four currently available Fusarium genome sequences allows an unsurpassed and unprecedented ability to predict genes, determine synteny and define regulatory sequences for genes in phytopatho...

  12. Comparative Plant Genomics Resources at PlantGDB1

    PubMed Central

    Dong, Qunfeng; Lawrence, Carolyn J.; Schlueter, Shannon D.; Wilkerson, Matthew D.; Kurtz, Stefan; Lushbough, Carol; Brendel, Volker

    2005-01-01

    PlantGDB (http://www.plantgdb.org/) is a database of plant molecular sequences. Expressed sequence tag (EST) sequences are assembled into contigs that represent tentative unique genes. EST contigs are functionally annotated with information derived from known protein sequences that are highly similar to the putative translation products. Tentative Gene Ontology terms are assigned to match those of the similar sequences identified. Genome survey sequences are assembled similarly. The resulting genome survey sequence contigs are matched to ESTs and conserved protein homologs to identify putative full-length open reading frame-containing genes, which are subsequently provisionally classified according to established gene family designations. For Arabidopsis (Arabidopsis thaliana) and rice (Oryza sativa), the exon-intron boundaries for gene structures are annotated by spliced alignment of ESTs and full-length cDNAs to their respective complete genome sequences. Unique genome browsers have been developed to present all available EST and cDNA evidence for current transcript models (for Arabidopsis, see the AtGDB site at http://www.plantgdb.org/AtGDB/; for rice, see the OsGDB site at http://www.plantgdb.org/OsGDB/). In addition, a number of bioinformatic tools have been integrated at PlantGDB that enable researchers to carry out sequence analyses on-site using both their own data and data residing within the database. PMID:16219921

  13. Genomic Comparative Study of Bovine Mastitis Escherichia coli

    PubMed Central

    Kempf, Florent; Slugocki, Cindy; Blum, Shlomo E.; Leitner, Gabriel; Germon, Pierre

    2016-01-01

    Escherichia coli, one of the main causative agents of bovine mastitis, is responsible for significant losses on dairy farms. In order to better understand the pathogenicity of E. coli mastitis, an accurate characterization of E. coli strains isolated from mastitis cases is required. By using phylogenetic analyses and whole genome comparison of 5 currently available mastitis E. coli genome sequences, we searched for genotypic traits specific for mastitis isolates. Our data confirm that there is a bias in the distribution of mastitis isolates in the different phylogenetic groups of the E. coli species, with the majority of strains belonging to phylogenetic groups A and B1. An interesting feature is that clustering of strains based on their accessory genome is very similar to that obtained using the core genome. This finding illustrates the fact that phenotypic properties of strains from different phylogroups are likely to be different. As a consequence, it is possible that different strategies could be used by mastitis isolates of different phylogroups to trigger mastitis. Our results indicate that mastitis E. coli isolates analyzed in this study carry very few of the virulence genes described in other pathogenic E. coli strains. A more detailed analysis of the presence/absence of genes involved in LPS synthesis, iron acquisition and type 6 secretion systems did not uncover specific properties of mastitis isolates. Altogether, these results indicate that mastitis E. coli isolates are rather characterized by a lack of bona fide currently described virulence genes. PMID:26809117

  14. Comparative Analysis of Genome Diversity in Bullmastiff Dogs

    PubMed Central

    Mortlock, Sally-Anne; Khatkar, Mehar S.; Williamson, Peter

    2016-01-01

    Management and preservation of genomic diversity in dog breeds is a major objective for maintaining health. The present study was undertaken to characterise genomic diversity in Bullmastiff dogs using both genealogical and molecular analysis. Genealogical analysis of diversity was conducted using a database consisting of 16,378 Bullmastiff pedigrees from year 1980 to 2013. Additionally, a total of 188 Bullmastiff dogs were genotyped using the 170,000 SNP Illumina CanineHD Beadchip. Genealogical parameters revealed a mean inbreeding coefficient of 0.047; 142 total founders (f); an effective number of founders (fe) of 79; an effective number of ancestors (fa) of 62; and an effective population size of the reference population of 41. Genetic diversity and the degree of genome-wide homogeneity within the breed were also investigated using molecular data. Multiple-locus heterozygosity (MLH) was equal to 0.206; runs of homozygosity (ROH) as proportion of the genome, averaged 16.44%; effective population size was 29.1, with an average inbreeding coefficient of 0.035, all estimated using SNP Data. Fine-scale population structure was analysed using NETVIEW, a population analysis pipeline. Visualisation of the high definition network captured relationships among individuals within and between subpopulations. Effects of unequal founder use, and ancestral inbreeding and selection, were evident. While current levels of Bullmastiff heterozygosity, inbreeding and homozygosity are not unusual, a relatively small effective population size indicates that a breeding strategy to reduce the inbreeding rate may be beneficial. PMID:26824579

  15. Comparative genomic analysis of novel Acinetobacter symbionts: A combined systems biology and genomics approach

    PubMed Central

    Gupta, Vipin; Haider, Shazia; Sood, Utkarsh; Gilbert, Jack A.; Ramjee, Meenakshi; Forbes, Ken; Singh, Yogendra; Lopes, Bruno S.; Lal, Rup

    2016-01-01

    The increasing trend of antibiotic resistance in Acinetobacter drastically limits the range of therapeutic agents required to treat multidrug resistant (MDR) infections. This study focused on analysis of novel Acinetobacter strains using a genomics and systems biology approach. Here we used a network theory method for pathogenic and non-pathogenic Acinetobacter spp. to identify the key regulatory proteins (hubs) in each strain. We identified nine key regulatory proteins, guaA, guaB, rpsB, rpsI, rpsL, rpsE, rpsC, rplM and trmD, which have functional roles as hubs in a hierarchical scale-free fractal protein-protein interaction network. Two key hubs (guaA and guaB) were important for insect-associated strains, and comparative analysis identified guaA as more important than guaB due to its role in effective module regulation. rpsI played a significant role in all the novel strains, while rplM was unique to sheep-associated strains. rpsM, rpsB and rpsI were involved in the regulation of overall network topology across all Acinetobacter strains analyzed in this study. Future analysis will investigate whether these hubs are useful as drug targets for treating Acinetobacter infections. PMID:27378055

  16. Comparative Genomics and Molecular Dynamics of DNA Repeats in Eukaryotes

    PubMed Central

    Richard, Guy-Franck; Kerrest, Alix; Dujon, Bernard

    2008-01-01

    Summary: Repeated elements can be widely abundant in eukaryotic genomes, composing more than 50% of the human genome, for example. It is possible to classify repeated sequences into two large families, “tandem repeats” and “dispersed repeats.” Each of these two families can be itself divided into subfamilies. Dispersed repeats contain transposons, tRNA genes, and gene paralogues, whereas tandem repeats contain gene tandems, ribosomal DNA repeat arrays, and satellite DNA, itself subdivided into satellites, minisatellites, and microsatellites. Remarkably, the molecular mechanisms that create and propagate dispersed and tandem repeats are specific to each class and usually do not overlap. In the present review, we have chosen in the first section to describe the nature and distribution of dispersed and tandem repeats in eukaryotic genomes in the light of complete (or nearly complete) available genome sequences. In the second part, we focus on the molecular mechanisms responsible for the fast evolution of two specific classes of tandem repeats: minisatellites and microsatellites. Given that a growing number of human neurological disorders involve the expansion of a particular class of microsatellites, called trinucleotide repeats, a large part of the recent experimental work on microsatellites has focused on these particular repeats, and thus we also review the current knowledge in this area. Finally, we propose a unified definition for mini- and microsatellites that takes into account their biological properties and try to point out new directions that should be explored in a near future on our road to understanding the genetics of repeated sequences. PMID:19052325

  17. Comparative Genomics of the Staphylococcus intermedius Group of Animal Pathogens

    PubMed Central

    Ben Zakour, Nouri L.; Beatson, Scott A.; van den Broek, Adri H. M.; Thoday, Keith L.; Fitzgerald, J. Ross

    2012-01-01

    The Staphylococcus intermedius group consists of three closely related coagulase-positive bacterial species including S. intermedius, Staphylococcus pseudintermedius, and Staphylococcus delphini. S. pseudintermedius is a major skin pathogen of dogs, which occasionally causes severe zoonotic infections of humans. S. delphini has been isolated from an array of different animals including horses, mink, and pigeons, whereas S. intermedius has been isolated only from pigeons to date. Here we provide a detailed analysis of the S. pseudintermedius whole genome sequence in comparison to high quality draft S. intermedius and S. delphini genomes, and to other sequenced staphylococcal species. The core genome of the SIG was highly conserved with average nucleotide identity (ANI) between the three species of 93.61%, which is very close to the threshold of species delineation (95% ANI), highlighting the close-relatedness of the SIG species. However, considerable variation was identified in the content of mobile genetic elements, cell wall-associated proteins, and iron and sugar transporters, reflecting the distinct ecological niches inhabited. Of note, S. pseudintermedius ED99 contained a clustered regularly interspaced short palindromic repeat locus of the Nmeni subtype and S. intermedius contained both Nmeni and Mtube subtypes. In contrast to S. intermedius and S. delphini and most other staphylococci examined to date, S. pseudintermedius contained at least nine predicted reverse transcriptase Group II introns. Furthermore, S. pseudintermedius ED99 encoded several transposons which were largely responsible for its multi-resistant phenotype. Overall, the study highlights extensive differences in accessory genome content between closely related staphylococcal species inhabiting distinct host niches, providing new avenues for research into pathogenesis and bacterial host-adaptation. PMID:22919635

  18. Jam packed genomes--a preliminary, comparative analysis of nucleomorphs.

    PubMed

    Gilson, Paul R; McFadden, Geoffrey I

    2002-05-01

    There are two ways eukaryotic cells can permanently acquire chloroplasts. They can take up a cyanobacterium and turn it into a chloroplast or they can engulf an alga that already has a chloroplast. The second method is far more common and there are at least seven major groups of protists that have obtained their chloroplasts, this way. In most cases little remains of the engulfed alga apart from its chloroplast, but in two groups, the cryptomonads and chlorarachniophytes, a small remnant nucleus of the engulfed alga is still present. These tiny nuclei, called nucleomorphs, are the smallest and most compact eukaryotic genomes known and recently the nucleomorph of the cryptomonad alga Guillardia theta, was completely sequenced (551 kilobases). The nucleomorph of the chlorarachniophyte Bigellowiella natans (380 kilobases), is also being sequenced and is about half complete. We discuss some of the similarities and differences that are emerging between these two nucleomorph genomes. Both genomes contain just three chromosomes that encode mainly housekeeping genes and a few proteins for chloroplast functions. The bulk of nucleomorph gene coding capacity, therefore, appears to be devoted to self perpetuation and creating gene and protein expression machineries to make a small number of essential chloroplast proteins. We discuss reasons why both nucleomorphs are extraordinarily compact and why their gene sequences are evolving rapidly.

  19. Mitochondrial genome sequences and comparative genomics ofPhytophthora ramorum and P. sojae

    SciTech Connect

    Martin, Frank N.; Douda, Bensasson; Tyler, Brett M.; Boore,Jeffrey L.

    2007-01-01

    The complete sequences of the mitochondrial genomes of theoomycetes of Phytophthora ramorum and P. sojae were determined during thecourse of their complete nuclear genome sequencing (Tyler, et al. 2006).Both are circular, with sizes of 39,314 bp for P. ramorum and 42,975 bpfor P. sojae. Each contains a total of 37 identifiable protein-encodinggenes, 25 or 26 tRNAs (P. sojae and P. ramorum, respectively)specifying19 amino acids, and a variable number of ORFs (7 for P. ramorum and 12for P. sojae) which are potentially additional functional genes.Non-coding regions comprise approximately 11.5 percent and 18.4 percentof the genomes of P. ramorum and P. sojae, respectively. Relative to P.sojae, there is an inverted repeat of 1,150 bp in P. ramorum thatincludes an unassigned unique ORF, a tRNA gene, and adjacent non-codingsequences, but otherwise the gene order in both species is identical.Comparisons of these genomes with published sequences of the P. infestansmitochondrial genome reveals a number of similarities, but the gene orderin P. infestans differs in two adjacent locations due to inversions.Sequence alignments of the three genomes indicated sequence conservationranging from 75 to 85 percent and that specific regions were morevariable than others.

  20. SynTView — an interactive multi-view genome browser for next-generation comparative microorganism genomics

    PubMed Central

    2013-01-01

    Background Dynamic visualisation interfaces are required to explore the multiple microbial genome data now available, especially those obtained by high-throughput sequencing — a.k.a. “Next-Generation Sequencing” (NGS) — technologies; they would also be useful for “standard” annotated genomes whose chromosome organizations may be compared. Although various software systems are available, few offer an optimal combination of feature-rich capabilities, non-static user interfaces and multi-genome data handling. Results We developed SynTView, a comparative and interactive viewer for microbial genomes, designed to run as either a web-based tool (Flash technology) or a desktop application (AIR environment). The basis of the program is a generic genome browser with sub-maps holding information about genomic objects (annotations). The software is characterised by the presentation of syntenic organisations of microbial genomes and the visualisation of polymorphism data (typically Single Nucleotide Polymorphisms — SNPs) along these genomes; these features are accessible to the user in an integrated way. A variety of specialised views are available and are all dynamically inter-connected (including linear and circular multi-genome representations, dot plots, phylogenetic profiles, SNP density maps, and more). SynTView is not linked to any particular database, allowing the user to plug his own data into the system seamlessly, and use external web services for added functionalities. SynTView has now been used in several genome sequencing projects to help biologists make sense out of huge data sets. Conclusions The most important assets of SynTView are: (i) the interactivity due to the Flash technology; (ii) the capabilities for dynamic interaction between many specialised views; and (iii) the flexibility allowing various user data sets to be integrated. It can thus be used to investigate massive amounts of information efficiently at the chromosome level. This

  1. Comparative Genomics Analysis of Streptomyces Species Reveals Their Adaptation to the Marine Environment and Their Diversity at the Genomic Level

    PubMed Central

    Tian, Xinpeng; Zhang, Zhewen; Yang, Tingting; Chen, Meili; Li, Jie; Chen, Fei; Yang, Jin; Li, Wenjie; Zhang, Bing; Zhang, Zhang; Wu, Jiayan; Zhang, Changsheng; Long, Lijuan; Xiao, Jingfa

    2016-01-01

    Over 200 genomes of streptomycete strains that were isolated from various environments are available from the NCBI. However, little is known about the characteristics that are linked to marine adaptation in marine-derived streptomycetes. The particularity and complexity of the marine environment suggest that marine streptomycetes are genetically diverse. Here, we sequenced nine strains from the Streptomyces genus that were isolated from different longitudes, latitudes, and depths of the South China Sea. Then we compared these strains to 22 NCBI downloaded streptomycete strains. Thirty-one streptomycete strains are clearly grouped into a marine-derived subgroup and multiple source subgroup-based phylogenetic tree. The phylogenetic analyses have revealed the dynamic process underlying streptomycete genome evolution, and lateral gene transfer is an important driving force during the process. Pan-genomics analyses have revealed that streptomycetes have an open pan-genome, which reflects the diversity of these streptomycetes and guarantees the species a quick and economical response to diverse environments. Functional and comparative genomics analyses indicate that the marine-derived streptomycetes subgroup possesses some common characteristics of marine adaptation. Our findings have expanded our knowledge of how ocean isolates of streptomycete strains adapt to marine environments. The availability of streptomycete genomes from the South China Sea will be beneficial for further analysis on marine streptomycetes and will enrich the South China Sea’s genetic data sources. PMID:27446038

  2. Comparative Genomics Analysis of Streptomyces Species Reveals Their Adaptation to the Marine Environment and Their Diversity at the Genomic Level.

    PubMed

    Tian, Xinpeng; Zhang, Zhewen; Yang, Tingting; Chen, Meili; Li, Jie; Chen, Fei; Yang, Jin; Li, Wenjie; Zhang, Bing; Zhang, Zhang; Wu, Jiayan; Zhang, Changsheng; Long, Lijuan; Xiao, Jingfa

    2016-01-01

    Over 200 genomes of streptomycete strains that were isolated from various environments are available from the NCBI. However, little is known about the characteristics that are linked to marine adaptation in marine-derived streptomycetes. The particularity and complexity of the marine environment suggest that marine streptomycetes are genetically diverse. Here, we sequenced nine strains from the Streptomyces genus that were isolated from different longitudes, latitudes, and depths of the South China Sea. Then we compared these strains to 22 NCBI downloaded streptomycete strains. Thirty-one streptomycete strains are clearly grouped into a marine-derived subgroup and multiple source subgroup-based phylogenetic tree. The phylogenetic analyses have revealed the dynamic process underlying streptomycete genome evolution, and lateral gene transfer is an important driving force during the process. Pan-genomics analyses have revealed that streptomycetes have an open pan-genome, which reflects the diversity of these streptomycetes and guarantees the species a quick and economical response to diverse environments. Functional and comparative genomics analyses indicate that the marine-derived streptomycetes subgroup possesses some common characteristics of marine adaptation. Our findings have expanded our knowledge of how ocean isolates of streptomycete strains adapt to marine environments. The availability of streptomycete genomes from the South China Sea will be beneficial for further analysis on marine streptomycetes and will enrich the South China Sea's genetic data sources. PMID:27446038

  3. Comparative Genomics Analysis of Streptomyces Species Reveals Their Adaptation to the Marine Environment and Their Diversity at the Genomic Level.

    PubMed

    Tian, Xinpeng; Zhang, Zhewen; Yang, Tingting; Chen, Meili; Li, Jie; Chen, Fei; Yang, Jin; Li, Wenjie; Zhang, Bing; Zhang, Zhang; Wu, Jiayan; Zhang, Changsheng; Long, Lijuan; Xiao, Jingfa

    2016-01-01

    Over 200 genomes of streptomycete strains that were isolated from various environments are available from the NCBI. However, little is known about the characteristics that are linked to marine adaptation in marine-derived streptomycetes. The particularity and complexity of the marine environment suggest that marine streptomycetes are genetically diverse. Here, we sequenced nine strains from the Streptomyces genus that were isolated from different longitudes, latitudes, and depths of the South China Sea. Then we compared these strains to 22 NCBI downloaded streptomycete strains. Thirty-one streptomycete strains are clearly grouped into a marine-derived subgroup and multiple source subgroup-based phylogenetic tree. The phylogenetic analyses have revealed the dynamic process underlying streptomycete genome evolution, and lateral gene transfer is an important driving force during the process. Pan-genomics analyses have revealed that streptomycetes have an open pan-genome, which reflects the diversity of these streptomycetes and guarantees the species a quick and economical response to diverse environments. Functional and comparative genomics analyses indicate that the marine-derived streptomycetes subgroup possesses some common characteristics of marine adaptation. Our findings have expanded our knowledge of how ocean isolates of streptomycete strains adapt to marine environments. The availability of streptomycete genomes from the South China Sea will be beneficial for further analysis on marine streptomycetes and will enrich the South China Sea's genetic data sources.

  4. The Integrated Microbial Genomes (IMG) System: An Expanding Comparative Analysis Resource

    SciTech Connect

    Markowitz, Victor M.; Chen, I-Min A.; Palaniappan, Krishna; Chu, Ken; Szeto, Ernest; Grechkin, Yuri; Ratner, Anna; Anderson, Iain; Lykidis, Athanasios; Mavromatis, Konstantinos; Ivanova, Natalia N.; Kyrpides, Nikos C.

    2009-09-13

    The integrated microbial genomes (IMG) system serves as a community resource for comparative analysis of publicly available genomes in a comprehensive integrated context. IMG contains both draft and complete microbial genomes integrated with other publicly available genomes from all three domains of life, together with a large number of plasmids and viruses. IMG provides tools and viewers for analyzing and reviewing the annotations of genes and genomes in a comparative context. Since its first release in 2005, IMG's data content and analytical capabilities have been constantly expanded through regular releases. Several companion IMG systems have been set up in order to serve domain specific needs, such as expert review of genome annotations. IMG is available at .

  5. The Aspergillus Genome Database (AspGD): recent developments in comprehensive multispecies curation, comparative genomics and community resources.

    PubMed

    Arnaud, Martha B; Cerqueira, Gustavo C; Inglis, Diane O; Skrzypek, Marek S; Binkley, Jonathan; Chibucos, Marcus C; Crabtree, Jonathan; Howarth, Clinton; Orvis, Joshua; Shah, Prachi; Wymore, Farrell; Binkley, Gail; Miyasato, Stuart R; Simison, Matt; Sherlock, Gavin; Wortman, Jennifer R

    2012-01-01

    The Aspergillus Genome Database (AspGD; http://www.aspgd.org) is a freely available, web-based resource for researchers studying fungi of the genus Aspergillus, which includes organisms of clinical, agricultural and industrial importance. AspGD curators have now completed comprehensive review of the entire published literature about Aspergillus nidulans and Aspergillus fumigatus, and this annotation is provided with streamlined, ortholog-based navigation of the multispecies information. AspGD facilitates comparative genomics by providing a full-featured genomics viewer, as well as matched and standardized sets of genomic information for the sequenced aspergilli. AspGD also provides resources to foster interaction and dissemination of community information and resources. We welcome and encourage feedback at aspergillus-curator@lists.stanford.edu.

  6. Five Complete Chloroplast Genome Sequences from Diospyros: Genome Organization and Comparative Analysis

    PubMed Central

    Hu, Jingjing; Liang, Yuqin; Liang, Jinjun; Wuyun, Tana; Tan, Xiaofeng

    2016-01-01

    Diospyros is the largest genus in Ebenaceae, comprising more than 500 species with remarkable economic value, especially Diospyros kaki Thunb., which has traditionally been an important food resource in China, Korea, and Japan. Complete chloroplast (cp) genomes from D. kaki, D. lotus L., D. oleifera Cheng., D. glaucifolia Metc., and Diospyros ‘Jinzaoshi’ were sequenced using Illumina sequencing technology. This is the first cp genome reported in Ebenaceae. The cp genome sequences of Diospyros ranged from 157,300 to 157,784 bp in length, presenting a typical quadripartite structure with two inverted repeats each separated by one large and one small single-copy region. For each cp genome, 134 genes were annotated, including 80 protein-coding, 31 tRNA, and 4 rRNA unique genes. In all, 179 repeats and 283 single sequence repeats were identified. Four hypervariable regions, namely, intergenic region of trnQ_rps16, trnV_ndhC, and psbD_trnT, and intron of ndhA, were identified in the Diospyros genomes. Phylogenetic analyses based on the whole cp genome, protein-coding, and intergenic and intron sequences indicated that D. oleifera is closely related to D. kaki and could be used as a model plant for future research on D. kaki; to our knowledge, this is proposed for the first time. Further, these analyses together with two large deletions (301 and 140 bp) in the cp genome of D. ‘Jinzaoshi’, support its placement as a new species in Diospyros. Both maximum parsimony and likelihood analyses for 19 taxa indicated the basal position of Ericales in asterids and suggested that Ebenaceae is monophyletic in Ericales. PMID:27442423

  7. Five Complete Chloroplast Genome Sequences from Diospyros: Genome Organization and Comparative Analysis.

    PubMed

    Fu, Jianmin; Liu, Huimin; Hu, Jingjing; Liang, Yuqin; Liang, Jinjun; Wuyun, Tana; Tan, Xiaofeng

    2016-01-01

    Diospyros is the largest genus in Ebenaceae, comprising more than 500 species with remarkable economic value, especially Diospyros kaki Thunb., which has traditionally been an important food resource in China, Korea, and Japan. Complete chloroplast (cp) genomes from D. kaki, D. lotus L., D. oleifera Cheng., D. glaucifolia Metc., and Diospyros 'Jinzaoshi' were sequenced using Illumina sequencing technology. This is the first cp genome reported in Ebenaceae. The cp genome sequences of Diospyros ranged from 157,300 to 157,784 bp in length, presenting a typical quadripartite structure with two inverted repeats each separated by one large and one small single-copy region. For each cp genome, 134 genes were annotated, including 80 protein-coding, 31 tRNA, and 4 rRNA unique genes. In all, 179 repeats and 283 single sequence repeats were identified. Four hypervariable regions, namely, intergenic region of trnQ_rps16, trnV_ndhC, and psbD_trnT, and intron of ndhA, were identified in the Diospyros genomes. Phylogenetic analyses based on the whole cp genome, protein-coding, and intergenic and intron sequences indicated that D. oleifera is closely related to D. kaki and could be used as a model plant for future research on D. kaki; to our knowledge, this is proposed for the first time. Further, these analyses together with two large deletions (301 and 140 bp) in the cp genome of D. 'Jinzaoshi', support its placement as a new species in Diospyros. Both maximum parsimony and likelihood analyses for 19 taxa indicated the basal position of Ericales in asterids and suggested that Ebenaceae is monophyletic in Ericales. PMID:27442423

  8. Five Complete Chloroplast Genome Sequences from Diospyros: Genome Organization and Comparative Analysis.

    PubMed

    Fu, Jianmin; Liu, Huimin; Hu, Jingjing; Liang, Yuqin; Liang, Jinjun; Wuyun, Tana; Tan, Xiaofeng

    2016-01-01

    Diospyros is the largest genus in Ebenaceae, comprising more than 500 species with remarkable economic value, especially Diospyros kaki Thunb., which has traditionally been an important food resource in China, Korea, and Japan. Complete chloroplast (cp) genomes from D. kaki, D. lotus L., D. oleifera Cheng., D. glaucifolia Metc., and Diospyros 'Jinzaoshi' were sequenced using Illumina sequencing technology. This is the first cp genome reported in Ebenaceae. The cp genome sequences of Diospyros ranged from 157,300 to 157,784 bp in length, presenting a typical quadripartite structure with two inverted repeats each separated by one large and one small single-copy region. For each cp genome, 134 genes were annotated, including 80 protein-coding, 31 tRNA, and 4 rRNA unique genes. In all, 179 repeats and 283 single sequence repeats were identified. Four hypervariable regions, namely, intergenic region of trnQ_rps16, trnV_ndhC, and psbD_trnT, and intron of ndhA, were identified in the Diospyros genomes. Phylogenetic analyses based on the whole cp genome, protein-coding, and intergenic and intron sequences indicated that D. oleifera is closely related to D. kaki and could be used as a model plant for future research on D. kaki; to our knowledge, this is proposed for the first time. Further, these analyses together with two large deletions (301 and 140 bp) in the cp genome of D. 'Jinzaoshi', support its placement as a new species in Diospyros. Both maximum parsimony and likelihood analyses for 19 taxa indicated the basal position of Ericales in asterids and suggested that Ebenaceae is monophyletic in Ericales.

  9. Complete Genome Sequence of Borrelia afzelii K78 and Comparative Genome Analysis

    PubMed Central

    Schüler, Wolfgang; Bunikis, Ignas; Weber-Lehman, Jacqueline; Comstedt, Pär; Kutschan-Bunikis, Sabrina; Stanek, Gerold; Huber, Jutta; Meinke, Andreas; Bergström, Sven; Lundberg, Urban

    2015-01-01

    The main Borrelia species causing Lyme borreliosis in Europe and Asia are Borrelia afzelii, B. garinii, B. burgdorferi and B. bavariensis. This is in contrast to the United States, where infections are exclusively caused by B. burgdorferi. Until to date the genome sequences of four B. afzelii strains, of which only two include the numerous plasmids, are available. In order to further assess the genetic diversity of B. afzelii, the most common species in Europe, responsible for the large variety of clinical manifestations of Lyme borreliosis, we have determined the full genome sequence of the B. afzelii strain K78, a clinical isolate from Austria. The K78 genome contains a linear chromosome (905,949 bp) and 13 plasmids (8 linear and 5 circular) together presenting 1,309 open reading frames of which 496 are located on plasmids. With the exception of lp28-8, all linear replicons in their full length including their telomeres have been sequenced. The comparison with the genomes of the four other B. afzelii strains, ACA-1, PKo, HLJ01 and Tom3107, as well as the one of B. burgdorferi strain B31, confirmed a high degree of conservation within the linear chromosome of B. afzelii, whereas plasmid encoded genes showed a much larger diversity. Since some plasmids present in B. burgdorferi are missing in the B. afzelii genomes, the corresponding virulence factors of B. burgdorferi are found in B. afzelii on other unrelated plasmids. In addition, we have identified a species specific region in the circular plasmid, cp26, which could be used for species determination. Different non-coding RNAs have been located on the B. afzelii K78 genome, which have not previously been annotated in any of the published Borrelia genomes. PMID:25798594

  10. Pan-vertebrate comparative genomics unmasks retrovirus macroevolution.

    PubMed

    Hayward, Alexander; Cornwallis, Charlie K; Jern, Patric

    2015-01-13

    Although extensive research has demonstrated host-retrovirus microevolutionary dynamics, it has been difficult to gain a deeper understanding of the macroevolutionary patterns of host-retrovirus interactions. Here we use recent technological advances to infer broad patterns in retroviral diversity, evolution, and host-virus relationships by using a large-scale phylogenomic approach using endogenous retroviruses (ERVs). Retroviruses insert a proviral DNA copy into the host cell genome to produce new viruses. ERVs are provirus insertions in germline cells that are inherited down the host lineage and consequently present a record of past host-viral associations. By mining ERVs from 65 host genomes sampled across vertebrate diversity, we uncover a great diversity of ERVs, indicating that retroviral sequences are much more prevalent and widespread across vertebrates than previously appreciated. The majority of ERV clades that we recover do not contain known retroviruses, implying either that retroviral lineages are highly transient over evolutionary time or that a considerable number of retroviruses remain to be identified. By characterizing the distribution of ERVs, we show that no major vertebrate lineage has escaped retroviral activity and that retroviruses are extreme host generalists, having an unprecedented ability for rampant host switching among distantly related vertebrates. In addition, we examine whether the distribution of ERVs can be explained by host factors predicted to influence viral transmission and find that internal fertilization has a pronounced effect on retroviral colonization of host genomes. By capturing the mode and pattern of retroviral evolution and contrasting ERV diversity with known retroviral diversity, our study provides a cohesive framework to understand host-virus coevolution better.

  11. Pan-vertebrate comparative genomics unmasks retrovirus macroevolution

    PubMed Central

    Hayward, Alexander; Cornwallis, Charlie K.; Jern, Patric

    2015-01-01

    Although extensive research has demonstrated host-retrovirus microevolutionary dynamics, it has been difficult to gain a deeper understanding of the macroevolutionary patterns of host–retrovirus interactions. Here we use recent technological advances to infer broad patterns in retroviral diversity, evolution, and host–virus relationships by using a large-scale phylogenomic approach using endogenous retroviruses (ERVs). Retroviruses insert a proviral DNA copy into the host cell genome to produce new viruses. ERVs are provirus insertions in germline cells that are inherited down the host lineage and consequently present a record of past host–viral associations. By mining ERVs from 65 host genomes sampled across vertebrate diversity, we uncover a great diversity of ERVs, indicating that retroviral sequences are much more prevalent and widespread across vertebrates than previously appreciated. The majority of ERV clades that we recover do not contain known retroviruses, implying either that retroviral lineages are highly transient over evolutionary time or that a considerable number of retroviruses remain to be identified. By characterizing the distribution of ERVs, we show that no major vertebrate lineage has escaped retroviral activity and that retroviruses are extreme host generalists, having an unprecedented ability for rampant host switching among distantly related vertebrates. In addition, we examine whether the distribution of ERVs can be explained by host factors predicted to influence viral transmission and find that internal fertilization has a pronounced effect on retroviral colonization of host genomes. By capturing the mode and pattern of retroviral evolution and contrasting ERV diversity with known retroviral diversity, our study provides a cohesive framework to understand host–virus coevolution better. PMID:25535393

  12. AVIAN IMMUNOTOXICOLOGY

    EPA Science Inventory

    Methods for studying the avian immune system have matured during the past two decades, with laboratory studies predominating in earlier years and field studies being conducted only in the past decade. One application has been to determine the potential for environmental contamina...

  13. Comparative analysis of microsatellites in chloroplast genomes of lower and higher plants.

    PubMed

    George, Biju; Bhatt, Bhavin S; Awasthi, Mayur; George, Binu; Singh, Achuit K

    2015-11-01

    Microsatellites, or simple sequence repeats (SSRs), contain repetitive DNA sequence where tandem repeats of one to six base pairs are present number of times. Chloroplast genome sequences have been  shown to possess extensive variations in the length, number and distribution of SSRs. However, a comparative analysis of chloroplast microsatellites is not available. Considering their potential importance in generating genomic diversity, we have systematically analysed the abundance and distribution of simple and compound microsatellites in 164 sequenced chloroplast genomes from wide range of plants. The key findings of these studies are (1) a large number of mononucleotide repeats as compared to SSR(2-6)(di-, tri-, tetra-, penta-, hexanucleotide repeats) are present in all chloroplast genomes investigated, (2) lower plants such as algae show wide variation in relative abundance, density and distribution of microsatellite repeats as compared to flowering plants, (3) longer SSRs are excluded from coding regions of most chloroplast genomes, (4) GC content has a weak influence on number, relative abundance and relative density of mononucleotide as well as SSR(2-6). However, GC content strongly showed negative correlation with relative density (R (2) = 0.5, P < 0.05) and relative abundance (R (2) = 0.6, P < 0.05) of cSSRs. In summary, our comparative studies of chloroplast genomes illustrate the variable distribution of microsatellites and revealed that chloroplast genome of smaller plants possesses relatively more genomic diversity compared to higher plants.

  14. Delineation of Steroid-Degrading Microorganisms through Comparative Genomic Analysis

    PubMed Central

    Bergstrand, Lee H.; Cardenas, Erick; Holert, Johannes; Van Hamme, Jonathan D.

    2016-01-01

    ABSTRACT Steroids are ubiquitous in natural environments and are a significant growth substrate for microorganisms. Microbial steroid metabolism is also important for some pathogens and for biotechnical applications. This study delineated the distribution of aerobic steroid catabolism pathways among over 8,000 microorganisms whose genomes are available in the NCBI RefSeq database. Combined analysis of bacterial, archaeal, and fungal genomes with both hidden Markov models and reciprocal BLAST identified 265 putative steroid degraders within only Actinobacteria and Proteobacteria, which mainly originated from soil, eukaryotic host, and aquatic environments. These bacteria include members of 17 genera not previously known to contain steroid degraders. A pathway for cholesterol degradation was conserved in many actinobacterial genera, particularly in members of the Corynebacterineae, and a pathway for cholate degradation was conserved in members of the genus Rhodococcus. A pathway for testosterone and, sometimes, cholate degradation had a patchy distribution among Proteobacteria. The steroid degradation genes tended to occur within large gene clusters. Growth experiments confirmed bioinformatic predictions of steroid metabolism capacity in nine bacterial strains. The results indicate there was a single ancestral 9,10-seco-steroid degradation pathway. Gene duplication, likely in a progenitor of Rhodococcus, later gave rise to a cholate degradation pathway. Proteobacteria and additional Actinobacteria subsequently obtained a cholate degradation pathway via horizontal gene transfer, in some cases facilitated by plasmids. Catabolism of steroids appears to be an important component of the ecological niches of broad groups of Actinobacteria and individual species of Proteobacteria. PMID:26956583

  15. Comparative genomics provide insights into evolution of trichoderma nutrition style.

    PubMed

    Xie, Bin-Bin; Qin, Qi-Long; Shi, Mei; Chen, Lei-Lei; Shu, Yan-Li; Luo, Yan; Wang, Xiao-Wei; Rong, Jin-Cheng; Gong, Zhi-Ting; Li, Dan; Sun, Cai-Yun; Liu, Gui-Ming; Dong, Xiao-Wei; Pang, Xiu-Hua; Huang, Feng; Liu, Weifeng; Chen, Xiu-Lan; Zhou, Bai-Cheng; Zhang, Yu-Zhong; Song, Xiao-Yan

    2014-02-01

    Saprotrophy on plant biomass is a recently developed nutrition strategy for Trichoderma. However, the physiology and evolution of this new nutrition strategy is still elusive. We report the deep sequencing and analysis of the genome of Trichoderma longibrachiatum, an efficient cellulase producer. The 31.7-Mb genome, smallest among the sequenced Trichoderma species, encodes fewer nutrition-related genes than saprotrophic T. reesei (Tr), including glycoside hydrolases and nonribosomal peptide synthetase-polyketide synthase. Homology and phylogenetic analyses suggest that a large number of nutrition-related genes, including GH18 chitinases, β-1,3/1,6-glucanases, cellulolytic enzymes, and hemicellulolytic enzymes, were lost in the common ancestor of T. longibrachiatum (Tl) and Tr. dN/dS (ω) calculation indicates that all the nutrition-related genes analyzed are under purifying selection. Cellulolytic enzymes, the key enzymes for saprotrophy on plant biomass, are under stronger purifying selection pressure in Tl and Tr than in mycoparasitic species, suggesting that development of the nutrition strategy of saprotrophy on plant biomass has increased the selection pressure. In addition, aspartic proteases, serine proteases, and metalloproteases are subject to stronger purifying selection pressure in Tl and Tr, suggesting that these enzymes may also play important roles in the nutrition. This study provides insights into the physiology and evolution of the nutrition strategy of Trichoderma.

  16. Comparative Genomics Provide Insights into Evolution of Trichoderma Nutrition Style

    PubMed Central

    Xie, Bin-Bin; Qin, Qi-Long; Shi, Mei; Chen, Lei-Lei; Shu, Yan-Li; Luo, Yan; Wang, Xiao-Wei; Rong, Jin-Cheng; Gong, Zhi-Ting; Li, Dan; Sun, Cai-Yun; Liu, Gui-Ming; Dong, Xiao-Wei; Pang, Xiu-Hua; Huang, Feng; Liu, Weifeng; Chen, Xiu-Lan; Zhou, Bai-Cheng; Zhang, Yu-Zhong; Song, Xiao-Yan

    2014-01-01

    Saprotrophy on plant biomass is a recently developed nutrition strategy for Trichoderma. However, the physiology and evolution of this new nutrition strategy is still elusive. We report the deep sequencing and analysis of the genome of Trichoderma longibrachiatum, an efficient cellulase producer. The 31.7-Mb genome, smallest among the sequenced Trichoderma species, encodes fewer nutrition-related genes than saprotrophic T. reesei (Tr), including glycoside hydrolases and nonribosomal peptide synthetase–polyketide synthase. Homology and phylogenetic analyses suggest that a large number of nutrition-related genes, including GH18 chitinases, β-1,3/1,6-glucanases, cellulolytic enzymes, and hemicellulolytic enzymes, were lost in the common ancestor of T. longibrachiatum (Tl) and Tr. dN/dS (ω) calculation indicates that all the nutrition-related genes analyzed are under purifying selection. Cellulolytic enzymes, the key enzymes for saprotrophy on plant biomass, are under stronger purifying selection pressure in Tl and Tr than in mycoparasitic species, suggesting that development of the nutrition strategy of saprotrophy on plant biomass has increased the selection pressure. In addition, aspartic proteases, serine proteases, and metalloproteases are subject to stronger purifying selection pressure in Tl and Tr, suggesting that these enzymes may also play important roles in the nutrition. This study provides insights into the physiology and evolution of the nutrition strategy of Trichoderma. PMID:24482532

  17. Comparative genomics on nemo-like kinase gene.

    PubMed

    Katoh, Masuko; Katoh, Masaru

    2005-06-01

    WNT signals are transduced to the planar cell polarity (PCP) pathway or the beta-catenin pathway. Drosophila Frizzled (Fz), Starry night (Stan), Van Gogh (Vang), Dishevelled (Dsh), Prickle (Pk), Diego (Dgo) and Nemo (Nmo) are implicated in the PCP signaling pathway. Choi and Benzer identified Drosophila Nmo in 1994, and Brott et al identified mouse Nemo-like kinase (Nlk) in 1998. Nlk positively regulates the PCP pathway, and negatively regulates the beta-catenin pathway. Here, we identified and characterized rat Nlk gene, Nlk2 gene and Nlkp pseudogene by using bioinformatics. Nlk gene, consisting of 11 exons, was mapped to rat chromosome 10q25. Rat Nlk gene encoded 515-aa Nlk protein with the serine/threonine kinase domain, poly(His) tracts and poly(Ala) tract, which showed 100, 99.8, 97.1 and 89.5% total-amino-acid identity with mouse Nlk, human NLK, Xenopus nlk and zebrafish nlk, respectively. Rat Nlk2 gene and Nlkp pseudogene were mapped to rat chromosome 13p13 and 2q44, respectively. Nlk2 gene and Nlkp pseudogene, consisting of a single exon, were not evolutionarily conserved. Nlk2 gene and Nlkp pseudogene were predicted as retrotransposed Nlk homologs within the rat genome. Nlk2 gene encoded a 480-aa Nlk2 protein with partial deletion within the kinase domain, which was predicted as the dominant negative Nlk homolog. This is the first report on the Nlk gene and retrotransposed Nlk homologs within the rat genome.

  18. Comparative genomics of the neglected human malaria parasite Plasmodium vivax.

    PubMed

    Carlton, Jane M; Adams, John H; Silva, Joana C; Bidwell, Shelby L; Lorenzi, Hernan; Caler, Elisabet; Crabtree, Jonathan; Angiuoli, Samuel V; Merino, Emilio F; Amedeo, Paolo; Cheng, Qin; Coulson, Richard M R; Crabb, Brendan S; Del Portillo, Hernando A; Essien, Kobby; Feldblyum, Tamara V; Fernandez-Becerra, Carmen; Gilson, Paul R; Gueye, Amy H; Guo, Xiang; Kang'a, Simon; Kooij, Taco W A; Korsinczky, Michael; Meyer, Esmeralda V-S; Nene, Vish; Paulsen, Ian; White, Owen; Ralph, Stuart A; Ren, Qinghu; Sargeant, Tobias J; Salzberg, Steven L; Stoeckert, Christian J; Sullivan, Steven A; Yamamoto, Marcio M; Hoffman, Stephen L; Wortman, Jennifer R; Gardner, Malcolm J; Galinski, Mary R; Barnwell, John W; Fraser-Liggett, Claire M

    2008-10-01

    The human malaria parasite Plasmodium vivax is responsible for 25-40% of the approximately 515 million annual cases of malaria worldwide. Although seldom fatal, the parasite elicits severe and incapacitating clinical symptoms and often causes relapses months after a primary infection has cleared. Despite its importance as a major human pathogen, P. vivax is little studied because it cannot be propagated continuously in the laboratory except in non-human primates. We sequenced the genome of P. vivax to shed light on its distinctive biological features, and as a means to drive development of new drugs and vaccines. Here we describe the synteny and isochore structure of P. vivax chromosomes, and show that the parasite resembles other malaria parasites in gene content and metabolic potential, but possesses novel gene families and potential alternative invasion pathways not recognized previously. Completion of the P. vivax genome provides the scientific community with a valuable resource that can be used to advance investigation into this neglected species.

  19. Evolution of Prdm Genes in Animals: Insights from Comparative Genomics.

    PubMed

    Vervoort, Michel; Meulemeester, David; Béhague, Julien; Kerner, Pierre

    2016-03-01

    Prdm genes encode transcription factors with a subtype of SET domain known as the PRDF1-RIZ (PR) homology domain and a variable number of zinc finger motifs. These genes are involved in a wide variety of functions during animal development. As most Prdm genes have been studied in vertebrates, especially in mice, little is known about the evolution of this gene family. We searched for Prdm genes in the fully sequenced genomes of 93 different species representative of all the main metazoan lineages. A total of 976 Prdm genes were identified in these species. The number of Prdm genes per species ranges from 2 to 19. To better understand how the Prdm gene family has evolved in metazoans, we performed phylogenetic analyses using this large set of id