USDA-ARS?s Scientific Manuscript database
Genome evolution influences a parasite’s’s pathogenicity, host-pathogen interactions, environmental constraints, and invasion biology, while genome assemblies form the basis of comparative sequence analyses. Given that closely related organisms typically maintain appreciable synteny, the genome asse...
Wang, Xiao-Wei; Zhao, Qiong-Yi; Luan, Jun-Bo; Wang, Yu-Jun; Yan, Gen-Hong; Liu, Shu-Sheng
2012-10-04
Genomic divergence between invasive and native species may provide insight into the molecular basis underlying specific characteristics that drive the invasion and displacement of closely related species. In this study, we sequenced the transcriptome of an indigenous species, Asia II 3, of the Bemisia tabaci complex and compared its genetic divergence with the transcriptomes of two invasive whiteflies species, Middle East Asia Minor 1 (MEAM1) and Mediterranean (MED), respectively. More than 16 million reads of 74 base pairs in length were obtained for the Asia II 3 species using the Illumina sequencing platform. These reads were assembled into 52,535 distinct sequences (mean size: 466 bp) and 16,596 sequences were annotated with an E-value above 10-5. Protein family comparisons revealed obvious diversification among the transcriptomes of these species suggesting species-specific adaptations during whitefly evolution. On the contrary, substantial conservation of the whitefly transcriptomes was also evident, despite their differences. The overall divergence of coding sequences between the orthologous gene pairs of Asia II 3 and MEAM1 is 1.73%, which is comparable to the average divergence of Asia II 3 and MED transcriptomes (1.84%) and much higher than that of MEAM1 and MED (0.83%). This is consistent with the previous phylogenetic analyses and crossing experiments suggesting these are distinct species. We also identified hundreds of highly diverged genes and compiled sequence identify data into gene functional groups and found the most divergent gene classes are Cytochrome P450, Glutathione metabolism and Oxidative phosphorylation. These results strongly suggest that the divergence of genes related to metabolism might be the driving force of the MEAM1 and Asia II 3 differentiation. We also analyzed single nucleotide polymorphisms within the orthologous gene pairs of indigenous and invasive whiteflies which are helpful for the investigation of association between allelic and phenotypes. Our data present the most comprehensive sequences for the indigenous whitefly species Asia II 3. The extensive comparisons of Asia II 3, MEAM1 and MED transcriptomes will serve as an invaluable resource for revealing the genetic basis of whitefly invasion and the molecular mechanisms underlying their biological differences.
2012-01-01
Background Genomic divergence between invasive and native species may provide insight into the molecular basis underlying specific characteristics that drive the invasion and displacement of closely related species. In this study, we sequenced the transcriptome of an indigenous species, Asia II 3, of the Bemisia tabaci complex and compared its genetic divergence with the transcriptomes of two invasive whiteflies species, Middle East Asia Minor 1 (MEAM1) and Mediterranean (MED), respectively. Results More than 16 million reads of 74 base pairs in length were obtained for the Asia II 3 species using the Illumina sequencing platform. These reads were assembled into 52,535 distinct sequences (mean size: 466 bp) and 16,596 sequences were annotated with an E-value above 10-5. Protein family comparisons revealed obvious diversification among the transcriptomes of these species suggesting species-specific adaptations during whitefly evolution. On the contrary, substantial conservation of the whitefly transcriptomes was also evident, despite their differences. The overall divergence of coding sequences between the orthologous gene pairs of Asia II 3 and MEAM1 is 1.73%, which is comparable to the average divergence of Asia II 3 and MED transcriptomes (1.84%) and much higher than that of MEAM1 and MED (0.83%). This is consistent with the previous phylogenetic analyses and crossing experiments suggesting these are distinct species. We also identified hundreds of highly diverged genes and compiled sequence identify data into gene functional groups and found the most divergent gene classes are Cytochrome P450, Glutathione metabolism and Oxidative phosphorylation. These results strongly suggest that the divergence of genes related to metabolism might be the driving force of the MEAM1 and Asia II 3 differentiation. We also analyzed single nucleotide polymorphisms within the orthologous gene pairs of indigenous and invasive whiteflies which are helpful for the investigation of association between allelic and phenotypes. Conclusions Our data present the most comprehensive sequences for the indigenous whitefly species Asia II 3. The extensive comparisons of Asia II 3, MEAM1 and MED transcriptomes will serve as an invaluable resource for revealing the genetic basis of whitefly invasion and the molecular mechanisms underlying their biological differences. PMID:23036081
Sex Chromosome Turnover Contributes to Genomic Divergence between Incipient Stickleback Species
Yoshida, Kohta; Makino, Takashi; Yamaguchi, Katsushi; Shigenobu, Shuji; Hasebe, Mitsuyasu; Kawata, Masakado; Kume, Manabu; Mori, Seiichi; Peichel, Catherine L.; Toyoda, Atsushi; Fujiyama, Asao; Kitano, Jun
2014-01-01
Sex chromosomes turn over rapidly in some taxonomic groups, where closely related species have different sex chromosomes. Although there are many examples of sex chromosome turnover, we know little about the functional roles of sex chromosome turnover in phenotypic diversification and genomic evolution. The sympatric pair of Japanese threespine stickleback (Gasterosteus aculeatus) provides an excellent system to address these questions: the Japan Sea species has a neo-sex chromosome system resulting from a fusion between an ancestral Y chromosome and an autosome, while the sympatric Pacific Ocean species has a simple XY sex chromosome system. Furthermore, previous quantitative trait locus (QTL) mapping demonstrated that the Japan Sea neo-X chromosome contributes to phenotypic divergence and reproductive isolation between these sympatric species. To investigate the genomic basis for the accumulation of genes important for speciation on the neo-X chromosome, we conducted whole genome sequencing of males and females of both the Japan Sea and the Pacific Ocean species. No substantial degeneration has yet occurred on the neo-Y chromosome, but the nucleotide sequence of the neo-X and the neo-Y has started to diverge, particularly at regions near the fusion. The neo-sex chromosomes also harbor an excess of genes with sex-biased expression. Furthermore, genes on the neo-X chromosome showed higher non-synonymous substitution rates than autosomal genes in the Japan Sea lineage. Genomic regions of higher sequence divergence between species, genes with divergent expression between species, and QTL for inter-species phenotypic differences were found not only at the regions near the fusion site, but also at other regions along the neo-X chromosome. Neo-sex chromosomes can therefore accumulate substitutions causing species differences even in the absence of substantial neo-Y degeneration. PMID:24625862
Lim, K Yoong; Kovarik, Ales; Matyasek, Roman; Chase, Mark W; Knapp, Sandra; McCarthy, Elizabeth; Clarkson, James J; Leitch, Andrew R
2006-12-01
Combining phylogenetic reconstructions of species relationships with comparative genomic approaches is a powerful way to decipher evolutionary events associated with genome divergence. Here, we reconstruct the history of karyotype and tandem repeat evolution in species of diploid Nicotiana section Alatae. By analysis of plastid DNA, we resolved two clades with high bootstrap support, one containing N. alata, N. langsdorffii, N. forgetiana and N. bonariensis (called the n = 9 group) and another containing N. plumbaginifolia and N. longiflora (called the n = 10 group). Despite little plastid DNA sequence divergence, we observed, via fluorescent in situ hybridization, substantial chromosomal repatterning, including altered chromosome numbers, structure and distribution of repeats. Effort was focussed on 35S and 5S nuclear ribosomal DNA (rDNA) and the HRS60 satellite family of tandem repeats comprising the elements HRS60, NP3R and NP4R. We compared divergence of these repeats in diploids and polyploids of Nicotiana. There are dramatic shifts in the distribution of the satellite repeats and complete replacement of intergenic spacers (IGSs) of 35S rDNA associated with divergence of the species in section Alatae. We suggest that sequence homogenization has replaced HRS60 family repeats at sub-telomeric regions, but that this process may not occur, or occurs more slowly, when the repeats are found at intercalary locations. Sequence homogenization acts more rapidly (at least two orders of magnitude) on 35S rDNA than 5S rDNA and sub-telomeric satellite sequences. This rapid rate of divergence is analogous to that found in polyploid species, and is therefore, in plants, not only associated with polyploidy.
Vorticity and divergence in the solar photosphere
NASA Technical Reports Server (NTRS)
Wang, YI; Noyes, Robert W.; Tarbell, Theodore D.; Title, Alan M.
1995-01-01
We have studied an outstanding sequence of continuum images of the solar granulation from Pic du Midi Observatory. We have calculated the horizontal vector flow field using a correlation tracking algorithm, and from this determined three scalar field: the vertical component of the curl; the horizontal divergence; and the horizontal flow speed. The divergence field has substantially longer coherence time and more power than does the curl field. Statistically, curl is better correlated with regions of negative divergence - that is, the vertical vorticity is higher in downflow regions, suggesting excess vorticity in intergranular lanes. The average value of the divergence is largest (i.e., outflow is largest) where the horizontal speed is large; we associate these regions with exploding granules. A numerical simulation of general convection also shows similar statistical differences between curl and divergence. Some individual small bright points in the granulation pattern show large local vorticities.
New genes from old: asymmetric divergence of gene duplicates and the evolution of development.
Holland, Peter W H; Marlétaz, Ferdinand; Maeso, Ignacio; Dunwell, Thomas L; Paps, Jordi
2017-02-05
Gene duplications and gene losses have been frequent events in the evolution of animal genomes, with the balance between these two dynamic processes contributing to major differences in gene number between species. After gene duplication, it is common for both daughter genes to accumulate sequence change at approximately equal rates. In some cases, however, the accumulation of sequence change is highly uneven with one copy radically diverging from its paralogue. Such 'asymmetric evolution' seems commoner after tandem gene duplication than after whole-genome duplication, and can generate substantially novel genes. We describe examples of asymmetric evolution in duplicated homeobox genes of moths, molluscs and mammals, in each case generating new homeobox genes that were recruited to novel developmental roles. The prevalence of asymmetric divergence of gene duplicates has been underappreciated, in part, because the origin of highly divergent genes can be difficult to resolve using standard phylogenetic methods.This article is part of the themed issue 'Evo-devo in the genomics era, and the origins of morphological diversity'. © 2016 The Author(s).
Han, Xiang Y; Sizer, Kurt C; Thompson, Erika J; Kabanja, Juma; Li, Jun; Hu, Peter; Gómez-Valero, Laura; Silva, Francisco J
2009-10-01
Mycobacterium lepromatosis is a newly discovered leprosy-causing organism. Preliminary phylogenetic analysis of its 16S rRNA gene and a few other gene segments revealed significant divergence from Mycobacterium leprae, a well-known cause of leprosy, that justifies the status of M. lepromatosis as a new species. In this study we analyzed the sequences of 20 genes and pseudogenes (22,814 nucleotides). Overall, the level of matching of these sequences with M. leprae sequences was 90.9%, which substantiated the species-level difference; the levels of matching for the 16S rRNA genes and 14 protein-encoding genes were 98.0% and 93.1%, respectively, but the level of matching for five pseudogenes was only 79.1%. Five conserved protein-encoding genes were selected to construct phylogenetic trees and to calculate the numbers of synonymous substitutions (dS values) and nonsynonymous substitutions (dN values) in the two species. Robust phylogenetic trees constructed using concatenated alignment of these genes placed M. lepromatosis and M. leprae in a tight cluster with long terminal branches, implying that the divergence occurred long ago. The dS and dN values were also much higher than those for other closest pairs of mycobacteria. The dS values were 14 to 28% of the dS values for M. leprae and Mycobacterium tuberculosis, a more divergent pair of species. These results thus indicate that M. lepromatosis and M. leprae diverged approximately 10 million years ago. The M. lepromatosis pseudogenes analyzed that were also pseudogenes in M. leprae showed nearly neutral evolution, and their relative ages were similar to those of M. leprae pseudogenes, suggesting that they were pseudogenes before divergence. Taken together, the results described above indicate that M. lepromatosis and M. leprae diverged from a common ancestor after the massive gene inactivation event described previously for M. leprae.
Decelerated genome evolution in modern vertebrates revealed by analysis of multiple lancelet genomes
Huang, Shengfeng; Chen, Zelin; Yan, Xinyu; Yu, Ting; Huang, Guangrui; Yan, Qingyu; Pontarotti, Pierre Antoine; Zhao, Hongchen; Li, Jie; Yang, Ping; Wang, Ruihua; Li, Rui; Tao, Xin; Deng, Ting; Wang, Yiquan; Li, Guang; Zhang, Qiujin; Zhou, Sisi; You, Leiming; Yuan, Shaochun; Fu, Yonggui; Wu, Fenfang; Dong, Meiling; Chen, Shangwu; Xu, Anlong
2014-01-01
Vertebrates diverged from other chordates ~500 Myr ago and experienced successful innovations and adaptations, but the genomic basis underlying vertebrate origins are not fully understood. Here we suggest, through comparison with multiple lancelet (amphioxus) genomes, that ancient vertebrates experienced high rates of protein evolution, genome rearrangement and domain shuffling and that these rates greatly slowed down after the divergence of jawed and jawless vertebrates. Compared with lancelets, modern vertebrates retain, at least relatively, less protein diversity, fewer nucleotide polymorphisms, domain combinations and conserved non-coding elements (CNE). Modern vertebrates also lost substantial transposable element (TE) diversity, whereas lancelets preserve high TE diversity that includes even the long-sought RAG transposon. Lancelets also exhibit rapid gene turnover, pervasive transcription, fastest exon shuffling in metazoans and substantial TE methylation not observed in other invertebrates. These new lancelet genome sequences provide new insights into the chordate ancestral state and the vertebrate evolution. PMID:25523484
Huang, Shengfeng; Chen, Zelin; Yan, Xinyu; Yu, Ting; Huang, Guangrui; Yan, Qingyu; Pontarotti, Pierre Antoine; Zhao, Hongchen; Li, Jie; Yang, Ping; Wang, Ruihua; Li, Rui; Tao, Xin; Deng, Ting; Wang, Yiquan; Li, Guang; Zhang, Qiujin; Zhou, Sisi; You, Leiming; Yuan, Shaochun; Fu, Yonggui; Wu, Fenfang; Dong, Meiling; Chen, Shangwu; Xu, Anlong
2014-12-19
Vertebrates diverged from other chordates ~500 Myr ago and experienced successful innovations and adaptations, but the genomic basis underlying vertebrate origins are not fully understood. Here we suggest, through comparison with multiple lancelet (amphioxus) genomes, that ancient vertebrates experienced high rates of protein evolution, genome rearrangement and domain shuffling and that these rates greatly slowed down after the divergence of jawed and jawless vertebrates. Compared with lancelets, modern vertebrates retain, at least relatively, less protein diversity, fewer nucleotide polymorphisms, domain combinations and conserved non-coding elements (CNE). Modern vertebrates also lost substantial transposable element (TE) diversity, whereas lancelets preserve high TE diversity that includes even the long-sought RAG transposon. Lancelets also exhibit rapid gene turnover, pervasive transcription, fastest exon shuffling in metazoans and substantial TE methylation not observed in other invertebrates. These new lancelet genome sequences provide new insights into the chordate ancestral state and the vertebrate evolution.
Extensive Local Gene Duplication and Functional Divergence among Paralogs in Atlantic Salmon
Warren, Ian A.; Ciborowski, Kate L.; Casadei, Elisa; Hazlerigg, David G.; Martin, Sam; Jordan, William C.; Sumner, Seirian
2014-01-01
Many organisms can generate alternative phenotypes from the same genome, enabling individuals to exploit diverse and variable environments. A prevailing hypothesis is that such adaptation has been favored by gene duplication events, which generate redundant genomic material that may evolve divergent functions. Vertebrate examples of recent whole-genome duplications are sparse although one example is the salmonids, which have undergone a whole-genome duplication event within the last 100 Myr. The life-cycle of the Atlantic salmon, Salmo salar, depends on the ability to produce alternating phenotypes from the same genome, to facilitate migration and maintain its anadromous life history. Here, we investigate the hypothesis that genome-wide and local gene duplication events have contributed to the salmonid adaptation. We used high-throughput sequencing to characterize the transcriptomes of three key organs involved in regulating migration in S. salar: Brain, pituitary, and olfactory epithelium. We identified over 10,000 undescribed S. salar sequences and designed an analytic workflow to distinguish between paralogs originating from local gene duplication events or from whole-genome duplication events. These data reveal that substantial local gene duplications took place shortly after the whole-genome duplication event. Many of the identified paralog pairs have either diverged in function or become noncoding. Future functional genomics studies will reveal to what extent this rich source of divergence in genetic sequence is likely to have facilitated the evolution of extreme phenotypic plasticity required for an anadromous life-cycle. PMID:24951567
DOE Office of Scientific and Technical Information (OSTI.GOV)
Podsiadlowski, L.; Carapelli, A.; Nardi, F.
2005-12-01
Mitochondrial genomes from two dipluran hexapods of the genus Campodea have been sequenced. Gene order is the same as in most other hexapods and crustaceans. Secondary structures of tRNAs reveal specific structural changes in tRNA-C, tRNA-R, tRNA-S1 and tRNA-S2. Comparative analyses of nucleotide and amino acid composition, as well as structural features of both ribosomal RNA subunits, reveal substantial differences among the analyzed taxa. Although the two Campodea species are morphologically highly uniform, genetic divergence is larger than expected, suggesting a long evolutionary history under stable ecological conditions.
Inversions and Gene Order Shuffling in Anopheles gambiae and A. funestus
NASA Astrophysics Data System (ADS)
Sharakhov, Igor V.; Serazin, Andrew C.; Grushko, Olga G.; Dana, Ali; Lobo, Neil; Hillenmeyer, Maureen E.; Westerman, Richard; Romero-Severson, Jeanne; Costantini, Carlo; Sagnon, N'Fale; Collins, Frank H.; Besansky, Nora J.
2002-10-01
In tropical Africa, Anopheles funestus is one of the three most important malaria vectors. We physically mapped 157 A. funestus complementary DNAs (cDNAs) to the polytene chromosomes of this species. Sequences of the cDNAs were mapped in silico to the A. gambiae genome as part of a comparative genomic study of synteny, gene order, and sequence conservation between A. funestus and A. gambiae. These species are in the same subgenus and diverged about as recently as humans and chimpanzees. Despite nearly perfect preservation of synteny, we found substantial shuffling of gene order along corresponding chromosome arms. Since the divergence of these species, at least 70 chromosomal inversions have been fixed, the highest rate of rearrangement of any eukaryote studied to date. The high incidence of paracentric inversions and limited colinearity suggests that locating genes in one anopheline species based on gene order in another may be limited to closely related taxa.
Laskar, Boni A.; Bhattacharjee, Maloyjo J.; Dhar, Bishal; Mahadani, Pradosh; Kundu, Shantanu; Ghosh, Sankar K.
2013-01-01
Background The taxonomic validity of Northeast Indian endemic Mahseer species, Tor progeneius and Neolissochilus hexastichus, has been argued repeatedly. This is mainly due to disagreements in recognizing the species based on morphological characters. Consequently, both the species have been concealed for many decades. DNA barcoding has become a promising and an independent technique for accurate species level identification. Therefore, utilization of such technique in association with the traditional morphotaxonomic description can resolve the species dilemma of this important group of sport fishes. Methodology/Principal Findings Altogether, 28 mahseer specimens including paratypes were studied from different locations in Northeast India, and 24 morphometric characters were measured invariably. The Principal Component Analysis with morphometric data revealed five distinct groups of sample that were taxonomically categorized into 4 species, viz., Tor putitora, T. progeneius, Neolissochilus hexagonolepis and N. hexastichus. Analysis with a dataset of 76 DNA barcode sequences of different mahseer species exhibited that the queries of T. putitora and N. hexagonolepis clustered cohesively with the respective conspecific database sequences maintaining 0.8% maximum K2P divergence. The closest congeneric divergence was 3 times higher than the mean conspecific divergence and was considered as barcode gap. The maximum divergence among the samples of T. progeneius and T. putitora was 0.8% that was much below the barcode gap, indicating them being synonymous. The query sequences of N. hexastichus invariably formed a discrete and a congeneric clade with the database sequences and maintained the interspecific divergence that supported its distinct species status. Notably, N. hexastichus was encountered in a single site and seemed to be under threat. Conclusion This study substantiated the identification of N. hexastichus to be a true species, and tentatively regarded T. progeneius to be a synonym of T. putitora. It would guide the conservationists to initiate priority conservation of N. hexastichus and T. putitora. PMID:23341979
2013-01-01
Background Deep sequencing of viruses isolated from infected hosts is an efficient way to measure population-genetic variation and can reveal patterns of dispersal and natural selection. In this study, we mined existing Illumina sequence reads to investigate single-nucleotide polymorphisms (SNPs) within two RNA viruses of the Western honey bee (Apis mellifera), deformed wing virus (DWV) and Israel acute paralysis virus (IAPV). All viral RNA was extracted from North American samples of honey bees or, in one case, the ectoparasitic mite Varroa destructor. Results Coverage depth was generally lower for IAPV than DWV, and marked gaps in coverage occurred in several narrow regions (< 50 bp) of IAPV. These coverage gaps occurred across sequencing runs and were virtually unchanged when reads were re-mapped with greater permissiveness (up to 8% divergence), suggesting a recurrent sequencing artifact rather than strain divergence. Consensus sequences of DWV for each sample showed little phylogenetic divergence, low nucleotide diversity, and strongly negative values of Fu and Li’s D statistic, suggesting a recent population bottleneck and/or purifying selection. The Kakugo strain of DWV fell outside of all other DWV sequences at 100% bootstrap support. IAPV consensus sequences supported the existence of multiple clades as had been previously reported, and Fu and Li’s D was closer to neutral expectation overall, although a sliding-window analysis identified a significantly positive D within the protease region, suggesting selection maintains diversity in that region. Within-sample mean diversity was comparable between the two viruses on average, although for both viruses there was substantial variation among samples in mean diversity at third codon positions and in the number of high-diversity sites. FST values were bimodal for DWV, likely reflecting neutral divergence in two low-diversity populations, whereas IAPV had several sites that were strong outliers with very low FST. Conclusions This initial survey of genetic variation within honey bee RNA viruses suggests future directions for studies examining the underlying causes of population-genetic structure in these economically important pathogens. PMID:23497218
Cornman, Robert Scott; Boncristiani, Humberto; Dainat, Benjamin; Chen, Yanping; vanEngelsdorp, Dennis; Weaver, Daniel; Evans, Jay D
2013-03-07
Deep sequencing of viruses isolated from infected hosts is an efficient way to measure population-genetic variation and can reveal patterns of dispersal and natural selection. In this study, we mined existing Illumina sequence reads to investigate single-nucleotide polymorphisms (SNPs) within two RNA viruses of the Western honey bee (Apis mellifera), deformed wing virus (DWV) and Israel acute paralysis virus (IAPV). All viral RNA was extracted from North American samples of honey bees or, in one case, the ectoparasitic mite Varroa destructor. Coverage depth was generally lower for IAPV than DWV, and marked gaps in coverage occurred in several narrow regions (< 50 bp) of IAPV. These coverage gaps occurred across sequencing runs and were virtually unchanged when reads were re-mapped with greater permissiveness (up to 8% divergence), suggesting a recurrent sequencing artifact rather than strain divergence. Consensus sequences of DWV for each sample showed little phylogenetic divergence, low nucleotide diversity, and strongly negative values of Fu and Li's D statistic, suggesting a recent population bottleneck and/or purifying selection. The Kakugo strain of DWV fell outside of all other DWV sequences at 100% bootstrap support. IAPV consensus sequences supported the existence of multiple clades as had been previously reported, and Fu and Li's D was closer to neutral expectation overall, although a sliding-window analysis identified a significantly positive D within the protease region, suggesting selection maintains diversity in that region. Within-sample mean diversity was comparable between the two viruses on average, although for both viruses there was substantial variation among samples in mean diversity at third codon positions and in the number of high-diversity sites. FST values were bimodal for DWV, likely reflecting neutral divergence in two low-diversity populations, whereas IAPV had several sites that were strong outliers with very low FST. This initial survey of genetic variation within honey bee RNA viruses suggests future directions for studies examining the underlying causes of population-genetic structure in these economically important pathogens.
A Large Pseudoautosomal Region on the Sex Chromosomes of the Frog Silurana tropicalis
Bewick, Adam J.; Chain, Frédéric J.J.; Zimmerman, Lyle B.; Sesay, Abdul; Gilchrist, Michael J.; Owens, Nick D.L.; Seifertova, Eva; Krylov, Vladimir; Macha, Jaroslav; Tlapakova, Tereza; Kubickova, Svatava; Cernohorska, Halina; Zarsky, Vojtech; Evans, Ben J.
2013-01-01
Sex chromosome divergence has been documented across phylogenetically diverse species, with amphibians typically having cytologically nondiverged (“homomorphic”) sex chromosomes. With an aim of further characterizing sex chromosome divergence of an amphibian, we used “RAD-tags” and Sanger sequencing to examine sex specificity and heterozygosity in the Western clawed frog Silurana tropicalis (also known as Xenopus tropicalis). Our findings based on approximately 20 million genotype calls and approximately 200 polymerase chain reaction-amplified regions across multiple male and female genomes failed to identify a substantially sized genomic region with genotypic hallmarks of sex chromosome divergence, including in regions known to be tightly linked to the sex-determining region. We also found that expression and molecular evolution of genes linked to the sex-determining region did not differ substantially from genes in other parts of the genome. This suggests that the pseudoautosomal region, where recombination occurs, comprises a large portion of the sex chromosomes of S. tropicalis. These results may in part explain why African clawed frogs have such a high incidence of polyploidization, shed light on why amphibians have a high rate of sex chromosome turnover, and raise questions about why homomorphic sex chromosomes are so prevalent in amphibians. PMID:23666865
Sooty mangabey genome sequence provides insight into AIDS resistance in a natural SIV host.
Palesch, David; Bosinger, Steven E; Tharp, Gregory K; Vanderford, Thomas H; Paiardini, Mirko; Chahroudi, Ann; Johnson, Zachary P; Kirchhoff, Frank; Hahn, Beatrice H; Norgren, Robert B; Patel, Nirav B; Sodora, Donald L; Dawoud, Reem A; Stewart, Caro-Beth; Seepo, Sara M; Harris, R Alan; Liu, Yue; Raveendran, Muthuswamy; Han, Yi; English, Adam; Thomas, Gregg W C; Hahn, Matthew W; Pipes, Lenore; Mason, Christopher E; Muzny, Donna M; Gibbs, Richard A; Sauter, Daniel; Worley, Kim; Rogers, Jeffrey; Silvestri, Guido
2018-01-03
In contrast to infections with human immunodeficiency virus (HIV) in humans and simian immunodeficiency virus (SIV) in macaques, SIV infection of a natural host, sooty mangabeys (Cercocebus atys), is non-pathogenic despite high viraemia. Here we sequenced and assembled the genome of a captive sooty mangabey. We conducted genome-wide comparative analyses of transcript assemblies from C. atys and AIDS-susceptible species, such as humans and macaques, to identify candidates for host genetic factors that influence susceptibility. We identified several immune-related genes in the genome of C. atys that show substantial sequence divergence from macaques or humans. One of these sequence divergences, a C-terminal frameshift in the toll-like receptor-4 (TLR4) gene of C. atys, is associated with a blunted in vitro response to TLR-4 ligands. In addition, we found a major structural change in exons 3-4 of the immune-regulatory protein intercellular adhesion molecule 2 (ICAM-2); expression of this variant leads to reduced cell surface expression of ICAM-2. These data provide a resource for comparative genomic studies of HIV and/or SIV pathogenesis and may help to elucidate the mechanisms by which SIV-infected sooty mangabeys avoid AIDS.
Sooty mangabey genome sequence provides insight into AIDS resistance in a natural SIV host
Palesch, David; Bosinger, Steven E.; Tharp, Gregory K.; Vanderford, Thomas H.; Paiardini, Mirko; Chahroudi, Ann; Johnson, Zachary P.; Kirchhoff, Frank; Hahn, Beatrice H.; Norgren, Robert B.; Patel, Nirav B.; Sodora, Donald L.; Dawoud, Reem A.; Stewart, Caro-Beth; Seepo, Sara M.; Harris, R. Alan; Liu, Yue; Raveendran, Muthuswamy; Han, Yi; English, Adam; Thomas, Gregg W. C.; Hahn, Matthew W.; Pipes, Lenore; Mason, Christopher E.; Muzny, Donna M.; Gibbs, Richard A.; Sauter, Daniel; Worley, Kim; Rogers, Jeffrey; Silvestri, Guido
2018-01-01
In contrast to infections with human immunodeficiency virus (HIV) in humans and simian immunodeficiency virus (SIV) in macaques, SIV infection of a natural host, sooty mangabeys (Cercocebus atys), is non-pathogenic despite high viraemia1. Here we sequenced and assembled the genome of a captive sooty mangabey. We conducted genome-wide comparative analyses of transcript assemblies from C. atys and AIDS-susceptible species, such as humans and macaques, to identify candidates for host genetic factors that influence susceptibility. We identified several immune-related genes in the genome of C. atys that show substantial sequence divergence from macaques or humans. One of these sequence divergences, a C-terminal frameshift in the toll-like receptor-4 (TLR4) gene of C. atys, is associated with a blunted in vitro response to TLR-4 ligands. In addition, we found a major structural change in exons 3–4 of the immune-regulatory protein intercellular adhesion molecule 2 (ICAM-2); expression of this variant leads to reduced cell surface expression of ICAM-2. These data provide a resource for comparative genomic studies of HIV and/or SIV pathogenesis and may help to elucidate the mechanisms by which SIV-infected sooty mangabeys avoid AIDS. PMID:29300007
Li, Peng; Wang, Dechen; Yan, Jinli; Zhou, Jianuan; Deng, Yinyue; Jiang, Zide; Cao, Bihao; He, Zifu; Zhang, Lianhui
2016-01-01
Ralstonia solanacearum species complex is a devastating group of phytopathogens with an unusually wide host range and broad geographical distribution. R. solanacearum isolates may differ considerably in various properties including host range and pathogenicity, but the underlying genetic bases remain vague. Here, we conducted the genome sequencing of strain EP1 isolated from Guangdong Province of China, which belongs to phylotype I and is highly virulent to a range of solanaceous crops. Its complete genome contains a 3.95-Mb chromosome and a 2.05-Mb mega-plasmid, which is considerably bigger than reported genomes of other R. solanacearum strains. Both the chromosome and the mega-plasmid have essential house-keeping genes and many virulence genes. Comparative analysis of strain EP1 with other 3 phylotype I and 3 phylotype II, III, IV strains unveiled substantial genome rearrangements, insertions and deletions. Genome sequences are relatively conserved among the 4 phylotype I strains, but more divergent among strains of different phylotypes. Moreover, the strains exhibited considerable variations in their key virulence genes, including those encoding secretion systems and type III effectors. Our results provide valuable information for further elucidation of the genetic basis of diversified virulences and host range of R. solanacearum species. PMID:27833603
Hübner, Sariel; Rashkovetsky, Eugenia; Kim, Young Bun; Oh, Jung Hun; Michalak, Katarzyna; Weiner, Dmitry; Korol, Abraham B.; Nevo, Eviatar; Michalak, Pawel
2013-01-01
The opposite slopes of “Evolution Canyon” in Israel have served as a natural model system of adaptation to a microclimate contrast. Long-term studies of Drosophila melanogaster populations inhabiting the canyon have exhibited significant interslope divergence in thermal and drought stress resistance, candidate genes, mobile elements, habitat choice, mating discrimination, and wing-shape variation, all despite close physical proximity of the contrasting habitats, as well as substantial interslope migration. To examine patterns of genetic differentiation at the genome-wide level, we used high coverage sequencing of the flies’ genomes. A total of 572 genes were significantly different in allele frequency between the slopes, 106 out of which were associated with 74 significantly overrepresented gene ontology (GO) terms, particularly so with response to stimulus and developmental and reproductive processes, thus corroborating previous observations of interslope divergence in stress response, life history, and mating functions. There were at least 37 chromosomal “islands” of interslope divergence and low sequence polymorphism, plausible signatures of selective sweeps, more abundant in flies derived from one (north-facing) of the slopes. Positive correlation between local recombination rate and the level of nucleotide polymorphism was also found. PMID:24324170
A pronounced evolutionary shift of the pseudoautosomal region boundary in house mice
White, Michael A.; Ikeda, Akihiro; Payseur, Bret A.
2012-01-01
The pseudoautosomal region (PAR) is essential for the accurate pairing and segregation of the X and Y chromosomes during meiosis. Despite its functional significance, the PAR shows substantial evolutionary divergence in structure and sequence between mammalian species. An instructive example of PAR evolution is the house mouse Mus musculus domesticus (represented by the C57BL/6J strain), which has the smallest PAR among those that have been mapped. In C57BL/6J, the PAR boundary is located just ~700 kb from the distal end of the X chromosome, whereas the boundary is found at a more proximal position in Mus spretus, a species that diverged from house mice 2–4 million years ago. Here, we use a combination of genetic and physical mapping to document a pronounced shift in the PAR boundary in a second house mouse subspecies, Mus musculus castaneus (represented by the CAST/EiJ strain), ~430 kb proximal of the M. m. domesticus boundary. We demonstrate molecular evolutionary consequences of this shift, including a marked lineage-specific increase in sequence divergence within Mid1, a gene that resides entirely within the M. m. castaneus PAR but straddles the boundary in other subspecies. Our results extend observations of structural divergence in the PAR to closely related subspecies, pointing to major evolutionary changes in this functionally important genomic region over a short time period. PMID:22763584
A pronounced evolutionary shift of the pseudoautosomal region boundary in house mice.
White, Michael A; Ikeda, Akihiro; Payseur, Bret A
2012-08-01
The pseudoautosomal region (PAR) is essential for the accurate pairing and segregation of the X and Y chromosomes during meiosis. Despite its functional significance, the PAR shows substantial evolutionary divergence in structure and sequence between mammalian species. An instructive example of PAR evolution is the house mouse Mus musculus domesticus (represented by the C57BL/6J strain), which has the smallest PAR among those that have been mapped. In C57BL/6J, the PAR boundary is located just ~700 kb from the distal end of the X chromosome, whereas the boundary is found at a more proximal position in Mus spretus, a species that diverged from house mice 2-4 million years ago. In this study we used a combination of genetic and physical mapping to document a pronounced shift in the PAR boundary in a second house mouse subspecies, Mus musculus castaneus (represented by the CAST/EiJ strain), ~430 kb proximal of the M. m. domesticus boundary. We demonstrate molecular evolutionary consequences of this shift, including a marked lineage-specific increase in sequence divergence within Mid1, a gene that resides entirely within the M. m. castaneus PAR but straddles the boundary in other subspecies. Our results extend observations of structural divergence in the PAR to closely related subspecies, pointing to major evolutionary changes in this functionally important genomic region over a short time period.
Kim, Young-Kyu; Park, Chong-wook; Kim, Ki-Joong
2009-03-31
The chloroplast DNA sequences of Megaleranthis saniculifolia, an endemic and monotypic endangered plant species, were completed in this study (GenBank FJ597983). The genome is 159,924 bp in length. It harbors a pair of IR regions consisting of 26,608 bp each. The lengths of the LSC and SSC regions are 88,326 bp and 18,382 bp, respectively. The structural organizations, gene and intron contents, gene orders, AT contents, codon usages, and transcription units of the Megaleranthis chloroplast genome are similar to those of typical land plant cp DNAs. However, the detailed features of Megaleranthis chloroplast genomes are substantially different from that of Ranunculus, which belongs to the same family, the Ranunculaceae. First, the Megaleranthis cp DNA was 4,797 bp longer than that of Ranunculus due to an expanded IR region into the SSC region and duplicated sequence elements in several spacer regions of the Megaleranthis cp genome. Second, the chloroplast genomes of Megaleranthis and Ranunculus evidence 5.6% sequence divergence in the coding regions, 8.9% sequence divergence in the intron regions, and 18.7% sequence divergence in the intergenic spacer regions, respectively. In both the coding and noncoding regions, average nucleotide substitution rates differed markedly, depending on the genome position. Our data strongly implicate the positional effects of the evolutionary modes of chloroplast genes. The genes evidencing higher levels of base substitutions also have higher incidences of indel mutations and low Ka/Ks ratios. A total of 54 simple sequence repeat loci were identified from the Megaleranthis cp genome. The existence of rich cp SSR loci in the Megaleranthis cp genome provides a rare opportunity to study the population genetic structures of this endangered species. Our phylogenetic trees based on the two independent markers, the nuclear ITS and chloroplast matK sequences, strongly support the inclusion of the Megaleranthis to the Trollius. Therefore, our molecular trees support Ohwi's original treatment of Megaleranthis saniculiforia to Trollius chosenensis Ohwi.
The contribution of alu elements to mutagenic DNA double-strand break repair.
Morales, Maria E; White, Travis B; Streva, Vincent A; DeFreece, Cecily B; Hedges, Dale J; Deininger, Prescott L
2015-03-01
Alu elements make up the largest family of human mobile elements, numbering 1.1 million copies and comprising 11% of the human genome. As a consequence of evolution and genetic drift, Alu elements of various sequence divergence exist throughout the human genome. Alu/Alu recombination has been shown to cause approximately 0.5% of new human genetic diseases and contribute to extensive genomic structural variation. To begin understanding the molecular mechanisms leading to these rearrangements in mammalian cells, we constructed Alu/Alu recombination reporter cell lines containing Alu elements ranging in sequence divergence from 0%-30% that allow detection of both Alu/Alu recombination and large non-homologous end joining (NHEJ) deletions that range from 1.0 to 1.9 kb in size. Introduction of as little as 0.7% sequence divergence between Alu elements resulted in a significant reduction in recombination, which indicates even small degrees of sequence divergence reduce the efficiency of homology-directed DNA double-strand break (DSB) repair. Further reduction in recombination was observed in a sequence divergence-dependent manner for diverged Alu/Alu recombination constructs with up to 10% sequence divergence. With greater levels of sequence divergence (15%-30%), we observed a significant increase in DSB repair due to a shift from Alu/Alu recombination to variable-length NHEJ which removes sequence between the two Alu elements. This increase in NHEJ deletions depends on the presence of Alu sequence homeology (similar but not identical sequences). Analysis of recombination products revealed that Alu/Alu recombination junctions occur more frequently in the first 100 bp of the Alu element within our reporter assay, just as they do in genomic Alu/Alu recombination events. This is the first extensive study characterizing the influence of Alu element sequence divergence on DNA repair, which will inform predictions regarding the effect of Alu element sequence divergence on both the rate and nature of DNA repair events.
Savard, L; Li, P; Strauss, S H; Chase, M W; Michaud, M; Bousquet, J
1994-01-01
We have estimated the time for the last common ancestor of extant seed plants by using molecular clocks constructed from the sequences of the chloroplastic gene coding for the large subunit of ribulose-1,5-bisphosphate carboxylase/oxygenase (rbcL) and the nuclear gene coding for the small subunit of rRNA (Rrn18). Phylogenetic analyses of nucleotide sequences indicated that the earliest divergence of extant seed plants is likely represented by a split between conifer-cycad and angiosperm lineages. Relative-rate tests were used to assess homogeneity of substitution rates among lineages, and annual angiosperms were found to evolve at a faster rate than other taxa for rbcL and, thus, these sequences were excluded from construction of molecular clocks. Five distinct molecular clocks were calibrated using substitution rates for the two genes and four divergence times based on fossil and published molecular clock estimates. The five estimated times for the last common ancestor of extant seed plants were in agreement with one another, with an average of 285 million years and a range of 275-290 million years. This implies a substantially more recent ancestor of all extant seed plants than suggested by some theories of plant evolution. PMID:8197201
Gallie, Daniel R.
2016-01-01
The eukaryotic translation initiation factor (eIF) 4G is required during protein synthesis to promote the assembly of several factors involved in the recruitment of a 40S ribosomal subunit to an mRNA. Although many eukaryotes express two eIF4G isoforms that are highly similar, the eIF4G isoforms in plants, referred to as eIF4G and eIFiso4G, are highly divergent in size, sequence, and domain organization but both can interact with eIF4A, eIF4B, eIF4E isoforms, and the poly(A)-binding protein. Nevertheless, eIF4G and eIFiso4G from wheat exhibit preferences in the mRNAs they translate optimally. For example, mRNA containing the 5′-leader (called Ω) of tobacco mosaic virus preferentially uses eIF4G in wheat germ lysate. In this study, the eIF4G isoform specificity of Ω was used to examine functional differences of the eIF4G isoforms in Arabidopsis. As in wheat, Ω-mediated translation was reduced in an eif4g null mutant. Loss of the eIFiso4G1 isoform, which is similar in sequence to wheat eIFiso4G, did not substantially affect Ω-mediated translation. However, loss of the eIFiso4G2 isoform substantially reduced Ω-mediated translation. eIFiso4G2 is substantially divergent from eIFiso4G1 and is present only in the Brassicaceae, suggesting a recent evolution. eIFiso4G2 isoforms exhibit sequence-specific differences in regions representing partner protein and RNA binding sites. Loss of any eIF4G isoform also resulted in a substantial reduction in reporter transcript level. These results suggest that eIFiso4G2 appeared late in plant evolution and exhibits more functional similarity with eIF4G than with eIFiso4G1 during Ω-mediated translation. PMID:26578519
Medzihradszky, K F; Gibson, B W; Kaur, S; Yu, Z H; Medzihradszky, D; Burlingame, A L; Bass, N M
1992-02-01
The primary structure of a fatty-acid-binding protein (FABP) isolated from the liver of the nurse shark (Ginglymostoma cirratum) was determined by high-performance tandem mass spectrometry (employing multichannel array detection) and Edman degradation. Shark liver FABP consists of 132 amino acids with an acetylated N-terminal valine. The chemical molecular mass of the intact protein determined by electrospray ionization mass spectrometry (Mr = 15124 +/- 2.5) was in good agreement with that calculated from the amino acid sequence (Mr = 15121.3). The amino acid sequence of shark liver FABP displays significantly greater similarity to the FABP expressed in mammalian heart, peripheral nerve myelin and adipose tissue (61-53% sequence similarity) than to the FABP expressed in mammalian liver (22% similarity). Phylogenetic trees derived from the comparison of the shark liver FABP amino acid sequence with the members of the mammalian fatty-acid/retinoid-binding protein gene family indicate the initial divergence of an ancestral gene into two major subfamilies: one comprising the genes for mammalian liver FABP and gastrotropin, the other comprising the genes for mammalian cellular retinol-binding proteins I and II, cellular retinoic-acid-binding protein myelin P2 protein, adipocyte FABP, heart FABP and shark liver FABP, the latter having diverged from the ancestral gene that ultimately gave rise to the present day mammalian heart-FABP, adipocyte FABP and myelin P2 protein sequences. The sequence for intestinal FABP from the rat could be assigned to either subfamily, depending on the approach used for phylogenetic tree construction, but clearly diverged at a relatively early evolutionary time point. Indeed, sequences proximately ancestral or closely related to mammalian intestinal FABP, liver FABP, gastrotropin and the retinoid-binding group of proteins appear to have arisen prior to the divergence of shark liver FABP and should therefore also be present in elasmobranchs. The presence in shark liver of an FABP which differs substantially in primary structure from mammalian liver FABP, while being closely related to the FABP expressed in mammalian heart muscle, peripheral nerve myelin and adipocytes, opens a further dimension regarding the question of the existence of structure-dependent and tissue-specific specialization of FABP function in lipid metabolism.
TRStalker: an efficient heuristic for finding fuzzy tandem repeats.
Pellegrini, Marco; Renda, M Elena; Vecchio, Alessio
2010-06-15
Genomes in higher eukaryotic organisms contain a substantial amount of repeated sequences. Tandem Repeats (TRs) constitute a large class of repetitive sequences that are originated via phenomena such as replication slippage and are characterized by close spatial contiguity. They play an important role in several molecular regulatory mechanisms, and also in several diseases (e.g. in the group of trinucleotide repeat disorders). While for TRs with a low or medium level of divergence the current methods are rather effective, the problem of detecting TRs with higher divergence (fuzzy TRs) is still open. The detection of fuzzy TRs is propaedeutic to enriching our view of their role in regulatory mechanisms and diseases. Fuzzy TRs are also important as tools to shed light on the evolutionary history of the genome, where higher divergence correlates with more remote duplication events. We have developed an algorithm (christened TRStalker) with the aim of detecting efficiently TRs that are hard to detect because of their inherent fuzziness, due to high levels of base substitutions, insertions and deletions. To attain this goal, we developed heuristics to solve a Steiner version of the problem for which the fuzziness is measured with respect to a motif string not necessarily present in the input string. This problem is akin to the 'generalized median string' that is known to be an NP-hard problem. Experiments with both synthetic and biological sequences demonstrate that our method performs better than current state of the art for fuzzy TRs and that the fuzzy TRs of the type we detect are indeed present in important biological sequences. TRStalker will be integrated in the web-based TRs Discovery Service (TReaDS) at bioalgo.iit.cnr.it. Supplementary data are available at Bioinformatics online.
Drevecky, C J; Falco, R; Aguirre, W E
2013-07-01
The genetic relationship between sympatric, morphologically divergent populations of anadromous and lake-resident three-spined stickleback Gasterosteus aculeatus in the Jim Creek drainage of Cook Inlet, Alaska, was examined using microsatellite loci and mitochondrial d-loop sequence data. Resident samples differed substantially from sympatric anadromous samples in the Jim Creek drainage with the magnitude of the genetic divergence being similar to that between allopatric resident and anadromous populations in other areas. Resident samples were genetically similar within the Jim Creek drainage, as were the anadromous samples surveyed. Neighbour-joining and Structure cluster analysis grouped the samples into four genetic clusters by ecomorph (anadromous v. all resident) and geographic location of the resident samples (Jim Creek, Mat-Su and Kenai). There was no evidence of hybridization between resident and anadromous G. aculeatus in the Jim Creek drainage, which thus appear to be reproductively isolated. © 2013 The Authors. Journal of Fish Biology © 2013 The Fisheries Society of the British Isles.
A longitudinal analysis of the relationship between fertility timing and schooling.
Stange, Kevin
2011-08-01
This article quantifies the contribution of pre-treatment dynamic selection to the relationship between fertility timing and postsecondary attainment, after controlling for a rich set of predetermined characteristics. Eventual mothers and nonmothers are matched using their predicted birth hazard rate, which shares the desirable properties of a propensity score but in a multivalued treatment setting. I find that eventual mothers and matched nonmothers enter college at the same rate, but their educational paths diverge well before the former become pregnant. This pre-pregnancy divergence creates substantial differences in ultimate educational attainment that cannot possibly be due to the childbirth itself. Controls for predetermined characteristics and fixed effects do not address this form of dynamic selection bias. A dynamic model of the simultaneous childbirth-education sequencing decision is necessary to address it.
Sequence space and the ongoing expansion of the protein universe.
Povolotskaya, Inna S; Kondrashov, Fyodor A
2010-06-17
The need to maintain the structural and functional integrity of an evolving protein severely restricts the repertoire of acceptable amino-acid substitutions. However, it is not known whether these restrictions impose a global limit on how far homologous protein sequences can diverge from each other. Here we explore the limits of protein evolution using sequence divergence data. We formulate a computational approach to study the rate of divergence of distant protein sequences and measure this rate for ancient proteins, those that were present in the last universal common ancestor. We show that ancient proteins are still diverging from each other, indicating an ongoing expansion of the protein sequence universe. The slow rate of this divergence is imposed by the sparseness of functional protein sequences in sequence space and the ruggedness of the protein fitness landscape: approximately 98 per cent of sites cannot accept an amino-acid substitution at any given moment but a vast majority of all sites may eventually be permitted to evolve when other, compensatory, changes occur. Thus, approximately 3.5 x 10(9) yr has not been enough to reach the limit of divergent evolution of proteins, and for most proteins the limit of sequence similarity imposed by common function may not exceed that of random sequences.
Patterns of Positive Selection of the Myogenic Regulatory Factor Gene Family in Vertebrates
Zhao, Xiao; Yu, Qi; Huang, Ling; Liu, Qing-Xin
2014-01-01
The functional divergence of transcriptional factors is critical in the evolution of transcriptional regulation. However, the mechanism of functional divergence among these factors remains unclear. Here, we performed an evolutionary analysis for positive selection in members of the myogenic regulatory factor (MRF) gene family of vertebrates. We selected 153 complete vertebrate MRF nucleotide sequences from our analyses, which revealed substantial evidence of positive selection. Here, we show that sites under positive selection were more frequently detected and identified from the genes encoding the myogenic differentiation factors (MyoG and Myf6) than the genes encoding myogenic determination factors (Myf5 and MyoD). Additionally, the functional divergence within the myogenic determination factors or differentiation factors was also under positive selection pressure. The positive selection sites were more frequently detected from MyoG and MyoD than Myf6 and Myf5, respectively. Amino acid residues under positive selection were identified mainly in their transcription activation domains and on the surface of protein three-dimensional structures. These data suggest that the functional gain and divergence of myogenic regulatory factors were driven by distinct positive selection of their transcription activation domains, whereas the function of the DNA binding domains was conserved in evolution. Our study evaluated the mechanism of functional divergence of the transcriptional regulation factors within a family, whereby the functions of their transcription activation domains diverged under positive selection during evolution. PMID:24651579
Resolution of the African hominoid trichotomy by use of a mitochondrial gene sequence
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ruvolo, M.; Disotell, T.R.; Allard, M.W.
1991-02-15
Mitochondrial DNA sequences encoding the cytochrome oxidase subunit II gene have been determined for five primate species, siamang (Hylobates syndactylus), lowland gorilla (Gorilla gorilla), pygmy chimpanzee (Pan paniscus), crab-eating macaque (Macaca fascicularis), and green monkey (Cercopithecus aethiops), and compared with published sequences of other primate and nonprimate species. Comparisons of cytochrome oxidase subunit II gene sequences provide clear-cut evidence from the mitochondrial genome for the separation of the African ape trichotomy into two evolutionary lineages, one leading to gorillas and the other to humans and chimpanzees. Several different tree-building methods support this same phylogenetic tree topology. The comparisons also yieldmore » trees in which a substantial length separates the divergence point of gorillas from that of humans and chimpanzees, suggesting that the lineage most immediately ancestral to humans and chimpanzees may have been in existence for a relatively long time.« less
Tzika, Athanasia C; Helaers, Raphaël; Schramm, Gerrit; Milinkovitch, Michel C
2011-09-26
Reptiles are largely under-represented in comparative genomics despite the fact that they are substantially more diverse in many respects than mammals. Given the high divergence of reptiles from classical model species, next-generation sequencing of their transcriptomes is an approach of choice for gene identification and annotation. Here, we use 454 technology to sequence the brain transcriptome of four divergent reptilian and one reference avian species: the Nile crocodile, the corn snake, the bearded dragon, the red-eared turtle, and the chicken. Using an in-house pipeline for recursive similarity searches of >3,000,000 reads against multiple databases from 7 reference vertebrates, we compile a reptilian comparative transcriptomics dataset, with homology assignment for 20,000 to 31,000 transcripts per species and a cumulated non-redundant sequence length of 248.6 Mbases. Our approach identifies the majority (87%) of chicken brain transcripts and about 50% of de novo assembled reptilian transcripts. In addition to 57,502 microsatellite loci, we identify thousands of SNP and indel polymorphisms for population genetic and linkage analyses. We also build very large multiple alignments for Sauropsida and mammals (two million residues per species) and perform extensive phylogenetic analyses suggesting that turtles are not basal living reptiles but are rather associated with Archosaurians, hence, potentially answering a long-standing question in the phylogeny of Amniotes. The reptilian transcriptome (freely available at http://www.reptilian-transcriptomes.org) should prove a useful new resource as reptiles are becoming important new models for comparative genomics, ecology, and evolutionary developmental genetics.
Shakya, Akhalesh Kumar; Shukla, Vibha; Maan, Harjeet Singh; Dhole, Tapan N
2012-10-16
Genetic analysis of measles viruses associated with recent cases and outbreaks has proven to bridge information gaps in routine outbreak investigations and has made a substantial contribution to measles control efforts by helping to identify the transmission pathways of the virus. The present study describes the genetic characterization of wild type measles viruses from Uttar Pradesh, India isolated between January 2008 and January 2011. In the study, 526 suspected measles cases from 15 outbreaks were investigated. Blood samples were collected from suspected measles outbreaks and tested for the presence of measles specific IgM; throat swab and urine samples were collected for virus isolation and RT-PCR. Genotyping of circulating measles viruses in Uttar Pradesh was performed by sequencing a 450-bp region encompassing the nucleoprotein hypervariable region and phylogenetic analysis. Based on serological results, all the outbreaks were confirmed as measles. Thirty eight strains were obtained. Genetic analysis of circulating measles strains (n = 38) in Uttar Pradesh from 235 cases of laboratory-confirmed cases from 526 suspected measles cases between 2008 and 2011 showed that all viruses responsible for outbreaks were within clade D and all were genotype D8.Analysis of this region showed that it is highly divergent (up to 3.4% divergence in the nucleotide sequence and 4.1% divergence in the amino acid sequence between most distant strains). Considerable genetic heterogeneity was observed in the MV genotype D8 viruses in North India and underscores the need for continued surveillance and in particular increases in vaccination levels to decrease morbidity and mortality attributable to measles.
Zehra, Rabail; Abbasi, Amir Ali
2018-03-01
Empirical assessments of human accelerated noncoding DNA frgaments have delineated presence of many cis-regulatory elements. Enhancers make up an important category of such accelerated cis-regulatory elements that efficiently control the spatiotemporal expression of many developmental genes. Establishing plausible reasons for accelerated enhancer sequence divergence in Homo sapiens has been termed significant in various previously published studies. This acceleration by including closely related primates and archaic human data has the potential to open up evolutionary avenues for deducing present-day brain structure. This study relied on empirically confirmed brain exclusive enhancers to avoid any misjudgments about their regulatory status and categorized among them a subset of enhancers with an exceptionally accelerated rate of lineage specific divergence in humans. In this assorted set, 13 distinct transcription factor binding sites were located that possessed unique existence in humans. Three of 13 such sites belonging to transcription factors SOX2, RUNX1/3, and FOS/JUND possessed single nucleotide variants that made them unique to H. sapiens upon comparisons with Neandertal and Denisovan orthologous sequences. These variants modifying the binding sites in modern human lineage were further substantiated as single nucleotide polymorphisms via exploiting 1000 Genomes Project Phase3 data. Long range haplotype based tests laid out evidence of positive selection to be governing in African population on two of the modern human motif modifying alleles with strongest results for SOX2 binding site. In sum, our study acknowledges acceleration in noncoding regulatory landscape of the genome and highlights functional parts within it to have undergone accelerated divergence in present-day human population.
Conceptual issues in Bayesian divergence time estimation
2016-01-01
Bayesian inference of species divergence times is an unusual statistical problem, because the divergence time parameters are not identifiable unless both fossil calibrations and sequence data are available. Commonly used marginal priors on divergence times derived from fossil calibrations may conflict with node order on the phylogenetic tree causing a change in the prior on divergence times for a particular topology. Care should be taken to avoid confusing this effect with changes due to informative sequence data. This effect is illustrated with examples. A topology-consistent prior that preserves the marginal priors is defined and examples are constructed. Conflicts between fossil calibrations and relative branch lengths (based on sequence data) can cause estimates of divergence times that are grossly incorrect, yet have a narrow posterior distribution. An example of this effect is given; it is recommended that overly narrow posterior distributions of divergence times should be carefully scrutinized. This article is part of the themed issue ‘Dating species divergences using rocks and clocks’. PMID:27325831
Conceptual issues in Bayesian divergence time estimation.
Rannala, Bruce
2016-07-19
Bayesian inference of species divergence times is an unusual statistical problem, because the divergence time parameters are not identifiable unless both fossil calibrations and sequence data are available. Commonly used marginal priors on divergence times derived from fossil calibrations may conflict with node order on the phylogenetic tree causing a change in the prior on divergence times for a particular topology. Care should be taken to avoid confusing this effect with changes due to informative sequence data. This effect is illustrated with examples. A topology-consistent prior that preserves the marginal priors is defined and examples are constructed. Conflicts between fossil calibrations and relative branch lengths (based on sequence data) can cause estimates of divergence times that are grossly incorrect, yet have a narrow posterior distribution. An example of this effect is given; it is recommended that overly narrow posterior distributions of divergence times should be carefully scrutinized.This article is part of the themed issue 'Dating species divergences using rocks and clocks'. © 2016 The Author(s).
Screening of duplicated loci reveals hidden divergence patterns in a complex salmonid genome
Limborg, Morten T.; Larson, Wesley; Seeb, Lisa W.; Seeb, James E.
2017-01-01
A whole-genome duplication (WGD) doubles the entire genomic content of a species and is thought to have catalysed adaptive radiation in some polyploid-origin lineages. However, little is known about general consequences of a WGD because gene duplicates (i.e., paralogs) are commonly filtered in genomic studies; such filtering may remove substantial portions of the genome in data sets from polyploid-origin species. We demonstrate a new method that enables genome-wide scans for signatures of selection at both nonduplicated and duplicated loci by taking locus-specific copy number into account. We apply this method to RAD sequence data from different ecotypes of a polyploid-origin salmonid (Oncorhynchus nerka) and reveal signatures of divergent selection that would have been missed if duplicated loci were filtered. We also find conserved signatures of elevated divergence at pairs of homeologous chromosomes with residual tetrasomic inheritance, suggesting that joint evolution of some nondiverged gene duplicates may affect the adaptive potential of these genes. These findings illustrate that including duplicated loci in genomic analyses enables novel insights into the evolutionary consequences of WGDs and local segmental gene duplications.
2011-01-01
Background Reptiles are largely under-represented in comparative genomics despite the fact that they are substantially more diverse in many respects than mammals. Given the high divergence of reptiles from classical model species, next-generation sequencing of their transcriptomes is an approach of choice for gene identification and annotation. Results Here, we use 454 technology to sequence the brain transcriptome of four divergent reptilian and one reference avian species: the Nile crocodile, the corn snake, the bearded dragon, the red-eared turtle, and the chicken. Using an in-house pipeline for recursive similarity searches of >3,000,000 reads against multiple databases from 7 reference vertebrates, we compile a reptilian comparative transcriptomics dataset, with homology assignment for 20,000 to 31,000 transcripts per species and a cumulated non-redundant sequence length of 248.6 Mbases. Our approach identifies the majority (87%) of chicken brain transcripts and about 50% of de novo assembled reptilian transcripts. In addition to 57,502 microsatellite loci, we identify thousands of SNP and indel polymorphisms for population genetic and linkage analyses. We also build very large multiple alignments for Sauropsida and mammals (two million residues per species) and perform extensive phylogenetic analyses suggesting that turtles are not basal living reptiles but are rather associated with Archosaurians, hence, potentially answering a long-standing question in the phylogeny of Amniotes. Conclusions The reptilian transcriptome (freely available at http://www.reptilian-transcriptomes.org) should prove a useful new resource as reptiles are becoming important new models for comparative genomics, ecology, and evolutionary developmental genetics. PMID:21943375
Planes, S; Doherty, P J; Bernardi, G
2001-11-11
Acanthochromis polyacanthus is an unusual tropical marine damselfish that uniquely lacks pelagic larvae and has lost the capacity for broad-scale dispersal among coral reefs. On the modern Great Barrier Reef (GBR), three color morphs meet and hydridize at two zones of secondary contact. Allozyme electrophoreses revealed strong differences between morphs from the southern zone but few differences between morphs from the northern counterpart, thus suggesting different contact histories. We explore the phylogeography of Acanthochromis polyacanthus with mitochondrial cytochrome b region sequences (alignment of 565 positions) obtained from 126 individuals representing seven to 12 fish from 13 sites distributed over 12 reefs of the GBR and the Coral Sea. The samples revealed three major clades: (1) black fish collected from the southern GBR; (2) bicolored fish collected from the GBR and one reef (Osprey) from the northern Coral Sea; (3) black and white monomorphs collected from six reefs in the Coral Sea. All three clades were well supported (72-100%) by bootstrap analyses. Sequence divergences were very high between the major clades (mean = 7.6%) as well as within them (2.0-3.6%). Within clades, most reefs segregated as monophyletic assemblages. This was revealed both by phylogenetic analyses and AMOVAs that showed that 72-90% of the variance originated from differences among groups, whereas only 5-13% originated within populations. These patterns are discussed in relation to the known geological history of coral reefs of the GBR and the Coral Sea. Finally, we ask whether the monospecific status of Acanthochromis should be revisited because the sequence divergences found among our samples is substantially greater than those recorded among well-recognized species in other reef fishes.
Kim, Min Jee; Hong, Eui Jeong; Kim, Iksoo
2016-01-01
We sequenced the complete mitochondrial (mt) genome of Camponotus atrox (Hymenoptera: Formicidae), which is only distributed in Korea. The genome was 16 540 bp in size and contained typical sets of genes (13 protein-coding genes, 22 tRNAs, and 2 rRNAs). The C. atrox A+T-rich region, at 1402 bp, was the longest of all sequenced ant genomes and was composed of an identical tandem repeat consisting of six 100-bp copies and one 96-bp copy. A total of 315 bp of intergenic spacer sequence was spread over 23 regions. An alignment of the spacer sequences in ants was largely feasible among congeneric species, and there was substantial sequence divergence, indicating their potential use as molecular markers for congeneric species. The A/T contents at the first and second codon positions of protein-coding genes (PCGs) were similar for ant species, including C. atrox (73.9% vs. 72.3%, on average). With increased taxon sampling among hymenopteran superfamilies, differences in the divergence rates (i.e., the non-synonymous substitution rates) between the suborders Symphyta and Apocrita were detected, consistent with previous results. The C. atrox mt genome had a unique gene arrangement, trnI-trnM-trnQ, at the A+T-rich region and ND2 junction (underline indicates inverted gene). This may have originated from a tandem duplication of trnM-trnI, resulting in trnM-trnI-trnM-trnI-trnQ, and the subsequent loss of the first trnM and second trnI, resulting in trnI-trnM-trnQ.
Foster, Charles S P; Henwood, Murray J; Ho, Simon Y W
2018-05-25
Data sets comprising small numbers of genetic markers are not always able to resolve phylogenetic relationships. This has frequently been the case in molecular systematic studies of plants, with many analyses being based on sequence data from only two or three chloroplast genes. An example of this comes from the riceflowers Pimelea Banks & Sol. ex Gaertn. (Thymelaeaceae), a large genus of flowering plants predominantly distributed in Australia. Despite the considerable morphological variation in the genus, low sequence divergence in chloroplast markers has led to the phylogeny of Pimelea remaining largely uncertain. In this study, we resolve the backbone of the phylogeny of Pimelea in comprehensive Bayesian and maximum-likelihood analyses of plastome sequences from 41 taxa. However, some relationships received only moderate to poor support, and the Pimelea clade contained extremely short internal branches. By using topology-clustering analyses, we demonstrate that conflicting phylogenetic signals can be found across the trees estimated from individual chloroplast protein-coding genes. A relaxed-clock dating analysis reveals that Pimelea arose in the mid-Miocene, with most divergences within the genus occurring during a subsequent rapid diversification. Our new phylogenetic estimate offers better resolution and is more strongly supported than previous estimates, providing a platform for future taxonomic revisions of both Pimelea and the broader subfamily. Our study has demonstrated the substantial improvements in phylogenetic resolution that can be achieved using plastome-scale data sets in plant molecular systematics. Copyright © 2018 Elsevier Inc. All rights reserved.
Lauber, Chris
2012-01-01
The recent advent of genome sequences as the only source available to classify many newly discovered viruses challenges the development of virus taxonomy by expert virologists who traditionally rely on extensive virus characterization. In this proof-of-principle study, we address this issue by presenting a computational approach (DEmARC) to classify viruses of a family into groups at hierarchical levels using a sole criterion—intervirus genetic divergence. To quantify genetic divergence, we used pairwise evolutionary distances (PEDs) estimated by maximum likelihood inference on a multiple alignment of family-wide conserved proteins. PEDs were calculated for all virus pairs, and the resulting distribution was modeled via a mixture of probability density functions. The model enables the quantitative inference of regions of distance discontinuity in the family-wide PED distribution, which define the levels of hierarchy. For each level, a limit on genetic divergence, below which two viruses join the same group, was objectively selected among a set of candidates by minimizing violations of intragroup PEDs to the limit. In a case study, we applied the procedure to hundreds of genome sequences of picornaviruses and extensively evaluated it by modulating four key parameters. It was found that the genetics-based classification largely tolerates variations in virus sampling and multiple alignment construction but is affected by the choice of protein and the measure of genetic divergence. In an accompanying paper (C. Lauber and A. E. Gorbalenya, J. Virol. 86:3905–3915, 2012), we analyze the substantial insight gained with the genetics-based classification approach by comparing it with the expert-based picornavirus taxonomy. PMID:22278230
Daniels, Savel R
2011-11-01
The endemic, monotypic freshwater crab species Seychellum alluaudi was used as a template to examine the initial colonisation and evolutionary history among the major islands in the Seychelles Archipelago. Five of the "inner" islands in the Seychelles Archipelago including Mahé, Praslin, Silhouette, La Digue and Frégate were sampled. Two partial mtDNA fragments, 16S rRNA and cytochrome oxidase subunit I (COI) was sequenced for 83 specimens of S. alluaudi. Evolutionary relationships between populations were inferred from the combined mtDNA dataset using maximum parsimony, maximum likelihood and Bayesian inferences. Analyses of molecular variance (AMOVA) were used to examine genetic variation among and within clades. A haplotype network was constructed using TCS while BEAST was employed to date the colonisation and divergence of lineages on the islands. Phylogenetic analyses of the combined mtDNA data set of 1103 base pairs retrieved a monophyletic S. alluaudi group comprised three statistically well-supported monophyletic clades. Clade one was exclusive to Silhouette; clade two included samples from Praslin sister to La Digue, while clade three comprised samples from Mahé sister to Frégate. The haplotype network corresponded to the three clades. Within Mahé, substantial phylogeographic substructure was evident. AMOVA results revealed limited genetic variation within localities with most variation occurring among localities. Divergence time estimations predated the Holocene sea level regressions and indicated a Pliocene/Pleistocene divergence between the three clades evident within S. alluaudi. The monophyly of each clade suggests that transoceanic dispersal is rare. The absence of shared haplotypes between the three clades, coupled with marked sequence divergence values suggests the presence of three allospecies within S. alluaudi. Copyright © 2011 Elsevier Inc. All rights reserved.
Patterns of DNA barcode variation in Canadian marine molluscs.
Layton, Kara K S; Martel, André L; Hebert, Paul D N
2014-01-01
Molluscs are the most diverse marine phylum and this high diversity has resulted in considerable taxonomic problems. Because the number of species in Canadian oceans remains uncertain, there is a need to incorporate molecular methods into species identifications. A 648 base pair segment of the cytochrome c oxidase subunit I gene has proven useful for the identification and discovery of species in many animal lineages. While the utility of DNA barcoding in molluscs has been demonstrated in other studies, this is the first effort to construct a DNA barcode registry for marine molluscs across such a large geographic area. This study examines patterns of DNA barcode variation in 227 species of Canadian marine molluscs. Intraspecific sequence divergences ranged from 0-26.4% and a barcode gap existed for most taxa. Eleven cases of relatively deep (>2%) intraspecific divergence were detected, suggesting the possible presence of overlooked species. Structural variation was detected in COI with indels found in 37 species, mostly bivalves. Some indels were present in divergent lineages, primarily in the region of the first external loop, suggesting certain areas are hotspots for change. Lastly, mean GC content varied substantially among orders (24.5%-46.5%), and showed a significant positive correlation with nearest neighbour distances. DNA barcoding is an effective tool for the identification of Canadian marine molluscs and for revealing possible cases of overlooked species. Some species with deep intraspecific divergence showed a biogeographic partition between lineages on the Atlantic, Arctic and Pacific coasts, suggesting the role of Pleistocene glaciations in the subdivision of their populations. Indels were prevalent in the barcode region of the COI gene in bivalves and gastropods. This study highlights the efficacy of DNA barcoding for providing insights into sequence variation across a broad taxonomic group on a large geographic scale.
Kim, Shin-Hee; Nayak, Subhashree; Paldurai, Anandan; Nayak, Baibaswata; Samuel, Arthur; Aplogan, Gilbert L.; Awoume, Kodzo A.; Webby, Richard J.; Ducatez, Mariette F.; Collins, Peter L.
2012-01-01
The complete genome sequence of an African Newcastle disease virus (NDV) strain isolated from a chicken in Togo in 2009 was determined. The genome is 15,198 nucleotides (nt) in length and is classified in genotype VII in the class II cluster. Compared to common vaccine strains, the African strain contains a previously described 6-nt insert in the downstream untranslated region of the N gene and a novel 6-nt insert in the HN-L intergenic region. Genome length differences are a marker of the natural history of NDV. This is the first description of a class II NDV strain with a genome of 15,198 nt and a 6-nt insert in the HN-L intergenic region. Sequence divergence relative to vaccine strains was substantial, likely contributes to outbreaks, and illustrates the continued evolution of new NDV strains in West Africa. PMID:22997417
Divergent clonal selection dominates medulloblastoma at recurrence
Morrissy, A. Sorana; Garzia, Livia; Shih, David J. H.; Zuyderduyn, Scott; Huang, Xi; Skowron, Patryk; Remke, Marc; Cavalli, Florence M. G.; Ramaswamy, Vijay; Lindsay, Patricia E.; Jelveh, Salomeh; Donovan, Laura K.; Wang, Xin; Luu, Betty; Zayne, Kory; Li, Yisu; Mayoh, Chelsea; Thiessen, Nina; Mercier, Eloi; Mungall, Karen L.; Ma, Yusanne; Tse, Kane; Zeng, Thomas; Shumansky, Karey; Roth, Andrew J. L.; Shah, Sohrab; Farooq, Hamza; Kijima, Noriyuki; Holgado, Borja L.; Lee, John J. Y.; Matan-Lithwick, Stuart; Liu, Jessica; Mack, Stephen C.; Manno, Alex; Michealraj, K. A.; Nor, Carolina; Peacock, John; Qin, Lei; Reimand, Juri; Rolider, Adi; Thompson, Yuan Y.; Wu, Xiaochong; Pugh, Trevor; Ally, Adrian; Bilenky, Mikhail; Butterfield, Yaron S. N.; Carlsen, Rebecca; Cheng, Young; Chuah, Eric; Corbett, Richard D.; Dhalla, Noreen; He, An; Lee, Darlene; Li, Haiyan I.; Long, William; Mayo, Michael; Plettner, Patrick; Qian, Jenny Q.; Schein, Jacqueline E.; Tam, Angela; Wong, Tina; Birol, Inanc; Zhao, Yongjun; Faria, Claudia C.; Pimentel, José; Nunes, Sofia; Shalaby, Tarek; Grotzer, Michael; Pollack, Ian F.; Hamilton, Ronald L.; Li, Xiao-Nan; Bendel, Anne E.; Fults, Daniel W.; Walter, Andrew W.; Kumabe, Toshihiro; Tominaga, Teiji; Collins, V. Peter; Cho, Yoon-Jae; Hoffman, Caitlin; Lyden, David; Wisoff, Jeffrey H.; Garvin, James H.; Stearns, Duncan S.; Massimi, Luca; Schüller, Ulrich; Sterba, Jaroslav; Zitterbart, Karel; Puget, Stephanie; Ayrault, Olivier; Dunn, Sandra E.; Tirapelli, Daniela P. C.; Carlotti, Carlos G.; Wheeler, Helen; Hallahan, Andrew R.; Ingram, Wendy; MacDonald, Tobey J.; Olson, Jeffrey J.; Van Meir, Erwin G.; Lee, Ji-Yeoun; Wang, Kyu-Chang; Kim, Seung-Ki; Cho, Byung-Kyu; Pietsch, Torsten; Fleischhack, Gudrun; Tippelt, Stephan; Ra, Young Shin; Bailey, Simon; Lindsey, Janet C.; Clifford, Steven C.; Eberhart, Charles G.; Cooper, Michael K.; Packer, Roger J.; Massimino, Maura; Garre, Maria Luisa; Bartels, Ute; Tabori, Uri; Hawkins, Cynthia E.; Dirks, Peter; Bouffet, Eric; Rutka, James T.; Wechsler-Reya, Robert J.; Weiss, William A.; Collier, Lara S.; Dupuy, Adam J.; Korshunov, Andrey; Jones, David T. W.; Kool, Marcel; Northcott, Paul A.; Pfister, Stefan M.; Largaespada, David A.; Mungall, Andrew J.; Moore, Richard A.; Jabado, Nada; Bader, Gary D.; Jones, Steven J. M.; Malkin, David; Marra, Marco A.; Taylor, Michael D.
2016-01-01
The development of targeted anti-cancer therapies through the study of cancer genomes is intended to increase survival rates and decrease treatment-related toxicity. We treated a transposon–driven, functional genomic mouse model of medulloblastoma with ‘humanized’ in vivo therapy (microneurosurgical tumour resection followed by multi-fractionated, image-guided radiotherapy). Genetic events in recurrent murine medulloblastoma exhibit a very poor overlap with those in matched murine diagnostic samples (<5%). Whole-genome sequencing of 33 pairs of human diagnostic and post-therapy medulloblastomas demonstrated substantial genetic divergence of the dominant clone after therapy (<12% diagnostic events were retained at recurrence). In both mice and humans, the dominant clone at recurrence arose through clonal selection of a pre-existing minor clone present at diagnosis. Targeted therapy is unlikely to be effective in the absence of the target, therefore our results offer a simple, proximal, and remediable explanation for the failure of prior clinical trials of targeted therapy. PMID:26760213
Landry, Jean-François; Hebert, Paul DN
2013-01-01
Abstract The genus Plutella was thought to be represented in Australia by a single introduced species, Plutella xylostella (Linnaeus), the diamondback moth. Its status as a major pest of cruciferous crops, and the difficulty in developing control strategies has motivated broad-ranging studies on its biology. Prior genetic work has generally supported the conclusion that populations of this migratory species are connected by substantial gene flow. However, the present study reveals the presence of two genetically divergent lineages of this taxonin Australia. One shows close genetic and morphological similarity with the nearly cosmopolitan Plutella xylostella. The second lineage possesses a similar external morphology, but marked sequence divergence in the barcode region of the cytochrome c oxidase I gene, coupled with clear differences in genitalia. As a consequence, members of this lineage are described as a new species, Plutella australiana Landry & Hebert, which is broadly distributed in the eastern half of Australia. PMID:24167421
Landry, Jean-François; Hebert, Paul Dn
2013-01-01
The genus Plutella was thought to be represented in Australia by a single introduced species, Plutella xylostella (Linnaeus), the diamondback moth. Its status as a major pest of cruciferous crops, and the difficulty in developing control strategies has motivated broad-ranging studies on its biology. Prior genetic work has generally supported the conclusion that populations of this migratory species are connected by substantial gene flow. However, the present study reveals the presence of two genetically divergent lineages of this taxonin Australia. One shows close genetic and morphological similarity with the nearly cosmopolitan Plutella xylostella. The second lineage possesses a similar external morphology, but marked sequence divergence in the barcode region of the cytochrome c oxidase I gene, coupled with clear differences in genitalia. As a consequence, members of this lineage are described as a new species, Plutella australiana Landry & Hebert, which is broadly distributed in the eastern half of Australia.
Chen, Chao; Wang, Huihua; Liu, Zhiguang; Chen, Xiao; Tang, Jiao; Meng, Fanming; Shi, Wei
2018-06-20
The mechanisms by which organisms adapt to variable environments are a fundamental question in evolutionary biology and are important to protect important species in response to a changing climate. An interesting candidate to study this question is the honey bee Apis cerana, a keystone pollinator with a wide distribution throughout a large variety of climates, that exhibits rapid dispersal. Here, we re-sequenced the genome of 180 A. cerana individuals from eighteen populations throughout China. Using a population genomics approach, we observed considerable genetic variation in A. cerana. Patterns of genetic differentiation indicate high divergence at the subspecies level, and physical barriers rather than distance are the driving force for population divergence. Estimations of divergence time suggested that the main branches diverged between 300 and 500 ka. Analyses of the population history revealed a substantial influence of the Earth's climate on the effective population size of A. cerana, as increased population sizes were observed during warmer periods. Further analyses identified candidate genes under natural selection that are potentially related to honey bee cognition, temperature adaptation, and olfactory. Based on our results, A. cerana may have great potential in response to climate change. Our study provides fundamental knowledge of the evolution and adaptation of A. cerana.
Xenopus in Space and Time: Fossils, Node Calibrations, Tip-Dating, and Paleobiogeography.
Cannatella, David
2015-01-01
Published data from DNA sequences, morphology of 11 extant and 15 extinct frog taxa, and stratigraphic ranges of fossils were integrated to open a window into the deep-time evolution of Xenopus. The ages and morphological characters of fossils were used as independent datasets to calibrate a chronogram. We found that DNA sequences, either alone or in combination with morphological data and fossils, tended to support a close relationship between Xenopus and Hymenochirus, although in some analyses this topology was not significantly better than the Pipa + Hymenochirus topology. Analyses that excluded DNA data found strong support for the Pipa + Hymenochirus tree. The criterion for selecting the maximum age of the calibration prior influenced the age estimates, and our age estimates of early divergences in the tree of frogs are substantially younger than those of published studies. Node-dating and tip-dating calibrations, either alone or in combination, yielded older dates for nodes than did a root calibration alone. Our estimates of divergence times indicate that overwater dispersal, rather than vicariance due to the splitting of Africa and South America, may explain the presence of Xenopus in Africa and its closest fossil relatives in South America.
Goicoechea, P G; Herrán, A; Durand, J; Bodénès, C; Plomion, C; Kremer, A
2015-01-01
We analyzed the genetic mosaic of speciation in two hybridizing Mediterranean white oaks from the Iberian Peninsula (Quercus faginea Lamb. and Quercus pyrenaica Willd.). The two species show ecological divergence in flowering phenology, leaf morphology and composition, and in their basic or acidic soil preferences. Ninety expressed sequence tag-simple sequence repeats (EST-SSRs) and eight nuclear SSRs were genotyped in 96 trees from each species. Genotyping was designed in two steps. First, we used 69 markers evenly distributed over the 12 linkage groups (LGs) of the oak linkage map to confirm the species genetic identity of the sampled genotypes, and searched for differentiation outliers. Then, we genotyped 29 additional markers from the chromosome bins containing the outliers and repeated the multilocus scans. We found one or two additional outliers within four saturated bins, thus confirming that outliers are organized into clusters. Linkage disequilibrium (LD) was extensive; even for loosely linked and for independent markers. Consequently, score tests for association between two-marker haplotypes and the ‘species trait' showed a broad genomic divergence, although substantial variation across the genome and within LGs was also observed. We discuss the influence of several confounding effects on neutrality tests and review the evolutionary processes leading to extensive LD. Finally, we examine how LD analyses within regions that contain outlier clusters and quantitative trait loci can help to identify regions of divergence and/or genomic hitchhiking in the light of predictions from ecological speciation theory. PMID:25515016
Camunas-Soler, Joan; Kertesz, Michael; De Vlaminck, Iwijn; Koh, Winston; Pan, Wenying; Martin, Lance; Neff, Norma F.; Okamoto, Jennifer; Wong, Ronald J.; Kharbanda, Sandhya; El-Sayed, Yasser; Blumenfeld, Yair; Stevenson, David K.; Shaw, Gary M.; Wolfe, Nathan D.; Quake, Stephen R.
2017-01-01
Blood circulates throughout the human body and contains molecules drawn from virtually every tissue, including the microbes and viruses which colonize the body. Through massive shotgun sequencing of circulating cell-free DNA from the blood, we identified hundreds of new bacteria and viruses which represent previously unidentified members of the human microbiome. Analyzing cumulative sequence data from 1,351 blood samples collected from 188 patients enabled us to assemble 7,190 contiguous regions (contigs) larger than 1 kbp, of which 3,761 are novel with little or no sequence homology in any existing databases. The vast majority of these novel contigs possess coding sequences, and we have validated their existence both by finding their presence in independent experiments and by performing direct PCR amplification. When their nearest neighbors are located in the tree of life, many of the organisms represent entirely novel taxa, showing that microbial diversity within the human body is substantially broader than previously appreciated. PMID:28830999
Platt, Roy N.; Amman, Brian R.; Keith, Megan S.; Thompson, Cody W.; Bradley, Robert D.
2015-01-01
The evolutionary relationships between Peromyscus, Habromys, Isthmomys, Megadontomys, Neotomodon, Osgoodomys, and Podomys are poorly understood. In order to further explore the evolutionary boundaries of Peromyscus and compare potential taxonomic solutions for this diverse group and its relatives, we conducted phylogenetic analyses of DNA sequence data from alcohol dehydrogenase (Adh1-I2), beta fibrinogen (Fgb-I7), interphotoreceptor retinoid-binding protein (Rbp3), and cytochrome-b (Cytb). Phylogenetic analyses of mitochondrial and nuclear genes produced similar topologies although levels of nodal support varied. The best-supported topology was obtained by combining nuclear and mitochondrial sequences. No monophyletic Peromyscus clade was supported. Instead, support was found for a clade containing Habromys, Megadontomys, Neotomodon, Osgoodomys, Podomys, and Peromyscus suggesting paraphyly of Peromyscus and confirming previous observations. Our analyses indicated an early divergence of Isthmomys from Peromyscus (approximately 8 million years ago), whereas most other peromyscine taxa emerged within the last 6 million years. To recover a monophyletic taxonomy from Peromyscus and affiliated lineages, we detail 3 taxonomic options in which Habromys, Megadontomys, Neotomodon, Osgoodomys, and Podomys are retained as genera, subsumed as subgenera, or subsumed as species groups within Peromyscus. Each option presents distinct taxonomic challenges, and the appropriate taxonomy must reflect the substantial levels of morphological divergence that characterize this group while maintaining the monophyletic relationships obtained from genetic data. PMID:26937047
Platt, Roy N; Amman, Brian R; Keith, Megan S; Thompson, Cody W; Bradley, Robert D
2015-08-03
The evolutionary relationships between Peromyscus , Habromys , Isthmomys , Megadontomys , Neotomodon , Osgoodomys , and Podomys are poorly understood. In order to further explore the evolutionary boundaries of Peromyscus and compare potential taxonomic solutions for this diverse group and its relatives, we conducted phylogenetic analyses of DNA sequence data from alcohol dehydrogenase ( Adh 1-I2), beta fibrinogen ( Fgb -I7), interphotoreceptor retinoid-binding protein ( Rbp 3), and cytochrome- b ( Cytb ). Phylogenetic analyses of mitochondrial and nuclear genes produced similar topologies although levels of nodal support varied. The best-supported topology was obtained by combining nuclear and mitochondrial sequences. No monophyletic Peromyscus clade was supported. Instead, support was found for a clade containing Habromys , Megadontomys , Neotomodon , Osgoodomys , Podomys , and Peromyscus suggesting paraphyly of Peromyscus and confirming previous observations. Our analyses indicated an early divergence of Isthmomys from Peromyscus (approximately 8 million years ago), whereas most other peromyscine taxa emerged within the last 6 million years. To recover a monophyletic taxonomy from Peromyscus and affiliated lineages, we detail 3 taxonomic options in which Habromys , Megadontomys , Neotomodon , Osgoodomys , and Podomys are retained as genera, subsumed as subgenera, or subsumed as species groups within Peromyscus . Each option presents distinct taxonomic challenges, and the appropriate taxonomy must reflect the substantial levels of morphological divergence that characterize this group while maintaining the monophyletic relationships obtained from genetic data.
Makowsky, Robert; Cox, Christian L; Roelke, Corey; Chippindale, Paul T
2010-11-01
Determining the appropriate gene for phylogeny reconstruction can be a difficult process. Rapidly evolving genes tend to resolve recent relationships, but suffer from alignment issues and increased homoplasy among distantly related species. Conversely, slowly evolving genes generally perform best for deeper relationships, but lack sufficient variation to resolve recent relationships. We determine the relationship between sequence divergence and Bayesian phylogenetic reconstruction ability using both natural and simulated datasets. The natural data are based on 28 well-supported relationships within the subphylum Vertebrata. Sequences of 12 genes were acquired and Bayesian analyses were used to determine phylogenetic support for correct relationships. Simulated datasets were designed to determine whether an optimal range of sequence divergence exists across extreme phylogenetic conditions. Across all genes we found that an optimal range of divergence for resolving the correct relationships does exist, although this level of divergence expectedly depends on the distance metric. Simulated datasets show that an optimal range of sequence divergence exists across diverse topologies and models of evolution. We determine that a simple to measure property of genetic sequences (genetic distance) is related to phylogenic reconstruction ability in Bayesian analyses. This information should be useful for selecting the most informative gene to resolve any relationships, especially those that are difficult to resolve, as well as minimizing both cost and confounding information during project design. Copyright © 2010. Published by Elsevier Inc.
Kumar, Girish; Kocour, Martin; Kunal, Swaraj Priyaranjan
2016-05-01
In order to assess the DNA sequence variation and phylogenetic relationship among five tuna species (Auxis thazard, Euthynnus affinis, Katsuwonus pelamis, Thunnus tonggol, and T. albacares) out of all four tuna genera, partial sequences of the mitochondrial DNA (mtDNA) D-loop region were analyzed. The estimate of intra-specific sequence variation in studied species was low, ranging from 0.027 to 0.080 [Kimura's two parameter distance (K2P)], whereas values of inter-specific variation ranged from 0.049 to 0.491. The longtail tuna (T. tonggol) and yellowfin tuna (T. albacares) were found to share a close relationship (K2P = 0.049) while skipjack tuna (K. pelamis) was most divergent studied species. Phylogenetic analysis using Maximum-Likelihood (ML) and Neighbor-Joining (NJ) methods supported the monophyletic origin of Thunnus species. Similarly, phylogeny of Auxis and Euthynnus species substantiate the monophyly. However, results showed a distinct origin of K. pelamis from genus Thunnus as well as Auxis and Euthynnus. Thus, the mtDNA D-loop region sequence data supports the polyphyletic origin of tuna species.
Tong, Ying; Zheng, Kang; Zhao, Shufang; Xiao, Guanxiu; Luo, Chen
2012-11-01
Recent studies demonstrated that sequence divergence in both transcriptional regulatory region and coding region contributes to the subfunctionalization of duplicate gene. However, whether sequence divergence in the 3'-untranslated region (3'-UTR) has an impact on the subfunctionalization of duplicate genes remains unclear. Here, we identified two diverging duplicate vsx1 (visual system homeobox-1) loci in goldfish, named vsx1A1 and vsx1A2. Phylogenetic analysis suggests that vsx1A1 and vsx1A2 may arise from a duplication of vsx1 after the separation of goldfish and zebrafish. Sequence comparison revealed that divergence in both transcriptional and translational regulatory regions is higher than divergence in the introns. vsx1A2 expresses during blastula and gastrula stages and in adult retina but silences from segmentation stage to hatching stage, vsx1A1 starts expression from segmentation onward. Comparing to that zebrafish vsx1 expresses in all the developmental stages and in the adult retina, it appears that goldfish vsx1A1 and vsx1A2 are under going to share the functions of ancestral vsx1. The different but overlapping temporal expression patterns of vsx1A1 and vsx1A2 suggest that sequence divergence in the promoter region of duplicate vsx1 is not sufficient for partitioning the functions of ancestral vsx1. By comparing vsx1A1 and vsx1A2 3'-UTR-linked green fluorescent protein gene expression patterns, we demonstrated that the 3'-UTR of vsx1A1 remains but the 3'-UTR of vsx1A2 has lost the capability of mediating bipolar cell specific expression during retina development. These results indicate that sequence divergence in the 3'-UTRs has a clear effect on subfunctionalization of the duplicate genes. © 2012 WILEY PERIODICALS, INC.
Chromosome rearrangements via template switching between diverged repeated sequences
Anand, Ranjith P.; Tsaponina, Olga; Greenwell, Patricia W.; Lee, Cheng-Sheng; Du, Wei; Petes, Thomas D.
2014-01-01
Recent high-resolution genome analyses of cancer and other diseases have revealed the occurrence of microhomology-mediated chromosome rearrangements and copy number changes. Although some of these rearrangements appear to involve nonhomologous end-joining, many must have involved mechanisms requiring new DNA synthesis. Models such as microhomology-mediated break-induced replication (MM-BIR) have been invoked to explain these rearrangements. We examined BIR and template switching between highly diverged sequences in Saccharomyces cerevisiae, induced during repair of a site-specific double-strand break (DSB). Our data show that such template switches are robust mechanisms that give rise to complex rearrangements. Template switches between highly divergent sequences appear to be mechanistically distinct from the initial strand invasions that establish BIR. In particular, such jumps are less constrained by sequence divergence and exhibit a different pattern of microhomology junctions. BIR traversing repeated DNA sequences frequently results in complex translocations analogous to those seen in mammalian cells. These results suggest that template switching among repeated genes is a potent driver of genome instability and evolution. PMID:25367035
2010-01-01
Background Multiple sequence alignments are used to study gene or protein function, phylogenetic relations, genome evolution hypotheses and even gene polymorphisms. Virtually without exception, all available tools focus on conserved segments or residues. Small divergent regions, however, are biologically important for specific quantitative polymerase chain reaction, genotyping, molecular markers and preparation of specific antibodies, and yet have received little attention. As a consequence, they must be selected empirically by the researcher. AlignMiner has been developed to fill this gap in bioinformatic analyses. Results AlignMiner is a Web-based application for detection of conserved and divergent regions in alignments of conserved sequences, focusing particularly on divergence. It accepts alignments (protein or nucleic acid) obtained using any of a variety of algorithms, which does not appear to have a significant impact on the final results. AlignMiner uses different scoring methods for assessing conserved/divergent regions, Entropy being the method that provides the highest number of regions with the greatest length, and Weighted being the most restrictive. Conserved/divergent regions can be generated either with respect to the consensus sequence or to one master sequence. The resulting data are presented in a graphical interface developed in AJAX, which provides remarkable user interaction capabilities. Users do not need to wait until execution is complete and can.even inspect their results on a different computer. Data can be downloaded onto a user disk, in standard formats. In silico and experimental proof-of-concept cases have shown that AlignMiner can be successfully used to designing specific polymerase chain reaction primers as well as potential epitopes for antibodies. Primer design is assisted by a module that deploys several oligonucleotide parameters for designing primers "on the fly". Conclusions AlignMiner can be used to reliably detect divergent regions via several scoring methods that provide different levels of selectivity. Its predictions have been verified by experimental means. Hence, it is expected that its usage will save researchers' time and ensure an objective selection of the best-possible divergent region when closely related sequences are analysed. AlignMiner is freely available at http://www.scbi.uma.es/alignminer. PMID:20525162
Bodewes, R; Kik, M J L; Raj, V Stalin; Schapendonk, C M E; Haagmans, B L; Smits, S L; Osterhaus, A D M E
2013-06-01
Arenaviruses are bi-segmented negative-stranded RNA viruses, which were until recently only detected in rodents and humans. Now highly divergent arenaviruses have been identified in boid snakes with inclusion body disease (IBD). Here, we describe the identification of a new species and variants of the highly divergent arenaviruses, which were detected in tissues of captive boid snakes with IBD in The Netherlands by next-generation sequencing. Phylogenetic analysis of the complete sequence of the open reading frames of the four predicted proteins of one of the detected viruses revealed that this virus was most closely related to the recently identified Golden Gate virus, while considerable sequence differences were observed between the highly divergent arenaviruses detected in this study. These findings add to the recent identification of the highly divergent arenaviruses in boid snakes with IBD in the United States and indicate that these viruses also circulate among boid snakes in Europe.
Laughter and the Management of Divergent Positions in Peer Review Interactions
Raclaw, Joshua; Ford, Cecilia E.
2017-01-01
In this paper we focus on how participants in peer review interactions use laughter as a resource as they publicly report divergence of evaluative positions, divergence that is typical in the give and take of joint grant evaluation. Using the framework of conversation analysis, we examine the infusion of laughter and multimodal laugh-relevant practices into sequences of talk in meetings of grant reviewers deliberating on the evaluation and scoring of high-level scientific grant applications. We focus on a recurrent sequence in these meetings, what we call the score-reporting sequence, in which the assigned reviewers first announce the preliminary scores they have assigned to the grant. We demonstrate that such sequences are routine sites for the use of laugh practices to navigate the initial moments in which divergence of opinion is made explicit. In the context of meetings convened for the purposes of peer review, laughter thus serves as a valuable resource for managing the socially delicate but institutionally required reporting of divergence and disagreement that is endemic to meetings where these types of evaluative tasks are a focal activity. PMID:29170594
Wood, Dustin A.; Vandergast, Amy G.; Barr, Kelly R.; Inman, Richard D.; Esque, Todd C.; Nussear, Kenneth E.; Fisher, Robert N.
2013-01-01
Aim: We explored lineage diversification within desert-dwelling fauna. Our goals were (1) to determine whether phylogenetic lineages and population expansions were consistent with younger Pleistocene climate fluctuation hypotheses or much older events predicted by pre-Pleistocene vicariance hypotheses, (2) to assess concordance in spatial patterns of genetic divergence and diversity among species and (3) to identify regional evolutionary hotspots of divergence and diversity and assess their conservation status. Location: Mojave, Colorado, and Sonoran Deserts, USA. Methods: We analysed previously published gene sequence data for twelve species. We used Bayesian gene tree methods to estimate lineages and divergence times. Within each lineage, we tested for population expansion and age of expansion using coalescent approaches. We mapped interpopulation genetic divergence and intra-population genetic diversity in a GIS to identify hotspots of highest genetic divergence and diversity and to assess whether protected lands overlapped with evolutionary hotspots. Results: In seven of the 12 species, lineage divergence substantially predated the Pleistocene. Historical population expansion was found in eight species, but expansion events postdated the Last Glacial Maximum (LGM) in only four. For all species assessed, six hotspots of high genetic divergence and diversity were concentrated in the Colorado Desert, along the Colorado River and in the Mojave/Sonoran ecotone. At least some proportion of the land within each recovered hotspot was categorized as protected, yet four of the six also overlapped with major areas of human development. Main conclusions: Most of the species studied here diversified into distinct Mojave and Sonoran lineages prior to the LGM – supporting older diversification hypotheses. Several evolutionary hotspots were recovered but are not strategically paired with areas of protected land. Long-term preservation of species-level biodiversity would entail selecting areas for protection in Mojave and Sonoran Deserts to retain divergent genetic diversity and ensure connectedness across environmental gradients.
Analysis of the Macaca mulatta transcriptome and the sequence divergence between Macaca and human.
Magness, Charles L; Fellin, P Campion; Thomas, Matthew J; Korth, Marcus J; Agy, Michael B; Proll, Sean C; Fitzgibbon, Matthew; Scherer, Christina A; Miner, Douglas G; Katze, Michael G; Iadonato, Shawn P
2005-01-01
We report the initial sequencing and comparative analysis of the Macaca mulatta transcriptome. Cloned sequences from 11 tissues, nine animals, and three species (M. mulatta, M. fascicularis, and M. nemestrina) were sampled, resulting in the generation of 48,642 sequence reads. These data represent an initial sampling of the putative rhesus orthologs for 6,216 human genes. Mean nucleotide diversity within M. mulatta and sequence divergence among M. fascicularis, M. nemestrina, and M. mulatta are also reported.
Middleton, Christopher P.; Senerchia, Natacha; Stein, Nils; Akhunov, Eduard D.; Keller, Beat
2014-01-01
Using Roche/454 technology, we sequenced the chloroplast genomes of 12 Triticeae species, including bread wheat, barley and rye, as well as the diploid progenitors and relatives of bread wheat Triticum urartu, Aegilops speltoides and Ae. tauschii. Two wild tetraploid taxa, Ae. cylindrica and Ae. geniculata, were also included. Additionally, we incorporated wild Einkorn wheat Triticum boeoticum and its domesticated form T. monococcum and two Hordeum spontaneum (wild barley) genotypes. Chloroplast genomes were used for overall sequence comparison, phylogenetic analysis and dating of divergence times. We estimate that barley diverged from rye and wheat approximately 8–9 million years ago (MYA). The genome donors of hexaploid wheat diverged between 2.1–2.9 MYA, while rye diverged from Triticum aestivum approximately 3–4 MYA, more recently than previously estimated. Interestingly, the A genome taxa T. boeoticum and T. urartu were estimated to have diverged approximately 570,000 years ago. As these two have a reproductive barrier, the divergence time estimate also provides an upper limit for the time required for the formation of a species boundary between the two. Furthermore, we conclusively show that the chloroplast genome of hexaploid wheat was contributed by the B genome donor and that this unknown species diverged from Ae. speltoides about 980,000 years ago. Additionally, sequence alignments identified a translocation of a chloroplast segment to the nuclear genome which is specific to the rye/wheat lineage. We propose the presented phylogeny and divergence time estimates as a reference framework for future studies on Triticeae. PMID:24614886
Determining divergence times with a protein clock: update and reevaluation
NASA Technical Reports Server (NTRS)
Feng, D. F.; Cho, G.; Doolittle, R. F.; Bada, J. L. (Principal Investigator)
1997-01-01
A recent study of the divergence times of the major groups of organisms as gauged by amino acid sequence comparison has been expanded and the data have been reanalyzed with a distance measure that corrects for both constraints on amino acid interchange and variation in substitution rate at different sites. Beyond that, the availability of complete genome sequences for several eubacteria and an archaebacterium has had a great impact on the interpretation of certain aspects of the data. Thus, the majority of the archaebacterial sequences are not consistent with currently accepted views of the Tree of Life which cluster the archaebacteria with eukaryotes. Instead, they are either outliers or mixed in with eubacterial orthologs. The simplest resolution of the problem is to postulate that many of these sequences were carried into eukaryotes by early eubacterial endosymbionts about 2 billion years ago, only very shortly after or even coincident with the divergence of eukaryotes and archaebacteria. The strong resemblances of these same enzymes among the major eubacterial groups suggest that the cyanobacteria and Gram-positive and Gram-negative eubacteria also diverged at about this same time, whereas the much greater differences between archaebacterial and eubacterial sequences indicate these two groups may have diverged between 3 and 4 billion years ago.
Sequence-Level Mechanisms of Human Epigenome Evolution
Prendergast, James G.D.; Chambers, Emily V.; Semple, Colin A.M.
2014-01-01
DNA methylation and chromatin states play key roles in development and disease. However, the extent of recent evolutionary divergence in the human epigenome and the influential factors that have shaped it are poorly understood. To determine the links between genome sequence and human epigenome evolution, we examined the divergence of DNA methylation and chromatin states following segmental duplication events in the human lineage. Chromatin and DNA methylation states were found to have been generally well conserved following a duplication event, with the evolution of the epigenome largely uncoupled from the total number of genetic changes in the surrounding DNA sequence. However, the epigenome at tissue-specific, distal regulatory regions was observed to be unusually prone to diverge following duplication, with particular sequence differences, altering known sequence motifs, found to be associated with divergence in patterns of DNA methylation and chromatin. Alu elements were found to have played a particularly prominent role in shaping human epigenome evolution, and we show that human-specific AluY insertion events are strongly linked to the evolution of the DNA methylation landscape and gene expression levels, including at key neurological genes in the human brain. Studying paralogous regions within the same sample enables the study of the links between genome and epigenome evolution while controlling for biological and technical variation. We show DNA methylation and chromatin divergence between duplicated regions are linked to the divergence of particular genetic motifs, with Alu elements having played a disproportionate role in the evolution of the epigenome in the human lineage. PMID:24966180
Karin, Benjamin R; Freitas, Elyse S; Shonleben, Samuel; Grismer, L Lee; Bauer, Aaron M; Das, Indraneil
2018-01-12
We collected two specimens of an undescribed species of Lygosoma from pitfall traps in an urban rainforest in Kuching and from the base of a forested hill in western Sarawak, East Malaysia. The new species is diagnosable from all south-east Asian congeners by morphological characters, and most closely resembles Lygosoma herberti from the Thai-Malay Peninsula. The new species shows substantial molecular divergence from its closest relatives in two protein-coding genes, one mitochondrial (ND1) and one nuclear (R35) that we sequenced for several south-east Asian congeners. We describe the new species on the basis of this distinct morphology and genetic divergence. It is the third species of Lygosoma known from Borneo, and highlights the continuing rise in lizard species diversity on the island. In addition, the discovery of this species from a small urban rainforest underscores the importance of preserving intact rainforest areas of any size in maintaining species diversity.
Duim, Birgitta; van der Graaf-van Bloois, Linda; Wagenaar, Jaap A; Zomer, Aldert L
2018-01-01
Abstract Homologous recombination is a major driver of bacterial speciation. Genetic divergence and host association are important factors influencing homologous recombination. Here, we study these factors for Campylobacter fetus, which shows a distinct intraspecific host dichotomy. Campylobacter fetus subspecies fetus (Cff) and venerealis are associated with mammals, whereas C. fetus subsp. testudinum (Cft) is associated with reptiles. Recombination between these genetically divergent C. fetus lineages is extremely rare. Previously it was impossible to show whether this barrier to recombination was determined by the differential host preferences, by the genetic divergence between both lineages or by other factors influencing recombination, such as restriction-modification, CRISPR/Cas, and transformation systems. Fortuitously, a distinct C. fetus lineage (ST69) was found, which was highly related to mammal-associated C. fetus, yet isolated from a chelonian. The whole genome sequences of two C. fetus ST69 isolates were compared with those of mammal- and reptile-associated C. fetus strains for phylogenetic and recombination analysis. In total, 5.1–5.5% of the core genome of both ST69 isolates showed signs of recombination. Of the predicted recombination regions, 80.4% were most closely related to Cft, 14.3% to Cff, and 5.6% to C. iguaniorum. Recombination from C. fetus ST69 to Cft was also detected, but to a lesser extent and only in chelonian-associated Cft strains. This study shows that despite substantial genetic divergence no absolute barrier to homologous recombination exists between two distinct C. fetus lineages when occurring in the same host type, which provides valuable insights in bacterial speciation and evolution. PMID:29608720
Ned B. Klopfenstein; John W. Hanna; Amy L. Ross-Davis; Jane E. Stewart; Yuko Ota; Rosario Medel-Ortiz; Miguel Armando Lopez-Ramirez; Ruben Damian Elias-Roman; Dionicio Alvarado-Rosales; Mee-Sook Kim
2013-01-01
Armillaria plays diverse ecological roles in forests worldwide, which has inspired interest in understanding phylogenetic relationships within and among species of this genus. Previous rDNA sequence-based phylogenetic analyses of Armillaria have shown general relationships among widely divergent taxa, but rDNA sequences were not reliable for separating closely related...
Lukhaup, Christian; Panteleit, Jörn; Schrimpf, Anne
2015-01-01
Abstract A new species, Cherax snowden sp. n., from the Oinsok River Drainage, Sawiat District in the central part of the Kepala Burung (Vogelkop) Peninsula, West Papua, Indonesia, is described, figured and compared with the closest related species, Cherax holthuisi Lukhaup & Pekny, 2006. This species is collected and exported for ornamental purposes and its commercial name in the pet trade is “orange tip” or “green orange tip”. Both species may be easily distinguished morphologically or by using sequence divergence, which is substantial, for considering Cherax snowden sp. n. to be a new species. PMID:26448698
Genome sequencing highlights the dynamic early history of dogs.
Freedman, Adam H; Gronau, Ilan; Schweizer, Rena M; Ortega-Del Vecchyo, Diego; Han, Eunjung; Silva, Pedro M; Galaverni, Marco; Fan, Zhenxin; Marx, Peter; Lorente-Galdos, Belen; Beale, Holly; Ramirez, Oscar; Hormozdiari, Farhad; Alkan, Can; Vilà, Carles; Squire, Kevin; Geffen, Eli; Kusak, Josip; Boyko, Adam R; Parker, Heidi G; Lee, Clarence; Tadigotla, Vasisht; Wilton, Alan; Siepel, Adam; Bustamante, Carlos D; Harkins, Timothy T; Nelson, Stanley F; Ostrander, Elaine A; Marques-Bonet, Tomas; Wayne, Robert K; Novembre, John
2014-01-01
To identify genetic changes underlying dog domestication and reconstruct their early evolutionary history, we generated high-quality genome sequences from three gray wolves, one from each of the three putative centers of dog domestication, two basal dog lineages (Basenji and Dingo) and a golden jackal as an outgroup. Analysis of these sequences supports a demographic model in which dogs and wolves diverged through a dynamic process involving population bottlenecks in both lineages and post-divergence gene flow. In dogs, the domestication bottleneck involved at least a 16-fold reduction in population size, a much more severe bottleneck than estimated previously. A sharp bottleneck in wolves occurred soon after their divergence from dogs, implying that the pool of diversity from which dogs arose was substantially larger than represented by modern wolf populations. We narrow the plausible range for the date of initial dog domestication to an interval spanning 11-16 thousand years ago, predating the rise of agriculture. In light of this finding, we expand upon previous work regarding the increase in copy number of the amylase gene (AMY2B) in dogs, which is believed to have aided digestion of starch in agricultural refuse. We find standing variation for amylase copy number variation in wolves and little or no copy number increase in the Dingo and Husky lineages. In conjunction with the estimated timing of dog origins, these results provide additional support to archaeological finds, suggesting the earliest dogs arose alongside hunter-gathers rather than agriculturists. Regarding the geographic origin of dogs, we find that, surprisingly, none of the extant wolf lineages from putative domestication centers is more closely related to dogs, and, instead, the sampled wolves form a sister monophyletic clade. This result, in combination with dog-wolf admixture during the process of domestication, suggests that a re-evaluation of past hypotheses regarding dog origins is necessary.
Genome Sequencing Highlights the Dynamic Early History of Dogs
Freedman, Adam H.; Gronau, Ilan; Schweizer, Rena M.; Ortega-Del Vecchyo, Diego; Han, Eunjung; Silva, Pedro M.; Galaverni, Marco; Fan, Zhenxin; Marx, Peter; Lorente-Galdos, Belen; Beale, Holly; Ramirez, Oscar; Hormozdiari, Farhad; Alkan, Can; Vilà, Carles; Squire, Kevin; Geffen, Eli; Kusak, Josip; Boyko, Adam R.; Parker, Heidi G.; Lee, Clarence; Tadigotla, Vasisht; Siepel, Adam; Bustamante, Carlos D.; Harkins, Timothy T.; Nelson, Stanley F.; Ostrander, Elaine A.; Marques-Bonet, Tomas; Wayne, Robert K.; Novembre, John
2014-01-01
To identify genetic changes underlying dog domestication and reconstruct their early evolutionary history, we generated high-quality genome sequences from three gray wolves, one from each of the three putative centers of dog domestication, two basal dog lineages (Basenji and Dingo) and a golden jackal as an outgroup. Analysis of these sequences supports a demographic model in which dogs and wolves diverged through a dynamic process involving population bottlenecks in both lineages and post-divergence gene flow. In dogs, the domestication bottleneck involved at least a 16-fold reduction in population size, a much more severe bottleneck than estimated previously. A sharp bottleneck in wolves occurred soon after their divergence from dogs, implying that the pool of diversity from which dogs arose was substantially larger than represented by modern wolf populations. We narrow the plausible range for the date of initial dog domestication to an interval spanning 11–16 thousand years ago, predating the rise of agriculture. In light of this finding, we expand upon previous work regarding the increase in copy number of the amylase gene (AMY2B) in dogs, which is believed to have aided digestion of starch in agricultural refuse. We find standing variation for amylase copy number variation in wolves and little or no copy number increase in the Dingo and Husky lineages. In conjunction with the estimated timing of dog origins, these results provide additional support to archaeological finds, suggesting the earliest dogs arose alongside hunter-gathers rather than agriculturists. Regarding the geographic origin of dogs, we find that, surprisingly, none of the extant wolf lineages from putative domestication centers is more closely related to dogs, and, instead, the sampled wolves form a sister monophyletic clade. This result, in combination with dog-wolf admixture during the process of domestication, suggests that a re-evaluation of past hypotheses regarding dog origins is necessary. PMID:24453982
Concerted evolution at the population level: pupfish HindIII satellite DNA sequences.
Elder, J F; Turner, B J
1994-01-01
The canonical monomers (approximately 170 bp) of an abundant (1.9 x 10(6) copies per diploid genome) satellite DNA sequence family in the genome of Cyprinodon variegatus, a "pupfish" that ranges along the Atlantic coast from Cape Cod to central Mexico, are divergent in base sequence in 10 of 12 samples collected from natural populations. The divergence involves substitutions, deletions, and insertions, is marked in scope (mean pairwise sequence similarity = 61.6%; range = 35-95.9%), is largely confined to the 3' half of the monomer, and is not correlated with the distance among collecting sites. Repetitive cloning and direct genomic sequencing experiments failed to detect intrapopulation and intraindividual variation, suggesting high levels of sequence homogeneity within populations. The satellite sequence has therefore undergone "concerted evolution," at the level of the local population. Concerted evolution has previously almost always been discussed in terms of the divergence of species or higher taxa; its intraspecific occurrence apparently has not been reported previously. The generality of the observation is difficult to evaluate, for although satellite DNAs from a large number of organisms have been studied in detail, there appear to be little or no other data on their sequence variation in natural populations. The relationship (if any) between concerted, population level, satellite DNA divergence and the extent of gene flow/genetic isolation among conspecific natural populations remains to be established. Images PMID:8302879
Joseph, Sneha; Poriya, Paresh; Kundu, Rahul
2016-11-01
The present study reports the phylogenetic relationship of six zoanthid species belonging to three genera, Isaurus, Palythoa, and Zoanthus identified using systematic computational analysis of mtDNA gene sequences. All six species are first recorded from the coasts of Kathiawar Peninsula, India. Genus: Isaurus is represented by Isaurus tuberculatus, genus Zoanthus is represented by Zoanthus kuroshio and Zoanthus sansibaricus, while genus Palythoa is represented by Palythoa tuberculosa, P. sp. JVK-2006 and Palythoa heliodiscus. Results of the present study revealed that among the various species observed along the coastline, a minimum of 99% sequence divergence and a maximum of 96% sequence divergence were seen. An interspecific divergence of 1-4% and negligible intraspecific divergence was observed. These results not only highlighted the efficiency of the COI gene region in species identification but also demonstrated the genetic variability of zoanthids along the Saurashtra coastline of the west coast of India.
NASA Astrophysics Data System (ADS)
Davison, John; Ho, Simon Y. W.; Bray, Sarah C.; Korsten, Marju; Tammeleht, Egle; Hindrikson, Maris; Østbye, Kjartan; Østbye, Eivind; Lauritzen, Stein-Erik; Austin, Jeremy; Cooper, Alan; Saarma, Urmas
2011-02-01
This review provides an up-to-date synthesis of the matrilineal phylogeography of a uniquely well-studied Holarctic mammal, the brown bear. We extend current knowledge by presenting a DNA sequence derived from one of the earliest known fossils of a polar bear (dated to 115 000 years before present), a species that shares a paraphyletic mitochondrial association with brown bears. A molecular clock analysis of 140 mitochondrial DNA sequences, including our new polar bear sequence, provides novel insights into the times of origin for different brown bear clades. We propose a number of regional biogeographic scenarios based on genetic data, divergence time estimates and paleontological records. The case of the brown bear provides an example for researchers working with less well-studied taxa: it shows clearly that phylogeographic models based on patterns of modern genetic variation alone can be substantially improved by including data on historical patterns of genetic diversity in the form of ancient DNA sequences derived from accurately dated samples and by using an approach to divergence-time estimation that suits the data under analysis. Using such approaches it has been possible to (i) establish that the processes shaping modern genetic diversity in brown bears acted recently, within the last three glacial cycles; (ii) distinguish among hypotheses concerning species' responses to climatic oscillations in accordance with the lack of phylogeographic structure that existed in brown bears prior to the last glacial maximum (LGM); (iii) reassess theories linking monophyletic brown bear populations to particular LGM refuge areas; and (iv) identify vicariance events and track analogous patterns of migration by brown bears out of Eurasia to North America and Japan.
Rutter, William B; Salcedo, Andres; Akhunova, Alina; He, Fei; Wang, Shichen; Liang, Hanquan; Bowden, Robert L; Akhunov, Eduard
2017-04-12
Two opposing evolutionary constraints exert pressure on plant pathogens: one to diversify virulence factors in order to evade plant defenses, and the other to retain virulence factors critical for maintaining a compatible interaction with the plant host. To better understand how the diversified arsenals of fungal genes promote interaction with the same compatible wheat line, we performed a comparative genomic analysis of two North American isolates of Puccinia graminis f. sp. tritici (Pgt). The patterns of inter-isolate divergence in the secreted candidate effector genes were compared with the levels of conservation and divergence of plant-pathogen gene co-expression networks (GCN) developed for each isolate. Comprative genomic analyses revealed substantial level of interisolate divergence in effector gene complement and sequence divergence. Gene Ontology (GO) analyses of the conserved and unique parts of the isolate-specific GCNs identified a number of conserved host pathways targeted by both isolates. Interestingly, the degree of inter-isolate sub-network conservation varied widely for the different host pathways and was positively associated with the proportion of conserved effector candidates associated with each sub-network. While different Pgt isolates tended to exploit similar wheat pathways for infection, the mode of plant-pathogen interaction varied for different pathways with some pathways being associated with the conserved set of effectors and others being linked with the diverged or isolate-specific effectors. Our data suggest that at the intra-species level pathogen populations likely maintain divergent sets of effectors capable of targeting the same plant host pathways. This functional redundancy may play an important role in the dynamic of the "arms-race" between host and pathogen serving as the basis for diverse virulence strategies and creating conditions where mutations in certain effector groups will not have a major effect on the pathogen's ability to infect the host.
A Comparative Encyclopedia of DNA Elements in the Mouse Genome
Yue, Feng; Cheng, Yong; Breschi, Alessandra; Vierstra, Jeff; Wu, Weisheng; Ryba, Tyrone; Sandstrom, Richard; Ma, Zhihai; Davis, Carrie; Pope, Benjamin D.; Shen, Yin; Pervouchine, Dmitri D.; Djebali, Sarah; Thurman, Bob; Kaul, Rajinder; Rynes, Eric; Kirilusha, Anthony; Marinov, Georgi K.; Williams, Brian A.; Trout, Diane; Amrhein, Henry; Fisher-Aylor, Katherine; Antoshechkin, Igor; DeSalvo, Gilberto; See, Lei-Hoon; Fastuca, Meagan; Drenkow, Jorg; Zaleski, Chris; Dobin, Alex; Prieto, Pablo; Lagarde, Julien; Bussotti, Giovanni; Tanzer, Andrea; Denas, Olgert; Li, Kanwei; Bender, M. A.; Zhang, Miaohua; Byron, Rachel; Groudine, Mark T.; McCleary, David; Pham, Long; Ye, Zhen; Kuan, Samantha; Edsall, Lee; Wu, Yi-Chieh; Rasmussen, Matthew D.; Bansal, Mukul S.; Keller, Cheryl A.; Morrissey, Christapher S.; Mishra, Tejaswini; Jain, Deepti; Dogan, Nergiz; Harris, Robert S.; Cayting, Philip; Kawli, Trupti; Boyle, Alan P.; Euskirchen, Ghia; Kundaje, Anshul; Lin, Shin; Lin, Yiing; Jansen, Camden; Malladi, Venkat S.; Cline, Melissa S.; Erickson, Drew T.; Kirkup, Vanessa M; Learned, Katrina; Sloan, Cricket A.; Rosenbloom, Kate R.; de Sousa, Beatriz Lacerda; Beal, Kathryn; Pignatelli, Miguel; Flicek, Paul; Lian, Jin; Kahveci, Tamer; Lee, Dongwon; Kent, W. James; Santos, Miguel Ramalho; Herrero, Javier; Notredame, Cedric; Johnson, Audra; Vong, Shinny; Lee, Kristen; Bates, Daniel; Neri, Fidencio; Diegel, Morgan; Canfield, Theresa; Sabo, Peter J.; Wilken, Matthew S.; Reh, Thomas A.; Giste, Erika; Shafer, Anthony; Kutyavin, Tanya; Haugen, Eric; Dunn, Douglas; Reynolds, Alex P.; Neph, Shane; Humbert, Richard; Hansen, R. Scott; De Bruijn, Marella; Selleri, Licia; Rudensky, Alexander; Josefowicz, Steven; Samstein, Robert; Eichler, Evan E.; Orkin, Stuart H.; Levasseur, Dana; Papayannopoulou, Thalia; Chang, Kai-Hsin; Skoultchi, Arthur; Gosh, Srikanta; Disteche, Christine; Treuting, Piper; Wang, Yanli; Weiss, Mitchell J.; Blobel, Gerd A.; Good, Peter J.; Lowdon, Rebecca F.; Adams, Leslie B.; Zhou, Xiao-Qiao; Pazin, Michael J.; Feingold, Elise A.; Wold, Barbara; Taylor, James; Kellis, Manolis; Mortazavi, Ali; Weissman, Sherman M.; Stamatoyannopoulos, John; Snyder, Michael P.; Guigo, Roderic; Gingeras, Thomas R.; Gilbert, David M.; Hardison, Ross C.; Beer, Michael A.; Ren, Bing
2014-01-01
Summary As the premier model organism in biomedical research, the laboratory mouse shares the majority of protein-coding genes with humans, yet the two mammals differ in significant ways. To gain greater insights into both shared and species-specific transcriptional and cellular regulatory programs in the mouse, the Mouse ENCODE Consortium has mapped transcription, DNase I hypersensitivity, transcription factor binding, chromatin modifications, and replication domains throughout the mouse genome in diverse cell and tissue types. By comparing with the human genome, we not only confirm substantial conservation in the newly annotated potential functional sequences, but also find a large degree of divergence of other sequences involved in transcriptional regulation, chromatin state and higher order chromatin organization. Our results illuminate the wide range of evolutionary forces acting on genes and their regulatory regions, and provide a general resource for research into mammalian biology and mechanisms of human diseases. PMID:25409824
A comparative encyclopedia of DNA elements in the mouse genome.
Yue, Feng; Cheng, Yong; Breschi, Alessandra; Vierstra, Jeff; Wu, Weisheng; Ryba, Tyrone; Sandstrom, Richard; Ma, Zhihai; Davis, Carrie; Pope, Benjamin D; Shen, Yin; Pervouchine, Dmitri D; Djebali, Sarah; Thurman, Robert E; Kaul, Rajinder; Rynes, Eric; Kirilusha, Anthony; Marinov, Georgi K; Williams, Brian A; Trout, Diane; Amrhein, Henry; Fisher-Aylor, Katherine; Antoshechkin, Igor; DeSalvo, Gilberto; See, Lei-Hoon; Fastuca, Meagan; Drenkow, Jorg; Zaleski, Chris; Dobin, Alex; Prieto, Pablo; Lagarde, Julien; Bussotti, Giovanni; Tanzer, Andrea; Denas, Olgert; Li, Kanwei; Bender, M A; Zhang, Miaohua; Byron, Rachel; Groudine, Mark T; McCleary, David; Pham, Long; Ye, Zhen; Kuan, Samantha; Edsall, Lee; Wu, Yi-Chieh; Rasmussen, Matthew D; Bansal, Mukul S; Kellis, Manolis; Keller, Cheryl A; Morrissey, Christapher S; Mishra, Tejaswini; Jain, Deepti; Dogan, Nergiz; Harris, Robert S; Cayting, Philip; Kawli, Trupti; Boyle, Alan P; Euskirchen, Ghia; Kundaje, Anshul; Lin, Shin; Lin, Yiing; Jansen, Camden; Malladi, Venkat S; Cline, Melissa S; Erickson, Drew T; Kirkup, Vanessa M; Learned, Katrina; Sloan, Cricket A; Rosenbloom, Kate R; Lacerda de Sousa, Beatriz; Beal, Kathryn; Pignatelli, Miguel; Flicek, Paul; Lian, Jin; Kahveci, Tamer; Lee, Dongwon; Kent, W James; Ramalho Santos, Miguel; Herrero, Javier; Notredame, Cedric; Johnson, Audra; Vong, Shinny; Lee, Kristen; Bates, Daniel; Neri, Fidencio; Diegel, Morgan; Canfield, Theresa; Sabo, Peter J; Wilken, Matthew S; Reh, Thomas A; Giste, Erika; Shafer, Anthony; Kutyavin, Tanya; Haugen, Eric; Dunn, Douglas; Reynolds, Alex P; Neph, Shane; Humbert, Richard; Hansen, R Scott; De Bruijn, Marella; Selleri, Licia; Rudensky, Alexander; Josefowicz, Steven; Samstein, Robert; Eichler, Evan E; Orkin, Stuart H; Levasseur, Dana; Papayannopoulou, Thalia; Chang, Kai-Hsin; Skoultchi, Arthur; Gosh, Srikanta; Disteche, Christine; Treuting, Piper; Wang, Yanli; Weiss, Mitchell J; Blobel, Gerd A; Cao, Xiaoyi; Zhong, Sheng; Wang, Ting; Good, Peter J; Lowdon, Rebecca F; Adams, Leslie B; Zhou, Xiao-Qiao; Pazin, Michael J; Feingold, Elise A; Wold, Barbara; Taylor, James; Mortazavi, Ali; Weissman, Sherman M; Stamatoyannopoulos, John A; Snyder, Michael P; Guigo, Roderic; Gingeras, Thomas R; Gilbert, David M; Hardison, Ross C; Beer, Michael A; Ren, Bing
2014-11-20
The laboratory mouse shares the majority of its protein-coding genes with humans, making it the premier model organism in biomedical research, yet the two mammals differ in significant ways. To gain greater insights into both shared and species-specific transcriptional and cellular regulatory programs in the mouse, the Mouse ENCODE Consortium has mapped transcription, DNase I hypersensitivity, transcription factor binding, chromatin modifications and replication domains throughout the mouse genome in diverse cell and tissue types. By comparing with the human genome, we not only confirm substantial conservation in the newly annotated potential functional sequences, but also find a large degree of divergence of sequences involved in transcriptional regulation, chromatin state and higher order chromatin organization. Our results illuminate the wide range of evolutionary forces acting on genes and their regulatory regions, and provide a general resource for research into mammalian biology and mechanisms of human diseases.
Giraffe genome sequence reveals clues to its unique morphology and physiology
Agaba, Morris; Ishengoma, Edson; Miller, Webb C.; McGrath, Barbara C.; Hudson, Chelsea N.; Bedoya Reina, Oscar C.; Ratan, Aakrosh; Burhans, Rico; Chikhi, Rayan; Medvedev, Paul; Praul, Craig A.; Wu-Cavener, Lan; Wood, Brendan; Robertson, Heather; Penfold, Linda; Cavener, Douglas R.
2016-01-01
The origins of giraffe's imposing stature and associated cardiovascular adaptations are unknown. Okapi, which lacks these unique features, is giraffe's closest relative and provides a useful comparison, to identify genetic variation underlying giraffe's long neck and cardiovascular system. The genomes of giraffe and okapi were sequenced, and through comparative analyses genes and pathways were identified that exhibit unique genetic changes and likely contribute to giraffe's unique features. Some of these genes are in the HOX, NOTCH and FGF signalling pathways, which regulate both skeletal and cardiovascular development, suggesting that giraffe's stature and cardiovascular adaptations evolved in parallel through changes in a small number of genes. Mitochondrial metabolism and volatile fatty acids transport genes are also evolutionarily diverged in giraffe and may be related to its unusual diet that includes toxic plants. Unexpectedly, substantial evolutionary changes have occurred in giraffe and okapi in double-strand break repair and centrosome functions. PMID:27187213
Nucleotide sequences of bovine alpha S1- and kappa-casein cDNAs.
Stewart, A F; Willis, I M; Mackinlay, A G
1984-01-01
The nucleotide sequences corresponding to bovine alpha S1- and kappa-casein mRNAs are presented. An unusual alpha S1-casein cDNA has been characterised whose 5' end commences upstream from its putative TATA box. The alpha S1-casein mRNA is compared to rat alpha-casein mRNA and two components of divergence are identified. Firstly, the two sequences have diverged at a high point mutation rate and the rate of amino acid replacement by this mechanism is at least as great as the rate of divergence of any other part of the mRNAs. Secondly, the protein coding sequence has been subjected to several insertion/deletion events, one of which may be an example of exon shuffling . The kappa-casein mRNA sequence verifies the proposition that it has arisen from a different ancestral gene to the other caseins. Images PMID:6328443
Skoglund, Pontus; Ersmark, Erik; Palkopoulou, Eleftheria; Dalén, Love
2015-06-01
The origin of domestic dogs is poorly understood [1-15], with suggested evidence of dog-like features in fossils that predate the Last Glacial Maximum [6, 9, 10, 14, 16] conflicting with genetic estimates of a more recent divergence between dogs and worldwide wolf populations [13, 15, 17-19]. Here, we present a draft genome sequence from a 35,000-year-old wolf from the Taimyr Peninsula in northern Siberia. We find that this individual belonged to a population that diverged from the common ancestor of present-day wolves and dogs very close in time to the appearance of the domestic dog lineage. We use the directly dated ancient wolf genome to recalibrate the molecular timescale of wolves and dogs and find that the mutation rate is substantially slower than assumed by most previous studies, suggesting that the ancestors of dogs were separated from present-day wolves before the Last Glacial Maximum. We also find evidence of introgression from the archaic Taimyr wolf lineage into present-day dog breeds from northeast Siberia and Greenland, contributing between 1.4% and 27.3% of their ancestry. This demonstrates that the ancestry of present-day dogs is derived from multiple regional wolf populations. Copyright © 2015 Elsevier Ltd. All rights reserved.
Lerner, Heather R L; Meyer, Matthias; James, Helen F; Hofreiter, Michael; Fleischer, Robert C
2011-11-08
Evolutionary theory has gained tremendous insight from studies of adaptive radiations. High rates of speciation, morphological divergence, and hybridization, combined with low sequence variability, however, have prevented phylogenetic reconstruction for many radiations. The Hawaiian honeycreepers are an exceptional adaptive radiation, with high phenotypic diversity and speciation that occurred within the geologically constrained setting of the Hawaiian Islands. Here we analyze a new data set of 13 nuclear loci and pyrosequencing of mitochondrial genomes that resolves the Hawaiian honeycreeper phylogeny. We show that they are a sister taxon to Eurasian rosefinches (Carpodacus) and probably came to Hawaii from Asia. We use island ages to calibrate DNA substitution rates, which vary substantially among gene regions, and calculate divergence times, showing that the radiation began roughly when the oldest of the current large Hawaiian Islands (Kauai and Niihau) formed, ~5.7 million years ago (mya). We show that most of the lineages that gave rise to distinctive morphologies diverged after Oahu emerged (4.0-3.7 mya) but before the formation of Maui and adjacent islands (2.4-1.9 mya). Thus, the formation of Oahu, and subsequent cycles of colonization and speciation between Kauai and Oahu, played key roles in generating the morphological diversity of the extant honeycreepers. Copyright © 2011 Elsevier Ltd. All rights reserved.
Inbred or Outbred? Genetic Diversity in Laboratory Rodent Colonies
Brekke, Thomas D.; Steele, Katherine A.; Mulley, John F.
2017-01-01
Nonmodel rodents are widely used as subjects for both basic and applied biological research, but the genetic diversity of the study individuals is rarely quantified. University-housed colonies tend to be small and subject to founder effects and genetic drift; so they may be highly inbred or show substantial genetic divergence from other colonies, even those derived from the same source. Disregard for the levels of genetic diversity in an animal colony may result in a failure to replicate results if a different colony is used to repeat an experiment, as different colonies may have fixed alternative variants. Here we use high throughput sequencing to demonstrate genetic divergence in three isolated colonies of Mongolian gerbil (Meriones unguiculatus) even though they were all established recently from the same source. We also show that genetic diversity in allegedly “outbred” colonies of nonmodel rodents (gerbils, hamsters, house mice, deer mice, and rats) varies considerably from nearly no segregating diversity to very high levels of polymorphism. We conclude that genetic divergence in isolated colonies may play an important role in the “replication crisis.” In a more positive light, divergent rodent colonies represent an opportunity to leverage genetically distinct individuals in genetic crossing experiments. In sum, awareness of the genetic diversity of an animal colony is paramount as it allows researchers to properly replicate experiments and also to capitalize on other genetically distinct individuals to explore the genetic basis of a trait. PMID:29242387
El-Sherry, Shiem; Ogedengbe, Mosun E; Hafeez, Mian A; Barta, John R
2013-07-01
Multiple 18S rDNA sequences were obtained from two single-oocyst-derived lines of each of Eimeria meleagrimitis and Eimeria adenoeides. After analysing the 15 new 18S rDNA sequences from two lines of E. meleagrimitis and 17 new sequences from two lines of E. adenoeides, there were clear indications that divergent, paralogous 18S rDNA copies existed within the nuclear genome of E. meleagrimitis. In contrast, mitochondrial cytochrome c oxidase subunit I (COI) partial sequences from all lines of a particular Eimeria sp. were identical and, in phylogenetic analyses, COI sequences clustered unambiguously in monophyletic and highly-supported clades specific to individual Eimeria sp. Phylogenetic analysis of the new 18S rDNA sequences from E. meleagrimitis showed that they formed two distinct clades: Type A with four new sequences; and Type B with nine new sequences; both Types A and B sequences were obtained from each of the single-oocyst-derived lines of E. meleagrimitis. Together these rDNA types formed a well-supported E. meleagrimitis clade. Types A and B 18S rDNA sequences from E. meleagrimitis had a mean sequence identity of only 97.4% whereas mean sequence identity within types was 99.1-99.3%. The observed intraspecific sequence divergence among E. meleagrimitis 18S rDNA sequence types was even higher (approximately 2.6%) than the interspecific sequence divergence present between some well-recognized species such as Eimeria tenella and Eimeria necatrix (1.1%). Our observations suggest that, unlike COI sequences, 18S rDNA sequences are not reliable molecular markers to be used alone for species identification with coccidia, although 18S rDNA sequences have clear utility for phylogenetic reconstruction of apicomplexan parasites at the genus and higher taxonomic ranks. Copyright © 2013. Published by Elsevier Ltd.
Finnerty, John R; Mazza, Maureen E; Jezewski, Peter A
2009-01-01
Background Msx originated early in animal evolution and is implicated in human genetic disorders. To reconstruct the functional evolution of Msx and inform the study of human mutations, we analyzed the phylogeny and synteny of 46 metazoan Msx proteins and tracked the duplication, diversification and loss of conserved motifs. Results Vertebrate Msx sequences sort into distinct Msx1, Msx2 and Msx3 clades. The sister-group relationship between MSX1 and MSX2 reflects their derivation from the 4p/5q chromosomal paralogon, a derivative of the original "MetaHox" cluster. We demonstrate physical linkage between Msx and other MetaHox genes (Hmx, NK1, Emx) in a cnidarian. Seven conserved domains, including two Groucho repression domains (N- and C-terminal), were present in the ancestral Msx. In cnidarians, the Groucho domains are highly similar. In vertebrate Msx1, the N-terminal Groucho domain is conserved, while the C-terminal domain diverged substantially, implying a novel function. In vertebrate Msx2 and Msx3, the C-terminal domain was lost. MSX1 mutations associated with ectodermal dysplasia or orofacial clefting disorders map to conserved domains in a non-random fashion. Conclusion Msx originated from a MetaHox ancestor that also gave rise to Tlx, Demox, NK, and possibly EHGbox, Hox and ParaHox genes. Duplication, divergence or loss of domains played a central role in the functional evolution of Msx. Duplicated domains allow pleiotropically expressed proteins to evolve new functions without disrupting existing interaction networks. Human missense sequence variants reside within evolutionarily conserved domains, likely disrupting protein function. This phylogenomic evaluation of candidate disease markers will inform clinical and functional studies. PMID:19154605
Finnerty, John R; Mazza, Maureen E; Jezewski, Peter A
2009-01-20
Msx originated early in animal evolution and is implicated in human genetic disorders. To reconstruct the functional evolution of Msx and inform the study of human mutations, we analyzed the phylogeny and synteny of 46 metazoan Msx proteins and tracked the duplication, diversification and loss of conserved motifs. Vertebrate Msx sequences sort into distinct Msx1, Msx2 and Msx3 clades. The sister-group relationship between MSX1 and MSX2 reflects their derivation from the 4p/5q chromosomal paralogon, a derivative of the original "MetaHox" cluster. We demonstrate physical linkage between Msx and other MetaHox genes (Hmx, NK1, Emx) in a cnidarian. Seven conserved domains, including two Groucho repression domains (N- and C-terminal), were present in the ancestral Msx. In cnidarians, the Groucho domains are highly similar. In vertebrate Msx1, the N-terminal Groucho domain is conserved, while the C-terminal domain diverged substantially, implying a novel function. In vertebrate Msx2 and Msx3, the C-terminal domain was lost. MSX1 mutations associated with ectodermal dysplasia or orofacial clefting disorders map to conserved domains in a non-random fashion. Msx originated from a MetaHox ancestor that also gave rise to Tlx, Demox, NK, and possibly EHGbox, Hox and ParaHox genes. Duplication, divergence or loss of domains played a central role in the functional evolution of Msx. Duplicated domains allow pleiotropically expressed proteins to evolve new functions without disrupting existing interaction networks. Human missense sequence variants reside within evolutionarily conserved domains, likely disrupting protein function. This phylogenomic evaluation of candidate disease markers will inform clinical and functional studies.
Blazier, J Chris; Ruhlman, Tracey A; Weng, Mao-Lun; Rehman, Sumaiyah K; Sabir, Jamal S M; Jansen, Robert K
2016-04-18
Genes for the plastid-encoded RNA polymerase (PEP) persist in the plastid genomes of all photosynthetic angiosperms. However, three unrelated lineages (Annonaceae, Passifloraceae and Geraniaceae) have been identified with unusually divergent open reading frames (ORFs) in the conserved region of rpoA, the gene encoding the PEP α subunit. We used sequence-based approaches to evaluate whether these genes retain function. Both gene sequences and complete plastid genome sequences were assembled and analyzed from each of the three angiosperm families. Multiple lines of evidence indicated that the rpoA sequences are likely functional despite retaining as low as 30% nucleotide sequence identity with rpoA genes from outgroups in the same angiosperm order. The ratio of non-synonymous to synonymous substitutions indicated that these genes are under purifying selection, and bioinformatic prediction of conserved domains indicated that functional domains are preserved. One of the lineages (Pelargonium, Geraniaceae) contains species with multiple rpoA-like ORFs that show evidence of ongoing inter-paralog gene conversion. The plastid genomes containing these divergent rpoA genes have experienced extensive structural rearrangement, including large expansions of the inverted repeat. We propose that illegitimate recombination, not positive selection, has driven the divergence of rpoA.
Phenotypic and Genetic Divergence among Poison Frog Populations in a Mimetic Radiation
Twomey, Evan; Yeager, Justin; Brown, Jason Lee; Morales, Victor; Cummings, Molly; Summers, Kyle
2013-01-01
The evolution of Müllerian mimicry is, paradoxically, associated with high levels of diversity in color and pattern. In a mimetic radiation, different populations of a species evolve to resemble different models, which can lead to speciation. Yet there are circumstances under which initial selection for divergence under mimicry may be reversed. Here we provide evidence for the evolution of extensive phenotypic divergence in a mimetic radiation in Ranitomeya imitator, the mimic poison frog, in Peru. Analyses of color hue (spectral reflectance) and pattern reveal substantial divergence between morphs. However, we also report that there is a “transition-zone” with mixed phenotypes. Analyses of genetic structure using microsatellite variation reveals some differentiation between populations, but this does not strictly correspond to color pattern divergence. Analyses of gene flow between populations suggest that, while historical levels of gene flow were low, recent levels are high in some cases, including substantial gene flow between some color pattern morphs. We discuss possible explanations for these observations. PMID:23405150
Clustering evolving proteins into homologous families.
Chan, Cheong Xin; Mahbob, Maisarah; Ragan, Mark A
2013-04-08
Clustering sequences into groups of putative homologs (families) is a critical first step in many areas of comparative biology and bioinformatics. The performance of clustering approaches in delineating biologically meaningful families depends strongly on characteristics of the data, including content bias and degree of divergence. New, highly scalable methods have recently been introduced to cluster the very large datasets being generated by next-generation sequencing technologies. However, there has been little systematic investigation of how characteristics of the data impact the performance of these approaches. Using clusters from a manually curated dataset as reference, we examined the performance of a widely used graph-based Markov clustering algorithm (MCL) and a greedy heuristic approach (UCLUST) in delineating protein families coded by three sets of bacterial genomes of different G+C content. Both MCL and UCLUST generated clusters that are comparable to the reference sets at specific parameter settings, although UCLUST tends to under-cluster compositionally biased sequences (G+C content 33% and 66%). Using simulated data, we sought to assess the individual effects of sequence divergence, rate heterogeneity, and underlying G+C content. Performance decreased with increasing sequence divergence, decreasing among-site rate variation, and increasing G+C bias. Two MCL-based methods recovered the simulated families more accurately than did UCLUST. MCL using local alignment distances is more robust across the investigated range of sequence features than are greedy heuristics using distances based on global alignment. Our results demonstrate that sequence divergence, rate heterogeneity and content bias can individually and in combination affect the accuracy with which MCL and UCLUST can recover homologous protein families. For application to data that are more divergent, and exhibit higher among-site rate variation and/or content bias, MCL may often be the better choice, especially if computational resources are not limiting.
Jennings, W Bryan; Wogel, Henrique; Bilate, Marcos; Salles, Rodrigo de O L; Buckup, Paulo A
2016-09-01
The microhylid frogs belonging to the genus Arcovomer have been reported from lowland Atlantic Rainforest in the Brazilian states of Espírito Santo, Rio de Janeiro, and São Paulo. Here, we use DNA barcoding to assess levels of genetic divergence between apparently isolated populations in Espírito Santo and Rio de Janeiro. Our mtDNA data consisting of cytochrome oxidase subunit I (COI) nucleotide sequences reveals 13.2% uncorrected and 30.4% TIM2 + I + Γ corrected genetic divergences between these two populations. This level of divergence exceeds the suggested 10% uncorrected divergence threshold for elevating amphibian populations to candidate species using this marker, which implies that the Espírito Santo population is a species distinct from Arcovomer passarellii. Calibration of our model-corrected sequence divergence estimates suggests that the time of population divergence falls between 12 and 29 million years ago.
Yamada, Kazuhiko; Kamimura, Eikichi; Kondo, Mariko; Tsuchiya, Kimiyuki; Nishida-Umehara, Chizuko; Matsuda, Yoichi
2006-02-01
We molecularly cloned new families of site-specific repetitive DNA sequences from BglII- and EcoRI-digested genomic DNA of the Syrian hamster (Mesocricetus auratus, Cricetrinae, Rodentia) and characterized them by chromosome in situ hybridization and filter hybridization. They were classified into six different types of repetitive DNA sequence families according to chromosomal distribution and genome organization. The hybridization patterns of the sequences were consistent with the distribution of C-positive bands and/or Hoechst-stained heterochromatin. The centromeric major satellite DNA and sex chromosome-specific and telomeric region-specific repetitive sequences were conserved in the same genus (Mesocricetus) but divergent in different genera. The chromosome-2-specific sequence was conserved in two genera, Mesocricetus and Cricetulus, and a low copy number of repetitive sequences on the heterochromatic chromosome arms were conserved in the subfamily Cricetinae but not in the subfamily Calomyscinae. By contrast, the other type of repetitive sequences on the heterochromatic chromosome arms, which had sequence similarities to a LINE sequence of rodents, was conserved through the three subfamilies, Cricetinae, Calomyscinae and Murinae. The nucleotide divergence of the repetitive sequences of heterochromatin was well correlated with the phylogenetic relationships of the Cricetinae species, and each sequence has been independently amplified and diverged in the same genome.
DNA barcodes for dragonflies and damselflies (Odonata) of Mindanao, Philippines.
Casas, Princess Angelie S; Sing, Kong-Wah; Lee, Ping-Shin; Nuñeza, Olga M; Villanueva, Reagan Joseph T; Wilson, John-James
2018-03-01
Reliable species identification provides a sounder basis for use of species in the order Odonata as biological indicators and for their conservation, an urgent concern as many species are threatened with imminent extinction. We generated 134 COI barcodes from 36 morphologically identified species of Odonata collected from Mindanao Island, representing 10 families and 19 genera. Intraspecific sequence divergences ranged from 0 to 6.7% with four species showing more than 2%, while interspecific sequence divergences ranged from 0.5 to 23.3% with seven species showing less than 2%. Consequently, no distinct gap was observed between intraspecific and interspecific DNA barcode divergences. The numerous islands of the Philippine archipelago may have facilitated rapid speciation in the Odonata and resulted in low interspecific sequence divergences among closely related groups of species. This study contributes DNA barcodes for 36 morphologically identified species of Odonata reported from Mindanao including 31 species with no previous DNA barcode records.
Phylogenomic evidence for ancient hybridization in the genomes of living cats (Felidae)
Li, Gang; Davis, Brian W.; Eizirik, Eduardo; Murphy, William J.
2016-01-01
Inter-species hybridization has been recently recognized as potentially common in wild animals, but the extent to which it shapes modern genomes is still poorly understood. Distinguishing historical hybridization events from other processes leading to phylogenetic discordance among different markers requires a well-resolved species tree that considers all modes of inheritance and overcomes systematic problems due to rapid lineage diversification by sampling large genomic character sets. Here, we assessed genome-wide phylogenetic variation across a diverse mammalian family, Felidae (cats). We combined genotypes from a genome-wide SNP array with additional autosomal, X- and Y-linked variants to sample ∼150 kb of nuclear sequence, in addition to complete mitochondrial genomes generated using light-coverage Illumina sequencing. We present the first robust felid time tree that accounts for unique maternal, paternal, and biparental evolutionary histories. Signatures of phylogenetic discordance were abundant in the genomes of modern cats, in many cases indicating hybridization as the most likely cause. Comparison of big cat whole-genome sequences revealed a substantial reduction of X-linked divergence times across several large recombination cold spots, which were highly enriched for signatures of selection-driven post-divergence hybridization between the ancestors of the snow leopard and lion lineages. These results highlight the mosaic origin of modern felid genomes and the influence of sex chromosomes and sex-biased dispersal in post-speciation gene flow. A complete resolution of the tree of life will require comprehensive genomic sampling of biparental and sex-limited genetic variation to identify and control for phylogenetic conflict caused by ancient admixture and sex-biased differences in genomic transmission. PMID:26518481
Vitamin K epoxide reductase complex subunit 1 (Vkorc1) haplotype diversity in mouse priority strains
Song, Ying; Vera, Nicole; Kohn, Michael H
2008-01-01
Background Polymorphisms in the vitamin K-epoxide reductase complex subunit 1 gene, Vkorc1, could affect blood coagulation and other vitamin K-dependent proteins, such as osteocalcin (bone Gla protein, BGP). Here we sequenced the Vkorc1 gene in 40 mouse priority strains. We analyzed Vkorc1 haplotypes with respect to prothrombin time (PT) and bone mineral density and composition (BMD and BMC); phenotypes expected to be vitamin K-dependent and represented by data in the Mouse Phenome Database (MPD). Findings In the commonly used laboratory strains of Mus musculus domesticus we identified only four haplotypes differing in the intron or 5' region sequence of the Vkorc1. Six haplotypes differing by coding and non-coding polymorphisms were identified in the other subspecies of Mus. We detected no significant association of Vkorc1 haplotypes with PT, BMD and BMC within each subspecies of Mus. Vkorc1 haplotype sequences divergence between subspecies was associated with PT, BMD and BMC. Conclusion Phenotypic variation in PT, BMD and BMC within subspecies of Mus, while substantial, appears to be dominated by genetic variation in genes other than the Vkorc1. This was particularly evident for M. m. domesticus, where a single haplotype was observed in conjunction with virtually the entire range of PT, BMD and BMC values of all 5 subspecies of Mus included in this study. Differences in these phenotypes between subspecies also should not be attributed to Vkorc1 variants, but should be viewed as a result of genome wide genetic divergence. PMID:19046458
Llopart, Ana
2018-05-01
The hemizygosity of the X (Z) chromosome fully exposes the fitness effects of mutations on that chromosome and has evolutionary consequences on the relative rates of evolution of X and autosomes. Specifically, several population genetics models predict increased rates of evolution in X-linked loci relative to autosomal loci. This prediction of faster-X evolution has been evaluated and confirmed for both protein coding sequences and gene expression. In the case of faster-X evolution for gene expression divergence, it is often assumed that variation in 5' noncoding sequences is associated with variation in transcript abundance between species but a formal, genomewide test of this hypothesis is still missing. Here, I use whole genome sequence data in Drosophila yakuba and D. santomea to evaluate this hypothesis and report positive correlations between sequence divergence at 5' noncoding sequences and gene expression divergence. I also examine polymorphism and divergence in 9,279 noncoding sequences located at the 5' end of annotated genes and detected multiple signals of positive selection. Notably, I used the traditional synonymous sites as neutral reference to test for adaptive evolution, but I also used bases 8-30 of introns <65 bp, which have been proposed to be a better neutral choice. X-linked genes with high degree of male-biased expression show the most extreme adaptive pattern at 5' noncoding regions, in agreement with faster-X evolution for gene expression divergence and a higher incidence of positively selected recessive mutations. © 2018 The Authors. Molecular Ecology Published by John Wiley & Sons Ltd.
Lobo, Jorge; Ferreira, Maria S; Antunes, Ilisa C; Teixeira, Marcos A L; Borges, Luisa M S; Sousa, Ronaldo; Gomes, Pedro A; Costa, Maria Helena; Cunha, Marina R; Costa, Filipe O
2017-02-01
In this study we compared DNA barcode-suggested species boundaries with morphology-based species identifications in the amphipod fauna of the southern European Atlantic coast. DNA sequences of the cytochrome c oxidase subunit I barcode region (COI-5P) were generated for 43 morphospecies (178 specimens) collected along the Portuguese coast which, together with publicly available COI-5P sequences, produced a final dataset comprising 68 morphospecies and 295 sequences. Seventy-five BINs (Barcode Index Numbers) were assigned to these morphospecies, of which 48 were concordant (i.e., 1 BIN = 1 species), 8 were taxonomically discordant, and 19 were singletons. Twelve species had matching sequences (<2% distance) with conspecifics from distant locations (e.g., North Sea). Seven morphospecies were assigned to multiple, and highly divergent, BINs, including specimens of Corophium multisetosum (18% divergence) and Dexamine spiniventris (16% divergence), which originated from sampling locations on the west coast of Portugal (only about 36 and 250 km apart, respectively). We also found deep divergence (4%-22%) among specimens of seven species from Portugal compared to those from the North Sea and Italy. The detection of evolutionarily meaningful divergence among populations of several amphipod species from southern Europe reinforces the need for a comprehensive re-assessment of the diversity of this faunal group.
Chambers, E Anne; Hebert, Paul D N
2016-01-01
High rates of species discovery and loss have led to the urgent need for more rapid assessment of species diversity in the herpetofauna. DNA barcoding allows for the preliminary identification of species based on sequence divergence. Prior DNA barcoding work on reptiles and amphibians has revealed higher biodiversity counts than previously estimated due to cases of cryptic and undiscovered species. Past studies have provided DNA barcodes for just 14% of the North American herpetofauna, revealing the need for expanded coverage. This study extends the DNA barcode reference library for North American herpetofauna, assesses the utility of this approach in aiding species delimitation, and examines the correspondence between current species boundaries and sequence clusters designated by the BIN system. Sequences were obtained from 730 specimens, representing 274 species (43%) from the North American herpetofauna. Mean intraspecific divergences were 1% and 3%, while average congeneric sequence divergences were 16% and 14% in amphibians and reptiles, respectively. BIN assignments corresponded with current species boundaries in 79% of amphibians, 100% of turtles, and 60% of squamates. Deep divergences (>2%) were noted in 35% of squamate and 16% of amphibian species, and low divergences (<2%) occurred in 12% of reptiles and 23% of amphibians, patterns reflected in BIN assignments. Sequence recovery declined with specimen age, and variation in recovery success was noted among collections. Within collections, barcodes effectively flagged seven mislabeled tissues, and barcode fragments were recovered from five formalin-fixed specimens. This study demonstrates that DNA barcodes can effectively flag errors in museum collections, while BIN splits and merges reveal taxa belonging to deeply diverged or hybridizing lineages. This study is the first effort to compile a reference library of DNA barcodes for herpetofauna on a continental scale.
Chambers, E. Anne; Hebert, Paul D. N.
2016-01-01
Background High rates of species discovery and loss have led to the urgent need for more rapid assessment of species diversity in the herpetofauna. DNA barcoding allows for the preliminary identification of species based on sequence divergence. Prior DNA barcoding work on reptiles and amphibians has revealed higher biodiversity counts than previously estimated due to cases of cryptic and undiscovered species. Past studies have provided DNA barcodes for just 14% of the North American herpetofauna, revealing the need for expanded coverage. Methodology/Principal Findings This study extends the DNA barcode reference library for North American herpetofauna, assesses the utility of this approach in aiding species delimitation, and examines the correspondence between current species boundaries and sequence clusters designated by the BIN system. Sequences were obtained from 730 specimens, representing 274 species (43%) from the North American herpetofauna. Mean intraspecific divergences were 1% and 3%, while average congeneric sequence divergences were 16% and 14% in amphibians and reptiles, respectively. BIN assignments corresponded with current species boundaries in 79% of amphibians, 100% of turtles, and 60% of squamates. Deep divergences (>2%) were noted in 35% of squamate and 16% of amphibian species, and low divergences (<2%) occurred in 12% of reptiles and 23% of amphibians, patterns reflected in BIN assignments. Sequence recovery declined with specimen age, and variation in recovery success was noted among collections. Within collections, barcodes effectively flagged seven mislabeled tissues, and barcode fragments were recovered from five formalin-fixed specimens. Conclusions/Significance This study demonstrates that DNA barcodes can effectively flag errors in museum collections, while BIN splits and merges reveal taxa belonging to deeply diverged or hybridizing lineages. This study is the first effort to compile a reference library of DNA barcodes for herpetofauna on a continental scale. PMID:27116180
A greedy, graph-based algorithm for the alignment of multiple homologous gene lists.
Fostier, Jan; Proost, Sebastian; Dhoedt, Bart; Saeys, Yvan; Demeester, Piet; Van de Peer, Yves; Vandepoele, Klaas
2011-03-15
Many comparative genomics studies rely on the correct identification of homologous genomic regions using accurate alignment tools. In such case, the alphabet of the input sequences consists of complete genes, rather than nucleotides or amino acids. As optimal multiple sequence alignment is computationally impractical, a progressive alignment strategy is often employed. However, such an approach is susceptible to the propagation of alignment errors in early pairwise alignment steps, especially when dealing with strongly diverged genomic regions. In this article, we present a novel accurate and efficient greedy, graph-based algorithm for the alignment of multiple homologous genomic segments, represented as ordered gene lists. Based on provable properties of the graph structure, several heuristics are developed to resolve local alignment conflicts that occur due to gene duplication and/or rearrangement events on the different genomic segments. The performance of the algorithm is assessed by comparing the alignment results of homologous genomic segments in Arabidopsis thaliana to those obtained by using both a progressive alignment method and an earlier graph-based implementation. Especially for datasets that contain strongly diverged segments, the proposed method achieves a substantially higher alignment accuracy, and proves to be sufficiently fast for large datasets including a few dozens of eukaryotic genomes. http://bioinformatics.psb.ugent.be/software. The algorithm is implemented as a part of the i-ADHoRe 3.0 package.
Whole genome investigation of a divergent clade of the pathogen Streptococcus suis
Baig, Abiyad; Weinert, Lucy A.; Peters, Sarah E.; Howell, Kate J.; Chaudhuri, Roy R.; Wang, Jinhong; Holden, Matthew T. G.; Parkhill, Julian; Langford, Paul R.; Rycroft, Andrew N.; Wren, Brendan W.; Tucker, Alexander W.; Maskell, Duncan J.
2015-01-01
Streptococcus suis is a major porcine and zoonotic pathogen responsible for significant economic losses in the pig industry and an increasing number of human cases. Multiple isolates of S. suis show marked genomic diversity. Here, we report the analysis of whole genome sequences of nine pig isolates that caused disease typical of S. suis and had phenotypic characteristics of S. suis, but their genomes were divergent from those of many other S. suis isolates. Comparison of protein sequences predicted from divergent genomes with those from normal S. suis reduced the size of core genome from 793 to only 397 genes. Divergence was clear if phylogenetic analysis was performed on reduced core genes and MLST alleles. Phylogenies based on certain other genes (16S rRNA, sodA, recN, and cpn60) did not show divergence for all isolates, suggesting recombination between some divergent isolates with normal S. suis for these genes. Indeed, there is evidence of recent recombination between the divergent and normal S. suis genomes for 249 of 397 core genes. In addition, phylogenetic analysis based on the 16S rRNA gene and 132 genes that were conserved between the divergent isolates and representatives of the broader Streptococcus genus showed that divergent isolates were more closely related to S. suis. Six out of nine divergent isolates possessed a S. suis-like capsule region with variation in capsular gene sequences but the remaining three did not have a discrete capsule locus. The majority (40/70), of virulence-associated genes in normal S. suis were present in the divergent genomes. Overall, the divergent isolates extend the current diversity of S. suis species but the phenotypic similarities and the large amount of gene exchange with normal S. suis gives insufficient evidence to assign these isolates to a new species or subspecies. Further, sampling and whole genome analysis of more isolates is warranted to understand the diversity of the species. PMID:26583006
Tracking the origins of the cave bear (Ursus spelaeus) by mitochondrial DNA sequencing.
Hänni, C; Laudet, V; Stehelin, D; Taberlet, P
1994-01-01
The different European populations of Ursus arctos, the brown bear, were recently studied for mitochondrial DNA polymorphism. Two clearly distinct lineages (eastern and western) were found, which may have diverged approximately 850,000 years ago. In this context, it was interesting to study the cave bear, Ursus spelaeus, a species which became extinct 20,000 years ago. In this study, we have amplified and sequenced a fragment of 139-bp in the mitochondrial DNA control region of a 40,000-year-old specimen of U. spelaeus. Phylogenetic reconstructions using this sequence and the European brown bear sequences already published suggest that U. spelaeus diverged from an early offshoot of U. arctos--i.e., approximately at the same time as the divergence of the two main lineages of U. arctos. This divergence probably took place at the earliest glaciation, likely due to geographic separation during the earlier Quaternary cold periods. This result is in agreement with the paleontological data available and suggests a good correspondence between molecular and morphological data. Images PMID:7991628
Zhang, Honghai; Chen, Lei
2011-03-01
The dhole (Cuon alpinus) is the only existent species in the genus Cuon (Carnivora: Canidae). In the present study, the complete mitochondrial genome of the dhole was sequenced. The total length is 16672 base pairs which is the shortest in Canidae. Sequence analysis revealed that most mitochondrial genomic functional regions were highly consistent among canid animals except the CSB domain of the control region. The difference in length among the Canidae mitochondrial genome sequences is mainly due to the number of short segments of tandem repeated in the CSB domain. Phylogenetic analysis was progressed based on the concatenated data set of 14 mitochondrial genes of 8 canid animals by using maximum parsimony (MP), maximum likelihood (ML) and Bayesian (BI) inference methods. The genera Vulpes and Nyctereutes formed a sister group and split first within Canidae, followed by that in the Cuon. The divergence in the genus Canis was the latest. The divarication of domestic dogs after that of the Canis lupus laniger is completely supported by all the three topologies. Pairwise sequence divergence data of different mitochondrial genes among canid animals were also determined. Except for the synonymous substitutions in protein-coding genes, the control region exhibits the highest sequence divergences. The synonymous rates are approximately two to six times higher than those of the non-synonymous sites except for a slightly higher rate in the non-synonymous substitution between Cuon alpinus and Vulpes vulpes. 16S rRNA genes have a slightly faster sequence divergence than 12S rRNA and tRNA genes. Based on nucleotide substitutions of tRNA genes and rRNA genes, the times since divergence between dhole and other canid animals, and between domestic dogs and three subspecies of wolves were evaluated. The result indicates that Vulpes and Nyctereutes have a close phylogenetic relationship and the divergence of Nyctereutes is a little earlier. The Tibetan wolf may be an archaic pedigree within wolf subspecies. The genetic distance between wolves and domestic dogs is less than that among different subspecies of wolves. The domestication of dogs was about 1.56-1.92 million years ago or even earlier.
Bayesian estimation of post-Messinian divergence times in Balearic Island lizards.
Brown, R P; Terrasa, B; Pérez-Mellado, V; Castro, J A; Hoskisson, P A; Picornell, A; Ramon, M M
2008-07-01
Phylogenetic relationships and timings of major cladogenesis events are investigated in the Balearic Island lizards Podarcislilfordi and P.pityusensis using 2675bp of mitochondrial and nuclear DNA sequences. Partitioned Bayesian and Maximum Parsimony analyses provided a well-resolved phylogeny with high node-support values. Bayesian MCMC estimation of node dates was investigated by comparing means of posterior distributions from different subsets of the sequence against the most robust analysis which used multiple partitions and allowed for rate heterogeneity among branches under a rate-drift model. Evolutionary rates were systematically underestimated and thus divergence times overestimated when sequences containing lower numbers of variable sites were used (based on ingroup node constraints). The following analyses allowed the best recovery of node times under the constant-rate (i.e., perfect clock) model: (i) all cytochrome b sequence (partitioned by codon position), (ii) cytochrome b (codon position 3 alone), (iii) NADH dehydrogenase (subunits 1 and 2; partitioned by codon position), (iv) cytochrome b and NADH dehydrogenase sequence together (six gene-codon partitions), (v) all unpartitioned sequence, (vi) a full multipartition analysis (nine partitions). Of these, only (iv) and (vi) performed well under the rate-drift model. These findings have significant implications for dating of recent divergence times in other taxa. The earliest P.lilfordi cladogenesis event (divergence of Menorcan populations), occurred before the end of the Pliocene, some 2.6Ma. Subsequent events led to a West Mallorcan lineage (2.0Ma ago), followed 1.2Ma ago by divergence of populations from the southern part of the Cabrera archipelago from a widely-distributed group from north Cabrera, northern and southern Mallorcan islets. Divergence within P.pityusensis is more recent with the main Ibiza and Formentera clades sharing a common ancestor at about 1.0Ma ago. Climatic and sea level changes are likely to have initiated cladogenesis, with lineages making secondary contact during periodic landbridge formation. This oscillating cross-archipelago pattern in which ancient divergence is followed by repeated contact resembles that seen between East-West refugia populations from mainland Europe.
Near, Thomas J; Dornburg, Alex; Friedman, Matt
2014-11-01
The Gonorynchiformes are the sister lineage of the species-rich Otophysi and provide important insights into the diversification of ostariophysan fishes. Phylogenies of gonorynchiforms inferred using morphological characters and mtDNA gene sequences provide differing resolutions with regard to the sister lineage of all other gonorynchiforms (Chanos vs. Gonorynchus) and support for monophyly of the two miniaturized lineages Cromeria and Grasseichthys. In this study the phylogeny and divergence times of gonorynchiforms are investigated with DNA sequences sampled from nine nuclear genes and a published morphological character matrix. Bayesian phylogenetic analyses reveal substantial congruence among individual gene trees with inferences from eight genes placing Gonorynchus as the sister lineage to all other gonorynchiforms. Seven gene trees resolve Cromeria and Grasseichthys as a clade, supporting previous inferences using morphological characters. Phylogenies resulting from either concatenating the nuclear genes, performing a multispecies coalescent species tree analysis, or combining the morphological and nuclear gene DNA sequences resolve Gonorynchus as the living sister lineage of all other gonorynchiforms, strongly support the monophyly of Cromeria and Grasseichthys, and resolve a clade containing Parakneria, Cromeria, and Grasseichthys. The morphological dataset, which includes 13 gonorynchiform fossil taxa that range in age from Early Cretaceous to Eocene, was analyzed in combination with DNA sequences from the nine nuclear genes and a relaxed molecular clock to estimate times of evolutionary divergence. This "tip dating" strategy accommodates uncertainty in the phylogenetic resolution of fossil taxa that provide calibration information in the relaxed molecular clock analysis. The estimated age of the most recent common ancestor (MRCA) of living gonorynchiforms is slightly older than estimates from previous node dating efforts, but the molecular tip dating estimated ages of Kneriinae (Kneria, Parakneria, Cromeria, and Grasseichthys) and the two paedomorphic lineages, Cromeria and Grasseichthys, are considerably younger. Copyright © 2014 Elsevier Inc. All rights reserved.
The Genetic Diversity of Influenza A Viruses in Wild Birds in Peru
Nelson, Martha I.; Pollett, Simon; Ghersi, Bruno; Silva, Maria; Simons, Mark P.; Icochea, Eliana; Gonzalez, Armando E.; Segovia, Karen; Kasper, Matthew R.; Montgomery, Joel M.; Bausch, Daniel G.
2016-01-01
Our understanding of the global ecology of avian influenza A viruses (AIVs) is impeded by historically low levels of viral surveillance in Latin America. Through sampling and whole-genome sequencing of 31 AIVs from wild birds in Peru, we identified 10 HA subtypes (H1-H4, H6-H7, H10-H13) and 8 NA subtypes (N1-N3, N5-N9). The majority of Peruvian AIVs were closely related to AIVs found in North America. However, unusual reassortants, including a H13 virus containing a PA segment related to extremely divergent Argentinian viruses, suggest that substantial AIV diversity circulates undetected throughout South America. PMID:26784331
Koloniuk, Igor; Fránová, Jana; Sarkisova, Tatiana; Přibylová, Jaroslava
2018-05-04
Strawberry crinkle disease is one of the major diseases that threatens strawberry production. Although the biological properties of the agent, strawberry crinkle virus (SCV), have been thoroughly investigated, its complete genome sequence has never been published. Existing RT-PCR-based detection relies on a partial sequence of the L protein gene, presumably the least expressed viral gene. Here, we present complete sequences of two divergent SCV isolates co-infecting a single plant, Fragaria x ananassa cv. Čačanská raná.
Lopez, Philippe; Halary, Sébastien; Bapteste, Eric
2015-10-26
Microbial genetic diversity is often investigated via the comparison of relatively similar 16S molecules through multiple alignments between reference sequences and novel environmental samples using phylogenetic trees, direct BLAST matches, or phylotypes counts. However, are we missing novel lineages in the microbial dark universe by relying on standard phylogenetic and BLAST methods? If so, how can we probe that universe using alternative approaches? We performed a novel type of multi-marker analysis of genetic diversity exploiting the topology of inclusive sequence similarity networks. Our protocol identified 86 ancient gene families, well distributed and rarely transferred across the 3 domains of life, and retrieved their environmental homologs among 10 million predicted ORFs from human gut samples and other metagenomic projects. Numerous highly divergent environmental homologs were observed in gut samples, although the most divergent genes were over-represented in non-gut environments. In our networks, most divergent environmental genes grouped exclusively with uncultured relatives, in maximal cliques. Sequences within these groups were under strong purifying selection and presented a range of genetic variation comparable to that of a prokaryotic domain. Many genes families included environmental homologs that were highly divergent from cultured homologs: in 79 gene families (including 18 ribosomal proteins), Bacteria and Archaea were less divergent than some groups of environmental sequences were to any cultured or viral homologs. Moreover, some groups of environmental homologs branched very deeply in phylogenetic trees of life, when they were not too divergent to be aligned. These results underline how limited our understanding of the most diverse elements of the microbial world remains, and encourage a deeper exploration of natural communities and their genetic resources, hinting at the possibility that still unknown yet major divisions of life have yet to be discovered.
A field ornithologist’s guide to genomics: Practical considerations for ecology and conservation
Oyler-McCance, Sara J.; Oh, Kevin; Langin, Kathryn; Aldridge, Cameron L.
2016-01-01
Vast improvements in sequencing technology have made it practical to simultaneously sequence millions of nucleotides distributed across the genome, opening the door for genomic studies in virtually any species. Ornithological research stands to benefit in three substantial ways. First, genomic methods enhance our ability to parse and simultaneously analyze both neutral and non-neutral genomic regions, thus providing insight into adaptive evolution and divergence. Second, the sheer quantity of sequence data generated by current sequencing platforms allows increased precision and resolution in analyses. Third, high-throughput sequencing can benefit applications that focus on a small number of loci that are otherwise prohibitively expensive, time-consuming, and technically difficult using traditional sequencing methods. These advances have improved our ability to understand evolutionary processes like speciation and local adaptation, but they also offer many practical applications in the fields of population ecology, migration tracking, conservation planning, diet analyses, and disease ecology. This review provides a guide for field ornithologists interested in incorporating genomic approaches into their research program, with an emphasis on techniques related to ecology and conservation. We present a general overview of contemporary genomic approaches and methods, as well as important considerations when selecting a genomic technique. We also discuss research questions that are likely to benefit from utilizing high-throughput sequencing instruments, highlighting select examples from recent avian studies.
Park, Chang-Gyu; Min, Sujeong; Lee, Gwan-Seok; Kim, Sora; Lee, Yerim; Lee, Seunghwan; Hong, Ki-Jeong; Wilson, Stephen W; Akimoto, Shin-Ichi; Lee, Wonhoon
2016-08-01
Metcalfa pruinosa (Say, 1830) (Hemiptera: Flatidae) has caused substantial agricultural damage since its recent introduction to the Republic of Korea; however, the source of this introduction is still unclear. To examine the genetic divergence and phylogenetic relationships among several populations of M. pruinosa from Korea and foreign countries, 251 COI sequences from 251 samples collected from Korea, France, Italy, Spain, Slovenia, and the United States were newly analyzed, together with seven published COI sequences from Canada. In total, 19 haplotypes were detected from the 258 COI sequences, and three haplotypes, H1, H3, and H9, were detected from samples in Korea. The MJ network and Bayesian inference revealed that the three haplotypes of Korea were closely connected with samples of Italy, Spain, Slovenia, France, and the United States. Our study revealed the possibility of multiple invasions of M. pruinosa from Europe and/or North America into Korea. © The Authors 2016. Published by Oxford University Press on behalf of Entomological Society of America. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Blazier, J. Chris; Ruhlman, Tracey A.; Weng, Mao-Lun; Rehman, Sumaiyah K.; Sabir, Jamal S. M.; Jansen, Robert K.
2016-01-01
Genes for the plastid-encoded RNA polymerase (PEP) persist in the plastid genomes of all photosynthetic angiosperms. However, three unrelated lineages (Annonaceae, Passifloraceae and Geraniaceae) have been identified with unusually divergent open reading frames (ORFs) in the conserved region of rpoA, the gene encoding the PEP α subunit. We used sequence-based approaches to evaluate whether these genes retain function. Both gene sequences and complete plastid genome sequences were assembled and analyzed from each of the three angiosperm families. Multiple lines of evidence indicated that the rpoA sequences are likely functional despite retaining as low as 30% nucleotide sequence identity with rpoA genes from outgroups in the same angiosperm order. The ratio of non-synonymous to synonymous substitutions indicated that these genes are under purifying selection, and bioinformatic prediction of conserved domains indicated that functional domains are preserved. One of the lineages (Pelargonium, Geraniaceae) contains species with multiple rpoA-like ORFs that show evidence of ongoing inter-paralog gene conversion. The plastid genomes containing these divergent rpoA genes have experienced extensive structural rearrangement, including large expansions of the inverted repeat. We propose that illegitimate recombination, not positive selection, has driven the divergence of rpoA. PMID:27087667
Ashfaq, Muhammad; Prosser, Sean; Nasir, Saima; Masood, Mariyam; Ratnasingham, Sujeevan; Hebert, Paul D. N.
2015-01-01
The study analyzes sequence variation of two mitochondrial genes (COI, cytb) in Pediculus humanus from three countries (Egypt, Pakistan, South Africa) that have received little prior attention, and integrates these results with prior data. Analysis indicates a maximum K2P distance of 10.3% among 960 COI sequences and 13.8% among 479 cytb sequences. Three analytical methods (BIN, PTP, ABGD) reveal five concordant OTUs for COI and cytb. Neighbor-Joining analysis of the COI sequences confirm five clusters; three corresponding to previously recognized mitochondrial clades A, B, C and two new clades, “D” and “E”, showing 2.3% and 2.8% divergence from their nearest neighbors (NN). Cytb data corroborate five clusters showing that clades “D” and “E” are both 4.6% divergent from their respective NN clades. Phylogenetic analysis supports the monophyly of all clusters recovered by NJ analysis. Divergence time estimates suggest that the earliest split of P. humanus clades occured slightly more than one million years ago (MYa) and the latest about 0.3 MYa. Sequence divergences in COI and cytb among the five clades of P. humanus are 10X those in their human host, a difference that likely reflects both rate acceleration and the acquisition of lice clades from several archaic hominid lineages. PMID:26373806
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kibui, J.
Osteoarthritis (OA) is a debilitating joint disease characterized by cartilage degradation which prompts pain, stiffness and swelling. Contributing factors include age, genetics, obesity, injury and overuse of joints. OA is defined by an acute phase and a chronic phase whereby inflammation and degeneration of articular cartilage and other tissues is followed by joint pain and limited mobility. Patients remain asymptomatic until substantial joint damage has occurred and therefore rely on long term surgical joint replacement and pain management as their sole treatment options. For this reason, there is an increasing need to identify early stage osteoarthritis biomarkers. Our study aimedmore » to identify and characterize gene expression variances in 3 different mouse strains (STR/ort, C57BL/6 and MRL/MpJ) with different susceptibility to post traumatic osteoarthritis (PTOA). Through RNA sequence analysis of whole knee joint RNA, we identified differentially expressed genes associated with the initial stages of PTOA in relation to mice with divergent phenotypes. These results will help elucidate potential mechanisms responsible for PTOA outcomes.« less
Perfus-Barbeoch, Laetitia; Da Rocha, Martine; Sallet, Erika; Bailly-Bechet, Marc; Castagnone-Sereno, Philippe; Flot, Jean-François; Kozlowski, Djampa K.; Cazareth, Julie; Couloux, Arnaud; Da Silva, Corinne; Guy, Julie; Kim-Jo, Yu-Jin; Rancurel, Corinne; Abad, Pierre; Wincker, Patrick
2017-01-01
Root-knot nematodes (genus Meloidogyne) exhibit a diversity of reproductive modes ranging from obligatory sexual to fully asexual reproduction. Intriguingly, the most widespread and devastating species to global agriculture are those that reproduce asexually, without meiosis. To disentangle this surprising parasitic success despite the absence of sex and genetic exchanges, we have sequenced and assembled the genomes of three obligatory ameiotic and asexual Meloidogyne. We have compared them to those of relatives able to perform meiosis and sexual reproduction. We show that the genomes of ameiotic asexual Meloidogyne are large, polyploid and made of duplicated regions with a high within-species average nucleotide divergence of ~8%. Phylogenomic analysis of the genes present in these duplicated regions suggests that they originated from multiple hybridization events and are thus homoeologs. We found that up to 22% of homoeologous gene pairs were under positive selection and these genes covered a wide spectrum of predicted functional categories. To biologically assess functional divergence, we compared expression patterns of homoeologous gene pairs across developmental life stages using an RNAseq approach in the most economically important asexually-reproducing nematode. We showed that >60% of homoeologous gene pairs display diverged expression patterns. These results suggest a substantial functional impact of the genome structure. Contrasting with high within-species nuclear genome divergence, mitochondrial genome divergence between the three ameiotic asexuals was very low, signifying that these putative hybrids share a recent common maternal ancestor. Transposable elements (TE) cover a ~1.7 times higher proportion of the genomes of the ameiotic asexual Meloidogyne compared to the sexual relative and might also participate in their plasticity. The intriguing parasitic success of asexually-reproducing Meloidogyne species could be partly explained by their TE-rich composite genomes, resulting from allopolyploidization events, and promoting plasticity and functional divergence between gene copies in the absence of sex and meiosis. PMID:28594822
Bagley, Justin C; Johnson, Jerald B
2014-01-01
A central goal of comparative phylogeography is determining whether codistributed species experienced (1) concerted evolutionary responses to past geological and climatic events, indicated by congruent spatial and temporal patterns (“concerted-response hypothesis”); (2) independent responses, indicated by spatial incongruence (“independent-response hypothesis”); or (3) multiple responses (“multiple-response hypothesis”), indicated by spatial congruence but temporal incongruence (“pseudocongruence”) or spatial and temporal incongruence (“pseudoincongruence”). We tested these competing hypotheses using DNA sequence data from three livebearing fish species codistributed in the Nicaraguan depression of Central America (Alfaro cultratus, Poecilia gillii, and Xenophallus umbratilis) that we predicted might display congruent responses due to co-occurrence in identical freshwater drainages. Spatial analyses recovered different subdivisions of genetic structure for each species, despite shared finer-scale breaks in northwestern Costa Rica (also supported by phylogenetic results). Isolation-with-migration models estimated incongruent timelines of among-region divergences, with A. cultratus and Xenophallus populations diverging over Miocene–mid-Pleistocene while P. gillii populations diverged over mid-late Pleistocene. Approximate Bayesian computation also lent substantial support to multiple discrete divergences over a model of simultaneous divergence across shared spatial breaks (e.g., Bayes factor [B10] = 4.303 for Ψ [no. of divergences] > 1 vs. Ψ = 1). Thus, the data support phylogeographic pseudoincongruence consistent with the multiple-response hypothesis. Model comparisons also indicated incongruence in historical demography, for example, support for intraspecific late Pleistocene population growth was unique to P. gillii, despite evidence for finer-scale population expansions in the other taxa. Empirical tests for phylogeographic congruence indicate that multiple evolutionary responses to historical events have shaped the population structure of freshwater species codistributed within the complex landscapes in/around the Nicaraguan depression. Recent community assembly through different routes (i.e., different past distributions or colonization routes), and intrinsic ecological differences among species, has likely contributed to the unique phylogeographical patterns displayed by these Neotropical fishes. PMID:24967085
Phan, Isabelle Q. H.; Scheib, Holger; Subramanian, Sandhya; Edwards, Thomas E.; Lehman, Stephanie S.; Piitulainen, Hanna; Sayeedur Rahman, M.; Rennoll-Bankert, Kristen E.; Staker, Bart L.; Taira, Suvi; Stacy, Robin; Myler, Peter J.; Azad, Abdu F.
2015-01-01
ABSTRACT Prokaryotes use type IV secretion systems (T4SSs) to translocate substrates (e.g., nucleoprotein, DNA, and protein) and/or elaborate surface structures (i.e., pili or adhesins). Bacterial genomes may encode multiple T4SSs, e.g., there are three functionally divergent T4SSs in some Bartonella species (vir, vbh, and trw). In a unique case, most rickettsial species encode a T4SS (rvh) enriched with gene duplication. Within single genomes, the evolutionary and functional implications of cross-system interchangeability of analogous T4SS protein components remains poorly understood. To lend insight into cross-system interchangeability, we analyzed the VirB8 family of T4SS channel proteins. Crystal structures of three VirB8 and two TrwG Bartonella proteins revealed highly conserved C-terminal periplasmic domain folds and dimerization interfaces, despite tremendous sequence divergence. This implies remarkable structural constraints for VirB8 components in the assembly of a functional T4SS. VirB8/TrwG heterodimers, determined via bacterial two-hybrid assays and molecular modeling, indicate that differential expression of trw and vir systems is the likely barrier to VirB8-TrwG interchangeability. We also determined the crystal structure of Rickettsia typhi RvhB8-II and modeled its coexpressed divergent paralog RvhB8-I. Remarkably, while RvhB8-I dimerizes and is structurally similar to other VirB8 proteins, the RvhB8-II dimer interface deviates substantially from other VirB8 structures, potentially preventing RvhB8-I/RvhB8-II heterodimerization. For the rvh T4SS, the evolution of divergent VirB8 paralogs implies a functional diversification that is unknown in other T4SSs. Collectively, our data identify two different constraints (spatiotemporal for Bartonella trw and vir T4SSs and structural for rvh T4SSs) that mediate the functionality of multiple divergent T4SSs within a single bacterium. PMID:26646013
Bernard, Guillaume; Chan, Cheong Xin; Ragan, Mark A
2016-07-01
Alignment-free (AF) approaches have recently been highlighted as alternatives to methods based on multiple sequence alignment in phylogenetic inference. However, the sensitivity of AF methods to genome-scale evolutionary scenarios is little known. Here, using simulated microbial genome data we systematically assess the sensitivity of nine AF methods to three important evolutionary scenarios: sequence divergence, lateral genetic transfer (LGT) and genome rearrangement. Among these, AF methods are most sensitive to the extent of sequence divergence, less sensitive to low and moderate frequencies of LGT, and most robust against genome rearrangement. We describe the application of AF methods to three well-studied empirical genome datasets, and introduce a new application of the jackknife to assess node support. Our results demonstrate that AF phylogenomics is computationally scalable to multi-genome data and can generate biologically meaningful phylogenies and insights into microbial evolution.
Limborg, Morten T.; Larson, Wesley; Shedd, Kyle; Seeb, Lisa W.; Seeb, James E.
2017-01-01
Preservation of heritable ecological diversity within species and populations is a key challenge for managing natural resources and wild populations. Salmonid fish are iconic and socio-economically important species for commercial, aquaculture, and recreational fisheries across the globe. Many salmonids are known to exhibit ecological divergence within species, including distinct feeding ecotypes within the same lakes. Here we used 5559 SNPs, derived from RAD sequencing, to perform population genetic comparisons between two dietary ecotypes of sockeye salmon (Oncorhynchus nerka) in Jo-Jo Lake, Alaska (USA). We tested the standing hypothesis that these two ecotypes are currently diverging as a result of adaptation to distinct dietary niches; results support earlier conclusions of a single panmictic population. The RAD sequence data revealed 40 new SNPs not previously detected in the species, and our sequence data can be used in future studies of ecotypic diversity in salmonid species.
Wu, Baojun; Buljic, Adnan; Hao, Weilong
2015-10-01
The frequency of horizontal gene transfer (HGT) in mitochondrial DNA varies substantially. In plants, HGT is relatively common, whereas in animals it appears to be quite rare. It is of considerable importance to understand mitochondrial HGT across the major groups of eukaryotes at a genome-wide level, but so far this has been well studied only in plants. In this study, we generated ten new mitochondrial genome sequences and analyzed 40 mitochondrial genomes from the Saccharomycetaceae to assess the magnitude and nature of mitochondrial HGT in yeasts. We provide evidence for extensive, homologous-recombination-mediated, mitochondrial-to-mitochondrial HGT occurring throughout yeast mitochondrial genomes, leading to genomes that are highly chimeric evolutionarily. This HGT has led to substantial intraspecific polymorphism in both sequence content and sequence divergence, which to our knowledge has not been previously documented in any mitochondrial genome. The unexpectedly high frequency of mitochondrial HGT in yeast may be driven by frequent mitochondrial fusion, relatively low mitochondrial substitution rates and pseudohyphal fusion to produce heterokaryons. These findings suggest that mitochondrial HGT may play an important role in genome evolution of a much broader spectrum of eukaryotes than previously appreciated and that there is a critical need to systematically study the frequency, extent, and importance of mitochondrial HGT across eukaryotes. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Aversano, Riccardo; Contaldi, Felice; Ercolano, Maria Raffaella; Grosso, Valentina; Iorizzo, Massimo; Tatino, Filippo; Xumerle, Luciano; Dal Molin, Alessandra; Avanzato, Carla; Ferrarini, Alberto; Delledonne, Massimo; Sanseverino, Walter; Cigliano, Riccardo Aiese; Capella-Gutierrez, Salvador; Gabaldón, Toni; Frusciante, Luigi; Bradeen, James M.; Carputo, Domenico
2015-01-01
Here, we report the draft genome sequence of Solanum commersonii, which consists of ∼830 megabases with an N50 of 44,303 bp anchored to 12 chromosomes, using the potato (Solanum tuberosum) genome sequence as a reference. Compared with potato, S. commersonii shows a striking reduction in heterozygosity (1.5% versus 53 to 59%), and differences in genome sizes were mainly due to variations in intergenic sequence length. Gene annotation by ab initio prediction supported by RNA-seq data produced a catalog of 1703 predicted microRNAs, 18,882 long noncoding RNAs of which 20% are shown to target cold-responsive genes, and 39,290 protein-coding genes with a significant repertoire of nonredundant nucleotide binding site-encoding genes and 126 cold-related genes that are lacking in S. tuberosum. Phylogenetic analyses indicate that domesticated potato and S. commersonii lineages diverged ∼2.3 million years ago. Three duplication periods corresponding to genome enrichment for particular gene families related to response to salt stress, water transport, growth, and defense response were discovered. The draft genome sequence of S. commersonii substantially increases our understanding of the domesticated germplasm, facilitating translation of acquired knowledge into advances in crop stability in light of global climate and environmental changes. PMID:25873387
Molecular phylogenetic analysis of non-sexually transmitted strains of Haemophilus ducreyi.
Gaston, Jordan R; Roberts, Sally A; Humphreys, Tricia L
2015-01-01
Haemophilus ducreyi, the etiologic agent of chancroid, has been previously reported to show genetic variance in several key virulence factors, placing strains of the bacterium into two genetically distinct classes. Recent studies done in yaws-endemic areas of the South Pacific have shown that H. ducreyi is also a major cause of cutaneous limb ulcers (CLU) that are not sexually transmitted. To genetically assess CLU strains relative to the previously described class I, class II phylogenetic hierarchy, we examined nucleotide sequence diversity at 11 H. ducreyi loci, including virulence and housekeeping genes, which encompass approximately 1% of the H. ducreyi genome. Sequences for all 11 loci indicated that strains collected from leg ulcers exhibit DNA sequences homologous to class I strains of H. ducreyi. However, sequences for 3 loci, including a hemoglobin receptor (hgbA), serum resistance protein (dsrA), and a collagen adhesin (ncaA) contained informative amounts of variation. Phylogenetic analyses suggest that these non-sexually transmitted strains of H. ducreyi comprise a sub-clonal population within class I strains of H. ducreyi. Molecular dating suggests that CLU strains are the most recently developed, having diverged approximately 0.355 million years ago, fourteen times more recently than the class I/class II divergence. The CLU strains' divergence falls after the divergence of humans from chimpanzees, making it the first known H. ducreyi divergence event directly influenced by the selective pressures accompanying human hosts.
Jiang, Haowei; Barker, Stephen C; Shao, Renfu
2013-01-01
Blood-sucking lice of humans have extensively fragmented mitochondrial (mt) genomes. Human head louse and body louse have their 37 mt genes on 20 minichromosomes. In human pubic louse, the 34 mt genes known are on 14 minichromosomes. To understand the process of mt genome fragmentation in the blood-sucking lice of mammals, we sequenced the mt genomes of the domestic pig louse, Haematopinus suis, and the wild pig louse, H. apri, which diverged from human lice approximately 65 Ma. The 37 mt genes of the pig lice are on nine circular minichromosomes; each minichromosome is 3-4 kb in size. The pig lice have four genes per minichromosome on average, in contrast to two genes per minichromosome in the human lice. One minichromosome of the pig lice has eight genes and is the most gene-rich minichromosome found in the sucking lice. Our results indicate substantial variation in the rate and extent of mt genome fragmentation among different lineages of the sucking lice.
Jiang, Haowei; Barker, Stephen C.; Shao, Renfu
2013-01-01
Blood-sucking lice of humans have extensively fragmented mitochondrial (mt) genomes. Human head louse and body louse have their 37 mt genes on 20 minichromosomes. In human pubic louse, the 34 mt genes known are on 14 minichromosomes. To understand the process of mt genome fragmentation in the blood-sucking lice of mammals, we sequenced the mt genomes of the domestic pig louse, Haematopinus suis, and the wild pig louse, H. apri, which diverged from human lice approximately 65 Ma. The 37 mt genes of the pig lice are on nine circular minichromosomes; each minichromosome is 3–4 kb in size. The pig lice have four genes per minichromosome on average, in contrast to two genes per minichromosome in the human lice. One minichromosome of the pig lice has eight genes and is the most gene-rich minichromosome found in the sucking lice. Our results indicate substantial variation in the rate and extent of mt genome fragmentation among different lineages of the sucking lice. PMID:23781098
Duplication and selection in the evolution of primate β-defensin genes
Semple, Colin AM; Rolfe, Mark; Dorin, Julia R
2003-01-01
Background Innate immunity is the first line of defense against microorganisms in vertebrates and acts by providing an initial barrier to microorganisms and triggering adaptive immune responses. Peptides such as β-defensins are an important component of this defense, providing a broad spectrum of antimicrobial activity against bacteria, fungi, mycobacteria and several enveloped viruses. β-defensins are small cationic peptides that vary in their expression patterns and spectrum of pathogen specificity. Disruptions in β-defensin function have been implicated in human diseases, including cystic fibrosis, and a fuller understanding of the variety, function and evolution of human β-defensins might form the basis for novel therapies. Here we use a combination of laboratory and computational techniques to characterize the main human β-defensin locus on chromosome 8p22-p23. Results In addition to known genes in the region we report the genomic structures and expression patterns of four novel human β-defensin genes and a related pseudogene. These genes show an unusual pattern of evolution, with rapid divergence between second exon sequences that encode the mature β-defensin peptides matched by relative stasis in first exons that encode signal peptides. Conclusions We conclude that the 8p22-p23 locus has evolved by successive rounds of duplication followed by substantial divergence involving positive selection, to produce a diverse cluster of paralogous genes established before the human-baboon divergence more than 23 million years ago. Positive selection, disproportionately favoring alterations in the charge of amino-acid residues, is implicated as driving second exon divergence in these genes. PMID:12734011
Divergent transcription is associated with promoters of transcriptional regulators
2013-01-01
Background Divergent transcription is a wide-spread phenomenon in mammals. For instance, short bidirectional transcripts are a hallmark of active promoters, while longer transcripts can be detected antisense from active genes in conditions where the RNA degradation machinery is inhibited. Moreover, many described long non-coding RNAs (lncRNAs) are transcribed antisense from coding gene promoters. However, the general significance of divergent lncRNA/mRNA gene pair transcription is still poorly understood. Here, we used strand-specific RNA-seq with high sequencing depth to thoroughly identify antisense transcripts from coding gene promoters in primary mouse tissues. Results We found that a substantial fraction of coding-gene promoters sustain divergent transcription of long non-coding RNA (lncRNA)/mRNA gene pairs. Strikingly, upstream antisense transcription is significantly associated with genes related to transcriptional regulation and development. Their promoters share several characteristics with those of transcriptional developmental genes, including very large CpG islands, high degree of conservation and epigenetic regulation in ES cells. In-depth analysis revealed a unique GC skew profile at these promoter regions, while the associated coding genes were found to have large first exons, two genomic features that might enforce bidirectional transcription. Finally, genes associated with antisense transcription harbor specific H3K79me2 epigenetic marking and RNA polymerase II enrichment profiles linked to an intensified rate of early transcriptional elongation. Conclusions We concluded that promoters of a class of transcription regulators are characterized by a specialized transcriptional control mechanism, which is directly coupled to relaxed bidirectional transcription. PMID:24365181
Comparative analysis of gene regulatory networks: from network reconstruction to evolution.
Thompson, Dawn; Regev, Aviv; Roy, Sushmita
2015-01-01
Regulation of gene expression is central to many biological processes. Although reconstruction of regulatory circuits from genomic data alone is therefore desirable, this remains a major computational challenge. Comparative approaches that examine the conservation and divergence of circuits and their components across strains and species can help reconstruct circuits as well as provide insights into the evolution of gene regulatory processes and their adaptive contribution. In recent years, advances in genomic and computational tools have led to a wealth of methods for such analysis at the sequence, expression, pathway, module, and entire network level. Here, we review computational methods developed to study transcriptional regulatory networks using comparative genomics, from sequence to functional data. We highlight how these methods use evolutionary conservation and divergence to reliably detect regulatory components as well as estimate the extent and rate of divergence. Finally, we discuss the promise and open challenges in linking regulatory divergence to phenotypic divergence and adaptation.
Divergence of Gene Body DNA Methylation and Evolution of Plant Duplicate Genes
Wang, Jun; Marowsky, Nicholas C.; Fan, Chuanzhu
2014-01-01
It has been shown that gene body DNA methylation is associated with gene expression. However, whether and how deviation of gene body DNA methylation between duplicate genes can influence their divergence remains largely unexplored. Here, we aim to elucidate the potential role of gene body DNA methylation in the fate of duplicate genes. We identified paralogous gene pairs from Arabidopsis and rice (Oryza sativa ssp. japonica) genomes and reprocessed their single-base resolution methylome data. We show that methylation in paralogous genes nonlinearly correlates with several gene properties including exon number/gene length, expression level and mutation rate. Further, we demonstrated that divergence of methylation level and pattern in paralogs indeed positively correlate with their sequence and expression divergences. This result held even after controlling for other confounding factors known to influence the divergence of paralogs. We observed that methylation level divergence might be more relevant to the expression divergence of paralogs than methylation pattern divergence. Finally, we explored the mechanisms that might give rise to the divergence of gene body methylation in paralogs. We found that exonic methylation divergence more closely correlates with expression divergence than intronic methylation divergence. We show that genomic environments (e.g., flanked by transposable elements and repetitive sequences) of paralogs generated by various duplication mechanisms are associated with the methylation divergence of paralogs. Overall, our results suggest that the changes in gene body DNA methylation could provide another avenue for duplicate genes to develop differential expression patterns and undergo different evolutionary fates in plant genomes. PMID:25310342
Phylogeography of the Central American lancehead Bothrops asper (SERPENTES: VIPERIDAE)
Parkinson, Christopher L.; Daza, Juan M.; Wüster, Wolfgang
2017-01-01
The uplift and final connection of the Central American land bridge is considered the major event that allowed biotic exchange between vertebrate lineages of northern and southern origin in the New World. However, given the complex tectonics that shaped Middle America, there is still substantial controversy over details of this geographical reconnection, and its role in determining biogeographic patterns in the region. Here, we examine the phylogeography of Bothrops asper, a widely distributed pitviper in Middle America and northwestern South America, in an attempt to evaluate how the final Isthmian uplift and other biogeographical boundaries in the region influenced genealogical lineage divergence in this species. We examined sequence data from two mitochondrial genes (MT-CYB and MT-ND4) from 111 specimens of B. asper, representing 70 localities throughout the species’ distribution. We reconstructed phylogeographic patterns using maximum likelihood and Bayesian methods and estimated divergence time using the Bayesian relaxed clock method. Within the nominal species, an early split led to two divergent lineages of B. asper: one includes five phylogroups distributed in Caribbean Middle America and southwestern Ecuador, and the other comprises five other groups scattered in the Pacific slope of Isthmian Central America and northwestern South America. Our results provide evidence of a complex transition that involves at least two dispersal events into Middle America during the final closure of the Isthmus. PMID:29176806
Phylogeography of the Central American lancehead Bothrops asper (SERPENTES: VIPERIDAE).
Saldarriaga-Córdoba, Mónica; Parkinson, Christopher L; Daza, Juan M; Wüster, Wolfgang; Sasa, Mahmood
2017-01-01
The uplift and final connection of the Central American land bridge is considered the major event that allowed biotic exchange between vertebrate lineages of northern and southern origin in the New World. However, given the complex tectonics that shaped Middle America, there is still substantial controversy over details of this geographical reconnection, and its role in determining biogeographic patterns in the region. Here, we examine the phylogeography of Bothrops asper, a widely distributed pitviper in Middle America and northwestern South America, in an attempt to evaluate how the final Isthmian uplift and other biogeographical boundaries in the region influenced genealogical lineage divergence in this species. We examined sequence data from two mitochondrial genes (MT-CYB and MT-ND4) from 111 specimens of B. asper, representing 70 localities throughout the species' distribution. We reconstructed phylogeographic patterns using maximum likelihood and Bayesian methods and estimated divergence time using the Bayesian relaxed clock method. Within the nominal species, an early split led to two divergent lineages of B. asper: one includes five phylogroups distributed in Caribbean Middle America and southwestern Ecuador, and the other comprises five other groups scattered in the Pacific slope of Isthmian Central America and northwestern South America. Our results provide evidence of a complex transition that involves at least two dispersal events into Middle America during the final closure of the Isthmus.
Christensen, Steen; Serbus, Laura Renee
2015-01-01
Two-component regulatory systems are commonly used by bacteria to coordinate intracellular responses with environmental cues. These systems are composed of functional protein pairs consisting of a sensor histidine kinase and cognate response regulator. In contrast to the well-studied Caulobacter crescentus system, which carries dozens of these pairs, the streamlined bacterial endosymbiont Wolbachia pipientis encodes only two pairs: CckA/CtrA and PleC/PleD. Here, we used bioinformatic tools to compare characterized two-component system relays from C. crescentus, the related Anaplasmataceae species Anaplasma phagocytophilum and Ehrlichia chaffeensis, and 12 sequenced Wolbachia strains. We found the core protein pairs and a subset of interacting partners to be highly conserved within Wolbachia and these other Anaplasmataceae. Genes involved in two-component signaling were positioned differently within the various Wolbachia genomes, whereas the local context of each gene was conserved. Unlike Anaplasma and Ehrlichia, Wolbachia two-component genes were more consistently found clustered with metabolic genes. The domain architecture and key functional residues standard for two-component system proteins were well-conserved in Wolbachia, although residues that specify cognate pairing diverged substantially from other Anaplasmataceae. These findings indicate that Wolbachia two-component signaling pairs share considerable functional overlap with other α-proteobacterial systems, whereas their divergence suggests the potential for regulatory differences and cross-talk. PMID:25809075
Chi, Hongshu; Taik, Patricia; Foley, Emily J; Racicot, Alycia C; Gray, Hilary M; Guzzetta, Katherine E; Lin, Hsin-Yun; Song, Yen-Ling; Tung, Che-Huang; Zenke, Kosuke; Yoshinaga, Tomoyoshi; Cheng, Chao-Yin; Chang, Wei-Jen; Gong, Hui
2017-07-01
The ciliate protozoan Cryptocaryon irritans parasitizes marine fish and causes lethal white spot disease. Sporadic infections as well as large-scale outbreaks have been reported globally and the parasite's broad host range poses particular threat to the aquaculture and ornamental fish markets. In order to better understand C. irritans' population structure, we sequenced and compared mitochondrial cox-1, SSU rRNA, and ITS-1 sequences from 8 new isolates of C. irritans collected in China, Japan, and Taiwan. We detected two SSU rRNA haplotypes, which differ at three positions, separating the isolates into two main groups (I and II). Cox-1 sequences also support the division into two groups, and the cox-1 divergence between these two groups is unexpectedly high (9.28% for 1582 nucleotide positions). The divergence is much greater than that detected in Ichthyophthirius multifiliis, the ciliate protozoan causing freshwater white spot disease in fish, where intraspecies divergence on cox-1 sequence is only 1.95%. ITS-1 sequences derived from these eight isolates and from all other C. irritans isolates (deposited in the GenBank) not only support the two groups, but further suggest the presence of a third group with even greater sequence divergence. Finally, a small Ka/Ks ratio estimated from cox-1 sequences suggests that this gene in C. irritans remains under strong purifying selection. Taken together, the C. irritans species may consists of many subspecies and/or syngens. Further work is needed to determine if there is reproductive isolation between the groups we have defined. Copyright © 2017 Elsevier Inc. All rights reserved.
Schönberg, Anna; Theunert, Christoph; Li, Mingkun; Stoneking, Mark; Nasidze, Ivan
2011-09-01
To investigate the demographic history of human populations from the Caucasus and surrounding regions, we used high-throughput sequencing to generate 147 complete mtDNA genome sequences from random samples of individuals from three groups from the Caucasus (Armenians, Azeri and Georgians), and one group each from Iran and Turkey. Overall diversity is very high, with 144 different sequences that fall into 97 different haplogroups found among the 147 individuals. Bayesian skyline plots (BSPs) of population size change through time show a population expansion around 40-50 kya, followed by a constant population size, and then another expansion around 15-18 kya for the groups from the Caucasus and Iran. The BSP for Turkey differs the most from the others, with an increase from 35 to 50 kya followed by a prolonged period of constant population size, and no indication of a second period of growth. An approximate Bayesian computation approach was used to estimate divergence times between each pair of populations; the oldest divergence times were between Turkey and the other four groups from the South Caucasus and Iran (~400-600 generations), while the divergence time of the three Caucasus groups from each other was comparable to their divergence time from Iran (average of ~360 generations). These results illustrate the value of random sampling of complete mtDNA genome sequences that can be obtained with high-throughput sequencing platforms.
Skoglund, Pontus; Götherström, Anders; Jakobsson, Mattias
2011-04-01
Despite recent technological advances in DNA sequencing, incomplete coverage remains to be an issue in population genomics, in particular for studies that include ancient samples. Here, we describe an approach to estimate population divergence times for non-overlapping sequence data that is based on probabilities of different genealogical topologies under a structured coalescent model. We show that the approach can be adapted to accommodate common problems such as sequencing errors and postmortem nucleotide misincorporations, and we use simulations to investigate biases involved with estimating genealogical topologies from empirical data. The approach relies on three reference genomes and should be particularly useful for future analysis of genomic data that comprise of nonoverlapping sets of sequences, potentially from different points in time. We applied the method to shotgun sequence data from an ancient wolf together with extant dogs and wolves and found striking resemblance to previously described fine-scale population structure among dog breeds. When comparing modern dogs to four geographically distinct wolves, we find that the divergence time between dogs and an Indian wolf is smallest, followed by the divergence times to a Chinese wolf and a Spanish wolf, and a relatively long divergence time to an Alaskan wolf, suggesting that the origin of modern dogs is somewhere in Eurasia, potentially southern Asia. We find that less than two-thirds of all loci in the boxer and poodle genomes are more similar to each other than to a modern gray wolf and that--assuming complete isolation without gene flow--the divergence time between gray wolves and modern European dogs extends to 3,500 generations before the present, corresponding to approximately 10,000 years ago (95% confidence interval [CI]: 9,000-13,000). We explicitly study the effect of gene flow between dogs and wolves on our estimates and show that a low rate of gene flow is compatible with an even earlier domestication date ∼30,000 years ago (95% CI: 15,000-90,000). This observation is in agreement with recent archaeological findings and indicates that human behavior necessary for domestication of wild animals could have appeared much earlier than the development of agriculture.
Wu, G. Albert; Prochnik, Simon; Jenkins, Jerry; Salse, Jerome; Hellsten, Uffe; Murat, Florent; Perrier, Xavier; Ruiz, Manuel; Scalabrin, Simone; Terol, Javier; Takita, Marco Aurélio; Labadie, Karine; Poulain, Julie; Couloux, Arnaud; Jabbari, Kamel; Cattonaro, Federica; Del Fabbro, Cristian; Pinosio, Sara; Zuccolo, Andrea; Chapman, Jarrod; Grimwood, Jane; Tadeo, Francisco R.; Estornell, Leandro H.; Muñoz-Sanz, Juan V.; Ibanez, Victoria; Herrero-Ortega, Amparo; Aleza, Pablo; Pérez-Pérez, Julián; Ramón, Daniel; Brunel, Dominique; Luro, François; Chen, Chunxian; Farmerie, William G.; Desany, Brian; Kodira, Chinnappa; Mohiuddin, Mohammed; Harkins, Tim; Fredrikson, Karin; Burns, Paul; Lomsadze, Alexandre; Borodovsky, Mark; Reforgiato, Giuseppe; Freitas-Astúa, Juliana; Quetier, Francis; Navarro, Luis; Roose, Mikeal; Wincker, Patrick; Schmutz, Jeremy; Morgante, Michele; Machado, Marcos Antonio; Talon, Manuel; Jaillon, Olivier; Ollitrault, Patrick; Gmitter, Frederick; Rokhsar, Daniel
2014-01-01
The domestication of citrus, is poorly understood. Cultivated types are selections from, or hybrids of, wild progenitor species, whose identities and contributions remain controversial. By comparative analysis of a collection of citrus genomes, including a high quality haploid reference, we show that cultivated types were derived from two progenitor species. Though cultivated pummelos represent selections from a single progenitor species, C. maxima, cultivated mandarins are introgressions of C. maxima into the ancestral mandarin species, C. reticulata. The most widely cultivated citrus, sweet orange, is the offspring of previously admixed individuals, but sour orange is an F1 hybrid of pure C. maxima and C. reticulata parents, implying that wild mandarins were part of the early breeding germplasm. A wild “mandarin” from China exhibited substantial divergence from C. reticulata, suggesting the possibility of other unrecognized wild citrus species. Understanding citrus phylogeny through genome analysis clarifies taxonomic relationships and enables sequence-directed genetic improvement. PMID:24908277
The Metamorphosis of Amphibian Toxicogenomics
Helbing, Caren C.
2012-01-01
Amphibians are important vertebrates in toxicology often representing both aquatic and terrestrial forms within the life history of the same species. Of the thousands of species, only two have substantial genomics resources: the recently published genome of the Pipid, Xenopus (Silurana) tropicalis, and transcript information (and ongoing genome sequencing project) of Xenopus laevis. However, many more species representative of regional ecological niches and life strategies are used in toxicology worldwide. Since Xenopus species diverged from the most populous frog family, the Ranidae, ~200 million years ago, there are notable differences between them and the even more distant Caudates (salamanders) and Caecilians. These differences include genome size, gene composition, and extent of polyploidization. Application of toxicogenomics to amphibians requires the mobilization of resources and expertise to develop de novo sequence assemblies and analysis strategies for a broader range of amphibian species. The present mini-review will present the advances in toxicogenomics as pertains to amphibians with particular emphasis upon the development and use of genomic techniques (inclusive of transcriptomics, proteomics, and metabolomics) and the challenges inherent therein. PMID:22435070
2012-01-01
Background A detailed knowledge about spatial and temporal gene expression is important for understanding both the function of genes and their evolution. For the vast majority of species, transcriptomes are still largely uncharacterized and even in those where substantial information is available it is often in the form of partially sequenced transcriptomes. With the development of next generation sequencing, a single experiment can now simultaneously identify the transcribed part of a species genome and estimate levels of gene expression. Results mRNA from actively growing needles of Norway spruce (Picea abies) was sequenced using next generation sequencing technology. In total, close to 70 million fragments with a length of 76 bp were sequenced resulting in 5 Gbp of raw data. A de novo assembly of these reads, together with publicly available expressed sequence tag (EST) data from Norway spruce, was used to create a reference transcriptome. Of the 38,419 PUTs (putative unique transcripts) longer than 150 bp in this reference assembly, 83.5% show similarity to ESTs from other spruce species and of the remaining PUTs, 3,704 show similarity to protein sequences from other plant species, leaving 4,167 PUTs with limited similarity to currently available plant proteins. By predicting coding frames and comparing not only the Norway spruce PUTs, but also PUTs from the close relatives Picea glauca and Picea sitchensis to both Pinus taeda and Taxus mairei, we obtained estimates of synonymous and non-synonymous divergence among conifer species. In addition, we detected close to 15,000 SNPs of high quality and estimated gene expression differences between samples collected under dark and light conditions. Conclusions Our study yielded a large number of single nucleotide polymorphisms as well as estimates of gene expression on transcriptome scale. In agreement with a recent study we find that the synonymous substitution rate per year (0.6 × 10−09 and 1.1 × 10−09) is an order of magnitude smaller than values reported for angiosperm herbs. However, if one takes generation time into account, most of this difference disappears. The estimates of the dN/dS ratio (non-synonymous over synonymous divergence) reported here are in general much lower than 1 and only a few genes showed a ratio larger than 1. PMID:23122049
Chapple, David G; Birkett, Alisha; Miller, Kimberly A; Daugherty, Charles H; Gleeson, Dianne M
2012-01-01
Climatic cooling and substantial tectonic activity since the late Miocene have had a pronounced influence on the evolutionary history of the fauna of New Zealand's South Island. However, many species have recently experienced dramatic range reductions due to habitat fragmentation and the introduction of mammalian predators and competitors. These anthropogenic impacts have been particularly severe in the tussock grasslands of the Otago region. The Otago skink (Oligosoma otagense), endemic to the region, is one of the most critically endangered vertebrates in New Zealand. We use mitochondrial DNA sequence data to investigate the evolutionary history of the Otago skink, examine its population genetic structure, and assess the level of genetic diversity in the individuals in the captive breeding program. Our data indicate that the Otago skink diverged from its closest relatives in the Miocene, consistent with the commencement of tectonic uplift of the Southern Alps. However, there is evidence for past introgression with the scree skink (O. waimatense) in the northern Otago-southern Canterbury region. The remnant populations in eastern Otago and western Otago are estimated to have diverged in the mid-Pliocene, with no haplotypes shared between these two regions. This divergence accounts for 95% of the genetic diversity in the species. Within both regions there is strong genetic structure among populations, although shared haplotypes are generally evident between adjacent localities. Although substantial genetic diversity is present in the captive population, all individuals originate from the eastern region and the majority had haplotypes that were not evident in the intensively managed populations at Macraes Flat. Our data indicate that eastern and western populations should continue to be regarded as separate management units. Knowledge of the genetic diversity of the breeding stock will act to inform the captive management of the Otago skink and contribute to a key recovery action for the species.
Bateman, Richard M; Rudall, Paula J; Bidartondo, Martin I; Cozzolino, Salvatore; Tranchida-Lombardo, Valentina; Carine, Mark A; Moura, Mónica
2014-06-01
• Premise of the study: Most orchid species native to the Macaronesian islands reflect immigration from western Europe or North Africa followed by anagenesis. The only putative exception is the butterfly orchids (Platanthera) of the Azores, where three species apparently reflect at least one cladogenetic speciation event. This multidisciplinary study explores the origin, speciation, phenotypic, and genotypic cohesion of these Azorean species and their mainland relatives.• Methods: Plants of Platanthera from 30 localities spanning all nine Azorean islands were compared with those of four continental European relatives for 38 morphometric characters; substantial subsets were also analyzed for plastid microsatellites, and for nrITS of both the orchids and their mycorrhizae.• Key results: Although the three Azorean and four mainland species are all readily distinguished morphometrically using several floral characters, and hybridization appears rare, divergence in ITS and especially plastid sequences is small. Despite occupying similar laurisilva habitats, the Azorean species differ radically in the identities and diversity of their mycorrhizal partners; specialism apparently increases rarity.• Conclusions: Although morphological evidence suggests two invasions of the islands from NW Africa and/or SW Europe, ITS data imply only one. As the molecular data are unable to distinguish among the potential mainland ancestors, two scenarios of relationship are explored that imply different ancestors. Both scenarios require both anagenetic and cladogenetic speciation events, involving homoplastic shifts in overall flower size and (often substantial) changes in the relative dimensions of individual floral organs. Limited genotypic divergence among the three species compared with greater phenotypic divergence suggests comparatively recent speciation. Mycorrhizae may be the most critical factor dictating the respective ecological tolerances, and thus the relative frequencies, of these species. The recent IUCN Red-List amalgamation of Azorean Platanthera taxa into a single species urgently requires reappraisal, as P. micrantha is an excellent indicator species of seminatural laurisilva forest and P. azorica is arguably Europe's rarest orchid. © 2014 Botanical Society of America, Inc.
Hopple, J S; Vilgalys, R
1999-10-01
Phylogenetic relationships were investigated in the mushroom genus Coprinus based on sequence data from the nuclear encoded large-subunit rDNA gene. Forty-seven species of Coprinus and 19 additional species from the families Coprinaceae, Strophariaceae, Bolbitiaceae, Agaricaceae, Podaxaceae, and Montagneaceae were studied. A total of 1360 sites was sequenced across seven divergent domains and intervening sequences. A total of 302 phylogenetically informative characters was found. Ninety-eight percent of the average divergence between taxa was located within the divergent domains, with domains D2 and D8 being most divergent and domains D7 and D10 the least divergent. An empirical test of phylogenetic signal among divergent domains also showed that domains D2 and D3 had the lowest levels of homoplasy. Two equally most parsimonious trees were resolved using Wagner parsimony. A character-state weighted analysis produced 12 equally most parsimonious trees similar to those generated by Wagner parsimony. Phylogenetic analyses employing topological constraints suggest that none of the major taxonomic systems proposed for subgeneric classification is able to completely reflect phylogenetic relationships in Coprinus. A strict consensus integration of the two Wagner trees demonstrates the problematic nature of choosing outgroups within dark-spored mushrooms. The genus Coprinus is found to be polyphyletic and is separated into three distinct clades. Most Coprinus taxa belong to the first two clades, which together form a larger monophyletic group with Lacrymaria and Psathyrella in basal positions. A third clade contains members of Coprinus section Comati as well as the genus Leucocoprinus, Podaxis pistillaris, Montagnea arenaria, and Agaricus pocillator. This third clade is separated from the other species of Coprinus by members of the families Strophariaceae and Bolbitiaceae and the genus Panaeolus. Copyright 1999 Academic Press.
Richards, Stephen; Liu, Yue; Bettencourt, Brian R.; Hradecky, Pavel; Letovsky, Stan; Nielsen, Rasmus; Thornton, Kevin; Hubisz, Melissa J.; Chen, Rui; Meisel, Richard P.; Couronne, Olivier; Hua, Sujun; Smith, Mark A.; Zhang, Peili; Liu, Jing; Bussemaker, Harmen J.; van Batenburg, Marinus F.; Howells, Sally L.; Scherer, Steven E.; Sodergren, Erica; Matthews, Beverly B.; Crosby, Madeline A.; Schroeder, Andrew J.; Ortiz-Barrientos, Daniel; Rives, Catharine M.; Metzker, Michael L.; Muzny, Donna M.; Scott, Graham; Steffen, David; Wheeler, David A.; Worley, Kim C.; Havlak, Paul; Durbin, K. James; Egan, Amy; Gill, Rachel; Hume, Jennifer; Morgan, Margaret B.; Miner, George; Hamilton, Cerissa; Huang, Yanmei; Waldron, Lenée; Verduzco, Daniel; Clerc-Blankenburg, Kerstin P.; Dubchak, Inna; Noor, Mohamed A.F.; Anderson, Wyatt; White, Kevin P.; Clark, Andrew G.; Schaeffer, Stephen W.; Gelbart, William; Weinstock, George M.; Gibbs, Richard A.
2005-01-01
We have sequenced the genome of a second Drosophila species, Drosophila pseudoobscura, and compared this to the genome sequence of Drosophila melanogaster, a primary model organism. Throughout evolution the vast majority of Drosophila genes have remained on the same chromosome arm, but within each arm gene order has been extensively reshuffled, leading to a minimum of 921 syntenic blocks shared between the species. A repetitive sequence is found in the D. pseudoobscura genome at many junctions between adjacent syntenic blocks. Analysis of this novel repetitive element family suggests that recombination between offset elements may have given rise to many paracentric inversions, thereby contributing to the shuffling of gene order in the D. pseudoobscura lineage. Based on sequence similarity and synteny, 10,516 putative orthologs have been identified as a core gene set conserved over 25–55 million years (Myr) since the pseudoobscura/melanogaster divergence. Genes expressed in the testes had higher amino acid sequence divergence than the genome-wide average, consistent with the rapid evolution of sex-specific proteins. Cis-regulatory sequences are more conserved than random and nearby sequences between the species—but the difference is slight, suggesting that the evolution of cis-regulatory elements is flexible. Overall, a pattern of repeat-mediated chromosomal rearrangement, and high coadaptation of both male genes and cis-regulatory sequences emerges as important themes of genome divergence between these species of Drosophila. PMID:15632085
2010-01-01
Background Cryptic species complexes are common among anophelines. Previous phylogenetic analysis based on the complete mtDNA COI gene sequences detected paraphyly in the Neotropical malaria vector Anopheles marajoara. The "Folmer region" detects a single taxon using a 3% divergence threshold. Methods To test the paraphyletic hypothesis and examine the utility of the Folmer region, genealogical trees based on a concatenated (white + 3' COI sequences) dataset and pairwise differentiation of COI fragments were examined. The population structure and demographic history were based on partial COI sequences for 294 individuals from 14 localities in Amazonian Brazil. 109 individuals from 12 localities were sequenced for the nDNA white gene, and 57 individuals from 11 localities were sequenced for the ribosomal DNA (rDNA) internal transcribed spacer 2 (ITS2). Results Distinct A. marajoara lineages were detected by combined genealogical analysis and were also supported among COI haplotypes using a median joining network and AMOVA, with time since divergence during the Pleistocene (<100,000 ya). COI sequences at the 3' end were more variable, demonstrating significant pairwise differentiation (3.82%) compared to the more moderate 2.92% detected by the Folmer region. Lineage 1 was present in all localities, whereas lineage 2 was restricted mainly to the west. Mismatch distributions for both lineages were bimodal, likely due to multiple colonization events and spatial expansion (~798 - 81,045 ya). There appears to be gene flow within, not between lineages, and a partial barrier was detected near Rio Jari in Amapá state, separating western and eastern populations. In contrast, both nDNA data sets (white gene sequences with or without the retention of the 4th intron, and ITS2 sequences and length) detected a single A. marajoara lineage. Conclusions Strong support for combined data with significant differentiation detected in the COI and absent in the nDNA suggest that the divergence is recent, and detectable only by the faster evolving mtDNA. A within subgenus threshold of >2% may be more appropriate among sister taxa in cryptic anopheline complexes than the standard 3%. Differences in demographic history and climatic changes may have contributed to mtDNA lineage divergence in A. marajoara. PMID:20929572
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sakoyama, Y.; Hong, K.J.; Byun, S.M.
To determine the phylogenetic relationships among hominoids and the dates of their divergence, the complete nucleotide sequences of the constant region of the immunoglobulin eta-chain (C/sub eta1/) genes from chimpanzee and orangutan have been determined. These sequences were compared with the human eta-chain constant-region sequence. A molecular clock (silent molecular clock), measured by the degree of sequence divergence at the synonymous (silent) positions of protein-encoding regions, was introduced for the present study. From the comparison of nucleotide sequences of ..cap alpha../sub 1/-antitrypsin and ..beta..- and delta-globulin genes between humans and Old World monkeys, the silent molecular clock was calibrated: themore » mean evolutionary rate of silent substitution was determined to be 1.56 x 10/sup -9/ substitutions per site per year. Using the silent molecular clock, the mean divergence dates of chimpanzee and orangutan from the human lineage were estimated as 6.4 +/- 2.6 million years and 17.3 +/- 4.5 million years, respectively. It was also shown that the evolutionary rate of primate genes is considerably slower than those of other mammalian genes.« less
Bloom DNA Helicase Facilitates Homologous Recombination between Diverged Homologous Sequences*
Kikuchi, Koji; Abdel-Aziz, H. Ismail; Taniguchi, Yoshihito; Yamazoe, Mitsuyoshi; Takeda, Shunichi; Hirota, Kouji
2009-01-01
Bloom syndrome caused by inactivation of the Bloom DNA helicase (Blm) is characterized by increases in the level of sister chromatid exchange, homologous recombination (HR) associated with cross-over. It is therefore believed that Blm works as an anti-recombinase. Meanwhile, in Drosophila, DmBlm is required specifically to promote the synthesis-dependent strand anneal (SDSA), a type of HR not associating with cross-over. However, conservation of Blm function in SDSA through higher eukaryotes has been a matter of debate. Here, we demonstrate the function of Blm in SDSA type HR in chicken DT40 B lymphocyte line, where Ig gene conversion diversifies the immunoglobulin V gene through intragenic HR between diverged homologous segments. This reaction is initiated by the activation-induced cytidine deaminase enzyme-mediated uracil formation at the V gene, which in turn converts into abasic site, presumably leading to a single strand gap. Ig gene conversion frequency was drastically reduced in BLM−/− cells. In addition, BLM−/− cells used limited donor segments harboring higher identity compared with other segments in Ig gene conversion event, suggesting that Blm can promote HR between diverged sequences. To further understand the role of Blm in HR between diverged homologous sequences, we measured the frequency of gene targeting induced by an I-SceI-endonuclease-mediated double-strand break. BLM−/− cells showed a severer defect in the gene targeting frequency as the number of heterologous sequences increased at the double-strand break site. Conversely, the overexpression of Blm, even an ATPase-defective mutant, strongly stimulated gene targeting. In summary, Blm promotes HR between diverged sequences through a novel ATPase-independent mechanism. PMID:19661064
Olmsted, R A; Langley, R; Roelke, M E; Goeken, R M; Adger-Johnson, D; Goff, J P; Albert, J P; Packer, C; Laurenson, M K; Caro, T M
1992-10-01
The natural occurrence of lentiviruses closely related to feline immunodeficiency virus (FIV) in nondomestic felid species is shown here to be worldwide. Cross-reactive antibodies to FIV were common in several free-ranging populations of large cats, including East African lions and cheetahs of the Serengeti ecosystem and in puma (also called cougar or mountain lion) populations throughout North America. Infectious puma lentivirus (PLV) was isolated from several Florida panthers, a severely endangered relict puma subspecies inhabiting the Big Cypress Swamp and Everglades ecosystems in southern Florida. Phylogenetic analysis of PLV genomic sequences from disparate geographic isolates revealed appreciable divergence from domestic cat FIV sequences as well as between PLV sequences found in different North American locales. The level of sequence divergence between PLV and FIV was greater than the level of divergence between human and certain simian immunodeficiency viruses, suggesting that the transmission of FIV between feline species is infrequent and parallels in time the emergence of HIV from simian ancestors.
Pohl, Nélida; Sison-Mangus, Marilou P; Yee, Emily N; Liswi, Saif W; Briscoe, Adriana D
2009-05-13
The increase in availability of genomic sequences for a wide range of organisms has revealed gene duplication to be a relatively common event. Encounters with duplicate gene copies have consequently become almost inevitable in the context of collecting gene sequences for inferring species trees. Here we examine the effect of incorporating duplicate gene copies evolving at different rates on tree reconstruction and time estimation of recent and deep divergences in butterflies. Sequences from ultraviolet-sensitive (UVRh), blue-sensitive (BRh), and long-wavelength sensitive (LWRh) opsins,EF-1 and COI were obtained from 27 taxa representing the five major butterfly families (5535 bp total). Both BRh and LWRh are present in multiple copies in some butterfly lineages and the different copies evolve at different rates. Regardless of the phylogenetic reconstruction method used, we found that analyses of combined data sets using either slower or faster evolving copies of duplicate genes resulted in a single topology in agreement with our current understanding of butterfly family relationships based on morphology and molecules. Interestingly, individual analyses of BRh and LWRh sequences also recovered these family-level relationships. Two different relaxed clock methods resulted in similar divergence time estimates at the shallower nodes in the tree, regardless of whether faster or slower evolving copies were used, with larger discrepancies observed at deeper nodes in the phylogeny. The time of divergence between the monarch butterfly Danaus plexippus and the queen D. gilippus (15.3-35.6 Mya) was found to be much older than the time of divergence between monarch co-mimic Limenitis archippus and red-spotted purple L. arthemis (4.7-13.6 Mya), and overlapping with the time of divergence of the co-mimetic passionflower butterflies Heliconius erato and H. melpomene (13.5-26.1 Mya). Our family-level results are congruent with recent estimates found in the literature and indicate an age of 84-113 million years for the divergence of all butterfly families. These results are consistent with diversification of the butterfly families following the radiation of angiosperms and suggest that some classes of opsin genes may be usefully employed for both phylogenetic reconstruction and divergence time estimation.
Horner, David S; Lefkimmiatis, Konstantinos; Reyes, Aurelio; Gissi, Carmela; Saccone, Cecilia; Pesole, Graziano
2007-01-01
Background Phylogenetic relationships between Lagomorpha, Rodentia and Primates and their allies (Euarchontoglires) have long been debated. While it is now generally agreed that Rodentia constitutes a monophyletic sister-group of Lagomorpha and that this clade (Glires) is sister to Primates and Dermoptera, higher-level relationships within Rodentia remain contentious. Results We have sequenced and performed extensive evolutionary analyses on the mitochondrial genome of the scaly-tailed flying squirrel Anomalurus sp., an enigmatic rodent whose phylogenetic affinities have been obscure and extensively debated. Our phylogenetic analyses of the coding regions of available complete mitochondrial genome sequences from Euarchontoglires suggest that Anomalurus is a sister taxon to the Hystricognathi, and that this clade represents the most basal divergence among sampled Rodentia. Bayesian dating methods incorporating a relaxed molecular clock provide divergence-time estimates which are consistently in agreement with the fossil record and which indicate a rapid radiation within Glires around 60 million years ago. Conclusion Taken together, the data presented provide a working hypothesis as to the phylogenetic placement of Anomalurus, underline the utility of mitochondrial sequences in the resolution of even relatively deep divergences and go some way to explaining the difficulty of conclusively resolving higher-level relationships within Glires with available data and methodologies. PMID:17288612
When are pathogen genome sequences informative of transmission events?
Ferguson, Neil; Jombart, Thibaut
2018-01-01
Recent years have seen the development of numerous methodologies for reconstructing transmission trees in infectious disease outbreaks from densely sampled whole genome sequence data. However, a fundamental and as of yet poorly addressed limitation of such approaches is the requirement for genetic diversity to arise on epidemiological timescales. Specifically, the position of infected individuals in a transmission tree can only be resolved by genetic data if mutations have accumulated between the sampled pathogen genomes. To quantify and compare the useful genetic diversity expected from genetic data in different pathogen outbreaks, we introduce here the concept of ‘transmission divergence’, defined as the number of mutations separating whole genome sequences sampled from transmission pairs. Using parameter values obtained by literature review, we simulate outbreak scenarios alongside sequence evolution using two models described in the literature to describe transmission divergence of ten major outbreak-causing pathogens. We find that while mean values vary significantly between the pathogens considered, their transmission divergence is generally very low, with many outbreaks characterised by large numbers of genetically identical transmission pairs. We describe the impact of transmission divergence on our ability to reconstruct outbreaks using two outbreak reconstruction tools, the R packages outbreaker and phybreak, and demonstrate that, in agreement with previous observations, genetic sequence data of rapidly evolving pathogens such as RNA viruses can provide valuable information on individual transmission events. Conversely, sequence data of pathogens with lower mean transmission divergence, including Streptococcus pneumoniae, Shigella sonnei and Clostridium difficile, provide little to no information about individual transmission events. Our results highlight the informational limitations of genetic sequence data in certain outbreak scenarios, and demonstrate the need to expand the toolkit of outbreak reconstruction tools to integrate other types of epidemiological data. PMID:29420641
Pereira, J O P; Freitas, B M; Jorge, D M M; Torres, D C; Soares, C E A; Grangeiro, T B
2009-01-01
Melipona quinquefasciata is a ground-nesting South American stingless bee whose geographic distribution was believed to comprise only the central and southern states of Brazil. We obtained partial sequences (about 500-570 bp) of first internal transcribed spacer (ITS1) nuclear ribosomal DNA from Melipona specimens putatively identified as M. quinquefasciata collected from different localities in northeastern Brazil. To confirm the taxonomic identity of the northeastern samples, specimens from the state of Goiás (Central region of Brazil) were included for comparison. All sequences were deposited in GenBank (accession numbers EU073751-EU073759). The mean nucleotide divergence (excluding sites with insertions/deletions) in the ITS1 sequences was only 1.4%, ranging from 0 to 4.1%. When the sites with insertions/deletions were also taken into account, sequence divergences varied from 0 to 5.3%. In all pairwise comparisons, the ITS1 sequence from the specimens collected in Goiás was most divergent compared to the ITS1 sequences of the bees from the other locations. However, neighbor-joining phylogenetic analysis showed that all ITS1 sequences from northeastern specimens along with the sample of Goiás were resolved in a single clade with a bootstrap support of 100%. The ITS1 sequencing data thus support the occurrence of M. quinquefasciata in northeast Brazil.
Stoesser, N; Sheppard, A E; Moore, C E; Golubchik, T; Parry, C M; Nget, P; Saroeun, M; Day, N P J; Giess, A; Johnson, J R; Peto, T E A; Crook, D W; Walker, A S
2015-07-01
Studies of the transmission epidemiology of antimicrobial-resistant Escherichia coli, such as strains harboring extended-spectrum beta-lactamase (ESBL) genes, frequently use selective culture of rectal surveillance swabs to identify isolates for molecular epidemiological investigation. Typically, only single colonies are evaluated, which risks underestimating species diversity and transmission events. We sequenced the genomes of 16 E. coli colonies from each of eight fecal samples (n = 127 genomes; one failure), taken from different individuals in Cambodia, a region of high ESBL-producing E. coli prevalence. Sequence data were used to characterize both the core chromosomal diversity of E. coli isolates and their resistance/virulence gene content as a proxy measure of accessory genome diversity. The 127 E. coli genomes represented 31 distinct sequence types (STs). Seven (88%) of eight subjects carried ESBL-positive isolates, all containing blaCTX-M variants. Diversity was substantial, with a median of four STs/individual (range, 1 to 10) and wide genetic divergence at the nucleotide level within some STs. In 2/8 (25%) individuals, the same blaCTX-M variant occurred in different clones, and/or different blaCTX-M variants occurred in the same clone. Patterns of other resistance genes and common virulence factors, representing differences in the accessory genome, were also diverse within and between clones. The substantial diversity among intestinally carried ESBL-positive E. coli bacteria suggests that fecal surveillance, particularly if based on single-colony subcultures, will likely underestimate transmission events, especially in high-prevalence settings. Copyright © 2015, Stoesser et al.
Evolution of exceptional species richness among lineages of fleshy-fruited Myrtaceae
Biffin, Ed; Lucas, Eve J.; Craven, Lyn A.; Ribeiro da Costa, Itayguara; Harrington, Mark G.; Crisp, Michael D.
2010-01-01
Background and Aims The angiosperm family Myrtaceae comprises 17 tribes with more than half of the estimated 5500 species being referred to the fleshy-fruited and predominantly rainforest associated Syzygieae and Myrteae. Previous studies suggest that fleshy fruits have evolved separately in these lineages, whereas generally shifts in fruit morphology have been variously implicated in diversification rate shifts among angiosperms. A phylogenetic hypothesis and estimate divergence times for Myrtaceae is developed as a basis to explore the evidence for, and drivers of, elevated diversification rates among the fleshy-fruited tribes of Myrtaceae. Methods Bayesian phylogenetic analyses of plastid and nuclear DNA sequences were used to estimate intertribal relationships and lineage divergence times in Myrtaceae. Focusing on the fleshy-fruited tribes, a variety of statistical approaches were used to assess diversification rates and diversification rate shifts across the family. Key Results Analyses of the sequence data provide a strongly supported phylogenetic hypothesis for Myrtaceae. Relative to previous studies, substantially younger ages for many of the clades are reported, and it is argued that the use of flexible calibrations to incorporate fossil data provides more realistic divergence estimates than the use of errorless point calibrations. It is found that Syzygieae and Myrteae have experienced elevated diversification rates relative to other lineages of Myrtaceae. Positive shifts in diversification rate have occurred separately in each lineage, associated with a shift from dry to fleshy fruit. Conclusions Fleshy fruits have evolved independently in Syzygieae and Myrteae, and this is accompanied by exceptional diversification rate shifts in both instances, suggesting that the evolution of fleshy fruits is a key innovation for rainforest Myrtaceae. Noting the scale dependency of this hypothesis, more complex explanations may be required to explain diversification rate shifts occurring within the fleshy-fruited tribes, and the suggested phylogenetic hypothesis provides an appropriate framework for this undertaking. PMID:20462850
Wan, Yizhen; Schwaninger, Heidi R; Baldo, Angela M; Labate, Joanne A; Zhong, Gan-Yuan; Simon, Charles J
2013-07-05
Grapes are one of the most economically important fruit crops. There are about 60 species in the genus Vitis. The phylogenetic relationships among these species are of keen interest for the conservation and use of this germplasm. We selected 309 accessions from 48 Vitis species,varieties, and outgroups, examined ~11 kb (~3.4 Mb total) of aligned nuclear DNA sequences from 27 unlinked genes in a phylogenetic context, and estimated divergence times based on fossil calibrations. Vitis formed a strongly supported clade. There was substantial support for species and less for the higher-level groupings (series). As estimated from extant taxa, the crown age of Vitis was 28 Ma and the divergence of subgenera (Vitis and Muscadinia) occurred at ~18 Ma. Higher clades in subgenus Vitis diverged 16 - 5 Ma with overlapping confidence intervals, and ongoing divergence formed extant species at 12 - 1.3 Ma. Several species had species-specific SNPs. NeighborNet analysis showed extensive reticulation at the core of subgenus Vitis representing the deeper nodes, with extensive reticulation radiating outward. Fitch Parsimony identified North America as the origin of the most recent common ancestor of extant Vitis species. Phylogenetic patterns suggested origination of the genus in North America, fragmentation of an ancestral range during the Miocene, formation of extant species in the late Miocene-Pleistocene, and differentiation of species in the context of Pliocene-Quaternary tectonic and climatic change. Nuclear SNPs effectively resolved relationships at and below the species level in grapes and rectified several misclassifications of accessions in the repositories. Our results challenge current higher-level classifications, reveal the abundance of genetic diversity in the genus that is potentially available for crop improvement, and provide a valuable resource for species delineation, germplasm conservation and use.
Mhc class II B gene evolution in East African cichlid fishes.
Figueroa, F; Mayer, W E; Sültmann, H; O'hUigin, C; Tichy, H; Satta, Y; Takezaki, N; Takahata, N; Klein, J
2000-06-01
A distinctive feature of essential major histocompatibility complex (Mhc) loci is their polymorphism characterized by large genetic distances between alleles and long persistence times of allelic lineages. Since the lineages often span several successive speciations, we investigated the behavior of the Mhc alleles during or close to the speciation phase. We sequenced exon 2 of the class II B locus 4 from 232 East African cichlid fishes representing 32 related species. The divergence times of the (sub)species ranged from 6,000 to 8.4 million years. Two types of evolutionary analysis were used to elucidate the pattern of exon 2 sequence divergence. First, phylogenetic methods were applied to reconstruct the most likely evolutionary pathways leading from the last common ancestor of the set to the extant sequences, and to assess the probable mechanisms involved in allelic diversification. Second, pairwise comparisons of sequences were carried out to detect differences seemingly incompatible with origin by nonparallel point mutations. The analysis revealed point mutations to be the most important mechanism behind allelic divergences, with recombination playing only an auxiliary part. Comparison of sequences from related species revealed evidence of random allelic (lineage) losses apparently associated with speciation. Sharing of identical alleles could be demonstrated between species that diverged 2 million years ago. The phylogeny of the exon was incongruent with that of the flanking introns, indicating either a high degree of convergent evolution at the peptide-binding region-encoding sites, or intron homogenization.
Amoikon, Tiemele Laurent Simon; Grondin, Cécile; Djéni, Théodore N'Dédé; Jacques, Noémie; Casaregola, Serge
2018-05-21
Analysis of yeasts isolated from various biotopes in French Guiana led to the identification of two strains isolated from flowers and designated CLIB 1634 T and CLIB 1707 T . Comparison of the D1/D2 domain of the large subunit (LSU D1/D2) rRNA gene sequences of CLIB 1634 T and CLIB 1707 T to those in the GenBank database revealed that these strains belong to the Starmerella clade. Strain CLIB 1634 T was shown to diverge from the closely related Starmerella apicola type strain CBS 2868 T with a sequence divergence of 1.34 and 1.30 %, in the LSU D1/D2 rRNA gene and internal transcribed spacer (ITS) sequences respectively. Strain CLIB 1634 T and Candida apicola CBS 2868 T diverged by 3.81 and 14.96 % at the level of the protein-coding gene partial sequences EF-1α and RPB2, respectively. CLIB 1707 T was found to have sequence divergence of 3.88 and 9.16 % in the LSU D1/D2 rRNA gene and ITS, respectively, from that of the most closely related species Starmerella ratchasimensis type strain CBS 10611 T . The species Starmerella reginensis f.a., sp. nov. and Starmerella kourouensis f.a., sp. nov. are proposed to accommodate strains CLIB 1634 T (=CBS 15247 T ) and CLIB 1707 T (=CBS 15257 T ), respectively.
Intraspecific variation in Cryptocaryon irritans.
Diggles, B K; Adlard, R D
1997-01-01
Intraspecific variation in the ciliate Cryptocaryon irritans was examined using sequences of the first internal transcribed spacer region (ITS-1) of ribosomal DNA (rDNA) combined with developmental and morphological characters. Amplified rDNA sequences consisting of 151 bases of the flanking 18 S and 5.8 S regions, and the entire ITS-1 region (169 or 170 bases), were determined and compared for 16 isolates of C. irritans from Australia, Israel and the USA. There was one variable base between isolates in the 18 S region and 11 variable bases in the ITS-1 region. Despite their similar morphology, significant sequence variation (4.1% divergence) and developmental differences indicate that Australian C. irritans isolates from estuarine (Moreton Bay) and coral reef (Heron Island) environments are distinct. The Heron Island isolate was genetically closer to morphologically dissimilar isolates from Israel (1.8% divergence) and the USA (2.3% divergence) than it was to the Moreton Bay isolates. Three isolates maintained in our laboratory since February 1994 differed in sequence from earlier laboratory isolates (2.9% to 3.5% divergence), even though all were similar morphologically and originated from the same source. During this time the sequence of the isolates from wild fish in Moreton Bay remained unchanged. These genetic differences indicate the existence of a founder effect in laboratory populations of C. irritans. The genetic variation found here, combined with known morphological and developmental differences, is used to characterise four strains of C. irritans.
Single reaction, real time RT-PCR detection of all known avian and human metapneumoviruses.
Lemaitre, E; Allée, C; Vabret, A; Eterradossi, N; Brown, P A
2018-01-01
Current molecular methods for the detection of avian and human metapneumovirus (AMPV, HMPV) are specifically targeted towards each virus species or individual subgroups of these. Here a broad range SYBR Green I real time RT-PCR was developed which amplified a highly conserved fragment of sequence in the N open reading frame. This method was sufficiently efficient and specific in detecting all MPVs. Its validation according to the NF U47-600 norm for the four AMPV subgroups estimated low limits of detection between 1000 and 10copies/μL, similar with detection levels described previously for real time RT-PCRs targeting specific subgroups. RNA viruses present a challenge for the design of durable molecular diagnostic test due to the rate of change in their genome sequences which can vary substantially in different areas and over time. The fact that the regions of sequence for primer hybridization in the described method have remained sufficiently conserved since the AMPV and HMPV diverged, should give the best chance of continued detection of current subgroups and of potential unknown or future emerging MPV strains. Copyright © 2017 Elsevier B.V. All rights reserved.
Mostafa, Ahmed; Abdelwhab, El-Sayed M; Slanina, Heiko; Hussein, Mohamed A; Kuznetsova, Irina; Schüttler, Christian G; Ziebuhr, John; Pleschka, Stephan
2016-06-01
Infections by H3N2-type influenza A viruses (IAV) resulted in significant numbers of hospitalization in several countries in 2014-2015, causing disease also in vaccinated individuals and, in some cases, fatal outcomes. In this study, sequence analysis of H3N2 viruses isolated in Germany from 1998 to 2015, including eleven H3N2 isolates collected early in 2015, was performed. Compared to the vaccine strain A/Texas/50/2012 (H3N2), the 2015 strains from Germany showed up to 4.5 % sequence diversity in their HA1 protein, indicating substantial genetic drift. The data further suggest that two distinct phylogroups, 3C.2 and 3C.3, with 1.6-2.3 % and 0.3-2.4 % HA1 nucleotide and amino acid sequence diversity, respectively, co-circulated in Germany in the 2014/2015 season. Distinct glycosylation patterns and amino acid substitutions in the hemagglutinin and neuraminidase proteins were identified, possibly contributing to the unusually high number of H3N2 infections in this season and providing important information for developing vaccines that are effective against both genotypes.
Van Nguyen, Dung; Van Nguyen, Cuong; Bonsall, David; Ngo, Tue Tri; Carrique-Mas, Juan; Pham, Anh Hong; Bryant, Juliet E; Thwaites, Guy; Baker, Stephen; Woolhouse, Mark; Simmonds, Peter
2018-02-28
Rodents and bats are now widely recognised as important sources of zoonotic virus infections in other mammals, including humans. Numerous surveys have expanded our knowledge of diverse viruses in a range of rodent and bat species, including their origins, evolution, and range of hosts. In this study of pegivirus and human hepatitis-related viruses, liver and serum samples from Vietnamese rodents and bats were examined by PCR and sequencing. Nucleic acids homologous to human hepatitis B, C, E viruses were detected in liver samples of 2 (1.3%) of 157 bats, 38 (8.1%), and 14 (3%) of 470 rodents, respectively. Hepacivirus-like viruses were frequently detected (42.7%) in the bamboo rat, Rhizomys pruinosus , while pegivirus RNA was only evident in 2 (0.3%) of 638 rodent serum samples. Complete or near-complete genome sequences of HBV, HEV and pegivirus homologues closely resembled those previously reported from rodents and bats. However, complete coding region sequences of the rodent hepacivirus-like viruses substantially diverged from all of the currently classified variants and potentially represent a new species in the Hepacivirus genus. Of the viruses identified, their routes of transmission and potential to establish zoonoses remain to be determined.
Cryptic species? Patterns of maternal and paternal gene flow in eight neotropical bats.
Clare, Elizabeth L
2011-01-01
Levels of sequence divergence at mitochondrial loci are frequently used in phylogeographic analysis and species delimitation though single marker systems cannot assess bi-parental gene flow. In this investigation I compare the phylogeographic patterns revealed through the maternally inherited mitochondrial COI region and the paternally inherited 7(th) intron region of the Dby gene on the Y-chromosome in eight common Neotropical bat species. These species are diverse and include members of two families from the feeding guilds of sanguivores, nectarivores, frugivores, carnivores and insectivores. In each case, the currently recognized taxon is comprised of distinct, substantially divergent intraspecific mitochondrial lineages suggesting cryptic species complexes. In Chrotopterus auritus, and Saccopteryx bilineata I observed congruent patterns of divergence in both genetic regions suggesting a cessation of gene flow between intraspecific groups. This evidence supports the existence of cryptic species complexes which meet the criteria of the genetic species concept. In Glossophaga soricina two intraspecific groups with largely sympatric South American ranges show evidence for incomplete lineage sorting or frequent hybridization while a third group with a Central American distribution appears to diverge congruently at both loci suggesting speciation. Within Desmodus rotundus and Trachops cirrhosus the paternally inherited region was monomorphic and thus does not support or refute the potential for cryptic speciation. In Uroderma bilobatum, Micronycteris megalotis and Platyrrhinus helleri the gene regions show conflicting patterns of divergence and I cannot exclude ongoing gene flow between intraspecific groups. This analysis provides a comprehensive comparison across taxa and employs both maternally and paternally inherited gene regions to validate patterns of gene flow. I present evidence for previously unrecognized species meeting the criteria of the genetic species concept but demonstrate that estimates of mitochondrial diversity alone do not accurately represent gene flow in these species and that contact/hybrid zones must be explored to evaluate reproductive isolation.
Chakona, Albert; Swartz, Ernst R.; Gouws, Gavin
2013-01-01
This study used phylogenetic analyses of mitochondrial cytochrome b sequences to investigate genetic diversity within three broadly co-distributed freshwater fish genera (Galaxias, Pseudobarbus and Sandelia) to shed some light on the processes that promoted lineage diversification and shaped geographical distribution patterns. A total of 205 sequences of Galaxias, 177 sequences of Pseudobarbus and 98 sequences of Sandelia from 146 localities across nine river systems in the south-western Cape Floristic Region (South Africa) were used. The data were analysed using phylogenetic and haplotype network methods and divergence times for the clades retrieved were estimated using *BEAST. Nine extremely divergent (3.5–25.3%) lineages were found within Galaxias. Similarly, deep phylogeographic divergence was evident within Pseudobarbus, with four markedly distinct (3.8–10.0%) phylogroups identified. Sandelia had two deeply divergent (5.5–5.9%) lineages, but seven minor lineages with strong geographical congruence were also identified. The Miocene-Pliocene major sea-level transgression and the resultant isolation of populations in upland refugia appear to have driven widespread allopatric divergence within the three genera. Subsequent coalescence of rivers during the Pleistocene major sea-level regression as well as intermittent drainage connections during wet periods are proposed to have facilitated range expansion of lineages that currently occur across isolated river systems. The high degree of genetic differentiation recovered from the present and previous studies suggest that freshwater fish diversity within the south-western CFR may be vastly underestimated, and taxonomic revisions are required. PMID:23951050
Bertioli, David J; Moretzsohn, Marcio C; Madsen, Lene H; Sandal, Niels; Leal-Bertioli, Soraya CM; Guimarães, Patricia M; Hougaard, Birgit K; Fredslund, Jakob; Schauser, Leif; Nielsen, Anna M; Sato, Shusei; Tabata, Satoshi; Cannon, Steven B; Stougaard, Jens
2009-01-01
Background Most agriculturally important legumes fall within two sub-clades of the Papilionoid legumes: the Phaseoloids and Galegoids, which diverged about 50 Mya. The Phaseoloids are mostly tropical and include crops such as common bean and soybean. The Galegoids are mostly temperate and include clover, fava bean and the model legumes Lotus and Medicago (both with substantially sequenced genomes). In contrast, peanut (Arachis hypogaea) falls in the Dalbergioid clade which is more basal in its divergence within the Papilionoids. The aim of this work was to integrate the genetic map of Arachis with Lotus and Medicago and improve our understanding of the Arachis genome and legume genomes in general. To do this we placed on the Arachis map, comparative anchor markers defined using a previously described bioinformatics pipeline. Also we investigated the possible role of transposons in the patterns of synteny that were observed. Results The Arachis genetic map was substantially aligned with Lotus and Medicago with most synteny blocks presenting a single main affinity to each genome. This indicates that the last common whole genome duplication within the Papilionoid legumes predated the divergence of Arachis from the Galegoids and Phaseoloids sufficiently that the common ancestral genome was substantially diploidized. The Arachis and model legume genomes comparison made here, together with a previously published comparison of Lotus and Medicago allowed all possible Arachis-Lotus-Medicago species by species comparisons to be made and genome syntenies observed. Distinct conserved synteny blocks and non-conserved regions were present in all genome comparisons, implying that certain legume genomic regions are consistently more stable during evolution than others. We found that in Medicago and possibly also in Lotus, retrotransposons tend to be more frequent in the variable regions. Furthermore, while these variable regions generally have lower densities of single copy genes than the more conserved regions, some harbor high densities of the fast evolving disease resistance genes. Conclusion We suggest that gene space in Papilionoids may be divided into two broadly defined components: more conserved regions which tend to have low retrotransposon densities and are relatively stable during evolution; and variable regions that tend to have high retrotransposon densities, and whose frequent restructuring may fuel the evolution of some gene families. PMID:19166586
Echave, Julian; Wilke, Claus O.
2018-01-01
For decades, rates of protein evolution have been interpreted in terms of the vague concept of “functional importance”. Slowly evolving proteins or sites within proteins were assumed to be more functionally important and thus subject to stronger selection pressure. More recently, biophysical models of protein evolution, which combine evolutionary theory with protein biophysics, have completely revolutionized our view of the forces that shape sequence divergence. Slowly evolving proteins have been found to evolve slowly because of selection against toxic misfolding and misinteractions, linking their rate of evolution primarily to their abundance. Similarly, most slowly evolving sites in proteins are not directly involved in function, but mutating them has large impacts on protein structure and stability. Here, we review the studies of the emergent field of biophysical protein evolution that have shaped our current understanding of sequence divergence patterns. We also propose future research directions to develop this nascent field. PMID:28301766
Windsor, Aaron J.; Schranz, M. Eric; Formanová, Nataša; Gebauer-Jung, Steffi; Bishop, John G.; Schnabelrauch, Domenica; Kroymann, Juergen; Mitchell-Olds, Thomas
2006-01-01
Comparative genomics provides insight into the evolutionary dynamics that shape discrete sequences as well as whole genomes. To advance comparative genomics within the Brassicaceae, we have end sequenced 23,136 medium-sized insert clones from Boechera stricta, a wild relative of Arabidopsis (Arabidopsis thaliana). A significant proportion of these sequences, 18,797, are nonredundant and display highly significant similarity (BLASTn e-value ≤ 10−30) to low copy number Arabidopsis genomic regions, including more than 9,000 annotated coding sequences. We have used this dataset to identify orthologous gene pairs in the two species and to perform a global comparison of DNA regions 5′ to annotated coding regions. On average, the 500 nucleotides upstream to coding sequences display 71.4% identity between the two species. In a similar analysis, 61.4% identity was observed between 5′ noncoding sequences of Brassica oleracea and Arabidopsis, indicating that regulatory regions are not as diverged among these lineages as previously anticipated. By mapping the B. stricta end sequences onto the Arabidopsis genome, we have identified nearly 2,000 conserved blocks of microsynteny (bracketing 26% of the Arabidopsis genome). A comparison of fully sequenced B. stricta inserts to their homologous Arabidopsis genomic regions indicates that indel polymorphisms >5 kb contribute substantially to the genome size difference observed between the two species. Further, we demonstrate that microsynteny inferred from end-sequence data can be applied to the rapid identification and cloning of genomic regions of interest from nonmodel species. These results suggest that among diploid relatives of Arabidopsis, small- to medium-scale shotgun sequencing approaches can provide rapid and cost-effective benefits to evolutionary and/or functional comparative genomic frameworks. PMID:16607030
Antell, Gregory C.; Zhong, Wen; Kercher, Katherine; Passic, Shendra; Williams, Jean; Liu, Yucheng; James, Tony; Jacobson, Jeffrey M.; Szep, Zsofia
2017-01-01
Vpr is an HIV-1 accessory protein that plays numerous roles during viral replication, and some of which are cell type dependent. To test the hypothesis that HIV-1 tropism extends beyond the envelope into the vpr gene, studies were performed to identify the associations between coreceptor usage and Vpr variation in HIV-1-infected patients. Colinear HIV-1 Env-V3 and Vpr amino acid sequences were obtained from the LANL HIV-1 sequence database and from well-suppressed patients in the Drexel/Temple Medicine CNS AIDS Research and Eradication Study (CARES) Cohort. Genotypic classification of Env-V3 sequences as X4 (CXCR4-utilizing) or R5 (CCR5-utilizing) was used to group colinear Vpr sequences. To reveal the sequences associated with a specific coreceptor usage genotype, Vpr amino acid sequences were assessed for amino acid diversity and Jensen-Shannon divergence between the two groups. Five amino acid alphabets were used to comprehensively examine the impact of amino acid substitutions involving side chains with similar physiochemical properties. Positions 36, 37, 41, 89, and 96 of Vpr were characterized by statistically significant divergence across multiple alphabets when X4 and R5 sequence groups were compared. In addition, consensus amino acid switches were found at positions 37 and 41 in comparisons of the R5 and X4 sequence populations. These results suggest an evolutionary link between Vpr and gp120 in HIV-1-infected patients. PMID:28620613
NASA Astrophysics Data System (ADS)
Nallaseth, Ferez Soli
The Y-chromosome presents a unique cytogenetic framework for the evolution of nucleotide sequences. Alignment of nine Y-chromosomal fragments in their increasing Y-specific/non Y-specific (male/female) sequence divergence ratios was directly and inversely related to their interspersion on these two respective genomic fractions. Sequence analysis confirmed a direct relationship between divergence ratios and the Alu, LINE-1, Satellite and their derivative oligonucleotide contents. Thus their relocation on the Y-chromosome is followed by sequence divergence rather than the well documented concerted evolution of these non-coding progenitor repeated sequences. Five of the nine Y-chromosomal fragments are non-pseudoautosomal and transcribed into heterogeneous PolyA^+ RNA and thus can be retrotransposed. Evolutionary and computer analysis identified homologous oligonucleotide tracts in several human loci suggesting common and random mechanistic origins. Dysgenic genomes represent the accelerated evolution driving sequence divergence (McClintock, 1984). Sex reversal and sterility characterizing dysgenesis occurs in C57BL/6JY ^{rm Pos} but not in 129/SvY^{rm Pos} derivative strains. High frequency, random, multi-locus deletion products of the feral Y^{ rm Pos}-chromosome are generated in the germlines of F1(C57BL/6J X 129/SvY^{ rm Pos})(male) and C57BL/6JY ^{rm Pos}(male) but not in 129/SvY^{rm Pos}(male). Equal, 10^{-1}, 10^ {-2}, and 0 copies (relative to males) of Y^{rm Pos}-specific deletion products respectively characterize C57BL/6JY ^{rm Pos} (HC), (LC), (T) and (F) females. The testes determining loci of inactive Y^{rm Pos}-chromosomes in C57BL/6JY^{rm Pos} HC females are the preferentially deleted/rearranged Y ^{rm Pos}-sequences. Disruption of regulation of plasma testosterone and hepatic MUP-A mRNA levels, TRD of a 4.7 Kbp EcoR1 fragment suggest disruption of autosomal/X-chromosomal sequences. These data and the highly repeated progenitor (Alu, GATA, LINE-1) sequence content of deletion products confirmed the previously unidentified loss of genetic control of mammalian chromosome biology and hybrid dysgenesis.
Fernández, Helena; Hughes, Sandrine; Vigne, Jean-Denis; Helmer, Daniel; Hodgins, Greg; Miquel, Christian; Hänni, Catherine; Luikart, Gordon; Taberlet, Pierre
2006-01-01
Goats were among the first farm animals domesticated, ≈10,500 years ago, contributing to the rise of the “Neolithic revolution.” Previous genetic studies have revealed that contemporary domestic goats (Capra hircus) show far weaker intercontinental population structuring than other livestock species, suggesting that goats have been transported more extensively. However, the timing of these extensive movements in goats remains unknown. To address this question, we analyzed mtDNA sequences from 19 ancient goat bones (7,300–6,900 years old) from one of the earliest Neolithic sites in southwestern Europe. Phylogenetic analysis revealed that two highly divergent goat lineages coexisted in each of the two Early Neolithic layers of this site. This finding indicates that high mtDNA diversity was already present >7,000 years ago in European goats, far from their areas of initial domestication in the Near East. These results argue for substantial gene flow among goat populations dating back to the early neolithisation of Europe and for a dual domestication scenario in the Near East, with two independent but essentially contemporary origins (of both A and C domestic lineages) and several more remote and/or later origins. PMID:17030824
Weikard, Rosemarie; Demasius, Wiebke; Hadlich, Frieder; Kühn, Christa
2015-01-01
Periparturient cows have been found to reveal immunosuppression, frequently associated with increased susceptibility to uterine and mammary infections. To improve understanding of the causes and molecular regulatory mechanisms accounting for this phenomenon around calving, we examined the effect of an antigen challenge on gene expression modulation on cows prior to (BC) or after calving (AC) using whole transcriptome sequencing (RNAseq). The transcriptome analysis of the cows’ blood identified a substantially higher number of loci affected in BC cows (2,235) in response to vaccination compared to AC cows (208) and revealed a divergent transcriptional profile specific for each group. In BC cows, a variety of loci involved in immune defense and cellular signaling processes were transcriptionally activated, whereas protein biosynthesis and posttranslational processes were tremendously impaired in response to vaccination. Furthermore, energy metabolism in the blood cells of BC cows was shifted from oxidative phosphorylation to the glycolytic system. In AC cows, the number and variety of regulated pathways involved in immunomodulation and maintenance of immnunocompetence are considerably lower after vaccination, and upregulation of arginine degradation was suggested as an immunosuppressive mechanism. Elevated transcript levels of erythrocyte-specific genes involved in gas exchange processes were a specific transcriptional signature in AC cows pointing to hematopoiesis activation. The divergent and substantially lower magnitude of transcriptional modulation in response to vaccination in AC cows provides evidence for a suppressed immune capacity of early lactating cows on the molecular level and demonstrates that an efficient immune response of cows is related to their physiological and metabolic status. PMID:26317664
Iftikhar, Romana; Ashfaq, Muhammad; Rasool, Akhtar; Hebert, Paul D N
2016-01-01
Although thrips are globally important crop pests and vectors of viral disease, species identifications are difficult because of their small size and inconspicuous morphological differences. Sequence variation in the mitochondrial COI-5' (DNA barcode) region has proven effective for the identification of species in many groups of insect pests. We analyzed barcode sequence variation among 471 thrips from various plant hosts in north-central Pakistan. The Barcode Index Number (BIN) system assigned these sequences to 55 BINs, while the Automatic Barcode Gap Discovery detected 56 partitions, a count that coincided with the number of monophyletic lineages recognized by Neighbor-Joining analysis and Bayesian inference. Congeneric species showed an average of 19% sequence divergence (range = 5.6% - 27%) at COI, while intraspecific distances averaged 0.6% (range = 0.0% - 7.6%). BIN analysis suggested that all intraspecific divergence >3.0% actually involved a species complex. In fact, sequences for three major pest species (Haplothrips reuteri, Thrips palmi, Thrips tabaci), and one predatory thrips (Aeolothrips intermedius) showed deep intraspecific divergences, providing evidence that each is a cryptic species complex. The study compiles the first barcode reference library for the thrips of Pakistan, and examines global haplotype diversity in four important pest thrips.
Extensive concerted evolution of rice paralogs and the road to regaining independence.
Wang, Xiyin; Tang, Haibao; Bowers, John E; Feltus, Frank A; Paterson, Andrew H
2007-11-01
Many genes duplicated by whole-genome duplications (WGDs) are more similar to one another than expected. We investigated whether concerted evolution through conversion and crossing over, well-known to affect tandem gene clusters, also affects dispersed paralogs. Genome sequences for two Oryza subspecies reveal appreciable gene conversion in the approximately 0.4 MY since their divergence, with a gradual progression toward independent evolution of older paralogs. Since divergence from subspecies indica, approximately 8% of japonica paralogs produced 5-7 MYA on chromosomes 11 and 12 have been affected by gene conversion and several reciprocal exchanges of chromosomal segments, while approximately 70-MY-old "paleologs" resulting from a genome duplication (GD) show much less conversion. Sequence similarity analysis in proximal gene clusters also suggests more conversion between younger paralogs. About 8% of paleologs may have been converted since rice-sorghum divergence approximately 41 MYA. Domain-encoding sequences are more frequently converted than nondomain sequences, suggesting a sort of circularity--that sequences conserved by selection may be further conserved by relatively frequent conversion. The higher level of concerted evolution in the 5-7 MY-old segmental duplication may reflect the behavior of many genomes within the first few million years after duplication or polyploidization.
Comparative sequence analyses of sixteen reptilian paramyxoviruses
Ahne, W.; Batts, W.N.; Kurath, G.; Winton, J.R.
1999-01-01
Viral genomic RNA of Fer-de-Lance virus (FDLV), a paramyxovirus highly pathogenic for reptiles, was reverse transcribed and cloned. Plasmids with significant sequence similarities to the hemagglutinin-neuraminidase (HN) and polymerase (L) genes of mammalian paramyxoviruses were identified by BLAST search. Partial sequences of the FDLV genes were used to design primers for amplification by nested polymerase chain reaction (PCR) and sequencing of 518-bp L gene and 352-bp HN gene fragments from a collection of 15 previously uncharacterized reptilian paramyxoviruses. Phylogenetic analyses of the partial L and HN sequences produced similar trees in which there were two distinct subgroups of isolates that were supported with maximum bootstrap values, and several intermediate isolates. Within each subgroup the nucleotide divergence values were less than 2.5%, while the divergence between the two subgroups was 20-22%. This indicated that the two subgroups represent distinct virus species containing multiple virus strains. The five intermediate isolates had nucleotide divergence values of 11-20% and may represent additional distinct species. In addition to establishing diversity among reptilian paramyxoviruses, the phylogenetic groupings showed some correlation with geographic location, and clearly demonstrated a low level of host species-specificity within these viruses. Copyright (C) 1999 Elsevier Science B.V.
srRNA evolution and phylogenetic relationships of the genus Naegleria (Protista: Rhizopoda).
Baverstock, P R; Illana, S; Christy, P E; Robinson, B S; Johnson, A M
1989-05-01
A rapid RNA sequencing technique was used to partially sequence the small-subunit ribosomal RNA (srRNA) of four species of the amoeboid genus Naegleria. The extent of nucleotide sequence divergence between the two most divergent species was roughly similar to that found between mammals and frogs. However, the pattern of variation among the Naegleria species was quite different from that found for those species of tetrapods characterized to date. A phylogenetic analysis of the consensus Naegleria sequence showed that Naegleria was not monophyletic with either Acanthamoeba castellanii or Dictyostelium discoideum, two other amoebas for which sequences were available. It was shown that the semiconserved regions of the srRNA molecule evolve in a clocklike fashion and that the clock is time dependent rather than generation dependent.
2015-01-01
Culex pipiens, an invasive mosquito and vector of West Nile virus in the US, has two morphologically indistinguishable forms that differ dramatically in behavior and physiology. Cx. pipiens form pipiens is primarily a bird-feeding temperate mosquito, while the sub-tropical Cx. pipiens form molestus thrives in sewers and feeds on mammals. Because the feral form can diapause during the cold winters but the domestic form cannot, the two Cx. pipiens forms are allopatric in northern Europe and, although viable, hybrids are rare. Cx. pipiens form molestus has spread across all inhabited continents and hybrids of the two forms are common in the US. Here we elucidate the genes and gene families with the greatest divergence rates between these phenotypically diverged mosquito populations, and discuss them in light of their potential biological and ecological effects. After generating and assembling novel transcriptome data for each population, we performed pairwise tests for nonsynonymous divergence (Ka) of homologous coding sequences and examined gene ontology terms that were statistically over-represented in those sequences with the greatest divergence rates. We identified genes involved in digestion (serine endopeptidases), innate immunity (fibrinogens and α-macroglobulins), hemostasis (D7 salivary proteins), olfaction (odorant binding proteins) and chitin binding (peritrophic matrix proteins). By examining molecular divergence between closely related yet phenotypically divergent forms of the same species, our results provide insights into the identity of rapidly-evolving genes between incipient species. Additionally, we found that families of signal transducers, ATP synthases and transcription regulators remained identical at the amino acid level, thus constituting conserved components of the Cx. pipiens proteome. We provide a reference with which to gauge the divergence reported in this analysis by performing a comparison of transcriptome sequences from conspecific (yet allopatric) populations of another member of the Cx. pipiens complex, Cx. quinquefasciatus. PMID:25755934
[Hepatitis C virus: sequence homology of a European isolate and divergence from the prototype].
Seelig, R; Seelig, H P; Renz, M
1991-08-01
The polymerase chain reaction (PCR) detected specific hepatitis C viral (HCV) RNA sequences in liver biopsies from two patients with chronic hepatitis, in the tissue of a liver implantate, in plasma from four chronic non-A, non-B hepatitis (NANBH) patients and, for the first time, in an infectious anti-D-immunoglobulin preparation. A comparison of the viral sequences coding for a region for the nonstructural NS3 protein from the liver tissues revealed only a very small degree of sequence divergence on the cDNA as well as on the amino acid level (between 0 and 5%). The sequence similarities of the RNA isolated from plasma of the four chronic NANBH patients and the anti-D-immunoglobulin preparation were partly somewhat lower but altogether also high (between 90 and 100%). In contrast, all eight cDNA and amino acid sequences exhibited a significantly higher degree of divergence in comparison with the HCV prototype sequence (between 29 and 32%) than among themselves (between 0 and 10%). This unexpected high sequence similarity of the eight European isolates and their low homology to the Northamerican prototype sequence is indicative for the existence of different types of HCV. This will be important not only for epidemiological studies but also for the development of effective diagnostic procedures and vaccines. Concerning the pathogenesis of NANBH, a double infection or a helper mechanism has to be considered: in addition to the C virus, sequences of an other virus particle were found in the infectious IgG preparation as well as in the liver biopsies.
Horai, S; Hayasaka, K; Kondo, R; Tsugane, K; Takahata, N
1995-01-01
We analyzed the complete mitochondrial DNA (mtDNA) sequences of three humans (African, European, and Japanese), three African apes (common and pygmy chimpanzees, and gorilla), and one orangutan in an attempt to estimate most accurately the substitution rates and divergence times of hominoid mtDNAs. Nonsynonymous substitutions and substitutions in RNA genes have accumulated with an approximately clock-like regularity. From these substitutions and under the assumption that the orangutan and African apes diverged 13 million years ago, we obtained a divergence time for humans and chimpanzees of 4.9 million years. This divergence time permitted calibration of the synonymous substitution rate (3.89 x 10(-8)/site per year). To obtain the substitution rate in the displacement (D)-loop region, we compared the three human mtDNAs and measured the relative abundance of substitutions in the D-loop region and at synonymous sites. The estimated substitution rate in the D-loop region was 7.00 x 10(-8)/site per year. Using both synonymous and D-loop substitutions, we inferred the age of the last common ancestor of the human mtDNAs as 143,000 +/- 18,000 years. The shallow ancestry of human mtDNAs, together with the observation that the African sequence is the most diverged among humans, strongly supports the recent African origin of modern humans, Homo sapiens sapiens. PMID:7530363
Population genomics of parallel hybrid zones in the mimetic butterflies, H. melpomene and H. erato
Ruiz, Mayté; Salazar, Patricio; Counterman, Brian; Medina, Jose Alejandro; Ortiz-Zuazaga, Humberto; Morrison, Anna; Papa, Riccardo
2014-01-01
Hybrid zones can be valuable tools for studying evolution and identifying genomic regions responsible for adaptive divergence and underlying phenotypic variation. Hybrid zones between subspecies of Heliconius butterflies can be very narrow and are maintained by strong selection acting on color pattern. The comimetic species, H. erato and H. melpomene, have parallel hybrid zones in which both species undergo a change from one color pattern form to another. We use restriction-associated DNA sequencing to obtain several thousand genome-wide sequence markers and use these to analyze patterns of population divergence across two pairs of parallel hybrid zones in Peru and Ecuador. We compare two approaches for analysis of this type of data—alignment to a reference genome and de novo assembly—and find that alignment gives the best results for species both closely (H. melpomene) and distantly (H. erato, ∼15% divergent) related to the reference sequence. Our results confirm that the color pattern controlling loci account for the majority of divergent regions across the genome, but we also detect other divergent regions apparently unlinked to color pattern differences. We also use association mapping to identify previously unmapped color pattern loci, in particular the Ro locus. Finally, we identify a new cryptic population of H. timareta in Ecuador, which occurs at relatively low altitude and is mimetic with H. melpomene malleti. PMID:24823669
Chloroplast Genome Evolution in Early Diverged Leptosporangiate Ferns
Kim, Hyoung Tae; Chung, Myong Gi; Kim, Ki-Joong
2014-01-01
In this study, the chloroplast (cp) genome sequences from three early diverged leptosporangiate ferns were completed and analyzed in order to understand the evolution of the genome of the fern lineages. The complete cp genome sequence of Osmunda cinnamomea (Osmundales) was 142,812 base pairs (bp). The cp genome structure was similar to that of eusporangiate ferns. The gene/intron losses that frequently occurred in the cp genome of leptosporangiate ferns were not found in the cp genome of O. cinnamomea. In addition, putative RNA editing sites in the cp genome were rare in O. cinnamomea, even though the sites were frequently predicted to be present in leptosporangiate ferns. The complete cp genome sequence of Diplopterygium glaucum (Gleicheniales) was 151,007 bp and has a 9.7 kb inversion between the trnL-CAA and trnV-GCA genes when compared to O. cinnamomea. Several repeated sequences were detected around the inversion break points. The complete cp genome sequence of Lygodium japonicum (Schizaeales) was 157,142 bp and a deletion of the rpoC1 intron was detected. This intron loss was shared by all of the studied species of the genus Lygodium. The GC contents and the effective numbers of co-dons (ENCs) in ferns varied significantly when compared to seed plants. The ENC values of the early diverged leptosporangiate ferns showed intermediate levels between eusporangiate and core leptosporangiate ferns. However, our phylogenetic tree based on all of the cp gene sequences clearly indicated that the cp genome similarity between O. cinnamomea (Osmundales) and eusporangiate ferns are symplesiomorphies, rather than synapomorphies. Therefore, our data is in agreement with the view that Osmundales is a distinct early diverged lineage in the leptosporangiate ferns. PMID:24823358
Chloroplast genome evolution in early diverged leptosporangiate ferns.
Kim, Hyoung Tae; Chung, Myong Gi; Kim, Ki-Joong
2014-05-01
In this study, the chloroplast (cp) genome sequences from three early diverged leptosporangiate ferns were completed and analyzed in order to understand the evolution of the genome of the fern lineages. The complete cp genome sequence of Osmunda cinnamomea (Osmundales) was 142,812 base pairs (bp). The cp genome structure was similar to that of eusporangiate ferns. The gene/intron losses that frequently occurred in the cp genome of leptosporangiate ferns were not found in the cp genome of O. cinnamomea. In addition, putative RNA editing sites in the cp genome were rare in O. cinnamomea, even though the sites were frequently predicted to be present in leptosporangiate ferns. The complete cp genome sequence of Diplopterygium glaucum (Gleicheniales) was 151,007 bp and has a 9.7 kb inversion between the trnL-CAA and trnVGCA genes when compared to O. cinnamomea. Several repeated sequences were detected around the inversion break points. The complete cp genome sequence of Lygodium japonicum (Schizaeales) was 157,142 bp and a deletion of the rpoC1 intron was detected. This intron loss was shared by all of the studied species of the genus Lygodium. The GC contents and the effective numbers of codons (ENCs) in ferns varied significantly when compared to seed plants. The ENC values of the early diverged leptosporangiate ferns showed intermediate levels between eusporangiate and core leptosporangiate ferns. However, our phylogenetic tree based on all of the cp gene sequences clearly indicated that the cp genome similarity between O. cinnamomea (Osmundales) and eusporangiate ferns are symplesiomorphies, rather than synapomorphies. Therefore, our data is in agreement with the view that Osmundales is a distinct early diverged lineage in the leptosporangiate ferns.
Fourment, Mathieu; Holmes, Edward C
2014-07-24
Early methods for estimating divergence times from gene sequence data relied on the assumption of a molecular clock. More sophisticated methods were created to model rate variation and used auto-correlation of rates, local clocks, or the so called "uncorrelated relaxed clock" where substitution rates are assumed to be drawn from a parametric distribution. In the case of Bayesian inference methods the impact of the prior on branching times is not clearly understood, and if the amount of data is limited the posterior could be strongly influenced by the prior. We develop a maximum likelihood method--Physher--that uses local or discrete clocks to estimate evolutionary rates and divergence times from heterochronous sequence data. Using two empirical data sets we show that our discrete clock estimates are similar to those obtained by other methods, and that Physher outperformed some methods in the estimation of the root age of an influenza virus data set. A simulation analysis suggests that Physher can outperform a Bayesian method when the real topology contains two long branches below the root node, even when evolution is strongly clock-like. These results suggest it is advisable to use a variety of methods to estimate evolutionary rates and divergence times from heterochronous sequence data. Physher and the associated data sets used here are available online at http://code.google.com/p/physher/.
Lashbrook, C C; Gonzalez-Bosch, C; Bennett, A B
1994-01-01
Two structurally divergent endo-beta-1,4-glucanase (EGase) cDNAs were cloned from tomato. Although both cDNAs (Cel1 and Cel2) encode potentially glycosylated, basic proteins of 51 to 53 kD and possess multiple amino acid domains conserved in both plant and microbial EGases, Cel1 and Cel2 exhibit only 50% amino acid identity at the overall sequence level. Amino acid sequence comparisons to other plant EGases indicate that tomato Cel1 is most similar to bean abscission zone EGase (68%), whereas Cel2 exhibits greatest sequence identity to avocado fruit EGase (57%). Sequence comparisons suggest the presence of at least two structurally divergent EGase families in plants. Unlike ripening avocado fruit and bean abscission zones in which a single EGase mRNA predominates, EGase expression in tomato reflects the overlapping accumulation of both Cel1 and Cel2 transcripts in ripening fruit and in plant organs undergoing cell separation. Cel1 mRNA contributes significantly to total EGase mRNA accumulation within plant organs undergoing cell separation (abscission zones and mature anthers), whereas Cel2 mRNA is most abundant in ripening fruit. The overlapping expression of divergent EGase genes within a single species may suggest that multiple activities are required for the cooperative disassembly of cell wall components during fruit ripening, floral abscission, and anther dehiscence. PMID:7994180
A DNA Barcode Library for North American Ephemeroptera: Progress and Prospects
Webb, Jeffrey M.; Jacobus, Luke M.; Funk, David H.; Zhou, Xin; Kondratieff, Boris; Geraci, Christy J.; DeWalt, R. Edward; Baird, Donald J.; Richard, Barton; Phillips, Iain; Hebert, Paul D. N.
2012-01-01
DNA barcoding of aquatic macroinvertebrates holds much promise as a tool for taxonomic research and for providing the reliable identifications needed for water quality assessment programs. A prerequisite for identification using barcodes is a reliable reference library. We gathered 4165 sequences from the barcode region of the mitochondrial cytochrome c oxidase subunit I gene representing 264 nominal and 90 provisional species of mayflies (Insecta: Ephemeroptera) from Canada, Mexico, and the United States. No species shared barcode sequences and all can be identified with barcodes with the possible exception of some Caenis. Minimum interspecific distances ranged from 0.3–24.7% (mean: 12.5%), while the average intraspecific divergence was 1.97%. The latter value was inflated by the presence of very high divergences in some taxa. In fact, nearly 20% of the species included two or three haplotype clusters showing greater than 5.0% sequence divergence and some values are as high as 26.7%. Many of the species with high divergences are polyphyletic and likely represent species complexes. Indeed, many of these polyphyletic species have numerous synonyms and individuals in some barcode clusters show morphological attributes characteristic of the synonymized species. In light of our findings, it is imperative that type or topotype specimens be sequenced to correctly associate barcode clusters with morphological species concepts and to determine the status of currently synonymized species. PMID:22666447
Czesny, Sergiusz; Epifanio, John; Michalak, Pawel
2012-01-01
Alewife Alosa pseudoharengus, a small clupeid fish native to Atlantic Ocean, has recently (∼150 years ago) invaded the North American Great Lakes and despite challenges of freshwater environment its populations exploded and disrupted local food web structures. This range expansion has been accompanied by dramatic changes at all levels of organization. Growth rates, size at maturation, or fecundity are only a few of the most distinct morphological and life history traits that contrast the two alewife morphs. A question arises to what extent these rapidly evolving differences between marine and freshwater varieties result from regulatory (including phenotypic plasticity) or structural mutations. To gain insights into expression changes and sequence divergence between marine and freshwater alewives, we sequenced transcriptomes of individuals from Lake Michigan and Atlantic Ocean. Population specific single nucleotide polymorphisms were rare but interestingly occurred in sequences of genes that also tended to show large differences in expression. Our results show that the striking phenotypic divergence between anadromous and lake alewives can be attributed to massive regulatory modifications rather than coding changes.
Czesny, Sergiusz; Epifanio, John; Michalak, Pawel
2012-01-01
Alewife Alosa pseudoharengus, a small clupeid fish native to Atlantic Ocean, has recently (∼150 years ago) invaded the North American Great Lakes and despite challenges of freshwater environment its populations exploded and disrupted local food web structures. This range expansion has been accompanied by dramatic changes at all levels of organization. Growth rates, size at maturation, or fecundity are only a few of the most distinct morphological and life history traits that contrast the two alewife morphs. A question arises to what extent these rapidly evolving differences between marine and freshwater varieties result from regulatory (including phenotypic plasticity) or structural mutations. To gain insights into expression changes and sequence divergence between marine and freshwater alewives, we sequenced transcriptomes of individuals from Lake Michigan and Atlantic Ocean. Population specific single nucleotide polymorphisms were rare but interestingly occurred in sequences of genes that also tended to show large differences in expression. Our results show that the striking phenotypic divergence between anadromous and lake alewives can be attributed to massive regulatory modifications rather than coding changes. PMID:22438868
Lischer, Heidi E L; Excoffier, Laurent; Heckel, Gerald
2014-04-01
Phylogenetic reconstruction of the evolutionary history of closely related organisms may be difficult because of the presence of unsorted lineages and of a relatively high proportion of heterozygous sites that are usually not handled well by phylogenetic programs. Genomic data may provide enough fixed polymorphisms to resolve phylogenetic trees, but the diploid nature of sequence data remains analytically challenging. Here, we performed a phylogenomic reconstruction of the evolutionary history of the common vole (Microtus arvalis) with a focus on the influence of heterozygosity on the estimation of intraspecific divergence times. We used genome-wide sequence information from 15 voles distributed across the European range. We provide a novel approach to integrate heterozygous information in existing phylogenetic programs by repeated random haplotype sampling from sequences with multiple unphased heterozygous sites. We evaluated the impact of the use of full, partial, or no heterozygous information for tree reconstructions on divergence time estimates. All results consistently showed four deep and strongly supported evolutionary lineages in the vole data. These lineages undergoing divergence processes split only at the end or after the last glacial maximum based on calibration with radiocarbon-dated paleontological material. However, the incorporation of information from heterozygous sites had a significant impact on absolute and relative branch length estimations. Ignoring heterozygous information led to an overestimation of divergence times between the evolutionary lineages of M. arvalis. We conclude that the exclusion of heterozygous sites from evolutionary analyses may cause biased and misleading divergence time estimates in closely related taxa.
Hirata, Daisuke; Mano, Tsutomu; Abramov, Alexei V; Baryshnikov, Gennady F; Kosintsev, Pavel A; Vorobiev, Alexandr A; Raichev, Evgeny G; Tsunoda, Hiroshi; Kaneko, Yayoi; Murata, Koichi; Fukui, Daisuke; Masuda, Ryuichi
2013-07-01
To further elucidate the migration history of the brown bears (Ursus arctos) on Hokkaido Island, Japan, we analyzed the complete mitochondrial DNA (mtDNA) sequences of 35 brown bears from Hokkaido, the southern Kuril Islands (Etorofu and Kunashiri), Sakhalin Island, and the Eurasian Continent (continental Russia, Bulgaria, and Tibet), and those of four polar bears. Based on these sequences, we reconstructed the maternal phylogeny of the brown bear and estimated divergence times to investigate the timing of brown bear migrations, especially in northeastern Eurasia. Our gene tree showed the mtDNA haplotypes of all 73 brown and polar bears to be divided into eight divergent lineages. The brown bear on Hokkaido was divided into three lineages (central, eastern, and southern). The Sakhalin brown bear grouped with eastern European and western Alaskan brown bears. Etorofu and Kunashiri brown bears were closely related to eastern Hokkaido brown bears and could have diverged from the eastern Hokkaido lineage after formation of the channel between Hokkaido and the southern Kuril Islands. Tibetan brown bears diverged early in the eastern lineage. Southern Hokkaido brown bears were closely related to North American brown bears.
Pohl, Nélida; Sison-Mangus, Marilou P; Yee, Emily N; Liswi, Saif W; Briscoe, Adriana D
2009-01-01
Background The increase in availability of genomic sequences for a wide range of organisms has revealed gene duplication to be a relatively common event. Encounters with duplicate gene copies have consequently become almost inevitable in the context of collecting gene sequences for inferring species trees. Here we examine the effect of incorporating duplicate gene copies evolving at different rates on tree reconstruction and time estimation of recent and deep divergences in butterflies. Results Sequences from ultraviolet-sensitive (UVRh), blue-sensitive (BRh), and long-wavelength sensitive (LWRh) opsins,EF-1α and COI were obtained from 27 taxa representing the five major butterfly families (5535 bp total). Both BRh and LWRh are present in multiple copies in some butterfly lineages and the different copies evolve at different rates. Regardless of the phylogenetic reconstruction method used, we found that analyses of combined data sets using either slower or faster evolving copies of duplicate genes resulted in a single topology in agreement with our current understanding of butterfly family relationships based on morphology and molecules. Interestingly, individual analyses of BRh and LWRh sequences also recovered these family-level relationships. Two different relaxed clock methods resulted in similar divergence time estimates at the shallower nodes in the tree, regardless of whether faster or slower evolving copies were used, with larger discrepancies observed at deeper nodes in the phylogeny. The time of divergence between the monarch butterfly Danaus plexippus and the queen D. gilippus (15.3–35.6 Mya) was found to be much older than the time of divergence between monarch co-mimic Limenitis archippus and red-spotted purple L. arthemis (4.7–13.6 Mya), and overlapping with the time of divergence of the co-mimetic passionflower butterflies Heliconius erato and H. melpomene (13.5–26.1 Mya). Our family-level results are congruent with recent estimates found in the literature and indicate an age of 84–113 million years for the divergence of all butterfly families. Conclusion These results are consistent with diversification of the butterfly families following the radiation of angiosperms and suggest that some classes of opsin genes may be usefully employed for both phylogenetic reconstruction and divergence time estimation. PMID:19439087
Chan, Yvonne H.; Venev, Sergey V.; Zeldovich, Konstantin B.; Matthews, C. Robert
2017-01-01
Sequence divergence of orthologous proteins enables adaptation to environmental stresses and promotes evolution of novel functions. Limits on evolution imposed by constraints on sequence and structure were explored using a model TIM barrel protein, indole-3-glycerol phosphate synthase (IGPS). Fitness effects of point mutations in three phylogenetically divergent IGPS proteins during adaptation to temperature stress were probed by auxotrophic complementation of yeast with prokaryotic, thermophilic IGPS. Analysis of beneficial mutations pointed to an unexpected, long-range allosteric pathway towards the active site of the protein. Significant correlations between the fitness landscapes of distant orthologues implicate both sequence and structure as primary forces in defining the TIM barrel fitness landscape and suggest that fitness landscapes can be translocated in sequence space. Exploration of fitness landscapes in the context of a protein fold provides a strategy for elucidating the sequence-structure-fitness relationships in other common motifs. PMID:28262665
Generalization of Entropy Based Divergence Measures for Symbolic Sequence Analysis
Ré, Miguel A.; Azad, Rajeev K.
2014-01-01
Entropy based measures have been frequently used in symbolic sequence analysis. A symmetrized and smoothed form of Kullback-Leibler divergence or relative entropy, the Jensen-Shannon divergence (JSD), is of particular interest because of its sharing properties with families of other divergence measures and its interpretability in different domains including statistical physics, information theory and mathematical statistics. The uniqueness and versatility of this measure arise because of a number of attributes including generalization to any number of probability distributions and association of weights to the distributions. Furthermore, its entropic formulation allows its generalization in different statistical frameworks, such as, non-extensive Tsallis statistics and higher order Markovian statistics. We revisit these generalizations and propose a new generalization of JSD in the integrated Tsallis and Markovian statistical framework. We show that this generalization can be interpreted in terms of mutual information. We also investigate the performance of different JSD generalizations in deconstructing chimeric DNA sequences assembled from bacterial genomes including that of E. coli, S. enterica typhi, Y. pestis and H. influenzae. Our results show that the JSD generalizations bring in more pronounced improvements when the sequences being compared are from phylogenetically proximal organisms, which are often difficult to distinguish because of their compositional similarity. While small but noticeable improvements were observed with the Tsallis statistical JSD generalization, relatively large improvements were observed with the Markovian generalization. In contrast, the proposed Tsallis-Markovian generalization yielded more pronounced improvements relative to the Tsallis and Markovian generalizations, specifically when the sequences being compared arose from phylogenetically proximal organisms. PMID:24728338
NASA Technical Reports Server (NTRS)
Romano, Laura A.; Wray, Gregory A.
2003-01-01
Evolutionary changes in transcriptional regulation undoubtedly play an important role in creating morphological diversity. However, there is little information about the evolutionary dynamics of cis-regulatory sequences. This study examines the functional consequence of evolutionary changes in the Endo16 promoter of sea urchins. The Endo16 gene encodes a large extracellular protein that is expressed in the endoderm and may play a role in cell adhesion. Its promoter has been characterized in exceptional detail in the purple sea urchin, Strongylocentrotus purpuratus. We have characterized the structure and function of the Endo16 promoter from a second sea urchin species, Lytechinus variegatus. The Endo16 promoter sequences have evolved in a strongly mosaic manner since these species diverged approximately 35 million years ago: the most proximal region (module A) is conserved, but the remaining modules (B-G) are unalignable. Despite extensive divergence in promoter sequences, the pattern of Endo16 transcription is largely conserved during embryonic and larval development. Transient expression assays demonstrate that 2.2 kb of upstream sequence in either species is sufficient to drive GFP reporter expression that correctly mimics this pattern of Endo16 transcription. Reciprocal cross-species transient expression assays imply that changes have also evolved in the set of transcription factors that interact with the Endo16 promoter. Taken together, these results suggest that stabilizing selection on the transcriptional output may have operated to maintain a similar pattern of Endo16 expression in S. purpuratus and L. variegatus, despite dramatic divergence in promoter sequence and mechanisms of transcriptional regulation.
Generalization of entropy based divergence measures for symbolic sequence analysis.
Ré, Miguel A; Azad, Rajeev K
2014-01-01
Entropy based measures have been frequently used in symbolic sequence analysis. A symmetrized and smoothed form of Kullback-Leibler divergence or relative entropy, the Jensen-Shannon divergence (JSD), is of particular interest because of its sharing properties with families of other divergence measures and its interpretability in different domains including statistical physics, information theory and mathematical statistics. The uniqueness and versatility of this measure arise because of a number of attributes including generalization to any number of probability distributions and association of weights to the distributions. Furthermore, its entropic formulation allows its generalization in different statistical frameworks, such as, non-extensive Tsallis statistics and higher order Markovian statistics. We revisit these generalizations and propose a new generalization of JSD in the integrated Tsallis and Markovian statistical framework. We show that this generalization can be interpreted in terms of mutual information. We also investigate the performance of different JSD generalizations in deconstructing chimeric DNA sequences assembled from bacterial genomes including that of E. coli, S. enterica typhi, Y. pestis and H. influenzae. Our results show that the JSD generalizations bring in more pronounced improvements when the sequences being compared are from phylogenetically proximal organisms, which are often difficult to distinguish because of their compositional similarity. While small but noticeable improvements were observed with the Tsallis statistical JSD generalization, relatively large improvements were observed with the Markovian generalization. In contrast, the proposed Tsallis-Markovian generalization yielded more pronounced improvements relative to the Tsallis and Markovian generalizations, specifically when the sequences being compared arose from phylogenetically proximal organisms.
Yasukochi, Yoshiki; Satta, Yoko
2014-05-02
An extraordinary diversity of amino acid sequences in the peptide-binding region (PBR) of human leukocyte antigen [HLA; human major histocompatibility complex (MHC)] molecules has been maintained by balancing selection. The process of accumulation of amino acid diversity in the PBR for six HLA genes (HLA-A, B, C, DRB1, DQB1, and DPB1) shows that the number of amino acid substitutions in the PBR among alleles does not linearly correlate with the divergence time of alleles at the six HLA loci. At these loci, some pairs of alleles show significantly less nonsynonymous substitutions at the PBR than expected from the divergence time. The same phenomenon was observed not only in the HLA but also in the rat MHC. To identify the cause for this, DRB1 sequences, a representative case of a typical nonlinear pattern of substitutions, were examined. When the amino acid substitutions in the PBR were placed with maximum parsimony on a maximum likelihood tree based on the non-PBR substitutions, heterogeneous rates of nonsynonymous substitutions in the PBR were observed on several branches. A computer simulation supported the hypothesis that allelic pairs with low PBR substitution rates were responsible for the stagnation of accumulation of PBR nonsynonymous substitutions. From these observations, we conclude that the nonsynonymous substitution rate at the PBR sites is not constant among the allelic lineages. The deceleration of the rate may be caused by the coexistence of certain pathogens for a substantially long time during HLA evolution. Copyright © 2014 Yasukochi and Satta.
Beaudet, Denis; Terrat, Yves; Halary, Sébastien; de la Providencia, Ivan Enrique; Hijri, Mohamed
2013-01-01
Comparative mitochondrial genomics of arbuscular mycorrhizal fungi (AMF) provide new avenues to overcome long-lasting obstacles that have hampered studies aimed at understanding the community structure, diversity, and evolution of these multinucleated and genetically polymorphic organisms.AMF mitochondrial (mt) genomes are homogeneous within isolates, and their intergenic regions harbor numerous mobile elements that have rapidly diverged, including homing endonuclease genes, small inverted repeats, and plasmid-related DNA polymerase genes (dpo), making them suitable targets for the development of reliable strain-specific markers. However, these elements may also lead to genome rearrangements through homologous recombination, although this has never previously been reported in this group of obligate symbiotic fungi. To investigate whether such rearrangements are present and caused by mobile elements in AMF, the mitochondrial genomes from two Glomeraceae members (i.e., Glomus cerebriforme and Glomus sp.) with substantial mtDNA synteny divergence,were sequenced and compared with available glomeromycotan mitochondrial genomes. We used an extensive nucleotide/protein similarity network-based approach to investigated podiversity in AMF as well as in other organisms for which sequences are publicly available. We provide strong evidence of dpo-induced inter-haplotype recombination, leading to a reshuffled mitochondrial genome in Glomus sp. These findings raise questions as to whether AMF single spore cultivations artificially underestimate mtDNA genetic diversity.We assessed potential dpo dispersal mechanisms in AMF and inferred a robust phylogenetic relationship with plant mitochondrial plasmids. Along with other indirect evidence, our analyses indicate that members of the Glomeromycota phylum are potential donors of mitochondrial plasmids to plants.
Beaudet, Denis; Terrat, Yves; Halary, Sébastien; de la Providencia, Ivan Enrique; Hijri, Mohamed
2013-01-01
Comparative mitochondrial genomics of arbuscular mycorrhizal fungi (AMF) provide new avenues to overcome long-lasting obstacles that have hampered studies aimed at understanding the community structure, diversity, and evolution of these multinucleated and genetically polymorphic organisms. AMF mitochondrial (mt) genomes are homogeneous within isolates, and their intergenic regions harbor numerous mobile elements that have rapidly diverged, including homing endonuclease genes, small inverted repeats, and plasmid-related DNA polymerase genes (dpo), making them suitable targets for the development of reliable strain-specific markers. However, these elements may also lead to genome rearrangements through homologous recombination, although this has never previously been reported in this group of obligate symbiotic fungi. To investigate whether such rearrangements are present and caused by mobile elements in AMF, the mitochondrial genomes from two Glomeraceae members (i.e., Glomus cerebriforme and Glomus sp.) with substantial mtDNA synteny divergence, were sequenced and compared with available glomeromycotan mitochondrial genomes. We used an extensive nucleotide/protein similarity network-based approach to investigate dpo diversity in AMF as well as in other organisms for which sequences are publicly available. We provide strong evidence of dpo-induced inter-haplotype recombination, leading to a reshuffled mitochondrial genome in Glomus sp. These findings raise questions as to whether AMF single spore cultivations artificially underestimate mtDNA genetic diversity. We assessed potential dpo dispersal mechanisms in AMF and inferred a robust phylogenetic relationship with plant mitochondrial plasmids. Along with other indirect evidence, our analyses indicate that members of the Glomeromycota phylum are potential donors of mitochondrial plasmids to plants. PMID:23925788
Jeukens, Julie; Bernatchez, Louis
2012-01-01
While gene expression divergence is known to be involved in adaptive phenotypic divergence and speciation, the relative importance of regulatory and structural evolution of genes is poorly understood. A recent next-generation sequencing experiment allowed identifying candidate genes potentially involved in the ongoing speciation of sympatric dwarf and normal lake whitefish (Coregonus clupeaformis), such as cytosolic malate dehydrogenase (MDH1), which showed both significant expression and sequence divergence. The main goal of this study was to investigate into more details the signatures of natural selection in the regulatory and coding sequences of MDH1 in lake whitefish and test for parallelism of these signatures with other coregonine species. Sequencing of the two regions in 118 fish from four sympatric pairs of whitefish and two cisco species revealed a total of 35 single nucleotide polymorphisms (SNPs), with more genetic diversity in European compared to North American coregonine species. While the coding region was found to be under purifying selection, an SNP in the proximal promoter exhibited significant allele frequency divergence in a parallel manner among independent sympatric pairs of North American lake whitefish and European whitefish (C. lavaretus). According to transcription factor binding simulation for 22 regulatory haplotypes of MDH1, putative binding profiles were fairly conserved among species, except for the region around this SNP. Moreover, we found evidence for the role of this SNP in the regulation of MDH1 expression level. Overall, these results provide further evidence for the role of natural selection in gene regulation evolution among whitefish species pairs and suggest its possible link with patterns of phenotypic diversity observed in coregonine species. PMID:22408741
Jeukens, Julie; Bernatchez, Louis
2012-01-01
While gene expression divergence is known to be involved in adaptive phenotypic divergence and speciation, the relative importance of regulatory and structural evolution of genes is poorly understood. A recent next-generation sequencing experiment allowed identifying candidate genes potentially involved in the ongoing speciation of sympatric dwarf and normal lake whitefish (Coregonus clupeaformis), such as cytosolic malate dehydrogenase (MDH1), which showed both significant expression and sequence divergence. The main goal of this study was to investigate into more details the signatures of natural selection in the regulatory and coding sequences of MDH1 in lake whitefish and test for parallelism of these signatures with other coregonine species. Sequencing of the two regions in 118 fish from four sympatric pairs of whitefish and two cisco species revealed a total of 35 single nucleotide polymorphisms (SNPs), with more genetic diversity in European compared to North American coregonine species. While the coding region was found to be under purifying selection, an SNP in the proximal promoter exhibited significant allele frequency divergence in a parallel manner among independent sympatric pairs of North American lake whitefish and European whitefish (C. lavaretus). According to transcription factor binding simulation for 22 regulatory haplotypes of MDH1, putative binding profiles were fairly conserved among species, except for the region around this SNP. Moreover, we found evidence for the role of this SNP in the regulation of MDH1 expression level. Overall, these results provide further evidence for the role of natural selection in gene regulation evolution among whitefish species pairs and suggest its possible link with patterns of phenotypic diversity observed in coregonine species.
Expression Divergence Is Correlated with Sequence Evolution but Not Positive Selection in Conifers.
Hodgins, Kathryn A; Yeaman, Sam; Nurkowski, Kristin A; Rieseberg, Loren H; Aitken, Sally N
2016-06-01
The evolutionary and genomic determinants of sequence evolution in conifers are poorly understood, and previous studies have found only limited evidence for positive selection. Using RNAseq data, we compared gene expression profiles to patterns of divergence and polymorphism in 44 seedlings of lodgepole pine (Pinus contorta) and 39 seedlings of interior spruce (Picea glauca × engelmannii) to elucidate the evolutionary forces that shape their genomes and their plastic responses to abiotic stress. We found that rapidly diverging genes tend to have greater expression divergence, lower expression levels, reduced levels of synonymous site diversity, and longer proteins than slowly diverging genes. Similar patterns were identified for the untranslated regions, but with some exceptions. We found evidence that genes with low expression levels had a larger fraction of nearly neutral sites, suggesting a primary role for negative selection in determining the association between evolutionary rate and expression level. There was limited evidence for differences in the rate of positive selection among genes with divergent versus conserved expression profiles and some evidence supporting relaxed selection in genes diverging in expression between the species. Finally, we identified a small number of genes that showed evidence of site-specific positive selection using divergence data alone. However, estimates of the proportion of sites fixed by positive selection (α) were in the range of other plant species with large effective population sizes suggesting relatively high rates of adaptive divergence among conifers. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Patterns and rates of intron divergence between humans and chimpanzees
Gazave, Elodie; Marqués-Bonet, Tomàs; Fernando, Olga; Charlesworth, Brian; Navarro, Arcadi
2007-01-01
Background Introns, which constitute the largest fraction of eukaryotic genes and which had been considered to be neutral sequences, are increasingly acknowledged as having important functions. Several studies have investigated levels of evolutionary constraint along introns and across classes of introns of different length and location within genes. However, thus far these studies have yielded contradictory results. Results We present the first analysis of human-chimpanzee intron divergence, in which differences in the number of substitutions per intronic site (Ki) can be interpreted as the footprint of different intensities and directions of the pressures of natural selection. Our main findings are as follows: there was a strong positive correlation between intron length and divergence; there was a strong negative correlation between intron length and GC content; and divergence rates vary along introns and depending on their ordinal position within genes (for instance, first introns are more GC rich, longer and more divergent, and divergence is lower at the 3' and 5' ends of all types of introns). Conclusion We show that the higher divergence of first introns is related to their larger size. Also, the lower divergence of short introns suggests that they may harbor a relatively greater proportion of regulatory elements than long introns. Moreover, our results are consistent with the presence of functionally relevant sequences near the 5' and 3' ends of introns. Finally, our findings suggest that other parts of introns may also be under selective constraints. PMID:17309804
Lexer, C; Wüest, R O; Mangili, S; Heuertz, M; Stölting, K N; Pearman, P B; Forest, F; Salamin, N; Zimmermann, N E; Bossolini, E
2014-09-01
Understanding the drivers of population divergence, speciation and species persistence is of great interest to molecular ecology, especially for species-rich radiations inhabiting the world's biodiversity hotspots. The toolbox of population genomics holds great promise for addressing these key issues, especially if genomic data are analysed within a spatially and ecologically explicit context. We have studied the earliest stages of the divergence continuum in the Restionaceae, a species-rich and ecologically important plant family of the Cape Floristic Region (CFR) of South Africa, using the widespread CFR endemic Restio capensis (L.) H.P. Linder & C.R. Hardy as an example. We studied diverging populations of this morphotaxon for plastid DNA sequences and >14 400 nuclear DNA polymorphisms from Restriction site Associated DNA (RAD) sequencing and analysed the results jointly with spatial, climatic and phytogeographic data, using a Bayesian generalized linear mixed modelling (GLMM) approach. The results indicate that population divergence across the extreme environmental mosaic of the CFR is mostly driven by isolation by environment (IBE) rather than isolation by distance (IBD) for both neutral and non-neutral markers, consistent with genome hitchhiking or coupling effects during early stages of divergence. Mixed modelling of plastid DNA and single divergent outlier loci from a Bayesian genome scan confirmed the predominant role of climate and pointed to additional drivers of divergence, such as drift and ecological agents of selection captured by phytogeographic zones. Our study demonstrates the usefulness of population genomics for disentangling the effects of IBD and IBE along the divergence continuum often found in species radiations across heterogeneous ecological landscapes. © 2014 John Wiley & Sons Ltd.
2014-01-01
Background A common assumption of microorganisms is that laboratory stocks will remain genetically and phenotypically constant over time, and across laboratories. It is becoming increasingly clear, however, that mutations can ruin strain integrity and drive the divergence or “domestication” of stocks. Since its discovery in 1960, a stock of Methylobacterium extorquens AM1 (“AM1”) has remained in the lab, propagated across numerous growth and storage conditions, researchers, and facilities. To explore the extent to which this lineage has diverged, we compared our own “Modern” stock of AM1 to a sample archived at a culture stock center shortly after the strain’s discovery. Stored as a lyophilized sample, we hypothesized that this Archival strain would better reflect the first-ever isolate of AM1 and reveal ways in which our Modern stock has changed through laboratory domestication or other means. Results Using whole-genome re-sequencing, we identified some 29 mutations – including single nucleotide polymorphisms, small indels, the insertion of mobile elements, and the loss of roughly 36 kb of DNA - that arose in the laboratory-maintained Modern lineage. Contrary to our expectations, Modern was both slower and less fit than Archival across a variety of growth substrates, and showed no improvement during long-term growth and storage. Modern did, however, outperform Archival during growth on nutrient broth, and in resistance to rifamycin, which was selected for by researchers in the 1980s. Recapitulating selection for rifamycin resistance in replicate Archival populations showed that mutations to RNA polymerase B (rpoB) substantially decrease growth in the absence of antibiotic, offering an explanation for slower growth in Modern stocks. Given the large number of genomic changes arising from domestication (28), it is somewhat surprising that the single other mutation attributed to purposeful laboratory selection accounts for much of the phenotypic divergence between strains. Conclusions These results highlight the surprising degree to which AM1 has diverged through a combination of unintended laboratory domestication and purposeful selection for rifamycin resistance. Instances of strain divergence are important, not only to ensure consistency of experimental results, but also to explore how microbes in the lab diverge from one another and from their wild counterparts. PMID:24384040
Baurens, Franc-Christophe; Bocs, Stéphanie; Rouard, Mathieu; Matsumoto, Takashi; Miller, Robert N G; Rodier-Goud, Marguerite; MBéguié-A-MBéguié, Didier; Yahiaoui, Nabila
2010-07-16
Comparative sequence analysis of complex loci such as resistance gene analog clusters allows estimating the degree of sequence conservation and mechanisms of divergence at the intraspecies level. In banana (Musa sp.), two diploid wild species Musa acuminata (A genome) and Musa balbisiana (B genome) contribute to the polyploid genome of many cultivars. The M. balbisiana species is associated with vigour and tolerance to pests and disease and little is known on the genome structure and haplotype diversity within this species. Here, we compare two genomic sequences of 253 and 223 kb corresponding to two haplotypes of the RGA08 resistance gene analog locus in M. balbisiana "Pisang Klutuk Wulung" (PKW). Sequence comparison revealed two regions of contrasting features. The first is a highly colinear gene-rich region where the two haplotypes diverge only by single nucleotide polymorphisms and two repetitive element insertions. The second corresponds to a large cluster of RGA08 genes, with 13 and 18 predicted RGA genes and pseudogenes spread over 131 and 152 kb respectively on each haplotype. The RGA08 cluster is enriched in repetitive element insertions, in duplicated non-coding intergenic sequences including low complexity regions and shows structural variations between haplotypes. Although some allelic relationships are retained, a large diversity of RGA08 genes occurs in this single M. balbisiana genotype, with several RGA08 paralogs specific to each haplotype. The RGA08 gene family has evolved by mechanisms of unequal recombination, intragenic sequence exchange and diversifying selection. An unequal recombination event taking place between duplicated non-coding intergenic sequences resulted in a different RGA08 gene content between haplotypes pointing out the role of such duplicated regions in the evolution of RGA clusters. Based on the synonymous substitution rate in coding sequences, we estimated a 1 million year divergence time for these M. balbisiana haplotypes. A large RGA08 gene cluster identified in wild banana corresponds to a highly variable genomic region between haplotypes surrounded by conserved flanking regions. High level of sequence identity (70 to 99%) of the genic and intergenic regions suggests a recent and rapid evolution of this cluster in M. balbisiana.
Singhal, Dinesh K; Singhal, Raxita; Malik, Hruda N; Kumar, Surender; Kumar, Sudarshan; Mohanty, Ashok K; Kaushik, Jai K; Malakar, Dhruba
2014-01-01
Nanog is a homeodomain containing protein which plays important roles in regulation of signaling pathways for maintenance and induction of pluripotency in stem cells. Because of its unique expression in stem cells it is also regarded as pluripotency marker. In this study goat Nanog (gNanog) gene has been amplified, cloned and characterized at sequence level with successful over-expression in CHO-K1 cell line using a lentiviral based system. gNanog ORF is 903 bp long which codes for Nanog protein of size 300 amino acids (aas). Complete nucleotide sequence shows some evolutionary mutation in goat in comparision to other species. Protein sequence of goat is highly similar to other species. Overall, gNanog nucleotide sequence and predicted protein sequence showed high similarity and minimum divergence with cattle (96 % identity/4 % divergence) and buffalo (94/5 %) while low similarity and high divergence with pig (84/15 %), human (81/23 %) and mouse (69/40 %) indicating evolutionary closeness of gNanog to cattle and buffalo. gNanog lentiviral expression construct was prepared for over-expression of Nanog gene in adult goat fibroblast cells. Lentiviral expression construct of Nanog enabled continuous protein expression for induction and maintenance of pluripotency. Western blotting revealed the expression of Nanog gene at protein level which supported that the lentiviral expression system is highly promising for Nanog protein expression in differentiated goat cell.
Shahin, Arwa; Smulders, Marinus J. M.; van Tuyl, Jaap M.; Arens, Paul; Bakker, Freek T.
2014-01-01
Next Generation Sequencing (NGS) may enable estimating relationships among genotypes using allelic variation of multiple nuclear genes simultaneously. We explored the potential and caveats of this strategy in four genetically distant Lilium cultivars to estimate their genetic divergence from transcriptome sequences using three approaches: POFAD (Phylogeny of Organisms from Allelic Data, uses allelic information of sequence data), RAxML (Randomized Accelerated Maximum Likelihood, tree building based on concatenated consensus sequences) and Consensus Network (constructing a network summarizing among gene tree conflicts). Twenty six gene contigs were chosen based on the presence of orthologous sequences in all cultivars, seven of which also had an orthologous sequence in Tulipa, used as out-group. The three approaches generated the same topology. Although the resolution offered by these approaches is high, in this case there was no extra benefit in using allelic information. We conclude that these 26 genes can be widely applied to construct a species tree for the genus Lilium. PMID:25368628
Novel genotype of Ehrlichia canis detected in samples of human blood bank donors in Costa Rica.
Bouza-Mora, Laura; Dolz, Gaby; Solórzano-Morales, Antony; Romero-Zuñiga, Juan José; Salazar-Sánchez, Lizbeth; Labruna, Marcelo B; Aguiar, Daniel M
2017-01-01
This study focuses on the detection and identification of DNA and antibodies to Ehrlichia spp. in samples of blood bank donors in Costa Rica using molecular and serological techniques. Presence of Ehrlichia canis was determined in 10 (3.6%) out of 280 blood samples using polymerase chain reaction (PCR) targeting the ehrlichial dsb conserved gene. Analysis of the ehrlichial trp36 polymorphic gene in these 10 samples revealed substantial polymorphism among the E. canis genotypes, including divergent tandem repeat sequences. Nucleotide sequences of dsb and trp36 amplicons revealed a novel genotype of E. canis in blood bank donors from Costa Rica. Indirect immunofluorescence assay (IFA) detected antibodies in 35 (35%) of 100 serum samples evaluated. Thirty samples showed low endpoint titers (64-256) to E. canis, whereas five sera yielded high endpoint titers (1024-8192); these five samples were also E. canis-PCR positive. These findings represent the first report of the presence of E. canis in humans in Central America. Copyright © 2016 Elsevier GmbH. All rights reserved.
Specification of anteroposterior cell fates in Caenorhabditis elegans by Drosophila Hox proteins.
Hunter, C P; Kenyon, C
1995-09-21
Antennapedia class homeobox (Hox) genes specify cell fates in successive anteroposterior body domains in vertebrates, insects and nematodes. The DNA-binding homeodomain sequences are very similar between vertebrate and Drosophila Hox proteins, and this similarity allows vertebrate Hox proteins to function in Drosophila. In contrast, the Caenorhabditis elegans homeodomains are substantially divergent. Further, C. elegans differs from both insects and vertebrates in having a non-segmented body as well as a distinctive mode of development that involves asymmetric early cleavages and invariant cell lineages. Here we report that, despite these differences, Drosophila Hox proteins expressed in C. elegans can substitute for C. elegans Hox proteins in the control of three different cell-fate decisions: the regulation of cell migration, the specification of serotonergic neurons, and the specification of a sensory structure. We also show that the specificity of one C. elegans Hox protein is partly determined by two amino acids that have been implicated in sequence-specific DNA binding. Together these findings suggest that factors important for target recognition by specific Hox proteins have been conserved throughout much of the animal kingdom.
Extensive Concerted Evolution of Rice Paralogs and the Road to Regaining Independence
Wang, Xiyin; Tang, Haibao; Bowers, John E.; Feltus, Frank A.; Paterson, Andrew H.
2007-01-01
Many genes duplicated by whole-genome duplications (WGDs) are more similar to one another than expected. We investigated whether concerted evolution through conversion and crossing over, well-known to affect tandem gene clusters, also affects dispersed paralogs. Genome sequences for two Oryza subspecies reveal appreciable gene conversion in the ∼0.4 MY since their divergence, with a gradual progression toward independent evolution of older paralogs. Since divergence from subspecies indica, ∼8% of japonica paralogs produced 5–7 MYA on chromosomes 11 and 12 have been affected by gene conversion and several reciprocal exchanges of chromosomal segments, while ∼70-MY-old “paleologs” resulting from a genome duplication (GD) show much less conversion. Sequence similarity analysis in proximal gene clusters also suggests more conversion between younger paralogs. About 8% of paleologs may have been converted since rice–sorghum divergence ∼41 MYA. Domain-encoding sequences are more frequently converted than nondomain sequences, suggesting a sort of circularity—that sequences conserved by selection may be further conserved by relatively frequent conversion. The higher level of concerted evolution in the 5–7 MY-old segmental duplication may reflect the behavior of many genomes within the first few million years after duplication or polyploidization. PMID:18039882
A Generalized Least-Squares Estimate for the Origin of Sporophytic Self-Incompatibility
Uyenoyama, M. K.
1995-01-01
Analysis of nucleotide sequences that regulate the expression of self-incompatibility in flowering plants affords a direct means of examining classical hypotheses for the origin and evolution of this major feature of mating systems. Departing from the classical view of monophyly of all forms of self-incompatibility, the current paradigm for the origin of self-incompatibility postulates multiple episodes of recruitment and modification of preexisting genes. In Brassica, the S locus, which regulates sporophytic self-incompatibility, shows homology to a multigene family present both in self-compatible congeners and in groups for which this form of self-incompatibility is atypical. A phylogenetic analysis of S-allele sequences together with homologous sequences that do not cosegregate with self-incompatibility permits dating the change of function that marked the origin of self-incompatibility. A generalized least-squares method is introduced that provides closed-form expressions for estimates and standard errors for function-specific divergence rates and times of divergence among sequences. This analysis suggests that the age of the sporophytic self-incompatibility system expressed in Brassica exceeds species divergence within the genus by four- to fivefold. The extraordinarily high levels of sequence diversity exhibited by S alleles appears to reflect their ancient derivation, with the alternative hypothesis of hypermutability rejected by the analysis. PMID:7713446
LeDuc, Richard G; Robertson, Kelly M; Pitman, Robert L
2008-08-23
Recently, three visually distinct forms of killer whales (Orcinus orca) were described from Antarctic waters and designated as types A, B and C. Based on consistent differences in prey selection and habitat preferences, morphological divergence and apparent lack of interbreeding among these broadly sympatric forms, it was suggested that they may represent separate species. To evaluate this hypothesis, we compared complete sequences of the mitochondrial control region from 81 Antarctic killer whale samples, including 9 type A, 18 type B, 47 type C and 7 type-undetermined individuals. We found three fixed differences that separated type A from B and C, and a single fixed difference that separated type C from A and B. These results are consistent with reproductive isolation among the different forms, although caution is needed in drawing further conclusions. Despite dramatic differences in morphology and ecology, the relatively low levels of sequence divergence in Antarctic killer whales indicate that these evolutionary changes occurred relatively rapidly and recently.
Özdemir, Ebru; Altındağ, Ahmet; Kandemir, İrfan
2017-05-01
Daphnia is a freshwater zooplankton species with controversial taxonomy due to its high morphological variation linked to environmental factors and inter-specific hybridization and polyploidy in some groups. The aim of the present study is to examine molecular diversity of some Daphnia species in Turkey and to establish DNA barcodes of Turkish Daphnia species. Sequence analysis was performed using 540 bp region of cytochrome oxidase subunit I gene of mitochondrial DNA. A total of 34 haplotypes have been identified for Turkey. Daphnia pulex complex was divided into two clades with 16.1% sequence divergence according to molecular taxonomy based on Kimura 2-parameter. The clade which was molecularly diverged from Daphnia pulex with 16.1% sequence divergence was found to show 99% similarity with Daphnia cf. pulicaria (sensu Alonso 1996) instead of Daphnia pulicaria Forbes, 1893. Furthermore, this study has contributed to Turkish zoogeography by demonstrating the distribution of Daphnia species in Turkey.
Archaebacterial rhodopsin sequences: Implications for evolution
NASA Technical Reports Server (NTRS)
Lanyi, J. K.
1991-01-01
It was proposed over 10 years ago that the archaebacteria represent a separate kingdom which diverged very early from the eubacteria and eukaryotes. It follows that investigations of archaebacterial characteristics might reveal features of early evolution. So far, two genes, one for bacteriorhodopsin and another for halorhodopsin, both from Halobacterium halobium, have been sequenced. We cloned and sequenced the gene coding for the polypeptide of another one of these rhodopsins, a halorhodopsin in Natronobacterium pharaonis. Peptide sequencing of cyanogen bromide fragments, and immuno-reactions of the protein and synthetic peptides derived from the C-terminal gene sequence, confirmed that the open reading frame was the structural gene for the pharaonis halorhodopsin polypeptide. The flanking DNA sequences of this gene, as well as those of other bacterial rhodopsins, were compared to previously proposed archaebacterial consensus sequences. In pairwise comparisons of the open reading frame with DNA sequences for bacterio-opsin and halo-opsin from Halobacterium halobium, silent divergences were calculated. These indicate very considerable evolutionary distance between each pair of genes, even in the dame organism. In spite of this, three protein sequences show extensive similarities, indicating strong selective pressures.
Mohien, Ceereena Ubaida; Colquhoun, David R.; Mathias, Derrick K.; Gibbons, John G.; Armistead, Jennifer S.; Rodriguez, Maria C.; Rodriguez, Mario Henry; Edwards, Nathan J.; Hartler, Jürgen; Thallinger, Gerhard G.; Graham, David R.; Martinez-Barnetche, Jesus; Rokas, Antonis; Dinglasan, Rhoel R.
2013-01-01
Malaria morbidity and mortality caused by both Plasmodium falciparum and Plasmodium vivax extend well beyond the African continent, and although P. vivax causes between 80 and 300 million severe cases each year, vivax transmission remains poorly understood. Plasmodium parasites are transmitted by Anopheles mosquitoes, and the critical site of interaction between parasite and host is at the mosquito's luminal midgut brush border. Although the genome of the “model” African P. falciparum vector, Anopheles gambiae, has been sequenced, evolutionary divergence limits its utility as a reference across anophelines, especially non-sequenced P. vivax vectors such as Anopheles albimanus. Clearly, technologies and platforms that bridge this substantial scientific gap are required in order to provide public health scientists with key transcriptomic and proteomic information that could spur the development of novel interventions to combat this disease. To our knowledge, no approaches have been published that address this issue. To bolster our understanding of P. vivax–An. albimanus midgut interactions, we developed an integrated bioinformatic-hybrid RNA-Seq-LC-MS/MS approach involving An. albimanus transcriptome (15,764 contigs) and luminal midgut subproteome (9,445 proteins) assembly, which, when used with our custom Diptera protein database (685,078 sequences), facilitated a comparative proteomic analysis of the midgut brush borders of two important malaria vectors, An. gambiae and An. albimanus. PMID:23082028
Ubaida Mohien, Ceereena; Colquhoun, David R; Mathias, Derrick K; Gibbons, John G; Armistead, Jennifer S; Rodriguez, Maria C; Rodriguez, Mario Henry; Edwards, Nathan J; Hartler, Jürgen; Thallinger, Gerhard G; Graham, David R; Martinez-Barnetche, Jesus; Rokas, Antonis; Dinglasan, Rhoel R
2013-01-01
Malaria morbidity and mortality caused by both Plasmodium falciparum and Plasmodium vivax extend well beyond the African continent, and although P. vivax causes between 80 and 300 million severe cases each year, vivax transmission remains poorly understood. Plasmodium parasites are transmitted by Anopheles mosquitoes, and the critical site of interaction between parasite and host is at the mosquito's luminal midgut brush border. Although the genome of the "model" African P. falciparum vector, Anopheles gambiae, has been sequenced, evolutionary divergence limits its utility as a reference across anophelines, especially non-sequenced P. vivax vectors such as Anopheles albimanus. Clearly, technologies and platforms that bridge this substantial scientific gap are required in order to provide public health scientists with key transcriptomic and proteomic information that could spur the development of novel interventions to combat this disease. To our knowledge, no approaches have been published that address this issue. To bolster our understanding of P. vivax-An. albimanus midgut interactions, we developed an integrated bioinformatic-hybrid RNA-Seq-LC-MS/MS approach involving An. albimanus transcriptome (15,764 contigs) and luminal midgut subproteome (9,445 proteins) assembly, which, when used with our custom Diptera protein database (685,078 sequences), facilitated a comparative proteomic analysis of the midgut brush borders of two important malaria vectors, An. gambiae and An. albimanus.
Divergence with gene flow within the recent chipmunk radiation (Tamias)
Sullivan, J; Demboski, J R; Bell, K C; Hird, S; Sarver, B; Reid, N; Good, J M
2014-01-01
Increasing data have supported the importance of divergence with gene flow (DGF) in the generation of biological diversity. In such cases, lineage divergence occurs on a shorter timescale than does the completion of reproductive isolation. Although it is critical to explore the mechanisms driving divergence and preventing homogenization by hybridization, it is equally important to document cases of DGF in nature. Here we synthesize data that have accumulated over the last dozen or so years on DGF in the chipmunk (Tamias) radiation with new data that quantify very high rates of mitochondrial DNA (mtDNA) introgression among para- and sympatric species in the T. quadrivittatus group in the central and southern Rocky Mountains. These new data (188 cytochrome b sequences) bring the total number of sequences up to 1871; roughly 16% (298) of the chipmunks we have sequenced exhibit introgressed mtDNA. This includes ongoing introgression between subspecies and between both closely related and distantly related taxa. In addition, we have identified several taxa that are apparently fixed for ancient introgressions and in which there is no evidence of ongoing introgression. A recurrent observation is that these introgressions occur between ecologically and morphologically diverged, sometimes non-sister taxa that engage in well-documented niche partitioning. Thus, the chipmunk radiation in western North America represents an excellent mammalian example of speciation in the face of recurrent gene flow among lineages and where biogeography, habitat differentiation and mating systems suggest important roles for both ecological and sexual selection. PMID:24781803
Mitochondrial divergence between slow- and fast-aging garter snakes.
Schwartz, Tonia S; Arendsee, Zebulun W; Bronikowski, Anne M
2015-11-01
Mitochondrial function has long been hypothesized to be intimately involved in aging processes--either directly through declining efficiency of mitochondrial respiration and ATP production with advancing age, or indirectly, e.g., through increased mitochondrial production of damaging free radicals with age. Yet we lack a comprehensive understanding of the evolution of mitochondrial genotypes and phenotypes across diverse animal models, particularly in species that have extremely labile physiology. Here, we measure mitochondrial genome-types and transcription in ecotypes of garter snakes (Thamnophis elegans) that are adapted to disparate habitats and have diverged in aging rates and lifespans despite residing in close proximity. Using two RNA-seq datasets, we (1) reconstruct the garter snake mitochondrial genome sequence and bioinformatically identify regulatory elements, (2) test for divergence of mitochondrial gene expression between the ecotypes and in response to heat stress, and (3) test for sequence divergence in mitochondrial protein-coding regions in these slow-aging (SA) and fast-aging (FA) naturally occurring ecotypes. At the nucleotide sequence level, we confirmed two (duplicated) mitochondrial control regions one of which contains a glucocorticoid response element (GRE). Gene expression of protein-coding genes was higher in FA snakes relative to SA snakes for most genes, but was neither affected by heat stress nor an interaction between heat stress and ecotype. SA and FA ecotypes had unique mitochondrial haplotypes with amino acid substitutions in both CYTB and ND5. The CYTB amino acid change (Isoleucine → Threonine) was highly segregated between ecotypes. This divergence of mitochondrial haplotypes between SA and FA snakes contrasts with nuclear gene-flow estimates, but correlates with previously reported divergence in mitochondrial function (mitochondrial oxygen consumption, ATP production, and reactive oxygen species consequences). Copyright © 2015 Elsevier Inc. All rights reserved.
Chromosomal Speciation in the Genomics Era: Disentangling Phylogenetic Evolution of Rock-wallabies.
Potter, Sally; Bragg, Jason G; Blom, Mozes P K; Deakin, Janine E; Kirkpatrick, Mark; Eldridge, Mark D B; Moritz, Craig
2017-01-01
The association of chromosome rearrangements (CRs) with speciation is well established, and there is a long history of theory and evidence relating to "chromosomal speciation." Genomic sequencing has the potential to provide new insights into how reorganization of genome structure promotes divergence, and in model systems has demonstrated reduced gene flow in rearranged segments. However, there are limits to what we can understand from a small number of model systems, which each only tell us about one episode of chromosomal speciation. Progressing from patterns of association between chromosome (and genic) change, to understanding processes of speciation requires both comparative studies across diverse systems and integration of genome-scale sequence comparisons with other lines of evidence. Here, we showcase a promising example of chromosomal speciation in a non-model organism, the endemic Australian marsupial genus Petrogale . We present initial phylogenetic results from exon-capture that resolve a history of divergence associated with extensive and repeated CRs. Yet it remains challenging to disentangle gene tree heterogeneity caused by recent divergence and gene flow in this and other such recent radiations. We outline a way forward for better integration of comparative genomic sequence data with evidence from molecular cytogenetics, and analyses of shifts in the recombination landscape and potential disruption of meiotic segregation and epigenetic programming. In all likelihood, CRs impact multiple cellular processes and these effects need to be considered together, along with effects of genic divergence. Understanding the effects of CRs together with genic divergence will require development of more integrative theory and inference methods. Together, new data and analysis tools will combine to shed light on long standing questions of how chromosome and genic divergence promote speciation.
Kim, Young Kyun; Kim, Seung Hyeon; Yi, Joo Mi; Kang, Chang-Keun; Short, Frederick; Lee, Kun-Seop
2017-01-01
Although seagrass species in the genus Halophila are generally distributed in tropical or subtropical regions, H. nipponica has been reported to occur in temperate coastal waters of the northwestern Pacific. Because H. nipponica occurs only in the warm temperate areas influenced by the Kuroshio Current and shows a tropical seasonal growth pattern, such as severely restricted growth in low water temperatures, it was hypothesized that this temperate Halophila species diverged from tropical species in the relatively recent evolutionary past. We used a phylogenetic analysis of internal transcribed spacer (ITS) regions to examine the genetic variability and evolutionary trend of H. nipponica. ITS sequences of H. nipponica from various locations in Korea and Japan were identical or showed very low sequence divergence (less than 3-base pair, bp, difference), confirming that H. nipponica from Japan and Korea are the same species. Halophila species in the section Halophila, which have simple phyllotaxy (a pair of petiolate leaves at the rhizome node), were separated into five well-supported clades by maximum parsimony analysis. H. nipponica grouped with H. okinawensis and H. gaudichaudii from the subtropical regions in the same clade, the latter two species having quite low ITS sequence divergence from H. nipponica (7-15-bp). H. nipponica in Clade I diverged 2.95 ± 1.08 million years ago from species in Clade II, which includes H. ovalis. According to geographical distribution and genetic similarity, H. nipponica appears to have diverged from a tropical species like H. ovalis and adapted to warm temperate environments. The results of divergence time estimates suggest that the temperate H. nipponica is an older species than the subtropical H. okinawensis and H. gaudichaudii and they may have different evolutionary histories.
Kim, Young Kyun; Kim, Seung Hyeon; Yi, Joo Mi; Kang, Chang-Keun; Short, Frederick; Lee, Kun-Seop
2017-01-01
Although seagrass species in the genus Halophila are generally distributed in tropical or subtropical regions, H. nipponica has been reported to occur in temperate coastal waters of the northwestern Pacific. Because H. nipponica occurs only in the warm temperate areas influenced by the Kuroshio Current and shows a tropical seasonal growth pattern, such as severely restricted growth in low water temperatures, it was hypothesized that this temperate Halophila species diverged from tropical species in the relatively recent evolutionary past. We used a phylogenetic analysis of internal transcribed spacer (ITS) regions to examine the genetic variability and evolutionary trend of H. nipponica. ITS sequences of H. nipponica from various locations in Korea and Japan were identical or showed very low sequence divergence (less than 3-base pair, bp, difference), confirming that H. nipponica from Japan and Korea are the same species. Halophila species in the section Halophila, which have simple phyllotaxy (a pair of petiolate leaves at the rhizome node), were separated into five well-supported clades by maximum parsimony analysis. H. nipponica grouped with H. okinawensis and H. gaudichaudii from the subtropical regions in the same clade, the latter two species having quite low ITS sequence divergence from H. nipponica (7–15-bp). H. nipponica in Clade I diverged 2.95 ± 1.08 million years ago from species in Clade II, which includes H. ovalis. According to geographical distribution and genetic similarity, H. nipponica appears to have diverged from a tropical species like H. ovalis and adapted to warm temperate environments. The results of divergence time estimates suggest that the temperate H. nipponica is an older species than the subtropical H. okinawensis and H. gaudichaudii and they may have different evolutionary histories. PMID:28505209
Yang, Zujun; Zhang, Tao; Bolshoy, Alexander; Beharav, Alexander; Nevo, Eviatar
2009-05-01
'Evolution Canyon' (ECI) at Lower Nahal Oren, Mount Carmel, Israel, is an optimal natural microscale model for unravelling evolution in action highlighting the twin evolutionary processes of adaptation and speciation. A major model organism in ECI is wild barley, Hordeum spontaneum, the progenitor of cultivated barley, which displays dramatic interslope adaptive and speciational divergence on the 'African' dry slope (AS) and the 'European' humid slope (ES), separated on average by 200 m. Here we examined interslope single nucleotide polymorphism (SNP) sequences and the expression diversity of the drought resistant dehydrin 1 gene (Dhn1) between the opposite slopes. We analysed 47 plants (genotypes), 4-10 individuals in each of seven stations (populations) in an area of 7000 m(2), for Dhn1 sequence diversity located in the 5' upstream flanking region of the gene. We found significant levels of Dhn1 genic diversity represented by 29 haplotypes, derived from 45 SNPs in a total of 708 bp sites. Most of the haplotypes, 25 out of 29 (= 86.2%), were represented by one genotype; hence, unique to one population. Only a single haplotype was common to both slopes. Genetic divergence of sequence and haplotype diversity was generally and significantly different among the populations and slopes. Nucleotide diversity was higher on the AS, whereas haplotype diversity was higher on the ES. Interslope divergence was significantly higher than intraslope divergence. The applied Tajima D rejected neutrality of the SNP diversity. The Dhn1 expression under dehydration indicated interslope divergent expression between AS and ES genotypes, reinforcing Dhn1 associated with drought resistance of wild barley at 'Evolution Canyon'. These results are inexplicable by mutation, gene flow, or chance effects, and support adaptive natural microclimatic selection as the major evolutionary divergent driving force.
Singh, Prashant; Singh, Satya Shila; Elster, Josef; Mishra, Arun Kumar
2013-06-01
In order to assess phylogeny, population genetics, and approximation of future course of cyanobacterial evolution based on nifH gene sequences, 41 heterocystous cyanobacterial strains collected from all over India have been used in the present study. NifH gene sequence analysis data confirm that the heterocystous cyanobacteria are monophyletic while the stigonematales show polyphyletic origin with grave intermixing. Further, analysis of nifH gene sequence data using intricate mathematical extrapolations revealed that the nucleotide diversity and recombination frequency is much greater in Nostocales than the Stigonematales. Similarly, DNA divergence studies showed significant values of divergence with greater gene conversion tracts in the unbranched (Nostocales) than the branched (Stigonematales) strains. Our data strongly support the origin of true branching cyanobacterial strains from the unbranched strains.
USDA-ARS?s Scientific Manuscript database
The complete nucleotide sequence of a recently discovered Florida (FL) isolate of Hibiscus infecting Cilevirus (HiCV) was determined by Sanger sequencing. The movement- and coat- protein gene sequences of the HiCV-FL isolate are more divergent than other genes of the previously sequenced HiCV-HA (Ha...
Reeves, R H; O'Brien, S J
1984-01-01
RD-114 is a replication-competent, xenotropic retrovirus which is homologous to a family of moderately repetitive DNA sequences present at ca. 20 copies in the normal cellular genome of domestic cats. To examine the extent and character of genomic divergence of the RD-114 gene family as well as to assess their positional association within the cat genome, we have prepared a series of molecular clones of endogenous RD-114 DNA segments from a genomic library of cat cellular DNA. Their restriction endonuclease maps were compared with each other as well as to that of the prototype-inducible RD-114 which was molecularly cloned from a chronically infected human cell line. The endogenous sequences analyzed were similar to each other in that they were colinear with RD-114 proviral DNA, were bounded by long terminal redundancies, and conserved many restriction sites in the gag and pol regions. However, the env regions of many of the sequences examined were substantially deleted. Several of the endogenous RD-114 genomes contained a novel envelope sequence which was unrelated to the env gene of the prototype RD-114 env gene but which, like RD-114 and endogenous feline leukemia virus provirus, was found only in species of the genus Felis, and not in other closely related Felidae genera. The endogenous RD-114 sequences each had a distinct cellular flank which indicates that these sequences are not tandem but dispersed nonspecifically throughout the genome. Southern analysis of cat cellular DNA confirmed the conclusions about conserved restriction sites in endogenous sequences and indicated that a single locus may be responsible for the production of the major inducible form of RD-114. Images PMID:6090693
Miniprimer PCR, a New Lens for Viewing the Microbial World▿ †
Isenbarger, Thomas A.; Finney, Michael; Ríos-Velázquez, Carlos; Handelsman, Jo; Ruvkun, Gary
2008-01-01
Molecular methods based on the 16S rRNA gene sequence are used widely in microbial ecology to reveal the diversity of microbial populations in environmental samples. Here we show that a new PCR method using an engineered polymerase and 10-nucleotide “miniprimers” expands the scope of detectable sequences beyond those detected by standard methods using longer primers and Taq polymerase. After testing the method in silico to identify divergent ribosomal genes in previously cloned environmental sequences, we applied the method to soil and microbial mat samples, which revealed novel 16S rRNA gene sequences that would not have been detected with standard primers. Deeply divergent sequences were discovered with high frequency and included representatives that define two new division-level taxa, designated CR1 and CR2, suggesting that miniprimer PCR may reveal new dimensions of microbial diversity. PMID:18083877
Candida ficus sp. nov., a novel yeast species from the gut of Apriona germari larvae.
Hui, Feng-Li; Niu, Qiu-Hong; Ke, Tao; Liu, Zheng
2012-11-01
A novel yeast species is described based on three strains from the gut of wood-boring larvae collected in a tree trunk of Ficus carica cultivated in parks near Nanyang, central China. Phylogenetic analysis based on sequences of the D1/D2 domains of the large subunit rRNA gene showed that these strains occurred in a separate clade that was genetically distinct from all known ascomycetous yeasts. In terms of pairwise sequence divergence, the novel strains differed by 15.3% divergence from the type strain of Pichia terricola, and by 15.8% divergence from the type strains of Pichia exigua and Candida rugopelliculosa in the D1/D2 domains. All three are ascomycetous yeasts in the Pichia clade. Unlike P. terricola, P. exigua and C. rugopelliculosa, the novel isolates did not ferment glucose. The name Candida ficus sp. nov. is proposed to accommodate these highly divergent organisms, with STN-8(T) (=CICC 1980(T)=CBS 12638(T)) as the type strain.
Single sample resolution of rare microbial dark matter in a marine invertebrate metagenome
DOE Office of Scientific and Technical Information (OSTI.GOV)
Miller, Ian J.; Weyna, Theodore R.; Fong, Stephen S.
Direct, untargeted sequencing of environmental samples (metagenomics) and de novo genome assembly enable the study of uncultured and phylogenetically divergent organisms. However, separating individual genomes from a mixed community has often relied on the differential-coverage analysis of multiple, deeply sequenced samples. In the metagenomic investigation of the marine bryozoan Bugula neritina, we uncovered seven bacterial genomes associated with a single B. neritina individual that appeared to be transient associates, two of which were unique to one individual and undetectable using certain “universal” 16S rRNA primers and probes. We recovered high quality genome assemblies for several rare instances of “microbial darkmore » matter,” or phylogenetically divergent bacteria lacking genomes in reference databases, from a single tissue sample that was not subjected to any physical or chemical pre-treatment. One of these rare, divergent organisms has a small (593 kbp), poorly annotated genome with low GC content (20.9%) and a 16S rRNA gene with just 65% sequence similarity to the closest reference sequence. Lastly, our findings illustrate the importance of sampling strategy and de novo assembly of metagenomic reads to understand the extent and function of bacterial biodiversity.« less
Single sample resolution of rare microbial dark matter in a marine invertebrate metagenome
Miller, Ian J.; Weyna, Theodore R.; Fong, Stephen S.; ...
2016-09-29
Direct, untargeted sequencing of environmental samples (metagenomics) and de novo genome assembly enable the study of uncultured and phylogenetically divergent organisms. However, separating individual genomes from a mixed community has often relied on the differential-coverage analysis of multiple, deeply sequenced samples. In the metagenomic investigation of the marine bryozoan Bugula neritina, we uncovered seven bacterial genomes associated with a single B. neritina individual that appeared to be transient associates, two of which were unique to one individual and undetectable using certain “universal” 16S rRNA primers and probes. We recovered high quality genome assemblies for several rare instances of “microbial darkmore » matter,” or phylogenetically divergent bacteria lacking genomes in reference databases, from a single tissue sample that was not subjected to any physical or chemical pre-treatment. One of these rare, divergent organisms has a small (593 kbp), poorly annotated genome with low GC content (20.9%) and a 16S rRNA gene with just 65% sequence similarity to the closest reference sequence. Lastly, our findings illustrate the importance of sampling strategy and de novo assembly of metagenomic reads to understand the extent and function of bacterial biodiversity.« less
Poomtien, Jamroonsri; Jindamorakot, Sasitorn; Limtong, Savitree; Pinphanichakarn, Pairoh; Thaniyavarn, Jiraporn
2013-01-01
Three yeast strains were isolated from industrial wastes in Thailand. Based on the phylogenetic sequence analysis of the D1/D2 region of the large subunit rRNA gene, the internal transcribed spacer (ITS1-5.8S rRNA gene-ITS2; ITS1-2) region, and their physiological characteristics, the three strains were found to represent two novel species of the ascomycetous anamorphic yeast. Strain JP52(T) represent a novel species which was named Cyberlindnera samutprakarnensis sp. nov. (type strain JP52(T); = BCC 46825(T) = JCM 17816(T) = CBS 12528(T), MycoBank no. MB800879), which was differentiated from the closely related species Cyberlindnera mengyuniae CBS 10845(T) by 2.9 % sequence divergence in the D1/D2 region and 4.4 % sequence divergence in the ITS1-2. Strain JP59(T) and JP60 were identical in their D1/D2 and ITS1-2 regions, which were closely related to those of Scheffersomyces spartinae CBS 6059(T) by 0.9 and 1.0 % sequence divergence, respectively. In addition, supportive evidence of actin gene and translational elongation factor gene by sequence divergence of 6.5 % each confirmed their distinct status. Furthermore, JP59(T) and JP60 differentiated from the closely related species in some biochemical and physiological characteristics. These two strains were assigned as a single novel species which was named Candida thasaenensis sp. nov. (type JP59(T) = BCC 46828(T) = JCM 17817(T) = CBS 12529(T), MycoBank no. MB800880).
Cheng, Ji-Hong; Liu, Wen-Chun; Chang, Ting-Tsung; Hsieh, Sun-Yuan; Tseng, Vincent S
2017-10-01
Many studies have suggested that deletions of Hepatitis B Viral (HBV) are associated with the development of progressive liver diseases, even ultimately resulting in hepatocellular carcinoma (HCC). Among the methods for detecting deletions from next-generation sequencing (NGS) data, few methods considered the characteristics of virus, such as high evolution rates and high divergence among the different HBV genomes. Sequencing high divergence HBV genome sequences using the NGS technology outputs millions of reads. Thus, detecting exact breakpoints of deletions from these big and complex data incurs very high computational cost. We proposed a novel analytical method named VirDelect (Virus Deletion Detect), which uses split read alignment base to detect exact breakpoint and diversity variable to consider high divergence in single-end reads data, such that the computational cost can be reduced without losing accuracy. We use four simulated reads datasets and two real pair-end reads datasets of HBV genome sequence to verify VirDelect accuracy by score functions. The experimental results show that VirDelect outperforms the state-of-the-art method Pindel in terms of accuracy score for all simulated datasets and VirDelect had only two base errors even in real datasets. VirDelect is also shown to deliver high accuracy in analyzing the single-end read data as well as pair-end data. VirDelect can serve as an effective and efficient bioinformatics tool for physiologists with high accuracy and efficient performance and applicable to further analysis with characteristics similar to HBV on genome length and high divergence. The software program of VirDelect can be downloaded at https://sourceforge.net/projects/virdelect/. Copyright © 2017. Published by Elsevier Inc.
Guillet-Claude, Carine; Isabel, Nathalie; Pelgas, Betty; Bousquet, Jean
2004-12-01
Class I knox genes code for transcription factors that play an essential role in plant growth and development as central regulators of meristem cell identity. Based on the analysis of new cDNA sequences from various tissues and genomic DNA sequences, we identified a highly diversified group of class I knox genes in conifers. Phylogenetic analyses of complete amino acid sequences from various seed plants indicated that all conifer sequences formed a monophyletic group. Within conifers, four subgroups here named genes KN1 to KN4 were well delineated, each regrouping pine and spruce sequences. KN4 was sister group to KN3, which was sister group to KN1 and KN2. Genetic mapping on the genomes of two divergent Picea species indicated that KN1 and KN2 are located close to each other on the same linkage group, whereas KN3 and KN4 mapped on different linkage groups, correlating the more ancient divergence of these two genes. The proportion of synonymous and nonsynonymous substitutions suggested intense purifying selection for the four genes. However, rates of substitution per year indicated an evolution in two steps: faster rates were noted after gene duplications, followed subsequently by lower rates. Positive directional selection was detected for most of the internal branches harboring an accelerated rate of evolution. In addition, many sites with highly significant amino acid rate shift were identified between these branches. However, the tightly linked KN1 and KN2 did not diverge as much from each other. The implications of the correlation between phylogenetic, structural, and functional information are discussed in relation to the diversification of the knox-I gene family in conifers.
Low X/Y divergence in four pairs of papaya sex-linked genes.
Yu, Qingyi; Hou, Shaobin; Feltus, F Alex; Jones, Meghan R; Murray, Jan E; Veatch, Olivia; Lemke, Cornelia; Saw, Jimmy H; Moore, Richard C; Thimmapuram, Jyothi; Liu, Lei; Moore, Paul H; Alam, Maqsudul; Jiang, Jiming; Paterson, Andrew H; Ming, Ray
2008-01-01
Sex chromosomes in flowering plants, in contrast to those in animals, evolved relatively recently and only a few are heteromorphic. The homomorphic sex chromosomes of papaya show features of incipient sex chromosome evolution. We investigated the features of paired X- and Y-specific bacterial artificial chromosomes (BACs), and estimated the time of divergence in four pairs of sex-linked genes. We report the results of a comparative analysis of long contiguous genomic DNA sequences between the X and hermaphrodite Y (Y(h)) chromosomes. Numerous chromosomal rearrangements were detected in the male-specific region of the Y chromosome (MSY), including inversions, deletions, insertions, duplications and translocations, showing the dynamic evolutionary process on the MSY after recombination ceased. DNA sequence expansion was documented in the two regions of the MSY, demonstrating that the cytologically homomorphic sex chromosomes are heteromorphic at the molecular level. Analysis of sequence divergence between four X and Y(h) gene pairs resulted in a estimated age of divergence of between 0.5 and 2.2 million years, supporting a recent origin of the papaya sex chromosomes. Our findings indicate that sex chromosomes did not evolve at the family level in Caricaceae, and reinforce the theory that sex chromosomes evolve at the species level in some lineages.
Mohandesan, Elmira; Fitak, Robert R; Corander, Jukka; Yadamsuren, Adiya; Chuluunbat, Battsetseg; Abdelhadi, Omer; Raziq, Abdul; Nagy, Peter; Stalder, Gabrielle; Walzer, Chris; Faye, Bernard; Burger, Pamela A
2017-08-30
The genus Camelus is an interesting model to study adaptive evolution in the mitochondrial genome, as the three extant Old World camel species inhabit hot and low-altitude as well as cold and high-altitude deserts. We sequenced 24 camel mitogenomes and combined them with three previously published sequences to study the role of natural selection under different environmental pressure, and to advance our understanding of the evolutionary history of the genus Camelus. We confirmed the heterogeneity of divergence across different components of the electron transport system. Lineage-specific analysis of mitochondrial protein evolution revealed a significant effect of purifying selection in the concatenated protein-coding genes in domestic Bactrian camels. The estimated dN/dS < 1 in the concatenated protein-coding genes suggested purifying selection as driving force for shaping mitogenome diversity in camels. Additional analyses of the functional divergence in amino acid changes between species-specific lineages indicated fixed substitutions in various genes, with radical effects on the physicochemical properties of the protein products. The evolutionary time estimates revealed a divergence between domestic and wild Bactrian camels around 1.1 [0.58-1.8] million years ago (mya). This has major implications for the conservation and management of the critically endangered wild species, Camelus ferus.
De-Nova, José Arturo; Sánchez-Reyes, Luna L; Eguiarte, Luis E; Magallón, Susana
2018-09-01
Arid biomes are particularly prominent in the Neotropics providing some of its most emblematic landscapes and a substantial part of its species diversity. To understand some of the evolutionary processes underlying the speciation of lineages in the Mexican Deserts, the diversification of Fouquieria is investigated, which includes eleven species, all endemic to the warm deserts and dry subtropical regions of North America. Using a phylogeny from plastid DNA sequences with samples of individuals from populations of all the species recognized in Fouquieria, we estimate divergence times, test for temporal diversification heterogeneity, test for geographical structure, and conduct ancestral area reconstruction. Fouquieria is an ancient lineage that diverged from Polemoniaceae ca. 75.54 Ma. A Mio-Pliocene diversification of Fouquieria with vicariance, associated with Neogene orogenesis underlying the early development of regional deserts is strongly supported. Test for temporal diversification heterogeneity indicates that during its evolutionary history, Fouquieria had a drastic diversification rate shift at ca.12.72 Ma, agreeing with hypotheses that some of the lineages in North American deserts diversified as early as the late Miocene to Pliocene, and not during the Pleistocene. Long-term diversification dynamics analyses suggest that extinction also played a significant role in Fouquieria's evolution, with a very high rate at the onset of the process. From the late Miocene onwards, Fouquieria underwent substantial diversification change, involving high speciation decreasing to the present and negligible extinction, which is congruent with its scant fossil record during this period. Geographic phylogenetic structure and the pattern of most sister species inhabiting different desert nucleus support that isolation by distance could be the main driver of speciation. Copyright © 2018 Elsevier Inc. All rights reserved.
Ninomiya, Masashi; Takahashi, Masaharu; Hoshino, Yu; Ichiyama, Koji; Simmonds, Peter; Okamoto, Hiroaki
2009-02-01
Humans are frequently infected with three anelloviruses which have circular DNA genomes of 3.6-3.9 kb [Torque teno virus (TTV)], 2.8-2.9 kb [Torque teno mini virus (TTMV)] and 3.2 kb [a recently discovered anellovirus named Torque teno midi virus (TTMDV)]. Unexpectedly, human TTMDV DNA was not detectable in any of 74 chimpanzees tested, although all but one tested positive for both human TTV and TTMV DNA. Using universal primers for anelloviruses, novel variants of TTMDV that are phylogenetically clearly separate from human TTMDV were identified from chimpanzees, and over the entire genome, three chimpanzee TTMDV variants differed by 17.9-20.3 % from each other and by 40.4-43.6 % from all 18 reported human TTMDVs. A newly developed PCR assay that uses chimpanzee TTMDV-specific primers revealed the high prevalence of chimpanzee TTMDV in chimpanzees (63/74, 85 %) but low prevalence in humans (1/100). While variants of TTV and TTMV from chimpanzees and humans were phylogenetically interspersed, those of TTMDV were monophyletic for each species, with sequence diversity of <33 and <20 % within the 18 human and three chimpanzee TTMDV variants, respectively. Maximum within-group divergence values for TTV and TTMV were 51 and 57 %, respectively; both of these values were substantially greater than the maximum divergence among TTMDV variants (44 %), consistent with a later evolutionary emergence of TTMDV. However, substantiation of this hypothesis will require further analysis of genetic diversity using an expanded dataset of TTMDV variants in humans and chimpanzees. Similarly, the underlying mechanism of observed infrequent cross-species infection of TTMDV between humans and chimpanzees deserves further analysis.
Jones, Christopher M; Stres, Blaz; Rosenquist, Magnus; Hallin, Sara
2008-09-01
Denitrification is a facultative respiratory pathway in which nitrite (NO2(-)), nitric oxide (NO), and nitrous oxide (N2O) are successively reduced to nitrogen gas (N(2)), effectively closing the nitrogen cycle. The ability to denitrify is widely dispersed among prokaryotes, and this polyphyletic distribution has raised the possibility of horizontal gene transfer (HGT) having a substantial role in the evolution of denitrification. Comparisons of 16S rRNA and denitrification gene phylogenies in recent studies support this possibility; however, these results remain speculative as they are based on visual comparisons of phylogenies from partial sequences. We reanalyzed publicly available nirS, nirK, norB, and nosZ partial sequences using Bayesian and maximum likelihood phylogenetic inference. Concomitant analysis of denitrification genes with 16S rRNA sequences from the same organisms showed substantial differences between the trees, which were supported by examining the posterior probability of monophyletic constraints at different taxonomic levels. Although these differences suggest HGT of denitrification genes, the presence of structural variants for nirK, norB, and nosZ makes it difficult to determine HGT from other evolutionary events. Additional analysis using phylogenetic networks and likelihood ratio tests of phylogenies based on full-length sequences retrieved from genomes also revealed significant differences in tree topologies among denitrification and 16S rRNA gene phylogenies, with the exception of the nosZ gene phylogeny within the data set of the nirK-harboring genomes. However, inspection of codon usage and G + C content plots from complete genomes gave no evidence for recent HGT. Instead, the close proximity of denitrification gene copies in the genomes of several denitrifying bacteria suggests duplication. Although HGT cannot be ruled out as a factor in the evolution of denitrification genes, our analysis suggests that other phenomena, such gene duplication/divergence and lineage sorting, may have differently influenced the evolution of each denitrification gene.
The complete chloroplast genome sequence of the medicinal plant Salvia miltiorrhiza.
Qian, Jun; Song, Jingyuan; Gao, Huanhuan; Zhu, Yingjie; Xu, Jiang; Pang, Xiaohui; Yao, Hui; Sun, Chao; Li, Xian'en; Li, Chuyuan; Liu, Juyan; Xu, Haibin; Chen, Shilin
2013-01-01
Salvia miltiorrhiza is an important medicinal plant with great economic and medicinal value. The complete chloroplast (cp) genome sequence of Salvia miltiorrhiza, the first sequenced member of the Lamiaceae family, is reported here. The genome is 151,328 bp in length and exhibits a typical quadripartite structure of the large (LSC, 82,695 bp) and small (SSC, 17,555 bp) single-copy regions, separated by a pair of inverted repeats (IRs, 25,539 bp). It contains 114 unique genes, including 80 protein-coding genes, 30 tRNAs and four rRNAs. The genome structure, gene order, GC content and codon usage are similar to the typical angiosperm cp genomes. Four forward, three inverted and seven tandem repeats were detected in the Salvia miltiorrhiza cp genome. Simple sequence repeat (SSR) analysis among the 30 asterid cp genomes revealed that most SSRs are AT-rich, which contribute to the overall AT richness of these cp genomes. Additionally, fewer SSRs are distributed in the protein-coding sequences compared to the non-coding regions, indicating an uneven distribution of SSRs within the cp genomes. Entire cp genome comparison of Salvia miltiorrhiza and three other Lamiales cp genomes showed a high degree of sequence similarity and a relatively high divergence of intergenic spacers. Sequence divergence analysis discovered the ten most divergent and ten most conserved genes as well as their length variation, which will be helpful for phylogenetic studies in asterids. Our analysis also supports that both regional and functional constraints affect gene sequence evolution. Further, phylogenetic analysis demonstrated a sister relationship between Salvia miltiorrhiza and Sesamum indicum. The complete cp genome sequence of Salvia miltiorrhiza reported in this paper will facilitate population, phylogenetic and cp genetic engineering studies of this medicinal plant.
Lo, Wen-Sui; Lin, Chan-Pin; Kuo, Chih-Horng
2013-01-01
Phytoplasmas are a group of bacteria that are associated with hundreds of plant diseases. Due to their economical importance and the difficulties involved in the experimental study of these obligate pathogens, genome sequencing and comparative analysis have been utilized as powerful tools to understand phytoplasma biology. To date four complete phytoplasma genome sequences have been published. However, these four strains represent limited phylogenetic diversity. In this study, we report the shotgun sequencing and evolutionary analysis of a peanut witches'-broom (PnWB) phytoplasma genome. The availability of this genome provides the first representative of the 16SrII group and substantially improves the taxon sampling to investigate genome evolution. The draft genome assembly contains 13 chromosomal contigs with a total size of 562,473 bp, covering ∼90% of the chromosome. Additionally, a complete plasmid sequence is included. Comparisons among the five available phytoplasma genomes reveal the differentiations in gene content and metabolic capacity. Notably, phylogenetic inferences of the potential mobile units (PMUs) in these genomes indicate that horizontal transfer may have occurred between divergent phytoplasma lineages. Because many effectors are associated with PMUs, the horizontal transfer of these transposon-like elements can contribute to the adaptation and diversification of these pathogens. In summary, the findings from this study highlight the importance of improving taxon sampling when investigating genome evolution. Moreover, the currently available sequences are inadequate to fully characterize the pan-genome of phytoplasmas. Future genome sequencing efforts to expand phylogenetic diversity are essential in improving our understanding of phytoplasma evolution. PMID:23626855
Behnke, Anke; Bunge, John; Barger, Kathryn; Breiner, Hans-Werner; Alla, Victoria; Stoeck, Thorsten
2006-01-01
To resolve the fine-scale architecture of anoxic protistan communities, we conducted a cultivation-independent 18S rRNA survey in the superanoxic Framvaren Fjord in Norway. We generated three clone libraries along the steep O2/H2S gradient, using the multiple-primer approach. Of 1,100 clones analyzed, 753 proved to be high-quality protistan target sequences. These sequences were grouped into 92 phylotypes, which displayed high protistan diversity in the fjord (17 major eukaryotic phyla). Only a few were closely related to known taxa. Several sequences were dissimilar to all previously described sequences and occupied a basal position in the inferred phylogenies, suggesting that the sequences recovered were derived from novel, deeply divergent eukaryotes. We detected sequence clades with evolutionary importance (for example, clades in the euglenozoa) and clades that seem to be specifically adapted to anoxic environments, challenging the hypothesis that the global dispersal of protists is uniform. Moreover, with the detection of clones affiliated with jakobid flagellates, we present evidence that primitive descendants of early eukaryotes are present in this anoxic environment. To estimate sample coverage and phylotype richness, we used parametric and nonparametric statistical methods. The results show that although our data set is one of the largest published inventories, our sample missed a substantial proportion of the protistan diversity. Nevertheless, statistical and phylogenetic analyses of the three libraries revealed the fine-scale architecture of anoxic protistan communities, which may exhibit adaptation to different environmental conditions along the O2/H2S gradient. PMID:16672511
2009-01-01
Background The full power of modern genetics has been applied to the study of speciation in only a small handful of genetic model species - all of which speciated allopatrically. Here we report the first large expressed sequence tag (EST) study of a candidate for ecological sympatric speciation, the apple maggot Rhagoletis pomonella, using massively parallel pyrosequencing on the Roche 454-FLX platform. To maximize transcript diversity we created and sequenced separate libraries from larvae, pupae, adult heads, and headless adult bodies. Results We obtained 239,531 sequences which assembled into 24,373 contigs. A total of 6810 unique protein coding genes were identified among the contigs and long singletons, corresponding to 48% of all known Drosophila melanogaster protein-coding genes. Their distribution across GO classes suggests that we have obtained a representative sample of the transcriptome. Among these sequences are many candidates for potential R. pomonella "speciation genes" (or "barrier genes") such as those controlling chemosensory and life-history timing processes. Furthermore, we identified important marker loci including more than 40,000 single nucleotide polymorphisms (SNPs) and over 100 microsatellites. An initial search for SNPs at which the apple and hawthorn host races differ suggested at least 75 loci warranting further work. We also determined that developmental expression differences remained even after normalization; transcripts expected to show different expression levels between larvae and pupae in D. melanogaster also did so in R. pomonella. Preliminary comparative analysis of transcript presences and absences revealed evidence of gene loss in Drosophila and gain in the higher dipteran clade Schizophora. Conclusions These data provide a much needed resource for exploring mechanisms of divergence in this important model for sympatric ecological speciation. Our description of ESTs from a substantial portion of the R. pomonella transcriptome will facilitate future functional studies of candidate genes for olfaction and diapause-related life history timing, and will enable large scale expression studies. Similarly, the identification of new SNP and microsatellite markers will facilitate future population and quantitative genetic studies of divergence between the apple and hawthorn-infesting host races. PMID:20035631
Mamos, Tomasz; Bącela-Spychalska, Karolina; Rewicz, Tomasz; Wattier, Remi A.
2017-01-01
Background The Balkans are a major worldwide biodiversity and endemism hotspot. Among the freshwater biota, amphipods are known for their high cryptic diversity. However, little is known about the temporal and paleogeographic aspects of their evolutionary history. We used paleogeography as a framework for understanding the onset of diversification in Gammarus roeselii: (1) we hypothesised that, given the high number of isolated waterbodies in the Balkans, the species is characterised by high level of cryptic diversity, even on a local scale; (2) the long geological history of the region might promote pre-Pleistocene divergence between lineages; (3) given that G. roeselii thrives both in lakes and rivers, its evolutionary history could be linked to the Balkan Neogene paleolake system; (4) we inspected whether the Pleistocene decline of hydrological networks could have any impact on the diversification of G. roeselii. Material and Methods DNA was extracted from 177 individuals collected from 26 sites all over Balkans. All individuals were amplified for ca. 650 bp long fragment of the mtDNA cytochrome oxidase subunit I (COI). After defining molecular operational taxonomic units (MOTU) based on COI, 50 individuals were amplified for ca. 900 bp long fragment of the nuclear 28S rDNA. Molecular diversity, divergence, differentiation and historical demography based on COI sequences were estimated for each MOTU. The relative frequency, geographic distribution and molecular divergence between COI haplotypes were presented as a median-joining network. COI was used also to reconstruct time-calibrated phylogeny with Bayesian inference. Probabilities of ancestors’ occurrence in riverine or lacustrine habitats, as well their possible geographic locations, were estimated with the Bayesian method. A Neighbour Joining tree was constructed to illustrate the phylogenetic relationships between 28S rDNA haplotypes. Results We revealed that G. roeselii includes at least 13 cryptic species or molecular operational taxonomic units (MOTUs), mostly of Miocene origin. A substantial Pleistocene diversification within-MOTUs was observed in several cases. We evidenced secondary contacts between very divergent MOTUs and introgression of nDNA. The Miocene ancestors could live in either lacustrine or riverine habitats yet their presumed geographic localisations overlapped with those of the Neogene lakes. Several extant riverine populations had Pleistocene lacustrine ancestors. Discussion Neogene divergence of lineages resulting in substantial cryptic diversity may be a common phenomenon in extant freshwater benthic crustaceans occupying areas that were not glaciated during the Pleistocene. Evolution of G. roeselii could be associated with gradual deterioration of the paleolakes. The within-MOTU diversification might be driven by fragmentation of river systems during the Pleistocene. Extant ancient lakes could serve as local microrefugia during that time. PMID:28265503
DOE Office of Scientific and Technical Information (OSTI.GOV)
Castelle, Cindy J.; Brown, Christopher T.; Thomas, Brian C.
The Candidate Phyla Radiation (CPR) is a large group of bacteria, the scale of which approaches that of all other bacteria. CPR organisms are inferred to depend on other community members for many basic cellular building blocks and all appear to be obligate anaerobes. To date, there has been no evidence for any significant respiratory capacity in an organism from this radiation. Here we report a curated draft genome for Candidatus Parcunitrobacter nitroensis' a member of the Parcubacteria (OD1) superphylum of the CPR. The genome encodes versatile energy pathways, including fermentative and respiratory capacities, nitrogen and fatty acid metabolism, asmore » well as the first complete electron transport chain described for a member of the CPR. The sequences of all of these enzymes are highly divergent from sequences found in other organisms, suggesting that these capacities were not recently acquired from non-CPR organisms. Although the wide respiration-based repertoire points to a different lifestyle compared to other CPR bacteria, we predict similar obligate dependence on other organisms or the microbial community. The results substantially expand the known metabolic potential of CPR bacteria, although sequence comparisons indicate that these capacities are very rare in members of this radiation.« less
Castelle, Cindy J.; Brown, Christopher T.; Thomas, Brian C.; ...
2017-01-09
The Candidate Phyla Radiation (CPR) is a large group of bacteria, the scale of which approaches that of all other bacteria. CPR organisms are inferred to depend on other community members for many basic cellular building blocks and all appear to be obligate anaerobes. To date, there has been no evidence for any significant respiratory capacity in an organism from this radiation. Here we report a curated draft genome for Candidatus Parcunitrobacter nitroensis' a member of the Parcubacteria (OD1) superphylum of the CPR. The genome encodes versatile energy pathways, including fermentative and respiratory capacities, nitrogen and fatty acid metabolism, asmore » well as the first complete electron transport chain described for a member of the CPR. The sequences of all of these enzymes are highly divergent from sequences found in other organisms, suggesting that these capacities were not recently acquired from non-CPR organisms. Although the wide respiration-based repertoire points to a different lifestyle compared to other CPR bacteria, we predict similar obligate dependence on other organisms or the microbial community. The results substantially expand the known metabolic potential of CPR bacteria, although sequence comparisons indicate that these capacities are very rare in members of this radiation.« less
Comparative Analyses of DNA Methylation and Sequence Evolution Using Nasonia Genomes
Park, Jungsun; Peng, Zuogang; Zeng, Jia; Elango, Navin; Park, Taesung; Wheeler, Dave; Werren, John H.; Yi, Soojin V.
2011-01-01
The functional and evolutionary significance of DNA methylation in insect genomes remains to be resolved. Nasonia is well situated for comparative analyses of DNA methylation and genome evolution, since the genomes of a moderately distant outgroup species as well as closely related sibling species are available. Using direct sequencing of bisulfite-converted DNA, we uncovered a substantial level of DNA methylation in 17 of 18 Nasonia vitripennis genes and a strong correlation between methylation level and CpG depletion. Notably, in the sex-determining locus transformer, the exon that is alternatively spliced between the sexes is heavily methylated in both males and females, whereas other exons are only sparsely methylated. Orthologous genes of the honeybee and Nasonia show highly similar relative levels of CpG depletion, despite ∼190 My divergence. Densely and sparsely methylated genes in these species also exhibit similar functional enrichments. We found that the degree of CpG depletion is negatively correlated with substitution rates between closely related Nasonia species for synonymous, nonsynonymous, and intron sites. This suggests that mutation rates increase with decreasing levels of germ line methylation. Thus, DNA methylation is prevalent in the Nasonia genome, may participate in regulatory processes such as sex determination and alternative splicing, and is correlated with several aspects of genome and sequence evolution. PMID:21693438
Van Nguyen, Cuong; Bonsall, David; Ngo, Tue Tri; Carrique-Mas, Juan; Pham, Anh Hong; Bryant, Juliet E.; Thwaites, Guy; Baker, Stephen; Woolhouse, Mark; Simmonds, Peter
2018-01-01
Rodents and bats are now widely recognised as important sources of zoonotic virus infections in other mammals, including humans. Numerous surveys have expanded our knowledge of diverse viruses in a range of rodent and bat species, including their origins, evolution, and range of hosts. In this study of pegivirus and human hepatitis-related viruses, liver and serum samples from Vietnamese rodents and bats were examined by PCR and sequencing. Nucleic acids homologous to human hepatitis B, C, E viruses were detected in liver samples of 2 (1.3%) of 157 bats, 38 (8.1%), and 14 (3%) of 470 rodents, respectively. Hepacivirus-like viruses were frequently detected (42.7%) in the bamboo rat, Rhizomys pruinosus, while pegivirus RNA was only evident in 2 (0.3%) of 638 rodent serum samples. Complete or near-complete genome sequences of HBV, HEV and pegivirus homologues closely resembled those previously reported from rodents and bats. However, complete coding region sequences of the rodent hepacivirus-like viruses substantially diverged from all of the currently classified variants and potentially represent a new species in the Hepacivirus genus. Of the viruses identified, their routes of transmission and potential to establish zoonoses remain to be determined. PMID:29495551
Thompson, Peter C; Zarlenga, Dante S; Liu, Ming-Yuan; Rosenthal, Benjamin M
2017-09-01
Genome assemblies can form the basis of comparative analyses fostering insight into the evolutionary genetics of a parasite's pathogenicity, host-pathogen interactions, environmental constraints and invasion biology; however, the length and complexity of many parasite genomes has hampered the development of well-resolved assemblies. In order to improve Trichinella genome assemblies, the genome of the sylvatic encapsulated species Trichinella murrelli was sequenced using third-generation, long-read technology and, using syntenic comparisons, scaffolded to a reference genome assembly of Trichinella spiralis, markedly improving both. A high-quality draft assembly for T. murrelli was achieved that totalled 63·2 Mbp, half of which was condensed into 26 contigs each longer than 571 000 bp. When compared with previous assemblies for parasites in the genus, ours required 10-fold fewer contigs, which were five times longer, on average. Better assembly across repetitive regions also enabled resolution of 8 Mbp of previously indeterminate sequence. Furthermore, syntenic comparisons identified widespread scaffold misassemblies in the T. spiralis reference genome. The two new assemblies, organized for the first time into three chromosomal scaffolds, will be valuable resources for future studies linking phenotypic traits within each species to their underlying genetic bases.
Deep reefs are not universal refuges: Reseeding potential varies among coral species
Bongaerts, Pim; Riginos, Cynthia; Brunner, Ramona; Englebert, Norbert; Smith, Struan R.; Hoegh-Guldberg, Ove
2017-01-01
Deep coral reefs (that is, mesophotic coral ecosystems) can act as refuges against major disturbances affecting shallow reefs. It has been proposed that, through the provision of coral propagules, such deep refuges may aid in shallow reef recovery; however, this “reseeding” hypothesis remains largely untested. We conducted a genome-wide assessment of two scleractinian coral species with contrasting reproductive modes, to assess the potential for connectivity between mesophotic (40 m) and shallow (12 m) depths on an isolated reef system in the Western Atlantic (Bermuda). To overcome the pervasive issue of endosymbiont contamination associated with de novo sequencing of corals, we used a novel subtraction reference approach. We have demonstrated that strong depth-associated selection has led to genome-wide divergence in the brooding species Agaricia fragilis (with divergence by depth exceeding divergence by location). Despite introgression from shallow into deep populations, a lack of first-generation migrants indicates that effective connectivity over ecological time scales is extremely limited for this species and thus precludes reseeding of shallow reefs from deep refuges. In contrast, no genetic structuring between depths (or locations) was observed for the broadcasting species Stephanocoenia intersepta, indicating substantial potential for vertical connectivity. Our findings demonstrate that vertical connectivity within the same reef system can differ greatly between species and that the reseeding potential of deep reefs in Bermuda may apply to only a small number of scleractinian species. Overall, we argue that the “deep reef refuge hypothesis” holds for individual coral species during episodic disturbances but should not be assumed as a broader ecosystem-wide phenomenon. PMID:28246645
Yew, Chee-Wei; Lu, Dongsheng; Deng, Lian; Wong, Lai-Ping; Ong, Rick Twee-Hee; Lu, Yan; Wang, Xiaoji; Yunus, Yushimah; Aghakhanian, Farhang; Mokhtar, Siti Shuhada; Hoque, Mohammad Zahirul; Voo, Christopher Lok-Yung; Abdul Rahman, Thuhairah; Bhak, Jong; Phipps, Maude E; Xu, Shuhua; Teo, Yik-Ying; Kumar, Subbiah Vijay; Hoh, Boon-Peng
2018-02-01
Southeast Asia (SEA) is enriched with a complex history of peopling. Malaysia, which is located at the crossroads of SEA, has been recognized as one of the hubs for early human migration. To unravel the genomic complexity of the native inhabitants of Malaysia, we sequenced 12 samples from 3 indigenous populations from Peninsular Malaysia and 4 native populations from North Borneo to a high coverage of 28-37×. We showed that the Negritos from Peninsular Malaysia shared a common ancestor with the East Asians, but exhibited some level of gene flow from South Asia, while the North Borneo populations exhibited closer genetic affinity towards East Asians than the Malays. The analysis of time of divergence suggested that ancestors of Negrito were the earliest settlers in the Malay Peninsula, whom first separated from the Papuans ~ 50-33 thousand years ago (kya), followed by East Asian (~ 40-15 kya), while the divergence time frame between North Borneo and East Asia populations predates the Austronesian expansion period implies a possible pre-Neolithic colonization. Substantial Neanderthal ancestry was confirmed in our genomes, as was observed in other East Asians. However, no significant difference was observed, in terms of the proportion of Denisovan gene flow into these native inhabitants from Malaysia. Judging from the similar amount of introgression in the Southeast Asians and East Asians, our findings suggest that the Denisovan gene flow may have occurred before the divergence of these populations and that the shared similarities are likely an ancestral component.
Sequence analysis of MHC class I α2 from sockeye salmon (Oncorhynchus nerka).
McClelland, Erin K; Ming, Tobi J; Tabata, Amy; Miller, Kristina M
2011-09-01
Most studies assessing adaptive MHC diversity in salmon populations have focused on the classical class II DAB or DAA loci, as these have been most amenable to single PCR amplifications due to their relatively low level of sequence divergence. Herein, we report the characterization of the classical class I UBA α2 locus based on collections taken throughout the species range of sockeye salmon (Oncorhynchus nerka). Through use of multiple lineage-specific primer sets, denaturing gradient gel electrophoresis and sequencing, we identified thirty-four alleles from three highly divergent lineages. Sequence identity between lineages ranged from 30.0% to 56.8% but was relatively high within lineages. Allelic identity within the antigen recognition site (ARS) was greater than for the longer sequence. Global positive selection on UBA was seen at the sequence level (dN:dS = 1.012) with four codons under positive selection and 12 codons under negative selection. Crown Copyright © 2011. Published by Elsevier Ltd. All rights reserved.
Bass, David; Moureau, Gregory; Tang, Shuoya; McAlister, Erica; Culverwell, C. Lorna; Glücksman, Edvard; Wang, Hui; Brown, T. David K.; Gould, Ernest A.; Harbach, Ralph E.; de Lamballerie, Xavier; Firth, Andrew E.
2013-01-01
We investigated whether small RNA (sRNA) sequenced from field-collected mosquitoes and chironomids (Diptera) can be used as a proxy signature of viral prevalence within a range of species and viral groups, using sRNAs sequenced from wild-caught specimens, to inform total RNA deep sequencing of samples of particular interest. Using this strategy, we sequenced from adult Anopheles maculipennis s.l. mosquitoes the apparently nearly complete genome of one previously undescribed virus related to chronic bee paralysis virus, and, from a pool of Ochlerotatus caspius and Oc. detritus mosquitoes, a nearly complete entomobirnavirus genome. We also reconstructed long sequences (1503-6557 nt) related to at least nine other viruses. Crucially, several of the sequences detected were reconstructed from host organisms highly divergent from those in which related viruses have been previously isolated or discovered. It is clear that viral transmission and maintenance cycles in nature are likely to be significantly more complex and taxonomically diverse than previously expected. PMID:24260463
Estimation of primate speciation dates using local molecular clocks.
Yoder, A D; Yang, Z
2000-07-01
Protein-coding genes of the mitochondrial genomes from 31 mammalian species were analyzed to estimate the speciation dates within primates and also between rats and mice. Three calibration points were used based on paleontological data: one at 20-25 MYA for the hominoid/cercopithecoid divergence, one at 53-57 MYA for the cetacean/artiodactyl divergence, and the third at 110-130 MYA for the metatherian/eutherian divergence. Both the nucleotide and the amino acid sequences were analyzed, producing conflicting results. The global molecular clock was clearly violated for both the nucleotide and the amino acid data. Models of local clocks were implemented using maximum likelihood, allowing different evolutionary rates for some lineages while assuming rate constancy in others. Surprisingly, the highly divergent third codon positions appeared to contain phylogenetic information and produced more sensible estimates of primate divergence dates than did the amino acid sequences. Estimated dates varied considerably depending on the data type, the calibration point, and the substitution model but differed little among the four tree topologies used. We conclude that the calibration derived from the primate fossil record is too recent to be reliable; we also point out a number of problems in date estimation when the molecular clock does not hold. Despite these obstacles, we derived estimates of primate divergence dates that were well supported by the data and were generally consistent with the paleontological record. Estimation of the mouse-rat divergence date, however, was problematic.
In-situ X-ray diffraction system using sources and detectors at fixed angular positions
Gibson, David M [Voorheesville, NY; Gibson, Walter M [Voorheesville, NY; Huang, Huapeng [Latham, NY
2007-06-26
An x-ray diffraction technique for measuring a known characteristic of a sample of a material in an in-situ state. The technique includes using an x-ray source for emitting substantially divergent x-ray radiation--with a collimating optic disposed with respect to the fixed source for producing a substantially parallel beam of x-ray radiation by receiving and redirecting the divergent paths of the divergent x-ray radiation. A first x-ray detector collects radiation diffracted from the sample; wherein the source and detector are fixed, during operation thereof, in position relative to each other and in at least one dimension relative to the sample according to a-priori knowledge about the known characteristic of the sample. A second x-ray detector may be fixed relative to the first x-ray detector according to the a-priori knowledge about the known characteristic of the sample, especially in a phase monitoring embodiment of the present invention.
Teng, Huajing; Zhang, Yaohua; Shi, Chengmin; Mao, Fengbiao; Cai, Wanshi; Lu, Liang; Zhao, Fangqing; Sun, Zhongsheng; Zhang, Jianxu
2017-01-01
Abstract Murine rodents are excellent models for study of adaptive radiations and speciation. Brown Norway rats (Rattus norvegicus) are successful global colonizers and the contributions of their domesticated laboratory strains to biomedical research are well established. To identify nucleotide-based speciation timing of the rat and genomic information contributing to its colonization capabilities, we analyzed 51 whole-genome sequences of wild-derived Brown Norway rats and their sibling species, R. nitidus, and identified over 20 million genetic variants in the wild Brown Norway rats that were absent in the laboratory strains, which substantially expand the reservoir of rat genetic diversity. We showed that divergence of the rat and its siblings coincided with drastic climatic changes that occurred during the Middle Pleistocene. Further, we revealed that there was a geographically widespread influx of genes between Brown Norway rats and the sibling species following the divergence, resulting in numerous introgressed regions in the genomes of admixed Brown Norway rats. Intriguing, genes related to chemical communications among these introgressed regions appeared to contribute to the population-specific adaptations of the admixed Brown Norway rats. Our data reveals evolutionary history of the Brown Norway rat, and offers new insights into the role of climatic changes in speciation of animals and the effect of interspecies introgression on animal adaptation. PMID:28482038
Remarkable ancient divergences amongst neglected lorisiform primates
Nekaris, K. Anne‐Isola; Perkin, Andrew; Bearder, Simon K.; Pimley, Elizabeth R.; Schulze, Helga; Streicher, Ulrike; Nadler, Tilo; Kitchener, Andrew; Zischler, Hans; Zinner, Dietmar; Roos, Christian
2015-01-01
Lorisiform primates (Primates: Strepsirrhini: Lorisiformes) represent almost 10% of the living primate species and are widely distributed in sub‐Saharan Africa and South/South‐East Asia; however, their taxonomy, evolutionary history, and biogeography are still poorly understood. In this study we report the largest molecular phylogeny in terms of the number of represented taxa. We sequenced the complete mitochondrial cytochrome b gene for 86 lorisiform specimens, including ∼80% of all the species currently recognized. Our results support the monophyly of the Galagidae, but a common ancestry of the Lorisinae and Perodicticinae (family Lorisidae) was not recovered. These three lineages have early origins, with the Galagidae and the Lorisinae diverging in the Oligocene at about 30 Mya and the Perodicticinae emerging in the early Miocene. Our mitochondrial phylogeny agrees with recent studies based on nuclear data, and supports Euoticus as the oldest galagid lineage and the polyphyletic status of Galagoides. Moreover, we have elucidated phylogenetic relationships for several species never included before in a molecular phylogeny. The results obtained in this study suggest that lorisiform diversity remains substantially underestimated and that previously unnoticed cryptic diversity might be present within many lineages, thus urgently requiring a comprehensive taxonomic revision of this primate group. © 2015 The Linnean Society of London PMID:26900177
Morphological and molecular evidence reveals recent hybridization between gorilla taxa.
Ackermann, Rebecca Rogers; Bishop, Jacqueline M
2010-01-01
Molecular studies have demonstrated a deep lineage split between the two gorilla species, as well as divisions within these taxa; estimates place this divergence in the mid-Pleistocene, with gene flow continuing until approximately 80,000 years ago. Here, we present analyses of skeletal data indicating the presence of substantial recent gene flow among gorillas at all taxonomic levels: between populations, subspecies, and species. Complementary analyses of DNA sequence variation suggest that low-level migration occurred primarily in a westerly-to-easterly direction. In western gorillas, the locations of hybrid phenotypes map closely to expectations based on population refugia and riverine barrier hypotheses, supporting the presence of significant vicariance-driven structuring and occasional admixture within this taxon. In eastern lowland gorillas, the high frequency of hybrid phenotypes is surprising, suggesting that this region represents a zone of introgression between eastern gorillas and migrants from the west, and underscoring the conservation priority of this critically endangered group. These results highlight the complex nature of evolutionary divergence in this genus, indicate that historical gene flow has played a major role in structuring gorilla diversity, and demonstrate that our understanding of the evolutionary processes responsible for shaping biodiversity can benefit immensely from consideration of morphological and molecular data in conjunction.
Horn, T; Chang, C A; Urdea, M S
1997-12-01
The divergent synthesis of branched DNA (bDNA) comb structures is described. This new type of bDNA contains one unique oligonucleotide, the primary sequence, covalently attached through a comb-like branch network to many identical copies of a different oligonucleotide, the secondary sequence. The bDNA comb structures were assembled on a solid support and several synthesis parameters were investigated and optimized. The bDNA comb molecules were characterized by polyacrylamide gel electrophoretic methods and by controlled cleavage at periodate-cleavable moieties incorporated during synthesis. The developed chemistry allows synthesis of bDNA comb molecules containing multiple secondary sequences. In the accompanying article we describe the synthesis and characterization of large bDNA combs containing all four deoxynucleotides for use as signal amplifiers in nucleic acid quantification assays.
Horn, T; Chang, C A; Urdea, M S
1997-01-01
The divergent synthesis of branched DNA (bDNA) comb structures is described. This new type of bDNA contains one unique oligonucleotide, the primary sequence, covalently attached through a comb-like branch network to many identical copies of a different oligonucleotide, the secondary sequence. The bDNA comb structures were assembled on a solid support and several synthesis parameters were investigated and optimized. The bDNA comb molecules were characterized by polyacrylamide gel electrophoretic methods and by controlled cleavage at periodate-cleavable moieties incorporated during synthesis. The developed chemistry allows synthesis of bDNA comb molecules containing multiple secondary sequences. In the accompanying article we describe the synthesis and characterization of large bDNA combs containing all four deoxynucleotides for use as signal amplifiers in nucleic acid quantification assays. PMID:9365265
Picard, François J.; Ke, Danbing; Boudreau, Dominique K.; Boissinot, Maurice; Huletsky, Ann; Richard, Dave; Ouellette, Marc; Roy, Paul H.; Bergeron, Michel G.
2004-01-01
A 761-bp portion of the tuf gene (encoding the elongation factor Tu) from 28 clinically relevant streptococcal species was obtained by sequencing amplicons generated using broad-range PCR primers. These tuf sequences were used to select Streptococcus-specific PCR primers and to perform phylogenetic analysis. The specificity of the PCR assay was verified using 102 different bacterial species, including the 28 streptococcal species. Genomic DNA purified from all streptococcal species was efficiently detected, whereas there was no amplification with DNA from 72 of the 74 nonstreptococcal bacterial species tested. There was cross-amplification with DNAs from Enterococcus durans and Lactococcus lactis. However, the 15 to 31% nucleotide sequence divergence in the 761-bp tuf portion of these two species compared to any streptococcal tuf sequence provides ample sequence divergence to allow the development of internal probes specific to streptococci. The Streptococcus-specific assay was highly sensitive for all 28 streptococcal species tested (i.e., detection limit of 1 to 10 genome copies per PCR). The tuf sequence data was also used to perform extensive phylogenetic analysis, which was generally in agreement with phylogeny determined on the basis of 16S rRNA gene data. However, the tuf gene provided a better discrimination at the streptococcal species level that should be particularly useful for the identification of very closely related species. In conclusion, tuf appears more suitable than the 16S ribosomal RNA gene for the development of diagnostic assays for the detection and identification of streptococcal species because of its higher level of species-specific genetic divergence. PMID:15297518
Centromere location in Arabidopsis is unaltered by extreme divergence in CENH3 protein sequence
2017-01-01
During cell division, spindle fibers attach to chromosomes at centromeres. The DNA sequence at regional centromeres is fast evolving with no conserved genetic signature for centromere identity. Instead CENH3, a centromere-specific histone H3 variant, is the epigenetic signature that specifies centromere location across both plant and animal kingdoms. Paradoxically, CENH3 is also adaptively evolving. An ongoing question is whether CENH3 evolution is driven by a functional relationship with the underlying DNA sequence. Here, we demonstrate that despite extensive protein sequence divergence, CENH3 histones from distant species assemble centromeres on the same underlying DNA sequence. We first characterized the organization and diversity of centromere repeats in wild-type Arabidopsis thaliana. We show that A. thaliana CENH3-containing nucleosomes exhibit a strong preference for a unique subset of centromeric repeats. These sequences are largely missing from the genome assemblies and represent the youngest and most homogeneous class of repeats. Next, we tested the evolutionary specificity of this interaction in a background in which the native A. thaliana CENH3 is replaced with CENH3s from distant species. Strikingly, we find that CENH3 from Lepidium oleraceum and Zea mays, although specifying epigenetically weaker centromeres that result in genome elimination upon outcrossing, show a binding pattern on A. thaliana centromere repeats that is indistinguishable from the native CENH3. Our results demonstrate positional stability of a highly diverged CENH3 on independently evolved repeats, suggesting that the sequence specificity of centromeres is determined by a mechanism independent of CENH3. PMID:28223399
Maruyama, Sandra Regina; Castro-Jorge, Luiza Antunes; Ribeiro, José Marcos Chaves; Gardinassi, Luiz Gustavo; Garcia, Gustavo Rocha; Brandão, Lucinda Giampietro; Rodrigues, Aline Rezende; Okada, Marcos Ituo; Abrão, Emiliana Pereira; Ferreira, Beatriz Rossetti; da Fonseca, Benedito Antonio Lopes; de Miranda-Santos, Isabel Kinney Ferreira
2013-01-01
Transcripts similar to those that encode the nonstructural (NS) proteins NS3 and NS5 from flaviviruses were found in a salivary gland (SG) complementary DNA (cDNA) library from the cattle tick Rhipicephalus microplus. Tick extracts were cultured with cells to enable the isolation of viruses capable of replicating in cultured invertebrate and vertebrate cells. Deep sequencing of the viral RNA isolated from culture supernatants provided the complete coding sequences for the NS3 and NS5 proteins and their molecular characterisation confirmed similarity with the NS3 and NS5 sequences from other flaviviruses. Despite this similarity, phylogenetic analyses revealed that this potentially novel virus may be a highly divergent member of the genus Flavivirus. Interestingly, we detected the divergent NS3 and NS5 sequences in ticks collected from several dairy farms widely distributed throughout three regions of Brazil. This is the first report of flavivirus-like transcripts in R. microplus ticks. This novel virus is a potential arbovirus because it replicated in arthropod and mammalian cells; furthermore, it was detected in a cDNA library from tick SGs and therefore may be present in tick saliva. It is important to determine whether and by what means this potential virus is transmissible and to monitor the virus as a potential emerging tick-borne zoonotic pathogen. PMID:24626302
Candida ruelliae sp. nov., a novel yeast species isolated from flowers of Ruellia sp. (Acanthaceae).
Saluja, Puja; Prasad, Gandham S
2008-06-01
Two novel yeast strains designated as 16Q1 and 16Q3 were isolated from flowers of the Ruellia species of the Acanthaceae family. The D1/D2 domain and ITS sequences of these two strains were identical. Sequence analysis of the D1/D2 domain of large-subunit rRNA gene indicated their relationship to species of the Candida haemulonii cluster. However, they differ from C. haemulonii by 14% nucleotide sequence divergence, from Candida pseudohaemulonii by 16.1% and from C. haemulonii type II by 16.5%. These strains also differ in 18 physiological tests from the type strain of C. haemulonii, and 12 and 16 tests, respectively, from C. pseudohaemulonii and C. haemulonii type II. They also differ from C. haemulonii and other related species by more than 13% sequence divergence in the internal transcribed spacer region. In the SSU rRNA gene sequences, strain 16Q1 differs by 1.7% nucleotide divergence from C. haemulonii. Sporulation was not observed in pure or mixed cultures on several media examined. All these data support the assignment of these strains to a novel species; we have named them as Candida ruelliae sp. nov., and designate strain 16Q1(T)=MTCC 7739(T)=CBS10815(T) as type strain of the novel species.
LinkFinder: An expert system that constructs phylogenic trees
NASA Technical Reports Server (NTRS)
Inglehart, James; Nelson, Peter C.
1991-01-01
An expert system has been developed using the C Language Integrated Production System (CLIPS) that automates the process of constructing DNA sequence based phylogenies (trees or lineages) that indicate evolutionary relationships. LinkFinder takes as input homologous DNA sequences from distinct individual organisms. It measures variations between the sequences, selects appropriate proportionality constants, and estimates the time that has passed since each pair of organisms diverged from a common ancestor. It then designs and outputs a phylogenic map summarizing these results. LinkFinder can find genetic relationships between different species, and between individuals of the same species, including humans. It was designed to take advantage of the vast amount of sequence data being produced by the Genome Project, and should be of value to evolution theorists who wish to utilize this data, but who have no formal training in molecular genetics. Evolutionary theory holds that distinct organisms carrying a common gene inherited that gene from a common ancestor. Homologous genes vary from individual to individual and species to species, and the amount of variation is now believed to be directly proportional to the time that has passed since divergence from a common ancestor. The proportionality constant must be determined experimentally; it varies considerably with the types of organisms and DNA molecules under study. Given an appropriate constant, and the variation between two DNA sequences, a simple linear equation gives the divergence time.
Next generation sequencing and analysis of a conserved transcriptome of New Zealand's kiwi.
Subramanian, Sankar; Huynen, Leon; Millar, Craig D; Lambert, David M
2010-12-15
Kiwi is a highly distinctive, flightless and endangered ratite bird endemic to New Zealand. To understand the patterns of molecular evolution of the nuclear protein-coding genes in brown kiwi (Apteryx australis mantelli) and to determine the timescale of avian history we sequenced a transcriptome obtained from a kiwi embryo using next generation sequencing methods. We then assembled the conserved protein-coding regions using the chicken proteome as a scaffold. Using 1,543 conserved protein coding genes we estimated the neutral evolutionary divergence between the kiwi and chicken to be ~45%, which is approximately equal to the divergence computed for the human-mouse pair using the same set of genes. A large fraction of genes was found to be under high selective constraint, as most of the expressed genes appeared to be involved in developmental gene regulation. Our study suggests a significant relationship between gene expression levels and protein evolution. Using sequences from over 700 nuclear genes we estimated the divergence between the two basal avian groups, Palaeognathae and Neognathae to be 132 million years, which is consistent with previous studies using mitochondrial genes. The results of this investigation revealed patterns of mutation and purifying selection in conserved protein coding regions in birds. Furthermore this study suggests a relatively cost-effective way of obtaining a glimpse into the fundamental molecular evolutionary attributes of a genome, particularly when no closely related genomic sequence is available.
Govindarajulu, Rajanikanth; Hughes, Colin E; Alexander, Patrick J; Bailey, C Donovan
2011-12-01
The evolutionary history of Leucaena has been impacted by polyploidy, hybridization, and divergent allopatric species diversification, suggesting that this is an ideal group to investigate the evolutionary tempo of polyploidy and the complexities of reticulation and divergence in plant diversification. Parsimony- and ML-based phylogenetic approaches were applied to 105 accessions sequenced for six sequence characterized amplified region-based nuclear encoded loci, nrDNA ITS, and four cpDNA regions. Hypotheses for the origin of tetraploid species were inferred using results derived from a novel species tree and established gene tree methods and from data on genome sizes and geographic distributions. The combination of comprehensively sampled multilocus DNA sequence data sets and a novel methodology provide strong resolution and support for the origins of all five tetraploid species. A minimum of four allopolyploidization events are required to explain the origins of these species. The origin(s) of one tetraploid pair (L. involucrata/L. pallida) can be equally explained by two unique allopolyploidizations or a single event followed by divergent speciation. Alongside other recent findings, a comprehensive picture of the complex evolutionary dynamics of polyploidy in Leucaena is emerging that includes paleotetraploidization, diploidization of the last common ancestor to Leucaena, allopatric divergence among diploids, and recent allopolyploid origins for tetraploid species likely associated with human translocation of seed. These results provide insights into the role of divergence and reticulation in a well-characterized angiosperm lineage and into traits of diploid parents and derived tetraploids (particularly self-compatibility and year-round flowering) favoring the formation and establishment of novel tetraploids combinations.
Hornok, Sándor; Wang, Yuanzhi; Otranto, Domenico; Keskin, Adem; Lia, Riccardo Paolo; Kontschán, Jenő; Takács, Nóra; Farkas, Róbert; Sándor, Attila D
2016-12-15
Haemaphysalis erinacei is one of the few ixodid tick species for which valid names of subspecies exist. Despite their disputed taxonomic status in the literature, these subspecies have not yet been compared with molecular methods. The aim of the present study was to investigate the phylogenetic relationships of H. erinacei subspecies, in the context of the first finding of this tick species in Romania. After morphological identification, DNA was extracted from five adults of H. e. taurica (from Romania and Turkey), four adults of H. e. erinacei (from Italy) and 17 adults of H. e. turanica (from China). From these samples fragments of the cytochrome c oxidase subunit 1 (cox1) and 16S rRNA genes were amplified via PCR and sequenced. Results showed that cox1 and 16S rRNA gene sequence divergences between H. e. taurica from Romania and H. e. erinacei from Italy were below 2%. However, the sequence divergences between H. e. taurica from Romania and H. e. turanica from China were high (up to 7.3% difference for the 16S rRNA gene), exceeding the reported level of sequence divergence between closely related tick species. At the same time, two adults of H. e. taurica from Turkey had higher 16S rRNA gene similarity to H. e. turanica from China (up to 97.5%) than to H. e. taurica from Romania (96.3%), but phylogenetically clustered more closely to H. e. taurica than to H. e. turanica. This is the first finding of H. erinacei in Romania, and the first (although preliminary) phylogenetic comparison of H. erinacei subspecies. Phylogenetic analyses did not support that the three H. erinacei subspecies evaluated here are of equal taxonomic rank, because the genetic divergence between H. e. turanica from China and H. e. taurica from Romania exceeded the usual level of sequence divergence between closely related tick species, suggesting that they might represent different species. Therefore, the taxonomic status of the subspecies of H. erinacei needs to be revised based on a larger number of specimens collected throughout its geographical range.
Phylogenetic analysis of Demodex caprae based on mitochondrial 16S rDNA sequence.
Zhao, Ya-E; Hu, Li; Ma, Jun-Xian
2013-11-01
Demodex caprae infests the hair follicles and sebaceous glands of goats worldwide, which not only seriously impairs goat farming, but also causes a big economic loss. However, there are few reports on the DNA level of D. caprae. To reveal the taxonomic position of D. caprae within the genus Demodex, the present study conducted phylogenetic analysis of D. caprae based on mt16S rDNA sequence data. D. caprae adults and eggs were obtained from a skin nodule of the goat suffering demodicidosis. The mt16S rDNA sequences of individual mite were amplified using specific primers, and then cloned, sequenced, and aligned. The sequence divergence, genetic distance, and transition/transversion rate were computed, and the phylogenetic trees in Demodex were reconstructed. Results revealed the 339-bp partial sequences of six D. caprae isolates were obtained, and the sequence identity was 100% among isolates. The pairwise divergences between D. caprae and Demodex canis or Demodex folliculorum or Demodex brevis were 22.2-24.0%, 24.0-24.9%, and 22.9-23.2%, respectively. The corresponding average genetic distances were 2.840, 2.926, and 2.665, and the average transition/transversion rates were 0.70, 0.55, and 0.54, respectively. The divergences, genetic distances, and transition/transversion rates of D. caprae versus the other three species all reached interspecies level. The five phylogenetic trees all presented that D. caprae clustered with D. brevis first, and then with D. canis, D. folliculorum, and Demodex injai in sequence. In conclusion, D. caprae is an independent species, and it is closer to D. brevis than to D. canis, D. folliculorum, or D. injai.
Hybridization Reveals the Evolving Genomic Architecture of Speciation
Kronforst, Marcus R.; Hansen, Matthew E.B.; Crawford, Nicholas G.; Gallant, Jason R.; Zhang, Wei; Kulathinal, Rob J.; Kapan, Durrell D.; Mullen, Sean P.
2014-01-01
SUMMARY The rate at which genomes diverge during speciation is unknown, as are the physical dynamics of the process. Here, we compare full genome sequences of 32 butterflies, representing five species from a hybridizing Heliconius butterfly community, to examine genome-wide patterns of introgression and infer how divergence evolves during the speciation process. Our analyses reveal that initial divergence is restricted to a small fraction of the genome, largely clustered around known wing-patterning genes. Over time, divergence evolves rapidly, due primarily to the origin of new divergent regions. Furthermore, divergent genomic regions display signatures of both selection and adaptive introgression, demonstrating the link between microevolutionary processes acting within species and the origin of species across macroevolutionary timescales. Our results provide a uniquely comprehensive portrait of the evolving species boundary due to the role that hybridization plays in reducing the background accumulation of divergence at neutral sites. PMID:24183670
Dissecting the relationship between protein structure and sequence variation
NASA Astrophysics Data System (ADS)
Shahmoradi, Amir; Wilke, Claus; Wilke Lab Team
2015-03-01
Over the past decade several independent works have shown that some structural properties of proteins are capable of predicting protein evolution. The strength and significance of these structure-sequence relations, however, appear to vary widely among different proteins, with absolute correlation strengths ranging from 0 . 1 to 0 . 8 . Here we present the results from a comprehensive search for the potential biophysical and structural determinants of protein evolution by studying more than 200 structural and evolutionary properties in a dataset of 209 monomeric enzymes. We discuss the main protein characteristics responsible for the general patterns of protein evolution, and identify sequence divergence as the main determinant of the strengths of virtually all structure-evolution relationships, explaining ~ 10 - 30 % of observed variation in sequence-structure relations. In addition to sequence divergence, we identify several protein structural properties that are moderately but significantly coupled with the strength of sequence-structure relations. In particular, proteins with more homogeneous back-bone hydrogen bond energies, large fractions of helical secondary structures and low fraction of beta sheets tend to have the strongest sequence-structure relation. BEACON-NSF center for the study of evolution in action.
Abergel, Chantal; Blanc, Guillaume; Monchois, Vincent; Renesto, Patricia; Sigoillot, Cécile; Ogata, Hiroyuki; Raoult, Didier; Claverie, Jean-Michel
2006-11-01
The genomic sequencing of Rickettsia conorii revealed a new family of Rickettsia-specific palindromic elements (RPEs) capable of in-frame insertion in preexisting open reading frames (ORFs). Many of these altered ORFs correspond to proteins with well-characterized or essential functions in other microorganisms. Previous experiments indicated that RPE-containing genes are normally transcribed and that no excision of the repeat occurs at the mRNA level. Using mass spectrometry, we now confirmed the retention of the RPE-derived amino acid residues in 4 proteins successfully expressed in Escherichia coli, raising the general question of the consequences of this common insertion event on the fitness of Rickettsia enzymes. The predicted guanylate kinase activity of the R. conorii gmk gene product was measured both on the RPE-containing and RPE-excised recombinant proteins. We show that the 2 proteins are active but exhibit substantial differences in their affinity for adenosine triphosphate, guanosine monophosphate, and catalytic constants. The distribution of the RPEgmk insert among Rickettsia species indicates that the insertion event is ancient and occurred after the divergence of Rickettsia felis and R. conorii but before that of Rickettsia helvetica and R. conorii. We found no evidence that the gmk gene fixed adaptive changes to compensate the RPE peptide insertion. Furthermore, the analysis of the rates of divergence in 23 RPE-containing genes indicates that coding RPE repeats tend to evolve under weak selective constraint, at a rate similar to intergenic noncoding RPE sequences. Altogether, these results suggest that the insertion of RPE-encoded "selfish peptides," although respecting the original fold and activity of the host proteins, might be slightly detrimental to the enzyme efficiency within limits tolerable for slow-growing intracellular parasites such as Rickettsia.
Swift, Camm C.; Spies, Brenton; Ellingson, Ryan A.; Jacobs, David K.
2016-01-01
A geographically isolated set of southern localities of the formerly monotypic goby genus Eucyclogobius is known to be reciprocally monophyletic and substantially divergent in mitochondrial sequence and nuclear microsatellite-based phylogenies relative to populations to the north along the California coast. To clarify taxonomic and conservation status, we conducted a suite of analyses on a comprehensive set of morphological counts and measures from across the range of Eucyclogobius and describe the southern populations as a new species, the Southern Tidewater Goby, Eucyclogobius kristinae, now separate from the Northern Tidewater Goby Eucyclogobius newberryi (Girard 1856). In addition to molecular distinction, adults of E. kristinae are diagnosed by: 1) loss of the anterior supratemporal lateral-line canals resulting in higher neuromast counts, 2) lower pectoral and branched caudal ray counts, and 3) sets of measurements identified via discriminant analysis. These differences suggest ecological distinction of the two species. Previous studies estimated lineage separation at 2–4 million years ago, and mitochondrial sequence divergence exceeds that of other recognized fish species. Fish from Santa Monica Artesian Springs (Los Angeles County) northward belong to E. newberryi; those from Aliso Creek (Orange County) southward constitute E. kristinae. The lagoonal habitat of Eucyclogobius has been diminished or degraded, leading to special conservation status at state and federal levels beginning in 1980. Habitat of the newly described species has been impacted by a range of anthropogenic activities, including the conversion of closing lagoons to open tidal systems in the name of restoration. In the last 30 years, E. kristinae has only been observed in nine intermittently occupied lagoonal systems in northern San Diego County; it currently persists in only three sites. Thus, the new species is in imminent danger of extinction and will require ongoing active management. PMID:27462700
Garcia-Rodriguez, A. I.; Bowen, B.W.; Domning, D.; Mignucci-Giannoni, A. A.; Marmontel, M.; Montoya-Ospina, R. A.; Morales-Vela, B.; Rudin, M.; Bonde, R.K.; McGuire, P.M.
1998-01-01
To resolve the population genetic structure and phylogeography of the West Indian manatee (Trichechus manatus), mitochondrial (mt) DNA control region sequences were compared among eight locations across the western Atlantic region. Fifteen haplotypes were identified among 86 individuals from Florida, Puerto Rico, the Dominican Republic, Mexico, Colombia, Venezuela, Guyana and Brazil. Despite the manatee's ability to move thousands of kilometres along continental margins, strong population separations between most locations were demonstrated with significant haplotype frequency shifts. These findings are consistent with tagging studies which indicate that stretches of open water and unsuitable coastal habitats constitute substantial barriers to gene flow and colonization. Low levels of genetic diversity within Florida and Brazilian samples might be explained by recent colonization into high latitudes or bottleneck effects. Three distinctive mtDNA lineages were observed in an intraspecific phylogeny of T. manatus, corresponding approximately to: (i) Florida and the West Indies; (ii) the Gulf of Mexico to the Caribbean rivers of South America; and (iii) the northeast Atlantic coast of South America. These lineages, which are not concordant with previous subspecies designations, are separated by sequence divergence estimates of d = 0.04-0.07, approximately the same level of divergence observed between T. manatus and the Amazonian manatee (T. inunguis, n = 16). Three individuals from Guyana, identified as T. manatus, had mtDNA haplotypes which are affiliated with the endemic Amazon form T. inunguis. The three primary T. manatus lineages and the T. inunguis lineage may represent relatively deep phylogeographic partitions which have been bridged recently due to changes in habitat availability (after the Wisconsin glacial period, 10 000 BP), natural colonization, and human-mediated transplantation.
Park, Jihye; Zhang, Ying; Chen, Chun; Dudley, Edward G; Harvill, Eric T
2015-12-01
Secretion systems are key virulence factors, modulating interactions between pathogens and the host's immune response. Six potential secretion systems (types 1-6; T1SS-T6SS) have been discussed in classical bordetellae, respiratory commensals/pathogens of mammals. The prototypical Bordetella bronchiseptica strain RB50 genome seems to contain all six systems, whilst two human-restricted subspecies, Bordetella parapertussis and Bordetella pertussis, have lost different subsets of these. This implicates secretion systems in the divergent evolutionary histories that have led to their success in different niches. Based on our previous work demonstrating that changes in secretion systems are associated with virulence characteristics, we hypothesized there would be substantial divergence of the loci encoding each amongst sequenced strains. Here, we describe extensive differences in secretion system loci; 10 of the 11 sequenced strains had lost subsets of genes or one entire secretion system locus. These loci contained genes homologous to those present in the respective loci in distantly related organisms, as well as genes unique to bordetellae, suggesting novel and/or auxiliary functions. The high degree of conservation of the T3SS locus, a complex machine with interdependent parts that must be conserved, stands in dramatic contrast to repeated loss of T5aSS 'autotransporters', which function as an autonomous unit. This comparative analysis provided insights into critical aspects of each pathogen's adaptation to its different niche, and the relative contributions of recombination, mutation and horizontal gene transfer. In addition, the relative conservation of various secretion systems is an important consideration in the ongoing search for more highly conserved protective antigens for the next generation of pertussis vaccines.
The Phylogenetics and Ecology of the Orthopoxviruses Endemic to North America
Emerson, Ginny L.; Li, Yu; Frace, Michael A.; Olsen-Rasmussen, Melissa A.; Khristova, Marina L.; Govil, Dhwani; Sammons, Scott A.; Regnery, Russell L.; Karem, Kevin L.; Damon, Inger K.; Carroll, Darin S.
2009-01-01
The data presented herein support the North American orthopoxviruses (NA OPXV) in a sister relationship to all other currently described Orthopoxvirus (OPXV) species. This phylogenetic analysis reaffirms the identification of the NA OPXV as close relatives of “Old World” (Eurasian and African) OPXV and presents high support for deeper nodes within the Chordopoxvirinae family. The natural reservoir host(s) for many of the described OPXV species remains unknown although a clear virus-host association exists between the genus OPXV and several mammalian taxa. The hypothesized host associations and the deep divergence of the OPXV/NA OPXV clades depicted in this study may reflect the divergence patterns of the mammalian faunas of the Old and New World and reflect a more ancient presence of OPXV on what are now the American continents. Genes from the central region of the poxvirus genome are generally more conserved than genes from either end of the linear genome due to functional constraints imposed on viral replication abilities. The relatively slower evolution of these genes may more accurately reflect the deeper history among the poxvirus group, allowing for robust placement of the NA OPXV within Chordopoxvirinae. Sequence data for nine genes were compiled from three NA OPXV strains plus an additional 50 genomes collected from Genbank. The current, gene sequence based phylogenetic analysis reaffirms the identification of the NA OPXV as the nearest relatives of “Old World” OPXV and presents high support for deeper nodes within the Chordopoxvirinae family. Additionally, the substantial genetic distances that separate the currently described NA OPXV species indicate that it is likely that many more undescribed OPXV/NA OPXV species may be circulating among wild animals in North America. PMID:19865479
Wóycicki, Rafał; Witkowicz, Justyna; Gawroński, Piotr; Dąbrowska, Joanna; Lomsadze, Alexandre; Pawełkowicz, Magdalena; Siedlecka, Ewa; Yagi, Kohei; Pląder, Wojciech; Seroczyńska, Anna; Śmiech, Mieczysław; Gutman, Wojciech; Niemirowicz-Szczytt, Katarzyna; Bartoszewski, Grzegorz; Tagashira, Norikazu; Hoshi, Yoshikazu; Borodovsky, Mark; Karpiński, Stanisław; Malepszy, Stefan; Przybecki, Zbigniew
2011-01-01
Cucumber (Cucumis sativus L.), a widely cultivated crop, has originated from Eastern Himalayas and secondary domestication regions includes highly divergent climate conditions e.g. temperate and subtropical. We wanted to uncover adaptive genome differences between the cucumber cultivars and what sort of evolutionary molecular mechanisms regulate genetic adaptation of plants to different ecosystems and organism biodiversity. Here we present the draft genome sequence of the Cucumis sativus genome of the North-European Borszczagowski cultivar (line B10) and comparative genomics studies with the known genomes of: C. sativus (Chinese cultivar – Chinese Long (line 9930)), Arabidopsis thaliana, Populus trichocarpa and Oryza sativa. Cucumber genomes show extensive chromosomal rearrangements, distinct differences in quantity of the particular genes (e.g. involved in photosynthesis, respiration, sugar metabolism, chlorophyll degradation, regulation of gene expression, photooxidative stress tolerance, higher non-optimal temperatures tolerance and ammonium ion assimilation) as well as in distributions of abscisic acid-, dehydration- and ethylene-responsive cis-regulatory elements (CREs) in promoters of orthologous group of genes, which lead to the specific adaptation features. Abscisic acid treatment of non-acclimated Arabidopsis and C. sativus seedlings induced moderate freezing tolerance in Arabidopsis but not in C. sativus. This experiment together with analysis of abscisic acid-specific CRE distributions give a clue why C. sativus is much more susceptible to moderate freezing stresses than A. thaliana. Comparative analysis of all the five genomes showed that, each species and/or cultivars has a specific profile of CRE content in promoters of orthologous genes. Our results constitute the substantial and original resource for the basic and applied research on environmental adaptations of plants, which could facilitate creation of new crops with improved growth and yield in divergent conditions. PMID:21829493
Wóycicki, Rafał; Witkowicz, Justyna; Gawroński, Piotr; Dąbrowska, Joanna; Lomsadze, Alexandre; Pawełkowicz, Magdalena; Siedlecka, Ewa; Yagi, Kohei; Pląder, Wojciech; Seroczyńska, Anna; Śmiech, Mieczysław; Gutman, Wojciech; Niemirowicz-Szczytt, Katarzyna; Bartoszewski, Grzegorz; Tagashira, Norikazu; Hoshi, Yoshikazu; Borodovsky, Mark; Karpiński, Stanisław; Malepszy, Stefan; Przybecki, Zbigniew
2011-01-01
Cucumber (Cucumis sativus L.), a widely cultivated crop, has originated from Eastern Himalayas and secondary domestication regions includes highly divergent climate conditions e.g. temperate and subtropical. We wanted to uncover adaptive genome differences between the cucumber cultivars and what sort of evolutionary molecular mechanisms regulate genetic adaptation of plants to different ecosystems and organism biodiversity. Here we present the draft genome sequence of the Cucumis sativus genome of the North-European Borszczagowski cultivar (line B10) and comparative genomics studies with the known genomes of: C. sativus (Chinese cultivar--Chinese Long (line 9930)), Arabidopsis thaliana, Populus trichocarpa and Oryza sativa. Cucumber genomes show extensive chromosomal rearrangements, distinct differences in quantity of the particular genes (e.g. involved in photosynthesis, respiration, sugar metabolism, chlorophyll degradation, regulation of gene expression, photooxidative stress tolerance, higher non-optimal temperatures tolerance and ammonium ion assimilation) as well as in distributions of abscisic acid-, dehydration- and ethylene-responsive cis-regulatory elements (CREs) in promoters of orthologous group of genes, which lead to the specific adaptation features. Abscisic acid treatment of non-acclimated Arabidopsis and C. sativus seedlings induced moderate freezing tolerance in Arabidopsis but not in C. sativus. This experiment together with analysis of abscisic acid-specific CRE distributions give a clue why C. sativus is much more susceptible to moderate freezing stresses than A. thaliana. Comparative analysis of all the five genomes showed that, each species and/or cultivars has a specific profile of CRE content in promoters of orthologous genes. Our results constitute the substantial and original resource for the basic and applied research on environmental adaptations of plants, which could facilitate creation of new crops with improved growth and yield in divergent conditions.
Swift, Camm C; Spies, Brenton; Ellingson, Ryan A; Jacobs, David K
2016-01-01
A geographically isolated set of southern localities of the formerly monotypic goby genus Eucyclogobius is known to be reciprocally monophyletic and substantially divergent in mitochondrial sequence and nuclear microsatellite-based phylogenies relative to populations to the north along the California coast. To clarify taxonomic and conservation status, we conducted a suite of analyses on a comprehensive set of morphological counts and measures from across the range of Eucyclogobius and describe the southern populations as a new species, the Southern Tidewater Goby, Eucyclogobius kristinae, now separate from the Northern Tidewater Goby Eucyclogobius newberryi (Girard 1856). In addition to molecular distinction, adults of E. kristinae are diagnosed by: 1) loss of the anterior supratemporal lateral-line canals resulting in higher neuromast counts, 2) lower pectoral and branched caudal ray counts, and 3) sets of measurements identified via discriminant analysis. These differences suggest ecological distinction of the two species. Previous studies estimated lineage separation at 2-4 million years ago, and mitochondrial sequence divergence exceeds that of other recognized fish species. Fish from Santa Monica Artesian Springs (Los Angeles County) northward belong to E. newberryi; those from Aliso Creek (Orange County) southward constitute E. kristinae. The lagoonal habitat of Eucyclogobius has been diminished or degraded, leading to special conservation status at state and federal levels beginning in 1980. Habitat of the newly described species has been impacted by a range of anthropogenic activities, including the conversion of closing lagoons to open tidal systems in the name of restoration. In the last 30 years, E. kristinae has only been observed in nine intermittently occupied lagoonal systems in northern San Diego County; it currently persists in only three sites. Thus, the new species is in imminent danger of extinction and will require ongoing active management.
McConnell, Sean C.; Hernandez, Kyle M.; Wcisel, Dustin J.; Kettleborough, Ross N.; Stemple, Derek L.; Andrade, Jorge; de Jong, Jill L. O.
2016-01-01
Antigen processing and presentation genes found within the MHC are among the most highly polymorphic genes of vertebrate genomes, providing populations with diverse immune responses to a wide array of pathogens. Here, we describe transcriptome, exome, and whole-genome sequencing of clonal zebrafish, uncovering the most extensive diversity within the antigen processing and presentation genes of any species yet examined. Our CG2 clonal zebrafish assembly provides genomic context within a remarkably divergent haplotype of the core MHC region on chromosome 19 for six expressed genes not found in the zebrafish reference genome: mhc1uga, proteasome-β 9b (psmb9b), psmb8f, and previously unknown genes psmb13b, tap2d, and tap2e. We identify ancient lineages for Psmb13 within a proteasome branch previously thought to be monomorphic and provide evidence of substantial lineage diversity within each of three major trifurcations of catalytic-type proteasome subunits in vertebrates: Psmb5/Psmb8/Psmb11, Psmb6/Psmb9/Psmb12, and Psmb7/Psmb10/Psmb13. Strikingly, nearby tap2 and MHC class I genes also retain ancient sequence lineages, indicating that alternative lineages may have been preserved throughout the entire MHC pathway since early diversification of the adaptive immune system ∼500 Mya. Furthermore, polymorphisms within the three MHC pathway steps (antigen cleavage, transport, and presentation) are each predicted to alter peptide specificity. Lastly, comparative analysis shows that antigen processing gene diversity is far more extensive than previously realized (with ancient coelacanth psmb8 lineages, shark psmb13, and tap2t and psmb10 outside the teleost MHC), implying distinct immune functions and conserved roles in shaping MHC pathway evolution throughout vertebrates. PMID:27493218
Full-genome sequence and analysis of a novel human rhinovirus strain within a divergent HRV-A clade.
Rathe, Jennifer A; Liu, Xinyue; Tallon, Luke J; Gern, James E; Liggett, Stephen B
2010-01-01
Genome sequences of human rhinoviruses (HRV) have primarily been from stocks collected in the 1960s, with genomes and phylogeny of modern HRVs remaining undefined. Here, two modern isolates (hrv-A101 and hrv-A101-v1) collected approximately 8 years apart were sequenced in their entirety. Incorporation into our full-genome HRV alignment with subsequent phylogenetic network inference indicated that these represent a unique HRV-A, localized within a distinct divergent clade. They appear to have resulted from recombination of the hrv-65 and hrv-78 lineages. These results support our contention that there are unrecognized distinct HRV-A strains, and that recombination is evident in currently circulating strains.
2010-01-01
Background Comparative sequence analysis of complex loci such as resistance gene analog clusters allows estimating the degree of sequence conservation and mechanisms of divergence at the intraspecies level. In banana (Musa sp.), two diploid wild species Musa acuminata (A genome) and Musa balbisiana (B genome) contribute to the polyploid genome of many cultivars. The M. balbisiana species is associated with vigour and tolerance to pests and disease and little is known on the genome structure and haplotype diversity within this species. Here, we compare two genomic sequences of 253 and 223 kb corresponding to two haplotypes of the RGA08 resistance gene analog locus in M. balbisiana "Pisang Klutuk Wulung" (PKW). Results Sequence comparison revealed two regions of contrasting features. The first is a highly colinear gene-rich region where the two haplotypes diverge only by single nucleotide polymorphisms and two repetitive element insertions. The second corresponds to a large cluster of RGA08 genes, with 13 and 18 predicted RGA genes and pseudogenes spread over 131 and 152 kb respectively on each haplotype. The RGA08 cluster is enriched in repetitive element insertions, in duplicated non-coding intergenic sequences including low complexity regions and shows structural variations between haplotypes. Although some allelic relationships are retained, a large diversity of RGA08 genes occurs in this single M. balbisiana genotype, with several RGA08 paralogs specific to each haplotype. The RGA08 gene family has evolved by mechanisms of unequal recombination, intragenic sequence exchange and diversifying selection. An unequal recombination event taking place between duplicated non-coding intergenic sequences resulted in a different RGA08 gene content between haplotypes pointing out the role of such duplicated regions in the evolution of RGA clusters. Based on the synonymous substitution rate in coding sequences, we estimated a 1 million year divergence time for these M. balbisiana haplotypes. Conclusions A large RGA08 gene cluster identified in wild banana corresponds to a highly variable genomic region between haplotypes surrounded by conserved flanking regions. High level of sequence identity (70 to 99%) of the genic and intergenic regions suggests a recent and rapid evolution of this cluster in M. balbisiana. PMID:20637079
2013-01-01
Background Grapes are one of the most economically important fruit crops. There are about 60 species in the genus Vitis. The phylogenetic relationships among these species are of keen interest for the conservation and use of this germplasm. We selected 309 accessions from 48 Vitis species,varieties, and outgroups, examined ~11 kb (~3.4 Mb total) of aligned nuclear DNA sequences from 27 unlinked genes in a phylogenetic context, and estimated divergence times based on fossil calibrations. Results Vitis formed a strongly supported clade. There was substantial support for species and less for the higher-level groupings (series). As estimated from extant taxa, the crown age of Vitis was 28 Ma and the divergence of subgenera (Vitis and Muscadinia) occurred at ~18 Ma. Higher clades in subgenus Vitis diverged 16 – 5 Ma with overlapping confidence intervals, and ongoing divergence formed extant species at 12 – 1.3 Ma. Several species had species-specific SNPs. NeighborNet analysis showed extensive reticulation at the core of subgenus Vitis representing the deeper nodes, with extensive reticulation radiating outward. Fitch Parsimony identified North America as the origin of the most recent common ancestor of extant Vitis species. Conclusions Phylogenetic patterns suggested origination of the genus in North America, fragmentation of an ancestral range during the Miocene, formation of extant species in the late Miocene-Pleistocene, and differentiation of species in the context of Pliocene-Quaternary tectonic and climatic change. Nuclear SNPs effectively resolved relationships at and below the species level in grapes and rectified several misclassifications of accessions in the repositories. Our results challenge current higher-level classifications, reveal the abundance of genetic diversity in the genus that is potentially available for crop improvement, and provide a valuable resource for species delineation, germplasm conservation and use. PMID:23826735
Nilsson, Maria A; Härlid, Anna; Kullberg, Morgan; Janke, Axel
2010-05-01
The native rodents are the most species-rich placental mammal group on the Australian continent. Fossils of native Australian rodents belonging to the group Conilurini are known from Northern Australia at 4.5Ma. These fossil assemblages already display a rich diversity of rodents, but the exact timing of their arrival on the Australian continent is not yet established. The complete mitochondrial genomes of two native Australian rodents, Leggadina lakedownensis (Lakeland Downs mouse) and Pseudomys chapmani (Western Pebble-mound mouse) were sequenced for investigating their evolutionary history. The molecular data were used for studying the phylogenetic position and divergence times of the Australian rodents, using 12 calibration points and various methods. Phylogenetic analyses place the native Australian rodents as the sister-group to the genus Mus. The Mus-Conilurini calibration point (7.3-11.0Ma) is highly critical for estimating rodent divergence times, while the influence of the different algorithms on estimating divergence times is negligible. The influence of the data type was investigated, indicating that amino acid data are more likely to reflect the correct divergence times than nucleotide sequences. The study on the problems related to estimating divergence times in fast-evolving lineages such as rodents, emphasize the choice of data and calibration points as being critical. Furthermore, it is essential to include accurate calibration points for fast-evolving groups, because the divergence times can otherwise be estimated to be significantly older. The divergence times of the Australian rodents are highly congruent and are estimated to 6.5-7.2Ma, a date that is compatible with their fossil record.
Brettanomyces acidodurans sp. nov., a new acetic acid producing yeast species from olive oil.
Péter, Gábor; Dlauchy, Dénes; Tóbiás, Andrea; Fülöp, László; Podgoršek, Martina; Čadež, Neža
2017-05-01
Two yeast strains representing a hitherto undescribed yeast species were isolated from olive oil and spoiled olive oil originating from Spain and Israel, respectively. Both strains are strong acetic acid producers, equipped with considerable tolerance to acetic acid. The cultures are not short-lived. Cellobiose is fermented as well as several other sugars. The sequences of their large subunit (LSU) rRNA gene D1/D2 domain are very divergent from the sequences available in the GenBank. They differ from the closest hit, Brettanomyces naardenensis by about 27%, mainly substitutions. Sequence analyses of the concatenated dataset from genes of the small subunit (SSU) rRNA, LSU rRNA and translation elongation factor-1α (EF-1α) placed the two strains as an early diverging member of the Brettanomyces/Dekkera clade with high bootstrap support. Sexual reproduction was not observed. The name Brettanomyces acidodurans sp. nov. (holotype: NCAIM Y.02178 T ; isotypes: CBS 14519 T = NRRL Y-63865 T = ZIM 2626 T , MycoBank no.: MB 819608) is proposed for this highly divergent new yeast species.
Geological dates and molecular rates: rapid divergence of rivers and their biotas.
Waters, Jonathan M; Rowe, Diane L; Apte, Smita; King, Tania M; Wallis, Graham P; Anderson, Leigh; Norris, Richard J; Craw, Dave; Burridge, Christopher P
2007-04-01
We highlight a novel molecular clock calibration system based on geologically dated river reversal and river capture events. Changes in drainage pattern may effect vicariant isolation of freshwater taxa, and thus provide a predictive framework for associated phylogeographic study. As a case in point, New Zealand's Pelorus and Kaituna rivers became geologically isolated from the larger Wairau River system 70 to 130 kyr BP. We conducted mitochondrial DNA phylogeographic analyses of two unrelated freshwater-limited fish taxa native to these river systems (Gobiomorphus breviceps, n = 63; Galaxias divergens, n = 95). Phylogenetic analysis of combined control region and cytochrome b sequences yielded reciprocally monophyletic clades of Pelorus-Kaituna and Wairau haplotypes for each species. Calibrated rates of molecular change based on this freshwater vicariant event are substantially faster than traditionally accepted rates for fishes but consistent with other recent inferences based on geologically young calibration points. A survey of freshwater phylogeographic literature reveals numerous examples in which the ages of recent evolutionary events may have been substantially overestimated through the use of "accepted" calibrations. We recommend that--wherever possible--biologists should start to reassess the conclusions of such studies by using more appropriate molecular calibrations derived from recent geological events.
Ancient wolf lineages in India.
Sharma, Dinesh K; Maldonado, Jesus E; Jhala, Yadrendradev V; Fleischer, Robert C
2004-01-01
All previously obtained wolf (Canis lupus) and dog (Canis familiaris) mitochondrial (mt) DNA sequences fall within an intertwined and shallow clade (the 'wolf-dog' clade). We sequenced mtDNA of recent and historical samples from 45 wolves from throughout lowland peninsular India and 23 wolves from the Himalayas and Tibetan Plateau and compared these sequences with all available wolf and dog sequences. All 45 lowland Indian wolves have one of four closely related haplotypes that form a well-supported, divergent sister lineage to the wolf-dog clade. This unique lineage may have been independent for more than 400,000 years. Although seven Himalayan wolves from western and central Kashmir fall within the widespread wolf-dog clade, one from Ladakh in eastern Kashmir, nine from Himachal Pradesh, four from Nepal and two from Tibet form a very different basal clade. This lineage contains five related haplotypes that probably diverged from other canids more than 800,000 years ago, but we find no evidence of current barriers to admixture. Thus, the Indian subcontinent has three divergent, ancient and apparently parapatric mtDNA lineages within the morphologically delineated wolf. No haplotypes of either novel lineage are found within a sample of 37 Indian (or other) dogs. Thus, we find no evidence that these two taxa played a part in the domestication of canids. PMID:15101402
Amazonian phylogeography: mtDNA sequence variation in arboreal echimyid rodents (Caviomorpha).
da Silva, M N; Patton, J L
1993-09-01
Patterns of evolutionary relationships among haplotype clades of sequences of the mitochondrial cytochrome b DNA gene are examined for five genera of arboreal rodents of the Caviomorph family Echimyidae from the Amazon Basin. Data are available for 798 bp of sequence from a total of 24 separate localities in Peru, Venezuela, Bolivia, and Brazil for Mesomys, Isothrix, Makalata, Dactylomys, and Echimys. Sequence divergence, corrected for multiple hits, is extensive, ranging from less than 1% for comparisons within populations of over 20% among geographic units within genera. Both the degree of differentiation and the geographic patterning of the variation suggest that more than one species composes the Amazonian distribution of the currently recognized Mesomys hispidus, Isothrix bistriata, Makalata didelphoides, and Dactylomys dactylinus. There is general concordance in the geographic range of haplotype clades for each of these taxa, and the overall level of differentiation within them is largely equivalent. These observations suggest that a common vicariant history underlies the respective diversification of each genus. However, estimated times of divergence based on the rate of third position transversion substitutions for the major clades within each genus typically range above 1 million years. Thus, allopatric isolation precipitating divergence must have been considerably earlier than the late Pleistocene forest fragmentation events commonly invoked for Amazonian biota.
Ancient wolf lineages in India.
Sharma, Dinesh K; Maldonado, Jesus E; Jhala, Yadrendradev V; Fleischer, Robert C
2004-02-07
All previously obtained wolf (Canis lupus) and dog (Canis familiaris) mitochondrial (mt) DNA sequences fall within an intertwined and shallow clade (the 'wolf-dog' clade). We sequenced mtDNA of recent and historical samples from 45 wolves from throughout lowland peninsular India and 23 wolves from the Himalayas and Tibetan Plateau and compared these sequences with all available wolf and dog sequences. All 45 lowland Indian wolves have one of four closely related haplotypes that form a well-supported, divergent sister lineage to the wolf-dog clade. This unique lineage may have been independent for more than 400,000 years. Although seven Himalayan wolves from western and central Kashmir fall within the widespread wolf-dog clade, one from Ladakh in eastern Kashmir, nine from Himachal Pradesh, four from Nepal and two from Tibet form a very different basal clade. This lineage contains five related haplotypes that probably diverged from other canids more than 800,000 years ago, but we find no evidence of current barriers to admixture. Thus, the Indian subcontinent has three divergent, ancient and apparently parapatric mtDNA lineages within the morphologically delineated wolf. No haplotypes of either novel lineage are found within a sample of 37 Indian (or other) dogs. Thus, we find no evidence that these two taxa played a part in the domestication of canids.
NASA Astrophysics Data System (ADS)
Xu, Jiajie; Jiang, Bo; Chai, Sanming; He, Yuan; Zhu, Jianyi; Shen, Zonggen; Shen, Songdong
2016-09-01
Filamentous Bangia, which are distributed extensively throughout the world, have simple and similar morphological characteristics. Scientists can classify these organisms using molecular markers in combination with morphology. We successfully sequenced the complete nuclear ribosomal DNA, approximately 13 kb in length, from a marine Bangia population. We further analyzed the small subunit ribosomal DNA gene (nrSSU) and the internal transcribed spacer (ITS) sequence regions along with nine other marine, and two freshwater Bangia samples from China. Pairwise distances of the nrSSU and 5.8S ribosomal DNA gene sequences show the marine samples grouping together with low divergences (00.003; 0-0.006, respectively) from each other, but high divergences (0.123-0.126; 0.198, respectively) from freshwater samples. An exception is the marine sample collected from Weihai, which shows high divergence from both other marine samples (0.063-0.065; 0.129, respectively) and the freshwater samples (0.097; 0.120, respectively). A maximum likelihood phylogenetic tree based on a combined SSU-ITS dataset with maximum likelihood method shows the samples divided into three clades, with the two marine sample clades containing Bangia spp. from North America, Europe, Asia, and Australia; and one freshwater clade, containing Bangia atropurpurea from North America and China.
Ren, Jindong; Du, Xue; Zeng, Tao; Chen, Li; Shen, Junda; Lu, Lizhi; Hu, Jianhong
2017-10-01
Long noncoding RNAs (lncRNAs) and divergently expressed genes exist widely in different tissues of mammals and birds, in which they are involved in various biological processes. However, there is limited information on their role in the regulation of normal biological processes during differentiation, development, and reproduction in birds. In this study, whole transcriptome strand-specific RNA sequencing of the ovary from young ducks (60days), first-laying ducks (160days), and old ducks, i.e., ducks that stopped laying eggs (490days) was performed. The lncRNAs and mRNAs from these ducks were systematically analyzed and identified by duck genome sequencing in the three study groups. The transcriptome from the duck ovary comprised 15,011 protein-coding genes and 2905 lncRNAs; all the lncRNAs were identified as novel long noncoding transcripts. The comparison of transcriptome data from different study groups identified 2240 divergent transcription genes and 135 divergently expressed lncRNAs, which differed among the groups; most of them were significantly downregulated with age. Among the divergent genes, 38 genes were related to the reproductive process and 6 genes were upregulated. Further prediction analysis revealed that 52 lncRNAs were closely correlated with divergent reproductive mRNAs. More importantly, 6 remarkable lncRNAs were correlated significantly with the conversion of the ovary in different phases. Our results aid in the understanding of the divergent transcriptome of duck ovary in different phases and the underlying mechanisms that drive the specificity of protein-coding genes and lncRNAs in duck ovary. Copyright © 2017. Published by Elsevier B.V.
USDA-ARS?s Scientific Manuscript database
Porcine reproductive and respiratory syndrome virus (PRRSV) is widespread with a high variation in sequence and virulence among the divergent strains and causes an economically destructive disease. A viral ovarian domain protease (vOTU) has been previously identified within the nonstructural protein...
Xu, Jianping; Yan, Zhun; Guo, Hong
2009-06-01
The inheritance of mitochondrial genes and genomes are uniparental in most sexual eukaryotes. This pattern of inheritance makes mitochondrial genomes in natural populations effectively clonal. Here, we examined the mitochondrial population genetics of the emerging human pathogenic fungus Cryptococcus gattii. The DNA sequences for five mitochondrial DNA fragments were obtained from each of 50 isolates belonging to two evolutionary divergent lineages, VGI and VGII. Our analyses revealed a greater sequence diversity within VGI than that within VGII, consistent with observations of the nuclear genes. The combined analyses of all five gene fragments indicated significant divergence between VGI and VGII. However, the five individual genealogies showed different relationships among the isolates, consistent with recent hybridization and mitochondrial gene transfer between the two lineages. Population genetic analyses of the multilocus data identified evidence for predominantly clonal mitochondrial population structures within both lineages. Interestingly, there were clear signatures of recombination among mitochondrial genes within the VGII lineage. Our analyses suggest historical mitochondrial genome divergence within C. gattii, but there is evidence for recent hybridization and recombination in the mitochondrial genome of this important human yeast pathogen.
Ribeiro, Priciane C; Souza, Matheus L; Muller, Larissa A C; Ellis, Vincenzo A; Heuertz, Myriam; Lemos-Filho, José P; Lovato, Maria Bernadete
2016-11-01
The Cerrado is the largest South American savanna and encompasses substantial species diversity and environmental variation. Nevertheless, little is known regarding the influence of the environment on population divergence of Cerrado species. Here, we searched for climatic drivers of genetic (nuclear microsatellites) and leaf trait divergence in Annona crassiflora, a widespread tree in the Cerrado. The sampling encompassed all phytogeographic provinces of the continuous area of the Cerrado and included 397 individuals belonging to 21 populations. Populations showed substantial genetic and leaf trait divergence across the species' range. Our data revealed three spatially defined genetic groups (eastern, western and southern) and two morphologically distinct groups (eastern and western only). The east-west split in both the morphological and genetic data closely mirrors previously described phylogeographic patterns of Cerrado species. Generalized linear mixed effects models and multiple regression analyses revealed several climatic factors associated with both genetic and leaf trait divergence among populations of A. crassiflora. Isolation by environment (IBE) was mainly due to temperature seasonality and precipitation of the warmest quarter. Populations that experienced lower precipitation summers and hotter winters had heavier leaves and lower specific leaf area. The southwestern area of the Cerrado had the highest genetic diversity of A. crassiflora, suggesting that this region may have been climatically stable. Overall, we demonstrate that a combination of current climate and past climatic changes have shaped the population divergence and spatial structure of A. crassiflora. However, the genetic structure of A. crassiflora reflects the biogeographic history of the species more strongly than leaf traits, which are more related to current climate. © 2016 John Wiley & Sons Ltd.
Evaluating, Comparing, and Interpreting Protein Domain Hierarchies
2014-01-01
Abstract Arranging protein domain sequences hierarchically into evolutionarily divergent subgroups is important for investigating evolutionary history, for speeding up web-based similarity searches, for identifying sequence determinants of protein function, and for genome annotation. However, whether or not a particular hierarchy is optimal is often unclear, and independently constructed hierarchies for the same domain can often differ significantly. This article describes methods for statistically evaluating specific aspects of a hierarchy, for probing the criteria underlying its construction and for direct comparisons between hierarchies. Information theoretical notions are used to quantify the contributions of specific hierarchical features to the underlying statistical model. Such features include subhierarchies, sequence subgroups, individual sequences, and subgroup-associated signature patterns. Underlying properties are graphically displayed in plots of each specific feature's contributions, in heat maps of pattern residue conservation, in “contrast alignments,” and through cross-mapping of subgroups between hierarchies. Together, these approaches provide a deeper understanding of protein domain functional divergence, reveal uncertainties caused by inconsistent patterns of sequence conservation, and help resolve conflicts between competing hierarchies. PMID:24559108
Genome Sequences of Akhmeta Virus, an Early Divergent Old World Orthopoxvirus.
Gao, Jinxin; Gigante, Crystal; Khmaladze, Ekaterine; Liu, Pengbo; Tang, Shiyuyun; Wilkins, Kimberly; Zhao, Kun; Davidson, Whitni; Nakazawa, Yoshinori; Maghlakelidze, Giorgi; Geleishvili, Marika; Kokhreidze, Maka; Carroll, Darin S; Emerson, Ginny; Li, Yu
2018-05-12
Annotated whole genome sequences of three isolates of the Akhmeta virus (AKMV), a novel species of orthopoxvirus (OPXV), isolated from the Akhmeta and Vani regions of the country Georgia, are presented and discussed. The AKMV genome is similar in genomic content and structure to that of the cowpox virus (CPXV), but a lower sequence identity was found between AKMV and Old World OPXVs than between other known species of Old World OPXVs. Phylogenetic analysis showed that AKMV diverged prior to other Old World OPXV. AKMV isolates formed a monophyletic clade in the OPXV phylogeny, yet the sequence variability between AKMV isolates was higher than between the monkeypox virus strains in the Congo basin and West Africa. An AKMV isolate from Vani contained approximately six kb sequence in the left terminal region that shared a higher similarity with CPXV than with other AKMV isolates, whereas the rest of the genome was most similar to AKMV, suggesting recombination between AKMV and CPXV in a region containing several host range and virulence genes.
Evolution of nuclear rDNA ITS sequences in the Cladophora albida/sericea clade (Chlorophyta).
Bakker, F T; Olsen, J L; Stam, W T
1995-06-01
Ribosomal DNA ITS sequences were compared among 13 different species and biogeographic isolates from the monophyletic "albida/sericea clade" in the green algal genus Cladophora. Six distinct ITS sequence types were found, characterized by multiple insertions and deletions and high levels of nucleotide substitution. Conserved domains within the ITS regions indicate the presence of ITS secondary structure. Low transition/transversion ratios among the six types and nearly symmetrical tree-length frequency distributions indicate some saturation, and low phylogenetic signal. Although branching order among five of the six ITS sequence types could not be resolved, estimates of ITS sequence divergence as compared with 18S divergence in a subset of the taxa suggests that the origin of the different ITS types is probably in the mid-Miocene (12 Ma ago) but that biogeographic isolates within a single ITS type (including both Pacific and Atlantic representatives) have probably dispersed on a time scale of thousands rather than millions of years.
RECOVIR Software for Identifying Viruses
NASA Technical Reports Server (NTRS)
Chakravarty, Sugoto; Fox, George E.; Zhu, Dianhui
2013-01-01
Most single-stranded RNA (ssRNA) viruses mutate rapidly to generate a large number of strains with highly divergent capsid sequences. Determining the capsid residues or nucleotides that uniquely characterize these strains is critical in understanding the strain diversity of these viruses. RECOVIR (an acronym for "recognize viruses") software predicts the strains of some ssRNA viruses from their limited sequence data. Novel phylogenetic-tree-based databases of protein or nucleic acid residues that uniquely characterize these virus strains are created. Strains of input virus sequences (partial or complete) are predicted through residue-wise comparisons with the databases. RECOVIR uses unique characterizing residues to identify automatically strains of partial or complete capsid sequences of picorna and caliciviruses, two of the most highly diverse ssRNA virus families. Partition-wise comparisons of the database residues with the corresponding residues of more than 300 complete and partial sequences of these viruses resulted in correct strain identification for all of these sequences. This study shows the feasibility of creating databases of hitherto unknown residues uniquely characterizing the capsid sequences of two of the most highly divergent ssRNA virus families. These databases enable automated strain identification from partial or complete capsid sequences of these human and animal pathogens.
Catalano, Sarah R; Whittington, Ian D; Donnellan, Stephen C; Bertozzi, Terry; Gillanders, Bronwyn M
2015-07-01
Dicyemids, poorly known parasites of benthic cephalopods, are one of the few phyla in which mitochondrial (mt) genome architecture departs from the typical ~16 kb circular metazoan genome. In addition to a putative circular genome, a series of mt minicircles that each comprises the mt encoded units (I-III) of the cytochrome c oxidase complex have been reported. Whether the structure of the mt minicircles is a consistent feature among dicyemid species is unknown. Here we analyse the complete cytochrome c oxidase subunit I (COI) minicircle molecule, containing the COI gene and an associated non-coding region (NCR), for ten dicyemid species, allowing for first time comparisons between species of minicircle architecture, NCR function and inferences of minicircle replication. Divergence in COI nucleotide sequences between dicyemid species was high (average net divergence = 31.6%) while within species diversity was lower (average net divergence = 0.2%). The NCR and putative 5' section of the COI gene were highly divergent between dicyemid species (average net nucleotide divergence of putative 5' COI section = 61.1%). No tRNA genes were found in the NCR, although palindrome sequences with the potential to form stem-loop structures were identified in some species, which may play a role in transcription or other biological processes.
CNL Disease Resistance Genes in Soybean and Their Evolutionary Divergence
Nepal, Madhav P; Benson, Benjamin V
2015-01-01
Disease resistance genes (R-genes) encode proteins involved in detecting pathogen attack and activating downstream defense molecules. Recent availability of soybean genome sequences makes it possible to examine the diversity of gene families including disease-resistant genes. The objectives of this study were to identify coiled-coil NBS-LRR (= CNL) R-genes in soybean, infer their evolutionary relationships, and assess structural as well as functional divergence of the R-genes. Profile hidden Markov models were used for sequence identification and model-based maximum likelihood was used for phylogenetic analysis, and variation in chromosomal positioning, gene clustering, and functional divergence were assessed. We identified 188 soybean CNL genes nested into four clades consistent to their orthologs in Arabidopsis. Gene clustering analysis revealed the presence of 41 gene clusters located on 13 different chromosomes. Analyses of the Ks-values and chromosomal positioning suggest duplication events occurring at varying timescales, and an extrapericentromeric positioning may have facilitated their rapid evolution. Each of the four CNL clades exhibited distinct patterns of gene expression. Phylogenetic analysis further supported the extrapericentromeric positioning effect on the divergence and retention of the CNL genes. The results are important for understanding the diversity and divergence of CNL genes in soybean, which would have implication in soybean crop improvement in future. PMID:25922568
CNL Disease Resistance Genes in Soybean and Their Evolutionary Divergence.
Nepal, Madhav P; Benson, Benjamin V
2015-01-01
Disease resistance genes (R-genes) encode proteins involved in detecting pathogen attack and activating downstream defense molecules. Recent availability of soybean genome sequences makes it possible to examine the diversity of gene families including disease-resistant genes. The objectives of this study were to identify coiled-coil NBS-LRR (= CNL) R-genes in soybean, infer their evolutionary relationships, and assess structural as well as functional divergence of the R-genes. Profile hidden Markov models were used for sequence identification and model-based maximum likelihood was used for phylogenetic analysis, and variation in chromosomal positioning, gene clustering, and functional divergence were assessed. We identified 188 soybean CNL genes nested into four clades consistent to their orthologs in Arabidopsis. Gene clustering analysis revealed the presence of 41 gene clusters located on 13 different chromosomes. Analyses of the K s-values and chromosomal positioning suggest duplication events occurring at varying timescales, and an extrapericentromeric positioning may have facilitated their rapid evolution. Each of the four CNL clades exhibited distinct patterns of gene expression. Phylogenetic analysis further supported the extrapericentromeric positioning effect on the divergence and retention of the CNL genes. The results are important for understanding the diversity and divergence of CNL genes in soybean, which would have implication in soybean crop improvement in future.
Evolution of meiotic recombination genes in maize and teosinte.
Sidhu, Gaganpreet K; Warzecha, Tomasz; Pawlowski, Wojciech P
2017-01-25
Meiotic recombination is a major source of genetic variation in eukaryotes. The role of recombination in evolution is recognized but little is known about how evolutionary forces affect the recombination pathway itself. Although the recombination pathway is fundamentally conserved across different species, genetic variation in recombination components and outcomes has been observed. Theoretical predictions and empirical studies suggest that changes in the recombination pathway are likely to provide adaptive abilities to populations experiencing directional or strong selection pressures, such as those occurring during species domestication. We hypothesized that adaptive changes in recombination may be associated with adaptive evolution patterns of genes involved in meiotic recombination. To examine how maize evolution and domestication affected meiotic recombination genes, we studied patterns of sequence polymorphism and divergence in eleven genes controlling key steps in the meiotic recombination pathway in a diverse set of maize inbred lines and several accessions of teosinte, the wild ancestor of maize. We discovered that, even though the recombination genes generally exhibited high sequence conservation expected in a pathway controlling a key cellular process, they showed substantial levels and diverse patterns of sequence polymorphism. Among others, we found differences in sequence polymorphism patterns between tropical and temperate maize germplasms. Several recombination genes displayed patterns of polymorphism indicative of adaptive evolution. Despite their ancient origin and overall sequence conservation, meiotic recombination genes can exhibit extensive and complex patterns of molecular evolution. Changes in these genes could affect the functioning of the recombination pathway, and may have contributed to the successful domestication of maize and its expansion to new cultivation areas.
The Genome of Naegleria gruberi Illuminates Early Eukaryotic Versatility
DOE Office of Scientific and Technical Information (OSTI.GOV)
Fritz-Laylin, Lillian K.; Prochnik, Simon E.; Ginger, Michael L.
2010-03-01
Genome sequences of diverse free-living protists are essential for understanding eukaryotic evolution and molecular and cell biology. The free-living amoeboflagellate Naegleria gruberi belongs to a varied and ubiquitous protist clade (Heterolobosea) that diverged from other eukaryotic lineages over a billion years ago. Analysis of the 15,727 protein-coding genes encoded by Naegleria's 41 Mb nuclear genome indicates a capacity for both aerobic respiration and anaerobic metabolism with concomitant hydrogen production, with fundamental implications for the evolution of organelle metabolism. The Naegleria genome facilitates substantially broader phylogenomic comparisons of free-living eukaryotes than previously possible, allowing us to identify thousands of genes likelymore » present in the pan-eukaryotic ancestor, with 40% likely eukaryotic inventions. Moreover, we construct a comprehensive catalog of amoeboid-motility genes. The Naegleria genome, analyzed in the context of other protists, reveals a remarkably complex ancestral eukaryote with a rich repertoire of cytoskeletal, sexual, signaling, and metabolic modules.« less
Functionally conserved enhancers with divergent sequences in distant vertebrates
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yang, Song; Oksenberg, Nir; Takayama, Sachiko
To examine the contributions of sequence and function conservation in the evolution of enhancers, we systematically identified enhancers whose sequences are not conserved among distant groups of vertebrate species, but have homologous function and are likely to be derived from a common ancestral sequence. In conclusion, our approach combined comparative genomics and epigenomics to identify potential enhancer sequences in the genomes of three groups of distantly related vertebrate species.
Functionally conserved enhancers with divergent sequences in distant vertebrates
Yang, Song; Oksenberg, Nir; Takayama, Sachiko; ...
2015-10-30
To examine the contributions of sequence and function conservation in the evolution of enhancers, we systematically identified enhancers whose sequences are not conserved among distant groups of vertebrate species, but have homologous function and are likely to be derived from a common ancestral sequence. In conclusion, our approach combined comparative genomics and epigenomics to identify potential enhancer sequences in the genomes of three groups of distantly related vertebrate species.
Noorani, Ayesha; Lynch, Andy G.; Achilleos, Achilleas; Eldridge, Matthew; Bower, Lawrence; Weaver, Jamie M.J.; Crawte, Jason; Ong, Chin-Ann; Shannon, Nicholas; MacRae, Shona; Grehan, Nicola; Nutzinger, Barbara; O'Donovan, Maria; Hardwick, Richard; Tavaré, Simon; Fitzgerald, Rebecca C.
2017-01-01
The scientific community has avoided using tissue samples from patients that have been exposed to systemic chemotherapy to infer the genomic landscape of a given cancer. Esophageal adenocarcinoma is a heterogeneous, chemoresistant tumor for which the availability and size of pretreatment endoscopic samples are limiting. This study compares whole-genome sequencing data obtained from chemo-naive and chemo-treated samples. The quality of whole-genomic sequencing data is comparable across all samples regardless of chemotherapy status. Inclusion of samples collected post-chemotherapy increased the proportion of late-stage tumors. When comparing matched pre- and post-chemotherapy samples from 10 cases, the mutational signatures, copy number, and SNV mutational profiles reflect the expected heterogeneity in this disease. Analysis of SNVs in relation to allele-specific copy-number changes pinpoints the common ancestor to a point prior to chemotherapy. For cases in which pre- and post-chemotherapy samples do show substantial differences, the timing of the divergence is near-synchronous with endoreduplication. Comparison across a large prospective cohort (62 treatment-naive, 58 chemotherapy-treated samples) reveals no significant differences in the overall mutation rate, mutation signatures, specific recurrent point mutations, or copy-number events in respect to chemotherapy status. In conclusion, whole-genome sequencing of samples obtained following neoadjuvant chemotherapy is representative of the genomic landscape of esophageal adenocarcinoma. Excluding these samples reduces the material available for cataloging and introduces a bias toward the earlier stages of cancer. PMID:28465312
Accelerated Evolution of the ASPM Gene Controlling Brain Size Begins Prior to Human Brain Expansion
Solomon, Gregory; Gersch, William; Yoon, Young-Ho; Collura, Randall; Ruvolo, Maryellen; Barrett, J. Carl; Woods, C. Geoffrey; Walsh, Christopher A
2004-01-01
Primary microcephaly (MCPH) is a neurodevelopmental disorder characterized by global reduction in cerebral cortical volume. The microcephalic brain has a volume comparable to that of early hominids, raising the possibility that some MCPH genes may have been evolutionary targets in the expansion of the cerebral cortex in mammals and especially primates. Mutations in ASPM, which encodes the human homologue of a fly protein essential for spindle function, are the most common known cause of MCPH. Here we have isolated large genomic clones containing the complete ASPM gene, including promoter regions and introns, from chimpanzee, gorilla, orangutan, and rhesus macaque by transformation-associated recombination cloning in yeast. We have sequenced these clones and show that whereas much of the sequence of ASPM is substantially conserved among primates, specific segments are subject to high Ka/Ks ratios (nonsynonymous/synonymous DNA changes) consistent with strong positive selection for evolutionary change. The ASPM gene sequence shows accelerated evolution in the African hominoid clade, and this precedes hominid brain expansion by several million years. Gorilla and human lineages show particularly accelerated evolution in the IQ domain of ASPM. Moreover, ASPM regions under positive selection in primates are also the most highly diverged regions between primates and nonprimate mammals. We report the first direct application of TAR cloning technology to the study of human evolution. Our data suggest that evolutionary selection of specific segments of the ASPM sequence strongly relates to differences in cerebral cortical size. PMID:15045028
Horn, Susanne; Durka, Walter; Wolf, Ronny; Ermala, Aslak; Stubbe, Annegret; Stubbe, Michael; Hofreiter, Michael
2011-01-01
Background Beavers are one of the largest and ecologically most distinct rodent species. Little is known about their evolution and even their closest phylogenetic relatives have not yet been identified with certainty. Similarly, little is known about the timing of divergence events within the genus Castor. Methodology/Principal Findings We sequenced complete mitochondrial genomes from both extant beaver species and used these sequences to place beavers in the phylogenetic tree of rodents and date their divergence from other rodents as well as the divergence events within the genus Castor. Our analyses support the phylogenetic position of beavers as a sister lineage to the scaly tailed squirrel Anomalurus within the mouse related clade. Molecular dating places the divergence time of the lineages leading to beavers and Anomalurus as early as around 54 million years ago (mya). The living beaver species, Castor canadensis from North America and Castor fiber from Eurasia, although similar in appearance, appear to have diverged from a common ancestor more than seven mya. This result is consistent with the hypothesis that a migration of Castor from Eurasia to North America as early as 7.5 mya could have initiated their speciation. We date the common ancestor of the extant Eurasian beaver relict populations to around 210,000 years ago, much earlier than previously thought. Finally, the substitution rate of Castor mitochondrial DNA is considerably lower than that of other rodents. We found evidence that this is correlated with the longer life span of beavers compared to other rodents. Conclusions/Significance A phylogenetic analysis of mitochondrial genome sequences suggests a sister-group relationship between Castor and Anomalurus, and allows molecular dating of species divergence in congruence with paleontological data. The implementation of a relaxed molecular clock enabled us to estimate mitochondrial substitution rates and to evaluate the effect of life history traits on it. PMID:21307956
Centromere location in Arabidopsis is unaltered by extreme divergence in CENH3 protein sequence.
Maheshwari, Shamoni; Ishii, Takayoshi; Brown, C Titus; Houben, Andreas; Comai, Luca
2017-03-01
During cell division, spindle fibers attach to chromosomes at centromeres. The DNA sequence at regional centromeres is fast evolving with no conserved genetic signature for centromere identity. Instead CENH3, a centromere-specific histone H3 variant, is the epigenetic signature that specifies centromere location across both plant and animal kingdoms. Paradoxically, CENH3 is also adaptively evolving. An ongoing question is whether CENH3 evolution is driven by a functional relationship with the underlying DNA sequence. Here, we demonstrate that despite extensive protein sequence divergence, CENH3 histones from distant species assemble centromeres on the same underlying DNA sequence. We first characterized the organization and diversity of centromere repeats in wild-type Arabidopsis thaliana We show that A. thaliana CENH3-containing nucleosomes exhibit a strong preference for a unique subset of centromeric repeats. These sequences are largely missing from the genome assemblies and represent the youngest and most homogeneous class of repeats. Next, we tested the evolutionary specificity of this interaction in a background in which the native A. thaliana CENH3 is replaced with CENH3s from distant species. Strikingly, we find that CENH3 from Lepidium oleraceum and Zea mays , although specifying epigenetically weaker centromeres that result in genome elimination upon outcrossing, show a binding pattern on A. thaliana centromere repeats that is indistinguishable from the native CENH3. Our results demonstrate positional stability of a highly diverged CENH3 on independently evolved repeats, suggesting that the sequence specificity of centromeres is determined by a mechanism independent of CENH3. © 2017 Maheshwari et al.; Published by Cold Spring Harbor Laboratory Press.
Genotype imputation in a coalescent model with infinitely-many-sites mutation
Huang, Lucy; Buzbas, Erkan O.; Rosenberg, Noah A.
2012-01-01
Empirical studies have identified population-genetic factors as important determinants of the properties of genotype-imputation accuracy in imputation-based disease association studies. Here, we develop a simple coalescent model of three sequences that we use to explore the theoretical basis for the influence of these factors on genotype-imputation accuracy, under the assumption of infinitely-many-sites mutation. Employing a demographic model in which two populations diverged at a given time in the past, we derive the approximate expectation and variance of imputation accuracy in a study sequence sampled from one of the two populations, choosing between two reference sequences, one sampled from the same population as the study sequence and the other sampled from the other population. We show that under this model, imputation accuracy—as measured by the proportion of polymorphic sites that are imputed correctly in the study sequence—increases in expectation with the mutation rate, the proportion of the markers in a chromosomal region that are genotyped, and the time to divergence between the study and reference populations. Each of these effects derives largely from an increase in information available for determining the reference sequence that is genetically most similar to the sequence targeted for imputation. We analyze as a function of divergence time the expected gain in imputation accuracy in the target using a reference sequence from the same population as the target rather than from the other population. Together with a growing body of empirical investigations of genotype imputation in diverse human populations, our modeling framework lays a foundation for extending imputation techniques to novel populations that have not yet been extensively examined. PMID:23079542
Genetic and phylogenetic divergence of feline immunodeficiency virus in the puma (Puma concolor).
Carpenter, M A; Brown, E W; Culver, M; Johnson, W E; Pecon-Slattery, J; Brousset, D; O'Brien, S J
1996-01-01
Feline immunodeficiency virus (FIV) is a lentivirus which causes an AIDS-like disease in domestic cats (Felis catus). A number of other felid species, including the puma (Puma concolor), carry a virus closely related to domestic cat FIV. Serological testing revealed the presence of antibodies to FIV in 22% of 434 samples from throughout the geographic range of the puma. FIV-Pco pol gene sequences isolated from pumas revealed extensive sequence diversity, greater than has been documented in the domestic cat. The puma sequences formed two highly divergent groups, analogous to the clades which have been defined for domestic cat and lion (Panthera leo) FIV. The puma clade A was made up of samples from Florida and California, whereas clade B consisted of samples from other parts of North America, Central America, and Brazil. The difference between these two groups was as great as that reported among three lion FIV clades. Within puma clades, sequence variation is large, comparable to between-clade differences seen for domestic cat clades, allowing recognition of 15 phylogenetic lineages (subclades) among puma FIV-Pco. Large sequence divergence among isolates, nearly complete species monophyly, and widespread geographic distribution suggest that FIV-Pco has evolved within the puma species for a long period. The sequence data provided evidence for vertical transmission of FIV-Pco from mothers to their kittens, for coinfection of individuals by two different viral strains, and for cross-species transmission of FIV from a domestic cat to a puma. These factors may all be important for understanding the epidemiology and natural history of FIV in the puma. PMID:8794304
USDA-ARS?s Scientific Manuscript database
High-throughput sequencing of reduced representation genomic libraries has ushered in an era of genotyping-by-sequencing (GBS), where genome-wide genotype data can be obtained for nearly any species. However, there remains a need for imputation-free GBS methods for genotyping large samples taken fr...
Complete genome sequence of a divergent strain of Japanese yam mosaic virus from China
USDA-ARS?s Scientific Manuscript database
A novel strain of Japanese yam mosaic virus (JYMV-CN) was identified in a yam plant with foliar mottle symptoms in China. The complete genomic sequence of JYMV-CN was determined. Its genomic sequence of 9701 nucleotides encodes a polyprotein of 3247 amino acids. Its organization was virtually identi...
DNA barcoding for molecular identification of Demodex based on mitochondrial genes.
Hu, Li; Yang, YuanJun; Zhao, YaE; Niu, DongLing; Yang, Rui; Wang, RuiLing; Lu, Zhaohui; Li, XiaoQi
2017-12-01
There has been no widely accepted DNA barcode for species identification of Demodex. In this study, we attempted to solve this issue. First, mitochondrial cox1-5' and 12S gene fragments of Demodex folloculorum, D. brevis, D. canis, and D. caprae were amplified, cloned, and sequenced for the first time; intra/interspecific divergences were computed and phylogenetic trees were reconstructed. Then, divergence frequency distribution plots of those two gene fragments were drawn together with mtDNA cox1-middle region and 16S obtained in previous studies. Finally, their identification efficiency was evaluated by comparing barcoding gap. Results indicated that 12S had the higher identification efficiency. Specifically, for cox1-5' region of the four Demodex species, intraspecific divergences were less than 2.0%, and interspecific divergences were 21.1-31.0%; for 12S, intraspecific divergences were less than 1.4%, and interspecific divergences were 20.8-26.9%. The phylogenetic trees demonstrated that the four Demodex species clustered separately, and divergence frequency distribution plot showed that the largest intraspecific divergence of 12S (1.4%) was less than cox1-5' region (2.0%), cox1-middle region (3.1%), and 16S (2.8%). The barcoding gap of 12S was 19.4%, larger than cox1-5' region (19.1%), cox1-middle region (11.3%), and 16S (13.0%); the interspecific divergence span of 12S was 6.2%, smaller than cox1-5' region (10.0%), cox1-middle region (14.1%), and 16S (11.4%). Moreover, 12S has a moderate length (517 bp) for sequencing at once. Therefore, we proposed mtDNA 12S was more suitable than cox1 and 16S to be a DNA barcode for classification and identification of Demodex at lower category level.
Keskin, Emre; Atar, Hasan Huseyin
2012-04-01
Mitochondrial DNA sequence variation in 655 bpfragments of the cytochrome oxidase c subunit I gene, known as the DNA barcode, of European anchovy (Engraulis encrasicolus) was evaluated by analyzing 1529 individuals representing 16 populations from the Black Sea, through the Marmara Sea and the Aegean Sea to the Mediterranean Sea. A total of 19 (2.9%) variable sites were found among individuals, and these defined 10 genetically diverged populations with an overall mean distance of 1.2%. The highest nucleotide divergence was found between samples of eastern Mediterranean and northern Aegean (2.2%). Evolutionary history analysis among 16 populations clustered the Mediterranean Sea clades in one main branch and the other clades in another branch. Diverging pattern of the European anchovy populations correlated with geographic dispersion supports the genetic structuring through the Black Sea-Marmara Sea-Aegean Sea-Mediterranean Sea quad.
Seeing chordate evolution through the Ciona genome sequence
Cañestro, Cristian; Bassham, Susan; Postlethwait, John H
2003-01-01
A draft sequence of the compact genome of the sea squirt Ciona intestinalis, a non-vertebrate chordate that diverged very early from other chordates, including vertebrates, illuminates how chordates originated and how vertebrate developmental innovations evolved. PMID:12620098
Akın, Ciğdem; Bilgin, C Can; Beerli, Peter; Westaway, Rob; Ohst, Torsten; Litvinchuk, Spartak N; Uzzell, Thomas; Bilgin, Metin; Hotz, Hansjürg; Guex, Gaston-Denis; Plötner, Jörg
2010-11-01
AIM: Our aims were to assess the phylogeographic patterns of genetic diversity in eastern Mediterranean water frogs and to estimate divergence times using different geological scenarios. We related divergence times to past geological events and discuss the relevance of our data for the systematics of eastern Mediterranean water frogs. LOCATION: The eastern Mediterranean region. METHODS: Genetic diversity and divergence were calculated using sequences of two protein-coding mitochondrial (mt) genes: ND2 (1038 bp, 119 sequences) and ND3 (340 bp, 612 sequences). Divergence times were estimated in a Bayesian framework under four geological scenarios representing alternative possible geological histories for the eastern Mediterranean. We then compared the different scenarios using Bayes factors and additional geological data. RESULTS: Extensive genetic diversity in mtDNA divides eastern Mediterranean water frogs into six main haplogroups (MHG). Three MHGs were identified on the Anatolian mainland; the most widespread MHG with the highest diversity is distributed from western Anatolia to the northern shore of the Caspian Sea, including the type locality of Pelophylax ridibundus. The other two Anatolian MHGs are restricted to south-eastern Turkey, occupying localities west and east of the Amanos mountain range. One of the remaining three MHGs is restricted to Cyprus; a second to the Levant; the third was found in the distribution area of European lake frogs (P. ridibundus group), including the Balkans. MAIN CONCLUSIONS: Based on geological evidence and estimates of genetic divergence we hypothesize that the water frogs of Cyprus have been isolated from the Anatolian mainland populations since the end of the Messinian salinity crisis (MSC), i.e. since c. 5.5-5.3 Ma, while our divergence time estimates indicate that the isolation of Crete from the mainland populations (Peloponnese, Anatolia) most likely pre-dates the MSC. The observed rates of divergence imply a time window of c. 1.6-1.1 million years for diversification of the largest Anatolian MHG; divergence between the two other Anatolian MHGs may have begun about 3.0 Ma, apparently as a result of uplift of the Amanos Mountains. Our mtDNA data suggest that the Anatolian water frogs and frogs from Cyprus represent several undescribed species.
Poortvliet, Marloes; Olsen, Jeanine L; Croll, Donald A; Bernardi, Giacomo; Newton, Kelly; Kollias, Spyros; O'Sullivan, John; Fernando, Daniel; Stevens, Guy; Galván Magaña, Felipe; Seret, Bernard; Wintner, Sabine; Hoarau, Galice
2015-02-01
Manta and devil rays are an iconic group of globally distributed pelagic filter feeders, yet their evolutionary history remains enigmatic. We employed next generation sequencing of mitogenomes for nine of the 11 recognized species and two outgroups; as well as additional Sanger sequencing of two mitochondrial and two nuclear genes in an extended taxon sampling set. Analysis of the mitogenome coding regions in a Maximum Likelihood and Bayesian framework provided a well-resolved phylogeny. The deepest divergences distinguished three clades with high support, one containing Manta birostris, Manta alfredi, Mobula tarapacana, Mobula japanica and Mobula mobular; one containing Mobula kuhlii, Mobula eregoodootenkee and Mobula thurstoni; and one containing Mobula munkiana, Mobula hypostoma and Mobula rochebrunei. Mobula remains paraphyletic with the inclusion of Manta, a result that is in agreement with previous studies based on molecular and morphological data. A fossil-calibrated Bayesian random local clock analysis suggests that mobulids diverged from Rhinoptera around 30 Mya. Subsequent divergences are characterized by long internodes followed by short bursts of speciation extending from an initial episode of divergence in the Early and Middle Miocene (19-17 Mya) to a second episode during the Pliocene and Pleistocene (3.6 Mya - recent). Estimates of divergence dates overlap significantly with periods of global warming, during which upwelling intensity - and related high primary productivity in upwelling regions - decreased markedly. These periods are hypothesized to have led to fragmentation and isolation of feeding regions leading to possible regional extinctions, as well as the promotion of allopatric speciation. The closely shared evolutionary history of mobulids in combination with ongoing threats from fisheries and climate change effects on upwelling and food supply, reinforces the case for greater protection of this charismatic family of pelagic filter feeders. Copyright © 2014 Elsevier Inc. All rights reserved.
Hellberg, M E; Moy, G W; Vacquier, V D
2000-03-01
Male-specific proteins have increasingly been reported as targets of positive selection and are of special interest because of the role they may play in the evolution of reproductive isolation. We report the rapid interspecific divergence of cDNA encoding a major acrosomal protein of unknown function (TMAP) of sperm from five species of teguline gastropods. A mitochondrial DNA clock (calibrated by congeneric species divided by the Isthmus of Panama) estimates that these five species diverged 2-10 MYA. Inferred amino acid sequences reveal a propeptide that has diverged rapidly between species. The mature protein has diverged faster still due to high nonsynonymous substitution rates (> 25 nonsynonymous substitutions per site per 10(9) years). cDNA encoding the mature protein (89-100 residues) shows evidence of positive selection (Dn/Ds > 1) for 4 of 10 pairwise species comparisons. cDNA and predicted secondary-structure comparisons suggest that TMAP is neither orthologous nor paralogous to abalone lysin, and thus marks a second, phylogenetically independent, protein subject to strong positive selection in free-spawning marine gastropods. In addition, an internal repeat in one species (Tegula aureotincta) produces a duplicated cleavage site which results in two alternatively processed mature proteins differing by nine amino acid residues. Such alternative processing may provide a mechanism for introducing novel amino acid sequence variation at the amino-termini of proteins. Highly divergent TMAP N-termini from two other tegulines (Tegula regina and Norrisia norrisii) may have originated by such a mechanism.
Troggio, Michela; Surbanovski, Nada; Bianco, Luca; Moretto, Marco; Giongo, Lara; Banchi, Elisa; Viola, Roberto; Fernández, Felicdad Fernández; Costa, Fabrizio; Velasco, Riccardo; Cestaro, Alessandro; Sargent, Daniel James
2013-01-01
High throughput arrays for the simultaneous genotyping of thousands of single-nucleotide polymorphisms (SNPs) have made the rapid genetic characterisation of plant genomes and the development of saturated linkage maps a realistic prospect for many plant species of agronomic importance. However, the correct calling of SNP genotypes in divergent polyploid genomes using array technology can be problematic due to paralogy, and to divergence in probe sequences causing changes in probe binding efficiencies. An Illumina Infinium II whole-genome genotyping array was recently developed for the cultivated apple and used to develop a molecular linkage map for an apple rootstock progeny (M432), but a large proportion of segregating SNPs were not mapped in the progeny, due to unexpected genotype clustering patterns. To investigate the causes of this unexpected clustering we performed BLAST analysis of all probe sequences against the 'Golden Delicious' genome sequence and discovered evidence for paralogous annealing sites and probe sequence divergence for a high proportion of probes contained on the array. Following visual re-evaluation of the genotyping data generated for 8,788 SNPs for the M432 progeny using the array, we manually re-scored genotypes at 818 loci and mapped a further 797 markers to the M432 linkage map. The newly mapped markers included the majority of those that could not be mapped previously, as well as loci that were previously scored as monomorphic, but which segregated due to divergence leading to heterozygosity in probe annealing sites. An evaluation of the 8,788 probes in a diverse collection of Malus germplasm showed that more than half the probes returned genotype clustering patterns that were difficult or impossible to interpret reliably, highlighting implications for the use of the array in genome-wide association studies.
Zhang, Huarong; Miller, Mark P.; Yang, Feng; Chan, Hon Ki; Gaubert, Philippe; Ades, Gary; Fischer, Gunter A
2015-01-01
Despite being protected by both international and national regulations, pangolins are threatened by illegal trade. Here we report mitochondrial DNA identification and haplotype richness estimation, using 239 pangolin scale samples from two confiscations in Hong Kong. We found a total of 13 genetically distinct cytochrome c oxidase I (COI) haplotypes in two confiscations (13 and ten haplotypes respectively, with ten shared haplotypes between confiscations). These haplotypes clustered in two distinct clades with one clade representing the Sunda pangolin (Manisjavanica). The other clade did not match with any known Asian pangolin sequences, and likely represented a cryptic pangolin lineage in Asia. By fitting sample coverage and rarefaction/regression models to our sample data, we predicted that the total number of COI haplotypes in two confiscations were 14.86 and 11.06 respectively, suggesting that our sampling caught the majority of haplotypes and that we had adequately characterized each confiscation. We detected substantial sequence divergence among the seized scales, likely evidencing that the Sunda pangolins were harvested over wide geographical areas across Southeast Asia. Our study illustrates the value of applying DNA forensics for illegal wildlife trade monitoring.
Sturhan, Dieter; Shutova, Tatyana S; Akimov, Vladimir N; Subbotin, Sergei A
2005-01-01
Parasitic bacteria of the genus Pasteuria are reported for three Anaplectus and four identified and several unidentified Plectus species found in eight countries in various habitats. The pasteurias from plectids agree in essential morphological characters of sporangia and endospores as well as in developmental cycle with those of the Pasteuria species and strains described from tylenchid nematodes, but appear to be mainly distinguished from these by absence of a distinct perisporium in the spores and the endospores obviously not being cup- or saucer-shaped. The wide range of measurements and morphological peculiarities of sporangia and endospores suggest that probably several Pasteuria species have to be distinguished as parasites in Plectidae. From an infected juvenile of an unidentified plectid species the 16S rRNA gene sequence of Pasteuria sp. was obtained. Substantial sequence divergence from described Pasteuria species and its phylogenetic position on molecular trees indicate that this Pasteuria sp. could be considered as a new species. Preliminary results of the analysis of DNA phylogeny of Pasteuria spp. and their nematode hosts provide evidence for incongruence of their phylogenetic history and of host switching events during evolution of the bacterial parasites.
rpoB-Based Identification of Nonpigmented and Late-Pigmenting Rapidly Growing Mycobacteria
Adékambi, Toïdi; Colson, Philippe; Drancourt, Michel
2003-01-01
Nonpigmented and late-pigmenting rapidly growing mycobacteria (RGM) are increasingly isolated in clinical microbiology laboratories. Their accurate identification remains problematic because classification is labor intensive work and because new taxa are not often incorporated into classification databases. Also, 16S rRNA gene sequence analysis underestimates RGM diversity and does not distinguish between all taxa. We determined the complete nucleotide sequence of the rpoB gene, which encodes the bacterial β subunit of the RNA polymerase, for 20 RGM type strains. After using in-house software which analyzes and graphically represents variability stretches of 60 bp along the nucleotide sequence, our analysis focused on a 723-bp variable region exhibiting 83.9 to 97% interspecies similarity and 0 to 1.7% intraspecific divergence. Primer pair Myco-F-Myco-R was designed as a tool for both PCR amplification and sequencing of this region for molecular identification of RGM. This tool was used for identification of 63 RGM clinical isolates previously identified at the species level on the basis of phenotypic characteristics and by 16S rRNA gene sequence analysis. Of 63 clinical isolates, 59 (94%) exhibited <2% partial rpoB gene sequence divergence from 1 of 20 species under study and were regarded as correctly identified at the species level. Mycobacterium abscessus and Mycobacterium mucogenicum isolates were clearly distinguished from Mycobacterium chelonae; Mycobacterium mageritense isolates were clearly distinguished from “Mycobacterium houstonense.” Four isolates were not identified at the species level because they exhibited >3% partial rpoB gene sequence divergence from the corresponding type strain; they belonged to three taxa related to M. mucogenicum, Mycobacterium smegmatis, and Mycobacterium porcinum. For M. abscessus and M. mucogenicum, this partial sequence yielded a high genetic heterogeneity within the clinical isolates. We conclude that molecular identification by analysis of the 723-bp rpoB sequence is a rapid and accurate tool for identification of RGM. PMID:14662964
Webb, Kristen M; Rosenthal, Benjamin M
2011-01-01
The mitochondrial genome's non-recombinant mode of inheritance and relatively rapid rate of evolution has promoted its use as a marker for studying the biogeographic history and evolutionary interrelationships among many metazoan species. A modest portion of the mitochondrial genome has been defined for 12 species and genotypes of parasites in the genus Trichinella, but its adequacy in representing the mitochondrial genome as a whole remains unclear, as the complete coding sequence has been characterized only for Trichinella spiralis. Here, we sought to comprehensively describe the extent and nature of divergence between the mitochondrial genomes of T. spiralis (which poses the most appreciable zoonotic risk owing to its capacity to establish persistent infections in domestic pigs) and Trichinella murrelli (which is the most prevalent species in North American wildlife hosts, but which poses relatively little risk to the safety of pork). Next generation sequencing methodologies and scaffold and de novo assembly strategies were employed. The entire protein-coding region was sequenced (13,917 bp), along with a portion of the highly repetitive non-coding region (1524 bp) of the mitochondrial genome of T. murrelli with a combined average read depth of 250 reads. The accuracy of base calling, estimated from coding region sequence was found to exceed 99.3%. Genome content and gene order was not found to be significantly different from that of T. spiralis. An overall inter-species sequence divergence of 9.5% was estimated. Significant variation was identified when the amount of variation between species at each gene is compared to the average amount of variation between species across the coding region. Next generation sequencing is a highly effective means to obtain previously unknown mitochondrial genome sequence. Particular to parasites, the extremely deep coverage achieved through this method allows for the detection of sequence heterogeneity between the multiple individuals that necessarily comprise such templates. Copyright © 2010 Elsevier B.V. All rights reserved.
Extraordinary Sequence Divergence at Tsga8, an X-linked Gene Involved in Mouse Spermiogenesis
Good, Jeffrey M.; Vanderpool, Dan; Smith, Kimberly L.; Nachman, Michael W.
2011-01-01
The X chromosome plays an important role in both adaptive evolution and speciation. We used a molecular evolutionary screen of X-linked genes potentially involved in reproductive isolation in mice to identify putative targets of recurrent positive selection. We then sequenced five very rapidly evolving genes within and between several closely related species of mice in the genus Mus. All five genes were involved in male reproduction and four of the genes showed evidence of recurrent positive selection. The most remarkable evolutionary patterns were found at Testis-specific gene a8 (Tsga8), a spermatogenesis-specific gene expressed during postmeiotic chromatin condensation and nuclear transformation. Tsga8 was characterized by extremely high levels of insertion–deletion variation of an alanine-rich repetitive motif in natural populations of Mus domesticus and M. musculus, differing in length from the reference mouse genome by up to 89 amino acids (27% of the total protein length). This population-level variation was coupled with striking divergence in protein sequence and length between closely related mouse species. Although no clear orthologs had previously been described for Tsga8 in other mammalian species, we have identified a highly divergent hypothetical gene on the rat X chromosome that shares clear orthology with the 5′ and 3′ ends of Tsga8. Further inspection of this ortholog verified that it is expressed in rat testis and shares remarkable similarity with mouse Tsga8 across several general features of the protein sequence despite no conservation of nucleotide sequence across over 60% of the rat-coding domain. Overall, Tsga8 appears to be one of the most rapidly evolving genes to have been described in rodents. We discuss the potential evolutionary causes and functional implications of this extraordinary divergence and the possible contribution of Tsga8 and the other four genes we examined to reproductive isolation in mice. PMID:21186189
Three Divergent Subpopulations of the Malaria Parasite Plasmodium knowlesi
Lin, Lee C.; Rovie-Ryan, Jeffrine J.; Kadir, Khamisah A.; Anderios, Fread; Hisam, Shamilah; Sharma, Reuben S.K.; Singh, Balbir; Conway, David J.
2017-01-01
Multilocus microsatellite genotyping of Plasmodium knowlesi isolates previously indicated 2 divergent parasite subpopulations in humans on the island of Borneo, each associated with a different macaque reservoir host species. Geographic divergence was also apparent, and independent sequence data have indicated particularly deep divergence between parasites from mainland Southeast Asia and Borneo. To resolve the overall population structure, multilocus microsatellite genotyping was conducted on a new sample of 182 P. knowlesi infections (obtained from 134 humans and 48 wild macaques) from diverse areas of Malaysia, first analyzed separately and then in combination with previous data. All analyses confirmed 2 divergent clusters of human cases in Malaysian Borneo, associated with long-tailed macaques and pig-tailed macaques, and a third cluster in humans and most macaques in peninsular Malaysia. High levels of pairwise divergence between each of these sympatric and allopatric subpopulations have implications for the epidemiology and control of this zoonotic species. PMID:28322705
Gender-based screening for chlamydial infection and divergent infection trends in men and women.
Rogers, Susan M; Turner, Charles F; Miller, William C; Erbelding, Emily; Eggleston, Elizabeth; Tan, Sylvia; Roman, Anthony; Hobbs, Marcia; Chromy, James; Muvva, Ravikiran; Ganapathi, Laxminarayana
2014-01-01
To assess the potential impact of chlamydial screening policy that recommends routine screening of women but not men. Population surveys of probability samples of Baltimore adults aged 18 to 35 years in 1997-1998 and 2006-2009 collected biospecimens to estimate trends in undiagnosed chlamydial infection. Survey estimates are compared to surveillance data on diagnosed chlamydial infections reported to the Health Department. Prevalence of undiagnosed chlamydial infection among men increased from 1.6% to 4.0%, but it declined from 4.3% to 3.1% among women (p = 0.028 for test of interaction). The annual (average) number of diagnosed infections was substantially higher among women than men in both time periods and increased among both men and women. Undiagnosed infection prevalence was substantially higher among black than non-black adults (4.0% vs 1.2%, p = 0.042 in 1997-98 and 5.5% vs 0.7%, p<0.001 in 2006-09). Divergent trends in undiagnosed chlamydial infection by gender parallel divergent screening recommendations that encourage chlamydial testing for women but not for men.
USDA-ARS?s Scientific Manuscript database
The Noctuid moth, Spodoptera frugiperda (the fall armyworm), is endemic to the Western Hemisphere and appears to be undergoing sympatric speciation to produce two subpopulations that differ in their choice of host plants. The diverging “rice strain” and “corn strain” are morphologically indistinguis...
Dynamics of actin evolution in dinoflagellates.
Kim, Sunju; Bachvaroff, Tsvetan R; Handy, Sara M; Delwiche, Charles F
2011-04-01
Dinoflagellates have unique nuclei and intriguing genome characteristics with very high DNA content making complete genome sequencing difficult. In dinoflagellates, many genes are found in multicopy gene families, but the processes involved in the establishment and maintenance of these gene families are poorly understood. Understanding the dynamics of gene family evolution in dinoflagellates requires comparisons at different evolutionary scales. Studies of closely related species provide fine-scale information relative to species divergence, whereas comparisons of more distantly related species provides broad context. We selected the actin gene family as a highly expressed conserved gene previously studied in dinoflagellates. Of the 142 sequences determined in this study, 103 were from the two closely related species, Dinophysis acuminata and D. caudata, including full length and partial cDNA sequences as well as partial genomic amplicons. For these two Dinophysis species, at least three types of sequences could be identified. Most copies (79%) were relatively similar and in nucleotide trees, the sequences formed two bushy clades corresponding to the two species. In comparisons within species, only eight to ten nucleotide differences were found between these copies. The two remaining types formed clades containing sequences from both species. One type included the most similar sequences in between-species comparisons with as few as 12 nucleotide differences between species. The second type included the most divergent sequences in comparisons between and within species with up to 93 nucleotide differences between sequences. In all the sequences, most variation occurred in synonymous sites or the 5' UnTranslated Region (UTR), although there was still limited amino acid variation between most sequences. Several potential pseudogenes were found (approximately 10% of all sequences depending on species) with incomplete open reading frames due to frameshifts or early stop codons. Overall, variation in the actin gene family fits best with the "birth and death" model of evolution based on recent duplications, pseudogenes, and incomplete lineage sorting. Divergence between species was similar to variation within species, so that actin may be too conserved to be useful for phylogenetic estimation of closely related species.
Mott, I; Ivarie, R
2002-06-01
Decades of selective breeding have yielded lines of poultry with substantial myofiber hyperplasia, vet little is known about what genes have been altered during the course of selection. Myostatin is a strong negative regulator of muscle mass in mice and cattle and could have been one of many genetic factors contributing to increased myofiber deposition in growth-selected lines of poultry. To test this hypothesis, the sequence and expression patterns of myostatin were analyzed in growth-selected lines of chickens and quail. The sequence of broiler myostatin cDNA, amplified via reverse transcription (RT)-PCR from embryonic muscle RNA, contained no missense mutations in the coding sequence when compared to that of White Leghorn layers, although two silent single nucleotide polymorphisms (SNP) were found. Northern analysis of myostatin transcripts from embryonic pectoralis and quadriceps showed no significant differences in expression levels between broiler and layer muscle RNA. However, levels of myostatin transcripts were greatly reduced in muscles of posthatch chicks compared to embryonic muscle. Myostatin protein was also present in broiler and layer embryonic muscle at similar levels. No significant polymorphisms or differences in RNA expression levels were found in embryonic muscles of divergently selected lines of Japanese quail. These results indicate that intense artificial selection in these growth-selected lines of poultry has neither silenced the expression of myostatin nor created null alleles via mutation in the lines analyzed.
Borsa, Philippe; Arlyza, Irma S; Chen, Wei-Jen; Durand, Jean-Dominique; Meekan, Mark G; Shen, Kang-Ning
2013-04-01
The maskray from New Caledonia, Neotrygon trigonoides Castelnau, 1873, has been recently synonymized with the blue-spotted maskray, N. kuhlii (Müller and Henle, 1841), a species with wide Indo-West Pacific distribution, but the reasons for this are unclear. Blue-spotted maskray specimens were collected from the Indian Ocean (Tanzania, Sumatra) and the Coral Triangle (Indonesia, Taiwan, and West Papua), and N. trigonoides specimens were collected from New Caledonia (Coral-Sea). Their partial COI gene sequences were generated to expand the available DNA-barcode database on this species, which currently comprises homologous sequences from Ningaloo Reef, the Coral Triangle and the Great Barrier Reef (Coral-Sea). Spotting patterns were also compared across regions. Haplotypes from the Coral-Sea formed a haplogroup phylogenetically distinct from all other haplotypes sampled in the Indo-West Pacific. No clear-cut geographic composition relative to DNA-barcodes or spotting patterns was apparent in N. kuhlii samples across the Indian Ocean and the Coral Triangle. The New Caledonian maskray had spotting patterns markedly different from all the other samples. This, added to a substantial level of net nucleotide divergence (2.6%) with typical N. kuhlii justifies considering the New Caledonian maskray as a separate species, for which we propose to resurrect the name Neotrygon trigonoides. Copyright © 2013. Published by Elsevier SAS.
Zhu, Tianqi; Dos Reis, Mario; Yang, Ziheng
2015-03-01
Genetic sequence data provide information about the distances between species or branch lengths in a phylogeny, but not about the absolute divergence times or the evolutionary rates directly. Bayesian methods for dating species divergences estimate times and rates by assigning priors on them. In particular, the prior on times (node ages on the phylogeny) incorporates information in the fossil record to calibrate the molecular tree. Because times and rates are confounded, our posterior time estimates will not approach point values even if an infinite amount of sequence data are used in the analysis. In a previous study we developed a finite-sites theory to characterize the uncertainty in Bayesian divergence time estimation in analysis of large but finite sequence data sets under a strict molecular clock. As most modern clock dating analyses use more than one locus and are conducted under relaxed clock models, here we extend the theory to the case of relaxed clock analysis of data from multiple loci (site partitions). Uncertainty in posterior time estimates is partitioned into three sources: Sampling errors in the estimates of branch lengths in the tree for each locus due to limited sequence length, variation of substitution rates among lineages and among loci, and uncertainty in fossil calibrations. Using a simple but analogous estimation problem involving the multivariate normal distribution, we predict that as the number of loci ([Formula: see text]) goes to infinity, the variance in posterior time estimates decreases and approaches the infinite-data limit at the rate of 1/[Formula: see text], and the limit is independent of the number of sites in the sequence alignment. We then confirmed the predictions by using computer simulation on phylogenies of two or three species, and by analyzing a real genomic data set for six primate species. Our results suggest that with the fossil calibrations fixed, analyzing multiple loci or site partitions is the most effective way for improving the precision of posterior time estimation. However, even if a huge amount of sequence data is analyzed, considerable uncertainty will persist in time estimates. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society of Systematic Biologists.
2012-01-01
Background Adaptive divergence driven by environmental heterogeneity has long been a fascinating topic in ecology and evolutionary biology. The study of the genetic basis of adaptive divergence has, however, been greatly hampered by a lack of genomic information. The recent development of transcriptome sequencing provides an unprecedented opportunity to generate large amounts of genomic data for detailed investigations of the genetics of adaptive divergence in non-model organisms. Herein, we used the Illumina sequencing platform to sequence the transcriptome of brain and liver tissues from a single individual of the Vinous-throated Parrotbill, Paradoxornis webbianus bulomachus, an ecologically important avian species in Taiwan with a wide elevational range of sea level to 3100 m. Results Our 10.1 Gbp of sequences were first assembled based on Zebra Finch (Taeniopygia guttata) and chicken (Gallus gallus) RNA references. The remaining reads were then de novo assembled. After filtering out contigs with low coverage (<10X), we retained 67,791 of 487,336 contigs, which covered approximately 5.3% of the P. w. bulomachus genome. Of 7,779 contigs retained for a top-hit species distribution analysis, the majority (about 86%) were matched to known Zebra Finch and chicken transcripts. We also annotated 6,365 contigs to gene ontology (GO) terms: in total, 122 GO-slim terms were assigned, including biological process (41%), molecular function (32%), and cellular component (27%). Many potential genetic markers for future adaptive genomic studies were also identified: 8,589 single nucleotide polymorphisms, 1,344 simple sequence repeats and 109 candidate genes that might be involved in elevational or climate adaptation. Conclusions Our study shows that transcriptome data can serve as a rich genetic resource, even for a single run of short-read sequencing from a single individual of a non-model species. This is the first study providing transcriptomic information for species in the avian superfamily Sylvioidea, which comprises more than 1,000 species. Our data can be used to study adaptive divergence in heterogeneous environments and investigate other important ecological and evolutionary questions in parrotbills from different populations and even in other species in the Sylvioidea. PMID:22530590
J.B. Whittall; J. Syring; M. Parks; J. Buenrostro; C. Dick; A. Liston; R. Cronn
2010-01-01
Critical to conservation efforts and other investigations at low taxonomic levels, DNA sequence data offer important insights into the distinctiveness, biogeographic partitioning, and evolutionary histories of species. The resolving power of DNA sequences is often limited by insufficient variability at the intraspecific level. This is particularly true of studies...
Genome Sequence of the Yeast Clavispora lusitaniae Type Strain CBS 6936.
Durrens, Pascal; Klopp, Christophe; Biteau, Nicolas; Fitton-Ouhabi, Valérie; Dementhon, Karine; Accoceberry, Isabelle; Sherman, David J; Noël, Thierry
2017-08-03
Clavispora lusitaniae , an environmental saprophytic yeast belonging to the CTG clade of Candida , can behave occasionally as an opportunistic pathogen in humans. We report here the genome sequence of the type strain CBS 6936. Comparison with sequences of strain ATCC 42720 indicates conservation of chromosomal structure but significant nucleotide divergence. Copyright © 2017 Durrens et al.
Genome Sequence of the Yeast Clavispora lusitaniae Type Strain CBS 6936
Klopp, Christophe; Biteau, Nicolas; Fitton-Ouhabi, Valérie; Dementhon, Karine; Accoceberry, Isabelle; Sherman, David J.; Noël, Thierry
2017-01-01
ABSTRACT Clavispora lusitaniae, an environmental saprophytic yeast belonging to the CTG clade of Candida, can behave occasionally as an opportunistic pathogen in humans. We report here the genome sequence of the type strain CBS 6936. Comparison with sequences of strain ATCC 42720 indicates conservation of chromosomal structure but significant nucleotide divergence. PMID:28774979
Penny, D; Hasegawa, M; Waddell, P J; Hendy, M D
1999-03-01
We explore the tree of mammalian mtDNA sequences, using particularly the LogDet transform on amino acid sequences, the distance Hadamard transform, and the Closest Tree selection criterion. The amino acid composition of different species show significant differences, even within mammals. After compensating for these differences, nearest-neighbor bootstrap results suggest that the tree is locally stable, though a few groups show slightly greater rearrangements when a large proportion of the constant sites are removed. Many parts of the trees we obtain agree with those on published protein ML trees. Interesting results include a preference for rodent monophyly. The detection of a few alternative signals to those on the optimal tree were obtained using the distance Hadamard transform (with results expressed as a Lento plot). One rearrangement suggested was the interchange of the position of primates and rodents on the optimal tree. The basic stability of the tree, combined with two calibration points (whale/cow and horse/rhinoceros), together with a distant secondary calibration from the mammal/bird divergence, allows inferences of the times of divergence of putative clades. Allowing for sampling variances due to finite sequence length, most major divergences amongst lineages leading to modern orders, appear to occur well before the Cretaceous/Tertiary (K/T) boundary. Implications arising from these early divergences are discussed, particularly the possibility of competition between the small dinosaurs and the new mammal clades.
Biological function in the twilight zone of sequence conservation.
Ponting, Chris P
2017-08-16
Strong DNA conservation among divergent species is an indicator of enduring functionality. With weaker sequence conservation we enter a vast 'twilight zone' in which sequence subject to transient or lower constraint cannot be distinguished easily from neutrally evolving, non-functional sequence. Twilight zone functional sequence is illuminated instead by principles of selective constraint and positive selection using genomic data acquired from within a species' population. Application of these principles reveals that despite being biochemically active, most twilight zone sequence is not functional.
A highly divergent Puumala virus lineage in southern Poland.
Rosenfeld, Ulrike M; Drewes, Stephan; Ali, Hanan Sheikh; Sadowska, Edyta T; Mikowska, Magdalena; Heckel, Gerald; Koteja, Paweł; Ulrich, Rainer G
2017-05-01
Puumala virus (PUUV) represents one of the most important hantaviruses in Central Europe. Phylogenetic analyses of PUUV strains indicate a strong genetic structuring of this hantavirus. Recently, PUUV sequences were identified in the natural reservoir, the bank vole (Myodes glareolus), collected in the northern part of Poland. The objective of this study was to evaluate the presence of PUUV in bank voles from southern Poland. A total of 72 bank voles were trapped in 2009 at six sites in this part of Poland. RT-PCR and IgG-ELISA analyses detected three PUUV positive voles at one trapping site. The PUUV-infected animals were identified by cytochrome b gene analysis to belong to the Carpathian and Eastern evolutionary lineages of bank vole. The novel PUUV S, M and L segment nucleotide sequences showed the closest similarity to sequences of the Russian PUUV lineage from Latvia, but were highly divergent to those previously found in northern Poland, Slovakia and Austria. In conclusion, the detection of a highly divergent PUUV lineage in southern Poland indicates the necessity of further bank vole monitoring in this region allowing rational public health measures to prevent human infections.
Testing the molecular clock using mechanistic models of fossil preservation and molecular evolution
2017-01-01
Molecular sequence data provide information about relative times only, and fossil-based age constraints are the ultimate source of information about absolute times in molecular clock dating analyses. Thus, fossil calibrations are critical to molecular clock dating, but competing methods are difficult to evaluate empirically because the true evolutionary time scale is never known. Here, we combine mechanistic models of fossil preservation and sequence evolution in simulations to evaluate different approaches to constructing fossil calibrations and their impact on Bayesian molecular clock dating, and the relative impact of fossil versus molecular sampling. We show that divergence time estimation is impacted by the model of fossil preservation, sampling intensity and tree shape. The addition of sequence data may improve molecular clock estimates, but accuracy and precision is dominated by the quality of the fossil calibrations. Posterior means and medians are poor representatives of true divergence times; posterior intervals provide a much more accurate estimate of divergence times, though they may be wide and often do not have high coverage probability. Our results highlight the importance of increased fossil sampling and improved statistical approaches to generating calibrations, which should incorporate the non-uniform nature of ecological and temporal fossil species distributions. PMID:28637852
Erickson, Harold P.
2009-01-01
Summary The eukaryotic cytoskeleton appears to have evolved from ancestral precursors related to prokaryotic FtsZ and MreB. FtsZ and MreB show 40−50% sequence identity across different bacterial and archaeal species. Here I suggest that this represents the limit of divergence that is consistent with maintaining their functions for cytokinesis and cell shape. Previous analyses have noted that tubulin and actin are highly conserved across eukaryotic species, but so divergent from their prokaryotic relatives as to be hardly recognizable from sequence comparisons. One suggestion for this extreme divergence of tubulin and actin is that it occurred as they evolved very different functions from FtsZ and MreB. I will present new arguments favoring this suggestion, and speculate on pathways. Moreover, the extreme conservation of tubulin and actin across eukaryotic species is not due to an intrinsic lack of variability, but is attributed to their acquisition of elaborate mechanisms for assembly dynamics and their interactions with multiple motor and binding proteins. A new structure-based sequence alignment identifies amino acids that are conserved from FtsZ to tubulins. The highly conserved amino acids are not those forming the subunit core or protofilament interface, but those involved in binding and hydrolysis of GTP. PMID:17563102
Mitochondrial genomes reveal the extinct Hippidion as an outgroup to all living equids.
Der Sarkissian, Clio; Vilstrup, Julia T; Schubert, Mikkel; Seguin-Orlando, Andaine; Eme, David; Weinstock, Jacobo; Alberdi, Maria Teresa; Martin, Fabiana; Lopez, Patricio M; Prado, Jose L; Prieto, Alfredo; Douady, Christophe J; Stafford, Tom W; Willerslev, Eske; Orlando, Ludovic
2015-03-01
Hippidions were equids with very distinctive anatomical features. They lived in South America 2.5 million years ago (Ma) until their extinction approximately 10 000 years ago. The evolutionary origin of the three known Hippidion morphospecies is still disputed. Based on palaeontological data, Hippidion could have diverged from the lineage leading to modern equids before 10 Ma. In contrast, a much later divergence date, with Hippidion nesting within modern equids, was indicated by partial ancient mitochondrial DNA sequences. Here, we characterized eight Hippidion complete mitochondrial genomes at 3.4-386.3-fold coverage using target-enrichment capture and next-generation sequencing. Our dataset reveals that the two morphospecies sequenced (H. saldiasi and H. principale) formed a monophyletic clade, basal to extant and extinct Equus lineages. This contrasts with previous genetic analyses and supports Hippidion as a distinct genus, in agreement with palaeontological models. We date the Hippidion split from Equus at 5.6-6.5 Ma, suggesting an early divergence in North America prior to the colonization of South America, after the formation of the Panamanian Isthmus 3.5 Ma and the Great American Biotic Interchange. © 2015 The Author(s) Published by the Royal Society. All rights reserved.
Mitochondrial genomes reveal the extinct Hippidion as an outgroup to all living equids
Der Sarkissian, Clio; Vilstrup, Julia T.; Schubert, Mikkel; Seguin-Orlando, Andaine; Eme, David; Weinstock, Jacobo; Alberdi, Maria Teresa; Martin, Fabiana; Lopez, Patricio M.; Prado, Jose L.; Prieto, Alfredo; Douady, Christophe J.; Stafford, Tom W.; Willerslev, Eske; Orlando, Ludovic
2015-01-01
Hippidions were equids with very distinctive anatomical features. They lived in South America 2.5 million years ago (Ma) until their extinction approximately 10 000 years ago. The evolutionary origin of the three known Hippidion morphospecies is still disputed. Based on palaeontological data, Hippidion could have diverged from the lineage leading to modern equids before 10 Ma. In contrast, a much later divergence date, with Hippidion nesting within modern equids, was indicated by partial ancient mitochondrial DNA sequences. Here, we characterized eight Hippidion complete mitochondrial genomes at 3.4–386.3-fold coverage using target-enrichment capture and next-generation sequencing. Our dataset reveals that the two morphospecies sequenced (H. saldiasi and H. principale) formed a monophyletic clade, basal to extant and extinct Equus lineages. This contrasts with previous genetic analyses and supports Hippidion as a distinct genus, in agreement with palaeontological models. We date the Hippidion split from Equus at 5.6–6.5 Ma, suggesting an early divergence in North America prior to the colonization of South America, after the formation of the Panamanian Isthmus 3.5 Ma and the Great American Biotic Interchange. PMID:25762573
Smith, M. Alex; Fisher, Brian L; Hebert, Paul D.N
2005-01-01
The role of DNA barcoding as a tool to accelerate the inventory and analysis of diversity for hyperdiverse arthropods is tested using ants in Madagascar. We demonstrate how DNA barcoding helps address the failure of current inventory methods to rapidly respond to pressing biodiversity needs, specifically in the assessment of richness and turnover across landscapes with hyperdiverse taxa. In a comparison of inventories at four localities in northern Madagascar, patterns of richness were not significantly different when richness was determined using morphological taxonomy (morphospecies) or sequence divergence thresholds (Molecular Operational Taxonomic Unit(s); MOTU). However, sequence-based methods tended to yield greater richness and significantly lower indices of similarity than morphological taxonomy. MOTU determined using our molecular technique were a remarkably local phenomenon—indicative of highly restricted dispersal and/or long-term isolation. In cases where molecular and morphological methods differed in their assignment of individuals to categories, the morphological estimate was always more conservative than the molecular estimate. In those cases where morphospecies descriptions collapsed distinct molecular groups, sequence divergences of 16% (on average) were contained within the same morphospecies. Such high divergences highlight taxa for further detailed genetic, morphological, life history, and behavioral studies. PMID:16214741
A DNA Barcode Library for North American Pyraustinae (Lepidoptera: Pyraloidea: Crambidae).
Yang, Zhaofu; Landry, Jean-François; Hebert, Paul D N
2016-01-01
Although members of the crambid subfamily Pyraustinae are frequently important crop pests, their identification is often difficult because many species lack conspicuous diagnostic morphological characters. DNA barcoding employs sequence diversity in a short standardized gene region to facilitate specimen identifications and species discovery. This study provides a DNA barcode reference library for North American pyraustines based upon the analysis of 1589 sequences recovered from 137 nominal species, 87% of the fauna. Data from 125 species were barcode compliant (>500bp, <1% n), and 99 of these taxa formed a distinct cluster that was assigned to a single BIN. The other 26 species were assigned to 56 BINs, reflecting frequent cases of deep intraspecific sequence divergence and a few instances of barcode sharing, creating a total of 155 BINs. Two systems for OTU designation, ABGD and BIN, were examined to check the correspondence between current taxonomy and sequence clusters. The BIN system performed better than ABGD in delimiting closely related species, while OTU counts with ABGD were influenced by the value employed for relative gap width. Different species with low or no interspecific divergence may represent cases of unrecognized synonymy, whereas those with high intraspecific divergence require further taxonomic scrutiny as they may involve cryptic diversity. The barcode library developed in this study will also help to advance understanding of relationships among species of Pyraustinae.
Teng, Huajing; Zhang, Yaohua; Shi, Chengmin; Mao, Fengbiao; Cai, Wanshi; Lu, Liang; Zhao, Fangqing; Sun, Zhongsheng; Zhang, Jianxu
2017-09-01
Murine rodents are excellent models for study of adaptive radiations and speciation. Brown Norway rats (Rattus norvegicus) are successful global colonizers and the contributions of their domesticated laboratory strains to biomedical research are well established. To identify nucleotide-based speciation timing of the rat and genomic information contributing to its colonization capabilities, we analyzed 51 whole-genome sequences of wild-derived Brown Norway rats and their sibling species, R. nitidus, and identified over 20 million genetic variants in the wild Brown Norway rats that were absent in the laboratory strains, which substantially expand the reservoir of rat genetic diversity. We showed that divergence of the rat and its siblings coincided with drastic climatic changes that occurred during the Middle Pleistocene. Further, we revealed that there was a geographically widespread influx of genes between Brown Norway rats and the sibling species following the divergence, resulting in numerous introgressed regions in the genomes of admixed Brown Norway rats. Intriguing, genes related to chemical communications among these introgressed regions appeared to contribute to the population-specific adaptations of the admixed Brown Norway rats. Our data reveals evolutionary history of the Brown Norway rat, and offers new insights into the role of climatic changes in speciation of animals and the effect of interspecies introgression on animal adaptation. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Genetics of alternative splicing evolution during sunflower domestication.
Smith, Chris C R; Tittes, Silas; Mendieta, J Paul; Collier-Zans, Erin; Rowe, Heather C; Rieseberg, Loren H; Kane, Nolan C
2018-06-11
Alternative splicing enables organisms to produce the diversity of proteins necessary for multicellular life by using relatively few protein-coding genes. Although differences in splicing have been identified among divergent taxa, the shorter-term evolution of splicing is understudied. The origins of novel splice forms, and the contributions of alternative splicing to major evolutionary transitions, are largely unknown. This study used transcriptomes of wild and domesticated sunflowers to examine splice differentiation and regulation during domestication. We identified substantial splicing divergence between wild and domesticated sunflowers, mainly in the form of intron retention. Transcripts with divergent splicing were enriched for seed-development functions, suggesting that artificial selection impacted splicing patterns. Mapping of quantitative trait loci (QTLs) associated with 144 differential splicing cases revealed primarily trans -acting variation affecting splicing patterns. A large proportion of identified QTLs contain known spliceosome proteins and are associated with splicing variation in multiple genes. Examining a broader set of wild and domesticated sunflower genotypes revealed that most differential splicing patterns in domesticated sunflowers likely arose from standing variation in wild Helianthus annuus and gained frequency during the domestication process. However, several domesticate-associated splicing patterns appear to be introgressed from other Helianthus species. These results suggest that sunflower domestication involved selection on pleiotropic regulatory alleles. More generally, our findings indicate that substantial differences in isoform abundances arose rapidly during a recent evolutionary transition and appear to contribute to adaptation and population divergence.
Dennenmoser, Stefan; Vamosi, Steven M; Nolte, Arne W; Rogers, Sean M
2017-01-01
Understanding the genomic basis of adaptive divergence in the presence of gene flow remains a major challenge in evolutionary biology. In prickly sculpin (Cottus asper), an abundant euryhaline fish in northwestern North America, high genetic connectivity among brackish-water (estuarine) and freshwater (tributary) habitats of coastal rivers does not preclude the build-up of neutral genetic differentiation and emergence of different life history strategies. Because these two habitats present different osmotic niches, we predicted high genetic differentiation at known teleost candidate genes underlying salinity tolerance and osmoregulation. We applied whole-genome sequencing of pooled DNA samples (Pool-Seq) to explore adaptive divergence between two estuarine and two tributary habitats. Paired-end sequence reads were mapped against genomic contigs of European Cottus, and the gene content of candidate regions was explored based on comparisons with the threespine stickleback genome. Genes showing signals of repeated differentiation among brackish-water and freshwater habitats included functions such as ion transport and structural permeability in freshwater gills, which suggests that local adaptation to different osmotic niches might contribute to genomic divergence among habitats. Overall, the presence of both repeated and unique signatures of differentiation across many loci scattered throughout the genome is consistent with polygenic adaptation from standing genetic variation and locally variable selection pressures in the early stages of life history divergence. © 2016 John Wiley & Sons Ltd.
Kibenge, Molly J T; Iwamoto, Tokinori; Wang, Yingwei; Morton, Alexandra; Godoy, Marcos G; Kibenge, Frederick S B
2013-07-11
Piscine reovirus (PRV) is a newly discovered fish reovirus of anadromous and marine fish ubiquitous among fish in Norwegian salmon farms, and likely the causative agent of heart and skeletal muscle inflammation (HSMI). HSMI is an increasingly economically significant disease in Atlantic salmon (Salmo salar) farms. The nucleotide sequence data available for PRV are limited, and there is no genetic information on this virus outside of Norway and none from wild fish. RT-PCR amplification and sequencing were used to obtain the complete viral genome of PRV (10 segments) from western Canada and Chile. The genetic diversity among the PRV strains and their relationship to Norwegian PRV isolates were determined by phylogenetic analyses and sequence identity comparisons. PRV is distantly related to members of the genera Orthoreovirus and Aquareovirus and an unambiguous new genus within the family Reoviridae. The Canadian and Norwegian PRV strains are most divergent in the segment S1 and S4 encoded proteins. Phylogenetic analysis of PRV S1 sequences, for which the largest number of complete sequences from different "isolates" is available, grouped Norwegian PRV strains into a single genotype, Genotype I, with sub-genotypes, Ia and Ib. The Canadian PRV strains matched sub-genotype Ia and Chilean PRV strains matched sub-genotype Ib. PRV should be considered as a member of a new genus within the family Reoviridae with two major Norwegian sub-genotypes. The Canadian PRV diverged from Norwegian sub-genotype Ia around 2007 ± 1, whereas the Chilean PRV diverged from Norwegian sub-genotype Ib around 2008 ± 1.
2013-01-01
Background Piscine reovirus (PRV) is a newly discovered fish reovirus of anadromous and marine fish ubiquitous among fish in Norwegian salmon farms, and likely the causative agent of heart and skeletal muscle inflammation (HSMI). HSMI is an increasingly economically significant disease in Atlantic salmon (Salmo salar) farms. The nucleotide sequence data available for PRV are limited, and there is no genetic information on this virus outside of Norway and none from wild fish. Methods RT-PCR amplification and sequencing were used to obtain the complete viral genome of PRV (10 segments) from western Canada and Chile. The genetic diversity among the PRV strains and their relationship to Norwegian PRV isolates were determined by phylogenetic analyses and sequence identity comparisons. Results PRV is distantly related to members of the genera Orthoreovirus and Aquareovirus and an unambiguous new genus within the family Reoviridae. The Canadian and Norwegian PRV strains are most divergent in the segment S1 and S4 encoded proteins. Phylogenetic analysis of PRV S1 sequences, for which the largest number of complete sequences from different “isolates” is available, grouped Norwegian PRV strains into a single genotype, Genotype I, with sub-genotypes, Ia and Ib. The Canadian PRV strains matched sub-genotype Ia and Chilean PRV strains matched sub-genotype Ib. Conclusions PRV should be considered as a member of a new genus within the family Reoviridae with two major Norwegian sub-genotypes. The Canadian PRV diverged from Norwegian sub-genotype Ia around 2007 ± 1, whereas the Chilean PRV diverged from Norwegian sub-genotype Ib around 2008 ± 1. PMID:23844948
Sikorav, J L; Duval, N; Anselmet, A; Bon, S; Krejci, E; Legay, C; Osterlund, M; Reimund, B; Massoulié, J
1988-01-01
In this paper, we show the existence of alternative splicing in the 3' region of the coding sequence of Torpedo acetylcholinesterase (AChE). We describe two cDNA structures which both diverge from the previously described coding sequence of the catalytic subunit of asymmetric (A) forms (Schumacher et al., 1986; Sikorav et al., 1987). They both contain a coding sequence followed by a non-coding sequence and a poly(A) stretch. Both of these structures were shown to exist in poly(A)+ RNAs, by S1 mapping experiments. The divergent region encoded by the first sequence corresponds to the precursor of the globular dimeric form (G2a), since it contains the expected C-terminal amino acids, Ala-Cys. These amino acids are followed by a 29 amino acid extension which contains a hydrophobic segment and must be replaced by a glycolipid in the mature protein. Analyses of intact G2a AChE showed that the common domain of the protein contains intersubunit disulphide bonds. The divergent region of the second type of cDNA consists of an adjacent genomic sequence, which is removed as an intron in A and Ga mRNAs, but may encode a distinct, less abundant catalytic subunit. The structures of the cDNA clones indicate that they are derived from minor mRNAs, shorter than the three major transcripts which have been described previously (14.5, 10.5 and 5.5 kb). Oligonucleotide probes specific for the asymmetric and globular terminal regions hybridize with the three major transcripts, indicating that their size is determined by 3'-untranslated regions which are not related to the differential splicing leading to A and Ga forms. Images PMID:3181125
Jiang, Yuan; Yang, Zhongqi; Wang, Xiaoyi; Hou, Yuxia
2015-01-01
The species belonging to Sclerodermus (Hymenoptera: Bethylidae) are currently the most important insect natural enemies of wood borer pests, mainly buprestid and cerambycid beetles, in China. However, some sibling species of this genus are very difficult to distinguish because of their similar morphological features. To address this issue, we conducted phylogenetic and genetic analyses of cytochrome oxidase subunit I (COI) and 28S RNA gene sequences from eight species of Sclerodermus reared from different wood borer pests. The eight sibling species were as follows: S. guani Xiao et Wu, S. sichuanensis Xiao, S. pupariae Yang et Yao, and Sclerodermus spp. (Nos. 1–5). A 594-bp fragment of COI and 750-bp fragment of 28S were subsequently sequenced. For COI, the G-C content was found to be low in all the species, averaging to about 30.0%. Sequence divergences (Kimura-2-parameter distances) between congeneric species averaged to 4.5%, and intraspecific divergences averaged to about 0.09%. Further, the maximum sequence divergences between congeneric species and Sclerodermus sp. (No. 5) averaged to about 16.5%. All 136 samples analyzed were included in six reciprocally monophyletic clades in the COI neighbor-joining (NJ) tree. The NJ tree inferred from the 28S rRNA sequence yielded almost identical results, but the samples from S. guani, S. sichuanensis, S. pupariae, and Sclerodermus spp. (Nos. 1–4) clustered together and only Sclerodermus sp. (No. 5) clustered separately. Our findings indicate that the standard barcode region of COI can be efficiently used to distinguish morphologically similar Sclerodermus species. Further, we speculate that Sclerodermus sp. (No. 5) might be a new species of Sclerodermus. PMID:25782000
Identification of a divergent genotype of equine arteritis virus from South American donkeys.
Rivas, J; Neira, V; Mena, J; Brito, B; Garcia, A; Gutierrez, C; Sandoval, D; Ortega, R
2017-12-01
A novel equine arteritis virus (EAV) was isolated and sequenced from feral donkeys in Chile. Phylogenetic analysis indicates that the new virus and South African asinine strains diverged at least 100 years from equine EAV strains. The results indicate that asinine strains belonged to a different EAV genotype. © 2017 Blackwell Verlag GmbH.
Evolution of the arginase fold and functional diversity
Dowling, Daniel P.; Costanzo, Luigi Di; Gennadios, Heather A.; Christianson, David W.
2009-01-01
The large number of protein structures deposited in the Protein Data Bank allows for the identification of novel structural superfamilies based on conservation of fold in addition to conservation of amino acid sequence. Since sequence diverges more rapidly than fold in protein evolution, proteins with little or no significant sequence identity are occasionally observed to adopt similar folds, thereby reflecting unanticipated evolutionary relationships. Here, we review the unique α/β fold first observed in the manganese metalloenzyme rat liver arginase, consisting of a parallel 8 stranded β-sheet surrounded by several helices, and its evolutionary relationship with the zinc-requiring and/or iron-requiring histone deacetylases and acetylpolyamine amidohydrolases. Structural comparisons reveal key features of the core α/β fold that contribute to the divergent metal ion specificity and stoichiometry required for the chemical and biological functions of these enzymes. PMID:18360740
Hass-Jacobus, Barbara L; Futrell-Griggs, Montona; Abernathy, Brian; Westerman, Rick; Goicoechea, Jose-Luis; Stein, Joshua; Klein, Patricia; Hurwitz, Bonnie; Zhou, Bin; Rakhshan, Fariborz; Sanyal, Abhijit; Gill, Navdeep; Lin, Jer-Young; Walling, Jason G; Luo, Mei Zhong; Ammiraju, Jetty Siva S; Kudrna, Dave; Kim, Hye Ran; Ware, Doreen; Wing, Rod A; Miguel, Phillip San; Jackson, Scott A
2006-01-01
Background With the completion of the genome sequence for rice (Oryza sativa L.), the focus of rice genomics research has shifted to the comparison of the rice genome with genomes of other species for gene cloning, breeding, and evolutionary studies. The genus Oryza includes 23 species that shared a common ancestor 8–10 million years ago making this an ideal model for investigations into the processes underlying domestication, as many of the Oryza species are still undergoing domestication. This study integrates high-throughput, hybridization-based markers with BAC end sequence and fingerprint data to construct physical maps of rice chromosome 1 orthologues in two wild Oryza species. Similar studies were undertaken in Sorghum bicolor, a species which diverged from cultivated rice 40–50 million years ago. Results Overgo markers, in conjunction with fingerprint and BAC end sequence data, were used to build sequence-ready BAC contigs for two wild Oryza species. The markers drove contig merges to construct physical maps syntenic to rice chromosome 1 in the wild species and provided evidence for at least one rearrangement on chromosome 1 of the O. sativa versus Oryza officinalis comparative map. When rice overgos were aligned to available S. bicolor sequence, 29% of the overgos aligned with three or fewer mismatches; of these, 41% gave positive hybridization signals. Overgo hybridization patterns supported colinearity of loci in regions of sorghum chromosome 3 and rice chromosome 1 and suggested that a possible genomic inversion occurred in this syntenic region in one of the two genomes after the divergence of S. bicolor and O. sativa. Conclusion The results of this study emphasize the importance of identifying conserved sequences in the reference sequence when designing overgo probes in order for those probes to hybridize successfully in distantly related species. As interspecific markers, overgos can be used successfully to construct physical maps in species which diverged less than 8 million years ago, and can be used in a more limited fashion to examine colinearity among species which diverged as much as 40 million years ago. Additionally, overgos are able to provide evidence of genomic rearrangements in comparative physical mapping studies. PMID:16895597
Detection of a divergent variant of grapevine virus F by next-generation sequencing.
Molenaar, Nicholas; Burger, Johan T; Maree, Hans J
2015-08-01
The complete genome sequence of a South African isolate of grapevine virus F (GVF) is presented. It was first detected by metagenomic next-generation sequencing of field samples and validated through direct Sanger sequencing. The genome sequence of GVF isolate V5 consists of 7539 nucleotides and contains a poly(A) tail. It has a typical vitivirus genome arrangement that comprises five open reading frames (ORFs), which share only 88.96 % nucleotide sequence identity with the existing complete GVF genome sequence (JX105428).
[A study on identification of edible bird's nests by DNA barcodes].
Chen, Yue-Juan; Liu, Wen-Jian; Chen, Dan-Na; Chieng, Sing-Hock; Jiang, Lin
2017-12-01
To provide theoretical basis for the traceability and quality evaluation of edible bird's nests (EBNs), the Cytb sequence was applied to identify the origin of EBNs. A total of 39 experiment samples were collected from Malaysia, Indonesia, Vietnam and Thailand. Genomic DNA was extracted for the PCR reaction. The amplified products were sequenced. 36 sequences were downloaded from Gen Bank including edible nest swiftlet, black nest swiftlet, mascarene swiftlet, pacific swiftlet and germain's swiftlet. MEGA 7.0 was used to analyze the distinction of sequences by the method of calculating the distances in intraspecific and interspecific divergences and constructing NJ and UPMGA phylogenetic tree based on Kimera-2-parameter model. The results showed that 39 samples were from three kinds of EBNs. Interspecific divergences were significantly greater than the intraspecific one. Samples could be successfully distinguished by NJ and UPMGA phylogenetic tree. In conclusion, Cytb sequence could be used to distinguish the origin of EBNs and it is efficient for tracing the origin species of EBNs. Copyright© by the Chinese Pharmaceutical Association.
Comparing and combining distance-based and character-based approaches for barcoding turtles.
Reid, B N; LE, M; McCord, W P; Iverson, J B; Georges, A; Bergmann, T; Amato, G; Desalle, R; Naro-Maciel, E
2011-11-01
Molecular barcoding can serve as a powerful tool in wildlife forensics and may prove to be a vital aid in conserving organisms that are threatened by illegal wildlife trade, such as turtles (Order Testudines). We produced cytochrome oxidase subunit one (COI) sequences (650 bp) for 174 turtle species and combined these with publicly available sequences for 50 species to produce a data set representative of the breadth of the order. Variability within the barcode region was assessed, and the utility of both distance-based and character-based methods for species identification was evaluated. For species in which genetic material from more than one individual was available (n = 69), intraspecific divergences were 1.3% on average, although divergences greater than the customary 2% barcode threshold occurred within 15 species. High intraspecific divergences could indicate species with a high degree of internal genetic structure or possibly even cryptic species, although introgression is also probable in some of these taxa. Divergences between species of the same genus were 6.4% on average; however, 49 species were <2% divergent from congeners. Low levels of interspecific divergence could be caused by recent evolutionary radiations coupled with the low rates of mtDNA evolution previously observed in turtles. Complementing distance-based barcoding with character-based methods for identifying diagnostic sets of nucleotides provided better resolution in several cases where distance-based methods failed to distinguish species. An online identification engine was created to provide character-based identifications. This study constitutes the first comprehensive barcoding effort for this seriously threatened order. © 2011 Blackwell Publishing Ltd.
Zill, Oliver A.; Scannell, Devin R.; Kuei, Jeffrey; Sadhu, Meru; Rine, Jasper
2012-01-01
The genetic bases for species-specific traits are widely sought, but reliable experimental methods with which to identify functionally divergent genes are lacking. In the Saccharomyces genus, interspecies complementation tests can be used to evaluate functional conservation and divergence of biological pathways or networks. Silent information regulator (SIR) proteins in S. bayanus provide an ideal test case for this approach because they show remarkable divergence in sequence and paralog number from those found in the closely related S. cerevisiae. We identified genes required for silencing in S. bayanus using a genetic screen for silencing-defective mutants. Complementation tests in interspecies hybrids identified an evolutionarily conserved Sir-protein-based silencing machinery, as defined by two interspecies complementation groups (SIR2 and SIR3). However, recessive mutations in S. bayanus SIR4 isolated from this screen could not be complemented by S. cerevisiae SIR4, revealing species-specific functional divergence in the Sir4 protein despite conservation of the overall function of the Sir2/3/4 complex. A cladistic complementation series localized the occurrence of functional changes in SIR4 to the S. cerevisiae and S. paradoxus branches of the Saccharomyces phylogeny. Most of this functional divergence mapped to sequence changes in the Sir4 PAD. Finally, a hemizygosity modifier screen in the interspecies hybrids identified additional genes involved in S. bayanus silencing. Thus, interspecies complementation tests can be used to identify (1) mutations in genetically underexplored organisms, (2) loci that have functionally diverged between species, and (3) evolutionary events of functional consequence within a genus. PMID:22923378
Wei, Chaoling; Yang, Hua; Wang, Songbo; Zhao, Jian; Liu, Chun; Gao, Liping; Xia, Enhua; Lu, Ying; Tai, Yuling; She, Guangbiao; Sun, Jun; Cao, Haisheng; Tong, Wei; Gao, Qiang; Li, Yeyun; Deng, Weiwei; Jiang, Xiaolan; Wang, Wenzhao; Chen, Qi; Zhang, Shihua; Li, Haijing; Wu, Junlan; Wang, Ping; Li, Penghui; Shi, Chengying; Zheng, Fengya; Jian, Jianbo; Huang, Bei; Shan, Dai; Shi, Mingming; Fang, Congbing; Yue, Yi; Li, Fangdong; Li, Daxiang; Wei, Shu; Han, Bin; Jiang, Changjun; Yin, Ye; Xia, Tao; Zhang, Zhengzhu; Bennetzen, Jeffrey L; Zhao, Shancen; Wan, Xiaochun
2018-05-01
Tea, one of the world's most important beverage crops, provides numerous secondary metabolites that account for its rich taste and health benefits. Here we present a high-quality sequence of the genome of tea, Camellia sinensis var. sinensis (CSS), using both Illumina and PacBio sequencing technologies. At least 64% of the 3.1-Gb genome assembly consists of repetitive sequences, and the rest yields 33,932 high-confidence predictions of encoded proteins. Divergence between two major lineages, CSS and Camellia sinensis var. assamica (CSA), is calculated to ∼0.38 to 1.54 million years ago (Mya). Analysis of genic collinearity reveals that the tea genome is the product of two rounds of whole-genome duplications (WGDs) that occurred ∼30 to 40 and ∼90 to 100 Mya. We provide evidence that these WGD events, and subsequent paralogous duplications, had major impacts on the copy numbers of secondary metabolite genes, particularly genes critical to producing three key quality compounds: catechins, theanine, and caffeine. Analyses of transcriptome and phytochemistry data show that amplification and transcriptional divergence of genes encoding a large acyltransferase family and leucoanthocyanidin reductases are associated with the characteristic young leaf accumulation of monomeric galloylated catechins in tea, while functional divergence of a single member of the glutamine synthetase gene family yielded theanine synthetase. This genome sequence will facilitate understanding of tea genome evolution and tea metabolite pathways, and will promote germplasm utilization for breeding improved tea varieties. Copyright © 2018 the Author(s). Published by PNAS.
Chakraborty, Ujani; George, Carolyn M.; Lyndaker, Amy M.; Alani, Eric
2016-01-01
Single-strand annealing (SSA) is an important homologous recombination mechanism that repairs DNA double strand breaks (DSBs) occurring between closely spaced repeat sequences. During SSA, the DSB is acted upon by exonucleases to reveal complementary sequences that anneal and are then repaired through tail clipping, DNA synthesis, and ligation steps. In baker’s yeast, the Msh DNA mismatch recognition complex and the Sgs1 helicase act to suppress SSA between divergent sequences by binding to mismatches present in heteroduplex DNA intermediates and triggering a DNA unwinding mechanism known as heteroduplex rejection. Using baker’s yeast as a model, we have identified new factors and regulatory steps in heteroduplex rejection during SSA. First we showed that Top3-Rmi1, a topoisomerase complex that interacts with Sgs1, is required for heteroduplex rejection. Second, we found that the replication processivity clamp proliferating cell nuclear antigen (PCNA) is dispensable for heteroduplex rejection, but is important for repairing mismatches formed during SSA. Third, we showed that modest overexpression of Msh6 results in a significant increase in heteroduplex rejection; this increase is due to a compromise in Msh2-Msh3 function required for the clipping of 3′ tails. Thus 3′ tail clipping during SSA is a critical regulatory step in the repair vs. rejection decision; rejection is favored before the 3′ tails are clipped. Unexpectedly, Msh6 overexpression, through interactions with PCNA, disrupted heteroduplex rejection between divergent sequences in another recombination substrate. These observations illustrate the delicate balance that exists between repair and replication factors to optimize genome stability. PMID:26680658
Wei, Chaoling; Yang, Hua; Wang, Songbo; Zhao, Jian; Liu, Chun; Gao, Liping; Xia, Enhua; Lu, Ying; Tai, Yuling; She, Guangbiao; Sun, Jun; Cao, Haisheng; Tong, Wei; Gao, Qiang; Li, Yeyun; Deng, Weiwei; Jiang, Xiaolan; Wang, Wenzhao; Chen, Qi; Zhang, Shihua; Li, Haijing; Wu, Junlan; Wang, Ping; Li, Penghui; Shi, Chengying; Zheng, Fengya; Jian, Jianbo; Huang, Bei; Shan, Dai; Shi, Mingming; Fang, Congbing; Yue, Yi; Li, Fangdong; Li, Daxiang; Wei, Shu; Han, Bin; Jiang, Changjun; Yin, Ye; Xia, Tao; Zhang, Zhengzhu; Bennetzen, Jeffrey L.; Zhao, Shancen; Wan, Xiaochun
2018-01-01
Tea, one of the world’s most important beverage crops, provides numerous secondary metabolites that account for its rich taste and health benefits. Here we present a high-quality sequence of the genome of tea, Camellia sinensis var. sinensis (CSS), using both Illumina and PacBio sequencing technologies. At least 64% of the 3.1-Gb genome assembly consists of repetitive sequences, and the rest yields 33,932 high-confidence predictions of encoded proteins. Divergence between two major lineages, CSS and Camellia sinensis var. assamica (CSA), is calculated to ∼0.38 to 1.54 million years ago (Mya). Analysis of genic collinearity reveals that the tea genome is the product of two rounds of whole-genome duplications (WGDs) that occurred ∼30 to 40 and ∼90 to 100 Mya. We provide evidence that these WGD events, and subsequent paralogous duplications, had major impacts on the copy numbers of secondary metabolite genes, particularly genes critical to producing three key quality compounds: catechins, theanine, and caffeine. Analyses of transcriptome and phytochemistry data show that amplification and transcriptional divergence of genes encoding a large acyltransferase family and leucoanthocyanidin reductases are associated with the characteristic young leaf accumulation of monomeric galloylated catechins in tea, while functional divergence of a single member of the glutamine synthetase gene family yielded theanine synthetase. This genome sequence will facilitate understanding of tea genome evolution and tea metabolite pathways, and will promote germplasm utilization for breeding improved tea varieties. PMID:29678829
DRS is far less divergent than streptococcal inhibitor of complement of group A streptococcus.
Sagar, Vivek; Kumar, Rajesh; Ganguly, Nirmal K; Menon, Thangam; Chakraborti, Anuradha
2007-04-01
When 100 group A streptococcus isolates were screened, drs, a variant of sic, was identified in emm12 and emm55 isolates. Molecular characterization showed that the drs gene sequence is highly conserved, unlike the sic gene sequence. However, the variation in gene size observed was due to the presence of extra internal repeat sequences.
DRS Is Far Less Divergent than Streptococcal Inhibitor of Complement of Group A Streptococcus▿
Sagar, Vivek; Kumar, Rajesh; Ganguly, Nirmal K.; Menon, Thangam; Chakraborti, Anuradha
2007-01-01
When 100 group A streptococcus isolates were screened, drs, a variant of sic, was identified in emm12 and emm55 isolates. Molecular characterization showed that the drs gene sequence is highly conserved, unlike the sic gene sequence. However, the variation in gene size observed was due to the presence of extra internal repeat sequences. PMID:17237170
Deep Sequencing Reveals a Divergent Ugandan cassava brown streak virus Isolate from Malawi
Winter, Stephan; Mukasa, Settumba; Tairo, Fred; Sseruwagi, Peter; Ndunguru, Joseph; Duffy, Siobain
2017-01-01
ABSTRACT Illumina sequencing of RNA from a cassava cutting from northern Malawi produced a genome of Ugandan cassava brown streak virus (UCBSV-MW-NB7_2013). Sequence comparisons revealed stronger similarity to an isolate from nearby Tanzania (93.4% pairwise nucleotide identity) than to those previously reported from Malawi (86.9 to 87.0%). PMID:28818908
Evolution of Enzyme Superfamilies: Comprehensive Exploration of Sequence-Function Relationships.
Baier, F; Copp, J N; Tokuriki, N
2016-11-22
The sequence and functional diversity of enzyme superfamilies have expanded through billions of years of evolution from a common ancestor. Understanding how protein sequence and functional "space" have expanded, at both the evolutionary and molecular level, is central to biochemistry, molecular biology, and evolutionary biology. Integrative approaches that examine protein sequence, structure, and function have begun to provide comprehensive views of the functional diversity and evolutionary relationships within enzyme superfamilies. In this review, we outline the recent advances in our understanding of enzyme evolution and superfamily functional diversity. We describe the tools that have been used to comprehensively analyze sequence relationships and to characterize sequence and function relationships. We also highlight recent large-scale experimental approaches that systematically determine the activity profiles across enzyme superfamilies. We identify several intriguing insights from this recent body of work. First, promiscuous activities are prevalent among extant enzymes. Second, many divergent proteins retain "function connectivity" via enzyme promiscuity, which can be used to probe the evolutionary potential and history of enzyme superfamilies. Finally, we discuss open questions regarding the intricacies of enzyme divergence, as well as potential research directions that will deepen our understanding of enzyme superfamily evolution.
Diversity and phylogenetic relationships among Bartonella strains from Thai bats.
McKee, Clifton D; Kosoy, Michael Y; Bai, Ying; Osikowicz, Lynn M; Franka, Richard; Gilbert, Amy T; Boonmar, Sumalee; Rupprecht, Charles E; Peruski, Leonard F
2017-01-01
Bartonellae are phylogenetically diverse, intracellular bacteria commonly found in mammals. Previous studies have demonstrated that bats have a high prevalence and diversity of Bartonella infections globally. Isolates (n = 42) were obtained from five bat species in four provinces of Thailand and analyzed using sequences of the citrate synthase gene (gltA). Sequences clustered into seven distinct genogroups; four of these genogroups displayed similarity with Bartonella spp. sequences from other bats in Southeast Asia, Africa, and Eastern Europe. Thirty of the isolates representing these seven genogroups were further characterized by sequencing four additional loci (ftsZ, nuoG, rpoB, and ITS) to clarify their evolutionary relationships with other Bartonella species and to assess patterns of diversity among strains. Among the seven genogroups, there were differences in the number of sequence variants, ranging from 1-5, and the amount of nucleotide divergence, ranging from 0.035-3.9%. Overall, these seven genogroups meet the criteria for distinction as novel Bartonella species, with sequence divergence among genogroups ranging from 6.4-15.8%. Evidence of intra- and intercontinental phylogenetic relationships and instances of homologous recombination among Bartonella genogroups in related bat species were found in Thai bats.
Janes, Holly; Frahm, Nicole; DeCamp, Allan; Rolland, Morgane; Gabriel, Erin; Wolfson, Julian; Hertz, Tomer; Kallas, Esper; Goepfert, Paul; Friedrich, David P.; Corey, Lawrence; Mullins, James I.; McElrath, M. Juliana; Gilbert, Peter
2012-01-01
Background The sieve analysis for the Step trial found evidence that breakthrough HIV-1 sequences for MRKAd5/HIV-1 Gag/Pol/Nef vaccine recipients were more divergent from the vaccine insert than placebo sequences in regions with predicted epitopes. We linked the viral sequence data with immune response and acute viral load data to explore mechanisms for and consequences of the observed sieve effect. Methods Ninety-one male participants (37 placebo and 54 vaccine recipients) were included; viral sequences were obtained at the time of HIV-1 diagnosis. T-cell responses were measured 4 weeks post-second vaccination and at the first or second week post-diagnosis. Acute viral load was obtained at RNA-positive and antibody-negative visits. Findings Vaccine recipients had a greater magnitude of post-infection CD8+ T cell response than placebo recipients (median 1.68% vs 1.18%; p = 0·04) and greater breadth of post-infection response (median 4.5 vs 2; p = 0·06). Viral sequences for vaccine recipients were marginally more divergent from the insert than placebo sequences in regions of Nef targeted by pre-infection immune responses (p = 0·04; Pol p = 0·13; Gag p = 0·89). Magnitude and breadth of pre-infection responses did not correlate with distance of the viral sequence to the insert (p>0·50). Acute log viral load trended lower in vaccine versus placebo recipients (estimated mean 4·7 vs 5·1) but the difference was not significant (p = 0·27). Neither was acute viral load associated with distance of the viral sequence to the insert (p>0·30). Interpretation Despite evidence of anamnestic responses, the sieve effect was not well explained by available measures of T-cell immunogenicity. Sequence divergence from the vaccine was not significantly associated with acute viral load. While point estimates suggested weak vaccine suppression of viral load, the result was not significant and more viral load data would be needed to detect suppression. PMID:22952672
Prado, Blanca R.; Pozo, Carmen; Valdez-Moreno, Martha; Hebert, Paul D. N.
2011-01-01
Background Recent studies have demonstrated the utility of DNA barcoding in the discovery of overlooked species and in the connection of immature and adult stages. In this study, we use DNA barcoding to examine diversity patterns in 121 species of Nymphalidae from the Yucatan Peninsula in Mexico. Our results suggest the presence of cryptic species in 8 of these 121 taxa. As well, the reference database derived from the analysis of adult specimens allowed the identification of nymphalid caterpillars providing new details on host plant use. Methodology/Principal Findings We gathered DNA barcode sequences from 857 adult Nymphalidae representing 121 different species. This total includes four species (Adelpha iphiclus, Adelpha malea, Hamadryas iphtime and Taygetis laches) that were initially overlooked because of their close morphological similarity to other species. The barcode results showed that each of the 121 species possessed a diagnostic array of barcode sequences. In addition, there was evidence of cryptic taxa; seven species included two barcode clusters showing more than 2% sequence divergence while one species included three clusters. All 71 nymphalid caterpillars were identified to a species level by their sequence congruence to adult sequences. These caterpillars represented 16 species, and included Hamadryas julitta, an endemic species from the Yucatan Peninsula whose larval stages and host plant (Dalechampia schottii, also endemic to the Yucatan Peninsula) were previously unknown. Conclusions/Significance This investigation has revealed overlooked species in a well-studied museum collection of nymphalid butterflies and suggests that there is a substantial incidence of cryptic species that await full characterization. The utility of barcoding in the rapid identification of caterpillars also promises to accelerate the assembly of information on life histories, a particularly important advance for hyperdiverse tropical insect assemblages. PMID:22132140
Veltsos, Paris; Cossard, Guillaume; Beaudoing, Emmanuel; Beydon, Genséric; Savova Bianchi, Dessislava; Roux, Camille; C González-Martínez, Santiago; R Pannell, John
2018-05-29
Dioecious plants vary in whether their sex chromosomes are heteromorphic or homomorphic, but even homomorphic sex chromosomes may show divergence between homologues in the non-recombining, sex-determining region (SDR). Very little is known about the SDR of these species, which might represent particularly early stages of sex-chromosome evolution. Here, we assess the size and content of the SDR of the diploid dioecious herb Mercurialis annua , a species with homomorphic sex chromosomes and mild Y-chromosome degeneration. We used RNA sequencing (RNAseq) to identify new Y-linked markers for M. annua. Twelve of 24 transcripts showing male-specific expression in a previous experiment could be amplified by polymerase chain reaction (PCR) only from males, and are thus likely to be Y-linked. Analysis of genome-capture data from multiple populations of M. annua pointed to an additional six male-limited (and thus Y-linked) sequences. We used these markers to identify and sequence 17 sex-linked bacterial artificial chromosomes (BACs), which form 11 groups of non-overlapping sequences, covering a total sequence length of about 1.5 Mb. Content analysis of this region suggests that it is enriched for repeats, has low gene density, and contains few candidate sex-determining genes. The BACs map to a subset of the sex-linked region of the genetic map, which we estimate to be at least 14.5 Mb. This is substantially larger than estimates for other dioecious plants with homomorphic sex chromosomes, both in absolute terms and relative to their genome sizes. Our data provide a rare, high-resolution view of the homomorphic Y chromosome of a dioecious plant.
Global Genomic Diversity of Oryza sativa Varieties Revealed by Comparative Physical Mapping
Wang, Xiaoming; Kudrna, David A.; Pan, Yonglong; Wang, Hao; Liu, Lin; Lin, Haiyan; Zhang, Jianwei; Song, Xiang; Goicoechea, Jose Luis; Wing, Rod A.; Zhang, Qifa; Luo, Meizhong
2014-01-01
Bacterial artificial chromosome (BAC) physical maps embedding a large number of BAC end sequences (BESs) were generated for Oryza sativa ssp. indica varieties Minghui 63 (MH63) and Zhenshan 97 (ZS97) and were compared with the genome sequences of O. sativa spp. japonica cv. Nipponbare and O. sativa ssp. indica cv. 93-11. The comparisons exhibited substantial diversities in terms of large structural variations and small substitutions and indels. Genome-wide BAC-sized and contig-sized structural variations were detected, and the shared variations were analyzed. In the expansion regions of the Nipponbare reference sequence, in comparison to the MH63 and ZS97 physical maps, as well as to the previously constructed 93-11 physical map, the amounts and types of the repeat contents, and the outputs of gene ontology analysis, were significantly different from those of the whole genome. Using the physical maps of four wild Oryza species from OMAP (http://www.omap.org) as a control, we detected many conserved and divergent regions related to the evolution process of O. sativa. Between the BESs of MH63 and ZS97 and the two reference sequences, a total of 1532 polymorphic simple sequence repeats (SSRs), 71,383 SNPs, 1767 multiple nucleotide polymorphisms, 6340 insertions, and 9137 deletions were identified. This study provides independent whole-genome resources for intra- and intersubspecies comparisons and functional genomics studies in O. sativa. Both the comparative physical maps and the GBrowse, which integrated the QTL and molecular markers from GRAMENE (http://www.gramene.org) with our physical maps and analysis results, are open to the public through our Web site (http://gresource.hzau.edu.cn/resource/resource.html). PMID:24424778
Broderick, Nichole A; Raffa, Kenneth F; Goodman, Robert M; Handelsman, Jo
2004-01-01
Little is known about bacteria associated with Lepidoptera, the large group of mostly phytophagous insects comprising the moths and butterflies. We inventoried the larval midgut bacteria of a polyphagous foliivore, the gypsy moth (Lymantria dispar L.), whose gut is highly alkaline, by using traditional culturing and culture-independent methods. We also examined the effects of diet on microbial composition. Analysis of individual third-instar larvae revealed a high degree of similarity of microbial composition among insects fed on the same diet. DNA sequence analysis indicated that most of the PCR-amplified 16S rRNA genes belong to the gamma-Proteobacteria and low G+C gram-positive divisions and that the cultured members represented more than half of the phylotypes identified. Less frequently detected taxa included members of the alpha-Proteobacterium, Actinobacterium, and Cytophaga/Flexibacter/Bacteroides divisions. The 16S rRNA gene sequences from 7 of the 15 cultured organisms and 8 of the 9 sequences identified by PCR amplification diverged from previously reported bacterial sequences. The microbial composition of midguts differed substantially among larvae feeding on a sterilized artificial diet, aspen, larch, white oak, or willow. 16S rRNA analysis of cultured isolates indicated that an Enterococcus species and culture-independent analysis indicated that an Entbacter sp. were both present in all larvae, regardless of the feeding substrate; the sequences of these two phylotypes varied less than 1% among individual insects. These results provide the first comprehensive description of the microbial diversity of a lepidopteran midgut and demonstrate that the plant species in the diet influences the composition of the gut bacterial community.
Highton, Richard; Hastings, Amy Picard; Palmer, Catherine; Watts, Richard; Hass, Carla A.; Culver, Melanie; Arnold, Stevan
2012-01-01
Salamanders of the North American plethodontid genus Plethodon are important model organisms in a variety of studies that depend on a phylogenetic framework (e.g., chemical communication, ecological competition, life histories, hybridization, and speciation), and consequently their systematics has been intensively investigated over several decades. Nevertheless, we lack a synthesis of relationships among the species. In the analyses reported here we use new DNA sequence data from the complete nuclear albumin gene (1818 bp) and the 12s mitochondrial gene (355 bp), as well as published data for four other genes (Wiens et al., 2006), up to a total of 6989 bp, to infer relationships. We relate these results to past systematic work based on morphology, allozymes, and DNA sequences. Although basal relationships show a strong consensus across studies, many terminal relationships remain in flux despite substantial sequencing and other molecular and morphological studies. This systematic instability appears to be a consequence of contemporaneous bursts of speciation in the late Miocene and Pliocene, yielding many closely related extant species in each of the four eastern species groups. Therefore we conclude that many relationships are likely to remain poorly resolved in the face of additional sequencing efforts. On the other hand, the current classification of the 45 eastern species into four species groups is supported. The Plethodon cinereus group (10 species) is the sister group to the clade comprising the other three groups, but these latter groups (Plethodon glutinosus [28 species], Plethodon welleri [5 species], and Plethodon wehrlei [2 species]) probably diverged from each other at approximately the same time.
Comparative RNA sequencing reveals substantial genetic variation in endangered primates
Perry, George H.; Melsted, Páll; Marioni, John C.; Wang, Ying; Bainer, Russell; Pickrell, Joseph K.; Michelini, Katelyn; Zehr, Sarah; Yoder, Anne D.; Stephens, Matthew; Pritchard, Jonathan K.; Gilad, Yoav
2012-01-01
Comparative genomic studies in primates have yielded important insights into the evolutionary forces that shape genetic diversity and revealed the likely genetic basis for certain species-specific adaptations. To date, however, these studies have focused on only a small number of species. For the majority of nonhuman primates, including some of the most critically endangered, genome-level data are not yet available. In this study, we have taken the first steps toward addressing this gap by sequencing RNA from the livers of multiple individuals from each of 16 mammalian species, including humans and 11 nonhuman primates. Of the nonhuman primate species, five are lemurs and two are lorisoids, for which little or no genomic data were previously available. To analyze these data, we developed a method for de novo assembly and alignment of orthologous gene sequences across species. We assembled an average of 5721 gene sequences per species and characterized diversity and divergence of both gene sequences and gene expression levels. We identified patterns of variation that are consistent with the action of positive or directional selection, including an 18-fold enrichment of peroxisomal genes among genes whose regulation likely evolved under directional selection in the ancestral primate lineage. Importantly, we found no relationship between genetic diversity and endangered status, with the two most endangered species in our study, the black and white ruffed lemur and the Coquerel's sifaka, having the highest genetic diversity among all primates. Our observations imply that many endangered lemur populations still harbor considerable genetic variation. Timely efforts to conserve these species alongside their habitats have, therefore, strong potential to achieve long-term success. PMID:22207615
Nadjar-Boger, Elisabeth; Funkenstein, Bruria
2011-02-01
Myostatin (MSTN) is a member of the transforming growth factor-ß superfamily that functions as a negative regulator of skeletal muscle development and growth in mammals. Fish express at least two genes for MSTN: MSTN-1 and MSTN-2. To date, MSTN-2 promoters have been cloned only from salmonids and zebrafish. Here we described the cloning and sequence analysis of MSTN-2 gene and its 5' flanking region in the marine fish Sparus aurata (saMSTN-2). We demonstrate the existence of three alleles of the promoter and three alleles of the first intron. Sequence comparison of the promoter region in the three alleles revealed that although the sequences of the first 1050 bp upstream of the translation start site are almost identical in the three alleles, a substantial sequence divergence is seen further upstream. Careful sequence analysis of the region upstream of the first 1050 bp in the three alleles identified several elements that appear to be repeated in some or all sequences, at different positions. This suggests that the promoter region of saMSTN-2 has been subjected to various chromosomal rearrangements during the course of evolution, reflecting either insertion or deletion events. Screening of several genomic DNA collections indicated differences in allele frequency, with allele 'b' being the most abundant, followed by allele 'c', whereas allele 'a' is relatively rare. Sequence analysis of saMSTN-2 gene also revealed polymorphism in the first intron, identifying three alleles. The length difference in alleles '1R' and '2R' of the first intron is due to the presence of one or two copies of a repeated block of approximately 150 bp, located at the 5' end of the first intron. The third allele, '4R', has an additional insertion of 323 bp located 116 bp upstream of the 3' end of the first intron. Analysis of several DNA collections showed that the '2R' allele is the most common, followed by the '4R' allele, whereas the '1R' allele is relatively rare. Progeny analysis of a full-sib family showed a Mendelian mode of inheritance of the two genetic loci. No clear association was found between the two genetic markers and growth rate. These results show for the first time a substantial degree of polymorphism in both the promoter and first intron of MSTN-2 gene in a perciform fish species which points to chromosomal rearrangements that took place during evolution.
Phylogeny and divergence of the pinnipeds (Carnivora: Mammalia) assessed using a multigene dataset
Higdon, Jeff W; Bininda-Emonds, Olaf RP; Beck, Robin MD; Ferguson, Steven H
2007-01-01
Background Phylogenetic comparative methods are often improved by complete phylogenies with meaningful branch lengths (e.g., divergence dates). This study presents a dated molecular supertree for all 34 world pinniped species derived from a weighted matrix representation with parsimony (MRP) supertree analysis of 50 gene trees, each determined under a maximum likelihood (ML) framework. Divergence times were determined by mapping the same sequence data (plus two additional genes) on to the supertree topology and calibrating the ML branch lengths against a range of fossil calibrations. We assessed the sensitivity of our supertree topology in two ways: 1) a second supertree with all mtDNA genes combined into a single source tree, and 2) likelihood-based supermatrix analyses. Divergence dates were also calculated using a Bayesian relaxed molecular clock with rate autocorrelation to test the sensitivity of our supertree results further. Results The resulting phylogenies all agreed broadly with recent molecular studies, in particular supporting the monophyly of Phocidae, Otariidae, and the two phocid subfamilies, as well as an Odobenidae + Otariidae sister relationship; areas of disagreement were limited to four more poorly supported regions. Neither the supertree nor supermatrix analyses supported the monophyly of the two traditional otariid subfamilies, supporting suggestions for the need for taxonomic revision in this group. Phocid relationships were similar to other recent studies and deeper branches were generally well-resolved. Halichoerus grypus was nested within a paraphyletic Pusa, although relationships within Phocina tend to be poorly supported. Divergence date estimates for the supertree were in good agreement with other studies and the available fossil record; however, the Bayesian relaxed molecular clock divergence date estimates were significantly older. Conclusion Our results join other recent studies and highlight the need for a re-evaluation of pinniped taxonomy, especially as regards the subfamilial classification of otariids and the generic nomenclature of Phocina. Even with the recent publication of new sequence data, the available genetic sequence information for several species, particularly those in Arctocephalus, remains very limited, especially for nuclear markers. However, resolution of parts of the tree will probably remain difficult, even with additional data, due to apparent rapid radiations. Our study addresses the lack of a recent pinniped phylogeny that includes all species and robust divergence dates for all nodes, and will therefore prove indispensable to comparative and macroevolutionary studies of this group of carnivores. PMID:17996107
Virus Identification in Unknown Tropical Febrile Illness Cases Using Deep Sequencing
Balmaseda, Angel; Harris, Eva; DeRisi, Joseph L.
2012-01-01
Dengue virus is an emerging infectious agent that infects an estimated 50–100 million people annually worldwide, yet current diagnostic practices cannot detect an etiologic pathogen in ∼40% of dengue-like illnesses. Metagenomic approaches to pathogen detection, such as viral microarrays and deep sequencing, are promising tools to address emerging and non-diagnosable disease challenges. In this study, we used the Virochip microarray and deep sequencing to characterize the spectrum of viruses present in human sera from 123 Nicaraguan patients presenting with dengue-like symptoms but testing negative for dengue virus. We utilized a barcoding strategy to simultaneously deep sequence multiple serum specimens, generating on average over 1 million reads per sample. We then implemented a stepwise bioinformatic filtering pipeline to remove the majority of human and low-quality sequences to improve the speed and accuracy of subsequent unbiased database searches. By deep sequencing, we were able to detect virus sequence in 37% (45/123) of previously negative cases. These included 13 cases with Human Herpesvirus 6 sequences. Other samples contained sequences with similarity to sequences from viruses in the Herpesviridae, Flaviviridae, Circoviridae, Anelloviridae, Asfarviridae, and Parvoviridae families. In some cases, the putative viral sequences were virtually identical to known viruses, and in others they diverged, suggesting that they may derive from novel viruses. These results demonstrate the utility of unbiased metagenomic approaches in the detection of known and divergent viruses in the study of tropical febrile illness. PMID:22347512
Coulthart, Michael B; Posada, David; Crandall, Keith A; Dekaban, Gregory A
2006-03-01
Recently, the putative finding of ancient human T cell leukemia virus type 1 (HTLV-1) long terminal repeat (LTR) DNA sequences in association with a 1500-year-old Chilean mummy has stirred vigorous debate. The debate is based partly on the inherent uncertainties associated with phylogenetic reconstruction when only short sequences of closely related genotypes are available. However, a full analysis of what phylogenetic information is present in the mummy data has not previously been published, leaving open the question of what precisely is the range of admissible interpretation. To fulfill this need, we re-analyzed the mummy data in a new way. We first performed phylogenetic analysis of 188 published LTR DNA sequences from extant strains belonging to the HTLV-1 Cosmopolitan clade, using the method of statistical parsimony which is designed both to optimize phylogenetic resolution among sequences with little evolutionary divergence, and to permit precise mapping of individual sequence mutations onto branches of a divergence network. We then deduced possible phylogenetic positions for the two main categories of published Chilean mummy sequences, based on their published 157-nucleotide LTR sequences. The possible phylogenetic placements for one of the mummy sequence categories are consistent with a modern origin. However, one of these placements for the other mummy sequence category falls very close to the root of the Cosmopolitan clade, consistent with an ancient origin for both this mummy sequence and the Cosmopolitan clade.
Navigating the tip of the genomic iceberg: Next-generation sequencing for plant systematics.
Straub, Shannon C K; Parks, Matthew; Weitemier, Kevin; Fishbein, Mark; Cronn, Richard C; Liston, Aaron
2012-02-01
Just as Sanger sequencing did more than 20 years ago, next-generation sequencing (NGS) is poised to revolutionize plant systematics. By combining multiplexing approaches with NGS throughput, systematists may no longer need to choose between more taxa or more characters. Here we describe a genome skimming (shallow sequencing) approach for plant systematics. Through simulations, we evaluated optimal sequencing depth and performance of single-end and paired-end short read sequences for assembly of nuclear ribosomal DNA (rDNA) and plastomes and addressed the effect of divergence on reference-guided plastome assembly. We also used simulations to identify potential phylogenetic markers from low-copy nuclear loci at different sequencing depths. We demonstrated the utility of genome skimming through phylogenetic analysis of the Sonoran Desert clade (SDC) of Asclepias (Apocynaceae). Paired-end reads performed better than single-end reads. Minimum sequencing depths for high quality rDNA and plastome assemblies were 40× and 30×, respectively. Divergence from the reference significantly affected plastome assembly, but relatively similar references are available for most seed plants. Deeper rDNA sequencing is necessary to characterize intragenomic polymorphism. The low-copy fraction of the nuclear genome was readily surveyed, even at low sequencing depths. Nearly 160000 bp of sequence from three organelles provided evidence of phylogenetic incongruence in the SDC. Adoption of NGS will facilitate progress in plant systematics, as whole plastome and rDNA cistrons, partial mitochondrial genomes, and low-copy nuclear markers can now be efficiently obtained for molecular phylogenetics studies.
The Evolution of Ribosomal DNA: Divergent Paralogues and Phylogenetic Implications
Buckler-IV, E. S.; Ippolito, A.; Holtsford, T. P.
1997-01-01
Although nuclear ribosomal DNA (rDNA) repeats evolve together through concerted evolution, some genomes contain a considerable diversity of paralogous rDNA. This diversity includes not only multiple functional loci but also putative pseudogenes and recombinants. We examined the occurrence of divergent paralogues and recombinants in Gossypium, Nicotiana, Tripsacum, Winteraceae, and Zea ribosomal internal transcribed spacer (ITS) sequences. Some of the divergent paralogues are probably rDNA pseudogenes, since they have low predicted secondary structure stability, high substitution rates, and many deamination-driven substitutions at methylation sites. Under standard PCR conditions, the low stability paralogues amplified well, while many high-stability paralogues amplified poorly. Under highly denaturing PCR conditions (i.e., with dimethylsulfoxide), both low- and high-stability paralogues amplified well. We also found recombination between divergent paralogues. For phylogenetics, divergent ribosomal paralogues can aid in reconstructing ancestral states and thus serve as good outgroups. Divergent paralogues can also provide companion rDNA phylogenies. However, phylogeneticists must discriminate among families of divergent paralogues and recombinants or suffer from muddled and inaccurate organismal phylogenies. PMID:9055091
Landry, C; Geyer, L B; Arakaki, Y; Uehara, T; Palumbi, Stephen R
2003-01-01
The rich species diversity of the marine Indo-West Pacific (IWP) has been explained largely on the basis of historical observation of large-scale diversity gradients. Careful study of divergence among closely related species can reveal important new information about the pace and mechanisms of their formation, and can illuminate the genesis of biogeographic patterns. Young species inhabiting the IWP include urchins of the genus Echinometra, which diverged over the past 1-5 Myr. Here, we report the most recent divergence of two cryptic species of Echinometra inhabiting this region. Mitochondrial cytochrome oxidase 1 (CO1) sequence data show that in Echinometra oblonga, species-level divergence in sperm morphology, gamete recognition proteins and gamete compatibility arose between central and western Pacific populations in the past 250 000 years. Divergence in sperm attachment proteins suggests rapid evolution of the fertilization system. Divergence of sperm morphology may be a common feature of free-spawning animals, and offers opportunities to simultaneously understand genetic divergence, changes in protein expression patterns and morphological evolution in traits directly related to reproductive isolation. PMID:12964987
Auguste, Albert J.; Liria, Jonathan; Forrester, Naomi L.; Giambalvo, Dileyvic; Moncada, Maria; Long, Kanya C.; Morón, Dulce; de Manzione, Nuris; Tesh, Robert B.; Halsey, Eric S.; Kochel, Tadeusz J.; Hernandez, Rosa; Navarro, Juan-Carlos
2015-01-01
In 2010, an outbreak of febrile illness with arthralgic manifestations was detected at La Estación village, Portuguesa State, Venezuela. The etiologic agent was determined to be Mayaro virus (MAYV), a reemerging South American alphavirus. A total of 77 cases was reported and 19 were confirmed as seropositive. MAYV was isolated from acute-phase serum samples from 6 symptomatic patients. We sequenced 27 complete genomes representing the full spectrum of MAYV genetic diversity, which facilitated detection of a new genotype, designated N. Phylogenetic analysis of genomic sequences indicated that etiologic strains from Venezuela belong to genotype D. Results indicate that MAYV is highly conserved genetically, showing ≈17% nucleotide divergence across all 3 genotypes and 4% among genotype D strains in the most variable genes. Coalescent analyses suggested genotypes D and L diverged ≈150 years ago and genotype diverged N ≈250 years ago. This virus commonly infects persons residing near enzootic transmission foci because of anthropogenic incursions. PMID:26401714
Auguste, Albert J; Liria, Jonathan; Forrester, Naomi L; Giambalvo, Dileyvic; Moncada, Maria; Long, Kanya C; Morón, Dulce; de Manzione, Nuris; Tesh, Robert B; Halsey, Eric S; Kochel, Tadeusz J; Hernandez, Rosa; Navarro, Juan-Carlos; Weaver, Scott C
2015-10-01
In 2010, an outbreak of febrile illness with arthralgic manifestations was detected at La Estación village, Portuguesa State, Venezuela. The etiologic agent was determined to be Mayaro virus (MAYV), a reemerging South American alphavirus. A total of 77 cases was reported and 19 were confirmed as seropositive. MAYV was isolated from acute-phase serum samples from 6 symptomatic patients. We sequenced 27 complete genomes representing the full spectrum of MAYV genetic diversity, which facilitated detection of a new genotype, designated N. Phylogenetic analysis of genomic sequences indicated that etiologic strains from Venezuela belong to genotype D. Results indicate that MAYV is highly conserved genetically, showing ≈17% nucleotide divergence across all 3 genotypes and 4% among genotype D strains in the most variable genes. Coalescent analyses suggested genotypes D and L diverged ≈150 years ago and genotype diverged N ≈250 years ago. This virus commonly infects persons residing near enzootic transmission foci because of anthropogenic incursions.
Park, D-S; Suh, S-J; Hebert, P D N; Oh, H-W; Hong, K-J
2011-08-01
Although DNA barcode coverage has grown rapidly for many insect orders, there are some groups, such as scale insects, where sequence recovery has been difficult. However, using a recently developed primer set, we recovered barcode records from 373 specimens, providing coverage for 75 species from 31 genera in two families. Overall success was >90% for mealybugs and >80% for armored scale species. The G·C content was very low in most species, averaging just 16.3%. Sequence divergences (K2P) between congeneric species averaged 10.7%, while intra-specific divergences averaged 0.97%. However, the latter value was inflated by high intra-specific divergence in nine taxa, cases that may indicate species overlooked by current taxonomic treatments. Our study establishes the feasibility of developing a comprehensive barcode library for scale insects and indicates that its construction will both create an effective system for identifying scale insects and reveal taxonomic situations worthy of deeper analysis.
McNeal, Joel R; Kuehl, Jennifer V; Boore, Jeffrey L; de Pamphilis, Claude W
2007-10-24
Plastid genome content and protein sequence are highly conserved across land plants and their closest algal relatives. Parasitic plants, which obtain some or all of their nutrition through an attachment to a host plant, are often a striking exception. Heterotrophy can lead to relaxed constraint on some plastid genes or even total gene loss. We sequenced plastid genomes of two species in the parasitic genus Cuscuta along with a non-parasitic relative, Ipomoea purpurea, to investigate changes in the plastid genome that may result from transition to the parasitic lifestyle. Aside from loss of all ndh genes, Cuscuta exaltata retains photosynthetic and photorespiratory genes that evolve under strong selective constraint. Cuscuta obtusiflora has incurred substantially more change to its plastid genome, including loss of all genes for the plastid-encoded RNA polymerase. Despite extensive change in gene content and greatly increased rate of overall nucleotide substitution, C. obtusiflora also retains all photosynthetic and photorespiratory genes with only one minor exception. Although Epifagus virginiana, the only other parasitic plant with its plastid genome sequenced to date, has lost a largely overlapping set of transfer-RNA and ribosomal genes as Cuscuta, it has lost all genes related to photosynthesis and maintains a set of genes which are among the most divergent in Cuscuta. Analyses demonstrate photosynthetic genes are under the highest constraint of any genes within the plastid genomes of Cuscuta, indicating a function involving RuBisCo and electron transport through photosystems is still the primary reason for retention of the plastid genome in these species.
McNeal, Joel R; Kuehl, Jennifer V; Boore, Jeffrey L; de Pamphilis, Claude W
2007-01-01
Background Plastid genome content and protein sequence are highly conserved across land plants and their closest algal relatives. Parasitic plants, which obtain some or all of their nutrition through an attachment to a host plant, are often a striking exception. Heterotrophy can lead to relaxed constraint on some plastid genes or even total gene loss. We sequenced plastid genomes of two species in the parasitic genus Cuscuta along with a non-parasitic relative, Ipomoea purpurea, to investigate changes in the plastid genome that may result from transition to the parasitic lifestyle. Results Aside from loss of all ndh genes, Cuscuta exaltata retains photosynthetic and photorespiratory genes that evolve under strong selective constraint. Cuscuta obtusiflora has incurred substantially more change to its plastid genome, including loss of all genes for the plastid-encoded RNA polymerase. Despite extensive change in gene content and greatly increased rate of overall nucleotide substitution, C. obtusiflora also retains all photosynthetic and photorespiratory genes with only one minor exception. Conclusion Although Epifagus virginiana, the only other parasitic plant with its plastid genome sequenced to date, has lost a largely overlapping set of transfer-RNA and ribosomal genes as Cuscuta, it has lost all genes related to photosynthesis and maintains a set of genes which are among the most divergent in Cuscuta. Analyses demonstrate photosynthetic genes are under the highest constraint of any genes within the plastid genomes of Cuscuta, indicating a function involving RuBisCo and electron transport through photosystems is still the primary reason for retention of the plastid genome in these species. PMID:17956636
Waye, J S; Willard, H F
1986-09-01
The centromeric regions of all human chromosomes are characterized by distinct subsets of a diverse tandemly repeated DNA family, alpha satellite. On human chromosome 17, the predominant form of alpha satellite is a 2.7-kilobase-pair higher-order repeat unit consisting of 16 alphoid monomers. We present the complete nucleotide sequence of the 16-monomer repeat, which is present in 500 to 1,000 copies per chromosome 17, as well as that of a less abundant 15-monomer repeat, also from chromosome 17. These repeat units were approximately 98% identical in sequence, differing by the exclusion of precisely 1 monomer from the 15-monomer repeat. Homologous unequal crossing-over is suggested as a probable mechanism by which the different repeat lengths on chromosome 17 were generated, and the putative site of such a recombination event is identified. The monomer organization of the chromosome 17 higher-order repeat unit is based, in part, on tandemly repeated pentamers. A similar pentameric suborganization has been previously demonstrated for alpha satellite of the human X chromosome. Despite the organizational similarities, substantial sequence divergence distinguishes these subsets. Hybridization experiments indicate that the chromosome 17 and X subsets are more similar to each other than to the subsets found on several other human chromosomes. We suggest that the chromosome 17 and X alpha satellite subsets may be related components of a larger alphoid subfamily which have evolved from a common ancestral repeat into the contemporary chromosome-specific subsets.
Knief, Claudia
2015-01-01
Methane-oxidizing bacteria are characterized by their capability to grow on methane as sole source of carbon and energy. Cultivation-dependent and -independent methods have revealed that this functional guild of bacteria comprises a substantial diversity of organisms. In particular the use of cultivation-independent methods targeting a subunit of the particulate methane monooxygenase (pmoA) as functional marker for the detection of aerobic methanotrophs has resulted in thousands of sequences representing “unknown methanotrophic bacteria.” This limits data interpretation due to restricted information about these uncultured methanotrophs. A few groups of uncultivated methanotrophs are assumed to play important roles in methane oxidation in specific habitats, while the biology behind other sequence clusters remains still largely unknown. The discovery of evolutionary related monooxygenases in non-methanotrophic bacteria and of pmoA paralogs in methanotrophs requires that sequence clusters of uncultivated organisms have to be interpreted with care. This review article describes the present diversity of cultivated and uncultivated aerobic methanotrophic bacteria based on pmoA gene sequence diversity. It summarizes current knowledge about cultivated and major clusters of uncultivated methanotrophic bacteria and evaluates habitat specificity of these bacteria at different levels of taxonomic resolution. Habitat specificity exists for diverse lineages and at different taxonomic levels. Methanotrophic genera such as Methylocystis and Methylocaldum are identified as generalists, but they harbor habitat specific methanotrophs at species level. This finding implies that future studies should consider these diverging preferences at different taxonomic levels when analyzing methanotrophic communities. PMID:26696968
Molecular insights into the colonization and chromosomal diversification of Madeiran house mice.
Förster, D W; Gündüz, I; Nunes, A C; Gabriel, S; Ramalhinho, M G; Mathias, M L; Britton-Davidian, J; Searle, J B
2009-11-01
The colonization history of Madeiran house mice was investigated by analysing the complete mitochondrial (mt) D-loop sequences of 156 mice from the island of Madeira and mainland Portugal, extending on previous studies. The numbers of mtDNA haplotypes from Madeira and mainland Portugal were substantially increased (17 and 14 new haplotypes respectively), and phylogenetic analysis confirmed the previously reported link between the Madeiran archipelago and northern Europe. Sequence analysis revealed the presence of four mtDNA lineages in mainland Portugal, of which one was particularly common and widespread (termed the 'Portugal Main Clade'). There was no support for population bottlenecks during the formation of the six Robertsonian chromosome races on the island of Madeira, and D-loop sequence variation was not found to be structured according to karyotype. The colonization time of the Madeiran archipelago by Mus musculus domesticus was approached using two molecular dating methods (mismatch distribution and Bayesian skyline plot). Time estimates based on D-loop sequence variation at mainland sites (including previously published data from France and Turkey) were evaluated in the context of the zooarchaeological record of M. m. domesticus. A range of values for mutation rate (mu) and number of mouse generations per year was considered in these analyses because of the uncertainty surrounding these two parameters. The colonization of Portugal and Madeira by house mice is discussed in the context of the best-supported parameter values. In keeping with recent studies, our results suggest that mutation rate estimates based on interspecific divergence lead to gross overestimates concerning the timing of recent within-species events.
Tennessen, Jacob A.; Govindarajulu, Rajanikanth; Ashman, Tia-Lynn; Liston, Aaron
2014-01-01
Whole-genome duplications are radical evolutionary events that have driven speciation and adaptation in many taxa. Higher-order polyploids have complex histories often including interspecific hybridization and dynamic genomic changes. This chromosomal reshuffling is poorly understood for most polyploid species, despite their evolutionary and agricultural importance, due to the challenge of distinguishing homologous sequences from each other. Here, we use dense linkage maps generated with targeted sequence capture to improve the diploid strawberry (Fragaria vesca) reference genome and to disentangle the subgenomes of the wild octoploid progenitors of cultivated strawberry, Fragaria virginiana and Fragaria chiloensis. Our novel approach, POLiMAPS (Phylogenetics Of Linkage-Map-Anchored Polyploid Subgenomes), leverages sequence reads to associate informative interhomeolog phylogenetic markers with linkage groups and reference genome positions. In contrast to a widely accepted model, we find that one of the four subgenomes originates with the diploid cytoplasm donor F. vesca, one with the diploid Fragaria iinumae, and two with an unknown ancestor close to F. iinumae. Extensive unidirectional introgression has converted F. iinumae-like subgenomes to be more F. vesca-like, but never the reverse, due either to homoploid hybridization in the F. iinumae-like diploid ancestors or else strong selection spreading F. vesca-like sequence among subgenomes through homeologous exchange. In addition, divergence between homeologous chromosomes has been substantially augmented by interchromosomal rearrangements. Our phylogenetic approach reveals novel aspects of the complicated web of genetic exchanges that occur during polyploid evolution and suggests a path forward for unraveling other agriculturally and ecologically important polyploid genomes. PMID:25477420
Miller, Hilary C.; O’Meally, Denis; Ezaz, Tariq; Amemiya, Chris; Marshall-Graves, Jennifer A.; Edwards, Scott
2015-01-01
Major histocompatibility complex (MHC) genes are a central component of the vertebrate immune system and usually exist in a single genomic region. However, considerable differences in MHC organization and size exist between different vertebrate lineages. Reptiles occupy a key evolutionary position for understanding how variation in MHC structure evolved in vertebrates, but information on the structure of the MHC region in reptiles is limited. In this study, we investigate the organization and cytogenetic location of MHC genes in the tuatara (Sphenodon punctatus), the sole extant representative of the early-diverging reptilian order Rhynchocephalia. Sequencing and mapping of 12 clones containing class I and II MHC genes from a bacterial artificial chromosome library indicated that the core MHC region is located on chromosome 13q. However, duplication and translocation of MHC genes outside of the core region was evident, because additional class I MHC genes were located on chromosome 4p. We found a total of seven class I sequences and 11 class II β sequences, with evidence for duplication and pseudogenization of genes within the tuatara lineage. The tuatara MHC is characterized by high repeat content and low gene density compared with other species and we found no antigen processing or MHC framework genes on the MHC gene-containing clones. Our findings indicate substantial differences in MHC organization in tuatara compared with mammalian and avian MHCs and highlight the dynamic nature of the MHC. Further sequencing and annotation of tuatara and other reptile MHCs will determine if the tuatara MHC is representative of nonavian reptiles in general. PMID:25953959
Glinsky, Gennadi V.
2016-01-01
Abstract Thousands of candidate human-specific regulatory sequences (HSRS) have been identified, supporting the hypothesis that unique to human phenotypes result from human-specific alterations of genomic regulatory networks. Collectively, a compendium of multiple diverse families of HSRS that are functionally and structurally divergent from Great Apes could be defined as the backbone of human-specific genomic regulatory networks. Here, the conservation patterns analysis of 18,364 candidate HSRS was carried out requiring that 100% of bases must remap during the alignments of human, chimpanzee, and bonobo sequences. A total of 5,535 candidate HSRS were identified that are: (i) highly conserved in Great Apes; (ii) evolved by the exaptation of highly conserved ancestral DNA; (iii) defined by either the acceleration of mutation rates on the human lineage or the functional divergence from non-human primates. The exaptation of highly conserved ancestral DNA pathway seems mechanistically distinct from the evolution of regulatory DNA segments driven by the species-specific expansion of transposable elements. Genome-wide proximity placement analysis of HSRS revealed that a small fraction of topologically associating domains (TADs) contain more than half of HSRS from four distinct families. TADs that are enriched for HSRS and termed rapidly evolving in humans TADs (revTADs) comprise 0.8–10.3% of 3,127 TADs in the hESC genome. RevTADs manifest distinct correlation patterns between placements of human accelerated regions, human-specific transcription factor-binding sites, and recombination rates. There is a significant enrichment within revTAD boundaries of hESC-enhancers, primate-specific CTCF-binding sites, human-specific RNAPII-binding sites, hCONDELs, and H3K4me3 peaks with human-specific enrichment at TSS in prefrontal cortex neurons (P < 0.0001 in all instances). Present analysis supports the idea that phenotypic divergence of Homo sapiens is driven by the evolution of human-specific genomic regulatory networks via at least two mechanistically distinct pathways of creation of divergent sequences of regulatory DNA: (i) recombination-associated exaptation of the highly conserved ancestral regulatory DNA segments; (ii) human-specific insertions of transposable elements. PMID:27503290
Microbial evolution of sulphate reduction when lateral gene transfer is geographically restricted.
Chi Fru, E
2011-07-01
Lateral gene transfer (LGT) is an important mechanism by which micro-organisms acquire new functions. This process has been suggested to be central to prokaryotic evolution in various environments. However, the influence of geographical constraints on the evolution of laterally acquired genes in microbial metabolic evolution is not yet well understood. In this study, the influence of geographical isolation on the evolution of laterally acquired dissimilatory sulphite reductase (dsr) gene sequences in the sulphate-reducing micro-organisms (SRM) was investigated. Sequences on four continental blocks related to SRM known to have received dsr by LGT were analysed using standard phylogenetic and multidimensional statistical methods. Sequences related to lineages with large genetic diversity correlated positively with habitat divergence. Those affiliated to Thermodesulfobacterium indicated strong biogeographical delineation; hydrothermal-vent sequences clustered independently from hot-spring sequences. Some of the hydrothermal-vent and hot-spring sequences suggested to have been acquired from a common ancestral source may have diverged upon isolation within distinct habitats. In contrast, analysis of some Desulfotomaculum sequences indicated they could have been transferred from different ancestral sources but converged upon isolation within the same niche. These results hint that, after lateral acquisition of dsr genes, barriers to gene flow probably play a strong role in their subsequent evolution.
Evolutionary distances in the twilight zone--a rational kernel approach.
Schwarz, Roland F; Fletcher, William; Förster, Frank; Merget, Benjamin; Wolf, Matthias; Schultz, Jörg; Markowetz, Florian
2010-12-31
Phylogenetic tree reconstruction is traditionally based on multiple sequence alignments (MSAs) and heavily depends on the validity of this information bottleneck. With increasing sequence divergence, the quality of MSAs decays quickly. Alignment-free methods, on the other hand, are based on abstract string comparisons and avoid potential alignment problems. However, in general they are not biologically motivated and ignore our knowledge about the evolution of sequences. Thus, it is still a major open question how to define an evolutionary distance metric between divergent sequences that makes use of indel information and known substitution models without the need for a multiple alignment. Here we propose a new evolutionary distance metric to close this gap. It uses finite-state transducers to create a biologically motivated similarity score which models substitutions and indels, and does not depend on a multiple sequence alignment. The sequence similarity score is defined in analogy to pairwise alignments and additionally has the positive semi-definite property. We describe its derivation and show in simulation studies and real-world examples that it is more accurate in reconstructing phylogenies than competing methods. The result is a new and accurate way of determining evolutionary distances in and beyond the twilight zone of sequence alignments that is suitable for large datasets.
Roessler, Christian G.; Hall, Branwen M.; Anderson, William J.; Ingram, Wendy M.; Roberts, Sue A.; Montfort, William R.; Cordes, Matthew H. J.
2008-01-01
Proteins that share common ancestry may differ in structure and function because of divergent evolution of their amino acid sequences. For a typical diverse protein superfamily, the properties of a few scattered members are known from experiment. A satisfying picture of functional and structural evolution in relation to sequence changes, however, may require characterization of a larger, well chosen subset. Here, we employ a “stepping-stone” method, based on transitive homology, to target sequences intermediate between two related proteins with known divergent properties. We apply the approach to the question of how new protein folds can evolve from preexisting folds and, in particular, to an evolutionary change in secondary structure and oligomeric state in the Cro family of bacteriophage transcription factors, initially identified by sequence-structure comparison of distant homologs from phages P22 and λ. We report crystal structures of two Cro proteins, Xfaso 1 and Pfl 6, with sequences intermediate between those of P22 and λ. The domains show 40% sequence identity but differ by switching of α-helix to β-sheet in a C-terminal region spanning ≈25 residues. Sedimentation analysis also suggests a correlation between helix-to-sheet conversion and strengthened dimerization. PMID:18227506
Exploring the temporal structure of heterochronous sequences using TempEst (formerly Path-O-Gen).
Rambaut, Andrew; Lam, Tommy T; Max Carvalho, Luiz; Pybus, Oliver G
2016-01-01
Gene sequences sampled at different points in time can be used to infer molecular phylogenies on a natural timescale of months or years, provided that the sequences in question undergo measurable amounts of evolutionary change between sampling times. Data sets with this property are termed heterochronous and have become increasingly common in several fields of biology, most notably the molecular epidemiology of rapidly evolving viruses. Here we introduce the cross-platform software tool, TempEst (formerly known as Path-O-Gen), for the visualization and analysis of temporally sampled sequence data. Given a molecular phylogeny and the dates of sampling for each sequence, TempEst uses an interactive regression approach to explore the association between genetic divergence through time and sampling dates. TempEst can be used to (1) assess whether there is sufficient temporal signal in the data to proceed with phylogenetic molecular clock analysis, and (2) identify sequences whose genetic divergence and sampling date are incongruent. Examination of the latter can help identify data quality problems, including errors in data annotation, sample contamination, sequence recombination, or alignment error. We recommend that all users of the molecular clock models implemented in BEAST first check their data using TempEst prior to analysis.
Identification of three duplicated Spin genes in medaka (Oryzias latipes).
Wang, Xiao-Lei; Mei, Jie; Sun, Min; Hong, Yun-Han; Gui, Jian-Fang
2005-05-09
Gene and genomic duplications are very important and frequent events in fish evolution, and the divergence of duplicated genes in sequences and functions is a focus of research on gene evolution. Here, we report the identification and characterization of three duplicated Spindlin (Spin) genes from medaka (Oryzias latipes): OlSpinA, OlSpinB, and OlSpinC. Molecular cloning, genomic DNA Blast analysis and phylogenetic relationship analysis demonstrated that the three duplicated OlSpin genes should belong to gene duplication. Furthermore, Western blot analysis revealed significant expression differences of the three OlSpins among different tissues and during embryogenesis in medaka, and suggested that sequence and functional divergence might have occurred in evolution among them.
Finding functional features in Saccharomyces genomes by phylogenetic footprinting.
Cliften, Paul; Sudarsanam, Priya; Desikan, Ashwin; Fulton, Lucinda; Fulton, Bob; Majors, John; Waterston, Robert; Cohen, Barak A; Johnston, Mark
2003-07-04
The sifting and winnowing of DNA sequence that occur during evolution cause nonfunctional sequences to diverge, leaving phylogenetic footprints of functional sequence elements in comparisons of genome sequences. We searched for such footprints among the genome sequences of six Saccharomyces species and identified potentially functional sequences. Comparison of these sequences allowed us to revise the catalog of yeast genes and identify sequence motifs that may be targets of transcriptional regulatory proteins. Some of these conserved sequence motifs reside upstream of genes with similar functional annotations or similar expression patterns or those bound by the same transcription factor and are thus good candidates for functional regulatory sequences.
Maddock, Simon T; Day, Julia J; Nussbaum, Ronald A; Wilkinson, Mark; Gower, David J
2014-06-01
The hyperoliid frog Tachycnemis seychellensis, the only species of its genus, is endemic to the four largest granitic islands of the Seychelles archipelago and is reliant on freshwater bodies for reproduction. Its presence in the Seychelles is thought to be the product of a transoceanic dispersal, diverging from the genus Heterixalus, its closest living relative (currently endemic to Madagascar), between approximately 10-35Ma. A previous study documented substantial intraspecific morphological variation among island populations and also among populations within the largest island (Mahé). To assess intraspecific genetic variation and to infer the closest living relative(s) of T. seychellensis, DNA sequence data were generated for three mitochondrial and four nuclear markers. These data support a sister-group relationship between T. seychellensis and Heterixalus, with the divergence between the two occurring between approximately 11-19Ma based on cytb p-distances. Low levels of genetic variation were found among major mitochondrial haplotype clades of T. seychellensis (maximum 0.7% p-distance concatenated mtDNA), and samples from each of the islands (except La Digue) comprised multiple mitochondrial haplotype clades. Two nuclear genes (rag1 and tyr) showed no variation, and the other two (rho and pomc) lacked any notable geographic structuring, counter to patterns observed within presumably more vagile Seychelles taxa such as lizards. The low levels of genetic variation and phylogeographic structure support an interpretation that there is a single but morphologically highly variable species of Seychelles treefrog. The contrasting genetic and morphological intraspecific variation may be attributable to relatively recent admixture during low sea-level stands, ecophenotypic plasticity, local adaptation to different environmental conditions, and/or current and previously small population sizes. Low genetic phylogeographic structure but substantial morphological variation is unusual within anurans. Copyright © 2014 Elsevier Inc. All rights reserved.
Tobler, Michael; Dewitt, Thomas J; Schlupp, Ingo; García de León, Francisco J; Herrmann, Roger; Feulner, Philine G D; Tiedemann, Ralph; Plath, Martin
2008-10-01
Divergent natural selection drives evolutionary diversification. It creates phenotypic diversity by favoring developmental plasticity within populations or genetic differentiation and local adaptation among populations. We investigated phenotypic and genetic divergence in the livebearing fish Poecilia mexicana along two abiotic environmental gradients. These fish typically inhabit nonsulfidic surface rivers, but also colonized sulfidic and cave habitats. We assessed phenotypic variation among a factorial combination of habitat types using geometric and traditional morphometrics, and genetic divergence using quantitative and molecular genetic analyses. Fish in caves (sulfidic or not) exhibited reduced eyes and slender bodies. Fish from sulfidic habitats (surface or cave) exhibited larger heads and longer gill filaments. Common-garden rearing suggested that these morphological differences are partly heritable. Population genetic analyses using microsatellites as well as cytochrome b gene sequences indicate high population differentiation over small spatial scale and very low rates of gene flow, especially among different habitat types. This suggests that divergent environmental conditions constitute barriers to gene flow. Strong molecular divergence over short distances as well as phenotypic and quantitative genetic divergence across habitats in directions classic to fish ecomorphology suggest that divergent selection is structuring phenotypic variation in this system.
Phylogenetic divergence of cell biological features
2018-01-01
Most cellular features have a range of states, but understanding the mechanisms responsible for interspecific divergence is a challenge for evolutionary cell biology. Models are developed for the distribution of mean phenotypes likely to evolve under the joint forces of mutation and genetic drift in the face of constant selection pressures. Mean phenotypes will deviate from optimal states to a degree depending on the effective population size, potentially leading to substantial divergence in the absence of diversifying selection. The steady-state distribution for the mean can even be bimodal, with one domain being largely driven by selection and the other by mutation pressure, leading to the illusion of phenotypic shifts being induced by movement among alternative adaptive domains. These results raise questions as to whether lineage-specific selective pressures are necessary to account for interspecific divergence, providing a possible platform for the establishment of null models for the evolution of cell-biological traits. PMID:29927740
Neopolyploidy and diversification in Heuchera grossulariifolia
Oswald, Benjamin P.; Nuismer, Scott L.
2013-01-01
Newly formed polyploid lineages must contend with several obstacles to avoid extinction, including minority cytotype exclusion, competition, and inbreeding depression. If polyploidization results in immediate divergence of phenotypic characters these hurdles may be reduced and establishment made more likely. In addition, if polyploidization alters the phenotypic and genotypic associations between traits, i.e. the P and G matrices, polyploids may be able to explore novel evolutionary paths, facilitating their divergence and successful establishment. Here we report results from a study of the perennial plant Heuchera grossulariifolia in which the phenotypic divergence and changes in phenotypic and genotypic covariance matrices caused by neopolyploidization have been estimated. Our results reveal that polyploidization causes immediate divergence for traits relevant to establishment and results in significant changes in the structure of the phenotypic covariance matrix. In contrast, our results do not provide evidence that polyploidization results in immediate and substantial shifts in the genetic covariance matrix. PMID:21143472
Genomics of parallel adaptation at two timescales in Drosophila
Begun, David J.
2017-01-01
Two interesting unanswered questions are the extent to which both the broad patterns and genetic details of adaptive divergence are repeatable across species, and the timescales over which parallel adaptation may be observed. Drosophila melanogaster is a key model system for population and evolutionary genomics. Findings from genetics and genomics suggest that recent adaptation to latitudinal environmental variation (on the timescale of hundreds or thousands of years) associated with Out-of-Africa colonization plays an important role in maintaining biological variation in the species. Additionally, studies of interspecific differences between D. melanogaster and its sister species D. simulans have revealed that a substantial proportion of proteins and amino acid residues exhibit adaptive divergence on a roughly few million years long timescale. Here we use population genomic approaches to attack the problem of parallelism between D. melanogaster and a highly diverged conger, D. hydei, on two timescales. D. hydei, a member of the repleta group of Drosophila, is similar to D. melanogaster, in that it too appears to be a recently cosmopolitan species and recent colonizer of high latitude environments. We observed parallelism both for genes exhibiting latitudinal allele frequency differentiation within species and for genes exhibiting recurrent adaptive protein divergence between species. Greater parallelism was observed for long-term adaptive protein evolution and this parallelism includes not only the specific genes/proteins that exhibit adaptive evolution, but extends even to the magnitudes of the selective effects on interspecific protein differences. Thus, despite the roughly 50 million years of time separating D. melanogaster and D. hydei, and despite their considerably divergent biology, they exhibit substantial parallelism, suggesting the existence of a fundamental predictability of adaptive evolution in the genus. PMID:28968391
Genomic architecture of adaptive color pattern divergence and convergence in Heliconius butterflies
Supple, Megan A.; Hines, Heather M.; Dasmahapatra, Kanchon K.; Lewis, James J.; Nielsen, Dahlia M.; Lavoie, Christine; Ray, David A.; Salazar, Camilo; McMillan, W. Owen; Counterman, Brian A.
2013-01-01
Identifying the genetic changes driving adaptive variation in natural populations is key to understanding the origins of biodiversity. The mosaic of mimetic wing patterns in Heliconius butterflies makes an excellent system for exploring adaptive variation using next-generation sequencing. In this study, we use a combination of techniques to annotate the genomic interval modulating red color pattern variation, identify a narrow region responsible for adaptive divergence and convergence in Heliconius wing color patterns, and explore the evolutionary history of these adaptive alleles. We use whole genome resequencing from four hybrid zones between divergent color pattern races of Heliconius erato and two hybrid zones of the co-mimic Heliconius melpomene to examine genetic variation across 2.2 Mb of a partial reference sequence. In the intergenic region near optix, the gene previously shown to be responsible for the complex red pattern variation in Heliconius, population genetic analyses identify a shared 65-kb region of divergence that includes several sites perfectly associated with phenotype within each species. This region likely contains multiple cis-regulatory elements that control discrete expression domains of optix. The parallel signatures of genetic differentiation in H. erato and H. melpomene support a shared genetic architecture between the two distantly related co-mimics; however, phylogenetic analysis suggests mimetic patterns in each species evolved independently. Using a combination of next-generation sequencing analyses, we have refined our understanding of the genetic architecture of wing pattern variation in Heliconius and gained important insights into the evolution of novel adaptive phenotypes in natural populations. PMID:23674305
Deciphering amphibian diversity through DNA barcoding: chances and challenges.
Vences, Miguel; Thomas, Meike; Bonett, Ronald M; Vieites, David R
2005-10-29
Amphibians globally are in decline, yet there is still a tremendous amount of unrecognized diversity, calling for an acceleration of taxonomic exploration. This process will be greatly facilitated by a DNA barcoding system; however, the mitochondrial population structure of many amphibian species presents numerous challenges to such a standardized, single locus, approach. Here we analyse intra- and interspecific patterns of mitochondrial variation in two distantly related groups of amphibians, mantellid frogs and salamanders, to determine the promise of DNA barcoding with cytochrome oxidase subunit I (cox1) sequences in this taxon. High intraspecific cox1 divergences of 7-14% were observed (18% in one case) within the whole set of amphibian sequences analysed. These high values are not caused by particularly high substitution rates of this gene but by generally deep mitochondrial divergences within and among amphibian species. Despite these high divergences, cox1 sequences were able to correctly identify species including disparate geographic variants. The main problems with cox1 barcoding of amphibians are (i) the high variability of priming sites that hinder the application of universal primers to all species and (ii) the observed distinct overlap of intraspecific and interspecific divergence values, which implies difficulties in the definition of threshold values to identify candidate species. Common discordances between geographical signatures of mitochondrial and nuclear markers in amphibians indicate that a single-locus approach can be problematic when high accuracy of DNA barcoding is required. We suggest that a number of mitochondrial and nuclear genes may be used as DNA barcoding markers to complement cox1.
Genome Evolution in the Primary Endosymbiont of Whiteflies Sheds Light on Their Divergence
Santos-Garcia, Diego; Vargas-Chavez, Carlos; Moya, Andrés; Latorre, Amparo; Silva, Francisco J.
2015-01-01
Whiteflies are important agricultural insect pests, whose evolutionary success is related to a long-term association with a bacterial endosymbiont, Candidatus Portiera aleyrodidarum. To completely characterize this endosymbiont clade, we sequenced the genomes of three new Portiera strains covering the two extant whitefly subfamilies. Using endosymbiont and mitochondrial sequences we estimated the divergence dates in the clade and used these values to understand the molecular evolution of the endosymbiont coding sequences. Portiera genomes were maintained almost completely stable in gene order and gene content during more than 125 Myr of evolution, except in the Bemisia tabaci lineage. The ancestor had already lost the genetic information transfer autonomy but was able to participate in the synthesis of all essential amino acids and carotenoids. The time of divergence of the B. tabaci complex was much more recent than previous estimations. The recent divergence of biotypes B (MEAM1 species) and Q (MED species) suggests that they still could be considered strains of the same species. We have estimated the rates of evolution of Portiera genes, synonymous and nonsynonymous, and have detected significant differences among-lineages, with most Portiera lineages evolving very slowly. Although the nonsynonymous rates were much smaller than the synonymous, the genomic dN/dS ratios were similar, discarding selection as the driver of among-lineage variation. We suggest variation in mutation rate and generation time as the responsible factors. In conclusion, the slow evolutionary rates of Portiera may have contributed to its long-term association with whiteflies, avoiding its replacement by a novel and more efficient endosymbiont. PMID:25716826
Gender-Based Screening for Chlamydial Infection and Divergent Infection Trends in Men and Women
Rogers, Susan M.; Turner, Charles F.; Miller, William C.; Erbelding, Emily; Eggleston, Elizabeth; Tan, Sylvia; Roman, Anthony; Hobbs, Marcia; Chromy, James; Muvva, Ravikiran; Ganapathi, Laxminarayana
2014-01-01
Objectives To assess the potential impact of chlamydial screening policy that recommends routine screening of women but not men. Methods Population surveys of probability samples of Baltimore adults aged 18 to 35 years in 1997–1998 and 2006–2009 collected biospecimens to estimate trends in undiagnosed chlamydial infection. Survey estimates are compared to surveillance data on diagnosed chlamydial infections reported to the Health Department. Results Prevalence of undiagnosed chlamydial infection among men increased from 1.6% to 4.0%, but it declined from 4.3% to 3.1% among women (p = 0.028 for test of interaction). The annual (average) number of diagnosed infections was substantially higher among women than men in both time periods and increased among both men and women. Undiagnosed infection prevalence was substantially higher among black than non-black adults (4.0% vs 1.2%, p = 0.042 in 1997–98 and 5.5% vs 0.7%, p<0.001 in 2006–09). Conclusion Divergent trends in undiagnosed chlamydial infection by gender parallel divergent screening recommendations that encourage chlamydial testing for women but not for men. PMID:24586491
Phylogenetic position of avian nocturnal and diurnal raptors.
Mahmood, Muhammad Tariq; McLenachan, Patricia A; Gibb, Gillian C; Penny, David
2014-02-01
We report three new avian mitochondrial genomes, two from widely separated groups of owls and a falcon relative (the Secretarybird). We then report additional progress in resolving Neoavian relationships in that the two groups of owls do come together (it is not just long-branch attraction), and the Secretarybird is the deepest divergence on the Accipitridae lineage. This is now agreed between mitochondrial and nuclear sequences. There is no evidence for the monophyly of the combined three groups of raptors (owls, eagles, and falcons), and again this is agreed by nuclear and mitochondrial sequences. All three groups (owls, accipitrids [eagles], and falcons) do appear to be members of the "higher land birds," and though there may not yet be full "consilience" between mitochondrial and nuclear sequences for the precise order of divergences of the eagles, falcons, and the owls, there is good progress on their relationships.
Phylogenetic Position of Avian Nocturnal and Diurnal Raptors
Mahmood, Muhammad Tariq; McLenachan, Patricia A.; Gibb, Gillian C.; Penny, David
2014-01-01
We report three new avian mitochondrial genomes, two from widely separated groups of owls and a falcon relative (the Secretarybird). We then report additional progress in resolving Neoavian relationships in that the two groups of owls do come together (it is not just long-branch attraction), and the Secretarybird is the deepest divergence on the Accipitridae lineage. This is now agreed between mitochondrial and nuclear sequences. There is no evidence for the monophyly of the combined three groups of raptors (owls, eagles, and falcons), and again this is agreed by nuclear and mitochondrial sequences. All three groups (owls, accipitrids [eagles], and falcons) do appear to be members of the “higher land birds,” and though there may not yet be full “consilience” between mitochondrial and nuclear sequences for the precise order of divergences of the eagles, falcons, and the owls, there is good progress on their relationships. PMID:24448983
Evolution of the chalcone synthase gene family in the genus Ipomoea.
Durbin, M L; Learn, G H; Huttley, G A; Clegg, M T
1995-01-01
The evolution of the chalcone synthase [CHS; malonyl-CoA:4-coumaroyl-CoA malonyltransferase (cyclizing), EC 2.3.1.74] multigene family in the genus Ipomoea is explored. Thirteen CHS genes from seven Ipomoea species (family Convolvulaceae) were sequenced--three from genomic clones and the remainder from PCR amplification with primers designed from the 5' flanking region and the end of the 3' coding region of Ipomoea purpurea Roth. Analysis of the data indicates a duplication of CHS that predates the divergence of the Ipomoea species in this study. The Ipomoea CHS genes are among the most rapidly evolving of the CHS genes sequenced to date. The CHS genes in this study are most closely related to the Petunia CHS-B gene, which is also rapidly evolving and highly divergent from the rest of the Petunia CHS sequences. PMID:7724563
Genome Sequencing and Analysis of Geographically Diverse Clinical Isolates of Herpes Simplex Virus 2
Lamers, Susanna L.; Weiner, Brian; Ray, Stuart C.; Colgrove, Robert C.; Diaz, Fernando; Jing, Lichen; Wang, Kening; Saif, Sakina; Young, Sarah; Henn, Matthew; Laeyendecker, Oliver; Tobian, Aaron A. R.; Cohen, Jeffrey I.; Koelle, David M.; Quinn, Thomas C.; Knipe, David M.
2015-01-01
ABSTRACT Herpes simplex virus 2 (HSV-2), the principal causative agent of recurrent genital herpes, is a highly prevalent viral infection worldwide. Limited information is available on the amount of genomic DNA variation between HSV-2 strains because only two genomes have been determined, the HG52 laboratory strain and the newly sequenced SD90e low-passage-number clinical isolate strain, each from a different geographical area. In this study, we report the nearly complete genome sequences of 34 HSV-2 low-passage-number and laboratory strains, 14 of which were collected in Uganda, 1 in South Africa, 11 in the United States, and 8 in Japan. Our analyses of these genomes demonstrated remarkable sequence conservation, regardless of geographic origin, with the maximum nucleotide divergence between strains being 0.4% across the genome. In contrast, prior studies indicated that HSV-1 genomes exhibit more sequence diversity, as well as geographical clustering. Additionally, unlike HSV-1, little viral recombination between HSV-2 strains could be substantiated. These results are interpreted in light of HSV-2 evolution, epidemiology, and pathogenesis. Finally, the newly generated sequences more closely resemble the low-passage-number SD90e than HG52, supporting the use of the former as the new reference genome of HSV-2. IMPORTANCE Herpes simplex virus 2 (HSV-2) is a causative agent of genital and neonatal herpes. Therefore, knowledge of its DNA genome and genetic variability is central to preventing and treating genital herpes. However, only two full-length HSV-2 genomes have been reported. In this study, we sequenced 34 additional HSV-2 low-passage-number and laboratory viral genomes and initiated analysis of the genetic diversity of HSV-2 strains from around the world. The analysis of these genomes will facilitate research aimed at vaccine development, diagnosis, and the evaluation of clinical manifestations and transmission of HSV-2. This information will also contribute to our understanding of HSV evolution. PMID:26018166
Datasets for evolutionary comparative genomics
Liberles, David A
2005-01-01
Many decisions about genome sequencing projects are directed by perceived gaps in the tree of life, or towards model organisms. With the goal of a better understanding of biology through the lens of evolution, however, there are additional genomes that are worth sequencing. One such rationale for whole-genome sequencing is discussed here, along with other important strategies for understanding the phenotypic divergence of species. PMID:16086856
New Hepatitis B Virus of Cranes That Has an Unexpected Broad Host Range
Prassolov, Alexej; Hohenberg, Heinz; Kalinina, Tatyana; Schneider, Carola; Cova, Lucyna; Krone, Oliver; Frölich, Kai; Will, Hans; Sirma, Hüseyin
2003-01-01
All hepadnaviruses known so far have a very limited host range, restricted to their natural hosts and a few closely related species. This is thought to be due mainly to sequence divergence in the large envelope protein and species-specific differences in host components essential for virus propagation. Here we report an infection of cranes with a novel hepadnavirus, designated CHBV, that has an unexpectedly broad host range and is only distantly evolutionarily related to avihepadnaviruses of related hosts. Direct DNA sequencing of amplified CHBV DNA as well a sequencing of cloned viral genomes revealed that CHBV is most closely related to, although distinct from, Ross' goose hepatitis B virus (RGHBV) and slightly less closely related to duck hepatitis B virus (DHBV). Phylogenetically, cranes are very distant from geese and ducks and are most closely related to herons and storks. Naturally occurring hepadnaviruses in the last two species are highly divergent in sequence from RGHBV and DHBV and do not infect ducks or do so only marginally. In contrast, CHBV from crane sera and recombinant CHBV produced from LMH cells infected primary duck hepatocytes almost as efficiently as DHBV did. This is the first report of a rather broad host range of an avihepadnavirus. Our data imply either usage of similar or identical entry pathways and receptors by DHBV and CHBV, unusual host and virus adaptation mechanisms, or divergent evolution of the host genomes and cellular components required for virus propagation. PMID:12525630
New hepatitis B virus of cranes that has an unexpected broad host range.
Prassolov, Alexej; Hohenberg, Heinz; Kalinina, Tatyana; Schneider, Carola; Cova, Lucyna; Krone, Oliver; Frölich, Kai; Will, Hans; Sirma, Hüseyin
2003-02-01
All hepadnaviruses known so far have a very limited host range, restricted to their natural hosts and a few closely related species. This is thought to be due mainly to sequence divergence in the large envelope protein and species-specific differences in host components essential for virus propagation. Here we report an infection of cranes with a novel hepadnavirus, designated CHBV, that has an unexpectedly broad host range and is only distantly evolutionarily related to avihepadnaviruses of related hosts. Direct DNA sequencing of amplified CHBV DNA as well a sequencing of cloned viral genomes revealed that CHBV is most closely related to, although distinct from, Ross' goose hepatitis B virus (RGHBV) and slightly less closely related to duck hepatitis B virus (DHBV). Phylogenetically, cranes are very distant from geese and ducks and are most closely related to herons and storks. Naturally occurring hepadnaviruses in the last two species are highly divergent in sequence from RGHBV and DHBV and do not infect ducks or do so only marginally. In contrast, CHBV from crane sera and recombinant CHBV produced from LMH cells infected primary duck hepatocytes almost as efficiently as DHBV did. This is the first report of a rather broad host range of an avihepadnavirus. Our data imply either usage of similar or identical entry pathways and receptors by DHBV and CHBV, unusual host and virus adaptation mechanisms, or divergent evolution of the host genomes and cellular components required for virus propagation.
Divergence and Mosaicism among Virulent Soil Phages of the Burkholderia cepacia Complex‡
Summer, Elizabeth J.; Gonzalez, Carlos F.; Bomer, Morgan; Carlile, Thomas; Embry, Addie; Kucherka, Amalie M.; Lee, Jonte; Mebane, Leslie; Morrison, William C.; Mark, Louise; King, Maria D.; LiPuma, John J.; Vidaver, Anne K.; Young, Ry
2006-01-01
We have determined the genomic sequences of four virulent myophages, Bcep1, Bcep43, BcepB1A, and Bcep781, whose hosts are soil isolates of the Burkholderia cepacia complex. Despite temporal and spatial separations between initial isolations, three of the phages (Bcep1, Bcep43, and Bcep781, designated the Bcep781 group) exhibit 87% to 99% sequence identity to one another and most coding region differences are due to synonymous nucleotide substitutions, a hallmark of neutral genetic drift. Phage BcepB1A has a very different genome organization but is clearly a mosaic with respect to many of the genes of the Bcep781 group, as is a defective prophage element in Photorhabdus luminescens. Functions were assigned to 27 out of 71 predicted genes of Bcep1 despite extreme sequence divergence. Using a lambda repressor fusion technique, 10 Bcep781-encoded proteins were identified for their ability to support homotypic interactions. While head and tail morphogenesis genes have retained canonical gene order despite extreme sequence divergence, genes involved in DNA metabolism and host lysis are not organized as in other phages. This unusual genome arrangement may contribute to the ability of the Bcep781-like phages to maintain a unified genomic type. However, the Bcep781 group phages can also engage in lateral gene transfer events with otherwise unrelated phages, a process that contributes to the broader-scale genomic mosaicism prevalent among the tailed phages. PMID:16352842
Divergence, differential methylation and interspersion of melon satellite DNA sequences.
Shmookler Reis, R; Timmis, J N; Ingle, J
1981-01-01
Melon (Cucumis melo) satellite DNA consists of two components, Q and S, each with a buoyant density in CsCl of 1.707 g/ml, but differing by 9 degrees C in "melting" temperature. These physical properties appear to be in contradiction, since both depend on G + C content. In order to resolve this anomaly, base compositions were directly determined for isolated fractions. the low-"melting" component S contains 41.8% G + C, with 6% of C present as 5-methylcytosine, whereas Q DNA contains 54% G + C, with 41% of C methylated. Analyses of restriction site loss agreed well with the direct determinations of methylation and divergence, and indicated some clustering of methylated sites in Q DNA. Analysis of restricted main-band DNA by hydridization with RNA complementary to Q satellite DNA ("Southern transfer") showed satellite Q tandem arrays interspersed in DNA of main-band density. Sequence divergence and extent of methylation did not appear to depend on whether a repeat array was present as satellite or interspersed in main-band DNA. Hydridization in situ indicated considerable heterogeneity in the genomic proportion of the Q-DNA sequences in melon fruit nuclei, implying over- and under-representation consistent with extensive unequal recombination in satellite Q tandem arrays. The cucumber, Cucumis sativus, contains less than 8% as much Q-homologous DNA per genome as the melon, suggesting rapid evolutionary gain or loss of these tandem repeat sequences. Images Fig. 2. PLATE 1 Fig. 4. Fig. 10. PMID:6172117
Testing the molecular clock using mechanistic models of fossil preservation and molecular evolution.
Warnock, Rachel C M; Yang, Ziheng; Donoghue, Philip C J
2017-06-28
Molecular sequence data provide information about relative times only, and fossil-based age constraints are the ultimate source of information about absolute times in molecular clock dating analyses. Thus, fossil calibrations are critical to molecular clock dating, but competing methods are difficult to evaluate empirically because the true evolutionary time scale is never known. Here, we combine mechanistic models of fossil preservation and sequence evolution in simulations to evaluate different approaches to constructing fossil calibrations and their impact on Bayesian molecular clock dating, and the relative impact of fossil versus molecular sampling. We show that divergence time estimation is impacted by the model of fossil preservation, sampling intensity and tree shape. The addition of sequence data may improve molecular clock estimates, but accuracy and precision is dominated by the quality of the fossil calibrations. Posterior means and medians are poor representatives of true divergence times; posterior intervals provide a much more accurate estimate of divergence times, though they may be wide and often do not have high coverage probability. Our results highlight the importance of increased fossil sampling and improved statistical approaches to generating calibrations, which should incorporate the non-uniform nature of ecological and temporal fossil species distributions. © 2017 The Authors.
Gómez, Africa; Serra, Manuel; Carvalho, Gary R; Lunt, David H
2002-07-01
Continental lake-dwelling zooplanktonic organisms have long been considered cosmopolitan species with little geographic variation in spite of the isolation of their habitats. Evidence of morphological cohesiveness and high dispersal capabilities support this interpretation. However, this view has been challenged recently as many such species have been shown either to comprise cryptic species complexes or to exhibit marked population genetic differentiation and strong phylogeographic structuring at a regional scale. Here we investigate the molecular phylogeny of the cosmopolitan passively dispersing rotifer Brachionus plicatilis (Rotifera: Monogononta) species complex using nucleotide sequence variation from both nuclear (ribosomal internal transcribed spacer 1, ITS1) and mitochondrial (cytochrome c oxidase subunit I, COI) genes. Analysis of rotifer resting eggs from 27 salt lakes in the Iberian Peninsula plus lakes from four continents revealed nine genetically divergent lineages. The high level of sequence divergence, absence of hybridization, and extensive sympatry observed support the specific status of these lineages. Sequence divergence estimates indicate that the B. plicatilis complex began diversifying many millions of years ago, yet has showed relatively high levels of morphological stasis. We discuss these results in relation to the ecology and genetics of aquatic invertebrates possessing dispersive resting propagules and address the apparent contradiction between zooplanktonic population structure and their morphological stasis.
Colihueque, Nelson; Gantz, Alberto; Rau, Jaime Ricardo; Parraguez, Margarita
2015-01-01
Abstract In this paper new mitochondrial COI sequences of Common Barn Owl Tyto alba (Scopoli, 1769) and Short-eared Owl Asio flammeus (Pontoppidan, 1763) from southern Chile are reported and compared with sequences from other parts of the World. The intraspecific genetic divergence (mean p-distance) was 4.6 to 5.5% for the Common Barn Owl in comparison with specimens from northern Europe and Australasia and 3.1% for the Short-eared Owl with respect to samples from north America, northern Europe and northern Asia. Phylogenetic analyses revealed three distinctive groups for the Common Barn Owl: (i) South America (Chile and Argentina) plus Central and North America, (ii) northern Europe and (iii) Australasia, and two distinctive groups for the Short-eared Owl: (i) South America (Chile and Argentina) and (ii) north America plus northern Europe and northern Asia. The level of genetic divergence observed in both species exceeds the upper limit of intraspecific comparisons reported previously for Strigiformes. Therefore, this suggests that further research is needed to assess the taxonomic status, particularly for the Chilean populations that, to date, have been identified as belonging to these species through traditional taxonomy. PMID:26668551
Wang, Qian; Abbott, Richard J; Yu, Qiu-Shi; Lin, Kao; Liu, Jian-Quan
2013-07-01
Pleistocene climate change has had an important effect in shaping intraspecific genetic variation in many species; however, its role in driving speciation is less clear. We examined the possibility of a Pleistocene origin of the only two representatives of the genus Pugionium (Brassicaceae), Pugionium cornutum and Pugionium dolabratum, which occupy different desert habitats in northwest China. We surveyed sequence variation for internal transcribed spacer (ITS), three chloroplast (cp) DNA fragments, and eight low-copy nuclear genes among individuals sampled from 11 populations of each species across their geographic ranges. One ITS mutation distinguished the two species, whereas mutations in cpDNA and the eight low-copy nuclear gene sequences were not species-specific. Although interspecific divergence varied greatly among nuclear gene sequences, in each case divergence was estimated to have occurred within the Pleistocene when deserts expanded in northwest China. Our findings point to the importance of Pleistocene climate change, in this case an increase in aridity, as a cause of speciation in Pugionium as a result of divergence in different habitats that formed in association with the expansion of deserts in China. © 2013 The Authors. New Phytologist © 2013 New Phytologist Trust.
LCC demons with divergence term for liver MRI motion correction
NASA Astrophysics Data System (ADS)
Oh, Jihun; Martin, Diego; Skrinjar, Oskar
2010-03-01
Contrast-enhanced liver MR image sequences acquired at multiple times before and after contrast administration have been shown to be critically important for the diagnosis and monitoring of liver tumors and may be used for the quantification of liver inflammation and fibrosis. However, over multiple acquisitions, the liver moves and deforms due to patient and respiratory motion. In order to analyze contrast agent uptake one first needs to correct for liver motion. In this paper we present a method for the motion correction of dynamic contrastenhanced liver MR images. For this purpose we use a modified version of the Local Correlation Coefficient (LCC) Demons non-rigid registration method. Since the liver is nearly incompressible its displacement field has small divergence. For this reason we add a divergence term to the energy that is minimized in the LCC Demons method. We applied the method to four sequences of contrast-enhanced liver MR images. Each sequence had a pre-contrast scan and seven post-contrast scans. For each post-contrast scan we corrected for the liver motion relative to the pre-contrast scan. Quantitative evaluation showed that the proposed method improved the liver alignment relative to the non-corrected and translation-corrected scans and visual inspection showed no visible misalignment of the motion corrected contrast-enhanced scans and pre-contrast scan.
Yamada, Kazuhiko; Nishida-Umehara, Chizuko; Matsuda, Yoichi
2004-03-01
We isolated a new family of satellite DNA sequences from HaeIII- and EcoRI-digested genomic DNA of the Blakiston's fish owl ( Ketupa blakistoni). The repetitive sequences were organized in tandem arrays of the 174 bp element, and localized to the centromeric regions of all macrochromosomes, including the Z and W chromosomes, and microchromosomes. This hybridization pattern was consistent with the distribution of C-band-positive centromeric heterochromatin, and the satellite DNA sequences occupied 10% of the total genome as a major component of centromeric heterochromatin. The sequences were homogenized between macro- and microchromosomes in this species, and therefore intraspecific divergence of the nucleotide sequences was low. The 174 bp element cross-hybridized to the genomic DNA of six other Strigidae species, but not to that of the Tytonidae, suggesting that the satellite DNA sequences are conserved in the same family but fairly divergent between the different families in the Strigiformes. Secondly, the centromeric satellite DNAs were cloned from eight Strigidae species, and the nucleotide sequences of 41 monomer fragments were compared within and between species. Molecular phylogenetic relationships of the nucleotide sequences were highly correlated with both the taxonomy based on morphological traits and the phylogenetic tree constructed by DNA-DNA hybridization. These results suggest that the satellite DNA sequence has evolved by concerted evolution in the Strigidae and that it is a good taxonomic and phylogenetic marker to examine genetic diversity between Strigiformes species.
Renner, S S; Grimm, Guido W; Kapli, Paschalia; Denk, Thomas
2016-07-19
The fossilized birth-death (FBD) model can make use of information contained in multiple fossils representing the same clade, and we here apply this model to infer divergence times in beeches (genus Fagus), using 53 fossils and nuclear sequences for all nine species. We also apply FBD dating to the fern clade Osmundaceae, with about 12 living species and 36 fossils. Fagus nuclear sequences cannot be aligned with those of other Fagaceae, and we therefore use Bayes factors to choose among alternative root positions. The crown group of Fagus is dated to 53 (62-43) Ma; divergence of the sole American species to 44 (51-39) Ma and divergence between Central European F. sylvatica and Eastern Mediterranean F. orientalis to 8.7 (20-1.8) Ma, unexpectedly old. The FBD model can accommodate fossils as sampled ancestors or as extinct or unobserved lineages; however, this makes its raw output, which shows all fossils on short or long branches, problematic to interpret. We use hand-drawn depictions and a bipartition network to illustrate the uncertain placements of fossils. Inferred speciation and extinction rates imply approximately 5× higher evolutionary turnover in Fagus than in Osmundaceae, fitting a hypothesized low turnover in plants adapted to low-nutrient conditions.This article is part of the themed issue 'Dating species divergences using rocks and clocks'. © 2016 The Author(s).
Kapli, Paschalia; Denk, Thomas
2016-01-01
The fossilized birth–death (FBD) model can make use of information contained in multiple fossils representing the same clade, and we here apply this model to infer divergence times in beeches (genus Fagus), using 53 fossils and nuclear sequences for all nine species. We also apply FBD dating to the fern clade Osmundaceae, with about 12 living species and 36 fossils. Fagus nuclear sequences cannot be aligned with those of other Fagaceae, and we therefore use Bayes factors to choose among alternative root positions. The crown group of Fagus is dated to 53 (62–43) Ma; divergence of the sole American species to 44 (51–39) Ma and divergence between Central European F. sylvatica and Eastern Mediterranean F. orientalis to 8.7 (20–1.8) Ma, unexpectedly old. The FBD model can accommodate fossils as sampled ancestors or as extinct or unobserved lineages; however, this makes its raw output, which shows all fossils on short or long branches, problematic to interpret. We use hand-drawn depictions and a bipartition network to illustrate the uncertain placements of fossils. Inferred speciation and extinction rates imply approximately 5× higher evolutionary turnover in Fagus than in Osmundaceae, fitting a hypothesized low turnover in plants adapted to low-nutrient conditions. This article is part of the themed issue ‘Dating species divergences using rocks and clocks’. PMID:27325832
Divergence between human populations estimated from linkage disequilibrium.
Sved, John A; McRae, Allan F; Visscher, Peter M
2008-12-01
Observed linkage disequilibrium (LD) between genetic markers in different populations descended independently from a common ancestral population can be used to estimate their absolute time of divergence, because the correlation of LD between populations will be reduced each generation by an amount that, approximately, depends only on the recombination rate between markers. Although drift leads to divergence in allele frequencies, it has less effect on divergence in LD values. We derived the relationship between LD and time of divergence and verified it with coalescent simulations. We then used HapMap Phase II data to estimate time of divergence between human populations. Summed over large numbers of pairs of loci, we find a positive correlation of LD between African and non-African populations at levels of up to approximately 0.3 cM. We estimate that the observed correlation of LD is consistent with an effective separation time of approximately 1,000 generations or approximately 25,000 years before present. The most likely explanation for such relatively low separation times is the existence of substantial levels of migration between populations after the initial separation. Theory and results from coalescent simulations confirm that low levels of migration can lead to a downward bias in the estimate of separation time.
Mukherjee, Nabanita; Beati, Lorenza; Sellers, Michael; Burton, Laquita; Adamson, Steven; Robbins, Richard G; Moore, Frank; Karim, Shahid
2014-03-01
Birds are capable of carrying ticks and, consequently, tick-transmitted microorganisms over long distances and across geographical barriers such as oceans and deserts. Ticks are hosts for several species of spotted fever group rickettsiae (SFGR), which can be transmitted to vertebrates during blood meals. In this study, the prevalence of this group of rickettsiae was examined in ticks infesting migratory songbirds by using polymerase chain reaction (PCR). During the 2009 and 2010 spring migration season, 2064 northward-migrating passerine songbirds were examined for ticks at Johnson Bayou, Louisiana. A total of 91 ticks was removed from 35 individual songbirds for tick species identification and spotted fever group rickettsia detection. Ticks were identified as Haemaphysalis juxtakochi (n=38, 42%), Amblyomma longirostre (n=22, 24%), Amblyomma nodosum (n=17, 19%), Amblyomma calcaratum (n=11, 12%), Amblyomma maculatum (n=2, 2%), and Haemaphysalis leporispalustris (n=1, 1%) by comparing their 12S rDNA gene sequence to homologous sequences in GenBank. Most of the identified ticks were exotic species originating outside of the United States. The phylogenetic analysis of the 71 ompA gene sequences of the rickettsial strains detected in the ticks revealed the occurrence of 6 distinct rickettsial genotypes. Two genotypes (corresponding to a total of 28 samples) were included in the Candidatus Rickettsia amblyommii clade (less than 1% divergence), 2 of them (corresponding to a total of 14 samples) clustered with Rickettsia sp. "Argentina" with less than 0.2% sequence divergence, and 2 of them (corresponding to a total of 27 samples), although closely related to the R. parkeri-R. africae lineage (2.50-3.41% divergence), exhibited sufficient genetic divergence from its members to possibly constitute a new rickettsial genotype. Overall, there does not seem to be a specific relationship between exotic tick species, the rickettsiae they harbor, or the reservoir competence of the corresponding bird species. Copyright © 2013 Elsevier GmbH. All rights reserved.
Zhang, Yinan; Samee, Md. Abul Hassan; Halfon, Marc S.; Sinha, Saurabh
2014-01-01
Many genes familiar from Drosophila development, such as the so-called gap, pair-rule, and segment polarity genes, play important roles in the development of other insects and in many cases appear to be deployed in a similar fashion, despite the fact that Drosophila-like “long germband” development is highly derived and confined to a subset of insect families. Whether or not these similarities extend to the regulatory level is unknown. Identification of regulatory regions beyond the well-studied Drosophila has been challenging as even within the Diptera (flies, including mosquitoes) regulatory sequences have diverged past the point of recognition by standard alignment methods. Here, we demonstrate that methods we previously developed for computational cis-regulatory module (CRM) discovery in Drosophila can be used effectively in highly diverged (250–350 Myr) insect species including Anopheles gambiae, Tribolium castaneum, Apis mellifera, and Nasonia vitripennis. In Drosophila, we have successfully used small sets of known CRMs as “training data” to guide the search for other CRMs with related function. We show here that although species-specific CRM training data do not exist, training sets from Drosophila can facilitate CRM discovery in diverged insects. We validate in vivo over a dozen new CRMs, roughly doubling the number of known CRMs in the four non-Drosophila species. Given the growing wealth of Drosophila CRM annotation, these results suggest that extensive regulatory sequence annotation will be possible in newly sequenced insects without recourse to costly and labor-intensive genome-scale experiments. We develop a new method, Regulus, which computes a probabilistic score of similarity based on binding site composition (despite the absence of nucleotide-level sequence alignment), and demonstrate similarity between functionally related CRMs from orthologous loci. Our work represents an important step toward being able to trace the evolutionary history of gene regulatory networks and defining the mechanisms underlying insect evolution. PMID:25173756
Kazemian, Majid; Suryamohan, Kushal; Chen, Jia-Yu; Zhang, Yinan; Samee, Md Abul Hassan; Halfon, Marc S; Sinha, Saurabh
2014-09-01
Many genes familiar from Drosophila development, such as the so-called gap, pair-rule, and segment polarity genes, play important roles in the development of other insects and in many cases appear to be deployed in a similar fashion, despite the fact that Drosophila-like "long germband" development is highly derived and confined to a subset of insect families. Whether or not these similarities extend to the regulatory level is unknown. Identification of regulatory regions beyond the well-studied Drosophila has been challenging as even within the Diptera (flies, including mosquitoes) regulatory sequences have diverged past the point of recognition by standard alignment methods. Here, we demonstrate that methods we previously developed for computational cis-regulatory module (CRM) discovery in Drosophila can be used effectively in highly diverged (250-350 Myr) insect species including Anopheles gambiae, Tribolium castaneum, Apis mellifera, and Nasonia vitripennis. In Drosophila, we have successfully used small sets of known CRMs as "training data" to guide the search for other CRMs with related function. We show here that although species-specific CRM training data do not exist, training sets from Drosophila can facilitate CRM discovery in diverged insects. We validate in vivo over a dozen new CRMs, roughly doubling the number of known CRMs in the four non-Drosophila species. Given the growing wealth of Drosophila CRM annotation, these results suggest that extensive regulatory sequence annotation will be possible in newly sequenced insects without recourse to costly and labor-intensive genome-scale experiments. We develop a new method, Regulus, which computes a probabilistic score of similarity based on binding site composition (despite the absence of nucleotide-level sequence alignment), and demonstrate similarity between functionally related CRMs from orthologous loci. Our work represents an important step toward being able to trace the evolutionary history of gene regulatory networks and defining the mechanisms underlying insect evolution. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Troggio, Michela; Šurbanovski, Nada; Bianco, Luca; Moretto, Marco; Giongo, Lara; Banchi, Elisa; Viola, Roberto; Fernández, Felicdad Fernández; Costa, Fabrizio; Velasco, Riccardo; Cestaro, Alessandro; Sargent, Daniel James
2013-01-01
High throughput arrays for the simultaneous genotyping of thousands of single-nucleotide polymorphisms (SNPs) have made the rapid genetic characterisation of plant genomes and the development of saturated linkage maps a realistic prospect for many plant species of agronomic importance. However, the correct calling of SNP genotypes in divergent polyploid genomes using array technology can be problematic due to paralogy, and to divergence in probe sequences causing changes in probe binding efficiencies. An Illumina Infinium II whole-genome genotyping array was recently developed for the cultivated apple and used to develop a molecular linkage map for an apple rootstock progeny (M432), but a large proportion of segregating SNPs were not mapped in the progeny, due to unexpected genotype clustering patterns. To investigate the causes of this unexpected clustering we performed BLAST analysis of all probe sequences against the ‘Golden Delicious’ genome sequence and discovered evidence for paralogous annealing sites and probe sequence divergence for a high proportion of probes contained on the array. Following visual re-evaluation of the genotyping data generated for 8,788 SNPs for the M432 progeny using the array, we manually re-scored genotypes at 818 loci and mapped a further 797 markers to the M432 linkage map. The newly mapped markers included the majority of those that could not be mapped previously, as well as loci that were previously scored as monomorphic, but which segregated due to divergence leading to heterozygosity in probe annealing sites. An evaluation of the 8,788 probes in a diverse collection of Malus germplasm showed that more than half the probes returned genotype clustering patterns that were difficult or impossible to interpret reliably, highlighting implications for the use of the array in genome-wide association studies. PMID:23826289
Wen, B; Rikihisa, Y; Fuerst, P A; Chaichanasiriwithaya, W
1995-04-01
Ehrlichia risticii is the causative agent of Potomac horse fever. Variations among the major antigens of different local E. risticii strains have been detected previously. To further assess genetic variability in this species or species complex, the sequences of the 16S rRNA genes of several isolates obtained from sick horses diagnosed as having Potomac horse fever were determined. The sequences of six isolates obtained from Ohio and three isolates obtained from Kentucky were amplified by PCR. Three groups of sequences were identified. The sequences of five of the Ohio isolates were identical to the sequence of the type strain of E. risticii, the Illinois strain. The sequence of one Ohio isolate, isolate 081, was unique; this sequence differed in 10 nucleotides from the sequence of the type strain (level of similarity, 99.3%). The sequences of the three Kentucky isolates were identical to each other, but differed by five bases from the sequence of the type strain (level of similarity, 99.6%). The levels of sequence similarity of isolate 081, the Kentucky isolates, and the type strain to the next most closely related Ehrlichia sp., Ehrlichia sennetsu, were 99.3, 99.2, and 99.2%, respectively. On the basis of the distinct antigenic profiles and the levels of 16S rRNA sequence divergence, isolate 081 is as divergent from the type strain of E. risticii as E. sennetsu is. Therefore, we suggest that strain 081 and the Kentucky isolates may represent two new distinct Ehrlichia species.
Gruber, Karl; Schöning, Caspar; Otte, Marianne; Kinuthia, Wanja; Hasselmann, Martin
2013-09-01
Identifying the forces shaping intraspecific phenotypic and genotypic divergence are of key importance in evolutionary biology. Phenotypic divergence may result from local adaptation or, especially in species with strong gene flow, from pronounced phenotypic plasticity. Here, we examine morphological and genetic divergence among populations of the western honey bee Apis mellifera in the topographically heterogeneous East African region. The currently accepted "mountain refugia hypothesis" states that populations living in disjunct montane forests belong to a different lineage than those in savanna habitats surrounding these forests. We obtained microsatellite data, mitochondrial sequences, and morphometric data from worker honey bees collected from feral colonies in three montane forests and corresponding neighboring savanna regions in Kenya. Honey bee colonies from montane forests showed distinct worker morphology compared with colonies in savanna areas. Mitochondrial sequence data did not support the existence of the two currently accepted subspecies. Furthermore, analyses of the microsatellite data with a Bayesian clustering method did not support the existence of two source populations as it would be expected under the mountain refugia scenario. Our findings suggest that phenotypic plasticity rather than distinct ancestry is the leading cause behind the phenotypic divergence observed between montane forest and savanna honey bees. Our study thus corroborates the idea that high gene flow may select for increased plasticity.
Beet, Clare R; Hogg, Ian D; Collins, Gemma E; Cowan, Don A; Wall, Diana H; Adams, Byron J
2016-09-01
Climate changes are likely to have major influences on the distribution and abundance of Antarctic terrestrial biota. To assess arthropod distribution and diversity within the Ross Sea region, we examined mitochondrial DNA (COI) sequences for three currently recognized species of springtail (Collembola) collected from sites in the vicinity, and to the north of, the Mackay Glacier (77°S). This area acts as a transition between two biogeographic regions (northern and southern Victoria Land). We found populations of highly divergent individuals (5%-11.3% intraspecific sequence divergence) for each of the three putative springtail species, suggesting the possibility of cryptic diversity. Based on molecular clock estimates, these divergent lineages are likely to have been isolated for 3-5 million years. It was during this time that the Western Antarctic Ice Sheet (WAIS) was likely to have completely collapsed, potentially facilitating springtail dispersal via rafting on running waters and open seaways. The reformation of the WAIS would have isolated newly established populations, with subsequent dispersal restricted by glaciers and ice-covered areas. Given the currently limited distributions for these genetically divergent populations, any future changes in species' distributions can be easily tracked through the DNA barcoding of springtails from within the Mackay Glacier ecotone.
Stone, Anne C; Battistuzzi, Fabia U; Kubatko, Laura S; Perry, George H; Trudeau, Evan; Lin, Hsiuman; Kumar, Sudhir
2010-10-27
Here, we report the sequencing and analysis of eight complete mitochondrial genomes of chimpanzees (Pan troglodytes) from each of the three established subspecies (P. t. troglodytes, P. t. schweinfurthii and P. t. verus) and the proposed fourth subspecies (P. t. ellioti). Our population genetic analyses are consistent with neutral patterns of evolution that have been shaped by demography. The high levels of mtDNA diversity in western chimpanzees are unlike those seen at nuclear loci, which may reflect a demographic history of greater female to male effective population sizes possibly owing to the characteristics of the founding population. By using relaxed-clock methods, we have inferred a timetree of chimpanzee species and subspecies. The absolute divergence times vary based on the methods and calibration used, but relative divergence times show extensive uniformity. Overall, mtDNA produces consistently older times than those known from nuclear markers, a discrepancy that is reduced significantly by explicitly accounting for chimpanzee population structures in time estimation. Assuming the human-chimpanzee split to be between 7 and 5 Ma, chimpanzee time estimates are 2.1-1.5, 1.1-0.76 and 0.25-0.18 Ma for the chimpanzee/bonobo, western/(eastern + central) and eastern/central chimpanzee divergences, respectively.
Wang, Sibao; Leclerque, Andreas; Pava-Ripoll, Monica; Fang, Weiguo; St Leger, Raymond J
2009-06-01
Many strains of Metarhizium anisopliae have broad host ranges, but others are specialists and adapted to particular hosts. Patterns of gene duplication, divergence, and deletion in three generalist and three specialist strains were investigated by heterologous hybridization of genomic DNA to genes from the generalist strain Ma2575. As expected, major life processes are highly conserved, presumably due to purifying selection. However, up to 7% of Ma2575 genes were highly divergent or absent in specialist strains. Many of these sequences are conserved in other fungal species, suggesting that there has been rapid evolution and loss in specialist Metarhizium genomes. Some poorly hybridizing genes in specialists were functionally coordinated, indicative of reductive evolution. These included several involved in toxin biosynthesis and sugar metabolism in root exudates, suggesting that specialists are losing genes required to live in alternative hosts or as saprophytes. Several components of mobile genetic elements were also highly divergent or lost in specialists. Exceptionally, the genome of the specialist cricket pathogen Ma443 contained extra insertion elements that might play a role in generating evolutionary novelty. This study throws light on the abundance of orphans in genomes, as 15% of orphan sequences were found to be rapidly evolving in the Ma2575 lineage.
Diehl, Adam G
2018-01-01
Abstract The mouse is widely used as system to study human genetic mechanisms. However, extensive rewiring of transcriptional regulatory networks often confounds translation of findings between human and mouse. Site-specific gain and loss of individual transcription factor binding sites (TFBS) has caused functional divergence of orthologous regulatory loci, and so we must look beyond this positional conservation to understand common themes of regulatory control. Fortunately, transcription factor co-binding patterns shared across species often perform conserved regulatory functions. These can be compared to ‘regulatory sentences’ that retain the same meanings regardless of sequence and species context. By analyzing TFBS co-occupancy patterns observed in four human and mouse cell types, we learned a regulatory grammar: the rules by which TFBS are combined into meaningful regulatory sentences. Different parts of this grammar associate with specific sets of functional annotations regardless of sequence conservation and predict functional signatures more accurately than positional conservation. We further show that both species-specific and conserved portions of this grammar are involved in gene expression divergence and human disease risk. These findings expand our understanding of transcriptional regulatory mechanisms, suggesting that phenotypic divergence and disease risk are driven by a complex interplay between deeply conserved and species-specific transcriptional regulatory pathways. PMID:29361190
Roy, Scott William
2015-12-01
In the deadly human malaria parasite Plasmodium falciparum, several major merozoite surface proteins (MSPs) show a striking pattern of allelic diversity called allelic dimorphism (AD). In AD, the vast majority of observed alleles fall into two highly divergent allelic classes, with recombinant alleles being rare or not observed, presumably due to repression by natural selection (recombination suppression, or RS). The three AD loci, merozoite surface proteins (MSPs) 1, 2, and 6, along with MSP3, which also exhibits RS among four allelic classes, can be collectively called AD/RS. The causes of AD/RS and the evolutionary history of allelic diversity at these loci remain mysterious. The few available sequences from a single closely related chimpanzee parasite, P. reichenowi, have suggested that for 3/4 loci, AD/RS is an ancient state that has been retained in P. falciparum since well before the P. falciparum-P. reichenowi ancestor. On the other hand, based on comparative sequence analysis, we recently suggested that (i) AD/RS P. falciparum loci have undergone interallelic recombination over longer evolutionary times (on the timescale of recent speciation events), and thus (ii) AD/RS may be a recent phenomenon. The recent publication of genomic sequencing efforts for P. gaboni, an outgroup to P. falciparum and P. reichenowi, allows for improved reconstruction of the evolutionary history of these loci. In this work, I report genic sequence for P. gaboni for all four AD/RS P. falciparum loci (MSP1, 2, 3, and 6). Comparison of these sequences with available P. falciparum and P. reichenowi data strengthens the evidence for interallelic recombination over the evolutionary history of these species and also strengthens the case that AD/RS at these loci is ancient. Combined with previous results, these data provide evidence that AD/RS at different loci has evolved at several different times in the evolutionary history of P. falciparum: (i) before the P. gaboni-P. falciparum divergence, for much of MSP1 and MSP3; (ii) between the P. gaboni-P. falciparum and P. reichenowi-P. falciparum divergences, for the 5' end of the AD region of MSP6 and block 3 of MSP1; (iii) near the P. reichenowi-P. falciparum divergence, for the 3' end of the AD region of MSP6; and (iv) after the P. reichenowi-P. falciparum divergence, for MSP2. Based on these results, I suggest a new hypothesis for long-term evolutionary maintenance of AD/RS by recombination within allelic groups. Copyright © 2015 Elsevier B.V. All rights reserved.
The D1-D2 region of the large subunit ribosomal DNA as barcode for ciliates.
Stoeck, T; Przybos, E; Dunthorn, M
2014-05-01
Ciliates are a major evolutionary lineage within the alveolates, which are distributed in nearly all habitats on our planet and are an essential component for ecosystem function, processes and stability. Accurate identification of these unicellular eukaryotes through, for example, microscopy or mating type reactions is reserved to few specialists. To satisfy the demand for a DNA barcode for ciliates, which meets the standard criteria for DNA barcodes defined by the Consortium for the Barcode of Life (CBOL), we here evaluated the D1-D2 region of the ribosomal DNA large subunit (LSU-rDNA). Primer universality for the phylum Ciliophora was tested in silico with available database sequences as well as in the laboratory with 73 ciliate species, which represented nine of 12 ciliate classes. Primers tested in this study were successful for all tested classes. To test the ability of the D1-D2 region to resolve conspecific and congeneric sequence divergence, 63 Paramecium strains were sampled from 24 mating species. The average conspecific D1-D2 variation was 0.18%, whereas congeneric sequence divergence averaged 4.83%. In pairwise genetic distance analyses, we identified a D1-D2 sequence divergence of <0.6% as an ideal threshold to discriminate Paramecium species. Using this definition, only 3.8% of all conspecific and 3.9% of all congeneric sequence comparisons had the potential of false assignments. Neighbour-joining analyses inferred monophyly for all taxa but for two Paramecium octaurelia strains. Here, we present a protocol for easy DNA amplification of single cells and voucher deposition. In conclusion, the presented data pinpoint the D1-D2 region as an excellent candidate for an official CBOL barcode for ciliated protists. © 2013 John Wiley & Sons Ltd.
Morrison, Cheryl L; Iwanowicz, Luke; Work, Thierry M; Fahsbender, Elizabeth; Breitbart, Mya; Adams, Cynthia; Iwanowicz, Deb; Sanders, Lakyn; Ackermann, Mathias; Cornman, Robert S
2018-01-01
Chelonid alphaherpesvirus 5 (ChHV5) is a herpesvirus associated with fibropapillomatosis (FP) in sea turtles worldwide. Single-locus typing has previously shown differentiation between Atlantic and Pacific strains of this virus, with low variation within each geographic clade. However, a lack of multi-locus genomic sequence data hinders understanding of the rate and mechanisms of ChHV5 evolutionary divergence, as well as how these genomic changes may contribute to differences in disease manifestation. To assess genomic variation in ChHV5 among five Hawaii and three Florida green sea turtles, we used high-throughput short-read sequencing of long-range PCR products amplified from tumor tissue using primers designed from the single available ChHV5 reference genome from a Hawaii green sea turtle. This strategy recovered sequence data from both geographic regions for approximately 75% of the predicted ChHV5 coding sequences. The average nucleotide divergence between geographic populations was 1.5%; most of the substitutions were fixed differences between regions. Protein divergence was generally low (average 0.08%), and ranged between 0 and 5.3%. Several atypical genes originally identified and annotated in the reference genome were confirmed in ChHV5 genomes from both geographic locations. Unambiguous recombination events between geographic regions were identified, and clustering of private alleles suggests the prevalence of recombination in the evolutionary history of ChHV5. This study significantly increased the amount of sequence data available from ChHV5 strains, enabling informed selection of loci for future population genetic and natural history studies, and suggesting the (possibly latent) co-infection of individuals by well-differentiated geographic variants.
Morrison, Cheryl L.; Iwanowicz, Luke R.; Work, Thierry M.; Fahsbender, Elizabeth; Breitbart, Mya; Adams, Cynthia; Iwanowicz, Deborah; Sanders, Lakyn; Ackermann, Mathias; Cornman, Robert S.
2018-01-01
Chelonid alphaherpesvirus 5 (ChHV5) is a herpesvirus associated with fibropapillomatosis (FP) in sea turtles worldwide. Single-locus typing has previously shown differentiation between Atlantic and Pacific strains of this virus, with low variation within each geographic clade. However, a lack of multi-locus genomic sequence data hinders understanding of the rate and mechanisms of ChHV5 evolutionary divergence, as well as how these genomic changes may contribute to differences in disease manifestation. To assess genomic variation in ChHV5 among five Hawaii and three Florida green sea turtles, we used high-throughput short-read sequencing of long-range PCR products amplified from tumor tissue using primers designed from the single available ChHV5 reference genome from a Hawaii green sea turtle. This strategy recovered sequence data from both geographic regions for approximately 75% of the predicted ChHV5 coding sequences. The average nucleotide divergence between geographic populations was 1.5%; most of the substitutions were fixed differences between regions. Protein divergence was generally low (average 0.08%), and ranged between 0 and 5.3%. Several atypical genes originally identified and annotated in the reference genome were confirmed in ChHV5 genomes from both geographic locations. Unambiguous recombination events between geographic regions were identified, and clustering of private alleles suggests the prevalence of recombination in the evolutionary history of ChHV5. This study significantly increased the amount of sequence data available from ChHV5 strains, enabling informed selection of loci for future population genetic and natural history studies, and suggesting the (possibly latent) co-infection of individuals by well-differentiated geographic variants.
Population Genomics of Paramecium Species.
Johri, Parul; Krenek, Sascha; Marinov, Georgi K; Doak, Thomas G; Berendonk, Thomas U; Lynch, Michael
2017-05-01
Population-genomic analyses are essential to understanding factors shaping genomic variation and lineage-specific sequence constraints. The dearth of such analyses for unicellular eukaryotes prompted us to assess genomic variation in Paramecium, one of the most well-studied ciliate genera. The Paramecium aurelia complex consists of ∼15 morphologically indistinguishable species that diverged subsequent to two rounds of whole-genome duplications (WGDs, as long as 320 MYA) and possess extremely streamlined genomes. We examine patterns of both nuclear and mitochondrial polymorphism, by sequencing whole genomes of 10-13 worldwide isolates of each of three species belonging to the P. aurelia complex: P. tetraurelia, P. biaurelia, P. sexaurelia, as well as two outgroup species that do not share the WGDs: P. caudatum and P. multimicronucleatum. An apparent absence of global geographic population structure suggests continuous or recent dispersal of Paramecium over long distances. Intergenic regions are highly constrained relative to coding sequences, especially in P. caudatum and P. multimicronucleatum that have shorter intergenic distances. Sequence diversity and divergence are reduced up to ∼100-150 bp both upstream and downstream of genes, suggesting strong constraints imposed by the presence of densely packed regulatory modules. In addition, comparison of sequence variation at non-synonymous and synonymous sites suggests similar recent selective pressures on paralogs within and orthologs across the deeply diverging species. This study presents the first genome-wide population-genomic analysis in ciliates and provides a valuable resource for future studies in evolutionary and functional genetics in Paramecium. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Iwanowicz, Luke; Work, Thierry M.; Fahsbender, Elizabeth; Breitbart, Mya; Adams, Cynthia; Iwanowicz, Deb; Sanders, Lakyn; Ackermann, Mathias; Cornman, Robert S.
2018-01-01
Chelonid alphaherpesvirus 5 (ChHV5) is a herpesvirus associated with fibropapillomatosis (FP) in sea turtles worldwide. Single-locus typing has previously shown differentiation between Atlantic and Pacific strains of this virus, with low variation within each geographic clade. However, a lack of multi-locus genomic sequence data hinders understanding of the rate and mechanisms of ChHV5 evolutionary divergence, as well as how these genomic changes may contribute to differences in disease manifestation. To assess genomic variation in ChHV5 among five Hawaii and three Florida green sea turtles, we used high-throughput short-read sequencing of long-range PCR products amplified from tumor tissue using primers designed from the single available ChHV5 reference genome from a Hawaii green sea turtle. This strategy recovered sequence data from both geographic regions for approximately 75% of the predicted ChHV5 coding sequences. The average nucleotide divergence between geographic populations was 1.5%; most of the substitutions were fixed differences between regions. Protein divergence was generally low (average 0.08%), and ranged between 0 and 5.3%. Several atypical genes originally identified and annotated in the reference genome were confirmed in ChHV5 genomes from both geographic locations. Unambiguous recombination events between geographic regions were identified, and clustering of private alleles suggests the prevalence of recombination in the evolutionary history of ChHV5. This study significantly increased the amount of sequence data available from ChHV5 strains, enabling informed selection of loci for future population genetic and natural history studies, and suggesting the (possibly latent) co-infection of individuals by well-differentiated geographic variants. PMID:29479497
Thompson, Owen A.; Snoek, L. Basten; Nijveen, Harm; Sterken, Mark G.; Volkers, Rita J. M.; Brenchley, Rachel; van’t Hof, Arjen; Bevers, Roel P. J.; Cossins, Andrew R.; Yanai, Itai; Hajnal, Alex; Schmid, Tobias; Perkins, Jaryn D.; Spencer, David; Kruglyak, Leonid; Andersen, Erik C.; Moerman, Donald G.; Hillier, LaDeana W.; Kammenga, Jan E.; Waterston, Robert H.
2015-01-01
The Hawaiian strain (CB4856) of Caenorhabditis elegans is one of the most divergent from the canonical laboratory strain N2 and has been widely used in developmental, population, and evolutionary studies. To enhance the utility of the strain, we have generated a draft sequence of the CB4856 genome, exploiting a variety of resources and strategies. When compared against the N2 reference, the CB4856 genome has 327,050 single nucleotide variants (SNVs) and 79,529 insertion–deletion events that result in a total of 3.3 Mb of N2 sequence missing from CB4856 and 1.4 Mb of sequence present in CB4856 but not present in N2. As previously reported, the density of SNVs varies along the chromosomes, with the arms of chromosomes showing greater average variation than the centers. In addition, we find 61 regions totaling 2.8 Mb, distributed across all six chromosomes, which have a greatly elevated SNV density, ranging from 2 to 16% SNVs. A survey of other wild isolates show that the two alternative haplotypes for each region are widely distributed, suggesting they have been maintained by balancing selection over long evolutionary times. These divergent regions contain an abundance of genes from large rapidly evolving families encoding F-box, MATH, BATH, seven-transmembrane G-coupled receptors, and nuclear hormone receptors, suggesting that they provide selective advantages in natural environments. The draft sequence makes available a comprehensive catalog of sequence differences between the CB4856 and N2 strains that will facilitate the molecular dissection of their phenotypic differences. Our work also emphasizes the importance of going beyond simple alignment of reads to a reference genome when assessing differences between genomes. PMID:25995208
López-Alvarez, Diana; López-Herranz, Maria Luisa; Betekhtin, Alexander; Catalán, Pilar
2012-01-01
Background Brachypodium distachyon s. l. has been widely investigated across the world as a model plant for temperate cereals and biofuel grasses. However, this annual plant shows three cytotypes that have been recently recognized as three independent species, the diploids B. distachyon (2n = 10) and B. stacei (2n = 20) and their derived allotetraploid B. hybridum (2n = 30). Methodology/Principal Findings We propose a DNA barcoding approach that consists of a rapid, accurate and automatable species identification method using the standard DNA sequences of complementary plastid (trnLF) and nuclear (ITS, GI) loci. The highly homogenous but largely divergent B. distachyon and B. stacei diploids could be easily distinguished (100% identification success) using direct trnLF (2.4%), ITS (5.5%) or GI (3.8%) sequence divergence. By contrast, B. hybridum could only be unambiguously identified through the use of combined trnLF+ITS sequences (90% of identification success) or by cloned GI sequences (96.7%) that showed 5.4% (ITS) and 4% (GI) rate divergence between the two parental sequences found in the allopolyploid. Conclusion/Significance Our data provide an unbiased and effective barcode to differentiate these three closely-related species from one another. This procedure overcomes the taxonomic uncertainty generated from methods based on morphology or flow cytometry identifications that have resulted in some misclassifications of the model plant and its allies. Our study also demonstrates that the allotetraploid B. hybridum has resulted from bi-directional crosses of B. distachyon and B. stacei plants acting either as maternal or paternal parents. PMID:23240000
Sample storage conditions significantly influence faecal microbiome profiles
Choo, Jocelyn M; Leong, Lex EX; Rogers, Geraint B
2015-01-01
Sequencing-based studies of the human faecal microbiota are increasingly common. Appropriate storage of sample material is essential to avoid the introduction of post-collection bias in microbial community composition. Rapid freezing to −80 °C is commonly considered to be best-practice. However, this is not feasible in many studies, particularly those involving sample collection in participants’ homes. We determined the extent to which a range of stabilisation and storage strategies maintained the composition of faecal microbial community structure relative to freezing to −80 °C. Refrigeration at 4 °C, storage at ambient temperature, and the use of several common preservative buffers (RNAlater, OMNIgene.GUT, Tris-EDTA) were assessed relative to freezing. Following 72 hours of storage, faecal microbial composition was assessed by 16 S rRNA amplicon sequencing. Refrigeration was associated with no significant alteration in faecal microbiota diversity or composition. However, samples stored using other conditions showed substantial divergence compared to −80 °C control samples. Aside from refrigeration, the use of OMNIgene.GUT resulted in the least alteration, while the greatest change was seen in samples stored in Tris-EDTA buffer. The commercially available OMNIgene.GUT kit may provide an important alternative where refrigeration and cold chain transportation is not available. PMID:26572876
Sample storage conditions significantly influence faecal microbiome profiles.
Choo, Jocelyn M; Leong, Lex E X; Rogers, Geraint B
2015-11-17
Sequencing-based studies of the human faecal microbiota are increasingly common. Appropriate storage of sample material is essential to avoid the introduction of post-collection bias in microbial community composition. Rapid freezing to -80 °C is commonly considered to be best-practice. However, this is not feasible in many studies, particularly those involving sample collection in participants' homes. We determined the extent to which a range of stabilisation and storage strategies maintained the composition of faecal microbial community structure relative to freezing to -80 °C. Refrigeration at 4 °C, storage at ambient temperature, and the use of several common preservative buffers (RNAlater, OMNIgene.GUT, Tris-EDTA) were assessed relative to freezing. Following 72 hours of storage, faecal microbial composition was assessed by 16 S rRNA amplicon sequencing. Refrigeration was associated with no significant alteration in faecal microbiota diversity or composition. However, samples stored using other conditions showed substantial divergence compared to -80 °C control samples. Aside from refrigeration, the use of OMNIgene.GUT resulted in the least alteration, while the greatest change was seen in samples stored in Tris-EDTA buffer. The commercially available OMNIgene.GUT kit may provide an important alternative where refrigeration and cold chain transportation is not available.
Godfrey, Rita E.; Lee, David J.; Busby, Stephen J. W.
2017-01-01
Summary The Escherichia coli K‐12 nrf operon encodes a periplasmic nitrite reductase, the expression of which is driven from a single promoter, pnrf. Expression from pnrf is activated by the FNR transcription factor in response to anaerobiosis and further increased in response to nitrite by the response regulator proteins, NarL and NarP. FNR‐dependent transcription is suppressed by the binding of two nucleoid associated proteins, IHF and Fis. As Fis levels increase in cells grown in rich medium, the positioning of its binding site, overlapping the promoter −10 element, ensures that pnrf is sharply repressed. Here, we investigate the expression of the nrf operon promoter from various pathogenic enteric bacteria. We show that pnrf from enterohaemorrhagic E. coli is more active than its K‐12 counterpart, exhibits substantial FNR‐independent activity and is insensitive to nutrient quality, due to an improved −10 element. We also demonstrate that the Salmonella enterica serovar Typhimurium core promoter is more active than previously thought, due to differences around the transcription start site, and that its expression is repressed by downstream sequences. We identify the CsrA RNA binding protein as being responsible for this, and show that CsrA differentially regulates the E. coli K‐12 and Salmonella nrf operons. PMID:28211111
HIV populations are large and accumulate high genetic diversity in a nonlinear fashion.
Maldarelli, Frank; Kearney, Mary; Palmer, Sarah; Stephens, Robert; Mican, JoAnn; Polis, Michael A; Davey, Richard T; Kovacs, Joseph; Shao, Wei; Rock-Kress, Diane; Metcalf, Julia A; Rehm, Catherine; Greer, Sarah E; Lucey, Daniel L; Danley, Kristen; Alter, Harvey; Mellors, John W; Coffin, John M
2013-09-01
HIV infection is characterized by rapid and error-prone viral replication resulting in genetically diverse virus populations. The rate of accumulation of diversity and the mechanisms involved are under intense study to provide useful information to understand immune evasion and the development of drug resistance. To characterize the development of viral diversity after infection, we carried out an in-depth analysis of single genome sequences of HIV pro-pol to assess diversity and divergence and to estimate replicating population sizes in a group of treatment-naive HIV-infected individuals sampled at single (n = 22) or multiple, longitudinal (n = 11) time points. Analysis of single genome sequences revealed nonlinear accumulation of sequence diversity during the course of infection. Diversity accumulated in recently infected individuals at rates 30-fold higher than in patients with chronic infection. Accumulation of synonymous changes accounted for most of the diversity during chronic infection. Accumulation of diversity resulted in population shifts, but the rates of change were low relative to estimated replication cycle times, consistent with relatively large population sizes. Analysis of changes in allele frequencies revealed effective population sizes that are substantially higher than previous estimates of approximately 1,000 infectious particles/infected individual. Taken together, these observations indicate that HIV populations are large, diverse, and slow to change in chronic infection and that the emergence of new mutations, including drug resistance mutations, is governed by both selection forces and drift.
Nadachowska-Brzyska, Krystyna; Burri, Reto; Olason, Pall I.; Kawakami, Takeshi; Smeds, Linnéa; Ellegren, Hans
2013-01-01
Profound knowledge of demographic history is a prerequisite for the understanding and inference of processes involved in the evolution of population differentiation and speciation. Together with new coalescent-based methods, the recent availability of genome-wide data enables investigation of differentiation and divergence processes at unprecedented depth. We combined two powerful approaches, full Approximate Bayesian Computation analysis (ABC) and pairwise sequentially Markovian coalescent modeling (PSMC), to reconstruct the demographic history of the split between two avian speciation model species, the pied flycatcher and collared flycatcher. Using whole-genome re-sequencing data from 20 individuals, we investigated 15 demographic models including different levels and patterns of gene flow, and changes in effective population size over time. ABC provided high support for recent (mode 0.3 my, range <0.7 my) species divergence, declines in effective population size of both species since their initial divergence, and unidirectional recent gene flow from pied flycatcher into collared flycatcher. The estimated divergence time and population size changes, supported by PSMC results, suggest that the ancestral species persisted through one of the glacial periods of middle Pleistocene and then split into two large populations that first increased in size before going through severe bottlenecks and expanding into their current ranges. Secondary contact appears to have been established after the last glacial maximum. The severity of the bottlenecks at the last glacial maximum is indicated by the discrepancy between current effective population sizes (20,000–80,000) and census sizes (5–50 million birds) of the two species. The recent divergence time challenges the supposition that avian speciation is a relatively slow process with extended times for intrinsic postzygotic reproductive barriers to evolve. Our study emphasizes the importance of using genome-wide data to unravel tangled demographic histories. Moreover, it constitutes one of the first examples of the inference of divergence history from genome-wide data in non-model species. PMID:24244198
Nadachowska-Brzyska, Krystyna; Burri, Reto; Olason, Pall I; Kawakami, Takeshi; Smeds, Linnéa; Ellegren, Hans
2013-11-01
Profound knowledge of demographic history is a prerequisite for the understanding and inference of processes involved in the evolution of population differentiation and speciation. Together with new coalescent-based methods, the recent availability of genome-wide data enables investigation of differentiation and divergence processes at unprecedented depth. We combined two powerful approaches, full Approximate Bayesian Computation analysis (ABC) and pairwise sequentially Markovian coalescent modeling (PSMC), to reconstruct the demographic history of the split between two avian speciation model species, the pied flycatcher and collared flycatcher. Using whole-genome re-sequencing data from 20 individuals, we investigated 15 demographic models including different levels and patterns of gene flow, and changes in effective population size over time. ABC provided high support for recent (mode 0.3 my, range <0.7 my) species divergence, declines in effective population size of both species since their initial divergence, and unidirectional recent gene flow from pied flycatcher into collared flycatcher. The estimated divergence time and population size changes, supported by PSMC results, suggest that the ancestral species persisted through one of the glacial periods of middle Pleistocene and then split into two large populations that first increased in size before going through severe bottlenecks and expanding into their current ranges. Secondary contact appears to have been established after the last glacial maximum. The severity of the bottlenecks at the last glacial maximum is indicated by the discrepancy between current effective population sizes (20,000-80,000) and census sizes (5-50 million birds) of the two species. The recent divergence time challenges the supposition that avian speciation is a relatively slow process with extended times for intrinsic postzygotic reproductive barriers to evolve. Our study emphasizes the importance of using genome-wide data to unravel tangled demographic histories. Moreover, it constitutes one of the first examples of the inference of divergence history from genome-wide data in non-model species.
Slatyer, Rachel A; Nash, Michael A; Miller, Adam D; Endo, Yoshinori; Umbers, Kate D L; Hoffmann, Ary A
2014-10-02
Mountain landscapes are topographically complex, creating discontinuous 'islands' of alpine and sub-alpine habitat with a dynamic history. Changing climatic conditions drive their expansion and contraction, leaving signatures on the genetic structure of their flora and fauna. Australia's high country covers a small, highly fragmented area. Although the area is thought to have experienced periods of relative continuity during Pleistocene glacial periods, small-scale studies suggest deep lineage divergence across low-elevation gaps. Using both DNA sequence data and microsatellite markers, we tested the hypothesis that genetic partitioning reflects observable geographic structuring across Australia's mainland high country, in the widespread alpine grasshopper Kosciuscola tristis (Sjösted). We found broadly congruent patterns of regional structure between the DNA sequence and microsatellite datasets, corresponding to strong divergence among isolated mountain regions. Small and isolated mountains in the south of the range were particularly distinct, with well-supported divergence corresponding to climate cycles during the late Pliocene and Pleistocene. We found mixed support, however, for divergence among other mountain regions. Interestingly, within areas of largely contiguous alpine and sub-alpine habitat around Mt Kosciuszko, microsatellite data suggested significant population structure, accompanied by a strong signature of isolation-by-distance. Consistent patterns of strong lineage divergence among different molecular datasets indicate genetic breaks between populations inhabiting geographically distinct mountain regions. Three primary phylogeographic groups were evident in the highly fragmented Victorian high country, while within-region structure detected with microsatellites may reflect more recent population isolation. Despite the small area of Australia's alpine and sub-alpine habitats, their low topographic relief and lack of extensive glaciation, divergence among populations was on the same scale as that detected in much more extensive Northern hemisphere mountain systems. The processes driving divergence in the Australian mountains might therefore differ from their Northern hemisphere counterparts.
Barrett, Craig F; Specht, Chelsea D; Leebens-Mack, Jim; Stevenson, Dennis Wm; Zomlefer, Wendy B; Davis, Jerrold I
2014-01-01
Zingiberales comprise a clade of eight tropical monocot families including approx. 2500 species and are hypothesized to have undergone an ancient, rapid radiation during the Cretaceous. Zingiberales display substantial variation in floral morphology, and several members are ecologically and economically important. Deep phylogenetic relationships among primary lineages of Zingiberales have proved difficult to resolve in previous studies, representing a key region of uncertainty in the monocot tree of life. Next-generation sequencing was used to construct complete plastid gene sets for nine taxa of Zingiberales, which were added to five previously sequenced sets in an attempt to resolve deep relationships among families in the order. Variation in taxon sampling, process partition inclusion and partition model parameters were examined to assess their effects on topology and support. Codon-based likelihood analysis identified a strongly supported clade of ((Cannaceae, Marantaceae), (Costaceae, Zingiberaceae)), sister to (Musaceae, (Lowiaceae, Strelitziaceae)), collectively sister to Heliconiaceae. However, the deepest divergences in this phylogenetic analysis comprised short branches with weak support. Additionally, manipulation of matrices resulted in differing deep topologies in an unpredictable fashion. Alternative topology testing allowed statistical rejection of some of the topologies. Saturation fails to explain observed topological uncertainty and low support at the base of Zingiberales. Evidence for conflict among the plastid data was based on a support metric that accounts for conflicting resampled topologies. Many relationships were resolved with robust support, but the paucity of character information supporting the deepest nodes and the existence of conflict suggest that plastid coding regions are insufficient to resolve and support the earliest divergences among families of Zingiberales. Whole plastomes will continue to be highly useful in plant phylogenetics, but the current study adds to a growing body of literature suggesting that they may not provide enough character information for resolving ancient, rapid radiations.
Kaeding, Allison J.; Ast, Jennifer C.; Pearce, Meghan M.; Urbanczyk, Henryk; Kimura, Seishi; Endo, Hiromitsu; Nakamura, Masaru; Dunlap, Paul V.
2007-01-01
“Photobacterium mandapamensis” (proposed name) and Photobacterium leiognathi are closely related, phenotypically similar marine bacteria that form bioluminescent symbioses with marine animals. Despite their similarity, however, these bacteria can be distinguished phylogenetically by sequence divergence of their luminescence genes, luxCDAB(F)E, by the presence (P. mandapamensis) or the absence (P. leiognathi) of luxF and, as shown here, by the sequence divergence of genes involved in the synthesis of riboflavin, ribBHA. To gain insight into the possibility that P. mandapamensis and P. leiognathi are ecologically distinct, we used these phylogenetic criteria to determine the incidence of P. mandapamensis as a bioluminescent symbiont of marine animals. Five fish species, Acropoma japonicum (Perciformes, Acropomatidae), Photopectoralis panayensis and Photopectoralis bindus (Perciformes, Leiognathidae), Siphamia versicolor (Perciformes, Apogonidae), and Gadella jordani (Gadiformes, Moridae), were found to harbor P. mandapamensis in their light organs. Specimens of A. japonicus, P. panayensis, and P. bindus harbored P. mandapamensis and P. leiognathi together as cosymbionts of the same light organ. Regardless of cosymbiosis, P. mandapamensis was the predominant symbiont of A. japonicum, and it was the apparently exclusive symbiont of S. versicolor and G. jordani. In contrast, P. leiognathi was found to be the predominant symbiont of P. panayensis and P. bindus, and it appears to be the exclusive symbiont of other leiognathid fishes and a loliginid squid. A phylogenetic test for cospeciation revealed no evidence of codivergence between P. mandapamensis and its host fishes, indicating that coevolution apparently is not the basis for this bacterium's host preferences. These results, which are the first report of bacterial cosymbiosis in fish light organs and the first demonstration that P. leiognathi is not the exclusive light organ symbiont of leiognathid fishes, demonstrate that the host species ranges of P. mandapamensis and P. leiognathi are substantially distinct. The host range difference underscores possible differences in the environmental distributions and physiologies of these two bacterial species. PMID:17369329
Barrett, Craig F.; Specht, Chelsea D.; Leebens-Mack, Jim; Stevenson, Dennis Wm.; Zomlefer, Wendy B.; Davis, Jerrold I.
2014-01-01
Background and Aims Zingiberales comprise a clade of eight tropical monocot families including approx. 2500 species and are hypothesized to have undergone an ancient, rapid radiation during the Cretaceous. Zingiberales display substantial variation in floral morphology, and several members are ecologically and economically important. Deep phylogenetic relationships among primary lineages of Zingiberales have proved difficult to resolve in previous studies, representing a key region of uncertainty in the monocot tree of life. Methods Next-generation sequencing was used to construct complete plastid gene sets for nine taxa of Zingiberales, which were added to five previously sequenced sets in an attempt to resolve deep relationships among families in the order. Variation in taxon sampling, process partition inclusion and partition model parameters were examined to assess their effects on topology and support. Key Results Codon-based likelihood analysis identified a strongly supported clade of ((Cannaceae, Marantaceae), (Costaceae, Zingiberaceae)), sister to (Musaceae, (Lowiaceae, Strelitziaceae)), collectively sister to Heliconiaceae. However, the deepest divergences in this phylogenetic analysis comprised short branches with weak support. Additionally, manipulation of matrices resulted in differing deep topologies in an unpredictable fashion. Alternative topology testing allowed statistical rejection of some of the topologies. Saturation fails to explain observed topological uncertainty and low support at the base of Zingiberales. Evidence for conflict among the plastid data was based on a support metric that accounts for conflicting resampled topologies. Conclusions Many relationships were resolved with robust support, but the paucity of character information supporting the deepest nodes and the existence of conflict suggest that plastid coding regions are insufficient to resolve and support the earliest divergences among families of Zingiberales. Whole plastomes will continue to be highly useful in plant phylogenetics, but the current study adds to a growing body of literature suggesting that they may not provide enough character information for resolving ancient, rapid radiations. PMID:24280362
Aslamkhan, Amy G.; Thompson, Deborah M.; Perry, Jennifer L.; Bleasby, Kelly; Wolff, Natascha A.; Barros, Scott; Miller, David S.; Pritchard, John B.
2007-01-01
The flounder renal organic anion transporter (fOat) has substantial sequence homology to mammalian basolateral organic anion transporter orthologs (OAT1/Oat1 and OAT3/Oat3), suggesting that fOat may have functional properties of both mammalian forms. We therefore compared uptake of various substrates by rat Oat1 and Oat3 and human OAT1 and OAT3 with the fOat clone expressed in Xenopus oocytes. These data confirm that estrone sulfate is an excellent substrate for mammalian OAT3/Oat3 transporters but not for OAT1/Oat1 transporters. In contrast, 2,4-dichlorophenoxyacetic acid and adefovir are better transported by mammalian OAT1/Oat1 than by the OAT3/Oat3 clones. All three substrates were well transported by fOat-expressing Xenopus oocytes. fOat Km values were comparable to those obtained for mammalian OAT/Oat1/3 clones. We also characterized the ability of these substrates to inhibit uptake of the fluorescent substrate fluorescein in intact teleost proximal tubules isolated from the winter flounder (Pseudopleuronectes americanus) and killifish (Fundulus heteroclitus). The rank order of the IC50 values for inhibition of cellular fluorescein accumulation was similar to that for the Km values obtained in fOat-expressing oocytes, suggesting that fOat may be the primary teleost renal basolateral Oat. Assessment of the zebrafish (Danio rerio) genome indicated the presence of a single Oat (zfOat) with similarity to both mammalian OAT1/Oat1 and OAT3/Oat3. The puffer fish (Takifugu rubripes) also has an Oat (pfOat) similar to mammalian OAT1/Oat1 and OAT3/Oat3 members. Furthermore, phylogenetic analyses argue that the teleost Oat1/3-like genes diverged from a common ancestral gene in advance of the divergence of the mammalian OAT1/Oat1, OAT3/Oat3, and, possibly, Oat6 genes. PMID:16857889
Zienius, D; Lelešius, R; Kavaliauskis, H; Stankevičius, A; Šalomskas, A
2016-01-01
The aim of the present study was to detect canine parvovirus (CPV) from faecal samples of clinically ill domestic dogs by polymerase chain reaction (PCR) followed by VP2 gene partial sequencing and molecular characterization of circulating strains in Lithuania. Eleven clinically and antigen-tested positive dog faecal samples, collected during the period of 2014-2015, were investigated by using PCR. The phylogenetic investigations indicated that the Lithuanian CPV VP2 partial sequences (3025-3706 cds) were closely related and showed 99.0-99.9% identity. All Lithuanian sequences were associated with one phylogroup, but grouped in different clusters. Ten of investigated Lithuanian CPV VP2 sequences were closely associated with CPV 2a antigenic variant (99.4% nt identity). Five CPV VP2 sequences from Lithuania were related to CPV-2a, but were rather divergent (6.8 nt differences). Only one CPV VP2 sequence from Lithuania was associated (99.3% nt identity) with CPV-2b VP2 sequences from France, Italy, USA and Korea. The four of eleven investigated Lithuanian dogs with CPV infection symptoms were vaccinated with CPV-2 vaccine, but their VP2 sequences were phylogenetically distantly associated with CPV vaccine strains VP2 sequences (11.5-15.8 nt differences). Ten Lithuanian CPV VP2 sequences had monophyletic relations among the close geographically associated samples, but five of them were rather divergent (1.0% less sequence similarity). The one Lithuanian CPV VP2 sequence was closely related with CPV-2b antigenic variant. All the Lithuanian CPV VP2 partial sequences were conservative and phylogenetically low associated with most commonly used CPV vaccine strains.
Recursive sequences in first-year calculus
NASA Astrophysics Data System (ADS)
Krainer, Thomas
2016-02-01
This article provides ready-to-use supplementary material on recursive sequences for a second-semester calculus class. It equips first-year calculus students with a basic methodical procedure based on which they can conduct a rigorous convergence or divergence analysis of many simple recursive sequences on their own without the need to invoke inductive arguments as is typically required in calculus textbooks. The sequences that are accessible to this kind of analysis are predominantly (eventually) monotonic, but also certain recursive sequences that alternate around their limit point as they converge can be considered.
Molecular analysis of Aspergillus section Flavi isolated from Brazil nuts.
Gonçalves, Juliana Soares; Ferracin, Lara Munique; Carneiro Vieira, Maria Lucia; Iamanaka, Beatriz Thie; Taniwaki, Marta Hiromi; Pelegrinelli Fungaro, Maria Helena
2012-04-01
Brazil nuts are an important export market in its main producing countries, including Brazil, Bolivia, and Peru. Approximately 30,000 tons of Brazil nuts are harvested each year. However, substantial nut contamination by Aspergillus section Flavi occurs with subsequent production of aflatoxins. In our study, Aspergillus section Flavi were isolated from Brazil nuts (Bertholletia excelsa), and identified by morphological and molecular means. We obtained 241 isolates from nut samples, 41% positive for aflatoxin production. Eighty-one isolates were selected for molecular investigation. Pairwise genetic distances among isolates and phylogenetic relationships were assessed. The following Aspergillus species were identified: A. flavus, A. caelatus, A. nomius, A. tamarii, A. bombycis, and A. arachidicola. Additionally, molecular profiles indicated a high level of nucleotide variation within β-tubulin and calmodulin gene sequences associated with high genetic divergence from RAPD data. Among the 81 isolates analyzed by molecular means, three of them were phylogenetically distinct from all other isolates representing the six species of section Flavi. A putative novel species was identified based on molecular profiles.
The Revolution Continues: Newly Discovered Systems Expand the CRISPR-Cas Toolkit.
Murugan, Karthik; Babu, Kesavan; Sundaresan, Ramya; Rajan, Rakhi; Sashital, Dipali G
2017-10-05
CRISPR-Cas systems defend prokaryotes against bacteriophages and mobile genetic elements and serve as the basis for revolutionary tools for genetic engineering. Class 2 CRISPR-Cas systems use single Cas endonucleases paired with guide RNAs to cleave complementary nucleic acid targets, enabling programmable sequence-specific targeting with minimal machinery. Recent discoveries of previously unidentified CRISPR-Cas systems have uncovered a deep reservoir of potential biotechnological tools beyond the well-characterized Type II Cas9 systems. Here we review the current mechanistic understanding of newly discovered single-protein Cas endonucleases. Comparison of these Cas effectors reveals substantial mechanistic diversity, underscoring the phylogenetic divergence of related CRISPR-Cas systems. This diversity has enabled further expansion of CRISPR-Cas biotechnological toolkits, with wide-ranging applications from genome editing to diagnostic tools based on various Cas endonuclease activities. These advances highlight the exciting prospects for future tools based on the continually expanding set of CRISPR-Cas systems. Copyright © 2017 Elsevier Inc. All rights reserved.
Gordon, Kacy L.; Arthur, Robert K.; Ruvinsky, Ilya
2015-01-01
Gene regulatory information guides development and shapes the course of evolution. To test conservation of gene regulation within the phylum Nematoda, we compared the functions of putative cis-regulatory sequences of four sets of orthologs (unc-47, unc-25, mec-3 and elt-2) from distantly-related nematode species. These species, Caenorhabditis elegans, its congeneric C. briggsae, and three parasitic species Meloidogyne hapla, Brugia malayi, and Trichinella spiralis, represent four of the five major clades in the phylum Nematoda. Despite the great phylogenetic distances sampled and the extensive sequence divergence of nematode genomes, all but one of the regulatory elements we tested are able to drive at least a subset of the expected gene expression patterns. We show that functionally conserved cis-regulatory elements have no more extended sequence similarity to their C. elegans orthologs than would be expected by chance, but they do harbor motifs that are important for proper expression of the C. elegans genes. These motifs are too short to be distinguished from the background level of sequence similarity, and while identical in sequence they are not conserved in orientation or position. Functional tests reveal that some of these motifs contribute to proper expression. Our results suggest that conserved regulatory circuitry can persist despite considerable turnover within cis elements. PMID:26020930
Monoparametric family of metrics derived from classical Jensen-Shannon divergence
NASA Astrophysics Data System (ADS)
Osán, Tristán M.; Bussandri, Diego G.; Lamberti, Pedro W.
2018-04-01
Jensen-Shannon divergence is a well known multi-purpose measure of dissimilarity between probability distributions. It has been proven that the square root of this quantity is a true metric in the sense that, in addition to the basic properties of a distance, it also satisfies the triangle inequality. In this work we extend this last result to prove that in fact it is possible to derive a monoparametric family of metrics from the classical Jensen-Shannon divergence. Motivated by our results, an application into the field of symbolic sequences segmentation is explored. Additionally, we analyze the possibility to extend this result into the quantum realm.
Jensen, Annette Bruun; Eilenberg, Jørgen; López Lastra, Claudia
2009-11-01
Three DNA regions (ITS 1, LSU rRNA and GPD) of isolates from the insect-pathogenic fungus genus Entomophthora originating from different fly (Diptera) and aphid (Hemiptera) host taxa were sequenced. The results documented a large genetic diversity among the fly-pathogenic Entomophthora and only minor differences among aphid-pathogenic Entomophthora. The evolutionary time of divergence of the fly and the aphid host taxa included cannot account for this difference. The host-driven divergence of Entomophthora, therefore, has been much greater in flies than in aphids. Host-range differences or a recent host shift to aphid are possible explanations.
Fatal Metacestode Infection in Bornean Orangutan Caused by Unknown Versteria Species
Gendron-Fitzpatrick, Annette; Deering, Kathleen M.; Wallace, Roberta S.; Clyde, Victoria L.; Lauck, Michael; Rosen, Gail E.; Bennett, Andrew J.; Greiner, Ellis C.; O’Connor, David H.
2014-01-01
A captive juvenile Bornean orangutan (Pongo pygmaeus) died from an unknown disseminated parasitic infection. Deep sequencing of DNA from infected tissues, followed by gene-specific PCR and sequencing, revealed a divergent species within the newly proposed genus Versteria (Cestoda: Taeniidae). Versteria may represent a previously unrecognized risk to primate health. PMID:24377497
USDA-ARS?s Scientific Manuscript database
We report on the assembly of the 14,146 base pairs (bp) near complete mitochondrial sequencing of the legume pod borer (LPB), Maruca vitrata (Lepidoptera: Crambidae), which was used to estimate divergence and relationships within the lepidopteran lineage. Arrangement and orientation of 13 protein c...
Komatsu, Ken; Yamashita, Kazuo; Sugawara, Kota; Verbeek, Martin; Fujita, Naoko; Hanada, Kaoru; Uehara-Ichiki, Tamaki; Fuji, Shin-Ichi
2017-02-01
Plantago asiatica mosaic virus (PlAMV) is a member of the genus Potexvirus and has an exceptionally wide host range. It causes severe damage to lilies. Here we report on the complete nucleotide sequences of two new Japanese PlAMV isolates, one from the eudicot weed Viola grypoceras (PlAMV-Vi), and the other from the eudicot shrub Nandina domestica Thunb. (PlAMV-NJ). Their genomes contain five open reading frames (ORFs), which is characteristic of potexviruses. Surprisingly, the isolates showed only 76.0-78.0 % sequence identity with each other and with other PlAMV isolates, including isolates from Japanese lily and American nandina. Amino acid alignments of the replicase coding region encoded by ORF1 showed that the regions between the methyltransferase and helicase domains were less conserved than other regions, with several insertions and/or deletions. Phylogenetic analyses of the full-length nucleotide sequences revealed a moderate correlation between phylogenetic clustering and the original host plants of the PlAMV isolates. This study revealed the presence of two highly divergent PlAMV isolates in Japan.
Choudhary, Kumari S.; Mih, Nathan; Monk, Jonathan; Kavvas, Erol; Yurkovich, James T.; Sakoulas, George; Palsson, Bernhard O.
2018-01-01
Two-component systems (TCSs) consist of a histidine kinase and a response regulator. Here, we evaluated the conservation of the AgrAC TCS among 149 completely sequenced Staphylococcus aureus strains. It is composed of four genes: agrBDCA. We found that: (i) AgrAC system (agr) was found in all but one of the 149 strains, (ii) the agr positive strains were further classified into four agr types based on AgrD protein sequences, (iii) the four agr types not only specified the chromosomal arrangement of the agr genes but also the sequence divergence of AgrC histidine kinase protein, which confers signal specificity, (iv) the sequence divergence was reflected in distinct structural properties especially in the transmembrane region and second extracellular binding domain, and (v) there was a strong correlation between the agr type and the virulence genomic profile of the organism. Taken together, these results demonstrate that bioinformatic analysis of the agr locus leads to a classification system that correlates with the presence of virulence factors and protein structural properties. PMID:29887846
Kim, Dae Hun; Ko, Kwan Soo
2015-07-01
To investigate pmrCAB sequence divergence in 5 species of Acinetobacter baumannii complex, a total of 80 isolates from a Korean hospital were explored. We evaluated nucleotide and amino acid polymorphisms of pmrCAB operon, and phylogenetic trees were constructed for each gene of prmCAB operon. Colistin and polymyxin B susceptibility was determined for all isolates, and multilocus sequence typing was also performed for A. baumannii isolates. Our results showed that each species of A. baumannii complex has divergent pmrCAB operon sequences. We identified a distinct pmrCAB allele allied with Acinetobacter nosocomialis in gene trees. Different grouping in each gene tree suggests sporadic recombination or emergence of pmrCAB genes among Acinetobacter species. Sequence polymorphisms among Acinetobacter species might not be associated with colistin resistance. We revealed that a distinct pmrCAB allele may be widespread across the continents such as North America and Asia and that sporadic genetic recombination or emergence of pmrCAB genes might occur. Copyright © 2015 Elsevier Inc. All rights reserved.
An improved approximate-Bayesian model-choice method for estimating shared evolutionary history
2014-01-01
Background To understand biological diversification, it is important to account for large-scale processes that affect the evolutionary history of groups of co-distributed populations of organisms. Such events predict temporally clustered divergences times, a pattern that can be estimated using genetic data from co-distributed species. I introduce a new approximate-Bayesian method for comparative phylogeographical model-choice that estimates the temporal distribution of divergences across taxa from multi-locus DNA sequence data. The model is an extension of that implemented in msBayes. Results By reparameterizing the model, introducing more flexible priors on demographic and divergence-time parameters, and implementing a non-parametric Dirichlet-process prior over divergence models, I improved the robustness, accuracy, and power of the method for estimating shared evolutionary history across taxa. Conclusions The results demonstrate the improved performance of the new method is due to (1) more appropriate priors on divergence-time and demographic parameters that avoid prohibitively small marginal likelihoods for models with more divergence events, and (2) the Dirichlet-process providing a flexible prior on divergence histories that does not strongly disfavor models with intermediate numbers of divergence events. The new method yields more robust estimates of posterior uncertainty, and thus greatly reduces the tendency to incorrectly estimate models of shared evolutionary history with strong support. PMID:24992937
de Carvalho, Danila Blanco; Congrains, Carlos; Chahad-Ehlers, Samira; Pinotti, Heloisa; Brito, Reinaldo Alves de; da Rosa, João Aristeu
2017-01-01
Chagas disease is one of the main parasitic diseases found in Latin America and it is estimated that between six and seven million people are infected worldwide. Its etiologic agent, the protozoan Trypanosoma cruzi, is transmitted by triatomines, some of which from the genus Rhodnius. Twenty species are currently recognized in this genus, including some closely related species with low levels of morphological differentiation, such as Rhodnius montenegrensis and Rhodnius robustus. In order to investigate genetic differences between these two species, we generated large-scale RNA-sequencing data (consisting of four RNA-seq libraries) from the heads and salivary glands of males of R. montenegrensis and R. robustus. Transcriptome assemblies produced for each species resulted in 64,952 contigs for R. montenegrensis and 70,894 contigs for R. robustus, with N50 of approximately 2,100 for both species. SNP calling based on the more complete R. robustus assembly revealed 3,055 fixed interspecific differences and 216 transcripts with high levels of divergence which contained only fixed differences between the two species. A gene ontology enrichment analysis revealed that these highly differentiated transcripts were enriched for eight GO terms related to AP-2 adaptor complex, as well as other interesting genes that could be involved in their differentiation. The results show that R. montenegrensis and R. robustus have a substantial quantity of fixed interspecific polymorphisms, which suggests a high degree of genetic divergence between the two species and likely corroborates the species status of R. montenegrensis.
Barry, Elizabeth G; Witherspoon, David J; Lampe, David J
2004-02-01
Transposons of the mariner family are widespread in animal genomes and have apparently infected them by horizontal transfer. Most species carry only old defective copies of particular mariner transposons that have diverged greatly from their active horizontally transferred ancestor, while a few contain young, very similar, and active copies. We report here the use of a whole-genome screen in bacteria to isolate somewhat diverged Famar1 copies from the European earwig, Forficula auricularia, that encode functional transposases. Functional and nonfunctional coding sequences of Famar1 and nonfunctional copies of Ammar1 from the European honey bee, Apis mellifera, were sequenced to examine their molecular evolution. No selection for sequence conservation was detected in any clade of a tree derived from these sequences, not even on branches leading to functional copies. This agrees with the current model for mariner transposon evolution that expects neutral evolution within particular hosts, with selection for function occurring only upon horizontal transfer to a new host. Our results further suggest that mariners are not finely tuned genetic entities and that a greater amount of sequence diversification than had previously been appreciated can occur in functional copies in a single host lineage. Finally, this method of isolating active copies can be used to isolate other novel active transposons without resorting to reconstruction of ancestral sequences.
FRAGS: estimation of coding sequence substitution rates from fragmentary data
Swart, Estienne C; Hide, Winston A; Seoighe, Cathal
2004-01-01
Background Rates of substitution in protein-coding sequences can provide important insights into evolutionary processes that are of biomedical and theoretical interest. Increased availability of coding sequence data has enabled researchers to estimate more accurately the coding sequence divergence of pairs of organisms. However the use of different data sources, alignment protocols and methods to estimate substitution rates leads to widely varying estimates of key parameters that define the coding sequence divergence of orthologous genes. Although complete genome sequence data are not available for all organisms, fragmentary sequence data can provide accurate estimates of substitution rates provided that an appropriate and consistent methodology is used and that differences in the estimates obtainable from different data sources are taken into account. Results We have developed FRAGS, an application framework that uses existing, freely available software components to construct in-frame alignments and estimate coding substitution rates from fragmentary sequence data. Coding sequence substitution estimates for human and chimpanzee sequences, generated by FRAGS, reveal that methodological differences can give rise to significantly different estimates of important substitution parameters. The estimated substitution rates were also used to infer upper-bounds on the amount of sequencing error in the datasets that we have analysed. Conclusion We have developed a system that performs robust estimation of substitution rates for orthologous sequences from a pair of organisms. Our system can be used when fragmentary genomic or transcript data is available from one of the organisms and the other is a completely sequenced genome within the Ensembl database. As well as estimating substitution statistics our system enables the user to manage and query alignment and substitution data. PMID:15005802
Orlando, Ludovic; Mauffrey, Jean-François; Cuisin, Jacques; Patton, James L; Hänni, Catherine; Catzeflis, François
2003-04-01
The spiny rat Mesomys hispidus is one of many South American rodents that lack adequate taxonomic definition. The few sampled populations of this broadly distributed trans-Amazonian arboreal rat have come from widely separated regions and are typically highly divergent. The holotype was described in 1817 by A.-G. Desmarest, after Napoleon's army brought it to Paris following the plunder of Lisbon in 1808; however, the locality of origin has remained unknown. Here we examine the taxonomic status of this species by direct comparison of 50 extant individuals with the holotype at the morphometric and genetic levels, the latter based on 331 bp of the mitochondrial cytochrome b gene retrieved from a small skin fragment of the holotype with ancient DNA technology. Extensive sequence divergence is present among samples of M. hispidus collected from throughout its range, from French Guiana across Amazonia to Bolivia and Peru, with at least seven mitochondrial clades recognized (average divergence of 7.7% Kimura 2-parameter distance). Sequence from the holotype is, however, only weakly divergent from those of recent samples from French Guiana. Moreover, the holotype clusters with greater that 99% posterior probability with samples from this part of Amazonia in a discriminant analysis based on 22 cranial and dental measurements. Thus, we suggest that the holotype was originally obtained in eastern Amazonia north of the Amazon River, most likely in the Brazilian state of Amapá. Despite the high level of sequence diversity and marked morphological differences in size across the range of M. hispidus, we continue to regard this assemblage as a single species until additional samples and analyses suggest otherwise. Copyright 2002 Elsevier Science (USA)
Molecular identification and phylogenetic study of Demodex caprae.
Zhao, Ya-E; Cheng, Juan; Hu, Li; Ma, Jun-Xian
2014-10-01
The DNA barcode has been widely used in species identification and phylogenetic analysis since 2003, but there have been no reports in Demodex. In this study, to obtain an appropriate DNA barcode for Demodex, molecular identification of Demodex caprae based on mitochondrial cox1 was conducted. Firstly, individual adults and eggs of D. caprae were obtained for genomic DNA (gDNA) extraction; Secondly, mitochondrial cox1 fragment was amplified, cloned, and sequenced; Thirdly, cox1 fragments of D. caprae were aligned with those of other Demodex retrieved from GenBank; Finally, the intra- and inter-specific divergences were computed and the phylogenetic trees were reconstructed to analyze phylogenetic relationship in Demodex. Results obtained from seven 429-bp fragments of D. caprae showed that sequence identities were above 99.1% among three adults and four eggs. The intraspecific divergences in D. caprae, Demodex folliculorum, Demodex brevis, and Demodex canis were 0.0-0.9, 0.5-0.9, 0.0-0.2, and 0.0-0.5%, respectively, while the interspecific divergences between D. caprae and D. folliculorum, D. canis, and D. brevis were 20.3-20.9, 21.8-23.0, and 25.0-25.3, respectively. The interspecific divergences were 10 times higher than intraspecific ones, indicating considerable barcoding gap. Furthermore, the phylogenetic trees showed that four Demodex species gathered separately, representing independent species; and Demodex folliculorum gathered with canine Demodex, D. caprae, and D. brevis in sequence. In conclusion, the selected 429-bp mitochondrial cox1 gene is an appropriate DNA barcode for molecular classification, identification, and phylogenetic analysis of Demodex. D. caprae is an independent species and D. folliculorum is closer to D. canis than to D. caprae or D. brevis.
2012-01-01
Background The NCBI Conserved Domain Database (CDD) consists of a collection of multiple sequence alignments of protein domains that are at various stages of being manually curated into evolutionary hierarchies based on conserved and divergent sequence and structural features. These domain models are annotated to provide insights into the relationships between sequence, structure and function via web-based BLAST searches. Results Here we automate the generation of conserved domain (CD) hierarchies using a combination of heuristic and Markov chain Monte Carlo (MCMC) sampling procedures and starting from a (typically very large) multiple sequence alignment. This procedure relies on statistical criteria to define each hierarchy based on the conserved and divergent sequence patterns associated with protein functional-specialization. At the same time this facilitates the sequence and structural annotation of residues that are functionally important. These statistical criteria also provide a means to objectively assess the quality of CD hierarchies, a non-trivial task considering that the protein subgroups are often very distantly related—a situation in which standard phylogenetic methods can be unreliable. Our aim here is to automatically generate (typically sub-optimal) hierarchies that, based on statistical criteria and visual comparisons, are comparable to manually curated hierarchies; this serves as the first step toward the ultimate goal of obtaining optimal hierarchical classifications. A plot of runtimes for the most time-intensive (non-parallelizable) part of the algorithm indicates a nearly linear time complexity so that, even for the extremely large Rossmann fold protein class, results were obtained in about a day. Conclusions This approach automates the rapid creation of protein domain hierarchies and thus will eliminate one of the most time consuming aspects of conserved domain database curation. At the same time, it also facilitates protein domain annotation by identifying those pattern residues that most distinguish each protein domain subgroup from other related subgroups. PMID:22726767
Resolving the tips of the tree of life: How much mitochondrialdata doe we need?
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bonett, Ronald M.; Macey, J. Robert; Boore, Jeffrey L.
2005-04-29
Mitochondrial (mt) DNA sequences are used extensively to reconstruct evolutionary relationships among recently diverged animals,and have constituted the most widely used markers for species- and generic-level relationships for the last decade or more. However, most studies to date have employed relatively small portions of the mt-genome. In contrast, complete mt-genomes primarily have been used to investigate deep divergences, including several studies of the amount of mt sequence necessary to recover ancient relationships. We sequenced and analyzed 24 complete mt-genomes from a group of salamander species exhibiting divergences typical of those in many species-level studies. We present the first comprehensive investigationmore » of the amount of mt sequence data necessary to consistently recover the mt-genome tree at this level, using parsimony and Bayesian methods. Both methods of phylogenetic analysis revealed extremely similar results. A surprising number of well supported, yet conflicting, relationships were found in trees based on fragments less than {approx}2000 nucleotides (nt), typical of the vast majority of the thousands of mt-based studies published to date. Large amounts of data (11,500+ nt) were necessary to consistently recover the whole mt-genome tree. Some relationships consistently were recovered with fragments of all sizes, but many nodes required the majority of the mt-genome to stabilize, particularly those associated with short internal branches. Although moderate amounts of data (2000-3000 nt) were adequate to recover mt-based relationships for which most nodes were congruent with the whole mt-genome tree, many thousands of nucleotides were necessary to resolve rapid bursts of evolution. Recent advances in genomics are making collection of large amounts of sequence data highly feasible, and our results provide the basis for comparative studies of other closely related groups to optimize mt sequence sampling and phylogenetic resolution at the ''tips'' of the Tree of Life.« less
2014-01-01
Background Plasmodium vivax is a protozoan parasite with an extensive worldwide distribution, being highly prevalent in Asia as well as in Mesoamerica and South America. In southern Mexico, P. vivax transmission has been endemic and recent studies suggest that these parasites have unique biological and genetic features. The msp1 gene has shown high rate of nucleotide substitutions, deletions, insertions, and its mosaic structure reveals frequent events of recombination, maybe between highly divergent parasite isolates. Methods The nucleotide sequence variation in the polymorphic icb5-6 fragment of the msp1 gene of Mexican and worldwide isolates was analysed. To understand how genotype diversity arises, disperses and persists in Mexico, the genetic structure and genealogical relationships of local isolates were examined. To identify new sequence hybrids and their evolutionary relationships with other P. vivax isolates circulating worldwide two haplotype networks were constructed questioning that two portions of the icb5-6 have different evolutionary history. Results Twelve new msp1 icb5-6 haplotypes of P. vivax from Mexico were identified. These nucleotide sequences show mosaic structure comprising three partially conserved and two variable subfragments and resulted into five different sequence types. The variable subfragment sV1 has undergone recombination events and resulted in hybrid sequences and the haplotype network allocated the Mexican haplotypes to three lineages, corresponding to the Sal I and Belem types, and other more divergent group. In contrast, the network from icb5-6 fragment but not sV1 revealed that the Mexican haplotypes belong to two separate lineages, none of which are closely related to Sal I or Belem sequences. Conclusions These results suggest that the new hybrid haplotypes from southern Mexico were the result of at least three different recombination events. These rearrangements likely resulted from the recombination between haplotypes of highly divergent lineages that are frequently distributed in South America and Asia and diversified rapidly. PMID:24472213
Yi, Zhenzhen; Song, Weibo; Clamp, John C; Chen, Zigui; Gao, Shan; Zhang, Qianqian
2009-03-01
Comprehensive molecular analyses of phylogenetic relationships within euplotid ciliates are relatively rare, and the relationships among some families remain questionable. We performed phylogenetic analyses of the order Euplotida based on new sequences of the gene coding for small-subunit RNA (SSrRNA) from a variety of taxa across the entire order as well as sequences from some of these taxa of other genes (ITS1-5.8S-ITS2 region and histone H4) that have not been included in previous analyses. Phylogenetic trees based on SSrRNA gene sequences constructed with four different methods had a consistent branching pattern that included the following features: (1) the "typical" euplotids comprised a paraphyletic assemblage composed of two divergent clades (family Uronychiidae and families Euplotidae-Certesiidae-Aspidiscidae-Gastrocirrhidae), (2) in the family Uronychiidae, the genera Uronychia and Paradiophrys formed a clearly outlined, well-supported clade that seemed to be rather divergent from Diophrys and Diophryopsis, suggesting that the Diophrys-complex may have had a longer and more separate evolutionary history than previously supposed, (3) inclusion of 12 new SSrRNA sequences in analyses of Euplotidae revealed two new clades of species within the family and cast additional doubt on the present classification of genera within the family, and (4) the intraspecific divergence among five species of Aspidisca was far greater than those of closely related genera. The ITS1-5.8S-ITS2 coding regions and partial histone H4 genes of six morphospecies in the Diophrys-complex were sequenced along with their SSrRNA genes and used to compare phylogenies constructed from single data sets to those constructed from combined sets. Results indicated that combined analyses could be used to construct more reliable, less ambiguous phylogenies of complex groups like the order Euplotida, because they provide a greater amount and diversity of information.
Turmel, Monique; Otis, Christian; Lemieux, Claude
2007-01-01
Background The Streptophyta comprises all land plants and six groups of charophycean green algae. The scaly biflagellate Mesostigma viride (Mesostigmatales) and the sarcinoid Chlorokybus atmophyticus (Chlorokybales) represent the earliest diverging lineages of this phylum. In trees based on chloroplast genome data, these two charophycean green algae are nested in the same clade. To validate this relationship and gain insight into the ancestral state of the mitochondrial genome in the Charophyceae, we sequenced the mitochondrial DNA (mtDNA) of Chlorokybus and compared this genome sequence with those of three other charophycean green algae and the bryophytes Marchantia polymorpha and Physcomitrella patens. Results The Chlorokybus genome differs radically from its 42,424-bp Mesostigma counterpart in size, gene order, intron content and density of repeated elements. At 201,763-bp, it is the largest mtDNA yet reported for a green alga. The 70 conserved genes represent 41.4% of the genome sequence and include nad10 and trnL(gag), two genes reported for the first time in a streptophyte mtDNA. At the gene order level, the Chlorokybus genome shares with its Chara, Chaetosphaeridium and bryophyte homologues eight to ten gene clusters including about 20 genes. Notably, some of these clusters exhibit gene linkages not previously found outside the Streptophyta, suggesting that they originated early during streptophyte evolution. In addition to six group I and 14 group II introns, short repeated sequences accounting for 7.5% of the genome were identified. Mitochondrial trees were unable to resolve the correct position of Mesostigma, due to analytical problems arising from accelerated sequence evolution in this lineage. Conclusion The Chlorokybus and Mesostigma mtDNAs exemplify the marked fluidity of the mitochondrial genome in charophycean green algae. The notion that the mitochondrial genome was constrained to remain compact during charophycean evolution is no longer tenable. Our data raise the possibility that the emergence of land plants was not associated with a substantial gain of intergenic sequences by the mitochondrial genome. PMID:17537252
Segmenting the human genome based on states of neutral genetic divergence.
Kuruppumullage Don, Prabhani; Ananda, Guruprasad; Chiaromonte, Francesca; Makova, Kateryna D
2013-09-03
Many studies have demonstrated that divergence levels generated by different mutation types vary and covary across the human genome. To improve our still-incomplete understanding of the mechanistic basis of this phenomenon, we analyze several mutation types simultaneously, anchoring their variation to specific regions of the genome. Using hidden Markov models on insertion, deletion, nucleotide substitution, and microsatellite divergence estimates inferred from human-orangutan alignments of neutrally evolving genomic sequences, we segment the human genome into regions corresponding to different divergence states--each uniquely characterized by specific combinations of divergence levels. We then parsed the mutagenic contributions of various biochemical processes associating divergence states with a broad range of genomic landscape features. We find that high divergence states inhabit guanine- and cytosine (GC)-rich, highly recombining subtelomeric regions; low divergence states cover inner parts of autosomes; chromosome X forms its own state with lowest divergence; and a state of elevated microsatellite mutability is interspersed across the genome. These general trends are mirrored in human diversity data from the 1000 Genomes Project, and departures from them highlight the evolutionary history of primate chromosomes. We also find that genes and noncoding functional marks [annotations from the Encyclopedia of DNA Elements (ENCODE)] are concentrated in high divergence states. Our results provide a powerful tool for biomedical data analysis: segmentations can be used to screen personal genome variants--including those associated with cancer and other diseases--and to improve computational predictions of noncoding functional elements.
de Souza, Gustavo A.; Arntzen, Magnus Ø.; Fortuin, Suereta; Schürch, Anita C.; Målen, Hiwa; McEvoy, Christopher R. E.; van Soolingen, Dick; Thiede, Bernd; Warren, Robin M.; Wiker, Harald G.
2011-01-01
Precise annotation of genes or open reading frames is still a difficult task that results in divergence even for data generated from the same genomic sequence. This has an impact in further proteomic studies, and also compromises the characterization of clinical isolates with many specific genetic variations that may not be represented in the selected database. We recently developed software called multistrain mass spectrometry prokaryotic database builder (MSMSpdbb) that can merge protein databases from several sources and be applied on any prokaryotic organism, in a proteomic-friendly approach. We generated a database for the Mycobacterium tuberculosis complex (using three strains of Mycobacterium bovis and five of M. tuberculosis), and analyzed data collected from two laboratory strains and two clinical isolates of M. tuberculosis. We identified 2561 proteins, of which 24 were present in M. tuberculosis H37Rv samples, but not annotated in the M. tuberculosis H37Rv genome. We were also able to identify 280 nonsynonymous single amino acid polymorphisms and confirm 367 translational start sites. As a proof of concept we applied the database to whole-genome DNA sequencing data of one of the clinical isolates, which allowed the validation of 116 predicted single amino acid polymorphisms and the annotation of 131 N-terminal start sites. Moreover we identified regions not present in the original M. tuberculosis H37Rv sequence, indicating strain divergence or errors in the reference sequence. In conclusion, we demonstrated the potential of using a merged database to better characterize laboratory or clinical bacterial strains. PMID:21030493
Renz, Adina J.; Meyer, Axel; Kuraku, Shigehiro
2013-01-01
Cartilaginous fishes, divided into Holocephali (chimaeras) and Elasmoblanchii (sharks, rays and skates), occupy a key phylogenetic position among extant vertebrates in reconstructing their evolutionary processes. Their accurate evolutionary time scale is indispensable for better understanding of the relationship between phenotypic and molecular evolution of cartilaginous fishes. However, our current knowledge on the time scale of cartilaginous fish evolution largely relies on estimates using mitochondrial DNA sequences. In this study, making the best use of the still partial, but large-scale sequencing data of cartilaginous fish species, we estimate the divergence times between the major cartilaginous fish lineages employing nuclear genes. By rigorous orthology assessment based on available genomic and transcriptomic sequence resources for cartilaginous fishes, we selected 20 protein-coding genes in the nuclear genome, spanning 2973 amino acid residues. Our analysis based on the Bayesian inference resulted in the mean divergence time of 421 Ma, the late Silurian, for the Holocephali-Elasmobranchii split, and 306 Ma, the late Carboniferous, for the split between sharks and rays/skates. By applying these results and other documented divergence times, we measured the relative evolutionary rate of the Hox A cluster sequences in the cartilaginous fish lineages, which resulted in a lower substitution rate with a factor of at least 2.4 in comparison to tetrapod lineages. The obtained time scale enables mapping phenotypic and molecular changes in a quantitative framework. It is of great interest to corroborate the less derived nature of cartilaginous fish at the molecular level as a genome-wide phenomenon. PMID:23825540
Renz, Adina J; Meyer, Axel; Kuraku, Shigehiro
2013-01-01
Cartilaginous fishes, divided into Holocephali (chimaeras) and Elasmoblanchii (sharks, rays and skates), occupy a key phylogenetic position among extant vertebrates in reconstructing their evolutionary processes. Their accurate evolutionary time scale is indispensable for better understanding of the relationship between phenotypic and molecular evolution of cartilaginous fishes. However, our current knowledge on the time scale of cartilaginous fish evolution largely relies on estimates using mitochondrial DNA sequences. In this study, making the best use of the still partial, but large-scale sequencing data of cartilaginous fish species, we estimate the divergence times between the major cartilaginous fish lineages employing nuclear genes. By rigorous orthology assessment based on available genomic and transcriptomic sequence resources for cartilaginous fishes, we selected 20 protein-coding genes in the nuclear genome, spanning 2973 amino acid residues. Our analysis based on the Bayesian inference resulted in the mean divergence time of 421 Ma, the late Silurian, for the Holocephali-Elasmobranchii split, and 306 Ma, the late Carboniferous, for the split between sharks and rays/skates. By applying these results and other documented divergence times, we measured the relative evolutionary rate of the Hox A cluster sequences in the cartilaginous fish lineages, which resulted in a lower substitution rate with a factor of at least 2.4 in comparison to tetrapod lineages. The obtained time scale enables mapping phenotypic and molecular changes in a quantitative framework. It is of great interest to corroborate the less derived nature of cartilaginous fish at the molecular level as a genome-wide phenomenon.
Briosio-Aguilar, R; Pinto, H A; Rodríguez-Santiago, M A; López-García, K; García-Varela, M; de León, G Pérez-Ponce
2018-06-01
The phylogenetic position of Clinostomum heluans Braun, 1899 within the genus Clinostomum Leidy, 1856 is reported in this study based on sequences of the barcoding region of the mitochondrial cytochrome c oxidase subunit 1 gene ( COX1). Additionally, molecular data are used to link the adult and the metacercariae of the species. The metacercariae of C. heluans were found encysted infecting the cichlid fish Australoheros sp. in Minas Gerais, Brazil, whereas the adults were obtained from the mouth cavity of the Great White Egret, Ardea alba, in Campeche, Mexico. The COX1 sequences obtained for the Mexican clinostomes and the Brazilian metacercaria were almost identical (0.2% molecular divergence), indicating conspecificity. Similar molecular divergence (0.2-0.4%) was found between sequences of C. heluans reported here and Clinostomum sp. 6 previously obtained from a metacercaria recovered from the cichlid Cichlasoma boliviense in Santa Cruz, Bolivia. Both maximum likelihood and Bayesian inference analyses unequivocally showed the conspecificity between C. heluans and Clinostomum sp. 6, which form a monophyletic clade with high nodal support and very low genetic divergence. Moreover, tree topology reveals that C. heluans occupies a basal position with respect to New World species of Clinostomum, although a denser taxon sampling of species within the genus is further required. The metacercaria of C. heluans seems to be specific to cichlid fish because both samples from South America were recovered from species of this fish family, although not closely related.
Martínez-Castilla, León Patricio; Alvarez-Buylla, Elena R.
2003-01-01
Gene duplication is a substrate of evolution. However, the relative importance of positive selection versus relaxation of constraints in the functional divergence of gene copies is still under debate. Plant MADS-box genes encode transcriptional regulators key in various aspects of development and have undergone extensive duplications to form a large family. We recovered 104 MADS sequences from the Arabidopsis genome. Bayesian phylogenetic trees recover type II lineage as a monophyletic group and resolve a branching sequence of monophyletic groups within this lineage. The type I lineage is comprised of several divergent groups. However, contrasting gene structure and patterns of chromosomal distribution between type I and II sequences suggest that they had different evolutionary histories and support the placement of the root of the gene family between these two groups. Site-specific and site-branch analyses of positive Darwinian selection (PDS) suggest that different selection regimes could have affected the evolution of these lineages. We found evidence for PDS along the branch leading to flowering time genes that have a direct impact on plant fitness. Sites with high probabilities of having been under PDS were found in the MADS and K domains, suggesting that these played important roles in the acquisition of novel functions during MADS-box diversification. Detected sites are targets for further experimental analyses. We argue that adaptive changes in MADS-domain protein sequences have been important for their functional divergence, suggesting that changes within coding regions of transcriptional regulators have influenced phenotypic evolution of plants. PMID:14597714
Tiwari, Pratibha; Singh, Noopur; Dixit, Aparna; Choudhury, Devapriya
2014-10-01
The "extended" type of short chain dehydrogenases/reductases (SDR), share a remarkable similarity in their tertiary structures inspite of being highly divergent in their functions and sequences. We have carried out principal component analysis (PCA) on structurally equivalent residue positions of 10 SDR families using information theoretic measures like Jensen-Shannon divergence and average shannon entropy as variables. The results classify residue positions in the SDR fold into six groups, one of which is characterized by low Shannon entropies but high Jensen-Shannon divergence against the reference family SDR1E, suggesting that these positions are responsible for the specific functional identities of individual SDR families, distinguishing them from the reference family SDR1E. Site directed mutagenesis of three residues from this group in the enzyme UDP-Galactose 4-epimerase belonging to SDR1E shows that the mutants promote the formation of NADH containing abortive complexes. Finally, molecular dynamics simulations have been used to suggest a mechanism by which the mutants interfere with the re-oxidation of NADH leading to the formation of abortive complexes. © 2014 Wiley Periodicals, Inc.
Cao, Ya-Nan; Wang, Ian J; Chen, Lu-Yao; Ding, Yan-Qian; Liu, Lu-Xian; Qiu, Ying-Xiong
2018-04-17
The relative roles of geography, climate and ecology in driving population divergence and (incipient) speciation has so far been largely neglected in studies addressing the evolution of East Asia's island flora. Here, we employed chloroplast and ribosomal DNA sequences and restriction site-associated DNA sequencing (RADseq) loci to investigate the phylogeography and drivers of population divergence of Neolitsea sericea. These data sets support the subdivision of N. sericea populations into the Southern and Northern lineages across the 'Tokara gap'. Two distinct sublineages were further identified for the Northern lineage of N. sericea from the RADseq data. RADseq was also used along with approximate Bayesian computation to show that the current distribution and differentiation of N. sericea populations resulted from a combination of relatively ancient migration and successive vicariant events that likely occurred during the mid to late Pleistocene. Landscape genomic analyses showed that, apart from geographic barriers, barrier, potentially local adaptation to different climatic conditions appears to be one of the major drivers for lineage diversification of N. sericea. Copyright © 2018 Elsevier Inc. All rights reserved.
A little bit of sex matters for genome evolution in asexual plants.
Hojsgaard, Diego; Hörandl, Elvira
2015-01-01
Genome evolution in asexual organisms is theoretically expected to be shaped by various factors: first, hybrid origin, and polyploidy confer a genomic constitution of highly heterozygous genotypes with multiple copies of genes; second, asexuality confers a lack of recombination and variation in populations, which reduces the efficiency of selection against deleterious mutations; hence, the accumulation of mutations and a gradual increase in mutational load (Muller's ratchet) would lead to rapid extinction of asexual lineages; third, allelic sequence divergence is expected to result in rapid divergence of lineages (Meselson effect). Recent transcriptome studies on the asexual polyploid complex Ranunculus auricomus using single-nucleotide polymorphisms confirmed neutral allelic sequence divergence within a short time frame, but rejected a hypothesis of a genome-wide accumulation of mutations in asexuals compared to sexuals, except for a few genes related to reproductive development. We discuss a general model that the observed incidence of facultative sexuality in plants may unmask deleterious mutations with partial dominance and expose them efficiently to purging selection. A little bit of sex may help to avoid genomic decay and extinction.
NASA Technical Reports Server (NTRS)
Lanyi, J. K.
1986-01-01
The archaebacteria occupy a unique place in phylogenetic trees constructed from analyses of sequences from key informational macromolecules, and their study continues to yield interesting ideas on the early evolution and divergence of biological forms. It is now known that the halobacteria among these species contain various retinal-proteins, resembling eukaryotic rhodopsins, but with different functions. Two of these pigments, located in the cytoplasmic membranes of the bacteria, are bacteriorhodopsin (a light-driven proton pump) and halorhodopsin (a light-driven chloride pump). Comparison of these systems is expected to reveal structure/function relationships in these simple (primitive?) energy transducing membrane components and evolutionary relationships which had produced the structural features which allow the divergent functions. Findings indicate that very different primary structures are needed for these proteins to accomplish their different functions. Indeed, analysis of partial amino acid sequences from halo-opsin shows already that few if any long segments exist which are homologous to bacterio-opsin. Either these proteins diverged a very long time ago to allow for the observed differences, or the evolutionary clock in the halobacteria runs faster than usual.
Divergent evolution of multiple virus-resistance genes from a progenitor in Capsicum spp.
Kim, Saet-Byul; Kang, Won-Hee; Huy, Hoang Ngoc; Yeom, Seon-In; An, Jeong-Tak; Kim, Seungill; Kang, Min-Young; Kim, Hyun Jung; Jo, Yeong Deuk; Ha, Yeaseong; Choi, Doil; Kang, Byoung-Cheorl
2017-01-01
Plants have evolved hundreds of nucleotide-binding and leucine-rich domain proteins (NLRs) as potential intracellular immune receptors, but the evolutionary mechanism leading to the ability to recognize specific pathogen effectors is elusive. Here, we cloned Pvr4 (a Potyvirus resistance gene in Capsicum annuum) and Tsw (a Tomato spotted wilt virus resistance gene in Capsicum chinense) via a genome-based approach using independent segregating populations. The genes both encode typical NLRs and are located at the same locus on pepper chromosome 10. Despite the fact that these two genes recognize completely different viral effectors, the genomic structures and coding sequences of the two genes are strikingly similar. Phylogenetic studies revealed that these two immune receptors diverged from a progenitor gene of a common ancestor. Our results suggest that sequence variations caused by gene duplication and neofunctionalization may underlie the evolution of the ability to specifically recognize different effectors. These findings thereby provide insight into the divergent evolution of plant immune receptors. © 2016 The Authors. New Phytologist © 2016 New Phytologist Trust.
Comparative Genome and Proteome Analysis of Anopheles gambiae and Drosophila melanogaster
NASA Astrophysics Data System (ADS)
Zdobnov, Evgeny M.; von Mering, Christian; Letunic, Ivica; Torrents, David; Suyama, Mikita; Copley, Richard R.; Christophides, George K.; Thomasova, Dana; Holt, Robert A.; Subramanian, G. Mani; Mueller, Hans-Michael; Dimopoulos, George; Law, John H.; Wells, Michael A.; Birney, Ewan; Charlab, Rosane; Halpern, Aaron L.; Kokoza, Elena; Kraft, Cheryl L.; Lai, Zhongwu; Lewis, Suzanna; Louis, Christos; Barillas-Mury, Carolina; Nusskern, Deborah; Rubin, Gerald M.; Salzberg, Steven L.; Sutton, Granger G.; Topalis, Pantelis; Wides, Ron; Wincker, Patrick; Yandell, Mark; Collins, Frank H.; Ribeiro, Jose; Gelbart, William M.; Kafatos, Fotis C.; Bork, Peer
2002-10-01
Comparison of the genomes and proteomes of the two diptera Anopheles gambiae and Drosophila melanogaster, which diverged about 250 million years ago, reveals considerable similarities. However, numerous differences are also observed; some of these must reflect the selection and subsequent adaptation associated with different ecologies and life strategies. Almost half of the genes in both genomes are interpreted as orthologs and show an average sequence identity of about 56%, which is slightly lower than that observed between the orthologs of the pufferfish and human (diverged about 450 million years ago). This indicates that these two insects diverged considerably faster than vertebrates. Aligned sequences reveal that orthologous genes have retained only half of their intron/exon structure, indicating that intron gains or losses have occurred at a rate of about one per gene per 125 million years. Chromosomal arms exhibit significant remnants of homology between the two species, although only 34% of the genes colocalize in small ``microsyntenic'' clusters, and major interarm transfers as well as intra-arm shuffling of gene order are detected.
Nikaido, Masato; Matsuno, Fumio; Hamilton, Healy; Brownell, Robert L.; Cao, Ying; Ding, Wang; Zuoyan, Zhu; Shedlock, Andrew M.; Fordyce, R. Ewan; Hasegawa, Masami; Okada, Norihiro
2001-01-01
SINE (short interspersed element) insertion analysis elucidates contentious aspects in the phylogeny of toothed whales and dolphins (Odontoceti), especially river dolphins. Here, we characterize 25 informative SINEs inserted into unique genomic loci during evolution of odontocetes to construct a cladogram, and determine a total of 2.8 kb per taxon of the flanking sequences of these SINE loci to estimate divergence times among lineages. We demonstrate that: (i) Odontocetes are monophyletic; (ii) Ganges River dolphins, beaked whales, and ocean dolphins diverged (in this order) after sperm whales; (iii) three other river dolphin taxa, namely the Amazon, La Plata, and Yangtze river dolphins, form a monophyletic group with Yangtze River dolphins being the most basal; and (iv) the rapid radiation of extant cetacean lineages occurred some 28–33 million years B.P., in strong accord with the fossil record. The combination of SINE and flanking sequence analysis suggests a topology and set of divergence times for odontocete relationships, offering alternative explanations for several long-standing problems in cetacean evolution. PMID:11416211
Wilson, Wade D; Turner, Thomas F
2009-08-01
The genus Oncorhynchus includes Pacific salmon and trout (anadromous and land-locked) species of the western United States and Mexico. All species and subspecies in this group are threatened, endangered, sensitive, or species of conservation concern in portions of their native ranges. To examine the relationships of the species within Oncorhynchus we sequenced a 768 bp fragment of the protein-encoding ND4 mtDNA region. We included all six recognized subspecies of O. clarki (cutthroat trout), O. gilaegilae (Gila trout) and O. g. apache (Apache trout). Gene trees from likelihood and Bayesian phylogenetic analyses revealed that Salvelinus was the sister group to Oncorhynchus, and as expected based on previous studies, O. clarki was sister to a clade that consisted of O. mykiss plus O. g. gilae and O. g. apache. Within the cutthroat clade (O. clarki), the coastal form O. c. clarki was basal with the Rio Grande cutthroat (O. c. virginalis) most derived. Divergence dating based on a fossil calibration molecular clock showed the oldest clade (mean node age) was O. masou ssp., which diverged roughly 7.6 MYA. Highest probability density intervals for divergence of O. masou overlapped with divergence (6.3 MYA) of Pacific salmon clades ((O. gorbuscha + O. nerka) and (O. tshawytscha + O. kisutch)). The Pacific trout clade ((O. mykiss + O. gilae ssp.) + (O. clarki ssp.)) diverged from the Pacific salmon around 6.3 MYA, with most of the diversification within the O. clarki clade occurring in the last 1 MY.
Kropáčková, Lucie; Těšický, Martin; Albrecht, Tomáš; Kubovčiak, Jan; Čížková, Dagmar; Tomášek, Oldřich; Martin, Jean-François; Bobek, Lukáš; Králová, Tereza; Procházka, Petr; Kreisinger, Jakub
2017-10-01
Vertebrate gut microbiota (GM) is comprised of a taxonomically diverse consortium of symbiotic and commensal microorganisms that have a pronounced effect on host physiology, immune system function and health status. Despite much research on interactions between hosts and their GM, the factors affecting inter- and intraspecific GM variation in wild populations are still poorly known. We analysed data on faecal microbiota composition in 51 passerine species (319 individuals) using Illumina MiSeq sequencing of bacterial 16S rRNA (V3-V4 variable region). Despite pronounced interindividual variation, GM composition exhibited significant differences at the interspecific level, accounting for approximately 20%-30% of total GM variation. We also observed a significant correlation between GM composition divergence and host's phylogenetic divergence, with strength of correlation higher than that of GM vs. ecological or life history traits and geographic variation. The effect of host's phylogeny on GM composition was significant, even after statistical control for these confounding factors. Hence, our data do not support codiversification of GM and passerine phylogeny solely as a by-product of their ecological divergence. Furthermore, our findings do not support that GM vs. host's phylogeny codiversification is driven primarily through trans-generational GM transfer as the GM vs. phylogeny correlation does not increase with higher sequence similarity used when delimiting operational taxonomic units. Instead, we hypothesize that the GM vs. phylogeny correlation may arise as a consequence of interspecific divergence of genes that directly or indirectly modulate composition of GM. © 2017 John Wiley & Sons Ltd.
Neuwald, Andrew F
2009-08-01
The patterns of sequence similarity and divergence present within functionally diverse, evolutionarily related proteins contain implicit information about corresponding biochemical similarities and differences. A first step toward accessing such information is to statistically analyze these patterns, which, in turn, requires that one first identify and accurately align a very large set of protein sequences. Ideally, the set should include many distantly related, functionally divergent subgroups. Because it is extremely difficult, if not impossible for fully automated methods to align such sequences correctly, researchers often resort to manual curation based on detailed structural and biochemical information. However, multiply-aligning vast numbers of sequences in this way is clearly impractical. This problem is addressed using Multiply-Aligned Profiles for Global Alignment of Protein Sequences (MAPGAPS). The MAPGAPS program uses a set of multiply-aligned profiles both as a query to detect and classify related sequences and as a template to multiply-align the sequences. It relies on Karlin-Altschul statistics for sensitivity and on PSI-BLAST (and other) heuristics for speed. Using as input a carefully curated multiple-profile alignment for P-loop GTPases, MAPGAPS correctly aligned weakly conserved sequence motifs within 33 distantly related GTPases of known structure. By comparison, the sequence- and structurally based alignment methods hmmalign and PROMALS3D misaligned at least 11 and 23 of these regions, respectively. When applied to a dataset of 65 million protein sequences, MAPGAPS identified, classified and aligned (with comparable accuracy) nearly half a million putative P-loop GTPase sequences. A C++ implementation of MAPGAPS is available at http://mapgaps.igs.umaryland.edu. Supplementary data are available at Bioinformatics online.
Lenz, Tobias L.; Mueller, Birte; Trillmich, Fritz; Wolf, Jochen B. W.
2013-01-01
It is still debated whether main individual fitness differences in natural populations can be attributed to genome-wide effects or to particular loci of outstanding functional importance such as the major histocompatibility complex (MHC). In a long-term monitoring project on Galápagos sea lions (Zalophus wollebaeki), we collected comprehensive fitness and mating data for a total of 506 individuals. Controlling for genome-wide inbreeding, we find strong associations between the MHC locus and nearly all fitness traits. The effect was mainly attributable to MHC sequence divergence and could be decomposed into contributions of own and maternal genotypes. In consequence, the population seems to have evolved a pool of highly divergent alleles conveying near-optimal MHC divergence even by random mating. Our results demonstrate that a single locus can significantly contribute to fitness in the wild and provide conclusive evidence for the ‘divergent allele advantage’ hypothesis, a special form of balancing selection with interesting evolutionary implications. PMID:23677346
Narang, Pooja; Wilson Sayres, Melissa A.
2016-01-01
Male mutation bias, when more mutations are passed on via the male germline than via the female germline, is observed across mammals. One common way to infer the magnitude of male mutation bias, α, is to compare levels of neutral sequence divergence between genomic regions that spend different amounts of time in the male and female germline. For great apes, including human, we show that estimates of divergence are reduced in putatively unconstrained regions near genes relative to unconstrained regions far from genes. Divergence increases with increasing distance from genes on both the X chromosome and autosomes, but increases faster on the X chromosome than autosomes. As a result, ratios of X/A divergence increase with increasing distance from genes and corresponding estimates of male mutation bias are significantly higher in intergenic regions near genes versus far from genes. Future studies in other species will need to carefully consider the effect that genomic location will have on estimates of male mutation bias. PMID:27702816
DOE Office of Scientific and Technical Information (OSTI.GOV)
Toye, P.G.; Metzelaar, M.J.; Wijngaard, P.L.J.
1995-08-01
Theileria parva, a tick-transmitted protozoan parasite related to Plasmodium spp., causes the disease East Coast fever, an acute and usually fatal lymphoproliferative disorder of cattle in Africa. Previous studies using sera from cattle that have survived infection identified a polymorphic immunodominant molecule (PIM) that is expressed by both the infective sporozoite stage of the parasite and the intracellular schizont. Here we show that mAb specific for the PIM Ag can inhibit sporozoite invasion of lymphocytes in vitro. A cDNA clone encoding the PIM Ag of the T. parva (Muguga) stock was obtained by using these mAb in a novel eukaryoticmore » expression cloning system that allows isolation of cDNA encoding cytoplasmic or surface Ags. To establish the molecular basis of the polymorphism of PIM, the cDNA of the PIM Ag from a buffalo-derived T. parva stock was isolated and its sequence was compared with that of the cattle-derived Muguga PIM. The two cDNAs showed considerable identity in both the 5{prime} and 3{prime} regions, but there was substantial sequence divergence in the central regions. Several types of repeated sequences were identified in the variant regions. In the Muguga form of the molecule, there were five tandem repeats of the tetrapeptide, QPEP, that were shown, by transfection of a deleted version of the PIM gene, not to react with several anti-PIM mAbs. By isolating and sequencing the genomic version of the gene, we identified two small introns in the 3{prime} region of the gene. Finally, we showed that polyclonal rat Abs against recombinant PIM neutralize sporozoite infectivity in vitro, suggesting that the PIM Ag should be evaluated for its capacity to immunize cattle against East Coast Fever.« less
Miller, Hilary C; O'Meally, Denis; Ezaz, Tariq; Amemiya, Chris; Marshall-Graves, Jennifer A; Edwards, Scott
2015-05-07
Major histocompatibility complex (MHC) genes are a central component of the vertebrate immune system and usually exist in a single genomic region. However, considerable differences in MHC organization and size exist between different vertebrate lineages. Reptiles occupy a key evolutionary position for understanding how variation in MHC structure evolved in vertebrates, but information on the structure of the MHC region in reptiles is limited. In this study, we investigate the organization and cytogenetic location of MHC genes in the tuatara (Sphenodon punctatus), the sole extant representative of the early-diverging reptilian order Rhynchocephalia. Sequencing and mapping of 12 clones containing class I and II MHC genes from a bacterial artificial chromosome library indicated that the core MHC region is located on chromosome 13q. However, duplication and translocation of MHC genes outside of the core region was evident, because additional class I MHC genes were located on chromosome 4p. We found a total of seven class I sequences and 11 class II β sequences, with evidence for duplication and pseudogenization of genes within the tuatara lineage. The tuatara MHC is characterized by high repeat content and low gene density compared with other species and we found no antigen processing or MHC framework genes on the MHC gene-containing clones. Our findings indicate substantial differences in MHC organization in tuatara compared with mammalian and avian MHCs and highlight the dynamic nature of the MHC. Further sequencing and annotation of tuatara and other reptile MHCs will determine if the tuatara MHC is representative of nonavian reptiles in general. Copyright © 2015 Miller et al.
DNA Barcoding the Geometrid Fauna of Bavaria (Lepidoptera): Successes, Surprises, and Questions
Hausmann, Axel; Haszprunar, Gerhard; Hebert, Paul D. N.
2011-01-01
Background The State of Bavaria is involved in a research program that will lead to the construction of a DNA barcode library for all animal species within its territorial boundaries. The present study provides a comprehensive DNA barcode library for the Geometridae, one of the most diverse of insect families. Methodology/Principal Findings This study reports DNA barcodes for 400 Bavarian geometrid species, 98 per cent of the known fauna, and approximately one per cent of all Bavarian animal species. Although 98.5% of these species possess diagnostic barcode sequences in Bavaria, records from neighbouring countries suggest that species-level resolution may be compromised in up to 3.5% of cases. All taxa which apparently share barcodes are discussed in detail. One case of modest divergence (1.4%) revealed a species overlooked by the current taxonomic system: Eupithecia goossensiata Mabille, 1869 stat.n. is raised from synonymy with Eupithecia absinthiata (Clerck, 1759) to species rank. Deep intraspecific sequence divergences (>2%) were detected in 20 traditionally recognized species. Conclusions/Significance The study emphasizes the effectiveness of DNA barcoding as a tool for monitoring biodiversity. Open access is provided to a data set that includes records for 1,395 geometrid specimens (331 species) from Bavaria, with 69 additional species from neighbouring regions. Taxa with deep intraspecific sequence divergences are undergoing more detailed analysis to ascertain if they represent cases of cryptic diversity. PMID:21423340
Detection and Analysis of Circular RNAs by RT-PCR.
Panda, Amaresh C; Gorospe, Myriam
2018-03-20
Gene expression in eukaryotic cells is tightly regulated at the transcriptional and posttranscriptional levels. Posttranscriptional processes, including pre-mRNA splicing, mRNA export, mRNA turnover, and mRNA translation, are controlled by RNA-binding proteins (RBPs) and noncoding (nc)RNAs. The vast family of ncRNAs comprises diverse regulatory RNAs, such as microRNAs and long noncoding (lnc)RNAs, but also the poorly explored class of circular (circ)RNAs. Although first discovered more than three decades ago by electron microscopy, only the advent of high-throughput RNA-sequencing (RNA-seq) and the development of innovative bioinformatic pipelines have begun to allow the systematic identification of circRNAs (Szabo and Salzman, 2016; Panda et al ., 2017b; Panda et al ., 2017c). However, the validation of true circRNAs identified by RNA sequencing requires other molecular biology techniques including reverse transcription (RT) followed by conventional or quantitative (q) polymerase chain reaction (PCR), and Northern blot analysis (Jeck and Sharpless, 2014). RT-qPCR analysis of circular RNAs using divergent primers has been widely used for the detection, validation, and sometimes quantification of circRNAs (Abdelmohsen et al ., 2015 and 2017; Panda et al ., 2017b). As detailed here, divergent primers designed to span the circRNA backsplice junction sequence can specifically amplify the circRNAs and not the counterpart linear RNA. In sum, RT-PCR analysis using divergent primers allows direct detection and quantification of circRNAs.
Radhika, R; Bijoy Nandan, S; Harikrishnan, M
2017-11-01
Morphological identification of the marine cyclopoid copepod Dioithona rigida in combination with sequencing a 645 bp fragment of mitochondrial cytochrome oxidase c subunit I (mtCOI) gene, collected from offshore waters of Kavarathi Island, Lakshadweep Sea, is presented in this study. Kiefer in 1935 classified Dioithona as a separate genus from Oithona. The main distinguishing characters observed in the collected samples, such as the presence of well-developed P5 with 2 setae, 5 segmented urosome, 12 segmented antennule, compact dagger-like setae on the inner margin of proximal segment of exopod ramus in P1-P4 and engorged portion of P1-bearing a spine, confirmed their morphology to D. rigida. A comparison of setal formulae of the exopod and endopod of D. rigida with those recorded previously by various authors are also presented here. Maximum likelihood Tree analysis exhibited the clustering of D. rigida sequences into a single clade (accession numbers KP972540.1-KR528588.1), which in contrast was 37-42% divergent from other Oithona species. Further intra-specific divergence values of 0-2% also confirmed the genetic identity of D. rigida species. Paracyclopina nana was selected as an out group displayed a diverged array. The present results distinctly differentiated D. rigida from other Oithona species.
Jonniaux, Pierre; Kumazawa, Yoshinori
2008-01-15
Mitochondrial DNA sequences of approximately 2.3 kbp including the complete NADH dehydrogenase subunit 2 gene and its flanking genes, as well as parts of 12S and 16S rRNA genes were determined from major species of the eyelid gecko family Eublepharidae sensu [Kluge, A.G. 1987. Cladistic relationships in the Gekkonoidea (Squamata, Sauria). Misc. Publ. Mus. Zool. Univ. Michigan 173, 1-54.]. In contrast to previous morphological studies, phylogenetic analyses based on these sequences supported that Eublepharidae and Gekkonidae form a sister group with Pygopodidae, raising the possibility of homoplasious character change in some key features of geckos, such as reduction of movable eyelids and innovation of climbing toe pads. The phylogenetic analyses also provided a well-resolved tree for relationships between the eublepharid species. The Bayesian estimation of divergence times without assuming the molecular clock suggested the Jurassic divergence of Eublepharidae from Gekkonidae and radiations of most eublepharid genera around the Cretaceous. These dating results appeared to be robust against some conditional changes for time estimation, such as gene regions used, taxon representation, and data partitioning. Taken together with geological evidence, these results support the vicariant divergence of Eublepharidae and Gekkonidae by the breakup of Pangea into Laurasia and Gondwanaland, and recent dispersal of two African eublepharid genera from Eurasia to Africa after these landmasses were connected in the Early Miocene.
Sperm Bindin Divergence under Sexual Selection and Concerted Evolution in Sea Stars.
Patiño, Susana; Keever, Carson C; Sunday, Jennifer M; Popovic, Iva; Byrne, Maria; Hart, Michael W
2016-08-01
Selection associated with competition among males or sexual conflict between mates can create positive selection for high rates of molecular evolution of gamete recognition genes and lead to reproductive isolation between species. We analyzed coding sequence and repetitive domain variation in the gene encoding the sperm acrosomal protein bindin in 13 diverse sea star species. We found that bindin has a conserved coding sequence domain structure in all 13 species, with several repeated motifs in a large central region that is similar among all sea stars in organization but highly divergent among genera in nucleotide and predicted amino acid sequence. More bindin codons and lineages showed positive selection for high relative rates of amino acid substitution in genera with gonochoric outcrossing adults (and greater expected strength of sexual selection) than in selfing hermaphrodites. That difference is consistent with the expectation that selfing (a highly derived mating system) may moderate the strength of sexual selection and limit the accumulation of bindin amino acid differences. The results implicate both positive selection on single codons and concerted evolution within the repetitive region in bindin divergence, and suggest that both single amino acid differences and repeat differences may affect sperm-egg binding and reproductive compatibility. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Can DNA barcoding accurately discriminate megadiverse Neotropical freshwater fish fauna?
2013-01-01
Background The megadiverse Neotropical freshwater ichthyofauna is the richest in the world with approximately 6,000 recognized species. Interestingly, they are distributed among only 17 orders, and almost 80% of them belong to only three orders: Characiformes, Siluriformes and Perciformes. Moreover, evidence based on molecular data has shown that most of the diversification of the Neotropical ichthyofauna occurred recently. These characteristics make the taxonomy and identification of this fauna a great challenge, even when using molecular approaches. In this context, the present study aimed to test the effectiveness of the barcoding methodology (COI gene) to identify the mega diverse freshwater fish fauna from the Neotropical region. For this purpose, 254 species of fishes were analyzed from the Upper Parana River basin, an area representative of the larger Neotropical region. Results Of the 254 species analyzed, 252 were correctly identified by their barcode sequences (99.2%). The main K2P intra- and inter-specific genetic divergence values (0.3% and 6.8%, respectively) were relatively low compared with similar values reported in the literature, reflecting the higher number of closely related species belonging to a few higher taxa and their recent radiation. Moreover, for 84 pairs of species that showed low levels of genetic divergence (<2%), application of a complementary character-based nucleotide diagnostic approach proved useful in discriminating them. Additionally, 14 species displayed high intra-specific genetic divergence (>2%), pointing to at least 23 strong candidates for new species. Conclusions Our study is the first to examine a large number of freshwater fish species from the Neotropical area, including a large number of closely related species. The results confirmed the efficacy of the barcoding methodology to identify a recently radiated, megadiverse fauna, discriminating 99.2% of the analyzed species. The power of the barcode sequences to identify species, even with low interspecific divergence, gives us an idea of the distribution of inter-specific genetic divergence in these megadiverse fauna. The results also revealed hidden genetic divergences suggestive of reproductive isolation and putative cryptic speciation in some species (23 candidates for new species). Finally, our study constituted an important contribution to the international Barcoding of Life (iBOL.org) project, providing barcode sequences for use in identification of these species by experts and non-experts, and allowing them to be available for use in other applications. PMID:23497346
Glinsky, Gennadi V
2016-09-19
Thousands of candidate human-specific regulatory sequences (HSRS) have been identified, supporting the hypothesis that unique to human phenotypes result from human-specific alterations of genomic regulatory networks. Collectively, a compendium of multiple diverse families of HSRS that are functionally and structurally divergent from Great Apes could be defined as the backbone of human-specific genomic regulatory networks. Here, the conservation patterns analysis of 18,364 candidate HSRS was carried out requiring that 100% of bases must remap during the alignments of human, chimpanzee, and bonobo sequences. A total of 5,535 candidate HSRS were identified that are: (i) highly conserved in Great Apes; (ii) evolved by the exaptation of highly conserved ancestral DNA; (iii) defined by either the acceleration of mutation rates on the human lineage or the functional divergence from non-human primates. The exaptation of highly conserved ancestral DNA pathway seems mechanistically distinct from the evolution of regulatory DNA segments driven by the species-specific expansion of transposable elements. Genome-wide proximity placement analysis of HSRS revealed that a small fraction of topologically associating domains (TADs) contain more than half of HSRS from four distinct families. TADs that are enriched for HSRS and termed rapidly evolving in humans TADs (revTADs) comprise 0.8-10.3% of 3,127 TADs in the hESC genome. RevTADs manifest distinct correlation patterns between placements of human accelerated regions, human-specific transcription factor-binding sites, and recombination rates. There is a significant enrichment within revTAD boundaries of hESC-enhancers, primate-specific CTCF-binding sites, human-specific RNAPII-binding sites, hCONDELs, and H3K4me3 peaks with human-specific enrichment at TSS in prefrontal cortex neurons (P < 0.0001 in all instances). Present analysis supports the idea that phenotypic divergence of Homo sapiens is driven by the evolution of human-specific genomic regulatory networks via at least two mechanistically distinct pathways of creation of divergent sequences of regulatory DNA: (i) recombination-associated exaptation of the highly conserved ancestral regulatory DNA segments; (ii) human-specific insertions of transposable elements. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Neopolyploidy and diversification in Heuchera grossulariifolia.
Oswald, Benjamin P; Nuismer, Scott L
2011-06-01
Newly formed polyploid lineages must contend with several obstacles to avoid extinction, including minority cytotype exclusion, competition, and inbreeding depression. If polyploidization results in immediate divergence of phenotypic characters these hurdles may be reduced and establishment made more likely. In addition, if polyploidization alters the phenotypic and genotypic associations between traits, that is, the P and G matrices, polyploids may be able to explore novel evolutionary paths, facilitating their divergence and successful establishment. Here, we report results from a study of the perennial plant Heuchera grossulariifolia in which the phenotypic divergence and changes in phenotypic and genotypic covariance matrices caused by neopolyploidization have been estimated. Our results reveal that polyploidization causes immediate divergence for traits relevant to establishment and results in significant changes in the structure of the phenotypic covariance matrix. In contrast, our results do not provide evidence that polyploidization results in immediate and substantial shifts in the genetic covariance matrix. © 2010 The Author(s). Evolution© 2010 The Society for the Study of Evolution.
Fenner, Jack N
2005-10-01
The length of the human generation interval is a key parameter when using genetics to date population divergence events. However, no consensus exists regarding the generation interval length, and a wide variety of interval lengths have been used in recent studies. This makes comparison between studies difficult, and questions the accuracy of divergence date estimations. Recent genealogy-based research suggests that the male generation interval is substantially longer than the female interval, and that both are greater than the values commonly used in genetics studies. This study evaluates each of these hypotheses in a broader cross-cultural context, using data from both nation states and recent hunter-gatherer societies. Both hypotheses are supported by this study; therefore, revised estimates of male, female, and overall human generation interval lengths are proposed. The nearly universal, cross-cultural nature of the evidence justifies using these proposed estimates in Y-chromosomal, mitochondrial, and autosomal DNA-based population divergence studies.
Shirani, Sahar; Hellweger, Ferdi L
2017-08-01
Molecular observations reveal substantial biogeographic patterns of cyanobacteria within systems of connected lakes. An important question is the relative role of environmental selection and neutral processes in the biogeography of these systems. Here, we quantify the effect of genetic drift and dispersal limitation by simulating individual cyanobacteria cells using an agent-based model (ABM). In the model, cells grow (divide), die, and migrate between lakes. Each cell has a full genome that is subject to neutral mutation (i.e., the growth rate is independent of the genome). The model is verified by simulating simplified lake systems, for which theoretical solutions are available. Then, it is used to simulate the biogeography of the cyanobacterium Microcystis aeruginosa in a number of real systems, including the Great Lakes, Klamath River, Yahara River, and Chattahoochee River. Model output is analyzed using standard bioinformatics tools (BLAST, MAFFT). The emergent patterns of nucleotide divergence between lakes are dynamic, including gradual increases due to accumulation of mutations and abrupt changes due to population takeovers by migrant cells (coalescence events). The model predicted nucleotide divergence is heterogeneous within systems, and for weakly connected lakes, it can be substantial. For example, Lakes Superior and Michigan are predicted to have an average genomic nucleotide divergence of 8200 bp or 0.14%. The divergence between more strongly connected lakes is much lower. Our results provide a quantitative baseline for future biogeography studies. They show that dispersal limitation can be an important factor in microbe biogeography, which is contrary to the common belief, and could affect how a system responds to environmental change.