Sample records for sequence divergence estimates

  1. Sequencing of Chloroplast Genomes from Wheat, Barley, Rye and Their Relatives Provides a Detailed Insight into the Evolution of the Triticeae Tribe

    PubMed Central

    Middleton, Christopher P.; Senerchia, Natacha; Stein, Nils; Akhunov, Eduard D.; Keller, Beat

    2014-01-01

    Using Roche/454 technology, we sequenced the chloroplast genomes of 12 Triticeae species, including bread wheat, barley and rye, as well as the diploid progenitors and relatives of bread wheat Triticum urartu, Aegilops speltoides and Ae. tauschii. Two wild tetraploid taxa, Ae. cylindrica and Ae. geniculata, were also included. Additionally, we incorporated wild Einkorn wheat Triticum boeoticum and its domesticated form T. monococcum and two Hordeum spontaneum (wild barley) genotypes. Chloroplast genomes were used for overall sequence comparison, phylogenetic analysis and dating of divergence times. We estimate that barley diverged from rye and wheat approximately 8–9 million years ago (MYA). The genome donors of hexaploid wheat diverged between 2.1–2.9 MYA, while rye diverged from Triticum aestivum approximately 3–4 MYA, more recently than previously estimated. Interestingly, the A genome taxa T. boeoticum and T. urartu were estimated to have diverged approximately 570,000 years ago. As these two have a reproductive barrier, the divergence time estimate also provides an upper limit for the time required for the formation of a species boundary between the two. Furthermore, we conclusively show that the chloroplast genome of hexaploid wheat was contributed by the B genome donor and that this unknown species diverged from Ae. speltoides about 980,000 years ago. Additionally, sequence alignments identified a translocation of a chloroplast segment to the nuclear genome which is specific to the rye/wheat lineage. We propose the presented phylogeny and divergence time estimates as a reference framework for future studies on Triticeae. PMID:24614886

  2. Conceptual issues in Bayesian divergence time estimation

    PubMed Central

    2016-01-01

    Bayesian inference of species divergence times is an unusual statistical problem, because the divergence time parameters are not identifiable unless both fossil calibrations and sequence data are available. Commonly used marginal priors on divergence times derived from fossil calibrations may conflict with node order on the phylogenetic tree causing a change in the prior on divergence times for a particular topology. Care should be taken to avoid confusing this effect with changes due to informative sequence data. This effect is illustrated with examples. A topology-consistent prior that preserves the marginal priors is defined and examples are constructed. Conflicts between fossil calibrations and relative branch lengths (based on sequence data) can cause estimates of divergence times that are grossly incorrect, yet have a narrow posterior distribution. An example of this effect is given; it is recommended that overly narrow posterior distributions of divergence times should be carefully scrutinized. This article is part of the themed issue ‘Dating species divergences using rocks and clocks’. PMID:27325831

  3. Conceptual issues in Bayesian divergence time estimation.

    PubMed

    Rannala, Bruce

    2016-07-19

    Bayesian inference of species divergence times is an unusual statistical problem, because the divergence time parameters are not identifiable unless both fossil calibrations and sequence data are available. Commonly used marginal priors on divergence times derived from fossil calibrations may conflict with node order on the phylogenetic tree causing a change in the prior on divergence times for a particular topology. Care should be taken to avoid confusing this effect with changes due to informative sequence data. This effect is illustrated with examples. A topology-consistent prior that preserves the marginal priors is defined and examples are constructed. Conflicts between fossil calibrations and relative branch lengths (based on sequence data) can cause estimates of divergence times that are grossly incorrect, yet have a narrow posterior distribution. An example of this effect is given; it is recommended that overly narrow posterior distributions of divergence times should be carefully scrutinized.This article is part of the themed issue 'Dating species divergences using rocks and clocks'. © 2016 The Author(s).

  4. Novel non-parametric models to estimate evolutionary rates and divergence times from heterochronous sequence data.

    PubMed

    Fourment, Mathieu; Holmes, Edward C

    2014-07-24

    Early methods for estimating divergence times from gene sequence data relied on the assumption of a molecular clock. More sophisticated methods were created to model rate variation and used auto-correlation of rates, local clocks, or the so called "uncorrelated relaxed clock" where substitution rates are assumed to be drawn from a parametric distribution. In the case of Bayesian inference methods the impact of the prior on branching times is not clearly understood, and if the amount of data is limited the posterior could be strongly influenced by the prior. We develop a maximum likelihood method--Physher--that uses local or discrete clocks to estimate evolutionary rates and divergence times from heterochronous sequence data. Using two empirical data sets we show that our discrete clock estimates are similar to those obtained by other methods, and that Physher outperformed some methods in the estimation of the root age of an influenza virus data set. A simulation analysis suggests that Physher can outperform a Bayesian method when the real topology contains two long branches below the root node, even when evolution is strongly clock-like. These results suggest it is advisable to use a variety of methods to estimate evolutionary rates and divergence times from heterochronous sequence data. Physher and the associated data sets used here are available online at http://code.google.com/p/physher/.

  5. Estimation of primate speciation dates using local molecular clocks.

    PubMed

    Yoder, A D; Yang, Z

    2000-07-01

    Protein-coding genes of the mitochondrial genomes from 31 mammalian species were analyzed to estimate the speciation dates within primates and also between rats and mice. Three calibration points were used based on paleontological data: one at 20-25 MYA for the hominoid/cercopithecoid divergence, one at 53-57 MYA for the cetacean/artiodactyl divergence, and the third at 110-130 MYA for the metatherian/eutherian divergence. Both the nucleotide and the amino acid sequences were analyzed, producing conflicting results. The global molecular clock was clearly violated for both the nucleotide and the amino acid data. Models of local clocks were implemented using maximum likelihood, allowing different evolutionary rates for some lineages while assuming rate constancy in others. Surprisingly, the highly divergent third codon positions appeared to contain phylogenetic information and produced more sensible estimates of primate divergence dates than did the amino acid sequences. Estimated dates varied considerably depending on the data type, the calibration point, and the substitution model but differed little among the four tree topologies used. We conclude that the calibration derived from the primate fossil record is too recent to be reliable; we also point out a number of problems in date estimation when the molecular clock does not hold. Despite these obstacles, we derived estimates of primate divergence dates that were well supported by the data and were generally consistent with the paleontological record. Estimation of the mouse-rat divergence date, however, was problematic.

  6. Ignoring heterozygous sites biases phylogenomic estimates of divergence times: implications for the evolutionary history of microtus voles.

    PubMed

    Lischer, Heidi E L; Excoffier, Laurent; Heckel, Gerald

    2014-04-01

    Phylogenetic reconstruction of the evolutionary history of closely related organisms may be difficult because of the presence of unsorted lineages and of a relatively high proportion of heterozygous sites that are usually not handled well by phylogenetic programs. Genomic data may provide enough fixed polymorphisms to resolve phylogenetic trees, but the diploid nature of sequence data remains analytically challenging. Here, we performed a phylogenomic reconstruction of the evolutionary history of the common vole (Microtus arvalis) with a focus on the influence of heterozygosity on the estimation of intraspecific divergence times. We used genome-wide sequence information from 15 voles distributed across the European range. We provide a novel approach to integrate heterozygous information in existing phylogenetic programs by repeated random haplotype sampling from sequences with multiple unphased heterozygous sites. We evaluated the impact of the use of full, partial, or no heterozygous information for tree reconstructions on divergence time estimates. All results consistently showed four deep and strongly supported evolutionary lineages in the vole data. These lineages undergoing divergence processes split only at the end or after the last glacial maximum based on calibration with radiocarbon-dated paleontological material. However, the incorporation of information from heterozygous sites had a significant impact on absolute and relative branch length estimations. Ignoring heterozygous information led to an overestimation of divergence times between the evolutionary lineages of M. arvalis. We conclude that the exclusion of heterozygous sites from evolutionary analyses may cause biased and misleading divergence time estimates in closely related taxa.

  7. Impact of duplicate gene copies on phylogenetic analysis and divergence time estimates in butterflies.

    PubMed

    Pohl, Nélida; Sison-Mangus, Marilou P; Yee, Emily N; Liswi, Saif W; Briscoe, Adriana D

    2009-05-13

    The increase in availability of genomic sequences for a wide range of organisms has revealed gene duplication to be a relatively common event. Encounters with duplicate gene copies have consequently become almost inevitable in the context of collecting gene sequences for inferring species trees. Here we examine the effect of incorporating duplicate gene copies evolving at different rates on tree reconstruction and time estimation of recent and deep divergences in butterflies. Sequences from ultraviolet-sensitive (UVRh), blue-sensitive (BRh), and long-wavelength sensitive (LWRh) opsins,EF-1 and COI were obtained from 27 taxa representing the five major butterfly families (5535 bp total). Both BRh and LWRh are present in multiple copies in some butterfly lineages and the different copies evolve at different rates. Regardless of the phylogenetic reconstruction method used, we found that analyses of combined data sets using either slower or faster evolving copies of duplicate genes resulted in a single topology in agreement with our current understanding of butterfly family relationships based on morphology and molecules. Interestingly, individual analyses of BRh and LWRh sequences also recovered these family-level relationships. Two different relaxed clock methods resulted in similar divergence time estimates at the shallower nodes in the tree, regardless of whether faster or slower evolving copies were used, with larger discrepancies observed at deeper nodes in the phylogeny. The time of divergence between the monarch butterfly Danaus plexippus and the queen D. gilippus (15.3-35.6 Mya) was found to be much older than the time of divergence between monarch co-mimic Limenitis archippus and red-spotted purple L. arthemis (4.7-13.6 Mya), and overlapping with the time of divergence of the co-mimetic passionflower butterflies Heliconius erato and H. melpomene (13.5-26.1 Mya). Our family-level results are congruent with recent estimates found in the literature and indicate an age of 84-113 million years for the divergence of all butterfly families. These results are consistent with diversification of the butterfly families following the radiation of angiosperms and suggest that some classes of opsin genes may be usefully employed for both phylogenetic reconstruction and divergence time estimation.

  8. Characterization of the uncertainty of divergence time estimation under relaxed molecular clock models using multiple loci.

    PubMed

    Zhu, Tianqi; Dos Reis, Mario; Yang, Ziheng

    2015-03-01

    Genetic sequence data provide information about the distances between species or branch lengths in a phylogeny, but not about the absolute divergence times or the evolutionary rates directly. Bayesian methods for dating species divergences estimate times and rates by assigning priors on them. In particular, the prior on times (node ages on the phylogeny) incorporates information in the fossil record to calibrate the molecular tree. Because times and rates are confounded, our posterior time estimates will not approach point values even if an infinite amount of sequence data are used in the analysis. In a previous study we developed a finite-sites theory to characterize the uncertainty in Bayesian divergence time estimation in analysis of large but finite sequence data sets under a strict molecular clock. As most modern clock dating analyses use more than one locus and are conducted under relaxed clock models, here we extend the theory to the case of relaxed clock analysis of data from multiple loci (site partitions). Uncertainty in posterior time estimates is partitioned into three sources: Sampling errors in the estimates of branch lengths in the tree for each locus due to limited sequence length, variation of substitution rates among lineages and among loci, and uncertainty in fossil calibrations. Using a simple but analogous estimation problem involving the multivariate normal distribution, we predict that as the number of loci ([Formula: see text]) goes to infinity, the variance in posterior time estimates decreases and approaches the infinite-data limit at the rate of 1/[Formula: see text], and the limit is independent of the number of sites in the sequence alignment. We then confirmed the predictions by using computer simulation on phylogenies of two or three species, and by analyzing a real genomic data set for six primate species. Our results suggest that with the fossil calibrations fixed, analyzing multiple loci or site partitions is the most effective way for improving the precision of posterior time estimation. However, even if a huge amount of sequence data is analyzed, considerable uncertainty will persist in time estimates. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society of Systematic Biologists.

  9. Estimation of population divergence times from non-overlapping genomic sequences: examples from dogs and wolves.

    PubMed

    Skoglund, Pontus; Götherström, Anders; Jakobsson, Mattias

    2011-04-01

    Despite recent technological advances in DNA sequencing, incomplete coverage remains to be an issue in population genomics, in particular for studies that include ancient samples. Here, we describe an approach to estimate population divergence times for non-overlapping sequence data that is based on probabilities of different genealogical topologies under a structured coalescent model. We show that the approach can be adapted to accommodate common problems such as sequencing errors and postmortem nucleotide misincorporations, and we use simulations to investigate biases involved with estimating genealogical topologies from empirical data. The approach relies on three reference genomes and should be particularly useful for future analysis of genomic data that comprise of nonoverlapping sets of sequences, potentially from different points in time. We applied the method to shotgun sequence data from an ancient wolf together with extant dogs and wolves and found striking resemblance to previously described fine-scale population structure among dog breeds. When comparing modern dogs to four geographically distinct wolves, we find that the divergence time between dogs and an Indian wolf is smallest, followed by the divergence times to a Chinese wolf and a Spanish wolf, and a relatively long divergence time to an Alaskan wolf, suggesting that the origin of modern dogs is somewhere in Eurasia, potentially southern Asia. We find that less than two-thirds of all loci in the boxer and poodle genomes are more similar to each other than to a modern gray wolf and that--assuming complete isolation without gene flow--the divergence time between gray wolves and modern European dogs extends to 3,500 generations before the present, corresponding to approximately 10,000 years ago (95% confidence interval [CI]: 9,000-13,000). We explicitly study the effect of gene flow between dogs and wolves on our estimates and show that a low rate of gene flow is compatible with an even earlier domestication date ∼30,000 years ago (95% CI: 15,000-90,000). This observation is in agreement with recent archaeological findings and indicates that human behavior necessary for domestication of wild animals could have appeared much earlier than the development of agriculture.

  10. The impact of fossil calibrations, codon positions and relaxed clocks on the divergence time estimates of the native Australian rodents (Conilurini).

    PubMed

    Nilsson, Maria A; Härlid, Anna; Kullberg, Morgan; Janke, Axel

    2010-05-01

    The native rodents are the most species-rich placental mammal group on the Australian continent. Fossils of native Australian rodents belonging to the group Conilurini are known from Northern Australia at 4.5Ma. These fossil assemblages already display a rich diversity of rodents, but the exact timing of their arrival on the Australian continent is not yet established. The complete mitochondrial genomes of two native Australian rodents, Leggadina lakedownensis (Lakeland Downs mouse) and Pseudomys chapmani (Western Pebble-mound mouse) were sequenced for investigating their evolutionary history. The molecular data were used for studying the phylogenetic position and divergence times of the Australian rodents, using 12 calibration points and various methods. Phylogenetic analyses place the native Australian rodents as the sister-group to the genus Mus. The Mus-Conilurini calibration point (7.3-11.0Ma) is highly critical for estimating rodent divergence times, while the influence of the different algorithms on estimating divergence times is negligible. The influence of the data type was investigated, indicating that amino acid data are more likely to reflect the correct divergence times than nucleotide sequences. The study on the problems related to estimating divergence times in fast-evolving lineages such as rodents, emphasize the choice of data and calibration points as being critical. Furthermore, it is essential to include accurate calibration points for fast-evolving groups, because the divergence times can otherwise be estimated to be significantly older. The divergence times of the Australian rodents are highly congruent and are estimated to 6.5-7.2Ma, a date that is compatible with their fossil record.

  11. FRAGS: estimation of coding sequence substitution rates from fragmentary data

    PubMed Central

    Swart, Estienne C; Hide, Winston A; Seoighe, Cathal

    2004-01-01

    Background Rates of substitution in protein-coding sequences can provide important insights into evolutionary processes that are of biomedical and theoretical interest. Increased availability of coding sequence data has enabled researchers to estimate more accurately the coding sequence divergence of pairs of organisms. However the use of different data sources, alignment protocols and methods to estimate substitution rates leads to widely varying estimates of key parameters that define the coding sequence divergence of orthologous genes. Although complete genome sequence data are not available for all organisms, fragmentary sequence data can provide accurate estimates of substitution rates provided that an appropriate and consistent methodology is used and that differences in the estimates obtainable from different data sources are taken into account. Results We have developed FRAGS, an application framework that uses existing, freely available software components to construct in-frame alignments and estimate coding substitution rates from fragmentary sequence data. Coding sequence substitution estimates for human and chimpanzee sequences, generated by FRAGS, reveal that methodological differences can give rise to significantly different estimates of important substitution parameters. The estimated substitution rates were also used to infer upper-bounds on the amount of sequencing error in the datasets that we have analysed. Conclusion We have developed a system that performs robust estimation of substitution rates for orthologous sequences from a pair of organisms. Our system can be used when fragmentary genomic or transcript data is available from one of the organisms and the other is a completely sequenced genome within the Ensembl database. As well as estimating substitution statistics our system enables the user to manage and query alignment and substitution data. PMID:15005802

  12. Using multi-locus allelic sequence data to estimate genetic divergence among four Lilium (Liliaceae) cultivars

    PubMed Central

    Shahin, Arwa; Smulders, Marinus J. M.; van Tuyl, Jaap M.; Arens, Paul; Bakker, Freek T.

    2014-01-01

    Next Generation Sequencing (NGS) may enable estimating relationships among genotypes using allelic variation of multiple nuclear genes simultaneously. We explored the potential and caveats of this strategy in four genetically distant Lilium cultivars to estimate their genetic divergence from transcriptome sequences using three approaches: POFAD (Phylogeny of Organisms from Allelic Data, uses allelic information of sequence data), RAxML (Randomized Accelerated Maximum Likelihood, tree building based on concatenated consensus sequences) and Consensus Network (constructing a network summarizing among gene tree conflicts). Twenty six gene contigs were chosen based on the presence of orthologous sequences in all cultivars, seven of which also had an orthologous sequence in Tulipa, used as out-group. The three approaches generated the same topology. Although the resolution offered by these approaches is high, in this case there was no extra benefit in using allelic information. We conclude that these 26 genes can be widely applied to construct a species tree for the genus Lilium. PMID:25368628

  13. Impact of duplicate gene copies on phylogenetic analysis and divergence time estimates in butterflies

    PubMed Central

    Pohl, Nélida; Sison-Mangus, Marilou P; Yee, Emily N; Liswi, Saif W; Briscoe, Adriana D

    2009-01-01

    Background The increase in availability of genomic sequences for a wide range of organisms has revealed gene duplication to be a relatively common event. Encounters with duplicate gene copies have consequently become almost inevitable in the context of collecting gene sequences for inferring species trees. Here we examine the effect of incorporating duplicate gene copies evolving at different rates on tree reconstruction and time estimation of recent and deep divergences in butterflies. Results Sequences from ultraviolet-sensitive (UVRh), blue-sensitive (BRh), and long-wavelength sensitive (LWRh) opsins,EF-1α and COI were obtained from 27 taxa representing the five major butterfly families (5535 bp total). Both BRh and LWRh are present in multiple copies in some butterfly lineages and the different copies evolve at different rates. Regardless of the phylogenetic reconstruction method used, we found that analyses of combined data sets using either slower or faster evolving copies of duplicate genes resulted in a single topology in agreement with our current understanding of butterfly family relationships based on morphology and molecules. Interestingly, individual analyses of BRh and LWRh sequences also recovered these family-level relationships. Two different relaxed clock methods resulted in similar divergence time estimates at the shallower nodes in the tree, regardless of whether faster or slower evolving copies were used, with larger discrepancies observed at deeper nodes in the phylogeny. The time of divergence between the monarch butterfly Danaus plexippus and the queen D. gilippus (15.3–35.6 Mya) was found to be much older than the time of divergence between monarch co-mimic Limenitis archippus and red-spotted purple L. arthemis (4.7–13.6 Mya), and overlapping with the time of divergence of the co-mimetic passionflower butterflies Heliconius erato and H. melpomene (13.5–26.1 Mya). Our family-level results are congruent with recent estimates found in the literature and indicate an age of 84–113 million years for the divergence of all butterfly families. Conclusion These results are consistent with diversification of the butterfly families following the radiation of angiosperms and suggest that some classes of opsin genes may be usefully employed for both phylogenetic reconstruction and divergence time estimation. PMID:19439087

  14. An improved approximate-Bayesian model-choice method for estimating shared evolutionary history

    PubMed Central

    2014-01-01

    Background To understand biological diversification, it is important to account for large-scale processes that affect the evolutionary history of groups of co-distributed populations of organisms. Such events predict temporally clustered divergences times, a pattern that can be estimated using genetic data from co-distributed species. I introduce a new approximate-Bayesian method for comparative phylogeographical model-choice that estimates the temporal distribution of divergences across taxa from multi-locus DNA sequence data. The model is an extension of that implemented in msBayes. Results By reparameterizing the model, introducing more flexible priors on demographic and divergence-time parameters, and implementing a non-parametric Dirichlet-process prior over divergence models, I improved the robustness, accuracy, and power of the method for estimating shared evolutionary history across taxa. Conclusions The results demonstrate the improved performance of the new method is due to (1) more appropriate priors on divergence-time and demographic parameters that avoid prohibitively small marginal likelihoods for models with more divergence events, and (2) the Dirichlet-process providing a flexible prior on divergence histories that does not strongly disfavor models with intermediate numbers of divergence events. The new method yields more robust estimates of posterior uncertainty, and thus greatly reduces the tendency to incorrectly estimate models of shared evolutionary history with strong support. PMID:24992937

  15. Bayesian estimation of post-Messinian divergence times in Balearic Island lizards.

    PubMed

    Brown, R P; Terrasa, B; Pérez-Mellado, V; Castro, J A; Hoskisson, P A; Picornell, A; Ramon, M M

    2008-07-01

    Phylogenetic relationships and timings of major cladogenesis events are investigated in the Balearic Island lizards Podarcislilfordi and P.pityusensis using 2675bp of mitochondrial and nuclear DNA sequences. Partitioned Bayesian and Maximum Parsimony analyses provided a well-resolved phylogeny with high node-support values. Bayesian MCMC estimation of node dates was investigated by comparing means of posterior distributions from different subsets of the sequence against the most robust analysis which used multiple partitions and allowed for rate heterogeneity among branches under a rate-drift model. Evolutionary rates were systematically underestimated and thus divergence times overestimated when sequences containing lower numbers of variable sites were used (based on ingroup node constraints). The following analyses allowed the best recovery of node times under the constant-rate (i.e., perfect clock) model: (i) all cytochrome b sequence (partitioned by codon position), (ii) cytochrome b (codon position 3 alone), (iii) NADH dehydrogenase (subunits 1 and 2; partitioned by codon position), (iv) cytochrome b and NADH dehydrogenase sequence together (six gene-codon partitions), (v) all unpartitioned sequence, (vi) a full multipartition analysis (nine partitions). Of these, only (iv) and (vi) performed well under the rate-drift model. These findings have significant implications for dating of recent divergence times in other taxa. The earliest P.lilfordi cladogenesis event (divergence of Menorcan populations), occurred before the end of the Pliocene, some 2.6Ma. Subsequent events led to a West Mallorcan lineage (2.0Ma ago), followed 1.2Ma ago by divergence of populations from the southern part of the Cabrera archipelago from a widely-distributed group from north Cabrera, northern and southern Mallorcan islets. Divergence within P.pityusensis is more recent with the main Ibiza and Formentera clades sharing a common ancestor at about 1.0Ma ago. Climatic and sea level changes are likely to have initiated cladogenesis, with lineages making secondary contact during periodic landbridge formation. This oscillating cross-archipelago pattern in which ancient divergence is followed by repeated contact resembles that seen between East-West refugia populations from mainland Europe.

  16. Variable Autosomal and X Divergence Near and Far from Genes Affects Estimates of Male Mutation Bias in Great Apes

    PubMed Central

    Narang, Pooja; Wilson Sayres, Melissa A.

    2016-01-01

    Male mutation bias, when more mutations are passed on via the male germline than via the female germline, is observed across mammals. One common way to infer the magnitude of male mutation bias, α, is to compare levels of neutral sequence divergence between genomic regions that spend different amounts of time in the male and female germline. For great apes, including human, we show that estimates of divergence are reduced in putatively unconstrained regions near genes relative to unconstrained regions far from genes. Divergence increases with increasing distance from genes on both the X chromosome and autosomes, but increases faster on the X chromosome than autosomes. As a result, ratios of X/A divergence increase with increasing distance from genes and corresponding estimates of male mutation bias are significantly higher in intergenic regions near genes versus far from genes. Future studies in other species will need to carefully consider the effect that genomic location will have on estimates of male mutation bias. PMID:27702816

  17. Testing the molecular clock using mechanistic models of fossil preservation and molecular evolution

    PubMed Central

    2017-01-01

    Molecular sequence data provide information about relative times only, and fossil-based age constraints are the ultimate source of information about absolute times in molecular clock dating analyses. Thus, fossil calibrations are critical to molecular clock dating, but competing methods are difficult to evaluate empirically because the true evolutionary time scale is never known. Here, we combine mechanistic models of fossil preservation and sequence evolution in simulations to evaluate different approaches to constructing fossil calibrations and their impact on Bayesian molecular clock dating, and the relative impact of fossil versus molecular sampling. We show that divergence time estimation is impacted by the model of fossil preservation, sampling intensity and tree shape. The addition of sequence data may improve molecular clock estimates, but accuracy and precision is dominated by the quality of the fossil calibrations. Posterior means and medians are poor representatives of true divergence times; posterior intervals provide a much more accurate estimate of divergence times, though they may be wide and often do not have high coverage probability. Our results highlight the importance of increased fossil sampling and improved statistical approaches to generating calibrations, which should incorporate the non-uniform nature of ecological and temporal fossil species distributions. PMID:28637852

  18. Recent African origin of modern humans revealed by complete sequences of hominoid mitochondrial DNAs.

    PubMed Central

    Horai, S; Hayasaka, K; Kondo, R; Tsugane, K; Takahata, N

    1995-01-01

    We analyzed the complete mitochondrial DNA (mtDNA) sequences of three humans (African, European, and Japanese), three African apes (common and pygmy chimpanzees, and gorilla), and one orangutan in an attempt to estimate most accurately the substitution rates and divergence times of hominoid mtDNAs. Nonsynonymous substitutions and substitutions in RNA genes have accumulated with an approximately clock-like regularity. From these substitutions and under the assumption that the orangutan and African apes diverged 13 million years ago, we obtained a divergence time for humans and chimpanzees of 4.9 million years. This divergence time permitted calibration of the synonymous substitution rate (3.89 x 10(-8)/site per year). To obtain the substitution rate in the displacement (D)-loop region, we compared the three human mtDNAs and measured the relative abundance of substitutions in the D-loop region and at synonymous sites. The estimated substitution rate in the D-loop region was 7.00 x 10(-8)/site per year. Using both synonymous and D-loop substitutions, we inferred the age of the last common ancestor of the human mtDNAs as 143,000 +/- 18,000 years. The shallow ancestry of human mtDNAs, together with the observation that the African sequence is the most diverged among humans, strongly supports the recent African origin of modern humans, Homo sapiens sapiens. PMID:7530363

  19. DNA barcoding reveals species level divergence between populations of the microhylid frog genus Arcovomer (Anura: Microhylidae) in the Atlantic Rainforest of southeastern Brazil.

    PubMed

    Jennings, W Bryan; Wogel, Henrique; Bilate, Marcos; Salles, Rodrigo de O L; Buckup, Paulo A

    2016-09-01

    The microhylid frogs belonging to the genus Arcovomer have been reported from lowland Atlantic Rainforest in the Brazilian states of Espírito Santo, Rio de Janeiro, and São Paulo. Here, we use DNA barcoding to assess levels of genetic divergence between apparently isolated populations in Espírito Santo and Rio de Janeiro. Our mtDNA data consisting of cytochrome oxidase subunit I (COI) nucleotide sequences reveals 13.2% uncorrected and 30.4% TIM2 + I + Γ corrected genetic divergences between these two populations. This level of divergence exceeds the suggested 10% uncorrected divergence threshold for elevating amphibian populations to candidate species using this marker, which implies that the Espírito Santo population is a species distinct from Arcovomer passarellii. Calibration of our model-corrected sequence divergence estimates suggests that the time of population divergence falls between 12 and 29 million years ago.

  20. The fossilized birth–death process for coherent calibration of divergence-time estimates

    PubMed Central

    Heath, Tracy A.; Huelsenbeck, John P.; Stadler, Tanja

    2014-01-01

    Time-calibrated species phylogenies are critical for addressing a wide range of questions in evolutionary biology, such as those that elucidate historical biogeography or uncover patterns of coevolution and diversification. Because molecular sequence data are not informative on absolute time, external data—most commonly, fossil age estimates—are required to calibrate estimates of species divergence dates. For Bayesian divergence time methods, the common practice for calibration using fossil information involves placing arbitrarily chosen parametric distributions on internal nodes, often disregarding most of the information in the fossil record. We introduce the “fossilized birth–death” (FBD) process—a model for calibrating divergence time estimates in a Bayesian framework, explicitly acknowledging that extant species and fossils are part of the same macroevolutionary process. Under this model, absolute node age estimates are calibrated by a single diversification model and arbitrary calibration densities are not necessary. Moreover, the FBD model allows for inclusion of all available fossils. We performed analyses of simulated data and show that node age estimation under the FBD model results in robust and accurate estimates of species divergence times with realistic measures of statistical uncertainty, overcoming major limitations of standard divergence time estimation methods. We used this model to estimate the speciation times for a dataset composed of all living bears, indicating that the genus Ursus diversified in the Late Miocene to Middle Pliocene. PMID:25009181

  1. A Generalized Least-Squares Estimate for the Origin of Sporophytic Self-Incompatibility

    PubMed Central

    Uyenoyama, M. K.

    1995-01-01

    Analysis of nucleotide sequences that regulate the expression of self-incompatibility in flowering plants affords a direct means of examining classical hypotheses for the origin and evolution of this major feature of mating systems. Departing from the classical view of monophyly of all forms of self-incompatibility, the current paradigm for the origin of self-incompatibility postulates multiple episodes of recruitment and modification of preexisting genes. In Brassica, the S locus, which regulates sporophytic self-incompatibility, shows homology to a multigene family present both in self-compatible congeners and in groups for which this form of self-incompatibility is atypical. A phylogenetic analysis of S-allele sequences together with homologous sequences that do not cosegregate with self-incompatibility permits dating the change of function that marked the origin of self-incompatibility. A generalized least-squares method is introduced that provides closed-form expressions for estimates and standard errors for function-specific divergence rates and times of divergence among sequences. This analysis suggests that the age of the sporophytic self-incompatibility system expressed in Brassica exceeds species divergence within the genus by four- to fivefold. The extraordinarily high levels of sequence diversity exhibited by S alleles appears to reflect their ancient derivation, with the alternative hypothesis of hypermutability rejected by the analysis. PMID:7713446

  2. Phylogeographic patterns of genetic diversity in eastern Mediterranean water frogs have been determined by geological processes and climate change in the Late Cenozoic.

    PubMed

    Akın, Ciğdem; Bilgin, C Can; Beerli, Peter; Westaway, Rob; Ohst, Torsten; Litvinchuk, Spartak N; Uzzell, Thomas; Bilgin, Metin; Hotz, Hansjürg; Guex, Gaston-Denis; Plötner, Jörg

    2010-11-01

    AIM: Our aims were to assess the phylogeographic patterns of genetic diversity in eastern Mediterranean water frogs and to estimate divergence times using different geological scenarios. We related divergence times to past geological events and discuss the relevance of our data for the systematics of eastern Mediterranean water frogs. LOCATION: The eastern Mediterranean region. METHODS: Genetic diversity and divergence were calculated using sequences of two protein-coding mitochondrial (mt) genes: ND2 (1038 bp, 119 sequences) and ND3 (340 bp, 612 sequences). Divergence times were estimated in a Bayesian framework under four geological scenarios representing alternative possible geological histories for the eastern Mediterranean. We then compared the different scenarios using Bayes factors and additional geological data. RESULTS: Extensive genetic diversity in mtDNA divides eastern Mediterranean water frogs into six main haplogroups (MHG). Three MHGs were identified on the Anatolian mainland; the most widespread MHG with the highest diversity is distributed from western Anatolia to the northern shore of the Caspian Sea, including the type locality of Pelophylax ridibundus. The other two Anatolian MHGs are restricted to south-eastern Turkey, occupying localities west and east of the Amanos mountain range. One of the remaining three MHGs is restricted to Cyprus; a second to the Levant; the third was found in the distribution area of European lake frogs (P. ridibundus group), including the Balkans. MAIN CONCLUSIONS: Based on geological evidence and estimates of genetic divergence we hypothesize that the water frogs of Cyprus have been isolated from the Anatolian mainland populations since the end of the Messinian salinity crisis (MSC), i.e. since c. 5.5-5.3 Ma, while our divergence time estimates indicate that the isolation of Crete from the mainland populations (Peloponnese, Anatolia) most likely pre-dates the MSC. The observed rates of divergence imply a time window of c. 1.6-1.1 million years for diversification of the largest Anatolian MHG; divergence between the two other Anatolian MHGs may have begun about 3.0 Ma, apparently as a result of uplift of the Amanos Mountains. Our mtDNA data suggest that the Anatolian water frogs and frogs from Cyprus represent several undescribed species.

  3. Comparative analysis of gene regulatory networks: from network reconstruction to evolution.

    PubMed

    Thompson, Dawn; Regev, Aviv; Roy, Sushmita

    2015-01-01

    Regulation of gene expression is central to many biological processes. Although reconstruction of regulatory circuits from genomic data alone is therefore desirable, this remains a major computational challenge. Comparative approaches that examine the conservation and divergence of circuits and their components across strains and species can help reconstruct circuits as well as provide insights into the evolution of gene regulatory processes and their adaptive contribution. In recent years, advances in genomic and computational tools have led to a wealth of methods for such analysis at the sequence, expression, pathway, module, and entire network level. Here, we review computational methods developed to study transcriptional regulatory networks using comparative genomics, from sequence to functional data. We highlight how these methods use evolutionary conservation and divergence to reliably detect regulatory components as well as estimate the extent and rate of divergence. Finally, we discuss the promise and open challenges in linking regulatory divergence to phenotypic divergence and adaptation.

  4. Low X/Y divergence in four pairs of papaya sex-linked genes.

    PubMed

    Yu, Qingyi; Hou, Shaobin; Feltus, F Alex; Jones, Meghan R; Murray, Jan E; Veatch, Olivia; Lemke, Cornelia; Saw, Jimmy H; Moore, Richard C; Thimmapuram, Jyothi; Liu, Lei; Moore, Paul H; Alam, Maqsudul; Jiang, Jiming; Paterson, Andrew H; Ming, Ray

    2008-01-01

    Sex chromosomes in flowering plants, in contrast to those in animals, evolved relatively recently and only a few are heteromorphic. The homomorphic sex chromosomes of papaya show features of incipient sex chromosome evolution. We investigated the features of paired X- and Y-specific bacterial artificial chromosomes (BACs), and estimated the time of divergence in four pairs of sex-linked genes. We report the results of a comparative analysis of long contiguous genomic DNA sequences between the X and hermaphrodite Y (Y(h)) chromosomes. Numerous chromosomal rearrangements were detected in the male-specific region of the Y chromosome (MSY), including inversions, deletions, insertions, duplications and translocations, showing the dynamic evolutionary process on the MSY after recombination ceased. DNA sequence expansion was documented in the two regions of the MSY, demonstrating that the cytologically homomorphic sex chromosomes are heteromorphic at the molecular level. Analysis of sequence divergence between four X and Y(h) gene pairs resulted in a estimated age of divergence of between 0.5 and 2.2 million years, supporting a recent origin of the papaya sex chromosomes. Our findings indicate that sex chromosomes did not evolve at the family level in Caricaceae, and reinforce the theory that sex chromosomes evolve at the species level in some lineages.

  5. Mitogenome Sequencing in the Genus Camelus Reveals Evidence for Purifying Selection and Long-term Divergence between Wild and Domestic Bactrian Camels.

    PubMed

    Mohandesan, Elmira; Fitak, Robert R; Corander, Jukka; Yadamsuren, Adiya; Chuluunbat, Battsetseg; Abdelhadi, Omer; Raziq, Abdul; Nagy, Peter; Stalder, Gabrielle; Walzer, Chris; Faye, Bernard; Burger, Pamela A

    2017-08-30

    The genus Camelus is an interesting model to study adaptive evolution in the mitochondrial genome, as the three extant Old World camel species inhabit hot and low-altitude as well as cold and high-altitude deserts. We sequenced 24 camel mitogenomes and combined them with three previously published sequences to study the role of natural selection under different environmental pressure, and to advance our understanding of the evolutionary history of the genus Camelus. We confirmed the heterogeneity of divergence across different components of the electron transport system. Lineage-specific analysis of mitochondrial protein evolution revealed a significant effect of purifying selection in the concatenated protein-coding genes in domestic Bactrian camels. The estimated dN/dS < 1 in the concatenated protein-coding genes suggested purifying selection as driving force for shaping mitogenome diversity in camels. Additional analyses of the functional divergence in amino acid changes between species-specific lineages indicated fixed substitutions in various genes, with radical effects on the physicochemical properties of the protein products. The evolutionary time estimates revealed a divergence between domestic and wild Bactrian camels around 1.1 [0.58-1.8] million years ago (mya). This has major implications for the conservation and management of the critically endangered wild species, Camelus ferus.

  6. Testing the molecular clock using mechanistic models of fossil preservation and molecular evolution.

    PubMed

    Warnock, Rachel C M; Yang, Ziheng; Donoghue, Philip C J

    2017-06-28

    Molecular sequence data provide information about relative times only, and fossil-based age constraints are the ultimate source of information about absolute times in molecular clock dating analyses. Thus, fossil calibrations are critical to molecular clock dating, but competing methods are difficult to evaluate empirically because the true evolutionary time scale is never known. Here, we combine mechanistic models of fossil preservation and sequence evolution in simulations to evaluate different approaches to constructing fossil calibrations and their impact on Bayesian molecular clock dating, and the relative impact of fossil versus molecular sampling. We show that divergence time estimation is impacted by the model of fossil preservation, sampling intensity and tree shape. The addition of sequence data may improve molecular clock estimates, but accuracy and precision is dominated by the quality of the fossil calibrations. Posterior means and medians are poor representatives of true divergence times; posterior intervals provide a much more accurate estimate of divergence times, though they may be wide and often do not have high coverage probability. Our results highlight the importance of increased fossil sampling and improved statistical approaches to generating calibrations, which should incorporate the non-uniform nature of ecological and temporal fossil species distributions. © 2017 The Authors.

  7. More reliable estimates of divergence times in Pan using complete mtDNA sequences and accounting for population structure.

    PubMed

    Stone, Anne C; Battistuzzi, Fabia U; Kubatko, Laura S; Perry, George H; Trudeau, Evan; Lin, Hsiuman; Kumar, Sudhir

    2010-10-27

    Here, we report the sequencing and analysis of eight complete mitochondrial genomes of chimpanzees (Pan troglodytes) from each of the three established subspecies (P. t. troglodytes, P. t. schweinfurthii and P. t. verus) and the proposed fourth subspecies (P. t. ellioti). Our population genetic analyses are consistent with neutral patterns of evolution that have been shaped by demography. The high levels of mtDNA diversity in western chimpanzees are unlike those seen at nuclear loci, which may reflect a demographic history of greater female to male effective population sizes possibly owing to the characteristics of the founding population. By using relaxed-clock methods, we have inferred a timetree of chimpanzee species and subspecies. The absolute divergence times vary based on the methods and calibration used, but relative divergence times show extensive uniformity. Overall, mtDNA produces consistently older times than those known from nuclear markers, a discrepancy that is reduced significantly by explicitly accounting for chimpanzee population structures in time estimation. Assuming the human-chimpanzee split to be between 7 and 5 Ma, chimpanzee time estimates are 2.1-1.5, 1.1-0.76 and 0.25-0.18 Ma for the chimpanzee/bonobo, western/(eastern + central) and eastern/central chimpanzee divergences, respectively.

  8. Theoretical Foundation of the RelTime Method for Estimating Divergence Times from Variable Evolutionary Rates

    PubMed Central

    Tamura, Koichiro; Tao, Qiqing; Kumar, Sudhir

    2018-01-01

    Abstract RelTime estimates divergence times by relaxing the assumption of a strict molecular clock in a phylogeny. It shows excellent performance in estimating divergence times for both simulated and empirical molecular sequence data sets in which evolutionary rates varied extensively throughout the tree. RelTime is computationally efficient and scales well with increasing size of data sets. Until now, however, RelTime has not had a formal mathematical foundation. Here, we show that the basis of the RelTime approach is a relative rate framework (RRF) that combines comparisons of evolutionary rates in sister lineages with the principle of minimum rate change between evolutionary lineages and their respective descendants. We present analytical solutions for estimating relative lineage rates and divergence times under RRF. We also discuss the relationship of RRF with other approaches, including the Bayesian framework. We conclude that RelTime will be useful for phylogenies with branch lengths derived not only from molecular data, but also morphological and biochemical traits. PMID:29893954

  9. Population genetics of polymorphism and divergence for diploid selection models with arbitrary dominance.

    PubMed

    Williamson, Scott; Fledel-Alon, Adi; Bustamante, Carlos D

    2004-09-01

    We develop a Poisson random-field model of polymorphism and divergence that allows arbitrary dominance relations in a diploid context. This model provides a maximum-likelihood framework for estimating both selection and dominance parameters of new mutations using information on the frequency spectrum of sequence polymorphisms. This is the first DNA sequence-based estimator of the dominance parameter. Our model also leads to a likelihood-ratio test for distinguishing nongenic from genic selection; simulations indicate that this test is quite powerful when a large number of segregating sites are available. We also use simulations to explore the bias in selection parameter estimates caused by unacknowledged dominance relations. When inference is based on the frequency spectrum of polymorphisms, genic selection estimates of the selection parameter can be very strongly biased even for minor deviations from the genic selection model. Surprisingly, however, when inference is based on polymorphism and divergence (McDonald-Kreitman) data, genic selection estimates of the selection parameter are nearly unbiased, even for completely dominant or recessive mutations. Further, we find that weak overdominant selection can increase, rather than decrease, the substitution rate relative to levels of polymorphism. This nonintuitive result has major implications for the interpretation of several popular tests of neutrality.

  10. Next generation sequencing and analysis of a conserved transcriptome of New Zealand's kiwi.

    PubMed

    Subramanian, Sankar; Huynen, Leon; Millar, Craig D; Lambert, David M

    2010-12-15

    Kiwi is a highly distinctive, flightless and endangered ratite bird endemic to New Zealand. To understand the patterns of molecular evolution of the nuclear protein-coding genes in brown kiwi (Apteryx australis mantelli) and to determine the timescale of avian history we sequenced a transcriptome obtained from a kiwi embryo using next generation sequencing methods. We then assembled the conserved protein-coding regions using the chicken proteome as a scaffold. Using 1,543 conserved protein coding genes we estimated the neutral evolutionary divergence between the kiwi and chicken to be ~45%, which is approximately equal to the divergence computed for the human-mouse pair using the same set of genes. A large fraction of genes was found to be under high selective constraint, as most of the expressed genes appeared to be involved in developmental gene regulation. Our study suggests a significant relationship between gene expression levels and protein evolution. Using sequences from over 700 nuclear genes we estimated the divergence between the two basal avian groups, Palaeognathae and Neognathae to be 132 million years, which is consistent with previous studies using mitochondrial genes. The results of this investigation revealed patterns of mutation and purifying selection in conserved protein coding regions in birds. Furthermore this study suggests a relatively cost-effective way of obtaining a glimpse into the fundamental molecular evolutionary attributes of a genome, particularly when no closely related genomic sequence is available.

  11. Genome Evolution in the Primary Endosymbiont of Whiteflies Sheds Light on Their Divergence

    PubMed Central

    Santos-Garcia, Diego; Vargas-Chavez, Carlos; Moya, Andrés; Latorre, Amparo; Silva, Francisco J.

    2015-01-01

    Whiteflies are important agricultural insect pests, whose evolutionary success is related to a long-term association with a bacterial endosymbiont, Candidatus Portiera aleyrodidarum. To completely characterize this endosymbiont clade, we sequenced the genomes of three new Portiera strains covering the two extant whitefly subfamilies. Using endosymbiont and mitochondrial sequences we estimated the divergence dates in the clade and used these values to understand the molecular evolution of the endosymbiont coding sequences. Portiera genomes were maintained almost completely stable in gene order and gene content during more than 125 Myr of evolution, except in the Bemisia tabaci lineage. The ancestor had already lost the genetic information transfer autonomy but was able to participate in the synthesis of all essential amino acids and carotenoids. The time of divergence of the B. tabaci complex was much more recent than previous estimations. The recent divergence of biotypes B (MEAM1 species) and Q (MED species) suggests that they still could be considered strains of the same species. We have estimated the rates of evolution of Portiera genes, synonymous and nonsynonymous, and have detected significant differences among-lineages, with most Portiera lineages evolving very slowly. Although the nonsynonymous rates were much smaller than the synonymous, the genomic dN/dS ratios were similar, discarding selection as the driver of among-lineage variation. We suggest variation in mutation rate and generation time as the responsible factors. In conclusion, the slow evolutionary rates of Portiera may have contributed to its long-term association with whiteflies, avoiding its replacement by a novel and more efficient endosymbiont. PMID:25716826

  12. High diversity and rapid diversification in the head louse, Pediculus humanus (Pediculidae: Phthiraptera)

    PubMed Central

    Ashfaq, Muhammad; Prosser, Sean; Nasir, Saima; Masood, Mariyam; Ratnasingham, Sujeevan; Hebert, Paul D. N.

    2015-01-01

    The study analyzes sequence variation of two mitochondrial genes (COI, cytb) in Pediculus humanus from three countries (Egypt, Pakistan, South Africa) that have received little prior attention, and integrates these results with prior data. Analysis indicates a maximum K2P distance of 10.3% among 960 COI sequences and 13.8% among 479 cytb sequences. Three analytical methods (BIN, PTP, ABGD) reveal five concordant OTUs for COI and cytb. Neighbor-Joining analysis of the COI sequences confirm five clusters; three corresponding to previously recognized mitochondrial clades A, B, C and two new clades, “D” and “E”, showing 2.3% and 2.8% divergence from their nearest neighbors (NN). Cytb data corroborate five clusters showing that clades “D” and “E” are both 4.6% divergent from their respective NN clades. Phylogenetic analysis supports the monophyly of all clusters recovered by NJ analysis. Divergence time estimates suggest that the earliest split of P. humanus clades occured slightly more than one million years ago (MYa) and the latest about 0.3 MYa. Sequence divergences in COI and cytb among the five clades of P. humanus are 10X those in their human host, a difference that likely reflects both rate acceleration and the acquisition of lice clades from several archaic hominid lineages. PMID:26373806

  13. DNA barcoding for effective biodiversity assessment of a hyperdiverse arthropod group: the ants of Madagascar

    PubMed Central

    Smith, M. Alex; Fisher, Brian L; Hebert, Paul D.N

    2005-01-01

    The role of DNA barcoding as a tool to accelerate the inventory and analysis of diversity for hyperdiverse arthropods is tested using ants in Madagascar. We demonstrate how DNA barcoding helps address the failure of current inventory methods to rapidly respond to pressing biodiversity needs, specifically in the assessment of richness and turnover across landscapes with hyperdiverse taxa. In a comparison of inventories at four localities in northern Madagascar, patterns of richness were not significantly different when richness was determined using morphological taxonomy (morphospecies) or sequence divergence thresholds (Molecular Operational Taxonomic Unit(s); MOTU). However, sequence-based methods tended to yield greater richness and significantly lower indices of similarity than morphological taxonomy. MOTU determined using our molecular technique were a remarkably local phenomenon—indicative of highly restricted dispersal and/or long-term isolation. In cases where molecular and morphological methods differed in their assignment of individuals to categories, the morphological estimate was always more conservative than the molecular estimate. In those cases where morphospecies descriptions collapsed distinct molecular groups, sequence divergences of 16% (on average) were contained within the same morphospecies. Such high divergences highlight taxa for further detailed genetic, morphological, life history, and behavioral studies. PMID:16214741

  14. Variable Autosomal and X Divergence Near and Far from Genes Affects Estimates of Male Mutation Bias in Great Apes.

    PubMed

    Narang, Pooja; Wilson Sayres, Melissa A

    2016-12-31

    Male mutation bias, when more mutations are passed on via the male germline than via the female germline, is observed across mammals. One common way to infer the magnitude of male mutation bias, α, is to compare levels of neutral sequence divergence between genomic regions that spend different amounts of time in the male and female germline. For great apes, including human, we show that estimates of divergence are reduced in putatively unconstrained regions near genes relative to unconstrained regions far from genes. Divergence increases with increasing distance from genes on both the X chromosome and autosomes, but increases faster on the X chromosome than autosomes. As a result, ratios of X/A divergence increase with increasing distance from genes and corresponding estimates of male mutation bias are significantly higher in intergenic regions near genes versus far from genes. Future studies in other species will need to carefully consider the effect that genomic location will have on estimates of male mutation bias. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  15. The impact of the rate prior on Bayesian estimation of divergence times with multiple Loci.

    PubMed

    Dos Reis, Mario; Zhu, Tianqi; Yang, Ziheng

    2014-07-01

    Bayesian methods provide a powerful way to estimate species divergence times by combining information from molecular sequences with information from the fossil record. With the explosive increase of genomic data, divergence time estimation increasingly uses data of multiple loci (genes or site partitions). Widely used computer programs to estimate divergence times use independent and identically distributed (i.i.d.) priors on the substitution rates for different loci. The i.i.d. prior is problematic. As the number of loci (L) increases, the prior variance of the average rate across all loci goes to zero at the rate 1/L. As a consequence, the rate prior dominates posterior time estimates when many loci are analyzed, and if the rate prior is misspecified, the estimated divergence times will converge to wrong values with very narrow credibility intervals. Here we develop a new prior on the locus rates based on the Dirichlet distribution that corrects the problematic behavior of the i.i.d. prior. We use computer simulation and real data analysis to highlight the differences between the old and new priors. For a dataset for six primate species, we show that with the old i.i.d. prior, if the prior rate is too high (or too low), the estimated divergence times are too young (or too old), outside the bounds imposed by the fossil calibrations. In contrast, with the new Dirichlet prior, posterior time estimates are insensitive to the rate prior and are compatible with the fossil calibrations. We re-analyzed a phylogenomic data set of 36 mammal species and show that using many fossil calibrations can alleviate the adverse impact of a misspecified rate prior to some extent. We recommend the use of the new Dirichlet prior in Bayesian divergence time estimation. [Bayesian inference, divergence time, relaxed clock, rate prior, partition analysis.]. © The Author(s) 2014. Published by Oxford University Press, on behalf of the Society of Systematic Biologists.

  16. Fossils matter: improved estimates of divergence times in Pinus reveal older diversification.

    PubMed

    Saladin, Bianca; Leslie, Andrew B; Wüest, Rafael O; Litsios, Glenn; Conti, Elena; Salamin, Nicolas; Zimmermann, Niklaus E

    2017-04-04

    The taxonomy of pines (genus Pinus) is widely accepted and a robust gene tree based on entire plastome sequences exists. However, there is a large discrepancy in estimated divergence times of major pine clades among existing studies, mainly due to differences in fossil placement and dating methods used. We currently lack a dated molecular phylogeny that makes use of the rich pine fossil record, and this study is the first to estimate the divergence dates of pines based on a large number of fossils (21) evenly distributed across all major clades, in combination with applying both node and tip dating methods. We present a range of molecular phylogenetic trees of Pinus generated within a Bayesian framework. We find the origin of crown Pinus is likely up to 30 Myr older (Early Cretaceous) than inferred in most previous studies (Late Cretaceous) and propose generally older divergence times for major clades within Pinus than previously thought. Our age estimates vary significantly between the different dating approaches, but the results generally agree on older divergence times. We present a revised list of 21 fossils that are suitable to use in dating or comparative analyses of pines. Reliable estimates of divergence times in pines are essential if we are to link diversification processes and functional adaptation of this genus to geological events or to changing climates. In addition to older divergence times in Pinus, our results also indicate that node age estimates in pines depend on dating approaches and the specific fossil sets used, reflecting inherent differences in various dating approaches. The sets of dated phylogenetic trees of pines presented here provide a way to account for uncertainties in age estimations when applying comparative phylogenetic methods.

  17. Nucleotide sequences of immunoglobulin eta genes of chimpanzee and orangutan: DNA molecular clock and hominoid evolution

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Sakoyama, Y.; Hong, K.J.; Byun, S.M.

    To determine the phylogenetic relationships among hominoids and the dates of their divergence, the complete nucleotide sequences of the constant region of the immunoglobulin eta-chain (C/sub eta1/) genes from chimpanzee and orangutan have been determined. These sequences were compared with the human eta-chain constant-region sequence. A molecular clock (silent molecular clock), measured by the degree of sequence divergence at the synonymous (silent) positions of protein-encoding regions, was introduced for the present study. From the comparison of nucleotide sequences of ..cap alpha../sub 1/-antitrypsin and ..beta..- and delta-globulin genes between humans and Old World monkeys, the silent molecular clock was calibrated: themore » mean evolutionary rate of silent substitution was determined to be 1.56 x 10/sup -9/ substitutions per site per year. Using the silent molecular clock, the mean divergence dates of chimpanzee and orangutan from the human lineage were estimated as 6.4 +/- 2.6 million years and 17.3 +/- 4.5 million years, respectively. It was also shown that the evolutionary rate of primate genes is considerably slower than those of other mammalian genes.« less

  18. Mechanisms of haplotype divergence at the RGA08 nucleotide-binding leucine-rich repeat gene locus in wild banana (Musa balbisiana).

    PubMed

    Baurens, Franc-Christophe; Bocs, Stéphanie; Rouard, Mathieu; Matsumoto, Takashi; Miller, Robert N G; Rodier-Goud, Marguerite; MBéguié-A-MBéguié, Didier; Yahiaoui, Nabila

    2010-07-16

    Comparative sequence analysis of complex loci such as resistance gene analog clusters allows estimating the degree of sequence conservation and mechanisms of divergence at the intraspecies level. In banana (Musa sp.), two diploid wild species Musa acuminata (A genome) and Musa balbisiana (B genome) contribute to the polyploid genome of many cultivars. The M. balbisiana species is associated with vigour and tolerance to pests and disease and little is known on the genome structure and haplotype diversity within this species. Here, we compare two genomic sequences of 253 and 223 kb corresponding to two haplotypes of the RGA08 resistance gene analog locus in M. balbisiana "Pisang Klutuk Wulung" (PKW). Sequence comparison revealed two regions of contrasting features. The first is a highly colinear gene-rich region where the two haplotypes diverge only by single nucleotide polymorphisms and two repetitive element insertions. The second corresponds to a large cluster of RGA08 genes, with 13 and 18 predicted RGA genes and pseudogenes spread over 131 and 152 kb respectively on each haplotype. The RGA08 cluster is enriched in repetitive element insertions, in duplicated non-coding intergenic sequences including low complexity regions and shows structural variations between haplotypes. Although some allelic relationships are retained, a large diversity of RGA08 genes occurs in this single M. balbisiana genotype, with several RGA08 paralogs specific to each haplotype. The RGA08 gene family has evolved by mechanisms of unequal recombination, intragenic sequence exchange and diversifying selection. An unequal recombination event taking place between duplicated non-coding intergenic sequences resulted in a different RGA08 gene content between haplotypes pointing out the role of such duplicated regions in the evolution of RGA clusters. Based on the synonymous substitution rate in coding sequences, we estimated a 1 million year divergence time for these M. balbisiana haplotypes. A large RGA08 gene cluster identified in wild banana corresponds to a highly variable genomic region between haplotypes surrounded by conserved flanking regions. High level of sequence identity (70 to 99%) of the genic and intergenic regions suggests a recent and rapid evolution of this cluster in M. balbisiana.

  19. Molecular survey of basidiomycetes and divergence time estimation: An Indian perspective

    PubMed Central

    Bhatt, Meghna; Mistri, Pankti; Joshi, Ishita; Ram, Hemal; Raval, Rinni; Thoota, Sruthi; Patel, Ankur; Raval, Dhrupa; Bhargava, Poonam; Soni, Subhash; Bagatharia, Snehal

    2018-01-01

    This study outlines the biodiversity of mushrooms of India. It reveals the molecular biodiversity and divergence time estimation of basidiomycetes from Gujarat, India. A total of 267 mushrooms were collected from 10 locations across the state. 225 ITS sequences were generated belonging to 105 species, 59 genera and 29 families. Phylogenetic analysis of Agaricaceae reveals monophyletic clade of Podaxis differentiating it from Coprinus. Further, the ancient nature of Podaxis supports the hypothesis that gasteroid forms evolved from secotioid forms. Members of Polyporaceae appeared polyphyletic. Further, our results of a close phylogenetic relationship between Trametes and Lenziteslead us to propose that the genera Trametes may by enlarged to include Lenzites. The tricholomatoid clade shows a clear demarcation for Entolomataceae. However, Lyophyllaceae and Tricholomataceae could not be distinguished clearly. Distribution studies of the mushrooms showed omnipresence of Ganoderma and Schizophyllum. Further, divergence time estimation shows that Dacrymycetes evolved in the Neoproterozoic Era and Hymenochaetales diverged from Agaricomycetes during the Silurian period. PMID:29771956

  20. Assessing DNA Barcodes for Species Identification in North American Reptiles and Amphibians in Natural History Collections.

    PubMed

    Chambers, E Anne; Hebert, Paul D N

    2016-01-01

    High rates of species discovery and loss have led to the urgent need for more rapid assessment of species diversity in the herpetofauna. DNA barcoding allows for the preliminary identification of species based on sequence divergence. Prior DNA barcoding work on reptiles and amphibians has revealed higher biodiversity counts than previously estimated due to cases of cryptic and undiscovered species. Past studies have provided DNA barcodes for just 14% of the North American herpetofauna, revealing the need for expanded coverage. This study extends the DNA barcode reference library for North American herpetofauna, assesses the utility of this approach in aiding species delimitation, and examines the correspondence between current species boundaries and sequence clusters designated by the BIN system. Sequences were obtained from 730 specimens, representing 274 species (43%) from the North American herpetofauna. Mean intraspecific divergences were 1% and 3%, while average congeneric sequence divergences were 16% and 14% in amphibians and reptiles, respectively. BIN assignments corresponded with current species boundaries in 79% of amphibians, 100% of turtles, and 60% of squamates. Deep divergences (>2%) were noted in 35% of squamate and 16% of amphibian species, and low divergences (<2%) occurred in 12% of reptiles and 23% of amphibians, patterns reflected in BIN assignments. Sequence recovery declined with specimen age, and variation in recovery success was noted among collections. Within collections, barcodes effectively flagged seven mislabeled tissues, and barcode fragments were recovered from five formalin-fixed specimens. This study demonstrates that DNA barcodes can effectively flag errors in museum collections, while BIN splits and merges reveal taxa belonging to deeply diverged or hybridizing lineages. This study is the first effort to compile a reference library of DNA barcodes for herpetofauna on a continental scale.

  1. Assessing DNA Barcodes for Species Identification in North American Reptiles and Amphibians in Natural History Collections

    PubMed Central

    Chambers, E. Anne; Hebert, Paul D. N.

    2016-01-01

    Background High rates of species discovery and loss have led to the urgent need for more rapid assessment of species diversity in the herpetofauna. DNA barcoding allows for the preliminary identification of species based on sequence divergence. Prior DNA barcoding work on reptiles and amphibians has revealed higher biodiversity counts than previously estimated due to cases of cryptic and undiscovered species. Past studies have provided DNA barcodes for just 14% of the North American herpetofauna, revealing the need for expanded coverage. Methodology/Principal Findings This study extends the DNA barcode reference library for North American herpetofauna, assesses the utility of this approach in aiding species delimitation, and examines the correspondence between current species boundaries and sequence clusters designated by the BIN system. Sequences were obtained from 730 specimens, representing 274 species (43%) from the North American herpetofauna. Mean intraspecific divergences were 1% and 3%, while average congeneric sequence divergences were 16% and 14% in amphibians and reptiles, respectively. BIN assignments corresponded with current species boundaries in 79% of amphibians, 100% of turtles, and 60% of squamates. Deep divergences (>2%) were noted in 35% of squamate and 16% of amphibian species, and low divergences (<2%) occurred in 12% of reptiles and 23% of amphibians, patterns reflected in BIN assignments. Sequence recovery declined with specimen age, and variation in recovery success was noted among collections. Within collections, barcodes effectively flagged seven mislabeled tissues, and barcode fragments were recovered from five formalin-fixed specimens. Conclusions/Significance This study demonstrates that DNA barcodes can effectively flag errors in museum collections, while BIN splits and merges reveal taxa belonging to deeply diverged or hybridizing lineages. This study is the first effort to compile a reference library of DNA barcodes for herpetofauna on a continental scale. PMID:27116180

  2. Bayesian Divergence-Time Estimation with Genome-Wide SNP Data of Sea Catfishes (Ariidae) Supports Miocene Closure of the Panamanian Isthmus.

    PubMed

    Stange, Madlen; Sánchez-Villagra, Marcelo R; Salzburger, Walter; Matschiner, Michael

    2018-01-27

    The closure of the Isthmus of Panama has long been considered to be one of the best defined biogeographic calibration points for molecular divergence-time estimation. However, geological and biological evidence has recently cast doubt on the presumed timing of the initial isthmus closure around 3 Ma but has instead suggested the existence of temporary land bridges as early as the Middle or Late Miocene. The biological evidence supporting these earlier land bridges was based either on only few molecular markers or on concatenation of genome-wide sequence data, an approach that is known to result in potentially misleading branch lengths and divergence times, which could compromise the reliability of this evidence. To allow divergence-time estimation with genomic data using the more appropriate multi-species coalescent model, we here develop a new method combining the SNP-based Bayesian species-tree inference of the software SNAPP with a molecular clock model that can be calibrated with fossil or biogeographic constraints. We validate our approach with simulations and use our method to reanalyze genomic data of Neotropical army ants (Dorylinae) that previously supported divergence times of Central and South American populations before the isthmus closure around 3 Ma. Our reanalysis with the multi-species coalescent model shifts all of these divergence times to ages younger than 3 Ma, suggesting that the older estimates supporting the earlier existence of temporary land bridges were artifacts resulting at least partially from the use of concatenation. We then apply our method to a new RAD-sequencing data set of Neotropical sea catfishes (Ariidae) and calibrate their species tree with extensive information from the fossil record. We identify a series of divergences between groups of Caribbean and Pacific sea catfishes around 10 Ma, indicating that processes related to the emergence of the isthmus led to vicariant speciation already in the Late Miocene, millions of years before the final isthmus closure. © The Author(s) 2018. Published by Oxford University Press, on behalf of the Society of Systematic Biologists.

  3. Mitochondrial genome sequence and expression profiling for the legume pod borer Maruca vitrata (Lepidoptera: Crambidae)

    USDA-ARS?s Scientific Manuscript database

    We report on the assembly of the 14,146 base pairs (bp) near complete mitochondrial sequencing of the legume pod borer (LPB), Maruca vitrata (Lepidoptera: Crambidae), which was used to estimate divergence and relationships within the lepidopteran lineage. Arrangement and orientation of 13 protein c...

  4. Phylogenetic analyses of complete mitochondrial genome sequences suggest a basal divergence of the enigmatic rodent Anomalurus

    PubMed Central

    Horner, David S; Lefkimmiatis, Konstantinos; Reyes, Aurelio; Gissi, Carmela; Saccone, Cecilia; Pesole, Graziano

    2007-01-01

    Background Phylogenetic relationships between Lagomorpha, Rodentia and Primates and their allies (Euarchontoglires) have long been debated. While it is now generally agreed that Rodentia constitutes a monophyletic sister-group of Lagomorpha and that this clade (Glires) is sister to Primates and Dermoptera, higher-level relationships within Rodentia remain contentious. Results We have sequenced and performed extensive evolutionary analyses on the mitochondrial genome of the scaly-tailed flying squirrel Anomalurus sp., an enigmatic rodent whose phylogenetic affinities have been obscure and extensively debated. Our phylogenetic analyses of the coding regions of available complete mitochondrial genome sequences from Euarchontoglires suggest that Anomalurus is a sister taxon to the Hystricognathi, and that this clade represents the most basal divergence among sampled Rodentia. Bayesian dating methods incorporating a relaxed molecular clock provide divergence-time estimates which are consistently in agreement with the fossil record and which indicate a rapid radiation within Glires around 60 million years ago. Conclusion Taken together, the data presented provide a working hypothesis as to the phylogenetic placement of Anomalurus, underline the utility of mitochondrial sequences in the resolution of even relatively deep divergences and go some way to explaining the difficulty of conclusively resolving higher-level relationships within Glires with available data and methodologies. PMID:17288612

  5. High-throughput sequencing of complete human mtDNA genomes from the Caucasus and West Asia: high diversity and demographic inferences.

    PubMed

    Schönberg, Anna; Theunert, Christoph; Li, Mingkun; Stoneking, Mark; Nasidze, Ivan

    2011-09-01

    To investigate the demographic history of human populations from the Caucasus and surrounding regions, we used high-throughput sequencing to generate 147 complete mtDNA genome sequences from random samples of individuals from three groups from the Caucasus (Armenians, Azeri and Georgians), and one group each from Iran and Turkey. Overall diversity is very high, with 144 different sequences that fall into 97 different haplogroups found among the 147 individuals. Bayesian skyline plots (BSPs) of population size change through time show a population expansion around 40-50 kya, followed by a constant population size, and then another expansion around 15-18 kya for the groups from the Caucasus and Iran. The BSP for Turkey differs the most from the others, with an increase from 35 to 50 kya followed by a prolonged period of constant population size, and no indication of a second period of growth. An approximate Bayesian computation approach was used to estimate divergence times between each pair of populations; the oldest divergence times were between Turkey and the other four groups from the South Caucasus and Iran (~400-600 generations), while the divergence time of the three Caucasus groups from each other was comparable to their divergence time from Iran (average of ~360 generations). These results illustrate the value of random sampling of complete mtDNA genome sequences that can be obtained with high-throughput sequencing platforms.

  6. High genetic diversities between isolates of the fish parasite Cryptocaryon irritans (Ciliophora) suggest multiple cryptic species.

    PubMed

    Chi, Hongshu; Taik, Patricia; Foley, Emily J; Racicot, Alycia C; Gray, Hilary M; Guzzetta, Katherine E; Lin, Hsin-Yun; Song, Yen-Ling; Tung, Che-Huang; Zenke, Kosuke; Yoshinaga, Tomoyoshi; Cheng, Chao-Yin; Chang, Wei-Jen; Gong, Hui

    2017-07-01

    The ciliate protozoan Cryptocaryon irritans parasitizes marine fish and causes lethal white spot disease. Sporadic infections as well as large-scale outbreaks have been reported globally and the parasite's broad host range poses particular threat to the aquaculture and ornamental fish markets. In order to better understand C. irritans' population structure, we sequenced and compared mitochondrial cox-1, SSU rRNA, and ITS-1 sequences from 8 new isolates of C. irritans collected in China, Japan, and Taiwan. We detected two SSU rRNA haplotypes, which differ at three positions, separating the isolates into two main groups (I and II). Cox-1 sequences also support the division into two groups, and the cox-1 divergence between these two groups is unexpectedly high (9.28% for 1582 nucleotide positions). The divergence is much greater than that detected in Ichthyophthirius multifiliis, the ciliate protozoan causing freshwater white spot disease in fish, where intraspecies divergence on cox-1 sequence is only 1.95%. ITS-1 sequences derived from these eight isolates and from all other C. irritans isolates (deposited in the GenBank) not only support the two groups, but further suggest the presence of a third group with even greater sequence divergence. Finally, a small Ka/Ks ratio estimated from cox-1 sequences suggests that this gene in C. irritans remains under strong purifying selection. Taken together, the C. irritans species may consists of many subspecies and/or syngens. Further work is needed to determine if there is reproductive isolation between the groups we have defined. Copyright © 2017 Elsevier Inc. All rights reserved.

  7. Chloroplast and nuclear gene sequences indicate late Pennsylvanian time for the last common ancestor of extant seed plants.

    PubMed Central

    Savard, L; Li, P; Strauss, S H; Chase, M W; Michaud, M; Bousquet, J

    1994-01-01

    We have estimated the time for the last common ancestor of extant seed plants by using molecular clocks constructed from the sequences of the chloroplastic gene coding for the large subunit of ribulose-1,5-bisphosphate carboxylase/oxygenase (rbcL) and the nuclear gene coding for the small subunit of rRNA (Rrn18). Phylogenetic analyses of nucleotide sequences indicated that the earliest divergence of extant seed plants is likely represented by a split between conifer-cycad and angiosperm lineages. Relative-rate tests were used to assess homogeneity of substitution rates among lineages, and annual angiosperms were found to evolve at a faster rate than other taxa for rbcL and, thus, these sequences were excluded from construction of molecular clocks. Five distinct molecular clocks were calibrated using substitution rates for the two genes and four divergence times based on fossil and published molecular clock estimates. The five estimated times for the last common ancestor of extant seed plants were in agreement with one another, with an average of 285 million years and a range of 275-290 million years. This implies a substantially more recent ancestor of all extant seed plants than suggested by some theories of plant evolution. PMID:8197201

  8. Molecular phylogeography of the brown bear (Ursus arctos) in Northeastern Asia based on analyses of complete mitochondrial DNA sequences.

    PubMed

    Hirata, Daisuke; Mano, Tsutomu; Abramov, Alexei V; Baryshnikov, Gennady F; Kosintsev, Pavel A; Vorobiev, Alexandr A; Raichev, Evgeny G; Tsunoda, Hiroshi; Kaneko, Yayoi; Murata, Koichi; Fukui, Daisuke; Masuda, Ryuichi

    2013-07-01

    To further elucidate the migration history of the brown bears (Ursus arctos) on Hokkaido Island, Japan, we analyzed the complete mitochondrial DNA (mtDNA) sequences of 35 brown bears from Hokkaido, the southern Kuril Islands (Etorofu and Kunashiri), Sakhalin Island, and the Eurasian Continent (continental Russia, Bulgaria, and Tibet), and those of four polar bears. Based on these sequences, we reconstructed the maternal phylogeny of the brown bear and estimated divergence times to investigate the timing of brown bear migrations, especially in northeastern Eurasia. Our gene tree showed the mtDNA haplotypes of all 73 brown and polar bears to be divided into eight divergent lineages. The brown bear on Hokkaido was divided into three lineages (central, eastern, and southern). The Sakhalin brown bear grouped with eastern European and western Alaskan brown bears. Etorofu and Kunashiri brown bears were closely related to eastern Hokkaido brown bears and could have diverged from the eastern Hokkaido lineage after formation of the channel between Hokkaido and the southern Kuril Islands. Tibetan brown bears diverged early in the eastern lineage. Southern Hokkaido brown bears were closely related to North American brown bears.

  9. Evolutionary trends of European bat lyssavirus type 2 including genetic characterization of Finnish strains of human and bat origin 24 years apart.

    PubMed

    Jakava-Viljanen, Miia; Miia, Jakava-Viljanen; Nokireki, Tiina; Tiina, Nokireki; Sironen, Tarja; Tarja, Sironen; Vapalahti, Olli; Olli, Vapalahti; Sihvonen, Liisa; Liisa, Sihvonen; Huovilainen, Anita; Anita, Huovilainen

    2015-06-01

    Among other Lyssaviruses, Daubenton's and pond-bat-related European bat lyssavirus type 2 (EBLV-2) can cause human rabies. To investigate the diversity and evolutionary trends of EBLV-2, complete genome sequences of two Finnish isolates were analysed. One originated from a human case in 1985, and the other originated from a bat in 2009. The overall nucleotide and deduced amino acid sequence identity of the two Finnish isolates were high, as well as the similarity to fully sequenced EBLV-2 strains originating from the UK and the Netherlands. In phylogenetic analysis, the EBLV-2 strains formed a monophyletic group that was separate from other bat-type lyssaviruses, with significant support. EBLV-2 shared the most recent common ancestry with Bokeloh bat lyssavirus (BBLV) and Khujan virus (KHUV). EBLV-2 showed limited diversity compared to RABV and appears to be well adapted to its host bat species. The slow tempo of viral evolution was evident in the estimations of divergence times for EBLV-2: the current diversity was estimated to have built up during the last 2000 years, and EBLV-2 diverged from KHUV about 8000 years ago. In a phylogenetic tree of partial N gene sequences, the Finnish EBLV-2 strains clustered with strains from Central Europe, supporting the hypothesis that EBLV-2 circulating in Finland might have a Central European origin. The Finnish EBLV-2 strains and a Swiss strain were estimated to have diverged from other EBLV-2 strains during the last 1000 years, and the two Finnish strains appear to have evolved from a common ancestor during the last 200 years.

  10. Next-generation sequencing of the Trichinella murrelli mitochondrial genome allows comprehensive comparison of its divergence from the principal agent of human trichinellosis, Trichinella spiralis.

    PubMed

    Webb, Kristen M; Rosenthal, Benjamin M

    2011-01-01

    The mitochondrial genome's non-recombinant mode of inheritance and relatively rapid rate of evolution has promoted its use as a marker for studying the biogeographic history and evolutionary interrelationships among many metazoan species. A modest portion of the mitochondrial genome has been defined for 12 species and genotypes of parasites in the genus Trichinella, but its adequacy in representing the mitochondrial genome as a whole remains unclear, as the complete coding sequence has been characterized only for Trichinella spiralis. Here, we sought to comprehensively describe the extent and nature of divergence between the mitochondrial genomes of T. spiralis (which poses the most appreciable zoonotic risk owing to its capacity to establish persistent infections in domestic pigs) and Trichinella murrelli (which is the most prevalent species in North American wildlife hosts, but which poses relatively little risk to the safety of pork). Next generation sequencing methodologies and scaffold and de novo assembly strategies were employed. The entire protein-coding region was sequenced (13,917 bp), along with a portion of the highly repetitive non-coding region (1524 bp) of the mitochondrial genome of T. murrelli with a combined average read depth of 250 reads. The accuracy of base calling, estimated from coding region sequence was found to exceed 99.3%. Genome content and gene order was not found to be significantly different from that of T. spiralis. An overall inter-species sequence divergence of 9.5% was estimated. Significant variation was identified when the amount of variation between species at each gene is compared to the average amount of variation between species across the coding region. Next generation sequencing is a highly effective means to obtain previously unknown mitochondrial genome sequence. Particular to parasites, the extremely deep coverage achieved through this method allows for the detection of sequence heterogeneity between the multiple individuals that necessarily comprise such templates. Copyright © 2010 Elsevier B.V. All rights reserved.

  11. The origin and diversification of eukaryotes: problems with molecular phylogenetics and molecular clock estimation

    PubMed Central

    Roger, Andrew J; Hug, Laura A

    2006-01-01

    Determining the relationships among and divergence times for the major eukaryotic lineages remains one of the most important and controversial outstanding problems in evolutionary biology. The sequencing and phylogenetic analyses of ribosomal RNA (rRNA) genes led to the first nearly comprehensive phylogenies of eukaryotes in the late 1980s, and supported a view where cellular complexity was acquired during the divergence of extant unicellular eukaryote lineages. More recently, however, refinements in analytical methods coupled with the availability of many additional genes for phylogenetic analysis showed that much of the deep structure of early rRNA trees was artefactual. Recent phylogenetic analyses of a multiple genes and the discovery of important molecular and ultrastructural phylogenetic characters have resolved eukaryotic diversity into six major hypothetical groups. Yet relationships among these groups remain poorly understood because of saturation of sequence changes on the billion-year time-scale, possible rapid radiations of major lineages, phylogenetic artefacts and endosymbiotic or lateral gene transfer among eukaryotes. Estimating the divergence dates between the major eukaryote lineages using molecular analyses is even more difficult than phylogenetic estimation. Error in such analyses comes from a myriad of sources including: (i) calibration fossil dates, (ii) the assumed phylogenetic tree, (iii) the nucleotide or amino acid substitution model, (iv) substitution number (branch length) estimates, (v) the model of how rates of evolution change over the tree, (vi) error inherent in the time estimates for a given model and (vii) how multiple gene data are treated. By reanalysing datasets from recently published molecular clock studies, we show that when errors from these various sources are properly accounted for, the confidence intervals on inferred dates can be very large. Furthermore, estimated dates of divergence vary hugely depending on the methods used and their assumptions. Accurate dating of divergence times among the major eukaryote lineages will require a robust tree of eukaryotes, a much richer Proterozoic fossil record of microbial eukaryotes assignable to extant groups for calibration, more sophisticated relaxed molecular clock methods and many more genes sampled from the full diversity of microbial eukaryotes. PMID:16754613

  12. Mechanisms of haplotype divergence at the RGA08 nucleotide-binding leucine-rich repeat gene locus in wild banana (Musa balbisiana)

    PubMed Central

    2010-01-01

    Background Comparative sequence analysis of complex loci such as resistance gene analog clusters allows estimating the degree of sequence conservation and mechanisms of divergence at the intraspecies level. In banana (Musa sp.), two diploid wild species Musa acuminata (A genome) and Musa balbisiana (B genome) contribute to the polyploid genome of many cultivars. The M. balbisiana species is associated with vigour and tolerance to pests and disease and little is known on the genome structure and haplotype diversity within this species. Here, we compare two genomic sequences of 253 and 223 kb corresponding to two haplotypes of the RGA08 resistance gene analog locus in M. balbisiana "Pisang Klutuk Wulung" (PKW). Results Sequence comparison revealed two regions of contrasting features. The first is a highly colinear gene-rich region where the two haplotypes diverge only by single nucleotide polymorphisms and two repetitive element insertions. The second corresponds to a large cluster of RGA08 genes, with 13 and 18 predicted RGA genes and pseudogenes spread over 131 and 152 kb respectively on each haplotype. The RGA08 cluster is enriched in repetitive element insertions, in duplicated non-coding intergenic sequences including low complexity regions and shows structural variations between haplotypes. Although some allelic relationships are retained, a large diversity of RGA08 genes occurs in this single M. balbisiana genotype, with several RGA08 paralogs specific to each haplotype. The RGA08 gene family has evolved by mechanisms of unequal recombination, intragenic sequence exchange and diversifying selection. An unequal recombination event taking place between duplicated non-coding intergenic sequences resulted in a different RGA08 gene content between haplotypes pointing out the role of such duplicated regions in the evolution of RGA clusters. Based on the synonymous substitution rate in coding sequences, we estimated a 1 million year divergence time for these M. balbisiana haplotypes. Conclusions A large RGA08 gene cluster identified in wild banana corresponds to a highly variable genomic region between haplotypes surrounded by conserved flanking regions. High level of sequence identity (70 to 99%) of the genic and intergenic regions suggests a recent and rapid evolution of this cluster in M. balbisiana. PMID:20637079

  13. The pipid root.

    PubMed

    Bewick, Adam J; Chain, Frédéric J J; Heled, Joseph; Evans, Ben J

    2012-12-01

    The estimation of phylogenetic relationships is an essential component of understanding evolution. Accurate phylogenetic estimation is difficult, however, when internodes are short and old, when genealogical discordance is common due to large ancestral effective population sizes or ancestral population structure, and when homoplasy is prevalent. Inference of divergence times is also hampered by unknown and uneven rates of evolution, the incomplete fossil record, uncertainty in relationships between fossil and extant lineages, and uncertainty in the age of fossils. Ideally, these challenges can be overcome by developing large "phylogenomic" data sets and by analyzing them with methods that accommodate features of the evolutionary process, such as genealogical discordance, recurrent substitution, recombination, ancestral population structure, gene flow after speciation among sampled and unsampled taxa, and variation in evolutionary rates. In some phylogenetic problems, it is possible to use information that is independent of fossils, such as the geological record, to identify putative triggers for diversification whose associated estimated divergence times can then be compared a posteriori with estimated relationships and ages of fossils. The history of diversification of pipid frog genera Pipa, Hymenochirus, Silurana, and Xenopus, for instance, is characterized by many of these evolutionary and analytical challenges. These frogs diversified dozens of millions of years ago, they have a relatively rich fossil record, their distributions span continental plates with a well characterized geological record of ancient connectivity, and there is considerable disagreement across studies in estimated evolutionary relationships. We used high throughput sequencing and public databases to generate a large phylogenomic data set with which we estimated evolutionary relationships using multilocus coalescence methods. We collected sequence data from Pipa, Hymenochirus, Silurana, and Xenopus and the outgroup taxon Rhinophrynus dorsalis from coding sequence of 113 autosomal regions, averaging ∼300 bp in length (range: 102-1695 bp) and also a portion of the mitochondrial genome. Analysis of these data using multiple approaches recovers strong support for the ((Xenopus, Silurana)(Pipa, Hymenochirus)) topology, and geologically calibrated divergence time estimates that are consistent with estimated ages and phylogenetic affinities of many fossils. These results provide new insights into the biogeography and chronology of pipid diversification during the breakup of Gondwanaland and illustrate how phylogenomic data may be necessary to tackle tough problems in molecular systematics. [Coalescence; gene tree; high-throughout sequencing; lineage sorting; pipid; species tree; Xenopus.].

  14. Evolutionary Drivers of Diversification and Distribution of a Southern Temperate Stream Fish Assemblage: Testing the Role of Historical Isolation and Spatial Range Expansion

    PubMed Central

    Chakona, Albert; Swartz, Ernst R.; Gouws, Gavin

    2013-01-01

    This study used phylogenetic analyses of mitochondrial cytochrome b sequences to investigate genetic diversity within three broadly co-distributed freshwater fish genera (Galaxias, Pseudobarbus and Sandelia) to shed some light on the processes that promoted lineage diversification and shaped geographical distribution patterns. A total of 205 sequences of Galaxias, 177 sequences of Pseudobarbus and 98 sequences of Sandelia from 146 localities across nine river systems in the south-western Cape Floristic Region (South Africa) were used. The data were analysed using phylogenetic and haplotype network methods and divergence times for the clades retrieved were estimated using *BEAST. Nine extremely divergent (3.5–25.3%) lineages were found within Galaxias. Similarly, deep phylogeographic divergence was evident within Pseudobarbus, with four markedly distinct (3.8–10.0%) phylogroups identified. Sandelia had two deeply divergent (5.5–5.9%) lineages, but seven minor lineages with strong geographical congruence were also identified. The Miocene-Pliocene major sea-level transgression and the resultant isolation of populations in upland refugia appear to have driven widespread allopatric divergence within the three genera. Subsequent coalescence of rivers during the Pleistocene major sea-level regression as well as intermittent drainage connections during wet periods are proposed to have facilitated range expansion of lineages that currently occur across isolated river systems. The high degree of genetic differentiation recovered from the present and previous studies suggest that freshwater fish diversity within the south-western CFR may be vastly underestimated, and taxonomic revisions are required. PMID:23951050

  15. Molecular phylogenetic and dating analyses using mitochondrial DNA sequences of eyelid geckos (Squamata: Eublepharidae).

    PubMed

    Jonniaux, Pierre; Kumazawa, Yoshinori

    2008-01-15

    Mitochondrial DNA sequences of approximately 2.3 kbp including the complete NADH dehydrogenase subunit 2 gene and its flanking genes, as well as parts of 12S and 16S rRNA genes were determined from major species of the eyelid gecko family Eublepharidae sensu [Kluge, A.G. 1987. Cladistic relationships in the Gekkonoidea (Squamata, Sauria). Misc. Publ. Mus. Zool. Univ. Michigan 173, 1-54.]. In contrast to previous morphological studies, phylogenetic analyses based on these sequences supported that Eublepharidae and Gekkonidae form a sister group with Pygopodidae, raising the possibility of homoplasious character change in some key features of geckos, such as reduction of movable eyelids and innovation of climbing toe pads. The phylogenetic analyses also provided a well-resolved tree for relationships between the eublepharid species. The Bayesian estimation of divergence times without assuming the molecular clock suggested the Jurassic divergence of Eublepharidae from Gekkonidae and radiations of most eublepharid genera around the Cretaceous. These dating results appeared to be robust against some conditional changes for time estimation, such as gene regions used, taxon representation, and data partitioning. Taken together with geological evidence, these results support the vicariant divergence of Eublepharidae and Gekkonidae by the breakup of Pangea into Laurasia and Gondwanaland, and recent dispersal of two African eublepharid genera from Eurasia to Africa after these landmasses were connected in the Early Miocene.

  16. Evolution of nuclear rDNA ITS sequences in the Cladophora albida/sericea clade (Chlorophyta).

    PubMed

    Bakker, F T; Olsen, J L; Stam, W T

    1995-06-01

    Ribosomal DNA ITS sequences were compared among 13 different species and biogeographic isolates from the monophyletic "albida/sericea clade" in the green algal genus Cladophora. Six distinct ITS sequence types were found, characterized by multiple insertions and deletions and high levels of nucleotide substitution. Conserved domains within the ITS regions indicate the presence of ITS secondary structure. Low transition/transversion ratios among the six types and nearly symmetrical tree-length frequency distributions indicate some saturation, and low phylogenetic signal. Although branching order among five of the six ITS sequence types could not be resolved, estimates of ITS sequence divergence as compared with 18S divergence in a subset of the taxa suggests that the origin of the different ITS types is probably in the mid-Miocene (12 Ma ago) but that biogeographic isolates within a single ITS type (including both Pacific and Atlantic representatives) have probably dispersed on a time scale of thousands rather than millions of years.

  17. Estimation of divergence times in cnidarian evolution based on mitochondrial protein-coding genes and the fossil record.

    PubMed

    Park, Eunji; Hwang, Dae-Sik; Lee, Jae-Seong; Song, Jun-Im; Seo, Tae-Kun; Won, Yong-Jin

    2012-01-01

    The phylum Cnidaria is comprised of remarkably diverse and ecologically significant taxa, such as the reef-forming corals, and occupies a basal position in metazoan evolution. The origin of this phylum and the most recent common ancestors (MRCAs) of its modern classes remain mostly unknown, although scattered fossil evidence provides some insights on this topic. Here, we investigate the molecular divergence times of the major taxonomic groups of Cnidaria (27 Hexacorallia, 16 Octocorallia, and 5 Medusozoa) on the basis of mitochondrial DNA sequences of 13 protein-coding genes. For this analysis, the complete mitochondrial genomes of seven octocoral and two scyphozoan species were newly sequenced and combined with all available mitogenomic data from GenBank. Five reliable fossil dates were used to calibrate the Bayesian estimates of divergence times. The molecular evidence suggests that cnidarians originated 741 million years ago (Ma) (95% credible region of 686-819), and the major taxa diversified prior to the Cambrian (543 Ma). The Octocorallia and Scleractinia may have originated from radiations of survivors of the Permian-Triassic mass extinction, which matches their fossil record well. Copyright © 2011 Elsevier Inc. All rights reserved.

  18. Revealing Less Derived Nature of Cartilaginous Fish Genomes with Their Evolutionary Time Scale Inferred with Nuclear Genes

    PubMed Central

    Renz, Adina J.; Meyer, Axel; Kuraku, Shigehiro

    2013-01-01

    Cartilaginous fishes, divided into Holocephali (chimaeras) and Elasmoblanchii (sharks, rays and skates), occupy a key phylogenetic position among extant vertebrates in reconstructing their evolutionary processes. Their accurate evolutionary time scale is indispensable for better understanding of the relationship between phenotypic and molecular evolution of cartilaginous fishes. However, our current knowledge on the time scale of cartilaginous fish evolution largely relies on estimates using mitochondrial DNA sequences. In this study, making the best use of the still partial, but large-scale sequencing data of cartilaginous fish species, we estimate the divergence times between the major cartilaginous fish lineages employing nuclear genes. By rigorous orthology assessment based on available genomic and transcriptomic sequence resources for cartilaginous fishes, we selected 20 protein-coding genes in the nuclear genome, spanning 2973 amino acid residues. Our analysis based on the Bayesian inference resulted in the mean divergence time of 421 Ma, the late Silurian, for the Holocephali-Elasmobranchii split, and 306 Ma, the late Carboniferous, for the split between sharks and rays/skates. By applying these results and other documented divergence times, we measured the relative evolutionary rate of the Hox A cluster sequences in the cartilaginous fish lineages, which resulted in a lower substitution rate with a factor of at least 2.4 in comparison to tetrapod lineages. The obtained time scale enables mapping phenotypic and molecular changes in a quantitative framework. It is of great interest to corroborate the less derived nature of cartilaginous fish at the molecular level as a genome-wide phenomenon. PMID:23825540

  19. Revealing less derived nature of cartilaginous fish genomes with their evolutionary time scale inferred with nuclear genes.

    PubMed

    Renz, Adina J; Meyer, Axel; Kuraku, Shigehiro

    2013-01-01

    Cartilaginous fishes, divided into Holocephali (chimaeras) and Elasmoblanchii (sharks, rays and skates), occupy a key phylogenetic position among extant vertebrates in reconstructing their evolutionary processes. Their accurate evolutionary time scale is indispensable for better understanding of the relationship between phenotypic and molecular evolution of cartilaginous fishes. However, our current knowledge on the time scale of cartilaginous fish evolution largely relies on estimates using mitochondrial DNA sequences. In this study, making the best use of the still partial, but large-scale sequencing data of cartilaginous fish species, we estimate the divergence times between the major cartilaginous fish lineages employing nuclear genes. By rigorous orthology assessment based on available genomic and transcriptomic sequence resources for cartilaginous fishes, we selected 20 protein-coding genes in the nuclear genome, spanning 2973 amino acid residues. Our analysis based on the Bayesian inference resulted in the mean divergence time of 421 Ma, the late Silurian, for the Holocephali-Elasmobranchii split, and 306 Ma, the late Carboniferous, for the split between sharks and rays/skates. By applying these results and other documented divergence times, we measured the relative evolutionary rate of the Hox A cluster sequences in the cartilaginous fish lineages, which resulted in a lower substitution rate with a factor of at least 2.4 in comparison to tetrapod lineages. The obtained time scale enables mapping phenotypic and molecular changes in a quantitative framework. It is of great interest to corroborate the less derived nature of cartilaginous fish at the molecular level as a genome-wide phenomenon.

  20. MRKAd5 HIV-1 Gag/Pol/Nef Vaccine-Induced T-Cell Responses Inadequately Predict Distance of Breakthrough HIV-1 Sequences to the Vaccine or Viral Load

    PubMed Central

    Janes, Holly; Frahm, Nicole; DeCamp, Allan; Rolland, Morgane; Gabriel, Erin; Wolfson, Julian; Hertz, Tomer; Kallas, Esper; Goepfert, Paul; Friedrich, David P.; Corey, Lawrence; Mullins, James I.; McElrath, M. Juliana; Gilbert, Peter

    2012-01-01

    Background The sieve analysis for the Step trial found evidence that breakthrough HIV-1 sequences for MRKAd5/HIV-1 Gag/Pol/Nef vaccine recipients were more divergent from the vaccine insert than placebo sequences in regions with predicted epitopes. We linked the viral sequence data with immune response and acute viral load data to explore mechanisms for and consequences of the observed sieve effect. Methods Ninety-one male participants (37 placebo and 54 vaccine recipients) were included; viral sequences were obtained at the time of HIV-1 diagnosis. T-cell responses were measured 4 weeks post-second vaccination and at the first or second week post-diagnosis. Acute viral load was obtained at RNA-positive and antibody-negative visits. Findings Vaccine recipients had a greater magnitude of post-infection CD8+ T cell response than placebo recipients (median 1.68% vs 1.18%; p = 0·04) and greater breadth of post-infection response (median 4.5 vs 2; p = 0·06). Viral sequences for vaccine recipients were marginally more divergent from the insert than placebo sequences in regions of Nef targeted by pre-infection immune responses (p = 0·04; Pol p = 0·13; Gag p = 0·89). Magnitude and breadth of pre-infection responses did not correlate with distance of the viral sequence to the insert (p>0·50). Acute log viral load trended lower in vaccine versus placebo recipients (estimated mean 4·7 vs 5·1) but the difference was not significant (p = 0·27). Neither was acute viral load associated with distance of the viral sequence to the insert (p>0·30). Interpretation Despite evidence of anamnestic responses, the sieve effect was not well explained by available measures of T-cell immunogenicity. Sequence divergence from the vaccine was not significantly associated with acute viral load. While point estimates suggested weak vaccine suppression of viral load, the result was not significant and more viral load data would be needed to detect suppression. PMID:22952672

  1. Southern African ancient genomes estimate modern human divergence to 350,000 to 260,000 years ago.

    PubMed

    Schlebusch, Carina M; Malmström, Helena; Günther, Torsten; Sjödin, Per; Coutinho, Alexandra; Edlund, Hanna; Munters, Arielle R; Vicente, Mário; Steyn, Maryna; Soodyall, Himla; Lombard, Marlize; Jakobsson, Mattias

    2017-11-03

    Southern Africa is consistently placed as a potential region for the evolution of Homo sapiens We present genome sequences, up to 13x coverage, from seven ancient individuals from KwaZulu-Natal, South Africa. The remains of three Stone Age hunter-gatherers (about 2000 years old) were genetically similar to current-day southern San groups, and those of four Iron Age farmers (300 to 500 years old) were genetically similar to present-day Bantu-language speakers. We estimate that all modern-day Khoe-San groups have been influenced by 9 to 30% genetic admixture from East Africans/Eurasians. Using traditional and new approaches, we estimate the first modern human population divergence time to between 350,000 and 260,000 years ago. This estimate increases the deepest divergence among modern humans, coinciding with anatomical developments of archaic humans into modern humans, as represented in the local fossil record. Copyright © 2017 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works.

  2. Expression Divergence Is Correlated with Sequence Evolution but Not Positive Selection in Conifers.

    PubMed

    Hodgins, Kathryn A; Yeaman, Sam; Nurkowski, Kristin A; Rieseberg, Loren H; Aitken, Sally N

    2016-06-01

    The evolutionary and genomic determinants of sequence evolution in conifers are poorly understood, and previous studies have found only limited evidence for positive selection. Using RNAseq data, we compared gene expression profiles to patterns of divergence and polymorphism in 44 seedlings of lodgepole pine (Pinus contorta) and 39 seedlings of interior spruce (Picea glauca × engelmannii) to elucidate the evolutionary forces that shape their genomes and their plastic responses to abiotic stress. We found that rapidly diverging genes tend to have greater expression divergence, lower expression levels, reduced levels of synonymous site diversity, and longer proteins than slowly diverging genes. Similar patterns were identified for the untranslated regions, but with some exceptions. We found evidence that genes with low expression levels had a larger fraction of nearly neutral sites, suggesting a primary role for negative selection in determining the association between evolutionary rate and expression level. There was limited evidence for differences in the rate of positive selection among genes with divergent versus conserved expression profiles and some evidence supporting relaxed selection in genes diverging in expression between the species. Finally, we identified a small number of genes that showed evidence of site-specific positive selection using divergence data alone. However, estimates of the proportion of sites fixed by positive selection (α) were in the range of other plant species with large effective population sizes suggesting relatively high rates of adaptive divergence among conifers. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  3. Methods for the quantitative comparison of molecular estimates of clade age and the fossil record.

    PubMed

    Clarke, Julia A; Boyd, Clint A

    2015-01-01

    Approaches quantifying the relative congruence, or incongruence, of molecular divergence estimates and the fossil record have been limited. Previously proposed methods are largely node specific, assessing incongruence at particular nodes for which both fossil data and molecular divergence estimates are available. These existing metrics, and other methods that quantify incongruence across topologies including entirely extinct clades, have so far not taken into account uncertainty surrounding both the divergence estimates and the ages of fossils. They have also treated molecular divergence estimates younger than previously assessed fossil minimum estimates of clade age as if they were the same as cases in which they were older. However, these cases are not the same. Recovered divergence dates younger than compared oldest known occurrences require prior hypotheses regarding the phylogenetic position of the compared fossil record and standard assumptions about the relative timing of morphological and molecular change to be incorrect. Older molecular dates, by contrast, are consistent with an incomplete fossil record and do not require prior assessments of the fossil record to be unreliable in some way. Here, we compare previous approaches and introduce two new descriptive metrics. Both metrics explicitly incorporate information on uncertainty by utilizing the 95% confidence intervals on estimated divergence dates and data on stratigraphic uncertainty concerning the age of the compared fossils. Metric scores are maximized when these ranges are overlapping. MDI (minimum divergence incongruence) discriminates between situations where molecular estimates are younger or older than known fossils reporting both absolute fit values and a number score for incompatible nodes. DIG range (divergence implied gap range) allows quantification of the minimum increase in implied missing fossil record induced by enforcing a given set of molecular-based estimates. These metrics are used together to describe the relationship between time trees and a set of fossil data, which we recommend be phylogenetically vetted and referred on the basis of apomorphy. Differences from previously proposed metrics and the utility of MDI and DIG range are illustrated in three empirical case studies from angiosperms, ostracods, and birds. These case studies also illustrate the ways in which MDI and DIG range may be used to assess time trees resultant from analyses varying in calibration regime, divergence dating approach or molecular sequence data analyzed. © The Author(s) 2014. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  4. A revised timescale for human evolution based on ancient mitochondrial genomes.

    PubMed

    Fu, Qiaomei; Mittnik, Alissa; Johnson, Philip L F; Bos, Kirsten; Lari, Martina; Bollongino, Ruth; Sun, Chengkai; Giemsch, Liane; Schmitz, Ralf; Burger, Joachim; Ronchitelli, Anna Maria; Martini, Fabio; Cremonesi, Renata G; Svoboda, Jiří; Bauer, Peter; Caramelli, David; Castellano, Sergi; Reich, David; Pääbo, Svante; Krause, Johannes

    2013-04-08

    Recent analyses of de novo DNA mutations in modern humans have suggested a nuclear substitution rate that is approximately half that of previous estimates based on fossil calibration. This result has led to suggestions that major events in human evolution occurred far earlier than previously thought. Here, we use mitochondrial genome sequences from ten securely dated ancient modern humans spanning 40,000 years as calibration points for the mitochondrial clock, thus yielding a direct estimate of the mitochondrial substitution rate. Our clock yields mitochondrial divergence times that are in agreement with earlier estimates based on calibration points derived from either fossils or archaeological material. In particular, our results imply a separation of non-Africans from the most closely related sub-Saharan African mitochondrial DNAs (haplogroup L3) that occurred less than 62-95 kya. Though single loci like mitochondrial DNA (mtDNA) can only provide biased estimates of population divergence times, they can provide valid upper bounds. Our results exclude most of the older dates for African and non-African population divergences recently suggested by de novo mutation rate estimates in the nuclear genome. Copyright © 2013 Elsevier Ltd. All rights reserved.

  5. The contribution of alu elements to mutagenic DNA double-strand break repair.

    PubMed

    Morales, Maria E; White, Travis B; Streva, Vincent A; DeFreece, Cecily B; Hedges, Dale J; Deininger, Prescott L

    2015-03-01

    Alu elements make up the largest family of human mobile elements, numbering 1.1 million copies and comprising 11% of the human genome. As a consequence of evolution and genetic drift, Alu elements of various sequence divergence exist throughout the human genome. Alu/Alu recombination has been shown to cause approximately 0.5% of new human genetic diseases and contribute to extensive genomic structural variation. To begin understanding the molecular mechanisms leading to these rearrangements in mammalian cells, we constructed Alu/Alu recombination reporter cell lines containing Alu elements ranging in sequence divergence from 0%-30% that allow detection of both Alu/Alu recombination and large non-homologous end joining (NHEJ) deletions that range from 1.0 to 1.9 kb in size. Introduction of as little as 0.7% sequence divergence between Alu elements resulted in a significant reduction in recombination, which indicates even small degrees of sequence divergence reduce the efficiency of homology-directed DNA double-strand break (DSB) repair. Further reduction in recombination was observed in a sequence divergence-dependent manner for diverged Alu/Alu recombination constructs with up to 10% sequence divergence. With greater levels of sequence divergence (15%-30%), we observed a significant increase in DSB repair due to a shift from Alu/Alu recombination to variable-length NHEJ which removes sequence between the two Alu elements. This increase in NHEJ deletions depends on the presence of Alu sequence homeology (similar but not identical sequences). Analysis of recombination products revealed that Alu/Alu recombination junctions occur more frequently in the first 100 bp of the Alu element within our reporter assay, just as they do in genomic Alu/Alu recombination events. This is the first extensive study characterizing the influence of Alu element sequence divergence on DNA repair, which will inform predictions regarding the effect of Alu element sequence divergence on both the rate and nature of DNA repair events.

  6. Genetic evidence for contribution of human dispersal to the genetic diversity of EBA-175 in Plasmodium falciparum.

    PubMed

    Yasukochi, Yoshiki; Naka, Izumi; Patarapotikul, Jintana; Hananantachai, Hathairad; Ohashi, Jun

    2015-08-01

    The 175-kDa erythrocyte binding antigen (EBA-175) of Plasmodium falciparum plays a crucial role in merozoite invasion into human erythrocytes. EBA-175 is believed to have been under diversifying selection; however, there have been no studies investigating the effect of dispersal of humans out of Africa on the genetic variation of EBA-175 in P. falciparum. The PCR-direct sequencing was performed for a part of the eba-175 gene (regions II and III) using DNA samples obtained from Thai patients infected with P. falciparum. The divergence times for the P. falciparum eba-175 alleles were estimated assuming that P. falciparum/Plasmodium reichenowi divergence occurred 6 million years ago (MYA). To examine the possibility of diversifying selection, nonsynonymous and synonymous substitution rates for Plasmodium species were also estimated. A total of 32 eba-175 alleles were identified from 131 Thai P. falciparum isolates. Their estimated divergence time was 0.13-0.14 MYA, before the exodus of humans from Africa. A phylogenetic tree for a large sequence dataset of P. falciparum eba-175 alleles from across the world showed the presence of a basal Asian-specific cluster for all P. falciparum sequences. A markedly more nonsynonymous substitutions than synonymous substitutions in region II in P. falciparum was also detected, but not within Plasmodium species parasitizing African apes, suggesting that diversifying selection has acted specifically on P. falciparum eba-175. Plasmodium falciparum eba-175 genetic diversity appeared to increase following the exodus of Asian ancestors from Africa. Diversifying selection may have played an important role in the diversification of eba-175 allelic lineages. The present results suggest that the dispersals of humans out of Africa influenced significantly the molecular evolution of P. falciparum EBA-175.

  7. Phylogeny and divergence of the pinnipeds (Carnivora: Mammalia) assessed using a multigene dataset

    PubMed Central

    Higdon, Jeff W; Bininda-Emonds, Olaf RP; Beck, Robin MD; Ferguson, Steven H

    2007-01-01

    Background Phylogenetic comparative methods are often improved by complete phylogenies with meaningful branch lengths (e.g., divergence dates). This study presents a dated molecular supertree for all 34 world pinniped species derived from a weighted matrix representation with parsimony (MRP) supertree analysis of 50 gene trees, each determined under a maximum likelihood (ML) framework. Divergence times were determined by mapping the same sequence data (plus two additional genes) on to the supertree topology and calibrating the ML branch lengths against a range of fossil calibrations. We assessed the sensitivity of our supertree topology in two ways: 1) a second supertree with all mtDNA genes combined into a single source tree, and 2) likelihood-based supermatrix analyses. Divergence dates were also calculated using a Bayesian relaxed molecular clock with rate autocorrelation to test the sensitivity of our supertree results further. Results The resulting phylogenies all agreed broadly with recent molecular studies, in particular supporting the monophyly of Phocidae, Otariidae, and the two phocid subfamilies, as well as an Odobenidae + Otariidae sister relationship; areas of disagreement were limited to four more poorly supported regions. Neither the supertree nor supermatrix analyses supported the monophyly of the two traditional otariid subfamilies, supporting suggestions for the need for taxonomic revision in this group. Phocid relationships were similar to other recent studies and deeper branches were generally well-resolved. Halichoerus grypus was nested within a paraphyletic Pusa, although relationships within Phocina tend to be poorly supported. Divergence date estimates for the supertree were in good agreement with other studies and the available fossil record; however, the Bayesian relaxed molecular clock divergence date estimates were significantly older. Conclusion Our results join other recent studies and highlight the need for a re-evaluation of pinniped taxonomy, especially as regards the subfamilial classification of otariids and the generic nomenclature of Phocina. Even with the recent publication of new sequence data, the available genetic sequence information for several species, particularly those in Arctocephalus, remains very limited, especially for nuclear markers. However, resolution of parts of the tree will probably remain difficult, even with additional data, due to apparent rapid radiations. Our study addresses the lack of a recent pinniped phylogeny that includes all species and robust divergence dates for all nodes, and will therefore prove indispensable to comparative and macroevolutionary studies of this group of carnivores. PMID:17996107

  8. LinkFinder: An expert system that constructs phylogenic trees

    NASA Technical Reports Server (NTRS)

    Inglehart, James; Nelson, Peter C.

    1991-01-01

    An expert system has been developed using the C Language Integrated Production System (CLIPS) that automates the process of constructing DNA sequence based phylogenies (trees or lineages) that indicate evolutionary relationships. LinkFinder takes as input homologous DNA sequences from distinct individual organisms. It measures variations between the sequences, selects appropriate proportionality constants, and estimates the time that has passed since each pair of organisms diverged from a common ancestor. It then designs and outputs a phylogenic map summarizing these results. LinkFinder can find genetic relationships between different species, and between individuals of the same species, including humans. It was designed to take advantage of the vast amount of sequence data being produced by the Genome Project, and should be of value to evolution theorists who wish to utilize this data, but who have no formal training in molecular genetics. Evolutionary theory holds that distinct organisms carrying a common gene inherited that gene from a common ancestor. Homologous genes vary from individual to individual and species to species, and the amount of variation is now believed to be directly proportional to the time that has passed since divergence from a common ancestor. The proportionality constant must be determined experimentally; it varies considerably with the types of organisms and DNA molecules under study. Given an appropriate constant, and the variation between two DNA sequences, a simple linear equation gives the divergence time.

  9. Hemocyanin gene family evolution in spiders (Araneae), with implications for phylogenetic relationships and divergence times in the infraorder Mygalomorphae.

    PubMed

    Starrett, James; Hedin, Marshal; Ayoub, Nadia; Hayashi, Cheryl Y

    2013-07-25

    Hemocyanins are multimeric copper-containing hemolymph proteins involved in oxygen binding and transport in all major arthropod lineages. Most arachnids have seven primary subunits (encoded by paralogous genes a-g), which combine to form a 24-mer (4×6) quaternary structure. Within some spider lineages, however, hemocyanin evolution has been a dynamic process with extensive paralog duplication and loss. We have obtained hemocyanin gene sequences from numerous representatives of the spider infraorders Mygalomorphae and Araneomorphae in order to infer the evolution of the hemocyanin gene family and estimate spider relationships using these conserved loci. Our hemocyanin gene tree is largely consistent with the previous hypotheses of paralog relationships based on immunological studies, but reveals some discrepancies in which paralog types have been lost or duplicated in specific spider lineages. Analyses of concatenated hemocyanin sequences resolved deep nodes in the spider phylogeny and recovered a number of clades that are supported by other molecular studies, particularly for mygalomorph taxa. The concatenated data set is also used to estimate dates of higher-level spider divergences and suggests that the diversification of extant mygalomorphs preceded that of extant araneomorphs. Spiders are diverse in behavior and respiratory morphology, and our results are beneficial for comparative analyses of spider respiration. Lastly, the conserved hemocyanin sequences allow for the inference of spider relationships and ancient divergence dates. Copyright © 2013 Elsevier B.V. All rights reserved.

  10. Amazonian phylogeography: mtDNA sequence variation in arboreal echimyid rodents (Caviomorpha).

    PubMed

    da Silva, M N; Patton, J L

    1993-09-01

    Patterns of evolutionary relationships among haplotype clades of sequences of the mitochondrial cytochrome b DNA gene are examined for five genera of arboreal rodents of the Caviomorph family Echimyidae from the Amazon Basin. Data are available for 798 bp of sequence from a total of 24 separate localities in Peru, Venezuela, Bolivia, and Brazil for Mesomys, Isothrix, Makalata, Dactylomys, and Echimys. Sequence divergence, corrected for multiple hits, is extensive, ranging from less than 1% for comparisons within populations of over 20% among geographic units within genera. Both the degree of differentiation and the geographic patterning of the variation suggest that more than one species composes the Amazonian distribution of the currently recognized Mesomys hispidus, Isothrix bistriata, Makalata didelphoides, and Dactylomys dactylinus. There is general concordance in the geographic range of haplotype clades for each of these taxa, and the overall level of differentiation within them is largely equivalent. These observations suggest that a common vicariant history underlies the respective diversification of each genus. However, estimated times of divergence based on the rate of third position transversion substitutions for the major clades within each genus typically range above 1 million years. Thus, allopatric isolation precipitating divergence must have been considerably earlier than the late Pleistocene forest fragmentation events commonly invoked for Amazonian biota.

  11. Mitochondrial divergence between slow- and fast-aging garter snakes.

    PubMed

    Schwartz, Tonia S; Arendsee, Zebulun W; Bronikowski, Anne M

    2015-11-01

    Mitochondrial function has long been hypothesized to be intimately involved in aging processes--either directly through declining efficiency of mitochondrial respiration and ATP production with advancing age, or indirectly, e.g., through increased mitochondrial production of damaging free radicals with age. Yet we lack a comprehensive understanding of the evolution of mitochondrial genotypes and phenotypes across diverse animal models, particularly in species that have extremely labile physiology. Here, we measure mitochondrial genome-types and transcription in ecotypes of garter snakes (Thamnophis elegans) that are adapted to disparate habitats and have diverged in aging rates and lifespans despite residing in close proximity. Using two RNA-seq datasets, we (1) reconstruct the garter snake mitochondrial genome sequence and bioinformatically identify regulatory elements, (2) test for divergence of mitochondrial gene expression between the ecotypes and in response to heat stress, and (3) test for sequence divergence in mitochondrial protein-coding regions in these slow-aging (SA) and fast-aging (FA) naturally occurring ecotypes. At the nucleotide sequence level, we confirmed two (duplicated) mitochondrial control regions one of which contains a glucocorticoid response element (GRE). Gene expression of protein-coding genes was higher in FA snakes relative to SA snakes for most genes, but was neither affected by heat stress nor an interaction between heat stress and ecotype. SA and FA ecotypes had unique mitochondrial haplotypes with amino acid substitutions in both CYTB and ND5. The CYTB amino acid change (Isoleucine → Threonine) was highly segregated between ecotypes. This divergence of mitochondrial haplotypes between SA and FA snakes contrasts with nuclear gene-flow estimates, but correlates with previously reported divergence in mitochondrial function (mitochondrial oxygen consumption, ATP production, and reactive oxygen species consequences). Copyright © 2015 Elsevier Inc. All rights reserved.

  12. Genetic identification and evolutionary trends of the seagrass Halophila nipponica in temperate coastal waters of Korea.

    PubMed

    Kim, Young Kyun; Kim, Seung Hyeon; Yi, Joo Mi; Kang, Chang-Keun; Short, Frederick; Lee, Kun-Seop

    2017-01-01

    Although seagrass species in the genus Halophila are generally distributed in tropical or subtropical regions, H. nipponica has been reported to occur in temperate coastal waters of the northwestern Pacific. Because H. nipponica occurs only in the warm temperate areas influenced by the Kuroshio Current and shows a tropical seasonal growth pattern, such as severely restricted growth in low water temperatures, it was hypothesized that this temperate Halophila species diverged from tropical species in the relatively recent evolutionary past. We used a phylogenetic analysis of internal transcribed spacer (ITS) regions to examine the genetic variability and evolutionary trend of H. nipponica. ITS sequences of H. nipponica from various locations in Korea and Japan were identical or showed very low sequence divergence (less than 3-base pair, bp, difference), confirming that H. nipponica from Japan and Korea are the same species. Halophila species in the section Halophila, which have simple phyllotaxy (a pair of petiolate leaves at the rhizome node), were separated into five well-supported clades by maximum parsimony analysis. H. nipponica grouped with H. okinawensis and H. gaudichaudii from the subtropical regions in the same clade, the latter two species having quite low ITS sequence divergence from H. nipponica (7-15-bp). H. nipponica in Clade I diverged 2.95 ± 1.08 million years ago from species in Clade II, which includes H. ovalis. According to geographical distribution and genetic similarity, H. nipponica appears to have diverged from a tropical species like H. ovalis and adapted to warm temperate environments. The results of divergence time estimates suggest that the temperate H. nipponica is an older species than the subtropical H. okinawensis and H. gaudichaudii and they may have different evolutionary histories.

  13. Genetic identification and evolutionary trends of the seagrass Halophila nipponica in temperate coastal waters of Korea

    PubMed Central

    Kim, Young Kyun; Kim, Seung Hyeon; Yi, Joo Mi; Kang, Chang-Keun; Short, Frederick; Lee, Kun-Seop

    2017-01-01

    Although seagrass species in the genus Halophila are generally distributed in tropical or subtropical regions, H. nipponica has been reported to occur in temperate coastal waters of the northwestern Pacific. Because H. nipponica occurs only in the warm temperate areas influenced by the Kuroshio Current and shows a tropical seasonal growth pattern, such as severely restricted growth in low water temperatures, it was hypothesized that this temperate Halophila species diverged from tropical species in the relatively recent evolutionary past. We used a phylogenetic analysis of internal transcribed spacer (ITS) regions to examine the genetic variability and evolutionary trend of H. nipponica. ITS sequences of H. nipponica from various locations in Korea and Japan were identical or showed very low sequence divergence (less than 3-base pair, bp, difference), confirming that H. nipponica from Japan and Korea are the same species. Halophila species in the section Halophila, which have simple phyllotaxy (a pair of petiolate leaves at the rhizome node), were separated into five well-supported clades by maximum parsimony analysis. H. nipponica grouped with H. okinawensis and H. gaudichaudii from the subtropical regions in the same clade, the latter two species having quite low ITS sequence divergence from H. nipponica (7–15-bp). H. nipponica in Clade I diverged 2.95 ± 1.08 million years ago from species in Clade II, which includes H. ovalis. According to geographical distribution and genetic similarity, H. nipponica appears to have diverged from a tropical species like H. ovalis and adapted to warm temperate environments. The results of divergence time estimates suggest that the temperate H. nipponica is an older species than the subtropical H. okinawensis and H. gaudichaudii and they may have different evolutionary histories. PMID:28505209

  14. A Coalescent-Based Estimator of Admixture From DNA Sequences

    PubMed Central

    Wang, Jinliang

    2006-01-01

    A variety of estimators have been developed to use genetic marker information in inferring the admixture proportions (parental contributions) of a hybrid population. The majority of these estimators used allele frequency data, ignored molecular information that is available in markers such as microsatellites and DNA sequences, and assumed that mutations are absent since the admixture event. As a result, these estimators may fail to deliver an estimate or give rather poor estimates when admixture is ancient and thus mutations are not negligible. A previous molecular estimator based its inference of admixture proportions on the average coalescent times between pairs of genes taken from within and between populations. In this article I propose an estimator that considers the entire genealogy of all of the sampled genes and infers admixture proportions from the numbers of segregating sites in DNA sequence samples. By considering the genealogy of all sequences rather than pairs of sequences, this new estimator also allows the joint estimation of other interesting parameters in the admixture model, such as admixture time, divergence time, population size, and mutation rate. Comparative analyses of simulated data indicate that the new coalescent estimator generally yields better estimates of admixture proportions than the previous molecular estimator, especially when the parental populations are not highly differentiated. It also gives reasonably accurate estimates of other admixture parameters. A human mtDNA sequence data set was analyzed to demonstrate the method, and the analysis results are discussed and compared with those from previous studies. PMID:16624918

  15. Phylogenetic relationships and divergence dates of softshell turtles (Testudines: Trionychidae) inferred from complete mitochondrial genomes.

    PubMed

    Li, H; Liu, J; Xiong, L; Zhang, H; Zhou, H; Yin, H; Jing, W; Li, J; Shi, Q; Wang, Y; Liu, J; Nie, L

    2017-05-01

    The softshell turtles (Trionychidae) are one of the most widely distributed reptile groups in the world, and fossils have been found on all continents except Antarctica. The phylogenetic relationships among members of this group have been previously studied; however, disagreements regarding its taxonomy, its phylogeography and divergence times are still poorly understood as well. Here, we present a comprehensive mitogenomic study of softshell turtles. We sequenced the complete mitochondrial genomes of 10 softshell turtles, in addition to the GenBank sequence of Dogania subplana, Lissemys punctata, Trionyx triunguis, which cover all extant genera within Trionychidae except for Cyclanorbis and Cycloderma. These data were combined with other mitogenomes of turtles for phylogenetic analyses. Divergence time calibration and ancestral reconstruction were calculated using BEAST and RASP software, respectively. Our phylogenetic analyses indicate that Trionychidae is the sister taxon of Carettochelyidae, and support the monophyly of Trionychinae and Cyclanorbinae, which is consistent with morphological data and molecular analysis. Our phylogenetic analyses have established a sister taxon relationship between the Asian Rafetus and the Asian Palea + Pelodiscus + Dogania + Nilssonia + Amyda, whereas a previous study grouped the Asian Rafetus with the American Apalone. The results of divergence time estimates and area ancestral reconstruction show that extant Trionychidae originated in Asia at around 108 million years ago (MA), and radiations mainly occurred during two warm periods, namely Late Cretaceous-Early Eocene and Oligocene. By combining the estimated divergence time and the reconstructed ancestral area of softshell turtles, we determined that the dispersal of softshell turtles out of Asia may have taken three routes. Furthermore, the times of dispersal seem to be in agreement with the time of the India-Asia collision and opening of the Bering Strait, which provide evidence for the accuracy of our estimation of divergence time. Overall, the mitogenomes of this group were used to explore the origin and dispersal route of Trionychidae and have provided new insights on the evolution of this group. © 2017 European Society For Evolutionary Biology. Journal of Evolutionary Biology © 2017 European Society For Evolutionary Biology.

  16. Plastome sequences and exploration of tree-space help to resolve the phylogeny of riceflowers (Thymelaeaceae: Pimelea).

    PubMed

    Foster, Charles S P; Henwood, Murray J; Ho, Simon Y W

    2018-05-25

    Data sets comprising small numbers of genetic markers are not always able to resolve phylogenetic relationships. This has frequently been the case in molecular systematic studies of plants, with many analyses being based on sequence data from only two or three chloroplast genes. An example of this comes from the riceflowers Pimelea Banks & Sol. ex Gaertn. (Thymelaeaceae), a large genus of flowering plants predominantly distributed in Australia. Despite the considerable morphological variation in the genus, low sequence divergence in chloroplast markers has led to the phylogeny of Pimelea remaining largely uncertain. In this study, we resolve the backbone of the phylogeny of Pimelea in comprehensive Bayesian and maximum-likelihood analyses of plastome sequences from 41 taxa. However, some relationships received only moderate to poor support, and the Pimelea clade contained extremely short internal branches. By using topology-clustering analyses, we demonstrate that conflicting phylogenetic signals can be found across the trees estimated from individual chloroplast protein-coding genes. A relaxed-clock dating analysis reveals that Pimelea arose in the mid-Miocene, with most divergences within the genus occurring during a subsequent rapid diversification. Our new phylogenetic estimate offers better resolution and is more strongly supported than previous estimates, providing a platform for future taxonomic revisions of both Pimelea and the broader subfamily. Our study has demonstrated the substantial improvements in phylogenetic resolution that can be achieved using plastome-scale data sets in plant molecular systematics. Copyright © 2018 Elsevier Inc. All rights reserved.

  17. Reevaluation of a classic phylogeographic barrier: new techniques reveal the influence of microgeographic climate variation on population divergence

    PubMed Central

    Soto-Centeno, J Angel; Barrow, Lisa N; Allen, Julie M; Reed, David L

    2013-01-01

    We evaluated the mtDNA divergence and relationships within Geomys pinetis to assess the status of formerly recognized Geomys taxa. Additionally, we integrated new hypothesis-based tests in ecological niche models (ENM) to provide greater insight into causes for divergence and potential barriers to gene flow in Southeastern United States (Alabama, Florida, and Georgia). Our DNA sequence dataset confirmed and strongly supported two distinct lineages within G. pinetis occurring east and west of the ARD. Divergence date estimates showed that eastern and western lineages diverged about 1.37 Ma (1.9 Ma–830 ka). Predicted distributions from ENMs were consistent with molecular data and defined each population east and west of the ARD with little overlap. Niche identity and background similarity tests were statistically significant suggesting that ENMs from eastern and western lineages are not identical or more similar than expected based on random localities drawn from the environmental background. ENMs also support the hypothesis that the ARD represents a ribbon of unsuitable climate between more suitable areas where these populations are distributed. The estimated age of divergence between eastern and western lineages of G. pinetis suggests that the divergence was driven by climatic conditions during Pleistocene glacial–interglacial cycles. The ARD at the contact zone of eastern and western lineages of G. pinetis forms a significant barrier promoting microgeographic isolation that helps maintain ecological and genetic divergence. PMID:23789071

  18. The Divergence of Neandertal and Modern Human Y Chromosomes

    PubMed Central

    Mendez, Fernando L.; Poznik, G. David; Castellano, Sergi; Bustamante, Carlos D.

    2016-01-01

    Sequencing the genomes of extinct hominids has reshaped our understanding of modern human origins. Here, we analyze ∼120 kb of exome-captured Y-chromosome DNA from a Neandertal individual from El Sidrón, Spain. We investigate its divergence from orthologous chimpanzee and modern human sequences and find strong support for a model that places the Neandertal lineage as an outgroup to modern human Y chromosomes—including A00, the highly divergent basal haplogroup. We estimate that the time to the most recent common ancestor (TMRCA) of Neandertal and modern human Y chromosomes is ∼588 thousand years ago (kya) (95% confidence interval [CI]: 447–806 kya). This is ∼2.1 (95% CI: 1.7–2.9) times longer than the TMRCA of A00 and other extant modern human Y-chromosome lineages. This estimate suggests that the Y-chromosome divergence mirrors the population divergence of Neandertals and modern human ancestors, and it refutes alternative scenarios of a relatively recent or super-archaic origin of Neandertal Y chromosomes. The fact that the Neandertal Y we describe has never been observed in modern humans suggests that the lineage is most likely extinct. We identify protein-coding differences between Neandertal and modern human Y chromosomes, including potentially damaging changes to PCDH11Y, TMSB4Y, USP9Y, and KDM5D. Three of these changes are missense mutations in genes that produce male-specific minor histocompatibility (H-Y) antigens. Antigens derived from KDM5D, for example, are thought to elicit a maternal immune response during gestation. It is possible that incompatibilities at one or more of these genes played a role in the reproductive isolation of the two groups. PMID:27058445

  19. The Divergence of Neandertal and Modern Human Y Chromosomes.

    PubMed

    Mendez, Fernando L; Poznik, G David; Castellano, Sergi; Bustamante, Carlos D

    2016-04-07

    Sequencing the genomes of extinct hominids has reshaped our understanding of modern human origins. Here, we analyze ∼120 kb of exome-captured Y-chromosome DNA from a Neandertal individual from El Sidrón, Spain. We investigate its divergence from orthologous chimpanzee and modern human sequences and find strong support for a model that places the Neandertal lineage as an outgroup to modern human Y chromosomes-including A00, the highly divergent basal haplogroup. We estimate that the time to the most recent common ancestor (TMRCA) of Neandertal and modern human Y chromosomes is ∼588 thousand years ago (kya) (95% confidence interval [CI]: 447-806 kya). This is ∼2.1 (95% CI: 1.7-2.9) times longer than the TMRCA of A00 and other extant modern human Y-chromosome lineages. This estimate suggests that the Y-chromosome divergence mirrors the population divergence of Neandertals and modern human ancestors, and it refutes alternative scenarios of a relatively recent or super-archaic origin of Neandertal Y chromosomes. The fact that the Neandertal Y we describe has never been observed in modern humans suggests that the lineage is most likely extinct. We identify protein-coding differences between Neandertal and modern human Y chromosomes, including potentially damaging changes to PCDH11Y, TMSB4Y, USP9Y, and KDM5D. Three of these changes are missense mutations in genes that produce male-specific minor histocompatibility (H-Y) antigens. Antigens derived from KDM5D, for example, are thought to elicit a maternal immune response during gestation. It is possible that incompatibilities at one or more of these genes played a role in the reproductive isolation of the two groups. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.

  20. Prediction of industrial tomato hybrids from agronomic traits and ISSR molecular markers.

    PubMed

    Figueiredo, A S T; Resende, J T V; Faria, M V; Da-Silva, P R; Fagundes, B S; Morales, R G F

    2016-05-13

    Heterosis is a highly relevant phenomenon in plant breeding. This condition is usually established in hybrids derived from crosses of highly divergent parents. The success of a breeder in obtaining heterosis is directly related to the correct identification of genetically contrasting parents. Currently, the diallel cross is the most commonly used methodology to detect contrasting parents; however, it is a time- and cost-consuming procedure. Therefore, new tools capable of performing this task quickly and accurately are required. Thus, the purpose of this study was to estimate the genetic divergence in industrial tomato lines, based on agronomic traits, and to compare with estimates obtained using inter-simple sequence repeat (ISSR) molecular markers. The genetic divergence among 10 industrial tomato lines, based on nine morphological characters and 12 ISSR primers was analyzed. For data analysis, Pearson and Spearman correlation coefficients were calculated between the genetic dissimilarity measures estimated by Mahalanobis distance and Jaccard's coefficient of genetic dissimilarity from the heterosis estimates, combining ability, and means of important traits of industrial tomato. The ISSR markers efficiently detected contrasting parents for hybrid production in tomato. Parent RVTD-08 was indicated as the most divergent, both by molecular and morphological markers, that positively contributed to increased heterosis and by the specific combining ability in the crosses in which it participated. The genetic dissimilarity estimated by ISSR molecular markers aided the identification of the best hybrids of the experiment in terms of total fruit yield, pulp yield, and soluble solids content.

  1. Detection and correction of false segmental duplications caused by genome mis-assembly

    PubMed Central

    2010-01-01

    Diploid genomes with divergent chromosomes present special problems for assembly software as two copies of especially polymorphic regions may be mistakenly constructed, creating the appearance of a recent segmental duplication. We developed a method for identifying such false duplications and applied it to four vertebrate genomes. For each genome, we corrected mis-assemblies, improved estimates of the amount of duplicated sequence, and recovered polymorphisms between the sequenced chromosomes. PMID:20219098

  2. Approximate likelihood calculation on a phylogeny for Bayesian estimation of divergence times.

    PubMed

    dos Reis, Mario; Yang, Ziheng

    2011-07-01

    The molecular clock provides a powerful way to estimate species divergence times. If information on some species divergence times is available from the fossil or geological record, it can be used to calibrate a phylogeny and estimate divergence times for all nodes in the tree. The Bayesian method provides a natural framework to incorporate different sources of information concerning divergence times, such as information in the fossil and molecular data. Current models of sequence evolution are intractable in a Bayesian setting, and Markov chain Monte Carlo (MCMC) is used to generate the posterior distribution of divergence times and evolutionary rates. This method is computationally expensive, as it involves the repeated calculation of the likelihood function. Here, we explore the use of Taylor expansion to approximate the likelihood during MCMC iteration. The approximation is much faster than conventional likelihood calculation. However, the approximation is expected to be poor when the proposed parameters are far from the likelihood peak. We explore the use of parameter transforms (square root, logarithm, and arcsine) to improve the approximation to the likelihood curve. We found that the new methods, particularly the arcsine-based transform, provided very good approximations under relaxed clock models and also under the global clock model when the global clock is not seriously violated. The approximation is poorer for analysis under the global clock when the global clock is seriously wrong and should thus not be used. The results suggest that the approximate method may be useful for Bayesian dating analysis using large data sets.

  3. Gone with the plate: the opening of the Western Mediterranean basin drove the diversification of ground-dweller spiders

    PubMed Central

    2011-01-01

    Background The major islands of the Western Mediterranean--Corsica, Sardinia, and the Balearic Islands--are continental terrenes that drifted towards their present day location following a retreat from their original position on the eastern Iberian Peninsula about 30 million years ago. Several studies have taken advantage of this well-dated geological scenario to calibrate molecular rates in species for which distributions seemed to match this tectonic event. Nevertheless, the use of external calibration points has revealed that most of the present-day fauna on these islands post-dated the opening of the western Mediterranean basin. In this study, we use sequence information of the cox1, nad1, 16S, L1, and 12S mitochondrial genes and the 18S, 28S, and h3 nuclear genes, along with relaxed clock models and a combination of biogeographic and fossil external calibration points, to test alternative historical scenarios of the evolutionary history of the ground-dweller spider genus Parachtes (Dysderidae), which is endemic to the region. Results We analyse 49 specimens representing populations of most Parachtes species and close relatives. Our results reveal that both the sequence of species formation in Parachtes and the estimated divergence times match the geochronological sequence of separation of the main islands, suggesting that the diversification of the group was driven by Tertiary plate tectonics. In addition, the confirmation that Parachtes diversification matches well-dated geological events provides a model framework to infer substitution rates of molecular markers. Divergence rates estimates ranged from 3.5% My-1 (nad1) to 0.12% My-1 (28S), and the average divergence rate for the mitochondrial genes was 2.25% My-1, very close to the "standard" arthropod mitochondrial rate (2.3% My-1). Conclusions Our study provides the first unequivocal evidence of terrestrial endemic fauna of the major western Mediterranean islands, whose origin can be traced back to the Oligocene separation of these islands from the continent. Moreover, our study provides useful information on the divergence rate estimates of the most commonly used genes for phylogenetic inference in non-model arthropods. PMID:22039781

  4. Gone with the plate: the opening of the Western Mediterranean basin drove the diversification of ground-dweller spiders.

    PubMed

    Bidegaray-Batista, Leticia; Arnedo, Miquel A

    2011-10-31

    The major islands of the Western Mediterranean--Corsica, Sardinia, and the Balearic Islands--are continental terrenes that drifted towards their present day location following a retreat from their original position on the eastern Iberian Peninsula about 30 million years ago. Several studies have taken advantage of this well-dated geological scenario to calibrate molecular rates in species for which distributions seemed to match this tectonic event. Nevertheless, the use of external calibration points has revealed that most of the present-day fauna on these islands post-dated the opening of the western Mediterranean basin. In this study, we use sequence information of the cox1, nad1, 16S, L1, and 12S mitochondrial genes and the 18S, 28S, and h3 nuclear genes, along with relaxed clock models and a combination of biogeographic and fossil external calibration points, to test alternative historical scenarios of the evolutionary history of the ground-dweller spider genus Parachtes (Dysderidae), which is endemic to the region. We analyse 49 specimens representing populations of most Parachtes species and close relatives. Our results reveal that both the sequence of species formation in Parachtes and the estimated divergence times match the geochronological sequence of separation of the main islands, suggesting that the diversification of the group was driven by Tertiary plate tectonics. In addition, the confirmation that Parachtes diversification matches well-dated geological events provides a model framework to infer substitution rates of molecular markers. Divergence rates estimates ranged from 3.5% My(-1) (nad1) to 0.12% My(-1) (28S), and the average divergence rate for the mitochondrial genes was 2.25% My(-1), very close to the "standard" arthropod mitochondrial rate (2.3% My(-1)). Our study provides the first unequivocal evidence of terrestrial endemic fauna of the major western Mediterranean islands, whose origin can be traced back to the Oligocene separation of these islands from the continent. Moreover, our study provides useful information on the divergence rate estimates of the most commonly used genes for phylogenetic inference in non-model arthropods.

  5. Mitogenome Phylogenetics: The Impact of Using Single Regions and Partitioning Schemes on Topology, Substitution Rate and Divergence Time Estimation

    PubMed Central

    Duchêne, Sebastián; Archer, Frederick I.; Vilstrup, Julia; Caballero, Susana; Morin, Phillip A.

    2011-01-01

    The availability of mitochondrial genome sequences is growing as a result of recent technological advances in molecular biology. In phylogenetic analyses, the complete mitogenome is increasingly becoming the marker of choice, usually providing better phylogenetic resolution and precision relative to traditional markers such as cytochrome b (CYTB) and the control region (CR). In some cases, the differences in phylogenetic estimates between mitogenomic and single-gene markers have yielded incongruent conclusions. By comparing phylogenetic estimates made from different genes, we identified the most informative mitochondrial regions and evaluated the minimum amount of data necessary to reproduce the same results as the mitogenome. We compared results among individual genes and the mitogenome for recently published complete mitogenome datasets of selected delphinids (Delphinidae) and killer whales (genus Orcinus). Using Bayesian phylogenetic methods, we investigated differences in estimation of topologies, divergence dates, and clock-like behavior among genes for both datasets. Although the most informative regions were not the same for each taxonomic group (COX1, CYTB, ND3 and ATP6 for Orcinus, and ND1, COX1 and ND4 for Delphinidae), in both cases they were equivalent to less than a quarter of the complete mitogenome. This suggests that gene information content can vary among groups, but can be adequately represented by a portion of the complete sequence. Although our results indicate that complete mitogenomes provide the highest phylogenetic resolution and most precise date estimates, a minimum amount of data can be selected using our approach when the complete sequence is unavailable. Studies based on single genes can benefit from the addition of a few more mitochondrial markers, producing topologies and date estimates similar to those obtained using the entire mitogenome. PMID:22073275

  6. Xenopus in Space and Time: Fossils, Node Calibrations, Tip-Dating, and Paleobiogeography.

    PubMed

    Cannatella, David

    2015-01-01

    Published data from DNA sequences, morphology of 11 extant and 15 extinct frog taxa, and stratigraphic ranges of fossils were integrated to open a window into the deep-time evolution of Xenopus. The ages and morphological characters of fossils were used as independent datasets to calibrate a chronogram. We found that DNA sequences, either alone or in combination with morphological data and fossils, tended to support a close relationship between Xenopus and Hymenochirus, although in some analyses this topology was not significantly better than the Pipa + Hymenochirus topology. Analyses that excluded DNA data found strong support for the Pipa + Hymenochirus tree. The criterion for selecting the maximum age of the calibration prior influenced the age estimates, and our age estimates of early divergences in the tree of frogs are substantially younger than those of published studies. Node-dating and tip-dating calibrations, either alone or in combination, yielded older dates for nodes than did a root calibration alone. Our estimates of divergence times indicate that overwater dispersal, rather than vicariance due to the splitting of Africa and South America, may explain the presence of Xenopus in Africa and its closest fossil relatives in South America.

  7. Positive selection and propeptide repeats promote rapid interspecific divergence of a gastropod sperm protein.

    PubMed

    Hellberg, M E; Moy, G W; Vacquier, V D

    2000-03-01

    Male-specific proteins have increasingly been reported as targets of positive selection and are of special interest because of the role they may play in the evolution of reproductive isolation. We report the rapid interspecific divergence of cDNA encoding a major acrosomal protein of unknown function (TMAP) of sperm from five species of teguline gastropods. A mitochondrial DNA clock (calibrated by congeneric species divided by the Isthmus of Panama) estimates that these five species diverged 2-10 MYA. Inferred amino acid sequences reveal a propeptide that has diverged rapidly between species. The mature protein has diverged faster still due to high nonsynonymous substitution rates (> 25 nonsynonymous substitutions per site per 10(9) years). cDNA encoding the mature protein (89-100 residues) shows evidence of positive selection (Dn/Ds > 1) for 4 of 10 pairwise species comparisons. cDNA and predicted secondary-structure comparisons suggest that TMAP is neither orthologous nor paralogous to abalone lysin, and thus marks a second, phylogenetically independent, protein subject to strong positive selection in free-spawning marine gastropods. In addition, an internal repeat in one species (Tegula aureotincta) produces a duplicated cleavage site which results in two alternatively processed mature proteins differing by nine amino acid residues. Such alternative processing may provide a mechanism for introducing novel amino acid sequence variation at the amino-termini of proteins. Highly divergent TMAP N-termini from two other tegulines (Tegula regina and Norrisia norrisii) may have originated by such a mechanism.

  8. Estimating phylogenetic relationships despite discordant gene trees across loci: the species tree of a diverse species group of feather mites (Acari: Proctophyllodidae).

    PubMed

    Knowles, Lacey L; Klimov, Pavel B

    2011-11-01

    With the increased availability of multilocus sequence data, the lack of concordance of gene trees estimated for independent loci has focused attention on both the biological processes producing the discord and the methodologies used to estimate phylogenetic relationships. What has emerged is a suite of new analytical tools for phylogenetic inference--species tree approaches. In contrast to traditional phylogenetic methods that are stymied by the idiosyncrasies of gene trees, approaches for estimating species trees explicitly take into account the cause of discord among loci and, in the process, provides a direct estimate of phylogenetic history (i.e. the history of species divergence, not divergence of specific loci). We illustrate the utility of species tree estimates with an analysis of a diverse group of feather mites, the pinnatus species group (genus Proctophyllodes). Discord among four sequenced nuclear loci is consistent with theoretical expectations, given the short time separating speciation events (as evident by short internodes relative to terminal branch lengths in the trees). Nevertheless, many of the relationships are well resolved in a Bayesian estimate of the species tree; the analysis also highlights ambiguous aspects of the phylogeny that require additional loci. The broad utility of species tree approaches is discussed, and specifically, their application to groups with high speciation rates--a history of diversification with particular prevalence in host/parasite systems where species interactions can drive rapid diversification.

  9. Intercontinental divergence in the Populus-associated ectomycorrhizal fungus, Tricholoma populinum.

    PubMed

    Grubisha, Lisa C; Levsen, Nicholas; Olson, Matthew S; Taylor, D Lee

    2012-04-01

    The ectomycorrhizal fungus Tricholoma populinum is host-specific with Populus species. T. populinum has wind-dispersed progagules and may be capable of long-distance dispersal. In this study, we tested the hypothesis of a panmictic population between Scandinavia and North America. DNA sequences from five nuclear loci were used to assess phylogeographic structure and nucleotide divergence between continents. Tricholoma populinum was composed of Scandinavian and North American lineages with complete absence of shared haplotypes and only one shared nucleotide mutation. Divergence of these lineages was estimated at approx. 1.7-1.0 million yr ago (Ma), which occurred after the estimated divergence of host species Populus tremula and Populus balsamifera/Populus trichocarpa at 5 Ma. Phylogeographic structure was not observed within Scandinavian or North American lineages of T. populinum. Intercontinental divergence appears to have resulted from either allopatric isolation; a recent, rare long-distance dispersal founding event followed by genetic drift; or the response in an obligate mycorrhizal fungus with a narrow host range to contractions and expansion of host distribution during glacial and interglacial episodes within continents. Understanding present genetic variation in populations is important for predicting how obligate symbiotic fungi will adapt to present and future changing climatic conditions. © 2012 The Authors. New Phytologist © 2012 New Phytologist Trust.

  10. Sequence space and the ongoing expansion of the protein universe.

    PubMed

    Povolotskaya, Inna S; Kondrashov, Fyodor A

    2010-06-17

    The need to maintain the structural and functional integrity of an evolving protein severely restricts the repertoire of acceptable amino-acid substitutions. However, it is not known whether these restrictions impose a global limit on how far homologous protein sequences can diverge from each other. Here we explore the limits of protein evolution using sequence divergence data. We formulate a computational approach to study the rate of divergence of distant protein sequences and measure this rate for ancient proteins, those that were present in the last universal common ancestor. We show that ancient proteins are still diverging from each other, indicating an ongoing expansion of the protein sequence universe. The slow rate of this divergence is imposed by the sparseness of functional protein sequences in sequence space and the ruggedness of the protein fitness landscape: approximately 98 per cent of sites cannot accept an amino-acid substitution at any given moment but a vast majority of all sites may eventually be permitted to evolve when other, compensatory, changes occur. Thus, approximately 3.5 x 10(9) yr has not been enough to reach the limit of divergent evolution of proteins, and for most proteins the limit of sequence similarity imposed by common function may not exceed that of random sequences.

  11. Sequencing of the needle transcriptome from Norway spruce (Picea abies Karst L.) reveals lower substitution rates, but similar selective constraints in gymnosperms and angiosperms

    PubMed Central

    2012-01-01

    Background A detailed knowledge about spatial and temporal gene expression is important for understanding both the function of genes and their evolution. For the vast majority of species, transcriptomes are still largely uncharacterized and even in those where substantial information is available it is often in the form of partially sequenced transcriptomes. With the development of next generation sequencing, a single experiment can now simultaneously identify the transcribed part of a species genome and estimate levels of gene expression. Results mRNA from actively growing needles of Norway spruce (Picea abies) was sequenced using next generation sequencing technology. In total, close to 70 million fragments with a length of 76 bp were sequenced resulting in 5 Gbp of raw data. A de novo assembly of these reads, together with publicly available expressed sequence tag (EST) data from Norway spruce, was used to create a reference transcriptome. Of the 38,419 PUTs (putative unique transcripts) longer than 150 bp in this reference assembly, 83.5% show similarity to ESTs from other spruce species and of the remaining PUTs, 3,704 show similarity to protein sequences from other plant species, leaving 4,167 PUTs with limited similarity to currently available plant proteins. By predicting coding frames and comparing not only the Norway spruce PUTs, but also PUTs from the close relatives Picea glauca and Picea sitchensis to both Pinus taeda and Taxus mairei, we obtained estimates of synonymous and non-synonymous divergence among conifer species. In addition, we detected close to 15,000 SNPs of high quality and estimated gene expression differences between samples collected under dark and light conditions. Conclusions Our study yielded a large number of single nucleotide polymorphisms as well as estimates of gene expression on transcriptome scale. In agreement with a recent study we find that the synonymous substitution rate per year (0.6 × 10−09 and 1.1 × 10−09) is an order of magnitude smaller than values reported for angiosperm herbs. However, if one takes generation time into account, most of this difference disappears. The estimates of the dN/dS ratio (non-synonymous over synonymous divergence) reported here are in general much lower than 1 and only a few genes showed a ratio larger than 1. PMID:23122049

  12. Influence of gene flow on divergence dating - implications for the speciation history of Takydromus grass lizards.

    PubMed

    Tseng, Shu-Ping; Li, Shou-Hsien; Hsieh, Chia-Hung; Wang, Hurng-Yi; Lin, Si-Min

    2014-10-01

    Dating the time of divergence and understanding speciation processes are central to the study of the evolutionary history of organisms but are notoriously difficult. The difficulty is largely rooted in variations in the ancestral population size or in the genealogy variation across loci. To depict the speciation processes and divergence histories of three monophyletic Takydromus species endemic to Taiwan, we sequenced 20 nuclear loci and combined with one mitochondrial locus published in GenBank. They were analysed by a multispecies coalescent approach within a Bayesian framework. Divergence dating based on the gene tree approach showed high variation among loci, and the divergence was estimated at an earlier date than when derived by the species-tree approach. To test whether variations in the ancestral population size accounted for the majority of this variation, we conducted computer inferences using isolation-with-migration (IM) and approximate Bayesian computation (ABC) frameworks. The results revealed that gene flow during the early stage of speciation was strongly favoured over the isolation model, and the initiation of the speciation process was far earlier than the dates estimated by gene- and species-based divergence dating. Due to their limited dispersal ability, it is suggested that geographical isolation may have played a major role in the divergence of these Takydromus species. Nevertheless, this study reveals a more complex situation and demonstrates that gene flow during the speciation process cannot be overlooked and may have a great impact on divergence dating. By using multilocus data and incorporating Bayesian coalescence approaches, we provide a more biologically realistic framework for delineating the divergence history of Takydromus. © 2014 John Wiley & Sons Ltd.

  13. Phylogenetic relationships and timing of diversification in gonorynchiform fishes inferred using nuclear gene DNA sequences (Teleostei: Ostariophysi).

    PubMed

    Near, Thomas J; Dornburg, Alex; Friedman, Matt

    2014-11-01

    The Gonorynchiformes are the sister lineage of the species-rich Otophysi and provide important insights into the diversification of ostariophysan fishes. Phylogenies of gonorynchiforms inferred using morphological characters and mtDNA gene sequences provide differing resolutions with regard to the sister lineage of all other gonorynchiforms (Chanos vs. Gonorynchus) and support for monophyly of the two miniaturized lineages Cromeria and Grasseichthys. In this study the phylogeny and divergence times of gonorynchiforms are investigated with DNA sequences sampled from nine nuclear genes and a published morphological character matrix. Bayesian phylogenetic analyses reveal substantial congruence among individual gene trees with inferences from eight genes placing Gonorynchus as the sister lineage to all other gonorynchiforms. Seven gene trees resolve Cromeria and Grasseichthys as a clade, supporting previous inferences using morphological characters. Phylogenies resulting from either concatenating the nuclear genes, performing a multispecies coalescent species tree analysis, or combining the morphological and nuclear gene DNA sequences resolve Gonorynchus as the living sister lineage of all other gonorynchiforms, strongly support the monophyly of Cromeria and Grasseichthys, and resolve a clade containing Parakneria, Cromeria, and Grasseichthys. The morphological dataset, which includes 13 gonorynchiform fossil taxa that range in age from Early Cretaceous to Eocene, was analyzed in combination with DNA sequences from the nine nuclear genes and a relaxed molecular clock to estimate times of evolutionary divergence. This "tip dating" strategy accommodates uncertainty in the phylogenetic resolution of fossil taxa that provide calibration information in the relaxed molecular clock analysis. The estimated age of the most recent common ancestor (MRCA) of living gonorynchiforms is slightly older than estimates from previous node dating efforts, but the molecular tip dating estimated ages of Kneriinae (Kneria, Parakneria, Cromeria, and Grasseichthys) and the two paedomorphic lineages, Cromeria and Grasseichthys, are considerably younger. Copyright © 2014 Elsevier Inc. All rights reserved.

  14. A dated molecular phylogeny of manta and devil rays (Mobulidae) based on mitogenome and nuclear sequences.

    PubMed

    Poortvliet, Marloes; Olsen, Jeanine L; Croll, Donald A; Bernardi, Giacomo; Newton, Kelly; Kollias, Spyros; O'Sullivan, John; Fernando, Daniel; Stevens, Guy; Galván Magaña, Felipe; Seret, Bernard; Wintner, Sabine; Hoarau, Galice

    2015-02-01

    Manta and devil rays are an iconic group of globally distributed pelagic filter feeders, yet their evolutionary history remains enigmatic. We employed next generation sequencing of mitogenomes for nine of the 11 recognized species and two outgroups; as well as additional Sanger sequencing of two mitochondrial and two nuclear genes in an extended taxon sampling set. Analysis of the mitogenome coding regions in a Maximum Likelihood and Bayesian framework provided a well-resolved phylogeny. The deepest divergences distinguished three clades with high support, one containing Manta birostris, Manta alfredi, Mobula tarapacana, Mobula japanica and Mobula mobular; one containing Mobula kuhlii, Mobula eregoodootenkee and Mobula thurstoni; and one containing Mobula munkiana, Mobula hypostoma and Mobula rochebrunei. Mobula remains paraphyletic with the inclusion of Manta, a result that is in agreement with previous studies based on molecular and morphological data. A fossil-calibrated Bayesian random local clock analysis suggests that mobulids diverged from Rhinoptera around 30 Mya. Subsequent divergences are characterized by long internodes followed by short bursts of speciation extending from an initial episode of divergence in the Early and Middle Miocene (19-17 Mya) to a second episode during the Pliocene and Pleistocene (3.6 Mya - recent). Estimates of divergence dates overlap significantly with periods of global warming, during which upwelling intensity - and related high primary productivity in upwelling regions - decreased markedly. These periods are hypothesized to have led to fragmentation and isolation of feeding regions leading to possible regional extinctions, as well as the promotion of allopatric speciation. The closely shared evolutionary history of mobulids in combination with ongoing threats from fisheries and climate change effects on upwelling and food supply, reinforces the case for greater protection of this charismatic family of pelagic filter feeders. Copyright © 2014 Elsevier Inc. All rights reserved.

  15. Molecular Phylogenetics and Temporal Diversification in the Genus Aeromonas Based on the Sequences of Five Housekeeping Genes

    PubMed Central

    Lorén, J. Gaspar; Farfán, Maribel; Fusté, M. Carmen

    2014-01-01

    Several approaches have been developed to estimate both the relative and absolute rates of speciation and extinction within clades based on molecular phylogenetic reconstructions of evolutionary relationships, according to an underlying model of diversification. However, the macroevolutionary models established for eukaryotes have scarcely been used with prokaryotes. We have investigated the rate and pattern of cladogenesis in the genus Aeromonas (γ-Proteobacteria, Proteobacteria, Bacteria) using the sequences of five housekeeping genes and an uncorrelated relaxed-clock approach. To our knowledge, until now this analysis has never been applied to all the species described in a bacterial genus and thus opens up the possibility of establishing models of speciation from sequence data commonly used in phylogenetic studies of prokaryotes. Our results suggest that the genus Aeromonas began to diverge between 248 and 266 million years ago, exhibiting a constant divergence rate through the Phanerozoic, which could be described as a pure birth process. PMID:24586399

  16. Origin, evolution, and biogeography of Juglans: a phylogenetic perspective

    USDA-ARS?s Scientific Manuscript database

    The eastern Asian and eastern North American disjunction in Juglans offers an opportunity to estimate the time since divergence of the Eurasian and American lineages and to compare it with paleobotanical evidences. Five chloroplast DNA non-coding spacer (NCS) sequences: trnT-trnF, psbA-trnH, atpB-r...

  17. Phylogeny and evolution of the auks (subfamily Alcinae) based on mitochondrial DNA sequences

    USGS Publications Warehouse

    Moum, Truls; Johansen, Steinar; Erikstad, Kjell Einar; Piatt, John F.

    1994-01-01

    The genetic divergence and phylogeny of the auks was assessed by mitochondrial DNA sequence comparisons in a study using 19 of the 22 auk species and two outgroup representatives. We compared more than 500 nucleotides from each of two mitochondrial genes encoding 12S rRNA and the NADH dehydrogenase subunit 6. Divergence times were estimated from transversional substitutions. The dovekie (Alle alle) is related to the razorbill (Alca torda) and the murres (Uria spp). Furthermore, the Xantus's murrelet (Synthliboramphus hypoleucus) and the ancient (Synthliboramphus antiquus) and Japanese murrelets (Synthliboramphus wumizusume) are genetically distinct members of the same main lineage, whereas brachyramphine and synthliboramphine murrelets are not closely related. An early adaptive radiation of six main species groups of auks seems to trace back to Middle Miocene. Later speciation probably involved ecological differentiations and geographical isolations.

  18. Speciation in ancient cryptic species complexes: evidence from the molecular phylogeny of Brachionus plicatilis (Rotifera).

    PubMed

    Gómez, Africa; Serra, Manuel; Carvalho, Gary R; Lunt, David H

    2002-07-01

    Continental lake-dwelling zooplanktonic organisms have long been considered cosmopolitan species with little geographic variation in spite of the isolation of their habitats. Evidence of morphological cohesiveness and high dispersal capabilities support this interpretation. However, this view has been challenged recently as many such species have been shown either to comprise cryptic species complexes or to exhibit marked population genetic differentiation and strong phylogeographic structuring at a regional scale. Here we investigate the molecular phylogeny of the cosmopolitan passively dispersing rotifer Brachionus plicatilis (Rotifera: Monogononta) species complex using nucleotide sequence variation from both nuclear (ribosomal internal transcribed spacer 1, ITS1) and mitochondrial (cytochrome c oxidase subunit I, COI) genes. Analysis of rotifer resting eggs from 27 salt lakes in the Iberian Peninsula plus lakes from four continents revealed nine genetically divergent lineages. The high level of sequence divergence, absence of hybridization, and extensive sympatry observed support the specific status of these lineages. Sequence divergence estimates indicate that the B. plicatilis complex began diversifying many millions of years ago, yet has showed relatively high levels of morphological stasis. We discuss these results in relation to the ecology and genetics of aquatic invertebrates possessing dispersive resting propagules and address the apparent contradiction between zooplanktonic population structure and their morphological stasis.

  19. Pleistocene climate change and the origin of two desert plant species, Pugionium cornutum and Pugionium dolabratum (Brassicaceae), in northwest China.

    PubMed

    Wang, Qian; Abbott, Richard J; Yu, Qiu-Shi; Lin, Kao; Liu, Jian-Quan

    2013-07-01

    Pleistocene climate change has had an important effect in shaping intraspecific genetic variation in many species; however, its role in driving speciation is less clear. We examined the possibility of a Pleistocene origin of the only two representatives of the genus Pugionium (Brassicaceae), Pugionium cornutum and Pugionium dolabratum, which occupy different desert habitats in northwest China. We surveyed sequence variation for internal transcribed spacer (ITS), three chloroplast (cp) DNA fragments, and eight low-copy nuclear genes among individuals sampled from 11 populations of each species across their geographic ranges. One ITS mutation distinguished the two species, whereas mutations in cpDNA and the eight low-copy nuclear gene sequences were not species-specific. Although interspecific divergence varied greatly among nuclear gene sequences, in each case divergence was estimated to have occurred within the Pleistocene when deserts expanded in northwest China. Our findings point to the importance of Pleistocene climate change, in this case an increase in aridity, as a cause of speciation in Pugionium as a result of divergence in different habitats that formed in association with the expansion of deserts in China. © 2013 The Authors. New Phytologist © 2013 New Phytologist Trust.

  20. Retroposon analysis of major cetacean lineages: The monophyly of toothed whales and the paraphyly of river dolphins

    PubMed Central

    Nikaido, Masato; Matsuno, Fumio; Hamilton, Healy; Brownell, Robert L.; Cao, Ying; Ding, Wang; Zuoyan, Zhu; Shedlock, Andrew M.; Fordyce, R. Ewan; Hasegawa, Masami; Okada, Norihiro

    2001-01-01

    SINE (short interspersed element) insertion analysis elucidates contentious aspects in the phylogeny of toothed whales and dolphins (Odontoceti), especially river dolphins. Here, we characterize 25 informative SINEs inserted into unique genomic loci during evolution of odontocetes to construct a cladogram, and determine a total of 2.8 kb per taxon of the flanking sequences of these SINE loci to estimate divergence times among lineages. We demonstrate that: (i) Odontocetes are monophyletic; (ii) Ganges River dolphins, beaked whales, and ocean dolphins diverged (in this order) after sperm whales; (iii) three other river dolphin taxa, namely the Amazon, La Plata, and Yangtze river dolphins, form a monophyletic group with Yangtze River dolphins being the most basal; and (iv) the rapid radiation of extant cetacean lineages occurred some 28–33 million years B.P., in strong accord with the fossil record. The combination of SINE and flanking sequence analysis suggests a topology and set of divergence times for odontocete relationships, offering alternative explanations for several long-standing problems in cetacean evolution. PMID:11416211

  1. Genetic diversity among populations of Antarctic springtails (Collembola) within the Mackay Glacier ecotone.

    PubMed

    Beet, Clare R; Hogg, Ian D; Collins, Gemma E; Cowan, Don A; Wall, Diana H; Adams, Byron J

    2016-09-01

    Climate changes are likely to have major influences on the distribution and abundance of Antarctic terrestrial biota. To assess arthropod distribution and diversity within the Ross Sea region, we examined mitochondrial DNA (COI) sequences for three currently recognized species of springtail (Collembola) collected from sites in the vicinity, and to the north of, the Mackay Glacier (77°S). This area acts as a transition between two biogeographic regions (northern and southern Victoria Land). We found populations of highly divergent individuals (5%-11.3% intraspecific sequence divergence) for each of the three putative springtail species, suggesting the possibility of cryptic diversity. Based on molecular clock estimates, these divergent lineages are likely to have been isolated for 3-5 million years. It was during this time that the Western Antarctic Ice Sheet (WAIS) was likely to have completely collapsed, potentially facilitating springtail dispersal via rafting on running waters and open seaways. The reformation of the WAIS would have isolated newly established populations, with subsequent dispersal restricted by glaciers and ice-covered areas. Given the currently limited distributions for these genetically divergent populations, any future changes in species' distributions can be easily tracked through the DNA barcoding of springtails from within the Mackay Glacier ecotone.

  2. Mitochondrial Genomes Reveal Slow Rates of Molecular Evolution and the Timing of Speciation in Beavers (Castor), One of the Largest Rodent Species

    PubMed Central

    Horn, Susanne; Durka, Walter; Wolf, Ronny; Ermala, Aslak; Stubbe, Annegret; Stubbe, Michael; Hofreiter, Michael

    2011-01-01

    Background Beavers are one of the largest and ecologically most distinct rodent species. Little is known about their evolution and even their closest phylogenetic relatives have not yet been identified with certainty. Similarly, little is known about the timing of divergence events within the genus Castor. Methodology/Principal Findings We sequenced complete mitochondrial genomes from both extant beaver species and used these sequences to place beavers in the phylogenetic tree of rodents and date their divergence from other rodents as well as the divergence events within the genus Castor. Our analyses support the phylogenetic position of beavers as a sister lineage to the scaly tailed squirrel Anomalurus within the mouse related clade. Molecular dating places the divergence time of the lineages leading to beavers and Anomalurus as early as around 54 million years ago (mya). The living beaver species, Castor canadensis from North America and Castor fiber from Eurasia, although similar in appearance, appear to have diverged from a common ancestor more than seven mya. This result is consistent with the hypothesis that a migration of Castor from Eurasia to North America as early as 7.5 mya could have initiated their speciation. We date the common ancestor of the extant Eurasian beaver relict populations to around 210,000 years ago, much earlier than previously thought. Finally, the substitution rate of Castor mitochondrial DNA is considerably lower than that of other rodents. We found evidence that this is correlated with the longer life span of beavers compared to other rodents. Conclusions/Significance A phylogenetic analysis of mitochondrial genome sequences suggests a sister-group relationship between Castor and Anomalurus, and allows molecular dating of species divergence in congruence with paleontological data. The implementation of a relaxed molecular clock enabled us to estimate mitochondrial substitution rates and to evaluate the effect of life history traits on it. PMID:21307956

  3. Siberian population of the New Stone Age: mtDNA haplotype diversity in the ancient population from the Ust'-Ida I burial ground, dated 4020-3210 BC by 14C.

    PubMed

    Naumova O, Y u; Rychkov S, Y u

    1998-03-01

    On the basis of analysis of mtDNA from skeletal remains, dated by 14C 4020-3210 BC, from the Ust'-Ida I Neolithic burial ground in Cis-Baikal area of Siberia, we obtained genetic characteristics of the ancient Mongoloid population. Using the 7 restriction enzymes for the analysis of site's polymorphism in 16,106-16,545 region of mtDNA, we studied the structure of the most frequent DNA haplotypes, and estimated the intrapopulational nucleotide diversity of the Neolithic population. Comparison of the Neolithic and modern indigeneous populations from Siberia, Mongolia and Ural showed, that the ancient Siberian population is one of the ancestors of the modern population of Siberia. From genetic distance, in the assumption of constant nucleotide substitution rate, we estimated the divergence time between the Neolithic and the modern Siberian population. This divergence time (5572 years ago) is conformed to the age of skeletal remains (5542-5652 years). With use of the 14C dates of the skeletal remains, nucleotide substitution rate in mtDNA was estimated as 1% sequence divergence for 8938-9115 years.

  4. Virus Identification in Unknown Tropical Febrile Illness Cases Using Deep Sequencing

    PubMed Central

    Balmaseda, Angel; Harris, Eva; DeRisi, Joseph L.

    2012-01-01

    Dengue virus is an emerging infectious agent that infects an estimated 50–100 million people annually worldwide, yet current diagnostic practices cannot detect an etiologic pathogen in ∼40% of dengue-like illnesses. Metagenomic approaches to pathogen detection, such as viral microarrays and deep sequencing, are promising tools to address emerging and non-diagnosable disease challenges. In this study, we used the Virochip microarray and deep sequencing to characterize the spectrum of viruses present in human sera from 123 Nicaraguan patients presenting with dengue-like symptoms but testing negative for dengue virus. We utilized a barcoding strategy to simultaneously deep sequence multiple serum specimens, generating on average over 1 million reads per sample. We then implemented a stepwise bioinformatic filtering pipeline to remove the majority of human and low-quality sequences to improve the speed and accuracy of subsequent unbiased database searches. By deep sequencing, we were able to detect virus sequence in 37% (45/123) of previously negative cases. These included 13 cases with Human Herpesvirus 6 sequences. Other samples contained sequences with similarity to sequences from viruses in the Herpesviridae, Flaviviridae, Circoviridae, Anelloviridae, Asfarviridae, and Parvoviridae families. In some cases, the putative viral sequences were virtually identical to known viruses, and in others they diverged, suggesting that they may derive from novel viruses. These results demonstrate the utility of unbiased metagenomic approaches in the detection of known and divergent viruses in the study of tropical febrile illness. PMID:22347512

  5. Phylogeographic Analyses of American Black Bears (Ursus americanus) Suggest Four Glacial Refugia and Complex Patterns of Postglacial Admixture.

    PubMed

    Puckett, Emily E; Etter, Paul D; Johnson, Eric A; Eggert, Lori S

    2015-09-01

    Studies of species with continental distributions continue to identify intraspecific lineages despite continuous habitat. Lineages may form due to isolation by distance, adaptation, divergence across barriers, or genetic drift following range expansion. We investigated lineage diversification and admixture within American black bears (Ursus americanus) across their range using 22 k single nucleotide polymorphisms and mitochondrial DNA sequences. We identified three subcontinental nuclear clusters which we further divided into nine geographic regions: Alaskan (Alaska-East), eastern (Central Interior Highlands, Great Lakes, Northeast, Southeast), and western (Alaska-West, West, Pacific Coast, Southwest). We estimated that the western cluster diverged 67 ka, before eastern and Alaskan divergence 31 ka; these divergence dates contrasted with those from the mitochondrial genome where clades A and B diverged 1.07 Ma, and clades A-east and A-west diverged 169 ka. We combined estimates of divergence timing with hindcast species distribution models to infer glacial refugia for the species in Beringia, Pacific Northwest, Southwest, and Southeast. Our results show a complex arrangement of admixture due to expansion out of multiple refugia. The delineation of the genomic population clusters was inconsistent with the ranges for 16 previously described subspecies. Ranges for U. a. pugnax and U. a. cinnamomum were concordant with admixed clusters, calling into question how to order taxa below the species level. Additionally, our finding that U. a. floridanus has not diverged from U. a. americanus also suggests that morphology and genetics should be reanalyzed to assess taxonomic designations relevant to the conservation management of the species. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved.For permissions please email: journals.permissions@oup.com.

  6. Polymorphism discovery and allele frequency estimation using high-throughput DNA sequencing of target-enriched pooled DNA samples

    PubMed Central

    2012-01-01

    Background The central role of the somatotrophic axis in animal post-natal growth, development and fertility is well established. Therefore, the identification of genetic variants affecting quantitative traits within this axis is an attractive goal. However, large sample numbers are a pre-requisite for the identification of genetic variants underlying complex traits and although technologies are improving rapidly, high-throughput sequencing of large numbers of complete individual genomes remains prohibitively expensive. Therefore using a pooled DNA approach coupled with target enrichment and high-throughput sequencing, the aim of this study was to identify polymorphisms and estimate allele frequency differences across 83 candidate genes of the somatotrophic axis, in 150 Holstein-Friesian dairy bulls divided into two groups divergent for genetic merit for fertility. Results In total, 4,135 SNPs and 893 indels were identified during the resequencing of the 83 candidate genes. Nineteen percent (n = 952) of variants were located within 5' and 3' UTRs. Seventy-two percent (n = 3,612) were intronic and 9% (n = 464) were exonic, including 65 indels and 236 SNPs resulting in non-synonymous substitutions (NSS). Significant (P < 0.01) mean allele frequency differentials between the low and high fertility groups were observed for 720 SNPs (58 NSS). Allele frequencies for 43 of the SNPs were also determined by genotyping the 150 individual animals (Sequenom® MassARRAY). No significant differences (P > 0.1) were observed between the two methods for any of the 43 SNPs across both pools (i.e., 86 tests in total). Conclusions The results of the current study support previous findings of the use of DNA sample pooling and high-throughput sequencing as a viable strategy for polymorphism discovery and allele frequency estimation. Using this approach we have characterised the genetic variation within genes of the somatotrophic axis and related pathways, central to mammalian post-natal growth and development and subsequent lactogenesis and fertility. We have identified a large number of variants segregating at significantly different frequencies between cattle groups divergent for calving interval plausibly harbouring causative variants contributing to heritable variation. To our knowledge, this is the first report describing sequencing of targeted genomic regions in any livestock species using groups with divergent phenotypes for an economically important trait. PMID:22235840

  7. Mitochondrial and nuclear DNA sequences support a Cretaceous origin of Columbiformes and a dispersal-driven radiation in the Paleocene .

    PubMed

    Pereira, Sergio L; Johnson, Kevin P; Clayton, Dale H; Baker, Allan J

    2007-08-01

    Phylogenetic relationships among genera of pigeons and doves (Aves, Columbiformes) have not been fully resolved because of limited sampling of taxa and characters in previous studies. We therefore sequenced multiple nuclear and mitochondrial DNA genes totaling over 9000 bp from 33 of 41 genera plus 8 outgroup taxa, and, together with sequences from 5 other pigeon genera retrieved from GenBank, recovered a strong phylogenetic hypothesis for the Columbiformes. Three major clades were recovered with the combined data set, comprising the basally branching New World pigeons and allies (clade A) that are sister to Neotropical ground doves (clade B), and the Afro-Eurasian and Australasian taxa (clade C). None of these clades supports the monophyly of current families and subfamilies. The extinct, flightless dodo and solitaires (Raphidae) were embedded within pigeons and doves (Columbidae) in clade C, and monophyly of the subfamily Columbinae was refuted because the remaining subfamilies were nested within it. Divergence times estimated using a Bayesian framework suggest that Columbiformes diverged from outgroups such as Apodiformes and Caprimulgiformes in the Cretaceous before the mass extinction that marks the end of this period. Bayesian and maximum likelihood inferences of ancestral areas, accounting for phylogenetic uncertainty and divergence times, respectively, favor an ancient origin of Columbiformes in the Neotropical portion of what was then Gondwana. The radiation of modern genera of Columbiformes started in the Early Eocene to the Middle Miocene, as previously estimated for other avian groups such as ratites, tinamous, galliform birds, penguins, shorebirds, parrots, passerine birds, and toucans. Multiple dispersals of more derived Columbiformes between Australasian and Afro-Eurasian regions are required to explain current distributions.

  8. A genomic timescale of prokaryote evolution: insights into the origin of methanogenesis, phototrophy, and the colonization of land

    NASA Technical Reports Server (NTRS)

    Battistuzzi, Fabia U.; Feijao, Andreia; Hedges, S. Blair

    2004-01-01

    BACKGROUND: The timescale of prokaryote evolution has been difficult to reconstruct because of a limited fossil record and complexities associated with molecular clocks and deep divergences. However, the relatively large number of genome sequences currently available has provided a better opportunity to control for potential biases such as horizontal gene transfer and rate differences among lineages. We assembled a data set of sequences from 32 proteins (approximately 7600 amino acids) common to 72 species and estimated phylogenetic relationships and divergence times with a local clock method. RESULTS: Our phylogenetic results support most of the currently recognized higher-level groupings of prokaryotes. Of particular interest is a well-supported group of three major lineages of eubacteria (Actinobacteria, Deinococcus, and Cyanobacteria) that we call Terrabacteria and associate with an early colonization of land. Divergence time estimates for the major groups of eubacteria are between 2.5-3.2 billion years ago (Ga) while those for archaebacteria are mostly between 3.1-4.1 Ga. The time estimates suggest a Hadean origin of life (prior to 4.1 Ga), an early origin of methanogenesis (3.8-4.1 Ga), an origin of anaerobic methanotrophy after 3.1 Ga, an origin of phototrophy prior to 3.2 Ga, an early colonization of land 2.8-3.1 Ga, and an origin of aerobic methanotrophy 2.5-2.8 Ga. CONCLUSIONS: Our early time estimates for methanogenesis support the consideration of methane, in addition to carbon dioxide, as a greenhouse gas responsible for the early warming of the Earths' surface. Our divergence times for the origin of anaerobic methanotrophy are compatible with highly depleted carbon isotopic values found in rocks dated 2.8-2.6 Ga. An early origin of phototrophy is consistent with the earliest bacterial mats and structures identified as stromatolites, but a 2.6 Ga origin of cyanobacteria suggests that those Archean structures, if biologically produced, were made by anoxygenic photosynthesizers. The resistance to desiccation of Terrabacteria and their elaboration of photoprotective compounds suggests that the common ancestor of this group inhabited land. If true, then oxygenic photosynthesis may owe its origin to terrestrial adaptations.

  9. Phylogeny and Bayesian divergence time estimations of small-headed flies (Diptera: Acroceridae) using multiple molecular markers.

    PubMed

    Winterton, Shaun L; Wiegmann, Brian M; Schlinger, Evert I

    2007-06-01

    The first formal analysis of phylogenetic relationships among small-headed flies (Acroceridae) is presented based on DNA sequence data from two ribosomal (16S and 28S) and two protein-encoding genes: carbomoylphosphate synthase (CPS) domain of CAD (i.e., rudimentary locus) and cytochrome oxidase I (COI). DNA sequences from 40 species in 22 genera of Acroceridae (representing all three subfamilies) were compared with outgroup exemplars from Nemestrinidae, Stratiomyidae, Tabanidae, and Xylophagidae. Parsimony and Bayesian simultaneous analyses of the full data set recover a well-resolved and strongly supported hypothesis of phylogenetic relationships for major lineages within the family. Molecular evidence supports the monophyly of traditionally recognised subfamilies Philopotinae and Panopinae, but Acrocerinae are polyphyletic. Panopinae, sometimes considered "primitive" based on morphology and host-use, are always placed in a more derived position in the current study. Furthermore, these data support emerging morphological evidence that the type genus Acrocera Meigen, and its sister genus Sphaerops, are atypical acrocerids, comprising a sister lineage to all other Acroceridae. Based on the phylogeny generated in the simultaneous analysis, historical divergence times were estimated using Bayesian methodology constrained with fossil data. These estimates indicate Acroceridae likely evolved during the late Triassic but did not diversify greatly until the Cretaceous.

  10. On the Evolutionary and Biogeographic History of Saxifraga sect. Trachyphyllum (Gaud.) Koch (Saxifragaceae Juss.)

    PubMed Central

    DeChaine, Eric G.; Anderson, Stacy A.; McNew, Jennifer M.; Wendling, Barry M.

    2013-01-01

    Arctic-alpine plants in the genus Saxifraga L. (Saxifragaceae Juss.) provide an excellent system for investigating the process of diversification in northern regions. Yet, sect. Trachyphyllum (Gaud.) Koch, which is comprised of about 8 to 26 species, has still not been explored by molecular systematists even though taxonomists concur that the section needs to be thoroughly re-examined. Our goals were to use chloroplast trnL-F and nuclear ITS DNA sequence data to circumscribe the section phylogenetically, test models of geographically-based population divergence, and assess the utility of morphological characters in estimating evolutionary relationships. To do so, we sequenced both genetic markers for 19 taxa within the section. The phylogenetic inferences of sect. Trachyphyllum using maximum likelihood and Bayesian analyses showed that the section is polyphyletic, with S. aspera L. and S bryoides L. falling outside the main clade. In addition, the analyses supported several taxonomic re-classifications to prior names. We used two approaches to test biogeographic hypotheses: i) a coalescent approach in Mesquite to test the fit of our reconstructed gene trees to geographically-based models of population divergence and ii) a maximum likelihood inference in Lagrange. These tests uncovered strong support for an origin of the clade in the Southern Rocky Mountains of North America followed by dispersal and divergence episodes across refugia. Finally we adopted a stochastic character mapping approach in SIMMAP to investigate the utility of morphological characters in estimating evolutionary relationships among taxa. We found that few morphological characters were phylogenetically informative and many were misleading. Our molecular analyses provide a foundation for the diversity and evolutionary relationships within sect. Trachyphyllum and hypotheses for better understanding the patterns and processes of divergence in this section, other saxifrages, and plants inhabiting the North Pacific Rim. PMID:23922810

  11. Evolutionary Roots and Diversification of the Genus Aeromonas.

    PubMed

    Sanglas, Ariadna; Albarral, Vicenta; Farfán, Maribel; Lorén, J G; Fusté, M C

    2017-01-01

    Despite the importance of diversification rates in the study of prokaryote evolution, they have not been quantitatively assessed for the majority of microorganism taxa. The investigation of evolutionary patterns in prokaryotes constitutes a challenge due to a very scarce fossil record, limited morphological differentiation and frequently complex taxonomic relationships, which make even species recognition difficult. Although the speciation models and speciation rates in eukaryotes have traditionally been established by analyzing the fossil record data, this is frequently incomplete, and not always available. More recently, several methods based on molecular sequence data have been developed to estimate speciation and extinction rates from phylogenies reconstructed from contemporary taxa. In this work, we determined the divergence time and temporal diversification of the genus Aeromonas by applying these methods widely used with eukaryotic taxa. Our analysis involved 150 Aeromonas strains using the concatenated sequences of two housekeeping genes (approximately 2,000 bp). Dating and diversification model analyses were performed using two different approaches: obtaining the consensus sequence from the concatenated sequences corresponding to all the strains belonging to the same species, or generating the species tree from multiple alignments of each gene. We used BEAST to perform a Bayesian analysis to estimate both the phylogeny and the divergence times. A global molecular clock cannot be assumed for any gene. From the chronograms obtained, we carried out a diversification analysis using several approaches. The results suggest that the genus Aeromonas began to diverge approximately 250 millions of years (Ma) ago. All methods used to determine Aeromonas diversification gave similar results, suggesting that the speciation process in this bacterial genus followed a rate-constant (Yule) diversification model, although there is a small probability that a slight deceleration occurred in recent times. We also determined the constant of diversification (λ) values, which in all cases were very similar, about 0.01 species/Ma, a value clearly lower than those described for different eukaryotes.

  12. Evolutionary Roots and Diversification of the Genus Aeromonas

    PubMed Central

    Sanglas, Ariadna; Albarral, Vicenta; Farfán, Maribel; Lorén, J. G.; Fusté, M. C.

    2017-01-01

    Despite the importance of diversification rates in the study of prokaryote evolution, they have not been quantitatively assessed for the majority of microorganism taxa. The investigation of evolutionary patterns in prokaryotes constitutes a challenge due to a very scarce fossil record, limited morphological differentiation and frequently complex taxonomic relationships, which make even species recognition difficult. Although the speciation models and speciation rates in eukaryotes have traditionally been established by analyzing the fossil record data, this is frequently incomplete, and not always available. More recently, several methods based on molecular sequence data have been developed to estimate speciation and extinction rates from phylogenies reconstructed from contemporary taxa. In this work, we determined the divergence time and temporal diversification of the genus Aeromonas by applying these methods widely used with eukaryotic taxa. Our analysis involved 150 Aeromonas strains using the concatenated sequences of two housekeeping genes (approximately 2,000 bp). Dating and diversification model analyses were performed using two different approaches: obtaining the consensus sequence from the concatenated sequences corresponding to all the strains belonging to the same species, or generating the species tree from multiple alignments of each gene. We used BEAST to perform a Bayesian analysis to estimate both the phylogeny and the divergence times. A global molecular clock cannot be assumed for any gene. From the chronograms obtained, we carried out a diversification analysis using several approaches. The results suggest that the genus Aeromonas began to diverge approximately 250 millions of years (Ma) ago. All methods used to determine Aeromonas diversification gave similar results, suggesting that the speciation process in this bacterial genus followed a rate-constant (Yule) diversification model, although there is a small probability that a slight deceleration occurred in recent times. We also determined the constant of diversification (λ) values, which in all cases were very similar, about 0.01 species/Ma, a value clearly lower than those described for different eukaryotes. PMID:28228750

  13. Bayesian phylogenetic estimation of fossil ages.

    PubMed

    Drummond, Alexei J; Stadler, Tanja

    2016-07-19

    Recent advances have allowed for both morphological fossil evidence and molecular sequences to be integrated into a single combined inference of divergence dates under the rule of Bayesian probability. In particular, the fossilized birth-death tree prior and the Lewis-Mk model of discrete morphological evolution allow for the estimation of both divergence times and phylogenetic relationships between fossil and extant taxa. We exploit this statistical framework to investigate the internal consistency of these models by producing phylogenetic estimates of the age of each fossil in turn, within two rich and well-characterized datasets of fossil and extant species (penguins and canids). We find that the estimation accuracy of fossil ages is generally high with credible intervals seldom excluding the true age and median relative error in the two datasets of 5.7% and 13.2%, respectively. The median relative standard error (RSD) was 9.2% and 7.2%, respectively, suggesting good precision, although with some outliers. In fact, in the two datasets we analyse, the phylogenetic estimate of fossil age is on average less than 2 Myr from the mid-point age of the geological strata from which it was excavated. The high level of internal consistency found in our analyses suggests that the Bayesian statistical model employed is an adequate fit for both the geological and morphological data, and provides evidence from real data that the framework used can accurately model the evolution of discrete morphological traits coded from fossil and extant taxa. We anticipate that this approach will have diverse applications beyond divergence time dating, including dating fossils that are temporally unconstrained, testing of the 'morphological clock', and for uncovering potential model misspecification and/or data errors when controversial phylogenetic hypotheses are obtained based on combined divergence dating analyses.This article is part of the themed issue 'Dating species divergences using rocks and clocks'. © 2016 The Authors.

  14. Bayesian phylogenetic estimation of fossil ages

    PubMed Central

    Drummond, Alexei J.; Stadler, Tanja

    2016-01-01

    Recent advances have allowed for both morphological fossil evidence and molecular sequences to be integrated into a single combined inference of divergence dates under the rule of Bayesian probability. In particular, the fossilized birth–death tree prior and the Lewis-Mk model of discrete morphological evolution allow for the estimation of both divergence times and phylogenetic relationships between fossil and extant taxa. We exploit this statistical framework to investigate the internal consistency of these models by producing phylogenetic estimates of the age of each fossil in turn, within two rich and well-characterized datasets of fossil and extant species (penguins and canids). We find that the estimation accuracy of fossil ages is generally high with credible intervals seldom excluding the true age and median relative error in the two datasets of 5.7% and 13.2%, respectively. The median relative standard error (RSD) was 9.2% and 7.2%, respectively, suggesting good precision, although with some outliers. In fact, in the two datasets we analyse, the phylogenetic estimate of fossil age is on average less than 2 Myr from the mid-point age of the geological strata from which it was excavated. The high level of internal consistency found in our analyses suggests that the Bayesian statistical model employed is an adequate fit for both the geological and morphological data, and provides evidence from real data that the framework used can accurately model the evolution of discrete morphological traits coded from fossil and extant taxa. We anticipate that this approach will have diverse applications beyond divergence time dating, including dating fossils that are temporally unconstrained, testing of the ‘morphological clock', and for uncovering potential model misspecification and/or data errors when controversial phylogenetic hypotheses are obtained based on combined divergence dating analyses. This article is part of the themed issue ‘Dating species divergences using rocks and clocks’. PMID:27325827

  15. Dynamics of actin evolution in dinoflagellates.

    PubMed

    Kim, Sunju; Bachvaroff, Tsvetan R; Handy, Sara M; Delwiche, Charles F

    2011-04-01

    Dinoflagellates have unique nuclei and intriguing genome characteristics with very high DNA content making complete genome sequencing difficult. In dinoflagellates, many genes are found in multicopy gene families, but the processes involved in the establishment and maintenance of these gene families are poorly understood. Understanding the dynamics of gene family evolution in dinoflagellates requires comparisons at different evolutionary scales. Studies of closely related species provide fine-scale information relative to species divergence, whereas comparisons of more distantly related species provides broad context. We selected the actin gene family as a highly expressed conserved gene previously studied in dinoflagellates. Of the 142 sequences determined in this study, 103 were from the two closely related species, Dinophysis acuminata and D. caudata, including full length and partial cDNA sequences as well as partial genomic amplicons. For these two Dinophysis species, at least three types of sequences could be identified. Most copies (79%) were relatively similar and in nucleotide trees, the sequences formed two bushy clades corresponding to the two species. In comparisons within species, only eight to ten nucleotide differences were found between these copies. The two remaining types formed clades containing sequences from both species. One type included the most similar sequences in between-species comparisons with as few as 12 nucleotide differences between species. The second type included the most divergent sequences in comparisons between and within species with up to 93 nucleotide differences between sequences. In all the sequences, most variation occurred in synonymous sites or the 5' UnTranslated Region (UTR), although there was still limited amino acid variation between most sequences. Several potential pseudogenes were found (approximately 10% of all sequences depending on species) with incomplete open reading frames due to frameshifts or early stop codons. Overall, variation in the actin gene family fits best with the "birth and death" model of evolution based on recent duplications, pseudogenes, and incomplete lineage sorting. Divergence between species was similar to variation within species, so that actin may be too conserved to be useful for phylogenetic estimation of closely related species.

  16. Ancient papillomavirus-host co-speciation in Felidae

    PubMed Central

    Rector, Annabel; Lemey, Philippe; Tachezy, Ruth; Mostmans, Sara; Ghim, Shin-Je; Van Doorslaer, Koenraad; Roelke, Melody; Bush, Mitchell; Montali, Richard J; Joslin, Janis; Burk, Robert D; Jenson, Alfred B; Sundberg, John P; Shapiro, Beth; Van Ranst, Marc

    2007-01-01

    Background Estimating evolutionary rates for slowly evolving viruses such as papillomaviruses (PVs) is not possible using fossil calibrations directly or sequences sampled over a time-scale of decades. An ability to correlate their divergence with a host species, however, can provide a means to estimate evolutionary rates for these viruses accurately. To determine whether such an approach is feasible, we sequenced complete feline PV genomes, previously available only for the domestic cat (Felis domesticus, FdPV1), from four additional, globally distributed feline species: Lynx rufus PV type 1, Puma concolor PV type 1, Panthera leo persica PV type 1, and Uncia uncia PV type 1. Results The feline PVs all belong to the Lambdapapillomavirus genus, and contain an unusual second noncoding region between the early and late protein region, which is only present in members of this genus. Our maximum likelihood and Bayesian phylogenetic analyses demonstrate that the evolutionary relationships between feline PVs perfectly mirror those of their feline hosts, despite a complex and dynamic phylogeographic history. By applying host species divergence times, we provide the first precise estimates for the rate of evolution for each PV gene, with an overall evolutionary rate of 1.95 × 10-8 (95% confidence interval 1.32 × 10-8 to 2.47 × 10-8) nucleotide substitutions per site per year for the viral coding genome. Conclusion Our work provides evidence for long-term virus-host co-speciation of feline PVs, indicating that viral diversity in slowly evolving viruses can be used to investigate host species evolution. These findings, however, should not be extrapolated to other viral lineages without prior confirmation of virus-host co-divergence. PMID:17430578

  17. Molecular-based estimate of species number, phylogenetic relationships and divergence times for the genus Stenotaenia (Chilopoda, Geophilomorpha) in the Italian region

    PubMed Central

    Del Latte, Laura; Bortolin, Francesca; Rota-Stabelli, Omar; Fusco, Giuseppe; Bonato, Lucio

    2015-01-01

    Abstract Stenotaenia is one of the largest and most widespread genera of geophilid centipedes in the Western Palearctic, with a very uniform morphology and about fifteen species provisionally recognized. For a better understanding of Stenotaenia species-level taxonomy, we have explored the possibility of using molecular data. As a preliminary assay, we sampled twelve populations, mainly from the Italian region, and analyzed partial sequences of the two genes COI and 28S. We employed a DNA-barcoding approach, complemented by a phylogenetic analysis coupled with divergence time estimation. Assuming a barcoding gap of 10–16% K2P pairwise distances, we found evidence for the presence of at least six Stenotaenia species in the Italian region, which started diverging about 50 million years ago, only partially matching with previously recognized species. We found that small-sized oligopodous species belong to a single clade that originated about 33 million years ago, and obtained some preliminary evidence of the related genus Tuoba being nested within Stenotaenia. PMID:26257533

  18. Multilocus phylogeny, divergence times, and a major role for the benthic-to-pelagic axis in the diversification of grunts (Haemulidae).

    PubMed

    Tavera, Jose; Acero P, Arturo; Wainwright, Peter C

    2018-04-01

    We present a phylogenetic analysis with divergence time estimates, and an ecomorphological assessment of the role of the benthic-to-pelagic axis of diversification in the history of haemulid fishes. Phylogenetic analyses were performed on 97 grunt species based on sequence data collected from seven loci. Divergence time estimation indicates that Haemulidae originated during the mid Eocene (54.7-42.3 Ma) but that the major lineages were formed during the mid-Oligocene 30-25 Ma. We propose a new classification that reflects the phylogenetic history of grunts. Overall the pattern of morphological and functional diversification in grunts appears to be strongly linked with feeding ecology. Feeding traits and the first principal component of body shape strongly separate species that feed in benthic and pelagic habitats. The benthic-to-pelagic axis has been the major axis of ecomorphological diversification in this important group of tropical shoreline fishes, with about 13 transitions between feeding habitats that have had major consequences for head and body morphology. Copyright © 2017 Elsevier Inc. All rights reserved.

  19. Mitogenomic analysis of the genus Panthera.

    PubMed

    Wei, Lei; Wu, Xiaobing; Zhu, Lixin; Jiang, Zhigang

    2011-10-01

    The complete sequences of the mitochondrial DNA genomes of Panthera tigris, Panthera pardus, and Panthera uncia were determined using the polymerase chain reaction method. The lengths of the complete mitochondrial DNA sequences of the three species were 16990, 16964, and 16773 bp, respectively. Each of the three mitochondrial DNA genomes included 13 protein-coding genes, 22 tRNA, two rRNA, one O(L)R, and one control region. The structures of the genomes were highly similar to those of Felis catus, Acinonyx jubatus, and Neofelis nebulosa. The phylogenies of the genus Panthera were inferred from two combined mitochondrial sequence data sets and the complete mitochondrial genome sequences, by MP (maximum parsimony), ML (maximum likelihood), and Bayesian analysis. The results showed that Panthera was composed of Panthera leo, P. uncia, P. pardus, Panthera onca, P. tigris, and N. nebulosa, which was included as the most basal member. The phylogeny within Panthera genus was N. nebulosa (P. tigris (P. onca (P. pardus, (P. leo, P. uncia)))). The divergence times for Panthera genus were estimated based on the ML branch lengths and four well-established calibration points. The results showed that at about 11.3 MYA, the Panthera genus separated from other felid species and then evolved into the several species of the genus. In detail, N. nebulosa was estimated to be founded about 8.66 MYA, P. tigris about 6.55 MYA, P. uncia about 4.63 MYA, and P. pardus about 4.35 MYA. All these estimated times were older than those estimated from the fossil records. The divergence event, evolutionary process, speciation, and distribution pattern of P. uncia, a species endemic to the central Asia with core habitats on the Qinghai-Tibetan Plateau and surrounding highlands, mostly correlated with the geological tectonic events and intensive climate shifts that happened at 8, 3.6, 2.5, and 1.7 MYA on the plateau during the late Cenozoic period.

  20. Estimating Divergence Parameters With Small Samples From a Large Number of Loci

    PubMed Central

    Wang, Yong; Hey, Jody

    2010-01-01

    Most methods for studying divergence with gene flow rely upon data from many individuals at few loci. Such data can be useful for inferring recent population history but they are unlikely to contain sufficient information about older events. However, the growing availability of genome sequences suggests a different kind of sampling scheme, one that may be more suited to studying relatively ancient divergence. Data sets extracted from whole-genome alignments may represent very few individuals but contain a very large number of loci. To take advantage of such data we developed a new maximum-likelihood method for genomic data under the isolation-with-migration model. Unlike many coalescent-based likelihood methods, our method does not rely on Monte Carlo sampling of genealogies, but rather provides a precise calculation of the likelihood by numerical integration over all genealogies. We demonstrate that the method works well on simulated data sets. We also consider two models for accommodating mutation rate variation among loci and find that the model that treats mutation rates as random variables leads to better estimates. We applied the method to the divergence of Drosophila melanogaster and D. simulans and detected a low, but statistically significant, signal of gene flow from D. simulans to D. melanogaster. PMID:19917765

  1. Divergence times in the termite genus Macrotermes (Isoptera: Termitidae).

    PubMed

    Brandl, R; Hyodo, F; Korff-Schmising, M von; Maekawa, K; Miura, T; Takematsu, Y; Matsumoto, T; Abe, T; Bagine, R; Kaib, M

    2007-10-01

    The evolution of fungus-growing termites is supposed to have started in the African rain forests with multiple invasions of semi-arid habitats as well as multiple invasions of the Oriental region. We used sequences of the mitochondrial COII gene and Bayesian dating to investigate the time frame of the evolution of Macrotermes, an important genus of fungus-growing termites. We found that the genus Macrotermes consists of at least 6 distantly related clades. Furthermore, the COII sequences suggested some cryptic diversity within the analysed African Macrotermes species. The dates calculated with the COII data using a fossilized termite mound to calibrate the clock were in good agreement with dates calculated with COI sequences using the split between Locusta and Chortippus as calibration point which supports the consistency of the calibration points. The clades from the Oriental region dated back to the early Tertiary. These estimates of divergence times suggested that Macrotermes invaded Asia during periods with humid climates. For Africa, many speciation events predated the Pleistocene and fall in range of 6-23 million years ago. These estimates suggest that savannah-adapted African clades radiated with the spread of the semi-arid ecosystems during the Miocene. Apparently, events during the Pleistocene were of little importance for speciation within the genus Macrotermes. However, further investigations are necessary to increase the number of taxa for phylogenetic analysis.

  2. Host Jumps and Radiation, Not Co-Divergence Drives Diversification of Obligate Pathogens. A Case Study in Downy Mildews and Asteraceae.

    PubMed

    Choi, Young-Joon; Thines, Marco

    2015-01-01

    Even though the microevolution of plant hosts and pathogens has been intensely studied, knowledge regarding macro-evolutionary patterns is limited. Having the highest species diversity and host-specificity among Oomycetes, downy mildews are a useful a model for investigating long-term host-pathogen coevolution. We show that phylogenies of Bremia and Asteraceae are significantly congruent. The accepted hypothesis is that pathogens have diverged contemporarily with their hosts. But maximum clade age estimation and sequence divergence comparison reveal that congruence is not due to long-term coevolution but rather due to host-shift driven speciation (pseudo-cospeciation). This pattern results from parasite radiation in related hosts, long after radiation and speciation of the hosts. As large host shifts free pathogens from hosts with effector triggered immunity subsequent radiation and diversification in related hosts with similar innate immunity may follow, resulting in a pattern mimicking true co-divergence, which is probably limited to the terminal nodes in many pathogen groups.

  3. Host Jumps and Radiation, Not Co‐Divergence Drives Diversification of Obligate Pathogens. A Case Study in Downy Mildews and Asteraceae

    PubMed Central

    Choi, Young-Joon; Thines, Marco

    2015-01-01

    Even though the microevolution of plant hosts and pathogens has been intensely studied, knowledge regarding macro-evolutionary patterns is limited. Having the highest species diversity and host-specificity among Oomycetes, downy mildews are a useful a model for investigating long-term host-pathogen coevolution. We show that phylogenies of Bremia and Asteraceae are significantly congruent. The accepted hypothesis is that pathogens have diverged contemporarily with their hosts. But maximum clade age estimation and sequence divergence comparison reveal that congruence is not due to long-term coevolution but rather due to host-shift driven speciation (pseudo-cospeciation). This pattern results from parasite radiation in related hosts, long after radiation and speciation of the hosts. As large host shifts free pathogens from hosts with effector triggered immunity subsequent radiation and diversification in related hosts with similar innate immunity may follow, resulting in a pattern mimicking true co-divergence, which is probably limited to the terminal nodes in many pathogen groups. PMID:26230508

  4. Phylogeny and evolution of ferns (monilophytes) with a focus on the early leptosporangiate divergences.

    PubMed

    Pryer, Kathleen M; Schuettpelz, Eric; Wolf, Paul G; Schneider, Harald; Smith, Alan R; Cranfill, Raymond

    2004-10-01

    The phylogenetic structure of ferns (= monilophytes) is explored here, with a special focus on the early divergences among leptosporangiate lineages. Despite considerable progress in our understanding of fern relationships, a rigorous and comprehensive analysis of the early leptosporangiate divergences was lacking. Therefore, a data set was designed here to include critical taxa that were not included in earlier studies. More than 5000 bp from the plastid (rbcL, atpB, rps4) and the nuclear (18S rDNA) genomes were sequenced for 62 taxa. Phylogenetic analyses of these data (1) confirm that Osmundaceae are sister to the rest of the leptosporangiates, (2) resolve a diverse set of ferns formerly thought to be a subsequent grade as possibly monophyletic (((Dipteridaceae, Matoniaceae), Gleicheniaceae), Hymenophyllaceae), and (3) place schizaeoid ferns as sister to a large clade of "core leptosporangiates" that includes heterosporous ferns, tree ferns, and polypods. Divergence time estimates for ferns are reported from penalized likelihood analyses of our molecular data, with constraints from a reassessment of the fossil record.

  5. Genomic patterns of species diversity and divergence in Eucalyptus.

    PubMed

    Hudson, Corey J; Freeman, Jules S; Myburg, Alexander A; Potts, Brad M; Vaillancourt, René E

    2015-06-01

    We examined genome-wide patterns of DNA sequence diversity and divergence among six species of the important tree genus Eucalyptus and investigated their relationship with genomic architecture. Using c. 90 range-wide individuals of each Eucalyptus species (E. grandis, E. urophylla, E. globulus, E. nitens, E. dunnii and E. camaldulensis), genetic diversity and divergence were estimated from 2840 polymorphic diversity arrays technology markers covering the 11 chromosomes. Species differentiating markers (SDMs) identified in each of 15 pairwise species comparisons, along with species diversity (HHW ) and divergence (FST ), were projected onto the E. grandis reference genome. Across all species comparisons, SDMs totalled 1.1-5.3% of markers and were widely distributed throughout the genome. Marker divergence (FST and SDMs) and diversity differed among and within chromosomes. Patterns of diversity and divergence were broadly conserved across species and significantly associated with genomic features, including the proximity of markers to genes, the relative number of clusters of tandem duplications, and gene density within or among chromosomes. These results suggest that genomic architecture influences patterns of species diversity and divergence in the genus. This influence is evident across the six species, encompassing diverse phylogenetic lineages, geography and ecology. © 2015 University of Tasmania New Phytologist © 2015 New Phytologist Trust.

  6. Segmenting the human genome based on states of neutral genetic divergence.

    PubMed

    Kuruppumullage Don, Prabhani; Ananda, Guruprasad; Chiaromonte, Francesca; Makova, Kateryna D

    2013-09-03

    Many studies have demonstrated that divergence levels generated by different mutation types vary and covary across the human genome. To improve our still-incomplete understanding of the mechanistic basis of this phenomenon, we analyze several mutation types simultaneously, anchoring their variation to specific regions of the genome. Using hidden Markov models on insertion, deletion, nucleotide substitution, and microsatellite divergence estimates inferred from human-orangutan alignments of neutrally evolving genomic sequences, we segment the human genome into regions corresponding to different divergence states--each uniquely characterized by specific combinations of divergence levels. We then parsed the mutagenic contributions of various biochemical processes associating divergence states with a broad range of genomic landscape features. We find that high divergence states inhabit guanine- and cytosine (GC)-rich, highly recombining subtelomeric regions; low divergence states cover inner parts of autosomes; chromosome X forms its own state with lowest divergence; and a state of elevated microsatellite mutability is interspersed across the genome. These general trends are mirrored in human diversity data from the 1000 Genomes Project, and departures from them highlight the evolutionary history of primate chromosomes. We also find that genes and noncoding functional marks [annotations from the Encyclopedia of DNA Elements (ENCODE)] are concentrated in high divergence states. Our results provide a powerful tool for biomedical data analysis: segmentations can be used to screen personal genome variants--including those associated with cancer and other diseases--and to improve computational predictions of noncoding functional elements.

  7. Phylogenomic Analysis Resolves the Interordinal Relationships and Rapid Diversification of the Laurasiatherian Mammals

    PubMed Central

    Zhou, Xuming; Xu, Shixia; Xu, Junxiao; Chen, Bingyao; Zhou, Kaiya; Yang, Guang

    2012-01-01

    Abstract Although great progress has been made in resolving the relationships of placental mammals, the position of several clades in Laurasiatheria remain controversial. In this study, we performed a phylogenetic analysis of 97 orthologs (46,152 bp) for 15 taxa, representing all laurasiatherian orders. Additionally, phylogenetic trees of laurasiatherian mammals with draft genome sequences were reconstructed based on 1608 exons (2,175,102 bp). Our reconstructions resolve the interordinal relationships within Laurasiatheria and corroborate the clades Scrotifera, Fereuungulata, and Cetartiodactyla. Furthermore, we tested alternative topologies within Laurasiatheria, and among alternatives for the phylogenetic position of Perissodactyla, a sister-group relationship with Cetartiodactyla receives the highest support. Thus, Pegasoferae (Perissodactyla + Carnivora + Pholidota + Chiroptera) does not appear to be a natural group. Divergence time estimates from these genes were compared with published estimates for splits within Laurasiatheria. Our estimates were similar to those of several studies and suggest that the divergences among these orders occurred within just a few million years. PMID:21900649

  8. The historical biogeography of the freshwater knifefishes using mitogenomic approaches: a mesozoic origin of the Asian notopterids (Actinopterygii: Osteoglossomorpha).

    PubMed

    Inoue, Jun G; Kumazawa, Yoshinori; Miya, Masaki; Nishida, Mutsumi

    2009-06-01

    The continental distributions of freshwater fishes in the family Notopteridae (Osteoglossomorpha) across Africa, India, and Southeast Asia constitute a long standing and enigmatic problem of freshwater biogeography. The migrational pathway of the Asian notopterids has been discussed in light of two competing schemes: the first posits recent transcontinental dispersal while the second relies on distributions being shaped by ancient vicariance associated with plate-tectonic events. In this study, we determined complete mitochondrial DNA sequences from 10 osteoglossomorph fishes to estimate phylogenetic relationships using partitioned Bayesian and maximum likelihood methods and divergence dates of the family Notopteridae with a partitioned Bayesian approach. We used six species representing the major lineages of the Notopteridae and seven species from the remaining osteoglossomorph families. Fourteen more-derived teleosts, nine basal actinopterygians, two coelacanths, and one shark were used as outgroups. Phylogenetic analyses indicated that the African and Asian notopterids formed a sister group to each other and that these notopterids were a sister to a clade comprising two African families (Mormyridae and Gymnarchidae). Estimated divergence time between the African and Asian notopterids dated back to the early Cretaceous when India-Madagascar separated from the African part of Gondwanaland. Thus, estimated time of divergence based on the molecular evidence is at odds with the recent dispersal model. It can be reconciled with the geological and paleontological evidence to support the vicariance model in which the Asian notopterids diverged from the African notopterids in Gondwanaland and migrated into Eurasia on the Indian subcontinent from the Cretaceous to the Tertiary. However, we could not exclude an alternative explanation that the African and Asian notopterids diverged in Pangea before its complete separation into Laurasia and Gondwanaland, to which these two lineages were later confined, respectively.

  9. Chloroplast DNA Structural Variation, Phylogeny, and Age of Divergence among Diploid Cotton Species.

    PubMed

    Chen, Zhiwen; Feng, Kun; Grover, Corrinne E; Li, Pengbo; Liu, Fang; Wang, Yumei; Xu, Qin; Shang, Mingzhao; Zhou, Zhongli; Cai, Xiaoyan; Wang, Xingxing; Wendel, Jonathan F; Wang, Kunbo; Hua, Jinping

    2016-01-01

    The cotton genus (Gossypium spp.) contains 8 monophyletic diploid genome groups (A, B, C, D, E, F, G, K) and a single allotetraploid clade (AD). To gain insight into the phylogeny of Gossypium and molecular evolution of the chloroplast genome in this group, we performed a comparative analysis of 19 Gossypium chloroplast genomes, six reported here for the first time. Nucleotide distance in non-coding regions was about three times that of coding regions. As expected, distances were smaller within than among genome groups. Phylogenetic topologies based on nucleotide and indel data support for the resolution of the 8 genome groups into 6 clades. Phylogenetic analysis of indel distribution among the 19 genomes demonstrates contrasting evolutionary dynamics in different clades, with a parallel genome downsizing in two genome groups and a biased accumulation of insertions in the clade containing the cultivated cottons leading to large (for Gossypium) chloroplast genomes. Divergence time estimates derived from the cpDNA sequence suggest that the major diploid clades had diverged approximately 10 to 11 million years ago. The complete nucleotide sequences of 6 cpDNA genomes are provided, offering a resource for cytonuclear studies in Gossypium.

  10. Chloroplast DNA Structural Variation, Phylogeny, and Age of Divergence among Diploid Cotton Species

    PubMed Central

    Li, Pengbo; Liu, Fang; Wang, Yumei; Xu, Qin; Shang, Mingzhao; Zhou, Zhongli; Cai, Xiaoyan; Wang, Xingxing; Wendel, Jonathan F.; Wang, Kunbo

    2016-01-01

    The cotton genus (Gossypium spp.) contains 8 monophyletic diploid genome groups (A, B, C, D, E, F, G, K) and a single allotetraploid clade (AD). To gain insight into the phylogeny of Gossypium and molecular evolution of the chloroplast genome in this group, we performed a comparative analysis of 19 Gossypium chloroplast genomes, six reported here for the first time. Nucleotide distance in non-coding regions was about three times that of coding regions. As expected, distances were smaller within than among genome groups. Phylogenetic topologies based on nucleotide and indel data support for the resolution of the 8 genome groups into 6 clades. Phylogenetic analysis of indel distribution among the 19 genomes demonstrates contrasting evolutionary dynamics in different clades, with a parallel genome downsizing in two genome groups and a biased accumulation of insertions in the clade containing the cultivated cottons leading to large (for Gossypium) chloroplast genomes. Divergence time estimates derived from the cpDNA sequence suggest that the major diploid clades had diverged approximately 10 to 11 million years ago. The complete nucleotide sequences of 6 cpDNA genomes are provided, offering a resource for cytonuclear studies in Gossypium. PMID:27309527

  11. Evolutionary history of Otophysi (Teleostei), a major clade of the modern freshwater fishes: Pangaean origin and Mesozoic radiation

    PubMed Central

    2011-01-01

    Background Freshwater harbors approximately 12,000 fish species accounting for 43% of the diversity of all modern fish. A single ancestral lineage evolved into about two-thirds of this enormous biodiversity (≈ 7900 spp.) and is currently distributed throughout the world's continents except Antarctica. Despite such remarkable species diversity and ubiquity, the evolutionary history of this major freshwater fish clade, Otophysi, remains largely unexplored. To gain insight into the history of otophysan diversification, we constructed a timetree based on whole mitogenome sequences across 110 species representing 55 of the 64 families. Results Partitioned maximum likelihood analysis based on unambiguously aligned sequences (9923 bp) confidently recovered the monophyly of Otophysi and the two constituent subgroups (Cypriniformes and Characiphysi). The latter clade comprised three orders (Gymnotiformes, Characiformes, Siluriformes), and Gymnotiformes was sister to the latter two groups. One of the two suborders in Characiformes (Characoidei) was more closely related to Siluriformes than to its own suborder (Citharinoidei), rendering the characiforms paraphyletic. Although this novel relationship did not receive strong statistical support, it was supported by analyzing independent nuclear markers. A relaxed molecular clock Bayesian analysis of the divergence times and reconstruction of ancestral habitats on the timetree suggest a Pangaean origin and Mesozoic radiation of otophysans. Conclusions The present timetree demonstrates that survival of the ancestral lineages through the two consecutive mass extinctions on Pangaea, and subsequent radiations during the Jurassic through early Cretaceous shaped the modern familial diversity of otophysans. This evolutionary scenario is consistent with recent arguments based on biogeographic inferences and molecular divergence time estimates. No fossil otophysan, however, has been recorded before the Albian, the early Cretaceous 100-112 Ma, creating an over 100 million year time span without fossil evidence. This formidable ghost range partially reflects a genuine difference between the estimated ages of stem group origin (molecular divergence time) and crown group morphological diversification (fossil divergence time); the ghost range, however, would be filled with discoveries of older fossils that can be used as more reasonable time constraints as well as with developments of more realistic models that capture the rates of molecular sequences accurately. PMID:21693066

  12. Evolutionary history of Otophysi (Teleostei), a major clade of the modern freshwater fishes: Pangaean origin and Mesozoic radiation.

    PubMed

    Nakatani, Masanori; Miya, Masaki; Mabuchi, Kohji; Saitoh, Kenji; Nishida, Mutsumi

    2011-06-22

    Freshwater harbors approximately 12,000 fish species accounting for 43% of the diversity of all modern fish. A single ancestral lineage evolved into about two-thirds of this enormous biodiversity (≈ 7900 spp.) and is currently distributed throughout the world's continents except Antarctica. Despite such remarkable species diversity and ubiquity, the evolutionary history of this major freshwater fish clade, Otophysi, remains largely unexplored. To gain insight into the history of otophysan diversification, we constructed a timetree based on whole mitogenome sequences across 110 species representing 55 of the 64 families. Partitioned maximum likelihood analysis based on unambiguously aligned sequences (9923 bp) confidently recovered the monophyly of Otophysi and the two constituent subgroups (Cypriniformes and Characiphysi). The latter clade comprised three orders (Gymnotiformes, Characiformes, Siluriformes), and Gymnotiformes was sister to the latter two groups. One of the two suborders in Characiformes (Characoidei) was more closely related to Siluriformes than to its own suborder (Citharinoidei), rendering the characiforms paraphyletic. Although this novel relationship did not receive strong statistical support, it was supported by analyzing independent nuclear markers. A relaxed molecular clock Bayesian analysis of the divergence times and reconstruction of ancestral habitats on the timetree suggest a Pangaean origin and Mesozoic radiation of otophysans. The present timetree demonstrates that survival of the ancestral lineages through the two consecutive mass extinctions on Pangaea, and subsequent radiations during the Jurassic through early Cretaceous shaped the modern familial diversity of otophysans. This evolutionary scenario is consistent with recent arguments based on biogeographic inferences and molecular divergence time estimates. No fossil otophysan, however, has been recorded before the Albian, the early Cretaceous 100-112 Ma, creating an over 100 million year time span without fossil evidence. This formidable ghost range partially reflects a genuine difference between the estimated ages of stem group origin (molecular divergence time) and crown group morphological diversification (fossil divergence time); the ghost range, however, would be filled with discoveries of older fossils that can be used as more reasonable time constraints as well as with developments of more realistic models that capture the rates of molecular sequences accurately.

  13. Evolution of exceptional species richness among lineages of fleshy-fruited Myrtaceae

    PubMed Central

    Biffin, Ed; Lucas, Eve J.; Craven, Lyn A.; Ribeiro da Costa, Itayguara; Harrington, Mark G.; Crisp, Michael D.

    2010-01-01

    Background and Aims The angiosperm family Myrtaceae comprises 17 tribes with more than half of the estimated 5500 species being referred to the fleshy-fruited and predominantly rainforest associated Syzygieae and Myrteae. Previous studies suggest that fleshy fruits have evolved separately in these lineages, whereas generally shifts in fruit morphology have been variously implicated in diversification rate shifts among angiosperms. A phylogenetic hypothesis and estimate divergence times for Myrtaceae is developed as a basis to explore the evidence for, and drivers of, elevated diversification rates among the fleshy-fruited tribes of Myrtaceae. Methods Bayesian phylogenetic analyses of plastid and nuclear DNA sequences were used to estimate intertribal relationships and lineage divergence times in Myrtaceae. Focusing on the fleshy-fruited tribes, a variety of statistical approaches were used to assess diversification rates and diversification rate shifts across the family. Key Results Analyses of the sequence data provide a strongly supported phylogenetic hypothesis for Myrtaceae. Relative to previous studies, substantially younger ages for many of the clades are reported, and it is argued that the use of flexible calibrations to incorporate fossil data provides more realistic divergence estimates than the use of errorless point calibrations. It is found that Syzygieae and Myrteae have experienced elevated diversification rates relative to other lineages of Myrtaceae. Positive shifts in diversification rate have occurred separately in each lineage, associated with a shift from dry to fleshy fruit. Conclusions Fleshy fruits have evolved independently in Syzygieae and Myrteae, and this is accompanied by exceptional diversification rate shifts in both instances, suggesting that the evolution of fleshy fruits is a key innovation for rainforest Myrtaceae. Noting the scale dependency of this hypothesis, more complex explanations may be required to explain diversification rate shifts occurring within the fleshy-fruited tribes, and the suggested phylogenetic hypothesis provides an appropriate framework for this undertaking. PMID:20462850

  14. Analyzing the relationship between sequence divergence and nodal support using Bayesian phylogenetic analyses.

    PubMed

    Makowsky, Robert; Cox, Christian L; Roelke, Corey; Chippindale, Paul T

    2010-11-01

    Determining the appropriate gene for phylogeny reconstruction can be a difficult process. Rapidly evolving genes tend to resolve recent relationships, but suffer from alignment issues and increased homoplasy among distantly related species. Conversely, slowly evolving genes generally perform best for deeper relationships, but lack sufficient variation to resolve recent relationships. We determine the relationship between sequence divergence and Bayesian phylogenetic reconstruction ability using both natural and simulated datasets. The natural data are based on 28 well-supported relationships within the subphylum Vertebrata. Sequences of 12 genes were acquired and Bayesian analyses were used to determine phylogenetic support for correct relationships. Simulated datasets were designed to determine whether an optimal range of sequence divergence exists across extreme phylogenetic conditions. Across all genes we found that an optimal range of divergence for resolving the correct relationships does exist, although this level of divergence expectedly depends on the distance metric. Simulated datasets show that an optimal range of sequence divergence exists across diverse topologies and models of evolution. We determine that a simple to measure property of genetic sequences (genetic distance) is related to phylogenic reconstruction ability in Bayesian analyses. This information should be useful for selecting the most informative gene to resolve any relationships, especially those that are difficult to resolve, as well as minimizing both cost and confounding information during project design. Copyright © 2010. Published by Elsevier Inc.

  15. Spatio-Temporal History of HIV-1 CRF35_AD in Afghanistan and Iran.

    PubMed

    Eybpoosh, Sana; Bahrampour, Abbas; Karamouzian, Mohammad; Azadmanesh, Kayhan; Jahanbakhsh, Fatemeh; Mostafavi, Ehsan; Zolala, Farzaneh; Haghdoost, Ali Akbar

    2016-01-01

    HIV-1 Circulating Recombinant Form 35_AD (CRF35_AD) has an important position in the epidemiological profile of Afghanistan and Iran. Despite the presence of this clade in Afghanistan and Iran for over a decade, our understanding of its origin and dissemination patterns is limited. In this study, we performed a Bayesian phylogeographic analysis to reconstruct the spatio-temporal dispersion pattern of this clade using eligible CRF35_AD gag and pol sequences available in the Los Alamos HIV database (432 sequences available from Iran, 16 sequences available from Afghanistan, and a single CRF35_AD-like pol sequence available from USA). Bayesian Markov Chain Monte Carlo algorithm was implemented in BEAST v1.8.1. Between-country dispersion rates were tested with Bayesian stochastic search variable selection method and were considered significant where Bayes factor values were greater than three. The findings suggested that CRF35_AD sequences were genetically similar to parental sequences from Kenya and Uganda, and to a set of subtype A1 sequences available from Afghan refugees living in Pakistan. Our results also showed that across all phylogenies, Afghan and Iranian CRF35_AD sequences formed a monophyletic cluster (posterior clade credibility> 0.7). The divergence date of this cluster was estimated to be between 1990 and 1992. Within this cluster, a bidirectional dispersion of the virus was observed across Afghanistan and Iran. We could not clearly identify if Afghanistan or Iran first established or received this epidemic, as the root location of this cluster could not be robustly estimated. Three CRF35_AD sequences from Afghan refugees living in Pakistan nested among Afghan and Iranian CRF35_AD branches. However, the CRF35_AD-like sequence available from USA diverged independently from Kenyan subtype A1 sequences, suggesting it not to be a true CRF35_AD lineage. Potential factors contributing to viral exchange between Afghanistan and Iran could be injection drug networks and mass migration of Afghan refugees and labours to Iran, which calls for extensive preventive efforts.

  16. Spatio-Temporal History of HIV-1 CRF35_AD in Afghanistan and Iran

    PubMed Central

    Eybpoosh, Sana; Bahrampour, Abbas; Karamouzian, Mohammad; Azadmanesh, Kayhan; Jahanbakhsh, Fatemeh; Mostafavi, Ehsan; Zolala, Farzaneh; Haghdoost, Ali Akbar

    2016-01-01

    HIV-1 Circulating Recombinant Form 35_AD (CRF35_AD) has an important position in the epidemiological profile of Afghanistan and Iran. Despite the presence of this clade in Afghanistan and Iran for over a decade, our understanding of its origin and dissemination patterns is limited. In this study, we performed a Bayesian phylogeographic analysis to reconstruct the spatio-temporal dispersion pattern of this clade using eligible CRF35_AD gag and pol sequences available in the Los Alamos HIV database (432 sequences available from Iran, 16 sequences available from Afghanistan, and a single CRF35_AD-like pol sequence available from USA). Bayesian Markov Chain Monte Carlo algorithm was implemented in BEAST v1.8.1. Between-country dispersion rates were tested with Bayesian stochastic search variable selection method and were considered significant where Bayes factor values were greater than three. The findings suggested that CRF35_AD sequences were genetically similar to parental sequences from Kenya and Uganda, and to a set of subtype A1 sequences available from Afghan refugees living in Pakistan. Our results also showed that across all phylogenies, Afghan and Iranian CRF35_AD sequences formed a monophyletic cluster (posterior clade credibility> 0.7). The divergence date of this cluster was estimated to be between 1990 and 1992. Within this cluster, a bidirectional dispersion of the virus was observed across Afghanistan and Iran. We could not clearly identify if Afghanistan or Iran first established or received this epidemic, as the root location of this cluster could not be robustly estimated. Three CRF35_AD sequences from Afghan refugees living in Pakistan nested among Afghan and Iranian CRF35_AD branches. However, the CRF35_AD-like sequence available from USA diverged independently from Kenyan subtype A1 sequences, suggesting it not to be a true CRF35_AD lineage. Potential factors contributing to viral exchange between Afghanistan and Iran could be injection drug networks and mass migration of Afghan refugees and labours to Iran, which calls for extensive preventive efforts. PMID:27280293

  17. Demographic Divergence History of Pied Flycatcher and Collared Flycatcher Inferred from Whole-Genome Re-sequencing Data

    PubMed Central

    Nadachowska-Brzyska, Krystyna; Burri, Reto; Olason, Pall I.; Kawakami, Takeshi; Smeds, Linnéa; Ellegren, Hans

    2013-01-01

    Profound knowledge of demographic history is a prerequisite for the understanding and inference of processes involved in the evolution of population differentiation and speciation. Together with new coalescent-based methods, the recent availability of genome-wide data enables investigation of differentiation and divergence processes at unprecedented depth. We combined two powerful approaches, full Approximate Bayesian Computation analysis (ABC) and pairwise sequentially Markovian coalescent modeling (PSMC), to reconstruct the demographic history of the split between two avian speciation model species, the pied flycatcher and collared flycatcher. Using whole-genome re-sequencing data from 20 individuals, we investigated 15 demographic models including different levels and patterns of gene flow, and changes in effective population size over time. ABC provided high support for recent (mode 0.3 my, range <0.7 my) species divergence, declines in effective population size of both species since their initial divergence, and unidirectional recent gene flow from pied flycatcher into collared flycatcher. The estimated divergence time and population size changes, supported by PSMC results, suggest that the ancestral species persisted through one of the glacial periods of middle Pleistocene and then split into two large populations that first increased in size before going through severe bottlenecks and expanding into their current ranges. Secondary contact appears to have been established after the last glacial maximum. The severity of the bottlenecks at the last glacial maximum is indicated by the discrepancy between current effective population sizes (20,000–80,000) and census sizes (5–50 million birds) of the two species. The recent divergence time challenges the supposition that avian speciation is a relatively slow process with extended times for intrinsic postzygotic reproductive barriers to evolve. Our study emphasizes the importance of using genome-wide data to unravel tangled demographic histories. Moreover, it constitutes one of the first examples of the inference of divergence history from genome-wide data in non-model species. PMID:24244198

  18. Demographic divergence history of pied flycatcher and collared flycatcher inferred from whole-genome re-sequencing data.

    PubMed

    Nadachowska-Brzyska, Krystyna; Burri, Reto; Olason, Pall I; Kawakami, Takeshi; Smeds, Linnéa; Ellegren, Hans

    2013-11-01

    Profound knowledge of demographic history is a prerequisite for the understanding and inference of processes involved in the evolution of population differentiation and speciation. Together with new coalescent-based methods, the recent availability of genome-wide data enables investigation of differentiation and divergence processes at unprecedented depth. We combined two powerful approaches, full Approximate Bayesian Computation analysis (ABC) and pairwise sequentially Markovian coalescent modeling (PSMC), to reconstruct the demographic history of the split between two avian speciation model species, the pied flycatcher and collared flycatcher. Using whole-genome re-sequencing data from 20 individuals, we investigated 15 demographic models including different levels and patterns of gene flow, and changes in effective population size over time. ABC provided high support for recent (mode 0.3 my, range <0.7 my) species divergence, declines in effective population size of both species since their initial divergence, and unidirectional recent gene flow from pied flycatcher into collared flycatcher. The estimated divergence time and population size changes, supported by PSMC results, suggest that the ancestral species persisted through one of the glacial periods of middle Pleistocene and then split into two large populations that first increased in size before going through severe bottlenecks and expanding into their current ranges. Secondary contact appears to have been established after the last glacial maximum. The severity of the bottlenecks at the last glacial maximum is indicated by the discrepancy between current effective population sizes (20,000-80,000) and census sizes (5-50 million birds) of the two species. The recent divergence time challenges the supposition that avian speciation is a relatively slow process with extended times for intrinsic postzygotic reproductive barriers to evolve. Our study emphasizes the importance of using genome-wide data to unravel tangled demographic histories. Moreover, it constitutes one of the first examples of the inference of divergence history from genome-wide data in non-model species.

  19. Analysis of a native whitefly transcriptome and its sequence divergence with two invasive whitefly species.

    PubMed

    Wang, Xiao-Wei; Zhao, Qiong-Yi; Luan, Jun-Bo; Wang, Yu-Jun; Yan, Gen-Hong; Liu, Shu-Sheng

    2012-10-04

    Genomic divergence between invasive and native species may provide insight into the molecular basis underlying specific characteristics that drive the invasion and displacement of closely related species. In this study, we sequenced the transcriptome of an indigenous species, Asia II 3, of the Bemisia tabaci complex and compared its genetic divergence with the transcriptomes of two invasive whiteflies species, Middle East Asia Minor 1 (MEAM1) and Mediterranean (MED), respectively. More than 16 million reads of 74 base pairs in length were obtained for the Asia II 3 species using the Illumina sequencing platform. These reads were assembled into 52,535 distinct sequences (mean size: 466 bp) and 16,596 sequences were annotated with an E-value above 10-5. Protein family comparisons revealed obvious diversification among the transcriptomes of these species suggesting species-specific adaptations during whitefly evolution. On the contrary, substantial conservation of the whitefly transcriptomes was also evident, despite their differences. The overall divergence of coding sequences between the orthologous gene pairs of Asia II 3 and MEAM1 is 1.73%, which is comparable to the average divergence of Asia II 3 and MED transcriptomes (1.84%) and much higher than that of MEAM1 and MED (0.83%). This is consistent with the previous phylogenetic analyses and crossing experiments suggesting these are distinct species. We also identified hundreds of highly diverged genes and compiled sequence identify data into gene functional groups and found the most divergent gene classes are Cytochrome P450, Glutathione metabolism and Oxidative phosphorylation. These results strongly suggest that the divergence of genes related to metabolism might be the driving force of the MEAM1 and Asia II 3 differentiation. We also analyzed single nucleotide polymorphisms within the orthologous gene pairs of indigenous and invasive whiteflies which are helpful for the investigation of association between allelic and phenotypes. Our data present the most comprehensive sequences for the indigenous whitefly species Asia II 3. The extensive comparisons of Asia II 3, MEAM1 and MED transcriptomes will serve as an invaluable resource for revealing the genetic basis of whitefly invasion and the molecular mechanisms underlying their biological differences.

  20. Analysis of a native whitefly transcriptome and its sequence divergence with two invasive whitefly species

    PubMed Central

    2012-01-01

    Background Genomic divergence between invasive and native species may provide insight into the molecular basis underlying specific characteristics that drive the invasion and displacement of closely related species. In this study, we sequenced the transcriptome of an indigenous species, Asia II 3, of the Bemisia tabaci complex and compared its genetic divergence with the transcriptomes of two invasive whiteflies species, Middle East Asia Minor 1 (MEAM1) and Mediterranean (MED), respectively. Results More than 16 million reads of 74 base pairs in length were obtained for the Asia II 3 species using the Illumina sequencing platform. These reads were assembled into 52,535 distinct sequences (mean size: 466 bp) and 16,596 sequences were annotated with an E-value above 10-5. Protein family comparisons revealed obvious diversification among the transcriptomes of these species suggesting species-specific adaptations during whitefly evolution. On the contrary, substantial conservation of the whitefly transcriptomes was also evident, despite their differences. The overall divergence of coding sequences between the orthologous gene pairs of Asia II 3 and MEAM1 is 1.73%, which is comparable to the average divergence of Asia II 3 and MED transcriptomes (1.84%) and much higher than that of MEAM1 and MED (0.83%). This is consistent with the previous phylogenetic analyses and crossing experiments suggesting these are distinct species. We also identified hundreds of highly diverged genes and compiled sequence identify data into gene functional groups and found the most divergent gene classes are Cytochrome P450, Glutathione metabolism and Oxidative phosphorylation. These results strongly suggest that the divergence of genes related to metabolism might be the driving force of the MEAM1 and Asia II 3 differentiation. We also analyzed single nucleotide polymorphisms within the orthologous gene pairs of indigenous and invasive whiteflies which are helpful for the investigation of association between allelic and phenotypes. Conclusions Our data present the most comprehensive sequences for the indigenous whitefly species Asia II 3. The extensive comparisons of Asia II 3, MEAM1 and MED transcriptomes will serve as an invaluable resource for revealing the genetic basis of whitefly invasion and the molecular mechanisms underlying their biological differences. PMID:23036081

  1. When did anoles diverge? An analysis of multiple dating strategies.

    PubMed

    Román-Palacios, Cristian; Tavera, Jose; Del Rosario Castañeda, María

    2018-06-12

    Whereas most of the studies that discuss the evolutionary divergence of Anolis lizards have dated the clade's crown group in between 31-64 Ma, a single study has recovered a significantly older age for the same node (87 Ma). These differences also entail notable consequences on the preferred biogeographical hypothesis for the whole clade. Here we analyze a total of seven dating strategies by combining three calibration sources in independent BEAST runs to infer the most probable divergence timing for anole lizards (a mitochondrial rate for ND2 gene, the Anolis dominicanus fossil, and a group of fossils assigned to the Priscagamines, Iguanines, and Idontosaurus clades). Based on the estimated timing, we also addressed whether chronograms differ the most in deeper or shallower nodes by exploring the trend in the standard deviation of mean ages between chronograms across time. Next, we focus on the pattern for a single shallow node by hypothesizing the biogeography of the island-endemic Malpelo anole (Anolis agassizi), and evaluating the temporal congruence between the species' divergence and the island geology. The estimated set of ages suggests that anoles most likely diverged 72 Ma (71-73 Ma), with the crown group established around 58 Ma (51-65 Ma). Dispersal is therefore supported as the major driver in the biogeography of the group (and in Caribbean lineages in particular). Our analyses also indicated that (1) rate-based analyses pulled dates toward younger ages, (2) the differences in node ages between chronograms decrease towards the tips regardless of the position of the constrained node, and that (3) the estimated age for deep nodes (e.g. Anolis stem) is highly influenced when deep nodes are also constrained. The latter two results imply that the estimated age for shallower nodes is largely unaffected by the used temporal constraint. The congruence of all chronograms for the Malpelo anole also support this finding. Anolis agassizi was found to have diverged before the emergence of Malpelo island in each analysis (anole: 19-31 Ma vs. Malpelo island: 16-17 Ma). We recommend when performing absolute dating analyses to first test for sequence saturation in the analyzed dataset (especially when calibrations are based on molecular rates). Our study also points out the importance of using of multiple node constraints, especially when placed deeply in the tree, for fossil-based divergence dating analyses. Copyright © 2018. Published by Elsevier Inc.

  2. A genomic timescale of prokaryote evolution: insights into the origin of methanogenesis, phototrophy, and the colonization of land

    PubMed Central

    Battistuzzi, Fabia U; Feijao, Andreia; Hedges, S Blair

    2004-01-01

    Background The timescale of prokaryote evolution has been difficult to reconstruct because of a limited fossil record and complexities associated with molecular clocks and deep divergences. However, the relatively large number of genome sequences currently available has provided a better opportunity to control for potential biases such as horizontal gene transfer and rate differences among lineages. We assembled a data set of sequences from 32 proteins (~7600 amino acids) common to 72 species and estimated phylogenetic relationships and divergence times with a local clock method. Results Our phylogenetic results support most of the currently recognized higher-level groupings of prokaryotes. Of particular interest is a well-supported group of three major lineages of eubacteria (Actinobacteria, Deinococcus, and Cyanobacteria) that we call Terrabacteria and associate with an early colonization of land. Divergence time estimates for the major groups of eubacteria are between 2.5–3.2 billion years ago (Ga) while those for archaebacteria are mostly between 3.1–4.1 Ga. The time estimates suggest a Hadean origin of life (prior to 4.1 Ga), an early origin of methanogenesis (3.8–4.1 Ga), an origin of anaerobic methanotrophy after 3.1 Ga, an origin of phototrophy prior to 3.2 Ga, an early colonization of land 2.8–3.1 Ga, and an origin of aerobic methanotrophy 2.5–2.8 Ga. Conclusions Our early time estimates for methanogenesis support the consideration of methane, in addition to carbon dioxide, as a greenhouse gas responsible for the early warming of the Earths' surface. Our divergence times for the origin of anaerobic methanotrophy are compatible with highly depleted carbon isotopic values found in rocks dated 2.8–2.6 Ga. An early origin of phototrophy is consistent with the earliest bacterial mats and structures identified as stromatolites, but a 2.6 Ga origin of cyanobacteria suggests that those Archean structures, if biologically produced, were made by anoxygenic photosynthesizers. The resistance to desiccation of Terrabacteria and their elaboration of photoprotective compounds suggests that the common ancestor of this group inhabited land. If true, then oxygenic photosynthesis may owe its origin to terrestrial adaptations. PMID:15535883

  3. Complete mitochondrial genome phylogeographic analysis of killer whales (Orcinus orca) indicates multiple species.

    PubMed

    Morin, Phillip A; Archer, Frederick I; Foote, Andrew D; Vilstrup, Julia; Allen, Eric E; Wade, Paul; Durban, John; Parsons, Kim; Pitman, Robert; Li, Lewyn; Bouffard, Pascal; Abel Nielsen, Sandra C; Rasmussen, Morten; Willerslev, Eske; Gilbert, M Thomas P; Harkins, Timothy

    2010-07-01

    Killer whales (Orcinus orca) currently comprise a single, cosmopolitan species with a diverse diet. However, studies over the last 30 yr have revealed populations of sympatric "ecotypes" with discrete prey preferences, morphology, and behaviors. Although these ecotypes avoid social interactions and are not known to interbreed, genetic studies to date have found extremely low levels of diversity in the mitochondrial control region, and few clear phylogeographic patterns worldwide. This low level of diversity is likely due to low mitochondrial mutation rates that are common to cetaceans. Using killer whales as a case study, we have developed a method to readily sequence, assemble, and analyze complete mitochondrial genomes from large numbers of samples to more accurately assess phylogeography and estimate divergence times. This represents an important tool for wildlife management, not only for killer whales but for many marine taxa. We used high-throughput sequencing to survey whole mitochondrial genome variation of 139 samples from the North Pacific, North Atlantic, and southern oceans. Phylogenetic analysis indicated that each of the known ecotypes represents a strongly supported clade with divergence times ranging from approximately 150,000 to 700,000 yr ago. We recommend that three named ecotypes be elevated to full species, and that the remaining types be recognized as subspecies pending additional data. Establishing appropriate taxonomic designations will greatly aid in understanding the ecological impacts and conservation needs of these important marine predators. We predict that phylogeographic mitogenomics will become an important tool for improved statistical phylogeography and more precise estimates of divergence times.

  4. Complete mitochondrial genome phylogeographic analysis of killer whales (Orcinus orca) indicates multiple species

    PubMed Central

    Morin, Phillip A.; Archer, Frederick I.; Foote, Andrew D.; Vilstrup, Julia; Allen, Eric E.; Wade, Paul; Durban, John; Parsons, Kim; Pitman, Robert; Li, Lewyn; Bouffard, Pascal; Abel Nielsen, Sandra C.; Rasmussen, Morten; Willerslev, Eske; Gilbert, M. Thomas P.; Harkins, Timothy

    2010-01-01

    Killer whales (Orcinus orca) currently comprise a single, cosmopolitan species with a diverse diet. However, studies over the last 30 yr have revealed populations of sympatric “ecotypes” with discrete prey preferences, morphology, and behaviors. Although these ecotypes avoid social interactions and are not known to interbreed, genetic studies to date have found extremely low levels of diversity in the mitochondrial control region, and few clear phylogeographic patterns worldwide. This low level of diversity is likely due to low mitochondrial mutation rates that are common to cetaceans. Using killer whales as a case study, we have developed a method to readily sequence, assemble, and analyze complete mitochondrial genomes from large numbers of samples to more accurately assess phylogeography and estimate divergence times. This represents an important tool for wildlife management, not only for killer whales but for many marine taxa. We used high-throughput sequencing to survey whole mitochondrial genome variation of 139 samples from the North Pacific, North Atlantic, and southern oceans. Phylogenetic analysis indicated that each of the known ecotypes represents a strongly supported clade with divergence times ranging from ∼150,000 to 700,000 yr ago. We recommend that three named ecotypes be elevated to full species, and that the remaining types be recognized as subspecies pending additional data. Establishing appropriate taxonomic designations will greatly aid in understanding the ecological impacts and conservation needs of these important marine predators. We predict that phylogeographic mitogenomics will become an important tool for improved statistical phylogeography and more precise estimates of divergence times. PMID:20413674

  5. Genetic divergence and phylogeographic history of two closely related species (Leucomeris decora and Nouelia insignis) across the 'Tanaka Line' in Southwest China.

    PubMed

    Zhao, Yu-Juan; Gong, Xun

    2015-07-08

    Leucomeris decora and Nouelia insignis (Asteraceae) are narrowly and allopatrically distributed species, separated by the important biogeographic boundary Tanaka Line in Southwest China. Previous morphological, cytogenetic and molecular studies suggested that L. decora is sister to N. insignis. However, it is less clear how the two species diverged, whether in full isolation or occurring gene flow across the Tanaka Line. Here, we performed a molecular study at the population level to characterize genetic differentiation and decipher phylogeographic history in two closely related species based on variation examined in plastid and nuclear DNAs using a coalescent-based approach. These morphologically distinct species share plastid DNA (cpDNA) haplotypes. In contrast, Bayesian analysis of nuclear DNA (nDNA) uncovered two distinct clusters corresponding to L. decora and N. insignis. Based on the IMa analysis, no strong indication of migration was detected based on both cpDNA and nDNA sequences. The molecular data pointed to a major west-east split in nuclear DNA between the two species corresponding with the Tanaka Line. The coalescent time estimate for all cpDNA haplotypes dated to the Mid-Late Pleistocene. The estimated demographic parameters showed that the population size of L. decora was similar to that of N. insignis and both experienced limited demographic fluctuations recently. The study revealed comprehensive species divergence and phylogeographic histories of N. insignis and L. decora divided by the Tanaka Line. The phylogeographic pattern inferred from cpDNA reflected ancestrally shared polymorphisms without post-divergence gene flow between species. The marked genealogical lineage divergence in nDNA provided some indication of Tanaka Line for its role as a barrier to plant dispersal, and lent support to its importance in promoting strong population structure and allopatric divergence.

  6. Molecular phylogeny and biogeography of West Indian frogs of the genus Leptodactylus (Anura, Leptodactylidae).

    PubMed

    Hedges, S Blair; Heinicke, Matthew P

    2007-07-01

    Three endemic species of the aquatic-breeding frog genus Leptodactylus are recognized from the West Indies: Leptodactylus albilabris (Puerto Rico and the Virgin Islands), Leptodactylus dominicensis (Hispaniola), and Leptodactylus fallax (Lesser Antilles). DNA sequences were obtained from several mitochondrial genes to resolve taxonomic questions involving these species and to provide insights into their origin and distribution in the islands. We found low levels of sequence divergence between L. dominicensis and L. albilabris, supporting morphological evidence that the former species is a junior synonym of the latter species. Phylogenetic analysis supported previous species-group allocations, finding that L. albilabris is a member of the fuscus group and L. fallax is a member of the pentadactylus group. Molecular time estimates for the divergence of L. albilabris from its closest relative in South America (24-58 million years ago, Ma) and for L. fallax from its closest relative in South America (23-34Ma) indicate that they colonized the West Indies independently by over-water dispersal in the mid-Cenozoic. The absence of detectable sequence divergence between the two extant populations of L. fallax (Dominica and Montserrat), a species used for human food and now critically endangered, suggests that one or both arose by human introduction from an island or islands where that species originated. The relatively minor genetic differentiation of populations of L. albilabris can be explained by vicariance and dispersal in the Pleistocene and Holocene, although human introduction of some populations cannot be ruled out.

  7. Stochastic precision analysis of 2D cardiac strain estimation in vivo

    NASA Astrophysics Data System (ADS)

    Bunting, E. A.; Provost, J.; Konofagou, E. E.

    2014-11-01

    Ultrasonic strain imaging has been applied to echocardiography and carries great potential to be used as a tool in the clinical setting. Two-dimensional (2D) strain estimation may be useful when studying the heart due to the complex, 3D deformation of the cardiac tissue. Increasing the framerate used for motion estimation, i.e. motion estimation rate (MER), has been shown to improve the precision of the strain estimation, although maintaining the spatial resolution necessary to view the entire heart structure in a single heartbeat remains challenging at high MERs. Two previously developed methods, the temporally unequispaced acquisition sequence (TUAS) and the diverging beam sequence (DBS), have been used in the past to successfully estimate in vivo axial strain at high MERs without compromising spatial resolution. In this study, a stochastic assessment of 2D strain estimation precision is performed in vivo for both sequences at varying MERs (65, 272, 544, 815 Hz for TUAS; 250, 500, 1000, 2000 Hz for DBS). 2D incremental strains were estimated during left ventricular contraction in five healthy volunteers using a normalized cross-correlation function and a least-squares strain estimator. Both sequences were shown capable of estimating 2D incremental strains in vivo. The conditional expected value of the elastographic signal-to-noise ratio (E(SNRe|ɛ)) was used to compare strain estimation precision of both sequences at multiple MERs over a wide range of clinical strain values. The results here indicate that axial strain estimation precision is much more dependent on MER than lateral strain estimation, while lateral estimation is more affected by strain magnitude. MER should be increased at least above 544 Hz to avoid suboptimal axial strain estimation. Radial and circumferential strain estimations were influenced by the axial and lateral strain in different ways. Furthermore, the TUAS and DBS were found to be of comparable precision at similar MERs.

  8. Evolutionary Distance of Amino Acid Sequence Orthologs across Macaque Subspecies: Identifying Candidate Genes for SIV Resistance in Chinese Rhesus Macaques

    PubMed Central

    Ross, Cody T.; Roodgar, Morteza; Smith, David Glenn

    2015-01-01

    We use the Reciprocal Smallest Distance (RSD) algorithm to identify amino acid sequence orthologs in the Chinese and Indian rhesus macaque draft sequences and estimate the evolutionary distance between such orthologs. We then use GOanna to map gene function annotations and human gene identifiers to the rhesus macaque amino acid sequences. We conclude methodologically by cross-tabulating a list of amino acid orthologs with large divergence scores with a list of genes known to be involved in SIV or HIV pathogenesis. We find that many of the amino acid sequences with large evolutionary divergence scores, as calculated by the RSD algorithm, have been shown to be related to HIV pathogenesis in previous laboratory studies. Four of the strongest candidate genes for SIVmac resistance in Chinese rhesus macaques identified in this study are CDK9, CXCL12, TRIM21, and TRIM32. Additionally, ANKRD30A, CTSZ, GORASP2, GTF2H1, IL13RA1, MUC16, NMDAR1, Notch1, NT5M, PDCD5, RAD50, and TM9SF2 were identified as possible candidates, among others. We failed to find many laboratory experiments contrasting the effects of Indian and Chinese orthologs at these sites on SIVmac pathogenesis, but future comparative studies might hold fertile ground for research into the biological mechanisms underlying innate resistance to SIVmac in Chinese rhesus macaques. PMID:25884674

  9. De novo identification of highly diverged protein repeats by probabilistic consistency.

    PubMed

    Biegert, A; Söding, J

    2008-03-15

    An estimated 25% of all eukaryotic proteins contain repeats, which underlines the importance of duplication for evolving new protein functions. Internal repeats often correspond to structural or functional units in proteins. Methods capable of identifying diverged repeated segments or domains at the sequence level can therefore assist in predicting domain structures, inferring hypotheses about function and mechanism, and investigating the evolution of proteins from smaller fragments. We present HHrepID, a method for the de novo identification of repeats in protein sequences. It is able to detect the sequence signature of structural repeats in many proteins that have not yet been known to possess internal sequence symmetry, such as outer membrane beta-barrels. HHrepID uses HMM-HMM comparison to exploit evolutionary information in the form of multiple sequence alignments of homologs. In contrast to a previous method, the new method (1) generates a multiple alignment of repeats; (2) utilizes the transitive nature of homology through a novel merging procedure with fully probabilistic treatment of alignments; (3) improves alignment quality through an algorithm that maximizes the expected accuracy; (4) is able to identify different kinds of repeats within complex architectures by a probabilistic domain boundary detection method and (5) improves sensitivity through a new approach to assess statistical significance. Server: http://toolkit.tuebingen.mpg.de/hhrepid; Executables: ftp://ftp.tuebingen.mpg.de/pub/protevo/HHrepID

  10. The taxonomic placement of three fossil Fundulus species and the timing of divergence within the North American topminnows (Teleostei: Fundulidae).

    PubMed

    Ghedotti, Michael J; Davis, Matthew P

    2017-04-10

    The fossils species †Fundulus detillae, †F. lariversi, and †F. nevadensis from localities in the western United States are represented by well-preserved material with date estimations. We combined morphological data for these fossil taxa with morphological and DNA-sequence data to conduct a phylogenetic analysis and a tip-based divergence-time estimation for the family Fundulidae. The resultant phylogeny is largely concordant with the prior total-evidence phylogeny. The fossil species do not form a monophyletic group, and do not represent a discrete western radiation of Fundulus as previously proposed. The genus Fundulus diverged into subgeneric clades likely in the Eocene or Oligocene (mean age 34.6 mya, 53-23 mya), and all subgeneric and most species-group clades had evolved by the middle Miocene. †Fundulus lariversi is a member of subgenus Fundulus in which all extant species are found only in eastern North America, demonstrating that fundulids had a complicated biogeographic history. We confirmed †Fundulus detillae as a member of the subgenus Plancterus. †F. nevadensis is not classified in a subgenus but likely is related to the subgenera Plancterus and Wileyichthys.

  11. Sequence divergence in the 3'-untranslated region has an effect on the subfunctionalization of duplicate genes.

    PubMed

    Tong, Ying; Zheng, Kang; Zhao, Shufang; Xiao, Guanxiu; Luo, Chen

    2012-11-01

    Recent studies demonstrated that sequence divergence in both transcriptional regulatory region and coding region contributes to the subfunctionalization of duplicate gene. However, whether sequence divergence in the 3'-untranslated region (3'-UTR) has an impact on the subfunctionalization of duplicate genes remains unclear. Here, we identified two diverging duplicate vsx1 (visual system homeobox-1) loci in goldfish, named vsx1A1 and vsx1A2. Phylogenetic analysis suggests that vsx1A1 and vsx1A2 may arise from a duplication of vsx1 after the separation of goldfish and zebrafish. Sequence comparison revealed that divergence in both transcriptional and translational regulatory regions is higher than divergence in the introns. vsx1A2 expresses during blastula and gastrula stages and in adult retina but silences from segmentation stage to hatching stage, vsx1A1 starts expression from segmentation onward. Comparing to that zebrafish vsx1 expresses in all the developmental stages and in the adult retina, it appears that goldfish vsx1A1 and vsx1A2 are under going to share the functions of ancestral vsx1. The different but overlapping temporal expression patterns of vsx1A1 and vsx1A2 suggest that sequence divergence in the promoter region of duplicate vsx1 is not sufficient for partitioning the functions of ancestral vsx1. By comparing vsx1A1 and vsx1A2 3'-UTR-linked green fluorescent protein gene expression patterns, we demonstrated that the 3'-UTR of vsx1A1 remains but the 3'-UTR of vsx1A2 has lost the capability of mediating bipolar cell specific expression during retina development. These results indicate that sequence divergence in the 3'-UTRs has a clear effect on subfunctionalization of the duplicate genes. © 2012 WILEY PERIODICALS, INC.

  12. Mitochondrial DNA Detects a Complex Evolutionary History with Pleistocene Epoch Divergence for the Neotropical Malaria Vector Anopheles nuneztovari Sensu Lato

    PubMed Central

    Scarpassa, Vera Margarete; Conn, Jan E.

    2011-01-01

    Cryptic species and lineages characterize Anopheles nuneztovari s.l. Gabaldón, an important malaria vector in South America. We investigated the phylogeographic structure across the range of this species with cytochrome oxidase subunit I (COI) mitochondrial DNA sequences to estimate the number of clades and levels of divergence. Bayesian and maximum-likelihood phylogenetic analyses detected four groups distributed in two major monophyletic clades (I and II). Samples from the Amazon Basin were clustered in clade I, as were subclades II-A and II-B, whereas those from Bolivia/Colombia/Venezuela were restricted to one basal subclade (II-C). These data, together with a statistical parsimony network, confirm results of previous studies that An. nuneztovari is a species complex consisting of at least two cryptic taxa, one occurring in Colombia and Venezuela and the another occurring in the Amazon Basin. These data also suggest that additional incipient species may exist in the Amazon Basin. Divergence time and expansion tests suggested that these groups separated and expanded in the Pleistocene Epoch. In addition, the COI sequences clearly separated An. nuneztovari s.l. from the closely related species An. dunhami Causey, and three new records are reported for An. dunhami in Amazonian Brazil. These findings are relevant for vector control programs in areas where both species occur. Our analyses support dynamic geologic and landscape changes in northern South America, and infer particularly active divergence during the Pleistocene Epoch for New World anophelines. PMID:22049039

  13. Historical connectivity, contemporary isolation and local adaptation in a widespread but discontinuously distributed species endemic to Taiwan, Rhododendron oldhamii (Ericaceae)

    PubMed Central

    Hsieh, Y-C; Chung, J-D; Wang, C-N; Chang, C-T; Chen, C-Y; Hwang, S-Y

    2013-01-01

    Elucidation of the evolutionary processes that constrain or facilitate adaptive divergence is a central goal in evolutionary biology, especially in non-model organisms. We tested whether changes in dynamics of gene flow (historical vs contemporary) caused population isolation and examined local adaptation in response to environmental selective forces in fragmented Rhododendron oldhamii populations. Variation in 26 expressed sequence tag-simple sequence repeat loci from 18 populations in Taiwan was investigated by examining patterns of genetic diversity, inbreeding, geographic structure, recent bottlenecks, and historical and contemporary gene flow. Selection associated with environmental variables was also examined. Bayesian clustering analysis revealed four regional population groups of north, central, south and southeast with significant genetic differentiation. Historical bottlenecks beginning 9168–13,092 years ago and ending 1584–3504 years ago were revealed by estimates using approximate Bayesian computation for all four regional samples analyzed. Recent migration within and across geographic regions was limited. However, major dispersal sources were found within geographic regions. Altitudinal clines of allelic frequencies of environmentally associated positively selected outliers were found, indicating adaptive divergence. Our results point to a transition from historical population connectivity toward contemporary population isolation and divergence on a regional scale. Spatial and temporal dispersal differences may have resulted in regional population divergence and local adaptation associated with environmental variables, which may have played roles as selective forces at a regional scale. PMID:23591517

  14. Chromosome rearrangements via template switching between diverged repeated sequences

    PubMed Central

    Anand, Ranjith P.; Tsaponina, Olga; Greenwell, Patricia W.; Lee, Cheng-Sheng; Du, Wei; Petes, Thomas D.

    2014-01-01

    Recent high-resolution genome analyses of cancer and other diseases have revealed the occurrence of microhomology-mediated chromosome rearrangements and copy number changes. Although some of these rearrangements appear to involve nonhomologous end-joining, many must have involved mechanisms requiring new DNA synthesis. Models such as microhomology-mediated break-induced replication (MM-BIR) have been invoked to explain these rearrangements. We examined BIR and template switching between highly diverged sequences in Saccharomyces cerevisiae, induced during repair of a site-specific double-strand break (DSB). Our data show that such template switches are robust mechanisms that give rise to complex rearrangements. Template switches between highly divergent sequences appear to be mechanistically distinct from the initial strand invasions that establish BIR. In particular, such jumps are less constrained by sequence divergence and exhibit a different pattern of microhomology junctions. BIR traversing repeated DNA sequences frequently results in complex translocations analogous to those seen in mammalian cells. These results suggest that template switching among repeated genes is a potent driver of genome instability and evolution. PMID:25367035

  15. Genome sequencing highlights the dynamic early history of dogs.

    PubMed

    Freedman, Adam H; Gronau, Ilan; Schweizer, Rena M; Ortega-Del Vecchyo, Diego; Han, Eunjung; Silva, Pedro M; Galaverni, Marco; Fan, Zhenxin; Marx, Peter; Lorente-Galdos, Belen; Beale, Holly; Ramirez, Oscar; Hormozdiari, Farhad; Alkan, Can; Vilà, Carles; Squire, Kevin; Geffen, Eli; Kusak, Josip; Boyko, Adam R; Parker, Heidi G; Lee, Clarence; Tadigotla, Vasisht; Wilton, Alan; Siepel, Adam; Bustamante, Carlos D; Harkins, Timothy T; Nelson, Stanley F; Ostrander, Elaine A; Marques-Bonet, Tomas; Wayne, Robert K; Novembre, John

    2014-01-01

    To identify genetic changes underlying dog domestication and reconstruct their early evolutionary history, we generated high-quality genome sequences from three gray wolves, one from each of the three putative centers of dog domestication, two basal dog lineages (Basenji and Dingo) and a golden jackal as an outgroup. Analysis of these sequences supports a demographic model in which dogs and wolves diverged through a dynamic process involving population bottlenecks in both lineages and post-divergence gene flow. In dogs, the domestication bottleneck involved at least a 16-fold reduction in population size, a much more severe bottleneck than estimated previously. A sharp bottleneck in wolves occurred soon after their divergence from dogs, implying that the pool of diversity from which dogs arose was substantially larger than represented by modern wolf populations. We narrow the plausible range for the date of initial dog domestication to an interval spanning 11-16 thousand years ago, predating the rise of agriculture. In light of this finding, we expand upon previous work regarding the increase in copy number of the amylase gene (AMY2B) in dogs, which is believed to have aided digestion of starch in agricultural refuse. We find standing variation for amylase copy number variation in wolves and little or no copy number increase in the Dingo and Husky lineages. In conjunction with the estimated timing of dog origins, these results provide additional support to archaeological finds, suggesting the earliest dogs arose alongside hunter-gathers rather than agriculturists. Regarding the geographic origin of dogs, we find that, surprisingly, none of the extant wolf lineages from putative domestication centers is more closely related to dogs, and, instead, the sampled wolves form a sister monophyletic clade. This result, in combination with dog-wolf admixture during the process of domestication, suggests that a re-evaluation of past hypotheses regarding dog origins is necessary.

  16. Genome Sequencing Highlights the Dynamic Early History of Dogs

    PubMed Central

    Freedman, Adam H.; Gronau, Ilan; Schweizer, Rena M.; Ortega-Del Vecchyo, Diego; Han, Eunjung; Silva, Pedro M.; Galaverni, Marco; Fan, Zhenxin; Marx, Peter; Lorente-Galdos, Belen; Beale, Holly; Ramirez, Oscar; Hormozdiari, Farhad; Alkan, Can; Vilà, Carles; Squire, Kevin; Geffen, Eli; Kusak, Josip; Boyko, Adam R.; Parker, Heidi G.; Lee, Clarence; Tadigotla, Vasisht; Siepel, Adam; Bustamante, Carlos D.; Harkins, Timothy T.; Nelson, Stanley F.; Ostrander, Elaine A.; Marques-Bonet, Tomas; Wayne, Robert K.; Novembre, John

    2014-01-01

    To identify genetic changes underlying dog domestication and reconstruct their early evolutionary history, we generated high-quality genome sequences from three gray wolves, one from each of the three putative centers of dog domestication, two basal dog lineages (Basenji and Dingo) and a golden jackal as an outgroup. Analysis of these sequences supports a demographic model in which dogs and wolves diverged through a dynamic process involving population bottlenecks in both lineages and post-divergence gene flow. In dogs, the domestication bottleneck involved at least a 16-fold reduction in population size, a much more severe bottleneck than estimated previously. A sharp bottleneck in wolves occurred soon after their divergence from dogs, implying that the pool of diversity from which dogs arose was substantially larger than represented by modern wolf populations. We narrow the plausible range for the date of initial dog domestication to an interval spanning 11–16 thousand years ago, predating the rise of agriculture. In light of this finding, we expand upon previous work regarding the increase in copy number of the amylase gene (AMY2B) in dogs, which is believed to have aided digestion of starch in agricultural refuse. We find standing variation for amylase copy number variation in wolves and little or no copy number increase in the Dingo and Husky lineages. In conjunction with the estimated timing of dog origins, these results provide additional support to archaeological finds, suggesting the earliest dogs arose alongside hunter-gathers rather than agriculturists. Regarding the geographic origin of dogs, we find that, surprisingly, none of the extant wolf lineages from putative domestication centers is more closely related to dogs, and, instead, the sampled wolves form a sister monophyletic clade. This result, in combination with dog-wolf admixture during the process of domestication, suggests that a re-evaluation of past hypotheses regarding dog origins is necessary. PMID:24453982

  17. Genetic divergence of common bean cultivars.

    PubMed

    Veloso, J S; Silva, W; Pinheiro, L R; Dos Santos, J B; Fonseca, N S; Euzebio, M P

    2015-09-22

    The aim of this study was to evaluate genetic divergence in the 'Carioca' (beige with brown stripes) common bean cultivar used by different institutions and in 16 other common bean cultivars used in the Rede Cooperativa de Pesquisa de Feijão (Cooperative Network of Common Bean Research), by using simple sequence repeats associated with agronomic traits that are highly distributed in the common bean genome. We evaluated 22 polymorphic loci using bulks containing DNA from 30 plants. There was genetic divergence among the Carioca cultivar provided by the institutions. Nevertheless, there was lower divergence among them than among the other cultivars. The cultivar used by Instituto Agronômico do Paraná was the most divergent in relation to the Carioca samples. The least divergence was observed among the samples used by Universidade Federal de Lavras and by Embrapa Arroz e Feijão. Of all the cultivars, 'CNFP 10104' and 'BRSMG Realce' showed the greatest dissimilarity. The cultivars were separated in two groups of greatest similarity using the Structure software. Genetic variation among cultivars was greater than the variation within or between the groups formed. This fact, together with the high estimate of heterozygosity observed and the genetic divergence of the samples of the Carioca cultivar in relation to the original provided by Instituto Agronômico de Campinas, indicates a mixture of cultivars. The high divergence among cultivars provides potential for the utilization of this genetic variability in plant breeding.

  18. Patterns of Z chromosome divergence among Heliconius species highlight the importance of historical demography.

    PubMed

    Van Belleghem, Steven M; Baquero, Margarita; Papa, Riccardo; Salazar, Camilo; McMillan, W Owen; Counterman, Brian A; Jiggins, Chris D; Martin, Simon H

    2018-03-22

    Sex chromosomes are disproportionately involved in reproductive isolation and adaptation. In support of such a "large-X" effect, genome scans between recently diverged populations and species pairs often identify distinct patterns of divergence on the sex chromosome compared to autosomes. When measures of divergence between populations are higher on the sex chromosome compared to autosomes, such patterns could be interpreted as evidence for faster divergence on the sex chromosome, that is "faster-X", barriers to gene flow on the sex chromosome. However, demographic changes can strongly skew divergence estimates and are not always taken into consideration. We used 224 whole-genome sequences representing 36 populations from two Heliconius butterfly clades (H. erato and H. melpomene) to explore patterns of Z chromosome divergence. We show that increased divergence compared to equilibrium expectations can in many cases be explained by demographic change. Among Heliconius erato populations, for instance, population size increase in the ancestral population can explain increased absolute divergence measures on the Z chromosome compared to the autosomes, as a result of increased ancestral Z chromosome genetic diversity. Nonetheless, we do identify increased divergence on the Z chromosome relative to the autosomes in parapatric or sympatric species comparisons that imply postzygotic reproductive barriers. Using simulations, we show that this is consistent with reduced gene flow on the Z chromosome, perhaps due to greater accumulation of incompatibilities. Our work demonstrates the importance of taking demography into account to interpret patterns of divergence on the Z chromosome, but nonetheless provides evidence to support the Z chromosome as a strong barrier to gene flow in incipient Heliconius butterfly species. © 2018 The Authors. Molecular Ecology Published by John Wiley & Sons Ltd.

  19. AlignMiner: a Web-based tool for detection of divergent regions in multiple sequence alignments of conserved sequences

    PubMed Central

    2010-01-01

    Background Multiple sequence alignments are used to study gene or protein function, phylogenetic relations, genome evolution hypotheses and even gene polymorphisms. Virtually without exception, all available tools focus on conserved segments or residues. Small divergent regions, however, are biologically important for specific quantitative polymerase chain reaction, genotyping, molecular markers and preparation of specific antibodies, and yet have received little attention. As a consequence, they must be selected empirically by the researcher. AlignMiner has been developed to fill this gap in bioinformatic analyses. Results AlignMiner is a Web-based application for detection of conserved and divergent regions in alignments of conserved sequences, focusing particularly on divergence. It accepts alignments (protein or nucleic acid) obtained using any of a variety of algorithms, which does not appear to have a significant impact on the final results. AlignMiner uses different scoring methods for assessing conserved/divergent regions, Entropy being the method that provides the highest number of regions with the greatest length, and Weighted being the most restrictive. Conserved/divergent regions can be generated either with respect to the consensus sequence or to one master sequence. The resulting data are presented in a graphical interface developed in AJAX, which provides remarkable user interaction capabilities. Users do not need to wait until execution is complete and can.even inspect their results on a different computer. Data can be downloaded onto a user disk, in standard formats. In silico and experimental proof-of-concept cases have shown that AlignMiner can be successfully used to designing specific polymerase chain reaction primers as well as potential epitopes for antibodies. Primer design is assisted by a module that deploys several oligonucleotide parameters for designing primers "on the fly". Conclusions AlignMiner can be used to reliably detect divergent regions via several scoring methods that provide different levels of selectivity. Its predictions have been verified by experimental means. Hence, it is expected that its usage will save researchers' time and ensure an objective selection of the best-possible divergent region when closely related sequences are analysed. AlignMiner is freely available at http://www.scbi.uma.es/alignminer. PMID:20525162

  20. Detection of novel divergent arenaviruses in boid snakes with inclusion body disease in The Netherlands.

    PubMed

    Bodewes, R; Kik, M J L; Raj, V Stalin; Schapendonk, C M E; Haagmans, B L; Smits, S L; Osterhaus, A D M E

    2013-06-01

    Arenaviruses are bi-segmented negative-stranded RNA viruses, which were until recently only detected in rodents and humans. Now highly divergent arenaviruses have been identified in boid snakes with inclusion body disease (IBD). Here, we describe the identification of a new species and variants of the highly divergent arenaviruses, which were detected in tissues of captive boid snakes with IBD in The Netherlands by next-generation sequencing. Phylogenetic analysis of the complete sequence of the open reading frames of the four predicted proteins of one of the detected viruses revealed that this virus was most closely related to the recently identified Golden Gate virus, while considerable sequence differences were observed between the highly divergent arenaviruses detected in this study. These findings add to the recent identification of the highly divergent arenaviruses in boid snakes with IBD in the United States and indicate that these viruses also circulate among boid snakes in Europe.

  1. Evolutionary History and Phylodynamics of Influenza A and B Neuraminidase (NA) Genes Inferred from Large-Scale Sequence Analyses

    PubMed Central

    Xu, Jianpeng; Davis, C. Todd; Christman, Mary C.; Rivailler, Pierre; Zhong, Haizhen; Donis, Ruben O.; Lu, Guoqing

    2012-01-01

    Background Influenza neuraminidase (NA) is an important surface glycoprotein and plays a vital role in viral replication and drug development. The NA is found in influenza A and B viruses, with nine subtypes classified in influenza A. The complete knowledge of influenza NA evolutionary history and phylodynamics, although critical for the prevention and control of influenza epidemics and pandemics, remains lacking. Methodology/Principal findings Evolutionary and phylogenetic analyses of influenza NA sequences using Maximum Likelihood and Bayesian MCMC methods demonstrated that the divergence of influenza viruses into types A and B occurred earlier than the divergence of influenza A NA subtypes. Twenty-three lineages were identified within influenza A, two lineages were classified within influenza B, and most lineages were specific to host, subtype or geographical location. Interestingly, evolutionary rates vary not only among lineages but also among branches within lineages. The estimated tMRCAs of influenza lineages suggest that the viruses of different lineages emerge several months or even years before their initial detection. The d N /d S ratios ranged from 0.062 to 0.313 for influenza A lineages, and 0.257 to 0.259 for influenza B lineages. Structural analyses revealed that all positively selected sites are at the surface of the NA protein, with a number of sites found to be important for host antibody and drug binding. Conclusions/Significance The divergence into influenza type A and B from a putative ancestral NA was followed by the divergence of type A into nine NA subtypes, of which 23 lineages subsequently diverged. This study provides a better understanding of influenza NA lineages and their evolutionary dynamics, which may facilitate early detection of newly emerging influenza viruses and thus improve influenza surveillance. PMID:22808012

  2. Multilocus approach to clarify species status and the divergence history of the Bemisia tabaci (Hemiptera: Aleyrodidae) species complex.

    PubMed

    Hsieh, Chia-Hung; Ko, Chiun-Cheng; Chung, Cheng-Han; Wang, Hurng-Yi

    2014-07-01

    The sweet potato whitefly, Bemisia tabaci, is a highly differentiated species complex. Despite consisting of several morphologically indistinguishable entities and frequent invasions on all continents with important associated economic losses, the phylogenetic relationships, species status, and evolutionary history of this species complex is still debated. We sequenced and analyzed one mitochondrial and three single-copy nuclear genes from 9 of the 12 genetic groups of B. tabaci and 5 closely related species. Bayesian species delimitation was applied to investigate the speciation events of B. tabaci. The species statuses of the different genetic groups were strongly supported under different prior settings and phylogenetic scenarios. Divergence histories were estimated by a multispecies coalescence approach implemented in (*)BEAST. Based on mitochondrial locus, B. tabaci was originated 6.47 million years ago (MYA). Nevertheless, the time was 1.25MYA based on nuclear loci. According to the method of approximate Bayesian computation, this difference is probably due to different degrees of migration among loci; i.e., although the mitochondrial locus had differentiated, gene flow at nuclear loci was still possible, a scenario similar to parapatric mode of speciation. This is the first study in whiteflies using multilocus data and incorporating Bayesian coalescence approaches, both of which provide a more biologically realistic framework for delimiting species status and delineating the divergence history of B. tabaci. Our study illustrates that gene flow during species divergence should not be overlooked and has a great impact on divergence time estimation. Copyright © 2014 Elsevier Inc. All rights reserved.

  3. Genotyping by sequencing resolves shallow population structure to inform conservation of Chinook salmon (Oncorhynchus tshawytscha)

    PubMed Central

    Larson, Wesley A; Seeb, Lisa W; Everett, Meredith V; Waples, Ryan K; Templin, William D; Seeb, James E

    2014-01-01

    Recent advances in population genomics have made it possible to detect previously unidentified structure, obtain more accurate estimates of demographic parameters, and explore adaptive divergence, potentially revolutionizing the way genetic data are used to manage wild populations. Here, we identified 10 944 single-nucleotide polymorphisms using restriction-site-associated DNA (RAD) sequencing to explore population structure, demography, and adaptive divergence in five populations of Chinook salmon (Oncorhynchus tshawytscha) from western Alaska. Patterns of population structure were similar to those of past studies, but our ability to assign individuals back to their region of origin was greatly improved (>90% accuracy for all populations). We also calculated effective size with and without removing physically linked loci identified from a linkage map, a novel method for nonmodel organisms. Estimates of effective size were generally above 1000 and were biased downward when physically linked loci were not removed. Outlier tests based on genetic differentiation identified 733 loci and three genomic regions under putative selection. These markers and genomic regions are excellent candidates for future research and can be used to create high-resolution panels for genetic monitoring and population assignment. This work demonstrates the utility of genomic data to inform conservation in highly exploited species with shallow population structure. PMID:24665338

  4. Laughter and the Management of Divergent Positions in Peer Review Interactions

    PubMed Central

    Raclaw, Joshua; Ford, Cecilia E.

    2017-01-01

    In this paper we focus on how participants in peer review interactions use laughter as a resource as they publicly report divergence of evaluative positions, divergence that is typical in the give and take of joint grant evaluation. Using the framework of conversation analysis, we examine the infusion of laughter and multimodal laugh-relevant practices into sequences of talk in meetings of grant reviewers deliberating on the evaluation and scoring of high-level scientific grant applications. We focus on a recurrent sequence in these meetings, what we call the score-reporting sequence, in which the assigned reviewers first announce the preliminary scores they have assigned to the grant. We demonstrate that such sequences are routine sites for the use of laugh practices to navigate the initial moments in which divergence of opinion is made explicit. In the context of meetings convened for the purposes of peer review, laughter thus serves as a valuable resource for managing the socially delicate but institutionally required reporting of divergence and disagreement that is endemic to meetings where these types of evaluative tasks are a focal activity. PMID:29170594

  5. The estimation of genetic divergence

    NASA Technical Reports Server (NTRS)

    Holmquist, R.; Conroy, T.

    1981-01-01

    Consideration is given to the criticism of Nei and Tateno (1978) of the REH (random evolutionary hits) theory of genetic divergence in nucleic acids and proteins, and to their proposed alternative estimator of total fixed mutations designated X2. It is argued that the assumption of nonuniform amino acid or nucleotide substitution will necessarily increase REH estimates relative to those made for a model where each locus has an equal likelihood of fixing mutations, thus the resulting value will not be an overestimation. The relative values of X2 and measures calculated on the basis of the PAM and REH theories for the number of nucleotide substitutions necessary to explain a given number of observed amino acid differences between two homologous proteins are compared, and the smaller values of X2 are attributed to (1) a mathematical model based on the incorrect assumption that an entire structural gene is free to fix mutations and (2) the assumptions of different numbers of variable codons for the X2 and REH calculations. Results of a repeat of the computer simulations of Nei and Tateno are presented which, in contrast to the original results, confirm the REH theory. It is pointed out that while a negative correlation is observed between estimations of the fixation intensity per varion and the number of varions for a given pair of sequences, the correlation between the two fixation intensities and varion numbers of two different pairs of sequences need not be negative. Finally, REH theory is used to resolve a paradox concerning the high rate of covarion turnover and the nature of general function sites as permanent covarions.

  6. SOMKE: kernel density estimation over data streams by sequences of self-organizing maps.

    PubMed

    Cao, Yuan; He, Haibo; Man, Hong

    2012-08-01

    In this paper, we propose a novel method SOMKE, for kernel density estimation (KDE) over data streams based on sequences of self-organizing map (SOM). In many stream data mining applications, the traditional KDE methods are infeasible because of the high computational cost, processing time, and memory requirement. To reduce the time and space complexity, we propose a SOM structure in this paper to obtain well-defined data clusters to estimate the underlying probability distributions of incoming data streams. The main idea of this paper is to build a series of SOMs over the data streams via two operations, that is, creating and merging the SOM sequences. The creation phase produces the SOM sequence entries for windows of the data, which obtains clustering information of the incoming data streams. The size of the SOM sequences can be further reduced by combining the consecutive entries in the sequence based on the measure of Kullback-Leibler divergence. Finally, the probability density functions over arbitrary time periods along the data streams can be estimated using such SOM sequences. We compare SOMKE with two other KDE methods for data streams, the M-kernel approach and the cluster kernel approach, in terms of accuracy and processing time for various stationary data streams. Furthermore, we also investigate the use of SOMKE over nonstationary (evolving) data streams, including a synthetic nonstationary data stream, a real-world financial data stream and a group of network traffic data streams. The simulation results illustrate the effectiveness and efficiency of the proposed approach.

  7. Natural Allelic Variations in Highly Polyploidy Saccharum Complex

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Song, Jian; Yang, Xiping; Resende, Jr., Marcio F. R.

    Sugarcane ( Saccharum spp.) is an important sugar and biofuel crop with high polyploid and complex genomes. The Saccharum complex, comprised of Saccharum genus and a few related genera, are important genetic resources for sugarcane breeding. A large amount of natural variation exists within the Saccharum complex. Though understanding their allelic variation has been challenging, it is critical to dissect allelic structure and to identify the alleles controlling important traits in sugarcane. To characterize natural variations in Saccharum complex, a target enrichment sequencing approach was used to assay 12 representative germplasm accessions. In total, 55,946 highly efficient probes were designedmore » based on the sorghum genome and sugarcane unigene set targeting a total of 6 Mb of the sugarcane genome. A pipeline specifically tailored for polyploid sequence variants and genotype calling was established. BWAmem and sorghum genome approved to be an acceptable aligner and reference for sugarcane target enrichment sequence analysis, respectively. Genetic variations including 1,166,066 non-redundant SNPs, 150,421 InDels, 919 gene copy number variations, and 1,257 gene presence/absence variations were detected. SNPs from three different callers (Samtools, Freebayes, and GATK) were compared and the validation rates were nearly 90%. Based on the SNP loci of each accession and their ploidy levels, 999,258 single dosage SNPs were identified and most loci were estimated as largely homozygotes. An average of 34,397 haplotype blocks for each accession was inferred. The highest divergence time among the Saccharum spp. was estimated as 1.2 million years ago (MYA). Saccharum spp. diverged from Erianthus and Sorghum approximately 5 and 6 MYA, respectively. Furthermore, the target enrichment sequencing approach provided an effective way to discover and catalog natural allelic variation in highly polyploid or heterozygous genomes.« less

  8. Late-Quaternary biogeographic scenarios for the brown bear ( Ursus arctos), a wild mammal model species

    NASA Astrophysics Data System (ADS)

    Davison, John; Ho, Simon Y. W.; Bray, Sarah C.; Korsten, Marju; Tammeleht, Egle; Hindrikson, Maris; Østbye, Kjartan; Østbye, Eivind; Lauritzen, Stein-Erik; Austin, Jeremy; Cooper, Alan; Saarma, Urmas

    2011-02-01

    This review provides an up-to-date synthesis of the matrilineal phylogeography of a uniquely well-studied Holarctic mammal, the brown bear. We extend current knowledge by presenting a DNA sequence derived from one of the earliest known fossils of a polar bear (dated to 115 000 years before present), a species that shares a paraphyletic mitochondrial association with brown bears. A molecular clock analysis of 140 mitochondrial DNA sequences, including our new polar bear sequence, provides novel insights into the times of origin for different brown bear clades. We propose a number of regional biogeographic scenarios based on genetic data, divergence time estimates and paleontological records. The case of the brown bear provides an example for researchers working with less well-studied taxa: it shows clearly that phylogeographic models based on patterns of modern genetic variation alone can be substantially improved by including data on historical patterns of genetic diversity in the form of ancient DNA sequences derived from accurately dated samples and by using an approach to divergence-time estimation that suits the data under analysis. Using such approaches it has been possible to (i) establish that the processes shaping modern genetic diversity in brown bears acted recently, within the last three glacial cycles; (ii) distinguish among hypotheses concerning species' responses to climatic oscillations in accordance with the lack of phylogeographic structure that existed in brown bears prior to the last glacial maximum (LGM); (iii) reassess theories linking monophyletic brown bear populations to particular LGM refuge areas; and (iv) identify vicariance events and track analogous patterns of migration by brown bears out of Eurasia to North America and Japan.

  9. Natural Allelic Variations in Highly Polyploidy Saccharum Complex

    DOE PAGES

    Song, Jian; Yang, Xiping; Resende, Jr., Marcio F. R.; ...

    2016-06-08

    Sugarcane ( Saccharum spp.) is an important sugar and biofuel crop with high polyploid and complex genomes. The Saccharum complex, comprised of Saccharum genus and a few related genera, are important genetic resources for sugarcane breeding. A large amount of natural variation exists within the Saccharum complex. Though understanding their allelic variation has been challenging, it is critical to dissect allelic structure and to identify the alleles controlling important traits in sugarcane. To characterize natural variations in Saccharum complex, a target enrichment sequencing approach was used to assay 12 representative germplasm accessions. In total, 55,946 highly efficient probes were designedmore » based on the sorghum genome and sugarcane unigene set targeting a total of 6 Mb of the sugarcane genome. A pipeline specifically tailored for polyploid sequence variants and genotype calling was established. BWAmem and sorghum genome approved to be an acceptable aligner and reference for sugarcane target enrichment sequence analysis, respectively. Genetic variations including 1,166,066 non-redundant SNPs, 150,421 InDels, 919 gene copy number variations, and 1,257 gene presence/absence variations were detected. SNPs from three different callers (Samtools, Freebayes, and GATK) were compared and the validation rates were nearly 90%. Based on the SNP loci of each accession and their ploidy levels, 999,258 single dosage SNPs were identified and most loci were estimated as largely homozygotes. An average of 34,397 haplotype blocks for each accession was inferred. The highest divergence time among the Saccharum spp. was estimated as 1.2 million years ago (MYA). Saccharum spp. diverged from Erianthus and Sorghum approximately 5 and 6 MYA, respectively. Furthermore, the target enrichment sequencing approach provided an effective way to discover and catalog natural allelic variation in highly polyploid or heterozygous genomes.« less

  10. Analysis of the Macaca mulatta transcriptome and the sequence divergence between Macaca and human.

    PubMed

    Magness, Charles L; Fellin, P Campion; Thomas, Matthew J; Korth, Marcus J; Agy, Michael B; Proll, Sean C; Fitzgibbon, Matthew; Scherer, Christina A; Miner, Douglas G; Katze, Michael G; Iadonato, Shawn P

    2005-01-01

    We report the initial sequencing and comparative analysis of the Macaca mulatta transcriptome. Cloned sequences from 11 tissues, nine animals, and three species (M. mulatta, M. fascicularis, and M. nemestrina) were sampled, resulting in the generation of 48,642 sequence reads. These data represent an initial sampling of the putative rhesus orthologs for 6,216 human genes. Mean nucleotide diversity within M. mulatta and sequence divergence among M. fascicularis, M. nemestrina, and M. mulatta are also reported.

  11. Determining divergence times with a protein clock: update and reevaluation

    NASA Technical Reports Server (NTRS)

    Feng, D. F.; Cho, G.; Doolittle, R. F.; Bada, J. L. (Principal Investigator)

    1997-01-01

    A recent study of the divergence times of the major groups of organisms as gauged by amino acid sequence comparison has been expanded and the data have been reanalyzed with a distance measure that corrects for both constraints on amino acid interchange and variation in substitution rate at different sites. Beyond that, the availability of complete genome sequences for several eubacteria and an archaebacterium has had a great impact on the interpretation of certain aspects of the data. Thus, the majority of the archaebacterial sequences are not consistent with currently accepted views of the Tree of Life which cluster the archaebacteria with eukaryotes. Instead, they are either outliers or mixed in with eubacterial orthologs. The simplest resolution of the problem is to postulate that many of these sequences were carried into eukaryotes by early eubacterial endosymbionts about 2 billion years ago, only very shortly after or even coincident with the divergence of eukaryotes and archaebacteria. The strong resemblances of these same enzymes among the major eubacterial groups suggest that the cyanobacteria and Gram-positive and Gram-negative eubacteria also diverged at about this same time, whereas the much greater differences between archaebacterial and eubacterial sequences indicate these two groups may have diverged between 3 and 4 billion years ago.

  12. Sequence-Level Mechanisms of Human Epigenome Evolution

    PubMed Central

    Prendergast, James G.D.; Chambers, Emily V.; Semple, Colin A.M.

    2014-01-01

    DNA methylation and chromatin states play key roles in development and disease. However, the extent of recent evolutionary divergence in the human epigenome and the influential factors that have shaped it are poorly understood. To determine the links between genome sequence and human epigenome evolution, we examined the divergence of DNA methylation and chromatin states following segmental duplication events in the human lineage. Chromatin and DNA methylation states were found to have been generally well conserved following a duplication event, with the evolution of the epigenome largely uncoupled from the total number of genetic changes in the surrounding DNA sequence. However, the epigenome at tissue-specific, distal regulatory regions was observed to be unusually prone to diverge following duplication, with particular sequence differences, altering known sequence motifs, found to be associated with divergence in patterns of DNA methylation and chromatin. Alu elements were found to have played a particularly prominent role in shaping human epigenome evolution, and we show that human-specific AluY insertion events are strongly linked to the evolution of the DNA methylation landscape and gene expression levels, including at key neurological genes in the human brain. Studying paralogous regions within the same sample enables the study of the links between genome and epigenome evolution while controlling for biological and technical variation. We show DNA methylation and chromatin divergence between duplicated regions are linked to the divergence of particular genetic motifs, with Alu elements having played a disproportionate role in the evolution of the epigenome in the human lineage. PMID:24966180

  13. Evidence for a recent origin of penguins

    PubMed Central

    Subramanian, Sankar; Beans-Picón, Gabrielle; Swaminathan, Siva K.; Millar, Craig D.; Lambert, David M.

    2013-01-01

    Penguins are a remarkable group of birds, with the 18 extant species living in diverse climatic zones from the tropics to Antarctica. The timing of the origin of these extant penguins remains controversial. Previous studies based on DNA sequences and fossil records have suggested widely differing times for the origin of the group. This has given rise to widely differing biogeographic narratives about their evolution. To resolve this problem, we sequenced five introns from 11 species representing all genera of living penguins. Using these data and other available DNA sequences, together with the ages of multiple penguin fossils to calibrate the molecular clock, we estimated the age of the most recent common ancestor of extant penguins to be 20.4 Myr (17.0–23.8 Myr). This time is half of the previous estimates based on molecular sequence data. Our results suggest that most of the major groups of extant penguins diverged 11–16 Ma. This overlaps with the sharp decline in Antarctic temperatures that began approximately 12 Ma, suggesting a possible relationship between climate change and penguin evolution. PMID:24227045

  14. A complete Neandertal mitochondrial genome sequence determined by high-throughput sequencing

    PubMed Central

    Green, Richard E.; Malaspinas, Anna-Sapfo; Krause, Johannes; Briggs, Adrian W.; Johnson, Philip L. F.; Uhler, Caroline; Meyer, Matthias; Good, Jeffrey M.; Maricic, Tomislav; Stenzel, Udo; Prüfer, Kay; Siebauer, Michael; Burbano, Hernán A.; Ronan, Michael; Rothberg, Jonathan M.; Egholm, Michael; Rudan, Pavao; Brajković, Dejana; Kućan, Željko; Gušić, Ivan; Wikström, Mårten; Laakkonen, Liisa; Kelso, Janet; Slatkin, Montgomery; Pääbo, Svante

    2008-01-01

    Summary A complete mitochondrial (mt) genome sequence was reconstructed from a 38,000-year-old Neandertal individual using 8,341 mtDNA sequences identified among 4.8 Gb of DNA generated from ~0.3 grams of bone. Analysis of the assembled sequence unequivocally establishes that the Neandertal mtDNA falls outside the variation of extant human mtDNAs and allows an estimate of the divergence date between the two mtDNA lineages of 660,000±140,000 years. Of the 13 proteins encoded in the mtDNA, subunit 2 of cytochrome c oxidase of the mitochondrial electron transport chain has experienced the largest number of amino acid substitutions in human ancestors since the separation from Neandertals. There is evidence that purifying selection in the Neandertal mtDNA was reduced compared to other primate lineages suggesting that the effective population size of Neandertals was small. PMID:18692465

  15. Armillaria phylogeny based on tef-1α sequences suggests ongoing divergent speciation within the boreal floristic kingdom

    Treesearch

    Ned B. Klopfenstein; John W. Hanna; Amy L. Ross-Davis; Jane E. Stewart; Yuko Ota; Rosario Medel-Ortiz; Miguel Armando Lopez-Ramirez; Ruben Damian Elias-Roman; Dionicio Alvarado-Rosales; Mee-Sook Kim

    2013-01-01

    Armillaria plays diverse ecological roles in forests worldwide, which has inspired interest in understanding phylogenetic relationships within and among species of this genus. Previous rDNA sequence-based phylogenetic analyses of Armillaria have shown general relationships among widely divergent taxa, but rDNA sequences were not reliable for separating closely related...

  16. Biogeography of Speciation of Two Sister Species of Neotropical Amazona (Aves, Psittaciformes) Based on Mitochondrial Sequence Data

    PubMed Central

    Rocha, Amanda V.; Rivera, Luis O.; Martinez, Jaime; Prestes, Nêmora P.; Caparroz, Renato

    2014-01-01

    Coalescent theory provides powerful models for population genetic inference and is now increasingly important in estimates of divergence times and speciation research. We use molecular data and methods based on coalescent theory to investigate whether genetic evidence supports the hypothesis of A. pretrei and A. tucumana as separate species and whether genetic data allow us to assess which allopatric model seems to better explain the diversification process in these taxa. We sampled 13 A. tucumana from two provinces in northern Argentina and 28 A. pretrei from nine localities of Rio Grande do Sul, Brazil. A 491 bp segment of the mitochondrial gene cytochrome c oxidase I was evaluated using the haplotype network and phylogenetic methods. The divergence time and other demographic quantities were estimated using the isolation and migration model based on coalescent theory. The network and phylogenetic reconstructions showed similar results, supporting reciprocal monophyly for these two taxa. The divergence time of lineage separation was estimated to be approximately 1.3 million years ago, which corresponds to the lower Pleistocene. Our results enforce the current taxonomic status for these two Amazon species. They also support that A. pretrei and A. tucumana diverged with little or no gene flow approximately 1.3 million years ago, most likely after the establishment of a small population in the Southern Yungas forest by dispersion of a few founders from the A. pretrei ancestral population. This process may have been favored by habitat corridors formed in hot and humid periods of the Quaternary. Considering that these two species are considered threatened, the results were evaluated for their implications for the conservation of these two species. PMID:25251765

  17. Molecular clocks and the early evolution of metazoan nervous systems.

    PubMed

    Wray, Gregory A

    2015-12-19

    The timing of early animal evolution remains poorly resolved, yet remains critical for understanding nervous system evolution. Methods for estimating divergence times from sequence data have improved considerably, providing a more refined understanding of key divergences. The best molecular estimates point to the origin of metazoans and bilaterians tens to hundreds of millions of years earlier than their first appearances in the fossil record. Both the molecular and fossil records are compatible, however, with the possibility of tiny, unskeletonized, low energy budget animals during the Proterozoic that had planktonic, benthic, or meiofaunal lifestyles. Such animals would likely have had relatively simple nervous systems equipped primarily to detect food, avoid inhospitable environments and locate mates. The appearance of the first macropredators during the Cambrian would have changed the selective landscape dramatically, likely driving the evolution of complex sense organs, sophisticated sensory processing systems, and diverse effector systems involved in capturing prey and avoiding predation. © 2015 The Author(s).

  18. A HIGH COVERAGE GENOME SEQUENCE FROM AN ARCHAIC DENISOVAN INDIVIDUAL

    PubMed Central

    Meyer, Matthias; Kircher, Martin; Gansauge, Marie-Theres; Li, Heng; Racimo, Fernando; Mallick, Swapan; Schraiber, Joshua G.; Jay, Flora; Prüfer, Kay; de Filippo, Cesare; Sudmant, Peter H.; Alkan, Can; Fu, Qiaomei; Do, Ron; Rohland, Nadin; Tandon, Arti; Siebauer, Michael; Green, Richard E.; Bryc, Katarzyna; Briggs, Adrian W.; Stenzel, Udo; Dabney, Jesse; Shendure, Jay; Kitzman, Jacob; Hammer, Michael F.; Shunkov, Michael V.; Derevianko, Anatoli P.; Patterson, Nick; Andrés, Aida M.; Eichler, Evan E.; Slatkin, Montgomery; Reich, David; Kelso, Janet; Pääbo, Svante

    2013-01-01

    We present a DNA library preparation method that has allowed us to reconstruct a high coverage (30X) genome sequence of a Denisovan, an extinct relative of Neandertals. The quality of this genome allows a direct estimation of Denisovan heterozygosity indicating that genetic diversity in these archaic hominins was extremely low. It also allows tentative dating of the specimen on the basis of “missing evolution” in its genome, detailed measurements of Denisovan and Neandertal admixture into present-day human populations, and the generation of a near-complete catalog of genetic changes that swept to high frequency in modern humans since their divergence from Denisovans. PMID:22936568

  19. Connecting Amazonian, Cerrado, and Atlantic Forest histories: Paraphyly, old divergences, and modern population dynamics in tyrant-manakins (Neopelma/Tyranneutes, Aves: Pipridae).

    PubMed

    Capurucho, João Marcos Guimarães; Ashley, Mary V; Ribas, Camila C; Bates, John M

    2018-06-11

    Several biogeographic hypotheses have been proposed to explain connections between Amazonian and Atlantic forest biotas. These hypotheses are related to the timing of the connections and their geographic patterns. We performed a phylogeographic investigation of Tyrant-manakins (Aves: Pipridae, Neopelma/Tyranneutes) which include species inhabiting the Amazon and Atlantic forests, as well as gallery forests of the Cerrado. Using DNA sequence data, we determined phylogenetic relationships, temporal and geographic patterns of diversification, and recent intraspecific population genetic patterns, relative to the history of these biomes. We found Neopelma to be a paraphyletic genus, as N. chrysolophum is sister to Neopelma + Tyranneutes, with an estimated divergence of approximately 18 Myrs BP, within the oldest estimated divergence times of other Amazonian and Atlantic forest avian taxa. Subsequent divergences in the group occurred from Mid Miocene to Early Pliocene and involved mainly the Amazonian species, with an expansion into and subsequent speciation in the Cerrado gallery forests by N. pallescens. We found additional structure within N. chrysocephalum and N. sulphureiventer. Analysis of recent population dynamics in N. chrysocephalum, N. sulphureiventer, and N. pallescens revealed recent demographic fluctuations and restrictions to gene flow related to environmental changes since the last glacial cycle. No genetic structure was detected across the Amazon River in N. pallescens. The tyrant-manakins represent an old historical connection between the Amazon and Atlantic Forest. Copyright © 2018. Published by Elsevier Inc.

  20. Concerted evolution at the population level: pupfish HindIII satellite DNA sequences.

    PubMed Central

    Elder, J F; Turner, B J

    1994-01-01

    The canonical monomers (approximately 170 bp) of an abundant (1.9 x 10(6) copies per diploid genome) satellite DNA sequence family in the genome of Cyprinodon variegatus, a "pupfish" that ranges along the Atlantic coast from Cape Cod to central Mexico, are divergent in base sequence in 10 of 12 samples collected from natural populations. The divergence involves substitutions, deletions, and insertions, is marked in scope (mean pairwise sequence similarity = 61.6%; range = 35-95.9%), is largely confined to the 3' half of the monomer, and is not correlated with the distance among collecting sites. Repetitive cloning and direct genomic sequencing experiments failed to detect intrapopulation and intraindividual variation, suggesting high levels of sequence homogeneity within populations. The satellite sequence has therefore undergone "concerted evolution," at the level of the local population. Concerted evolution has previously almost always been discussed in terms of the divergence of species or higher taxa; its intraspecific occurrence apparently has not been reported previously. The generality of the observation is difficult to evaluate, for although satellite DNAs from a large number of organisms have been studied in detail, there appear to be little or no other data on their sequence variation in natural populations. The relationship (if any) between concerted, population level, satellite DNA divergence and the extent of gene flow/genetic isolation among conspecific natural populations remains to be established. Images PMID:8302879

  1. Probing the phylogenetic relationships of a few newly recorded intertidal zoanthids of Gujarat coast (India) with mtDNA COI sequences.

    PubMed

    Joseph, Sneha; Poriya, Paresh; Kundu, Rahul

    2016-11-01

    The present study reports the phylogenetic relationship of six zoanthid species belonging to three genera, Isaurus, Palythoa, and Zoanthus identified using systematic computational analysis of mtDNA gene sequences. All six species are first recorded from the coasts of Kathiawar Peninsula, India. Genus: Isaurus is represented by Isaurus tuberculatus, genus Zoanthus is represented by Zoanthus kuroshio and Zoanthus sansibaricus, while genus Palythoa is represented by Palythoa tuberculosa, P. sp. JVK-2006 and Palythoa heliodiscus. Results of the present study revealed that among the various species observed along the coastline, a minimum of 99% sequence divergence and a maximum of 96% sequence divergence were seen. An interspecific divergence of 1-4% and negligible intraspecific divergence was observed. These results not only highlighted the efficiency of the COI gene region in species identification but also demonstrated the genetic variability of zoanthids along the Saurashtra coastline of the west coast of India.

  2. Primate phylogenetic relationships and divergence dates inferred from complete mitochondrial genomes.

    PubMed

    Pozzi, Luca; Hodgson, Jason A; Burrell, Andrew S; Sterner, Kirstin N; Raaum, Ryan L; Disotell, Todd R

    2014-06-01

    The origins and the divergence times of the most basal lineages within primates have been difficult to resolve mainly due to the incomplete sampling of early fossil taxa. The main source of contention is related to the discordance between molecular and fossil estimates: while there are no crown primate fossils older than 56Ma, most molecule-based estimates extend the origins of crown primates into the Cretaceous. Here we present a comprehensive mitogenomic study of primates. We assembled 87 mammalian mitochondrial genomes, including 62 primate species representing all the families of the order. We newly sequenced eleven mitochondrial genomes, including eight Old World monkeys and three strepsirrhines. Phylogenetic analyses support a strong topology, confirming the monophyly for all the major primate clades. In contrast to previous mitogenomic studies, the positions of tarsiers and colugos relative to strepsirrhines and anthropoids are well resolved. In order to improve our understanding of how fossil calibrations affect age estimates within primates, we explore the effect of seventeen fossil calibrations across primates and other mammalian groups and we select a subset of calibrations to date our mitogenomic tree. The divergence date estimates of the Strepsirrhine/Haplorhine split support an origin of crown primates in the Late Cretaceous, at around 74Ma. This result supports a short-fuse model of primate origins, whereby relatively little time passed between the origin of the order and the diversification of its major clades. It also suggests that the early primate fossil record is likely poorly sampled. Copyright © 2014 Elsevier Inc. All rights reserved.

  3. Evolution of a Planktonic Foraminifer during Environmental Changes in the Tropical Oceans.

    PubMed

    Ujiié, Yurika; Ishitani, Yoshiyuki

    2016-01-01

    Ecological adaptation to environmental changes is a strong driver of evolution, enabling speciation of pelagic plankton in the open ocean without the presence of effective physical barriers to gene flow. The tropical ocean environment, which plays an important role in shaping marine biodiversity, has drastically and frequently changed since the Pliocene. Nevertheless, the evolutionary history of tropical pelagic plankton has been poorly understood, as phylogeographic investigations are still in the developing state and paleontological approaches are insufficient to obtain a sequential record from the deep-sea sediments. The planktonic foraminifer Pulleniatina obliquiloculata is widely distributed in the tropical area throughout the world's oceans, and its phylogeography is well established. It is thus one of the best candidates to examine how past environmental changes may have shifted the spatial distribution and affected the diversification of tropical pelagic plankton. Such an examination requires the divergence history of the planktonic foraminifers, yet the gene marker (partial small subunit (SSU) rDNA) previously used for phylogeographic studies was not powerful enough to achieve a high accuracy in estimating the divergence times. The present study focuses on improving the precision of divergence time estimates for the splits between sibling species (genetic types) of planktonic foraminifers by increasing the number of genes as well as the number of nucleotide bases used for molecular clock estimates. We have amplified the entire coding regions of two ribosomal RNA genes (SSU rDNA and large subunit (LSU) rDNA) of three genetic types of P. obliquiloculata and two closely related species for the first time and applied them to the Bayesian relaxed clock method. The comparison of the credible intervals of the four datasets consisting either of sequences of the partial SSU rDNA, the complete SSU rDNA, LSU rDNA, or a combination of both genes (SSU+LSU) clearly demonstrated that the two-gene dataset improved the accuracy of divergence time estimates. The P. obliquiloculata lineage diverged twice, first at the end of the Pliocene (3.1 Ma) and again in the middle Pleistocene (1.4 Ma). Both timings coincided with the environmental changes, which indirectly involved geographic separation of populations. The habitat of P. obliquiloculata was expanded toward the higher latitudinal zones during the stable warm periods and subsequently placed on the steep environmental gradients following the global cooling. Different environmental conditions in the stable warm tropics and unstable higher latitudes may have triggered ecological divergence among the populations, leading to adaptive differentiation and eventually speciation. A comprehensive analysis of divergence time estimates combined with phylogeography enabled us to reveal the evolutionary history of the pelagic plankton and to find the potential paleoenvironmental events, which could have changed their biogeography and ecology.

  4. Whole-genome analyses of DS-1-like human G2P[4] and G8P[4] rotavirus strains from Eastern, Western and Southern Africa

    PubMed Central

    Nyaga, Martin M.; Stucker, Karla M.; Esona, Mathew D.; Jere, Khuzwayo C.; Mwinyi, Bakari; Shonhai, Annie; Tsolenyanu, Enyonam; Mulindwa, Augustine; Chibumbya, Julia N.; Adolfine, Hokororo; Halpin, Rebecca A.; Roy, Sunando; Stockwell, Timothy B.; Berejena, Chipo; Seheri, Mapaseka L.; Mwenda, Jason M.; Steele, A. Duncan; Wentworth, David E.

    2018-01-01

    Group A rotaviruses (RVAs) with distinct G and P genotype combinations have been reported globally. We report the genome composition and possible origin of seven G8P[4] and five G2P[4] human RVA strains based on the genetic evolution of all 11 genome segments at the nucleotide level. Twelve RVA ELISA positive stool samples collected in the representative countries of Eastern, Southern and West Africa during the 2007–2012 surveillance seasons were subjected to sequencing using the Ion Torrent PGM and Illumina MiSeq platforms. A reference-based assembly was performed using CLC Bio’s clc_ref_assemble_long program, and full-genome consensus sequences were obtained. With the exception of the neutralising antigen, VP7, all study strains exhibited the DS-1-like genome constellation (P[4]-I2-R2-C2-M2-A2-N2-T2-E2-H2) and clustered phylogenetically with reference strains having a DS-1-like genetic backbone. Comparison of the nucleotide and amino acid sequences with selected global cognate genome segments revealed nucleotide and amino acid sequence identities of 81.7–100 % and 90.6–100 %, respectively, with NSP4 gene segment showing the most diversity among the strains. Bayesian analyses of all gene sequences to estimate the time of divergence of the lineage indicated that divergence times ranged from 16 to 44 years, except for the NSP4 gene where the lineage seemed to arise in the more distant past at an estimated 203 years ago. However, the long-term effects of changes found within the NSP4 genome segment should be further explored, and thus we recommend continued whole-genome analyses from larger sample sets to determine the evolutionary mechanisms of the DS-1-like strains collected in Africa. PMID:24952422

  5. Molecular phylogeny of osteoglossoids: a new model for Gondwanian origin and plate tectonic transportation of the Asian arowana.

    PubMed

    Kumazawa, Y; Nishida, M

    2000-12-01

    One of the traditional enigmas in freshwater zoogeography has been the evolutionary origin of Scleropages formosus inhabiting Southeast Asia (the Asian arowana), which is a species threatened with extinction among the highly freshwater-adapted fishes from the order Osteoglossiformes. Dispersalists have hypothesized that it originated from the recent (the Miocene or later) transmarine dispersal of morphologically quite similar Australasian arowanas across Wallace's Line, but this hypothesis has been questioned due to their remarkable adaptation to freshwater. We determined the complete nucleotide sequences of two mitochondrial protein genes from 12 osteoglossiform species, including all members of the suborder Osteoglossoidei, with which robust molecular phylogeny was constructed and divergence times were estimated. In agreement with previous morphology-based phylogenetic studies, our molecular phylogeny suggested that the osteoglossiforms diverged from a basal position of the teleostean lineage, that heterotidines (the Nile arowana and the pirarucu) form a sister group of osteoglossines (arowanas in South America, Australasia, and Southeast Asia), and that the Asian arowana is more closely related to Australasian arowanas than to South American ones. However, molecular distances between the Asian and Australasian arowanas were much larger than expected from the fact that they are classified within the same genus. By using the molecular clock of bony fishes, tested for its good performance for rather deep divergences and calibrated using some reasonable assumptions, the divergence between the Asian and Australasian arowanas was estimated to date back to the early Cretaceous. Based on the molecular and geological evidence, we propose a new model whereby the Asian arowana vicariantly diverged from the Australasian arowanas in the eastern margin of Gondwanaland and migrated into Eurasia on the Indian subcontinent or smaller continental blocks. This study also implicates the relatively long absence of osteoglossiform fossil records from the Mesozoic.

  6. Relative information content of polymorphic microsatellites and mitochondrial DNA for inferring dispersal and population genetic structure in the olive sea snake, Aipysurus laevis.

    PubMed

    Lukoschek, V; Waycott, M; Keogh, J S

    2008-07-01

    Polymorphic microsatellites are widely considered more powerful for resolving population structure than mitochondrial DNA (mtDNA) markers, particularly for recently diverged lineages or geographically proximate populations. Weaker population subdivision for biparentally inherited nuclear markers than maternally inherited mtDNA may signal male-biased dispersal but can also be attributed to marker-specific evolutionary characteristics and sampling properties. We discriminated between these competing explanations with a population genetic study on olive sea snakes, Aipysurus laevis. A previous mtDNA study revealed strong regional population structure for A. laevis around northern Australia, where Pleistocene sea-level fluctuations have influenced the genetic signatures of shallow-water marine species. Divergences among phylogroups dated to the Late Pleistocene, suggesting recent range expansions by previously isolated matrilines. Fine-scale population structure within regions was, however, poorly resolved for mtDNA. In order to improve estimates of fine-scale genetic divergence and to compare population structure between nuclear and mtDNA, 354 olive sea snakes (previously sequenced for mtDNA) were genotyped for five microsatellite loci. F statistics and Bayesian multilocus genotype clustering analyses found similar regional population structure as mtDNA and, after standardizing microsatellite F statistics for high heterozygosities, regional divergence estimates were quantitatively congruent between marker classes. Over small spatial scales, however, microsatellites recovered almost no genetic structure and standardized F statistics were orders of magnitude smaller than for mtDNA. Three tests for male-biased dispersal were not significant, suggesting that recent demographic expansions to the typically large population sizes of A. laevis have prevented microsatellites from reaching mutation-drift equilibrium and local populations may still be diverging.

  7. APPROXIMATION AND ESTIMATION OF s-CONCAVE DENSITIES VIA RÉNYI DIVERGENCES.

    PubMed

    Han, Qiyang; Wellner, Jon A

    2016-01-01

    In this paper, we study the approximation and estimation of s -concave densities via Rényi divergence. We first show that the approximation of a probability measure Q by an s -concave density exists and is unique via the procedure of minimizing a divergence functional proposed by [ Ann. Statist. 38 (2010) 2998-3027] if and only if Q admits full-dimensional support and a first moment. We also show continuity of the divergence functional in Q : if Q n → Q in the Wasserstein metric, then the projected densities converge in weighted L 1 metrics and uniformly on closed subsets of the continuity set of the limit. Moreover, directional derivatives of the projected densities also enjoy local uniform convergence. This contains both on-the-model and off-the-model situations, and entails strong consistency of the divergence estimator of an s -concave density under mild conditions. One interesting and important feature for the Rényi divergence estimator of an s -concave density is that the estimator is intrinsically related with the estimation of log-concave densities via maximum likelihood methods. In fact, we show that for d = 1 at least, the Rényi divergence estimators for s -concave densities converge to the maximum likelihood estimator of a log-concave density as s ↗ 0. The Rényi divergence estimator shares similar characterizations as the MLE for log-concave distributions, which allows us to develop pointwise asymptotic distribution theory assuming that the underlying density is s -concave.

  8. APPROXIMATION AND ESTIMATION OF s-CONCAVE DENSITIES VIA RÉNYI DIVERGENCES

    PubMed Central

    Han, Qiyang; Wellner, Jon A.

    2017-01-01

    In this paper, we study the approximation and estimation of s-concave densities via Rényi divergence. We first show that the approximation of a probability measure Q by an s-concave density exists and is unique via the procedure of minimizing a divergence functional proposed by [Ann. Statist. 38 (2010) 2998–3027] if and only if Q admits full-dimensional support and a first moment. We also show continuity of the divergence functional in Q: if Qn → Q in the Wasserstein metric, then the projected densities converge in weighted L1 metrics and uniformly on closed subsets of the continuity set of the limit. Moreover, directional derivatives of the projected densities also enjoy local uniform convergence. This contains both on-the-model and off-the-model situations, and entails strong consistency of the divergence estimator of an s-concave density under mild conditions. One interesting and important feature for the Rényi divergence estimator of an s-concave density is that the estimator is intrinsically related with the estimation of log-concave densities via maximum likelihood methods. In fact, we show that for d = 1 at least, the Rényi divergence estimators for s-concave densities converge to the maximum likelihood estimator of a log-concave density as s ↗ 0. The Rényi divergence estimator shares similar characterizations as the MLE for log-concave distributions, which allows us to develop pointwise asymptotic distribution theory assuming that the underlying density is s-concave. PMID:28966410

  9. Nucleotide sequences of bovine alpha S1- and kappa-casein cDNAs.

    PubMed Central

    Stewart, A F; Willis, I M; Mackinlay, A G

    1984-01-01

    The nucleotide sequences corresponding to bovine alpha S1- and kappa-casein mRNAs are presented. An unusual alpha S1-casein cDNA has been characterised whose 5' end commences upstream from its putative TATA box. The alpha S1-casein mRNA is compared to rat alpha-casein mRNA and two components of divergence are identified. Firstly, the two sequences have diverged at a high point mutation rate and the rate of amino acid replacement by this mechanism is at least as great as the rate of divergence of any other part of the mRNAs. Secondly, the protein coding sequence has been subjected to several insertion/deletion events, one of which may be an example of exon shuffling . The kappa-casein mRNA sequence verifies the proposition that it has arisen from a different ancestral gene to the other caseins. Images PMID:6328443

  10. An Evolutionarily Young Polar Bear (Ursus maritimus) Endogenous Retrovirus Identified from Next Generation Sequence Data.

    PubMed

    Tsangaras, Kyriakos; Mayer, Jens; Alquezar-Planas, David E; Greenwood, Alex D

    2015-11-24

    Transcriptome analysis of polar bear (Ursus maritimus) tissues identified sequences with similarity to Porcine Endogenous Retroviruses (PERV). Based on these sequences, four proviral copies and 15 solo long terminal repeats (LTRs) of a newly described endogenous retrovirus were characterized from the polar bear draft genome sequence. Closely related sequences were identified by PCR analysis of brown bear (Ursus arctos) and black bear (Ursus americanus) but were absent in non-Ursinae bear species. The virus was therefore designated UrsusERV. Two distinct groups of LTRs were observed including a recombinant ERV that contained one LTR belonging to each group indicating that genomic invasions by at least two UrsusERV variants have recently occurred. Age estimates based on proviral LTR divergence and conservation of integration sites among ursids suggest the viral group is only a few million years old. The youngest provirus was polar bear specific, had intact open reading frames (ORFs) and could potentially encode functional proteins. Phylogenetic analyses of UrsusERV consensus protein sequences suggest that it is part of a pig, gibbon and koala retrovirus clade. The young age estimates and lineage specificity of the virus suggests UrsusERV is a recent cross species transmission from an unknown reservoir and places the viral group among the youngest of ERVs identified in mammals.

  11. An Evolutionarily Young Polar Bear (Ursus maritimus) Endogenous Retrovirus Identified from Next Generation Sequence Data

    PubMed Central

    Tsangaras, Kyriakos; Mayer, Jens; Alquezar-Planas, David E.; Greenwood, Alex D.

    2015-01-01

    Transcriptome analysis of polar bear (Ursus maritimus) tissues identified sequences with similarity to Porcine Endogenous Retroviruses (PERV). Based on these sequences, four proviral copies and 15 solo long terminal repeats (LTRs) of a newly described endogenous retrovirus were characterized from the polar bear draft genome sequence. Closely related sequences were identified by PCR analysis of brown bear (Ursus arctos) and black bear (Ursus americanus) but were absent in non-Ursinae bear species. The virus was therefore designated UrsusERV. Two distinct groups of LTRs were observed including a recombinant ERV that contained one LTR belonging to each group indicating that genomic invasions by at least two UrsusERV variants have recently occurred. Age estimates based on proviral LTR divergence and conservation of integration sites among ursids suggest the viral group is only a few million years old. The youngest provirus was polar bear specific, had intact open reading frames (ORFs) and could potentially encode functional proteins. Phylogenetic analyses of UrsusERV consensus protein sequences suggest that it is part of a pig, gibbon and koala retrovirus clade. The young age estimates and lineage specificity of the virus suggests UrsusERV is a recent cross species transmission from an unknown reservoir and places the viral group among the youngest of ERVs identified in mammals. PMID:26610552

  12. Resolution of ray-finned fish phylogeny and timing of diversification.

    PubMed

    Near, Thomas J; Eytan, Ron I; Dornburg, Alex; Kuhn, Kristen L; Moore, Jon A; Davis, Matthew P; Wainwright, Peter C; Friedman, Matt; Smith, W Leo

    2012-08-21

    Ray-finned fishes make up half of all living vertebrate species. Nearly all ray-finned fishes are teleosts, which include most commercially important fish species, several model organisms for genomics and developmental biology, and the dominant component of marine and freshwater vertebrate faunas. Despite the economic and scientific importance of ray-finned fishes, the lack of a single comprehensive phylogeny with corresponding divergence-time estimates has limited our understanding of the evolution and diversification of this radiation. Our analyses, which use multiple nuclear gene sequences in conjunction with 36 fossil age constraints, result in a well-supported phylogeny of all major ray-finned fish lineages and molecular age estimates that are generally consistent with the fossil record. This phylogeny informs three long-standing problems: specifically identifying elopomorphs (eels and tarpons) as the sister lineage of all other teleosts, providing a unique hypothesis on the radiation of early euteleosts, and offering a promising strategy for resolution of the "bush at the top of the tree" that includes percomorphs and other spiny-finned teleosts. Contrasting our divergence time estimates with studies using a single nuclear gene or whole mitochondrial genomes, we find that the former underestimates ages of the oldest ray-finned fish divergences, but the latter dramatically overestimates ages for derived teleost lineages. Our time-calibrated phylogeny reveals that much of the diversification leading to extant groups of teleosts occurred between the late Mesozoic and early Cenozoic, identifying this period as the "Second Age of Fishes."

  13. Extended mitogenomic phylogenetic analyses yield new insight into crocodylian evolution and their survival of the Cretaceous-Tertiary boundary.

    PubMed

    Roos, Jonas; Aggarwal, Ramesh K; Janke, Axel

    2007-11-01

    The mitochondrial genomes of the dwarf crocodile, Osteolaemus tetraspis, and two species of dwarf caimans, the smooth-fronted caiman, Paleosuchus trigonatus, and Cuvier's dwarf caiman, Paleosuchus palpebrosus, were sequenced and included in a mitogenomic phylogenetic study. The phylogenetic analyses, which included a total of ten crocodylian species, yielded strong support to a basal split between Crocodylidae and Alligatoridae. Osteolaemus fell within the Crocodylidae as the sister group to Crocodylus. Gavialis and Tomistoma, which joined on a common branch, constituted a sister group to Crocodylus/Osteolaemus. This suggests that extant crocodylians are organized in two families: Alligatoridae and Crocodylidae. Within the Alligatoridae there was a basal split between Alligator and a branch that contained Paleosuchus and Caiman. The analyses also provided molecular estimates of various divergences applying recently established crocodylian and outgroup fossil calibration points. Molecular estimates based on amino acid data placed the divergence between Crocodylidae and Alligatoridae at 97-103 million years ago and that between Alligator and Caiman/Paleosuchus at 65-72 million years ago. Other crocodilian divergences were placed after the Cretaceous-Tertiary boundary. Thus, according to the molecular estimates, three extant crocodylian lineages have their roots in the Cretaceous. Considering the crocodylian diversification in the Cretaceous the molecular datings suggest that the extinction of the dinosaurs was also to some extent paralleled in the crocodylian evolution. However, for whatever reason, some crocodylian lineages survived into the Tertiary.

  14. Resolution of ray-finned fish phylogeny and timing of diversification

    PubMed Central

    Near, Thomas J.; Eytan, Ron I.; Dornburg, Alex; Kuhn, Kristen L.; Moore, Jon A.; Davis, Matthew P.; Wainwright, Peter C.; Friedman, Matt; Smith, W. Leo

    2012-01-01

    Ray-finned fishes make up half of all living vertebrate species. Nearly all ray-finned fishes are teleosts, which include most commercially important fish species, several model organisms for genomics and developmental biology, and the dominant component of marine and freshwater vertebrate faunas. Despite the economic and scientific importance of ray-finned fishes, the lack of a single comprehensive phylogeny with corresponding divergence-time estimates has limited our understanding of the evolution and diversification of this radiation. Our analyses, which use multiple nuclear gene sequences in conjunction with 36 fossil age constraints, result in a well-supported phylogeny of all major ray-finned fish lineages and molecular age estimates that are generally consistent with the fossil record. This phylogeny informs three long-standing problems: specifically identifying elopomorphs (eels and tarpons) as the sister lineage of all other teleosts, providing a unique hypothesis on the radiation of early euteleosts, and offering a promising strategy for resolution of the “bush at the top of the tree” that includes percomorphs and other spiny-finned teleosts. Contrasting our divergence time estimates with studies using a single nuclear gene or whole mitochondrial genomes, we find that the former underestimates ages of the oldest ray-finned fish divergences, but the latter dramatically overestimates ages for derived teleost lineages. Our time-calibrated phylogeny reveals that much of the diversification leading to extant groups of teleosts occurred between the late Mesozoic and early Cenozoic, identifying this period as the “Second Age of Fishes.” PMID:22869754

  15. Divergent nuclear 18S rDNA paralogs in a turkey coccidium, Eimeria meleagrimitis, complicate molecular systematics and identification.

    PubMed

    El-Sherry, Shiem; Ogedengbe, Mosun E; Hafeez, Mian A; Barta, John R

    2013-07-01

    Multiple 18S rDNA sequences were obtained from two single-oocyst-derived lines of each of Eimeria meleagrimitis and Eimeria adenoeides. After analysing the 15 new 18S rDNA sequences from two lines of E. meleagrimitis and 17 new sequences from two lines of E. adenoeides, there were clear indications that divergent, paralogous 18S rDNA copies existed within the nuclear genome of E. meleagrimitis. In contrast, mitochondrial cytochrome c oxidase subunit I (COI) partial sequences from all lines of a particular Eimeria sp. were identical and, in phylogenetic analyses, COI sequences clustered unambiguously in monophyletic and highly-supported clades specific to individual Eimeria sp. Phylogenetic analysis of the new 18S rDNA sequences from E. meleagrimitis showed that they formed two distinct clades: Type A with four new sequences; and Type B with nine new sequences; both Types A and B sequences were obtained from each of the single-oocyst-derived lines of E. meleagrimitis. Together these rDNA types formed a well-supported E. meleagrimitis clade. Types A and B 18S rDNA sequences from E. meleagrimitis had a mean sequence identity of only 97.4% whereas mean sequence identity within types was 99.1-99.3%. The observed intraspecific sequence divergence among E. meleagrimitis 18S rDNA sequence types was even higher (approximately 2.6%) than the interspecific sequence divergence present between some well-recognized species such as Eimeria tenella and Eimeria necatrix (1.1%). Our observations suggest that, unlike COI sequences, 18S rDNA sequences are not reliable molecular markers to be used alone for species identification with coccidia, although 18S rDNA sequences have clear utility for phylogenetic reconstruction of apicomplexan parasites at the genus and higher taxonomic ranks. Copyright © 2013. Published by Elsevier Ltd.

  16. Identifying Genetic Signatures of Natural Selection Using Pooled Population Sequencing in Picea abies

    PubMed Central

    Chen, Jun; Källman, Thomas; Ma, Xiao-Fei; Zaina, Giusi; Morgante, Michele; Lascoux, Martin

    2016-01-01

    The joint inference of selection and past demography remain a costly and demanding task. We used next generation sequencing of two pools of 48 Norway spruce mother trees, one corresponding to the Fennoscandian domain, and the other to the Alpine domain, to assess nucleotide polymorphism at 88 nuclear genes. These genes are candidate genes for phenological traits, and most belong to the photoperiod pathway. Estimates of population genetic summary statistics from the pooled data are similar to previous estimates, suggesting that pooled sequencing is reliable. The nonsynonymous SNPs tended to have both lower frequency differences and lower FST values between the two domains than silent ones. These results suggest the presence of purifying selection. The divergence between the two domains based on synonymous changes was around 5 million yr, a time similar to a recent phylogenetic estimate of 6 million yr, but much larger than earlier estimates based on isozymes. Two approaches, one of them novel and that considers both FST and difference in allele frequencies between the two domains, were used to identify SNPs potentially under diversifying selection. SNPs from around 20 genes were detected, including genes previously identified as main target for selection, such as PaPRR3 and PaGI. PMID:27172202

  17. Identifying Genetic Signatures of Natural Selection Using Pooled Population Sequencing in Picea abies.

    PubMed

    Chen, Jun; Källman, Thomas; Ma, Xiao-Fei; Zaina, Giusi; Morgante, Michele; Lascoux, Martin

    2016-07-07

    The joint inference of selection and past demography remain a costly and demanding task. We used next generation sequencing of two pools of 48 Norway spruce mother trees, one corresponding to the Fennoscandian domain, and the other to the Alpine domain, to assess nucleotide polymorphism at 88 nuclear genes. These genes are candidate genes for phenological traits, and most belong to the photoperiod pathway. Estimates of population genetic summary statistics from the pooled data are similar to previous estimates, suggesting that pooled sequencing is reliable. The nonsynonymous SNPs tended to have both lower frequency differences and lower FST values between the two domains than silent ones. These results suggest the presence of purifying selection. The divergence between the two domains based on synonymous changes was around 5 million yr, a time similar to a recent phylogenetic estimate of 6 million yr, but much larger than earlier estimates based on isozymes. Two approaches, one of them novel and that considers both FST and difference in allele frequencies between the two domains, were used to identify SNPs potentially under diversifying selection. SNPs from around 20 genes were detected, including genes previously identified as main target for selection, such as PaPRR3 and PaGI. Copyright © 2016 Chen et al.

  18. Complete Chloroplast Genome Sequence of Coptis chinensis Franch. and Its Evolutionary History

    PubMed Central

    He, Yang; Deng, Cao; Fan, Gang; Qin, Shishang

    2017-01-01

    The Coptis chinensis Franch. is an important medicinal plant from the Ranunculales. We used next generation sequencing technology to determine the complete chloroplast genome of C. chinensis. This genome is 155,484 bp long with 38.17% GC content. Two 26,758 bp long inverted repeats separated the genome into a typical quadripartite structure. The C. chinensis chloroplast genome consists of 128 gene loci, including eight rRNA gene loci, 28 tRNA gene loci, and 92 protein-coding gene loci. Most of the SSRs in C. chinensis are poly-A/T. The numbers of mononucleotide SSRs in C. chinensis and other Ranunculaceae species are fewer than those in Berberidaceae species, while the number of dinucleotide SSRs is greater than that in the Berberidaceae. C. chinensis diverged from other Ranunculaceae species an estimated 81 million years ago (Mya). The divergence between Ranunculaceae and Berberidaceae was ~111 Mya, while the Ranunculales and Magnoliaceae shared a common ancestor during the Jurassic, ~153 Mya. Position 104 of the C. chinensis ndhG protein was identified as a positively selected site, indicating possible selection for the photosystem-chlororespiration system in C. chinensis. In summary, the complete sequencing and annotation of the C. chinensis chloroplast genome will facilitate future studies on this important medicinal species. PMID:28698879

  19. Insights into the evolution of Yersinia pestis through whole-genome comparison with Yersinia pseudotuberculosis

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Chain, Patrick S. G.; Carniel, E.; Larimer, Frank W

    2004-09-01

    Yersinia pestis, the causative agent of plague, is a highly uniform clone that diverged recently from the enteric pathogen Yersinia pseudotuberculosis. Despite their close genetic relationship, they differ radically in their pathogenicity and transmission. Here, we report the complete genomic sequence of Y. pseudotuberculosis IP32953 and its use for detailed genome comparisons with available Y. pestis sequences. Analyses of identified differences across a panel of Yersinia isolates from around the world reveal 32 Y. pestis chromosomal genes that, together with the two Y. pestis-specific plasmids, to our knowledge, represent the only new genetic material in Y. pestis acquired since themore » the divergence from Y. pseudotuberculosis. In contrast, 149 other pseudogenes (doubling the previous estimate) and 317 genes absent from Y. pestis were detected, indicating that as many as 13% of Y. pseudotuberculosis genes no longer function in Y. pestis. Extensive insertion sequence-mediated genome rearrangements and reductive evolution through massive gene loss, resulting in elimination and modification of preexisting gene expression pathways, appear to be more important than acquisition of genes in the evolution of Y. pestis. These results provide a sobering example of how a highly virulent epidemic clone can suddenly emerge from a less virulent, closely related progenitor.« less

  20. Divergence of RNA polymerase α subunits in angiosperm plastid genomes is mediated by genomic rearrangement.

    PubMed

    Blazier, J Chris; Ruhlman, Tracey A; Weng, Mao-Lun; Rehman, Sumaiyah K; Sabir, Jamal S M; Jansen, Robert K

    2016-04-18

    Genes for the plastid-encoded RNA polymerase (PEP) persist in the plastid genomes of all photosynthetic angiosperms. However, three unrelated lineages (Annonaceae, Passifloraceae and Geraniaceae) have been identified with unusually divergent open reading frames (ORFs) in the conserved region of rpoA, the gene encoding the PEP α subunit. We used sequence-based approaches to evaluate whether these genes retain function. Both gene sequences and complete plastid genome sequences were assembled and analyzed from each of the three angiosperm families. Multiple lines of evidence indicated that the rpoA sequences are likely functional despite retaining as low as 30% nucleotide sequence identity with rpoA genes from outgroups in the same angiosperm order. The ratio of non-synonymous to synonymous substitutions indicated that these genes are under purifying selection, and bioinformatic prediction of conserved domains indicated that functional domains are preserved. One of the lineages (Pelargonium, Geraniaceae) contains species with multiple rpoA-like ORFs that show evidence of ongoing inter-paralog gene conversion. The plastid genomes containing these divergent rpoA genes have experienced extensive structural rearrangement, including large expansions of the inverted repeat. We propose that illegitimate recombination, not positive selection, has driven the divergence of rpoA.

  1. Clustering evolving proteins into homologous families.

    PubMed

    Chan, Cheong Xin; Mahbob, Maisarah; Ragan, Mark A

    2013-04-08

    Clustering sequences into groups of putative homologs (families) is a critical first step in many areas of comparative biology and bioinformatics. The performance of clustering approaches in delineating biologically meaningful families depends strongly on characteristics of the data, including content bias and degree of divergence. New, highly scalable methods have recently been introduced to cluster the very large datasets being generated by next-generation sequencing technologies. However, there has been little systematic investigation of how characteristics of the data impact the performance of these approaches. Using clusters from a manually curated dataset as reference, we examined the performance of a widely used graph-based Markov clustering algorithm (MCL) and a greedy heuristic approach (UCLUST) in delineating protein families coded by three sets of bacterial genomes of different G+C content. Both MCL and UCLUST generated clusters that are comparable to the reference sets at specific parameter settings, although UCLUST tends to under-cluster compositionally biased sequences (G+C content 33% and 66%). Using simulated data, we sought to assess the individual effects of sequence divergence, rate heterogeneity, and underlying G+C content. Performance decreased with increasing sequence divergence, decreasing among-site rate variation, and increasing G+C bias. Two MCL-based methods recovered the simulated families more accurately than did UCLUST. MCL using local alignment distances is more robust across the investigated range of sequence features than are greedy heuristics using distances based on global alignment. Our results demonstrate that sequence divergence, rate heterogeneity and content bias can individually and in combination affect the accuracy with which MCL and UCLUST can recover homologous protein families. For application to data that are more divergent, and exhibit higher among-site rate variation and/or content bias, MCL may often be the better choice, especially if computational resources are not limiting.

  2. New families of site-specific repetitive DNA sequences that comprise constitutive heterochromatin of the Syrian hamster (Mesocricetus auratus, Cricetinae, Rodentia).

    PubMed

    Yamada, Kazuhiko; Kamimura, Eikichi; Kondo, Mariko; Tsuchiya, Kimiyuki; Nishida-Umehara, Chizuko; Matsuda, Yoichi

    2006-02-01

    We molecularly cloned new families of site-specific repetitive DNA sequences from BglII- and EcoRI-digested genomic DNA of the Syrian hamster (Mesocricetus auratus, Cricetrinae, Rodentia) and characterized them by chromosome in situ hybridization and filter hybridization. They were classified into six different types of repetitive DNA sequence families according to chromosomal distribution and genome organization. The hybridization patterns of the sequences were consistent with the distribution of C-positive bands and/or Hoechst-stained heterochromatin. The centromeric major satellite DNA and sex chromosome-specific and telomeric region-specific repetitive sequences were conserved in the same genus (Mesocricetus) but divergent in different genera. The chromosome-2-specific sequence was conserved in two genera, Mesocricetus and Cricetulus, and a low copy number of repetitive sequences on the heterochromatic chromosome arms were conserved in the subfamily Cricetinae but not in the subfamily Calomyscinae. By contrast, the other type of repetitive sequences on the heterochromatic chromosome arms, which had sequence similarities to a LINE sequence of rodents, was conserved through the three subfamilies, Cricetinae, Calomyscinae and Murinae. The nucleotide divergence of the repetitive sequences of heterochromatin was well correlated with the phylogenetic relationships of the Cricetinae species, and each sequence has been independently amplified and diverged in the same genome.

  3. DNA barcodes for dragonflies and damselflies (Odonata) of Mindanao, Philippines.

    PubMed

    Casas, Princess Angelie S; Sing, Kong-Wah; Lee, Ping-Shin; Nuñeza, Olga M; Villanueva, Reagan Joseph T; Wilson, John-James

    2018-03-01

    Reliable species identification provides a sounder basis for use of species in the order Odonata as biological indicators and for their conservation, an urgent concern as many species are threatened with imminent extinction. We generated 134 COI barcodes from 36 morphologically identified species of Odonata collected from Mindanao Island, representing 10 families and 19 genera. Intraspecific sequence divergences ranged from 0 to 6.7% with four species showing more than 2%, while interspecific sequence divergences ranged from 0.5 to 23.3% with seven species showing less than 2%. Consequently, no distinct gap was observed between intraspecific and interspecific DNA barcode divergences. The numerous islands of the Philippine archipelago may have facilitated rapid speciation in the Odonata and resulted in low interspecific sequence divergences among closely related groups of species. This study contributes DNA barcodes for 36 morphologically identified species of Odonata reported from Mindanao including 31 species with no previous DNA barcode records.

  4. Determining the Effect of Natural Selection on Linked Neutral Divergence across Species

    PubMed Central

    Phung, Tanya N.; Lohmueller, Kirk E.

    2016-01-01

    A major goal in evolutionary biology is to understand how natural selection has shaped patterns of genetic variation across genomes. Studies in a variety of species have shown that neutral genetic diversity (intra-species differences) has been reduced at sites linked to those under direct selection. However, the effect of linked selection on neutral sequence divergence (inter-species differences) remains ambiguous. While empirical studies have reported correlations between divergence and recombination, which is interpreted as evidence for natural selection reducing linked neutral divergence, theory argues otherwise, especially for species that have diverged long ago. Here we address these outstanding issues by examining whether natural selection can affect divergence between both closely and distantly related species. We show that neutral divergence between closely related species (e.g. human-primate) is negatively correlated with functional content and positively correlated with human recombination rate. We also find that neutral divergence between distantly related species (e.g. human-rodent) is negatively correlated with functional content and positively correlated with estimates of background selection from primates. These patterns persist after accounting for the confounding factors of hypermutable CpG sites, GC content, and biased gene conversion. Coalescent models indicate that even when the contribution of ancestral polymorphism to divergence is small, background selection in the ancestral population can still explain a large proportion of the variance in divergence across the genome, generating the observed correlations. Our findings reveal that, contrary to previous intuition, natural selection can indirectly affect linked neutral divergence between both closely and distantly related species. Though we cannot formally exclude the possibility that the direct effects of purifying selection drive some of these patterns, such a scenario would be possible only if more of the genome is under purifying selection than currently believed. Our work has implications for understanding the evolution of genomes and interpreting patterns of genetic variation. PMID:27508305

  5. Determining the Effect of Natural Selection on Linked Neutral Divergence across Species.

    PubMed

    Phung, Tanya N; Huber, Christian D; Lohmueller, Kirk E

    2016-08-01

    A major goal in evolutionary biology is to understand how natural selection has shaped patterns of genetic variation across genomes. Studies in a variety of species have shown that neutral genetic diversity (intra-species differences) has been reduced at sites linked to those under direct selection. However, the effect of linked selection on neutral sequence divergence (inter-species differences) remains ambiguous. While empirical studies have reported correlations between divergence and recombination, which is interpreted as evidence for natural selection reducing linked neutral divergence, theory argues otherwise, especially for species that have diverged long ago. Here we address these outstanding issues by examining whether natural selection can affect divergence between both closely and distantly related species. We show that neutral divergence between closely related species (e.g. human-primate) is negatively correlated with functional content and positively correlated with human recombination rate. We also find that neutral divergence between distantly related species (e.g. human-rodent) is negatively correlated with functional content and positively correlated with estimates of background selection from primates. These patterns persist after accounting for the confounding factors of hypermutable CpG sites, GC content, and biased gene conversion. Coalescent models indicate that even when the contribution of ancestral polymorphism to divergence is small, background selection in the ancestral population can still explain a large proportion of the variance in divergence across the genome, generating the observed correlations. Our findings reveal that, contrary to previous intuition, natural selection can indirectly affect linked neutral divergence between both closely and distantly related species. Though we cannot formally exclude the possibility that the direct effects of purifying selection drive some of these patterns, such a scenario would be possible only if more of the genome is under purifying selection than currently believed. Our work has implications for understanding the evolution of genomes and interpreting patterns of genetic variation.

  6. Faster-X evolution of gene expression is driven by recessive adaptive cis-regulatory variation in Drosophila.

    PubMed

    Llopart, Ana

    2018-05-01

    The hemizygosity of the X (Z) chromosome fully exposes the fitness effects of mutations on that chromosome and has evolutionary consequences on the relative rates of evolution of X and autosomes. Specifically, several population genetics models predict increased rates of evolution in X-linked loci relative to autosomal loci. This prediction of faster-X evolution has been evaluated and confirmed for both protein coding sequences and gene expression. In the case of faster-X evolution for gene expression divergence, it is often assumed that variation in 5' noncoding sequences is associated with variation in transcript abundance between species but a formal, genomewide test of this hypothesis is still missing. Here, I use whole genome sequence data in Drosophila yakuba and D. santomea to evaluate this hypothesis and report positive correlations between sequence divergence at 5' noncoding sequences and gene expression divergence. I also examine polymorphism and divergence in 9,279 noncoding sequences located at the 5' end of annotated genes and detected multiple signals of positive selection. Notably, I used the traditional synonymous sites as neutral reference to test for adaptive evolution, but I also used bases 8-30 of introns <65 bp, which have been proposed to be a better neutral choice. X-linked genes with high degree of male-biased expression show the most extreme adaptive pattern at 5' noncoding regions, in agreement with faster-X evolution for gene expression divergence and a higher incidence of positively selected recessive mutations. © 2018 The Authors. Molecular Ecology Published by John Wiley & Sons Ltd.

  7. Molecular taxonomy, phylogeny and evolution in the family Stichopodidae (Aspidochirotida: Holothuroidea) based on COI and 16S mitochondrial DNA.

    PubMed

    Byrne, Maria; Rowe, Frank; Uthicke, Sven

    2010-09-01

    The Stichopodidae comprise a diverse assemblage of holothuroids most of which occur in the Indo-Pacific. Phylogenetic analyses of mitochondrial gene (COI, 16S rRNA) sequence for 111 individuals (7 genera, 17 species) clarified taxonomic uncertainties, species relationships, biogeography and evolution of the family. A monophyly of the genus Stichopus was supported with the exception of Stichopus ellipes. Molecular analyses confirmed genus level taxonomy based on morphology. Most specimens harvested as S. horrens fell in the S. monotuberculatus clade, a morphologically variable assemblage with others from the S. naso clade. Taxonomic clarification of species fished as S. horrens will assist conservation measures. Evolutionary rates based on comparison of sequence from trans-ithmian Isostichopus species estimated that Stichopus and Isostichopus diverged ca. 5.5-10.7Ma (Miocene). More recent splits were estimated to be younger than 1Ma. Copyright 2010 Elsevier Inc. All rights reserved.

  8. Contrasting morphological and DNA barcode-suggested species boundaries among shallow-water amphipod fauna from the southern European Atlantic coast.

    PubMed

    Lobo, Jorge; Ferreira, Maria S; Antunes, Ilisa C; Teixeira, Marcos A L; Borges, Luisa M S; Sousa, Ronaldo; Gomes, Pedro A; Costa, Maria Helena; Cunha, Marina R; Costa, Filipe O

    2017-02-01

    In this study we compared DNA barcode-suggested species boundaries with morphology-based species identifications in the amphipod fauna of the southern European Atlantic coast. DNA sequences of the cytochrome c oxidase subunit I barcode region (COI-5P) were generated for 43 morphospecies (178 specimens) collected along the Portuguese coast which, together with publicly available COI-5P sequences, produced a final dataset comprising 68 morphospecies and 295 sequences. Seventy-five BINs (Barcode Index Numbers) were assigned to these morphospecies, of which 48 were concordant (i.e., 1 BIN = 1 species), 8 were taxonomically discordant, and 19 were singletons. Twelve species had matching sequences (<2% distance) with conspecifics from distant locations (e.g., North Sea). Seven morphospecies were assigned to multiple, and highly divergent, BINs, including specimens of Corophium multisetosum (18% divergence) and Dexamine spiniventris (16% divergence), which originated from sampling locations on the west coast of Portugal (only about 36 and 250 km apart, respectively). We also found deep divergence (4%-22%) among specimens of seven species from Portugal compared to those from the North Sea and Italy. The detection of evolutionarily meaningful divergence among populations of several amphipod species from southern Europe reinforces the need for a comprehensive re-assessment of the diversity of this faunal group.

  9. iGLASS: An Improvement to the GLASS Method for Estimating Species Trees from Gene Trees

    PubMed Central

    Rosenberg, Noah A.

    2012-01-01

    Abstract Several methods have been designed to infer species trees from gene trees while taking into account gene tree/species tree discordance. Although some of these methods provide consistent species tree topology estimates under a standard model, most either do not estimate branch lengths or are computationally slow. An exception, the GLASS method of Mossel and Roch, is consistent for the species tree topology, estimates branch lengths, and is computationally fast. However, GLASS systematically overestimates divergence times, leading to biased estimates of species tree branch lengths. By assuming a multispecies coalescent model in which multiple lineages are sampled from each of two taxa at L independent loci, we derive the distribution of the waiting time until the first interspecific coalescence occurs between the two taxa, considering all loci and measuring from the divergence time. We then use the mean of this distribution to derive a correction to the GLASS estimator of pairwise divergence times. We show that our improved estimator, which we call iGLASS, consistently estimates the divergence time between a pair of taxa as the number of loci approaches infinity, and that it is an unbiased estimator of divergence times when one lineage is sampled per taxon. We also show that many commonly used clustering methods can be combined with the iGLASS estimator of pairwise divergence times to produce a consistent estimator of the species tree topology. Through simulations, we show that iGLASS can greatly reduce the bias and mean squared error in obtaining estimates of divergence times in a species tree. PMID:22216756

  10. Mito-nuclear discord in six congeneric lineages of Holarctic ducks (genus Anas).

    PubMed

    Peters, Jeffrey L; Winker, Kevin; Millam, Kendra C; Lavretsky, Philip; Kulikova, Irina; Wilson, Robert E; Zhuravlev, Yuri N; McCracken, Kevin G

    2014-06-01

    Many species have Holarctic distributions that extend across Europe, Asia and North America. Most genetics research on these species has examined only mitochondrial (mt) DNA, which has revealed wide variance in divergence between Old World (OW) and New World (NW) populations, ranging from shallow, unstructured genealogies to deeply divergent lineages. In this study, we sequenced 20 nuclear introns to test for concordant patterns of OW-NW differentiation between mtDNA and nuclear (nu) DNA for six lineages of Holarctic ducks (genus Anas). Genetic differentiation for both marker types varied widely among these lineages (idiosyncratic population histories), but mtDNA and nuDNA divergence within lineages was not significantly correlated. Moreover, compared with the association between mtDNA and nuDNA divergence observed among different species, OW-NW nuDNA differentiation was generally lower than mtDNA divergence, at least for lineages with deeply divergent mtDNA. Furthermore, coalescent estimates indicated significantly higher rates of gene flow for nuDNA than mtDNA for four of the six lineages. Thus, Holarctic ducks show prominent mito-nuclear discord between OW and NW populations, and we reject differences in sorting rates as the sole cause of the within-species discord. Male-mediated intercontinental gene flow is likely a leading contributor to this discord, although selection could also cause increased mtDNA divergence relative to weak nuDNA differentiation. The population genetics of these ducks contribute to growing evidence that mtDNA can be an unreliable indicator of stage of speciation and that more holistic approaches are needed for species delimitation. © 2014 John Wiley & Sons Ltd.

  11. Comparative genomics and repetitive sequence divergence in the species of diploid Nicotiana section Alatae.

    PubMed

    Lim, K Yoong; Kovarik, Ales; Matyasek, Roman; Chase, Mark W; Knapp, Sandra; McCarthy, Elizabeth; Clarkson, James J; Leitch, Andrew R

    2006-12-01

    Combining phylogenetic reconstructions of species relationships with comparative genomic approaches is a powerful way to decipher evolutionary events associated with genome divergence. Here, we reconstruct the history of karyotype and tandem repeat evolution in species of diploid Nicotiana section Alatae. By analysis of plastid DNA, we resolved two clades with high bootstrap support, one containing N. alata, N. langsdorffii, N. forgetiana and N. bonariensis (called the n = 9 group) and another containing N. plumbaginifolia and N. longiflora (called the n = 10 group). Despite little plastid DNA sequence divergence, we observed, via fluorescent in situ hybridization, substantial chromosomal repatterning, including altered chromosome numbers, structure and distribution of repeats. Effort was focussed on 35S and 5S nuclear ribosomal DNA (rDNA) and the HRS60 satellite family of tandem repeats comprising the elements HRS60, NP3R and NP4R. We compared divergence of these repeats in diploids and polyploids of Nicotiana. There are dramatic shifts in the distribution of the satellite repeats and complete replacement of intergenic spacers (IGSs) of 35S rDNA associated with divergence of the species in section Alatae. We suggest that sequence homogenization has replaced HRS60 family repeats at sub-telomeric regions, but that this process may not occur, or occurs more slowly, when the repeats are found at intercalary locations. Sequence homogenization acts more rapidly (at least two orders of magnitude) on 35S rDNA than 5S rDNA and sub-telomeric satellite sequences. This rapid rate of divergence is analogous to that found in polyploid species, and is therefore, in plants, not only associated with polyploidy.

  12. Whole genome investigation of a divergent clade of the pathogen Streptococcus suis

    PubMed Central

    Baig, Abiyad; Weinert, Lucy A.; Peters, Sarah E.; Howell, Kate J.; Chaudhuri, Roy R.; Wang, Jinhong; Holden, Matthew T. G.; Parkhill, Julian; Langford, Paul R.; Rycroft, Andrew N.; Wren, Brendan W.; Tucker, Alexander W.; Maskell, Duncan J.

    2015-01-01

    Streptococcus suis is a major porcine and zoonotic pathogen responsible for significant economic losses in the pig industry and an increasing number of human cases. Multiple isolates of S. suis show marked genomic diversity. Here, we report the analysis of whole genome sequences of nine pig isolates that caused disease typical of S. suis and had phenotypic characteristics of S. suis, but their genomes were divergent from those of many other S. suis isolates. Comparison of protein sequences predicted from divergent genomes with those from normal S. suis reduced the size of core genome from 793 to only 397 genes. Divergence was clear if phylogenetic analysis was performed on reduced core genes and MLST alleles. Phylogenies based on certain other genes (16S rRNA, sodA, recN, and cpn60) did not show divergence for all isolates, suggesting recombination between some divergent isolates with normal S. suis for these genes. Indeed, there is evidence of recent recombination between the divergent and normal S. suis genomes for 249 of 397 core genes. In addition, phylogenetic analysis based on the 16S rRNA gene and 132 genes that were conserved between the divergent isolates and representatives of the broader Streptococcus genus showed that divergent isolates were more closely related to S. suis. Six out of nine divergent isolates possessed a S. suis-like capsule region with variation in capsular gene sequences but the remaining three did not have a discrete capsule locus. The majority (40/70), of virulence-associated genes in normal S. suis were present in the divergent genomes. Overall, the divergent isolates extend the current diversity of S. suis species but the phenotypic similarities and the large amount of gene exchange with normal S. suis gives insufficient evidence to assign these isolates to a new species or subspecies. Further, sampling and whole genome analysis of more isolates is warranted to understand the diversity of the species. PMID:26583006

  13. Next Generation Semiconductor Based Sequencing of the Donkey (Equus asinus) Genome Provided Comparative Sequence Data against the Horse Genome and a Few Millions of Single Nucleotide Polymorphisms

    PubMed Central

    Bertolini, Francesca; Scimone, Concetta; Geraci, Claudia; Schiavo, Giuseppina; Utzeri, Valerio Joe; Chiofalo, Vincenzo; Fontanesi, Luca

    2015-01-01

    Few studies investigated the donkey (Equus asinus) at the whole genome level so far. Here, we sequenced the genome of two male donkeys using a next generation semiconductor based sequencing platform (the Ion Proton sequencer) and compared obtained sequence information with the available donkey draft genome (and its Illumina reads from which it was originated) and with the EquCab2.0 assembly of the horse genome. Moreover, the Ion Torrent Personal Genome Analyzer was used to sequence reduced representation libraries (RRL) obtained from a DNA pool including donkeys of different breeds (Grigio Siciliano, Ragusano and Martina Franca). The number of next generation sequencing reads aligned with the EquCab2.0 horse genome was larger than those aligned with the draft donkey genome. This was due to the larger N50 for contigs and scaffolds of the horse genome. Nucleotide divergence between E. caballus and E. asinus was estimated to be ~ 0.52-0.57%. Regions with low nucleotide divergence were identified in several autosomal chromosomes and in the whole chromosome X. These regions might be evolutionally important in equids. Comparing Y-chromosome regions we identified variants that could be useful to track donkey paternal lineages. Moreover, about 4.8 million of single nucleotide polymorphisms (SNPs) in the donkey genome were identified and annotated combining sequencing data from Ion Proton (whole genome sequencing) and Ion Torrent (RRL) runs with Illumina reads. A higher density of SNPs was present in regions homologous to horse chromosome 12, in which several studies reported a high frequency of copy number variants. The SNPs we identified constitute a first resource useful to describe variability at the population genomic level in E. asinus and to establish monitoring systems for the conservation of donkey genetic resources. PMID:26151450

  14. Next Generation Semiconductor Based Sequencing of the Donkey (Equus asinus) Genome Provided Comparative Sequence Data against the Horse Genome and a Few Millions of Single Nucleotide Polymorphisms.

    PubMed

    Bertolini, Francesca; Scimone, Concetta; Geraci, Claudia; Schiavo, Giuseppina; Utzeri, Valerio Joe; Chiofalo, Vincenzo; Fontanesi, Luca

    2015-01-01

    Few studies investigated the donkey (Equus asinus) at the whole genome level so far. Here, we sequenced the genome of two male donkeys using a next generation semiconductor based sequencing platform (the Ion Proton sequencer) and compared obtained sequence information with the available donkey draft genome (and its Illumina reads from which it was originated) and with the EquCab2.0 assembly of the horse genome. Moreover, the Ion Torrent Personal Genome Analyzer was used to sequence reduced representation libraries (RRL) obtained from a DNA pool including donkeys of different breeds (Grigio Siciliano, Ragusano and Martina Franca). The number of next generation sequencing reads aligned with the EquCab2.0 horse genome was larger than those aligned with the draft donkey genome. This was due to the larger N50 for contigs and scaffolds of the horse genome. Nucleotide divergence between E. caballus and E. asinus was estimated to be ~ 0.52-0.57%. Regions with low nucleotide divergence were identified in several autosomal chromosomes and in the whole chromosome X. These regions might be evolutionally important in equids. Comparing Y-chromosome regions we identified variants that could be useful to track donkey paternal lineages. Moreover, about 4.8 million of single nucleotide polymorphisms (SNPs) in the donkey genome were identified and annotated combining sequencing data from Ion Proton (whole genome sequencing) and Ion Torrent (RRL) runs with Illumina reads. A higher density of SNPs was present in regions homologous to horse chromosome 12, in which several studies reported a high frequency of copy number variants. The SNPs we identified constitute a first resource useful to describe variability at the population genomic level in E. asinus and to establish monitoring systems for the conservation of donkey genetic resources.

  15. Tracking the origins of the cave bear (Ursus spelaeus) by mitochondrial DNA sequencing.

    PubMed Central

    Hänni, C; Laudet, V; Stehelin, D; Taberlet, P

    1994-01-01

    The different European populations of Ursus arctos, the brown bear, were recently studied for mitochondrial DNA polymorphism. Two clearly distinct lineages (eastern and western) were found, which may have diverged approximately 850,000 years ago. In this context, it was interesting to study the cave bear, Ursus spelaeus, a species which became extinct 20,000 years ago. In this study, we have amplified and sequenced a fragment of 139-bp in the mitochondrial DNA control region of a 40,000-year-old specimen of U. spelaeus. Phylogenetic reconstructions using this sequence and the European brown bear sequences already published suggest that U. spelaeus diverged from an early offshoot of U. arctos--i.e., approximately at the same time as the divergence of the two main lineages of U. arctos. This divergence probably took place at the earliest glaciation, likely due to geographic separation during the earlier Quaternary cold periods. This result is in agreement with the paleontological data available and suggests a good correspondence between molecular and morphological data. Images PMID:7991628

  16. The augmentation algorithm and molecular phylogenetic trees

    NASA Technical Reports Server (NTRS)

    Holmquist, R.

    1978-01-01

    Moore's (1977) augmentation procedure is discussed, and it is concluded that the procedure is valid for obtaining estimates of the total number of fixed nucleotide substitutions both theoretically and in practice, for both simulated and real data, and in agreement, for experimentally dense data sets, with stochastic estimates of the divergence, provided the restrictions on codon mutability resulting from natural selection are explicitly allowed for. Tateno and Nei's (1978) critique that the augmentation procedure has a systematic bias toward overestimation of the total number of nucleotide replacements is disputed, and a data analysis suggests that ancestral sequences inferred by the method of parsimony contain a large number of incorrectly assigned nucleotides.

  17. The complete mitochondrial genome of dhole Cuon alpinus: phylogenetic analysis and dating evolutionary divergence within Canidae.

    PubMed

    Zhang, Honghai; Chen, Lei

    2011-03-01

    The dhole (Cuon alpinus) is the only existent species in the genus Cuon (Carnivora: Canidae). In the present study, the complete mitochondrial genome of the dhole was sequenced. The total length is 16672 base pairs which is the shortest in Canidae. Sequence analysis revealed that most mitochondrial genomic functional regions were highly consistent among canid animals except the CSB domain of the control region. The difference in length among the Canidae mitochondrial genome sequences is mainly due to the number of short segments of tandem repeated in the CSB domain. Phylogenetic analysis was progressed based on the concatenated data set of 14 mitochondrial genes of 8 canid animals by using maximum parsimony (MP), maximum likelihood (ML) and Bayesian (BI) inference methods. The genera Vulpes and Nyctereutes formed a sister group and split first within Canidae, followed by that in the Cuon. The divergence in the genus Canis was the latest. The divarication of domestic dogs after that of the Canis lupus laniger is completely supported by all the three topologies. Pairwise sequence divergence data of different mitochondrial genes among canid animals were also determined. Except for the synonymous substitutions in protein-coding genes, the control region exhibits the highest sequence divergences. The synonymous rates are approximately two to six times higher than those of the non-synonymous sites except for a slightly higher rate in the non-synonymous substitution between Cuon alpinus and Vulpes vulpes. 16S rRNA genes have a slightly faster sequence divergence than 12S rRNA and tRNA genes. Based on nucleotide substitutions of tRNA genes and rRNA genes, the times since divergence between dhole and other canid animals, and between domestic dogs and three subspecies of wolves were evaluated. The result indicates that Vulpes and Nyctereutes have a close phylogenetic relationship and the divergence of Nyctereutes is a little earlier. The Tibetan wolf may be an archaic pedigree within wolf subspecies. The genetic distance between wolves and domestic dogs is less than that among different subspecies of wolves. The domestication of dogs was about 1.56-1.92 million years ago or even earlier.

  18. Complete genome sequences of two divergent isolates of strawberry crinkle virus coinfecting a single strawberry plant.

    PubMed

    Koloniuk, Igor; Fránová, Jana; Sarkisova, Tatiana; Přibylová, Jaroslava

    2018-05-04

    Strawberry crinkle disease is one of the major diseases that threatens strawberry production. Although the biological properties of the agent, strawberry crinkle virus (SCV), have been thoroughly investigated, its complete genome sequence has never been published. Existing RT-PCR-based detection relies on a partial sequence of the L protein gene, presumably the least expressed viral gene. Here, we present complete sequences of two divergent SCV isolates co-infecting a single plant, Fragaria x ananassa cv. Čačanská raná.

  19. Highly divergent ancient gene families in metagenomic samples are compatible with additional divisions of life.

    PubMed

    Lopez, Philippe; Halary, Sébastien; Bapteste, Eric

    2015-10-26

    Microbial genetic diversity is often investigated via the comparison of relatively similar 16S molecules through multiple alignments between reference sequences and novel environmental samples using phylogenetic trees, direct BLAST matches, or phylotypes counts. However, are we missing novel lineages in the microbial dark universe by relying on standard phylogenetic and BLAST methods? If so, how can we probe that universe using alternative approaches? We performed a novel type of multi-marker analysis of genetic diversity exploiting the topology of inclusive sequence similarity networks. Our protocol identified 86 ancient gene families, well distributed and rarely transferred across the 3 domains of life, and retrieved their environmental homologs among 10 million predicted ORFs from human gut samples and other metagenomic projects. Numerous highly divergent environmental homologs were observed in gut samples, although the most divergent genes were over-represented in non-gut environments. In our networks, most divergent environmental genes grouped exclusively with uncultured relatives, in maximal cliques. Sequences within these groups were under strong purifying selection and presented a range of genetic variation comparable to that of a prokaryotic domain. Many genes families included environmental homologs that were highly divergent from cultured homologs: in 79 gene families (including 18 ribosomal proteins), Bacteria and Archaea were less divergent than some groups of environmental sequences were to any cultured or viral homologs. Moreover, some groups of environmental homologs branched very deeply in phylogenetic trees of life, when they were not too divergent to be aligned. These results underline how limited our understanding of the most diverse elements of the microbial world remains, and encourage a deeper exploration of natural communities and their genetic resources, hinting at the possibility that still unknown yet major divisions of life have yet to be discovered.

  20. A synthetic phylogeny of freshwater crayfish: insights for conservation.

    PubMed

    Owen, Christopher L; Bracken-Grissom, Heather; Stern, David; Crandall, Keith A

    2015-02-19

    Phylogenetic systematics is heading for a renaissance where we shift from considering our phylogenetic estimates as a static image in a published paper and taxonomies as a hardcopy checklist to treating both the phylogenetic estimate and dynamic taxonomies as metadata for further analyses. The Open Tree of Life project (opentreeoflife.org) is developing synthesis tools for harnessing the power of phylogenetic inference and robust taxonomy to develop a synthetic tree of life. We capitalize on this approach to estimate a synthesis tree for the freshwater crayfish. The crayfish make an exceptional group to demonstrate the utility of the synthesis approach, as there recently have been a number of phylogenetic studies on the crayfishes along with a robust underlying taxonomic framework. Importantly, the crayfish have also been extensively assessed by an IUCN Red List team and therefore have accurate and up-to-date area and conservation status data available for analysis within a phylogenetic context. Here, we develop a synthesis phylogeny for the world's freshwater crayfish and examine the phylogenetic distribution of threat. We also estimate a molecular phylogeny based on all available GenBank crayfish sequences and use this tree to estimate divergence times and test for divergence rate variation. Finally, we conduct EDGE and HEDGE analyses and identify a number of species of freshwater crayfish of highest priority in conservation efforts. © 2015 The Author(s) Published by the Royal Society. All rights reserved.

  1. A synthetic phylogeny of freshwater crayfish: insights for conservation

    PubMed Central

    Owen, Christopher L.; Bracken-Grissom, Heather; Stern, David; Crandall, Keith A.

    2015-01-01

    Phylogenetic systematics is heading for a renaissance where we shift from considering our phylogenetic estimates as a static image in a published paper and taxonomies as a hardcopy checklist to treating both the phylogenetic estimate and dynamic taxonomies as metadata for further analyses. The Open Tree of Life project (opentreeoflife.org) is developing synthesis tools for harnessing the power of phylogenetic inference and robust taxonomy to develop a synthetic tree of life. We capitalize on this approach to estimate a synthesis tree for the freshwater crayfish. The crayfish make an exceptional group to demonstrate the utility of the synthesis approach, as there recently have been a number of phylogenetic studies on the crayfishes along with a robust underlying taxonomic framework. Importantly, the crayfish have also been extensively assessed by an IUCN Red List team and therefore have accurate and up-to-date area and conservation status data available for analysis within a phylogenetic context. Here, we develop a synthesis phylogeny for the world's freshwater crayfish and examine the phylogenetic distribution of threat. We also estimate a molecular phylogeny based on all available GenBank crayfish sequences and use this tree to estimate divergence times and test for divergence rate variation. Finally, we conduct EDGE and HEDGE analyses and identify a number of species of freshwater crayfish of highest priority in conservation efforts. PMID:25561670

  2. Mitochondrial Analysis of the Most Basal Canid Reveals Deep Divergence between Eastern and Western North American Gray Foxes (Urocyon spp.) and Ancient Roots in Pleistocene California.

    PubMed

    Goddard, Natalie S; Statham, Mark J; Sacks, Benjamin N

    2015-01-01

    Pleistocene aridification in central North America caused many temperate forest-associated vertebrates to split into eastern and western lineages. Such divisions can be cryptic when Holocene expansions have closed the gaps between once-disjunct ranges or when local morphological variation obscures deeper regional divergences. We investigated such cryptic divergence in the gray fox (Urocyon cinereoargenteus), the most basal extant canid in the world. We also investigated the phylogeography of this species and its diminutive relative, the island fox (U. littoralis), in California. The California Floristic Province was a significant source of Pleistocene diversification for a wide range of taxa and, we hypothesized, for the gray fox as well. Alternatively, gray foxes in California potentially reflected a recent Holocene expansion from further south. We sequenced mitochondrial DNA from 169 gray foxes from the southeastern and southwestern United States and 11 island foxes from three of the Channel Islands. We estimated a 1.3% sequence divergence in the cytochrome b gene between eastern and western foxes and used coalescent simulations to date the divergence to approximately 500,000 years before present (YBP), which is comparable to that between recognized sister species within the Canidae. Gray fox samples collected from throughout California exhibited high haplotype diversity, phylogeographic structure, and genetic signatures of a late-Holocene population decline. Bayesian skyline analysis also indicated an earlier population increase dating to the early Wisconsin glaciation (~70,000 YBP) and a root height extending back to the previous interglacial (~100,000 YBP). Together these findings support California's role as a long-term Pleistocene refugium for western Urocyon. Lastly, based both on our results and re-interpretation of those of another study, we conclude that island foxes of the Channel Islands trace their origins to at least 3 distinct female founders from the mainland rather than to a single matriline, as previously suggested.

  3. Mitochondrial Analysis of the Most Basal Canid Reveals Deep Divergence between Eastern and Western North American Gray Foxes (Urocyon spp.) and Ancient Roots in Pleistocene California

    PubMed Central

    Goddard, Natalie S.; Statham, Mark J.; Sacks, Benjamin N.

    2015-01-01

    Pleistocene aridification in central North America caused many temperate forest-associated vertebrates to split into eastern and western lineages. Such divisions can be cryptic when Holocene expansions have closed the gaps between once-disjunct ranges or when local morphological variation obscures deeper regional divergences. We investigated such cryptic divergence in the gray fox (Urocyon cinereoargenteus), the most basal extant canid in the world. We also investigated the phylogeography of this species and its diminutive relative, the island fox (U. littoralis), in California. The California Floristic Province was a significant source of Pleistocene diversification for a wide range of taxa and, we hypothesized, for the gray fox as well. Alternatively, gray foxes in California potentially reflected a recent Holocene expansion from further south. We sequenced mitochondrial DNA from 169 gray foxes from the southeastern and southwestern United States and 11 island foxes from three of the Channel Islands. We estimated a 1.3% sequence divergence in the cytochrome b gene between eastern and western foxes and used coalescent simulations to date the divergence to approximately 500,000 years before present (YBP), which is comparable to that between recognized sister species within the Canidae. Gray fox samples collected from throughout California exhibited high haplotype diversity, phylogeographic structure, and genetic signatures of a late-Holocene population decline. Bayesian skyline analysis also indicated an earlier population increase dating to the early Wisconsin glaciation (~70,000 YBP) and a root height extending back to the previous interglacial (~100,000 YBP). Together these findings support California’s role as a long-term Pleistocene refugium for western Urocyon. Lastly, based both on our results and re-interpretation of those of another study, we conclude that island foxes of the Channel Islands trace their origins to at least 3 distinct female founders from the mainland rather than to a single matriline, as previously suggested. PMID:26288066

  4. Nonnegative matrix factorization with the Itakura-Saito divergence: with application to music analysis.

    PubMed

    Févotte, Cédric; Bertin, Nancy; Durrieu, Jean-Louis

    2009-03-01

    This letter presents theoretical, algorithmic, and experimental results about nonnegative matrix factorization (NMF) with the Itakura-Saito (IS) divergence. We describe how IS-NMF is underlaid by a well-defined statistical model of superimposed gaussian components and is equivalent to maximum likelihood estimation of variance parameters. This setting can accommodate regularization constraints on the factors through Bayesian priors. In particular, inverse-gamma and gamma Markov chain priors are considered in this work. Estimation can be carried out using a space-alternating generalized expectation-maximization (SAGE) algorithm; this leads to a novel type of NMF algorithm, whose convergence to a stationary point of the IS cost function is guaranteed. We also discuss the links between the IS divergence and other cost functions used in NMF, in particular, the Euclidean distance and the generalized Kullback-Leibler (KL) divergence. As such, we describe how IS-NMF can also be performed using a gradient multiplicative algorithm (a standard algorithm structure in NMF) whose convergence is observed in practice, though not proven. Finally, we report a furnished experimental comparative study of Euclidean-NMF, KL-NMF, and IS-NMF algorithms applied to the power spectrogram of a short piano sequence recorded in real conditions, with various initializations and model orders. Then we show how IS-NMF can successfully be employed for denoising and upmix (mono to stereo conversion) of an original piece of early jazz music. These experiments indicate that IS-NMF correctly captures the semantics of audio and is better suited to the representation of music signals than NMF with the usual Euclidean and KL costs.

  5. Partitioning the Genetic Diversity of a Virus Family: Approach and Evaluation through a Case Study of Picornaviruses

    PubMed Central

    Lauber, Chris

    2012-01-01

    The recent advent of genome sequences as the only source available to classify many newly discovered viruses challenges the development of virus taxonomy by expert virologists who traditionally rely on extensive virus characterization. In this proof-of-principle study, we address this issue by presenting a computational approach (DEmARC) to classify viruses of a family into groups at hierarchical levels using a sole criterion—intervirus genetic divergence. To quantify genetic divergence, we used pairwise evolutionary distances (PEDs) estimated by maximum likelihood inference on a multiple alignment of family-wide conserved proteins. PEDs were calculated for all virus pairs, and the resulting distribution was modeled via a mixture of probability density functions. The model enables the quantitative inference of regions of distance discontinuity in the family-wide PED distribution, which define the levels of hierarchy. For each level, a limit on genetic divergence, below which two viruses join the same group, was objectively selected among a set of candidates by minimizing violations of intragroup PEDs to the limit. In a case study, we applied the procedure to hundreds of genome sequences of picornaviruses and extensively evaluated it by modulating four key parameters. It was found that the genetics-based classification largely tolerates variations in virus sampling and multiple alignment construction but is affected by the choice of protein and the measure of genetic divergence. In an accompanying paper (C. Lauber and A. E. Gorbalenya, J. Virol. 86:3905–3915, 2012), we analyze the substantial insight gained with the genetics-based classification approach by comparing it with the expert-based picornavirus taxonomy. PMID:22278230

  6. Reconstructing the colonisation and diversification history of the endemic freshwater crab (Seychellum alluaudi) in the granitic and volcanic Seychelles Archipelago.

    PubMed

    Daniels, Savel R

    2011-11-01

    The endemic, monotypic freshwater crab species Seychellum alluaudi was used as a template to examine the initial colonisation and evolutionary history among the major islands in the Seychelles Archipelago. Five of the "inner" islands in the Seychelles Archipelago including Mahé, Praslin, Silhouette, La Digue and Frégate were sampled. Two partial mtDNA fragments, 16S rRNA and cytochrome oxidase subunit I (COI) was sequenced for 83 specimens of S. alluaudi. Evolutionary relationships between populations were inferred from the combined mtDNA dataset using maximum parsimony, maximum likelihood and Bayesian inferences. Analyses of molecular variance (AMOVA) were used to examine genetic variation among and within clades. A haplotype network was constructed using TCS while BEAST was employed to date the colonisation and divergence of lineages on the islands. Phylogenetic analyses of the combined mtDNA data set of 1103 base pairs retrieved a monophyletic S. alluaudi group comprised three statistically well-supported monophyletic clades. Clade one was exclusive to Silhouette; clade two included samples from Praslin sister to La Digue, while clade three comprised samples from Mahé sister to Frégate. The haplotype network corresponded to the three clades. Within Mahé, substantial phylogeographic substructure was evident. AMOVA results revealed limited genetic variation within localities with most variation occurring among localities. Divergence time estimations predated the Holocene sea level regressions and indicated a Pliocene/Pleistocene divergence between the three clades evident within S. alluaudi. The monophyly of each clade suggests that transoceanic dispersal is rare. The absence of shared haplotypes between the three clades, coupled with marked sequence divergence values suggests the presence of three allospecies within S. alluaudi. Copyright © 2011 Elsevier Inc. All rights reserved.

  7. Multi-locus phylogeny and divergence time estimates of Enallagma damselflies (Odonata: Coenagrionidae).

    PubMed

    Callahan, Melissa S; McPeek, Mark A

    2016-01-01

    Reconstructing evolutionary patterns of species and populations provides a framework for asking questions about the impacts of climate change. Here we use a multilocus dataset to estimate gene trees under maximum likelihood and Bayesian models to obtain a robust estimate of relationships for a genus of North American damselflies, Enallagma. Using a relaxed molecular clock, we estimate the divergence times for this group. Furthermore, to account for the fact that gene tree analyses can overestimate ages of population divergences, we use a multi-population coalescent model to gain a more accurate estimate of divergence times. We also infer diversification rates using a method that allows for variation in diversification rate through time and among lineages. Our results reveal a complex evolutionary history of Enallagma, in which divergence events both predate and occur during Pleistocene climate fluctuations. There is also evidence of diversification rate heterogeneity across the tree. These divergence time estimates provide a foundation for addressing the relative significance of historical climatic events in the diversification of this genus. Copyright © 2015 Elsevier Inc. All rights reserved.

  8. Sequence divergence of the red and green visual pigments in great apes and humans.

    PubMed Central

    Deeb, S S; Jorgensen, A L; Battisti, L; Iwasaki, L; Motulsky, A G

    1994-01-01

    We have determined the coding sequences of red and green visual pigment genes of the chimpanzee, gorilla, and orangutan. The deduced amino acid sequences of these pigments are highly homologous to the equivalent human pigments. None of the amino acid differences occurred at sites that were previously shown to influence pigment absorption characteristics. Therefore, we predict the spectra of red and green pigments of the apes to have wavelengths of maximum absorption that differ by < 2 nm from the equivalent human pigments and that color vision in these nonhuman primates will be very similar, if not identical, to that in humans. A total of 14 within-species polymorphisms (6 involving silent substitutions) were observed in the coding sequences of the red and green pigment genes of the great apes. Remarkably, the polymorphisms at 6 of these sites had been observed in human populations, suggesting that they predated the evolution of higher primates. Alleles at polymorphic sites were often shared between the red and green pigment genes. The average synonymous rate of divergence of red from green sequences was approximately 1/10th that estimated for other proteins of higher primates, indicating the involvement of gene conversion in generating these polymorphisms. The high degree of homology and juxtaposition of these two genes on the X chromosome has promoted unequal recombination and/or gene conversion that led to sequence homogenization. However, natural selection operated to maintain the degree of separation in peak absorbance between the red and green pigments that resulted in optimal chromatic discrimination. This represents a unique case of molecular coevolution between two homologous genes that functionally interact at the behavioral level. PMID:8041777

  9. Origin and domestication of papaya Yh chromosome.

    PubMed

    VanBuren, Robert; Zeng, Fanchang; Chen, Cuixia; Zhang, Jisen; Wai, Ching Man; Han, Jennifer; Aryal, Rishi; Gschwend, Andrea R; Wang, Jianping; Na, Jong-Kuk; Huang, Lixian; Zhang, Lingmao; Miao, Wenjing; Gou, Jiqing; Arro, Jie; Guyot, Romain; Moore, Richard C; Wang, Ming-Li; Zee, Francis; Charlesworth, Deborah; Moore, Paul H; Yu, Qingyi; Ming, Ray

    2015-04-01

    Sex in papaya is controlled by a pair of nascent sex chromosomes. Females are XX, and two slightly different Y chromosomes distinguish males (XY) and hermaphrodites (XY(h)). The hermaphrodite-specific region of the Y(h) chromosome (HSY) and its X chromosome counterpart were sequenced and analyzed previously. We now report the sequence of the entire male-specific region of the Y (MSY). We used a BAC-by-BAC approach to sequence the MSY and resequence the Y regions of 24 wild males and the Y(h) regions of 12 cultivated hermaphrodites. The MSY and HSY regions have highly similar gene content and structure, and only 0.4% sequence divergence. The MSY sequences from wild males include three distinct haplotypes, associated with the populations' geographic locations, but gene flow is detected for other genomic regions. The Y(h) sequence is highly similar to one Y haplotype (MSY3) found only in wild dioecious populations from the north Pacific region of Costa Rica. The low MSY3-Y(h) divergence supports the hypothesis that hermaphrodite papaya is a product of human domestication. We estimate that Y(h) arose only ∼ 4000 yr ago, well after crop plant domestication in Mesoamerica >6200 yr ago but coinciding with the rise of the Maya civilization. The Y(h) chromosome has lower nucleotide diversity than the Y, or the genome regions that are not fully sex-linked, consistent with a domestication bottleneck. The identification of the ancestral MSY3 haplotype will expedite investigation of the mutation leading to the domestication of the hermaphrodite Y(h) chromosome. In turn, this mutation should identify the gene that was affected by the carpel-suppressing mutation that was involved in the evolution of males. © 2015 VanBuren et al.; Published by Cold Spring Harbor Laboratory Press.

  10. Origin and domestication of papaya Yh chromosome

    PubMed Central

    VanBuren, Robert; Zeng, Fanchang; Chen, Cuixia; Zhang, Jisen; Wai, Ching Man; Han, Jennifer; Aryal, Rishi; Gschwend, Andrea R.; Wang, Jianping; Na, Jong-Kuk; Huang, Lixian; Zhang, Lingmao; Miao, Wenjing; Gou, Jiqing; Arro, Jie; Guyot, Romain; Moore, Richard C.; Wang, Ming-Li; Zee, Francis; Charlesworth, Deborah; Moore, Paul H.; Yu, Qingyi; Ming, Ray

    2015-01-01

    Sex in papaya is controlled by a pair of nascent sex chromosomes. Females are XX, and two slightly different Y chromosomes distinguish males (XY) and hermaphrodites (XYh). The hermaphrodite-specific region of the Yh chromosome (HSY) and its X chromosome counterpart were sequenced and analyzed previously. We now report the sequence of the entire male-specific region of the Y (MSY). We used a BAC-by-BAC approach to sequence the MSY and resequence the Y regions of 24 wild males and the Yh regions of 12 cultivated hermaphrodites. The MSY and HSY regions have highly similar gene content and structure, and only 0.4% sequence divergence. The MSY sequences from wild males include three distinct haplotypes, associated with the populations’ geographic locations, but gene flow is detected for other genomic regions. The Yh sequence is highly similar to one Y haplotype (MSY3) found only in wild dioecious populations from the north Pacific region of Costa Rica. The low MSY3-Yh divergence supports the hypothesis that hermaphrodite papaya is a product of human domestication. We estimate that Yh arose only ∼4000 yr ago, well after crop plant domestication in Mesoamerica >6200 yr ago but coinciding with the rise of the Maya civilization. The Yh chromosome has lower nucleotide diversity than the Y, or the genome regions that are not fully sex-linked, consistent with a domestication bottleneck. The identification of the ancestral MSY3 haplotype will expedite investigation of the mutation leading to the domestication of the hermaphrodite Yh chromosome. In turn, this mutation should identify the gene that was affected by the carpel-suppressing mutation that was involved in the evolution of males. PMID:25762551

  11. The appearance of Ulva laetevirens (Ulvophyceae, Chlorophyta) in the northeast coast of the United States of America

    NASA Astrophysics Data System (ADS)

    Mao, Yunxiang; Kim, Jang Kyun; Wilson, Roderick; Yarish, Charles

    2014-10-01

    Introduced species may outcompete or hybridize with native species, resulting in the loss of native biodiversity or even alteration of ecosystem processes. In this study, we reported an alien distromatic Ulva species, which was found in an embayment (Holly Pond) connected with Long Island Sound, USA. The morphological and anatomical observations in combination with molecular data were used for its identification to species. Anatomy of collected specimens showed that the cell shape in rhizoidal and basal regions was round and the marginal teeth along the basal and median region were not found. These characteristics were primarily identical to the diagnostic characteristics of Ulva laetevirens Areschoug (Chlorophyta). The plastid-encoding tufA and nucleusencoding ITS1 were used for its molecular identification. Phylogenetic analysis for the tufA gene placed the specimens from Holly Pond in a well-supported clade along with published sequences of U. laetevirens identified early without any sequence divergence. In ITS tree, the sample also formed well-supported clades with the sequences of U. laetevirens with an estimated sequence divergence among the taxa in these clades as low as 1%. These findings confirmed the morpho-anatomical conclusion. Native to Australia, this species was reported in several countries along the Mediterranean coast after the late of 1990s. This is the first time that U. laetevirens is found in the northeast coast of United States and the second record for Atlantic North America.

  12. Divergence of RNA polymerase α subunits in angiosperm plastid genomes is mediated by genomic rearrangement

    PubMed Central

    Blazier, J. Chris; Ruhlman, Tracey A.; Weng, Mao-Lun; Rehman, Sumaiyah K.; Sabir, Jamal S. M.; Jansen, Robert K.

    2016-01-01

    Genes for the plastid-encoded RNA polymerase (PEP) persist in the plastid genomes of all photosynthetic angiosperms. However, three unrelated lineages (Annonaceae, Passifloraceae and Geraniaceae) have been identified with unusually divergent open reading frames (ORFs) in the conserved region of rpoA, the gene encoding the PEP α subunit. We used sequence-based approaches to evaluate whether these genes retain function. Both gene sequences and complete plastid genome sequences were assembled and analyzed from each of the three angiosperm families. Multiple lines of evidence indicated that the rpoA sequences are likely functional despite retaining as low as 30% nucleotide sequence identity with rpoA genes from outgroups in the same angiosperm order. The ratio of non-synonymous to synonymous substitutions indicated that these genes are under purifying selection, and bioinformatic prediction of conserved domains indicated that functional domains are preserved. One of the lineages (Pelargonium, Geraniaceae) contains species with multiple rpoA-like ORFs that show evidence of ongoing inter-paralog gene conversion. The plastid genomes containing these divergent rpoA genes have experienced extensive structural rearrangement, including large expansions of the inverted repeat. We propose that illegitimate recombination, not positive selection, has driven the divergence of rpoA. PMID:27087667

  13. Alignment-free microbial phylogenomics under scenarios of sequence divergence, genome rearrangement and lateral genetic transfer.

    PubMed

    Bernard, Guillaume; Chan, Cheong Xin; Ragan, Mark A

    2016-07-01

    Alignment-free (AF) approaches have recently been highlighted as alternatives to methods based on multiple sequence alignment in phylogenetic inference. However, the sensitivity of AF methods to genome-scale evolutionary scenarios is little known. Here, using simulated microbial genome data we systematically assess the sensitivity of nine AF methods to three important evolutionary scenarios: sequence divergence, lateral genetic transfer (LGT) and genome rearrangement. Among these, AF methods are most sensitive to the extent of sequence divergence, less sensitive to low and moderate frequencies of LGT, and most robust against genome rearrangement. We describe the application of AF methods to three well-studied empirical genome datasets, and introduce a new application of the jackknife to assess node support. Our results demonstrate that AF phylogenomics is computationally scalable to multi-genome data and can generate biologically meaningful phylogenies and insights into microbial evolution.

  14. Novel RAD sequence data reveal a lack of genomic divergence between dietary ecotypes in a landlocked salmonid population

    USGS Publications Warehouse

    Limborg, Morten T.; Larson, Wesley; Shedd, Kyle; Seeb, Lisa W.; Seeb, James E.

    2017-01-01

    Preservation of heritable ecological diversity within species and populations is a key challenge for managing natural resources and wild populations. Salmonid fish are iconic and socio-economically important species for commercial, aquaculture, and recreational fisheries across the globe. Many salmonids are known to exhibit ecological divergence within species, including distinct feeding ecotypes within the same lakes. Here we used 5559 SNPs, derived from RAD sequencing, to perform population genetic comparisons between two dietary ecotypes of sockeye salmon (Oncorhynchus nerka) in Jo-Jo Lake, Alaska (USA). We tested the standing hypothesis that these two ecotypes are currently diverging as a result of adaptation to distinct dietary niches; results support earlier conclusions of a single panmictic population. The RAD sequence data revealed 40 new SNPs not previously detected in the species, and our sequence data can be used in future studies of ecotypic diversity in salmonid species.

  15. Evolution of the viral hemorrhagic septicemia virus: divergence, selection and origin.

    PubMed

    He, Mei; Yan, Xue-Chun; Liang, Yang; Sun, Xiao-Wen; Teng, Chun-Bo

    2014-08-01

    Viral hemorrhagic septicemia virus (VHSV) is an economically significant rhabdovirus that affects an increasing number of freshwater and marine fish species. Extensive studies have been conducted on the molecular epizootiology, genetic diversity, and phylogeny of VHSV. However, there are discrepancies between the reported estimates of the nucleotide substitution rate for the G gene and the divergence times for the genotypes. Herein, Bayesian coalescent analyses were conducted to the time-stamped entire coding sequences of the six VHSV genes. Rate estimates based on the G gene indicated that the marine genotypes/subtypes might not all evolve slower than their major European freshwater counterpart. Age calculations on the six genes revealed that the first bifurcation event of the analyzed isolates might have taken place within the last 300 years, which was much younger than previously thought. Selection analyses suggested that two codons of the G gene might be positively selected. Surveys of codon usage bias showed that the P, M and NV genes exhibited genotype-specific variations. Furthermore, we proposed that VHSV originated from the Pacific Northwest of North America. Copyright © 2014 Elsevier Inc. All rights reserved.

  16. The tale of a modern animal plague: Tracing the evolutionary history and determining the time-scale for foot and mouth disease virus

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Tully, Damien C.; Fares, Mario A.

    2008-12-20

    Despite significant advances made in the understanding of its epidemiology, foot and mouth disease virus (FMDV) is among the most unexpected agricultural devastating plagues. While the disease manifests itself as seven immunologically distinct strains their origin, population dynamics, migration patterns and divergence times remain unknown. Herein we have assembled a comprehensive data set of gene sequences representing the global diversity of the disease and inferred the time-scale and evolutionary history for FMDV. Serotype-specific rates of evolution and divergence times were estimated using a Bayesian coalescent framework. We report that an ancient precursor FMDV gave rise to two major diversification eventsmore » spanning a relatively short interval of time. This radiation event is estimated to have taken place towards the end of the 17th and the beginning of the 18th century giving us the present circulating Euro-Asiatic and South African viral strains. Furthermore our results hint that Europe acted as a possible hub for the disease from where it successfully dispersed elsewhere via exploration and trading routes.« less

  17. Karyotype Stability and Unbiased Fractionation in the Paleo-Allotetraploid Cucurbita Genomes.

    PubMed

    Sun, Honghe; Wu, Shan; Zhang, Guoyu; Jiao, Chen; Guo, Shaogui; Ren, Yi; Zhang, Jie; Zhang, Haiying; Gong, Guoyi; Jia, Zhangcai; Zhang, Fan; Tian, Jiaxing; Lucas, William J; Doyle, Jeff J; Li, Haizhen; Fei, Zhangjun; Xu, Yong

    2017-10-09

    The Cucurbita genus contains several economically important species in the Cucurbitaceae family. Here, we report high-quality genome sequences of C. maxima and C. moschata and provide evidence supporting an allotetraploidization event in Cucurbita. We are able to partition the genome into two homoeologous subgenomes based on different genetic distances to melon, cucumber, and watermelon in the Benincaseae tribe. We estimate that the two diploid progenitors successively diverged from Benincaseae around 31 and 26 million years ago (Mya), respectively, and the allotetraploidization happened at some point between 26 Mya and 3 Mya, the estimated date when C. maxima and C. moschata diverged. The subgenomes have largely maintained the chromosome structures of their diploid progenitors. Such long-term karyotype stability after polyploidization has not been commonly observed in plant polyploids. The two subgenomes have retained similar numbers of genes, and neither subgenome is globally dominant in gene expression. Allele-specific expression analysis in the C. maxima × C. moschata interspecific F 1 hybrid and their two parents indicates the predominance of trans-regulatory effects underlying expression divergence of the parents, and detects transgressive gene expression changes in the hybrid correlated with heterosis in important agronomic traits. Our study provides insights into polyploid genome evolution and valuable resources for genetic improvement of cucurbit crops. Copyright © 2017 The Author. Published by Elsevier Inc. All rights reserved.

  18. Molecular phylogenetic analysis of non-sexually transmitted strains of Haemophilus ducreyi.

    PubMed

    Gaston, Jordan R; Roberts, Sally A; Humphreys, Tricia L

    2015-01-01

    Haemophilus ducreyi, the etiologic agent of chancroid, has been previously reported to show genetic variance in several key virulence factors, placing strains of the bacterium into two genetically distinct classes. Recent studies done in yaws-endemic areas of the South Pacific have shown that H. ducreyi is also a major cause of cutaneous limb ulcers (CLU) that are not sexually transmitted. To genetically assess CLU strains relative to the previously described class I, class II phylogenetic hierarchy, we examined nucleotide sequence diversity at 11 H. ducreyi loci, including virulence and housekeeping genes, which encompass approximately 1% of the H. ducreyi genome. Sequences for all 11 loci indicated that strains collected from leg ulcers exhibit DNA sequences homologous to class I strains of H. ducreyi. However, sequences for 3 loci, including a hemoglobin receptor (hgbA), serum resistance protein (dsrA), and a collagen adhesin (ncaA) contained informative amounts of variation. Phylogenetic analyses suggest that these non-sexually transmitted strains of H. ducreyi comprise a sub-clonal population within class I strains of H. ducreyi. Molecular dating suggests that CLU strains are the most recently developed, having diverged approximately 0.355 million years ago, fourteen times more recently than the class I/class II divergence. The CLU strains' divergence falls after the divergence of humans from chimpanzees, making it the first known H. ducreyi divergence event directly influenced by the selective pressures accompanying human hosts.

  19. Codon Usage Selection Can Bias Estimation of the Fraction of Adaptive Amino Acid Fixations.

    PubMed

    Matsumoto, Tomotaka; John, Anoop; Baeza-Centurion, Pablo; Li, Boyang; Akashi, Hiroshi

    2016-06-01

    A growing number of molecular evolutionary studies are estimating the proportion of adaptive amino acid substitutions (α) from comparisons of ratios of polymorphic and fixed DNA mutations. Here, we examine how violations of two of the model assumptions, neutral evolution of synonymous mutations and stationary base composition, affect α estimation. We simulated the evolution of coding sequences assuming weak selection on synonymous codon usage bias and neutral protein evolution, α = 0. We show that weak selection on synonymous mutations can give polymorphism/divergence ratios that yield α-hat (estimated α) considerably larger than its true value. Nonstationary evolution (changes in population size, selection, or mutation) can exacerbate such biases or, in some scenarios, give biases in the opposite direction, α-hat < α. These results demonstrate that two factors that appear to be prevalent among taxa, weak selection on synonymous mutations and non-steady-state nucleotide composition, should be considered when estimating α. Estimates of the proportion of adaptive amino acid fixations from large-scale analyses of Drosophila melanogaster polymorphism and divergence data are positively correlated with codon usage bias. Such patterns are consistent with α-hat inflation from weak selection on synonymous mutations and/or mutational changes within the examined gene trees. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  20. Molecular phylogenetic analysis of the Persea group (Lauraceae) and its biogeographic implications on the evolution of tropical and subtropical Amphi-Pacific disjunctions.

    PubMed

    Li, Lang; Li, Jie; Rohwer, Jens G; van der Werff, Henk; Wang, Zhi-Hua; Li, Hsi-Wen

    2011-09-01

    The Persea group (Lauraceae) has a tropical and subtropical amphi-pacific disjunct distribution with most of its members, and it includes two Macaronesian species. The relationships within the group are still controversial, and its intercontinental disjunction has not been investigated with extensive sampling and precise time dating. • ITS and LEAFY intron II sequences of 78 Persea group species and nine other Lauraceae species were analyzed with maximum parsimony and Bayesian inference. Divergence time estimation employed Bayesian Markov chain Monte Carlo method under a relaxed clock. • Several traditional genera or subgenera within the Persea group form well-supported monophyletic groups except Alseodaphne and Dehaasia. The divergence time of the Persea group is estimated as ∼55.3 (95% higher posterior densities [HPD] 41.4-69.9) million years ago (mya). Two major divergences within the Persea group are estimated as ∼51.9 (95% HPD 38.9-63.9) mya and ∼48.5 (95% HPD 35.9-59.9) mya. • Persea can be retained as a genus by the inclusion of Apollonias barbujana and exclusion a few species that do not fit into the established subgenera. A major revision is recommended for the delimitation between Alseodaphne, Dehaasia, and Nothaphoebe. We suggest that the Persea group originated from the Perseeae-Laureae radiation in early Eocene Laurasia. Its amphi-pacific disjunction results from the disruption of boreotropical flora by climatic cooling during the mid- to late Eocene. The American-Macaronesian disjunction may be explained by the long-distance dispersal.

  1. Mesozoic fossils (>145 Mya) suggest the antiquity of the subgenera of Daphnia and their coevolution with chaoborid predators.

    PubMed

    Kotov, Alexey A; Taylor, Derek J

    2011-05-19

    The timescale of the origins of Daphnia O. F. Mueller (Crustacea: Cladocera) remains controversial. The origin of the two main subgenera has been associated with the breakup of the supercontinent Pangaea. This vicariance hypothesis is supported by reciprocal monophyly, present day associations with the former Gondwanaland and Laurasia regions, and mitochondrial DNA divergence estimates. However, previous multilocus nuclear DNA sequence divergence estimates at < 10 Million years are inconsistent with the breakup of Pangaea. We examined new and existing cladoceran fossils from a Mesozoic Mongolian site, in hopes of gaining insights into the timescale of the evolution of Daphnia. We describe new fossils of ephippia from the Khotont site in Mongolia associated with the Jurassic-Cretaceous boundary (about 145 MYA) that are morphologically similar to several modern genera of the family Daphniidae, including the two major subgenera of Daphnia, i.e., Daphnia s. str. and Ctenodaphnia. The daphniid fossils co-occurred with fossils of the predaceous phantom midge (Chaoboridae). Our findings indicate that the main subgenera of Daphnia are likely much older than previously known from fossils (at least 100 MY older) or from nuclear DNA estimates of divergence. The results showing co-occurrence of the main subgenera far from the presumed Laurasia/Gondwanaland dispersal barrier shortly after formation suggests that vicariance from the breakup of Pangaea is an unlikely explanation for the origin of the main subgenera. The fossil impressions also reveal that the coevolution of a dipteran predator (Chaoboridae) with the subgenus Daphnia is much older than previously known -- since the Mesozoic.

  2. DNA Barcoding in Pencilfishes (Lebiasinidae: Nannostomus) Reveals Cryptic Diversity across the Brazilian Amazon

    PubMed Central

    Benzaquem, Denise Corrêa; Oliveira, Claudio; da Silva Batista, Jaqueline; Zuanon, Jansen; Porto, Jorge Ivan Rebelo

    2015-01-01

    Nannostomus is comprised of 20 species. Popularly known as pencilfishes the vast majority of these species lives in the flooded forests of the Amazon basin and are popular in the ornamental trade. Among the lebiasinids, it is the only genus to have undergone more than one taxonomic revision. Even so, it still possesses poorly defined species. Here, we report the results of an application of DNA barcoding to the identification of pencilfishes and highlight the deeply divergent clades within four nominal species. We surveyed the sequence variation in the mtDNA cytochrome c oxidase subunit I gene among 110 individuals representing 14 nominal species that were collected from several rivers along the Amazon basin. The mean Kimura-2-parameter distances within species and genus were 2% and 19,0%, respectively. The deep lineage divergences detected in N. digrammus, N. trifasciatus, N. unifasciatus and N. eques suggest the existence of hidden diversity in Nannostomus species. For N. digrammus and N. trifasciatus, in particular, the estimated divergences in some lineages were so high that doubt about their conspecific status is raised. PMID:25658694

  3. Recent oceanic long-distance dispersal and divergence in the amphi-Atlantic rain forest genus Renealmia L.f. (Zingiberaceae).

    PubMed

    Särkinen, Tiina E; Newman, Mark F; Maas, Paul J M; Maas, Hiltje; Poulsen, Axel D; Harris, David J; Richardson, James E; Clark, Alexandra; Hollingsworth, Michelle; Pennington, R Toby

    2007-09-01

    Renealmia L.f. (Zingiberaceae) is one of the few tropical plant genera with numerous species in both Africa and South America but not in Asia. Based on phylogenetic analysis of nuclear ribosomal internal transcribed spacer (ITS) and chloroplast trnL-F DNA, Renealmia is shown to be monophyletic with high branch support. Low sequence divergence found in the two genome regions (ITS: 0-2.4%; trnL-F: 0-1.9%) suggests recent diversification within the genus. Molecular divergence age estimates give further support to the recent origin of the genus and show that Renealmia has attained its amphi-Atlantic distribution by an oceanic long-distance dispersal event from Africa to South America during the Miocene or Pliocene (15.8-2.7 My ago). Some support is found for the hypothesis that speciation in neotropical Renealmia was influenced by the Andean orogeny. Speciation has been approximately simultaneous on both sides of the Atlantic, but increased taxon sampling is required to compare the speciation rates between the New World and Old World tropics.

  4. Phylogenomic Analyses Indicate that Early Fungi Evolved Digesting Cell Walls of Algal Ancestors of Land Plants

    PubMed Central

    Chang, Ying; Wang, Sishuo; Sekimoto, Satoshi; Aerts, Andrea L.; Choi, Cindy; Clum, Alicia; LaButti, Kurt M.; Lindquist, Erika A.; Yee Ngan, Chew; Ohm, Robin A.; Salamov, Asaf A.; Grigoriev, Igor V.; Spatafora, Joseph W.; Berbee, Mary L.

    2015-01-01

    As decomposers, fungi are key players in recycling plant material in global carbon cycles. We hypothesized that genomes of early diverging fungi may have inherited pectinases from an ancestral species that had been able to extract nutrients from pectin-containing land plants and their algal allies (Streptophytes). We aimed to infer, based on pectinase gene expansions and on the organismal phylogeny, the geological timing of the plant–fungus association. We analyzed 40 fungal genomes, three of which, including Gonapodya prolifera, were sequenced for this study. In the organismal phylogeny from 136 housekeeping loci, Rozella diverged first from all other fungi. Gonapodya prolifera was included among the flagellated, predominantly aquatic fungal species in Chytridiomycota. Sister to Chytridiomycota were the predominantly terrestrial fungi including zygomycota I and zygomycota II, along with the ascomycetes and basidiomycetes that comprise Dikarya. The Gonapodya genome has 27 genes representing five of the seven classes of pectin-specific enzymes known from fungi. Most of these share a common ancestry with pectinases from Dikarya. Indicating functional and sequence similarity, Gonapodya, like many Dikarya, can use pectin as a carbon source for growth in pure culture. Shared pectinases of Dikarya and Gonapodya provide evidence that even ancient aquatic fungi had adapted to extract nutrients from the plants in the green lineage. This implies that 750 million years, the estimated maximum age of origin of the pectin-containing streptophytes represents a maximum age for the divergence of Chytridiomycota from the lineage including Dikarya. PMID:25977457

  5. Hybridization and massive mtDNA unidirectional introgression between the closely related Neotropical toads Rhinella marina and R. schneideri inferred from mtDNA and nuclear markers

    PubMed Central

    2011-01-01

    Background The classical perspective that interspecific hybridization in animals is rare has been changing due to a growing list of empirical examples showing the occurrence of gene flow between closely related species. Using sequence data from cyt b mitochondrial gene and three intron nuclear genes (RPL9, c-myc, and RPL3) we investigated patterns of nucleotide polymorphism and divergence between two closely related toad species R. marina and R. schneideri. By comparing levels of differentiation at nuclear and mtDNA levels we were able to describe patterns of introgression and infer the history of hybridization between these species. Results All nuclear loci are essentially concordant in revealing two well differentiated groups of haplotypes, corresponding to the morphologically-defined species R. marina and R. schneideri. Mitochondrial DNA analysis also revealed two well-differentiated groups of haplotypes but, in stark contrast with the nuclear genealogies, all R. schneideri sequences are clustered with sequences of R. marina from the right Amazon bank (RAB), while R. marina sequences from the left Amazon bank (LAB) are monophyletic. An Isolation-with-Migration (IM) analysis using nuclear data showed that R. marina and R. schneideri diverged at ≈ 1.69 Myr (early Pleistocene), while R. marina populations from LAB and RAB diverged at ≈ 0.33 Myr (middle Pleistocene). This time of divergence is not consistent with the split between LAB and RAB populations obtained with mtDNA data (≈ 1.59 Myr), which is notably similar to the estimate obtained with nuclear genes between R. marina and R. schneideri. Coalescent simulations of mtDNA phylogeny under the speciation history inferred from nuclear genes rejected the hypothesis of incomplete lineage sorting to explain the conflicting signal between mtDNA and nuclear-based phylogenies. Conclusions The cytonuclear discordance seems to reflect the occurrence of interspecific hybridization between these two closely related toad species. Overall, our results suggest a phenomenon of extensive mtDNA unidirectional introgression from the previously occurring R. schneideri into the invading R. marina. We hypothesize that climatic-induced range shifts during the Pleistocene/Holocene may have played an important role in the observed patterns of introgression. PMID:21939538

  6. History, geography and host use shape genomewide patterns of genetic variation in the redheaded pine sawfly (Neodiprion lecontei).

    PubMed

    Bagley, Robin K; Sousa, Vitor C; Niemiller, Matthew L; Linnen, Catherine R

    2017-02-01

    Divergent host use has long been suspected to drive population differentiation and speciation in plant-feeding insects. Evaluating the contribution of divergent host use to genetic differentiation can be difficult, however, as dispersal limitation and population structure may also influence patterns of genetic variation. In this study, we use double-digest restriction-associated DNA (ddRAD) sequencing to test the hypothesis that divergent host use contributes to genetic differentiation among populations of the redheaded pine sawfly (Neodiprion lecontei), a widespread pest that uses multiple Pinus hosts throughout its range in eastern North America. Because this species has a broad range and specializes on host plants known to have migrated extensively during the Pleistocene, we first assess overall genetic structure using model-based and model-free clustering methods and identify three geographically distinct genetic clusters. Next, using a composite-likelihood approach based on the site frequency spectrum and a novel strategy for maximizing the utility of linked RAD markers, we infer the population topology and date divergence to the Pleistocene. Based on existing knowledge of Pinus refugia, estimated demographic parameters and patterns of diversity among sawfly populations, we propose a Pleistocene divergence scenario for N. lecontei. Finally, using Mantel and partial Mantel tests, we identify a significant relationship between genetic distance and geography in all clusters, and between genetic distance and host use in two of three clusters. Overall, our results indicate that Pleistocene isolation, dispersal limitation and ecological divergence all contribute to genomewide differentiation in this species and support the hypothesis that host use is a common driver of population divergence in host-specialized insects. © 2016 John Wiley & Sons Ltd.

  7. Population genomics provide insights into the evolution and adaptation of the eastern honey bee (Apis cerana).

    PubMed

    Chen, Chao; Wang, Huihua; Liu, Zhiguang; Chen, Xiao; Tang, Jiao; Meng, Fanming; Shi, Wei

    2018-06-20

    The mechanisms by which organisms adapt to variable environments are a fundamental question in evolutionary biology and are important to protect important species in response to a changing climate. An interesting candidate to study this question is the honey bee Apis cerana, a keystone pollinator with a wide distribution throughout a large variety of climates, that exhibits rapid dispersal. Here, we re-sequenced the genome of 180 A. cerana individuals from eighteen populations throughout China. Using a population genomics approach, we observed considerable genetic variation in A. cerana. Patterns of genetic differentiation indicate high divergence at the subspecies level, and physical barriers rather than distance are the driving force for population divergence. Estimations of divergence time suggested that the main branches diverged between 300 and 500 ka. Analyses of the population history revealed a substantial influence of the Earth's climate on the effective population size of A. cerana, as increased population sizes were observed during warmer periods. Further analyses identified candidate genes under natural selection that are potentially related to honey bee cognition, temperature adaptation, and olfactory. Based on our results, A. cerana may have great potential in response to climate change. Our study provides fundamental knowledge of the evolution and adaptation of A. cerana.

  8. Multilocus analysis of nucleotide variation and speciation in three closely related Populus (Salicaceae) species.

    PubMed

    Du, Shuhui; Wang, Zhaoshan; Ingvarsson, Pär K; Wang, Dongsheng; Wang, Junhui; Wu, Zhiqiang; Tembrock, Luke R; Zhang, Jianguo

    2015-10-01

    Historical tectonism and climate oscillations can isolate and contract the geographical distributions of many plant species, and they are even known to trigger species divergence and ultimately speciation. Here, we estimated the nucleotide variation and speciation in three closely related Populus species, Populus tremuloides, P. tremula and P. davidiana, distributed in North America and Eurasia. We analysed the sequence variation in six single-copy nuclear loci and three chloroplast (cpDNA) fragments in 497 individuals sampled from 33 populations of these three species across their geographic distributions. These three Populus species harboured relatively high levels of nucleotide diversity and showed high levels of nucleotide differentiation. Phylogenetic analysis revealed that P. tremuloides diverged earlier than the other two species. The cpDNA haplotype network result clearly illustrated the dispersal route from North America to eastern Asia and then into Europe. Molecular dating results confirmed that the divergence of these three species coincided with the sundering of the Bering land bridge in the late Miocene and a rapid uplift of the Qinghai-Tibetan Plateau around the Miocene/Pliocene boundary. Vicariance-driven successful allopatric speciation resulting from historical tectonism and climate oscillations most likely played roles in the formation of the disjunct distributions and divergence of these three Populus species. © 2015 John Wiley & Sons Ltd.

  9. Divergence of Gene Body DNA Methylation and Evolution of Plant Duplicate Genes

    PubMed Central

    Wang, Jun; Marowsky, Nicholas C.; Fan, Chuanzhu

    2014-01-01

    It has been shown that gene body DNA methylation is associated with gene expression. However, whether and how deviation of gene body DNA methylation between duplicate genes can influence their divergence remains largely unexplored. Here, we aim to elucidate the potential role of gene body DNA methylation in the fate of duplicate genes. We identified paralogous gene pairs from Arabidopsis and rice (Oryza sativa ssp. japonica) genomes and reprocessed their single-base resolution methylome data. We show that methylation in paralogous genes nonlinearly correlates with several gene properties including exon number/gene length, expression level and mutation rate. Further, we demonstrated that divergence of methylation level and pattern in paralogs indeed positively correlate with their sequence and expression divergences. This result held even after controlling for other confounding factors known to influence the divergence of paralogs. We observed that methylation level divergence might be more relevant to the expression divergence of paralogs than methylation pattern divergence. Finally, we explored the mechanisms that might give rise to the divergence of gene body methylation in paralogs. We found that exonic methylation divergence more closely correlates with expression divergence than intronic methylation divergence. We show that genomic environments (e.g., flanked by transposable elements and repetitive sequences) of paralogs generated by various duplication mechanisms are associated with the methylation divergence of paralogs. Overall, our results suggest that the changes in gene body DNA methylation could provide another avenue for duplicate genes to develop differential expression patterns and undergo different evolutionary fates in plant genomes. PMID:25310342

  10. Discordant genetic diversity and geographic patterns between Crassicutis cichlasomae (Digenea: Apocreadiidae) and its cichlid host, "Cichlasoma" urophthalmus (Osteichthyes: Cichlidae), in Middle-America.

    PubMed

    Razo-Mendivil, Ulises; Vázquez-Domínguez, Ella; de León, Gerardo Pérez-Ponce

    2013-12-01

    Genetic analyses of hosts and their parasites are key to understand the evolutionary patterns and processes that have shaped host-parasite associations. We evaluated the genetic structure of the digenean Crassicutis cichlasomae and its most common host, the Mayan cichlid "Cichlasoma" urophthalmus, encompassing most of their geographical range in Middle-America (river basins in southeastern Mexico, Belize, and Guatemala together with the Yucatan Peninsula). Genetic diversity and structure analyses were done based on 167 cytochrome c oxidase subunit 1 sequences (330 bp) for C. cichlasomae from 21 populations and 161 cytochrome b sequences (599 bp) for "C." urophthalmus from 26 populations. Analyses performed included phylogenetic tree estimation under Bayesian inference and maximum likelihood analysis, genetic diversity, distance and structure estimates, haplotype networks, and demographic evaluations. Crassicutis cichlasomae showed high genetic diversity values and genetic structuring, corresponding with 4 groups clearly differentiated and highly divergent. Conversely, "C." urophthalmus showed low levels of genetic diversity and genetic differentiation, defined as 2 groups with low divergence and with no correspondence with geographical distribution. Our results show that species of cichlids parasitized by C. cichlasomae other than "C." urophthalmus, along with multiple colonization events and subsequent isolation in different basins, are likely factors that shaped the genetic structure of the parasite. Meanwhile, historical long-distance dispersal and drought periods during the Holocene, with significant population size reductions and fragmentations, are factors that could have shaped the genetic structure of the Mayan cichlid.

  11. Mitochondrial Divergence between Western and Eastern Great Bustards: Implications for Conservation and Species Status.

    PubMed

    Kessler, Aimee Elizabeth; Santos, Malia A; Flatz, Ramona; Batbayar, Nyambayar; Natsagdorj, Tseveenmyadag; Batsuur, Dashnyam; Bidashko, Fyodor G; Galbadrakh, Natsag; Goroshko, Oleg; Khrokov, Valery V; Unenbat, Tuvshin; Vagner, Ivan I; Wang, Muyang; Smith, Christopher Irwin

    2018-06-02

    The Great Bustard is the heaviest bird capable of flight and an iconic species of the Eurasian steppe. Populations of both currently recognized subspecies are highly fragmented and critically small in Asia. We used DNA sequence data from the mitochondrial cytochrome b gene and the mitochondrial control region to estimate the degree of mitochondrial differentiation and rates of female gene flow between the subspecies. We obtained genetic samples from 51 individuals of Otis tarda dybowskii representing multiple populations, including the first samples from Kazakhstan and Mongolia and samples from near the Altai Mountains, the proposed geographic divide between the subspecies, allowing for better characterization of the boundary between the two subspecies. We compared these with existing sequence data (n=66) from O. t. tarda. Our results suggest, though do not conclusively prove, that O. t. dybowskii and O. t. tarda may be distinct species. The geographic distribution of haplotypes, phylogenetic analysis, analyses of molecular variance, and coalescent estimation of divergence time and female migration rates indicate that O. t. tarda and O. t. dybowskii are highly differentiated in the mitochondrial genome, have been isolated for approximately 1.4 million years, and exchange much less than one female migrant per generation. Our findings indicate that the two forms should at least be recognized and managed as separate evolutionary units. Populations in Xinjiang, China and Khövsgöl and Bulgan, Mongolia exhibited the highest levels of genetic diversity and should be prioritized in conservation planning.

  12. Dating of the human-ape splitting by a molecular clock of mitochondrial DNA.

    PubMed

    Hasegawa, M; Kishino, H; Yano, T

    1985-01-01

    A new statistical method for estimating divergence dates of species from DNA sequence data by a molecular clock approach is developed. This method takes into account effectively the information contained in a set of DNA sequence data. The molecular clock of mitochondrial DNA (mtDNA) was calibrated by setting the date of divergence between primates and ungulates at the Cretaceous-Tertiary boundary (65 million years ago), when the extinction of dinosaurs occurred. A generalized least-squares method was applied in fitting a model to mtDNA sequence data, and the clock gave dates of 92.3 +/- 11.7, 13.3 +/- 1.5, 10.9 +/- 1.2, 3.7 +/- 0.6, and 2.7 +/- 0.6 million years ago (where the second of each pair of numbers is the standard deviation) for the separation of mouse, gibbon, orangutan, gorilla, and chimpanzee, respectively, from the line leading to humans. Although there is some uncertainty in the clock, this dating may pose a problem for the widely believed hypothesis that the pipedal creature Australopithecus afarensis, which lived some 3.7 million years ago at Laetoli in Tanzania and at Hadar in Ethiopia, was ancestral to man and evolved after the human-ape splitting. Another likelier possibility is that mtDNA was transferred through hybridization between a proto-human and a proto-chimpanzee after the former had developed bipedalism.

  13. High levels of Y-chromosome nucleotide diversity in the genus Pan

    PubMed Central

    Stone, Anne C.; Griffiths, Robert C.; Zegura, Stephen L.; Hammer, Michael F.

    2002-01-01

    Although some mitochondrial, X chromosome, and autosomal sequence diversity data are available for our closest relatives, Pan troglodytes and Pan paniscus, data from the nonrecombining portion of the Y chromosome (NRY) are more limited. We examined ≈3 kb of NRY DNA from 101 chimpanzees, seven bonobos, and 42 humans to investigate: (i) relative levels of intraspecific diversity; (ii) the degree of paternal lineage sorting among species and subspecies of the genus Pan; and (iii) the date of the chimpanzee/bonobo divergence. We identified 10 informative sequence-tagged sites associated with 23 polymorphisms on the NRY from the genus Pan. Nucleotide diversity was significantly higher on the NRY of chimpanzees and bonobos than on the human NRY. Similar to mtDNA, but unlike X-linked and autosomal loci, lineages defined by mutations on the NRY were not shared among subspecies of P. troglodytes. Comparisons with mtDNA ND2 sequences from some of the same individuals revealed a larger female versus male effective population size for chimpanzees. The NRY-based divergence time between chimpanzees and bonobos was estimated at ≈1.8 million years ago. In contrast to human populations who appear to have had a low effective size and a recent origin with subsequent population growth, some taxa within the genus Pan may be characterized by large populations of relatively constant size, more ancient origins, and high levels of subdivision. PMID:11756656

  14. Genomic analysis of a new mammalian distal-less gene: Dlx7.

    PubMed

    Nakamura, S; Stock, D W; Wydner, K L; Bollekens, J A; Takeshita, K; Nagai, B M; Chiba, S; Kitamura, T; Freeland, T M; Zhao, Z; Minowada, J; Lawrence, J B; Weiss, K M; Ruddle, F H

    1996-12-15

    We have cloned a new Dlx gene (Dlx7) from human and mouse that may represent the mammalian orthologue of the newt gene NvHBox-5. The homeodomains of these genes are highly similar to all other vertebrate Dlx genes, and regions of similarity also exist between mammalian Dlx7 and a subset of vertebrate Dlx genes downstream of the homeodomain. The sequence divergence between human and mouse Dlx7 in these regions is greater than that predicted from comparisons of other vertebrate Dlx genes, however, and there is little sequence similarity upstream of the homeodomain both between these two genes and with other Dlx genes. We present evidence for alternative splicing of mouse Dlx7 upstream of the homeodomain that may account for some of this divergence. We have mapped human DLX7 distal to the 5' end of the HOXB cluster at an estimated distance of between 1 and 2 Mb by FISH. Both the human and the mouse Dlx7 are shown to be closely linked to Dlx3 in a convergently transcribed orientation. These mapping results support the possibility that vertebrate distal-less genes have been duplicated in concert with the Hox clusters.

  15. SD-MSAEs: Promoter recognition in human genome based on deep feature extraction.

    PubMed

    Xu, Wenxuan; Zhang, Li; Lu, Yaping

    2016-06-01

    The prediction and recognition of promoter in human genome play an important role in DNA sequence analysis. Entropy, in Shannon sense, of information theory is a multiple utility in bioinformatic details analysis. The relative entropy estimator methods based on statistical divergence (SD) are used to extract meaningful features to distinguish different regions of DNA sequences. In this paper, we choose context feature and use a set of methods of SD to select the most effective n-mers distinguishing promoter regions from other DNA regions in human genome. Extracted from the total possible combinations of n-mers, we can get four sparse distributions based on promoter and non-promoters training samples. The informative n-mers are selected by optimizing the differentiating extents of these distributions. Specially, we combine the advantage of statistical divergence and multiple sparse auto-encoders (MSAEs) in deep learning to extract deep feature for promoter recognition. And then we apply multiple SVMs and a decision model to construct a human promoter recognition method called SD-MSAEs. Framework is flexible that it can integrate new feature extraction or new classification models freely. Experimental results show that our method has high sensitivity and specificity. Copyright © 2016 Elsevier Inc. All rights reserved.

  16. Genetic variation and evolutionary demography of Fenneropenaeus chinensis populations, as revealed by the analysis of mitochondrial control region sequences

    PubMed Central

    2010-01-01

    Genetic variation and evolutionary demography of the shrimp Fenneropenaeus chinensis were investigated using sequence data of the complete mitochondrial control region (CR). Fragments of 993 bp of the CR were sequenced for 93 individuals from five localities over most of the species' range in the Yellow Sea and the Bohai Sea. There were 84 variable sites defining 68 haplotypes. Haplotype diversity levels were very high (0.95 ± 0.03-0.99 ± 0.02) in F. chinensis populations, whereas those of nucleotide diversity were moderate to low (0.66 ± 0.36%-0.84 ± 0.46%). Analysis of molecular variance and conventional population statistics (FST ) revealed no significant genetic structure throughout the range of F. chinensis. Mismatch distribution, estimates of population parameters and neutrality tests revealed that the significant fluctuations and shallow coalescence of mtDNA genealogies observed were coincident with estimated demographic parameters and neutrality tests, in implying important past-population size fluctuations or range expansion. Isolation with Migration (IM) coalescence results suggest that F. chinensis, distributed along the coasts of northern China and the Korean Peninsula (about 1000 km apart), diverged recently, the estimated time-split being 12,800 (7,400-18,600) years ago. PMID:21637498

  17. Unconstrained cranial evolution in Neandertals and modern humans compared to common chimpanzees

    PubMed Central

    Weaver, Timothy D.; Stringer, Chris B.

    2015-01-01

    A variety of lines of evidence support the idea that neutral evolutionary processes (genetic drift, mutation) have been important in generating cranial differences between Neandertals and modern humans. But how do Neandertals and modern humans compare with other species? And how do these comparisons illuminate the evolutionary processes underlying cranial diversification? To address these questions, we used 27 standard cranial measurements collected on 2524 recent modern humans, 20 Neandertals and 237 common chimpanzees to estimate split times between Neandertals and modern humans, and between Pan troglodytes verus and two other subspecies of common chimpanzee. Consistent with a neutral divergence, the Neandertal versus modern human split-time estimates based on cranial measurements are similar to those based on DNA sequences. By contrast, the common chimpanzee cranial estimates are much lower than DNA-sequence estimates. Apparently, cranial evolution has been unconstrained in Neandertals and modern humans compared with common chimpanzees. Based on these and additional analyses, it appears that cranial differentiation in common chimpanzees has been restricted by stabilizing natural selection. Alternatively, this restriction could be due to genetic and/or developmental constraints on the amount of within-group variance (relative to effective population size) available for genetic drift to act on. PMID:26468243

  18. Exploring the effect of asymmetric mitochondrial DNA introgression on estimating niche divergence in morphologically cryptic species.

    PubMed

    Wielstra, Ben; Arntzen, Jan W

    2014-01-01

    If potential morphologically cryptic species, identified based on differentiated mitochondrial DNA, express ecological divergence, this increases support for their treatment as distinct species. However, mitochondrial DNA introgression hampers the correct estimation of ecological divergence. We test the hypothesis that estimated niche divergence differs when considering nuclear DNA composition or mitochondrial DNA type as representing the true species range. We use empirical data of two crested newt species (Amphibia: Triturus) which possess introgressed mitochondrial DNA from a third species in part of their ranges. We analyze the data in environmental space by determining Fisher distances in a principal component analysis and in geographical space by determining geographical overlap of species distribution models. We find that under mtDNA guidance in one of the two study cases niche divergence is overestimated, whereas in the other it is underestimated. In the light of our results we discuss the role of estimated niche divergence in species delineation.

  19. Putting scales into evolutionary time: the divergence of major scale insect lineages (Hemiptera) predates the radiation of modern angiosperm hosts

    PubMed Central

    Vea, Isabelle M.; Grimaldi, David A.

    2016-01-01

    The radiation of flowering plants in the mid-Cretaceous transformed landscapes and is widely believed to have fuelled the radiations of major groups of phytophagous insects. An excellent group to test this assertion is the scale insects (Coccomorpha: Hemiptera), with some 8,000 described Recent species and probably the most diverse fossil record of any phytophagous insect group preserved in amber. We used here a total-evidence approach (by tip-dating) employing 174 morphological characters of 73 Recent and 43 fossil taxa (48 families) and DNA sequences of three gene regions, to obtain divergence time estimates and compare the chronology of the most diverse lineage of scale insects, the neococcoid families, with the timing of the main angiosperm radiation. An estimated origin of the Coccomorpha occurred at the beginning of the Triassic, about 245 Ma [228–273], and of the neococcoids 60 million years later [210–165 Ma]. A total-evidence approach allows the integration of extinct scale insects into a phylogenetic framework, resulting in slightly younger median estimates than analyses using Recent taxa, calibrated with fossil ages only. From these estimates, we hypothesise that most major lineages of coccoids shifted from gymnosperms onto angiosperms when the latter became diverse and abundant in the mid- to Late Cretaceous. PMID:27000526

  20. A phylogenetic analysis of the grape genus (Vitis L.) reveals broad reticulation and concurrent diversification during neogene and quaternary climate change.

    PubMed

    Wan, Yizhen; Schwaninger, Heidi R; Baldo, Angela M; Labate, Joanne A; Zhong, Gan-Yuan; Simon, Charles J

    2013-07-05

    Grapes are one of the most economically important fruit crops. There are about 60 species in the genus Vitis. The phylogenetic relationships among these species are of keen interest for the conservation and use of this germplasm. We selected 309 accessions from 48 Vitis species,varieties, and outgroups, examined ~11 kb (~3.4 Mb total) of aligned nuclear DNA sequences from 27 unlinked genes in a phylogenetic context, and estimated divergence times based on fossil calibrations. Vitis formed a strongly supported clade. There was substantial support for species and less for the higher-level groupings (series). As estimated from extant taxa, the crown age of Vitis was 28 Ma and the divergence of subgenera (Vitis and Muscadinia) occurred at ~18 Ma. Higher clades in subgenus Vitis diverged 16 - 5 Ma with overlapping confidence intervals, and ongoing divergence formed extant species at 12 - 1.3 Ma. Several species had species-specific SNPs. NeighborNet analysis showed extensive reticulation at the core of subgenus Vitis representing the deeper nodes, with extensive reticulation radiating outward. Fitch Parsimony identified North America as the origin of the most recent common ancestor of extant Vitis species. Phylogenetic patterns suggested origination of the genus in North America, fragmentation of an ancestral range during the Miocene, formation of extant species in the late Miocene-Pleistocene, and differentiation of species in the context of Pliocene-Quaternary tectonic and climatic change. Nuclear SNPs effectively resolved relationships at and below the species level in grapes and rectified several misclassifications of accessions in the repositories. Our results challenge current higher-level classifications, reveal the abundance of genetic diversity in the genus that is potentially available for crop improvement, and provide a valuable resource for species delineation, germplasm conservation and use.

  1. The little shrimp that could: phylogeography of the circumtropical Stenopus hispidus (Crustacea: Decapoda), reveals divergent Atlantic and Pacific lineages

    PubMed Central

    Iacchei, Matthew; Coleman, Richard R.; Gaither, Michelle R.; Browne, William E.; Bowen, Brian W.; Toonen, Robert J.

    2018-01-01

    The banded coral shrimp, Stenopus hispidus (Crustacea: Decapoda: Stenopodidea) is a popular marine ornamental species with a circumtropical distribution. The planktonic larval stage lasts ∼120–253 days, indicating considerable dispersal potential, but few studies have investigated genetic connectivity on a global scale in marine invertebrates. To resolve patterns of divergence and phylogeography of S. hispidus, we surveyed 525 bp of mitochondrial cytochrome c oxidase subunit I (COI) from 198 individuals sampled at 10 locations across ∼27,000 km of the species range. Phylogenetic analyses reveal that S. hispidus has a Western Atlantic lineage and a widely distributed Indo-Pacific lineage, separated by sequence divergence of 2.1%. Genetic diversity is much higher in the Western Atlantic (h = 0.929; π = 0.004) relative to the Indo-Pacific (h = 0.105; π < 0.001), and coalescent analyses indicate that the Indo-Pacific population expanded more recently (95% HPD (highest posterior density) = 60,000–400,000 yr) than the Western Atlantic population (95% HPD = 300,000–760,000 yr). Divergence of the Western Atlantic and Pacific lineages is estimated at 710,000–1.8 million years ago, which does not readily align with commonly implicated colonization events between the ocean basins. The estimated age of populations contradicts the prevailing dispersal route for tropical marine biodiversity (Indo-Pacific to Atlantic) with the oldest and most diverse population in the Atlantic, and a recent population expansion with a single common haplotype shared throughout the vast Indian and Pacific oceans. In contrast to the circumtropical fishes, this diminutive reef shrimp challenges our understanding of conventional dispersal capabilities of marine species. PMID:29527409

  2. A Passerine Bird's evolution corroborates the geologic history of the island of New Guinea.

    PubMed

    Deiner, Kristy; Lemmon, Alan R; Mack, Andrew L; Fleischer, Robert C; Dumbacher, John P

    2011-05-06

    New Guinea is a biologically diverse island, with a unique geologic history and topography that has likely played a role in the evolution of species. Few island-wide studies, however, have examined the phylogeographic history of lowland species. The objective of this study was to examine patterns of phylogeographic variation of a common and widespread New Guinean bird species (Colluricincla megarhyncha). Specifically, we test the mechanisms hypothesized to cause geographic and genetic variation (e.g., vicariance, isolation by distance and founder-effect with dispersal). To accomplish this, we surveyed three regions of the mitochondrial genome and a nuclear intron and assessed differences among 23 of the 30 described subspecies from throughout their range. We found support for eight highly divergent lineages within C. megarhyncha. Genetic lineages were found within continuous lowland habitat or on smaller islands, but all individuals within clades were not necessarily structured by predicted biogeographic barriers. There was some evidence of isolation by distance and potential founder-effects. Mitochondrial DNA sequence divergence among lineages was at a level often observed among different species or even genera of birds (5-11%), suggesting lineages within regions have been isolated for long periods of time. When topographical barriers were associated with divergence patterns, the estimated divergence date for the clade coincided with the estimated time of barrier formation. We also found that dispersal distance and range size are positively correlated across lineages. Evidence from this research suggests that different phylogeographic mechanisms concurrently structure lineages of C. megarhyncha and are not mutually exclusive. These lineages are a result of evolutionary forces acting at different temporal and spatial scales concordant with New Guinea's geological history.

  3. A Passerine Bird's Evolution Corroborates the Geologic History of the Island of New Guinea

    PubMed Central

    Deiner, Kristy; Lemmon, Alan R.; Mack, Andrew L.; Fleischer, Robert C.; Dumbacher, John P.

    2011-01-01

    New Guinea is a biologically diverse island, with a unique geologic history and topography that has likely played a role in the evolution of species. Few island-wide studies, however, have examined the phylogeographic history of lowland species. The objective of this study was to examine patterns of phylogeographic variation of a common and widespread New Guinean bird species (Colluricincla megarhyncha). Specifically, we test the mechanisms hypothesized to cause geographic and genetic variation (e.g., vicariance, isolation by distance and founder-effect with dispersal). To accomplish this, we surveyed three regions of the mitochondrial genome and a nuclear intron and assessed differences among 23 of the 30 described subspecies from throughout their range. We found support for eight highly divergent lineages within C. megarhyncha. Genetic lineages were found within continuous lowland habitat or on smaller islands, but all individuals within clades were not necessarily structured by predicted biogeographic barriers. There was some evidence of isolation by distance and potential founder-effects. Mitochondrial DNA sequence divergence among lineages was at a level often observed among different species or even genera of birds (5–11%), suggesting lineages within regions have been isolated for long periods of time. When topographical barriers were associated with divergence patterns, the estimated divergence date for the clade coincided with the estimated time of barrier formation. We also found that dispersal distance and range size are positively correlated across lineages. Evidence from this research suggests that different phylogeographic mechanisms concurrently structure lineages of C. megarhyncha and are not mutually exclusive. These lineages are a result of evolutionary forces acting at different temporal and spatial scales concordant with New Guinea's geological history. PMID:21573115

  4. A practical divergence measure for survival distributions that can be estimated from Kaplan-Meier curves.

    PubMed

    Cox, Trevor F; Czanner, Gabriela

    2016-06-30

    This paper introduces a new simple divergence measure between two survival distributions. For two groups of patients, the divergence measure between their associated survival distributions is based on the integral of the absolute difference in probabilities that a patient from one group dies at time t and a patient from the other group survives beyond time t and vice versa. In the case of non-crossing hazard functions, the divergence measure is closely linked to the Harrell concordance index, C, the Mann-Whitney test statistic and the area under a receiver operating characteristic curve. The measure can be used in a dynamic way where the divergence between two survival distributions from time zero up to time t is calculated enabling real-time monitoring of treatment differences. The divergence can be found for theoretical survival distributions or can be estimated non-parametrically from survival data using Kaplan-Meier estimates of the survivor functions. The estimator of the divergence is shown to be generally unbiased and approximately normally distributed. For the case of proportional hazards, the constituent parts of the divergence measure can be used to assess the proportional hazards assumption. The use of the divergence measure is illustrated on the survival of pancreatic cancer patients. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.

  5. DNA barcoding for the identification of sand fly species (Diptera, Psychodidae, Phlebotominae) in Colombia.

    PubMed

    Contreras Gutiérrez, María Angélica; Vivero, Rafael J; Vélez, Iván D; Porter, Charles H; Uribe, Sandra

    2014-01-01

    Sand flies include a group of insects that are of medical importance and that vary in geographic distribution, ecology, and pathogen transmission. Approximately 163 species of sand flies have been reported in Colombia. Surveillance of the presence of sand fly species and the actualization of species distribution are important for predicting risks for and monitoring the expansion of diseases which sand flies can transmit. Currently, the identification of phlebotomine sand flies is based on morphological characters. However, morphological identification requires considerable skills and taxonomic expertise. In addition, significant morphological similarity between some species, especially among females, may cause difficulties during the identification process. DNA-based approaches have become increasingly useful and promising tools for estimating sand fly diversity and for ensuring the rapid and accurate identification of species. A partial sequence of the mitochondrial cytochrome oxidase gene subunit I (COI) is currently being used to differentiate species in different animal taxa, including insects, and it is referred as a barcoding sequence. The present study explored the utility of the DNA barcode approach for the identification of phlebotomine sand flies in Colombia. We sequenced 700 bp of the COI gene from 36 species collected from different geographic localities. The COI barcode sequence divergence within a single species was <2% in most cases, whereas this divergence ranged from 9% to 26.6% among different species. These results indicated that the barcoding gene correctly discriminated among the previously morphologically identified species with an efficacy of nearly 100%. Analyses of the generated sequences indicated that the observed species groupings were consistent with the morphological identifications. In conclusion, the barcoding gene was useful for species discrimination in sand flies from Colombia.

  6. DNA Barcoding for the Identification of Sand Fly Species (Diptera, Psychodidae, Phlebotominae) in Colombia

    PubMed Central

    Contreras Gutiérrez, María Angélica; Vivero, Rafael J.; Vélez, Iván D.; Porter, Charles H.; Uribe, Sandra

    2014-01-01

    Sand flies include a group of insects that are of medical importance and that vary in geographic distribution, ecology, and pathogen transmission. Approximately 163 species of sand flies have been reported in Colombia. Surveillance of the presence of sand fly species and the actualization of species distribution are important for predicting risks for and monitoring the expansion of diseases which sand flies can transmit. Currently, the identification of phlebotomine sand flies is based on morphological characters. However, morphological identification requires considerable skills and taxonomic expertise. In addition, significant morphological similarity between some species, especially among females, may cause difficulties during the identification process. DNA-based approaches have become increasingly useful and promising tools for estimating sand fly diversity and for ensuring the rapid and accurate identification of species. A partial sequence of the mitochondrial cytochrome oxidase gene subunit I (COI) is currently being used to differentiate species in different animal taxa, including insects, and it is referred as a barcoding sequence. The present study explored the utility of the DNA barcode approach for the identification of phlebotomine sand flies in Colombia. We sequenced 700 bp of the COI gene from 36 species collected from different geographic localities. The COI barcode sequence divergence within a single species was <2% in most cases, whereas this divergence ranged from 9% to 26.6% among different species. These results indicated that the barcoding gene correctly discriminated among the previously morphologically identified species with an efficacy of nearly 100%. Analyses of the generated sequences indicated that the observed species groupings were consistent with the morphological identifications. In conclusion, the barcoding gene was useful for species discrimination in sand flies from Colombia. PMID:24454877

  7. Phylogenetic relationships in the mushroom genus Coprinus and dark-spored allies based on sequence data from the nuclear gene coding for the large ribosomal subunit RNA: divergent domains, outgroups, and monophyly.

    PubMed

    Hopple, J S; Vilgalys, R

    1999-10-01

    Phylogenetic relationships were investigated in the mushroom genus Coprinus based on sequence data from the nuclear encoded large-subunit rDNA gene. Forty-seven species of Coprinus and 19 additional species from the families Coprinaceae, Strophariaceae, Bolbitiaceae, Agaricaceae, Podaxaceae, and Montagneaceae were studied. A total of 1360 sites was sequenced across seven divergent domains and intervening sequences. A total of 302 phylogenetically informative characters was found. Ninety-eight percent of the average divergence between taxa was located within the divergent domains, with domains D2 and D8 being most divergent and domains D7 and D10 the least divergent. An empirical test of phylogenetic signal among divergent domains also showed that domains D2 and D3 had the lowest levels of homoplasy. Two equally most parsimonious trees were resolved using Wagner parsimony. A character-state weighted analysis produced 12 equally most parsimonious trees similar to those generated by Wagner parsimony. Phylogenetic analyses employing topological constraints suggest that none of the major taxonomic systems proposed for subgeneric classification is able to completely reflect phylogenetic relationships in Coprinus. A strict consensus integration of the two Wagner trees demonstrates the problematic nature of choosing outgroups within dark-spored mushrooms. The genus Coprinus is found to be polyphyletic and is separated into three distinct clades. Most Coprinus taxa belong to the first two clades, which together form a larger monophyletic group with Lacrymaria and Psathyrella in basal positions. A third clade contains members of Coprinus section Comati as well as the genus Leucocoprinus, Podaxis pistillaris, Montagnea arenaria, and Agaricus pocillator. This third clade is separated from the other species of Coprinus by members of the families Strophariaceae and Bolbitiaceae and the genus Panaeolus. Copyright 1999 Academic Press.

  8. Estimation of divergence from Hardy-Weinberg form.

    PubMed

    Stark, Alan E

    2015-08-01

    The Hardy–Weinberg (HW) principle explains how random mating (RM) can produce and maintain a population in equilibrium, that is, with constant genotypic proportions. When proportions diverge from HW form, it is of interest to estimate the fixation index F, which reflects the degree of divergence. Starting from a sample of genotypic counts, a mixed procedure gives first the orthodox estimate of gene frequency q and then a Bayesian estimate of F, based on a credible prior distribution of F, which is described here.

  9. Comparative genome sequencing of Drosophila pseudoobscura: Chromosomal, gene, and cis-element evolution

    PubMed Central

    Richards, Stephen; Liu, Yue; Bettencourt, Brian R.; Hradecky, Pavel; Letovsky, Stan; Nielsen, Rasmus; Thornton, Kevin; Hubisz, Melissa J.; Chen, Rui; Meisel, Richard P.; Couronne, Olivier; Hua, Sujun; Smith, Mark A.; Zhang, Peili; Liu, Jing; Bussemaker, Harmen J.; van Batenburg, Marinus F.; Howells, Sally L.; Scherer, Steven E.; Sodergren, Erica; Matthews, Beverly B.; Crosby, Madeline A.; Schroeder, Andrew J.; Ortiz-Barrientos, Daniel; Rives, Catharine M.; Metzker, Michael L.; Muzny, Donna M.; Scott, Graham; Steffen, David; Wheeler, David A.; Worley, Kim C.; Havlak, Paul; Durbin, K. James; Egan, Amy; Gill, Rachel; Hume, Jennifer; Morgan, Margaret B.; Miner, George; Hamilton, Cerissa; Huang, Yanmei; Waldron, Lenée; Verduzco, Daniel; Clerc-Blankenburg, Kerstin P.; Dubchak, Inna; Noor, Mohamed A.F.; Anderson, Wyatt; White, Kevin P.; Clark, Andrew G.; Schaeffer, Stephen W.; Gelbart, William; Weinstock, George M.; Gibbs, Richard A.

    2005-01-01

    We have sequenced the genome of a second Drosophila species, Drosophila pseudoobscura, and compared this to the genome sequence of Drosophila melanogaster, a primary model organism. Throughout evolution the vast majority of Drosophila genes have remained on the same chromosome arm, but within each arm gene order has been extensively reshuffled, leading to a minimum of 921 syntenic blocks shared between the species. A repetitive sequence is found in the D. pseudoobscura genome at many junctions between adjacent syntenic blocks. Analysis of this novel repetitive element family suggests that recombination between offset elements may have given rise to many paracentric inversions, thereby contributing to the shuffling of gene order in the D. pseudoobscura lineage. Based on sequence similarity and synteny, 10,516 putative orthologs have been identified as a core gene set conserved over 25–55 million years (Myr) since the pseudoobscura/melanogaster divergence. Genes expressed in the testes had higher amino acid sequence divergence than the genome-wide average, consistent with the rapid evolution of sex-specific proteins. Cis-regulatory sequences are more conserved than random and nearby sequences between the species—but the difference is slight, suggesting that the evolution of cis-regulatory elements is flexible. Overall, a pattern of repeat-mediated chromosomal rearrangement, and high coadaptation of both male genes and cis-regulatory sequences emerges as important themes of genome divergence between these species of Drosophila. PMID:15632085

  10. Lineage divergence detected in the malaria vector Anopheles marajoara (Diptera: Culicidae) in Amazonian Brazil

    PubMed Central

    2010-01-01

    Background Cryptic species complexes are common among anophelines. Previous phylogenetic analysis based on the complete mtDNA COI gene sequences detected paraphyly in the Neotropical malaria vector Anopheles marajoara. The "Folmer region" detects a single taxon using a 3% divergence threshold. Methods To test the paraphyletic hypothesis and examine the utility of the Folmer region, genealogical trees based on a concatenated (white + 3' COI sequences) dataset and pairwise differentiation of COI fragments were examined. The population structure and demographic history were based on partial COI sequences for 294 individuals from 14 localities in Amazonian Brazil. 109 individuals from 12 localities were sequenced for the nDNA white gene, and 57 individuals from 11 localities were sequenced for the ribosomal DNA (rDNA) internal transcribed spacer 2 (ITS2). Results Distinct A. marajoara lineages were detected by combined genealogical analysis and were also supported among COI haplotypes using a median joining network and AMOVA, with time since divergence during the Pleistocene (<100,000 ya). COI sequences at the 3' end were more variable, demonstrating significant pairwise differentiation (3.82%) compared to the more moderate 2.92% detected by the Folmer region. Lineage 1 was present in all localities, whereas lineage 2 was restricted mainly to the west. Mismatch distributions for both lineages were bimodal, likely due to multiple colonization events and spatial expansion (~798 - 81,045 ya). There appears to be gene flow within, not between lineages, and a partial barrier was detected near Rio Jari in Amapá state, separating western and eastern populations. In contrast, both nDNA data sets (white gene sequences with or without the retention of the 4th intron, and ITS2 sequences and length) detected a single A. marajoara lineage. Conclusions Strong support for combined data with significant differentiation detected in the COI and absent in the nDNA suggest that the divergence is recent, and detectable only by the faster evolving mtDNA. A within subgenus threshold of >2% may be more appropriate among sister taxa in cryptic anopheline complexes than the standard 3%. Differences in demographic history and climatic changes may have contributed to mtDNA lineage divergence in A. marajoara. PMID:20929572

  11. Inferring Species Richness and Turnover by Statistical Multiresolution Texture Analysis of Satellite Imagery

    DTIC Science & Technology

    2012-10-24

    representative pdf’s via the Kullback - Leibler divergence (KL). Species turnover, or b diversity, is estimated using both this KL divergence and the...multiresolution analysis provides a means for estimating divergence between two textures, specifically the Kullback - Leibler divergence between the pair of ...and open challenges. Ecological Informatics 5: 318–329. 19. Ludovisi A, TaticchiM(2006) Investigating beta diversity by kullback - leibler information

  12. Barcoding of fresh water fishes from Pakistan.

    PubMed

    Karim, Asma; Iqbal, Asad; Akhtar, Rehan; Rizwan, Muhammad; Amar, Ali; Qamar, Usman; Jahan, Shah

    2016-07-01

    DNA bar-coding is a taxonomic method that uses small genetic markers in organisms' mitochondrial DNA (mt DNA) for identification of particular species. It uses sequence diversity in a 658-base pair fragment near the 5' end of the mitochondrial cytochrome c oxidase subunit 1 (CO1) gene as a tool for species identification. DNA barcoding is more accurate and reliable method as compared with the morphological identification. It is equally useful in juveniles as well as adult stages of fishes. The present study was conducted to identify three farm fish species of Pakistan (Cyprinus carpio, Cirrhinus mrigala, and Ctenopharyngodon idella) genetically. All of them belonged to family cyprinidae. CO1 gene was amplified. PCR products were sequenced and analyzed by bioinformatic software. Conspecific, congenric, and confamilial k2P nucleotide divergence was estimated. From these findings, it was concluded that the gene sequence, CO1, may serve as milestone for the identification of related species at molecular level.

  13. Defining functional distance using manifold embeddings of gene ontology annotations

    PubMed Central

    Lerman, Gilad; Shakhnovich, Boris E.

    2007-01-01

    Although rigorous measures of similarity for sequence and structure are now well established, the problem of defining functional relationships has been particularly daunting. Here, we present several manifold embedding techniques to compute distances between Gene Ontology (GO) functional annotations and consequently estimate functional distances between protein domains. To evaluate accuracy, we correlate the functional distance to the well established measures of sequence, structural, and phylogenetic similarities. Finally, we show that manual classification of structures into folds and superfamilies is mirrored by proximity in the newly defined function space. We show how functional distances place structure–function relationships in biological context resulting in insight into divergent and convergent evolution. The methods and results in this paper can be readily generalized and applied to a wide array of biologically relevant investigations, such as accuracy of annotation transference, the relationship between sequence, structure, and function, or coherence of expression modules. PMID:17595300

  14. High-quality de novo assembly of the apple genome and methylome dynamics of early fruit development.

    PubMed

    Daccord, Nicolas; Celton, Jean-Marc; Linsmith, Gareth; Becker, Claude; Choisne, Nathalie; Schijlen, Elio; van de Geest, Henri; Bianco, Luca; Micheletti, Diego; Velasco, Riccardo; Di Pierro, Erica Adele; Gouzy, Jérôme; Rees, D Jasper G; Guérif, Philippe; Muranty, Hélène; Durel, Charles-Eric; Laurens, François; Lespinasse, Yves; Gaillard, Sylvain; Aubourg, Sébastien; Quesneville, Hadi; Weigel, Detlef; van de Weg, Eric; Troggio, Michela; Bucher, Etienne

    2017-07-01

    Using the latest sequencing and optical mapping technologies, we have produced a high-quality de novo assembly of the apple (Malus domestica Borkh.) genome. Repeat sequences, which represented over half of the assembly, provided an unprecedented opportunity to investigate the uncharacterized regions of a tree genome; we identified a new hyper-repetitive retrotransposon sequence that was over-represented in heterochromatic regions and estimated that a major burst of different transposable elements (TEs) occurred 21 million years ago. Notably, the timing of this TE burst coincided with the uplift of the Tian Shan mountains, which is thought to be the center of the location where the apple originated, suggesting that TEs and associated processes may have contributed to the diversification of the apple ancestor and possibly to its divergence from pear. Finally, genome-wide DNA methylation data suggest that epigenetic marks may contribute to agronomically relevant aspects, such as apple fruit development.

  15. Identification of an ancient endogenous retrovirus, predating the divergence of the placental mammals.

    PubMed

    Lee, Adam; Nolan, Alison; Watson, Jason; Tristem, Michael

    2013-09-19

    The evolutionary arms race between mammals and retroviruses has long been recognized as one of the oldest host-parasite interactions. Rapid evolution rates in exogenous retroviruses have often made accurate viral age estimations highly problematic. Endogenous retroviruses (ERVs), however, integrate into the germline of their hosts, and are subjected to their evolutionary rates. This study describes, for the first time, a retroviral orthologue predating the divergence of placental mammals, giving it a minimum age of 104-110 Myr. Simultaneously, other orthologous selfish genetic elements (SGEs), inserted into the ERV sequence, provide evidence for the oldest individual mammalian-wide interspersed repeat and medium-reiteration frequency interspersed repeat mammalian repeats, with the same minimum age. The combined use of shared SGEs and reconstruction of viral orthologies defines new limits and increases maximum 'lookback' times, with subsequent implications for the field of paleovirology.

  16. Bloom DNA Helicase Facilitates Homologous Recombination between Diverged Homologous Sequences*

    PubMed Central

    Kikuchi, Koji; Abdel-Aziz, H. Ismail; Taniguchi, Yoshihito; Yamazoe, Mitsuyoshi; Takeda, Shunichi; Hirota, Kouji

    2009-01-01

    Bloom syndrome caused by inactivation of the Bloom DNA helicase (Blm) is characterized by increases in the level of sister chromatid exchange, homologous recombination (HR) associated with cross-over. It is therefore believed that Blm works as an anti-recombinase. Meanwhile, in Drosophila, DmBlm is required specifically to promote the synthesis-dependent strand anneal (SDSA), a type of HR not associating with cross-over. However, conservation of Blm function in SDSA through higher eukaryotes has been a matter of debate. Here, we demonstrate the function of Blm in SDSA type HR in chicken DT40 B lymphocyte line, where Ig gene conversion diversifies the immunoglobulin V gene through intragenic HR between diverged homologous segments. This reaction is initiated by the activation-induced cytidine deaminase enzyme-mediated uracil formation at the V gene, which in turn converts into abasic site, presumably leading to a single strand gap. Ig gene conversion frequency was drastically reduced in BLM−/− cells. In addition, BLM−/− cells used limited donor segments harboring higher identity compared with other segments in Ig gene conversion event, suggesting that Blm can promote HR between diverged sequences. To further understand the role of Blm in HR between diverged homologous sequences, we measured the frequency of gene targeting induced by an I-SceI-endonuclease-mediated double-strand break. BLM−/− cells showed a severer defect in the gene targeting frequency as the number of heterologous sequences increased at the double-strand break site. Conversely, the overexpression of Blm, even an ATPase-defective mutant, strongly stimulated gene targeting. In summary, Blm promotes HR between diverged sequences through a novel ATPase-independent mechanism. PMID:19661064

  17. Strategies for improving approximate Bayesian computation tests for synchronous diversification.

    PubMed

    Overcast, Isaac; Bagley, Justin C; Hickerson, Michael J

    2017-08-24

    Estimating the variability in isolation times across co-distributed taxon pairs that may have experienced the same allopatric isolating mechanism is a core goal of comparative phylogeography. The use of hierarchical Approximate Bayesian Computation (ABC) and coalescent models to infer temporal dynamics of lineage co-diversification has been a contentious topic in recent years. Key issues that remain unresolved include the choice of an appropriate prior on the number of co-divergence events (Ψ), as well as the optimal strategies for data summarization. Through simulation-based cross validation we explore the impact of the strategy for sorting summary statistics and the choice of prior on Ψ on the estimation of co-divergence variability. We also introduce a new setting (β) that can potentially improve estimation of Ψ by enforcing a minimal temporal difference between pulses of co-divergence. We apply this new method to three empirical datasets: one dataset each of co-distributed taxon pairs of Panamanian frogs and freshwater fishes, and a large set of Neotropical butterfly sister-taxon pairs. We demonstrate that the choice of prior on Ψ has little impact on inference, but that sorting summary statistics yields substantially more reliable estimates of co-divergence variability despite violations of assumptions about exchangeability. We find the implementation of β improves estimation of Ψ, with improvement being most dramatic given larger numbers of taxon pairs. We find equivocal support for synchronous co-divergence for both of the Panamanian groups, but we find considerable support for asynchronous divergence among the Neotropical butterflies. Our simulation experiments demonstrate that using sorted summary statistics results in improved estimates of the variability in divergence times, whereas the choice of hyperprior on Ψ has negligible effect. Additionally, we demonstrate that estimating the number of pulses of co-divergence across co-distributed taxon-pairs is improved by applying a flexible buffering regime over divergence times. This improves the correlation between Ψ and the true variability in isolation times and allows for more meaningful interpretation of this hyperparameter. This will allow for more accurate identification of the number of temporally distinct pulses of co-divergence that generated the diversification pattern of a given regional assemblage of sister-taxon-pairs.

  18. The impact of calibration and clock-model choice on molecular estimates of divergence times.

    PubMed

    Duchêne, Sebastián; Lanfear, Robert; Ho, Simon Y W

    2014-09-01

    Phylogenetic estimates of evolutionary timescales can be obtained from nucleotide sequence data using the molecular clock. These estimates are important for our understanding of evolutionary processes across all taxonomic levels. The molecular clock needs to be calibrated with an independent source of information, such as fossil evidence, to allow absolute ages to be inferred. Calibration typically involves fixing or constraining the age of at least one node in the phylogeny, enabling the ages of the remaining nodes to be estimated. We conducted an extensive simulation study to investigate the effects of the position and number of calibrations on the resulting estimate of the timescale. Our analyses focused on Bayesian estimates obtained using relaxed molecular clocks. Our findings suggest that an effective strategy is to include multiple calibrations and to prefer those that are close to the root of the phylogeny. Under these conditions, we found that evolutionary timescales could be estimated accurately even when the relaxed-clock model was misspecified and when the sequence data were relatively uninformative. We tested these findings in a case study of simian foamy virus, where we found that shallow calibrations caused the overall timescale to be underestimated by up to three orders of magnitude. Finally, we provide some recommendations for improving the practice of molecular-clock calibration. Copyright © 2014 Elsevier Inc. All rights reserved.

  19. Worldwide prevalence of lentivirus infection in wild feline species: epidemiologic and phylogenetic aspects.

    PubMed

    Olmsted, R A; Langley, R; Roelke, M E; Goeken, R M; Adger-Johnson, D; Goff, J P; Albert, J P; Packer, C; Laurenson, M K; Caro, T M

    1992-10-01

    The natural occurrence of lentiviruses closely related to feline immunodeficiency virus (FIV) in nondomestic felid species is shown here to be worldwide. Cross-reactive antibodies to FIV were common in several free-ranging populations of large cats, including East African lions and cheetahs of the Serengeti ecosystem and in puma (also called cougar or mountain lion) populations throughout North America. Infectious puma lentivirus (PLV) was isolated from several Florida panthers, a severely endangered relict puma subspecies inhabiting the Big Cypress Swamp and Everglades ecosystems in southern Florida. Phylogenetic analysis of PLV genomic sequences from disparate geographic isolates revealed appreciable divergence from domestic cat FIV sequences as well as between PLV sequences found in different North American locales. The level of sequence divergence between PLV and FIV was greater than the level of divergence between human and certain simian immunodeficiency viruses, suggesting that the transmission of FIV between feline species is infrequent and parallels in time the emergence of HIV from simian ancestors.

  20. Calibrated tree priors for relaxed phylogenetics and divergence time estimation.

    PubMed

    Heled, Joseph; Drummond, Alexei J

    2012-01-01

    The use of fossil evidence to calibrate divergence time estimation has a long history. More recently, Bayesian Markov chain Monte Carlo has become the dominant method of divergence time estimation, and fossil evidence has been reinterpreted as the specification of prior distributions on the divergence times of calibration nodes. These so-called "soft calibrations" have become widely used but the statistical properties of calibrated tree priors in a Bayesian setting hashave not been carefully investigated. Here, we clarify that calibration densities, such as those defined in BEAST 1.5, do not represent the marginal prior distribution of the calibration node. We illustrate this with a number of analytical results on small trees. We also describe an alternative construction for a calibrated Yule prior on trees that allows direct specification of the marginal prior distribution of the calibrated divergence time, with or without the restriction of monophyly. This method requires the computation of the Yule prior conditional on the height of the divergence being calibrated. Unfortunately, a practical solution for multiple calibrations remains elusive. Our results suggest that direct estimation of the prior induced by specifying multiple calibration densities should be a prerequisite of any divergence time dating analysis.

  1. Asymptotic response of observables from divergent weak-coupling expansions: a fractional-calculus-assisted Padé technique.

    PubMed

    Dhatt, Sharmistha; Bhattacharyya, Kamal

    2012-08-01

    Appropriate constructions of Padé approximants are believed to provide reasonable estimates of the asymptotic (large-coupling) amplitude and exponent of an observable, given its weak-coupling expansion to some desired order. In many instances, however, sequences of such approximants are seen to converge very poorly. We outline here a strategy that exploits the idea of fractional calculus to considerably improve the convergence behavior. Pilot calculations on the ground-state perturbative energy series of quartic, sextic, and octic anharmonic oscillators reveal clearly the worth of our endeavor.

  2. Genotypes and phylogeographical relationships of infectious hematopoietic necrosis virus in California, USA

    USGS Publications Warehouse

    Kelley, G.O.; Bendorf, C.M.; Yun, S.C.; Kurath, G.; Hedrick, R.P.

    2007-01-01

    Infectious hematopoietic necrosis virus (IHNV) contains 3 major genogroups in North America with discreet geographic ranges designated as upper (U), middle (M), and lower (L). A comprehensive genotyping of 237 IHNV isolates from hatchery and wild salmonids in California revealed 25 different sequence types (a to y) all in the L genogroup; specifically, the genogroup contained 14 sequence types that were unique to individual isolates as well as 11 sequence types representing 2 or more identical isolates. The most evident trend was the phylogenetic and geographical division of the L genogroup into 2 distinct subgroups designated as LI and LII. Isolates within Subgroup LI were primarily found within waterways linked to southern Oregon and northern California coastal rivers. Isolates in Subgroup LII were concentrated within inland valley watersheds that included the Sacramento River, San Joaquin River, and their tributaries. The temporal and spatial patterns of virus occurrence suggested that infections among adult Chinook salmon in the hatchery or that spawn in the river are a major source of virus potentially infecting other migrating or resident salmonids in California. Serum neutralization results of the California isolates of IHNV corroborated a temporal trend of sequence divergence; specifically, 2 progressive shifts in which more recent virus isolates represent new serotypes. A comparison of the estimates of divergence rates for Subgroup LI (1 ?? ICT5 mutations per nucleotide site per year) indicated stasis similar to that observed in the U genogroup, while the Subgroup LII rate (1 ?? 10 3 mutations per nucleotide site per year) suggested a more active evolution similar to that of the M genogroup. ?? Inter-Research 2007.

  3. Divergence between human populations estimated from linkage disequilibrium.

    PubMed

    Sved, John A; McRae, Allan F; Visscher, Peter M

    2008-12-01

    Observed linkage disequilibrium (LD) between genetic markers in different populations descended independently from a common ancestral population can be used to estimate their absolute time of divergence, because the correlation of LD between populations will be reduced each generation by an amount that, approximately, depends only on the recombination rate between markers. Although drift leads to divergence in allele frequencies, it has less effect on divergence in LD values. We derived the relationship between LD and time of divergence and verified it with coalescent simulations. We then used HapMap Phase II data to estimate time of divergence between human populations. Summed over large numbers of pairs of loci, we find a positive correlation of LD between African and non-African populations at levels of up to approximately 0.3 cM. We estimate that the observed correlation of LD is consistent with an effective separation time of approximately 1,000 generations or approximately 25,000 years before present. The most likely explanation for such relatively low separation times is the existence of substantial levels of migration between populations after the initial separation. Theory and results from coalescent simulations confirm that low levels of migration can lead to a downward bias in the estimate of separation time.

  4. Evaluation of beam divergence of a negative hydrogen ion beam using Doppler shift spectroscopy diagnostics

    NASA Astrophysics Data System (ADS)

    Deka, A. J.; Bharathi, P.; Pandya, K.; Bandyopadhyay, M.; Bhuyan, M.; Yadav, R. K.; Tyagi, H.; Gahlaut, A.; Chakraborty, A.

    2018-01-01

    The Doppler Shift Spectroscopy (DSS) diagnostic is in the conceptual stage to estimate beam divergence, stripping losses, and beam uniformity of the 100 keV hydrogen Diagnostics Neutral Beam of International Thermonuclear Experimental Reactor. This DSS diagnostic is used to measure the above-mentioned parameters with an error of less than 10%. To aid the design calculations and to establish a methodology for estimation of the beam divergence, DSS measurements were carried out on the existing prototype ion source RF Operated Beam Source in India for Negative ion Research. Emissions of the fast-excited neutrals that are generated from the extracted negative ions were collected in the target tank, and the line broadening of these emissions were used for estimating beam divergence. The observed broadening is a convolution of broadenings due to beam divergence, collection optics, voltage ripple, beam focusing, and instrumental broadening. Hence, for estimating the beam divergence from the observed line broadening, a systematic line profile analysis was performed. To minimize the error in the divergence measurements, a study on error propagation in the beam divergence measurements was carried out and the error was estimated. The measurements of beam divergence were done at a constant RF power of 50 kW and a source pressure of 0.6 Pa by varying the extraction voltage from 4 kV to10 kV and the acceleration voltage from 10 kV to 15 kV. These measurements were then compared with the calorimetric divergence, and the results seemed to agree within 10%. A minimum beam divergence of ˜3° was obtained when the source was operated at an extraction voltage of ˜5 kV and at a ˜10 kV acceleration voltage, i.e., at a total applied voltage of 15 kV. This is in agreement with the values reported in experiments carried out on similar sources elsewhere.

  5. When are pathogen genome sequences informative of transmission events?

    PubMed Central

    Ferguson, Neil; Jombart, Thibaut

    2018-01-01

    Recent years have seen the development of numerous methodologies for reconstructing transmission trees in infectious disease outbreaks from densely sampled whole genome sequence data. However, a fundamental and as of yet poorly addressed limitation of such approaches is the requirement for genetic diversity to arise on epidemiological timescales. Specifically, the position of infected individuals in a transmission tree can only be resolved by genetic data if mutations have accumulated between the sampled pathogen genomes. To quantify and compare the useful genetic diversity expected from genetic data in different pathogen outbreaks, we introduce here the concept of ‘transmission divergence’, defined as the number of mutations separating whole genome sequences sampled from transmission pairs. Using parameter values obtained by literature review, we simulate outbreak scenarios alongside sequence evolution using two models described in the literature to describe transmission divergence of ten major outbreak-causing pathogens. We find that while mean values vary significantly between the pathogens considered, their transmission divergence is generally very low, with many outbreaks characterised by large numbers of genetically identical transmission pairs. We describe the impact of transmission divergence on our ability to reconstruct outbreaks using two outbreak reconstruction tools, the R packages outbreaker and phybreak, and demonstrate that, in agreement with previous observations, genetic sequence data of rapidly evolving pathogens such as RNA viruses can provide valuable information on individual transmission events. Conversely, sequence data of pathogens with lower mean transmission divergence, including Streptococcus pneumoniae, Shigella sonnei and Clostridium difficile, provide little to no information about individual transmission events. Our results highlight the informational limitations of genetic sequence data in certain outbreak scenarios, and demonstrate the need to expand the toolkit of outbreak reconstruction tools to integrate other types of epidemiological data. PMID:29420641

  6. Comparative phylogeography reveals deep lineages and regional evolutionary hotspots in the Mojave and Sonoran Deserts

    USGS Publications Warehouse

    Wood, Dustin A.; Vandergast, Amy G.; Barr, Kelly R.; Inman, Richard D.; Esque, Todd C.; Nussear, Kenneth E.; Fisher, Robert N.

    2013-01-01

    Aim: We explored lineage diversification within desert-dwelling fauna. Our goals were (1) to determine whether phylogenetic lineages and population expansions were consistent with younger Pleistocene climate fluctuation hypotheses or much older events predicted by pre-Pleistocene vicariance hypotheses, (2) to assess concordance in spatial patterns of genetic divergence and diversity among species and (3) to identify regional evolutionary hotspots of divergence and diversity and assess their conservation status. Location: Mojave, Colorado, and Sonoran Deserts, USA. Methods: We analysed previously published gene sequence data for twelve species. We used Bayesian gene tree methods to estimate lineages and divergence times. Within each lineage, we tested for population expansion and age of expansion using coalescent approaches. We mapped interpopulation genetic divergence and intra-population genetic diversity in a GIS to identify hotspots of highest genetic divergence and diversity and to assess whether protected lands overlapped with evolutionary hotspots. Results: In seven of the 12 species, lineage divergence substantially predated the Pleistocene. Historical population expansion was found in eight species, but expansion events postdated the Last Glacial Maximum (LGM) in only four. For all species assessed, six hotspots of high genetic divergence and diversity were concentrated in the Colorado Desert, along the Colorado River and in the Mojave/Sonoran ecotone. At least some proportion of the land within each recovered hotspot was categorized as protected, yet four of the six also overlapped with major areas of human development. Main conclusions: Most of the species studied here diversified into distinct Mojave and Sonoran lineages prior to the LGM – supporting older diversification hypotheses. Several evolutionary hotspots were recovered but are not strategically paired with areas of protected land. Long-term preservation of species-level biodiversity would entail selecting areas for protection in Mojave and Sonoran Deserts to retain divergent genetic diversity and ensure connectedness across environmental gradients.

  7. Divergences and estimating tight bounds on Bayes error with applications to multivariate Gaussian copula and latent Gaussian copula

    NASA Astrophysics Data System (ADS)

    Thelen, Brian J.; Xique, Ismael J.; Burns, Joseph W.; Goley, G. Steven; Nolan, Adam R.; Benson, Jonathan W.

    2017-04-01

    In Bayesian decision theory, there has been a great amount of research into theoretical frameworks and information- theoretic quantities that can be used to provide lower and upper bounds for the Bayes error. These include well-known bounds such as Chernoff, Battacharrya, and J-divergence. Part of the challenge of utilizing these various metrics in practice is (i) whether they are "loose" or "tight" bounds, (ii) how they might be estimated via either parametric or non-parametric methods, and (iii) how accurate the estimates are for limited amounts of data. In general what is desired is a methodology for generating relatively tight lower and upper bounds, and then an approach to estimate these bounds efficiently from data. In this paper, we explore the so-called triangle divergence which has been around for a while, but was recently made more prominent in some recent research on non-parametric estimation of information metrics. Part of this work is motivated by applications for quantifying fundamental information content in SAR/LIDAR data, and to help in this, we have developed a flexible multivariate modeling framework based on multivariate Gaussian copula models which can be combined with the triangle divergence framework to quantify this information, and provide approximate bounds on Bayes error. In this paper we present an overview of the bounds, including those based on triangle divergence and verify that under a number of multivariate models, the upper and lower bounds derived from triangle divergence are significantly tighter than the other common bounds, and often times, dramatically so. We also propose some simple but effective means for computing the triangle divergence using Monte Carlo methods, and then discuss estimation of the triangle divergence from empirical data based on Gaussian Copula models.

  8. Phylogeographic patterns of the desert poplar in Northwest China shaped by both geology and climatic oscillations.

    PubMed

    Zeng, Yan-Fei; Zhang, Jian-Guo; Abuduhamiti, Bawerjan; Wang, Wen-Ting; Jia, Zhi-Qing

    2018-05-25

    The effects of historical geology and climatic events on the evolution of plants around the Qinghai-Tibetan Plateau region have been at the center of debate for years. To identify the influence of the uplift of the Tianshan Mountains and/or climatic oscillations on the evolution of plants in arid northwest China, we investigated the phylogeography of the Euphrates poplar (Populus euphratica) using chloroplast DNA (cpDNA) sequences and nuclear microsatellites, and estimated its historical distribution using Ecological Niche Modeling (ENM). We found that the Euphrates poplar differed from another desert poplar, P. pruinosa, in both nuclear and chloroplast DNA. The low clonal diversity in both populations reflected the low regeneration rate by seed/seedlings in many locations. Both cpDNA and nuclear markers demonstrated a clear divergence between the Euphrates poplar populations from northern and southern Xinjiang regions. The divergence time was estimated to be early Pleistocene based on cpDNA, and late Pleistocene using an Approximate Bayesian Computation analysis based on microsatellites. Estimated gene flow was low between these two regions, and the limited gene flow occurred mainly via dispersal from eastern regions. ENM analysis supported a wider distribution of the Euphrates poplar at 3 Ma, but a more constricted distribution during both the glacial period and the interglacial period. These results indicate that the deformation of the Tianshan Mountains has impeded gene flow of the Euphrates poplar populations from northern and southern Xinjiang, and the distribution constriction due to climatic oscillations further accelerated the divergence of populations from these regions. To protect the desert poplars, more effort is needed to encourage seed germination and seedling establishment, and to conserve endemic gene resources in the northern Xinjiang region.

  9. Purifying selection and genetic drift shaped Pleistocene evolution of the mitochondrial genome in an endangered Australian freshwater fish.

    PubMed

    Pavlova, A; Gan, H M; Lee, Y P; Austin, C M; Gilligan, D M; Lintermans, M; Sunnucks, P

    2017-05-01

    Genetic variation in mitochondrial genes could underlie metabolic adaptations because mitochondrially encoded proteins are directly involved in a pathway supplying energy to metabolism. Macquarie perch from river basins exposed to different climates differ in size and growth rate, suggesting potential presence of adaptive metabolic differences. We used complete mitochondrial genome sequences to build a phylogeny, estimate lineage divergence times and identify signatures of purifying and positive selection acting on mitochondrial genes for 25 Macquarie perch from three basins: Murray-Darling Basin (MDB), Hawkesbury-Nepean Basin (HNB) and Shoalhaven Basin (SB). Phylogenetic analysis resolved basin-level clades, supporting incipient speciation previously inferred from differentiation in allozymes, microsatellites and mitochondrial control region. The estimated time of lineage divergence suggested an early- to mid-Pleistocene split between SB and the common ancestor of HNB+MDB, followed by mid-to-late Pleistocene splitting between HNB and MDB. These divergence estimates are more recent than previous ones. Our analyses suggested that evolutionary drivers differed between inland MDB and coastal HNB. In the cooler and more climatically variable MDB, mitogenomes evolved under strong purifying selection, whereas in the warmer and more climatically stable HNB, purifying selection was relaxed. Evidence for relaxed selection in the HNB includes elevated transfer RNA and 16S ribosomal RNA polymorphism, presence of potentially mildly deleterious mutations and a codon (ATP6 113 ) displaying signatures of positive selection (ratio of nonsynonymous to synonymous substitution rates (dN/dS) >1, radical change of an amino-acid property and phylogenetic conservation across the Percichthyidae). In addition, the difference could be because of stronger genetic drift in the smaller and historically more subdivided HNB with low per-population effective population sizes.

  10. Genomic timetree and historical biogeography of Caribbean island ameiva lizards (Pholidoscelis: Teiidae).

    PubMed

    Tucker, Derek B; Hedges, Stephen Blair; Colli, Guarino R; Pyron, Robert Alexander; Sites, Jack W

    2017-09-01

    The phylogenetic relationships and biogeographic history of Caribbean island ameivas ( Pholidoscelis ) are not well-known because of incomplete sampling, conflicting datasets, and poor support for many clades. Here, we use phylogenomic and mitochondrial DNA datasets to reconstruct a well-supported phylogeny and assess historical colonization patterns in the group. We obtained sequence data from 316 nuclear loci and one mitochondrial marker for 16 of 19 extant species of the Caribbean endemic genus Pholidoscelis . Phylogenetic analyses were carried out using both concatenation and species tree approaches. To estimate divergence times, we used fossil teiids to calibrate a timetree which was used to elucidate the historical biogeography of these lizards. All phylogenetic analyses recovered four well-supported species groups (clades) recognized previously and supported novel relationships of those groups, including a ( P. auberi + P. lineolatus ) clade (western + central Caribbean), and a ( P. exsul + P. plei ) clade (eastern Caribbean). Divergence between Pholidoscelis and its sister clade was estimated to have occurred ~25 Ma, with subsequent diversification on Caribbean islands occurring over the last 11 Myr. Of the six models compared in the biogeographic analyses, the scenario which considered the distance among islands and allowed dispersal in all directions best fit the data. These reconstructions suggest that the ancestor of this group colonized either Hispaniola or Puerto Rico from Middle America. We provide a well-supported phylogeny of Pholidoscelis with novel relationships not reported in previous studies that were based on significantly smaller datasets. We propose that Pholidoscelis colonized the eastern Greater Antilles from Middle America based on our biogeographic analysis, phylogeny, and divergence time estimates. The closing of the Central American Seaway and subsequent formation of the modern Atlantic meridional overturning circulation may have promoted dispersal in this group.

  11. Genetic variability in Melipona quinquefasciata (Hymenoptera, Apidae, Meliponini) from northeastern Brazil determined using the first internal transcribed spacer (ITS1).

    PubMed

    Pereira, J O P; Freitas, B M; Jorge, D M M; Torres, D C; Soares, C E A; Grangeiro, T B

    2009-01-01

    Melipona quinquefasciata is a ground-nesting South American stingless bee whose geographic distribution was believed to comprise only the central and southern states of Brazil. We obtained partial sequences (about 500-570 bp) of first internal transcribed spacer (ITS1) nuclear ribosomal DNA from Melipona specimens putatively identified as M. quinquefasciata collected from different localities in northeastern Brazil. To confirm the taxonomic identity of the northeastern samples, specimens from the state of Goiás (Central region of Brazil) were included for comparison. All sequences were deposited in GenBank (accession numbers EU073751-EU073759). The mean nucleotide divergence (excluding sites with insertions/deletions) in the ITS1 sequences was only 1.4%, ranging from 0 to 4.1%. When the sites with insertions/deletions were also taken into account, sequence divergences varied from 0 to 5.3%. In all pairwise comparisons, the ITS1 sequence from the specimens collected in Goiás was most divergent compared to the ITS1 sequences of the bees from the other locations. However, neighbor-joining phylogenetic analysis showed that all ITS1 sequences from northeastern specimens along with the sample of Goiás were resolved in a single clade with a bootstrap support of 100%. The ITS1 sequencing data thus support the occurrence of M. quinquefasciata in northeast Brazil.

  12. Big and slow: phylogenetic estimates of molecular evolution in baleen whales (suborder mysticeti).

    PubMed

    Jackson, J A; Baker, C S; Vant, M; Steel, D J; Medrano-González, L; Palumbi, S R

    2009-11-01

    Baleen whales are the largest animals that have ever lived. To develop an improved estimation of substitution rate for nuclear and mitochondrial DNA for this taxon, we implemented a relaxed-clock phylogenetic approach using three fossil calibration dates: the divergence between odontocetes and mysticetes approximately 34 million years ago (Ma), between the balaenids and balaenopterids approximately 28 Ma, and the time to most recent common ancestor within the Balaenopteridae approximately 12 Ma. We examined seven mitochondrial genomes, a large number of mitochondrial control region sequences (219 haplotypes for 465 bp) and nine nuclear introns representing five species of whales, within which multiple species-specific alleles were sequenced to account for within-species diversity (1-15 for each locus). The total data set represents >1.65 Mbp of mitogenome and nuclear genomic sequence. The estimated substitution rate for the humpback whale control region (3.9%/million years, My) was higher than previous estimates for baleen whales but slow relative to other mammal species with similar generation times (e.g., human-chimp mean rate > 20%/My). The mitogenomic third codon position rate was also slow relative to other mammals (mean estimate 1%/My compared with a mammalian average of 9.8%/My for the cytochrome b gene). The mean nuclear genomic substitution rate (0.05%/My) was substantially slower than average synonymous estimates for other mammals (0.21-0.37%/My across a range of studies). The nuclear and mitogenome rate estimates for baleen whales were thus roughly consistent with an 8- to 10-fold slowing due to a combination of large body size and long generation times. Surprisingly, despite the large data set of nuclear intron sequences, there was only weak and conflicting support for alternate hypotheses about the phylogeny of balaenopterid whales, suggesting that interspecies introgressions or a rapid radiation has obscured species relationships in the nuclear genome.

  13. Limited genetic divergence among Australian alpine Poa tussock grasses coupled with regional structuring points to ongoing gene flow and taxonomic challenges

    PubMed Central

    Griffin, Philippa C.; Hoffmann, Ary A.

    2014-01-01

    Background and Aims While molecular approaches can often accurately reconstruct species relationships, taxa that are incompletely differentiated pose a challenge even with extensive data. Such taxa are functionally differentiated, but may be genetically differentiated only at small and/or patchy regions of the genome. This issue is considered here in Poa tussock grass species that dominate grassland and herbfields in the Australian alpine zone. Methods Previously reported tetraploidy was confirmed in all species by sequencing seven nuclear regions and five microsatellite markers. A Bayesian approach was used to co-estimate nuclear and chloroplast gene trees with an overall dated species tree. The resulting species tree was used to examine species structure and recent hybridization, and intertaxon fertility was tested by experimental crosses. Key Results Species tree estimation revealed Poa gunnii, a Tasmanian endemic species, as sister to the rest of the Australian alpine Poa. The taxa have radiated in the last 0·5–1·2 million years and the non-gunnii taxa are not supported as genetically distinct. Recent hybridization following past species divergence was also not supported. Ongoing gene flow is suggested, with some broad-scale geographic structure within the group. Conclusions The Australian alpine Poa species are not genetically distinct despite being distinguishable phenotypically, suggesting recent adaptive divergence with ongoing intertaxon gene flow. This highlights challenges in using conventional molecular taxonomy to infer species relationships in recent, rapid radiations. PMID:24607721

  14. Mitochondrial DNA variation and phylogenetic relationships among five tuna species based on sequencing of D-loop region.

    PubMed

    Kumar, Girish; Kocour, Martin; Kunal, Swaraj Priyaranjan

    2016-05-01

    In order to assess the DNA sequence variation and phylogenetic relationship among five tuna species (Auxis thazard, Euthynnus affinis, Katsuwonus pelamis, Thunnus tonggol, and T. albacares) out of all four tuna genera, partial sequences of the mitochondrial DNA (mtDNA) D-loop region were analyzed. The estimate of intra-specific sequence variation in studied species was low, ranging from 0.027 to 0.080 [Kimura's two parameter distance (K2P)], whereas values of inter-specific variation ranged from 0.049 to 0.491. The longtail tuna (T. tonggol) and yellowfin tuna (T. albacares) were found to share a close relationship (K2P = 0.049) while skipjack tuna (K. pelamis) was most divergent studied species. Phylogenetic analysis using Maximum-Likelihood (ML) and Neighbor-Joining (NJ) methods supported the monophyletic origin of Thunnus species. Similarly, phylogeny of Auxis and Euthynnus species substantiate the monophyly. However, results showed a distinct origin of K. pelamis from genus Thunnus as well as Auxis and Euthynnus. Thus, the mtDNA D-loop region sequence data supports the polyphyletic origin of tuna species.

  15. Phylogeny and Divergence Times of Gymnosperms Inferred from Single-Copy Nuclear Genes

    PubMed Central

    Guo, Dong-Mei; Yang, Zu-Yu; Wang, Xiao-Quan

    2014-01-01

    Phylogenetic reconstruction is fundamental to study evolutionary biology and historical biogeography. However, there was not a molecular phylogeny of gymnosperms represented by extensive sampling at the genus level, and most published phylogenies of this group were constructed based on cytoplasmic DNA markers and/or the multi-copy nuclear ribosomal DNA. In this study, we use LFY and NLY, two single-copy nuclear genes that originated from an ancient gene duplication in the ancestor of seed plants, to reconstruct the phylogeny and estimate divergence times of gymnosperms based on a complete sampling of extant genera. The results indicate that the combined LFY and NLY coding sequences can resolve interfamilial relationships of gymnosperms and intergeneric relationships of most families. Moreover, the addition of intron sequences can improve the resolution in Podocarpaceae but not in cycads, although divergence times of the cycad genera are similar to or longer than those of the Podocarpaceae genera. Our study strongly supports cycads as the basal-most lineage of gymnosperms rather than sister to Ginkgoaceae, and a sister relationship between Podocarpaceae and Araucariaceae and between Cephalotaxaceae-Taxaceae and Cupressaceae. In addition, intergeneric relationships of some families that were controversial, and the relationships between Taxaceae and Cephalotaxaceae and between conifers and Gnetales are discussed based on the nuclear gene evidence. The molecular dating analysis suggests that drastic extinctions occurred in the early evolution of gymnosperms, and extant coniferous genera in the Northern Hemisphere are older than those in the Southern Hemisphere on average. This study provides an evolutionary framework for future studies on gymnosperms. PMID:25222863

  16. Phylogeography of the sand dollar genus Mellita: cryptic speciation along the coasts of the Americas.

    PubMed

    Coppard, Simon E; Zigler, Kirk S; Lessios, H A

    2013-12-01

    Sand dollars of the genus Mellita are members of the sandy shallow-water fauna. The genus ranges in tropical and subtropical regions on the two coasts of the Americas. To reconstruct the phylogeography of the genus we sequenced parts of the mitochondrial cytochrome oxidase I and of 16S rRNA as well as part of the nuclear 28S rRNA gene from a total of 185 specimens of all ten described morphospecies from 31 localities. Our analyses revealed the presence of eleven species, including six cryptic species. Sequences of five morphospecies do not constitute monophyletic molecular units and thus probably represent ecophenotypic variants. The fossil-calibrated phylogeny showed that the ancestor of Mellita diverged into a Pacific lineage and an Atlantic+Pacific lineage close to the Miocene/Pliocene boundary. Atlantic M. tenuis, M. quinquiesperforata and two undescribed species of Mellita have non-overlapping distributions. Pacific Mellita consist of two highly divergent lineages that became established at different times, resulting in sympatric M. longifissa and M. notabilis. Judged by modern day ranges, not all divergence in this genus conforms to an allopatric speciation model. Only the separation of M. quinquiesperforata from M. notabilis is clearly due to vicariance as the result of the completion of the Isthmus of Panama. The molecular phylogeny calibrated on fossil evidence estimated this event as having occurred ~3 Ma, thus providing evidence that, contrary to a recent proposal, the central American Isthmus was not completed until this date. Copyright © 2013 Elsevier Inc. All rights reserved.

  17. Investigating the Genetic Diversity, Population Differentiation and Population Dynamics of Cycas segmentifida (Cycadaceae) Endemic to Southwest China by Multiple Molecular Markers

    PubMed Central

    Feng, Xiuyan; Liu, Jian; Chiang, Yu-Chung; Gong, Xun

    2017-01-01

    Climate change, species dispersal ability and habitat fragmentation are major factors influencing species distribution and genetic diversity, especially for the range-restricted and threatened taxa. Here, using four sequences of chloroplast DNAs (cpDNAs), three nuclear genes (nDNAs) and 12 nuclear microsatellites (SSRs), we investigated the genetic diversity, genetic structure, divergence time and population dynamics of Cycas segmentifida D. Y. Wang and C. Y. Deng, a threatened cycad species endemic to Southwest China. High levels of genetic diversity and genetic differentiation were revealed in C. segmentifida. Haplotypes of networks showed two evolutionary units in C. segmentifida, with the exception of the nuclear gene GTP network. Meanwhile, the UPGMA tree, structure and PCoA analyses suggested that 14 populations of C. segmentifida were divided into two clades. There was significant effect of isolation by distance (IBD) in this species. However, this species did not display a significant phylogeographic structure. The divergence time estimation suggested that its haplotypes diverged during the Middle Pleistocene. Additionally, the population dynamics inferred from different DNA sequences analyses were discordant. Bottleneck analysis showed that populations of C. segmentifida did not experience any recent bottleneck effect, but rather pointed to a contraction of its effective population size over time. Furthermore, our results suggested that the population BM which held an intact population structure and occupied undisturbed habitat was at the Hardy–Weinberg equilibrium, implying that this population is a free-mating system. These genetic features provide important information for the sustainable management of C. segmentifida. PMID:28580005

  18. Mhc class II B gene evolution in East African cichlid fishes.

    PubMed

    Figueroa, F; Mayer, W E; Sültmann, H; O'hUigin, C; Tichy, H; Satta, Y; Takezaki, N; Takahata, N; Klein, J

    2000-06-01

    A distinctive feature of essential major histocompatibility complex (Mhc) loci is their polymorphism characterized by large genetic distances between alleles and long persistence times of allelic lineages. Since the lineages often span several successive speciations, we investigated the behavior of the Mhc alleles during or close to the speciation phase. We sequenced exon 2 of the class II B locus 4 from 232 East African cichlid fishes representing 32 related species. The divergence times of the (sub)species ranged from 6,000 to 8.4 million years. Two types of evolutionary analysis were used to elucidate the pattern of exon 2 sequence divergence. First, phylogenetic methods were applied to reconstruct the most likely evolutionary pathways leading from the last common ancestor of the set to the extant sequences, and to assess the probable mechanisms involved in allelic diversification. Second, pairwise comparisons of sequences were carried out to detect differences seemingly incompatible with origin by nonparallel point mutations. The analysis revealed point mutations to be the most important mechanism behind allelic divergences, with recombination playing only an auxiliary part. Comparison of sequences from related species revealed evidence of random allelic (lineage) losses apparently associated with speciation. Sharing of identical alleles could be demonstrated between species that diverged 2 million years ago. The phylogeny of the exon was incongruent with that of the flanking introns, indicating either a high degree of convergent evolution at the peptide-binding region-encoding sites, or intron homogenization.

  19. Starmerella reginensis f.a., sp. nov. and Starmerella kourouensis f.a., sp. nov., isolated from flowers in French Guiana.

    PubMed

    Amoikon, Tiemele Laurent Simon; Grondin, Cécile; Djéni, Théodore N'Dédé; Jacques, Noémie; Casaregola, Serge

    2018-05-21

    Analysis of yeasts isolated from various biotopes in French Guiana led to the identification of two strains isolated from flowers and designated CLIB 1634 T and CLIB 1707 T . Comparison of the D1/D2 domain of the large subunit (LSU D1/D2) rRNA gene sequences of CLIB 1634 T and CLIB 1707 T to those in the GenBank database revealed that these strains belong to the Starmerella clade. Strain CLIB 1634 T was shown to diverge from the closely related Starmerella apicola type strain CBS 2868 T with a sequence divergence of 1.34 and 1.30 %, in the LSU D1/D2 rRNA gene and internal transcribed spacer (ITS) sequences respectively. Strain CLIB 1634 T and Candida apicola CBS 2868 T diverged by 3.81 and 14.96 % at the level of the protein-coding gene partial sequences EF-1α and RPB2, respectively. CLIB 1707 T was found to have sequence divergence of 3.88 and 9.16 % in the LSU D1/D2 rRNA gene and ITS, respectively, from that of the most closely related species Starmerella ratchasimensis type strain CBS 10611 T . The species Starmerella reginensis f.a., sp. nov. and Starmerella kourouensis f.a., sp. nov. are proposed to accommodate strains CLIB 1634 T (=CBS 15247 T ) and CLIB 1707 T (=CBS 15257 T ), respectively.

  20. Intraspecific variation in Cryptocaryon irritans.

    PubMed

    Diggles, B K; Adlard, R D

    1997-01-01

    Intraspecific variation in the ciliate Cryptocaryon irritans was examined using sequences of the first internal transcribed spacer region (ITS-1) of ribosomal DNA (rDNA) combined with developmental and morphological characters. Amplified rDNA sequences consisting of 151 bases of the flanking 18 S and 5.8 S regions, and the entire ITS-1 region (169 or 170 bases), were determined and compared for 16 isolates of C. irritans from Australia, Israel and the USA. There was one variable base between isolates in the 18 S region and 11 variable bases in the ITS-1 region. Despite their similar morphology, significant sequence variation (4.1% divergence) and developmental differences indicate that Australian C. irritans isolates from estuarine (Moreton Bay) and coral reef (Heron Island) environments are distinct. The Heron Island isolate was genetically closer to morphologically dissimilar isolates from Israel (1.8% divergence) and the USA (2.3% divergence) than it was to the Moreton Bay isolates. Three isolates maintained in our laboratory since February 1994 differed in sequence from earlier laboratory isolates (2.9% to 3.5% divergence), even though all were similar morphologically and originated from the same source. During this time the sequence of the isolates from wild fish in Moreton Bay remained unchanged. These genetic differences indicate the existence of a founder effect in laboratory populations of C. irritans. The genetic variation found here, combined with known morphological and developmental differences, is used to characterise four strains of C. irritans.

  1. Branch length estimation and divergence dating: estimates of error in Bayesian and maximum likelihood frameworks.

    PubMed

    Schwartz, Rachel S; Mueller, Rachel L

    2010-01-11

    Estimates of divergence dates between species improve our understanding of processes ranging from nucleotide substitution to speciation. Such estimates are frequently based on molecular genetic differences between species; therefore, they rely on accurate estimates of the number of such differences (i.e. substitutions per site, measured as branch length on phylogenies). We used simulations to determine the effects of dataset size, branch length heterogeneity, branch depth, and analytical framework on branch length estimation across a range of branch lengths. We then reanalyzed an empirical dataset for plethodontid salamanders to determine how inaccurate branch length estimation can affect estimates of divergence dates. The accuracy of branch length estimation varied with branch length, dataset size (both number of taxa and sites), branch length heterogeneity, branch depth, dataset complexity, and analytical framework. For simple phylogenies analyzed in a Bayesian framework, branches were increasingly underestimated as branch length increased; in a maximum likelihood framework, longer branch lengths were somewhat overestimated. Longer datasets improved estimates in both frameworks; however, when the number of taxa was increased, estimation accuracy for deeper branches was less than for tip branches. Increasing the complexity of the dataset produced more misestimated branches in a Bayesian framework; however, in an ML framework, more branches were estimated more accurately. Using ML branch length estimates to re-estimate plethodontid salamander divergence dates generally resulted in an increase in the estimated age of older nodes and a decrease in the estimated age of younger nodes. Branch lengths are misestimated in both statistical frameworks for simulations of simple datasets. However, for complex datasets, length estimates are quite accurate in ML (even for short datasets), whereas few branches are estimated accurately in a Bayesian framework. Our reanalysis of empirical data demonstrates the magnitude of effects of Bayesian branch length misestimation on divergence date estimates. Because the length of branches for empirical datasets can be estimated most reliably in an ML framework when branches are <1 substitution/site and datasets are > or =1 kb, we suggest that divergence date estimates using datasets, branch lengths, and/or analytical techniques that fall outside of these parameters should be interpreted with caution.

  2. Empirical and Bayesian approaches to fossil-only divergence times: A study across three reptile clades.

    PubMed

    Turner, Alan H; Pritchard, Adam C; Matzke, Nicholas J

    2017-01-01

    Estimating divergence times on phylogenies is critical in paleontological and neontological studies. Chronostratigraphically-constrained fossils are the only direct evidence of absolute timing of species divergence. Strict temporal calibration of fossil-only phylogenies provides minimum divergence estimates, and various methods have been proposed to estimate divergences beyond these minimum values. We explore the utility of simultaneous estimation of tree topology and divergence times using BEAST tip-dating on datasets consisting only of fossils by using relaxed morphological clocks and birth-death tree priors that include serial sampling (BDSS) at a constant rate through time. We compare BEAST results to those from the traditional maximum parsimony (MP) and undated Bayesian inference (BI) methods. Three overlapping datasets were used that span 250 million years of archosauromorph evolution leading to crocodylians. The first dataset focuses on early Sauria (31 taxa, 240 chars.), the second on early Archosauria (76 taxa, 400 chars.) and the third on Crocodyliformes (101 taxa, 340 chars.). For each dataset three time-calibrated trees (timetrees) were calculated: a minimum-age timetree with node ages based on earliest occurrences in the fossil record; a 'smoothed' timetree using a range of time added to the root that is then averaged over zero-length internodes; and a tip-dated timetree. Comparisons within datasets show that the smoothed and tip-dated timetrees provide similar estimates. Only near the root node do BEAST estimates fall outside the smoothed timetree range. The BEAST model is not able to overcome limited sampling to correctly estimate divergences considerably older than sampled fossil occurrence dates. Conversely, the smoothed timetrees consistently provide node-ages far older than the strict dates or BEAST estimates for morphologically conservative sister-taxa when they sit on long ghost lineages. In this latter case, the relaxed-clock model appears to be correctly moderating the node-age estimate based on the limited morphological divergence. Topologies are generally similar across analyses, but BEAST trees for crocodyliforms differ when clades are deeply nested but contain very old taxa. It appears that the constant-rate sampling assumption of the BDSS tree prior influences topology inference by disfavoring long, unsampled branches.

  3. Empirical and Bayesian approaches to fossil-only divergence times: A study across three reptile clades

    PubMed Central

    Turner, Alan H.; Pritchard, Adam C.; Matzke, Nicholas J.

    2017-01-01

    Estimating divergence times on phylogenies is critical in paleontological and neontological studies. Chronostratigraphically-constrained fossils are the only direct evidence of absolute timing of species divergence. Strict temporal calibration of fossil-only phylogenies provides minimum divergence estimates, and various methods have been proposed to estimate divergences beyond these minimum values. We explore the utility of simultaneous estimation of tree topology and divergence times using BEAST tip-dating on datasets consisting only of fossils by using relaxed morphological clocks and birth-death tree priors that include serial sampling (BDSS) at a constant rate through time. We compare BEAST results to those from the traditional maximum parsimony (MP) and undated Bayesian inference (BI) methods. Three overlapping datasets were used that span 250 million years of archosauromorph evolution leading to crocodylians. The first dataset focuses on early Sauria (31 taxa, 240 chars.), the second on early Archosauria (76 taxa, 400 chars.) and the third on Crocodyliformes (101 taxa, 340 chars.). For each dataset three time-calibrated trees (timetrees) were calculated: a minimum-age timetree with node ages based on earliest occurrences in the fossil record; a ‘smoothed’ timetree using a range of time added to the root that is then averaged over zero-length internodes; and a tip-dated timetree. Comparisons within datasets show that the smoothed and tip-dated timetrees provide similar estimates. Only near the root node do BEAST estimates fall outside the smoothed timetree range. The BEAST model is not able to overcome limited sampling to correctly estimate divergences considerably older than sampled fossil occurrence dates. Conversely, the smoothed timetrees consistently provide node-ages far older than the strict dates or BEAST estimates for morphologically conservative sister-taxa when they sit on long ghost lineages. In this latter case, the relaxed-clock model appears to be correctly moderating the node-age estimate based on the limited morphological divergence. Topologies are generally similar across analyses, but BEAST trees for crocodyliforms differ when clades are deeply nested but contain very old taxa. It appears that the constant-rate sampling assumption of the BDSS tree prior influences topology inference by disfavoring long, unsampled branches. PMID:28187191

  4. Identification of cardiac rhythm features by mathematical analysis of vector fields.

    PubMed

    Fitzgerald, Tamara N; Brooks, Dana H; Triedman, John K

    2005-01-01

    Automated techniques for locating cardiac arrhythmia features are limited, and cardiologists generally rely on isochronal maps to infer patterns in the cardiac activation sequence during an ablation procedure. Velocity vector mapping has been proposed as an alternative method to study cardiac activation in both clinical and research environments. In addition to the visual cues that vector maps can provide, vector fields can be analyzed using mathematical operators such as the divergence and curl. In the current study, conduction features were extracted from velocity vector fields computed from cardiac mapping data. The divergence was used to locate ectopic foci and wavefront collisions, and the curl to identify central obstacles in reentrant circuits. Both operators were applied to simulated rhythms created from a two-dimensional cellular automaton model, to measured data from an in situ experimental canine model, and to complex three-dimensional human cardiac mapping data sets. Analysis of simulated vector fields indicated that the divergence is useful in identifying ectopic foci, with a relatively small number of vectors and with errors of up to 30 degrees in the angle measurements. The curl was useful for identifying central obstacles in reentrant circuits, and the number of velocity vectors needed increased as the rhythm became more complex. The divergence was able to accurately identify canine in situ pacing sites, areas of breakthrough activation, and wavefront collisions. In data from human arrhythmias, the divergence reliably estimated origins of electrical activity and wavefront collisions, but the curl was less reliable at locating central obstacles in reentrant circuits, possibly due to the retrospective nature of data collection. The results indicate that the curl and divergence operators applied to velocity vector maps have the potential to add valuable information in cardiac mapping and can be used to supplement human pattern recognition.

  5. Bias Reduction and Filter Convergence for Long Range Stereo

    NASA Technical Reports Server (NTRS)

    Sibley, Gabe; Matthies, Larry; Sukhatme, Gaurav

    2005-01-01

    We are concerned here with improving long range stereo by filtering image sequences. Traditionally, measurement errors from stereo camera systems have been approximated as 3-D Gaussians, where the mean is derived by triangulation and the covariance by linearized error propagation. However, there are two problems that arise when filtering such 3-D measurements. First, stereo triangulation suffers from a range dependent statistical bias; when filtering this leads to over-estimating the true range. Second, filtering 3-D measurements derived via linearized error propagation leads to apparent filter divergence; the estimator is biased to under-estimate range. To address the first issue, we examine the statistical behavior of stereo triangulation and show how to remove the bias by series expansion. The solution to the second problem is to filter with image coordinates as measurements instead of triangulated 3-D coordinates.

  6. Biophysical models of protein evolution: Understanding the patterns of evolutionary sequence divergence

    PubMed Central

    Echave, Julian; Wilke, Claus O.

    2018-01-01

    For decades, rates of protein evolution have been interpreted in terms of the vague concept of “functional importance”. Slowly evolving proteins or sites within proteins were assumed to be more functionally important and thus subject to stronger selection pressure. More recently, biophysical models of protein evolution, which combine evolutionary theory with protein biophysics, have completely revolutionized our view of the forces that shape sequence divergence. Slowly evolving proteins have been found to evolve slowly because of selection against toxic misfolding and misinteractions, linking their rate of evolution primarily to their abundance. Similarly, most slowly evolving sites in proteins are not directly involved in function, but mutating them has large impacts on protein structure and stability. Here, we review the studies of the emergent field of biophysical protein evolution that have shaped our current understanding of sequence divergence patterns. We also propose future research directions to develop this nascent field. PMID:28301766

  7. Evidence of Divergent Amino Acid Usage in Comparative Analyses of R5- and X4-Associated HIV-1 Vpr Sequences

    PubMed Central

    Antell, Gregory C.; Zhong, Wen; Kercher, Katherine; Passic, Shendra; Williams, Jean; Liu, Yucheng; James, Tony; Jacobson, Jeffrey M.; Szep, Zsofia

    2017-01-01

    Vpr is an HIV-1 accessory protein that plays numerous roles during viral replication, and some of which are cell type dependent. To test the hypothesis that HIV-1 tropism extends beyond the envelope into the vpr gene, studies were performed to identify the associations between coreceptor usage and Vpr variation in HIV-1-infected patients. Colinear HIV-1 Env-V3 and Vpr amino acid sequences were obtained from the LANL HIV-1 sequence database and from well-suppressed patients in the Drexel/Temple Medicine CNS AIDS Research and Eradication Study (CARES) Cohort. Genotypic classification of Env-V3 sequences as X4 (CXCR4-utilizing) or R5 (CCR5-utilizing) was used to group colinear Vpr sequences. To reveal the sequences associated with a specific coreceptor usage genotype, Vpr amino acid sequences were assessed for amino acid diversity and Jensen-Shannon divergence between the two groups. Five amino acid alphabets were used to comprehensively examine the impact of amino acid substitutions involving side chains with similar physiochemical properties. Positions 36, 37, 41, 89, and 96 of Vpr were characterized by statistically significant divergence across multiple alphabets when X4 and R5 sequence groups were compared. In addition, consensus amino acid switches were found at positions 37 and 41 in comparisons of the R5 and X4 sequence populations. These results suggest an evolutionary link between Vpr and gp120 in HIV-1-infected patients. PMID:28620613

  8. Crown Group Lejeuneaceae and Pleurocarpous Mosses in Early Eocene (Ypresian) Indian Amber.

    PubMed

    Heinrichs, Jochen; Scheben, Armin; Bechteler, Julia; Lee, Gaik Ee; Schäfer-Verwimp, Alfons; Hedenäs, Lars; Singh, Hukam; Pócs, Tamás; Nascimbene, Paul C; Peralta, Denilson F; Renner, Matt; Schmidt, Alexander R

    2016-01-01

    Cambay amber originates from the warmest period of the Eocene, which is also well known for the appearance of early angiosperm-dominated megathermal forests. The humid climate of these forests may have triggered the evolution of epiphytic lineages of bryophytes; however, early Eocene fossils of bryophytes are rare. Here, we present evidence for lejeuneoid liverworts and pleurocarpous mosses in Cambay amber. The preserved morphology of the moss fossil is inconclusive for a detailed taxonomic treatment. The liverwort fossil is, however, distinctive; its zig-zagged stems, suberect complicate-bilobed leaves, large leaf lobules, and small, deeply bifid underleaves suggest a member of Lejeuneaceae subtribe Lejeuneinae (Harpalejeunea, Lejeunea, Microlejeunea). We tested alternative classification possibilities by conducting divergence time estimates based on DNA sequence variation of Lejeuneinae using the age of the fossil for corresponding age constraints. Consideration of the fossil as a stem group member of Microlejeunea or Lejeunea resulted in an Eocene to Late Cretaceous age of the Lejeuneinae crown group. This reconstruction is in good accordance with published divergence time estimates generated without the newly presented fossil evidence. Balancing available evidence, we describe the liverwort fossil as the extinct species Microlejeunea nyiahae, representing the oldest crown group fossil of Lejeuneaceae.

  9. Inferring the demographic history of European Ficedula flycatcher populations

    PubMed Central

    2013-01-01

    Background Inference of population and species histories and population stratification using genetic data is important for discriminating between different speciation scenarios and for correct interpretation of genome scans for signs of adaptive evolution and trait association. Here we use data from 24 intronic loci re-sequenced in population samples of two closely related species, the pied flycatcher and the collared flycatcher. Results We applied Isolation-Migration models, assignment analyses and estimated the genetic differentiation and diversity between species and between populations within species. The data indicate a divergence time between the species of <1 million years, significantly shorter than previous estimates using mtDNA, point to a scenario with unidirectional gene-flow from the pied flycatcher into the collared flycatcher and imply that barriers to hybridisation are still permeable in a recently established hybrid zone. Furthermore, we detect significant population stratification, predominantly between the Spanish population and other pied flycatcher populations. Conclusions Our results provide further evidence for a divergence process where different genomic regions may be at different stages of speciation. We also conclude that forthcoming analyses of genotype-phenotype relations in these ecological model species should be designed to take population stratification into account. PMID:23282063

  10. Diversification in the northern neotropics: mitochondrial and nuclear DNA phylogeography of the iguana Ctenosaura pectinata and related species.

    PubMed

    Zarza, Eugenia; Reynoso, Victor H; Emerson, Brent C

    2008-07-01

    While Quaternary climatic changes are considered by some to have been a major factor promoting speciation within the neotropics, others suggest that much of the neotropical species diversity originated before the Pleistocene. Using mitochondrial and nuclear sequence data, we evaluate the relative importance of Pleistocene and pre-Pleistocene events within the evolutionary history of the Mexican iguana Ctenosaura pectinata, and related species. Results support the existence of cryptic lineages with strong mitochondrial divergence (> 4%) among them. Some of these lineages form zones of secondary contact, with one of them hybridizing with C. hemilopha. Evolutionary network analyses reveal the oldest populations of C. pectinata to be those of the northern and southern Mexican coastal regions. Inland and mid-latitudinal coastal populations are younger in age as a consequence of a history of local extinction within these regions followed by re-colonization. Estimated divergence times suggest that C. pectinata originated during the Pliocene, whereas geographically distinct mitochondrial DNA lineages first started to diverge during the Pliocene, with subsequent divergence continuing through the Pleistocene. Our results highlight the influence of both Pliocene and Pleistocene events in shaping the geographical distribution of genetic variation within neotropical lowland organisms. Areas of high genetic diversity in southern Mexico were detected, this finding plus the high levels of genetic diversity within C. pectinata, have implications for the conservation of this threatened species.

  11. The Role of the Y-Chromosome in the Establishment of Murine Hybrid Dysgenesis and in the Analysis of the Nucleotide Sequence Organization, Genetic Transmission and Evolution of Repeated Sequences.

    NASA Astrophysics Data System (ADS)

    Nallaseth, Ferez Soli

    The Y-chromosome presents a unique cytogenetic framework for the evolution of nucleotide sequences. Alignment of nine Y-chromosomal fragments in their increasing Y-specific/non Y-specific (male/female) sequence divergence ratios was directly and inversely related to their interspersion on these two respective genomic fractions. Sequence analysis confirmed a direct relationship between divergence ratios and the Alu, LINE-1, Satellite and their derivative oligonucleotide contents. Thus their relocation on the Y-chromosome is followed by sequence divergence rather than the well documented concerted evolution of these non-coding progenitor repeated sequences. Five of the nine Y-chromosomal fragments are non-pseudoautosomal and transcribed into heterogeneous PolyA^+ RNA and thus can be retrotransposed. Evolutionary and computer analysis identified homologous oligonucleotide tracts in several human loci suggesting common and random mechanistic origins. Dysgenic genomes represent the accelerated evolution driving sequence divergence (McClintock, 1984). Sex reversal and sterility characterizing dysgenesis occurs in C57BL/6JY ^{rm Pos} but not in 129/SvY^{rm Pos} derivative strains. High frequency, random, multi-locus deletion products of the feral Y^{ rm Pos}-chromosome are generated in the germlines of F1(C57BL/6J X 129/SvY^{ rm Pos})(male) and C57BL/6JY ^{rm Pos}(male) but not in 129/SvY^{rm Pos}(male). Equal, 10^{-1}, 10^ {-2}, and 0 copies (relative to males) of Y^{rm Pos}-specific deletion products respectively characterize C57BL/6JY ^{rm Pos} (HC), (LC), (T) and (F) females. The testes determining loci of inactive Y^{rm Pos}-chromosomes in C57BL/6JY^{rm Pos} HC females are the preferentially deleted/rearranged Y ^{rm Pos}-sequences. Disruption of regulation of plasma testosterone and hepatic MUP-A mRNA levels, TRD of a 4.7 Kbp EcoR1 fragment suggest disruption of autosomal/X-chromosomal sequences. These data and the highly repeated progenitor (Alu, GATA, LINE-1) sequence content of deletion products confirmed the previously unidentified loss of genetic control of mammalian chromosome biology and hybrid dysgenesis.

  12. Phylogeny of world stag beetles (Coleoptera: Lucanidae) reveals a Gondwanan origin of Darwin's stag beetle.

    PubMed

    Kim, Sang Il; Farrell, Brian D

    2015-05-01

    Stag beetles (family Lucanidae Latreille, 1804) are one of the earliest branching lineages of scarab beetles that are characterized by the striking development of the male mandibles. Despite stag beetles' popularity among traditional taxonomists and amateur collectors, there has been almost no study of lucanid relationships and evolution. Entomologists, including Jeannel (1942), have long recognized resemblance between the austral stag beetles of the tribes Chiasognathini, Colophonini, Lamprimini, Pholidotini, Rhyssonotini, and Streptocerini, but this hypothesis of their close relationship across the continents has never been tested. To gain further insight into lucanid phylogeny and biogeography, we reconstructed the first molecular phylogeny of world stag beetles using DNA sequences from mitochondrial 16S rDNA, nuclear 18S and 28S rDNA, and the nuclear protein-coding (NPC) gene wingless for 93 lucanid species representing all extant subfamilies and 24 out of the 27 tribes, together with 14 representative samples of other early branching scarabaeoid families and two staphyliniform beetle families as outgroups. Both Bayesian inference (BI) and maximum likelihood inference (MLI) strongly supported the monophyly of Lucanidae sensu lato that includes Diphyllostomatidae. Within Lucanidae sensu stricto, the subfamilies Lucaninae and Lampriminae appeared monophyletic under both methods of phylogenetic inferences; however, Aesalinae and Syndesinae were found to be polyphyletic. A time-calibrated phylogeny based on five fossil data estimated the origin of crown group Lucanidae as circa 160 million years ago (MYA). Divergence between the Neotropical and Australasian groups of the Chiasognathini was estimated to be circa 47MYA, with the South African Colophonini branching off from the ancient Chiasognathini lineage around 87MYA. Another Gondwanan relationship was recovered between the Australasian Eucarteria and the Neotropical Casignetus, which diverged circa 58MYA. Lastly, as Jeannel's hypothesis predicted, divergence within Lampriminae between the Australasian Lamprima and the Neotropical Streptocerus was estimated to be circa 37MYA. The split of these lineages were generally concordant with the pattern of continental break-up of the super-continent Gondwana, and our biogeographic reconstructions based on the dispersal-extinction-cladogenesis model (DEC) corroborate our view that the divergences in these austral lineages were caused by vicariance events following the Gondwanan break-up. In addition, the phylogenetic position and geographic origin of the Hawaiian genus Apterocyclus was revealed for the first time. Overall, our results provide the framework toward studying lucanid relationships and divergence time estimates, which allowed for more accurate biogeographic explanations and discussions on ancestral lucanids and the evolutionary origin of the enlarged male mandibles. Copyright © 2015 Elsevier Inc. All rights reserved.

  13. The phylogenetic relationships and molecular systematics of scincid lizards of the genus Heremites (Sauria, Scincidae) in the Middle East based on mtDNA sequences.

    PubMed

    Bahmani, Zahed; Rastegar-Pouyani, Eskandar; Rastegar-Pouyani, Nasrullah

    2017-09-08

    The taxonomic status of species included in the genus Heremites in Iran and Iraq is uncertain. Three of these species have been assigned to the genus based on morphology: Heremites auratus transcaucasica, H. vittatus, and H. septemtaeniatus. We examined the phylogenetic relationships and taxonomic status of the Iranian and Iraqi species of Heremites by performing phylogenetic analyses using mitochondrial DNA sequences (cytochrome b and 16S rRNA). Phylogenetic relationships and estimated genetic distances indicated that the Heremites populations of the area (Iran and Iraq) form five distinct clades. Three of these clades are found only in Iran, specifically in: (1) Fars and Hormozgan provinces; (2) Northeastern Khuzestan; and (3) Khorasan and Isfahan provinces. The fourth clade (H. septemtaeniatus) is found in west and Mahshahr in Iran as well as in eastern and northern parts of Iraq. The fifth clade, Heremites vittatus, is found in Iran and Iraq. We also confirm the absence of H. auratus in Iran and Iraq. It also indicated that H. vittatus is sister taxon to the other groups that our analyses estimate the divergence of this clade in the Middle Miocene (15.9 Mya). The clade containing the Fars-Hormozgan and Khuzestan populations diverged at the end of the Miocene (8.5 Mya). The Isfahan and Khorasan populations separated at the Pliocene (4.2 Mya) from the western Iranian group, the group in Mahshahr, Iran and the groups in northern and eastern Iraq.

  14. Phylogenomics and Divergence Dating of Fungus-Farming Ants (Hymenoptera: Formicidae) of the Genera Sericomyrmex and Apterostigma.

    PubMed

    Ješovnik, Ana; González, Vanessa L; Schultz, Ted R

    2016-01-01

    Fungus-farming ("attine") ants are model systems for studies of symbiosis, coevolution, and advanced eusociality. A New World clade of nearly 300 species in 15 genera, all attine ants cultivate fungal symbionts for food. In order to better understand the evolution of ant agriculture, we sequenced, assembled, and analyzed transcriptomes of four different attine ant species in two genera: three species in the higher-attine genus Sericomyrmex and a single lower-attine ant species, Apterostigma megacephala, representing the first genomic data for either genus. These data were combined with published genomes of nine other ant species and the honey bee Apis mellifera for phylogenomic and divergence-dating analyses. The resulting phylogeny confirms relationships inferred in previous studies of fungus-farming ants. Divergence-dating analyses recovered slightly older dates than most prior analyses, estimating that attine ants originated 53.6-66.7 million of years ago, and recovered a very long branch subtending a very recent, rapid radiation of the genus Sericomyrmex. This result is further confirmed by a separate analysis of the three Sericomyrmex species, which reveals that 92.71% of orthologs have 99% - 100% pairwise-identical nucleotide sequences. We searched the transcriptomes for genes of interest, most importantly argininosuccinate synthase and argininosuccinate lyase, which are functional in other ants but which are known to have been lost in seven previously studied attine ant species. Loss of the ability to produce the amino acid arginine has been hypothesized to contribute to the obligate dependence of attine ants upon their cultivated fungi, but the point in fungus-farming ant evolution at which these losses occurred has remained unknown. We did not find these genes in any of the sequenced transcriptomes. Although expected for Sericomyrmex species, the absence of arginine anabolic genes in the lower-attine ant Apterostigma megacephala strongly suggests that the loss coincided with the origin of attine ants.

  15. Meiotic drive impacts expression and evolution of x-linked genes in stalk-eyed flies.

    PubMed

    Reinhardt, Josephine A; Brand, Cara L; Paczolt, Kimberly A; Johns, Philip M; Baker, Richard H; Wilkinson, Gerald S

    2014-01-01

    Although sex chromosome meiotic drive has been observed in a variety of species for over 50 years, the genes causing drive are only known in a few cases, and none of these cases cause distorted sex-ratios in nature. In stalk-eyed flies (Teleopsis dalmanni), driving X chromosomes are commonly found at frequencies approaching 30% in the wild, but the genetic basis of drive has remained elusive due to reduced recombination between driving and non-driving X chromosomes. Here, we used RNAseq to identify transcripts that are differentially expressed between males carrying either a driving X (XSR) or a standard X chromosome (XST), and found hundreds of these, the majority of which are X-linked. Drive-associated transcripts show increased levels of sequence divergence (dN/dS) compared to a control set, and are predominantly expressed either in testes or in the gonads of both sexes. Finally, we confirmed that XSR and XST are highly divergent by estimating sequence differentiation between the RNAseq pools. We found that X-linked transcripts were often strongly differentiated (whereas most autosomal transcripts were not), supporting the presence of a relatively large region of recombination suppression on XSR presumably caused by one or more inversions. We have identified a group of genes that are good candidates for further study into the causes and consequences of sex-chromosome drive, and demonstrated that meiotic drive has had a profound effect on sequence evolution and gene expression of X-linked genes in this species.

  16. DNA Barcode Analysis of Thrips (Thysanoptera) Diversity in Pakistan Reveals Cryptic Species Complexes.

    PubMed

    Iftikhar, Romana; Ashfaq, Muhammad; Rasool, Akhtar; Hebert, Paul D N

    2016-01-01

    Although thrips are globally important crop pests and vectors of viral disease, species identifications are difficult because of their small size and inconspicuous morphological differences. Sequence variation in the mitochondrial COI-5' (DNA barcode) region has proven effective for the identification of species in many groups of insect pests. We analyzed barcode sequence variation among 471 thrips from various plant hosts in north-central Pakistan. The Barcode Index Number (BIN) system assigned these sequences to 55 BINs, while the Automatic Barcode Gap Discovery detected 56 partitions, a count that coincided with the number of monophyletic lineages recognized by Neighbor-Joining analysis and Bayesian inference. Congeneric species showed an average of 19% sequence divergence (range = 5.6% - 27%) at COI, while intraspecific distances averaged 0.6% (range = 0.0% - 7.6%). BIN analysis suggested that all intraspecific divergence >3.0% actually involved a species complex. In fact, sequences for three major pest species (Haplothrips reuteri, Thrips palmi, Thrips tabaci), and one predatory thrips (Aeolothrips intermedius) showed deep intraspecific divergences, providing evidence that each is a cryptic species complex. The study compiles the first barcode reference library for the thrips of Pakistan, and examines global haplotype diversity in four important pest thrips.

  17. Extensive concerted evolution of rice paralogs and the road to regaining independence.

    PubMed

    Wang, Xiyin; Tang, Haibao; Bowers, John E; Feltus, Frank A; Paterson, Andrew H

    2007-11-01

    Many genes duplicated by whole-genome duplications (WGDs) are more similar to one another than expected. We investigated whether concerted evolution through conversion and crossing over, well-known to affect tandem gene clusters, also affects dispersed paralogs. Genome sequences for two Oryza subspecies reveal appreciable gene conversion in the approximately 0.4 MY since their divergence, with a gradual progression toward independent evolution of older paralogs. Since divergence from subspecies indica, approximately 8% of japonica paralogs produced 5-7 MYA on chromosomes 11 and 12 have been affected by gene conversion and several reciprocal exchanges of chromosomal segments, while approximately 70-MY-old "paleologs" resulting from a genome duplication (GD) show much less conversion. Sequence similarity analysis in proximal gene clusters also suggests more conversion between younger paralogs. About 8% of paleologs may have been converted since rice-sorghum divergence approximately 41 MYA. Domain-encoding sequences are more frequently converted than nondomain sequences, suggesting a sort of circularity--that sequences conserved by selection may be further conserved by relatively frequent conversion. The higher level of concerted evolution in the 5-7 MY-old segmental duplication may reflect the behavior of many genomes within the first few million years after duplication or polyploidization.

  18. Comparative sequence analyses of sixteen reptilian paramyxoviruses

    USGS Publications Warehouse

    Ahne, W.; Batts, W.N.; Kurath, G.; Winton, J.R.

    1999-01-01

    Viral genomic RNA of Fer-de-Lance virus (FDLV), a paramyxovirus highly pathogenic for reptiles, was reverse transcribed and cloned. Plasmids with significant sequence similarities to the hemagglutinin-neuraminidase (HN) and polymerase (L) genes of mammalian paramyxoviruses were identified by BLAST search. Partial sequences of the FDLV genes were used to design primers for amplification by nested polymerase chain reaction (PCR) and sequencing of 518-bp L gene and 352-bp HN gene fragments from a collection of 15 previously uncharacterized reptilian paramyxoviruses. Phylogenetic analyses of the partial L and HN sequences produced similar trees in which there were two distinct subgroups of isolates that were supported with maximum bootstrap values, and several intermediate isolates. Within each subgroup the nucleotide divergence values were less than 2.5%, while the divergence between the two subgroups was 20-22%. This indicated that the two subgroups represent distinct virus species containing multiple virus strains. The five intermediate isolates had nucleotide divergence values of 11-20% and may represent additional distinct species. In addition to establishing diversity among reptilian paramyxoviruses, the phylogenetic groupings showed some correlation with geographic location, and clearly demonstrated a low level of host species-specificity within these viruses. Copyright (C) 1999 Elsevier Science B.V.

  19. srRNA evolution and phylogenetic relationships of the genus Naegleria (Protista: Rhizopoda).

    PubMed

    Baverstock, P R; Illana, S; Christy, P E; Robinson, B S; Johnson, A M

    1989-05-01

    A rapid RNA sequencing technique was used to partially sequence the small-subunit ribosomal RNA (srRNA) of four species of the amoeboid genus Naegleria. The extent of nucleotide sequence divergence between the two most divergent species was roughly similar to that found between mammals and frogs. However, the pattern of variation among the Naegleria species was quite different from that found for those species of tetrapods characterized to date. A phylogenetic analysis of the consensus Naegleria sequence showed that Naegleria was not monophyletic with either Acanthamoeba castellanii or Dictyostelium discoideum, two other amoebas for which sequences were available. It was shown that the semiconserved regions of the srRNA molecule evolve in a clocklike fashion and that the clock is time dependent rather than generation dependent.

  20. Rate variation and estimation of divergence times using strict and relaxed clocks.

    PubMed

    Brown, Richard P; Yang, Ziheng

    2011-09-26

    Understanding causes of biological diversity may be greatly enhanced by knowledge of divergence times. Strict and relaxed clock models are used in Bayesian estimation of divergence times. We examined whether: i) strict clock models are generally more appropriate in shallow phylogenies where rate variation is expected to be low, ii) the likelihood ratio test of the clock (LRT) reliably informs which model is appropriate for dating divergence times. Strict and relaxed models were used to analyse sequences simulated under different levels of rate variation. Published shallow phylogenies (Black bass, Primate-sucking lice, Podarcis lizards, Gallotiinae lizards, and Caprinae mammals) were also analysed to determine natural levels of rate variation relative to the performance of the different models. Strict clock analyses performed well on data simulated under the independent rates model when the standard deviation of log rate on branches, σ, was low (≤ 0.1), but were inappropriate when σ>0.1 (95% of rates fall within 0.0082-0.0121 subs/site/Ma when σ = 0.1, for a mean rate of 0.01). The independent rates relaxed clock model performed well at all levels of rate variation, although posterior intervals on times were significantly wider than for the strict clock. The strict clock is therefore superior when rate variation is low. The performance of a correlated rates relaxed clock model was similar to the strict clock. Increased numbers of independent loci led to slightly narrower posteriors under the relaxed clock while older root ages provided proportionately narrower posteriors. The LRT had low power for σ = 0.01-0.1, but high power for σ = 0.5-2.0. Posterior means of σ2 were useful for assessing rate variation in published datasets. Estimates of natural levels of rate variation ranged from 0.05-3.38 for different partitions. Differences in divergence times between relaxed and strict clock analyses were greater in two datasets with higher σ2 for one or more partitions, supporting the simulation results. The strict clock can be superior for trees with shallow roots because of low levels of rate variation between branches. The LRT allows robust assessment of suitability of the clock model as does examination of posteriors on σ2.

  1. Allotetraploid origin and divergence in Eleusine (Chloridoideae, Poaceae): evidence from low-copy nuclear gene phylogenies and a plastid gene chronogram.

    PubMed

    Liu, Qing; Triplett, Jimmy K; Wen, Jun; Peterson, Paul M

    2011-11-01

    Eleusine (Poaceae) is a small genus of the subfamily Chloridoideae exhibiting considerable morphological and ecological diversity in East Africa and the Americas. The interspecific phylogenetic relationships of Eleusine are investigated in order to identify its allotetraploid origin, and a chronogram is estimated to infer temporal relationships between palaeoenvironment changes and divergence of Eleusine in East Africa. Two low-copy nuclear (LCN) markers, Pepc4 and EF-1α, were analysed using parsimony, likelihood and Bayesian approaches. A chronogram of Eleusine was inferred from a combined data set of six plastid DNA markers (ndhA intron, ndhF, rps16-trnK, rps16 intron, rps3, and rpl32-trnL) using the Bayesian dating method. The monophyly of Eleusine is strongly supported by sequence data from two LCN markers. In the cpDNA phylogeny, three tetraploid species (E. africana, E. coracana and E. kigeziensis) share a common ancestor with the E. indica-E. tristachya clade, which is considered a source of maternal parents for allotetraploids. Two homoeologous loci are isolated from three tetraploid species in the Pepc4 phylogeny, and the maternal parents receive further support. The A-type EF-1α sequences possess three characters, i.e. a large number of variations of intron 2; clade E-A distantly diverged from clade E-B and other diploid species; and seven deletions in intron 2, implying a possible derivation through a gene duplication event. The crown age of Eleusine and the allotetraploid lineage are 3·89 million years ago (mya) and 1·40 mya, respectively. The molecular data support independent allotetraploid origins for E. kigeziensis and the E. africana-E. coracana clade. Both events may have involved diploids E. indica and E. tristachya as the maternal parents, but the paternal parents remain unidentified. The habitat-specific hypothesis is proposed to explain the divergence of Eleusine and its allotetraploid lineage.

  2. Allotetraploid origin and divergence in Eleusine (Chloridoideae, Poaceae): evidence from low-copy nuclear gene phylogenies and a plastid gene chronogram

    PubMed Central

    Liu, Qing; Triplett, Jimmy K.; Wen, Jun; Peterson, Paul M.

    2011-01-01

    Background and Aims Eleusine (Poaceae) is a small genus of the subfamily Chloridoideae exhibiting considerable morphological and ecological diversity in East Africa and the Americas. The interspecific phylogenetic relationships of Eleusine are investigated in order to identify its allotetraploid origin, and a chronogram is estimated to infer temporal relationships between palaeoenvironment changes and divergence of Eleusine in East Africa. Methods Two low-copy nuclear (LCN) markers, Pepc4 and EF-1α, were analysed using parsimony, likelihood and Bayesian approaches. A chronogram of Eleusine was inferred from a combined data set of six plastid DNA markers (ndhA intron, ndhF, rps16-trnK, rps16 intron, rps3, and rpl32-trnL) using the Bayesian dating method. Key Results The monophyly of Eleusine is strongly supported by sequence data from two LCN markers. In the cpDNA phylogeny, three tetraploid species (E. africana, E. coracana and E. kigeziensis) share a common ancestor with the E. indica–E. tristachya clade, which is considered a source of maternal parents for allotetraploids. Two homoeologous loci are isolated from three tetraploid species in the Pepc4 phylogeny, and the maternal parents receive further support. The A-type EF-1α sequences possess three characters, i.e. a large number of variations of intron 2; clade E-A distantly diverged from clade E-B and other diploid species; and seven deletions in intron 2, implying a possible derivation through a gene duplication event. The crown age of Eleusine and the allotetraploid lineage are 3·89 million years ago (mya) and 1·40 mya, respectively. Conclusions The molecular data support independent allotetraploid origins for E. kigeziensis and the E. africana–E. coracana clade. Both events may have involved diploids E. indica and E. tristachya as the maternal parents, but the paternal parents remain unidentified. The habitat-specific hypothesis is proposed to explain the divergence of Eleusine and its allotetraploid lineage. PMID:21880659

  3. Identification of divergent protein domains by combining HMM-HMM comparisons and co-occurrence detection.

    PubMed

    Ghouila, Amel; Florent, Isabelle; Guerfali, Fatma Zahra; Terrapon, Nicolas; Laouini, Dhafer; Yahia, Sadok Ben; Gascuel, Olivier; Bréhélin, Laurent

    2014-01-01

    Identification of protein domains is a key step for understanding protein function. Hidden Markov Models (HMMs) have proved to be a powerful tool for this task. The Pfam database notably provides a large collection of HMMs which are widely used for the annotation of proteins in sequenced organisms. This is done via sequence/HMM comparisons. However, this approach may lack sensitivity when searching for domains in divergent species. Recently, methods for HMM/HMM comparisons have been proposed and proved to be more sensitive than sequence/HMM approaches in certain cases. However, these approaches are usually not used for protein domain discovery at a genome scale, and the benefit that could be expected from their utilization for this problem has not been investigated. Using proteins of P. falciparum and L. major as examples, we investigate the extent to which HMM/HMM comparisons can identify new domain occurrences not already identified by sequence/HMM approaches. We show that although HMM/HMM comparisons are much more sensitive than sequence/HMM comparisons, they are not sufficiently accurate to be used as a standalone complement of sequence/HMM approaches at the genome scale. Hence, we propose to use domain co-occurrence--the general domain tendency to preferentially appear along with some favorite domains in the proteins--to improve the accuracy of the approach. We show that the combination of HMM/HMM comparisons and co-occurrence domain detection boosts protein annotations. At an estimated False Discovery Rate of 5%, it revealed 901 and 1098 new domains in Plasmodium and Leishmania proteins, respectively. Manual inspection of part of these predictions shows that it contains several domain families that were missing in the two organisms. All new domain occurrences have been integrated in the EuPathDomains database, along with the GO annotations that can be deduced.

  4. Identification of Divergent Protein Domains by Combining HMM-HMM Comparisons and Co-Occurrence Detection

    PubMed Central

    Ghouila, Amel; Florent, Isabelle; Guerfali, Fatma Zahra; Terrapon, Nicolas; Laouini, Dhafer; Yahia, Sadok Ben; Gascuel, Olivier; Bréhélin, Laurent

    2014-01-01

    Identification of protein domains is a key step for understanding protein function. Hidden Markov Models (HMMs) have proved to be a powerful tool for this task. The Pfam database notably provides a large collection of HMMs which are widely used for the annotation of proteins in sequenced organisms. This is done via sequence/HMM comparisons. However, this approach may lack sensitivity when searching for domains in divergent species. Recently, methods for HMM/HMM comparisons have been proposed and proved to be more sensitive than sequence/HMM approaches in certain cases. However, these approaches are usually not used for protein domain discovery at a genome scale, and the benefit that could be expected from their utilization for this problem has not been investigated. Using proteins of P. falciparum and L. major as examples, we investigate the extent to which HMM/HMM comparisons can identify new domain occurrences not already identified by sequence/HMM approaches. We show that although HMM/HMM comparisons are much more sensitive than sequence/HMM comparisons, they are not sufficiently accurate to be used as a standalone complement of sequence/HMM approaches at the genome scale. Hence, we propose to use domain co-occurrence — the general domain tendency to preferentially appear along with some favorite domains in the proteins — to improve the accuracy of the approach. We show that the combination of HMM/HMM comparisons and co-occurrence domain detection boosts protein annotations. At an estimated False Discovery Rate of 5%, it revealed 901 and 1098 new domains in Plasmodium and Leishmania proteins, respectively. Manual inspection of part of these predictions shows that it contains several domain families that were missing in the two organisms. All new domain occurrences have been integrated in the EuPathDomains database, along with the GO annotations that can be deduced. PMID:24901648

  5. Integrative taxonomy of Metrichia Ross (Trichoptera: Hydroptilidae: Ochrotrichiinae) microcaddisflies from Brazil: descriptions of twenty new species

    PubMed Central

    Takiya, Daniela M.; Nessimian, Jorge L.

    2016-01-01

    Metrichia is assigned to the Ochrotrichiinae, a group of almost exclusively Neotropical microcaddisflies. Metrichia comprises over 100 described species and, despite its diversity, only one species has been described from Brazil so far. In this paper, we provide descriptions for 20 new species from 8 Brazilian states: M. acuminata sp. nov., M. azul sp. nov., M. bonita sp. nov., M. bracui sp. nov., M. caraca sp. nov., M. circuliforme sp. nov., M. curta sp. nov., M. farofa sp. nov., M. forceps sp. nov., M. formosinha sp. nov., M. goiana sp. nov., M. itabaiana sp. nov., M. longissima sp. nov., M. peluda sp. nov., M. rafaeli sp. nov., M. simples sp. nov., M. talhada sp. nov., M. tere sp. nov., M. ubajara sp. nov., and M. vulgaris sp. nov. DNA barcode sequences (577 bp of the mitochondrial gene COI) were generated for 13 of the new species and two previously known species of Metrichia resulting in 64 sequences. In addition, COI sequences were obtained for other genera of Ochrotrichiinae (Angrisanoia, Nothotrichia, Ochrotrichia, Ragatrichia, and Rhyacopsyche). DNA sequences and morphological data were integrated to evaluate species delimitations. K2P pairwise distances were calculated to generate a neighbor-joining tree. COI sequences also were submitted to ABGD and GMYC methods to assess ‘potential species’ delimitation. Analyses showed a conspicuous barcoding gap among Metrichia sequences (highest intraspecific divergence: 4.8%; lowest interspecific divergence: 12.6%). Molecular analyses also allowed the association of larvae and adults of Metrichia bonita sp. nov. from Mato Grosso do Sul, representing the first record of microcaddisfly larvae occurring in calcareous tufa (or travertine). ABGD results agreed with the morphological delimitation of Metrichia species, while GMYC estimated a slightly higher number of species, suggesting the division of two morphological species, each one into two potential species. Because this could be due to unbalanced sampling and the lack of morphological diagnostic characters, we have maintained these two species as undivided. PMID:27169001

  6. Similarity of Symbol Frequency Distributions with Heavy Tails

    NASA Astrophysics Data System (ADS)

    Gerlach, Martin; Font-Clos, Francesc; Altmann, Eduardo G.

    2016-04-01

    Quantifying the similarity between symbolic sequences is a traditional problem in information theory which requires comparing the frequencies of symbols in different sequences. In numerous modern applications, ranging from DNA over music to texts, the distribution of symbol frequencies is characterized by heavy-tailed distributions (e.g., Zipf's law). The large number of low-frequency symbols in these distributions poses major difficulties to the estimation of the similarity between sequences; e.g., they hinder an accurate finite-size estimation of entropies. Here, we show analytically how the systematic (bias) and statistical (fluctuations) errors in these estimations depend on the sample size N and on the exponent γ of the heavy-tailed distribution. Our results are valid for the Shannon entropy (α =1 ), its corresponding similarity measures (e.g., the Jensen-Shanon divergence), and also for measures based on the generalized entropy of order α . For small α 's, including α =1 , the errors decay slower than the 1 /N decay observed in short-tailed distributions. For α larger than a critical value α*=1 +1 /γ ≤2 , the 1 /N decay is recovered. We show the practical significance of our results by quantifying the evolution of the English language over the last two centuries using a complete α spectrum of measures. We find that frequent words change more slowly than less frequent words and that α =2 provides the most robust measure to quantify language change.

  7. Next generation semiconductor based-sequencing of a nutrigenetics target gene (GPR120) and association with growth rate in Italian Large White pigs.

    PubMed

    Fontanesi, Luca; Bertolini, Francesca; Scotti, Emilio; Schiavo, Giuseppina; Colombo, Michela; Trevisi, Paolo; Ribani, Anisa; Buttazzoni, Luca; Russo, Vincenzo; Dall'Olio, Stefania

    2015-01-01

    The GPR120 gene (also known as FFAR4 or O3FAR1) encodes for a functional omega-3 fatty acid receptor/sensor that mediates potent insulin sensitizing effects by repressing macrophage-induced tissue inflammation. For its functional role, GPR120 could be considered a potential target gene in animal nutrigenetics. In this work we resequenced the porcine GPR120 gene by high throughput Ion Torrent semiconductor sequencing of amplified fragments obtained from 8 DNA pools derived, on the whole, from 153 pigs of different breeds/populations (two Italian Large White pools, Italian Duroc, Italian Landrace, Casertana, Pietrain, Meishan, and wild boars). Three single nucleotide polymorphisms (SNPs), two synonymous substitutions and one in the putative 3'-untranslated region (g.114765469C > T), were identified and their allele frequencies were estimated by sequencing reads count. The g.114765469C > T SNP was also genotyped by PCR-RFLP confirming estimated frequency in Italian Large White pools. Then, this SNP was analyzed in two Italian Large White cohorts using a selective genotyping approach based on extreme and divergent pigs for back fat thickness (BFT) estimated breeding value (EBV) and average daily gain (ADG) EBV. Significant differences of allele and genotype frequencies distribution was observed between the extreme ADG-EBV groups (P < 0.001) whereas this marker was not associated with BFT-EBV.

  8. Genetic Divergence Disclosing a Rapid Prehistorical Dispersion of Native Americans in Central and South America

    PubMed Central

    He, Yungang; Wang, Wei R.; Li, Ran; Wang, Sijia; Jin, Li

    2012-01-01

    An accurate estimate of the divergence time between Native Americans is important for understanding the initial entry and early dispersion of human beings in the New World. Current methods for estimating the genetic divergence time of populations could seriously depart from a linear relationship with the true divergence for multiple populations of a different population size and significant population expansion. Here, to address this problem, we propose a novel measure to estimate the genetic divergence time of populations. Computer simulation revealed that the new measure maintained an excellent linear correlation with the population divergence time in complicated multi-population scenarios with population expansion. Utilizing the new measure and microsatellite data of 21 Native American populations, we investigated the genetic divergences of the Native American populations. The results indicated that genetic divergences between North American populations are greater than that between Central and South American populations. None of the divergences, however, were large enough to constitute convincing evidence supporting the two-wave or multi-wave migration model for the initial entry of human beings into America. The genetic affinity of the Native American populations was further explored using Neighbor-Net and the genetic divergences suggested that these populations could be categorized into four genetic groups living in four different ecologic zones. The divergence of the population groups suggests that the early dispersion of human beings in America was a multi-step procedure. Further, the divergences suggest the rapid dispersion of Native Americans in Central and South Americas after a long standstill period in North America. PMID:22970308

  9. Genetic divergence between populations of feral and domestic forms of a mosquito disease vector assessed by transcriptomics

    PubMed Central

    2015-01-01

    Culex pipiens, an invasive mosquito and vector of West Nile virus in the US, has two morphologically indistinguishable forms that differ dramatically in behavior and physiology. Cx. pipiens form pipiens is primarily a bird-feeding temperate mosquito, while the sub-tropical Cx. pipiens form molestus thrives in sewers and feeds on mammals. Because the feral form can diapause during the cold winters but the domestic form cannot, the two Cx. pipiens forms are allopatric in northern Europe and, although viable, hybrids are rare. Cx. pipiens form molestus has spread across all inhabited continents and hybrids of the two forms are common in the US. Here we elucidate the genes and gene families with the greatest divergence rates between these phenotypically diverged mosquito populations, and discuss them in light of their potential biological and ecological effects. After generating and assembling novel transcriptome data for each population, we performed pairwise tests for nonsynonymous divergence (Ka) of homologous coding sequences and examined gene ontology terms that were statistically over-represented in those sequences with the greatest divergence rates. We identified genes involved in digestion (serine endopeptidases), innate immunity (fibrinogens and α-macroglobulins), hemostasis (D7 salivary proteins), olfaction (odorant binding proteins) and chitin binding (peritrophic matrix proteins). By examining molecular divergence between closely related yet phenotypically divergent forms of the same species, our results provide insights into the identity of rapidly-evolving genes between incipient species. Additionally, we found that families of signal transducers, ATP synthases and transcription regulators remained identical at the amino acid level, thus constituting conserved components of the Cx. pipiens proteome. We provide a reference with which to gauge the divergence reported in this analysis by performing a comparison of transcriptome sequences from conspecific (yet allopatric) populations of another member of the Cx. pipiens complex, Cx. quinquefasciatus. PMID:25755934

  10. [Hepatitis C virus: sequence homology of a European isolate and divergence from the prototype].

    PubMed

    Seelig, R; Seelig, H P; Renz, M

    1991-08-01

    The polymerase chain reaction (PCR) detected specific hepatitis C viral (HCV) RNA sequences in liver biopsies from two patients with chronic hepatitis, in the tissue of a liver implantate, in plasma from four chronic non-A, non-B hepatitis (NANBH) patients and, for the first time, in an infectious anti-D-immunoglobulin preparation. A comparison of the viral sequences coding for a region for the nonstructural NS3 protein from the liver tissues revealed only a very small degree of sequence divergence on the cDNA as well as on the amino acid level (between 0 and 5%). The sequence similarities of the RNA isolated from plasma of the four chronic NANBH patients and the anti-D-immunoglobulin preparation were partly somewhat lower but altogether also high (between 90 and 100%). In contrast, all eight cDNA and amino acid sequences exhibited a significantly higher degree of divergence in comparison with the HCV prototype sequence (between 29 and 32%) than among themselves (between 0 and 10%). This unexpected high sequence similarity of the eight European isolates and their low homology to the Northamerican prototype sequence is indicative for the existence of different types of HCV. This will be important not only for epidemiological studies but also for the development of effective diagnostic procedures and vaccines. Concerning the pathogenesis of NANBH, a double infection or a helper mechanism has to be considered: in addition to the C virus, sequences of an other virus particle were found in the infectious IgG preparation as well as in the liver biopsies.

  11. Population genomics of parallel hybrid zones in the mimetic butterflies, H. melpomene and H. erato

    PubMed Central

    Ruiz, Mayté; Salazar, Patricio; Counterman, Brian; Medina, Jose Alejandro; Ortiz-Zuazaga, Humberto; Morrison, Anna; Papa, Riccardo

    2014-01-01

    Hybrid zones can be valuable tools for studying evolution and identifying genomic regions responsible for adaptive divergence and underlying phenotypic variation. Hybrid zones between subspecies of Heliconius butterflies can be very narrow and are maintained by strong selection acting on color pattern. The comimetic species, H. erato and H. melpomene, have parallel hybrid zones in which both species undergo a change from one color pattern form to another. We use restriction-associated DNA sequencing to obtain several thousand genome-wide sequence markers and use these to analyze patterns of population divergence across two pairs of parallel hybrid zones in Peru and Ecuador. We compare two approaches for analysis of this type of data—alignment to a reference genome and de novo assembly—and find that alignment gives the best results for species both closely (H. melpomene) and distantly (H. erato, ∼15% divergent) related to the reference sequence. Our results confirm that the color pattern controlling loci account for the majority of divergent regions across the genome, but we also detect other divergent regions apparently unlinked to color pattern differences. We also use association mapping to identify previously unmapped color pattern loci, in particular the Ro locus. Finally, we identify a new cryptic population of H. timareta in Ecuador, which occurs at relatively low altitude and is mimetic with H. melpomene malleti. PMID:24823669

  12. Chloroplast Genome Evolution in Early Diverged Leptosporangiate Ferns

    PubMed Central

    Kim, Hyoung Tae; Chung, Myong Gi; Kim, Ki-Joong

    2014-01-01

    In this study, the chloroplast (cp) genome sequences from three early diverged leptosporangiate ferns were completed and analyzed in order to understand the evolution of the genome of the fern lineages. The complete cp genome sequence of Osmunda cinnamomea (Osmundales) was 142,812 base pairs (bp). The cp genome structure was similar to that of eusporangiate ferns. The gene/intron losses that frequently occurred in the cp genome of leptosporangiate ferns were not found in the cp genome of O. cinnamomea. In addition, putative RNA editing sites in the cp genome were rare in O. cinnamomea, even though the sites were frequently predicted to be present in leptosporangiate ferns. The complete cp genome sequence of Diplopterygium glaucum (Gleicheniales) was 151,007 bp and has a 9.7 kb inversion between the trnL-CAA and trnV-GCA genes when compared to O. cinnamomea. Several repeated sequences were detected around the inversion break points. The complete cp genome sequence of Lygodium japonicum (Schizaeales) was 157,142 bp and a deletion of the rpoC1 intron was detected. This intron loss was shared by all of the studied species of the genus Lygodium. The GC contents and the effective numbers of co-dons (ENCs) in ferns varied significantly when compared to seed plants. The ENC values of the early diverged leptosporangiate ferns showed intermediate levels between eusporangiate and core leptosporangiate ferns. However, our phylogenetic tree based on all of the cp gene sequences clearly indicated that the cp genome similarity between O. cinnamomea (Osmundales) and eusporangiate ferns are symplesiomorphies, rather than synapomorphies. Therefore, our data is in agreement with the view that Osmundales is a distinct early diverged lineage in the leptosporangiate ferns. PMID:24823358

  13. Chloroplast genome evolution in early diverged leptosporangiate ferns.

    PubMed

    Kim, Hyoung Tae; Chung, Myong Gi; Kim, Ki-Joong

    2014-05-01

    In this study, the chloroplast (cp) genome sequences from three early diverged leptosporangiate ferns were completed and analyzed in order to understand the evolution of the genome of the fern lineages. The complete cp genome sequence of Osmunda cinnamomea (Osmundales) was 142,812 base pairs (bp). The cp genome structure was similar to that of eusporangiate ferns. The gene/intron losses that frequently occurred in the cp genome of leptosporangiate ferns were not found in the cp genome of O. cinnamomea. In addition, putative RNA editing sites in the cp genome were rare in O. cinnamomea, even though the sites were frequently predicted to be present in leptosporangiate ferns. The complete cp genome sequence of Diplopterygium glaucum (Gleicheniales) was 151,007 bp and has a 9.7 kb inversion between the trnL-CAA and trnVGCA genes when compared to O. cinnamomea. Several repeated sequences were detected around the inversion break points. The complete cp genome sequence of Lygodium japonicum (Schizaeales) was 157,142 bp and a deletion of the rpoC1 intron was detected. This intron loss was shared by all of the studied species of the genus Lygodium. The GC contents and the effective numbers of codons (ENCs) in ferns varied significantly when compared to seed plants. The ENC values of the early diverged leptosporangiate ferns showed intermediate levels between eusporangiate and core leptosporangiate ferns. However, our phylogenetic tree based on all of the cp gene sequences clearly indicated that the cp genome similarity between O. cinnamomea (Osmundales) and eusporangiate ferns are symplesiomorphies, rather than synapomorphies. Therefore, our data is in agreement with the view that Osmundales is a distinct early diverged lineage in the leptosporangiate ferns.

  14. Two divergent endo-beta-1,4-glucanase genes exhibit overlapping expression in ripening fruit and abscising flowers.

    PubMed Central

    Lashbrook, C C; Gonzalez-Bosch, C; Bennett, A B

    1994-01-01

    Two structurally divergent endo-beta-1,4-glucanase (EGase) cDNAs were cloned from tomato. Although both cDNAs (Cel1 and Cel2) encode potentially glycosylated, basic proteins of 51 to 53 kD and possess multiple amino acid domains conserved in both plant and microbial EGases, Cel1 and Cel2 exhibit only 50% amino acid identity at the overall sequence level. Amino acid sequence comparisons to other plant EGases indicate that tomato Cel1 is most similar to bean abscission zone EGase (68%), whereas Cel2 exhibits greatest sequence identity to avocado fruit EGase (57%). Sequence comparisons suggest the presence of at least two structurally divergent EGase families in plants. Unlike ripening avocado fruit and bean abscission zones in which a single EGase mRNA predominates, EGase expression in tomato reflects the overlapping accumulation of both Cel1 and Cel2 transcripts in ripening fruit and in plant organs undergoing cell separation. Cel1 mRNA contributes significantly to total EGase mRNA accumulation within plant organs undergoing cell separation (abscission zones and mature anthers), whereas Cel2 mRNA is most abundant in ripening fruit. The overlapping expression of divergent EGase genes within a single species may suggest that multiple activities are required for the cooperative disassembly of cell wall components during fruit ripening, floral abscission, and anther dehiscence. PMID:7994180

  15. A DNA Barcode Library for North American Ephemeroptera: Progress and Prospects

    PubMed Central

    Webb, Jeffrey M.; Jacobus, Luke M.; Funk, David H.; Zhou, Xin; Kondratieff, Boris; Geraci, Christy J.; DeWalt, R. Edward; Baird, Donald J.; Richard, Barton; Phillips, Iain; Hebert, Paul D. N.

    2012-01-01

    DNA barcoding of aquatic macroinvertebrates holds much promise as a tool for taxonomic research and for providing the reliable identifications needed for water quality assessment programs. A prerequisite for identification using barcodes is a reliable reference library. We gathered 4165 sequences from the barcode region of the mitochondrial cytochrome c oxidase subunit I gene representing 264 nominal and 90 provisional species of mayflies (Insecta: Ephemeroptera) from Canada, Mexico, and the United States. No species shared barcode sequences and all can be identified with barcodes with the possible exception of some Caenis. Minimum interspecific distances ranged from 0.3–24.7% (mean: 12.5%), while the average intraspecific divergence was 1.97%. The latter value was inflated by the presence of very high divergences in some taxa. In fact, nearly 20% of the species included two or three haplotype clusters showing greater than 5.0% sequence divergence and some values are as high as 26.7%. Many of the species with high divergences are polyphyletic and likely represent species complexes. Indeed, many of these polyphyletic species have numerous synonyms and individuals in some barcode clusters show morphological attributes characteristic of the synonymized species. In light of our findings, it is imperative that type or topotype specimens be sequenced to correctly associate barcode clusters with morphological species concepts and to determine the status of currently synonymized species. PMID:22666447

  16. Genetic divergence between freshwater and marine morphs of alewife (Alosa pseudoharengus): a 'next-generation' sequencing analysis.

    PubMed

    Czesny, Sergiusz; Epifanio, John; Michalak, Pawel

    2012-01-01

    Alewife Alosa pseudoharengus, a small clupeid fish native to Atlantic Ocean, has recently (∼150 years ago) invaded the North American Great Lakes and despite challenges of freshwater environment its populations exploded and disrupted local food web structures. This range expansion has been accompanied by dramatic changes at all levels of organization. Growth rates, size at maturation, or fecundity are only a few of the most distinct morphological and life history traits that contrast the two alewife morphs. A question arises to what extent these rapidly evolving differences between marine and freshwater varieties result from regulatory (including phenotypic plasticity) or structural mutations. To gain insights into expression changes and sequence divergence between marine and freshwater alewives, we sequenced transcriptomes of individuals from Lake Michigan and Atlantic Ocean. Population specific single nucleotide polymorphisms were rare but interestingly occurred in sequences of genes that also tended to show large differences in expression. Our results show that the striking phenotypic divergence between anadromous and lake alewives can be attributed to massive regulatory modifications rather than coding changes.

  17. Genetic Divergence between Freshwater and Marine Morphs of Alewife (Alosa pseudoharengus): A ‘Next-Generation’ Sequencing Analysis

    PubMed Central

    Czesny, Sergiusz; Epifanio, John; Michalak, Pawel

    2012-01-01

    Alewife Alosa pseudoharengus, a small clupeid fish native to Atlantic Ocean, has recently (∼150 years ago) invaded the North American Great Lakes and despite challenges of freshwater environment its populations exploded and disrupted local food web structures. This range expansion has been accompanied by dramatic changes at all levels of organization. Growth rates, size at maturation, or fecundity are only a few of the most distinct morphological and life history traits that contrast the two alewife morphs. A question arises to what extent these rapidly evolving differences between marine and freshwater varieties result from regulatory (including phenotypic plasticity) or structural mutations. To gain insights into expression changes and sequence divergence between marine and freshwater alewives, we sequenced transcriptomes of individuals from Lake Michigan and Atlantic Ocean. Population specific single nucleotide polymorphisms were rare but interestingly occurred in sequences of genes that also tended to show large differences in expression. Our results show that the striking phenotypic divergence between anadromous and lake alewives can be attributed to massive regulatory modifications rather than coding changes. PMID:22438868

  18. Convergent evolution of Hawaiian and Australo-Pacific honeyeaters from distant songbird ancestors.

    PubMed

    Fleischer, Robert C; James, Helen F; Olson, Storrs L

    2008-12-23

    The Hawaiian "honeyeaters," five endemic species of recently extinct, nectar-feeding songbirds in the genera Moho and Chaetoptila, looked and acted like Australasian honeyeaters (Meliphagidae), and no taxonomist since their discovery on James Cook's third voyage has classified them as anything else. We obtained DNA sequences from museum specimens of Moho and Chaetoptila collected in Hawaii 115-158 years ago. Phylogenetic analysis of these sequences supports monophyly of the two Hawaiian genera but, surprisingly, reveals that neither taxon is a meliphagid honeyeater, nor even in the same part of the songbird radiation as meliphagids. Instead, the Hawaiian species are divergent members of a passeridan group that includes deceptively dissimilar families of songbirds (Holarctic waxwings, neotropical silky flycatchers, and palm chats). Here we designate them as a new family, the Mohoidae. A nuclear-DNA rate calibration suggests that mohoids diverged from their closest living ancestor 14-17 mya, coincident with the estimated earliest arrival in Hawaii of a bird-pollinated plant lineage. Convergent evolution, the evolution of similar traits in distantly related taxa because of common selective pressures, is illustrated well by nectar-feeding birds, but the morphological, behavioral, and ecological similarity of the mohoids to the Australasian honeyeaters makes them a particularly striking example of the phenomenon.

  19. Insights into the genome evolution of Yersinia pestis through whole genome comparison with Yersinia pseudotuberculosis

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Souza, B; Stoutland, P; Derbise, A

    2004-01-24

    Yersinia pestis, the causative agent of plague, is a highly uniform clone that diverged recently from the enteric pathogen Yersinia pseudotuberculosis. Despite their close genetic relationship, they differ radically in their pathogenicity and transmission. Here we report the complete genomic sequence of Y. pseudotuberculosis IP32953 and its use for detailed genome comparisons to available Y. pestis sequences. Analyses of identified differences across a panel of Yersinia isolates from around the world reveals 32 Y. pestis chromosomal genes that, together with the two Y. pestis-specific plasmids, represent the only new genetic material in Y. pestis acquired since the divergence from Y.more » pseudotuberculosis. In contrast, 149 new pseudogenes (doubling the previous estimate) and 317 genes absent from Y. pestis were detected, indicating that as many as 13% of Y. pseudotuberculosis genes no longer function in Y. pestis. Extensive IS-mediated genome rearrangements and reductive evolution through massive gene loss, resulting in elimination and modification of pre-existing gene expression pathways appear to be more important than acquisition of new genes in the evolution of Y. pestis. These results provide a sobering example of how a highly virulent epidemic clone can suddenly emerge from a less virulent, closely related progenitor.« less

  20. Global diversity and oceanic divergence of humpback whales (Megaptera novaeangliae).

    PubMed

    Jackson, Jennifer A; Steel, Debbie J; Beerli, P; Congdon, Bradley C; Olavarría, Carlos; Leslie, Matthew S; Pomilla, Cristina; Rosenbaum, Howard; Baker, C Scott

    2014-07-07

    Humpback whales (Megaptera novaeangliae) annually undertake the longest migrations between seasonal feeding and breeding grounds of any mammal. Despite this dispersal potential, discontinuous seasonal distributions and migratory patterns suggest that humpbacks form discrete regional populations within each ocean. To better understand the worldwide population history of humpbacks, and the interplay of this species with the oceanic environment through geological time, we assembled mitochondrial DNA control region sequences representing approximately 2700 individuals (465 bp, 219 haplotypes) and eight nuclear intronic sequences representing approximately 70 individuals (3700 bp, 140 alleles) from the North Pacific, North Atlantic and Southern Hemisphere. Bayesian divergence time reconstructions date the origin of humpback mtDNA lineages to the Pleistocene (880 ka, 95% posterior intervals 550-1320 ka) and estimate radiation of current Northern Hemisphere lineages between 50 and 200 ka, indicating colonization of the northern oceans prior to the Last Glacial Maximum. Coalescent analyses reveal restricted gene flow between ocean basins, with long-term migration rates (individual migrants per generation) of less than 3.3 for mtDNA and less than 2 for nuclear genomic DNA. Genetic evidence suggests that humpbacks in the North Pacific, North Atlantic and Southern Hemisphere are on independent evolutionary trajectories, supporting taxonomic revision of M. novaeangliae to three subspecies. © 2014 The Author(s) Published by the Royal Society. All rights reserved.

  1. Global diversity and oceanic divergence of humpback whales (Megaptera novaeangliae)

    PubMed Central

    Jackson, Jennifer A.; Steel, Debbie J.; Beerli, P.; Congdon, Bradley C.; Olavarría, Carlos; Leslie, Matthew S.; Pomilla, Cristina; Rosenbaum, Howard; Baker, C. Scott

    2014-01-01

    Humpback whales (Megaptera novaeangliae) annually undertake the longest migrations between seasonal feeding and breeding grounds of any mammal. Despite this dispersal potential, discontinuous seasonal distributions and migratory patterns suggest that humpbacks form discrete regional populations within each ocean. To better understand the worldwide population history of humpbacks, and the interplay of this species with the oceanic environment through geological time, we assembled mitochondrial DNA control region sequences representing approximately 2700 individuals (465 bp, 219 haplotypes) and eight nuclear intronic sequences representing approximately 70 individuals (3700 bp, 140 alleles) from the North Pacific, North Atlantic and Southern Hemisphere. Bayesian divergence time reconstructions date the origin of humpback mtDNA lineages to the Pleistocene (880 ka, 95% posterior intervals 550–1320 ka) and estimate radiation of current Northern Hemisphere lineages between 50 and 200 ka, indicating colonization of the northern oceans prior to the Last Glacial Maximum. Coalescent analyses reveal restricted gene flow between ocean basins, with long-term migration rates (individual migrants per generation) of less than 3.3 for mtDNA and less than 2 for nuclear genomic DNA. Genetic evidence suggests that humpbacks in the North Pacific, North Atlantic and Southern Hemisphere are on independent evolutionary trajectories, supporting taxonomic revision of M. novaeangliae to three subspecies. PMID:24850919

  2. Sparse Reconstruction of Electric Fields from Radial Magnetic Data

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Yeates, Anthony R.

    2017-02-10

    Accurate estimates of the horizontal electric field on the Sun’s visible surface are important not only for estimating the Poynting flux of magnetic energy into the corona but also for driving time-dependent magnetohydrodynamic models of the corona. In this paper, a method is developed for estimating the horizontal electric field from a sequence of radial-component magnetic field maps. This problem of inverting Faraday’s law has no unique solution. Unfortunately, the simplest solution (a divergence-free electric field) is not realistically localized in regions of nonzero magnetic field, as would be expected from Ohm’s law. Our new method generates instead a localizedmore » solution, using a basis pursuit algorithm to find a sparse solution for the electric field. The method is shown to perform well on test cases where the input magnetic maps are flux balanced in both Cartesian and spherical geometries. However, we show that if the input maps have a significant imbalance of flux—usually arising from data assimilation—then it is not possible to find a localized, realistic, electric field solution. This is the main obstacle to driving coronal models from time sequences of solar surface magnetic maps.« less

  3. Hidden genetic history of the Japanese sand dollar Peronella (Echinoidea: Laganidae) revealed by nuclear intron sequences.

    PubMed

    Endo, Megumi; Hirose, Mamiko; Honda, Masanao; Koga, Hiroyuki; Morino, Yoshiaki; Kiyomoto, Masato; Wada, Hiroshi

    2018-06-15

    The marine environment around Japan experienced significant changes during the Cenozoic Era. In this study, we report findings suggesting that this dynamic history left behind traces in the genome of the Japanese sand dollar species Peronella japonica and P. rubra. Although mitochondrial Cytochrome C Oxidase I sequences did not indicate fragmentation of the current local populations of P. japonica around Japan, two different types of intron sequence were found in the Alx1 locus. We inferred that past fragmentation of the populations account for the presence of two types of nuclear sequences as alleles in the Alx1 intron of P. japonica. It is likely that the split populations have intermixed in recent times; hence, we did not detect polymorphisms in the sequences reflecting the current localization of the species. In addition, we found two allelic sequences of theAlx1 intron in the sister species P. rubra. The divergence times of the two types of Alx1 intron sequences were estimated at approximately 14.9 and 4.0 million years ago for P. japonica and P. rubra, respectively. Our study indicates that information from the intron sequences of nuclear genes can enhance our understanding of past genetic events in organisms. Copyright © 2018 Elsevier B.V. All rights reserved.

  4. Comparative sequence analysis of Mycobacterium leprae and the new leprosy-causing Mycobacterium lepromatosis.

    PubMed

    Han, Xiang Y; Sizer, Kurt C; Thompson, Erika J; Kabanja, Juma; Li, Jun; Hu, Peter; Gómez-Valero, Laura; Silva, Francisco J

    2009-10-01

    Mycobacterium lepromatosis is a newly discovered leprosy-causing organism. Preliminary phylogenetic analysis of its 16S rRNA gene and a few other gene segments revealed significant divergence from Mycobacterium leprae, a well-known cause of leprosy, that justifies the status of M. lepromatosis as a new species. In this study we analyzed the sequences of 20 genes and pseudogenes (22,814 nucleotides). Overall, the level of matching of these sequences with M. leprae sequences was 90.9%, which substantiated the species-level difference; the levels of matching for the 16S rRNA genes and 14 protein-encoding genes were 98.0% and 93.1%, respectively, but the level of matching for five pseudogenes was only 79.1%. Five conserved protein-encoding genes were selected to construct phylogenetic trees and to calculate the numbers of synonymous substitutions (dS values) and nonsynonymous substitutions (dN values) in the two species. Robust phylogenetic trees constructed using concatenated alignment of these genes placed M. lepromatosis and M. leprae in a tight cluster with long terminal branches, implying that the divergence occurred long ago. The dS and dN values were also much higher than those for other closest pairs of mycobacteria. The dS values were 14 to 28% of the dS values for M. leprae and Mycobacterium tuberculosis, a more divergent pair of species. These results thus indicate that M. lepromatosis and M. leprae diverged approximately 10 million years ago. The M. lepromatosis pseudogenes analyzed that were also pseudogenes in M. leprae showed nearly neutral evolution, and their relative ages were similar to those of M. leprae pseudogenes, suggesting that they were pseudogenes before divergence. Taken together, the results described above indicate that M. lepromatosis and M. leprae diverged from a common ancestor after the massive gene inactivation event described previously for M. leprae.

  5. [Molecular evolution of the tick-borne encephalitis and Powassan viruses].

    PubMed

    Subbotina, E L; Loktev, V B

    2012-01-01

    The problem of emerging viruses, their genetic diversity and viral evolution in nature are attracting more attention. The phylogenetic analysis and evaluationary rate estimation were made for pathogenic flaviviruses such as tick-borne encephalitis virus (TBEV) and Powassan (PV) circulated in natural foci in Russia. 47 nucleotide sequences of encoded protein E of the TBEV and 17 sequences of NS5 genome region of the PV have been used. It was found that the rate of accumulation of nucleotide substitutions for E genome region of TBEV was approximately 1.4 x 10(-4) and 5.4 x 10(-5) substitutions per site per year for NS5 genome region of PV. The ratio of non-synonymous nucleotide substitutions to synonymous substitution (dN/dS) for viral sequences were estimated of 0.049 for TBEV and 0.098 for PV. Maximum value dN/dS was 0.201-0.220 for sub-cluster of Russian and Canadian strains of PV and the minimum - 0.024 for cluster of Russian and Chinese strains of Far Eastern genotype TBEV. Evaluation of time intervals of evolutionary events associated with these viruses showed that European subtype TBEV are diverged from all-TBEV ancestor within approximately 2750 years and the Siberian and Far Eastern subtypes are emerged about 2250 years ago. The PV was introduced into natural foci of the Primorsky Krai of Russia only about 70 years ago and PV is a very close to Canadian strains of PV. Evolutionary picture for PV in North America is similar to evolution of Siberian and Far Eastern subtypes TBEV in Asia. The divergence time for main genetic groups of TBEV and PV are correlated with historical periods of warming and cooling. These allow to propose a hypothesis that climate changes were essential to the evolution of the flaviviruses in the past millenniums.

  6. Genetic signatures of adaptation revealed from transcriptome sequencing of Arctic and red foxes.

    PubMed

    Kumar, Vikas; Kutschera, Verena E; Nilsson, Maria A; Janke, Axel

    2015-08-07

    The genus Vulpes (true foxes) comprises numerous species that inhabit a wide range of habitats and climatic conditions, including one species, the Arctic fox (Vulpes lagopus) which is adapted to the arctic region. A close relative to the Arctic fox, the red fox (Vulpes vulpes), occurs in subarctic to subtropical habitats. To study the genetic basis of their adaptations to different environments, transcriptome sequences from two Arctic foxes and one red fox individual were generated and analyzed for signatures of positive selection. In addition, the data allowed for a phylogenetic analysis and divergence time estimate between the two fox species. The de novo assembly of reads resulted in more than 160,000 contigs/transcripts per individual. Approximately 17,000 homologous genes were identified using human and the non-redundant databases. Positive selection analyses revealed several genes involved in various metabolic and molecular processes such as energy metabolism, cardiac gene regulation, apoptosis and blood coagulation to be under positive selection in foxes. Branch site tests identified four genes to be under positive selection in the Arctic fox transcriptome, two of which are fat metabolism genes. In the red fox transcriptome eight genes are under positive selection, including molecular process genes, notably genes involved in ATP metabolism. Analysis of the three transcriptomes and five Sanger re-sequenced genes in additional individuals identified a lower genetic variability within Arctic foxes compared to red foxes, which is consistent with distribution range differences and demographic responses to past climatic fluctuations. A phylogenomic analysis estimated that the Arctic and red fox lineages diverged about three million years ago. Transcriptome data are an economic way to generate genomic resources for evolutionary studies. Despite not representing an entire genome, this transcriptome analysis identified numerous genes that are relevant to arctic adaptation in foxes. Similar to polar bears, fat metabolism seems to play a central role in adaptation of Arctic foxes to the cold climate, as has been identified in the polar bear, another arctic specialist.

  7. Correlation of fitness landscapes from three orthologous TIM barrels originates from sequence and structure constraints

    PubMed Central

    Chan, Yvonne H.; Venev, Sergey V.; Zeldovich, Konstantin B.; Matthews, C. Robert

    2017-01-01

    Sequence divergence of orthologous proteins enables adaptation to environmental stresses and promotes evolution of novel functions. Limits on evolution imposed by constraints on sequence and structure were explored using a model TIM barrel protein, indole-3-glycerol phosphate synthase (IGPS). Fitness effects of point mutations in three phylogenetically divergent IGPS proteins during adaptation to temperature stress were probed by auxotrophic complementation of yeast with prokaryotic, thermophilic IGPS. Analysis of beneficial mutations pointed to an unexpected, long-range allosteric pathway towards the active site of the protein. Significant correlations between the fitness landscapes of distant orthologues implicate both sequence and structure as primary forces in defining the TIM barrel fitness landscape and suggest that fitness landscapes can be translocated in sequence space. Exploration of fitness landscapes in the context of a protein fold provides a strategy for elucidating the sequence-structure-fitness relationships in other common motifs. PMID:28262665

  8. Bayesian relaxed clock estimation of divergence times in foraminifera.

    PubMed

    Groussin, Mathieu; Pawlowski, Jan; Yang, Ziheng

    2011-10-01

    Accurate and precise estimation of divergence times during the Neo-Proterozoic is necessary to understand the speciation dynamic of early Eukaryotes. However such deep divergences are difficult to date, as the molecular clock is seriously violated. Recent improvements in Bayesian molecular dating techniques allow the relaxation of the molecular clock hypothesis as well as incorporation of multiple and flexible fossil calibrations. Divergence times can then be estimated even when the evolutionary rate varies among lineages and even when the fossil calibrations involve substantial uncertainties. In this paper, we used a Bayesian method to estimate divergence times in Foraminifera, a group of unicellular eukaryotes, known for their excellent fossil record but also for the high evolutionary rates of their genomes. Based on multigene data we reconstructed the phylogeny of Foraminifera and dated their origin and the major radiation events. Our estimates suggest that Foraminifera emerged during the Cryogenian (650-920 Ma, Neo-Proterozoic), with a mean time around 770 Ma, about 220 Myr before the first appearance of reliable foraminiferal fossils in sediments (545 Ma). Most dates are in agreement with the fossil record, but in general our results suggest earlier origins of foraminiferal orders. We found that the posterior time estimates were robust to specifications of the prior. Our results highlight inter-species variations of evolutionary rates in Foraminifera. Their effect was partially overcome by using the partitioned Bayesian analysis to accommodate rate heterogeneity among data partitions and using the relaxed molecular clock to account for changing evolutionary rates. However, more coding genes appear necessary to obtain more precise estimates of divergence times and to resolve the conflicts between fossil and molecular date estimates. Copyright © 2011 Elsevier Inc. All rights reserved.

  9. Generalization of Entropy Based Divergence Measures for Symbolic Sequence Analysis

    PubMed Central

    Ré, Miguel A.; Azad, Rajeev K.

    2014-01-01

    Entropy based measures have been frequently used in symbolic sequence analysis. A symmetrized and smoothed form of Kullback-Leibler divergence or relative entropy, the Jensen-Shannon divergence (JSD), is of particular interest because of its sharing properties with families of other divergence measures and its interpretability in different domains including statistical physics, information theory and mathematical statistics. The uniqueness and versatility of this measure arise because of a number of attributes including generalization to any number of probability distributions and association of weights to the distributions. Furthermore, its entropic formulation allows its generalization in different statistical frameworks, such as, non-extensive Tsallis statistics and higher order Markovian statistics. We revisit these generalizations and propose a new generalization of JSD in the integrated Tsallis and Markovian statistical framework. We show that this generalization can be interpreted in terms of mutual information. We also investigate the performance of different JSD generalizations in deconstructing chimeric DNA sequences assembled from bacterial genomes including that of E. coli, S. enterica typhi, Y. pestis and H. influenzae. Our results show that the JSD generalizations bring in more pronounced improvements when the sequences being compared are from phylogenetically proximal organisms, which are often difficult to distinguish because of their compositional similarity. While small but noticeable improvements were observed with the Tsallis statistical JSD generalization, relatively large improvements were observed with the Markovian generalization. In contrast, the proposed Tsallis-Markovian generalization yielded more pronounced improvements relative to the Tsallis and Markovian generalizations, specifically when the sequences being compared arose from phylogenetically proximal organisms. PMID:24728338

  10. Conservation of Endo16 expression in sea urchins despite evolutionary divergence in both cis and trans-acting components of transcriptional regulation

    NASA Technical Reports Server (NTRS)

    Romano, Laura A.; Wray, Gregory A.

    2003-01-01

    Evolutionary changes in transcriptional regulation undoubtedly play an important role in creating morphological diversity. However, there is little information about the evolutionary dynamics of cis-regulatory sequences. This study examines the functional consequence of evolutionary changes in the Endo16 promoter of sea urchins. The Endo16 gene encodes a large extracellular protein that is expressed in the endoderm and may play a role in cell adhesion. Its promoter has been characterized in exceptional detail in the purple sea urchin, Strongylocentrotus purpuratus. We have characterized the structure and function of the Endo16 promoter from a second sea urchin species, Lytechinus variegatus. The Endo16 promoter sequences have evolved in a strongly mosaic manner since these species diverged approximately 35 million years ago: the most proximal region (module A) is conserved, but the remaining modules (B-G) are unalignable. Despite extensive divergence in promoter sequences, the pattern of Endo16 transcription is largely conserved during embryonic and larval development. Transient expression assays demonstrate that 2.2 kb of upstream sequence in either species is sufficient to drive GFP reporter expression that correctly mimics this pattern of Endo16 transcription. Reciprocal cross-species transient expression assays imply that changes have also evolved in the set of transcription factors that interact with the Endo16 promoter. Taken together, these results suggest that stabilizing selection on the transcriptional output may have operated to maintain a similar pattern of Endo16 expression in S. purpuratus and L. variegatus, despite dramatic divergence in promoter sequence and mechanisms of transcriptional regulation.

  11. Generalization of entropy based divergence measures for symbolic sequence analysis.

    PubMed

    Ré, Miguel A; Azad, Rajeev K

    2014-01-01

    Entropy based measures have been frequently used in symbolic sequence analysis. A symmetrized and smoothed form of Kullback-Leibler divergence or relative entropy, the Jensen-Shannon divergence (JSD), is of particular interest because of its sharing properties with families of other divergence measures and its interpretability in different domains including statistical physics, information theory and mathematical statistics. The uniqueness and versatility of this measure arise because of a number of attributes including generalization to any number of probability distributions and association of weights to the distributions. Furthermore, its entropic formulation allows its generalization in different statistical frameworks, such as, non-extensive Tsallis statistics and higher order Markovian statistics. We revisit these generalizations and propose a new generalization of JSD in the integrated Tsallis and Markovian statistical framework. We show that this generalization can be interpreted in terms of mutual information. We also investigate the performance of different JSD generalizations in deconstructing chimeric DNA sequences assembled from bacterial genomes including that of E. coli, S. enterica typhi, Y. pestis and H. influenzae. Our results show that the JSD generalizations bring in more pronounced improvements when the sequences being compared are from phylogenetically proximal organisms, which are often difficult to distinguish because of their compositional similarity. While small but noticeable improvements were observed with the Tsallis statistical JSD generalization, relatively large improvements were observed with the Markovian generalization. In contrast, the proposed Tsallis-Markovian generalization yielded more pronounced improvements relative to the Tsallis and Markovian generalizations, specifically when the sequences being compared arose from phylogenetically proximal organisms.

  12. Regulatory versus coding signatures of natural selection in a candidate gene involved in the adaptive divergence of whitefish species pairs (Coregonus spp.)

    PubMed Central

    Jeukens, Julie; Bernatchez, Louis

    2012-01-01

    While gene expression divergence is known to be involved in adaptive phenotypic divergence and speciation, the relative importance of regulatory and structural evolution of genes is poorly understood. A recent next-generation sequencing experiment allowed identifying candidate genes potentially involved in the ongoing speciation of sympatric dwarf and normal lake whitefish (Coregonus clupeaformis), such as cytosolic malate dehydrogenase (MDH1), which showed both significant expression and sequence divergence. The main goal of this study was to investigate into more details the signatures of natural selection in the regulatory and coding sequences of MDH1 in lake whitefish and test for parallelism of these signatures with other coregonine species. Sequencing of the two regions in 118 fish from four sympatric pairs of whitefish and two cisco species revealed a total of 35 single nucleotide polymorphisms (SNPs), with more genetic diversity in European compared to North American coregonine species. While the coding region was found to be under purifying selection, an SNP in the proximal promoter exhibited significant allele frequency divergence in a parallel manner among independent sympatric pairs of North American lake whitefish and European whitefish (C. lavaretus). According to transcription factor binding simulation for 22 regulatory haplotypes of MDH1, putative binding profiles were fairly conserved among species, except for the region around this SNP. Moreover, we found evidence for the role of this SNP in the regulation of MDH1 expression level. Overall, these results provide further evidence for the role of natural selection in gene regulation evolution among whitefish species pairs and suggest its possible link with patterns of phenotypic diversity observed in coregonine species. PMID:22408741

  13. Regulatory versus coding signatures of natural selection in a candidate gene involved in the adaptive divergence of whitefish species pairs (Coregonus spp.).

    PubMed

    Jeukens, Julie; Bernatchez, Louis

    2012-01-01

    While gene expression divergence is known to be involved in adaptive phenotypic divergence and speciation, the relative importance of regulatory and structural evolution of genes is poorly understood. A recent next-generation sequencing experiment allowed identifying candidate genes potentially involved in the ongoing speciation of sympatric dwarf and normal lake whitefish (Coregonus clupeaformis), such as cytosolic malate dehydrogenase (MDH1), which showed both significant expression and sequence divergence. The main goal of this study was to investigate into more details the signatures of natural selection in the regulatory and coding sequences of MDH1 in lake whitefish and test for parallelism of these signatures with other coregonine species. Sequencing of the two regions in 118 fish from four sympatric pairs of whitefish and two cisco species revealed a total of 35 single nucleotide polymorphisms (SNPs), with more genetic diversity in European compared to North American coregonine species. While the coding region was found to be under purifying selection, an SNP in the proximal promoter exhibited significant allele frequency divergence in a parallel manner among independent sympatric pairs of North American lake whitefish and European whitefish (C. lavaretus). According to transcription factor binding simulation for 22 regulatory haplotypes of MDH1, putative binding profiles were fairly conserved among species, except for the region around this SNP. Moreover, we found evidence for the role of this SNP in the regulation of MDH1 expression level. Overall, these results provide further evidence for the role of natural selection in gene regulation evolution among whitefish species pairs and suggest its possible link with patterns of phenotypic diversity observed in coregonine species.

  14. Patterns and rates of intron divergence between humans and chimpanzees

    PubMed Central

    Gazave, Elodie; Marqués-Bonet, Tomàs; Fernando, Olga; Charlesworth, Brian; Navarro, Arcadi

    2007-01-01

    Background Introns, which constitute the largest fraction of eukaryotic genes and which had been considered to be neutral sequences, are increasingly acknowledged as having important functions. Several studies have investigated levels of evolutionary constraint along introns and across classes of introns of different length and location within genes. However, thus far these studies have yielded contradictory results. Results We present the first analysis of human-chimpanzee intron divergence, in which differences in the number of substitutions per intronic site (Ki) can be interpreted as the footprint of different intensities and directions of the pressures of natural selection. Our main findings are as follows: there was a strong positive correlation between intron length and divergence; there was a strong negative correlation between intron length and GC content; and divergence rates vary along introns and depending on their ordinal position within genes (for instance, first introns are more GC rich, longer and more divergent, and divergence is lower at the 3' and 5' ends of all types of introns). Conclusion We show that the higher divergence of first introns is related to their larger size. Also, the lower divergence of short introns suggests that they may harbor a relatively greater proportion of regulatory elements than long introns. Moreover, our results are consistent with the presence of functionally relevant sequences near the 5' and 3' ends of introns. Finally, our findings suggest that other parts of introns may also be under selective constraints. PMID:17309804

  15. Molecular phylogeny and biogeography of the Qinghai-Tibet Plateau endemic Nannoglottis (Asteraceae).

    PubMed

    Liu, Jian-Quan; Gao, Tian-Gang; Chen, Zhi-Duan; Lu, An-Ming

    2002-06-01

    All taxa endemic to the Qinghai-Tibet Plateau are hypothesized to have originated in situ or from immediately adjacent areas because of the relatively recent formation of the plateau since the Pliocene, followed by the large-scaled biota extinction and recession caused by the Quaternary ice sheet. However, identification of specific progenitors remains difficult for some endemics, especially some endemic genera. Nannoglottis, with about eight species endemic to this region, is one such genus. Past taxonomic treatments have suggested its relationships with four different tribes of Asteraceae. We intend to identify the closest relatives of Nannoglottis by evaluating the level of monophyly, tribal delimitation, and systematic position of the genus by using molecular data from ndhF gene, trnL-F, and ITS region sequences. We find that all sampled species of Nannoglottis are a well-defined monophyly. This supports all recent taxonomic treatments of Nannoglottis, in which all sampled species were placed in one broadly re-circumscribed genus. Nannoglottis is most closely related to the Astereae, but stands as an isolated genus as the first diverging lineage of the tribe, without close relatives. A tentative relationship was suggested for Nannoglottis and the next lineage of the tribe was based on the ITS topology, the "basal group," which consists of seven genera from the Southern Hemisphere. Such a relationship is supported by some commonly shared plesiomorphic morphological characters. Despite the very early divergence of Nannoglottis in the Astereae, the tribe must be regarded to have its origin in Southern Hemisphere rather than in Asia, because based on all morphological, molecular, biogeographical, and fossil data, the Asteraceae and its major lineages (tribes) are supposed to have originated in the former area. Long-distance dispersal using Southeast Asia as a steppingstone from Southern Hemisphere to the Qinghai-Tibet Plateau is the most likely explanation for this unusual biogeographic link of Nannoglottis. The 23-32-million-year divergence time between Nannoglottis and the other Astereae estimated by DNA sequences predated the formation of the plateau. This estimation is further favored by the fossil record of the Asteraceae and the possible time of origin of the Astereae. Nannoglottis seems to have reached the Qinghai-Tibet area in the Oligocene-Eocene and then re-diversified with the uplift of the plateau. The molecular infragenetic phylogeny of the genus identifies two distinct clades, which reject the earlier infrageneric classification based on the arrangement of the involucral bracts and the length of the ligules, but agree well with the habits and ecological preferences of its current species. The "alpine shrub" vs. "coniferous forest" divergence within Nannoglottis was estimated at about 3.4 million years ago when the plateau began its first large-scale uplifting and the coniferous vegetation began to appear. Most of the current species at the "coniferous forest" clade of the genus are estimated to have originated from 1.02 to 1.94 million years ago, when the second and third uprisings of the plateau occurred, the climate oscillated and the habitats were strongly changed. The assumed evolution, speciation diversity, and radiation of Nannoglottis based on molecular phylogeny and divergence times agree well with the known geological and paleobotanical histories of the Qinghai-Tibet Plateau. (c) 2002 Elsevier Science (USA).

  16. Multilocus methods for estimating population sizes, migration rates and divergence time, with applications to the divergence of Drosophila pseudoobscura and D. persimilis.

    PubMed Central

    Hey, Jody; Nielsen, Rasmus

    2004-01-01

    The genetic study of diverging, closely related populations is required for basic questions on demography and speciation, as well as for biodiversity and conservation research. However, it is often unclear whether divergence is due simply to separation or whether populations have also experienced gene flow. These questions can be addressed with a full model of population separation with gene flow, by applying a Markov chain Monte Carlo method for estimating the posterior probability distribution of model parameters. We have generalized this method and made it applicable to data from multiple unlinked loci. These loci can vary in their modes of inheritance, and inheritance scalars can be implemented either as constants or as parameters to be estimated. By treating inheritance scalars as parameters it is also possible to address variation among loci in the impact via linkage of recurrent selective sweeps or background selection. These methods are applied to a large multilocus data set from Drosophila pseudoobscura and D. persimilis. The species are estimated to have diverged approximately 500,000 years ago. Several loci have nonzero estimates of gene flow since the initial separation of the species, with considerable variation in gene flow estimates among loci, in both directions between the species. PMID:15238526

  17. Genomics of the divergence continuum in an African plant biodiversity hotspot, I: drivers of population divergence in Restio capensis (Restionaceae).

    PubMed

    Lexer, C; Wüest, R O; Mangili, S; Heuertz, M; Stölting, K N; Pearman, P B; Forest, F; Salamin, N; Zimmermann, N E; Bossolini, E

    2014-09-01

    Understanding the drivers of population divergence, speciation and species persistence is of great interest to molecular ecology, especially for species-rich radiations inhabiting the world's biodiversity hotspots. The toolbox of population genomics holds great promise for addressing these key issues, especially if genomic data are analysed within a spatially and ecologically explicit context. We have studied the earliest stages of the divergence continuum in the Restionaceae, a species-rich and ecologically important plant family of the Cape Floristic Region (CFR) of South Africa, using the widespread CFR endemic Restio capensis (L.) H.P. Linder & C.R. Hardy as an example. We studied diverging populations of this morphotaxon for plastid DNA sequences and >14 400 nuclear DNA polymorphisms from Restriction site Associated DNA (RAD) sequencing and analysed the results jointly with spatial, climatic and phytogeographic data, using a Bayesian generalized linear mixed modelling (GLMM) approach. The results indicate that population divergence across the extreme environmental mosaic of the CFR is mostly driven by isolation by environment (IBE) rather than isolation by distance (IBD) for both neutral and non-neutral markers, consistent with genome hitchhiking or coupling effects during early stages of divergence. Mixed modelling of plastid DNA and single divergent outlier loci from a Bayesian genome scan confirmed the predominant role of climate and pointed to additional drivers of divergence, such as drift and ecological agents of selection captured by phytogeographic zones. Our study demonstrates the usefulness of population genomics for disentangling the effects of IBD and IBE along the divergence continuum often found in species radiations across heterogeneous ecological landscapes. © 2014 John Wiley & Sons Ltd.

  18. Probabilistic divergence time estimation without branch lengths: dating the origins of dinosaurs, avian flight and crown birds.

    PubMed

    Lloyd, G T; Bapst, D W; Friedman, M; Davis, K E

    2016-11-01

    Branch lengths-measured in character changes-are an essential requirement of clock-based divergence estimation, regardless of whether the fossil calibrations used represent nodes or tips. However, a separate set of divergence time approaches are typically used to date palaeontological trees, which may lack such branch lengths. Among these methods, sophisticated probabilistic approaches have recently emerged, in contrast with simpler algorithms relying on minimum node ages. Here, using a novel phylogenetic hypothesis for Mesozoic dinosaurs, we apply two such approaches to estimate divergence times for: (i) Dinosauria, (ii) Avialae (the earliest birds) and (iii) Neornithes (crown birds). We find: (i) the plausibility of a Permian origin for dinosaurs to be dependent on whether Nyasasaurus is the oldest dinosaur, (ii) a Middle to Late Jurassic origin of avian flight regardless of whether Archaeopteryx or Aurornis is considered the first bird and (iii) a Late Cretaceous origin for Neornithes that is broadly congruent with other node- and tip-dating estimates. Demonstrating the feasibility of probabilistic time-scaling further opens up divergence estimation to the rich histories of extinct biodiversity in the fossil record, even in the absence of detailed character data. © 2016 The Authors.

  19. The influence of ignoring secondary structure on divergence time estimates from ribosomal RNA genes.

    PubMed

    Dohrmann, Martin

    2014-02-01

    Genes coding for ribosomal RNA molecules (rDNA) are among the most popular markers in molecular phylogenetics and evolution. However, coevolution of sites that code for pairing regions (stems) in the RNA secondary structure can make it challenging to obtain accurate results from such loci. While the influence of ignoring secondary structure on multiple sequence alignment and tree topology has been investigated in numerous studies, its effect on molecular divergence time estimates is still poorly known. Here, I investigate this issue in Bayesian Markov Chain Monte Carlo (BMCMC) and penalized likelihood (PL) frameworks, using empirical datasets from dragonflies (Odonata: Anisoptera) and glass sponges (Porifera: Hexactinellida). My results indicate that highly biased inferences under substitution models that ignore secondary structure only occur if maximum-likelihood estimates of branch lengths are used as input to PL dating, whereas in a BMCMC framework and in PL dating based on Bayesian consensus branch lengths, the effect is far less severe. I conclude that accounting for coevolution of paired sites in molecular dating studies is not as important as previously suggested, as long as the estimates are based on Bayesian consensus branch lengths instead of ML point estimates. This finding is especially relevant for studies where computational limitations do not allow the use of secondary-structure specific substitution models, or where accurate consensus structures cannot be predicted. I also found that the magnitude and direction (over- vs. underestimating node ages) of bias in age estimates when secondary structure is ignored was not distributed randomly across the nodes of the phylogenies, a phenomenon that requires further investigation. Copyright © 2013 Elsevier Inc. All rights reserved.

  20. Do island plant populations really have lower genetic variation than mainland populations? Effects of selection and distribution range on genetic diversity estimates.

    PubMed

    García-Verdugo, C; Sajeva, M; La Mantia, T; Harrouni, C; Msanda, F; Caujapé-Castells, J

    2015-02-01

    Ecological and evolutionary studies largely assume that island populations display low levels of neutral genetic variation. However, this notion has only been formally tested in a few cases involving plant taxa, and the confounding effect of selection on genetic diversity (GD) estimates based on putatively neutral markers has typically been overlooked. Here, we generated nuclear microsatellite and plastid DNA sequence data in Periploca laevigata, a plant taxon with an island-mainland distribution area, to (i) investigate whether selection affects GD estimates of populations across contrasting habitats; and (ii) test the long-standing idea that island populations have lower GD than their mainland counterparts. Plastid data showed that colonization of the Canary Islands promoted strong lineage divergence within P. laevigata, which was accompanied by selective sweeps at several nuclear microsatellite loci. Inclusion of loci affected by strong divergent selection produced a significant downward bias in the GD estimates of the mainland lineage, but such underestimates were substantial (>14%) only when more than one loci under selection were included in the computations. When loci affected by selection were removed, we did not find evidence that insular Periploca populations have less GD than their mainland counterparts. The analysis of data obtained from a comprehensive literature survey reinforced this result, as overall comparisons of GD estimates between island and mainland populations were not significant across plant taxa (N = 66), with the only exception of island endemics with narrow distributions. This study suggests that identification and removal of markers potentially affected by selection should be routinely implemented in estimates of GD, particularly if different lineages are compared. Furthermore, it provides compelling evidence that the expectation of low GD cannot be generalized to island plant populations. © 2015 John Wiley & Sons Ltd.

  1. HYBRIDCHECK: software for the rapid detection, visualization and dating of recombinant regions in genome sequence data.

    PubMed

    Ward, Ben J; van Oosterhout, Cock

    2016-03-01

    HYBRIDCHECK is a software package to visualize the recombination signal in large DNA sequence data set, and it can be used to analyse recombination, genetic introgression, hybridization and horizontal gene transfer. It can scan large (multiple kb) contigs and whole-genome sequences of three or more individuals. HYBRIDCHECK is written in the r software for OS X, Linux and Windows operating systems, and it has a simple graphical user interface. In addition, the r code can be readily incorporated in scripts and analysis pipelines. HYBRIDCHECK implements several ABBA-BABA tests and visualizes the effects of hybridization and the resulting mosaic-like genome structure in high-density graphics. The package also reports the following: (i) the breakpoint positions, (ii) the number of mutations in each introgressed block, (iii) the probability that the identified region is not caused by recombination and (iv) the estimated age of each recombination event. The divergence times between the donor and recombinant sequence are calculated using a JC, K80, F81, HKY or GTR correction, and the dating algorithm is exceedingly fast. By estimating the coalescence time of introgressed blocks, it is possible to distinguish between hybridization and incomplete lineage sorting. HYBRIDCHECK is libré software and it and its manual are free to download from http://ward9250.github.io/HybridCheck/. © 2015 John Wiley & Sons Ltd.

  2. A comprehensive bioinformatic analysis of hepatitis D virus full-length genomes.

    PubMed

    Delfino, C M; Cerrudo, C S; Biglione, M; Oubiña, J R; Ghiringhelli, P D; Mathet, V L

    2018-02-06

    In association with hepatitis B virus (HBV), hepatitis delta virus (HDV) is a subviral agent that may promote severe acute and chronic forms of liver disease. Based on the percentage of nucleotide identity of the genome, HDV was initially classified into three genotypes. However, since 2006, the original classification has been further expanded into eight clades/genotypes. The intergenotype divergence may be as high as 35%-40% over the entire RNA genome, whereas sequence heterogeneity among the isolates of a given genotype is <20%; furthermore, HDV recombinants have been clearly demonstrated. The genetic diversity of HDV is related to the geographic origin of the isolates. This study shows the first comprehensive bioinformatic analysis of the complete available set of HDV sequences, using both nucleotide and protein phylogenies (based on an evolutionary model selection, gamma distribution estimation, tree inference and phylogenetic distance estimation), protein composition analysis and comparison (based on the presence of invariant residues, molecular signatures, amino acid frequencies and mono- and di-amino acid compositional distances), as well as amino acid changes in sequence evolution. Taking into account the congruent and consistent results of both nucleotide and amino acid analyses of GenBank available sequences (recorded as of January, 2017), we propose that the eight hepatitis D virus genotypes may be grouped into three large genogroups fully supported by their shared characteristics. © 2018 John Wiley & Sons Ltd.

  3. Potential for bias and low precision in molecular divergence time estimation of the Canopy of Life: an example from aquatic bird families

    PubMed Central

    van Tuinen, Marcel; Torres, Christopher R.

    2015-01-01

    Uncertainty in divergence time estimation is frequently studied from many angles but rarely from the perspective of phylogenetic node age. If appropriate molecular models and fossil priors are used, a multi-locus, partitioned analysis is expected to equally minimize error in accuracy and precision across all nodes of a given phylogeny. In contrast, if available models fail to completely account for rate heterogeneity, substitution saturation and incompleteness of the fossil record, uncertainty in divergence time estimation may increase with node age. While many studies have stressed this concern with regard to deep nodes in the Tree of Life, the inference that molecular divergence time estimation of shallow nodes is less sensitive to erroneous model choice has not been tested explicitly in a Bayesian framework. Because of available divergence time estimation methods that permit fossil priors across any phylogenetic node and the present increase in efficient, cheap collection of species-level genomic data, insight is needed into the performance of divergence time estimation of shallow (<10 MY) nodes. Here, we performed multiple sensitivity analyses in a multi-locus data set of aquatic birds with six fossil constraints. Comparison across divergence time analyses that varied taxon and locus sampling, number and position of fossil constraint and shape of prior distribution showed various insights. Deviation from node ages obtained from a reference analysis was generally highest for the shallowest nodes but determined more by temporal placement than number of fossil constraints. Calibration with only the shallowest nodes significantly underestimated the aquatic bird fossil record, indicating the presence of saturation. Although joint calibration with all six priors yielded ages most consistent with the fossil record, ages of shallow nodes were overestimated. This bias was found in both mtDNA and nDNA regions. Thus, divergence time estimation of shallow nodes may suffer from bias and low precision, even when appropriate fossil priors and best available substitution models are chosen. Much care must be taken to address the possible ramifications of substitution saturation across the entire Tree of Life. PMID:26106406

  4. TITAN: inference of copy number architectures in clonal cell populations from tumor whole-genome sequence data

    PubMed Central

    Roth, Andrew; Khattra, Jaswinder; Ho, Julie; Yap, Damian; Prentice, Leah M.; Melnyk, Nataliya; McPherson, Andrew; Bashashati, Ali; Laks, Emma; Biele, Justina; Ding, Jiarui; Le, Alan; Rosner, Jamie; Shumansky, Karey; Marra, Marco A.; Gilks, C. Blake; Huntsman, David G.; McAlpine, Jessica N.; Aparicio, Samuel

    2014-01-01

    The evolution of cancer genomes within a single tumor creates mixed cell populations with divergent somatic mutational landscapes. Inference of tumor subpopulations has been disproportionately focused on the assessment of somatic point mutations, whereas computational methods targeting evolutionary dynamics of copy number alterations (CNA) and loss of heterozygosity (LOH) in whole-genome sequencing data remain underdeveloped. We present a novel probabilistic model, TITAN, to infer CNA and LOH events while accounting for mixtures of cell populations, thereby estimating the proportion of cells harboring each event. We evaluate TITAN on idealized mixtures, simulating clonal populations from whole-genome sequences taken from genomically heterogeneous ovarian tumor sites collected from the same patient. In addition, we show in 23 whole genomes of breast tumors that the inference of CNA and LOH using TITAN critically informs population structure and the nature of the evolving cancer genome. Finally, we experimentally validated subclonal predictions using fluorescence in situ hybridization (FISH) and single-cell sequencing from an ovarian cancer patient sample, thereby recapitulating the key modeling assumptions of TITAN. PMID:25060187

  5. Genetic structure and historical diversification of catfish Brachyplatystoma platynemum (Siluriformes: Pimelodidae) in the Amazon basin with implications for its conservation.

    PubMed

    Ochoa, Luz Eneida; Pereira, Luiz Henrique G; Costa-Silva, Guilherme Jose; Roxo, Fábio F; Batista, Jacqueline S; Formiga, Kyara; Foresti, Fausto; Oliveira, Claudio

    2015-05-01

    Brachyplatystoma platynemum is a catfish species widely distributed in the Amazon basin. Despite being considered of little commercial interest, the decline in other fish populations has contributed to the increase in the catches of this species. The structure, population genetic variability, and evolutionary process that have driven the diversification of this species are presently unknown. Considering that, in order to better understand the genetic structure of this species, we analyzed individuals from seven locations of the Amazon basin using eight molecular markers: control region and cytochrome b mtDNA sequences, and a set of six nuclear microsatellite loci. The results show high levels of haplotype diversity and point to the occurrence of two structured populations (Amazon River and the Madeira River) with high values for F ST. Divergence time estimates based on mtDNA indicated that these populations diverged about 1.0 Mya (0.2-2.5 Mya 95% HPD) using cytochrome b and 1.4 Mya (0.2-2.7 Mya 95% HPD) using control region. During that time, the influence of climate changes and hydrological events such as sea level oscillations and drainage isolation as a result of geological processes in the Pleistocene may have contributed to the current structure of B. platynemum populations, as well as of differences in water chemistry in Madeira River. The strong genetic structure and the time of genetic divergence estimated for the groups may indicate the existence of strong structure populations of B. platynemum in the Amazon basin.

  6. Genetic structure and historical diversification of catfish Brachyplatystoma platynemum (Siluriformes: Pimelodidae) in the Amazon basin with implications for its conservation

    PubMed Central

    Ochoa, Luz Eneida; Pereira, Luiz Henrique G; Costa-Silva, Guilherme Jose; Roxo, Fábio F; Batista, Jacqueline S; Formiga, Kyara; Foresti, Fausto; Oliveira, Claudio

    2015-01-01

    Brachyplatystoma platynemum is a catfish species widely distributed in the Amazon basin. Despite being considered of little commercial interest, the decline in other fish populations has contributed to the increase in the catches of this species. The structure, population genetic variability, and evolutionary process that have driven the diversification of this species are presently unknown. Considering that, in order to better understand the genetic structure of this species, we analyzed individuals from seven locations of the Amazon basin using eight molecular markers: control region and cytochrome b mtDNA sequences, and a set of six nuclear microsatellite loci. The results show high levels of haplotype diversity and point to the occurrence of two structured populations (Amazon River and the Madeira River) with high values for FST. Divergence time estimates based on mtDNA indicated that these populations diverged about 1.0 Mya (0.2–2.5 Mya 95% HPD) using cytochrome b and 1.4 Mya (0.2–2.7 Mya 95% HPD) using control region. During that time, the influence of climate changes and hydrological events such as sea level oscillations and drainage isolation as a result of geological processes in the Pleistocene may have contributed to the current structure of B. platynemum populations, as well as of differences in water chemistry in Madeira River. The strong genetic structure and the time of genetic divergence estimated for the groups may indicate the existence of strong structure populations of B. platynemum in the Amazon basin. PMID:26045952

  7. Molecular cloning, sequence characterization and recombinant expression of Nanog gene in goat fibroblast cells using lentiviral based expression system.

    PubMed

    Singhal, Dinesh K; Singhal, Raxita; Malik, Hruda N; Kumar, Surender; Kumar, Sudarshan; Mohanty, Ashok K; Kaushik, Jai K; Malakar, Dhruba

    2014-01-01

    Nanog is a homeodomain containing protein which plays important roles in regulation of signaling pathways for maintenance and induction of pluripotency in stem cells. Because of its unique expression in stem cells it is also regarded as pluripotency marker. In this study goat Nanog (gNanog) gene has been amplified, cloned and characterized at sequence level with successful over-expression in CHO-K1 cell line using a lentiviral based system. gNanog ORF is 903 bp long which codes for Nanog protein of size 300 amino acids (aas). Complete nucleotide sequence shows some evolutionary mutation in goat in comparision to other species. Protein sequence of goat is highly similar to other species. Overall, gNanog nucleotide sequence and predicted protein sequence showed high similarity and minimum divergence with cattle (96 % identity/4 % divergence) and buffalo (94/5 %) while low similarity and high divergence with pig (84/15 %), human (81/23 %) and mouse (69/40 %) indicating evolutionary closeness of gNanog to cattle and buffalo. gNanog lentiviral expression construct was prepared for over-expression of Nanog gene in adult goat fibroblast cells. Lentiviral expression construct of Nanog enabled continuous protein expression for induction and maintenance of pluripotency. Western blotting revealed the expression of Nanog gene at protein level which supported that the lentiviral expression system is highly promising for Nanog protein expression in differentiated goat cell.

  8. Flying with the birds? Recent large-area dispersal of four Australian Limnadopsis species (Crustacea: Branchiopoda: Spinicaudata)

    PubMed Central

    Schwentner, Martin; Timms, Brian V; Richter, Stefan

    2012-01-01

    Temporary water bodies are important freshwater habitats in the arid zone of Australia. They harbor a distinct fauna and provide important feeding and breeding grounds for water birds. This paper assesses, on the basis of haplotype networks, analyses of molecular variation and relaxed molecular clock divergence time estimates, the phylogeographic history, and population structure of four common temporary water species of the Australian endemic clam shrimp taxon Limnadopsis in eastern and central Australia (an area of >1,350,000 km2). Mitochondrial cytochrome c oxidase subunit I sequences of 413 individuals and a subset of 63 nuclear internal transcribed spacer 2 sequences were analyzed. Genetic differentiation was observed between populations inhabiting southeastern and central Australia and those inhabiting the northern Lake Eyre Basin and Western Australia. However, over large parts of the study area and across river drainage systems in southeastern and central Australia (the Murray–Darling Basin, Bulloo River, and southern Lake Eyre Basin), no evidence of population subdivision was observed in any of the four Limnadopsis species. This indicates recent gene flow across an area of ∼800,000 km2. This finding contrasts with patterns observed in other Australian arid zone taxa, particularly freshwater species, whose populations are often structured according to drainage systems. The lack of genetic differentiation within the area in question may be linked to the huge number of highly nomadic water birds that potentially disperse the resting eggs of Limnadopsis among temporary water bodies. Genetically undifferentiated populations on a large geographic scale contrast starkly with findings for many other large branchiopods in other parts of the world, where pronounced genetic structure is often observed even in populations inhabiting pools separated by a few kilometers. Due to its divergent genetic lineages (up to 5.6% uncorrected p-distance) and the relaxed molecular clock divergence time estimates obtained, Limnadopsis parvispinus is assumed to have inhabited the Murray–Darling Basin continuously since the mid-Pliocene (∼4 million years ago). This means that suitable temporary water bodies would have existed in this area throughout the wet–dry cycles of the Pleistocene. PMID:22957166

  9. Extensive Concerted Evolution of Rice Paralogs and the Road to Regaining Independence

    PubMed Central

    Wang, Xiyin; Tang, Haibao; Bowers, John E.; Feltus, Frank A.; Paterson, Andrew H.

    2007-01-01

    Many genes duplicated by whole-genome duplications (WGDs) are more similar to one another than expected. We investigated whether concerted evolution through conversion and crossing over, well-known to affect tandem gene clusters, also affects dispersed paralogs. Genome sequences for two Oryza subspecies reveal appreciable gene conversion in the ∼0.4 MY since their divergence, with a gradual progression toward independent evolution of older paralogs. Since divergence from subspecies indica, ∼8% of japonica paralogs produced 5–7 MYA on chromosomes 11 and 12 have been affected by gene conversion and several reciprocal exchanges of chromosomal segments, while ∼70-MY-old “paleologs” resulting from a genome duplication (GD) show much less conversion. Sequence similarity analysis in proximal gene clusters also suggests more conversion between younger paralogs. About 8% of paleologs may have been converted since rice–sorghum divergence ∼41 MYA. Domain-encoding sequences are more frequently converted than nondomain sequences, suggesting a sort of circularity—that sequences conserved by selection may be further conserved by relatively frequent conversion. The higher level of concerted evolution in the 5–7 MY-old segmental duplication may reflect the behavior of many genomes within the first few million years after duplication or polyploidization. PMID:18039882

  10. An integrative taxonomic study reveals a new species of Tylodelphys Diesing, 1950 (Digenea: Diplostomidae) in central and northern Mexico.

    PubMed

    García-Varela, M; Sereno-Uribe, A L; Pinacho-Pinacho, C D; Hernández-Cruz, E; Pérez-Ponce de León, G

    2016-11-01

    Tylodelphys aztecae n. sp. (Digenea: Diplostomidae) is described from adult specimens obtained from the intestine of the pied-billed grebe (Podilymbus podiceps) and the metacercariae found in the body cavity of freshwater fishes of the families Goodeidae and Cyprinidae in eight localities across central and northern Mexico. The new species is mainly distinguished from the other four described species of Tylodelphys from the Americas (T. adulta, T. americana, T. elongata and T. brevis) by having a forebody slightly concave, a larger ventral sucker, two larger pseudosuckers and by having between 2 and 7 eggs in the uterus. Partial DNA sequences of the mitochondrial gene cytochrome c oxidase subunit I (cox1), and the internal transcribed spacers (ITS1+5.8S+ ITS2) of the ribosomal DNA, were generated for both developmental stages and compared with available sequences in GenBank of other congeners. The genetic divergence estimated among Tylodelphys aztecae n. sp. and other congeneric species varied from 12 to 15% for cox1, and from 3 to 11% for ITS. In contrast, the genetic divergence among metacercariae and adults of the new species was very low, ranging between 0 and 1% for cox1 and between 0 and 0.3% for ITS. Phylogenetic analyses inferred with both molecular markers using maximum likelihood and Bayesian inference placed the adults and their metacercariae in a single clade, confirming that both stages are conspecific. The morphological evidence and the genetic divergence, in combination with the reciprocal monophyly in both phylogenetic trees, support the hypothesis that the diplostomids found in the intestines of the pied-billed grebe bird and the body cavity from goodeid and cyprinid fishes in central and northern Mexico represent a new species.

  11. Mitochondrial sequence divergence among Antarctic killer whale ecotypes is consistent with multiple species.

    PubMed

    LeDuc, Richard G; Robertson, Kelly M; Pitman, Robert L

    2008-08-23

    Recently, three visually distinct forms of killer whales (Orcinus orca) were described from Antarctic waters and designated as types A, B and C. Based on consistent differences in prey selection and habitat preferences, morphological divergence and apparent lack of interbreeding among these broadly sympatric forms, it was suggested that they may represent separate species. To evaluate this hypothesis, we compared complete sequences of the mitochondrial control region from 81 Antarctic killer whale samples, including 9 type A, 18 type B, 47 type C and 7 type-undetermined individuals. We found three fixed differences that separated type A from B and C, and a single fixed difference that separated type C from A and B. These results are consistent with reproductive isolation among the different forms, although caution is needed in drawing further conclusions. Despite dramatic differences in morphology and ecology, the relatively low levels of sequence divergence in Antarctic killer whales indicate that these evolutionary changes occurred relatively rapidly and recently.

  12. Molecular diversity of some species belonging to the genus Daphnia O. F. Müller, 1785 (Crustacea: Cladocera) in Turkey.

    PubMed

    Özdemir, Ebru; Altındağ, Ahmet; Kandemir, İrfan

    2017-05-01

    Daphnia is a freshwater zooplankton species with controversial taxonomy due to its high morphological variation linked to environmental factors and inter-specific hybridization and polyploidy in some groups. The aim of the present study is to examine molecular diversity of some Daphnia species in Turkey and to establish DNA barcodes of Turkish Daphnia species. Sequence analysis was performed using 540 bp region of cytochrome oxidase subunit I gene of mitochondrial DNA. A total of 34 haplotypes have been identified for Turkey. Daphnia pulex complex was divided into two clades with 16.1% sequence divergence according to molecular taxonomy based on Kimura 2-parameter. The clade which was molecularly diverged from Daphnia pulex with 16.1% sequence divergence was found to show 99% similarity with Daphnia cf. pulicaria (sensu Alonso 1996) instead of Daphnia pulicaria Forbes, 1893. Furthermore, this study has contributed to Turkish zoogeography by demonstrating the distribution of Daphnia species in Turkey.

  13. Archaebacterial rhodopsin sequences: Implications for evolution

    NASA Technical Reports Server (NTRS)

    Lanyi, J. K.

    1991-01-01

    It was proposed over 10 years ago that the archaebacteria represent a separate kingdom which diverged very early from the eubacteria and eukaryotes. It follows that investigations of archaebacterial characteristics might reveal features of early evolution. So far, two genes, one for bacteriorhodopsin and another for halorhodopsin, both from Halobacterium halobium, have been sequenced. We cloned and sequenced the gene coding for the polypeptide of another one of these rhodopsins, a halorhodopsin in Natronobacterium pharaonis. Peptide sequencing of cyanogen bromide fragments, and immuno-reactions of the protein and synthetic peptides derived from the C-terminal gene sequence, confirmed that the open reading frame was the structural gene for the pharaonis halorhodopsin polypeptide. The flanking DNA sequences of this gene, as well as those of other bacterial rhodopsins, were compared to previously proposed archaebacterial consensus sequences. In pairwise comparisons of the open reading frame with DNA sequences for bacterio-opsin and halo-opsin from Halobacterium halobium, silent divergences were calculated. These indicate very considerable evolutionary distance between each pair of genes, even in the dame organism. In spite of this, three protein sequences show extensive similarities, indicating strong selective pressures.

  14. Divergence and codon usage bias of Betanodavirus, a neurotropic pathogen in fish.

    PubMed

    He, Mei; Teng, Chun-Bo

    2015-02-01

    Betanodavirus is a small bipartite RNA virus of global economical significance that can cause severe neurological disorders to an increasing number of marine fish species. Herein, to further the understanding of the evolution of betanodavirus, Bayesian coalescent analyses were conducted to the time-stamped entire coding sequences of their RNA polymerase and coat protein genes. Similar moderate nucleotide substitution rates were then estimated for the two genes. According to age calculations, the divergence of the two genes into the four genotypes initiated nearly simultaneously at ∼700 years ago, despite the different scenarios, whereas the seven analyzed chimeric isolates might be the outcomes of a single genetic reassortment event taking place in the early 1980s in Southern Europe. Furthermore, codon usage bias analyses indicated that each gene had influences in addition to mutational bias and codon choice of betanodavirus was not completely complied with that of fish host. Copyright © 2014 Elsevier Inc. All rights reserved.

  15. Timing major conflict between mitochondrial and nuclear genes in species relationships of Polygonia butterflies (Nymphalidae: Nymphalini)

    PubMed Central

    Wahlberg, Niklas; Weingartner, Elisabet; Warren, Andrew D; Nylin, Sören

    2009-01-01

    Background Major conflict between mitochondrial and nuclear genes in estimating species relationships is an increasingly common finding in animals. Usually this is attributed to incomplete lineage sorting, but recently the possibility has been raised that hybridization is important in generating such phylogenetic patterns. Just how widespread ancient and/or recent hybridization is in animals and how it affects estimates of species relationships is still not well-known. Results We investigate the species relationships and their evolutionary history over time in the genus Polygonia using DNA sequences from two mitochondrial gene regions (COI and ND1, total 1931 bp) and four nuclear gene regions (EF-1α, wingless, GAPDH and RpS5, total 2948 bp). We found clear, strongly supported conflict between mitochondrial and nuclear DNA sequences in estimating species relationships in the genus Polygonia. Nodes at which there was no conflict tended to have diverged at the same time when analyzed separately, while nodes at which conflict was present diverged at different times. We find that two species create most of the conflict, and attribute the conflict found in Polygonia satyrus to ancient hybridization and conflict found in Polygonia oreas to recent or ongoing hybridization. In both examples, the nuclear gene regions tended to give the phylogenetic relationships of the species supported by morphology and biology. Conclusion Studies inferring species-level relationships using molecular data should never be based on a single locus. Here we show that the phylogenetic hypothesis generated using mitochondrial DNA gives a very different interpretation of the evolutionary history of Polygonia species compared to that generated from nuclear DNA. We show that possible cases of hybridization in Polygonia are not limited to sister species, but may be inferred further back in time. Furthermore, we provide more evidence that Haldane's effect might not be as strong a process in preventing hybridization in butterflies as has been previously thought. PMID:19422691

  16. Divergence with gene flow within the recent chipmunk radiation (Tamias)

    PubMed Central

    Sullivan, J; Demboski, J R; Bell, K C; Hird, S; Sarver, B; Reid, N; Good, J M

    2014-01-01

    Increasing data have supported the importance of divergence with gene flow (DGF) in the generation of biological diversity. In such cases, lineage divergence occurs on a shorter timescale than does the completion of reproductive isolation. Although it is critical to explore the mechanisms driving divergence and preventing homogenization by hybridization, it is equally important to document cases of DGF in nature. Here we synthesize data that have accumulated over the last dozen or so years on DGF in the chipmunk (Tamias) radiation with new data that quantify very high rates of mitochondrial DNA (mtDNA) introgression among para- and sympatric species in the T. quadrivittatus group in the central and southern Rocky Mountains. These new data (188 cytochrome b sequences) bring the total number of sequences up to 1871; roughly 16% (298) of the chipmunks we have sequenced exhibit introgressed mtDNA. This includes ongoing introgression between subspecies and between both closely related and distantly related taxa. In addition, we have identified several taxa that are apparently fixed for ancient introgressions and in which there is no evidence of ongoing introgression. A recurrent observation is that these introgressions occur between ecologically and morphologically diverged, sometimes non-sister taxa that engage in well-documented niche partitioning. Thus, the chipmunk radiation in western North America represents an excellent mammalian example of speciation in the face of recurrent gene flow among lineages and where biogeography, habitat differentiation and mating systems suggest important roles for both ecological and sexual selection. PMID:24781803

  17. PopHuman: the human population genomics browser

    PubMed Central

    Mulet, Roger; Villegas-Mirón, Pablo; Hervas, Sergi; Sanz, Esteve; Velasco, Daniel; Bertranpetit, Jaume; Laayouni, Hafid

    2018-01-01

    Abstract The 1000 Genomes Project (1000GP) represents the most comprehensive world-wide nucleotide variation data set so far in humans, providing the sequencing and analysis of 2504 genomes from 26 populations and reporting >84 million variants. The availability of this sequence data provides the human lineage with an invaluable resource for population genomics studies, allowing the testing of molecular population genetics hypotheses and eventually the understanding of the evolutionary dynamics of genetic variation in human populations. Here we present PopHuman, a new population genomics-oriented genome browser based on JBrowse that allows the interactive visualization and retrieval of an extensive inventory of population genetics metrics. Efficient and reliable parameter estimates have been computed using a novel pipeline that faces the unique features and limitations of the 1000GP data, and include a battery of nucleotide variation measures, divergence and linkage disequilibrium parameters, as well as different tests of neutrality, estimated in non-overlapping windows along the chromosomes and in annotated genes for all 26 populations of the 1000GP. PopHuman is open and freely available at http://pophuman.uab.cat. PMID:29059408

  18. Chromosomal Speciation in the Genomics Era: Disentangling Phylogenetic Evolution of Rock-wallabies.

    PubMed

    Potter, Sally; Bragg, Jason G; Blom, Mozes P K; Deakin, Janine E; Kirkpatrick, Mark; Eldridge, Mark D B; Moritz, Craig

    2017-01-01

    The association of chromosome rearrangements (CRs) with speciation is well established, and there is a long history of theory and evidence relating to "chromosomal speciation." Genomic sequencing has the potential to provide new insights into how reorganization of genome structure promotes divergence, and in model systems has demonstrated reduced gene flow in rearranged segments. However, there are limits to what we can understand from a small number of model systems, which each only tell us about one episode of chromosomal speciation. Progressing from patterns of association between chromosome (and genic) change, to understanding processes of speciation requires both comparative studies across diverse systems and integration of genome-scale sequence comparisons with other lines of evidence. Here, we showcase a promising example of chromosomal speciation in a non-model organism, the endemic Australian marsupial genus Petrogale . We present initial phylogenetic results from exon-capture that resolve a history of divergence associated with extensive and repeated CRs. Yet it remains challenging to disentangle gene tree heterogeneity caused by recent divergence and gene flow in this and other such recent radiations. We outline a way forward for better integration of comparative genomic sequence data with evidence from molecular cytogenetics, and analyses of shifts in the recombination landscape and potential disruption of meiotic segregation and epigenetic programming. In all likelihood, CRs impact multiple cellular processes and these effects need to be considered together, along with effects of genic divergence. Understanding the effects of CRs together with genic divergence will require development of more integrative theory and inference methods. Together, new data and analysis tools will combine to shed light on long standing questions of how chromosome and genic divergence promote speciation.

  19. Adaptive microclimatic structural and expressional dehydrin 1 evolution in wild barley, Hordeum spontaneum, at 'Evolution Canyon', Mount Carmel, Israel.

    PubMed

    Yang, Zujun; Zhang, Tao; Bolshoy, Alexander; Beharav, Alexander; Nevo, Eviatar

    2009-05-01

    'Evolution Canyon' (ECI) at Lower Nahal Oren, Mount Carmel, Israel, is an optimal natural microscale model for unravelling evolution in action highlighting the twin evolutionary processes of adaptation and speciation. A major model organism in ECI is wild barley, Hordeum spontaneum, the progenitor of cultivated barley, which displays dramatic interslope adaptive and speciational divergence on the 'African' dry slope (AS) and the 'European' humid slope (ES), separated on average by 200 m. Here we examined interslope single nucleotide polymorphism (SNP) sequences and the expression diversity of the drought resistant dehydrin 1 gene (Dhn1) between the opposite slopes. We analysed 47 plants (genotypes), 4-10 individuals in each of seven stations (populations) in an area of 7000 m(2), for Dhn1 sequence diversity located in the 5' upstream flanking region of the gene. We found significant levels of Dhn1 genic diversity represented by 29 haplotypes, derived from 45 SNPs in a total of 708 bp sites. Most of the haplotypes, 25 out of 29 (= 86.2%), were represented by one genotype; hence, unique to one population. Only a single haplotype was common to both slopes. Genetic divergence of sequence and haplotype diversity was generally and significantly different among the populations and slopes. Nucleotide diversity was higher on the AS, whereas haplotype diversity was higher on the ES. Interslope divergence was significantly higher than intraslope divergence. The applied Tajima D rejected neutrality of the SNP diversity. The Dhn1 expression under dehydration indicated interslope divergent expression between AS and ES genotypes, reinforcing Dhn1 associated with drought resistance of wild barley at 'Evolution Canyon'. These results are inexplicable by mutation, gene flow, or chance effects, and support adaptive natural microclimatic selection as the major evolutionary divergent driving force.

  20. Molecular phylogeny, population genetics, and evolution of heterocystous cyanobacteria using nifH gene sequences.

    PubMed

    Singh, Prashant; Singh, Satya Shila; Elster, Josef; Mishra, Arun Kumar

    2013-06-01

    In order to assess phylogeny, population genetics, and approximation of future course of cyanobacterial evolution based on nifH gene sequences, 41 heterocystous cyanobacterial strains collected from all over India have been used in the present study. NifH gene sequence analysis data confirm that the heterocystous cyanobacteria are monophyletic while the stigonematales show polyphyletic origin with grave intermixing. Further, analysis of nifH gene sequence data using intricate mathematical extrapolations revealed that the nucleotide diversity and recombination frequency is much greater in Nostocales than the Stigonematales. Similarly, DNA divergence studies showed significant values of divergence with greater gene conversion tracts in the unbranched (Nostocales) than the branched (Stigonematales) strains. Our data strongly support the origin of true branching cyanobacterial strains from the unbranched strains.

  1. DNA-Sequence Variation Among Schistosoma mekongi Populations and Related Taxa; Phylogeography and the Current Distribution of Asian Schistosomiasis

    PubMed Central

    Attwood, Stephen W.; Fatih, Farrah A.; Upatham, E. Suchart

    2008-01-01

    Background Schistosomiasis in humans along the lower Mekong River has proven a persistent public health problem in the region. The causative agent is the parasite Schistosoma mekongi (Trematoda: Digenea). A new transmission focus is reported, as well as the first study of genetic variation among S. mekongi populations. The aim is to confirm the identity of the species involved at each known focus of Mekong schistosomiasis transmission, to examine historical relationships among the populations and related taxa, and to provide data for use (a priori) in further studies of the origins, radiation, and future dispersal capabilities of S. mekongi. Methodology/Principal Findings DNA sequence data are presented for four populations of S. mekongi from Cambodia and southern Laos, three of which were distinguishable at the COI (cox1) and 12S (rrnS) mitochondrial loci sampled. A phylogeny was estimated for these populations and the other members of the Schistosoma sinensium group. The study provides new DNA sequence data for three new populations and one new locus/population combination. A Bayesian approach is used to estimate divergence dates for events within the S. sinensium group and among the S. mekongi populations. Conclusions/Significance The date estimates are consistent with phylogeographical hypotheses describing a Pliocene radiation of the S. sinensium group and a mid-Pleistocene invasion of Southeast Asia by S. mekongi. The date estimates also provide Bayesian priors for future work on the evolution of S. mekongi. The public health implications of S. mekongi transmission outside the lower Mekong River are also discussed. PMID:18350111

  2. Complete nucleotide sequence of a novel Hibiscus-infecting Cilevirus from Florida and its relationship with closely associated Cileviruses

    USDA-ARS?s Scientific Manuscript database

    The complete nucleotide sequence of a recently discovered Florida (FL) isolate of Hibiscus infecting Cilevirus (HiCV) was determined by Sanger sequencing. The movement- and coat- protein gene sequences of the HiCV-FL isolate are more divergent than other genes of the previously sequenced HiCV-HA (Ha...

  3. Proteomics on the rims; insights into the biology of the nuclear envelope and flagellar pocket of trypanosomes

    PubMed Central

    Field, Mark C.; Adung’a, Vincent; Obado, Samson; Chait, Brian T.; Rout, Michael P.

    2014-01-01

    SUMMERY Trypanosomatids represent the causative agents of major diseases in humans, livestock and plants, with inevitable suffering and economic hardship as a result. They are also evolutionarily highly divergent organisms, and the many unique aspects of trypanosome biology provide opportunities in terms of identification of drug targets, the challenge of exploiting these putative targets, and at the same time significant scope for exploration of novel and divergent cell biology. We can estimate from genome sequences that the degree of divergence of trypanosomes from animals and fungi is extreme, with perhaps one third to one half of predicted trypanosome proteins having no known function based on homology or recognizable protein domains/architecture. Two highly important aspects of trypanosome biology are the flagellar pocket and the nuclear envelope, where in silico analysis clearly suggests great potential divergence in the proteome. The flagellar pocket is the sole site of endo- and exocytosis in trypanosomes and plays important roles in immune evasion via variant surface glycoprotein (VSG) trafficking and providing a location for sequestration of various invariant receptors. The trypanosome nuclear envelope has been largely unexplored, but by analogy with higher eukaryotes, roles in the regulation of chromatin and most significantly, in controlling VSG gene expression are expected. Here we discuss recent successful proteomics-based approaches towards characterization of the nuclear envelope and the endocytic apparatus, the identification of conserved and novel trypanosomatid-specific features, and the implications of these findings. PMID:22309600

  4. Estimating Divergence Dates and Substitution Rates in the Drosophila Phylogeny

    PubMed Central

    Obbard, Darren J.; Maclennan, John; Kim, Kang-Wook; Rambaut, Andrew; O’Grady, Patrick M.; Jiggins, Francis M.

    2012-01-01

    An absolute timescale for evolution is essential if we are to associate evolutionary phenomena, such as adaptation or speciation, with potential causes, such as geological activity or climatic change. Timescales in most phylogenetic studies use geologically dated fossils or phylogeographic events as calibration points, but more recently, it has also become possible to use experimentally derived estimates of the mutation rate as a proxy for substitution rates. The large radiation of drosophilid taxa endemic to the Hawaiian islands has provided multiple calibration points for the Drosophila phylogeny, thanks to the "conveyor belt" process by which this archipelago forms and is colonized by species. However, published date estimates for key nodes in the Drosophila phylogeny vary widely, and many are based on simplistic models of colonization and coalescence or on estimates of island age that are not current. In this study, we use new sequence data from seven species of Hawaiian Drosophila to examine a range of explicit coalescent models and estimate substitution rates. We use these rates, along with a published experimentally determined mutation rate, to date key events in drosophilid evolution. Surprisingly, our estimate for the date for the most recent common ancestor of the genus Drosophila based on mutation rate (25–40 Ma) is closer to being compatible with independent fossil-derived dates (20–50 Ma) than are most of the Hawaiian-calibration models and also has smaller uncertainty. We find that Hawaiian-calibrated dates are extremely sensitive to model choice and give rise to point estimates that range between 26 and 192 Ma, depending on the details of the model. Potential problems with the Hawaiian calibration may arise from systematic variation in the molecular clock due to the long generation time of Hawaiian Drosophila compared with other Drosophila and/or uncertainty in linking island formation dates with colonization dates. As either source of error will bias estimates of divergence time, we suggest mutation rate estimates be used until better models are available. PMID:22683811

  5. Active learning for noisy oracle via density power divergence.

    PubMed

    Sogawa, Yasuhiro; Ueno, Tsuyoshi; Kawahara, Yoshinobu; Washio, Takashi

    2013-10-01

    The accuracy of active learning is critically influenced by the existence of noisy labels given by a noisy oracle. In this paper, we propose a novel pool-based active learning framework through robust measures based on density power divergence. By minimizing density power divergence, such as β-divergence and γ-divergence, one can estimate the model accurately even under the existence of noisy labels within data. Accordingly, we develop query selecting measures for pool-based active learning using these divergences. In addition, we propose an evaluation scheme for these measures based on asymptotic statistical analyses, which enables us to perform active learning by evaluating an estimation error directly. Experiments with benchmark datasets and real-world image datasets show that our active learning scheme performs better than several baseline methods. Copyright © 2013 Elsevier Ltd. All rights reserved.

  6. A phylogenetic analysis of the grape genus (Vitis L.) reveals broad reticulation and concurrent diversification during neogene and quaternary climate change

    PubMed Central

    2013-01-01

    Background Grapes are one of the most economically important fruit crops. There are about 60 species in the genus Vitis. The phylogenetic relationships among these species are of keen interest for the conservation and use of this germplasm. We selected 309 accessions from 48 Vitis species,varieties, and outgroups, examined ~11 kb (~3.4 Mb total) of aligned nuclear DNA sequences from 27 unlinked genes in a phylogenetic context, and estimated divergence times based on fossil calibrations. Results Vitis formed a strongly supported clade. There was substantial support for species and less for the higher-level groupings (series). As estimated from extant taxa, the crown age of Vitis was 28 Ma and the divergence of subgenera (Vitis and Muscadinia) occurred at ~18 Ma. Higher clades in subgenus Vitis diverged 16 – 5 Ma with overlapping confidence intervals, and ongoing divergence formed extant species at 12 – 1.3 Ma. Several species had species-specific SNPs. NeighborNet analysis showed extensive reticulation at the core of subgenus Vitis representing the deeper nodes, with extensive reticulation radiating outward. Fitch Parsimony identified North America as the origin of the most recent common ancestor of extant Vitis species. Conclusions Phylogenetic patterns suggested origination of the genus in North America, fragmentation of an ancestral range during the Miocene, formation of extant species in the late Miocene-Pleistocene, and differentiation of species in the context of Pliocene-Quaternary tectonic and climatic change. Nuclear SNPs effectively resolved relationships at and below the species level in grapes and rectified several misclassifications of accessions in the repositories. Our results challenge current higher-level classifications, reveal the abundance of genetic diversity in the genus that is potentially available for crop improvement, and provide a valuable resource for species delineation, germplasm conservation and use. PMID:23826735

  7. Application of Johnson et al.'s speciation threshold model to apparent colonization times of island biotas.

    PubMed

    Ricklefs, Robert E; Bermingham, Eldredge

    2004-08-01

    Understanding patterns of diversity can be furthered by analysis of the dynamics of colonization, speciation, and extinction on islands using historical information provided by molecular phylogeography. The land birds of the Lesser Antilles are one of the most thoroughly described regional faunas in this context. In an analysis of colonization times, Ricklefs and Bermingham (2001) found that the cumulative distribution of lineages with respect to increasing time since colonization exhibits a striking change in slope at a genetic distance of about 2% mitochondrial DNA sequence divergence (about one million years). They further showed how this heterogeneity could be explained by either an abrupt increase in colonization rates or a mass extinction event. Cherry et al. (2002), referring to a model developed by Johnson et al. (2000), argued instead that the pattern resulted from a speciation threshold for reproductive isolation of island populations from their continental source populations. Prior to this threshold, genetic divergence is slowed by migration from the source, and species of varying age accumulate at a low genetic distance. After the threshold is reached, source and island populations diverge more rapidly, creating heterogeneity in the distribution of apparent ages of island taxa. We simulated of Johnson et al.'s speciation-threshold model, incorporating genetic divergence at rate k and fixation at rate M of genes that have migrated between the source and the island population. Fixation resets the divergence clock to zero. The speciation-threshold model fits the distribution of divergence times of Lesser Antillean birds well with biologically plausible parameter estimates. Application of the model to the Hawaiian avifauna, which does not exhibit marked heterogeneity of genetic divergence, and the West Indian herpetofauna, which does, required unreasonably high migration-fixation rates, several orders of magnitude greater than the colonization rate. However, the plausibility of the speciation-divergence model for Lesser Antillean birds emphasizes the importance of further investigation of historical biogeography on a regional scale for whole biotas, as well as the migration of genes between populations on long time scales and the achievement of reproductive isolation.

  8. Genome Size, Molecular Phylogeny, and Evolutionary History of the Tribe Aquilarieae (Thymelaeaceae), the Natural Source of Agarwood

    PubMed Central

    Farah, Azman H.; Lee, Shiou Yih; Gao, Zhihui; Yao, Tze Leong; Madon, Maria; Mohamed, Rozi

    2018-01-01

    The tribe Aquilarieae of the family Thymelaeaceae consists of two genera, Aquilaria and Gyrinops, with a total of 30 species, distributed from northeast India, through southeast Asia and the south of China, to Papua New Guinea. They are an important botanical resource for fragrant agarwood, a prized product derived from injured or infected stems of these species. The aim of this study was to estimate the genome size of selected Aquilaria species and comprehend the evolutionary history of Aquilarieae speciation through molecular phylogeny. Five non-coding chloroplast DNA regions and a nuclear region were sequenced from 12 Aquilaria and three Gyrinops species. Phylogenetic trees constructed using combined chloroplast DNA sequences revealed relationships of the studied 15 members in Aquilarieae, while nuclear ribosomal DNA internal transcribed spacer (ITS) sequences showed a paraphyletic relationship between Aquilaria species from Indochina and Malesian. We exposed, for the first time, the estimated divergence time for Aquilarieae speciation, which was speculated to happen during the Miocene Epoch. The ancestral split and biogeographic pattern of studied species were discussed. Results showed no large variation in the 2C-values for the five Aquilaria species (1.35–2.23 pg). Further investigation into the genome size may provide additional information regarding ancestral traits and its evolution history. PMID:29896211

  9. Miniprimer PCR, a New Lens for Viewing the Microbial World▿ †

    PubMed Central

    Isenbarger, Thomas A.; Finney, Michael; Ríos-Velázquez, Carlos; Handelsman, Jo; Ruvkun, Gary

    2008-01-01

    Molecular methods based on the 16S rRNA gene sequence are used widely in microbial ecology to reveal the diversity of microbial populations in environmental samples. Here we show that a new PCR method using an engineered polymerase and 10-nucleotide “miniprimers” expands the scope of detectable sequences beyond those detected by standard methods using longer primers and Taq polymerase. After testing the method in silico to identify divergent ribosomal genes in previously cloned environmental sequences, we applied the method to soil and microbial mat samples, which revealed novel 16S rRNA gene sequences that would not have been detected with standard primers. Deeply divergent sequences were discovered with high frequency and included representatives that define two new division-level taxa, designated CR1 and CR2, suggesting that miniprimer PCR may reveal new dimensions of microbial diversity. PMID:18083877

  10. Candida ficus sp. nov., a novel yeast species from the gut of Apriona germari larvae.

    PubMed

    Hui, Feng-Li; Niu, Qiu-Hong; Ke, Tao; Liu, Zheng

    2012-11-01

    A novel yeast species is described based on three strains from the gut of wood-boring larvae collected in a tree trunk of Ficus carica cultivated in parks near Nanyang, central China. Phylogenetic analysis based on sequences of the D1/D2 domains of the large subunit rRNA gene showed that these strains occurred in a separate clade that was genetically distinct from all known ascomycetous yeasts. In terms of pairwise sequence divergence, the novel strains differed by 15.3% divergence from the type strain of Pichia terricola, and by 15.8% divergence from the type strains of Pichia exigua and Candida rugopelliculosa in the D1/D2 domains. All three are ascomycetous yeasts in the Pichia clade. Unlike P. terricola, P. exigua and C. rugopelliculosa, the novel isolates did not ferment glucose. The name Candida ficus sp. nov. is proposed to accommodate these highly divergent organisms, with STN-8(T) (=CICC 1980(T)=CBS 12638(T)) as the type strain.

  11. Single sample resolution of rare microbial dark matter in a marine invertebrate metagenome

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Miller, Ian J.; Weyna, Theodore R.; Fong, Stephen S.

    Direct, untargeted sequencing of environmental samples (metagenomics) and de novo genome assembly enable the study of uncultured and phylogenetically divergent organisms. However, separating individual genomes from a mixed community has often relied on the differential-coverage analysis of multiple, deeply sequenced samples. In the metagenomic investigation of the marine bryozoan Bugula neritina, we uncovered seven bacterial genomes associated with a single B. neritina individual that appeared to be transient associates, two of which were unique to one individual and undetectable using certain “universal” 16S rRNA primers and probes. We recovered high quality genome assemblies for several rare instances of “microbial darkmore » matter,” or phylogenetically divergent bacteria lacking genomes in reference databases, from a single tissue sample that was not subjected to any physical or chemical pre-treatment. One of these rare, divergent organisms has a small (593 kbp), poorly annotated genome with low GC content (20.9%) and a 16S rRNA gene with just 65% sequence similarity to the closest reference sequence. Lastly, our findings illustrate the importance of sampling strategy and de novo assembly of metagenomic reads to understand the extent and function of bacterial biodiversity.« less

  12. Single sample resolution of rare microbial dark matter in a marine invertebrate metagenome

    DOE PAGES

    Miller, Ian J.; Weyna, Theodore R.; Fong, Stephen S.; ...

    2016-09-29

    Direct, untargeted sequencing of environmental samples (metagenomics) and de novo genome assembly enable the study of uncultured and phylogenetically divergent organisms. However, separating individual genomes from a mixed community has often relied on the differential-coverage analysis of multiple, deeply sequenced samples. In the metagenomic investigation of the marine bryozoan Bugula neritina, we uncovered seven bacterial genomes associated with a single B. neritina individual that appeared to be transient associates, two of which were unique to one individual and undetectable using certain “universal” 16S rRNA primers and probes. We recovered high quality genome assemblies for several rare instances of “microbial darkmore » matter,” or phylogenetically divergent bacteria lacking genomes in reference databases, from a single tissue sample that was not subjected to any physical or chemical pre-treatment. One of these rare, divergent organisms has a small (593 kbp), poorly annotated genome with low GC content (20.9%) and a 16S rRNA gene with just 65% sequence similarity to the closest reference sequence. Lastly, our findings illustrate the importance of sampling strategy and de novo assembly of metagenomic reads to understand the extent and function of bacterial biodiversity.« less

  13. Two new anamorphic yeasts species, Cyberlindnera samutprakarnensis sp. nov. and Candida thasaenensis sp. nov., isolated from industrial wastes in Thailand.

    PubMed

    Poomtien, Jamroonsri; Jindamorakot, Sasitorn; Limtong, Savitree; Pinphanichakarn, Pairoh; Thaniyavarn, Jiraporn

    2013-01-01

    Three yeast strains were isolated from industrial wastes in Thailand. Based on the phylogenetic sequence analysis of the D1/D2 region of the large subunit rRNA gene, the internal transcribed spacer (ITS1-5.8S rRNA gene-ITS2; ITS1-2) region, and their physiological characteristics, the three strains were found to represent two novel species of the ascomycetous anamorphic yeast. Strain JP52(T) represent a novel species which was named Cyberlindnera samutprakarnensis sp. nov. (type strain JP52(T); = BCC 46825(T) = JCM 17816(T) = CBS 12528(T), MycoBank no. MB800879), which was differentiated from the closely related species Cyberlindnera mengyuniae CBS 10845(T) by 2.9 % sequence divergence in the D1/D2 region and 4.4 % sequence divergence in the ITS1-2. Strain JP59(T) and JP60 were identical in their D1/D2 and ITS1-2 regions, which were closely related to those of Scheffersomyces spartinae CBS 6059(T) by 0.9 and 1.0 % sequence divergence, respectively. In addition, supportive evidence of actin gene and translational elongation factor gene by sequence divergence of 6.5 % each confirmed their distinct status. Furthermore, JP59(T) and JP60 differentiated from the closely related species in some biochemical and physiological characteristics. These two strains were assigned as a single novel species which was named Candida thasaenensis sp. nov. (type JP59(T) = BCC 46828(T) = JCM 17817(T) = CBS 12529(T), MycoBank no. MB800880).

  14. Detecting exact breakpoints of deletions with diversity in hepatitis B viral genomic DNA from next-generation sequencing data.

    PubMed

    Cheng, Ji-Hong; Liu, Wen-Chun; Chang, Ting-Tsung; Hsieh, Sun-Yuan; Tseng, Vincent S

    2017-10-01

    Many studies have suggested that deletions of Hepatitis B Viral (HBV) are associated with the development of progressive liver diseases, even ultimately resulting in hepatocellular carcinoma (HCC). Among the methods for detecting deletions from next-generation sequencing (NGS) data, few methods considered the characteristics of virus, such as high evolution rates and high divergence among the different HBV genomes. Sequencing high divergence HBV genome sequences using the NGS technology outputs millions of reads. Thus, detecting exact breakpoints of deletions from these big and complex data incurs very high computational cost. We proposed a novel analytical method named VirDelect (Virus Deletion Detect), which uses split read alignment base to detect exact breakpoint and diversity variable to consider high divergence in single-end reads data, such that the computational cost can be reduced without losing accuracy. We use four simulated reads datasets and two real pair-end reads datasets of HBV genome sequence to verify VirDelect accuracy by score functions. The experimental results show that VirDelect outperforms the state-of-the-art method Pindel in terms of accuracy score for all simulated datasets and VirDelect had only two base errors even in real datasets. VirDelect is also shown to deliver high accuracy in analyzing the single-end read data as well as pair-end data. VirDelect can serve as an effective and efficient bioinformatics tool for physiologists with high accuracy and efficient performance and applicable to further analysis with characteristics similar to HBV on genome length and high divergence. The software program of VirDelect can be downloaded at https://sourceforge.net/projects/virdelect/. Copyright © 2017. Published by Elsevier Inc.

  15. The evolutionary implications of knox-I gene duplications in conifers: correlated evidence from phylogeny, gene mapping, and analysis of functional divergence.

    PubMed

    Guillet-Claude, Carine; Isabel, Nathalie; Pelgas, Betty; Bousquet, Jean

    2004-12-01

    Class I knox genes code for transcription factors that play an essential role in plant growth and development as central regulators of meristem cell identity. Based on the analysis of new cDNA sequences from various tissues and genomic DNA sequences, we identified a highly diversified group of class I knox genes in conifers. Phylogenetic analyses of complete amino acid sequences from various seed plants indicated that all conifer sequences formed a monophyletic group. Within conifers, four subgroups here named genes KN1 to KN4 were well delineated, each regrouping pine and spruce sequences. KN4 was sister group to KN3, which was sister group to KN1 and KN2. Genetic mapping on the genomes of two divergent Picea species indicated that KN1 and KN2 are located close to each other on the same linkage group, whereas KN3 and KN4 mapped on different linkage groups, correlating the more ancient divergence of these two genes. The proportion of synonymous and nonsynonymous substitutions suggested intense purifying selection for the four genes. However, rates of substitution per year indicated an evolution in two steps: faster rates were noted after gene duplications, followed subsequently by lower rates. Positive directional selection was detected for most of the internal branches harboring an accelerated rate of evolution. In addition, many sites with highly significant amino acid rate shift were identified between these branches. However, the tightly linked KN1 and KN2 did not diverge as much from each other. The implications of the correlation between phylogenetic, structural, and functional information are discussed in relation to the diversification of the knox-I gene family in conifers.

  16. Resolving Recent Plant Radiations: Power and Robustness of Genotyping-by-Sequencing.

    PubMed

    Fernández-Mazuecos, Mario; Mellers, Greg; Vigalondo, Beatriz; Sáez, Llorenç; Vargas, Pablo; Glover, Beverley J

    2018-03-01

    Disentangling species boundaries and phylogenetic relationships within recent evolutionary radiations is a challenge due to the poor morphological differentiation and low genetic divergence between species, frequently accompanied by phenotypic convergence, interspecific gene flow and incomplete lineage sorting. Here we employed a genotyping-by-sequencing (GBS) approach, in combination with morphometric analyses, to investigate a small western Mediterranean clade in the flowering plant genus Linaria that radiated in the Quaternary. After confirming the morphological and genetic distinctness of eight species, we evaluated the relative performances of concatenation and coalescent methods to resolve phylogenetic relationships. Specifically, we focused on assessing the robustness of both approaches to variations in the parameter used to estimate sequence homology (clustering threshold). Concatenation analyses suffered from strong systematic bias, as revealed by the high statistical support for multiple alternative topologies depending on clustering threshold values. By contrast, topologies produced by two coalescent-based methods (NJ$_{\\mathrm{st}}$, SVDquartets) were robust to variations in the clustering threshold. Reticulate evolution may partly explain incongruences between NJ$_{\\mathrm{st}}$, SVDquartets and concatenated trees. Integration of morphometric and coalescent-based phylogenetic results revealed (i) extensive morphological divergence associated with recent splits between geographically close or sympatric sister species and (ii) morphological convergence in geographically disjunct species. These patterns are particularly true for floral traits related to pollinator specialization, including nectar spur length, tube width and corolla color, suggesting pollinator-driven diversification. Given its relatively simple and inexpensive implementation, GBS is a promising technique for the phylogenetic and systematic study of recent radiations, but care must be taken to evaluate the robustness of results to variation of data assembly parameters.

  17. Phylogeny and temporal diversification of darters (Percidae: Etheostomatinae).

    PubMed

    Near, Thomas J; Bossu, Christen M; Bradburd, Gideon S; Carlson, Rose L; Harrington, Richard C; Hollingsworth, Phillip R; Keck, Benjamin P; Etnier, David A

    2011-10-01

    Discussions aimed at resolution of the Tree of Life are most often focused on the interrelationships of major organismal lineages. In this study, we focus on the resolution of some of the most apical branches in the Tree of Life through exploration of the phylogenetic relationships of darters, a species-rich clade of North American freshwater fishes. With a near-complete taxon sampling of close to 250 species, we aim to investigate strategies for efficient multilocus data sampling and the estimation of divergence times using relaxed-clock methods when a clade lacks a fossil record. Our phylogenetic data set comprises a single mitochondrial DNA (mtDNA) gene and two nuclear genes sampled from 245 of the 248 darter species. This dense sampling allows us to determine if a modest amount of nuclear DNA sequence data can resolve relationships among closely related animal species. Darters lack a fossil record to provide age calibration priors in relaxed-clock analyses. Therefore, we use a near-complete species-sampled phylogeny of the perciform clade Centrarchidae, which has a rich fossil record, to assess two distinct strategies of external calibration in relaxed-clock divergence time estimates of darters: using ages inferred from the fossil record and molecular evolutionary rate estimates. Comparison of Bayesian phylogenies inferred from mtDNA and nuclear genes reveals that heterospecific mtDNA is present in approximately 12.5% of all darter species. We identify three patterns of mtDNA introgression in darters: proximal mtDNA transfer, which involves the transfer of mtDNA among extant and sympatric darter species, indeterminate introgression, which involves the transfer of mtDNA from a lineage that cannot be confidently identified because the introgressed haplotypes are not clearly referable to mtDNA haplotypes in any recognized species, and deep introgression, which is characterized by species diversification within a recipient clade subsequent to the transfer of heterospecific mtDNA. The results of our analyses indicate that DNA sequences sampled from single-copy nuclear genes can provide appreciable phylogenetic resolution for closely related animal species. A well-resolved near-complete species-sampled phylogeny of darters was estimated with Bayesian methods using a concatenated mtDNA and nuclear gene data set with all identified heterospecific mtDNA haplotypes treated as missing data. The relaxed-clock analyses resulted in very similar posterior age estimates across the three sampled genes and methods of calibration and therefore offer a viable strategy for estimating divergence times for clades that lack a fossil record. In addition, an informative rank-free clade-based classification of darters that preserves the rich history of nomenclature in the group and provides formal taxonomic communication of darter clades was constructed using the mtDNA and nuclear gene phylogeny. On the whole, the appeal of mtDNA for phylogeny inference among closely related animal species is diminished by the observations of extensive mtDNA introgression and by finding appreciable phylogenetic signal in a modest sampling of nuclear genes in our phylogenetic analyses of darters.

  18. The complete chloroplast genome sequence of the medicinal plant Salvia miltiorrhiza.

    PubMed

    Qian, Jun; Song, Jingyuan; Gao, Huanhuan; Zhu, Yingjie; Xu, Jiang; Pang, Xiaohui; Yao, Hui; Sun, Chao; Li, Xian'en; Li, Chuyuan; Liu, Juyan; Xu, Haibin; Chen, Shilin

    2013-01-01

    Salvia miltiorrhiza is an important medicinal plant with great economic and medicinal value. The complete chloroplast (cp) genome sequence of Salvia miltiorrhiza, the first sequenced member of the Lamiaceae family, is reported here. The genome is 151,328 bp in length and exhibits a typical quadripartite structure of the large (LSC, 82,695 bp) and small (SSC, 17,555 bp) single-copy regions, separated by a pair of inverted repeats (IRs, 25,539 bp). It contains 114 unique genes, including 80 protein-coding genes, 30 tRNAs and four rRNAs. The genome structure, gene order, GC content and codon usage are similar to the typical angiosperm cp genomes. Four forward, three inverted and seven tandem repeats were detected in the Salvia miltiorrhiza cp genome. Simple sequence repeat (SSR) analysis among the 30 asterid cp genomes revealed that most SSRs are AT-rich, which contribute to the overall AT richness of these cp genomes. Additionally, fewer SSRs are distributed in the protein-coding sequences compared to the non-coding regions, indicating an uneven distribution of SSRs within the cp genomes. Entire cp genome comparison of Salvia miltiorrhiza and three other Lamiales cp genomes showed a high degree of sequence similarity and a relatively high divergence of intergenic spacers. Sequence divergence analysis discovered the ten most divergent and ten most conserved genes as well as their length variation, which will be helpful for phylogenetic studies in asterids. Our analysis also supports that both regional and functional constraints affect gene sequence evolution. Further, phylogenetic analysis demonstrated a sister relationship between Salvia miltiorrhiza and Sesamum indicum. The complete cp genome sequence of Salvia miltiorrhiza reported in this paper will facilitate population, phylogenetic and cp genetic engineering studies of this medicinal plant.

  19. Drift-driven evolution of electric signals in a Neotropical knifefish.

    PubMed

    Picq, Sophie; Alda, Fernando; Bermingham, Eldredge; Krahe, Rüdiger

    2016-09-01

    Communication signals are highly diverse traits. This diversity is usually assumed to be shaped by selective forces, whereas the null hypothesis of divergence through drift is often not considered. In Panama, the weakly electric fish Brachyhypopomus occidentalis is widely distributed in multiple independent drainage systems, which provide a natural evolutionary laboratory for the study of genetic and signal divergence in separate populations. We quantified geographic variation in the electric signals of 109 fish from five populations, and compared it to the neutral genetic variation estimated from cytochrome oxidase I (COI) sequences of the same individuals, to test whether drift may be driving divergence of their signals. Signal distances were highly correlated with genetic distances, even after controlling for geographic distances, suggesting that drift alone is sufficient to explain geographic variation in electric signals. Significant differences at smaller geographic scales (within drainages) showed, however, that electric signals may evolve at a faster rate than expected under drift, raising the possibility that additional adaptive forces may be contributing to their evolution. Overall, our data point to stochastic forces as main drivers of signal evolution in this species and extend the role of drift in the evolution of communication systems to fish and electrocommunication. © 2016 The Author(s). Evolution © 2016 The Society for the Study of Evolution.

  20. Phylogenetic analysis of the GST family in Anopheles (Nyssorhynchus) darlingi.

    PubMed

    Azevedo-Júnior, Gilson Martins de; Guimarães-Marques, Giselle Moura; Cegatti Bridi, Leticia; Christine Ohse, Ketlen; Vicentini, Renato; Tadei, Wanderli; Rafael, Míriam Silva

    2014-08-01

    Anopheles darlingi Root, 1926 and Anopheles gambiae (Diptera: Culicidae) are the most important human malaria vectors in South America and Africa, respectively. The two species are estimated to have diverged 100 million years ago. Studies on the phylogenetics and evolution of gene sequences, such as glutathione S-transferase (GST) in disease-transmitting mosquitoes are scarce. The sigma class GST (KC890767) from the transcriptome of An. darlingi captured in the Brazilian Amazon was studied by in silico hybridization, and mapped to chromosome 3 of An. gambiae. The sigma class GST of An. darlingi was used for phylogenetic analyses to understand the GST base composition of the most recent common ancestor between An. darlingi, Anopheles gambiae, Aedes aegypti and Culex quinquefasciatus. The GST (KC890767) of An. darlingi was studied to generate the main divergence branches using a Neighbor-Joining and bootstrapping approaches to confirm confidence levels on the tree nodes that separate the An. darlingi and other mosquito species. The results showed divergence between An. gambiae, Ae. Aegypti, Cx. quinquefasciatus, and Phlebotomus papatasi as outgroup, and the homology relationship between sigma class GST of An. darlingi and GSTS1_1 gene of An. gambiae was valuable for phylogenetic and evolutionary studies. Copyright © 2014 Elsevier B.V. All rights reserved.

  1. Divergence of Lutzomyia (Psathyromyia) shannoni (Diptera: Psychodidae: Phlebotominae) is indicated by morphometric and molecular analyses when examined between taxa from the southeastern United States and southern Mexico.

    PubMed

    Florin, David A; Rebollar-Téllez, Eduardo A

    2013-11-01

    The medically important sand fly Lutzomyia shannoni (Dyar 1929) was collected at eight different sites: seven within the southeastern United States and one in the state of Quintana Roo, Mexico. A canonical discriminant analysis was conducted on 40 female L. shannoni specimens from each of the eight collection sites (n = 320) using 49 morphological characters. Four L. shannoni specimens from each of the eight collection sites (n = 32) were sent to the Barcode of Life Data systems where a 654-base pair segment of the cytochrome c oxidase subunit 1 (CO1) genetic marker was sequenced from each sand fly. Phylogeny estimation based on the COI segments, in addition to genetic distance, divergence, and differentiation values were calculated. Results of both the morphometric and molecular analyses indicate that the species has undergone divergence when examined between the taxa of the United States and Quintana Roo, Mexico. Although purely speculative, the arid or semiarid expanse from southern Texas to Mexico City could be an allopatric barrier that has impeded migration and hence gene flow, resulting in different morphology and genetic makeup between the two purported populations. A high degree of intragroup variability was noted in the Quintana Roo sand flies.

  2. Ancient wolf genome reveals an early divergence of domestic dog ancestors and admixture into high-latitude breeds.

    PubMed

    Skoglund, Pontus; Ersmark, Erik; Palkopoulou, Eleftheria; Dalén, Love

    2015-06-01

    The origin of domestic dogs is poorly understood [1-15], with suggested evidence of dog-like features in fossils that predate the Last Glacial Maximum [6, 9, 10, 14, 16] conflicting with genetic estimates of a more recent divergence between dogs and worldwide wolf populations [13, 15, 17-19]. Here, we present a draft genome sequence from a 35,000-year-old wolf from the Taimyr Peninsula in northern Siberia. We find that this individual belonged to a population that diverged from the common ancestor of present-day wolves and dogs very close in time to the appearance of the domestic dog lineage. We use the directly dated ancient wolf genome to recalibrate the molecular timescale of wolves and dogs and find that the mutation rate is substantially slower than assumed by most previous studies, suggesting that the ancestors of dogs were separated from present-day wolves before the Last Glacial Maximum. We also find evidence of introgression from the archaic Taimyr wolf lineage into present-day dog breeds from northeast Siberia and Greenland, contributing between 1.4% and 27.3% of their ancestry. This demonstrates that the ancestry of present-day dogs is derived from multiple regional wolf populations. Copyright © 2015 Elsevier Ltd. All rights reserved.

  3. Phylogeography and population genetic structure of double-crested cormorants (Phalacrocorax auritus)

    USGS Publications Warehouse

    Mercer, Dacey; Haig, Susan M.; Roby, Daniel D.

    2013-01-01

    is genetically divergent from other populations in North America (net sequence divergence = 5.85 %;UST for mitochondrial control region = 0.708; FST for microsatellite loci = 0.052). Historical records, contemporary population estimates, and field observations are consistent with recognition of the Alaskan subspecies as distinct and potentially of conservation interest. Our data also indicated the presence of another divergent lineage, associated with the southwestern portion of the species range, as evidenced by highly unique haplotypes sampled in southern California. In contrast, there was little support for recognition of subspecies within the conterminous U.S. and Canada. Rather than genetically distinct regions corresponding to the putative subspecies [P. a. albociliatus (Pacific), P. a. auritus (Interior and North Atlantic), and P. a. floridanus (Southeast)], we observed a distribution of genetic variation consistent with a pattern of isolation by distance. This pattern implies that genetic differences across the range are due to geographic distance, rather than discrete subspecific breaks. Although three of the four traditional subspecies were not genetically distinct, possible demographic separation, habitat differences, and documented declines at some colonies within the regions, suggests that the Pacific and possibly North Atlantic portions of the breeding range may warrant differential consideration from the Interior and Southeast breeding regions.

  4. Evidence of shallow mitochondrial divergence in the slender armorhead, Pentaceros wheeleri (Pisces, Pentacerotidae) from the Emperor Seamount Chain.

    PubMed

    Bae, Seung Eun; Kim, Hanna; Choi, Seok-Gwan; Kim, Jin-Koo

    2018-01-12

    Competitive overexploitation of the slender armorhead, Pentaceros wheeleri, a deep-sea fish inhabiting the Emperor Seamount Chain caused a serious population decline. Therefore, it is urgently necessary to clarify its genetic diversity and connectivity among populations of P. wheeleri for appropriate stock management. For this, we compared 677 base pairs (bp) of mitochondrial (mt) DNA control region (CR) sequences of 80 individuals from three seamounts (the Milwaukee, Kinmei, and Koko Seamounts) in the southern part of the Emperor Seamount Chain. Contrary to our expectation, the three seamount populations showed high genetic diversity, not yet reflecting effects from the recent population decline or due to mixed two clades. Analysis of molecular variance indicated no significant genetic differentiation between seamount populations, however, the neighbour-joining tree and minimum spanning network showed significant separation into two clades (K2P distance= 1.2-3.2%, ϕ st  = 0.5739, p < .05) regardless of seamount. The divergence time between the two clades was estimated to be 0.3-0.8 Mya, during the period of Pleistocene glacial cycles, suggesting that associated environmental changes and the unique life history traits of Pentaceros spp. might have resulted in the initiation of divergence between these clades.

  5. Population-genomic variation within RNA viruses of the Western honey bee, Apis mellifera, inferred from deep sequencing

    PubMed Central

    2013-01-01

    Background Deep sequencing of viruses isolated from infected hosts is an efficient way to measure population-genetic variation and can reveal patterns of dispersal and natural selection. In this study, we mined existing Illumina sequence reads to investigate single-nucleotide polymorphisms (SNPs) within two RNA viruses of the Western honey bee (Apis mellifera), deformed wing virus (DWV) and Israel acute paralysis virus (IAPV). All viral RNA was extracted from North American samples of honey bees or, in one case, the ectoparasitic mite Varroa destructor. Results Coverage depth was generally lower for IAPV than DWV, and marked gaps in coverage occurred in several narrow regions (< 50 bp) of IAPV. These coverage gaps occurred across sequencing runs and were virtually unchanged when reads were re-mapped with greater permissiveness (up to 8% divergence), suggesting a recurrent sequencing artifact rather than strain divergence. Consensus sequences of DWV for each sample showed little phylogenetic divergence, low nucleotide diversity, and strongly negative values of Fu and Li’s D statistic, suggesting a recent population bottleneck and/or purifying selection. The Kakugo strain of DWV fell outside of all other DWV sequences at 100% bootstrap support. IAPV consensus sequences supported the existence of multiple clades as had been previously reported, and Fu and Li’s D was closer to neutral expectation overall, although a sliding-window analysis identified a significantly positive D within the protease region, suggesting selection maintains diversity in that region. Within-sample mean diversity was comparable between the two viruses on average, although for both viruses there was substantial variation among samples in mean diversity at third codon positions and in the number of high-diversity sites. FST values were bimodal for DWV, likely reflecting neutral divergence in two low-diversity populations, whereas IAPV had several sites that were strong outliers with very low FST. Conclusions This initial survey of genetic variation within honey bee RNA viruses suggests future directions for studies examining the underlying causes of population-genetic structure in these economically important pathogens. PMID:23497218

  6. Population-genomic variation within RNA viruses of the Western honey bee, Apis mellifera, inferred from deep sequencing.

    PubMed

    Cornman, Robert Scott; Boncristiani, Humberto; Dainat, Benjamin; Chen, Yanping; vanEngelsdorp, Dennis; Weaver, Daniel; Evans, Jay D

    2013-03-07

    Deep sequencing of viruses isolated from infected hosts is an efficient way to measure population-genetic variation and can reveal patterns of dispersal and natural selection. In this study, we mined existing Illumina sequence reads to investigate single-nucleotide polymorphisms (SNPs) within two RNA viruses of the Western honey bee (Apis mellifera), deformed wing virus (DWV) and Israel acute paralysis virus (IAPV). All viral RNA was extracted from North American samples of honey bees or, in one case, the ectoparasitic mite Varroa destructor. Coverage depth was generally lower for IAPV than DWV, and marked gaps in coverage occurred in several narrow regions (< 50 bp) of IAPV. These coverage gaps occurred across sequencing runs and were virtually unchanged when reads were re-mapped with greater permissiveness (up to 8% divergence), suggesting a recurrent sequencing artifact rather than strain divergence. Consensus sequences of DWV for each sample showed little phylogenetic divergence, low nucleotide diversity, and strongly negative values of Fu and Li's D statistic, suggesting a recent population bottleneck and/or purifying selection. The Kakugo strain of DWV fell outside of all other DWV sequences at 100% bootstrap support. IAPV consensus sequences supported the existence of multiple clades as had been previously reported, and Fu and Li's D was closer to neutral expectation overall, although a sliding-window analysis identified a significantly positive D within the protease region, suggesting selection maintains diversity in that region. Within-sample mean diversity was comparable between the two viruses on average, although for both viruses there was substantial variation among samples in mean diversity at third codon positions and in the number of high-diversity sites. FST values were bimodal for DWV, likely reflecting neutral divergence in two low-diversity populations, whereas IAPV had several sites that were strong outliers with very low FST. This initial survey of genetic variation within honey bee RNA viruses suggests future directions for studies examining the underlying causes of population-genetic structure in these economically important pathogens.

  7. Complex longitudinal diversification across South China and Vietnam in Stejneger's pit viper, Viridovipera stejnegeri (Schmidt, 1925) (Reptilia: Serpentes: Viperidae).

    PubMed

    Guo, Peng; Liu, Qin; Zhu, Fei; Zhong, Guang H; Chen, Xin; Myers, Edward A; Che, Jing; Zhang, Liang; Ziegler, Thomas; Nguyen, Truong Q; Burbrink, Frank T

    2016-06-01

    Viridovipera stejnegeri is one of the most common pit vipers in Asia, with a wide distribution in southern China and Vietnam. We investigated historical demography and explored how the environment and climatic factors have shaped genetic diversity and the evolutionary history of this venomous snake. A total of 171 samples from 47 localities were sequenced and analysed for two mitochondrial gene fragments and three nuclear genes. Gene trees reveal the existence of two well-supported clades (Southwest China and Southeast China) with seven distinct and strongly supported, geographically structured subclades within V. stejnegeri. Estimation of divergence time and ancestral area suggests that V. stejnegeri originated at ~6.0 Ma in the late Miocene on the Yunnan-Guizhou Plateau. The estimated date of origin and divergence of the island populations of Taiwan and Hainan closely matches the geological origin of the both islands. The mtDNA gene tree reveals the presence of west-east diversification in V. stejnegeri populations. Complex orogenesis and heterogeneous habitats, as well as climate-mediated habitat differentiation including glacial cycles, all have influenced population structure and the distribution of this taxon. The validity of V. stejnegeri chenbihuii is questionable, and this subspecies most probably represents an invalid taxon. © 2016 John Wiley & Sons Ltd.

  8. Dynamics and Differential Proliferation of Transposable Elements During the Evolution of the B and A Genomes of Wheat

    PubMed Central

    Charles, Mathieu; Belcram, Harry; Just, Jérémy; Huneau, Cécile; Viollet, Agnès; Couloux, Arnaud; Segurens, Béatrice; Carter, Meredith; Huteau, Virginie; Coriton, Olivier; Appels, Rudi; Samain, Sylvie; Chalhoub, Boulos

    2008-01-01

    Transposable elements (TEs) constitute >80% of the wheat genome but their dynamics and contribution to size variation and evolution of wheat genomes (Triticum and Aegilops species) remain unexplored. In this study, 10 genomic regions have been sequenced from wheat chromosome 3B and used to constitute, along with all publicly available genomic sequences of wheat, 1.98 Mb of sequence (from 13 BAC clones) of the wheat B genome and 3.63 Mb of sequence (from 19 BAC clones) of the wheat A genome. Analysis of TE sequence proportions (as percentages), ratios of complete to truncated copies, and estimation of insertion dates of class I retrotransposons showed that specific types of TEs have undergone waves of differential proliferation in the B and A genomes of wheat. While both genomes show similar rates and relatively ancient proliferation periods for the Athila retrotransposons, the Copia retrotransposons proliferated more recently in the A genome whereas Gypsy retrotransposon proliferation is more recent in the B genome. It was possible to estimate for the first time the proliferation periods of the abundant CACTA class II DNA transposons, relative to that of the three main retrotransposon superfamilies. Proliferation of these TEs started prior to and overlapped with that of the Athila retrotransposons in both genomes. However, they also proliferated during the same periods as Gypsy and Copia retrotransposons in the A genome, but not in the B genome. As estimated from their insertion dates and confirmed by PCR-based tracing analysis, the majority of differential proliferation of TEs in B and A genomes of wheat (87 and 83%, respectively), leading to rapid sequence divergence, occurred prior to the allotetraploidization event that brought them together in Triticum turgidum and Triticum aestivum, <0.5 million years ago. More importantly, the allotetraploidization event appears to have neither enhanced nor repressed retrotranspositions. We discuss the apparent proliferation of TEs as resulting from their insertion, removal, and/or combinations of both evolutionary forces. PMID:18780739

  9. HIV populations are large and accumulate high genetic diversity in a nonlinear fashion.

    PubMed

    Maldarelli, Frank; Kearney, Mary; Palmer, Sarah; Stephens, Robert; Mican, JoAnn; Polis, Michael A; Davey, Richard T; Kovacs, Joseph; Shao, Wei; Rock-Kress, Diane; Metcalf, Julia A; Rehm, Catherine; Greer, Sarah E; Lucey, Daniel L; Danley, Kristen; Alter, Harvey; Mellors, John W; Coffin, John M

    2013-09-01

    HIV infection is characterized by rapid and error-prone viral replication resulting in genetically diverse virus populations. The rate of accumulation of diversity and the mechanisms involved are under intense study to provide useful information to understand immune evasion and the development of drug resistance. To characterize the development of viral diversity after infection, we carried out an in-depth analysis of single genome sequences of HIV pro-pol to assess diversity and divergence and to estimate replicating population sizes in a group of treatment-naive HIV-infected individuals sampled at single (n = 22) or multiple, longitudinal (n = 11) time points. Analysis of single genome sequences revealed nonlinear accumulation of sequence diversity during the course of infection. Diversity accumulated in recently infected individuals at rates 30-fold higher than in patients with chronic infection. Accumulation of synonymous changes accounted for most of the diversity during chronic infection. Accumulation of diversity resulted in population shifts, but the rates of change were low relative to estimated replication cycle times, consistent with relatively large population sizes. Analysis of changes in allele frequencies revealed effective population sizes that are substantially higher than previous estimates of approximately 1,000 infectious particles/infected individual. Taken together, these observations indicate that HIV populations are large, diverse, and slow to change in chronic infection and that the emergence of new mutations, including drug resistance mutations, is governed by both selection forces and drift.

  10. Tracing the colonization history of the Indian Ocean scops-owls (Strigiformes: Otus) with further insight into the spatio-temporal origin of the Malagasy avifauna.

    PubMed

    Fuchs, Jérôme; Pons, Jean-Marc; Goodman, Steven M; Bretagnolle, Vincent; Melo, Martim; Bowie, Rauri C K; Currie, David; Safford, Roger; Virani, Munir Z; Thomsett, Simon; Hija, Alawi; Cruaud, Corinne; Pasquet, Eric

    2008-07-09

    The island of Madagascar and surrounding volcanic and coralline islands are considered to form a biodiversity hotspot with large numbers of unique taxa. The origin of this endemic fauna can be explained by two different factors: vicariance or over-water-dispersal. Deciphering which factor explains the current distributional pattern of a given taxonomic group requires robust phylogenies as well as estimates of divergence times. The lineage of Indian Ocean scops-owls (Otus: Strigidae) includes six or seven species that are endemic to Madagascar and portions of the Comoros and Seychelles archipelagos; little is known about the species limits, biogeographic affinities and relationships to each other. In the present study, using DNA sequence data gathered from six loci, we examine the biogeographic history of the Indian Ocean scops-owls. We also compare the pattern and timing of colonization of the Indian Ocean islands by scops-owls with divergence times already proposed for other bird taxa. Our analyses revealed that Indian Ocean islands scops-owls do not form a monophyletic assemblage: the Seychelles Otus insularis is genetically closer to the South-East Asian endemic O. sunia than to species from the Comoros and Madagascar. The Pemba Scops-owls O. pembaensis, often considered closely related to, if not conspecific with O. rutilus of Madagascar, is instead closely related to the African mainland O. senegalensis. Relationships among the Indian Ocean taxa from the Comoros and Madagascar are unresolved, despite the analysis of over 4000 bp, suggesting a diversification burst after the initial colonization event. We also highlight one case of putative back-colonization to the Asian mainland from an island ancestor (O. sunia). Our divergence date estimates, using a Bayesian relaxed clock method, suggest that all these events occurred during the last 3.6 myr; albeit colonization of the Indian Ocean islands were not synchronous, O. pembaensis diverged from O. senegalensis about 1.7 mya while species from Madagascar and the Comoro diverged from their continental sister-group about 3.6 mya. We highlight that our estimates coincide with estimates of diversification from other bird lineages. Our analyses revealed the occurrence of multiple synchronous colonization events of the Indian Ocean islands by scops-owls, at a time when faunistic exchanges involving Madagascar was common as a result of lowered sea-level that would have allowed the formation of stepping-stone islands. Patterns of diversification that emerged from the scops-owls data are: 1) a star-like pattern concerning the order of colonization of the Indian Ocean islands and 2) the high genetic distinctiveness among all Indian Ocean taxa, reinforcing their recognition as distinct species.

  11. Tracing the colonization history of the Indian Ocean scops-owls (Strigiformes: Otus) with further insight into the spatio-temporal origin of the Malagasy avifauna

    PubMed Central

    2008-01-01

    Background The island of Madagascar and surrounding volcanic and coralline islands are considered to form a biodiversity hotspot with large numbers of unique taxa. The origin of this endemic fauna can be explained by two different factors: vicariance or over-water-dispersal. Deciphering which factor explains the current distributional pattern of a given taxonomic group requires robust phylogenies as well as estimates of divergence times. The lineage of Indian Ocean scops-owls (Otus: Strigidae) includes six or seven species that are endemic to Madagascar and portions of the Comoros and Seychelles archipelagos; little is known about the species limits, biogeographic affinities and relationships to each other. In the present study, using DNA sequence data gathered from six loci, we examine the biogeographic history of the Indian Ocean scops-owls. We also compare the pattern and timing of colonization of the Indian Ocean islands by scops-owls with divergence times already proposed for other bird taxa. Results Our analyses revealed that Indian Ocean islands scops-owls do not form a monophyletic assemblage: the Seychelles Otus insularis is genetically closer to the South-East Asian endemic O. sunia than to species from the Comoros and Madagascar. The Pemba Scops-owls O. pembaensis, often considered closely related to, if not conspecific with O. rutilus of Madagascar, is instead closely related to the African mainland O. senegalensis. Relationships among the Indian Ocean taxa from the Comoros and Madagascar are unresolved, despite the analysis of over 4000 bp, suggesting a diversification burst after the initial colonization event. We also highlight one case of putative back-colonization to the Asian mainland from an island ancestor (O. sunia). Our divergence date estimates, using a Bayesian relaxed clock method, suggest that all these events occurred during the last 3.6 myr; albeit colonization of the Indian Ocean islands were not synchronous, O. pembaensis diverged from O. senegalensis about 1.7 mya while species from Madagascar and the Comoro diverged from their continental sister-group about 3.6 mya. We highlight that our estimates coincide with estimates of diversification from other bird lineages. Conclusion Our analyses revealed the occurrence of multiple synchronous colonization events of the Indian Ocean islands by scops-owls, at a time when faunistic exchanges involving Madagascar was common as a result of lowered sea-level that would have allowed the formation of stepping-stone islands. Patterns of diversification that emerged from the scops-owls data are: 1) a star-like pattern concerning the order of colonization of the Indian Ocean islands and 2) the high genetic distinctiveness among all Indian Ocean taxa, reinforcing their recognition as distinct species. PMID:18611281

  12. MOLECULAR DEMOGRAPHIC HISTORY OF THE ANNUAL SUNFLOWERS HELIANTHUS ANNUUS AND H. PETIOLARIS—LARGE EFFECTIVE POPULATION SIZES AND RATES OF LONG-TERM GENE FLOW

    PubMed Central

    Strasburg, Jared L.; Rieseberg, Loren H.

    2008-01-01

    Hybridization between distinct species may lead to introgression of genes across species boundaries, and this pattern can potentially persist for extended periods as long as selection at some loci or genomic regions prevents thorough mixing of gene pools. However, very few reliable estimates of long-term levels of effective migration are available between hybridizing species throughout their history. Accurate estimates of divergence dates and levels of gene flow require data from multiple unlinked loci as well as an analytical framework that can distinguish between lineage sorting and gene flow and incorporate the effects of demographic changes within each species. Here we use sequence data from 18 anonymous nuclear loci in two broadly sympatric sunflower species, Helianthus annuus and H. petiolaris, analyzed within an “isolation with migration” framework to make genome-wide estimates of the ages of these two species, long-term rates of gene flow between them, and effective population sizes and historical patterns of population growth. Our results indicate that H. annuus and H. petiolaris are approximately one million years old and have exchanged genes at a surprisingly high rate (long-term Nef m estimates of approximately 0.5 in each direction), with somewhat higher rates of introgression from H. annuus into H. petiolaris than vice versa. In addition, each species has undergone dramatic population expansion since divergence, and both species have among the highest levels of genetic diversity reported for flowering plants. Our results provide the most comprehensive estimate to date of long-term patterns of gene flow and historical demography in a nonmodel plant system, and they indicate that species integrity can be maintained even in the face of extensive gene flow over a prolonged period. PMID:18462213

  13. Sequence analysis of MHC class I α2 from sockeye salmon (Oncorhynchus nerka).

    PubMed

    McClelland, Erin K; Ming, Tobi J; Tabata, Amy; Miller, Kristina M

    2011-09-01

    Most studies assessing adaptive MHC diversity in salmon populations have focused on the classical class II DAB or DAA loci, as these have been most amenable to single PCR amplifications due to their relatively low level of sequence divergence. Herein, we report the characterization of the classical class I UBA α2 locus based on collections taken throughout the species range of sockeye salmon (Oncorhynchus nerka). Through use of multiple lineage-specific primer sets, denaturing gradient gel electrophoresis and sequencing, we identified thirty-four alleles from three highly divergent lineages. Sequence identity between lineages ranged from 30.0% to 56.8% but was relatively high within lineages. Allelic identity within the antigen recognition site (ARS) was greater than for the longer sequence. Global positive selection on UBA was seen at the sequence level (dN:dS = 1.012) with four codons under positive selection and 12 codons under negative selection. Crown Copyright © 2011. Published by Elsevier Ltd. All rights reserved.

  14. Novel Virus Discovery and Genome Reconstruction from Field RNA Samples Reveals Highly Divergent Viruses in Dipteran Hosts

    PubMed Central

    Bass, David; Moureau, Gregory; Tang, Shuoya; McAlister, Erica; Culverwell, C. Lorna; Glücksman, Edvard; Wang, Hui; Brown, T. David K.; Gould, Ernest A.; Harbach, Ralph E.; de Lamballerie, Xavier; Firth, Andrew E.

    2013-01-01

    We investigated whether small RNA (sRNA) sequenced from field-collected mosquitoes and chironomids (Diptera) can be used as a proxy signature of viral prevalence within a range of species and viral groups, using sRNAs sequenced from wild-caught specimens, to inform total RNA deep sequencing of samples of particular interest. Using this strategy, we sequenced from adult Anopheles maculipennis s.l. mosquitoes the apparently nearly complete genome of one previously undescribed virus related to chronic bee paralysis virus, and, from a pool of Ochlerotatus caspius and Oc. detritus mosquitoes, a nearly complete entomobirnavirus genome. We also reconstructed long sequences (1503-6557 nt) related to at least nine other viruses. Crucially, several of the sequences detected were reconstructed from host organisms highly divergent from those in which related viruses have been previously isolated or discovered. It is clear that viral transmission and maintenance cycles in nature are likely to be significantly more complex and taxonomically diverse than previously expected. PMID:24260463

  15. BLAST and FASTA similarity searching for multiple sequence alignment.

    PubMed

    Pearson, William R

    2014-01-01

    BLAST, FASTA, and other similarity searching programs seek to identify homologous proteins and DNA sequences based on excess sequence similarity. If two sequences share much more similarity than expected by chance, the simplest explanation for the excess similarity is common ancestry-homology. The most effective similarity searches compare protein sequences, rather than DNA sequences, for sequences that encode proteins, and use expectation values, rather than percent identity, to infer homology. The BLAST and FASTA packages of sequence comparison programs provide programs for comparing protein and DNA sequences to protein databases (the most sensitive searches). Protein and translated-DNA comparisons to protein databases routinely allow evolutionary look back times from 1 to 2 billion years; DNA:DNA searches are 5-10-fold less sensitive. BLAST and FASTA can be run on popular web sites, but can also be downloaded and installed on local computers. With local installation, target databases can be customized for the sequence data being characterized. With today's very large protein databases, search sensitivity can also be improved by searching smaller comprehensive databases, for example, a complete protein set from an evolutionarily neighboring model organism. By default, BLAST and FASTA use scoring strategies target for distant evolutionary relationships; for comparisons involving short domains or queries, or searches that seek relatively close homologs (e.g. mouse-human), shallower scoring matrices will be more effective. Both BLAST and FASTA provide very accurate statistical estimates, which can be used to reliably identify protein sequences that diverged more than 2 billion years ago.

  16. Mouse Vk gene classification by nucleic acid sequence similarity.

    PubMed

    Strohal, R; Helmberg, A; Kroemer, G; Kofler, R

    1989-01-01

    Analyses of immunoglobulin (Ig) variable (V) region gene usage in the immune response, estimates of V gene germline complexity, and other nucleic acid hybridization-based studies depend on the extent to which such genes are related (i.e., sequence similarity) and their organization in gene families. While mouse Igh heavy chain V region (VH) gene families are relatively well-established, a corresponding systematic classification of Igk light chain V region (Vk) genes has not been reported. The present analysis, in the course of which we reviewed the known extent of the Vk germline gene repertoire and Vk gene usage in a variety of responses to foreign and self antigens, provides a classification of mouse Vk genes in gene families composed of members with greater than 80% overall nucleic acid sequence similarity. This classification differed in several aspects from that of VH genes: only some Vk gene families were as clearly separated (by greater than 25% sequence dissimilarity) as typical VH gene families; most Vk gene families were closely related and, in several instances, members from different families were very similar (greater than 80%) over large sequence portions; frequently, classification by nucleic acid sequence similarity diverged from existing classifications based on amino-terminal protein sequence similarity. Our data have implications for Vk gene analyses by nucleic acid hybridization and describe potentially important differences in sequence organization between VH and Vk genes.

  17. The Past Sure is Tense: On Interpreting Phylogenetic Divergence Time Estimates.

    PubMed

    Brown, Joseph W; Smith, Stephen A

    2018-03-01

    Divergence time estimation-the calibration of a phylogeny to geological time-is an integral first step in modeling the tempo of biological evolution (traits and lineages). However, despite increasingly sophisticated methods to infer divergence times from molecular genetic sequences, the estimated age of many nodes across the tree of life contrast significantly and consistently with timeframes conveyed by the fossil record. This is perhaps best exemplified by crown angiosperms, where molecular clock (Triassic) estimates predate the oldest (Early Cretaceous) undisputed angiosperm fossils by tens of millions of years or more. While the incompleteness of the fossil record is a common concern, issues of data limitation and model inadequacy are viable (if underexplored) alternative explanations. In this vein, Beaulieu et al. (2015) convincingly demonstrated how methods of divergence time inference can be misled by both (i) extreme state-dependent molecular substitution rate heterogeneity and (ii) biased sampling of representative major lineages. These results demonstrate the impact of (potentially common) model violations. Here, we suggest another potential challenge: that the configuration of the statistical inference problem (i.e., the parameters, their relationships, and associated priors) alone may preclude the reconstruction of the paleontological timeframe for the crown age of angiosperms. We demonstrate, through sampling from the joint prior (formed by combining the tree (diversification) prior with the calibration densities specified for fossil-calibrated nodes) that with no data present at all, that an Early Cretaceous crown angiosperms is rejected (i.e., has essentially zero probability). More worrisome, however, is that for the 24 nodes calibrated by fossils, almost all have indistinguishable marginal prior and posterior age distributions when employing routine lognormal fossil calibration priors. These results indicate that there is inadequate information in the data to over-rule the joint prior. Given that these calibrated nodes are strategically placed in disparate regions of the tree, they act to anchor the tree scaffold, and so the posterior inference for the tree as a whole is largely determined by the pseudodata present in the (often arbitrary) calibration densities. We recommend, as for any Bayesian analysis, that marginal prior and posterior distributions be carefully compared to determine whether signal is coming from the data or prior belief, especially for parameters of direct interest. This recommendation is not novel. However, given how rarely such checks are carried out in evolutionary biology, it bears repeating. Our results demonstrate the fundamental importance of prior/posterior comparisons in any Bayesian analysis, and we hope that they further encourage both researchers and journals to consistently adopt this crucial step as standard practice. Finally, we note that the results presented here do not refute the biological modeling concerns identified by Beaulieu et al. (2015). Both sets of issues remain apposite to the goals of accurate divergence time estimation, and only by considering them in tandem can we move forward more confidently.

  18. Phylogenetic relationships of Malassezia species based on multilocus sequence analysis.

    PubMed

    Castellá, Gemma; Coutinho, Selene Dall' Acqua; Cabañes, F Javier

    2014-01-01

    Members of the genus Malassezia are lipophilic basidiomycetous yeasts, which are part of the normal cutaneous microbiota of humans and other warm-blooded animals. Currently, this genus consists of 14 species that have been characterized by phenetic and molecular methods. Although several molecular methods have been used to identify and/or differentiate Malassezia species, the sequencing of the rRNA genes and the chitin synthase-2 gene (CHS2) are the most widely employed. There is little information about the β-tubulin gene in the genus Malassezia, a gene has been used for the analysis of complex species groups. The aim of the present study was to sequence a fragment of the β-tubulin gene of Malassezia species and analyze their phylogenetic relationship using a multilocus sequence approach based on two rRNA genes (ITS including 5.8S rRNA and D1/D2 region of 26S rRNA) together with two protein encoding genes (CHS2 and β-tubulin). The phylogenetic study of the partial β-tubulin gene sequences indicated that this molecular marker can be used to assess diversity and identify new species. The multilocus sequence analysis of the four loci provides robust support to delineate species at the terminal nodes and could help to estimate divergence times for the origin and diversification of Malassezia species.

  19. Sexual selection and population divergence II. Divergence in different sexual traits and signal modalities in field crickets (Teleogryllus oceanicus).

    PubMed

    Pascoal, Sonia; Mendrok, Magdalena; Wilson, Alastair J; Hunt, John; Bailey, Nathan W

    2017-06-01

    Sexual selection can target many different types of traits. However, the relative influence of different sexually selected traits during evolutionary divergence is poorly understood. We used the field cricket Teleogryllus oceanicus to quantify and compare how five traits from each of three sexual signal modalities and components diverge among allopatric populations: male advertisement song, cuticular hydrocarbon (CHC) profiles and forewing morphology. Population divergence was unexpectedly consistent: we estimated the among-population (genetic) variance-covariance matrix, D, for all 15 traits, and D max explained nearly two-thirds of its variation. CHC and wing traits were most tightly integrated, whereas song varied more independently. We modeled the dependence of among-population trait divergence on genetic distance estimated from neutral markers to test for signatures of selection versus neutral divergence. For all three sexual trait types, phenotypic variation among populations was largely explained by a neutral model of divergence. Our findings illustrate how phenotypic integration across different types of sexual traits might impose constraints on the evolution of mating isolation and divergence via sexual selection. © 2017 The Author(s). Evolution © 2017 The Society for the Study of Evolution.

  20. An improved divergent synthesis of comb-type branched oligodeoxyribonucleotides (bDNA) containing multiple secondary sequences.

    PubMed

    Horn, T; Chang, C A; Urdea, M S

    1997-12-01

    The divergent synthesis of branched DNA (bDNA) comb structures is described. This new type of bDNA contains one unique oligonucleotide, the primary sequence, covalently attached through a comb-like branch network to many identical copies of a different oligonucleotide, the secondary sequence. The bDNA comb structures were assembled on a solid support and several synthesis parameters were investigated and optimized. The bDNA comb molecules were characterized by polyacrylamide gel electrophoretic methods and by controlled cleavage at periodate-cleavable moieties incorporated during synthesis. The developed chemistry allows synthesis of bDNA comb molecules containing multiple secondary sequences. In the accompanying article we describe the synthesis and characterization of large bDNA combs containing all four deoxynucleotides for use as signal amplifiers in nucleic acid quantification assays.

  1. An improved divergent synthesis of comb-type branched oligodeoxyribonucleotides (bDNA) containing multiple secondary sequences.

    PubMed Central

    Horn, T; Chang, C A; Urdea, M S

    1997-01-01

    The divergent synthesis of branched DNA (bDNA) comb structures is described. This new type of bDNA contains one unique oligonucleotide, the primary sequence, covalently attached through a comb-like branch network to many identical copies of a different oligonucleotide, the secondary sequence. The bDNA comb structures were assembled on a solid support and several synthesis parameters were investigated and optimized. The bDNA comb molecules were characterized by polyacrylamide gel electrophoretic methods and by controlled cleavage at periodate-cleavable moieties incorporated during synthesis. The developed chemistry allows synthesis of bDNA comb molecules containing multiple secondary sequences. In the accompanying article we describe the synthesis and characterization of large bDNA combs containing all four deoxynucleotides for use as signal amplifiers in nucleic acid quantification assays. PMID:9365265

  2. Feasibility of Measuring Mean Vertical Motion for Estimating Advection. Chapter 6

    NASA Technical Reports Server (NTRS)

    Vickers, Dean; Mahrt, L.

    2005-01-01

    Numerous recent studies calculate horizontal and vertical advection terms for budget studies of net ecosystem exchange of carbon. One potential uncertainty in such studies is the estimate of mean vertical motion. This work addresses the reliability of vertical advection estimates by contrasting the vertical motion obtained from the standard practise of measuring the vertical velocity and applying a tilt correction, to the vertical motion calculated from measurements of the horizontal divergence of the flow using a network of towers. Results are compared for three different tilt correction methods. Estimates of mean vertical motion are sensitive to the choice of tilt correction method. The short-term mean (10 to 60 minutes) vertical motion based on the horizontal divergence is more realistic compared to the estimates derived from the standard practise. The divergence shows long-term mean (days to months) sinking motion at the site, apparently due to the surface roughness change. Because all the tilt correction methods rely on the assumption that the long-term mean vertical motion is zero for a given wind direction, they fail to reproduce the vertical motion based on the divergence.

  3. Use of tuf Sequences for Genus-Specific PCR Detection and Phylogenetic Analysis of 28 Streptococcal Species

    PubMed Central

    Picard, François J.; Ke, Danbing; Boudreau, Dominique K.; Boissinot, Maurice; Huletsky, Ann; Richard, Dave; Ouellette, Marc; Roy, Paul H.; Bergeron, Michel G.

    2004-01-01

    A 761-bp portion of the tuf gene (encoding the elongation factor Tu) from 28 clinically relevant streptococcal species was obtained by sequencing amplicons generated using broad-range PCR primers. These tuf sequences were used to select Streptococcus-specific PCR primers and to perform phylogenetic analysis. The specificity of the PCR assay was verified using 102 different bacterial species, including the 28 streptococcal species. Genomic DNA purified from all streptococcal species was efficiently detected, whereas there was no amplification with DNA from 72 of the 74 nonstreptococcal bacterial species tested. There was cross-amplification with DNAs from Enterococcus durans and Lactococcus lactis. However, the 15 to 31% nucleotide sequence divergence in the 761-bp tuf portion of these two species compared to any streptococcal tuf sequence provides ample sequence divergence to allow the development of internal probes specific to streptococci. The Streptococcus-specific assay was highly sensitive for all 28 streptococcal species tested (i.e., detection limit of 1 to 10 genome copies per PCR). The tuf sequence data was also used to perform extensive phylogenetic analysis, which was generally in agreement with phylogeny determined on the basis of 16S rRNA gene data. However, the tuf gene provided a better discrimination at the streptococcal species level that should be particularly useful for the identification of very closely related species. In conclusion, tuf appears more suitable than the 16S ribosomal RNA gene for the development of diagnostic assays for the detection and identification of streptococcal species because of its higher level of species-specific genetic divergence. PMID:15297518

  4. Centromere location in Arabidopsis is unaltered by extreme divergence in CENH3 protein sequence

    PubMed Central

    2017-01-01

    During cell division, spindle fibers attach to chromosomes at centromeres. The DNA sequence at regional centromeres is fast evolving with no conserved genetic signature for centromere identity. Instead CENH3, a centromere-specific histone H3 variant, is the epigenetic signature that specifies centromere location across both plant and animal kingdoms. Paradoxically, CENH3 is also adaptively evolving. An ongoing question is whether CENH3 evolution is driven by a functional relationship with the underlying DNA sequence. Here, we demonstrate that despite extensive protein sequence divergence, CENH3 histones from distant species assemble centromeres on the same underlying DNA sequence. We first characterized the organization and diversity of centromere repeats in wild-type Arabidopsis thaliana. We show that A. thaliana CENH3-containing nucleosomes exhibit a strong preference for a unique subset of centromeric repeats. These sequences are largely missing from the genome assemblies and represent the youngest and most homogeneous class of repeats. Next, we tested the evolutionary specificity of this interaction in a background in which the native A. thaliana CENH3 is replaced with CENH3s from distant species. Strikingly, we find that CENH3 from Lepidium oleraceum and Zea mays, although specifying epigenetically weaker centromeres that result in genome elimination upon outcrossing, show a binding pattern on A. thaliana centromere repeats that is indistinguishable from the native CENH3. Our results demonstrate positional stability of a highly diverged CENH3 on independently evolved repeats, suggesting that the sequence specificity of centromeres is determined by a mechanism independent of CENH3. PMID:28223399

  5. The genetic signature of recent speciation in manta rays (Manta alfredi and M. birostris).

    PubMed

    Kashiwagi, Tom; Marshall, Andrea D; Bennett, Michael B; Ovenden, Jennifer R

    2012-07-01

    Manta rays have been taxonomically revised as two species, Manta alfredi and M. birostris, on the basis of morphological and meristic data, yet the two species occur in extensive mosaic sympatry. We analysed the genetic signatures of the species boundary using a portion of the nuclear RAG1 (681 base pairs), mitochondrial CO1 (574 bp) and ND5 genes (1188 bp). The assay with CO1 sequences, widely used in DNA barcoding, failed to distinguish the two species. The two species were clearly distinguishable, however, with no shared RAG1 or ND5 haplotypes. The species were reciprocally monophyletic for RAG1, but paraphyletic for ND5 sequences. Qualitative evidence and statistical inferences using the 'Isolation-with-Migration models' indicated that these results were better explained with post-divergence gene flow in the recent past rather than incomplete lineage sorting with zero gene flow since speciation. An estimate of divergence time was less than 0.5 Ma with an upper confidence limit of within 1 Ma. Recent speciation of highly mobile species in the marine environment is of great interest, as it suggests that speciation may have occurred in the absence of long-term physical barriers to gene flow. We propose that the ecologically driven forces such as habitat choice played a significant role in speciation in manta rays. Copyright © 2012 Elsevier Inc. All rights reserved.

  6. Origin and Diversification of Major Clades in Parmelioid Lichens (Parmeliaceae, Ascomycota) during the Paleogene Inferred by Bayesian Analysis

    PubMed Central

    Amo de Paz, Guillermo; Cubas, Paloma; Divakar, Pradeep K.; Lumbsch, H. Thorsten; Crespo, Ana

    2011-01-01

    There is a long-standing debate on the extent of vicariance and long-distance dispersal events to explain the current distribution of organisms, especially in those with small diaspores potentially prone to long-distance dispersal. Age estimates of clades play a crucial role in evaluating the impact of these processes. The aim of this study is to understand the evolutionary history of the largest clade of macrolichens, the parmelioid lichens (Parmeliaceae, Lecanoromycetes, Ascomycota) by dating the origin of the group and its major lineages. They have a worldwide distribution with centers of distribution in the Neo- and Paleotropics, and semi-arid subtropical regions of the Southern Hemisphere. Phylogenetic analyses were performed using DNA sequences of nuLSU and mtSSU rDNA, and the protein-coding RPB1 gene. The three DNA regions had different evolutionary rates: RPB1 gave a rate two to four times higher than nuLSU and mtSSU. Divergence times of the major clades were estimated with partitioned BEAST analyses allowing different rates for each DNA region and using a relaxed clock model. Three calibrations points were used to date the tree: an inferred age at the stem of Lecanoromycetes, and two dated fossils: Parmelia in the parmelioid group, and Alectoria. Palaeoclimatic conditions and the palaeogeological area cladogram were compared to the dated phylogeny of parmelioid. The parmelioid group diversified around the K/T boundary, and the major clades diverged during the Eocene and Oligocene. The radiation of the genera occurred through globally changing climatic condition of the early Oligocene, Miocene and early Pliocene. The estimated divergence times are consistent with long-distance dispersal events being the major factor to explain the biogeographical distribution patterns of Southern Hemisphere parmelioids, especially for Africa-Australia disjunctions, because the sequential break-up of Gondwana started much earlier than the origin of these clades. However, our data cannot reject vicariance to explain South America-Australia disjunctions. PMID:22174775

  7. Testing for shared biogeographic history in the lower Central American freshwater fish assemblage using comparative phylogeography: concerted, independent, or multiple evolutionary responses?

    PubMed Central

    Bagley, Justin C; Johnson, Jerald B

    2014-01-01

    A central goal of comparative phylogeography is determining whether codistributed species experienced (1) concerted evolutionary responses to past geological and climatic events, indicated by congruent spatial and temporal patterns (“concerted-response hypothesis”); (2) independent responses, indicated by spatial incongruence (“independent-response hypothesis”); or (3) multiple responses (“multiple-response hypothesis”), indicated by spatial congruence but temporal incongruence (“pseudocongruence”) or spatial and temporal incongruence (“pseudoincongruence”). We tested these competing hypotheses using DNA sequence data from three livebearing fish species codistributed in the Nicaraguan depression of Central America (Alfaro cultratus, Poecilia gillii, and Xenophallus umbratilis) that we predicted might display congruent responses due to co-occurrence in identical freshwater drainages. Spatial analyses recovered different subdivisions of genetic structure for each species, despite shared finer-scale breaks in northwestern Costa Rica (also supported by phylogenetic results). Isolation-with-migration models estimated incongruent timelines of among-region divergences, with A. cultratus and Xenophallus populations diverging over Miocene–mid-Pleistocene while P. gillii populations diverged over mid-late Pleistocene. Approximate Bayesian computation also lent substantial support to multiple discrete divergences over a model of simultaneous divergence across shared spatial breaks (e.g., Bayes factor [B10] = 4.303 for Ψ [no. of divergences] > 1 vs. Ψ = 1). Thus, the data support phylogeographic pseudoincongruence consistent with the multiple-response hypothesis. Model comparisons also indicated incongruence in historical demography, for example, support for intraspecific late Pleistocene population growth was unique to P. gillii, despite evidence for finer-scale population expansions in the other taxa. Empirical tests for phylogeographic congruence indicate that multiple evolutionary responses to historical events have shaped the population structure of freshwater species codistributed within the complex landscapes in/around the Nicaraguan depression. Recent community assembly through different routes (i.e., different past distributions or colonization routes), and intrinsic ecological differences among species, has likely contributed to the unique phylogeographical patterns displayed by these Neotropical fishes. PMID:24967085

  8. Molecular and Paleontological Evidence for a Post-Cretaceous Origin of Rodents

    PubMed Central

    Wu, Shaoyuan; Wu, Wenyu; Zhang, Fuchun; Ye, Jie; Ni, Xijun; Sun, Jimin; Edwards, Scott V.; Meng, Jin; Organ, Chris L.

    2012-01-01

    The timing of the origin and diversification of rodents remains controversial, due to conflicting results from molecular clocks and paleontological data. The fossil record tends to support an early Cenozoic origin of crown-group rodents. In contrast, most molecular studies place the origin and initial diversification of crown-Rodentia deep in the Cretaceous, although some molecular analyses have recovered estimated divergence times that are more compatible with the fossil record. Here we attempt to resolve this conflict by carrying out a molecular clock investigation based on a nine-gene sequence dataset and a novel set of seven fossil constraints, including two new rodent records (the earliest known representatives of Cardiocraniinae and Dipodinae). Our results indicate that rodents originated around 61.7–62.4 Ma, shortly after the Cretaceous/Paleogene (K/Pg) boundary, and diversified at the intraordinal level around 57.7–58.9 Ma. These estimates are broadly consistent with the paleontological record, but challenge previous molecular studies that place the origin and early diversification of rodents in the Cretaceous. This study demonstrates that, with reliable fossil constraints, the incompatibility between paleontological and molecular estimates of rodent divergence times can be eliminated using currently available tools and genetic markers. Similar conflicts between molecular and paleontological evidence bedevil attempts to establish the origination times of other placental groups. The example of the present study suggests that more reliable fossil calibration points may represent the key to resolving these controversies. PMID:23071573

  9. Snake mitochondrial genomes: phylogenetic relationships and implications of extended taxon sampling for interpretations of mitogenomic evolution

    PubMed Central

    2010-01-01

    Background Snake mitochondrial genomes are of great interest in understanding mitogenomic evolution because of gene duplications and rearrangements and the fast evolutionary rate of their genes compared to other vertebrates. Mitochondrial gene sequences have also played an important role in attempts to resolve the contentious phylogenetic relationships of especially the early divergences among alethinophidian snakes. Two recent innovative studies found dramatic gene- and branch-specific relative acceleration in snake protein-coding gene evolution, particularly along internal branches leading to Serpentes and Alethinophidia. It has been hypothesized that some of these rate shifts are temporally (and possibly causally) associated with control region duplication and/or major changes in ecology and anatomy. Results The near-complete mitochondrial (mt) genomes of three henophidian snakes were sequenced: Anilius scytale, Rhinophis philippinus, and Charina trivirgata. All three genomes share a duplicated control region and translocated tRNALEU, derived features found in all alethinophidian snakes studied to date. The new sequence data were aligned with mt genome data for 21 other species of snakes and used in phylogenetic analyses. Phylogenetic results agreed with many other studies in recovering several robust clades, including Colubroidea, Caenophidia, and Cylindrophiidae+Uropeltidae. Nodes within Henophidia that have been difficult to resolve robustly in previous analyses remained uncompellingly resolved here. Comparisons of relative rates of evolution of rRNA vs. protein-coding genes were conducted by estimating branch lengths across the tree. Our expanded sampling revealed dramatic acceleration along the branch leading to Typhlopidae, particularly long rRNA terminal branches within Scolecophidia, and that most of the dramatic acceleration in protein-coding gene rate along Serpentes and Alethinophidia branches occurred before Anilius diverged from other alethinophidians. Conclusions Mitochondrial gene sequence data alone may not be able to robustly resolve basal divergences among alethinophidian snakes. Taxon sampling plays an important role in identifying mitogenomic evolutionary events within snakes, and in testing hypotheses explaining their origin. Dramatic rate shifts in mitogenomic evolution occur within Scolecophidia as well as Alethinophidia, thus falsifying the hypothesis that these shifts in snakes are associated exclusively with evolution of a non-burrowing lifestyle, macrostomatan feeding ecology and/or duplication of the control region, both restricted to alethinophidians among living snakes. PMID:20055998

  10. Cross-cultural estimation of the human generation interval for use in genetics-based population divergence studies.

    PubMed

    Fenner, Jack N

    2005-10-01

    The length of the human generation interval is a key parameter when using genetics to date population divergence events. However, no consensus exists regarding the generation interval length, and a wide variety of interval lengths have been used in recent studies. This makes comparison between studies difficult, and questions the accuracy of divergence date estimations. Recent genealogy-based research suggests that the male generation interval is substantially longer than the female interval, and that both are greater than the values commonly used in genetics studies. This study evaluates each of these hypotheses in a broader cross-cultural context, using data from both nation states and recent hunter-gatherer societies. Both hypotheses are supported by this study; therefore, revised estimates of male, female, and overall human generation interval lengths are proposed. The nearly universal, cross-cultural nature of the evidence justifies using these proposed estimates in Y-chromosomal, mitochondrial, and autosomal DNA-based population divergence studies.

  11. Characterizing chaotic dynamics from integrate-and-fire interspike intervals at the presence of noise

    NASA Astrophysics Data System (ADS)

    Mohammad, Yasir K.; Pavlova, Olga N.; Pavlov, Alexey N.

    2016-04-01

    We discuss the problem of quantifying chaotic dynamics at the input of the "integrate-and-fire" (IF) model from the output sequences of interspike intervals (ISIs) for the case when the fluctuating threshold level leads to the appearance of noise in ISI series. We propose a way to detect an ability of computing dynamical characteristics of the input dynamics and the level of noise in the output point processes. The proposed approach is based on the dependence of the largest Lyapunov exponent from the maximal orientation error used at the estimation of the averaged rate of divergence of nearby phase trajectories.

  12. Scale dependence of the 200-mb divergence inferred from EOLE data.

    NASA Technical Reports Server (NTRS)

    Morel, P.; Necco, G.

    1973-01-01

    The EOLE experiment with 480 constant-volume balloons distributed over the Southern Hemisphere approximately at the 200-mb level, has provided a unique, highly accurate set of tracer trajectories in the general westerly circulation. The trajectories of neighboring balloons are analyzed to estimate the horizontal divergence from the Lagrangian derivative of the area of one cluster. The variance of the divergence estimates results from two almost comparable effects: the true divergence of the horizontal flow and eddy diffusion due to small-scale, two-dimensional turbulence. Taking this into account, the rms divergence is found to be of the order of 0.00001 per sec and decreases logarithmically with cluster size. This scale dependence is shown to be consistent with the quasi-geostrophic turbulence model of the general circulation in midlatitudes.

  13. Characterisation of divergent flavivirus NS3 and NS5 protein sequences detected in Rhipicephalus microplus ticks from Brazil

    PubMed Central

    Maruyama, Sandra Regina; Castro-Jorge, Luiza Antunes; Ribeiro, José Marcos Chaves; Gardinassi, Luiz Gustavo; Garcia, Gustavo Rocha; Brandão, Lucinda Giampietro; Rodrigues, Aline Rezende; Okada, Marcos Ituo; Abrão, Emiliana Pereira; Ferreira, Beatriz Rossetti; da Fonseca, Benedito Antonio Lopes; de Miranda-Santos, Isabel Kinney Ferreira

    2013-01-01

    Transcripts similar to those that encode the nonstructural (NS) proteins NS3 and NS5 from flaviviruses were found in a salivary gland (SG) complementary DNA (cDNA) library from the cattle tick Rhipicephalus microplus. Tick extracts were cultured with cells to enable the isolation of viruses capable of replicating in cultured invertebrate and vertebrate cells. Deep sequencing of the viral RNA isolated from culture supernatants provided the complete coding sequences for the NS3 and NS5 proteins and their molecular characterisation confirmed similarity with the NS3 and NS5 sequences from other flaviviruses. Despite this similarity, phylogenetic analyses revealed that this potentially novel virus may be a highly divergent member of the genus Flavivirus. Interestingly, we detected the divergent NS3 and NS5 sequences in ticks collected from several dairy farms widely distributed throughout three regions of Brazil. This is the first report of flavivirus-like transcripts in R. microplus ticks. This novel virus is a potential arbovirus because it replicated in arthropod and mammalian cells; furthermore, it was detected in a cDNA library from tick SGs and therefore may be present in tick saliva. It is important to determine whether and by what means this potential virus is transmissible and to monitor the virus as a potential emerging tick-borne zoonotic pathogen. PMID:24626302

  14. Candida ruelliae sp. nov., a novel yeast species isolated from flowers of Ruellia sp. (Acanthaceae).

    PubMed

    Saluja, Puja; Prasad, Gandham S

    2008-06-01

    Two novel yeast strains designated as 16Q1 and 16Q3 were isolated from flowers of the Ruellia species of the Acanthaceae family. The D1/D2 domain and ITS sequences of these two strains were identical. Sequence analysis of the D1/D2 domain of large-subunit rRNA gene indicated their relationship to species of the Candida haemulonii cluster. However, they differ from C. haemulonii by 14% nucleotide sequence divergence, from Candida pseudohaemulonii by 16.1% and from C. haemulonii type II by 16.5%. These strains also differ in 18 physiological tests from the type strain of C. haemulonii, and 12 and 16 tests, respectively, from C. pseudohaemulonii and C. haemulonii type II. They also differ from C. haemulonii and other related species by more than 13% sequence divergence in the internal transcribed spacer region. In the SSU rRNA gene sequences, strain 16Q1 differs by 1.7% nucleotide divergence from C. haemulonii. Sporulation was not observed in pure or mixed cultures on several media examined. All these data support the assignment of these strains to a novel species; we have named them as Candida ruelliae sp. nov., and designate strain 16Q1(T)=MTCC 7739(T)=CBS10815(T) as type strain of the novel species.

  15. The complex evolutionary dynamics of ancient and recent polyploidy in Leucaena (Leguminosae; Mimosoideae).

    PubMed

    Govindarajulu, Rajanikanth; Hughes, Colin E; Alexander, Patrick J; Bailey, C Donovan

    2011-12-01

    The evolutionary history of Leucaena has been impacted by polyploidy, hybridization, and divergent allopatric species diversification, suggesting that this is an ideal group to investigate the evolutionary tempo of polyploidy and the complexities of reticulation and divergence in plant diversification. Parsimony- and ML-based phylogenetic approaches were applied to 105 accessions sequenced for six sequence characterized amplified region-based nuclear encoded loci, nrDNA ITS, and four cpDNA regions. Hypotheses for the origin of tetraploid species were inferred using results derived from a novel species tree and established gene tree methods and from data on genome sizes and geographic distributions. The combination of comprehensively sampled multilocus DNA sequence data sets and a novel methodology provide strong resolution and support for the origins of all five tetraploid species. A minimum of four allopolyploidization events are required to explain the origins of these species. The origin(s) of one tetraploid pair (L. involucrata/L. pallida) can be equally explained by two unique allopolyploidizations or a single event followed by divergent speciation. Alongside other recent findings, a comprehensive picture of the complex evolutionary dynamics of polyploidy in Leucaena is emerging that includes paleotetraploidization, diploidization of the last common ancestor to Leucaena, allopatric divergence among diploids, and recent allopolyploid origins for tetraploid species likely associated with human translocation of seed. These results provide insights into the role of divergence and reticulation in a well-characterized angiosperm lineage and into traits of diploid parents and derived tetraploids (particularly self-compatibility and year-round flowering) favoring the formation and establishment of novel tetraploids combinations.

  16. Phylogenetic analysis of Haemaphysalis erinacei Pavesi, 1884 (Acari: Ixodidae) from China, Turkey, Italy and Romania.

    PubMed

    Hornok, Sándor; Wang, Yuanzhi; Otranto, Domenico; Keskin, Adem; Lia, Riccardo Paolo; Kontschán, Jenő; Takács, Nóra; Farkas, Róbert; Sándor, Attila D

    2016-12-15

    Haemaphysalis erinacei is one of the few ixodid tick species for which valid names of subspecies exist. Despite their disputed taxonomic status in the literature, these subspecies have not yet been compared with molecular methods. The aim of the present study was to investigate the phylogenetic relationships of H. erinacei subspecies, in the context of the first finding of this tick species in Romania. After morphological identification, DNA was extracted from five adults of H. e. taurica (from Romania and Turkey), four adults of H. e. erinacei (from Italy) and 17 adults of H. e. turanica (from China). From these samples fragments of the cytochrome c oxidase subunit 1 (cox1) and 16S rRNA genes were amplified via PCR and sequenced. Results showed that cox1 and 16S rRNA gene sequence divergences between H. e. taurica from Romania and H. e. erinacei from Italy were below 2%. However, the sequence divergences between H. e. taurica from Romania and H. e. turanica from China were high (up to 7.3% difference for the 16S rRNA gene), exceeding the reported level of sequence divergence between closely related tick species. At the same time, two adults of H. e. taurica from Turkey had higher 16S rRNA gene similarity to H. e. turanica from China (up to 97.5%) than to H. e. taurica from Romania (96.3%), but phylogenetically clustered more closely to H. e. taurica than to H. e. turanica. This is the first finding of H. erinacei in Romania, and the first (although preliminary) phylogenetic comparison of H. erinacei subspecies. Phylogenetic analyses did not support that the three H. erinacei subspecies evaluated here are of equal taxonomic rank, because the genetic divergence between H. e. turanica from China and H. e. taurica from Romania exceeded the usual level of sequence divergence between closely related tick species, suggesting that they might represent different species. Therefore, the taxonomic status of the subspecies of H. erinacei needs to be revised based on a larger number of specimens collected throughout its geographical range.

  17. Phylogenetic analysis of Demodex caprae based on mitochondrial 16S rDNA sequence.

    PubMed

    Zhao, Ya-E; Hu, Li; Ma, Jun-Xian

    2013-11-01

    Demodex caprae infests the hair follicles and sebaceous glands of goats worldwide, which not only seriously impairs goat farming, but also causes a big economic loss. However, there are few reports on the DNA level of D. caprae. To reveal the taxonomic position of D. caprae within the genus Demodex, the present study conducted phylogenetic analysis of D. caprae based on mt16S rDNA sequence data. D. caprae adults and eggs were obtained from a skin nodule of the goat suffering demodicidosis. The mt16S rDNA sequences of individual mite were amplified using specific primers, and then cloned, sequenced, and aligned. The sequence divergence, genetic distance, and transition/transversion rate were computed, and the phylogenetic trees in Demodex were reconstructed. Results revealed the 339-bp partial sequences of six D. caprae isolates were obtained, and the sequence identity was 100% among isolates. The pairwise divergences between D. caprae and Demodex canis or Demodex folliculorum or Demodex brevis were 22.2-24.0%, 24.0-24.9%, and 22.9-23.2%, respectively. The corresponding average genetic distances were 2.840, 2.926, and 2.665, and the average transition/transversion rates were 0.70, 0.55, and 0.54, respectively. The divergences, genetic distances, and transition/transversion rates of D. caprae versus the other three species all reached interspecies level. The five phylogenetic trees all presented that D. caprae clustered with D. brevis first, and then with D. canis, D. folliculorum, and Demodex injai in sequence. In conclusion, D. caprae is an independent species, and it is closer to D. brevis than to D. canis, D. folliculorum, or D. injai.

  18. Early Divergent Strains of Yersinia pestis in Eurasia 5,000 Years Ago

    PubMed Central

    Rasmussen, Simon; Allentoft, Morten Erik; Nielsen, Kasper; Orlando, Ludovic; Sikora, Martin; Sjögren, Karl-Göran; Pedersen, Anders Gorm; Schubert, Mikkel; Van Dam, Alex; Kapel, Christian Moliin Outzen; Nielsen, Henrik Bjørn; Brunak, Søren; Avetisyan, Pavel; Epimakhov, Andrey; Khalyapin, Mikhail Viktorovich; Gnuni, Artak; Kriiska, Aivar; Lasak, Irena; Metspalu, Mait; Moiseyev, Vyacheslav; Gromov, Andrei; Pokutta, Dalia; Saag, Lehti; Varul, Liivi; Yepiskoposyan, Levon; Sicheritz-Pontén, Thomas; Foley, Robert A.; Lahr, Marta Mirazón; Nielsen, Rasmus; Kristiansen, Kristian; Willerslev, Eske

    2015-01-01

    Summary The bacteria Yersinia pestis is the etiological agent of plague and has caused human pandemics with millions of deaths in historic times. How and when it originated remains contentious. Here, we report the oldest direct evidence of Yersinia pestis identified by ancient DNA in human teeth from Asia and Europe dating from 2,800 to 5,000 years ago. By sequencing the genomes, we find that these ancient plague strains are basal to all known Yersinia pestis. We find the origins of the Yersinia pestis lineage to be at least two times older than previous estimates. We also identify a temporal sequence of genetic changes that lead to increased virulence and the emergence of the bubonic plague. Our results show that plague infection was endemic in the human populations of Eurasia at least 3,000 years before any historical recordings of pandemics. PMID:26496604

  19. Mitochondrial phylogenomics of Hemiptera reveals adaptive innovations driving the diversification of true bugs

    PubMed Central

    Li, Hu; Leavengood, John M.; Chapman, Eric G.; Burkhardt, Daniel; Song, Fan; Jiang, Pei; Liu, Jinpeng; Cai, Wanzhi

    2017-01-01

    Hemiptera, the largest non-holometabolous order of insects, represents approximately 7% of metazoan diversity. With extraordinary life histories and highly specialized morphological adaptations, hemipterans have exploited diverse habitats and food sources through approximately 300 Myr of evolution. To elucidate the phylogeny and evolutionary history of Hemiptera, we carried out the most comprehensive mitogenomics analysis on the richest taxon sampling to date covering all the suborders and infraorders, including 34 newly sequenced and 94 published mitogenomes. With optimized branch length and sequence heterogeneity, Bayesian analyses using a site-heterogeneous mixture model resolved the higher-level hemipteran phylogeny as (Sternorrhyncha, (Auchenorrhyncha, (Coleorrhyncha, Heteroptera))). Ancestral character state reconstruction and divergence time estimation suggest that the success of true bugs (Heteroptera) is probably due to angiosperm coevolution, but key adaptive innovations (e.g. prognathous mouthpart, predatory behaviour, and haemelytron) facilitated multiple independent shifts among diverse feeding habits and multiple independent colonizations of aquatic habitats. PMID:28878063

  20. Hybridization Reveals the Evolving Genomic Architecture of Speciation

    PubMed Central

    Kronforst, Marcus R.; Hansen, Matthew E.B.; Crawford, Nicholas G.; Gallant, Jason R.; Zhang, Wei; Kulathinal, Rob J.; Kapan, Durrell D.; Mullen, Sean P.

    2014-01-01

    SUMMARY The rate at which genomes diverge during speciation is unknown, as are the physical dynamics of the process. Here, we compare full genome sequences of 32 butterflies, representing five species from a hybridizing Heliconius butterfly community, to examine genome-wide patterns of introgression and infer how divergence evolves during the speciation process. Our analyses reveal that initial divergence is restricted to a small fraction of the genome, largely clustered around known wing-patterning genes. Over time, divergence evolves rapidly, due primarily to the origin of new divergent regions. Furthermore, divergent genomic regions display signatures of both selection and adaptive introgression, demonstrating the link between microevolutionary processes acting within species and the origin of species across macroevolutionary timescales. Our results provide a uniquely comprehensive portrait of the evolving species boundary due to the role that hybridization plays in reducing the background accumulation of divergence at neutral sites. PMID:24183670

  1. Dissecting the relationship between protein structure and sequence variation

    NASA Astrophysics Data System (ADS)

    Shahmoradi, Amir; Wilke, Claus; Wilke Lab Team

    2015-03-01

    Over the past decade several independent works have shown that some structural properties of proteins are capable of predicting protein evolution. The strength and significance of these structure-sequence relations, however, appear to vary widely among different proteins, with absolute correlation strengths ranging from 0 . 1 to 0 . 8 . Here we present the results from a comprehensive search for the potential biophysical and structural determinants of protein evolution by studying more than 200 structural and evolutionary properties in a dataset of 209 monomeric enzymes. We discuss the main protein characteristics responsible for the general patterns of protein evolution, and identify sequence divergence as the main determinant of the strengths of virtually all structure-evolution relationships, explaining ~ 10 - 30 % of observed variation in sequence-structure relations. In addition to sequence divergence, we identify several protein structural properties that are moderately but significantly coupled with the strength of sequence-structure relations. In particular, proteins with more homogeneous back-bone hydrogen bond energies, large fractions of helical secondary structures and low fraction of beta sheets tend to have the strongest sequence-structure relation. BEACON-NSF center for the study of evolution in action.

  2. Evolutionary and preservational constraints on origins of biologic groups: divergence times of eutherian mammals

    NASA Technical Reports Server (NTRS)

    Foote, M.; Hunter, J. P.; Janis, C. M.; Sepkoski, J. J. Jr

    1999-01-01

    Some molecular clock estimates of divergence times of taxonomic groups undergoing evolutionary radiation are much older than the groups' first observed fossil record. Mathematical models of branching evolution are used to estimate the maximal rate of fossil preservation consistent with a postulated missing history, given the sum of species durations implied by early origins under a range of species origination and extinction rates. The plausibility of postulated divergence times depends on origination, extinction, and preservation rates estimated from the fossil record. For eutherian mammals, this approach suggests that it is unlikely that many modern orders arose much earlier than their oldest fossil records.

  3. Full-genome sequence and analysis of a novel human rhinovirus strain within a divergent HRV-A clade.

    PubMed

    Rathe, Jennifer A; Liu, Xinyue; Tallon, Luke J; Gern, James E; Liggett, Stephen B

    2010-01-01

    Genome sequences of human rhinoviruses (HRV) have primarily been from stocks collected in the 1960s, with genomes and phylogeny of modern HRVs remaining undefined. Here, two modern isolates (hrv-A101 and hrv-A101-v1) collected approximately 8 years apart were sequenced in their entirety. Incorporation into our full-genome HRV alignment with subsequent phylogenetic network inference indicated that these represent a unique HRV-A, localized within a distinct divergent clade. They appear to have resulted from recombination of the hrv-65 and hrv-78 lineages. These results support our contention that there are unrecognized distinct HRV-A strains, and that recombination is evident in currently circulating strains.

  4. The Species Dilemma of Northeast Indian Mahseer (Actinopterygii: Cyprinidae): DNA Barcoding in Clarifying the Riddle

    PubMed Central

    Laskar, Boni A.; Bhattacharjee, Maloyjo J.; Dhar, Bishal; Mahadani, Pradosh; Kundu, Shantanu; Ghosh, Sankar K.

    2013-01-01

    Background The taxonomic validity of Northeast Indian endemic Mahseer species, Tor progeneius and Neolissochilus hexastichus, has been argued repeatedly. This is mainly due to disagreements in recognizing the species based on morphological characters. Consequently, both the species have been concealed for many decades. DNA barcoding has become a promising and an independent technique for accurate species level identification. Therefore, utilization of such technique in association with the traditional morphotaxonomic description can resolve the species dilemma of this important group of sport fishes. Methodology/Principal Findings Altogether, 28 mahseer specimens including paratypes were studied from different locations in Northeast India, and 24 morphometric characters were measured invariably. The Principal Component Analysis with morphometric data revealed five distinct groups of sample that were taxonomically categorized into 4 species, viz., Tor putitora, T. progeneius, Neolissochilus hexagonolepis and N. hexastichus. Analysis with a dataset of 76 DNA barcode sequences of different mahseer species exhibited that the queries of T. putitora and N. hexagonolepis clustered cohesively with the respective conspecific database sequences maintaining 0.8% maximum K2P divergence. The closest congeneric divergence was 3 times higher than the mean conspecific divergence and was considered as barcode gap. The maximum divergence among the samples of T. progeneius and T. putitora was 0.8% that was much below the barcode gap, indicating them being synonymous. The query sequences of N. hexastichus invariably formed a discrete and a congeneric clade with the database sequences and maintained the interspecific divergence that supported its distinct species status. Notably, N. hexastichus was encountered in a single site and seemed to be under threat. Conclusion This study substantiated the identification of N. hexastichus to be a true species, and tentatively regarded T. progeneius to be a synonym of T. putitora. It would guide the conservationists to initiate priority conservation of N. hexastichus and T. putitora. PMID:23341979

  5. Brettanomyces acidodurans sp. nov., a new acetic acid producing yeast species from olive oil.

    PubMed

    Péter, Gábor; Dlauchy, Dénes; Tóbiás, Andrea; Fülöp, László; Podgoršek, Martina; Čadež, Neža

    2017-05-01

    Two yeast strains representing a hitherto undescribed yeast species were isolated from olive oil and spoiled olive oil originating from Spain and Israel, respectively. Both strains are strong acetic acid producers, equipped with considerable tolerance to acetic acid. The cultures are not short-lived. Cellobiose is fermented as well as several other sugars. The sequences of their large subunit (LSU) rRNA gene D1/D2 domain are very divergent from the sequences available in the GenBank. They differ from the closest hit, Brettanomyces naardenensis by about 27%, mainly substitutions. Sequence analyses of the concatenated dataset from genes of the small subunit (SSU) rRNA, LSU rRNA and translation elongation factor-1α (EF-1α) placed the two strains as an early diverging member of the Brettanomyces/Dekkera clade with high bootstrap support. Sexual reproduction was not observed. The name Brettanomyces acidodurans sp. nov. (holotype: NCAIM Y.02178 T ; isotypes: CBS 14519 T  = NRRL Y-63865 T  = ZIM 2626 T , MycoBank no.: MB 819608) is proposed for this highly divergent new yeast species.

  6. Molecular phylogeny and taxonomy of wood mice (genus Apodemus Kaup, 1829) based on complete mtDNA cytochrome b sequences, with emphasis on Chinese species.

    PubMed

    Liu, Xiaoming; Wei, Fuwen; Li, Ming; Jiang, Xuelong; Feng, Zuojian; Hu, Jinchu

    2004-10-01

    Phylogenetic relationships among 15 species of wood mice (genus Apodemus) were reconstructed to explore some long-standing taxonomic problems. The results provided support for the monophyly of the genus Apodemus, but could not reject the hypothesis of paraphyly for this genus. Our data divided the 15 species into four major groups: (1) the Sylvaemus group (A. sylvaticus, A. flavicollis, A. alpicola, and A. uralensis), (2) the Apodemus group (A. peninsulae, A. chevreri, A. agrarius, A. speciosus, A. draco, A. ilex, A. semotus, A. latronum, and A. mystacinus), (3) A. argenteus, and (4) A. gurkha. Our results also suggested that orestes should be a valid subspecies of A. draco rather than an independent species; in contrast, A. ilex from Yunnan may be regarded as a separate species rather than a synonym of orestes or draco. The species level status of A. latronum, tscherga as synonyms of A. uralensis, and A. chevrieri as a valid species and the closest sibling species of A. agrarius were further corroborated by our data. Applying a molecular clock with the divergences of Mus and Rattus set at 12 million years ago (Mya) as a calibration point, it was estimated that five old lineages (A. mystacinus and four major groups above) diverged in the late Miocene (7.82-12.74 Mya). Then the Apodemus group (excluding A. mystacinus) split into two subgroups: agrarius and draco, at about 7.17-9.95 Mya. Four species of the Sylvaemus group were estimated to diverge at about 2.92-5.21 Mya. The Hengduan Mountains Region was hypothesized to have played important roles in Apodemus evolutionary histories since the Pleistocene.

  7. Beyond genomic variation--comparison and functional annotation of three Brassica rapa genomes: a turnip, a rapid cycling and a Chinese cabbage.

    PubMed

    Lin, Ke; Zhang, Ningwen; Severing, Edouard I; Nijveen, Harm; Cheng, Feng; Visser, Richard G F; Wang, Xiaowu; de Ridder, Dick; Bonnema, Guusje

    2014-03-31

    Brassica rapa is an economically important crop species. During its long breeding history, a large number of morphotypes have been generated, including leafy vegetables such as Chinese cabbage and pakchoi, turnip tuber crops and oil crops. To investigate the genetic variation underlying this morphological variation, we re-sequenced, assembled and annotated the genomes of two B. rapa subspecies, turnip crops (turnip) and a rapid cycling. We then analysed the two resulting genomes together with the Chinese cabbage Chiifu reference genome to obtain an impression of the B. rapa pan-genome. The number of genes with protein-coding changes between the three genotypes was lower than that among different accessions of Arabidopsis thaliana, which can be explained by the smaller effective population size of B. rapa due to its domestication. Based on orthology to a number of non-brassica species, we estimated the date of divergence among the three B. rapa morphotypes at approximately 250,000 YA, far predating Brassica domestication (5,000-10,000 YA). By analysing genes unique to turnip we found evidence for copy number differences in peroxidases, pointing to a role for the phenylpropanoid biosynthesis pathway in the generation of morphological variation. The estimated date of divergence among three B. rapa morphotypes implies that prior to domestication there was already considerably divergence among B. rapa genotypes. Our study thus provides two new B. rapa reference genomes, delivers a set of computer tools to analyse the resulting pan-genome and uses these to shed light on genetic drivers behind the rich morphological variation found in B. rapa.

  8. A comprehensive and integrative reconstruction of evolutionary history for Anomura (Crustacea: Decapoda)

    PubMed Central

    2013-01-01

    Background The infraorder Anomura has long captivated the attention of evolutionary biologists due to its impressive morphological diversity and ecological adaptations. To date, 2500 extant species have been described but phylogenetic relationships at high taxonomic levels remain unresolved. Here, we reconstruct the evolutionary history—phylogeny, divergence times, character evolution and diversification—of this speciose clade. For this purpose, we sequenced two mitochondrial (16S and 12S) and three nuclear (H3, 18S and 28S) markers for 19 of the 20 extant families, using traditional Sanger and next-generation 454 sequencing methods. Molecular data were combined with 156 morphological characters in order to estimate the largest anomuran phylogeny to date. The anomuran fossil record allowed us to incorporate 31 fossils for divergence time analyses. Results Our best phylogenetic hypothesis (morphological + molecular data) supports most anomuran superfamilies and families as monophyletic. However, three families and eleven genera are recovered as para- and polyphyletic. Divergence time analysis dates the origin of Anomura to the Late Permian ~259 (224–296) MYA with many of the present day families radiating during the Jurassic and Early Cretaceous. Ancestral state reconstruction suggests that carcinization occurred independently 3 times within the group. The invasion of freshwater and terrestrial environments both occurred between the Late Cretaceous and Tertiary. Diversification analyses found the speciation rate to be low across Anomura, and we identify 2 major changes in the tempo of diversification; the most significant at the base of a clade that includes the squat-lobster family Chirostylidae. Conclusions Our findings are compared against current classifications and previous hypotheses of anomuran relationships. Many families and genera appear to be poly- or paraphyletic suggesting a need for further taxonomic revisions at these levels. A divergence time analysis provides key insights into the origins of major lineages and events and the timing of morphological (body form) and ecological (habitat) transitions. Living anomuran biodiversity is the product of 2 major changes in the tempo of diversification; our initial insights suggest that the acquisition of a crab-like form did not act as a key innovation. PMID:23786343

  9. The phylogeny of brown lacewings (Neuroptera: Hemerobiidae) reveals multiple reductions in wing venation.

    PubMed

    Garzón-Orduña, Ivonne J; Menchaca-Armenta, Imelda; Contreras-Ramos, Atilano; Liu, Xingyue; Winterton, Shaun L

    2016-09-20

    The last time the phylogenetic relationships among members of the family Hemerobiidae were studied quantitatively was over 12 years ago and based exclusively on morphology. Our study builds upon this morphological evidence by adding sequence data from three gene loci to provide a total evidence phylogeny of brown lacewings (Neuroptera: Hemerobiidae). Thirty-seven species representing nineteen Hemerobiidae genera were compared with outgroups from the families Ithonidae, Psychopsidae and Chrysopidae in Bayesian and parsimony analyses using a single nuclear gene (CAD) and two mitochondrial (16S rDNA and Cytochrome Oxidase I) genes. We compare divergence time estimates of Hemerobiidae cladogenesis under the two most commonly used relaxed clock models and discuss the evolution of wing venation in the family. We recovered a phylogeny largely incongruent with previously published morphological studies, although all but two subfamilies (i.e., Notiobiellinae and Drepanacrinae) were recovered as monophyletic. We found the subfamily Drepanacrinae paraphyletic with respect to Psychobiellinae, and Notiobiellinae to be polyphyletic. We thus offer a revised concept of Notiobiellinae, comprising only Notiobiella Banks, and erect a new subfamily Zachobiellinae including the remaining genera previously placed in Notiobiellinae. Psychobiellinae is synonymized with Drepanacrinae. Unlike the previous hypothesis that proposed a remarkably laddered topology, our tree suggests that hemerobiids diverged as three main clades. Moreover, in contrast to the vein proliferation hypothesis, we found that hemerobiids have instead undergone multiple reductions in the number of radial veins, this scenario questions the relevance of this character as diagnostic of various subfamilies Our phylogenetic hypothesis and divergence times analysis suggest that extant hemerobiids originated around the end of the Triassic and evolved as three distinct clades that diverged from one another during the Late Jurassic to Early Cretaceous. Contrary to earlier phylogenetic hypotheses, Carobius Banks (Carobiinae) is sister to the previously unplaced genus Notherobius New in a clade more closely related to Sympherobiinae, Megalominae and Zachobiellinae subfam. nov. The addition of taxa which are not available for DNA sequencing should be the focus of future studies, especially Adelphohemerobius Oswald, which is particularly important to test our inferences regarding the evolution of wing venation in Hemerobiidae.

  10. Maternal and child mortality indicators across 187 countries of the world: converging or diverging.

    PubMed

    Goli, Srinivas; Arokiasamy, Perianayagam

    2014-01-01

    This study reassessed the progress achieved since 1990 in maternal and child mortality indicators to test whether the progress is converging or diverging across countries worldwide. The convergence process is examined using standard parametric and non-parametric econometric models of convergence. The results of absolute convergence estimates reveal that progress in maternal and child mortality indicators is diverging for the entire period of 1990-2010 [maternal mortality ratio (MMR) - β = .00033, p < .574; neonatal mortality rate (NNMR) - β = .04367, p < .000; post-neonatal mortality rate (PNMR) - β = .02677, p < .000; under-five mortality rate (U5MR) - β = .00828, p < .000)]. In the recent period, such divergence is replaced with convergence for MMR but diverged for all the child mortality indicators. The results of Kernel density estimate reveal considerable reduction in divergence of MMR for the recent period; however, the Kernel density distribution plots show more than one 'peak' which indicates the emergence of convergence clubs based on their mortality levels. For child mortality indicators, the Kernel estimates suggest that divergence is in progress across the countries worldwide but tended to converge for countries with low mortality levels. A mere progress in global averages of maternal and child mortality indicators among a global cross-section of countries does not warranty convergence unless there is a considerable reduction in variance, skewness and range of change.

  11. Full Genome Sequencing Reveals New Southern African Territories Genotypes Bringing Us Closer to Understanding True Variability of Foot-and-Mouth Disease Virus in Africa

    PubMed Central

    Lasecka-Dykes, Lidia; Wright, Caroline F.; Di Nardo, Antonello; Logan, Grace; Mioulet, Valerie; Jackson, Terry; Tuthill, Tobias J.; Knowles, Nick J.; King, Donald P.

    2018-01-01

    Foot-and-mouth disease virus (FMDV) causes a highly contagious disease of cloven-hooved animals that poses a constant burden on farmers in endemic regions and threatens the livestock industries in disease-free countries. Despite the increased number of publicly available whole genome sequences, FMDV data are biased by the opportunistic nature of sampling. Since whole genomic sequences of Southern African Territories (SAT) are particularly underrepresented, this study sequenced 34 isolates from eastern and southern Africa. Phylogenetic analyses revealed two novel genotypes (that comprised 8/34 of these SAT isolates) which contained unusual 5′ untranslated and non-structural encoding regions. While recombination has occurred between these sequences, phylogeny violation analyses indicated that the high degree of sequence diversity for the novel SAT genotypes has not solely arisen from recombination events. Based on estimates of the timing of ancestral divergence, these data are interpreted as being representative of un-sampled FMDV isolates that have been subjected to geographical isolation within Africa by the effects of the Great African Rinderpest Pandemic (1887–1897), which caused a mass die-out of FMDV-susceptible hosts. These findings demonstrate that further sequencing of African FMDV isolates is likely to reveal more unusual genotypes and will allow for better understanding of natural variability and evolution of FMDV. PMID:29652800

  12. Ancient wolf lineages in India.

    PubMed Central

    Sharma, Dinesh K; Maldonado, Jesus E; Jhala, Yadrendradev V; Fleischer, Robert C

    2004-01-01

    All previously obtained wolf (Canis lupus) and dog (Canis familiaris) mitochondrial (mt) DNA sequences fall within an intertwined and shallow clade (the 'wolf-dog' clade). We sequenced mtDNA of recent and historical samples from 45 wolves from throughout lowland peninsular India and 23 wolves from the Himalayas and Tibetan Plateau and compared these sequences with all available wolf and dog sequences. All 45 lowland Indian wolves have one of four closely related haplotypes that form a well-supported, divergent sister lineage to the wolf-dog clade. This unique lineage may have been independent for more than 400,000 years. Although seven Himalayan wolves from western and central Kashmir fall within the widespread wolf-dog clade, one from Ladakh in eastern Kashmir, nine from Himachal Pradesh, four from Nepal and two from Tibet form a very different basal clade. This lineage contains five related haplotypes that probably diverged from other canids more than 800,000 years ago, but we find no evidence of current barriers to admixture. Thus, the Indian subcontinent has three divergent, ancient and apparently parapatric mtDNA lineages within the morphologically delineated wolf. No haplotypes of either novel lineage are found within a sample of 37 Indian (or other) dogs. Thus, we find no evidence that these two taxa played a part in the domestication of canids. PMID:15101402

  13. Ancient wolf lineages in India.

    PubMed

    Sharma, Dinesh K; Maldonado, Jesus E; Jhala, Yadrendradev V; Fleischer, Robert C

    2004-02-07

    All previously obtained wolf (Canis lupus) and dog (Canis familiaris) mitochondrial (mt) DNA sequences fall within an intertwined and shallow clade (the 'wolf-dog' clade). We sequenced mtDNA of recent and historical samples from 45 wolves from throughout lowland peninsular India and 23 wolves from the Himalayas and Tibetan Plateau and compared these sequences with all available wolf and dog sequences. All 45 lowland Indian wolves have one of four closely related haplotypes that form a well-supported, divergent sister lineage to the wolf-dog clade. This unique lineage may have been independent for more than 400,000 years. Although seven Himalayan wolves from western and central Kashmir fall within the widespread wolf-dog clade, one from Ladakh in eastern Kashmir, nine from Himachal Pradesh, four from Nepal and two from Tibet form a very different basal clade. This lineage contains five related haplotypes that probably diverged from other canids more than 800,000 years ago, but we find no evidence of current barriers to admixture. Thus, the Indian subcontinent has three divergent, ancient and apparently parapatric mtDNA lineages within the morphologically delineated wolf. No haplotypes of either novel lineage are found within a sample of 37 Indian (or other) dogs. Thus, we find no evidence that these two taxa played a part in the domestication of canids.

  14. Complete nuclear ribosomal DNA sequence amplification and molecular analyses of Bangia (Bangiales, Rhodophyta) from China

    NASA Astrophysics Data System (ADS)

    Xu, Jiajie; Jiang, Bo; Chai, Sanming; He, Yuan; Zhu, Jianyi; Shen, Zonggen; Shen, Songdong

    2016-09-01

    Filamentous Bangia, which are distributed extensively throughout the world, have simple and similar morphological characteristics. Scientists can classify these organisms using molecular markers in combination with morphology. We successfully sequenced the complete nuclear ribosomal DNA, approximately 13 kb in length, from a marine Bangia population. We further analyzed the small subunit ribosomal DNA gene (nrSSU) and the internal transcribed spacer (ITS) sequence regions along with nine other marine, and two freshwater Bangia samples from China. Pairwise distances of the nrSSU and 5.8S ribosomal DNA gene sequences show the marine samples grouping together with low divergences (00.003; 0-0.006, respectively) from each other, but high divergences (0.123-0.126; 0.198, respectively) from freshwater samples. An exception is the marine sample collected from Weihai, which shows high divergence from both other marine samples (0.063-0.065; 0.129, respectively) and the freshwater samples (0.097; 0.120, respectively). A maximum likelihood phylogenetic tree based on a combined SSU-ITS dataset with maximum likelihood method shows the samples divided into three clades, with the two marine sample clades containing Bangia spp. from North America, Europe, Asia, and Australia; and one freshwater clade, containing Bangia atropurpurea from North America and China.

  15. Divergently expressed gene identification and interaction prediction of long noncoding RNA and mRNA involved in duck reproduction.

    PubMed

    Ren, Jindong; Du, Xue; Zeng, Tao; Chen, Li; Shen, Junda; Lu, Lizhi; Hu, Jianhong

    2017-10-01

    Long noncoding RNAs (lncRNAs) and divergently expressed genes exist widely in different tissues of mammals and birds, in which they are involved in various biological processes. However, there is limited information on their role in the regulation of normal biological processes during differentiation, development, and reproduction in birds. In this study, whole transcriptome strand-specific RNA sequencing of the ovary from young ducks (60days), first-laying ducks (160days), and old ducks, i.e., ducks that stopped laying eggs (490days) was performed. The lncRNAs and mRNAs from these ducks were systematically analyzed and identified by duck genome sequencing in the three study groups. The transcriptome from the duck ovary comprised 15,011 protein-coding genes and 2905 lncRNAs; all the lncRNAs were identified as novel long noncoding transcripts. The comparison of transcriptome data from different study groups identified 2240 divergent transcription genes and 135 divergently expressed lncRNAs, which differed among the groups; most of them were significantly downregulated with age. Among the divergent genes, 38 genes were related to the reproductive process and 6 genes were upregulated. Further prediction analysis revealed that 52 lncRNAs were closely correlated with divergent reproductive mRNAs. More importantly, 6 remarkable lncRNAs were correlated significantly with the conversion of the ovary in different phases. Our results aid in the understanding of the divergent transcriptome of duck ovary in different phases and the underlying mechanisms that drive the specificity of protein-coding genes and lncRNAs in duck ovary. Copyright © 2017. Published by Elsevier B.V.

  16. Assessing the potential of RAD-sequencing to resolve phylogenetic relationships within species radiations: The fly genus Chiastocheta (Diptera: Anthomyiidae) as a case study.

    PubMed

    Suchan, Tomasz; Espíndola, Anahí; Rutschmann, Sereina; Emerson, Brent C; Gori, Kevin; Dessimoz, Christophe; Arrigo, Nils; Ronikier, Michał; Alvarez, Nadir

    2017-09-01

    Determining phylogenetic relationships among recently diverged species has long been a challenge in evolutionary biology. Cytoplasmic DNA markers, which have been widely used, notably in the context of molecular barcoding, have not always proved successful in resolving such phylogenies. However, with the advent of next-generation-sequencing technologies and associated techniques of reduced genome representation, phylogenies of closely related species have been resolved at a much higher detail in the last couple of years. Here we examine the potential and limitations of one of such techniques-Restriction-site Associated DNA (RAD) sequencing, a method that produces thousands of (mostly) anonymous nuclear markers, in disentangling the phylogeny of the fly genus Chiastocheta (Diptera: Anthomyiidae). In Europe, this genus encompasses seven species of seed predators, which have been widely studied in the context of their ecological and evolutionary interactions with the plant Trollius europaeus (Ranunculaceae). So far, phylogenetic analyses using mitochondrial markers failed to resolve monophyly of most of the species from this recently diversified genus, suggesting that their taxonomy may need a revision. However, relying on a single, non-recombining marker and ignoring potential incongruences between mitochondrial and nuclear loci may provide an incomplete account of the lineage history. In this study, we applied both classical Sanger sequencing of three mtDNA regions and RAD-sequencing, for reconstructing the phylogeny of the genus. Contrasting with results based on mitochondrial markers, RAD-sequencing analyses retrieved the monophyly of all seven species, in agreement with the morphological species assignment. We found robust nuclear-based species assignment of individual samples, and low levels of estimated contemporary gene flow among them. However, despite recovering species' monophyly, interspecific relationships varied depending on the set of RAD loci considered, producing contradictory topologies. Moreover, coalescence-based phylogenetic analyses revealed low supports for most of the interspecific relationships. Our results indicate that despite the higher performance of RAD-sequencing in terms of species trees resolution compared to cytoplasmic markers, reconstructing inter-specific relationships among recently-diverged lineages may lie beyond the possibilities offered by large sets of RAD-sequencing markers in cases of strong gene tree incongruence. Copyright © 2017 Elsevier Inc. All rights reserved.

  17. Analysis of Complete Nucleotide Sequences of 12 Gossypium Chloroplast Genomes: Origin and Evolution of Allotetraploids

    PubMed Central

    Xu, Qin; Xiong, Guanjun; Li, Pengbo; He, Fei; Huang, Yi; Wang, Kunbo; Li, Zhaohu; Hua, Jinping

    2012-01-01

    Background Cotton (Gossypium spp.) is a model system for the analysis of polyploidization. Although ascertaining the donor species of allotetraploid cotton has been intensively studied, sequence comparison of Gossypium chloroplast genomes is still of interest to understand the mechanisms underlining the evolution of Gossypium allotetraploids, while it is generally accepted that the parents were A- and D-genome containing species. Here we performed a comparative analysis of 13 Gossypium chloroplast genomes, twelve of which are presented here for the first time. Methodology/Principal Findings The size of 12 chloroplast genomes under study varied from 159,959 bp to 160,433 bp. The chromosomes were highly similar having >98% sequence identity. They encoded the same set of 112 unique genes which occurred in a uniform order with only slightly different boundary junctions. Divergence due to indels as well as substitutions was examined separately for genome, coding and noncoding sequences. The genome divergence was estimated as 0.374% to 0.583% between allotetraploid species and A-genome, and 0.159% to 0.454% within allotetraploids. Forty protein-coding genes were completely identical at the protein level, and 20 intergenic sequences were completely conserved. The 9 allotetraploids shared 5 insertions and 9 deletions in whole genome, and 7-bp substitutions in protein-coding genes. The phylogenetic tree confirmed a close relationship between allotetraploids and the ancestor of A-genome, and the allotetraploids were divided into four separate groups. Progenitor allotetraploid cotton originated 0.43–0.68 million years ago (MYA). Conclusion Despite high degree of conservation between the Gossypium chloroplast genomes, sequence variations among species could still be detected. Gossypium chloroplast genomes preferred for 5-bp indels and 1–3-bp indels are mainly attributed to the SSR polymorphisms. This study supports that the common ancestor of diploid A-genome species in Gossypium is the maternal source of extant allotetraploid species and allotetraploids have a monophyletic origin. G. hirsutum AD1 lineages have experienced more sequence variations than other allotetraploids in intergenic regions. The available complete nucleotide sequences of 12 Gossypium chloroplast genomes should facilitate studies to uncover the molecular mechanisms of compartmental co-evolution and speciation of Gossypium allotetraploids. PMID:22876273

  18. Divergence in substrate specificity by the vOTU domain of various strains of highly-pathogenic PRRSV and the implications to pathogenicity

    USDA-ARS?s Scientific Manuscript database

    Porcine reproductive and respiratory syndrome virus (PRRSV) is widespread with a high variation in sequence and virulence among the divergent strains and causes an economically destructive disease. A viral ovarian domain protease (vOTU) has been previously identified within the nonstructural protein...

  19. New genes from old: asymmetric divergence of gene duplicates and the evolution of development.

    PubMed

    Holland, Peter W H; Marlétaz, Ferdinand; Maeso, Ignacio; Dunwell, Thomas L; Paps, Jordi

    2017-02-05

    Gene duplications and gene losses have been frequent events in the evolution of animal genomes, with the balance between these two dynamic processes contributing to major differences in gene number between species. After gene duplication, it is common for both daughter genes to accumulate sequence change at approximately equal rates. In some cases, however, the accumulation of sequence change is highly uneven with one copy radically diverging from its paralogue. Such 'asymmetric evolution' seems commoner after tandem gene duplication than after whole-genome duplication, and can generate substantially novel genes. We describe examples of asymmetric evolution in duplicated homeobox genes of moths, molluscs and mammals, in each case generating new homeobox genes that were recruited to novel developmental roles. The prevalence of asymmetric divergence of gene duplicates has been underappreciated, in part, because the origin of highly divergent genes can be difficult to resolve using standard phylogenetic methods.This article is part of the themed issue 'Evo-devo in the genomics era, and the origins of morphological diversity'. © 2016 The Author(s).

  20. Divergence, hybridization, and recombination in the mitochondrial genome of the human pathogenic yeast Cryptococcus gattii.

    PubMed

    Xu, Jianping; Yan, Zhun; Guo, Hong

    2009-06-01

    The inheritance of mitochondrial genes and genomes are uniparental in most sexual eukaryotes. This pattern of inheritance makes mitochondrial genomes in natural populations effectively clonal. Here, we examined the mitochondrial population genetics of the emerging human pathogenic fungus Cryptococcus gattii. The DNA sequences for five mitochondrial DNA fragments were obtained from each of 50 isolates belonging to two evolutionary divergent lineages, VGI and VGII. Our analyses revealed a greater sequence diversity within VGI than that within VGII, consistent with observations of the nuclear genes. The combined analyses of all five gene fragments indicated significant divergence between VGI and VGII. However, the five individual genealogies showed different relationships among the isolates, consistent with recent hybridization and mitochondrial gene transfer between the two lineages. Population genetic analyses of the multilocus data identified evidence for predominantly clonal mitochondrial population structures within both lineages. Interestingly, there were clear signatures of recombination among mitochondrial genes within the VGII lineage. Our analyses suggest historical mitochondrial genome divergence within C. gattii, but there is evidence for recent hybridization and recombination in the mitochondrial genome of this important human yeast pathogen.

  1. A Hidden Markov Model Approach for Simultaneously Estimating Local Ancestry and Admixture Time Using Next Generation Sequence Data in Samples of Arbitrary Ploidy

    PubMed Central

    Nielsen, Rasmus

    2017-01-01

    Admixture—the mixing of genomes from divergent populations—is increasingly appreciated as a central process in evolution. To characterize and quantify patterns of admixture across the genome, a number of methods have been developed for local ancestry inference. However, existing approaches have a number of shortcomings. First, all local ancestry inference methods require some prior assumption about the expected ancestry tract lengths. Second, existing methods generally require genotypes, which is not feasible to obtain for many next-generation sequencing projects. Third, many methods assume samples are diploid, however a wide variety of sequencing applications will fail to meet this assumption. To address these issues, we introduce a novel hidden Markov model for estimating local ancestry that models the read pileup data, rather than genotypes, is generalized to arbitrary ploidy, and can estimate the time since admixture during local ancestry inference. We demonstrate that our method can simultaneously estimate the time since admixture and local ancestry with good accuracy, and that it performs well on samples of high ploidy—i.e. 100 or more chromosomes. As this method is very general, we expect it will be useful for local ancestry inference in a wider variety of populations than what previously has been possible. We then applied our method to pooled sequencing data derived from populations of Drosophila melanogaster on an ancestry cline on the east coast of North America. We find that regions of local recombination rates are negatively correlated with the proportion of African ancestry, suggesting that selection against foreign ancestry is the least efficient in low recombination regions. Finally we show that clinal outlier loci are enriched for genes associated with gene regulatory functions, consistent with a role of regulatory evolution in ecological adaptation of admixed D. melanogaster populations. Our results illustrate the potential of local ancestry inference for elucidating fundamental evolutionary processes. PMID:28045893

  2. Evaluating, Comparing, and Interpreting Protein Domain Hierarchies

    PubMed Central

    2014-01-01

    Abstract Arranging protein domain sequences hierarchically into evolutionarily divergent subgroups is important for investigating evolutionary history, for speeding up web-based similarity searches, for identifying sequence determinants of protein function, and for genome annotation. However, whether or not a particular hierarchy is optimal is often unclear, and independently constructed hierarchies for the same domain can often differ significantly. This article describes methods for statistically evaluating specific aspects of a hierarchy, for probing the criteria underlying its construction and for direct comparisons between hierarchies. Information theoretical notions are used to quantify the contributions of specific hierarchical features to the underlying statistical model. Such features include subhierarchies, sequence subgroups, individual sequences, and subgroup-associated signature patterns. Underlying properties are graphically displayed in plots of each specific feature's contributions, in heat maps of pattern residue conservation, in “contrast alignments,” and through cross-mapping of subgroups between hierarchies. Together, these approaches provide a deeper understanding of protein domain functional divergence, reveal uncertainties caused by inconsistent patterns of sequence conservation, and help resolve conflicts between competing hierarchies. PMID:24559108

  3. Genome Sequences of Akhmeta Virus, an Early Divergent Old World Orthopoxvirus.

    PubMed

    Gao, Jinxin; Gigante, Crystal; Khmaladze, Ekaterine; Liu, Pengbo; Tang, Shiyuyun; Wilkins, Kimberly; Zhao, Kun; Davidson, Whitni; Nakazawa, Yoshinori; Maghlakelidze, Giorgi; Geleishvili, Marika; Kokhreidze, Maka; Carroll, Darin S; Emerson, Ginny; Li, Yu

    2018-05-12

    Annotated whole genome sequences of three isolates of the Akhmeta virus (AKMV), a novel species of orthopoxvirus (OPXV), isolated from the Akhmeta and Vani regions of the country Georgia, are presented and discussed. The AKMV genome is similar in genomic content and structure to that of the cowpox virus (CPXV), but a lower sequence identity was found between AKMV and Old World OPXVs than between other known species of Old World OPXVs. Phylogenetic analysis showed that AKMV diverged prior to other Old World OPXV. AKMV isolates formed a monophyletic clade in the OPXV phylogeny, yet the sequence variability between AKMV isolates was higher than between the monkeypox virus strains in the Congo basin and West Africa. An AKMV isolate from Vani contained approximately six kb sequence in the left terminal region that shared a higher similarity with CPXV than with other AKMV isolates, whereas the rest of the genome was most similar to AKMV, suggesting recombination between AKMV and CPXV in a region containing several host range and virulence genes.

  4. RECOVIR Software for Identifying Viruses

    NASA Technical Reports Server (NTRS)

    Chakravarty, Sugoto; Fox, George E.; Zhu, Dianhui

    2013-01-01

    Most single-stranded RNA (ssRNA) viruses mutate rapidly to generate a large number of strains with highly divergent capsid sequences. Determining the capsid residues or nucleotides that uniquely characterize these strains is critical in understanding the strain diversity of these viruses. RECOVIR (an acronym for "recognize viruses") software predicts the strains of some ssRNA viruses from their limited sequence data. Novel phylogenetic-tree-based databases of protein or nucleic acid residues that uniquely characterize these virus strains are created. Strains of input virus sequences (partial or complete) are predicted through residue-wise comparisons with the databases. RECOVIR uses unique characterizing residues to identify automatically strains of partial or complete capsid sequences of picorna and caliciviruses, two of the most highly diverse ssRNA virus families. Partition-wise comparisons of the database residues with the corresponding residues of more than 300 complete and partial sequences of these viruses resulted in correct strain identification for all of these sequences. This study shows the feasibility of creating databases of hitherto unknown residues uniquely characterizing the capsid sequences of two of the most highly divergent ssRNA virus families. These databases enable automated strain identification from partial or complete capsid sequences of these human and animal pathogens.

  5. EAPhy: A Flexible Tool for High-throughput Quality Filtering of Exon-alignments and Data Processing for Phylogenetic Methods.

    PubMed

    Blom, Mozes P K

    2015-08-05

    Recently developed molecular methods enable geneticists to target and sequence thousands of orthologous loci and infer evolutionary relationships across the tree of life. Large numbers of genetic markers benefit species tree inference but visual inspection of alignment quality, as traditionally conducted, is challenging with thousands of loci. Furthermore, due to the impracticality of repeated visual inspection with alternative filtering criteria, the potential consequences of using datasets with different degrees of missing data remain nominally explored in most empirical phylogenomic studies. In this short communication, I describe a flexible high-throughput pipeline designed to assess alignment quality and filter exonic sequence data for subsequent inference. The stringency criteria for alignment quality and missing data can be adapted based on the expected level of sequence divergence. Each alignment is automatically evaluated based on the stringency criteria specified, significantly reducing the number of alignments that require visual inspection. By developing a rapid method for alignment filtering and quality assessment, the consistency of phylogenetic estimation based on exonic sequence alignments can be further explored across distinct inference methods, while accounting for different degrees of missing data.

  6. TITAN: inference of copy number architectures in clonal cell populations from tumor whole-genome sequence data.

    PubMed

    Ha, Gavin; Roth, Andrew; Khattra, Jaswinder; Ho, Julie; Yap, Damian; Prentice, Leah M; Melnyk, Nataliya; McPherson, Andrew; Bashashati, Ali; Laks, Emma; Biele, Justina; Ding, Jiarui; Le, Alan; Rosner, Jamie; Shumansky, Karey; Marra, Marco A; Gilks, C Blake; Huntsman, David G; McAlpine, Jessica N; Aparicio, Samuel; Shah, Sohrab P

    2014-11-01

    The evolution of cancer genomes within a single tumor creates mixed cell populations with divergent somatic mutational landscapes. Inference of tumor subpopulations has been disproportionately focused on the assessment of somatic point mutations, whereas computational methods targeting evolutionary dynamics of copy number alterations (CNA) and loss of heterozygosity (LOH) in whole-genome sequencing data remain underdeveloped. We present a novel probabilistic model, TITAN, to infer CNA and LOH events while accounting for mixtures of cell populations, thereby estimating the proportion of cells harboring each event. We evaluate TITAN on idealized mixtures, simulating clonal populations from whole-genome sequences taken from genomically heterogeneous ovarian tumor sites collected from the same patient. In addition, we show in 23 whole genomes of breast tumors that the inference of CNA and LOH using TITAN critically informs population structure and the nature of the evolving cancer genome. Finally, we experimentally validated subclonal predictions using fluorescence in situ hybridization (FISH) and single-cell sequencing from an ovarian cancer patient sample, thereby recapitulating the key modeling assumptions of TITAN. © 2014 Ha et al.; Published by Cold Spring Harbor Laboratory Press.

  7. First comparative insight into the architecture of COI mitochondrial minicircle molecules of dicyemids reveals marked inter-species variation.

    PubMed

    Catalano, Sarah R; Whittington, Ian D; Donnellan, Stephen C; Bertozzi, Terry; Gillanders, Bronwyn M

    2015-07-01

    Dicyemids, poorly known parasites of benthic cephalopods, are one of the few phyla in which mitochondrial (mt) genome architecture departs from the typical ~16 kb circular metazoan genome. In addition to a putative circular genome, a series of mt minicircles that each comprises the mt encoded units (I-III) of the cytochrome c oxidase complex have been reported. Whether the structure of the mt minicircles is a consistent feature among dicyemid species is unknown. Here we analyse the complete cytochrome c oxidase subunit I (COI) minicircle molecule, containing the COI gene and an associated non-coding region (NCR), for ten dicyemid species, allowing for first time comparisons between species of minicircle architecture, NCR function and inferences of minicircle replication. Divergence in COI nucleotide sequences between dicyemid species was high (average net divergence = 31.6%) while within species diversity was lower (average net divergence = 0.2%). The NCR and putative 5' section of the COI gene were highly divergent between dicyemid species (average net nucleotide divergence of putative 5' COI section = 61.1%). No tRNA genes were found in the NCR, although palindrome sequences with the potential to form stem-loop structures were identified in some species, which may play a role in transcription or other biological processes.

  8. CNL Disease Resistance Genes in Soybean and Their Evolutionary Divergence

    PubMed Central

    Nepal, Madhav P; Benson, Benjamin V

    2015-01-01

    Disease resistance genes (R-genes) encode proteins involved in detecting pathogen attack and activating downstream defense molecules. Recent availability of soybean genome sequences makes it possible to examine the diversity of gene families including disease-resistant genes. The objectives of this study were to identify coiled-coil NBS-LRR (= CNL) R-genes in soybean, infer their evolutionary relationships, and assess structural as well as functional divergence of the R-genes. Profile hidden Markov models were used for sequence identification and model-based maximum likelihood was used for phylogenetic analysis, and variation in chromosomal positioning, gene clustering, and functional divergence were assessed. We identified 188 soybean CNL genes nested into four clades consistent to their orthologs in Arabidopsis. Gene clustering analysis revealed the presence of 41 gene clusters located on 13 different chromosomes. Analyses of the Ks-values and chromosomal positioning suggest duplication events occurring at varying timescales, and an extrapericentromeric positioning may have facilitated their rapid evolution. Each of the four CNL clades exhibited distinct patterns of gene expression. Phylogenetic analysis further supported the extrapericentromeric positioning effect on the divergence and retention of the CNL genes. The results are important for understanding the diversity and divergence of CNL genes in soybean, which would have implication in soybean crop improvement in future. PMID:25922568

  9. CNL Disease Resistance Genes in Soybean and Their Evolutionary Divergence.

    PubMed

    Nepal, Madhav P; Benson, Benjamin V

    2015-01-01

    Disease resistance genes (R-genes) encode proteins involved in detecting pathogen attack and activating downstream defense molecules. Recent availability of soybean genome sequences makes it possible to examine the diversity of gene families including disease-resistant genes. The objectives of this study were to identify coiled-coil NBS-LRR (= CNL) R-genes in soybean, infer their evolutionary relationships, and assess structural as well as functional divergence of the R-genes. Profile hidden Markov models were used for sequence identification and model-based maximum likelihood was used for phylogenetic analysis, and variation in chromosomal positioning, gene clustering, and functional divergence were assessed. We identified 188 soybean CNL genes nested into four clades consistent to their orthologs in Arabidopsis. Gene clustering analysis revealed the presence of 41 gene clusters located on 13 different chromosomes. Analyses of the K s-values and chromosomal positioning suggest duplication events occurring at varying timescales, and an extrapericentromeric positioning may have facilitated their rapid evolution. Each of the four CNL clades exhibited distinct patterns of gene expression. Phylogenetic analysis further supported the extrapericentromeric positioning effect on the divergence and retention of the CNL genes. The results are important for understanding the diversity and divergence of CNL genes in soybean, which would have implication in soybean crop improvement in future.

  10. Functionally conserved enhancers with divergent sequences in distant vertebrates

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Yang, Song; Oksenberg, Nir; Takayama, Sachiko

    To examine the contributions of sequence and function conservation in the evolution of enhancers, we systematically identified enhancers whose sequences are not conserved among distant groups of vertebrate species, but have homologous function and are likely to be derived from a common ancestral sequence. In conclusion, our approach combined comparative genomics and epigenomics to identify potential enhancer sequences in the genomes of three groups of distantly related vertebrate species.

  11. Functionally conserved enhancers with divergent sequences in distant vertebrates

    DOE PAGES

    Yang, Song; Oksenberg, Nir; Takayama, Sachiko; ...

    2015-10-30

    To examine the contributions of sequence and function conservation in the evolution of enhancers, we systematically identified enhancers whose sequences are not conserved among distant groups of vertebrate species, but have homologous function and are likely to be derived from a common ancestral sequence. In conclusion, our approach combined comparative genomics and epigenomics to identify potential enhancer sequences in the genomes of three groups of distantly related vertebrate species.

  12. Statistics of surface divergence and their relation to air-water gas transfer velocity

    NASA Astrophysics Data System (ADS)

    Asher, William E.; Liang, Hanzhuang; Zappa, Christopher J.; Loewen, Mark R.; Mukto, Moniz A.; Litchendorf, Trina M.; Jessup, Andrew T.

    2012-05-01

    Air-sea gas fluxes are generally defined in terms of the air/water concentration difference of the gas and the gas transfer velocity,kL. Because it is difficult to measure kLin the ocean, it is often parameterized using more easily measured physical properties. Surface divergence theory suggests that infrared (IR) images of the water surface, which contain information concerning the movement of water very near the air-water interface, might be used to estimatekL. Therefore, a series of experiments testing whether IR imagery could provide a convenient means for estimating the surface divergence applicable to air-sea exchange were conducted in a synthetic jet array tank embedded in a wind tunnel. Gas transfer velocities were measured as a function of wind stress and mechanically generated turbulence; laser-induced fluorescence was used to measure the concentration of carbon dioxide in the top 300 μm of the water surface; IR imagery was used to measure the spatial and temporal distribution of the aqueous skin temperature; and particle image velocimetry was used to measure turbulence at a depth of 1 cm below the air-water interface. It is shown that an estimate of the surface divergence for both wind-shear driven turbulence and mechanically generated turbulence can be derived from the surface skin temperature. The estimates derived from the IR images are compared to velocity field divergences measured by the PIV and to independent estimates of the divergence made using the laser-induced fluorescence data. Divergence is shown to scale withkLvalues measured using gaseous tracers as predicted by conceptual models for both wind-driven and mechanically generated turbulence.

  13. Centromere location in Arabidopsis is unaltered by extreme divergence in CENH3 protein sequence.

    PubMed

    Maheshwari, Shamoni; Ishii, Takayoshi; Brown, C Titus; Houben, Andreas; Comai, Luca

    2017-03-01

    During cell division, spindle fibers attach to chromosomes at centromeres. The DNA sequence at regional centromeres is fast evolving with no conserved genetic signature for centromere identity. Instead CENH3, a centromere-specific histone H3 variant, is the epigenetic signature that specifies centromere location across both plant and animal kingdoms. Paradoxically, CENH3 is also adaptively evolving. An ongoing question is whether CENH3 evolution is driven by a functional relationship with the underlying DNA sequence. Here, we demonstrate that despite extensive protein sequence divergence, CENH3 histones from distant species assemble centromeres on the same underlying DNA sequence. We first characterized the organization and diversity of centromere repeats in wild-type Arabidopsis thaliana We show that A. thaliana CENH3-containing nucleosomes exhibit a strong preference for a unique subset of centromeric repeats. These sequences are largely missing from the genome assemblies and represent the youngest and most homogeneous class of repeats. Next, we tested the evolutionary specificity of this interaction in a background in which the native A. thaliana CENH3 is replaced with CENH3s from distant species. Strikingly, we find that CENH3 from Lepidium oleraceum and Zea mays , although specifying epigenetically weaker centromeres that result in genome elimination upon outcrossing, show a binding pattern on A. thaliana centromere repeats that is indistinguishable from the native CENH3. Our results demonstrate positional stability of a highly diverged CENH3 on independently evolved repeats, suggesting that the sequence specificity of centromeres is determined by a mechanism independent of CENH3. © 2017 Maheshwari et al.; Published by Cold Spring Harbor Laboratory Press.

  14. Genotype imputation in a coalescent model with infinitely-many-sites mutation

    PubMed Central

    Huang, Lucy; Buzbas, Erkan O.; Rosenberg, Noah A.

    2012-01-01

    Empirical studies have identified population-genetic factors as important determinants of the properties of genotype-imputation accuracy in imputation-based disease association studies. Here, we develop a simple coalescent model of three sequences that we use to explore the theoretical basis for the influence of these factors on genotype-imputation accuracy, under the assumption of infinitely-many-sites mutation. Employing a demographic model in which two populations diverged at a given time in the past, we derive the approximate expectation and variance of imputation accuracy in a study sequence sampled from one of the two populations, choosing between two reference sequences, one sampled from the same population as the study sequence and the other sampled from the other population. We show that under this model, imputation accuracy—as measured by the proportion of polymorphic sites that are imputed correctly in the study sequence—increases in expectation with the mutation rate, the proportion of the markers in a chromosomal region that are genotyped, and the time to divergence between the study and reference populations. Each of these effects derives largely from an increase in information available for determining the reference sequence that is genetically most similar to the sequence targeted for imputation. We analyze as a function of divergence time the expected gain in imputation accuracy in the target using a reference sequence from the same population as the target rather than from the other population. Together with a growing body of empirical investigations of genotype imputation in diverse human populations, our modeling framework lays a foundation for extending imputation techniques to novel populations that have not yet been extensively examined. PMID:23079542

  15. Genetic and phylogenetic divergence of feline immunodeficiency virus in the puma (Puma concolor).

    PubMed Central

    Carpenter, M A; Brown, E W; Culver, M; Johnson, W E; Pecon-Slattery, J; Brousset, D; O'Brien, S J

    1996-01-01

    Feline immunodeficiency virus (FIV) is a lentivirus which causes an AIDS-like disease in domestic cats (Felis catus). A number of other felid species, including the puma (Puma concolor), carry a virus closely related to domestic cat FIV. Serological testing revealed the presence of antibodies to FIV in 22% of 434 samples from throughout the geographic range of the puma. FIV-Pco pol gene sequences isolated from pumas revealed extensive sequence diversity, greater than has been documented in the domestic cat. The puma sequences formed two highly divergent groups, analogous to the clades which have been defined for domestic cat and lion (Panthera leo) FIV. The puma clade A was made up of samples from Florida and California, whereas clade B consisted of samples from other parts of North America, Central America, and Brazil. The difference between these two groups was as great as that reported among three lion FIV clades. Within puma clades, sequence variation is large, comparable to between-clade differences seen for domestic cat clades, allowing recognition of 15 phylogenetic lineages (subclades) among puma FIV-Pco. Large sequence divergence among isolates, nearly complete species monophyly, and widespread geographic distribution suggest that FIV-Pco has evolved within the puma species for a long period. The sequence data provided evidence for vertical transmission of FIV-Pco from mothers to their kittens, for coinfection of individuals by two different viral strains, and for cross-species transmission of FIV from a domestic cat to a puma. These factors may all be important for understanding the epidemiology and natural history of FIV in the puma. PMID:8794304

  16. Cretaceous origin of giant rhinoceros beetles (Dynastini; Coleoptera) and correlation of their evolution with the Pangean breakup.

    PubMed

    Jin, Haofei; Yonezawa, Takahiro; Zhong, Yang; Kishino, Hirohisa; Hasegawa, Masami

    2017-03-17

    The giant rhinoceros beetles (Dynastini, Scarabaeidae, Coleoptera) are distributed in tropical and temperate regions in Asia, America and Africa. Recent molecular phylogenetic studies have revealed that the giant rhinoceros beetles can be divided into three clades representing Asia, America and Africa. Although a correlation between their evolution and the continental drift during the Pangean breakup was suggested, there is no accurate divergence time estimation among the three clades based on molecular data. Moreover, there is a long chronological gap between the timing of the Pangean breakup (Cretaceous: 110-148 Ma) and the emergence of the oldest fossil record (Oligocene: 33 Ma). In this study, we estimated their divergence times based on molecular data, using several combinations of fossil calibration sets, and obtained robust estimates. The inter-continental divergence events among the clades were estimated to have occurred about 99 Ma (Asian clade and others) and 78 Ma (American clade and African clade), both of which are after the Pangean breakup. These estimates suggest their inter-continental divergences occurred by overseas sweepstakes dispersal, rather than by vicariances of the population caused by the Pangean breakup.

  17. Whale phylogeny and rapid radiation events revealed using novel retroposed elements and their flanking sequences.

    PubMed

    Chen, Zhuo; Xu, Shixia; Zhou, Kaiya; Yang, Guang

    2011-10-27

    A diversity of hypotheses have been proposed based on both morphological and molecular data to reveal phylogenetic relationships within the order Cetacea (dolphins, porpoises, and whales), and great progress has been made in the past two decades. However, there is still some controversy concerning relationships among certain cetacean taxa such as river dolphins and delphinoid species, which needs to be further addressed with more markers in an effort to address unresolved portions of the phylogeny. An analysis of additional SINE insertions and SINE-flanking sequences supported the monophyly of the order Cetacea as well as Odontocete, Delphinoidea (Delphinidae + Phocoenidae + Mondontidae), and Delphinidae. A sister relationship between Delphinidae and Phocoenidae + Mondontidae was supported, and members of classical river dolphins and the genera Tursiops and Stenella were found to be paraphyletic. Estimates of divergence times revealed rapid divergences of basal Odontocete lineages in the Oligocene and Early Miocene, and a recent rapid diversification of Delphinidae in the Middle-Late Miocene and Pliocene within a narrow time frame. Several novel SINEs were found to differentiate Delphinidae from the other two families (Monodontidae and Phocoenidae), whereas the sister grouping of the latter two families with exclusion of Delphinidae was further revealed using the SINE-flanking sequences. Interestingly, some anomalous PCR amplification patterns of SINE insertions were detected, which can be explained as the result of potential ancestral SINE polymorphisms and incomplete lineage sorting. Although a few loci were potentially anomalous, this study demonstrated that the SINE-based approach is a powerful tool in phylogenetic studies. Identifying additional SINE elements that resolve the relationships in the superfamily Delphinoidea and family Delphinidae will be important steps forward in completely resolving cetacean phylogenetic relationships in the future.

  18. Novel microsatellite DNA markers indicate strict parthenogenesis and few genotypes in the invasive willow sawfly Nematus oligospilus.

    PubMed

    Caron, V; Norgate, M; Ede, F J; Nyman, T; Sunnucks, P

    2013-02-01

    Invasive organisms can have major impacts on the environment. Some invasive organisms are parthenogenetic in their invasive range and, therefore, exist as a number of asexual lineages (=clones). Determining the reproductive mode of invasive species has important implications for understanding the evolutionary genetics of such species, more especially, for management-relevant traits. The willow sawfly Nematus oligospilus Förster (Hymenoptera: Tenthredinidae) has been introduced unintentionally into several countries in the Southern Hemisphere where it has subsequently become invasive. To assess the population expansion, reproductive mode and host-plant relationships of this insect, microsatellite markers were developed and applied to natural populations sampled from the native and expanded range, along with sequencing of the cytochrome-oxidase I mitochondrial DNA (mtDNA) region. Other tenthredinids across a spectrum of taxonomic similarity to N. oligospilus and having a range of life strategies were also tested. Strict parthenogenesis was apparent within invasive N. oligospilus populations throughout the Southern Hemisphere, which comprised only a small number of genotypes. Sequences of mtDNA were identical for all individuals tested in the invasive range. The microsatellite markers were used successfully in several sawfly species, especially Nematus spp. and other genera of the Nematini tribe, with the degree of success inversely related to genetic divergence as estimated from COI sequences. The confirmation of parthenogenetic reproduction in N. oligospilus and the fact that it has a very limited pool of genotypes have important implications for understanding and managing this species and its biology, including in terms of phenotypic diversity, host relationships, implications for spread and future adaptive change. It would appear to be an excellent model study system for understanding evolution of invasive parthenogens that diverge without sexual reproduction and genetic recombination.

  19. Evolutionary rate of a gene affected by chromosomal position.

    PubMed

    Perry, J; Ashworth, A

    1999-09-09

    Genes evolve at different rates depending on the strength of selective pressure to maintain their function. Chromosomal position can also have an influence [1] [2]. The pseudoautosomal region (PAR) of mammalian sex chromosomes is a small region of sequence identity that is the site of an obligatory pairing and recombination event between the X and Y chromosomes during male meiosis [3] [4] [5] [6]. During female meiosis, X chromosomes can pair and recombine along their entire length. Recombination in the PAR is therefore approximately 10 times greater in male meiosis compared with female meiosis [4] [5] [6]. The gene Fxy (also known as MID1 [7]) spans the pseudoautosomal boundary (PAB) in the laboratory mouse (Mus musculus domesticus, C57BL/6) such that the 5' three exons of the gene are located on the X chromosome but the seven exons encoding the carboxy-terminal two-thirds of the protein are located within the PAR and are therefore present on both the X and Y chromosomes [8]. In humans [7] [9], the rat, and the wild mouse species Mus spretus, the gene is entirely X-unique. Here, we report that the rate of sequence divergence of the 3' end of the Fxy gene is much higher (estimated at 170-fold higher for synonymous sites) when pseudoautosomal (present on both the X and Y chromosomes) than when X-unique. Thus, chromosomal position can directly affect the rate of evolution of a gene. This finding also provides support for the suggestion that regions of the genome with a high recombination frequency, such as the PAR, may have an intrinsically elevated rate of sequence divergence.

  20. Whale phylogeny and rapid radiation events revealed using novel retroposed elements and their flanking sequences

    PubMed Central

    2011-01-01

    Background A diversity of hypotheses have been proposed based on both morphological and molecular data to reveal phylogenetic relationships within the order Cetacea (dolphins, porpoises, and whales), and great progress has been made in the past two decades. However, there is still some controversy concerning relationships among certain cetacean taxa such as river dolphins and delphinoid species, which needs to be further addressed with more markers in an effort to address unresolved portions of the phylogeny. Results An analysis of additional SINE insertions and SINE-flanking sequences supported the monophyly of the order Cetacea as well as Odontocete, Delphinoidea (Delphinidae + Phocoenidae + Mondontidae), and Delphinidae. A sister relationship between Delphinidae and Phocoenidae + Mondontidae was supported, and members of classical river dolphins and the genera Tursiops and Stenella were found to be paraphyletic. Estimates of divergence times revealed rapid divergences of basal Odontocete lineages in the Oligocene and Early Miocene, and a recent rapid diversification of Delphinidae in the Middle-Late Miocene and Pliocene within a narrow time frame. Conclusions Several novel SINEs were found to differentiate Delphinidae from the other two families (Monodontidae and Phocoenidae), whereas the sister grouping of the latter two families with exclusion of Delphinidae was further revealed using the SINE-flanking sequences. Interestingly, some anomalous PCR amplification patterns of SINE insertions were detected, which can be explained as the result of potential ancestral SINE polymorphisms and incomplete lineage sorting. Although a few loci were potentially anomalous, this study demonstrated that the SINE-based approach is a powerful tool in phylogenetic studies. Identifying additional SINE elements that resolve the relationships in the superfamily Delphinoidea and family Delphinidae will be important steps forward in completely resolving cetacean phylogenetic relationships in the future. PMID:22029548

  1. Phased genotyping-by-sequencing enhances analysis of genetic diversity and reveals divergent copy number variants in maize

    USDA-ARS?s Scientific Manuscript database

    High-throughput sequencing of reduced representation genomic libraries has ushered in an era of genotyping-by-sequencing (GBS), where genome-wide genotype data can be obtained for nearly any species. However, there remains a need for imputation-free GBS methods for genotyping large samples taken fr...

  2. Complete genome sequence of a divergent strain of Japanese yam mosaic virus from China

    USDA-ARS?s Scientific Manuscript database

    A novel strain of Japanese yam mosaic virus (JYMV-CN) was identified in a yam plant with foliar mottle symptoms in China. The complete genomic sequence of JYMV-CN was determined. Its genomic sequence of 9701 nucleotides encodes a polyprotein of 3247 amino acids. Its organization was virtually identi...

  3. DNA barcoding for molecular identification of Demodex based on mitochondrial genes.

    PubMed

    Hu, Li; Yang, YuanJun; Zhao, YaE; Niu, DongLing; Yang, Rui; Wang, RuiLing; Lu, Zhaohui; Li, XiaoQi

    2017-12-01

    There has been no widely accepted DNA barcode for species identification of Demodex. In this study, we attempted to solve this issue. First, mitochondrial cox1-5' and 12S gene fragments of Demodex folloculorum, D. brevis, D. canis, and D. caprae were amplified, cloned, and sequenced for the first time; intra/interspecific divergences were computed and phylogenetic trees were reconstructed. Then, divergence frequency distribution plots of those two gene fragments were drawn together with mtDNA cox1-middle region and 16S obtained in previous studies. Finally, their identification efficiency was evaluated by comparing barcoding gap. Results indicated that 12S had the higher identification efficiency. Specifically, for cox1-5' region of the four Demodex species, intraspecific divergences were less than 2.0%, and interspecific divergences were 21.1-31.0%; for 12S, intraspecific divergences were less than 1.4%, and interspecific divergences were 20.8-26.9%. The phylogenetic trees demonstrated that the four Demodex species clustered separately, and divergence frequency distribution plot showed that the largest intraspecific divergence of 12S (1.4%) was less than cox1-5' region (2.0%), cox1-middle region (3.1%), and 16S (2.8%). The barcoding gap of 12S was 19.4%, larger than cox1-5' region (19.1%), cox1-middle region (11.3%), and 16S (13.0%); the interspecific divergence span of 12S was 6.2%, smaller than cox1-5' region (10.0%), cox1-middle region (14.1%), and 16S (11.4%). Moreover, 12S has a moderate length (517 bp) for sequencing at once. Therefore, we proposed mtDNA 12S was more suitable than cox1 and 16S to be a DNA barcode for classification and identification of Demodex at lower category level.

  4. Genetic structuring of European anchovy (Engraulis encrasicolus) populations through mitochondrial DNA sequences.

    PubMed

    Keskin, Emre; Atar, Hasan Huseyin

    2012-04-01

    Mitochondrial DNA sequence variation in 655 bpfragments of the cytochrome oxidase c subunit I gene, known as the DNA barcode, of European anchovy (Engraulis encrasicolus) was evaluated by analyzing 1529 individuals representing 16 populations from the Black Sea, through the Marmara Sea and the Aegean Sea to the Mediterranean Sea. A total of 19 (2.9%) variable sites were found among individuals, and these defined 10 genetically diverged populations with an overall mean distance of 1.2%. The highest nucleotide divergence was found between samples of eastern Mediterranean and northern Aegean (2.2%). Evolutionary history analysis among 16 populations clustered the Mediterranean Sea clades in one main branch and the other clades in another branch. Diverging pattern of the European anchovy populations correlated with geographic dispersion supports the genetic structuring through the Black Sea-Marmara Sea-Aegean Sea-Mediterranean Sea quad.

  5. Seeing chordate evolution through the Ciona genome sequence

    PubMed Central

    Cañestro, Cristian; Bassham, Susan; Postlethwait, John H

    2003-01-01

    A draft sequence of the compact genome of the sea squirt Ciona intestinalis, a non-vertebrate chordate that diverged very early from other chordates, including vertebrates, illuminates how chordates originated and how vertebrate developmental innovations evolved. PMID:12620098

  6. PlantFuncSSR: Integrating First and Next Generation Transcriptomics for Mining of SSR-Functional Domains Markers

    PubMed Central

    Sablok, Gaurav; Pérez-Pulido, Antonio J.; Do, Thac; Seong, Tan Y.; Casimiro-Soriguer, Carlos S.; La Porta, Nicola; Ralph, Peter J.; Squartini, Andrea; Muñoz-Merida, Antonio; Harikrishna, Jennifer A.

    2016-01-01

    Analysis of repetitive DNA sequence content and divergence among the repetitive functional classes is a well-accepted approach for estimation of inter- and intra-generic differences in plant genomes. Among these elements, microsatellites, or Simple Sequence Repeats (SSRs), have been widely demonstrated as powerful genetic markers for species and varieties discrimination. We present PlantFuncSSRs platform having more than 364 plant species with more than 2 million functional SSRs. They are provided with detailed annotations for easy functional browsing of SSRs and with information on primer pairs and associated functional domains. PlantFuncSSRs can be leveraged to identify functional-based genic variability among the species of interest, which might be of particular interest in developing functional markers in plants. This comprehensive on-line portal unifies mining of SSRs from first and next generation sequencing datasets, corresponding primer pairs and associated in-depth functional annotation such as gene ontology annotation, gene interactions and its identification from reference protein databases. PlantFuncSSRs is freely accessible at: http://www.bioinfocabd.upo.es/plantssr. PMID:27446111

  7. Sex Chromosome Turnover Contributes to Genomic Divergence between Incipient Stickleback Species

    PubMed Central

    Yoshida, Kohta; Makino, Takashi; Yamaguchi, Katsushi; Shigenobu, Shuji; Hasebe, Mitsuyasu; Kawata, Masakado; Kume, Manabu; Mori, Seiichi; Peichel, Catherine L.; Toyoda, Atsushi; Fujiyama, Asao; Kitano, Jun

    2014-01-01

    Sex chromosomes turn over rapidly in some taxonomic groups, where closely related species have different sex chromosomes. Although there are many examples of sex chromosome turnover, we know little about the functional roles of sex chromosome turnover in phenotypic diversification and genomic evolution. The sympatric pair of Japanese threespine stickleback (Gasterosteus aculeatus) provides an excellent system to address these questions: the Japan Sea species has a neo-sex chromosome system resulting from a fusion between an ancestral Y chromosome and an autosome, while the sympatric Pacific Ocean species has a simple XY sex chromosome system. Furthermore, previous quantitative trait locus (QTL) mapping demonstrated that the Japan Sea neo-X chromosome contributes to phenotypic divergence and reproductive isolation between these sympatric species. To investigate the genomic basis for the accumulation of genes important for speciation on the neo-X chromosome, we conducted whole genome sequencing of males and females of both the Japan Sea and the Pacific Ocean species. No substantial degeneration has yet occurred on the neo-Y chromosome, but the nucleotide sequence of the neo-X and the neo-Y has started to diverge, particularly at regions near the fusion. The neo-sex chromosomes also harbor an excess of genes with sex-biased expression. Furthermore, genes on the neo-X chromosome showed higher non-synonymous substitution rates than autosomal genes in the Japan Sea lineage. Genomic regions of higher sequence divergence between species, genes with divergent expression between species, and QTL for inter-species phenotypic differences were found not only at the regions near the fusion site, but also at other regions along the neo-X chromosome. Neo-sex chromosomes can therefore accumulate substitutions causing species differences even in the absence of substantial neo-Y degeneration. PMID:24625862

  8. PopHuman: the human population genomics browser.

    PubMed

    Casillas, Sònia; Mulet, Roger; Villegas-Mirón, Pablo; Hervas, Sergi; Sanz, Esteve; Velasco, Daniel; Bertranpetit, Jaume; Laayouni, Hafid; Barbadilla, Antonio

    2018-01-04

    The 1000 Genomes Project (1000GP) represents the most comprehensive world-wide nucleotide variation data set so far in humans, providing the sequencing and analysis of 2504 genomes from 26 populations and reporting >84 million variants. The availability of this sequence data provides the human lineage with an invaluable resource for population genomics studies, allowing the testing of molecular population genetics hypotheses and eventually the understanding of the evolutionary dynamics of genetic variation in human populations. Here we present PopHuman, a new population genomics-oriented genome browser based on JBrowse that allows the interactive visualization and retrieval of an extensive inventory of population genetics metrics. Efficient and reliable parameter estimates have been computed using a novel pipeline that faces the unique features and limitations of the 1000GP data, and include a battery of nucleotide variation measures, divergence and linkage disequilibrium parameters, as well as different tests of neutrality, estimated in non-overlapping windows along the chromosomes and in annotated genes for all 26 populations of the 1000GP. PopHuman is open and freely available at http://pophuman.uab.cat. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  9. Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin.

    PubMed

    Troggio, Michela; Surbanovski, Nada; Bianco, Luca; Moretto, Marco; Giongo, Lara; Banchi, Elisa; Viola, Roberto; Fernández, Felicdad Fernández; Costa, Fabrizio; Velasco, Riccardo; Cestaro, Alessandro; Sargent, Daniel James

    2013-01-01

    High throughput arrays for the simultaneous genotyping of thousands of single-nucleotide polymorphisms (SNPs) have made the rapid genetic characterisation of plant genomes and the development of saturated linkage maps a realistic prospect for many plant species of agronomic importance. However, the correct calling of SNP genotypes in divergent polyploid genomes using array technology can be problematic due to paralogy, and to divergence in probe sequences causing changes in probe binding efficiencies. An Illumina Infinium II whole-genome genotyping array was recently developed for the cultivated apple and used to develop a molecular linkage map for an apple rootstock progeny (M432), but a large proportion of segregating SNPs were not mapped in the progeny, due to unexpected genotype clustering patterns. To investigate the causes of this unexpected clustering we performed BLAST analysis of all probe sequences against the 'Golden Delicious' genome sequence and discovered evidence for paralogous annealing sites and probe sequence divergence for a high proportion of probes contained on the array. Following visual re-evaluation of the genotyping data generated for 8,788 SNPs for the M432 progeny using the array, we manually re-scored genotypes at 818 loci and mapped a further 797 markers to the M432 linkage map. The newly mapped markers included the majority of those that could not be mapped previously, as well as loci that were previously scored as monomorphic, but which segregated due to divergence leading to heterozygosity in probe annealing sites. An evaluation of the 8,788 probes in a diverse collection of Malus germplasm showed that more than half the probes returned genotype clustering patterns that were difficult or impossible to interpret reliably, highlighting implications for the use of the array in genome-wide association studies.

  10. Extensive Conserved Synteny of Genes between the Karyotypes of Manduca sexta and Bombyx mori Revealed by BAC-FISH Mapping

    PubMed Central

    Tanaka-Okuyama, Makiko; Shibata, Fukashi; Yoshido, Atsuo; Marec, František; Wu, Chengcang; Zhang, Hongbin; Goldsmith, Marian R.

    2009-01-01

    Background Genome sequencing projects have been completed for several species representing four highly diverged holometabolous insect orders, Diptera, Hymenoptera, Coleoptera, and Lepidoptera. The striking evolutionary diversity of insects argues a need for efficient methods to apply genome information from such models to genetically uncharacterized species. Constructing conserved synteny maps plays a crucial role in this task. Here, we demonstrate the use of fluorescence in situ hybridization with bacterial artificial chromosome probes as a powerful tool for physical mapping of genes and comparative genome analysis in Lepidoptera, which have numerous and morphologically uniform holokinetic chromosomes. Methodology/Principal Findings We isolated 214 clones containing 159 orthologs of well conserved single-copy genes of a sequenced lepidopteran model, the silkworm, Bombyx mori, from a BAC library of a sphingid with an unexplored genome, the tobacco hornworm, Manduca sexta. We then constructed a BAC-FISH karyotype identifying all 28 chromosomes of M. sexta by mapping 124 loci using the corresponding BAC clones. BAC probes from three M. sexta chromosomes also generated clear signals on the corresponding chromosomes of the convolvulus hawk moth, Agrius convolvuli, which belongs to the same subfamily, Sphinginae, as M. sexta. Conclusions/Significance Comparison of the M. sexta BAC physical map with the linkage map and genome sequence of B. mori pointed to extensive conserved synteny including conserved gene order in most chromosomes. Only a few rearrangements, including three inversions, three translocations, and two fission/fusion events were estimated to have occurred after the divergence of Bombycidae and Sphingidae. These results add to accumulating evidence for the stability of lepidopteran genomes. Generating signals on A. convolvuli chromosomes using heterologous M. sexta probes demonstrated that BAC-FISH with orthologous sequences can be used for karyotyping a wide range of related and genetically uncharacterized species, significantly extending the ability to develop synteny maps for comparative and functional genomics. PMID:19829706

  11. rpoB-Based Identification of Nonpigmented and Late-Pigmenting Rapidly Growing Mycobacteria

    PubMed Central

    Adékambi, Toïdi; Colson, Philippe; Drancourt, Michel

    2003-01-01

    Nonpigmented and late-pigmenting rapidly growing mycobacteria (RGM) are increasingly isolated in clinical microbiology laboratories. Their accurate identification remains problematic because classification is labor intensive work and because new taxa are not often incorporated into classification databases. Also, 16S rRNA gene sequence analysis underestimates RGM diversity and does not distinguish between all taxa. We determined the complete nucleotide sequence of the rpoB gene, which encodes the bacterial β subunit of the RNA polymerase, for 20 RGM type strains. After using in-house software which analyzes and graphically represents variability stretches of 60 bp along the nucleotide sequence, our analysis focused on a 723-bp variable region exhibiting 83.9 to 97% interspecies similarity and 0 to 1.7% intraspecific divergence. Primer pair Myco-F-Myco-R was designed as a tool for both PCR amplification and sequencing of this region for molecular identification of RGM. This tool was used for identification of 63 RGM clinical isolates previously identified at the species level on the basis of phenotypic characteristics and by 16S rRNA gene sequence analysis. Of 63 clinical isolates, 59 (94%) exhibited <2% partial rpoB gene sequence divergence from 1 of 20 species under study and were regarded as correctly identified at the species level. Mycobacterium abscessus and Mycobacterium mucogenicum isolates were clearly distinguished from Mycobacterium chelonae; Mycobacterium mageritense isolates were clearly distinguished from “Mycobacterium houstonense.” Four isolates were not identified at the species level because they exhibited >3% partial rpoB gene sequence divergence from the corresponding type strain; they belonged to three taxa related to M. mucogenicum, Mycobacterium smegmatis, and Mycobacterium porcinum. For M. abscessus and M. mucogenicum, this partial sequence yielded a high genetic heterogeneity within the clinical isolates. We conclude that molecular identification by analysis of the 723-bp rpoB sequence is a rapid and accurate tool for identification of RGM. PMID:14662964

  12. Isolation and characterization of a highly evolved type 3 vaccine-derived poliovirus in China.

    PubMed

    Zhang, Xiaowei; Qin, Chong; Li, Wei; Zheng, Zhenhua; Wang, Hanzhong; Cui, Zongqiang

    2017-06-15

    In this study, we report the identification and characterization of a highly evolved type 3 vaccine-derived poliovirus (VDPV) strain designated as WIV14, isolated in 2014 from a 4-year-old child suspected of having an enteroviral infection in China. Complete genome sequence of WIV14 revealed multiple nucleotide substitutions when compared with the attenuated poliovirus (PV) Sabin 3, including the reversion of three major attenuation sites to wild type. From the nucleotide divergence for the P1/capsid region, we estimated that the evolution time of WIV14 was more than 7 years, indicating the possible long time of replication. WIV14 strain seemed to have differences in biological characteristics compared with attenuated PV strains, such as being non-temperature-sensitive and producing large plaques. The current isolation of a highly divergent type 3 VDPV gives an idea of the risk of emergent VDPV strains, and emphasizes the importance of maintaining high vaccination coverage and herd immunity against PVs in China. Copyright © 2017 Elsevier B.V. All rights reserved.

  13. Extraordinary Sequence Divergence at Tsga8, an X-linked Gene Involved in Mouse Spermiogenesis

    PubMed Central

    Good, Jeffrey M.; Vanderpool, Dan; Smith, Kimberly L.; Nachman, Michael W.

    2011-01-01

    The X chromosome plays an important role in both adaptive evolution and speciation. We used a molecular evolutionary screen of X-linked genes potentially involved in reproductive isolation in mice to identify putative targets of recurrent positive selection. We then sequenced five very rapidly evolving genes within and between several closely related species of mice in the genus Mus. All five genes were involved in male reproduction and four of the genes showed evidence of recurrent positive selection. The most remarkable evolutionary patterns were found at Testis-specific gene a8 (Tsga8), a spermatogenesis-specific gene expressed during postmeiotic chromatin condensation and nuclear transformation. Tsga8 was characterized by extremely high levels of insertion–deletion variation of an alanine-rich repetitive motif in natural populations of Mus domesticus and M. musculus, differing in length from the reference mouse genome by up to 89 amino acids (27% of the total protein length). This population-level variation was coupled with striking divergence in protein sequence and length between closely related mouse species. Although no clear orthologs had previously been described for Tsga8 in other mammalian species, we have identified a highly divergent hypothetical gene on the rat X chromosome that shares clear orthology with the 5′ and 3′ ends of Tsga8. Further inspection of this ortholog verified that it is expressed in rat testis and shares remarkable similarity with mouse Tsga8 across several general features of the protein sequence despite no conservation of nucleotide sequence across over 60% of the rat-coding domain. Overall, Tsga8 appears to be one of the most rapidly evolving genes to have been described in rodents. We discuss the potential evolutionary causes and functional implications of this extraordinary divergence and the possible contribution of Tsga8 and the other four genes we examined to reproductive isolation in mice. PMID:21186189

  14. Three Divergent Subpopulations of the Malaria Parasite Plasmodium knowlesi

    PubMed Central

    Lin, Lee C.; Rovie-Ryan, Jeffrine J.; Kadir, Khamisah A.; Anderios, Fread; Hisam, Shamilah; Sharma, Reuben S.K.; Singh, Balbir; Conway, David J.

    2017-01-01

    Multilocus microsatellite genotyping of Plasmodium knowlesi isolates previously indicated 2 divergent parasite subpopulations in humans on the island of Borneo, each associated with a different macaque reservoir host species. Geographic divergence was also apparent, and independent sequence data have indicated particularly deep divergence between parasites from mainland Southeast Asia and Borneo. To resolve the overall population structure, multilocus microsatellite genotyping was conducted on a new sample of 182 P. knowlesi infections (obtained from 134 humans and 48 wild macaques) from diverse areas of Malaysia, first analyzed separately and then in combination with previous data. All analyses confirmed 2 divergent clusters of human cases in Malaysian Borneo, associated with long-tailed macaques and pig-tailed macaques, and a third cluster in humans and most macaques in peninsular Malaysia. High levels of pairwise divergence between each of these sympatric and allopatric subpopulations have implications for the epidemiology and control of this zoonotic species. PMID:28322705

  15. Vorticity and divergence in the solar photosphere

    NASA Technical Reports Server (NTRS)

    Wang, YI; Noyes, Robert W.; Tarbell, Theodore D.; Title, Alan M.

    1995-01-01

    We have studied an outstanding sequence of continuum images of the solar granulation from Pic du Midi Observatory. We have calculated the horizontal vector flow field using a correlation tracking algorithm, and from this determined three scalar field: the vertical component of the curl; the horizontal divergence; and the horizontal flow speed. The divergence field has substantially longer coherence time and more power than does the curl field. Statistically, curl is better correlated with regions of negative divergence - that is, the vertical vorticity is higher in downflow regions, suggesting excess vorticity in intergranular lanes. The average value of the divergence is largest (i.e., outflow is largest) where the horizontal speed is large; we associate these regions with exploding granules. A numerical simulation of general convection also shows similar statistical differences between curl and divergence. Some individual small bright points in the granulation pattern show large local vorticities.

  16. Out of the Pacific and back again: insights into the matrilineal history of Pacific killer whale ecotypes.

    PubMed

    Foote, Andrew D; Morin, Phillip A; Durban, John W; Willerslev, Eske; Orlando, Ludovic; Gilbert, M Thomas P

    2011-01-01

    Killer whales (Orcinus orca) are the most widely distributed marine mammals and have radiated to occupy a range of ecological niches. Disparate sympatric types are found in the North Atlantic, Antarctic and North Pacific oceans, however, little is known about the underlying mechanisms driving divergence. Previous phylogeographic analysis using complete mitogenomes yielded a bifurcating tree of clades corresponding to described ecotypes. However, there was low support at two nodes at which two Pacific and two Atlantic clades diverged. Here we apply further phylogenetic and coalescent analyses to partitioned mitochondrial genome sequences to better resolve the pattern of past radiations in this species. Our phylogenetic reconstructions indicate that in the North Pacific, sympatry between the maternal lineages that make up each ecotype arises from secondary contact. Both the phylogenetic reconstructions and a clinal decrease in diversity suggest a North Pacific to North Atlantic founding event, and the later return of killer whales to the North Pacific. Therefore, ecological divergence could have occurred during the allopatric phase through drift or selection and/or may have either commenced or have been consolidated upon secondary contact due to resource competition. The estimated timing of bidirectional migration between the North Pacific and North Atlantic coincided with the previous inter-glacial when the leakage of fauna from the Indo-Pacific into the Atlantic via the Agulhas current was particularly vigorous.

  17. Phylogeography of the Central American lancehead Bothrops asper (SERPENTES: VIPERIDAE)

    PubMed Central

    Parkinson, Christopher L.; Daza, Juan M.; Wüster, Wolfgang

    2017-01-01

    The uplift and final connection of the Central American land bridge is considered the major event that allowed biotic exchange between vertebrate lineages of northern and southern origin in the New World. However, given the complex tectonics that shaped Middle America, there is still substantial controversy over details of this geographical reconnection, and its role in determining biogeographic patterns in the region. Here, we examine the phylogeography of Bothrops asper, a widely distributed pitviper in Middle America and northwestern South America, in an attempt to evaluate how the final Isthmian uplift and other biogeographical boundaries in the region influenced genealogical lineage divergence in this species. We examined sequence data from two mitochondrial genes (MT-CYB and MT-ND4) from 111 specimens of B. asper, representing 70 localities throughout the species’ distribution. We reconstructed phylogeographic patterns using maximum likelihood and Bayesian methods and estimated divergence time using the Bayesian relaxed clock method. Within the nominal species, an early split led to two divergent lineages of B. asper: one includes five phylogroups distributed in Caribbean Middle America and southwestern Ecuador, and the other comprises five other groups scattered in the Pacific slope of Isthmian Central America and northwestern South America. Our results provide evidence of a complex transition that involves at least two dispersal events into Middle America during the final closure of the Isthmus. PMID:29176806

  18. Genetic evidence for subspecies differentiation of the Himalayan marmot, Marmota himalayana, in the Qinghai-Tibet Plateau

    PubMed Central

    Lin, Gonghua; Li, Qian; Chen, Jiarui; Qin, Wen; Su, Jianping; Zhang, Tongzuo

    2017-01-01

    The primary host of plague in the Qinghai-Tibet Plateau (QTP), China, is Marmota himalayana, which plays an essential role in the maintenance, transmission, and prevalence of plague. To achieve a more clear insight into the differentiation of M. himalayana, complete cytochrome b (cyt b) gene and 11 microsatellite loci were analyzed for a total of 423 individuals from 43 localities in the northeast of the QTP. Phylogenetic analyses with maximum likelihood and Bayesian inference methods showed that all derived haplotypes diverged into two primary well-supported monophyletic lineages, I and II, which corresponded to the referential sequences of two recognized subspecies, M. h. himalayana and M. h. robusta, respectively. The divergence between the two lineages was estimated to be at about 1.03 million years ago, nearly synchronously with the divergence between M. baibacina and M. kastschenkoi and much earlier than that between M. vancouverensis and M. caligata. Genetic structure analyses based on the microsatellite dataset detected significant admixture between the two lineages in the mixed region, which verified the intraspecies level of the differentiation between the two lineages. Our results for the first time demonstrated the coexistence of M. h. himalayana and M. h. robusta, and also, determined the distribution range of the two subspecies in the northeast of QTP. We provided fundamental information for more effective plague control in the QTP. PMID:28809943

  19. Improved analytical methods for microarray-based genome-composition analysis

    PubMed Central

    Kim, Charles C; Joyce, Elizabeth A; Chan, Kaman; Falkow, Stanley

    2002-01-01

    Background Whereas genome sequencing has given us high-resolution pictures of many different species of bacteria, microarrays provide a means of obtaining information on genome composition for many strains of a given species. Genome-composition analysis using microarrays, or 'genomotyping', can be used to categorize genes into 'present' and 'divergent' categories based on the level of hybridization signal. This typically involves selecting a signal value that is used as a cutoff to discriminate present (high signal) and divergent (low signal) genes. Current methodology uses empirical determination of cutoffs for classification into these categories, but this methodology is subject to several problems that can result in the misclassification of many genes. Results We describe a method that depends on the shape of the signal-ratio distribution and does not require empirical determination of a cutoff. Moreover, the cutoff is determined on an array-to-array basis, accounting for variation in strain composition and hybridization quality. The algorithm also provides an estimate of the probability that any given gene is present, which provides a measure of confidence in the categorical assignments. Conclusions Many genes previously classified as present using static methods are in fact divergent on the basis of microarray signal; this is corrected by our algorithm. We have reassigned hundreds of genes from previous genomotyping studies of Helicobacter pylori and Campylobacter jejuni strains, and expect that the algorithm should be widely applicable to genomotyping data. PMID:12429064

  20. Diversification of Rice Yellow Mottle Virus and Related Viruses Spans the History of Agriculture from the Neolithic to the Present

    PubMed Central

    Fargette, Denis; Pinel-Galzi, Agnès; Sérémé, Drissa; Lacombe, Séverine; Hébrard, Eugénie; Traoré, Oumar; Konaté, Gnissa

    2008-01-01

    The mechanisms of evolution of plant viruses are being unraveled, yet the timescale of their evolution remains an enigma. To address this critical issue, the divergence time of plant viruses at the intra- and inter-specific levels was assessed. The time of the most recent common ancestor (TMRCA) of Rice yellow mottle virus (RYMV; genus Sobemovirus) was calculated by a Bayesian coalescent analysis of the coat protein sequences of 253 isolates collected between 1966 and 2006 from all over Africa. It is inferred that RYMV diversified approximately 200 years ago in Africa, i.e., centuries after rice was domesticated or introduced, and decades before epidemics were reported. The divergence time of sobemoviruses and viruses of related genera was subsequently assessed using the age of RYMV under a relaxed molecular clock for calibration. The divergence time between sobemoviruses and related viruses was estimated to be approximately 9,000 years, that between sobemoviruses and poleroviruses approximately 5,000 years, and that among sobemoviruses approximately 3,000 years. The TMRCA of closely related pairs of sobemoviruses, poleroviruses, and luteoviruses was approximately 500 years, which is a measure of the time associated with plant virus speciation. It is concluded that the diversification of RYMV and related viruses has spanned the history of agriculture, from the Neolithic age to the present. PMID:18704169

Top