On the quirks of maximum parsimony and likelihood on phylogenetic networks.
Bryant, Christopher; Fischer, Mareike; Linz, Simone; Semple, Charles
2017-03-21
Maximum parsimony is one of the most frequently-discussed tree reconstruction methods in phylogenetic estimation. However, in recent years it has become more and more apparent that phylogenetic trees are often not sufficient to describe evolution accurately. For instance, processes like hybridization or lateral gene transfer that are commonplace in many groups of organisms and result in mosaic patterns of relationships cannot be represented by a single phylogenetic tree. This is why phylogenetic networks, which can display such events, are becoming of more and more interest in phylogenetic research. It is therefore necessary to extend concepts like maximum parsimony from phylogenetic trees to networks. Several suggestions for possible extensions can be found in recent literature, for instance the softwired and the hardwired parsimony concepts. In this paper, we analyze the so-called big parsimony problem under these two concepts, i.e. we investigate maximum parsimonious networks and analyze their properties. In particular, we show that finding a softwired maximum parsimony network is possible in polynomial time. We also show that the set of maximum parsimony networks for the hardwired definition always contains at least one phylogenetic tree. Lastly, we investigate some parallels of parsimony to different likelihood concepts on phylogenetic networks. Copyright © 2017 Elsevier Ltd. All rights reserved.
Improved Maximum Parsimony Models for Phylogenetic Networks.
Van Iersel, Leo; Jones, Mark; Scornavacca, Celine
2018-05-01
Phylogenetic networks are well suited to represent evolutionary histories comprising reticulate evolution. Several methods aiming at reconstructing explicit phylogenetic networks have been developed in the last two decades. In this article, we propose a new definition of maximum parsimony for phylogenetic networks that permits to model biological scenarios that cannot be modeled by the definitions currently present in the literature (namely, the "hardwired" and "softwired" parsimony). Building on this new definition, we provide several algorithmic results that lay the foundations for new parsimony-based methods for phylogenetic network reconstruction.
Kamneva, Olga K; Rosenberg, Noah A
2017-01-01
Hybridization events generate reticulate species relationships, giving rise to species networks rather than species trees. We report a comparative study of consensus, maximum parsimony, and maximum likelihood methods of species network reconstruction using gene trees simulated assuming a known species history. We evaluate the role of the divergence time between species involved in a hybridization event, the relative contributions of the hybridizing species, and the error in gene tree estimation. When gene tree discordance is mostly due to hybridization and not due to incomplete lineage sorting (ILS), most of the methods can detect even highly skewed hybridization events between highly divergent species. For recent divergences between hybridizing species, when the influence of ILS is sufficiently high, likelihood methods outperform parsimony and consensus methods, which erroneously identify extra hybridizations. The more sophisticated likelihood methods, however, are affected by gene tree errors to a greater extent than are consensus and parsimony. PMID:28469378
Generalized Buneman Pruning for Inferring the Most Parsimonious Multi-state Phylogeny
NASA Astrophysics Data System (ADS)
Misra, Navodit; Blelloch, Guy; Ravi, R.; Schwartz, Russell
Accurate reconstruction of phylogenies remains a key challenge in evolutionary biology. Most biologically plausible formulations of the problem are formally NP-hard, with no known efficient solution. The standard in practice are fast heuristic methods that are empirically known to work very well in general, but can yield results arbitrarily far from optimal. Practical exact methods, which yield exponential worst-case running times but generally much better times in practice, provide an important alternative. We report progress in this direction by introducing a provably optimal method for the weighted multi-state maximum parsimony phylogeny problem. The method is based on generalizing the notion of the Buneman graph, a construction key to efficient exact methods for binary sequences, so as to apply to sequences with arbitrary finite numbers of states with arbitrary state transition weights. We implement an integer linear programming (ILP) method for the multi-state problem using this generalized Buneman graph and demonstrate that the resulting method is able to solve data sets that are intractable by prior exact methods in run times comparable with popular heuristics. Our work provides the first method for provably optimal maximum parsimony phylogeny inference that is practical for multi-state data sets of more than a few characters.
Padial, José M; Grant, Taran; Frost, Darrel R
2014-06-26
Brachycephaloidea is a monophyletic group of frogs with more than 1000 species distributed throughout the New World tropics, subtropics, and Andean regions. Recently, the group has been the target of multiple molecular phylogenetic analyses, resulting in extensive changes in its taxonomy. Here, we test previous hypotheses of phylogenetic relationships for the group by combining available molecular evidence (sequences of 22 genes representing 431 ingroup and 25 outgroup terminals) and performing a tree-alignment analysis under the parsimony optimality criterion using the program POY. To elucidate the effects of alignment and optimality criterion on phylogenetic inferences, we also used the program MAFFT to obtain a similarity-alignment for analysis under both parsimony and maximum likelihood using the programs TNT and GARLI, respectively. Although all three analytical approaches agreed on numerous points, there was also extensive disagreement. Tree-alignment under parsimony supported the monophyly of the ingroup and the sister group relationship of the monophyletic marsupial frogs (Hemiphractidae), while maximum likelihood and parsimony analyses of the MAFFT similarity-alignment did not. All three methods differed with respect to the position of Ceuthomantis smaragdinus (Ceuthomantidae), with tree-alignment using parsimony recovering this species as the sister of Pristimantis + Yunganastes. All analyses rejected the monophyly of Strabomantidae and Strabomantinae as originally defined, and the tree-alignment analysis under parsimony further rejected the recently redefined Craugastoridae and Pristimantinae. Despite the greater emphasis in the systematics literature placed on the choice of optimality criterion for evaluating trees than on the choice of method for aligning DNA sequences, we found that the topological differences attributable to the alignment method were as great as those caused by the optimality criterion. Further, the optimal tree-alignment indicates that insertions and deletions occurred in twice as many aligned positions as implied by the optimal similarity-alignment, confirming previous findings that sequence turnover through insertion and deletion events plays a greater role in molecular evolution than indicated by similarity-alignments. Our results also provide a clear empirical demonstration of the different effects of wildcard taxa produced by missing data in parsimony and maximum likelihood analyses. Specifically, maximum likelihood analyses consistently (81% bootstrap frequency) provided spurious resolution despite a lack of evidence, whereas parsimony correctly depicted the ambiguity due to missing data by collapsing unsupported nodes. We provide a new taxonomy for the group that retains previously recognized Linnaean taxa except for Ceuthomantidae, Strabomantidae, and Strabomantinae. A phenotypically diagnosable superfamily is recognized formally as Brachycephaloidea, with the informal, unranked name terrarana retained as the standard common name for these frogs. We recognize three families within Brachycephaloidea that are currently diagnosable solely on molecular grounds (Brachycephalidae, Craugastoridae, and Eleutherodactylidae), as well as five subfamilies (Craugastorinae, Eleutherodactylinae, Holoadeninae, Phyzelaphryninae, and Pristimantinae) corresponding in large part to previous families and subfamilies. Our analyses upheld the monophyly of all tested genera, but we found numerous subgeneric taxa to be non-monophyletic and modified the taxonomy accordingly.
Multiple optimality criteria support Ornithoscelida
NASA Astrophysics Data System (ADS)
Parry, Luke A.; Baron, Matthew G.; Vinther, Jakob
2017-10-01
A recent study of early dinosaur evolution using equal-weights parsimony recovered a scheme of dinosaur interrelationships and classification that differed from historical consensus in a single, but significant, respect; Ornithischia and Saurischia were not recovered as monophyletic sister-taxa, but rather Ornithischia and Theropoda formed a novel clade named Ornithoscelida. However, these analyses only used maximum parsimony, and numerous recent simulation studies have questioned the accuracy of parsimony under equal weights. Here, we provide additional support for this alternative hypothesis using Bayesian implementation of the Mkv model, as well as through number of additional parsimony analyses, including implied weighting. Using Bayesian inference and implied weighting, we recover the same fundamental topology for Dinosauria as the original study, with a monophyletic Ornithoscelida, demonstrating that the main suite of methods used in morphological phylogenetics recover this novel hypothesis. This result was further scrutinized through the systematic exclusion of different character sets. Novel characters from the original study (those not taken or adapted from previous phylogenetic studies) were found to be more important for resolving the relationships within Dinosauromorpha than the relationships within Dinosauria. Reanalysis of a modified version of the character matrix that supports the Ornithischia-Saurischia dichotomy under maximum parsimony also supports this hypothesis under implied weighting, but not under the Mkv model, with both Theropoda and Sauropodomorpha becoming paraphyletic with respect to Ornithischia.
A Review of System Identification Methods Applied to Aircraft
NASA Technical Reports Server (NTRS)
Klein, V.
1983-01-01
Airplane identification, equation error method, maximum likelihood method, parameter estimation in frequency domain, extended Kalman filter, aircraft equations of motion, aerodynamic model equations, criteria for the selection of a parsimonious model, and online aircraft identification are addressed.
Towards improving searches for optimal phylogenies.
Ford, Eric; St John, Katherine; Wheeler, Ward C
2015-01-01
Finding the optimal evolutionary history for a set of taxa is a challenging computational problem, even when restricting possible solutions to be "tree-like" and focusing on the maximum-parsimony optimality criterion. This has led to much work on using heuristic tree searches to find approximate solutions. We present an approach for finding exact optimal solutions that employs and complements the current heuristic methods for finding optimal trees. Given a set of taxa and a set of aligned sequences of characters, there may be subsets of characters that are compatible, and for each such subset there is an associated (possibly partially resolved) phylogeny with edges corresponding to each character state change. These perfect phylogenies serve as anchor trees for our constrained search space. We show that, for sequences with compatible sites, the parsimony score of any tree [Formula: see text] is at least the parsimony score of the anchor trees plus the number of inferred changes between [Formula: see text] and the anchor trees. As the maximum-parsimony optimality score is additive, the sum of the lower bounds on compatible character partitions provides a lower bound on the complete alignment of characters. This yields a region in the space of trees within which the best tree is guaranteed to be found; limiting the search for the optimal tree to this region can significantly reduce the number of trees that must be examined in a search of the space of trees. We analyze this method empirically using four different biological data sets as well as surveying 400 data sets from the TreeBASE repository, demonstrating the effectiveness of our technique in reducing the number of steps in exact heuristic searches for trees under the maximum-parsimony optimality criterion. © The Author(s) 2014. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
The relationships of the Euparkeriidae and the rise of Archosauria
NASA Astrophysics Data System (ADS)
Sookias, Roland B.
2016-03-01
For the first time, a phylogenetic analysis including all putative euparkeriid taxa is conducted, using a large data matrix analysed with maximum parsimony and Bayesian analysis. Using parsimony, the putative euparkeriid Dorosuchus neoetus from Russia is the sister taxon to Archosauria + Phytosauria. Euparkeria capensis is placed one node further from the crown, and forms a euparkeriid clade with the Chinese taxa Halazhaisuchus qiaoensis and `Turfanosuchus shageduensis' and the Polish taxon Osmolskina czatkowicensis. Using Bayesian methods, Osmolskina and Halazhaisuchus are sister taxa within Euparkeriidae, in turn sister to `Turfanosuchus shageduensis' and then Euparkeria capensis. Dorosuchus is placed in a polytomy with Euparkeriidae and Archosauria + Phytosauria. Although conclusions remain tentative owing to low node support and incompleteness, a broad phylogenetic position close to the base of Archosauria is confirmed for all putative euparkeriids, and the ancestor of Archosauria +Phytosauria is optimized as similar to euparkeriids in its morphology. Ecomorphological characters and traits are optimized onto the maximum parsimony strict consensus phylogeny presented using squared change parsimony. This optimization indicates that the ancestral archosaur was probably similar in many respects to euparkeriids, being relatively small, terrestrial, carnivorous and showing relatively cursorial limb morphology; this Bauplan may have underlain the exceptional radiaton and success of crown Archosauria.
Maximum parsimony, substitution model, and probability phylogenetic trees.
Weng, J F; Thomas, D A; Mareels, I
2011-01-01
The problem of inferring phylogenies (phylogenetic trees) is one of the main problems in computational biology. There are three main methods for inferring phylogenies-Maximum Parsimony (MP), Distance Matrix (DM) and Maximum Likelihood (ML), of which the MP method is the most well-studied and popular method. In the MP method the optimization criterion is the number of substitutions of the nucleotides computed by the differences in the investigated nucleotide sequences. However, the MP method is often criticized as it only counts the substitutions observable at the current time and all the unobservable substitutions that really occur in the evolutionary history are omitted. In order to take into account the unobservable substitutions, some substitution models have been established and they are now widely used in the DM and ML methods but these substitution models cannot be used within the classical MP method. Recently the authors proposed a probability representation model for phylogenetic trees and the reconstructed trees in this model are called probability phylogenetic trees. One of the advantages of the probability representation model is that it can include a substitution model to infer phylogenetic trees based on the MP principle. In this paper we explain how to use a substitution model in the reconstruction of probability phylogenetic trees and show the advantage of this approach with examples.
Inferring Phylogenetic Networks Using PhyloNet.
Wen, Dingqiao; Yu, Yun; Zhu, Jiafan; Nakhleh, Luay
2018-07-01
PhyloNet was released in 2008 as a software package for representing and analyzing phylogenetic networks. At the time of its release, the main functionalities in PhyloNet consisted of measures for comparing network topologies and a single heuristic for reconciling gene trees with a species tree. Since then, PhyloNet has grown significantly. The software package now includes a wide array of methods for inferring phylogenetic networks from data sets of unlinked loci while accounting for both reticulation (e.g., hybridization) and incomplete lineage sorting. In particular, PhyloNet now allows for maximum parsimony, maximum likelihood, and Bayesian inference of phylogenetic networks from gene tree estimates. Furthermore, Bayesian inference directly from sequence data (sequence alignments or biallelic markers) is implemented. Maximum parsimony is based on an extension of the "minimizing deep coalescences" criterion to phylogenetic networks, whereas maximum likelihood and Bayesian inference are based on the multispecies network coalescent. All methods allow for multiple individuals per species. As computing the likelihood of a phylogenetic network is computationally hard, PhyloNet allows for evaluation and inference of networks using a pseudolikelihood measure. PhyloNet summarizes the results of the various analyzes and generates phylogenetic networks in the extended Newick format that is readily viewable by existing visualization software.
On defining a unique phylogenetic tree with homoplastic characters.
Goloboff, Pablo A; Wilkinson, Mark
2018-05-01
This paper discusses the problem of whether creating a matrix with all the character state combinations that have a fixed number of steps (or extra steps) on a given tree T, produces the same tree T when analyzed with maximum parsimony or maximum likelihood. Exhaustive enumeration of cases up to 20 taxa for binary characters, and up to 12 taxa for 4-state characters, shows that the same tree is recovered (as unique most likely or most parsimonious tree) as long as the number of extra steps is within 1/4 of the number of taxa. This dependence, 1/4 of the number of taxa, is discussed with a general argumentation, in terms of the spread of the character changes on the tree used to select character state distributions. The present finding allows creating matrices which have as much homoplasy as possible for the most parsimonious or likely tree to be predictable, and examination of these matrices with hill-climbing search algorithms provides additional evidence on the (lack of a) necessary relationship between homoplasy and the ability of search methods to find optimal trees. Copyright © 2018 Elsevier Inc. All rights reserved.
Galtier, N; Boursot, P
2000-03-01
A new, model-based method was devised to locate nucleotide changes in a given phylogenetic tree. For each site, the posterior probability of any possible change in each branch of the tree is computed. This probabilistic method is a valuable alternative to the maximum parsimony method when base composition is skewed (i.e., different from 25% A, 25% C, 25% G, 25% T): computer simulations showed that parsimony misses more rare --> common than common --> rare changes, resulting in biased inferred change matrices, whereas the new method appeared unbiased. The probabilistic method was applied to the analysis of the mutation and substitution processes in the mitochondrial control region of mouse. Distinct change patterns were found at the polymorphism (within species) and divergence (between species) levels, rejecting the hypothesis of a neutral evolution of base composition in mitochondrial DNA.
Krishnan, Neeraja M; Seligmann, Hervé; Stewart, Caro-Beth; De Koning, A P Jason; Pollock, David D
2004-10-01
Reconstruction of ancestral DNA and amino acid sequences is an important means of inferring information about past evolutionary events. Such reconstructions suggest changes in molecular function and evolutionary processes over the course of evolution and are used to infer adaptation and convergence. Maximum likelihood (ML) is generally thought to provide relatively accurate reconstructed sequences compared to parsimony, but both methods lead to the inference of multiple directional changes in nucleotide frequencies in primate mitochondrial DNA (mtDNA). To better understand this surprising result, as well as to better understand how parsimony and ML differ, we constructed a series of computationally simple "conditional pathway" methods that differed in the number of substitutions allowed per site along each branch, and we also evaluated the entire Bayesian posterior frequency distribution of reconstructed ancestral states. We analyzed primate mitochondrial cytochrome b (Cyt-b) and cytochrome oxidase subunit I (COI) genes and found that ML reconstructs ancestral frequencies that are often more different from tip sequences than are parsimony reconstructions. In contrast, frequency reconstructions based on the posterior ensemble more closely resemble extant nucleotide frequencies. Simulations indicate that these differences in ancestral sequence inference are probably due to deterministic bias caused by high uncertainty in the optimization-based ancestral reconstruction methods (parsimony, ML, Bayesian maximum a posteriori). In contrast, ancestral nucleotide frequencies based on an average of the Bayesian set of credible ancestral sequences are much less biased. The methods involving simpler conditional pathway calculations have slightly reduced likelihood values compared to full likelihood calculations, but they can provide fairly unbiased nucleotide reconstructions and may be useful in more complex phylogenetic analyses than considered here due to their speed and flexibility. To determine whether biased reconstructions using optimization methods might affect inferences of functional properties, ancestral primate mitochondrial tRNA sequences were inferred and helix-forming propensities for conserved pairs were evaluated in silico. For ambiguously reconstructed nucleotides at sites with high base composition variability, ancestral tRNA sequences from Bayesian analyses were more compatible with canonical base pairing than were those inferred by other methods. Thus, nucleotide bias in reconstructed sequences apparently can lead to serious bias and inaccuracies in functional predictions.
Parsimonious nonstationary flood frequency analysis
NASA Astrophysics Data System (ADS)
Serago, Jake M.; Vogel, Richard M.
2018-02-01
There is now widespread awareness of the impact of anthropogenic influences on extreme floods (and droughts) and thus an increasing need for methods to account for such influences when estimating a frequency distribution. We introduce a parsimonious approach to nonstationary flood frequency analysis (NFFA) based on a bivariate regression equation which describes the relationship between annual maximum floods, x, and an exogenous variable which may explain the nonstationary behavior of x. The conditional mean, variance and skewness of both x and y = ln (x) are derived, and combined with numerous common probability distributions including the lognormal, generalized extreme value and log Pearson type III models, resulting in a very simple and general approach to NFFA. Our approach offers several advantages over existing approaches including: parsimony, ease of use, graphical display, prediction intervals, and opportunities for uncertainty analysis. We introduce nonstationary probability plots and document how such plots can be used to assess the improved goodness of fit associated with a NFFA.
McCann, Jamie; Stuessy, Tod F.; Villaseñor, Jose L.; Weiss-Schneeweiss, Hanna
2016-01-01
Chromosome number change (polyploidy and dysploidy) plays an important role in plant diversification and speciation. Investigating chromosome number evolution commonly entails ancestral state reconstruction performed within a phylogenetic framework, which is, however, prone to uncertainty, whose effects on evolutionary inferences are insufficiently understood. Using the chromosomally diverse plant genus Melampodium (Asteraceae) as model group, we assess the impact of reconstruction method (maximum parsimony, maximum likelihood, Bayesian methods), branch length model (phylograms versus chronograms) and phylogenetic uncertainty (topological and branch length uncertainty) on the inference of chromosome number evolution. We also address the suitability of the maximum clade credibility (MCC) tree as single representative topology for chromosome number reconstruction. Each of the listed factors causes considerable incongruence among chromosome number reconstructions. Discrepancies between inferences on the MCC tree from those made by integrating over a set of trees are moderate for ancestral chromosome numbers, but severe for the difference of chromosome gains and losses, a measure of the directionality of dysploidy. Therefore, reliance on single trees, such as the MCC tree, is strongly discouraged and model averaging, taking both phylogenetic and model uncertainty into account, is recommended. For studying chromosome number evolution, dedicated models implemented in the program ChromEvol and ordered maximum parsimony may be most appropriate. Chromosome number evolution in Melampodium follows a pattern of bidirectional dysploidy (starting from x = 11 to x = 9 and x = 14, respectively) with no prevailing direction. PMID:27611687
McCann, Jamie; Schneeweiss, Gerald M; Stuessy, Tod F; Villaseñor, Jose L; Weiss-Schneeweiss, Hanna
2016-01-01
Chromosome number change (polyploidy and dysploidy) plays an important role in plant diversification and speciation. Investigating chromosome number evolution commonly entails ancestral state reconstruction performed within a phylogenetic framework, which is, however, prone to uncertainty, whose effects on evolutionary inferences are insufficiently understood. Using the chromosomally diverse plant genus Melampodium (Asteraceae) as model group, we assess the impact of reconstruction method (maximum parsimony, maximum likelihood, Bayesian methods), branch length model (phylograms versus chronograms) and phylogenetic uncertainty (topological and branch length uncertainty) on the inference of chromosome number evolution. We also address the suitability of the maximum clade credibility (MCC) tree as single representative topology for chromosome number reconstruction. Each of the listed factors causes considerable incongruence among chromosome number reconstructions. Discrepancies between inferences on the MCC tree from those made by integrating over a set of trees are moderate for ancestral chromosome numbers, but severe for the difference of chromosome gains and losses, a measure of the directionality of dysploidy. Therefore, reliance on single trees, such as the MCC tree, is strongly discouraged and model averaging, taking both phylogenetic and model uncertainty into account, is recommended. For studying chromosome number evolution, dedicated models implemented in the program ChromEvol and ordered maximum parsimony may be most appropriate. Chromosome number evolution in Melampodium follows a pattern of bidirectional dysploidy (starting from x = 11 to x = 9 and x = 14, respectively) with no prevailing direction.
Zhi-Bin Wen; Ming-Li Zhang; Ge-Lin Zhu; Stewart C. Sanderson
2010-01-01
To reconstruct phylogeny and verify the monophyly of major subgroups, a total of 52 species representing almost all species of Salsoleae s.l. in China were sampled, with analysis based on three molecular markers (nrDNA ITS, cpDNA psbB-psbH and rbcL), using maximum parsimony, maximum likelihood, and Bayesian inference methods. Our molecular evidence provides strong...
Yamaguchi, M; Miya, M; Okiyama, M; Nishida, M
2000-04-01
Larvae of the deep-sea lanternfish genus Hygophum (Myctophidae) exhibit a remarkable morphological diversity that is quite unexpected, considering their homogeneous adult morphology. In an attempt to elucidate the evolutionary patterns of such larval morphological diversity, nucleotide sequences of a portion of the mitochondrially encoded 16S ribosomal RNA gene were determined for seven Hygophum species and three outgroup taxa. Secondary structure-based alignment resulted in a character matrix consisting of 1172 bp of unambiguously aligned sequences, which were subjected to phylogenetic analyses using maximum-parsimony, maximum-likelihood, and neighbor-joining methods. The resultant tree topologies from the three methods were congruent, with most nodes, including that of the genus Hygophum, being strongly supported by various tree statistics. The most parsimonious reconstruction of the three previously recognized, distinct larval morphs onto the molecular phylogeny revealed that one of the morphs had originated as the common ancestor of the genus, the other two having diversified separately in two subsequent major clades. The patterns of such diversification are discussed in terms of the unusual larval eye morphology and geographic distribution. Copyright 2000 Academic Press.
Majority rule has transition ratio 4 on Yule trees under a 2-state symmetric model.
Mossel, Elchanan; Steel, Mike
2014-11-07
Inferring the ancestral state at the root of a phylogenetic tree from states observed at the leaves is a problem arising in evolutionary biology. The simplest technique - majority rule - estimates the root state by the most frequently occurring state at the leaves. Alternative methods - such as maximum parsimony - explicitly take the tree structure into account. Since either method can outperform the other on particular trees, it is useful to consider the accuracy of the methods on trees generated under some evolutionary null model, such as a Yule pure-birth model. In this short note, we answer a recently posed question concerning the performance of majority rule on Yule trees under a symmetric 2-state Markovian substitution model of character state change. We show that majority rule is accurate precisely when the ratio of the birth (speciation) rate of the Yule process to the substitution rate exceeds the value 4. By contrast, maximum parsimony has been shown to be accurate only when this ratio is at least 6. Our proof relies on a second moment calculation, coupling, and a novel application of a reflection principle. Copyright © 2014 Elsevier Ltd. All rights reserved.
Cao, Y; Adachi, J; Yano, T; Hasegawa, M
1994-07-01
Graur et al.'s (1991) hypothesis that the guinea pig-like rodents have an evolutionary origin within mammals that is separate from that of other rodents (the rodent-polyphyly hypothesis) was reexamined by the maximum-likelihood method for protein phylogeny, as well as by the maximum-parsimony and neighbor-joining methods. The overall evidence does not support Graur et al.'s hypothesis, which radically contradicts the traditional view of rodent monophyly. This work demonstrates that we must be careful in choosing a proper method for phylogenetic inference and that an argument based on a small data set (with respect to the length of the sequence and especially the number of species) may be unstable.
Salas-Leiva, Dayana E; Meerow, Alan W; Calonje, Michael; Griffith, M Patrick; Francisco-Ortega, Javier; Nakamura, Kyoko; Stevenson, Dennis W; Lewis, Carl E; Namoff, Sandra
2013-11-01
Despite a recent new classification, a stable phylogeny for the cycads has been elusive, particularly regarding resolution of Bowenia, Stangeria and Dioon. In this study, five single-copy nuclear genes (SCNGs) are applied to the phylogeny of the order Cycadales. The specific aim is to evaluate several gene tree-species tree reconciliation approaches for developing an accurate phylogeny of the order, to contrast them with concatenated parsimony analysis and to resolve the erstwhile problematic phylogenetic position of these three genera. DNA sequences of five SCNGs were obtained for 20 cycad species representing all ten genera of Cycadales. These were analysed with parsimony, maximum likelihood (ML) and three Bayesian methods of gene tree-species tree reconciliation, using Cycas as the outgroup. A calibrated date estimation was developed with Bayesian methods, and biogeographic analysis was also conducted. Concatenated parsimony, ML and three species tree inference methods resolve exactly the same tree topology with high support at most nodes. Dioon and Bowenia are the first and second branches of Cycadales after Cycas, respectively, followed by an encephalartoid clade (Macrozamia-Lepidozamia-Encephalartos), which is sister to a zamioid clade, of which Ceratozamia is the first branch, and in which Stangeria is sister to Microcycas and Zamia. A single, well-supported phylogenetic hypothesis of the generic relationships of the Cycadales is presented. However, massive extinction events inferred from the fossil record that eliminated broader ancestral distributions within Zamiaceae compromise accurate optimization of ancestral biogeographical areas for that hypothesis. While major lineages of Cycadales are ancient, crown ages of all modern genera are no older than 12 million years, supporting a recent hypothesis of mostly Miocene radiations. This phylogeny can contribute to an accurate infrafamilial classification of Zamiaceae.
Naushad, Sohail; Barkema, Herman W.; Luby, Christopher; Condas, Larissa A. Z.; Nobrega, Diego B.; Carson, Domonique A.; De Buck, Jeroen
2016-01-01
Non-aureus staphylococci (NAS), a heterogeneous group of a large number of species and subspecies, are the most frequently isolated pathogens from intramammary infections in dairy cattle. Phylogenetic relationships among bovine NAS species are controversial and have mostly been determined based on single-gene trees. Herein, we analyzed phylogeny of bovine NAS species using whole-genome sequencing (WGS) of 441 distinct isolates. In addition, evolutionary relationships among bovine NAS were estimated from multilocus data of 16S rRNA, hsp60, rpoB, sodA, and tuf genes and sequences from these and numerous other single genes/proteins. All phylogenies were created with FastTree, Maximum-Likelihood, Maximum-Parsimony, and Neighbor-Joining methods. Regardless of methodology, WGS-trees clearly separated bovine NAS species into five monophyletic coherent clades. Furthermore, there were consistent interspecies relationships within clades in all WGS phylogenetic reconstructions. Except for the Maximum-Parsimony tree, multilocus data analysis similarly produced five clades. There were large variations in determining clades and interspecies relationships in single gene/protein trees, under different methods of tree constructions, highlighting limitations of using single genes for determining bovine NAS phylogeny. However, based on WGS data, we established a robust phylogeny of bovine NAS species, unaffected by method or model of evolutionary reconstructions. Therefore, it is now possible to determine associations between phylogeny and many biological traits, such as virulence, antimicrobial resistance, environmental niche, geographical distribution, and host specificity. PMID:28066335
Molecular phylogenetic trees - On the validity of the Goodman-Moore augmentation algorithm
NASA Technical Reports Server (NTRS)
Holmquist, R.
1979-01-01
A response is made to the reply of Nei and Tateno (1979) to the letter of Holmquist (1978) supporting the validity of the augmentation algorithm of Moore (1977) in reconstructions of nucleotide substitutions by means of the maximum parsimony principle. It is argued that the overestimation of the augmented numbers of nucleotide substitutions (augmented distances) found by Tateno and Nei (1978) is due to an unrepresentative data sample and that it is only necessary that evolution be stochastically uniform in different regions of the phylogenetic network for the augmentation method to be useful. The importance of the average value of the true distance over all links is explained, and the relative variances of the true and augmented distances are calculated to be almost identical. The effects of topological changes in the phylogenetic tree on the augmented distance and the question of the correctness of ancestral sequences inferred by the method of parsimony are also clarified.
Salas-Leiva, Dayana E.; Meerow, Alan W.; Calonje, Michael; Griffith, M. Patrick; Francisco-Ortega, Javier; Nakamura, Kyoko; Stevenson, Dennis W.; Lewis, Carl E.; Namoff, Sandra
2013-01-01
Background and aims Despite a recent new classification, a stable phylogeny for the cycads has been elusive, particularly regarding resolution of Bowenia, Stangeria and Dioon. In this study, five single-copy nuclear genes (SCNGs) are applied to the phylogeny of the order Cycadales. The specific aim is to evaluate several gene tree–species tree reconciliation approaches for developing an accurate phylogeny of the order, to contrast them with concatenated parsimony analysis and to resolve the erstwhile problematic phylogenetic position of these three genera. Methods DNA sequences of five SCNGs were obtained for 20 cycad species representing all ten genera of Cycadales. These were analysed with parsimony, maximum likelihood (ML) and three Bayesian methods of gene tree–species tree reconciliation, using Cycas as the outgroup. A calibrated date estimation was developed with Bayesian methods, and biogeographic analysis was also conducted. Key Results Concatenated parsimony, ML and three species tree inference methods resolve exactly the same tree topology with high support at most nodes. Dioon and Bowenia are the first and second branches of Cycadales after Cycas, respectively, followed by an encephalartoid clade (Macrozamia–Lepidozamia–Encephalartos), which is sister to a zamioid clade, of which Ceratozamia is the first branch, and in which Stangeria is sister to Microcycas and Zamia. Conclusions A single, well-supported phylogenetic hypothesis of the generic relationships of the Cycadales is presented. However, massive extinction events inferred from the fossil record that eliminated broader ancestral distributions within Zamiaceae compromise accurate optimization of ancestral biogeographical areas for that hypothesis. While major lineages of Cycadales are ancient, crown ages of all modern genera are no older than 12 million years, supporting a recent hypothesis of mostly Miocene radiations. This phylogeny can contribute to an accurate infrafamilial classification of Zamiaceae. PMID:23997230
Convergence among cave catfishes: long-branch attraction and a Bayesian relative rates test.
Wilcox, T P; García de León, F J; Hendrickson, D A; Hillis, D M
2004-06-01
Convergence has long been of interest to evolutionary biologists. Cave organisms appear to be ideal candidates for studying convergence in morphological, physiological, and developmental traits. Here we report apparent convergence in two cave-catfishes that were described on morphological grounds as congeners: Prietella phreatophila and Prietella lundbergi. We collected mitochondrial DNA sequence data from 10 species of catfishes, representing five of the seven genera in Ictaluridae, as well as seven species from a broad range of siluriform outgroups. Analysis of the sequence data under parsimony supports a monophyletic Prietella. However, both maximum-likelihood and Bayesian analyses support polyphyly of the genus, with P. lundbergi sister to Ictalurus and P. phreatophila sister to Ameiurus. The topological difference between parsimony and the other methods appears to result from long-branch attraction between the Prietella species. Similarly, the sequence data do not support several other relationships within Ictaluridae supported by morphology. We develop a new Bayesian method for examining variation in molecular rates of evolution across a phylogeny.
Licona-Vera, Yuyini; Ornelas, Juan Francisco
2017-06-05
Geographical and temporal patterns of diversification in bee hummingbirds (Mellisugini) were assessed with respect to the evolution of migration, critical for colonization of North America. We generated a dated multilocus phylogeny of the Mellisugini based on a dense sampling using Bayesian inference, maximum-likelihood and maximum parsimony methods, and reconstructed the ancestral states of distributional areas in a Bayesian framework and migratory behavior using maximum parsimony, maximum-likelihood and re-rooting methods. All phylogenetic analyses confirmed monophyly of the Mellisugini and the inclusion of Atthis, Calothorax, Doricha, Eulidia, Mellisuga, Microstilbon, Myrmia, Tilmatura, and Thaumastura. Mellisugini consists of two clades: (1) South American species (including Tilmatura dupontii), and (2) species distributed in North and Central America and the Caribbean islands. The second clade consists of four subclades: Mexican (Calothorax, Doricha) and Caribbean (Archilochus, Calliphlox, Mellisuga) sheartails, Calypte, and Selasphorus (incl. Atthis). Coalescent-based dating places the origin of the Mellisugini in the mid-to-late Miocene, with crown ages of most subclades in the early Pliocene, and subsequent species splits in the Pleistocene. Bee hummingbirds reached western North America by the end of the Miocene and the ancestral mellisuginid (bee hummingbirds) was reconstructed as sedentary, with four independent gains of migratory behavior during the evolution of the Mellisugini. Early colonization of North America and subsequent evolution of migration best explained biogeographic and diversification patterns within the Mellisugini. The repeated evolution of long-distance migration by different lineages was critical for the colonization of North America, contributing to the radiation of bee hummingbirds. Comparative phylogeography is needed to test whether the repeated evolution of migration resulted from northward expansion of southern sedentary populations.
MRL and SuperFine+MRL: new supertree methods
2012-01-01
Background Supertree methods combine trees on subsets of the full taxon set together to produce a tree on the entire set of taxa. Of the many supertree methods, the most popular is MRP (Matrix Representation with Parsimony), a method that operates by first encoding the input set of source trees by a large matrix (the "MRP matrix") over {0,1, ?}, and then running maximum parsimony heuristics on the MRP matrix. Experimental studies evaluating MRP in comparison to other supertree methods have established that for large datasets, MRP generally produces trees of equal or greater accuracy than other methods, and can run on larger datasets. A recent development in supertree methods is SuperFine+MRP, a method that combines MRP with a divide-and-conquer approach, and produces more accurate trees in less time than MRP. In this paper we consider a new approach for supertree estimation, called MRL (Matrix Representation with Likelihood). MRL begins with the same MRP matrix, but then analyzes the MRP matrix using heuristics (such as RAxML) for 2-state Maximum Likelihood. Results We compared MRP and SuperFine+MRP with MRL and SuperFine+MRL on simulated and biological datasets. We examined the MRP and MRL scores of each method on a wide range of datasets, as well as the resulting topological accuracy of the trees. Our experimental results show that MRL, coupled with a very good ML heuristic such as RAxML, produced more accurate trees than MRP, and MRL scores were more strongly correlated with topological accuracy than MRP scores. Conclusions SuperFine+MRP, when based upon a good MP heuristic, such as TNT, produces among the best scores for both MRP and MRL, and is generally faster and more topologically accurate than other supertree methods we tested. PMID:22280525
Phylogenetically marking the limits of the genus Fusarium for post-Article 59 usage
USDA-ARS?s Scientific Manuscript database
Fusarium (Hypocreales, Nectriaceae) is one of the most important and systematically challenging groups of mycotoxigenic, plant pathogenic, and human pathogenic fungi. We conducted maximum likelihood (ML), maximum parsimony (MP) and Bayesian (B) analyses on partial nucleotide sequences of genes encod...
Liu, Luxian; Jin, Xinjie; Chen, Nan; Li, Xian; Li, Pan; Fu, Chengxin
2015-01-01
Phylogenetic relationships among Chinese species of Morella (Myricaceae) are unresolved. Here, we use restriction site-associated DNA sequencing (RAD-seq) to identify candidate loci that will help in determining phylogenetic relationships among Morella rubra, M. adenophora, M. nana and M. esculenta. Three methods for inferring phylogeny, maximum parsimony (MP), maximum likelihood (ML) and Bayesian concordance, were applied to data sets including as many as 4253 RAD loci with 8360 parsimony informative variable sites. All three methods significantly favored the topology of (((M. rubra, M. adenophora), M. nana), M. esculenta). Two species from North America (M. cerifera and M. pensylvanica) were placed as sister to the four Chinese species. According to BEAST analysis, we deduced speciation of M. rubra to be at about the Miocene-Pliocene boundary (5.28 Ma). Intraspecific divergence in M. rubra occurred in the late Pliocene (3.39 Ma). From pooled data, we assembled 29378, 21902 and 23552 de novo contigs with an average length of 229, 234 and 234 bp for M. rubra, M. nana and M. esculenta respectively. The contigs were used to investigate functional classification of RAD tags in a BLASTX search. Additionally, we identified 3808 unlinked SNP sites across the four populations of M. rubra and discovered genes associated with fruit ripening and senescence, fruit quality and disease/defense metabolism based on KEGG database. PMID:26431030
ERIC Educational Resources Information Center
Casabianca, Jodi M.; Lewis, Charles
2015-01-01
Loglinear smoothing (LLS) estimates the latent trait distribution while making fewer assumptions about its form and maintaining parsimony, thus leading to more precise item response theory (IRT) item parameter estimates than standard marginal maximum likelihood (MML). This article provides the expectation-maximization algorithm for MML estimation…
Gâteblé, Gildas; Villegente, Matthieu; Fabre, Isabelle; Klein, Nicolas; Anger, Nicolas; Baskin, Carol C; Scutt, Charlie P
2017-01-01
Abstract Background and Aims Recent parsimony-based reconstructions suggest that seeds of early angiosperms had either morphophysiological or physiological dormancy, with the former considered as more probable. The aim of this study was to determine the class of seed dormancy present in Amborella trichopoda, the sole living representative of the most basal angiosperm lineage Amborellales, with a view to resolving fully the class of dormancy present at the base of the angiosperm clade. Methods Drupes of A. trichopoda without fleshy parts were germinated and dissected to observe their structure and embryo growth. Pre-treatments including acid scarification, gibberellin treatment and seed excision were tested to determine their influence on dormancy breakage and germination. Character-state mapping by maximum parsimony, incorporating data from the present work and published sources, was then used to determine the likely class of dormancy present in early angiosperms. Key Results Germination in A. trichopoda requires a warm stratification period of at least approx. 90 d, which is followed by endosperm swelling, causing the water-permeable pericarp–mesocarp envelope to split open. The embryo then grows rapidly within the seed, to radicle emergence some 17 d later and cotyledon emergence after an additional 24 d. Gibberellin treatment, acid scarification and excision of seeds from the surrounding drupe tissues all promoted germination by shortening the initial phase of dormancy, prior to embryo growth. Conclusions Seeds of A. trichopoda have non-deep simple morphophysiological dormancy, in which mechanical resistance of the pericarp–mesocarp envelope plays a key role in the initial physiological phase. Maximum parsimony analyses, including data obtained in the present work, indicate that morphophysiological dormancy is likely to be a pleisiomorphic trait in flowering plants. The significance of this conclusion for studies of early angiosperm evolution is discussed. PMID:28087660
Fast Construction of Near Parsimonious Hybridization Networks for Multiple Phylogenetic Trees.
Mirzaei, Sajad; Wu, Yufeng
2016-01-01
Hybridization networks represent plausible evolutionary histories of species that are affected by reticulate evolutionary processes. An established computational problem on hybridization networks is constructing the most parsimonious hybridization network such that each of the given phylogenetic trees (called gene trees) is "displayed" in the network. There have been several previous approaches, including an exact method and several heuristics, for this NP-hard problem. However, the exact method is only applicable to a limited range of data, and heuristic methods can be less accurate and also slow sometimes. In this paper, we develop a new algorithm for constructing near parsimonious networks for multiple binary gene trees. This method is more efficient for large numbers of gene trees than previous heuristics. This new method also produces more parsimonious results on many simulated datasets as well as a real biological dataset than a previous method. We also show that our method produces topologically more accurate networks for many datasets.
USDA-ARS?s Scientific Manuscript database
The phylogeny of Amaryllidaceae tribe Hippeastreae was inferred using chloroplast (3’ycf1, ndhF, trnL-F) and nuclear (ITS rDNA) sequence data under maximum parsimony and maximum likelihood frameworks. Network analyses were applied to resolve conflicting signals among data sets and putative scenarios...
USDA-ARS?s Scientific Manuscript database
Fusarium (Hypocreales, Nectriaceae) is one of the most economically important and systematically challenging groups of mycotoxigenic phytopathogens and emergent human pathogens. We conducted maximum likelihood (ML), maximum parsimony (MP) and Bayesian (B) analyses on partial RNA polymerase largest (...
Statistical parsimony networks and species assemblages in Cephalotrichid nemerteans (nemertea).
Chen, Haixia; Strand, Malin; Norenburg, Jon L; Sun, Shichun; Kajihara, Hiroshi; Chernyshev, Alexey V; Maslakova, Svetlana A; Sundberg, Per
2010-09-21
It has been suggested that statistical parsimony network analysis could be used to get an indication of species represented in a set of nucleotide data, and the approach has been used to discuss species boundaries in some taxa. Based on 635 base pairs of the mitochondrial protein-coding gene cytochrome c oxidase I (COI), we analyzed 152 nemertean specimens using statistical parsimony network analysis with the connection probability set to 95%. The analysis revealed 15 distinct networks together with seven singletons. Statistical parsimony yielded three networks supporting the species status of Cephalothrix rufifrons, C. major and C. spiralis as they currently have been delineated by morphological characters and geographical location. Many other networks contained haplotypes from nearby geographical locations. Cladistic structure by maximum likelihood analysis overall supported the network analysis, but indicated a false positive result where subnetworks should have been connected into one network/species. This probably is caused by undersampling of the intraspecific haplotype diversity. Statistical parsimony network analysis provides a rapid and useful tool for detecting possible undescribed/cryptic species among cephalotrichid nemerteans based on COI gene. It should be combined with phylogenetic analysis to get indications of false positive results, i.e., subnetworks that would have been connected with more extensive haplotype sampling.
Ali, Syeda Kauser; Baig, Lubna Ansari; Violato, Claudio; Zahid, Onaiza
2017-01-01
Objectives: This study was conducted to adduce evidence of validity for admissions tests and processes and for identifying a parsimonious model that predicts students’ academic achievement in Medical College. Methods: Psychometric study done on admission data and assessment scores for five years of medical studies at Aga Khan University Medical College, Pakistan using confirmatory factor analysis (CFA) and structured equation modeling (SEM). Sample included 276 medical students admitted in 2003, 2004 and 2005. Results: The SEM supported the existence of covariance between verbal reasoning, science and clinical knowledge for predicting achievement in medical school employing Maximum Likelihood (ML) estimations (n=112). Fit indices: χ2 (21) = 59.70, p =<.0001; CFI=.873; RMSEA = 0.129; SRMR = 0.093. Conclusions: This study shows that in addition to biology and chemistry which have been traditionally used as major criteria for admission to medical colleges in Pakistan; mathematics has proven to be a better predictor for higher achievements in medical college. PMID:29067063
Using MOEA with Redistribution and Consensus Branches to Infer Phylogenies.
Min, Xiaoping; Zhang, Mouzhao; Yuan, Sisi; Ge, Shengxiang; Liu, Xiangrong; Zeng, Xiangxiang; Xia, Ningshao
2017-12-26
In recent years, to infer phylogenies, which are NP-hard problems, more and more research has focused on using metaheuristics. Maximum Parsimony and Maximum Likelihood are two effective ways to conduct inference. Based on these methods, which can also be considered as the optimal criteria for phylogenies, various kinds of multi-objective metaheuristics have been used to reconstruct phylogenies. However, combining these two time-consuming methods results in those multi-objective metaheuristics being slower than a single objective. Therefore, we propose a novel, multi-objective optimization algorithm, MOEA-RC, to accelerate the processes of rebuilding phylogenies using structural information of elites in current populations. We compare MOEA-RC with two representative multi-objective algorithms, MOEA/D and NAGA-II, and a non-consensus version of MOEA-RC on three real-world datasets. The result is, within a given number of iterations, MOEA-RC achieves better solutions than the other algorithms.
A proof of the DBRF-MEGN method, an algorithm for deducing minimum equivalent gene networks
2011-01-01
Background We previously developed the DBRF-MEGN (difference-based regulation finding-minimum equivalent gene network) method, which deduces the most parsimonious signed directed graphs (SDGs) consistent with expression profiles of single-gene deletion mutants. However, until the present study, we have not presented the details of the method's algorithm or a proof of the algorithm. Results We describe in detail the algorithm of the DBRF-MEGN method and prove that the algorithm deduces all of the exact solutions of the most parsimonious SDGs consistent with expression profiles of gene deletion mutants. Conclusions The DBRF-MEGN method provides all of the exact solutions of the most parsimonious SDGs consistent with expression profiles of gene deletion mutants. PMID:21699737
Johnson, Tania Aspasia; Iyengar, Arati
2015-01-01
Sturgeons and paddlefish are freshwater fish which are highly valued for their caviar. Despite the fact that every single species of sturgeon and paddlefish is listed under CITES, there are reports of illegal trade in caviar where products are deliberately mislabeled. Three samples of caviar purchased in the United Kingdom were investigated for accurate CITES labeling using COI and cyt b sequencing. Initial species identification was carried out using BLAST followed by phylogenetic analyses using both maximum parsimony and maximum likelihood methods. Results showed no evidence for mislabeling with respect to CITES labels in any of the three samples, but we observed clear evidence for a case of misleading the customer in one sample. © 2014 American Academy of Forensic Sciences.
Merz, Clayton; Catchen, Julian M; Hanson-Smith, Victor; Emerson, Kevin J; Bradshaw, William E; Holzapfel, Christina M
2013-01-01
Herein we tested the repeatability of phylogenetic inference based on high throughput sequencing by increased taxon sampling using our previously published techniques in the pitcher-plant mosquito, Wyeomyia smithii in North America. We sampled 25 natural populations drawn from different localities nearby 21 previous collection localities and used these new data to construct a second, independent phylogeny, expressly to test the reproducibility of phylogenetic patterns. Comparison of trees between the two data sets based on both maximum parsimony and maximum likelihood with Bayesian posterior probabilities showed close correspondence in the grouping of the most southern populations into clear clades. However, discrepancies emerged, particularly in the middle of W. smithii's current range near the previous maximum extent of the Laurentide Ice Sheet, especially concerning the most recent common ancestor to mountain and northern populations. Combining all 46 populations from both studies into a single maximum parsimony tree and taking into account the post-glacial historical biogeography of associated flora provided an improved picture of W. smithii's range expansion in North America. In a more general sense, we propose that extensive taxon sampling, especially in areas of known geological disruption is key to a comprehensive approach to phylogenetics that leads to biologically meaningful phylogenetic inference.
A simplified parsimonious higher order multivariate Markov chain model
NASA Astrophysics Data System (ADS)
Wang, Chao; Yang, Chuan-sheng
2017-09-01
In this paper, a simplified parsimonious higher-order multivariate Markov chain model (SPHOMMCM) is presented. Moreover, parameter estimation method of TPHOMMCM is give. Numerical experiments shows the effectiveness of TPHOMMCM.
A tridiagonal parsimonious higher order multivariate Markov chain model
NASA Astrophysics Data System (ADS)
Wang, Chao; Yang, Chuan-sheng
2017-09-01
In this paper, we present a tridiagonal parsimonious higher-order multivariate Markov chain model (TPHOMMCM). Moreover, estimation method of the parameters in TPHOMMCM is give. Numerical experiments illustrate the effectiveness of TPHOMMCM.
Choi, Ickwon; Kattan, Michael W; Wells, Brian J; Yu, Changhong
2012-01-01
In medical society, the prognostic models, which use clinicopathologic features and predict prognosis after a certain treatment, have been externally validated and used in practice. In recent years, most research has focused on high dimensional genomic data and small sample sizes. Since clinically similar but molecularly heterogeneous tumors may produce different clinical outcomes, the combination of clinical and genomic information, which may be complementary, is crucial to improve the quality of prognostic predictions. However, there is a lack of an integrating scheme for clinic-genomic models due to the P ≥ N problem, in particular, for a parsimonious model. We propose a methodology to build a reduced yet accurate integrative model using a hybrid approach based on the Cox regression model, which uses several dimension reduction techniques, L₂ penalized maximum likelihood estimation (PMLE), and resampling methods to tackle the problem. The predictive accuracy of the modeling approach is assessed by several metrics via an independent and thorough scheme to compare competing methods. In breast cancer data studies on a metastasis and death event, we show that the proposed methodology can improve prediction accuracy and build a final model with a hybrid signature that is parsimonious when integrating both types of variables.
Khan, Haseeb A; Arif, Ibrahim A; Bahkali, Ali H; Al Farhan, Ahmad H; Al Homaidan, Ali A
2008-10-06
This investigation was aimed to compare the inference of antelope phylogenies resulting from the 16S rRNA, cytochrome-b (cyt-b) and d-loop segments of mitochondrial DNA using three different computational models including Bayesian (BA), maximum parsimony (MP) and unweighted pair group method with arithmetic mean (UPGMA). The respective nucleotide sequences of three Oryx species (Oryx leucoryx, Oryx dammah and Oryx gazella) and an out-group (Addax nasomaculatus) were aligned and subjected to BA, MP and UPGMA models for comparing the topologies of respective phylogenetic trees. The 16S rRNA region possessed the highest frequency of conserved sequences (97.65%) followed by cyt-b (94.22%) and d-loop (87.29%). There were few transitions (2.35%) and none transversions in 16S rRNA as compared to cyt-b (5.61% transitions and 0.17% transversions) and d-loop (11.57% transitions and 1.14% transversions) while comparing the four taxa. All the three mitochondrial segments clearly differentiated the genus Addax from Oryx using the BA or UPGMA models. The topologies of all the gamma-corrected Bayesian trees were identical irrespective of the marker type. The UPGMA trees resulting from 16S rRNA and d-loop sequences were also identical (Oryx dammah grouped with Oryx leucoryx) to Bayesian trees except that the UPGMA tree based on cyt-b showed a slightly different phylogeny (Oryx dammah grouped with Oryx gazella) with a low bootstrap support. However, the MP model failed to differentiate the genus Addax from Oryx. These findings demonstrate the efficiency and robustness of BA and UPGMA methods for phylogenetic analysis of antelopes using mitochondrial markers.
Khan, Haseeb A.; Arif, Ibrahim A.; Bahkali, Ali H.; Al Farhan, Ahmad H.; Al Homaidan, Ali A.
2008-01-01
This investigation was aimed to compare the inference of antelope phylogenies resulting from the 16S rRNA, cytochrome-b (cyt-b) and d-loop segments of mitochondrial DNA using three different computational models including Bayesian (BA), maximum parsimony (MP) and unweighted pair group method with arithmetic mean (UPGMA). The respective nucleotide sequences of three Oryx species (Oryx leucoryx, Oryx dammah and Oryx gazella) and an out-group (Addax nasomaculatus) were aligned and subjected to BA, MP and UPGMA models for comparing the topologies of respective phylogenetic trees. The 16S rRNA region possessed the highest frequency of conserved sequences (97.65%) followed by cyt-b (94.22%) and d-loop (87.29%). There were few transitions (2.35%) and none transversions in 16S rRNA as compared to cyt-b (5.61% transitions and 0.17% transversions) and d-loop (11.57% transitions and 1.14% transversions) while comparing the four taxa. All the three mitochondrial segments clearly differentiated the genus Addax from Oryx using the BA or UPGMA models. The topologies of all the gamma-corrected Bayesian trees were identical irrespective of the marker type. The UPGMA trees resulting from 16S rRNA and d-loop sequences were also identical (Oryx dammah grouped with Oryx leucoryx) to Bayesian trees except that the UPGMA tree based on cyt-b showed a slightly different phylogeny (Oryx dammah grouped with Oryx gazella) with a low bootstrap support. However, the MP model failed to differentiate the genus Addax from Oryx. These findings demonstrate the efficiency and robustness of BA and UPGMA methods for phylogenetic analysis of antelopes using mitochondrial markers. PMID:19204824
Schminkey, Donna L; von Oertzen, Timo; Bullock, Linda
2016-08-01
With increasing access to population-based data and electronic health records for secondary analysis, missing data are common. In the social and behavioral sciences, missing data frequently are handled with multiple imputation methods or full information maximum likelihood (FIML) techniques, but healthcare researchers have not embraced these methodologies to the same extent and more often use either traditional imputation techniques or complete case analysis, which can compromise power and introduce unintended bias. This article is a review of options for handling missing data, concluding with a case study demonstrating the utility of multilevel structural equation modeling using full information maximum likelihood (MSEM with FIML) to handle large amounts of missing data. MSEM with FIML is a parsimonious and hypothesis-driven strategy to cope with large amounts of missing data without compromising power or introducing bias. This technique is relevant for nurse researchers faced with ever-increasing amounts of electronic data and decreasing research budgets. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
A parsimonious tree-grow method for haplotype inference.
Li, Zhenping; Zhou, Wenfeng; Zhang, Xiang-Sun; Chen, Luonan
2005-09-01
Haplotype information has become increasingly important in analyzing fine-scale molecular genetics data, such as disease genes mapping and drug design. Parsimony haplotyping is one of haplotyping problems belonging to NP-hard class. In this paper, we aim to develop a novel algorithm for the haplotype inference problem with the parsimony criterion, based on a parsimonious tree-grow method (PTG). PTG is a heuristic algorithm that can find the minimum number of distinct haplotypes based on the criterion of keeping all genotypes resolved during tree-grow process. In addition, a block-partitioning method is also proposed to improve the computational efficiency. We show that the proposed approach is not only effective with a high accuracy, but also very efficient with the computational complexity in the order of O(m2n) time for n single nucleotide polymorphism sites in m individual genotypes. The software is available upon request from the authors, or from http://zhangroup.aporc.org/bioinfo/ptg/ chen@elec.osaka-sandai.ac.jp Supporting materials is available from http://zhangroup.aporc.org/bioinfo/ptg/bti572supplementary.pdf
Using tree diversity to compare phylogenetic heuristics.
Sul, Seung-Jin; Matthews, Suzanne; Williams, Tiffani L
2009-04-29
Evolutionary trees are family trees that represent the relationships between a group of organisms. Phylogenetic heuristics are used to search stochastically for the best-scoring trees in tree space. Given that better tree scores are believed to be better approximations of the true phylogeny, traditional evaluation techniques have used tree scores to determine the heuristics that find the best scores in the fastest time. We develop new techniques to evaluate phylogenetic heuristics based on both tree scores and topologies to compare Pauprat and Rec-I-DCM3, two popular Maximum Parsimony search algorithms. Our results show that although Pauprat and Rec-I-DCM3 find the trees with the same best scores, topologically these trees are quite different. Furthermore, the Rec-I-DCM3 trees cluster distinctly from the Pauprat trees. In addition to our heatmap visualizations of using parsimony scores and the Robinson-Foulds distance to compare best-scoring trees found by the two heuristics, we also develop entropy-based methods to show the diversity of the trees found. Overall, Pauprat identifies more diverse trees than Rec-I-DCM3. Overall, our work shows that there is value to comparing heuristics beyond the parsimony scores that they find. Pauprat is a slower heuristic than Rec-I-DCM3. However, our work shows that there is tremendous value in using Pauprat to reconstruct trees-especially since it finds identical scoring but topologically distinct trees. Hence, instead of discounting Pauprat, effort should go in improving its implementation. Ultimately, improved performance measures lead to better phylogenetic heuristics and will result in better approximations of the true evolutionary history of the organisms of interest.
Turner, Barbara; Paun, Ovidiu; Munzinger, Jérôme; Chase, Mark W.; Samuel, Rosabelle
2016-01-01
Background and Aims Some plant groups, especially on islands, have been shaped by strong ancestral bottlenecks and rapid, recent radiation of phenotypic characters. Single molecular markers are often not informative enough for phylogenetic reconstruction in such plant groups. Whole plastid genomes and nuclear ribosomal DNA (nrDNA) are viewed by many researchers as sources of information for phylogenetic reconstruction of groups in which expected levels of divergence in standard markers are low. Here we evaluate the usefulness of these data types to resolve phylogenetic relationships among closely related Diospyros species. Methods Twenty-two closely related Diospyros species from New Caledonia were investigated using whole plastid genomes and nrDNA data from low-coverage next-generation sequencing (NGS). Phylogenetic trees were inferred using maximum parsimony, maximum likelihood and Bayesian inference on separate plastid and nrDNA and combined matrices. Key Results The plastid and nrDNA sequences were, singly and together, unable to provide well supported phylogenetic relationships among the closely related New Caledonian Diospyros species. In the nrDNA, a 6-fold greater percentage of parsimony-informative characters compared with plastid DNA was found, but the total number of informative sites was greater for the much larger plastid DNA genomes. Combining the plastid and nuclear data improved resolution. Plastid results showed a trend towards geographical clustering of accessions rather than following taxonomic species. Conclusions In plant groups in which multiple plastid markers are not sufficiently informative, an investigation at the level of the entire plastid genome may also not be sufficient for detailed phylogenetic reconstruction. Sequencing of complete plastid genomes and nrDNA repeats seems to clarify some relationships among the New Caledonian Diospyros species, but the higher percentage of parsimony-informative characters in nrDNA compared with plastid DNA did not help to resolve the phylogenetic tree because the total number of variable sites was much lower than in the entire plastid genome. The geographical clustering of the individuals against a background of overall low sequence divergence could indicate transfer of plastid genomes due to hybridization and introgression following secondary contact. PMID:27098088
NASA Astrophysics Data System (ADS)
Wang, Chao; Yang, Chuan-sheng
2017-09-01
In this paper, we present a simplified parsimonious higher-order multivariate Markov chain model with new convergence condition. (TPHOMMCM-NCC). Moreover, estimation method of the parameters in TPHOMMCM-NCC is give. Numerical experiments illustrate the effectiveness of TPHOMMCM-NCC.
Varela, Eduardo S; Lima, João P M S; Galdino, Alexsandro S; Pinto, Luciano da S; Bezerra, Walderly M; Nunes, Edson P; Alves, Maria A O; Grangeiro, Thalles B
2004-01-01
The complete sequences of nuclear ribosomal DNA (nrDNA) internal transcribed spacer regions (ITS/5.8S) were determined for species belonging to six genera from the subtribe Diocleinae as well as for the anomalous genera Calopogonium and Pachyrhizus. Phylogenetic trees constructed by distance matrix, maximum parsimony and maximum likelihood methods showed that Calopogonium and Pachyrhizus were outside the clade Diocleinae (Canavalia, Camptosema, Cratylia, Dioclea, Cymbosema, and Galactia). This finding supports previous morphological, phytochemical, and molecular evidence that Calopogonium and Pachyrhizus do not belong to the subtribe Diocleinae. Within the true Diocleinae clade, the clustering of genera and species were congruent with morphology-based classifications, suggesting that ITS/5.8S sequences can provide enough informative sites to allow resolution below the genus level. This is the first evidence of the phylogeny of subtribe Diocleinae based on nuclear DNA sequences.
Mugleston, Joseph D; Song, Hojun; Whiting, Michael F
2013-12-01
The phylogenetic relationships of Tettigoniidae (katydids and bush-crickets) were inferred using molecular sequence data. Six genes (18S rDNA, 28S rDNA, Cytochrome Oxidase II, Histone 3, Tubulin Alpha I, and Wingless) were sequenced for 135 ingroup taxa representing 16 of the 19 extant katydid subfamilies. Five subfamilies (Tettigoniinae, Pseudophyllinae, Mecopodinae, Meconematinae, and Listroscelidinae) were found to be paraphyletic under various tree reconstruction methods (Maximum Likelihood, Bayesisan Inference and Maximum Parsimony). Seven subfamilies - Conocephalinae, Hetrodinae, Hexacentrinae, Saginae, Phaneropterinae, Phyllophorinae, and Lipotactinae - were each recovered as well-supported monophyletic groups. We mapped the small and exposed thoracic auditory spiracle (a defining character of the subfamily Pseudophyllinae) and found it to be homoplasious. We also found the leaf-like wings of katydids have been derived independently in at least six lineages. Copyright © 2013 Elsevier Inc. All rights reserved.
Hu, Chao; Tian, Huaizhen; Li, Hongqing; Hu, Aiqun; Xing, Fuwu; Bhattacharjee, Avishek; Hsu, Tianchuan; Kumar, Pankaj; Chung, Shihwen
2016-01-01
A molecular phylogeny of Asiatic species of Goodyera (Orchidaceae, Cranichideae, Goodyerinae) based on the nuclear ribosomal internal transcribed spacer (ITS) region and two chloroplast loci (matK and trnL-F) was presented. Thirty-five species represented by 132 samples of Goodyera were analyzed, along with other 27 genera/48 species, using Pterostylis longifolia and Chloraea gaudichaudii as outgroups. Bayesian inference, maximum parsimony and maximum likelihood methods were used to reveal the intrageneric relationships of Goodyera and its intergeneric relationships to related genera. The results indicate that: 1) Goodyera is not monophyletic; 2) Goodyera could be divided into four sections, viz., Goodyera, Otosepalum, Reticulum and a new section; 3) sect. Reticulum can be further divided into two subsections, viz., Reticulum and Foliosum, whereas sect. Goodyera can in turn be divided into subsections Goodyera and a new subsection. PMID:26927946
Hu, Chao; Tian, Huaizhen; Li, Hongqing; Hu, Aiqun; Xing, Fuwu; Bhattacharjee, Avishek; Hsu, Tianchuan; Kumar, Pankaj; Chung, Shihwen
2016-01-01
A molecular phylogeny of Asiatic species of Goodyera (Orchidaceae, Cranichideae, Goodyerinae) based on the nuclear ribosomal internal transcribed spacer (ITS) region and two chloroplast loci (matK and trnL-F) was presented. Thirty-five species represented by 132 samples of Goodyera were analyzed, along with other 27 genera/48 species, using Pterostylis longifolia and Chloraea gaudichaudii as outgroups. Bayesian inference, maximum parsimony and maximum likelihood methods were used to reveal the intrageneric relationships of Goodyera and its intergeneric relationships to related genera. The results indicate that: 1) Goodyera is not monophyletic; 2) Goodyera could be divided into four sections, viz., Goodyera, Otosepalum, Reticulum and a new section; 3) sect. Reticulum can be further divided into two subsections, viz., Reticulum and Foliosum, whereas sect. Goodyera can in turn be divided into subsections Goodyera and a new subsection.
Seeking parsimony in hydrology and water resources technology
NASA Astrophysics Data System (ADS)
Koutsoyiannis, D.
2009-04-01
The principle of parsimony, also known as the principle of simplicity, the principle of economy and Ockham's razor, advises scientists to prefer the simplest theory among those that fit the data equally well. In this, it is an epistemic principle but reflects an ontological characterization that the universe is ultimately parsimonious. Is this principle useful and can it really be reconciled with, and implemented to, our modelling approaches of complex hydrological systems, whose elements and events are extraordinarily numerous, different and unique? The answer underlying the mainstream hydrological research of the last two decades seems to be negative. Hopes were invested to the power of computers that would enable faithful and detailed representation of the diverse system elements and the hydrological processes, based on merely "first principles" and resulting in "physically-based" models that tend to approach in complexity the real world systems. Today the account of such research endeavour seems not positive, as it did not improve model predictive capacity and processes comprehension. A return to parsimonious modelling seems to be again the promising route. The experience from recent research and from comparisons of parsimonious and complicated models indicates that the former can facilitate insight and comprehension, improve accuracy and predictive capacity, and increase efficiency. In addition - and despite aspiration that "physically based" models will have lower data requirements and, even, they ultimately become "data-free" - parsimonious models require fewer data to achieve the same accuracy with more complicated models. Naturally, the concepts that reconcile the simplicity of parsimonious models with the complexity of hydrological systems are probability theory and statistics. Probability theory provides the theoretical basis for moving from a microscopic to a macroscopic view of phenomena, by mapping sets of diverse elements and events of hydrological systems to single numbers (a probability or an expected value), and statistics provides the empirical basis of summarizing data, making inference from them, and supporting decision making in water resource management. Unfortunately, the current state of the art in probability, statistics and their union, often called stochastics, is not fully satisfactory for the needs of modelling of hydrological and water resource systems. A first problem is that stochastic modelling has traditionally relied on classical statistics, which is based on the independent "coin-tossing" prototype, rather than on the study of real-world systems whose behaviour is very different from the classical prototype. A second problem is that the stochastic models (particularly the multivariate ones) are often not parsimonious themselves. Therefore, substantial advancement of stochastics is necessary in a new paradigm of parsimonious hydrological modelling. These ideas are illustrated using several examples, namely: (a) hydrological modelling of a karst system in Bosnia and Herzegovina using three different approaches ranging from parsimonious to detailed "physically-based"; (b) parsimonious modelling of a peculiar modified catchment in Greece; (c) a stochastic approach that can replace parameter-excessive ARMA-type models with a generalized algorithm that produces any shape of autocorrelation function (consistent with the accuracy provided by the data) using a couple of parameters; (d) a multivariate stochastic approach which replaces a huge number of parameters estimated from data with coefficients estimated by the principle of maximum entropy; and (e) a parsimonious approach for decision making in multi-reservoir systems using a handful of parameters instead of thousands of decision variables.
A Distance Measure for Genome Phylogenetic Analysis
NASA Astrophysics Data System (ADS)
Cao, Minh Duc; Allison, Lloyd; Dix, Trevor
Phylogenetic analyses of species based on single genes or parts of the genomes are often inconsistent because of factors such as variable rates of evolution and horizontal gene transfer. The availability of more and more sequenced genomes allows phylogeny construction from complete genomes that is less sensitive to such inconsistency. For such long sequences, construction methods like maximum parsimony and maximum likelihood are often not possible due to their intensive computational requirement. Another class of tree construction methods, namely distance-based methods, require a measure of distances between any two genomes. Some measures such as evolutionary edit distance of gene order and gene content are computational expensive or do not perform well when the gene content of the organisms are similar. This study presents an information theoretic measure of genetic distances between genomes based on the biological compression algorithm expert model. We demonstrate that our distance measure can be applied to reconstruct the consensus phylogenetic tree of a number of Plasmodium parasites from their genomes, the statistical bias of which would mislead conventional analysis methods. Our approach is also used to successfully construct a plausible evolutionary tree for the γ-Proteobacteria group whose genomes are known to contain many horizontally transferred genes.
Rylková, K; Tůmová, E; Brožová, A; Jankovská, I; Vadlejch, J; Čadková, Z; Frýdlová, J; Peřinková, P; Langrová, I; Chodová, D; Nechybová, S; Scháňková, Š
2015-11-01
Trichuris sp. individuals were collected from Myocastor coypus from fancy breeder farms in the Czech Republic. Using morphological and biometrical methods, 30 female and 30 male nematodes were identified as Trichuris myocastoris. This paper presents the first molecular description of this species. The ribosomal DNA (rDNA) region, consisting of internal transcribed spacer (ITS)-1, 5.8 gene and ITS-2, was sequenced. Based on an analysis of 651 bp, T. myocastoris was found to be different from any other Trichuris species for which published sequencing of the ITS region is available. The phylogenetic relationships were estimated using the maximum parsimony methods and Bayesian analyses. T. myocastoris was found to be significantly closely related to Trichuris of rodents than those of ruminants.
Goggin, C L; Barker, S C
1993-07-01
Parasites of the genus Perkinsus destroy marine molluscs worldwide. Their phylogenetic position within the kingdom Protista is controversial. Nucleotide sequence data (1792 bp) from the small subunit rRNA gene of Perkinsus sp. from Anadara trapezia (Mollusca: Bivalvia) from Moreton Bay, Queensland, was used to examine the phylogenetic affinities of this enigmatic genus. These data were aligned with nucleotide sequences from 6 apicomplexans, 3 ciliates, 3 flagellates, a dinoflagellate, 3 fungi, maize and human. Phylogenetic trees were constructed after analysis with maximum parsimony and distance matrix methods. Our analyses indicate that Perkinsus is phylogenetically closer to dinoflagellates and to coccidean and piroplasm apicomplexans than to fungi or flagellates.
Tamura, Koichiro; Peterson, Daniel; Peterson, Nicholas; Stecher, Glen; Nei, Masatoshi; Kumar, Sudhir
2011-01-01
Comparative analysis of molecular sequence data is essential for reconstructing the evolutionary histories of species and inferring the nature and extent of selective forces shaping the evolution of genes and species. Here, we announce the release of Molecular Evolutionary Genetics Analysis version 5 (MEGA5), which is a user-friendly software for mining online databases, building sequence alignments and phylogenetic trees, and using methods of evolutionary bioinformatics in basic biology, biomedicine, and evolution. The newest addition in MEGA5 is a collection of maximum likelihood (ML) analyses for inferring evolutionary trees, selecting best-fit substitution models (nucleotide or amino acid), inferring ancestral states and sequences (along with probabilities), and estimating evolutionary rates site-by-site. In computer simulation analyses, ML tree inference algorithms in MEGA5 compared favorably with other software packages in terms of computational efficiency and the accuracy of the estimates of phylogenetic trees, substitution parameters, and rate variation among sites. The MEGA user interface has now been enhanced to be activity driven to make it easier for the use of both beginners and experienced scientists. This version of MEGA is intended for the Windows platform, and it has been configured for effective use on Mac OS X and Linux desktops. It is available free of charge from http://www.megasoftware.net. PMID:21546353
Supertrees Based on the Subtree Prune-and-Regraft Distance
Whidden, Christopher; Zeh, Norbert; Beiko, Robert G.
2014-01-01
Supertree methods reconcile a set of phylogenetic trees into a single structure that is often interpreted as a branching history of species. A key challenge is combining conflicting evolutionary histories that are due to artifacts of phylogenetic reconstruction and phenomena such as lateral gene transfer (LGT). Many supertree approaches use optimality criteria that do not reflect underlying processes, have known biases, and may be unduly influenced by LGT. We present the first method to construct supertrees by using the subtree prune-and-regraft (SPR) distance as an optimality criterion. Although calculating the rooted SPR distance between a pair of trees is NP-hard, our new maximum agreement forest-based methods can reconcile trees with hundreds of taxa and > 50 transfers in fractions of a second, which enables repeated calculations during the course of an iterative search. Our approach can accommodate trees in which uncertain relationships have been collapsed to multifurcating nodes. Using a series of benchmark datasets simulated under plausible rates of LGT, we show that SPR supertrees are more similar to correct species histories than supertrees based on parsimony or Robinson–Foulds distance criteria. We successfully constructed an SPR supertree from a phylogenomic dataset of 40,631 gene trees that covered 244 genomes representing several major bacterial phyla. Our SPR-based approach also allowed direct inference of highways of gene transfer between bacterial classes and genera. A Small number of these highways connect genera in different phyla and can highlight specific genes implicated in long-distance LGT. [Lateral gene transfer; matrix representation with parsimony; phylogenomics; prokaryotic phylogeny; Robinson–Foulds; subtree prune-and-regraft; supertrees.] PMID:24695589
USDA-ARS?s Scientific Manuscript database
Theileria equi is a tick-borne Apicomplexan hemoparasite that causes equine piroplasmosis (EP). This parasite has a worldwide distribution, but until recent outbreaks the United States has been considered to be free of EP. Maximum parsimony analysis of 18S rRNA gene sequences of North American T. eq...
Szövényi, Péter; Hock, Zsófia; Urmi, Edwin; Schneller, Jakob J
2006-01-01
The chloroplast phylogeography of two peat mosses (Sphagnum fimbriatum and Sphagnum squarrosum) with similar distributions but different life history characteristics was investigated in Europe. Our main aim was to test whether similar distributions reflect similar phylogeographic and phylodemographic processes. Accessions covering the European distributions of the species were collected and approx. 2000 bp of the chloroplast genome of each species was sequenced. Maximum parsimony, statistical parsimony and phylodemographic analyses were used to address the question of whether these species with similar distributions show evidence of similar phylogeographic and phylodemographic processes. The chloroplast haplotypes of the currently spreading species S. fimbriatum showed strong geographic structure, whereas those of S. squarrosum, which has stable historical population sizes, showed only very weak geographic affinity and were widely distributed. We hypothesize that S. fimbriatum survived the last glaciations along the Atlantic coast of Europe, whereas S. squarrosum had numerous, scattered refugia in Europe. The dominance of one haplotype of S. fimbriatum across almost all of Europe suggests rapid colonization after the last glacial maximum. We hypothesize that high colonizing ability is an inherent characteristic of the species and its recent expansion in Europe is a response to climate change.
Yuri, Tamaki; Kimball, Rebecca T.; Harshman, John; Bowie, Rauri C. K.; Braun, Michael J.; Chojnowski, Jena L.; Han, Kin-Lan; Hackett, Shannon J.; Huddleston, Christopher J.; Moore, William S.; Reddy, Sushma; Sheldon, Frederick H.; Steadman, David W.; Witt, Christopher C.; Braun, Edward L.
2013-01-01
Insertion/deletion (indel) mutations, which are represented by gaps in multiple sequence alignments, have been used to examine phylogenetic hypotheses for some time. However, most analyses combine gap data with the nucleotide sequences in which they are embedded, probably because most phylogenetic datasets include few gap characters. Here, we report analyses of 12,030 gap characters from an alignment of avian nuclear genes using maximum parsimony (MP) and a simple maximum likelihood (ML) framework. Both trees were similar, and they exhibited almost all of the strongly supported relationships in the nucleotide tree, although neither gap tree supported many relationships that have proven difficult to recover in previous studies. Moreover, independent lines of evidence typically corroborated the nucleotide topology instead of the gap topology when they disagreed, although the number of conflicting nodes with high bootstrap support was limited. Filtering to remove short indels did not substantially reduce homoplasy or reduce conflict. Combined analyses of nucleotides and gaps resulted in the nucleotide topology, but with increased support, suggesting that gap data may prove most useful when analyzed in combination with nucleotide substitutions. PMID:24832669
Modeling the evolution of protein domain architectures using maximum parsimony.
Fong, Jessica H; Geer, Lewis Y; Panchenko, Anna R; Bryant, Stephen H
2007-02-09
Domains are basic evolutionary units of proteins and most proteins have more than one domain. Advances in domain modeling and collection are making it possible to annotate a large fraction of known protein sequences by a linear ordering of their domains, yielding their architecture. Protein domain architectures link evolutionarily related proteins and underscore their shared functions. Here, we attempt to better understand this association by identifying the evolutionary pathways by which extant architectures may have evolved. We propose a model of evolution in which architectures arise through rearrangements of inferred precursor architectures and acquisition of new domains. These pathways are ranked using a parsimony principle, whereby scenarios requiring the fewest number of independent recombination events, namely fission and fusion operations, are assumed to be more likely. Using a data set of domain architectures present in 159 proteomes that represent all three major branches of the tree of life allows us to estimate the history of over 85% of all architectures in the sequence database. We find that the distribution of rearrangement classes is robust with respect to alternative parsimony rules for inferring the presence of precursor architectures in ancestral species. Analyzing the most parsimonious pathways, we find 87% of architectures to gain complexity over time through simple changes, among which fusion events account for 5.6 times as many architectures as fission. Our results may be used to compute domain architecture similarities, for example, based on the number of historical recombination events separating them. Domain architecture "neighbors" identified in this way may lead to new insights about the evolution of protein function.
Modeling the Evolution of Protein Domain Architectures Using Maximum Parsimony
Fong, Jessica H.; Geer, Lewis Y.; Panchenko, Anna R.; Bryant, Stephen H.
2007-01-01
Domains are basic evolutionary units of proteins and most proteins have more than one domain. Advances in domain modeling and collection are making it possible to annotate a large fraction of known protein sequences by a linear ordering of their domains, yielding their architecture. Protein domain architectures link evolutionarily related proteins and underscore their shared functions. Here, we attempt to better understand this association by identifying the evolutionary pathways by which extant architectures may have evolved. We propose a model of evolution in which architectures arise through rearrangements of inferred precursor architectures and acquisition of new domains. These pathways are ranked using a parsimony principle, whereby scenarios requiring the fewest number of independent recombination events, namely fission and fusion operations, are assumed to be more likely. Using a data set of domain architectures present in 159 proteomes that represent all three major branches of the tree of life allows us to estimate the history of over 85% of all architectures in the sequence database. We find that the distribution of rearrangement classes is robust with respect to alternative parsimony rules for inferring the presence of precursor architectures in ancestral species. Analyzing the most parsimonious pathways, we find 87% of architectures to gain complexity over time through simple changes, among which fusion events account for 5.6 times as many architectures as fission. Our results may be used to compute domain architecture similarities, for example, based on the number of historical recombination events separating them. Domain architecture “neighbors” identified in this way may lead to new insights about the evolution of protein function. PMID:17166515
Robles, María del Rosario; Cutillas, Cristina; Panei, Carlos Javier; Callejón, Rocío
2014-01-01
Populations of Trichuris spp. isolated from six species of sigmodontine rodents from Argentina were analyzed based on morphological characteristics and ITS2 (rDNA) region sequences. Molecular data provided an opportunity to discuss the phylogenetic relationships among the Trichuris spp. from Noth and South America (mainly from Argentina). Trichuris specimens were identified morphologically as Trichuris pardinasi, T. navonae, Trichuris sp. and Trichuris new species, described in this paper. Sequences analyzed by Maximum Parsimony, Maximum Likelihood and Bayesian inference methods showed four main clades corresponding with the four different species regardless of geographical origin and host species. These four species from sigmodontine rodents clustered together and separated from Trichuris species isolated from murine and arvicoline rodents (outgroup). Different genetic lineages observed among Trichuris species from sigmodontine rodents which supported the proposal of a new species. Moreover, host distribution showed correspondence with the different tribes within the subfamily Sigmodontinae. PMID:25393618
Callejón, Rocío; Robles, María Del Rosario; Panei, Carlos Javier; Cutillas, Cristina
2016-08-01
A molecular phylogenetic hypothesis is presented for the genus Trichuris based on sequence data from mitochondrial cytochrome c oxidase 1 (cox1) and cytochrome b (cob). The taxa consisted of nine populations of whipworm from five species of Sigmodontinae rodents from Argentina. Bayesian Inference, Maximum Parsimony, and Maximum Likelihood methods were used to infer phylogenies for each gene separately but also for the combined mitochondrial data and the combined mitochondrial and nuclear dataset. Phylogenetic results based on cox1 and cob mitochondrial DNA (mtDNA) revealed three clades strongly resolved corresponding to three different species (Trichuris navonae, Trichuris bainae, and Trichuris pardinasi) showing phylogeographic variation, but relationships among Trichuris species were poorly resolved. Phylogenetic reconstruction based on concatenated sequences had greater phylogenetic resolution for delimiting species and populations intra-specific of Trichuris than those based on partitioned genes. Thus, populations of T. bainae and T. pardinasi could be affected by geographical factors and co-divergence parasite-host.
Joint amalgamation of most parsimonious reconciled gene trees
Scornavacca, Celine; Jacox, Edwin; Szöllősi, Gergely J.
2015-01-01
Motivation: Traditionally, gene phylogenies have been reconstructed solely on the basis of molecular sequences; this, however, often does not provide enough information to distinguish between statistically equivalent relationships. To address this problem, several recent methods have incorporated information on the species phylogeny in gene tree reconstruction, leading to dramatic improvements in accuracy. Although probabilistic methods are able to estimate all model parameters but are computationally expensive, parsimony methods—generally computationally more efficient—require a prior estimate of parameters and of the statistical support. Results: Here, we present the Tree Estimation using Reconciliation (TERA) algorithm, a parsimony based, species tree aware method for gene tree reconstruction based on a scoring scheme combining duplication, transfer and loss costs with an estimate of the sequence likelihood. TERA explores all reconciled gene trees that can be amalgamated from a sample of gene trees. Using a large scale simulated dataset, we demonstrate that TERA achieves the same accuracy as the corresponding probabilistic method while being faster, and outperforms other parsimony-based methods in both accuracy and speed. Running TERA on a set of 1099 homologous gene families from complete cyanobacterial genomes, we find that incorporating knowledge of the species tree results in a two thirds reduction in the number of apparent transfer events. Availability and implementation: The algorithm is implemented in our program TERA, which is freely available from http://mbb.univ-montp2.fr/MBB/download_sources/16__TERA. Contact: celine.scornavacca@univ-montp2.fr, ssolo@angel.elte.hu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:25380957
Kress, W J; Prince, L M; Hahn, W J; Zimmer, E A
2001-01-01
The Zingiberales are a tropical group of monocotyledons that includes bananas, gingers, and their relatives. The phylogenetic relationships among the eight families currently recognized are investigated here by using parsimony and maximum likelihood analyses of four character sets: morphological features (1), and sequence data of the (2) chloroplast rbcL gene, (3) chloroplast atpB gene, and (4) nuclear 18S rDNA gene. Outgroups for the analyses include the closely related Commelinaceae + Philydraceae + Haemodoraceae + Pontederiaceae + Hanguanaceae as well as seven more distantly related monocots and paleoherbs. Only slightly different estimates of evolutionary relationships result from the analysis of each character set. The morphological data yield a single fully resolved most-parsimonious tree. None of the molecular datasets alone completely resolves interfamilial relationships. The analyses of the combined molecular dataset provide more resolution than do those of individual genes, and the addition of the morphological data provides a well-supported estimate of phylogenetic relationships: (Musaceae ((Strelitziaceae, Lowiaceae) (Heliconiaceae ((Zingiberaceae, Costaceae) (Cannaceae, Marantaceae))))). Evidence from branch lengths in the parsimony analyses and from the fossil record suggests that the Zingiberales originated in the Early Cretaceous and underwent a rapid radiation in the mid-Cretaceous, by which time most extant family lineages had diverged.
Molecular Phylogeny of the Bamboo Sharks (Chiloscyllium spp.)
Masstor, Noor Haslina; Samat, Abdullah; Nor, Shukor Md; Md-Zain, Badrul Munir
2014-01-01
Chiloscyllium, commonly called bamboo shark, can be found inhabiting the waters of the Indo-West Pacific around East Asian countries such as Malaysia, Myanmar, Thailand, Singapore, and Indonesia. The International Union for Conservation of Nature (IUCN) Red List has categorized them as nearly threatened sharks out of their declining population status due to overexploitation. A molecular study was carried out to portray the systematic relationships within Chiloscyllium species using 12S rRNA and cytochrome b gene sequences. Maximum parsimony and Bayesian were used to reconstruct their phylogeny trees. A total of 381 bp sequences' lengths were successfully aligned in the 12S rRNA region, with 41 bp sites being parsimony-informative. In the cytochrome b region, a total of 1120 bp sites were aligned, with 352 parsimony-informative characters. All analyses yield phylogeny trees on which C. indicum has close relationships with C. plagiosum. C. punctatum is sister taxon to both C. indicum and C. plagiosum while C. griseum and C. hasseltii formed their own clade as sister taxa. These Chiloscyllium classifications can be supported by some morphological characters (lateral dermal ridges on the body, coloring patterns, and appearance of hypobranchials and basibranchial plate) that can clearly be used to differentiate each species. PMID:25013766
Masters, J C; Anthony, N M; de Wit, M J; Mitchell, A
2005-08-01
Major aspects of lorisid phylogeny and systematics remain unresolved, despite several studies (involving morphology, histology, karyology, immunology, and DNA sequencing) aimed at elucidating them. Our study is the first to investigate the evolution of this enigmatic group using molecular and morphological data for all four well-established genera: Arctocebus, Loris, Nycticebus, and Perodicticus. Data sets consisting of 386 bp of 12S rRNA, 535 bp of 16S rRNA, and 36 craniodental characters were analyzed separately and in combination, using maximum parsimony and maximum likelihood. Outgroups, consisting of two galagid taxa (Otolemur and Galagoides) and a lemuroid (Microcebus), were also varied. The morphological data set yielded a paraphyletic lorisid clade with the robust Nycticebus and Perodicticus grouped as sister taxa, and the galagids allied with Arctocebus. All molecular analyses maximum parsimony (MP) or maximum likelihood (ML) which included Microcebus as an outgroup rendered a paraphyletic lorisid clade, with one exception: the 12S + 16S data set analyzed with ML. The position of the galagids in these paraphyletic topologies was inconsistent, however, and bootstrap values were low. Exclusion of Microcebus generated a monophyletic Lorisidae with Asian and African subclades; bootstrap values for all three clades in the total evidence tree were over 90%. We estimated mean genetic distances for lemuroids vs. lorisoids, lorisids vs. galagids, and Asian vs. African lorisids as a guide to relative divergence times. We present information regarding a temporary land bridge that linked the two now widely separated regions inhabited by lorisids that may explain their distribution. Finally, we make taxonomic recommendations based on our results. (c) 2005 Wiley-Liss, Inc.
Learning Efficient Sparse and Low Rank Models.
Sprechmann, P; Bronstein, A M; Sapiro, G
2015-09-01
Parsimony, including sparsity and low rank, has been shown to successfully model data in numerous machine learning and signal processing tasks. Traditionally, such modeling approaches rely on an iterative algorithm that minimizes an objective function with parsimony-promoting terms. The inherently sequential structure and data-dependent complexity and latency of iterative optimization constitute a major limitation in many applications requiring real-time performance or involving large-scale data. Another limitation encountered by these modeling techniques is the difficulty of their inclusion in discriminative learning scenarios. In this work, we propose to move the emphasis from the model to the pursuit algorithm, and develop a process-centric view of parsimonious modeling, in which a learned deterministic fixed-complexity pursuit process is used in lieu of iterative optimization. We show a principled way to construct learnable pursuit process architectures for structured sparse and robust low rank models, derived from the iteration of proximal descent algorithms. These architectures learn to approximate the exact parsimonious representation at a fraction of the complexity of the standard optimization methods. We also show that appropriate training regimes allow to naturally extend parsimonious models to discriminative settings. State-of-the-art results are demonstrated on several challenging problems in image and audio processing with several orders of magnitude speed-up compared to the exact optimization algorithms.
Antle, John M.; Stoorvogel, Jetse J.; Valdivia, Roberto O.
2014-01-01
This article presents conceptual and empirical foundations for new parsimonious simulation models that are being used to assess future food and environmental security of farm populations. The conceptual framework integrates key features of the biophysical and economic processes on which the farming systems are based. The approach represents a methodological advance by coupling important behavioural processes, for example, self-selection in adaptive responses to technological and environmental change, with aggregate processes, such as changes in market supply and demand conditions or environmental conditions as climate. Suitable biophysical and economic data are a critical limiting factor in modelling these complex systems, particularly for the characterization of out-of-sample counterfactuals in ex ante analyses. Parsimonious, population-based simulation methods are described that exploit available observational, experimental, modelled and expert data. The analysis makes use of a new scenario design concept called representative agricultural pathways. A case study illustrates how these methods can be used to assess food and environmental security. The concluding section addresses generalizations of parametric forms and linkages of regional models to global models. PMID:24535388
Antle, John M; Stoorvogel, Jetse J; Valdivia, Roberto O
2014-04-05
This article presents conceptual and empirical foundations for new parsimonious simulation models that are being used to assess future food and environmental security of farm populations. The conceptual framework integrates key features of the biophysical and economic processes on which the farming systems are based. The approach represents a methodological advance by coupling important behavioural processes, for example, self-selection in adaptive responses to technological and environmental change, with aggregate processes, such as changes in market supply and demand conditions or environmental conditions as climate. Suitable biophysical and economic data are a critical limiting factor in modelling these complex systems, particularly for the characterization of out-of-sample counterfactuals in ex ante analyses. Parsimonious, population-based simulation methods are described that exploit available observational, experimental, modelled and expert data. The analysis makes use of a new scenario design concept called representative agricultural pathways. A case study illustrates how these methods can be used to assess food and environmental security. The concluding section addresses generalizations of parametric forms and linkages of regional models to global models.
Goloboff, Pablo A
2014-10-01
Three different types of data sets, for which the uniquely most parsimonious tree can be known exactly but is hard to find with heuristic tree search methods, are studied. Tree searches are complicated more by the shape of the tree landscape (i.e. the distribution of homoplasy on different trees) than by the sheer abundance of homoplasy or character conflict. Data sets of Type 1 are those constructed by Radel et al. (2013). Data sets of Type 2 present a very rugged landscape, with narrow peaks and valleys, but relatively low amounts of homoplasy. For such a tree landscape, subjecting the trees to TBR and saving suboptimal trees produces much better results when the sequence of clipping for the tree branches is randomized instead of fixed. An unexpected finding for data sets of Types 1 and 2 is that starting a search from a random tree instead of a random addition sequence Wagner tree may increase the probability that the search finds the most parsimonious tree; a small artificial example where these probabilities can be calculated exactly is presented. Data sets of Type 3, the most difficult data sets studied here, comprise only congruent characters, and a single island with only one most parsimonious tree. Even if there is a single island, missing entries create a very flat landscape which is difficult to traverse with tree search algorithms because the number of equally parsimonious trees that need to be saved and swapped to effectively move around the plateaus is too large. Minor modifications of the parameters of tree drifting, ratchet, and sectorial searches allow travelling around these plateaus much more efficiently than saving and swapping large numbers of equally parsimonious trees with TBR. For these data sets, two new related criteria for selecting taxon addition sequences in Wagner trees (the "selected" and "informative" addition sequences) produce much better results than the standard random or closest addition sequences. These new methods for Wagner trees and for moving around plateaus can be useful when analyzing phylogenomic data sets formed by concatenation of genes with uneven taxon representation ("sparse" supermatrices), which are likely to present a tree landscape with extensive plateaus. Copyright © 2014 Elsevier Inc. All rights reserved.
Parsimonious surface wave interferometry
NASA Astrophysics Data System (ADS)
Li, Jing; Hanafy, Sherif; Schuster, Gerard T.
2018-03-01
To decrease the recording time of a 2-D seismic survey from a few days to one hour or less, we present a parsimonious surface wave interferometry method. Interferometry allows for the creation of a large number of virtual shot gathers from just two reciprocal shot gathers by crosscoherence of trace pairs. Then, the virtual surface waves can be inverted for the S-wave velocity model by wave-equation dispersion inversion (WD). Synthetic and field data tests suggest that parsimonious WD (PWD) gives S-velocity tomograms that are comparable to those obtained from a conventional survey with a shot at each receiver. The limitation of PWD is that the virtual data lose some information so that the resolution of the S-velocity tomogram can be modestly lower than that of the S-velocity tomogram inverted from a conventional survey.
Applying a multiobjective metaheuristic inspired by honey bees to phylogenetic inference.
Santander-Jiménez, Sergio; Vega-Rodríguez, Miguel A
2013-10-01
The development of increasingly popular multiobjective metaheuristics has allowed bioinformaticians to deal with optimization problems in computational biology where multiple objective functions must be taken into account. One of the most relevant research topics that can benefit from these techniques is phylogenetic inference. Throughout the years, different researchers have proposed their own view about the reconstruction of ancestral evolutionary relationships among species. As a result, biologists often report different phylogenetic trees from a same dataset when considering distinct optimality principles. In this work, we detail a multiobjective swarm intelligence approach based on the novel Artificial Bee Colony algorithm for inferring phylogenies. The aim of this paper is to propose a complementary view of phylogenetics according to the maximum parsimony and maximum likelihood criteria, in order to generate a set of phylogenetic trees that represent a compromise between these principles. Experimental results on a variety of nucleotide data sets and statistical studies highlight the relevance of the proposal with regard to other multiobjective algorithms and state-of-the-art biological methods. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Hrbek, Tomas; Stölting, Kai N; Bardakci, Fevzi; Küçük, Fahrettin; Wildekamp, Rudolf H; Meyer, Axel
2004-07-01
We investigated the phylogenetic relationships of Pseudophoxinus (Cyprinidae: Leuciscinae) species from central Anatolia, Turkey to test the hypothesis of geographic speciation driven by early Pliocene orogenic events. We analyzed 1141 aligned base pairs of the complete cytochrome b mitochondrial gene. Phylogenetic relationships reconstructed by maximum likelihood, Bayesian likelihood, and maximum parsimony methods are identical, and generally well supported. Species and clades are restricted to geologically well-defined units, and are deeply divergent from each other. The basal diversification of central Anatolian Pseudophoxinus is estimated to have occurred approximately 15 million years ago. Our results are in agreement with a previous study of the Anatolian fish genus Aphanius that also shows a diversification pattern driven by the Pliocene orogenic events. The distribution of clades of Aphanius and Pseudophoxinus overlap, and areas of distribution comprise the same geological units. The geological history of Anatolia is likely to have had a major impact on the diversification history of many taxa occupying central Anatolia; many of these taxa are likely to be still unrecognized as distinct. Copyright 2004 Elsevier Inc.
Atibalentja, N; Noel, G R; Domier, L L
2000-03-01
A 1341 bp sequence of the 16S rDNA of an undescribed species of Pasteuria that parasitizes the soybean cyst nematode, Heterodera glycines, was determined and then compared with a homologous sequence of Pasteuria ramosa, a parasite of cladoceran water fleas of the family Daphnidae. The two Pasteuria sequences, which diverged from each other by a dissimilarity index of 7%, also were compared with the 16S rDNA sequences of 30 other bacterial species to determine the phylogenetic position of the genus Pasteuria among the Gram-positive eubacteria. Phylogenetic analyses using maximum-likelihood, maximum-parsimony and neighbour-joining methods showed that the Heterodera glycines-infecting Pasteuria and its sister species, P. ramosa, form a distinct line of descent within the Alicyclobacillus group of the Bacillaceae. These results are consistent with the view that the genus Pasteuria is a deeply rooted member of the Clostridium-Bacillus-Streptococcus branch of the Gram-positive eubacteria, neither related to the actinomycetes nor closely related to true endospore-forming bacteria.
Ali, Syeda Kauser; Baig, Lubna Ansari; Violato, Claudio; Zahid, Onaiza
2017-01-01
This study was conducted to adduce evidence of validity for admissions tests and processes and for identifying a parsimonious model that predicts students' academic achievement in Medical College. Psychometric study done on admission data and assessment scores for five years of medical studies at Aga Khan University Medical College, Pakistan using confirmatory factor analysis (CFA) and structured equation modeling (SEM). Sample included 276 medical students admitted in 2003, 2004 and 2005. The SEM supported the existence of covariance between verbal reasoning, science and clinical knowledge for predicting achievement in medical school employing Maximum Likelihood (ML) estimations (n=112). Fit indices: χ 2 (21) = 59.70, p =<.0001; CFI=.873; RMSEA = 0.129; SRMR = 0.093. This study shows that in addition to biology and chemistry which have been traditionally used as major criteria for admission to medical colleges in Pakistan; mathematics has proven to be a better predictor for higher achievements in medical college.
Nalbantoglu, Sinem; Abu-Asab, Mones; Tan, Ming; Zhang, Xuemin; Cai, Ling; Amri, Hakima
2016-07-01
Pancreatic ductal adenocarcinoma (PDAC) is one of the rapidly growing forms of pancreatic cancer with a poor prognosis and less than 5% 5-year survival rate. In this study, we characterized the genetic signatures and signaling pathways related to survival from PDAC, using a parsimony phylogenetic algorithm. We applied the parsimony phylogenetic algorithm to analyze the publicly available whole-genome in silico array analysis of a gene expression data set in 25 early-stage human PDAC specimens. We explain here that the parsimony phylogenetics is an evolutionary analytical method that offers important promise to uncover clonal (driver) and nonclonal (passenger) aberrations in complex diseases. In our analysis, parsimony and statistical analyses did not identify significant correlations between survival times and gene expression values. Thus, the survival rankings did not appear to be significantly different between patients for any specific gene (p > 0.05). Also, we did not find correlation between gene expression data and tumor stage in the present data set. While the present analysis was unable to identify in this relatively small sample of patients a molecular signature associated with pancreatic cancer prognosis, we suggest that future research and analyses with the parsimony phylogenetic algorithm in larger patient samples are worthwhile, given the devastating nature of pancreatic cancer and its early diagnosis, and the need for novel data analytic approaches. The future research practices might want to place greater emphasis on phylogenetics as one of the analytical paradigms, as our findings presented here are on the cusp of this shift, especially in the current era of Big Data and innovation policies advocating for greater data sharing and reanalysis.
Artificial neural network model for ozone concentration estimation and Monte Carlo analysis
NASA Astrophysics Data System (ADS)
Gao, Meng; Yin, Liting; Ning, Jicai
2018-07-01
Air pollution in urban atmosphere directly affects public-health; therefore, it is very essential to predict air pollutant concentrations. Air quality is a complex function of emissions, meteorology and topography, and artificial neural networks (ANNs) provide a sound framework for relating these variables. In this study, we investigated the feasibility of using ANN model with meteorological parameters as input variables to predict ozone concentration in the urban area of Jinan, a metropolis in Northern China. We firstly found that the architecture of network of neurons had little effect on the predicting capability of ANN model. A parsimonious ANN model with 6 routinely monitored meteorological parameters and one temporal covariate (the category of day, i.e. working day, legal holiday and regular weekend) as input variables was identified, where the 7 input variables were selected following the forward selection procedure. Compared with the benchmarking ANN model with 9 meteorological and photochemical parameters as input variables, the predicting capability of the parsimonious ANN model was acceptable. Its predicting capability was also verified in term of warming success ratio during the pollution episodes. Finally, uncertainty and sensitivity analysis were also performed based on Monte Carlo simulations (MCS). It was concluded that the ANN could properly predict the ambient ozone level. Maximum temperature, atmospheric pressure, sunshine duration and maximum wind speed were identified as the predominate input variables significantly influencing the prediction of ambient ozone concentrations.
Chen, Jing; Jiang, Li-Yun; Qiao, Ge-Xia
2011-01-01
Abstract The taxonomic position of Hormaphis similibetulae Qiao & Zhang, 2004 has been reexamined. The phylogenetic position of Hormaphis similibetulae was inferred by maximum parsimony, maximum likelihood and Bayesian analyses on the basis of partial nuclear elongation factor-1α and mitochondrial tRNA leucine/cytochrome oxidase II sequences. The results showed that this species fell into the clade of Hamamelistes species, occupying a basal position, and was clearly distinct from other Hormaphis species. A closer relationship between Hormaphis similibetulae and Hamamelistes species was also revealed by life cycle analysis. Therefore, we conclude that Hormaphis similibetulae should be transferred to the genus Hamamelistes as Hamamelistes similibetulae (Qiao & Zhang), comb. n. PMID:21852935
NASA Astrophysics Data System (ADS)
Relan, Rishi; Tiels, Koen; Marconato, Anna; Dreesen, Philippe; Schoukens, Johan
2018-05-01
Many real world systems exhibit a quasi linear or weakly nonlinear behavior during normal operation, and a hard saturation effect for high peaks of the input signal. In this paper, a methodology to identify a parsimonious discrete-time nonlinear state space model (NLSS) for the nonlinear dynamical system with relatively short data record is proposed. The capability of the NLSS model structure is demonstrated by introducing two different initialisation schemes, one of them using multivariate polynomials. In addition, a method using first-order information of the multivariate polynomials and tensor decomposition is employed to obtain the parsimonious decoupled representation of the set of multivariate real polynomials estimated during the identification of NLSS model. Finally, the experimental verification of the model structure is done on the cascaded water-benchmark identification problem.
NASA Technical Reports Server (NTRS)
Stolzer, Alan J.; Halford, Carl
2007-01-01
In a previous study, multiple regression techniques were applied to Flight Operations Quality Assurance-derived data to develop parsimonious model(s) for fuel consumption on the Boeing 757 airplane. The present study examined several data mining algorithms, including neural networks, on the fuel consumption problem and compared them to the multiple regression results obtained earlier. Using regression methods, parsimonious models were obtained that explained approximately 85% of the variation in fuel flow. In general data mining methods were more effective in predicting fuel consumption. Classification and Regression Tree methods reported correlation coefficients of .91 to .92, and General Linear Models and Multilayer Perceptron neural networks reported correlation coefficients of about .99. These data mining models show great promise for use in further examining large FOQA databases for operational and safety improvements.
Guindon, Stéphane; Dufayard, Jean-François; Lefort, Vincent; Anisimova, Maria; Hordijk, Wim; Gascuel, Olivier
2010-05-01
PhyML is a phylogeny software based on the maximum-likelihood principle. Early PhyML versions used a fast algorithm performing nearest neighbor interchanges to improve a reasonable starting tree topology. Since the original publication (Guindon S., Gascuel O. 2003. A simple, fast and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst. Biol. 52:696-704), PhyML has been widely used (>2500 citations in ISI Web of Science) because of its simplicity and a fair compromise between accuracy and speed. In the meantime, research around PhyML has continued, and this article describes the new algorithms and methods implemented in the program. First, we introduce a new algorithm to search the tree space with user-defined intensity using subtree pruning and regrafting topological moves. The parsimony criterion is used here to filter out the least promising topology modifications with respect to the likelihood function. The analysis of a large collection of real nucleotide and amino acid data sets of various sizes demonstrates the good performance of this method. Second, we describe a new test to assess the support of the data for internal branches of a phylogeny. This approach extends the recently proposed approximate likelihood-ratio test and relies on a nonparametric, Shimodaira-Hasegawa-like procedure. A detailed analysis of real alignments sheds light on the links between this new approach and the more classical nonparametric bootstrap method. Overall, our tests show that the last version (3.0) of PhyML is fast, accurate, stable, and ready to use. A Web server and binary files are available from http://www.atgc-montpellier.fr/phyml/.
Geller, J B; Walton, E D
2001-09-01
Clonal growth and symbiosis with photosynthetic zooxanthellae typify many genera of marine organisms, suggesting that these traits are usually conserved. However, some, such as Anthopleura, a genus of sea anemones, contain members lacking one or both of these traits. The evolutionary origins of these traits in 13 species of Anthopleura were inferred from a molecular phylogeny derived from 395 bp of the mitochondrial 16S rRNA gene and 410 bp of the mitochondrial cytochrome oxidase subunit III gene. Sequences from these genes were combined and analyzed by maximum-parsimony, maximum-likelihood, and neighbor-joining methods. Best trees from each method indicated a minimum of four changes in growth mode and that symbiosis with zooxanthellae has arisen independently in eastern and western Pacific species. Alternative trees in which species sharing growth modes or the symbiotic condition were constrained to be monophyletic were significantly worse than best trees. Although clade composition was mostly consistent with geographic sympatry, A. artemisia from California was included in the western Pacific clade. Likewise, A. midori from Japan was not placed in a clade containing only other Asian congeners. The history of Anthopleura includes repeated shifts between clonality and solitariness, repeated attainment of symbiosis with zooxanthellae, and intercontinental dispersal.
Efficiency of nuclear and mitochondrial markers recovering and supporting known amniote groups.
Lambret-Frotté, Julia; Perini, Fernando Araújo; de Moraes Russo, Claudia Augusta
2012-01-01
We have analysed the efficiency of all mitochondrial protein coding genes and six nuclear markers (Adora3, Adrb2, Bdnf, Irbp, Rag2 and Vwf) in reconstructing and statistically supporting known amniote groups (murines, rodents, primates, eutherians, metatherians, therians). The efficiencies of maximum likelihood, Bayesian inference, maximum parsimony, neighbor-joining and UPGMA were also evaluated, by assessing the number of correct and incorrect recovered groupings. In addition, we have compared support values using the conservative bootstrap test and the Bayesian posterior probabilities. First, no correlation was observed between gene size and marker efficiency in recovering or supporting correct nodes. As expected, tree-building methods performed similarly, even UPGMA that, in some cases, outperformed other most extensively used methods. Bayesian posterior probabilities tend to show much higher support values than the conservative bootstrap test, for correct and incorrect nodes. Our results also suggest that nuclear markers do not necessarily show a better performance than mitochondrial genes. The so-called dependency among mitochondrial markers was not observed comparing genome performances. Finally, the amniote groups with lowest recovery rates were therians and rodents, despite the morphological support for their monophyletic status. We suggest that, regardless of the tree-building method, a few carefully selected genes are able to unfold a detailed and robust scenario of phylogenetic hypotheses, particularly if taxon sampling is increased.
Seifali, Mahvash; Arshad, Aziz; Moghaddam, Faezeh Yazdani; Esmaeili, Hamid Reza; Kiabi, Bahram H.; Daud, Siti Khalijah; Aliabadian, Mansour
2012-01-01
Background Knowledge about Alburnoides remains lacking relative to many other species, resulting in a lack of a systematic position and taxonomic diagnosis. Basic biological information for Alburnoides has been constructed, and it is necessary to understand further and obtain more information about this species. Its phylogenetic relationships are still debated and no molecular data have been used to study this taxon in Iran. A holistic approach for genetic methods was adopted to analyze possible spirlin population differences at selected centers in the south Caspian Sea basin of Iran. Methods The phylogenetic relationships were determined based on 774 base pairs of the mitochondrial cytochrome b gene of 32 specimens of spirlin from nine locations in the south Caspian Sea drainage basin of Iran. The nucleotide sequences were subjected to phylogenetic analysis using the neighbor-joining, maximum parsimony, maximum likelihood, and Bayesian methods. Results The mitochondrial gene tree largely supports the existence of three major clades. The western populations (clade I) may be considered as Alburnoides eichwaldii, whereas the Talar river populations (clade II) are represented as Alburnoides sp.1 and the eastern populations (clade III) may be distinct taxa of Alburnoides sp.2. Conclusion This molecular evidence supports the hypothesis that A. bipunctatus does not exist in the south Caspian Sea basin of Iran, and that the western and eastern populations are distinct taxa. PMID:22654487
Csuros, Miklos; Rogozin, Igor B.; Koonin, Eugene V.
2011-01-01
Protein-coding genes in eukaryotes are interrupted by introns, but intron densities widely differ between eukaryotic lineages. Vertebrates, some invertebrates and green plants have intron-rich genes, with 6–7 introns per kilobase of coding sequence, whereas most of the other eukaryotes have intron-poor genes. We reconstructed the history of intron gain and loss using a probabilistic Markov model (Markov Chain Monte Carlo, MCMC) on 245 orthologous genes from 99 genomes representing the three of the five supergroups of eukaryotes for which multiple genome sequences are available. Intron-rich ancestors are confidently reconstructed for each major group, with 53 to 74% of the human intron density inferred with 95% confidence for the Last Eukaryotic Common Ancestor (LECA). The results of the MCMC reconstruction are compared with the reconstructions obtained using Maximum Likelihood (ML) and Dollo parsimony methods. An excellent agreement between the MCMC and ML inferences is demonstrated whereas Dollo parsimony introduces a noticeable bias in the estimations, typically yielding lower ancestral intron densities than MCMC and ML. Evolution of eukaryotic genes was dominated by intron loss, with substantial gain only at the bases of several major branches including plants and animals. The highest intron density, 120 to 130% of the human value, is inferred for the last common ancestor of animals. The reconstruction shows that the entire line of descent from LECA to mammals was intron-rich, a state conducive to the evolution of alternative splicing. PMID:21935348
Ma, Junying; Wang, Hu; Lin, Gonghua; Craig, Philip S; Ito, Akira; Cai, Zhenyuan; Zhang, Tongzuo; Han, Xiumin; Ma, Xiao; Zhang, Jingxiao; Liu, Yufang; Zhao, Yanmei; Wang, Yongshun
2012-07-01
The Qinghai-Tibetan Plateau (QTP, in western China), which is the largest and highest plateau on Earth, is a highly epidemic region for Echinococcus spp. We collected 70 Echinococcus samples from humans, dogs, sheep, yaks, plateau pikas, and voles in eastern and southern Qinghai and genotyped them using the mitochondrial DNA marker cytochrome oxidase subunit I gene and maximum parsimony and Bayesian reconstruction methods. Based on the 792-bp sequence matrix, we recorded 124 variable sites, of which, 115 were parsimony-informative. Thirty-four haplotypes (H1-H34) were detected, of which H1-H15, H16-H17, and H18-H34 belonged to Echinococcus shiquicus, Echinococcus multilocularis, and Echinococcus granulosus, respectively. Within 26 human isolates, three were identified as E. multilocularis and 23 were E. granulosus. We also detected a dual infection case in a dog with E. multilocularis and E. granulosus. The intraspecific haplotype (Hd ± SD) and nucleotide (Nd ± SD) diversity of E. shiquicus (0.947 ± 0.021; 0.00441 ± 0.00062) was higher than that for E. granulosus (0.896 ± 0.038; 0.00221 ± 0.00031) and E. multilocularis (0.286 ± 0.196; 0.00036 ± 0.00025). Moreover, the haplotype network of E. shiquicus showed a radial feature rather than a divergent feature in a previous study, indicating this species in the QTP has also evolved with bottleneck effects.
A LEAST ABSOLUTE SHRINKAGE AND SELECTION OPERATOR (LASSO) FOR NONLINEAR SYSTEM IDENTIFICATION
NASA Technical Reports Server (NTRS)
Kukreja, Sunil L.; Lofberg, Johan; Brenner, Martin J.
2006-01-01
Identification of parametric nonlinear models involves estimating unknown parameters and detecting its underlying structure. Structure computation is concerned with selecting a subset of parameters to give a parsimonious description of the system which may afford greater insight into the functionality of the system or a simpler controller design. In this study, a least absolute shrinkage and selection operator (LASSO) technique is investigated for computing efficient model descriptions of nonlinear systems. The LASSO minimises the residual sum of squares by the addition of a 1 penalty term on the parameter vector of the traditional 2 minimisation problem. Its use for structure detection is a natural extension of this constrained minimisation approach to pseudolinear regression problems which produces some model parameters that are exactly zero and, therefore, yields a parsimonious system description. The performance of this LASSO structure detection method was evaluated by using it to estimate the structure of a nonlinear polynomial model. Applicability of the method to more complex systems such as those encountered in aerospace applications was shown by identifying a parsimonious system description of the F/A-18 Active Aeroelastic Wing using flight test data.
An experimental phylogeny to benchmark ancestral sequence reconstruction
Randall, Ryan N.; Radford, Caelan E.; Roof, Kelsey A.; Natarajan, Divya K.; Gaucher, Eric A.
2016-01-01
Ancestral sequence reconstruction (ASR) is a still-burgeoning method that has revealed many key mechanisms of molecular evolution. One criticism of the approach is an inability to validate its algorithms within a biological context as opposed to a computer simulation. Here we build an experimental phylogeny using the gene of a single red fluorescent protein to address this criticism. The evolved phylogeny consists of 19 operational taxonomic units (leaves) and 17 ancestral bifurcations (nodes) that display a wide variety of fluorescent phenotypes. The 19 leaves then serve as ‘modern' sequences that we subject to ASR analyses using various algorithms and to benchmark against the known ancestral genotypes and ancestral phenotypes. We confirm computer simulations that show all algorithms infer ancient sequences with high accuracy, yet we also reveal wide variation in the phenotypes encoded by incorrectly inferred sequences. Specifically, Bayesian methods incorporating rate variation significantly outperform the maximum parsimony criterion in phenotypic accuracy. Subsampling of extant sequences had minor effect on the inference of ancestral sequences. PMID:27628687
Schuster, Tanja M.; Setaro, Sabrina D.; Tibbits, Josquin F. G.; Batty, Erin L.; Fowler, Rachael M.; McLay, Todd G. B.; Wilcox, Stephen; Ades, Peter K.
2018-01-01
Previous molecular phylogenetic analyses have resolved the Australian bloodwood eucalypt genus Corymbia (~100 species) as either monophyletic or paraphyletic with respect to Angophora (9–10 species). Here we assess relationships of Corymbia and Angophora using a large dataset of chloroplast DNA sequences (121,016 base pairs; from 90 accessions representing 55 Corymbia and 8 Angophora species, plus 33 accessions of related genera), skimmed from high throughput sequencing of genomic DNA, and compare results with new analyses of nuclear ITS sequences (119 accessions) from previous studies. Maximum likelihood and maximum parsimony analyses of cpDNA resolve well supported trees with most nodes having >95% bootstrap support. These trees strongly reject monophyly of Corymbia, its two subgenera (Corymbia and Blakella), most taxonomic sections (Abbreviatae, Maculatae, Naviculares, Septentrionales), and several species. ITS trees weakly indicate paraphyly of Corymbia (bootstrap support <50% for maximum likelihood, and 71% for parsimony), but are highly incongruent with the cpDNA analyses, in that they support monophyly of both subgenera and some taxonomic sections of Corymbia. The striking incongruence between cpDNA trees and both morphological taxonomy and ITS trees is attributed largely to chloroplast introgression between taxa, because of geographic sharing of chloroplast clades across taxonomic groups. Such introgression has been widely inferred in studies of the related genus Eucalyptus. This is the first report of its likely prevalence in Corymbia and Angophora, but this is consistent with previous morphological inferences of hybridisation between species. Our findings (based on continent-wide sampling) highlight a need for more focussed studies to assess the extent of hybridisation and introgression in the evolutionary history of these genera, and that critical testing of the classification of Corymbia and Angophora requires additional sequence data from nuclear genomes. PMID:29668710
DiMeglio, Laura M.; Yu, Hongrun; Davis, Thomas M.
2014-01-01
The genus Fragaria encompasses species at ploidy levels ranging from diploid to decaploid. The cultivated strawberry, Fragaria×ananassa, and its two immediate progenitors, F. chiloensis and F. virginiana, are octoploids. To elucidate the ancestries of these octoploid species, we performed a phylogenetic analysis using intron-containing sequences of the nuclear ADH-1 gene from 39 germplasm accessions representing nineteen Fragaria species and one outgroup species, Dasiphora fruticosa. All trees from Maximum Parsimony and Maximum Likelihood analyses showed two major clades, Clade A and Clade B. Each of the sampled octoploids contributed alleles to both major clades. All octoploid-derived alleles in Clade A clustered with alleles of diploid F. vesca, with the exception of one octoploid allele that clustered with the alleles of diploid F. mandshurica. All octoploid-derived alleles in clade B clustered with the alleles of only one diploid species, F. iinumae. When gaps encoded as binary characters were included in the Maximum Parsimony analysis, tree resolution was improved with the addition of six nodes, and the bootstrap support was generally higher, rising above the 50% threshold for an additional nine branches. These results, coupled with the congruence of the sequence data and the coded gap data, validate and encourage the employment of sequence sets containing gaps for phylogenetic analysis. Our phylogenetic conclusions, based upon sequence data from the ADH-1 gene located on F. vesca linkage group II, complement and generally agree with those obtained from analyses of protein-encoding genes GBSSI-2 and DHAR located on F. vesca linkage groups V and VII, respectively, but differ from a previous study that utilized rDNA sequences and did not detect the ancestral role of F. iinumae. PMID:25078607
Cutcliffe, John R; Harder, Henry G
2009-10-01
While it appears that the term parsimony has been used in the context of qualitative research and qualitative research methodology, there is a distinct absence of writing that actually explores, seeks to define, understand, critique, apply and/or evaluate the concept in qualitative research literature. This paper explores a number of issues pertaining to parsimony in qualitative research. It is the hope of the authors that this paper might raise awareness of the hitherto unexplored issues, stimulate some further interest in these and prompt other qualitative researchers to contribute to the ensuing debate. While there are currently no definitive criteria for determining the parsimony of qualitative research findings, it would be epistemologically inappropriate and philosophically incongruent to import and translate quantitative notions of parsimony. However, the ideas, principles and epistemological functions that parsimony serves can and should be applied to the qualitative paradigm. The authors suggest that more than one type of qualitative parsimony is required. The authors advance the argument that there is a relationship between the degree of parsimony and the elegance, ease of accessibility and straightforwardness (some might say - beauty) of the writing/findings; the level of expertise of the researcher; and the quality of the data collection interview. The authors also assert that there are a number of practices which, when adhered to, can enhance the parsimony of the findings and that here are a number of major implications arising from qualitative findings that lack parsimony.
Concepts of Classification and Taxonomy Phylogenetic Classification
NASA Astrophysics Data System (ADS)
Fraix-Burnet, D.
2016-05-01
Phylogenetic approaches to classification have been heavily developed in biology by bioinformaticians. But these techniques have applications in other fields, in particular in linguistics. Their main characteristics is to search for relationships between the objects or species in study, instead of grouping them by similarity. They are thus rather well suited for any kind of evolutionary objects. For nearly fifteen years, astrocladistics has explored the use of Maximum Parsimony (or cladistics) for astronomical objects like galaxies or globular clusters. In this lesson we will learn how it works.
2006-09-27
Maximum parsimony; Sibling species; Species complex; Myxomatosis ; DNA barcoding; Australia; Papua New Guinea; ITS2; COI; COII; EF-11. Introduction... myxomatosis to con- trol rabbits (Fenner and RatcliVe, 1965). Chris Green used data from cross-matings and the band- ing pattern of polytene chromosomes to... myxomatosis based on distribution but more sam- pling is required to conWrm this. Many of the sampling locations in this study and the allozyme study of
NAVARRO, F. B.; SUÁREZ-SANTIAGO, V. N.; BLANCA, G.
2004-01-01
• Background and Aims The discovery of a new species, Haplophyllum bastetanum F.B. Navarro, V.N. Suárez-Santiago & Blanca sp. nov., in the south-east of Spain has prompted the comparative study of species of the Iberian Peninsula, and others related, through morphological, cytogenetic, molecular, distributional and ecological characterization. • Methods The morphological study involved a quantitative analysis of the species present in the Iberian Peninsula and a comparative analysis of the morphological characteristics between H. bastetanum and other related species. Mitotic analyses were made with root meristems taken from germinating seeds. Phylogenetic analyses of the internal transcribed spacer sequences of nuclear ribosomal DNA were performed using neighbour-joining (NJ) and maximum-parsimony methods. • Key Results Haplophyllum bastetanum is a diploid species (2n = 18) distinguished primarily for its non-trifoliate glabrous leaves, lanceolate sepals, dark-green petals with a dorsal band of hairs, and a highly hairy ovary with round-apex locules. The other two Iberian species (H. linifolium and H. rosmarinifolium) are tetraploid (2n = 36) and have yellow petals. Both phylogenetic methods generated a well-supported clade grouping H. linifolium with H. rosmarinifolium. In the NJ tree, the H. linifolium–H. rosmarinifolium clade is a sister group to H. bastetanum, while in the parsimony analysis this occurred only when the gaps were coded as a fifth base and the characters were reweighted according to the rescaled consistency index. This latter group is supported by the sequence divergence among taxa. • Conclusions The phylogenies established from DNA sequences together with morphological and cytogenetic analyses support the separation of H. bastetanum as a new species. The results suggest that the change in the number of chromosomes may be the key mechanism of speciation of the genus Haplophyllum in the Iberian Peninsula. An evolutionary scheme for them is propounded. PMID:15306560
A supermatrix analysis of genomic, morphological, and paleontological data from crown Cetacea
2011-01-01
Background Cetacea (dolphins, porpoises, and whales) is a clade of aquatic species that includes the most massive, deepest diving, and largest brained mammals. Understanding the temporal pattern of diversification in the group as well as the evolution of cetacean anatomy and behavior requires a robust and well-resolved phylogenetic hypothesis. Although a large body of molecular data has accumulated over the past 20 years, DNA sequences of cetaceans have not been directly integrated with the rich, cetacean fossil record to reconcile discrepancies among molecular and morphological characters. Results We combined new nuclear DNA sequences, including segments of six genes (~2800 basepairs) from the functionally extinct Yangtze River dolphin, with an expanded morphological matrix and published genomic data. Diverse analyses of these data resolved the relationships of 74 taxa that represent all extant families and 11 extinct families of Cetacea. The resulting supermatrix (61,155 characters) and its sub-partitions were analyzed using parsimony methods. Bayesian and maximum likelihood (ML) searches were conducted on the molecular partition, and a molecular scaffold obtained from these searches was used to constrain a parsimony search of the morphological partition. Based on analysis of the supermatrix and model-based analyses of the molecular partition, we found overwhelming support for 15 extant clades. When extinct taxa are included, we recovered trees that are significantly correlated with the fossil record. These trees were used to reconstruct the timing of cetacean diversification and the evolution of characters shared by "river dolphins," a non-monophyletic set of species according to all of our phylogenetic analyses. Conclusions The parsimony analysis of the supermatrix and the analysis of morphology constrained to fit the ML/Bayesian molecular tree yielded broadly congruent phylogenetic hypotheses. In trees from both analyses, all Oligocene taxa included in our study fell outside crown Mysticeti and crown Odontoceti, suggesting that these two clades radiated in the late Oligocene or later, contra some recent molecular clock studies. Our trees also imply that many character states shared by river dolphins evolved in their oceanic ancestors, contradicting the hypothesis that these characters are convergent adaptations to fluvial habitats. PMID:21518443
A supermatrix analysis of genomic, morphological, and paleontological data from crown Cetacea.
Geisler, Jonathan H; McGowen, Michael R; Yang, Guang; Gatesy, John
2011-04-25
Cetacea (dolphins, porpoises, and whales) is a clade of aquatic species that includes the most massive, deepest diving, and largest brained mammals. Understanding the temporal pattern of diversification in the group as well as the evolution of cetacean anatomy and behavior requires a robust and well-resolved phylogenetic hypothesis. Although a large body of molecular data has accumulated over the past 20 years, DNA sequences of cetaceans have not been directly integrated with the rich, cetacean fossil record to reconcile discrepancies among molecular and morphological characters. We combined new nuclear DNA sequences, including segments of six genes (~2800 basepairs) from the functionally extinct Yangtze River dolphin, with an expanded morphological matrix and published genomic data. Diverse analyses of these data resolved the relationships of 74 taxa that represent all extant families and 11 extinct families of Cetacea. The resulting supermatrix (61,155 characters) and its sub-partitions were analyzed using parsimony methods. Bayesian and maximum likelihood (ML) searches were conducted on the molecular partition, and a molecular scaffold obtained from these searches was used to constrain a parsimony search of the morphological partition. Based on analysis of the supermatrix and model-based analyses of the molecular partition, we found overwhelming support for 15 extant clades. When extinct taxa are included, we recovered trees that are significantly correlated with the fossil record. These trees were used to reconstruct the timing of cetacean diversification and the evolution of characters shared by "river dolphins," a non-monophyletic set of species according to all of our phylogenetic analyses. The parsimony analysis of the supermatrix and the analysis of morphology constrained to fit the ML/Bayesian molecular tree yielded broadly congruent phylogenetic hypotheses. In trees from both analyses, all Oligocene taxa included in our study fell outside crown Mysticeti and crown Odontoceti, suggesting that these two clades radiated in the late Oligocene or later, contra some recent molecular clock studies. Our trees also imply that many character states shared by river dolphins evolved in their oceanic ancestors, contradicting the hypothesis that these characters are convergent adaptations to fluvial habitats.
Liu, Tianyu; Liang, Yinan; Zhong, Xiuqin; Wang, Ning; Hu, Dandan; Zhou, Xuan; Gu, Xiaobin; Peng, Xuerong; Yang, Guangyou
2014-01-01
Dirofilaria immitis (heartworm) is the causative agent of an important zoonotic disease that is spread by mosquitoes. In this study, molecular and phylogenetic characterization of D. immitis were performed based on complete ND1 and 16S rDNA gene sequences, which provided the foundation for more advanced molecular diagnosis, prevention, and control of heartworm diseases. The mutation rate and evolutionary divergence in adult heartworm samples from seven dogs in western China were analyzed to obtain information on genetic diversity and variability. Phylogenetic relationships were inferred using both maximum parsimony (MP) and Bayes methods based on the complete gene sequences. The results suggest that D. immitis formed an independent monophyletic group in which the 16S rDNA gene has mutated more rapidly than has ND1. PMID:24639299
Chen, Weicai; Zhang, Wei; Zhou, Shichu; Li, Ning; Huang, Yong; Mo, Yunming
2013-01-01
Lepobrachiun guangxiense Fei, Mo, Ye and Jiang, 2009 (Anura: Megophryidae), is presently thought to be endemic to Shangsi, Guangxi Province, China. A molecular phylogenetic analysis and morphological data were performed to gain insight into the phylogenetic position of this species. Maximum parsimony, maximum likelihood, and Bayesian inference methods were employed to reconstruct phylogenetic relationship, using 1914 bp of sequences from mtDNA genes of 12S rRNA, tRNAVal and 16S rRNA. Topologies revealed that L. guangxiense and Tam Dao (Vietnam) L. chapaense lineage (3A) formed a monophyletic group with well-supported values. The uncorrected p-distance of ~1.4k bp 16S rRNA data-sets between Tam Dao L. chapaense lineage (3A) and L. guangxiense is only 0.1%. Morphologically, L. guangxiense and Tam Dao L. chapaense lineage (3A) shared the same characters, and are distinguishable from "true" L. chapaense from the type locality in Sa Pa, Vietnam. Based on morphological characters and mitochondrial DNA, we suggested that the Tam Dao lineages of L. chapaense are conspecific with L. guangxiense. This represents a range extension for L. guangxiense, and a new country record for Vietnam.
López-Wilchis, Ricardo; Del Río-Portilla, Miguel Ángel; Guevara-Chumacero, Luis Manuel
2017-02-01
We described the complete mitochondrial genome (mitogenome) of the Wagner's mustached bat, Pteronotus personatus, a species belonging to the family Mormoopidae, and compared it with other published mitogenomes of bats (Chiroptera). The mitogenome of P. personatus was 16,570 bp long and contained a typically conserved structure including 13 protein-coding genes, 22 transfer RNA genes, two ribosomal RNA genes, and one control region (D-loop). Most of the genes were encoded on the H-strand, except for eight tRNA and the ND6 genes. The order of protein-coding and rRNA genes was highly conserved in all mitogenomes. All protein-coding genes started with an ATG codon, except for ND2, ND3, and ND5, which initiated with ATA, and terminated with the typical stop codon TAA/TAG or the codon AGA. Phylogenetic trees constructed using Maximum Parsimony, Maximum Likelihood, and Bayesian inference methods showed an identical topology and indicated the monophyly of different families of bats (Mormoopidae, Phyllostomidae, Vespertilionidae, Rhinolophidae, and Pteropopidae) and the existence of two major clades corresponding to the suborders Yangochiroptera and Yinpterochiroptera. The mitogenome sequence provided here will be useful for further phylogenetic analyses and population genetic studies in mormoopid bats.
Molecular phylogeny of the armored catfish family Callichthyidae (Ostariophysi, Siluriformes).
Shimabukuro-Dias, Cristiane Kioko; Oliveira, Claudio; Reis, Roberto E; Foresti, Fausto
2004-07-01
The family Callichthyidae comprises eight genera of fishes widely distributed across the Neotropical region. In the present study, sequences of the mitochondrial genes 12S rRNA, 16S rRNA, ND4, tRNAHis, and tRNASer were obtained from 28 callichthyid specimens. The sample included 12 species of Corydoras, three species of Aspidoras, two species of Brochis, Dianema, Lepthoplosternum, and Megalechis, and two local populations of Callichthys and Hoplosternum. Sequences of Nematogenys inermis (Nematogenyidae), Trichomycterus areolatus, and Henonemus punctatus (Trichomycteridae), Astroblepus sp. (Astroblepidae), and Neoplecostomus paranensis, Delturus parahybae, and Hemipsilichthys nimius (Loricariidae) were included as the outgroup. Phylogenetic analyses were performed by using the methods of maximum parsimony and maximum likelihood. The results of almost all analyses were very similar. The family Callichthyidae is monophyletic and comprises two natural groups: the subfamilies Corydoradinae (Aspidoras, Brochis, and Corydoras) and Callichthyinae (Callichthys, Dianema, Hoplosternum, Lepthoplosternum, and Megalechis), as previously demonstrated by morphological studies. The relationships observed within these subfamilies are in several ways different from those previously proposed on the basis of morphological data. Molecular results were compared with the morphologic and cytogenetic data available on the family. Copyright 2003 Elsevier Inc.
Martin, Donald S; Wright, André-Denis G; Barta, John R; Desser, Sherwin S
2002-06-01
Phylogenetic relationships within the kinetoplastid flagellates were inferred from comparisons of small-subunit ribosomal RNA gene sequences. These included 5 new gene sequences, Trypanosoma fallisi (2,239 bp), Trypanosoma chattoni (2,180 bp), Trypanosoma mega (2,211 bp), Trypanosoma neveulemairei (2,197 bp), and Trypanosoma ranarum (2,203 bp). Trees produced using maximum-parsimony and distance-matrix methods (least-squares, neighbor-joining, and maximum-likelihood), supported by strong bootstrap and quartet-puzzle analyses, indicated that the trypanosomes are a monophyletic group that divides into 2 major lineages, the salivarian trypanosomes and the nonsalivarian trypanosomes. The nonsalivarian trypanosomes further divide into 2 lineages, 1 containing trypanosomes of birds, mammals, and reptiles and the other containing trypanosomes of fish, reptiles, and anurans. Among the giant trypanosomes, T. chattoni is clearly shown to be distantly related to all the other anuran trypanosome species. Trypanosoma mega is closely associated with T. fallisi and T. ranarum, whereas T. neveulemairei and Trypanosoma rotatorium are sister taxa. The branching order of the anuran trypanosomes suggests that some toad trypanosomes may have evolved by host switching from frogs to toads.
Bärmann, Eva Verena; Rössner, Gertrud Elisabeth; Wörheide, Gert
2013-05-01
Antilopini (gazelles and their allies) are one of the most diverse but phylogenetically controversial groups of bovids. Here we provide a molecular phylogeny of this poorly understood taxon using combined analyses of mitochondrial (CYTB, COIII, 12S, 16S) and nuclear (KCAS, SPTBN1, PRKCI, MC1R, THYR) genes. We explore the influence of data partitioning and different analytical methods, including Bayesian inference, maximum likelihood and maximum parsimony, on the inferred relationships within Antilopini. We achieve increased resolution and support compared to previous analyses especially in the two most problematic parts of their tree. First, taxa commonly referred to as "gazelles" are recovered as paraphyletic, as the genus Gazella appears more closely related to the Indian blackbuck (Antilope cervicapra) than to the other two gazelle genera (Nanger and Eudorcas). Second, we recovered a strongly supported sister relationship between one of the dwarf antelopes (Ourebia) and the Antilopini subgroup Antilopina (Saiga, Gerenuk, Springbok, Blackbuck and gazelles). The assessment of the influence of taxon sampling, outgroup rooting, and data partitioning in Bayesian analyses helps explain the contradictory results of previous studies. Copyright © 2013 Elsevier Inc. All rights reserved.
Vink, Cor J; Paterson, Adrian M
2003-09-01
Datasets from the mitochondrial gene regions NADH dehydrogenase subunit I (ND1) and cytochrome c oxidase subunit I (COI) of the 20 species in the New Zealand wolf spider (Lycosidae) genus Anoteropsis were generated. Sequence data were phylogenetically analysed using parsimony and maximum likelihood analyses. The phylogenies generated from the ND1 and COI sequence data and a previously generated morphological dataset were significantly congruent (p<0.001). Sequence data were combined with morphological data and phylogenetically analysed using parsimony. The ND1 region sequenced included part of tRNA(Leu(CUN)), which appears to have an unstable amino-acyl arm and no TpsiC arm in lycosids. Analyses supported the existence of five species groups within Anoteropsis and the monophyly of species represented by multiple samples. A radiation of Anoteropsis species within the last five million years is inferred from the ND1 and COI likelihood phylograms, habitat and geological data, which also indicates that Anoteropsis arrived in New Zealand some time after it separated from Gondwana.
Evolution of complex fruiting-body morphologies in homobasidiomycetes.
Hibbett, David S; Binder, Manfred
2002-01-01
The fruiting bodies of homobasidiomycetes include some of the most complex forms that have evolved in the fungi, such as gilled mushrooms, bracket fungi and puffballs ('pileate-erect') forms. Homobasidiomycetes also include relatively simple crust-like 'resupinate' forms, however, which account for ca. 13-15% of the described species in the group. Resupinate homobasidiomycetes have been interpreted either as a paraphyletic grade of plesiomorphic forms or a polyphyletic assemblage of reduced forms. The former view suggests that morphological evolution in homobasidiomycetes has been marked by independent elaboration in many clades, whereas the latter view suggests that parallel simplification has been a common mode of evolution. To infer patterns of morphological evolution in homobasidiomycetes, we constructed phylogenetic trees from a dataset of 481 species and performed ancestral state reconstruction (ASR) using parsimony and maximum likelihood (ML) methods. ASR with both parsimony and ML implies that the ancestor of the homobasidiomycetes was resupinate, and that there have been multiple gains and losses of complex forms in the homobasidiomycetes. We also used ML to address whether there is an asymmetry in the rate of transformations between simple and complex forms. Models of morphological evolution inferred with ML indicate that the rate of transformations from simple to complex forms is about three to six times greater than the rate of transformations in the reverse direction. A null model of morphological evolution, in which there is no asymmetry in transformation rates, was rejected. These results suggest that there is a 'driven' trend towards the evolution of complex forms in homobasidiomycetes. PMID:12396494
Takamiya, Tomoko; Wongsawad, Pheravut; Sathapattayanon, Apirada; Tajima, Natsuko; Suzuki, Shunichiro; Kitamura, Saki; Shioda, Nao; Handa, Takashi; Kitanaka, Susumu; Iijima, Hiroshi; Yukawa, Tomohisa
2014-01-01
It is always difficult to construct coherent classification systems for plant lineages having diverse morphological characters. The genus Dendrobium, one of the largest genera in the Orchidaceae, includes ∼1100 species, and enormous morphological diversification has hindered the establishment of consistent classification systems covering all major groups of this genus. Given the particular importance of species in Dendrobium section Dendrobium and allied groups as floriculture and crude drug genetic resources, there is an urgent need to establish a stable classification system. To clarify phylogenetic relationships in Dendrobium section Dendrobium and allied groups, we analysed the macromolecular characters of the group. Phylogenetic analyses of 210 taxa of Dendrobium were conducted on DNA sequences of internal transcribed spacer (ITS) regions of 18S–26S nuclear ribosomal DNA and the maturase-coding gene (matK) located in an intron of the plastid gene trnK using maximum parsimony and Bayesian methods. The parsimony and Bayesian analyses revealed 13 distinct clades in the group comprising section Dendrobium and its allied groups. Results also showed paraphyly or polyphyly of sections Amblyanthus, Aporum, Breviflores, Calcarifera, Crumenata, Dendrobium, Densiflora, Distichophyllae, Dolichocentrum, Holochrysa, Oxyglossum and Pedilonum. On the other hand, the monophyly of section Stachyobium was well supported. It was found that many of the morphological characters that have been believed to reflect phylogenetic relationships are, in fact, the result of convergence. As such, many of the sections that have been recognized up to this point were found to not be monophyletic, so recircumscription of sections is required. PMID:25107672
Surveillance of Echinococcus isolates from Qinghai, China.
Ma, Junying; Wang, Hu; Lin, Gonghua; Zhao, Fang; Li, Chao; Zhang, Tongzuo; Ma, Xiao; Zhang, Yongguo; Hou, Zhibin; Cai, Huixia; Liu, Peiyun; Wang, Yongshun
2015-01-15
Echinococcosis is highly endemic over large parts of the Qinghai-Tibet Plateau (QTP), China. Based on a large number of samples, we present data on the current presence, host distribution, and genetic diversity of Echinococcus in the Qinghai Province, located in the northeastern corner of the QTP and constituting >25% of the area of the plateau. We used 521 samples (including 451 newly collected samples and 70 samples from our previous study) from humans, yaks, sheep, goats, dogs, fox, plateau pikas, and voles in 36 counties, and genotyped them using the mitochondrial DNA marker cytochrome oxidase subunit I (cox1) gene and the maximum parsimony and Bayesian reconstruction methods. Based on the 792 bp sequence matrix, we recorded 177 variable sites; 157 were parsimony-informative. A total of 105 haplotypes (H1-H105) were detected, of which H1-H15 and H90-H104, H16-H17, H18-H89, and H105 belonged to Echinococcus shiquicus, Echinococcus multilocularis, Echinococcus granulosus, and Echinococcus canadensis, respectively. Our results showed that, (i) the Qinghai Province was under a high burden of Echinococcus epidemiology; (ii) E. granulosus was the main echinococcosis threat to the local people, and the followed is E. multilocularis; (iii) there are a considerable number of haplotypes shared by domestic animals (sheep, yaks, and dogs) and humans, demonstrating the close relationship between human and domestic animals epidemiology; (iv) the threat of E. shiquicus on humans and livestock can be mostly ignored, while the infection risk of E. canadensis echinococcosis should not be neglected. Copyright © 2014 Elsevier B.V. All rights reserved.
Reconstructing the origin and elaboration of insect-trapping inflorescences in the Araceae1
Bröderbauer, David; Diaz, Anita; Weber, Anton
2016-01-01
Premise of the study Floral traps are among the most sophisticated devices that have evolved in angiosperms in the context of pollination, but the evolution of trap pollination has not yet been studied in a phylogenetic context. We aim to determine the evolutionary history of morphological traits that facilitate trap pollination and to elucidate the impact of pollinators on the evolution of inflorescence traps in the family Araceae. Methods Inflorescence morphology was investigated to determine the presence of trapping devices and to classify functional types of traps. We inferred phylogenetic relationships in the family using maximum likelihood and Bayesian methods. Character evolution of trapping devices, trap types, and pollinator types was then assessed with maximum parsimony and Bayesian methods. We also tested for an association of trap pollination with specific pollinator types. Key results Inflorescence traps have evolved independently at least 10 times within the Araceae. Trapping devices were found in 27 genera. On the basis of different combinations of trapping devices, six functional types of traps were identified. Trap pollination in Araceae is correlated with pollination by flies. Conclusions Trap pollination in the Araceae is more common than was previously thought. Preadaptations such as papillate cells or elongated sterile flowers facilitated the evolution of inflorescence traps. In some clades, imperfect traps served as a precursor for the evolution of more elaborate traps. Traps that evolved in association with fly pollination were most probably derived from mutualistic ancestors, offering a brood-site to their pollinators. PMID:22965851
Stasis and convergence characterize morphological evolution in eupolypod II ferns
Sundue, Michael A.; Rothfels, Carl J.
2014-01-01
Background and Aims Patterns of morphological evolution at levels above family rank remain underexplored in the ferns. The present study seeks to address this gap through analysis of 79 morphological characters for 81 taxa, including representatives of all ten families of eupolypod II ferns. Recent molecular phylogenetic studies demonstrate that the evolution of the large eupolypod II clade (which includes nearly one-third of extant fern species) features unexpected patterns. The traditional ‘athyrioid’ ferns are scattered across the phylogeny despite their apparent morphological cohesiveness, and mixed among these seemingly conservative taxa are morphologically dissimilar groups that lack any obvious features uniting them with their relatives. Maximum-likelihood and maximum-parsimony character optimizations are used to determine characters that unite the seemingly disparate groups, and to test whether the polyphyly of the traditional athyrioid ferns is due to evolutionary stasis (symplesiomorphy) or convergent evolution. The major events in eupolypod II character evolution are reviewed, and character and character state concepts are reappraised, as a basis for further inquiries into fern morphology. Methods Characters were scored from the literature, live plants and herbarium specimens, and optimized using maximum-parsimony and maximum-likelihood, onto a highly supported topology derived from maximum-likelihood and Bayesian analysis of molecular data. Phylogenetic signal of characters were tested for using randomization methods and fitdiscrete. Key Results The majority of character state changes within the eupolypod II phylogeny occur at the family level or above. Relative branch lengths for the morphological data resemble those from molecular data and fit an ancient rapid radiation model (long branches subtended by very short backbone internodes), with few characters uniting the morphologically disparate clades. The traditional athyrioid ferns were circumscribed based upon a combination of symplesiomorphic and homoplastic characters. Petiole vasculature consisting of two bundles is ancestral for eupolypods II and a synapomorphy for eupolypods II under deltran optimization. Sori restricted to one side of the vein defines the recently recognized clade comprising Rhachidosoraceae through Aspleniaceae, and sori present on both sides of the vein is a synapomorphy for the Athyriaceae sensu stricto. The results indicate that a chromosome base number of x =41 is synapomorphic for all eupolypods, a clade that includes over two-thirds of extant fern species. Conclusions The integrated approach synthesizes morphological studies with current phylogenetic hypotheses and provides explicit statements of character evolution in the eupolypod II fern families. Strong character support is found for previously recognized clades, whereas few characters support previously unrecognized clades. Sorus position appears to be less complicated than previously hypothesized, and linear sori restricted to one side of the vein support the clade comprising Aspleniaceae, Diplaziopsidaceae, Hemidictyaceae and Rachidosoraceae – a lineage only recently identified. Despite x =41 being a frequent number among extant species, to our knowledge it has not previously been demonstrated as the ancestral state. This is the first synapomorphy proposed for the eupolypod clade, a lineage comprising 67 % of extant fern species. This study provides some of the first hypotheses of character evolution at the family level and above in light of recent phylogenetic results, and promotes further study in an area that remains open for original observation. PMID:24197753
Sex and the Catasetinae (Darwin's favourite orchids).
Pérez-Escobar, Oscar Alejandro; Gottschling, Marc; Whitten, W Mark; Salazar, Gerardo; Gerlach, Günter
2016-04-01
Two sexual systems are predominant in Catasetinae (Orchidaceae), namely protandry (which has evolved in other orchid lineages as well) and environmental sex determination (ESD) being a unique trait among Orchidaceae. Yet, the lack of a robust phylogenetic framework for Catasetinae has hampered deeper insights in origin and evolution of sexual systems. To investigate the origins of protandry and ESD in Catasetinae, we sequenced nuclear and chloroplast loci from 77 species, providing the most extensive data matrix of Catasetinae available so far with all major lineages represented. We used Maximum Parsimony, Maximum Likelihood and Bayesian methods to infer phylogenetic relationships and evolution of sexual systems. Irrespectively of the methods used, Catasetinae were monophyletic in molecular phylogenies, with all established generic lineages and their relationships resolved and highly supported. According to comparative reconstruction approaches, the last common ancestor of Catasetinae was inferred as having bisexual flowers (i.e., lacking protandry and ESD as well), and protandry originated once in core Catasetinae (comprising Catasetum, Clowesia, Cycnoches, Dressleria and Mormodes). In addition, three independent gains of ESD are reliably inferred, linked to corresponding loss of protandry within core Catasetinae. Thus, prior gain of protandry appears as the necessary prerequisite for gain of ESD in orchids. Our results contribute to a comprehensive evolutionary scenario for sexual systems in Catasetinae and more generally in orchids as well. Copyright © 2015 Elsevier Inc. All rights reserved.
Tan, D Y; Hair-Bejo, M; Omar, A R; Aini, I
2004-01-01
The characteristics of the pathogenic infectious bursal disease virus (IBDV) that infected avian species other than commercial chickens were largely unknown. In this study, by using in vivo and molecular methods, we had characterized an IBDV isolate (named 94268) isolated from an infectious bursal disease (IBD) outbreak in Malaysian village chickens--the adulterated descendant of the Southeast Asian jungle fowl (Gallus bankiva) that were commonly reared in the backyard. The 94268 isolate was grouped as the very virulent IBDV (vvIBDV) strain because it caused severe lesions and a high mortality rate in village chickens (>88%) and experimentally infected specific-pathogen-free chickens (>66%). In addition, it possessed all of the vvIBDV molecular markers in its VP2 gene. Phylogenetic analysis using distance, maximum parsimony, and maximum likelihood methods revealed that 94268 was monophyletic with other vvIBDV isolates and closely related to the Malaysian vvIBDV isolates. Given that the VP2 gene of 94268 isolate was almost identical and evolutionarily closely related to other field IBDV isolates that affected the commercial chickens, we therefore concluded that IBD infections had spread across the farm boundary. IBD infection in the village chicken may represent an important part of the IBD epidemiology because these birds could harbor the vvIBDV strain and should not be overlooked in the control and prevention of the disease.
Wang, Li; Yokoyama, Koji; Miyaji, Makoto; Nishimura, Kazuko
2001-01-01
We analyzed a 402-bp sequence of the mitochondrial cytochrome b gene of 34 strains of Exophiala jeanselmei and 16 strains representing 12 related species. The strains of E. jeanselmei were classified into 20 DNA types and 17 amino acid types. The differences between these strains were found in 1 to 60 nucleotides and 1 to 17 amino acids. On the basis of the identities and similarities of nucleotide and amino acid sequences, some strains were reidentified: i.e., two strains of E. jeanselmei var. hetermorpha and one strain of E. castellanii as E. dermatitidis (including the type strain), three strains of E. jeanselmei as E. jeanselmei var. lecanii-corni (including the type strain), three strains of E. jeanselmei as E. bergeri (including the type strain), seven strains of E. jeanselmei as E. pisciphila (including the type strain), seven strains of E. jeanselmei as E. jeanselmei var. jeanselmei (including the type strain), one strain of E. jeanselmei as Fonsecaea pedrosoi (including the type strain), and one strain of E. jeanselmei as E. spinifera (including the type strain). Some E. jeanselmei strains showed distinct nucleotide and amino acid sequences. The amino-acid-based UPGMA (unweighted pair group method with the arithmetic mean) tree exhibited nearly the same topology as those of the DNA-based trees obtained by neighbor joining, maximum parsimony, and maximum likelihood methods. PMID:11724862
Neutral changes during divergent evolution of hemoglobins
NASA Technical Reports Server (NTRS)
Jukes, T. H.
1978-01-01
A comparison of the mRNAs for rabbit and human beta-hemoglobins shows that synonymous changes in codons have accumulated three times as rapidly as nucleotide replacements that produced changes in amino acids. This agrees with predictions based on the so-called neutral theory. In addition, seven codon changes that appear to be single-base changes (according to maximum parsimony) are actually two-base changes. This indicates that the construction of primordial sequences is of limited significance when based on inferences that assume minimum base changes for amino acid replacements.
Bendiksby, Mika; Næsborg, Rikke Reese; Timdal, Einar
2018-01-01
Xylopsora canopeorum Timdal, Reese Næsborg & Bendiksby is described as a new species occupying the crowns of large Sequoia sempervirens trees in California, USA. The new species is supported by morphology, anatomy, secondary chemistry and DNA sequence data. While similar in external appearance to X. friesii , it is distinguished by forming smaller, partly coralloid squamules, by the occurrence of soralia and, in some specimens, by the presence of thamnolic acid in addition to friesiic acid in the thallus. Molecular phylogenetic results are based on nuclear (ITS and LSU) as well as mitochondrial (SSU) ribosomal DNA sequence alignments. Phylogenetic hypotheses obtained using Bayesian Inference, Maximum Likelihood and Maximum Parsimony all support X. canopeorum as a distinct evolutionary lineage belonging to the X. caradocensis - X. friesii clade.
Xylopsora canopeorum (Umbilicariaceae), a new lichen species from the canopy of Sequoia sempervirens
Bendiksby, Mika; Næsborg, Rikke Reese; Timdal, Einar
2018-01-01
Abstract Xylopsora canopeorum Timdal, Reese Næsborg & Bendiksby is described as a new species occupying the crowns of large Sequoia sempervirens trees in California, USA. The new species is supported by morphology, anatomy, secondary chemistry and DNA sequence data. While similar in external appearance to X. friesii, it is distinguished by forming smaller, partly coralloid squamules, by the occurrence of soralia and, in some specimens, by the presence of thamnolic acid in addition to friesiic acid in the thallus. Molecular phylogenetic results are based on nuclear (ITS and LSU) as well as mitochondrial (SSU) ribosomal DNA sequence alignments. Phylogenetic hypotheses obtained using Bayesian Inference, Maximum Likelihood and Maximum Parsimony all support X. canopeorum as a distinct evolutionary lineage belonging to the X. caradocensis–X. friesii clade. PMID:29559828
Parsimonious extreme learning machine using recursive orthogonal least squares.
Wang, Ning; Er, Meng Joo; Han, Min
2014-10-01
Novel constructive and destructive parsimonious extreme learning machines (CP- and DP-ELM) are proposed in this paper. By virtue of the proposed ELMs, parsimonious structure and excellent generalization of multiinput-multioutput single hidden-layer feedforward networks (SLFNs) are obtained. The proposed ELMs are developed by innovative decomposition of the recursive orthogonal least squares procedure into sequential partial orthogonalization (SPO). The salient features of the proposed approaches are as follows: 1) Initial hidden nodes are randomly generated by the ELM methodology and recursively orthogonalized into an upper triangular matrix with dramatic reduction in matrix size; 2) the constructive SPO in the CP-ELM focuses on the partial matrix with the subcolumn of the selected regressor including nonzeros as the first column while the destructive SPO in the DP-ELM operates on the partial matrix including elements determined by the removed regressor; 3) termination criteria for CP- and DP-ELM are simplified by the additional residual error reduction method; and 4) the output weights of the SLFN need not be solved in the model selection procedure and is derived from the final upper triangular equation by backward substitution. Both single- and multi-output real-world regression data sets are used to verify the effectiveness and superiority of the CP- and DP-ELM in terms of parsimonious architecture and generalization accuracy. Innovative applications to nonlinear time-series modeling demonstrate superior identification results.
Pyron, R Alexander; Hendry, Catriona R; Chou, Vincent M; Lemmon, Emily M; Lemmon, Alan R; Burbrink, Frank T
2014-12-01
Next-generation genomic sequencing promises to quickly and cheaply resolve remaining contentious nodes in the Tree of Life, and facilitates species-tree estimation while taking into account stochastic genealogical discordance among loci. Recent methods for estimating species trees bypass full likelihood-based estimates of the multi-species coalescent, and approximate the true species-tree using simpler summary metrics. These methods converge on the true species-tree with sufficient genomic sampling, even in the anomaly zone. However, no studies have yet evaluated their efficacy on a large-scale phylogenomic dataset, and compared them to previous concatenation strategies. Here, we generate such a dataset for Caenophidian snakes, a group with >2500 species that contains several rapid radiations that were poorly resolved with fewer loci. We generate sequence data for 333 single-copy nuclear loci with ∼100% coverage (∼0% missing data) for 31 major lineages. We estimate phylogenies using neighbor joining, maximum parsimony, maximum likelihood, and three summary species-tree approaches (NJst, STAR, and MP-EST). All methods yield similar resolution and support for most nodes. However, not all methods support monophyly of Caenophidia, with Acrochordidae placed as the sister taxon to Pythonidae in some analyses. Thus, phylogenomic species-tree estimation may occasionally disagree with well-supported relationships from concatenated analyses of small numbers of nuclear or mitochondrial genes, a consideration for future studies. In contrast for at least two diverse, rapid radiations (Lamprophiidae and Colubridae), phylogenomic data and species-tree inference do little to improve resolution and support. Thus, certain nodes may lack strong signal, and larger datasets and more sophisticated analyses may still fail to resolve them. Copyright © 2014 Elsevier Inc. All rights reserved.
Phylogenetic relationships, diversification and expansion of chili peppers (Capsicum, Solanaceae)
Carrizo García, Carolina; Barfuss, Michael H. J.; Sehr, Eva M.; Barboza, Gloria E.; Samuel, Rosabelle; Moscone, Eduardo A.; Ehrendorfer, Friedrich
2016-01-01
Background and Aims Capsicum (Solanaceae), native to the tropical and temperate Americas, comprises the well-known sweet and hot chili peppers and several wild species. So far, only partial taxonomic and phylogenetic analyses have been done for the genus. Here, the phylogenetic relationships between nearly all taxa of Capsicum were explored to test the monophyly of the genus and to obtain a better knowledge of species relationships, diversification and expansion. Methods Thirty-four of approximately 35 Capsicum species were sampled. Maximum parsimony and Bayesian inference analyses were performed using two plastid markers (matK and psbA-trnH) and one single-copy nuclear gene (waxy). The evolutionary changes of nine key features were reconstructed following the parsimony ancestral states method. Ancestral areas were reconstructed through a Bayesian Markov chain Monte Carlo analysis. Key Results Capsicum forms a monophyletic clade, with Lycianthes as a sister group, following both phylogenetic approaches. Eleven well-supported clades (four of them monotypic) can be recognized within Capsicum, although some interspecific relationships need further analysis. A few features are useful to characterize different clades (e.g. fruit anatomy, chromosome base number), whereas some others are highly homoplastic (e.g. seed colour). The origin of Capsicum is postulated in an area along the Andes of western to north-western South America. The expansion of the genus has followed a clockwise direction around the Amazon basin, towards central and south-eastern Brazil, then back to western South America, and finally northwards to Central America. Conclusions New insights are provided regarding interspecific relationships, character evolution, and geographical origin and expansion of Capsicum. A clearly distinct early-diverging clade can be distinguished, centred in western–north-western South America. Subsequent rapid speciation has led to the origin of the remaining clades. The diversification of Capsicum has culminated in the origin of the main cultivated species in several regions of South to Central America. PMID:27245634
Dutheil, Julien; Gaillard, Sylvain; Bazin, Eric; Glémin, Sylvain; Ranwez, Vincent; Galtier, Nicolas; Belkhir, Khalid
2006-04-04
A large number of bioinformatics applications in the fields of bio-sequence analysis, molecular evolution and population genetics typically share input/output methods, data storage requirements and data analysis algorithms. Such common features may be conveniently bundled into re-usable libraries, which enable the rapid development of new methods and robust applications. We present Bio++, a set of Object Oriented libraries written in C++. Available components include classes for data storage and handling (nucleotide/amino-acid/codon sequences, trees, distance matrices, population genetics datasets), various input/output formats, basic sequence manipulation (concatenation, transcription, translation, etc.), phylogenetic analysis (maximum parsimony, markov models, distance methods, likelihood computation and maximization), population genetics/genomics (diversity statistics, neutrality tests, various multi-locus analyses) and various algorithms for numerical calculus. Implementation of methods aims at being both efficient and user-friendly. A special concern was given to the library design to enable easy extension and new methods development. We defined a general hierarchy of classes that allow the developer to implement its own algorithms while remaining compatible with the rest of the libraries. Bio++ source code is distributed free of charge under the CeCILL general public licence from its website http://kimura.univ-montp2.fr/BioPP.
Parsimony and goodness-of-fit in multi-dimensional NMR inversion
NASA Astrophysics Data System (ADS)
Babak, Petro; Kryuchkov, Sergey; Kantzas, Apostolos
2017-01-01
Multi-dimensional nuclear magnetic resonance (NMR) experiments are often used for study of molecular structure and dynamics of matter in core analysis and reservoir evaluation. Industrial applications of multi-dimensional NMR involve a high-dimensional measurement dataset with complicated correlation structure and require rapid and stable inversion algorithms from the time domain to the relaxation rate and/or diffusion domains. In practice, applying existing inverse algorithms with a large number of parameter values leads to an infinite number of solutions with a reasonable fit to the NMR data. The interpretation of such variability of multiple solutions and selection of the most appropriate solution could be a very complex problem. In most cases the characteristics of materials have sparse signatures, and investigators would like to distinguish the most significant relaxation and diffusion values of the materials. To produce an easy to interpret and unique NMR distribution with the finite number of the principal parameter values, we introduce a new method for NMR inversion. The method is constructed based on the trade-off between the conventional goodness-of-fit approach to multivariate data and the principle of parsimony guaranteeing inversion with the least number of parameter values. We suggest performing the inversion of NMR data using the forward stepwise regression selection algorithm. To account for the trade-off between goodness-of-fit and parsimony, the objective function is selected based on Akaike Information Criterion (AIC). The performance of the developed multi-dimensional NMR inversion method and its comparison with conventional methods are illustrated using real data for samples with bitumen, water and clay.
Molecular phylogeny of the red panda (Ailurus fulgens).
Slattery, J P; O'Brien, S J
1995-01-01
The phylogenetic placement of the red panda (Ailurus fulgens) and the giant panda (Ailuropoda melanoleuca) has been an evolutionary enigma since their original descriptions in the nineteenth century. A series of recent molecular analyses led to a consensus that the giant panda's ancestors were derived from early bears (Ursidae), but left unsettled the phylogenetic relationship of the red panda. Previous molecular and morphological phylogenies were inconclusive and varied among placement of the red panda within the raccoon family (Procyonidae), within the bear family (Ursidae), or in a separate family of carnivores equidistant between the two. To examine a relatively ancient (circa 20-30 million years before the present, MYBP) phylogenetic divergence, we used two slowly evolving genetic markers: mitochondrial 12S rRNA sequence and 592 fibroblast proteins resolved by two dimensional gel electrophoresis. Four different carnivore outgroup species, including dog (Canidae: Canis familiaris), cat (Felidae: Felis catus), fanaloka (Viverridae: Fossa fossa), and mongoose (Herpestidae: Galidia elegans), were selected to identify the root of the phylogenetic topologies. Phylogenetic reconstruction by distance-based methods, maximum parsimony, and maximum likelihood clearly indicate a distinct bifurcation forming the Ursidae and the Procyonidae. Further, our data consistently place the red panda as an early divergence within the Procyonidae radiation and confirm the inclusion of giant panda in the Ursidae lineage.
Structural analysis of the α subunit of Na(+)/K(+) ATPase genes in invertebrates.
Thabet, Rahma; Rouault, J-D; Ayadi, Habib; Leignel, Vincent
2016-01-01
The Na(+)/K(+) ATPase is a ubiquitous pump coordinating the transport of Na(+) and K(+) across the membrane of cells and its role is fundamental to cellular functions. It is heteromer in eukaryotes including two or three subunits (α, β and γ which is specific to the vertebrates). The catalytic functions of the enzyme have been attributed to the α subunit. Several complete α protein sequences are available, but only few gene structures were characterized. We identified the genomic sequences coding the α-subunit of the Na(+)/K(+) ATPase, from the whole-genome shotgun contigs (WGS), NCBI Genomes (chromosome), Genomic Survey Sequences (GSS) and High Throughput Genomic Sequences (HTGS) databases across distinct phyla. One copy of the α subunit gene was found in Annelida, Arthropoda, Cnidaria, Echinodermata, Hemichordata, Mollusca, Placozoa, Porifera, Platyhelminthes, Urochordata, but the nematodes seem to possess 2 to 4 copies. The number of introns varied from 0 (Platyhelminthes) to 26 (Porifera); and their localization and length are also highly variable. Molecular phylogenies (Maximum Likelihood and Maximum Parsimony methods) showed some clusters constituted by (Chordata/(Echinodermata/Hemichordata)) or (Plathelminthes/(Annelida/Mollusca)) and a basal position for Porifera. These structural analyses increase our knowledge about the evolutionary events of the α subunit genes in the invertebrates. Copyright © 2016 Elsevier Inc. All rights reserved.
Yuan, Le-Yang; Liu, Xiao-Xiang; Zhang, E
2015-12-21
Sequences from the mitochondrial control region of 14 putative species of Acrossocheilus (Cyprinidae) were examined to elucidate phylogenetic relationships within species of the barred group in that genus. Phylogenetic reconstructions were generated using three tree-building methods: maximum parsimony, maximum likelihood, and Bayesian inference. The resultant phylogenies were consistent with monophyly of the majority of the morphologically recognized species. However, mitochondrial DNA sequence evidence is incongruent with monophyly of A. fasciatus, as currently conceived. This species occurs only in the upper Qiantang-Jiang basin in Zhejiang and Anhui provinces, and coastal rivers in the Zhejiang Province. The species formerly recognized as A. paradoxus from Zhejiang Province is A. fasciatus. The specimens previously reported as A. fasciatus from river basins in Fujian Province are misidentified A. wuyiensis. The barred group of Acrossocheilus is shown to be polyphyletic. Acrossocheilus is restricted to the barred species here placed in "Clade II," containing A. paradoxus and relatives. Separate generic status is recommended for A. monticola and for A. longipinnis and their closest relatives, although more information on phylogenetic relationships based on multiple genes is required to develop robust phylogenetic hypotheses and diagnoses. Masticbarbus Tang, 1942 is available for A. longipinnis and three allied species (A. iridescens, A. microstomus and A. lamus).
Subbotin, S A; Vierstraete, A; De Ley, P; Rowe, J; Waeyenberge, L; Moens, M; Vanfleteren, J R
2001-10-01
The ITS1, ITS2, and 5.8S gene sequences of nuclear ribosomal DNA from 40 taxa of the family Heteroderidae (including the genera Afenestrata, Cactodera, Heterodera, Globodera, Punctodera, Meloidodera, Cryphodera, and Thecavermiculatus) were sequenced and analyzed. The ITS regions displayed high levels of sequence divergence within Heteroderinae and compared to outgroup taxa. Unlike recent findings in root knot nematodes, ITS sequence polymorphism does not appear to complicate phylogenetic analysis of cyst nematodes. Phylogenetic analyses with maximum-parsimony, minimum-evolution, and maximum-likelihood methods were performed with a range of computer alignments, including elision and culled alignments. All multiple alignments and phylogenetic methods yielded similar basic structure for phylogenetic relationships of Heteroderidae. The cyst-forming nematodes are represented by six main clades corresponding to morphological characters and host specialization, with certain clades assuming different positions depending on alignment procedure and/or method of phylogenetic inference. Hypotheses of monophyly of Punctoderinae and Heteroderinae are, respectively, strongly and moderately supported by the ITS data across most alignments. Close relationships were revealed between the Avenae and the Sacchari groups and between the Humuli group and the species H. salixophila within Heteroderinae. The Goettingiana group occupies a basal position within this subfamily. The validity of the genera Afenestrata and Bidera was tested and is discussed based on molecular data. We conclude that ITS sequence data are appropriate for studies of relationships within the different species groups and less so for recovery of more ancient speciations within Heteroderidae. Copyright 2001 Academic Press.
Weisrock, David W; Macey, J Robert; Matsui, Masafumi; Mulcahy, Daniel G; Papenfuss, Theodore J
2013-01-01
The salamander family Hynobiidae contains over 50 species and has been the subject of a number of molecular phylogenetic investigations aimed at reconstructing branches across the entire family. In general, studies using the greatest amount of sequence data have used reduced taxon sampling, while the study with the greatest taxon sampling has used a limited sequence data set. Here, we provide insights into the phylogenetic history of the Hynobiidae using both dense taxon sampling and a large mitochondrial DNA sequence data set. We report exclusive new mitochondrial DNA data of 2566 aligned bases (with 151 excluded sites, of included sites 1157 are variable with 957 parsimony informative). This is sampled from two genic regions encoding a 12S-16S region (the 3' end of 12S rRNA, tRNA(VAI), and the 5' end of 16S rRNA), and a ND2-COI region (ND2, tRNA(Trp), tRNA(Ala), tRNA(Asn), the origin for light strand replication--O(L), tRNA(Cys), tRNAT(Tyr), and the 5' end of COI). Analyses using parsimony, Bayesian, and maximum likelihood optimality criteria produce similar phylogenetic trees, with discordant branches generally receiving low levels of branch support. Monophyly of the Hynobiidae is strongly supported across all analyses, as is the sister relationship and deep divergence between the genus Onychodactylus with all remaining hynobiids. Within this latter grouping our phylogenetic results identify six clades that are relatively divergent from one another, but for which there is minimal support for their phylogenetic placement. This includes the genus Batrachuperus, the genus Hynobius, the genus Pachyhynobius, the genus Salamandrella, a clade containing the genera Ranodon and Paradactylodon, and a clade containing the genera Liua and Pseudohynobius. This latter clade receives low bootstrap support in the parsimony analysis, but is consistent across all three analytical methods. Our results also clarify a number of well-supported relationships within the larger Batrachuperus and Hynobius clades. While the relationships identified in this study do much to clarify the phylogenetic history of the Hynobiidae, the poor resolution among major hynobiid clades, and the contrast of mtDNA-derived relationships with recent phylogenetic results from a small number of nuclear genes, highlights the need for continued phylogenetic study with larger numbers of nuclear loci.
The phylogenetic relationships of known mosquito (Diptera: Culicidae) mitogenomes.
Chu, Hongliang; Li, Chunxiao; Guo, Xiaoxia; Zhang, Hengduan; Luo, Peng; Wu, Zhonghua; Wang, Gang; Zhao, Tongyan
2018-01-01
The known mosquito mitogenomes, containing a total of 34 species, which belong to five genera, were collected from GenBank, and the practicality and effectiveness of the variation in the complete mitochondrial DNA genome and portions of mitochondrial COI gene were assessed to reconstruct the phylogeny of mosquitoes. Phylogenetic trees were reconstructed on the basis of parsimony, maximum likelihood, and Bayesian (BI) methods. It is concluded that: (1) Both mitogenomes and COI gene support the monophly of following taxa: Subgenus Nyssorhynchus, Subgenus Cellia, Anopheles albitarsis complex, Anopheles gambiae complex, and Anopheles punctulatus group; (2) Genus Aedes is not monophyletic relative to Ochlerotatus vigilax; (3) The mitogenome results indicate a close relationship between Anopheles epiroticus and Anopheles gambiae complex, Anopheles dirus complex and Anopheles punctulatus group, respectively; (4) The Bayesian posterior probability (BPP) within phylogenetic tree reconstructed by mitogenomes is higher than COI tree. The results show that phylogenetic relationships reconstructed using the mitogenomes were more similar to those based on morphological data.
Masuda, R; Lopez, J V; Slattery, J P; Yuhki, N; O'Brien, S J
1996-12-01
Molecular phylogeny of the cat family Felidae is derived using two mitochondrial genes, cytochrome b and 12S rRNA. Phylogenetic methods of weighted maximum parsimony and minimum evolution estimated by neighbor-joining are employed to reconstruct topologies among 20 extant felid species. Sequence analyses of 363 bp of cytochrome b and 376 bp of the 12S rRNA genes yielded average pair-wise similarity values between felids ranging from 94 to 99% and from 85 to 99%, respectively. Phylogenetic reconstruction supports more recent, intralineage associations but fails to completely resolve interlineage relationships. Both genes produce a monophyletic group of Felis species but vary in the placement of the pallas cat. The ocelot lineage represents an early divergence within the Felidae, with strong associations between ocelot and margay, Geoffroy's cat and kodkod, and pampas cat and tigrina. Implications of the relative recency of felid evolution, presence of ancestral polymorphisms, and influence of outgroups in placement of the topological root are discussed.
Castilho, Flávio J D; Torres, Rodrigo A; Barbosa, Aneli M; Dekker, Robert F H; Garcia, José E
2009-02-01
The present study is the first describing the sequencing of a fragment of the copper-oxidase domain of a laccase gene in the family Botryosphaeriaceae. The aim of this work was to assess the degree of genetic and evolutionary relationships of a laccase gene from Botryosphaeria rhodina MAMB-05 with other ascomycete and basidiomycete laccase genes. The 193-amino acid sequences of the copper-oxidase domain from several different fungi, insects, a plant, and a bacterial species were retrieved from GenBank and aligned. Phylogenetic analyses were performed using neighbor-joining, maximum parsimony, and Bayesian inference methods. The organisms studied clustered into five gene clades: fungi (ascomycetes and basidiomycetes), insects, plants, and bacteria. Also, the topologies showed that fungal laccases of the ascomycetes and basidiomycetes are clearly separated into two distinct clusters. This evidence indicated that B. rhodina MAMB-05 and other closely related ascomycetes are a new biological resource given the biotechnological potential of their laccase genes.
Genotyping of Giardia lamblia isolates from humans in China and Korea using ribosomal DNA Sequences.
Yong, T S; Park, S J; Hwang, U W; Yang, H W; Lee, K W; Min, D Y; Rim, H J; Wang, Y; Zheng, F
2000-08-01
Genetic characterization of a total of 15 Giardia lamblia isolates, 8 from Anhui Province, China (all from purified cysts) and 7 from Seoul, Korea (2 from axenic cultures and 5 from purified cysts), was performed by polymerase chain reaction amplification and sequencing of a 295-bp region near the 5' end of the small subunit ribosomal DNA (eukaryotic 16S rDNA). Phylogenetic analyses were subsequently conducted using sequence data obtained in this study, as well as sequences published from other Giardia isolates. The maximum parsimony method revealed that G. lamblia isolates from humans in China and Korea are divided into 2 major lineages, assemblages A and B. All 7 Korean isolates were grouped into assemblage A, whereas 4 Chinese isolates were grouped into assemblage A and 4 into assemblage B. Two Giardia microti isolates and 2 dog-derived Giardia isolates also grouped into assemblage B, whereas Giardia ardeae and Giardia muris were unique.
Torres-Carvajal, Omar; Schulte, James A; Cadle, John E
2006-04-01
The South American iguanian lizard genus Stenocercus includes 54 species occurring mostly in the Andes and adjacent lowland areas from northern Venezuela and Colombia to central Argentina at elevations of 0-4000m. Small taxon or character sampling has characterized all phylogenetic analyses of Stenocercus, which has long been recognized as sister taxon to the Tropidurus Group. In this study, we use mtDNA sequence data to perform phylogenetic analyses that include 32 species of Stenocercus and 12 outgroup taxa. Monophyly of this genus is strongly supported by maximum parsimony and Bayesian analyses. Evolutionary relationships within Stenocercus are further analyzed with a Bayesian implementation of a general mixture model, which accommodates variability in the pattern of evolution across sites. These analyses indicate a basal split of Stenocercus into two clades, one of which receives very strong statistical support. In addition, we test previous hypotheses using non-parametric and parametric statistical methods, and provide a phylogenetic classification for Stenocercus.
Xiang, Kun-Li; Wu, Sheng-Dan; Yu, Sheng-Xian; Liu, Yang; Jabbour, Florian; Erst, Andrey S.; Zhao, Liang; Wang, Wei; Chen, Zhi-Duan
2016-01-01
Coptis (Ranunculaceae) contains 15 species and is one of the pharmaceutically most important plant genera in eastern Asia. Understanding of the evolution of morphological characters and phylogenetic relationships within the genus is very limited. Here, we present the first comprehensive phylogenetic analysis of the genus based on two plastid and one nuclear markers. The phylogeny was reconstructed using Bayesian inference, as well as maximum parsimony and maximum likelihood methods. The Swofford-Olsen-Waddell-Hillis and Bayesian tests were used to assess the strength of the conflicts between traditional taxonomic units and those suggested by the phylogenetic inferences. Evolution of morphological characters was inferred using Bayesian method to identify synapomorphies for the infrageneric lineages. Our data recognize two strongly supported clades within Coptis. The first clade contains subgenus Coptis and section Japonocoptis of subgenus Metacoptis, supported by morphological characters, such as traits of the central leaflet base, petal color, and petal shape. The second clade consists of section Japonocoptis of subgenus Metacoptis. Coptis morii is not united with C. quinquefolia, in contrast with the view that C. morii is a synonym of C. quinquefolia. Two varieties of C. chinensis do not cluster together. Coptis groenlandica and C. lutescens are reduced to C. trifolia and C. japonica, respectively. Central leaflet base, sepal shape, and petal blade carry a strong phylogenetic signal in Coptis, while leaf type, sepal and petal color, and petal shape exhibit relatively higher levels of evolutionary flexibility. PMID:27044035
Bapst, D W; Wright, A M; Matzke, N J; Lloyd, G T
2016-07-01
Dated phylogenies of fossil taxa allow palaeobiologists to estimate the timing of major divergences and placement of extinct lineages, and to test macroevolutionary hypotheses. Recently developed Bayesian 'tip-dating' methods simultaneously infer and date the branching relationships among fossil taxa, and infer putative ancestral relationships. Using a previously published dataset for extinct theropod dinosaurs, we contrast the dated relationships inferred by several tip-dating approaches and evaluate potential downstream effects on phylogenetic comparative methods. We also compare tip-dating analyses to maximum-parsimony trees time-scaled via alternative a posteriori approaches including via the probabilistic cal3 method. Among tip-dating analyses, we find opposing but strongly supported relationships, despite similarity in inferred ancestors. Overall, tip-dating methods infer divergence dates often millions (or tens of millions) of years older than the earliest stratigraphic appearance of that clade. Model-comparison analyses of the pattern of body-size evolution found that the support for evolutionary mode can vary across and between tree samples from cal3 and tip-dating approaches. These differences suggest that model and software choice in dating analyses can have a substantial impact on the dated phylogenies obtained and broader evolutionary inferences. © 2016 The Author(s).
Lustgarten, Jonathan Lyle; Balasubramanian, Jeya Balaji; Visweswaran, Shyam; Gopalakrishnan, Vanathi
2017-03-01
The comprehensibility of good predictive models learned from high-dimensional gene expression data is attractive because it can lead to biomarker discovery. Several good classifiers provide comparable predictive performance but differ in their abilities to summarize the observed data. We extend a Bayesian Rule Learning (BRL-GSS) algorithm, previously shown to be a significantly better predictor than other classical approaches in this domain. It searches a space of Bayesian networks using a decision tree representation of its parameters with global constraints, and infers a set of IF-THEN rules. The number of parameters and therefore the number of rules are combinatorial to the number of predictor variables in the model. We relax these global constraints to a more generalizable local structure (BRL-LSS). BRL-LSS entails more parsimonious set of rules because it does not have to generate all combinatorial rules. The search space of local structures is much richer than the space of global structures. We design the BRL-LSS with the same worst-case time-complexity as BRL-GSS while exploring a richer and more complex model space. We measure predictive performance using Area Under the ROC curve (AUC) and Accuracy. We measure model parsimony performance by noting the average number of rules and variables needed to describe the observed data. We evaluate the predictive and parsimony performance of BRL-GSS, BRL-LSS and the state-of-the-art C4.5 decision tree algorithm, across 10-fold cross-validation using ten microarray gene-expression diagnostic datasets. In these experiments, we observe that BRL-LSS is similar to BRL-GSS in terms of predictive performance, while generating a much more parsimonious set of rules to explain the same observed data. BRL-LSS also needs fewer variables than C4.5 to explain the data with similar predictive performance. We also conduct a feasibility study to demonstrate the general applicability of our BRL methods on the newer RNA sequencing gene-expression data.
Parsimonious description for predicting high-dimensional dynamics
Hirata, Yoshito; Takeuchi, Tomoya; Horai, Shunsuke; Suzuki, Hideyuki; Aihara, Kazuyuki
2015-01-01
When we observe a system, we often cannot observe all its variables and may have some of its limited measurements. Under such a circumstance, delay coordinates, vectors made of successive measurements, are useful to reconstruct the states of the whole system. Although the method of delay coordinates is theoretically supported for high-dimensional dynamical systems, practically there is a limitation because the calculation for higher-dimensional delay coordinates becomes more expensive. Here, we propose a parsimonious description of virtually infinite-dimensional delay coordinates by evaluating their distances with exponentially decaying weights. This description enables us to predict the future values of the measurements faster because we can reuse the calculated distances, and more accurately because the description naturally reduces the bias of the classical delay coordinates toward the stable directions. We demonstrate the proposed method with toy models of the atmosphere and real datasets related to renewable energy. PMID:26510518
Voigt, Kerstin; Olsson, L
2008-09-01
A multi-gene genealogy based on maximum parsimony and distance analyses of the exonic genes for actin (act) and translation elongation factor 1 alpha (tef), the nuclear genes for the small (18S) and large (28S) subunit ribosomal RNA (comprising 807, 1092, 1863, 389 characters, respectively) of all 50 genera of the Mucorales (Zygomycetes) suggests that the Choanephoraceae is a monophyletic group. The monotypic Gilbertellaceae appears in close phylogenetic relatedness to the Choanephoraceae. The monophyly of the Choanephoraceae has moderate to strong support (bootstrap proportions 67% and 96% in distance and maximum parsimony analyses, respectively), whereas the monophyly of the Choanephoraceae-Gilbertellaceae clade is supported by high bootstrap values (100% and 98%). This suggests that the two families can be joined into one family, which leads to the elimination of the Gilbertellaceae as a separate family. In order to test this hypothesis single-locus neighbor-joining analyses were performed on nuclear genes of the 18S, 5.8S, 28S and internal transcribed spacer (ITS) 1 ribosomal RNA and the translation elongation factor 1 alpha (tef) and beta tubulin (betatub) nucleotide sequences. The common monophyletic origin of the Choanephoraceae-Gilbertellaceae clade could be confirmed in all gene trees and by investigation of their ultrastructure. Sporangia with persistent, sutured walls splitting in half at maturity and ellipsoidal sporangiospores with striated ornamentations and polar ciliate appendages arising from spores in persistent sporangia and dehiscent sporangiola represent synapomorphic characters of this group. We discuss our data in the context of the historical development of their taxonomy and physiology and propose a reduction of the two families to one family, the Choanephoraceae sensu lato comprising species which are facultative plant pathogens and parasites, especially in subtropical to tropical regions.
Dealing with Complex Causality in Realist Synthesis: The Promise of Qualitative Comparative Analysis
ERIC Educational Resources Information Center
Sager, Fritz; Andereggen, Celine
2012-01-01
In this article, the authors state two arguments: first, that the four categories of context, politics, polity, and policy make an adequate framework for systematic review being both exhaustive and parsimonious; second, that the method of qualitative comparative analysis (QCA) is an appropriate methodical approach for gaining realistic results…
A polyphasic taxonomic approach in isolated strains of Cyanobacteria from thermal springs of Greece.
Bravakos, Panos; Kotoulas, Georgios; Skaraki, Katerina; Pantazidou, Adriani; Economou-Amilli, Athena
2016-05-01
Strains of Cyanobacteria isolated from mats of 9 thermal springs of Greece have been studied for their taxonomic evaluation. A polyphasic taxonomic approach was employed which included: morphological observations by light microscopy and scanning electron microscopy, maximum parsimony, maximum likelihood and Bayesian analysis of 16S rDNA sequences, secondary structural comparisons of 16S-23S rRNA Internal Transcribed Spacer sequences, and finally environmental data. The 17 cyanobacterial isolates formed a diverse group that contained filamentous, coccoid and heterocytous strains. These included representatives of the polyphyletic genera of Synechococcus and Phormidium, and the orders Oscillatoriales, Spirulinales, Chroococcales and Nostocales. After analysis, at least 6 new taxa at the genus level provide new evidence in the taxonomy of Cyanobacteria and highlight the abundant diversity of thermal spring environments with many potential endemic species or ecotypes. Copyright © 2016 Elsevier Inc. All rights reserved.
Things fall apart: biological species form unconnected parsimony networks.
Hart, Michael W; Sunday, Jennifer
2007-10-22
The generality of operational species definitions is limited by problematic definitions of between-species divergence. A recent phylogenetic species concept based on a simple objective measure of statistically significant genetic differentiation uses between-species application of statistical parsimony networks that are typically used for population genetic analysis within species. Here we review recent phylogeographic studies and reanalyse several mtDNA barcoding studies using this method. We found that (i) alignments of DNA sequences typically fall apart into a separate subnetwork for each Linnean species (but with a higher rate of true positives for mtDNA data) and (ii) DNA sequences from single species typically stick together in a single haplotype network. Departures from these patterns are usually consistent with hybridization or cryptic species diversity.
Pareto-optimal phylogenetic tree reconciliation
Libeskind-Hadas, Ran; Wu, Yi-Chieh; Bansal, Mukul S.; Kellis, Manolis
2014-01-01
Motivation: Phylogenetic tree reconciliation is a widely used method for reconstructing the evolutionary histories of gene families and species, hosts and parasites and other dependent pairs of entities. Reconciliation is typically performed using maximum parsimony, in which each evolutionary event type is assigned a cost and the objective is to find a reconciliation of minimum total cost. It is generally understood that reconciliations are sensitive to event costs, but little is understood about the relationship between event costs and solutions. Moreover, choosing appropriate event costs is a notoriously difficult problem. Results: We address this problem by giving an efficient algorithm for computing Pareto-optimal sets of reconciliations, thus providing the first systematic method for understanding the relationship between event costs and reconciliations. This, in turn, results in new techniques for computing event support values and, for cophylogenetic analyses, performing robust statistical tests. We provide new software tools and demonstrate their use on a number of datasets from evolutionary genomic and cophylogenetic studies. Availability and implementation: Our Python tools are freely available at www.cs.hmc.edu/∼hadas/xscape. Contact: mukul@engr.uconn.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:24932009
Functional characteristics of the calcium modulated proteins seen from an evolutionary perspective
NASA Technical Reports Server (NTRS)
Kretsinger, R. H.; Nakayama, S.; Moncrief, N. D.
1991-01-01
We have constructed dendrograms relating 173 EF-hand proteins of known amino acid sequence. We aligned all of these proteins by their EF-hand domains, omitting interdomain regions. Initial dendrograms were computed by minimum mutation distance methods. Using these as starting points, we determined the best dendrogram by the method of maximum parsimony, scored by minimum mutation distance. We identified 14 distinct subfamilies as well as 6 unique proteins that are perhaps the sole representatives of other subfamilies. This information is given in tabular form. Within subfamilies one can easily align interdomain regions. The resulting dendrograms are very similar to those computed using domains only. Dendrograms constructed using pairs of domains show general congruence. However, there are enough exceptions to caution against an overly simple scheme in which one pair of gene duplications leads from one domain precurser to a four domain prototype from which all other forms evolved. The ability to bind calcium was lost and acquired several times during evolution. The distribution of introns does not conform to the dendrogram based on amino acid sequences. The rates of evolution appear to be much slower within subfamilies, especially within calmodulin, than those prior to the definition of subfamily.
Stasis and convergence characterize morphological evolution in eupolypod II ferns.
Sundue, Michael A; Rothfels, Carl J
2014-01-01
Patterns of morphological evolution at levels above family rank remain underexplored in the ferns. The present study seeks to address this gap through analysis of 79 morphological characters for 81 taxa, including representatives of all ten families of eupolypod II ferns. Recent molecular phylogenetic studies demonstrate that the evolution of the large eupolypod II clade (which includes nearly one-third of extant fern species) features unexpected patterns. The traditional 'athyrioid' ferns are scattered across the phylogeny despite their apparent morphological cohesiveness, and mixed among these seemingly conservative taxa are morphologically dissimilar groups that lack any obvious features uniting them with their relatives. Maximum-likelihood and maximum-parsimony character optimizations are used to determine characters that unite the seemingly disparate groups, and to test whether the polyphyly of the traditional athyrioid ferns is due to evolutionary stasis (symplesiomorphy) or convergent evolution. The major events in eupolypod II character evolution are reviewed, and character and character state concepts are reappraised, as a basis for further inquiries into fern morphology. Characters were scored from the literature, live plants and herbarium specimens, and optimized using maximum-parsimony and maximum-likelihood, onto a highly supported topology derived from maximum-likelihood and Bayesian analysis of molecular data. Phylogenetic signal of characters were tested for using randomization methods and fitdiscrete. The majority of character state changes within the eupolypod II phylogeny occur at the family level or above. Relative branch lengths for the morphological data resemble those from molecular data and fit an ancient rapid radiation model (long branches subtended by very short backbone internodes), with few characters uniting the morphologically disparate clades. The traditional athyrioid ferns were circumscribed based upon a combination of symplesiomorphic and homoplastic characters. Petiole vasculature consisting of two bundles is ancestral for eupolypods II and a synapomorphy for eupolypods II under deltran optimization. Sori restricted to one side of the vein defines the recently recognized clade comprising Rhachidosoraceae through Aspleniaceae, and sori present on both sides of the vein is a synapomorphy for the Athyriaceae sensu stricto. The results indicate that a chromosome base number of x =41 is synapomorphic for all eupolypods, a clade that includes over two-thirds of extant fern species. The integrated approach synthesizes morphological studies with current phylogenetic hypotheses and provides explicit statements of character evolution in the eupolypod II fern families. Strong character support is found for previously recognized clades, whereas few characters support previously unrecognized clades. Sorus position appears to be less complicated than previously hypothesized, and linear sori restricted to one side of the vein support the clade comprising Aspleniaceae, Diplaziopsidaceae, Hemidictyaceae and Rachidosoraceae - a lineage only recently identified. Despite x =41 being a frequent number among extant species, to our knowledge it has not previously been demonstrated as the ancestral state. This is the first synapomorphy proposed for the eupolypod clade, a lineage comprising 67 % of extant fern species. This study provides some of the first hypotheses of character evolution at the family level and above in light of recent phylogenetic results, and promotes further study in an area that remains open for original observation.
Flores, B S; Siddall, M E; Burreson, E M
1996-08-01
The phylogenetic position of the phylum Haplosporidia was investigated with the complete small subunit rRNA gene sequences from 5 species in the phylum: Haplosporidium nelsoni and Haplosporidium costale, parasites of the eastern oyster Crassostrea virginica; Haplosporidium louisiana, a parasite of the mudcrab Panopeus herbstii; Minchinia teredinis, a parasite of shipworms (Teredo spp.) and Urosporidium crescens, a hyperparasite found in metacercariae of the trematode Megalophallus sp. in the blue crab, Callinectes sapidus. Multiple alignments of small subunit rRNA gene sequences included the 5 haplosporidian taxa and 14 taxa in the alveolate phyla Ciliophora, Dinoflagellida, and Apicomplexa. Maximum parsimony analysis placed the phylum Haplosporidia as a monophyletic group within the alveolate clade, as a taxon of equal rank with the other 3 alveolate phyla, and as a sister taxon to the clade composed of the phyla Dinoflagellida and Apicomplexa. Transversionally weighted parsimony placed the haplosporidians as a sister taxon to the ciliates. A separate analysis focused on the relationships of species in the genus Haplosporidium. Analyses were conducted with the haplosporidians as a functional ingroup, using each of the alveolate phyla individually as functional outgroups. The results indicated that species in the genus Haplosporidium do not form a monophyletic assemblage. As such, the present morphological criteria for distinguishing the genera Haplosporidium and Minchinia are insufficient.
Hardman, Michael; Hardman, Lotta M
2008-02-01
We applied Bayesian phylogenetics, divergence time estimation, diversification pattern analysis, and parsimony-based methods of ancestral state reconstruction to a combination of nucleotide sequences, maximum body sizes, fossils, and paleoclimate data to explore the influence of an extrinsic (climate change) and an intrinsic (maximum body size) factor on diversification rates in a North American clade of catfishes (Ictaluridae). We found diversification rate to have been significantly variable over time, with significant (or nearly significant) rate increases in the early history of Noturus. Though the latter coincided closely with a period of dramatic climate change at the Eocene-Oligocene boundary, we did not detect evidence for a general association between climate change and diversification rate during the entire history of Ictaluridae. Within Ictaluridae, small body size was found to be a near significant predictor of species richness. Morphological stasis of several species appears to be a consequence of a homoplastic increase in body size. We estimated the maximum standard length of the ictalurid ancestor to be approximately 50 cm, comparable to Eocene ictalurids (Astephus) and similar to modern sizes of Ameiurus and their Asian sister-taxon Cranoglanis. During the late Paleocene and early Eocene, the ictalurid ancestor diversified into the lineages represented by the modern epigean genera. The majority of modern species originated in the Oligocene and Miocene, most likely according to a peripheral isolates model of speciation. We discuss the difficulties of detecting macroevolutionary patterns within a lineage history and encourage the scrutiny of the terminal Eocene climatic event as a direct promoter of diversification.
Making sense: duty hours, work flow, and waste in graduate medical education.
Bush, Roger W; Philibert, Ingrid
2009-12-01
Parsimony, and not industry, is the immediate cause of the increase of capital. Industry, indeed, provides the subject which parsimony accumulates. But whatever industry might acquire, if parsimony did not save and store up, the capital would never be the greater.Adam Smith, The Wealth of Nations, book 2, chapter 31In 2003, the Accreditation Council for Graduate Medical Education implemented resident duty hour limits that included a weekly limit and limits on continuous hours. Recent recommendations for added reductions in resident duty hours have produced concern about concomitant reductions in future graduates' preparedness for independent practice. The current debate about resident hours largely does not consider whether all hours residents spend in the educational and clinical-care environment contribute meaningfully either to residents' learning or to effective patient care. This may distract the community from waste in the current clinical-education model. We propose that use of "lean production" and quality improvement methods may assist teaching institutions in attaining a deeper understanding of work flow and waste. These methods can be used to assign value to patient- and learner-centered activities and outputs and to optimize the competing and synergistic aspects of all desired outcomes to produce the care the Institute of Medicine recommends: safe, effective, efficient, patient-centered, timely, and equitable. Finally, engagement of senior clinical faculty in determining the culture of the care and education system will contribute to an advanced social-learning and care network.
Making Sense: Duty Hours, Work Flow, and Waste in Graduate Medical Education
Bush, Roger W.; Philibert, Ingrid
2009-01-01
Parsimony, and not industry, is the immediate cause of the increase of capital. Industry, indeed, provides the subject which parsimony accumulates. But whatever industry might acquire, if parsimony did not save and store up, the capital would never be the greater. Adam Smith, The Wealth of Nations, book 2, chapter 31 In 2003, the Accreditation Council for Graduate Medical Education implemented resident duty hour limits that included a weekly limit and limits on continuous hours. Recent recommendations for added reductions in resident duty hours have produced concern about concomitant reductions in future graduates' preparedness for independent practice. The current debate about resident hours largely does not consider whether all hours residents spend in the educational and clinical-care environment contribute meaningfully either to residents' learning or to effective patient care. This may distract the community from waste in the current clinical-education model. We propose that use of “lean production” and quality improvement methods may assist teaching institutions in attaining a deeper understanding of work flow and waste. These methods can be used to assign value to patient- and learner-centered activities and outputs and to optimize the competing and synergistic aspects of all desired outcomes to produce the care the Institute of Medicine recommends: safe, effective, efficient, patient-centered, timely, and equitable. Finally, engagement of senior clinical faculty in determining the culture of the care and education system will contribute to an advanced social-learning and care network. PMID:21976000
Velazco, Paúl M; Patterson, Bruce D
2013-09-01
The Yellow-shouldered bats, Genus Sturnira, are widespread, diverse, and abundant throughout the Neotropical Region, but little is known of their phylogeny and biogeography. We collected 4409 bp of DNA from three mitochondrial (cyt-b, ND2, D-loop) and two nuclear (RAG1, RAG2) sequences from 138 individuals representing all but two recognized species of Sturnira and five other phyllostomid bats used as outgroups. The sequence data were subjected to maximum parsimony, maximum likelihood, and Bayesian inference analyses. Results overwhelmingly support the monophyly of the genus Sturnira but not continued recognition of Corvira as a subgenus; the two species (bidens and nana) allocated to that group constitute separate, basal branches on the phylogeny. A total of 21 monophyletic putatively species-level groups were recovered; pairs were separated by an average 7.09% (SD=1.61) pairwise genetic distance in cyt-b, and three of these groups are apparently unnamed. Several well-supported clades are evident, including a complex of seven species formerly confused with S. lilium, a species that is actually limited to the Brazilian Shield. We used four calibration points to construct a time-tree for Sturnira, using BEAST. Sturnira diverged from other stenodermatines in the mid-Miocene, and by the end of that epoch (5.3 Ma), three basal lineages were present. Most living species belong to one of two clades, A and B, which appeared and diversified shortly afterwards, during the Pliocene. Both parsimony (DIVA) and likelihood (Lagrange) methods for reconstructing ancestral ranges indicate that the radiation of Sturnira is rooted in the Andes; all three basal lineages (in order, bidens, nana, and aratathomasi) have strictly or mainly Andean distributions. Only later did Sturnira colonize the Pacific lowlands (Chocó) and thence Central America. Sturnira species that are endemic to Central America appeared after the final emergence of the Panamanian landbridge ~3 Ma. Despite its ability to fly and to colonize the Antilles overwater, this genus probably accompanied the "legions" of South American taxa that moved overland during the Great American Biotic Interchange. Its eventual colonization of the Lesser Antilles and the appearance of two endemic lineages there did not take place until the Pleistocene. Because of its continual residence and diversification in South America, Andean assemblages of Sturnira contain both basal and highly derived members of the genus. Copyright © 2013 Elsevier Inc. All rights reserved.
A comparative study of clock rate and drift estimation
NASA Technical Reports Server (NTRS)
Breakiron, Lee A.
1994-01-01
Five different methods of drift determination and four different methods of rate determination were compared using months of hourly phase and frequency data from a sample of cesium clocks and active hydrogen masers. Linear least squares on frequency is selected as the optimal method of determining both drift and rate, more on the basis of parameter parsimony and confidence measures than on random and systematic errors.
Cloning and sequence analysis of chitin synthase gene fragments of Demodex mites.
Zhao, Ya-e; Wang, Zheng-hang; Xu, Yang; Xu, Ji-ru; Liu, Wen-yan; Wei, Meng; Wang, Chu-ying
2012-10-01
To our knowledge, few reports on Demodex studied at the molecular level are available at present. In this study our group, for the first time, cloned, sequenced and analyzed the chitin synthase (CHS) gene fragments of Demodex folliculorum, Demodex brevis, and Demodex canis (three isolates from each species) from Xi'an China, by designing specific primers based on the only partial sequence of the CHS gene of D. canis from Japan, retrieved from GenBank. Results show that amplification was successful only in three D. canis isolates and one D. brevis isolate out of the nine Demodex isolates. The obtained fragments were sequenced to be 339 bp for D. canis and 338 bp for D. brevis. The CHS gene sequence similarities between the three Xi'an D. canis isolates and one Japanese D. canis isolate ranged from 99.7% to 100.0%, and those between four D. canis isolates and one D. brevis isolate were 99.1%-99.4%. Phylogenetic trees based on maximum parsimony (MP) and maximum likelihood (ML) methods shared the same clusters, according with the traditional classification. Two open reading frames (ORFs) were identified in each CHS gene sequenced, and their corresponding amino acid sequences were located at the catalytic domain. The relatively conserved sequences could be deduced to be a CHS class A gene, which is associated with chitin synthesis in the integument of Demodex mites.
Tartar, Aurélien; Boucias, Drion G; Becnel, James J; Adams, Byron J
2003-11-01
The Helicosporidia are invertebrate pathogens that have recently been identified as non-photosynthetic green algae (Chlorophyta). In order to confirm the algal nature of the genus Helicosporidium, the presence of a retained chloroplast genome in Helicosporidia cells was investigated. Fragments homologous to plastid 16S rRNA (rrn16) genes were amplified successfully from cellular DNA extracted from two different Helicosporidium isolates. The fragment sequences are 1269 and 1266 bp long, are very AT-rich (60.7 %) and are similar to homologous genes sequenced from non-photosynthetic green algae. Maximum-parsimony, maximum-likelihood and neighbour-joining methods were used to infer phylogenetic trees from an rrn16 sequence alignment. All trees depicted the Helicosporidia as sister taxa to the non-photosynthetic, pathogenic alga Prototheca zopfii. Moreover, the trees identified Helicosporidium spp. as members of a clade that included the heterotrophic species Prototheca spp. and the mesotrophic species Chlorella protothecoides. The clade is always strongly supported by bootstrap values, suggesting that all these organisms share a most recent common ancestor. Phylogenetic analyses inferred from plastid 16S rRNA genes confirmed that the Helicosporidia are non-photosynthetic green algae, close relatives of the genus Prototheca (Chlorophyta, Trebouxiophyceae). Such phylogenetic affinities suggest that Helicosporidium spp. are likely to possess Prototheca-like organelles and organelle genomes.
Paleogene Radiation of a Plant Pathogenic Mushroom
Coetzee, Martin P. A.; Bloomer, Paulette; Wingfield, Michael J.; Wingfield, Brenda D.
2011-01-01
Background The global movement and speciation of fungal plant pathogens is important, especially because of the economic losses they cause and the ease with which they are able to spread across large areas. Understanding the biogeography and origin of these plant pathogens can provide insights regarding their dispersal and current day distribution. We tested the hypothesis of a Gondwanan origin of the plant pathogenic mushroom genus Armillaria and the currently accepted premise that vicariance accounts for the extant distribution of the species. Methods The phylogeny of a selection of Armillaria species was reconstructed based on Maximum Parsimony (MP), Maximum Likelihood (ML) and Bayesian Inference (BI). A timeline was then placed on the divergence of lineages using a Bayesian relaxed molecular clock approach. Results Phylogenetic analyses of sequenced data for three combined nuclear regions provided strong support for three major geographically defined clades: Holarctic, South American-Australasian and African. Molecular dating placed the initial radiation of the genus at 54 million years ago within the Early Paleogene, postdating the tectonic break-up of Gondwana. Conclusions The distribution of extant Armillaria species is the result of ancient long-distance dispersal rather than vicariance due to continental drift. As these finding are contrary to most prior vicariance hypotheses for fungi, our results highlight the important role of long-distance dispersal in the radiation of fungal pathogens from the Southern Hemisphere. PMID:22216099
Cloning and sequence analysis of chitin synthase gene fragments of Demodex mites*
Zhao, Ya-e; Wang, Zheng-hang; Xu, Yang; Xu, Ji-ru; Liu, Wen-yan; Wei, Meng; Wang, Chu-ying
2012-01-01
To our knowledge, few reports on Demodex studied at the molecular level are available at present. In this study our group, for the first time, cloned, sequenced and analyzed the chitin synthase (CHS) gene fragments of Demodex folliculorum, Demodex brevis, and Demodex canis (three isolates from each species) from Xi’an China, by designing specific primers based on the only partial sequence of the CHS gene of D. canis from Japan, retrieved from GenBank. Results show that amplification was successful only in three D. canis isolates and one D. brevis isolate out of the nine Demodex isolates. The obtained fragments were sequenced to be 339 bp for D. canis and 338 bp for D. brevis. The CHS gene sequence similarities between the three Xi’an D. canis isolates and one Japanese D. canis isolate ranged from 99.7% to 100.0%, and those between four D. canis isolates and one D. brevis isolate were 99.1%–99.4%. Phylogenetic trees based on maximum parsimony (MP) and maximum likelihood (ML) methods shared the same clusters, according with the traditional classification. Two open reading frames (ORFs) were identified in each CHS gene sequenced, and their corresponding amino acid sequences were located at the catalytic domain. The relatively conserved sequences could be deduced to be a CHS class A gene, which is associated with chitin synthesis in the integument of Demodex mites. PMID:23024043
Phylogeography of the California mountain kingsnake, Lampropeltis zonata (Colubridae).
Rodríguez-Robles, J A; Denardo, D F; Staub, R E
1999-11-01
The phylogeography of the California mountain kingsnake, Lampropeltis zonata, was studied using mitochondrial DNA sequences from specimens belonging to the seven recognized subspecies and collected throughout the range of the species. Maximum parsimony and maximum likelihood methods identified a basal split within L. zonata that corresponds to southern and northern segments of its distribution. The southern clade is composed of populations from southern California (USA) and northern Baja California, Mexico. The northern clade is divided into two subclades, a 'coastal' subclade, consisting of populations from the central coast of California and the southern Sierra Nevada Mountains of eastern California, and a 'northeastern' subclade, mainly comprised of populations north of the San Francisco Bay and from the majority of the Sierra Nevada. We suggest that past inland seaways in southwestern California and the embayment of central California constituted barriers to gene flow that resulted in the two deepest divergences within L. zonata. Throughout its evolutionary history, the northern clade apparently has undergone instances of range contraction, isolation, differentiation, and then expansion and secondary contact. Examination of colour pattern variation in 321 living and preserved specimens indicated that the two main colour pattern characters used to define the subspecies of L. zonata are so variable that they cannot be reliably used to differentiate taxonomic units within this complex, which calls into question the recognition of seven geographical races of this snake.
Kuch, Ulrich; Keogh, J Scott; Weigel, John; Smith, Laurie A; Mebs, Dietrich
2005-03-01
King brown snakes or mulga snakes (Pseudechis australis) are the largest and among the most dangerous and wide-ranging venomous snakes in Australia and New Guinea. They occur in diverse habitats, are important predators, and exhibit considerable morphological variation. We infer the relationships and historical biogeography of P. australis based on phylogenetic analysis of 1,249 base pairs from the mitochondrial cytochrome b, NADH dehydrogenase subunit 4 and three adjacent tRNA genes using Bayesian, maximum-likelihood, and maximum-parsimony methods. All methods reveal deep phylogenetic structure with four strongly supported clades comprising snakes from New Guinea (I), localities all over Australia (II), the Kimberleys of Western Australia (III), and north-central Australia (IV), suggesting a much more ancient radiation than previously believed. This conclusion is robust to different molecular clock estimations indicating divergence in Pliocene or Late Miocene, after landbridge dispersal to New Guinea had occurred. While members of clades I, III and IV are medium-sized, slender snakes, those of clade II attain large sizes and a robust build, rendering them top predators in their ecosystems. Genetic differentiation within clade II is low and haplotype distribution largely incongruent with geography or colour morphs, suggesting Pleistocene dispersal and recent ecomorph evolution. Significant haplotype diversity exists in clades III and IV, implying that clade IV comprises two species. Members of clade II are broadly sympatric with members of both northern Australian clades. Thus, our data support the recognition of at least five species from within P. australis (auct.) under various criteria. We discuss biogeographical, ecological and medical implications of our findings.
NASA Astrophysics Data System (ADS)
Kuch, Ulrich; Keogh, J. Scott; Weigel, John; Smith, Laurie A.; Mebs, Dietrich
2005-03-01
King brown snakes or mulga snakes (Pseudechis australis) are the largest and among the most dangerous and wide-ranging venomous snakes in Australia and New Guinea. They occur in diverse habitats, are important predators, and exhibit considerable morphological variation. We infer the relationships and historical biogeography of P. australis based on phylogenetic analysis of 1,249 base pairs from the mitochondrial cytochrome b, NADH dehydrogenase subunit 4 and three adjacent tRNA genes using Bayesian, maximum-likelihood, and maximum-parsimony methods. All methods reveal deep phylogenetic structure with four strongly supported clades comprising snakes from New Guinea (I), localities all over Australia (II), the Kimberleys of Western Australia (III), and north-central Australia (IV), suggesting a much more ancient radiation than previously believed. This conclusion is robust to different molecular clock estimations indicating divergence in Pliocene or Late Miocene, after landbridge dispersal to New Guinea had occurred. While members of clades I, III and IV are medium-sized, slender snakes, those of clade II attain large sizes and a robust build, rendering them top predators in their ecosystems. Genetic differentiation within clade II is low and haplotype distribution largely incongruent with geography or colour morphs, suggesting Pleistocene dispersal and recent ecomorph evolution. Significant haplotype diversity exists in clades III and IV, implying that clade IV comprises two species. Members of clade II are broadly sympatric with members of both northern Australian clades. Thus, our data support the recognition of at least five species from within P. australis (auct.) under various criteria. We discuss biogeographical, ecological and medical implications of our findings.
DNA Barcode for Identifying Folium Artemisiae Argyi from Counterfeits.
Mei, Quanxi; Chen, Xiaolu; Xiang, Li; Liu, Yue; Su, Yanyan; Gao, Yuqiao; Dai, Weibo; Dong, Pengpeng; Chen, Shilin
2016-01-01
Folium Artemisiae Argyi is an important herb in traditional Chinese medicine. It is commonly used in moxibustion, medicine, etc. However, identifying Artemisia argyi is difficult because this herb exhibits similar morphological characteristics to closely related species and counterfeits. To verify the applicability of DNA barcoding, ITS2 and psbA-trnH were used to identify A. argyi from 15 closely related species and counterfeits. Results indicated that total DNA was easily extracted from all the samples and that both ITS2 and psbA-trnH fragments can be easily amplified. ITS2 was a more ideal barcode than psbA-trnH and ITS2+psbA-trnH to identify A. argyi from closely related species and counterfeits on the basis of sequence character, genetic distance, and tree methods. The sequence length was 225 bp for the 56 ITS2 sequences of A. argyi, and no variable site was detected. For the ITS2 sequences, A. capillaris, A. anomala, A. annua, A. igniaria, A. maximowicziana, A. princeps, Dendranthema vestitum, and D. indicum had single nucleotide polymorphisms (SNPs). The intraspecific Kimura 2-Parameter distance was zero, which is lower than the minimum interspecific distance (0.005). A. argyi, the closely related species, and counterfeits, except for Artemisia maximowicziana and Artemisia sieversiana, were separated into pairs of divergent clusters by using the neighbor joining, maximum parsimony, and maximum likelihood tree methods. Thus, the ITS2 sequence was an ideal barcode to identify A. argyi from closely related species and counterfeits to ensure the safe use of this plant.
Cau, Andrea
2017-01-01
Bayesian phylogenetic methods integrating simultaneously morphological and stratigraphic information have been applied increasingly among paleontologists. Most of these studies have used Bayesian methods as an alternative to the widely-used parsimony analysis, to infer macroevolutionary patterns and relationships among species-level or higher taxa. Among recently introduced Bayesian methodologies, the Fossilized Birth-Death (FBD) model allows incorporation of hypotheses on ancestor-descendant relationships in phylogenetic analyses including fossil taxa. Here, the FBD model is used to infer the relationships among an ingroup formed exclusively by fossil individuals, i.e., dipnoan tooth plates from four localities in the Ain el Guettar Formation of Tunisia. Previous analyses of this sample compared the results of phylogenetic analysis using parsimony with stratigraphic methods, inferred a high diversity (five or more genera) in the Ain el Guettar Formation, and interpreted it as an artifact inflated by depositional factors. In the analysis performed here, the uncertainty on the chronostratigraphic relationships among the specimens was included among the prior settings. The results of the analysis confirm the referral of most of the specimens to the taxa Asiatoceratodus , Equinoxiodus, Lavocatodus and Neoceratodus , but reject those to Ceratodus and Ferganoceratodus . The resulting phylogeny constrained the evolution of the Tunisian sample exclusively in the Early Cretaceous, contrasting with the previous scenario inferred by the stratigraphically-calibrated topology resulting from parsimony analysis. The phylogenetic framework also suggests that (1) the sampled localities are laterally equivalent, (2) but three localities are restricted to the youngest part of the section; both results are in agreement with previous stratigraphic analyses of these localities. The FBD model of specimen-level units provides a novel tool for phylogenetic inference among fossils but also for independent tests of stratigraphic scenarios.
Complete Mitochondrial Genome of Echinostoma hortense (Digenea: Echinostomatidae).
Liu, Ze-Xuan; Zhang, Yan; Liu, Yu-Ting; Chang, Qiao-Cheng; Su, Xin; Fu, Xue; Yue, Dong-Mei; Gao, Yuan; Wang, Chun-Ren
2016-04-01
Echinostoma hortense (Digenea: Echinostomatidae) is one of the intestinal flukes with medical importance in humans. However, the mitochondrial (mt) genome of this fluke has not been known yet. The present study has determined the complete mt genome sequences of E. hortense and assessed the phylogenetic relationships with other digenean species for which the complete mt genome sequences are available in GenBank using concatenated amino acid sequences inferred from 12 protein-coding genes. The mt genome of E. hortense contained 12 protein-coding genes, 22 transfer RNA genes, 2 ribosomal RNA genes, and 1 non-coding region. The length of the mt genome of E. hortense was 14,994 bp, which was somewhat smaller than those of other trematode species. Phylogenetic analyses based on concatenated nucleotide sequence datasets for all 12 protein-coding genes using maximum parsimony (MP) method showed that E. hortense and Hypoderaeum conoideum gathered together, and they were closer to each other than to Fasciolidae and other echinostomatid trematodes. The availability of the complete mt genome sequences of E. hortense provides important genetic markers for diagnostics, population genetics, and evolutionary studies of digeneans.
Complete Mitochondrial Genome of Echinostoma hortense (Digenea: Echinostomatidae)
Liu, Ze-Xuan; Zhang, Yan; Liu, Yu-Ting; Chang, Qiao-Cheng; Su, Xin; Fu, Xue; Yue, Dong-Mei; Gao, Yuan; Wang, Chun-Ren
2016-01-01
Echinostoma hortense (Digenea: Echinostomatidae) is one of the intestinal flukes with medical importance in humans. However, the mitochondrial (mt) genome of this fluke has not been known yet. The present study has determined the complete mt genome sequences of E. hortense and assessed the phylogenetic relationships with other digenean species for which the complete mt genome sequences are available in GenBank using concatenated amino acid sequences inferred from 12 protein-coding genes. The mt genome of E. hortense contained 12 protein-coding genes, 22 transfer RNA genes, 2 ribosomal RNA genes, and 1 non-coding region. The length of the mt genome of E. hortense was 14,994 bp, which was somewhat smaller than those of other trematode species. Phylogenetic analyses based on concatenated nucleotide sequence datasets for all 12 protein-coding genes using maximum parsimony (MP) method showed that E. hortense and Hypoderaeum conoideum gathered together, and they were closer to each other than to Fasciolidae and other echinostomatid trematodes. The availability of the complete mt genome sequences of E. hortense provides important genetic markers for diagnostics, population genetics, and evolutionary studies of digeneans. PMID:27180575
Kusumi, J; Tsumura, Y; Yoshimaru, H; Tachida, H
2000-10-01
Nucleotide sequences from four chloroplast genes, the matK, chlL, intergenic spacer (IGS) region between trnL and trnF, and an intron of trnL, were determined from all species of Taxodiaceae and five species of Cupressaceae sensu stricto (s.s.). Phylogenetic trees were constructed using the maximum parsimony and the neighbor-joining methods with Cunninghamia as an outgroup. These analyses provided greater resolution of relationships among genera and higher bootstrap supports for clades compared to previous analyses. Results indicate that Taiwania diverged first, and then Athrotaxis diverged from the remaining genera. Metasequoia, Sequoia, and Sequoiadendron form a clade. Taxodium and Glyptostrobus form a clade, which is the sister to Cryptomeria. Cupressaceae s.s. are derived from within Taxodiaceae, being the most closely related to the Cryptomeria/Taxodium/Glyptostrobus clade. These relationships are consistent with previous morphological groupings and the analyses of molecular data. In addition, we found acceleration of evolutionary rates in Cupressaceae s.s. Possible causes for the acceleration are discussed.
The first iguanian lizard from the Mesozoic of Africa
NASA Astrophysics Data System (ADS)
Apesteguía, Sebastián; Daza, Juan D.; Simões, Tiago R.; Rage, Jean Claude
2016-09-01
The fossil record shows that iguanian lizards were widely distributed during the Late Cretaceous. However, the biogeographic history and early evolution of one of its most diverse and peculiar clades (acrodontans) remain poorly known. Here, we present the first Mesozoic acrodontan from Africa, which also represents the oldest iguanian lizard from that continent. The new taxon comes from the Kem Kem Beds in Morocco (Cenomanian, Late Cretaceous) and is based on a partial lower jaw. The new taxon presents a number of features that are found only among acrodontan lizards and shares greatest similarities with uromastycines, specifically. In a combined evidence phylogenetic dataset comprehensive of all major acrodontan lineages using multiple tree inference methods (traditional and implied weighting maximum-parsimony, and Bayesian inference), we found support for the placement of the new species within uromastycines, along with Gueragama sulamericana (Late Cretaceous of Brazil). The new fossil supports the previously hypothesized widespread geographical distribution of acrodontans in Gondwana during the Mesozoic. Additionally, it provides the first fossil evidence of uromastycines in the Cretaceous, and the ancestry of acrodontan iguanians in Africa.
A New Heterogeneous Multidimensional Unfolding Procedure
ERIC Educational Resources Information Center
Park, Joonwook; Rajagopal, Priyali; DeSarbo, Wayne S.
2012-01-01
A variety of joint space multidimensional scaling (MDS) methods have been utilized for the spatial analysis of two- or three-way dominance data involving subjects' preferences, choices, considerations, intentions, etc. so as to provide a parsimonious spatial depiction of the underlying relevant dimensions, attributes, stimuli, and/or subjects'…
Relaxations to Sparse Optimization Problems and Applications
NASA Astrophysics Data System (ADS)
Skau, Erik West
Parsimony is a fundamental property that is applied to many characteristics in a variety of fields. Of particular interest are optimization problems that apply rank, dimensionality, or support in a parsimonious manner. In this thesis we study some optimization problems and their relaxations, and focus on properties and qualities of the solutions of these problems. The Gramian tensor decomposition problem attempts to decompose a symmetric tensor as a sum of rank one tensors.We approach the Gramian tensor decomposition problem with a relaxation to a semidefinite program. We study conditions which ensure that the solution of the relaxed semidefinite problem gives the minimal Gramian rank decomposition. Sparse representations with learned dictionaries are one of the leading image modeling techniques for image restoration. When learning these dictionaries from a set of training images, the sparsity parameter of the dictionary learning algorithm strongly influences the content of the dictionary atoms.We describe geometrically the content of trained dictionaries and how it changes with the sparsity parameter.We use statistical analysis to characterize how the different content is used in sparse representations. Finally, a method to control the structure of the dictionaries is demonstrated, allowing us to learn a dictionary which can later be tailored for specific applications. Variations of dictionary learning can be broadly applied to a variety of applications.We explore a pansharpening problem with a triple factorization variant of coupled dictionary learning. Another application of dictionary learning is computer vision. Computer vision relies heavily on object detection, which we explore with a hierarchical convolutional dictionary learning model. Data fusion of disparate modalities is a growing topic of interest.We do a case study to demonstrate the benefit of using social media data with satellite imagery to estimate hazard extents. In this case study analysis we apply a maximum entropy model, guided by the social media data, to estimate the flooded regions during a 2013 flood in Boulder, CO and show that the results are comparable to those obtained using expert information.
Phylogenetic relationships among arecoid palms (Arecaceae: Arecoideae)
Baker, William J.; Norup, Maria V.; Clarkson, James J.; Couvreur, Thomas L. P.; Dowe, John L.; Lewis, Carl E.; Pintaud, Jean-Christophe; Savolainen, Vincent; Wilmot, Tomas; Chase, Mark W.
2011-01-01
Background and Aims The Arecoideae is the largest and most diverse of the five subfamilies of palms (Arecaceae/Palmae), containing >50 % of the species in the family. Despite its importance, phylogenetic relationships among Arecoideae are poorly understood. Here the most densely sampled phylogenetic analysis of Arecoideae available to date is presented. The results are used to test the current classification of the subfamily and to identify priority areas for future research. Methods DNA sequence data for the low-copy nuclear genes PRK and RPB2 were collected from 190 palm species, covering 103 (96 %) genera of Arecoideae. The data were analysed using the parsimony ratchet, maximum likelihood, and both likelihood and parsimony bootstrapping. Key Results and Conclusions Despite the recovery of paralogues and pseudogenes in a small number of taxa, PRK and RPB2 were both highly informative, producing well-resolved phylogenetic trees with many nodes well supported by bootstrap analyses. Simultaneous analyses of the combined data sets provided additional resolution and support. Two areas of incongruence between PRK and RPB2 were strongly supported by the bootstrap relating to the placement of tribes Chamaedoreeae, Iriarteeae and Reinhardtieae; the causes of this incongruence remain uncertain. The current classification within Arecoideae was strongly supported by the present data. Of the 14 tribes and 14 sub-tribes in the classification, only five sub-tribes from tribe Areceae (Basseliniinae, Linospadicinae, Oncospermatinae, Rhopalostylidinae and Verschaffeltiinae) failed to receive support. Three major higher level clades were strongly supported: (1) the RRC clade (Roystoneeae, Reinhardtieae and Cocoseae), (2) the POS clade (Podococceae, Oranieae and Sclerospermeae) and (3) the core arecoid clade (Areceae, Euterpeae, Geonomateae, Leopoldinieae, Manicarieae and Pelagodoxeae). However, new data sources are required to elucidate ambiguities that remain in phylogenetic relationships among and within the major groups of Arecoideae, as well as within the Areceae, the largest tribe in the palm family. PMID:21325340
Redefining the WISC-R: Implications for Professional Practice and Public Policy.
ERIC Educational Resources Information Center
Macmann, Gregg M.; Barnett, David W.
1992-01-01
The factor structure of the Wechsler Intelligence Scale for Children (Revised) was examined in the standardization sample using new methods of factor analysis. The substantial overlap across factors was most parsimoniously represented by a single general factor. Implications for public policy regarding the purposes and outcomes of special…
Molecular phylogeny of the Achatinoidea (Mollusca: Gastropoda).
Fontanilla, Ian Kendrich; Naggs, Fred; Wade, Christopher Mark
2017-09-01
This study presents a multi-gene phylogenetic analysis of the Achatinoidea and provides an initial basis for a taxonomic re-evaluation of family level groups within the superfamily. A total of 5028 nucleotides from the nuclear rRNA, actin and histone 3 genes and the 1st and 2nd codon positions of the mitochondrial cytochrome c oxidase subunit I gene were sequenced from 24 species, representing six currently recognised families. Results from maximum likelihood, neighbour joining, maximum parsimony and Bayesian inference trees revealed that, of currently recognised families, only the Achatinidae are monophyletic. For the Ferussaciidae, Ferussacia folliculus fell separately to Cecilioides gokweanus and formed a sister taxon to the rest of the Achatinoidea. For the Coeliaxidae, Coeliaxis blandii and Pyrgina umbilicata did not group together. The Subulinidae was not resolved, with some subulinids clustering with the Coeliaxidae and Thyrophorellidae. Three subfamilies currently included within the Subulinidae based on current taxonomy likewise did not form monophyletic groups. Copyright © 2017 Elsevier Inc. All rights reserved.
Swart, Belinda L; von der Heyden, Sophie; Bester-van der Merwe, Aletta; Roodt-Wilding, Rouvay
2015-12-01
The genus Seriola includes several important commercially exploited species and has a disjunct distribution globally; yet phylogenetic relationships within this genus have not been thoroughly investigated. This study reports the first comprehensive molecular phylogeny for this genus based on mitochondrial (Cytb) and nuclear gene (RAG1 and Rhod) DNA sequence data for all extant Seriola species (nine species, n=27). All species were found to be monophyletic based on Maximum parsimony, Maximum likelihood and Bayesian inference. The closure of the Tethys Sea (12-20 MYA) coincides with the divergence of a clade containing ((S. fasciata and S. peruana), S. carpenteri) from the rest of the Seriola species, while the formation of the Isthmus of Panama (±3 MYA) played an important role in the divergence of S. fasciata and S. peruana. Furthermore, factors such as climate and water temperature fluctuations during the Pliocene played important roles during the divergence of the remaining Seriola species. Copyright © 2015 Elsevier Inc. All rights reserved.
Phylogenetic study on Shiraia bambusicola by rDNA sequence analyses.
Cheng, Tian-Fan; Jia, Xiao-Ming; Ma, Xiao-Hang; Lin, Hai-Ping; Zhao, Yu-Hua
2004-01-01
In this study, 18S rDNA and ITS-5.8S rDNA regions of four Shiraia bambusicola isolates collected from different species of bamboos were amplified by PCR with universal primer pairs NS1/NS8 and ITS5/ITS4, respectively, and sequenced. Phylogenetic analyses were conducted on three selected datasets of rDNA sequences. Maximum parsimony, distance and maximum likelihood criteria were used to infer trees. Morphological characteristics were also observed. The positioning of Shiraia in the order Pleosporales was well supported by bootstrap, which agreed with the placement by Amano (1980) according to their morphology. We did not find significant inter-hostal differences among these four isolates from different species of bamboos. From the results of analyses and comparison of their rDNA sequences, we conclude that Shiraia should be classified into Pleosporales as Amano (1980) proposed and suggest that it might be positioned in the family Phaeosphaeriaceae. Copyright 2004 WILEY-VCH Verlag GmbH & Co.
Canedo, Clarissa; Haddad, Célio F B
2012-11-01
We present a phylogenetic hypothesis of the anuran clade Terrarana based on partial sequences of nuclear (Tyr and RAG1) and mitochondrial (12S, tRNA-Val, and 16S) genes, testing the monophyly of Ischnocnema and its species series. We performed maximum parsimony, maximum likelihood, and Bayesian inference analyses on 364 terminals: 11 outgroup terminals and 353 ingroup Terrarana terminals, including 139 Ischnocnema terminals (accounting for 29 of the 35 named Ischnocnema species) and 214 other Terrarana terminals within the families Brachycephalidae, Ceuthomantidae, Craugastoridae, and Eleutherodactylidae. Different optimality criteria produced similar results and mostly recovered the currently accepted families and genera. According to these topologies, Ischnocnema is not a monophyletic group. We propose new combinations for three species, relocating them to Pristimantis, and render Eleutherodactylus bilineatus Bokermann, 1975 incertae sedis status within Holoadeninae. The rearrangements in Ischnocnema place it outside the northernmost Brazilian Atlantic rainforest, where the fauna of Terrarana comprises typical Amazonian genera. Copyright © 2012 Elsevier Inc. All rights reserved.
2013-01-01
Background Phylogeny estimation from aligned haplotype sequences has attracted more and more attention in the recent years due to its importance in analysis of many fine-scale genetic data. Its application fields range from medical research, to drug discovery, to epidemiology, to population dynamics. The literature on molecular phylogenetics proposes a number of criteria for selecting a phylogeny from among plausible alternatives. Usually, such criteria can be expressed by means of objective functions, and the phylogenies that optimize them are referred to as optimal. One of the most important estimation criteria is the parsimony which states that the optimal phylogeny T∗for a set H of n haplotype sequences over a common set of variable loci is the one that satisfies the following requirements: (i) it has the shortest length and (ii) it is such that, for each pair of distinct haplotypes hi,hj∈H, the sum of the edge weights belonging to the path from hi to hj in T∗ is not smaller than the observed number of changes between hi and hj. Finding the most parsimonious phylogeny for H involves solving an optimization problem, called the Most Parsimonious Phylogeny Estimation Problem (MPPEP), which is NP-hard in many of its versions. Results In this article we investigate a recent version of the MPPEP that arises when input data consist of single nucleotide polymorphism haplotypes extracted from a population of individuals on a common genomic region. Specifically, we explore the prospects for improving on the implicit enumeration strategy of implicit enumeration strategy used in previous work using a novel problem formulation and a series of strengthening valid inequalities and preliminary symmetry breaking constraints to more precisely bound the solution space and accelerate implicit enumeration of possible optimal phylogenies. We present the basic formulation and then introduce a series of provable valid constraints to reduce the solution space. We then prove that these constraints can often lead to significant reductions in the gap between the optimal solution and its non-integral linear programming bound relative to the prior art as well as often substantially faster processing of moderately hard problem instances. Conclusion We provide an indication of the conditions under which such an optimal enumeration approach is likely to be feasible, suggesting that these strategies are usable for relatively large numbers of taxa, although with stricter limits on numbers of variable sites. The work thus provides methodology suitable for provably optimal solution of some harder instances that resist all prior approaches. PMID:23343437
GASP: Gapped Ancestral Sequence Prediction for proteins
Edwards, Richard J; Shields, Denis C
2004-01-01
Background The prediction of ancestral protein sequences from multiple sequence alignments is useful for many bioinformatics analyses. Predicting ancestral sequences is not a simple procedure and relies on accurate alignments and phylogenies. Several algorithms exist based on Maximum Parsimony or Maximum Likelihood methods but many current implementations are unable to process residues with gaps, which may represent insertion/deletion (indel) events or sequence fragments. Results Here we present a new algorithm, GASP (Gapped Ancestral Sequence Prediction), for predicting ancestral sequences from phylogenetic trees and the corresponding multiple sequence alignments. Alignments may be of any size and contain gaps. GASP first assigns the positions of gaps in the phylogeny before using a likelihood-based approach centred on amino acid substitution matrices to assign ancestral amino acids. Important outgroup information is used by first working down from the tips of the tree to the root, using descendant data only to assign probabilities, and then working back up from the root to the tips using descendant and outgroup data to make predictions. GASP was tested on a number of simulated datasets based on real phylogenies. Prediction accuracy for ungapped data was similar to three alternative algorithms tested, with GASP performing better in some cases and worse in others. Adding simple insertions and deletions to the simulated data did not have a detrimental effect on GASP accuracy. Conclusions GASP (Gapped Ancestral Sequence Prediction) will predict ancestral sequences from multiple protein alignments of any size. Although not as accurate in all cases as some of the more sophisticated maximum likelihood approaches, it can process a wide range of input phylogenies and will predict ancestral sequences for gapped and ungapped residues alike. PMID:15350199
Chloroplast heterogeneity and historical admixture within the genus Malus.
Volk, Gayle M; Henk, Adam D; Baldo, Angela; Fazio, Gennaro; Chao, C Thomas; Richards, Christopher M
2015-07-01
• The genus Malus represents a unique and complex evolutionary context in which to study domestication. Several Malus species have provided novel alleles and traits to the cultivars. The extent of admixture among wild Malus species has not been well described, due in part to limited sampling of individuals within a taxon.• Four chloroplast regions (1681 bp total) were sequenced and aligned for 412 Malus individuals from 30 species. Phylogenetic relationships were reconstructed using maximum parsimony. The distribution of chloroplast haplotypes among species was examined using statistical parsimony, phylogenetic trees, and a median-joining network.• Chloroplast haplotypes are shared among species within Malus. Three major haplotype-sharing networks were identified. One includes species native to China, Western North America, as well as Malus domestica Borkh, and its four primary progenitor species: M. sieversii (Ledeb.) M. Roem., M. orientalis Uglitzk., M. sylvestris (L.) Mill., and M. prunifolia (Willd.) Borkh; another includes five Chinese Malus species, and a third includes the three Malus species native to Eastern North America.• Chloroplast haplotypes found in M. domestica belong to a single, highly admixed network. Haplotypes shared between the domesticated apple and its progenitors may reflect historical introgression or the retention of ancestral polymorphisms. Multiple individuals should be sampled within Malus species to reveal haplotype heterogeneity, if complex maternal contributions to named species are to be recognized. © 2015 Botanical Society of America, Inc.
Riethmüller, A; Voglmayr, H; Göker, M; Weiß, M; Oberwinkler, F
2002-01-01
In order to investigate phylogenetic relationships of the Peronosporomycetes (Oomycetes), nuclear large subunit ribosomal DNA sequences containing the D1 and D2 region were analyzed of 92 species belonging to the orders Peronosporales, Pythiales, Leptomitales, Rhipidiales, Saprolegniales and Sclerosporales. The data were analyzed applying methods of neighbor-joining as well as maximum parsimony, both statistically supported using the bootstrap method. The results confirm the major division between the Pythiales and Peronosporales on the one hand and the Saprolegniales, Leptomitales, and Rhipidiales on the other. The Sclerosporales were shown to be polyphyletic; while Sclerosporaceae are nested within the Peronosporaceae, the Verrucalvaceae are merged within the Saprolegniales. Within the Peronosporomycetidae, Pythiales as well as Peronosporales as currently defined are polyphyletic. The well supported Albugo clade appears to be the most basal lineage, followed by a Pythium-Lagenidium clade. The third, highly supported clade comprises the Peronosporaceae together with Sclerospora, Phytophthora, and Peronophythora. Peronophythora is placed within Phytophthora, indicating that both genera should be merged. Bremiella seems to be polyphyletic within the genus Plasmopara, suggesting a transfer to Plasmopara. The species of Peronospora do not appear as a monophyletic group. Peronospora species growing on Brassicaceae form a highly supported clade.
Ramnath; Jyrwa, D B; Dutta, A K; Das, B; Tandon, V
2014-03-01
The nodular tapeworm, Raillietina echinobothrida is a well studied avian gastrointestinal parasite of family Davaineidae (Cestoda: Cyclophyllidea). It is reported to be the largest in size and second most prevalent species infecting chicken in north-east India. In the present study, morphometrical methods coupled with the molecular analysis of the second internal transcribed spacer (ITS2) region of ribosomal DNA were employed for precise identification of the parasite. The annotated ITS2 region was found to be 446 bp long and further utilized to elucidate the phylogenetic relationships and its species-interrelationships at the molecular level. In phylogenetic analysis similar topology was observed among the trees obtained by distance-based neighbor-joining as well as character-based maximum parsimony tree building methods. The query sequence R. echinobothrida is well aligned and placed within the Davaineidae group, with all Raillietina species well separated from the other cyclophyllidean (taeniid and hymenolepid) cestodes, while Diphyllobothrium latum (Pseudophyllidea: Diphyllobothriidae) was rooted as an out-group. Sequence similarities indeed confirmed our hypothesis that Raillietina spp. are neighboring the position with other studied species of order Cyclophyllidea against the out-group order Pseudophyllidea. The present study strengthens the potential of ITS2 as a reliable marker for phylogenetic reconstructions.
el-Showk, Sedeer; Help-Rinta-Rahko, Hanna; Blomster, Tiina; Siligato, Riccardo; Marée, Athanasius F. M.; Mähönen, Ari Pekka; Grieneisen, Verônica A.
2015-01-01
An auxin maximum is positioned along the xylem axis of the Arabidopsis root tip. The pattern depends on mutual feedback between auxin and cytokinins mediated by the PIN class of auxin efflux transporters and AHP6, an inhibitor of cytokinin signalling. This interaction has been proposed to regulate the size and the position of the hormones’ respective signalling domains and specify distinct boundaries between them. To understand the dynamics of this regulatory network, we implemented a parsimonious computational model of auxin transport that considers hormonal regulation of the auxin transporters within a spatial context, explicitly taking into account cell shape and polarity and the presence of cell walls. Our analysis reveals that an informative spatial pattern in cytokinin levels generated by diffusion is a theoretically unlikely scenario. Furthermore, our model shows that such a pattern is not required for correct and robust auxin patterning. Instead, auxin-dependent modifications of cytokinin response, rather than variations in cytokinin levels, allow for the necessary feedbacks, which can amplify and stabilise the auxin maximum. Our simulations demonstrate the importance of hormonal regulation of auxin efflux for pattern robustness. While involvement of the PIN proteins in vascular patterning is well established, we predict and experimentally verify a role of AUX1 and LAX1/2 auxin influx transporters in this process. Furthermore, we show that polar localisation of PIN1 generates an auxin flux circuit that not only stabilises the accumulation of auxin within the xylem axis, but also provides a mechanism for auxin to accumulate specifically in the xylem-pole pericycle cells, an important early step in lateral root initiation. The model also revealed that pericycle cells on opposite xylem poles compete for auxin accumulation, consistent with the observation that lateral roots are not initiated opposite to each other. PMID:26505899
Rubinoff, Daniel; Le Roux, Johannes J.
2008-01-01
Background Saltational evolution in which a particular lineage undergoes relatively rapid, significant, and unparalleled change as compared with its closest relatives is rarely invoked as an alternative model to the dominant paradigm of gradualistic evolution. Identifying saltational events is an important first-step in assessing the importance of this discontinuous model in generating evolutionary novelty. We offer evidence for three independent instances of saltational evolution in a charismatic moth genus with only eight species. Methodology/Principal Findings Maximum parsimony, maximum likelihood and Bayesian search criteria offered congruent, well supported phylogenies based on 1,965 base pairs of DNA sequence using the mitochondrial gene cytochrome oxidase subunit I, and the nuclear genes elongation factor-1 alpha and wingless. Using a comparative methods approach, we examined three taxa exhibiting novelty in the form of Batesian mimicry, host plant shift, and dramatic physiological differences in light of the phylogenetic data. All three traits appear to have evolved relatively rapidly and independently in three different species of Proserpinus. Each saltational species exhibits a markedly different and discrete example of discontinuous trait evolution while remaining canalized for other typical traits shared by the rest of the genus. All three saltational taxa show insignificantly different levels of overall genetic change as compared with their congeners, implying that their divergence is targeted to particular traits and not genome-wide. Conclusions/Significance Such rapid evolution of novel traits in individual species suggests that the pace of evolution can be quick, dramatic, and isolated—even on the species level. These results may be applicable to other groups in which specific taxa have generated pronounced evolutionary novelty. Genetic mechanisms and methods for assessing such relatively rapid changes are postulated. PMID:19107205
Rubinoff, Daniel; Le Roux, Johannes J
2008-01-01
Saltational evolution in which a particular lineage undergoes relatively rapid, significant, and unparalleled change as compared with its closest relatives is rarely invoked as an alternative model to the dominant paradigm of gradualistic evolution. Identifying saltational events is an important first-step in assessing the importance of this discontinuous model in generating evolutionary novelty. We offer evidence for three independent instances of saltational evolution in a charismatic moth genus with only eight species. Maximum parsimony, maximum likelihood and Bayesian search criteria offered congruent, well supported phylogenies based on 1,965 base pairs of DNA sequence using the mitochondrial gene cytochrome oxidase subunit I, and the nuclear genes elongation factor-1 alpha and wingless. Using a comparative methods approach, we examined three taxa exhibiting novelty in the form of Batesian mimicry, host plant shift, and dramatic physiological differences in light of the phylogenetic data. All three traits appear to have evolved relatively rapidly and independently in three different species of Proserpinus. Each saltational species exhibits a markedly different and discrete example of discontinuous trait evolution while remaining canalized for other typical traits shared by the rest of the genus. All three saltational taxa show insignificantly different levels of overall genetic change as compared with their congeners, implying that their divergence is targeted to particular traits and not genome-wide. Such rapid evolution of novel traits in individual species suggests that the pace of evolution can be quick, dramatic, and isolated--even on the species level. These results may be applicable to other groups in which specific taxa have generated pronounced evolutionary novelty. Genetic mechanisms and methods for assessing such relatively rapid changes are postulated.
Kress, W John; Erickson, David L; Swenson, Nathan G; Thompson, Jill; Uriarte, Maria; Zimmerman, Jess K
2010-11-09
Species number, functional traits, and phylogenetic history all contribute to characterizing the biological diversity in plant communities. The phylogenetic component of diversity has been particularly difficult to quantify in species-rich tropical tree assemblages. The compilation of previously published (and often incomplete) data on evolutionary relationships of species into a composite phylogeny of the taxa in a forest, through such programs as Phylomatic, has proven useful in building community phylogenies although often of limited resolution. Recently, DNA barcodes have been used to construct a robust community phylogeny for nearly 300 tree species in a forest dynamics plot in Panama using a supermatrix method. In that study sequence data from three barcode loci were used to generate a well-resolved species-level phylogeny. Here we expand upon this earlier investigation and present results on the use of a phylogenetic constraint tree to generate a community phylogeny for a diverse, tropical forest dynamics plot in Puerto Rico. This enhanced method of phylogenetic reconstruction insures the congruence of the barcode phylogeny with broadly accepted hypotheses on the phylogeny of flowering plants (i.e., APG III) regardless of the number and taxonomic breadth of the taxa sampled. We also compare maximum parsimony versus maximum likelihood estimates of community phylogenetic relationships as well as evaluate the effectiveness of one- versus two- versus three-gene barcodes in resolving community evolutionary history. As first demonstrated in the Panamanian forest dynamics plot, the results for the Puerto Rican plot illustrate that highly resolved phylogenies derived from DNA barcode sequence data combined with a constraint tree based on APG III are particularly useful in comparative analysis of phylogenetic diversity and will enhance research on the interface between community ecology and evolution.
Sites, J.W.; Morando, M.; Highton, R.; Huber, F.; Jung, R.E.
2004-01-01
The Shenandoah salamander (Plethodon shenandoah), known from isolated talus slopes on three of the highest mountains in Shenandoah National Park, is listed as state-endangered in Virginia and federally endangered under the U.S. Endangered Species Act. A 1999 paper by G. R. Thurow described P. shenandoah-like salamanders from three localities further south in the Blue Ridge Physiographic Province, which, if confirmed, would represent a range extension for P. shenandoah of approximately 90 km from its nearest known locality. Samples collected from two of these three localities were included in a molecular phylogenetic study of the known populations of P. shenandoah, and all other recognized species in the Plethodon cinereus group, using a 792 bp region of the mitochondrial cytochrome-b gene. Phylogenetic estimates were based on Bayesian, maximum likelihood, and maximum parsimony methods and topologies examined for placement of the new P. shenandoah-like samples relative to all others. All topologies recovered all haplotypes of the P. shenandoah-like animals nested within P. cinereus, and a statistical comparison of the best likelihood tree topology with one with an enforced (Thurow + Shenandoah P. shenandoah) clade revealed that the unconstrained tree had a significantly lower -In L score (P < 0.05, using the Shimodaira-Hasegawa test) than the constraint tree. This result and other anecdotal information give us no solid reason to consider the Thurow report valid. The current recovery program for P. shenandoah should remain focused on populations in Shenandoah National Park.
2014-01-01
Background Nematodirus spp. are among the most common nematodes of ruminants worldwide. N. oiratianus and N. spathiger are distributed worldwide as highly prevalent gastrointestinal nematodes, which cause emerging health problems and economic losses. Accurate identification of Nematodirus species is essential to develop effective control strategies for Nematodirus infection in ruminants. Mitochondrial DNA (mtDNA) could provide powerful genetic markers for identifying these closely related species and resolving phylogenetic relationships at different taxonomic levels. Methods In the present study, the complete mitochondrial (mt) genomes of N. oiratianus and N. spathiger from small ruminants in China were obtained using Long-range PCR and sequencing. Results The complete mt genomes of N. oiratianus and N. spathiger were 13,765 bp and 13,519 bp in length, respectively. Both mt genomes were circular and consisted of 36 genes, including 12 genes encoding proteins, 2 genes encoding rRNA, and 22 genes encoding tRNA. Phylogenetic analyses based on the concatenated amino acid sequence data of all 12 protein-coding genes by Bayesian inference (BI), Maximum likelihood (ML) and Maximum parsimony (MP) showed that the two Nematodirus species (Molineidae) were closely related to Dictyocaulidae. Conclusions The availability of the complete mtDNA sequences of N. oiratianus and N. spathiger not only provides new mtDNA sources for a better understanding of nematode mt genomics and phylogeny, but also provides novel and useful genetic markers for studying diagnosis, population genetics and molecular epidemiology of Nematodirus spp. in small ruminants. PMID:25015379
Parsimonious mathematical characterization of channel shape and size
USDA-ARS?s Scientific Manuscript database
This work has two purposes: 1) using a Leopold and Maddock (1953) hydraulic geometry approach, present a mathematically parsimonious, two parameter, characterization of channel shape and size; and 2) analytically quantify cross-sectional area, top width, average depth, critical energy, and bankfull ...
A laid-back trip through the Hennigian Forests
2017-01-01
Background This paper is a comment on the idea of matrix-free Cladistics. Demonstration of this idea’s efficiency is a major goal of the study. Within the proposed framework, the ordinary (phenetic) matrix is necessary only as “source” of Hennigian trees, not as a primary subject of the analysis. Switching from the matrix-based thinking to the matrix-free Cladistic approach clearly reveals that optimizations of the character-state changes are related not to the real processes, but to the form of the data representation. Methods We focused our study on the binary data. We wrote the simple ruby-based script FORESTER version 1.0 that helps represent a binary matrix as an array of the rooted trees (as a “Hennigian forest”). The binary representations of the genomic (DNA) data have been made by script 1001. The Average Consensus method as well as the standard Maximum Parsimony (MP) approach has been used to analyze the data. Principle findings The binary matrix may be easily re-written as a set of rooted trees (maximal relationships). The latter might be analyzed by the Average Consensus method. Paradoxically, this method, if applied to the Hennigian forests, in principle can help to identify clades despite the absence of the direct evidence from the primary data. Our approach may handle the clock- or non clock-like matrices, as well as the hypothetical, molecular or morphological data. Discussion Our proposal clearly differs from the numerous phenetic alignment-free techniques of the construction of the phylogenetic trees. Dealing with the relations, not with the actual “data” also distinguishes our approach from all optimization-based methods, if the optimization is defined as a way to reconstruct the sequences of the character-state changes on a tree, either the standard alignment-based techniques or the “direct” alignment-free procedure. We are not viewing our recent framework as an alternative to the three-taxon statement analysis (3TA), but there are two major differences between our recent proposal and the 3TA, as originally designed and implemented: (1) the 3TA deals with the three-taxon statements or minimal relationships. According to the logic of 3TA, the set of the minimal trees must be established as a binary matrix and used as an input for the parsimony program. In this paper, we operate directly with maximal relationships written just as trees, not as binary matrices, while also using the Average Consensus method instead of the MP analysis. The solely ‘reversal’-based groups can always be found by our method without the separate scoring of the putative reversals before analyses. PMID:28740753
Homology and the optimization of DNA sequence data
NASA Technical Reports Server (NTRS)
Wheeler, W.
2001-01-01
Three methods of nucleotide character analysis are discussed. Their implications for molecular sequence homology and phylogenetic analysis are compared. The criterion of inter-data set congruence, both character based and topological, are applied to two data sets to elucidate and potentially discriminate among these parsimony-based ideas. c2001 The Willi Hennig Society.
A Molecular Phylogeny of the Chalcidoidea (Hymenoptera)
Munro, James B.; Heraty, John M.; Burks, Roger A.; Hawks, David; Mottern, Jason; Cruaud, Astrid; Rasplus, Jean-Yves; Jansta, Petr
2011-01-01
Chalcidoidea (Hymenoptera) are extremely diverse with more than 23,000 species described and over 500,000 species estimated to exist. This is the first comprehensive phylogenetic analysis of the superfamily based on a molecular analysis of 18S and 28S ribosomal gene regions for 19 families, 72 subfamilies, 343 genera and 649 species. The 56 outgroups are comprised of Ceraphronoidea and most proctotrupomorph families, including Mymarommatidae. Data alignment and the impact of ambiguous regions are explored using a secondary structure analysis and automated (MAFFT) alignments of the core and pairing regions and regions of ambiguous alignment. Both likelihood and parsimony approaches are used to analyze the data. Overall there is no impact of alignment method, and few but substantial differences between likelihood and parsimony approaches. Monophyly of Chalcidoidea and a sister group relationship between Mymaridae and the remaining Chalcidoidea is strongly supported in all analyses. Either Mymarommatoidea or Diaprioidea are the sister group of Chalcidoidea depending on the analysis. Likelihood analyses place Rotoitidae as the sister group of the remaining Chalcidoidea after Mymaridae, whereas parsimony nests them within Chalcidoidea. Some traditional family groups are supported as monophyletic (Agaonidae, Eucharitidae, Encyrtidae, Eulophidae, Leucospidae, Mymaridae, Ormyridae, Signiphoridae, Tanaostigmatidae and Trichogrammatidae). Several other families are paraphyletic (Perilampidae) or polyphyletic (Aphelinidae, Chalcididae, Eupelmidae, Eurytomidae, Pteromalidae, Tetracampidae and Torymidae). Evolutionary scenarios discussed for Chalcidoidea include the evolution of phytophagy, egg parasitism, sternorrhynchan parasitism, hypermetamorphic development and heteronomy. PMID:22087244
Influenza epidemics, seasonality, and the effects of cold weather on cardiac mortality
2012-01-01
Background More people die in the winter from cardiac disease, and there are competing hypotheses to explain this. The authors conducted a study in 48 US cities to determine how much of the seasonal pattern in cardiac deaths could be explained by influenza epidemics, whether that allowed a more parsimonious control for season than traditional spline models, and whether such control changed the short term association with temperature. Methods The authors obtained counts of daily cardiac deaths and of emergency hospital admissions of the elderly for influenza during 1992–2000. Quasi-Poisson regression models were conducted estimating the association between daily cardiac mortality, and temperature. Results Controlling for influenza admissions provided a more parsimonious model with better Generalized Cross-Validation, lower residual serial correlation, and better captured Winter peaks. The temperature-response function was not greatly affected by adjusting for influenza. The pooled estimated increase in risk for a temperature decrease from 0 to −5°C was 1.6% (95% confidence interval (CI) 1.1-2.1%). Influenza accounted for 2.3% of cardiac deaths over this period. Conclusions The results suggest that including epidemic data explained most of the irregular seasonal pattern (about 18% of the total seasonal variation), allowing more parsimonious models than when adjusting for seasonality only with smooth functions of time. The effect of cold temperature is not confounded by epidemics. PMID:23025494
Singular Spectrum Analysis for Astronomical Time Series: Constructing a Parsimonious Hypothesis Test
NASA Astrophysics Data System (ADS)
Greco, G.; Kondrashov, D.; Kobayashi, S.; Ghil, M.; Branchesi, M.; Guidorzi, C.; Stratta, G.; Ciszak, M.; Marino, F.; Ortolan, A.
We present a data-adaptive spectral method - Monte Carlo Singular Spectrum Analysis (MC-SSA) - and its modification to tackle astrophysical problems. Through numerical simulations we show the ability of the MC-SSA in dealing with 1/f β power-law noise affected by photon counting statistics. Such noise process is simulated by a first-order autoregressive, AR(1) process corrupted by intrinsic Poisson noise. In doing so, we statistically estimate a basic stochastic variation of the source and the corresponding fluctuations due to the quantum nature of light. In addition, MC-SSA test retains its effectiveness even when a significant percentage of the signal falls below a certain level of detection, e.g., caused by the instrument sensitivity. The parsimonious approach presented here may be broadly applied, from the search for extrasolar planets to the extraction of low-intensity coherent phenomena probably hidden in high energy transients.
King, Benedict; Qiao, Tuo; Lee, Michael S Y; Zhu, Min; Long, John A
2017-07-01
The phylogeny of early gnathostomes provides an important framework for understanding one of the most significant evolutionary events, the origin and diversification of jawed vertebrates. A series of recent cladistic analyses have suggested that the placoderms, an extinct group of armoured fish, form a paraphyletic group basal to all other jawed vertebrates. We revised and expanded this morphological data set, most notably by sampling autapomorphies in a similar way to parsimony-informative traits, thus ensuring this data (unlike most existing morphological data sets) satisfied an important assumption of Bayesian tip-dated morphological clock approaches. We also found problems with characters supporting placoderm paraphyly, including character correlation and incorrect codings. Analysis of this data set reveals that paraphyly and monophyly of core placoderms (excluding maxillate forms) are essentially equally parsimonious. The two alternative topologies have different root positions for the jawed vertebrates but are otherwise similar. However, analysis using tip-dated clock methods reveals strong support for placoderm monophyly, due to this analysis favoring trees with more balanced rates of evolution. Furthermore, enforcing placoderm paraphyly results in higher levels and unusual patterns of rate heterogeneity among branches, similar to that generated from simulated trees reconstructed with incorrect root positions. These simulations also show that Bayesian tip-dated clock methods outperform parsimony when the outgroup is largely uninformative (e.g., due to inapplicable characters), as might be the case here. The analysis also reveals that gnathostomes underwent a rapid burst of evolution during the Silurian period which declined during the Early Devonian. This rapid evolution during a period with few articulated fossils might partly explain the difficulty in ascertaining the root position of jawed vertebrates. © The Author(s) 2016. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Kim, Joo-Hwan; Kim, Dong-Kap; Forest, Felix; Fay, Michael F.; Chase, Mark W.
2010-01-01
Background Previous phylogenetics studies of Asparagales, although extensive and generally well supported, have left several sets of taxa unclearly placed and have not addressed all relationships within certain clades thoroughly (some clades were relatively sparsely sampled). One of the most important of these is sampling within and placement of Nolinoideae (Ruscaceae s.l.) of Asparagaceae sensu Angiosperm Phylogeny Group (APG) III, which subfamily includes taxa previously referred to Convallariaceae, Dracaenaaceae, Eriospermaceae, Nolinaceae and Ruscaceae. Methods A phylogenetic analysis of a combined data set for 126 taxa of Ruscaceae s.l. and related groups in Asparagales based on three nuclear and plastid DNA coding genes, 18S rDNA (1796 bp), rbcL (1338 bp) and matK (1668 bp), representing a total of approx. 4·8 kb is presented. Parsimony and Bayesian inference analyses were conducted to elucidate relationships of Ruscaceae s.l. and related groups, and parsimony bootstrap analysis was performed to assess support of clades. Key Results The combination of the three genes results in the most highly resolved and strongly supported topology yet obtained for Asparagales including Ruscaceae s.l. Asparagales relationships are nearly congruent with previous combined gene analyses, which were reflected in the APG III classification. Parsimony and Bayesian analyses yield identical relationships except for some slight variation among the core asparagoid families, which nevertheless form a strongly supported group in both types of analyses. In core asparagoids, five major clades are identified: (1) Alliaceae s.l. (sensu APG III, Amarylidaceae–Agapanthaceae–Alliaceae); (2) Asparagaceae–Laxmanniaceae–Ruscaceae s.l.; (3) Themidaceae; (4) Hyacinthaceae; (5) Anemarrhenaceae–Behniaceae–Herreriaceae–Agavaceae (clades 2–5 collectively Asparagaceae s.l. sensu APG III). The position of Aphyllanthes is labile, but it is sister to Themidaceae in the combined maximum-parsimony tree and sister to Anemarrhenaceae in the Bayesian analysis. The highly supported clade of Xanthorrhoeaceae s.l. (sensu APG III, including Asphodelaceae and Hemerocallidaceae) is sister to the core asparagoids. Ruscaceae s.l. are a well-supported group. Asparagaceae s.s. are sister to Ruscaceae s.l., even though the clade of the two families is weakly supported; Laxmanniaceae are strongly supported as sister to Ruscaceae s.l. and Asparagaceae. Ruscaceae s.l. include six principal clades that often reflect previously named groups: (1) tribe Polygonateae (excluding Disporopsis); (2) tribe Ophiopogoneae; (3) tribe Convallarieae (excluding Theropogon); (4) Ruscaceae s.s. + Dracaenaceae + Theropogon + Disporopsis + Comospermum; (5) Nolinaceae, (6) Eriospermum. Conclusions The analyses here were largely conducted with new data collected for the same loci as in previous studies, but in this case from different species/DNA accessions and greater sampling in many cases than in previously published analyses; nonetheless, the results largely mirror those of previously conducted studies. This demonstrates the robustness of these results and answers questions often raised about reproducibility of DNA results, given the often sparse sampling of taxa in some studies, particularly the earliest ones. The results also provide a clear set of patterns on which to base a new classification of the subfamilies of Asparagaceae s.l., particularly Ruscaceae s.l. (= Nolinoideae of Asparagaceae s.l.), and examine other putatively important characters of Asparagales. PMID:20929900
Kassian, Alexei
2015-01-01
A lexicostatistical classification is proposed for 20 languages and dialects of the Lezgian group of the North Caucasian family, based on meticulously compiled 110-item wordlists, published as part of the Global Lexicostatistical Database project. The lexical data have been subsequently analyzed with the aid of the principal phylogenetic methods, both distance-based and character-based: Starling neighbor joining (StarlingNJ), Neighbor joining (NJ), Unweighted pair group method with arithmetic mean (UPGMA), Bayesian Markov chain Monte Carlo (MCMC), Unweighted maximum parsimony (UMP). Cognation indexes within the input matrix were marked by two different algorithms: traditional etymological approach and phonetic similarity, i.e., the automatic method of consonant classes (Levenshtein distances). Due to certain reasons (first of all, high lexicographic quality of the wordlists and a consensus about the Lezgian phylogeny among Caucasologists), the Lezgian database is a perfect testing area for appraisal of phylogenetic methods. For the etymology-based input matrix, all the phylogenetic methods, with the possible exception of UMP, have yielded trees that are sufficiently compatible with each other to generate a consensus phylogenetic tree of the Lezgian lects. The obtained consensus tree agrees with the traditional expert classification as well as some of the previously proposed formal classifications of this linguistic group. Contrary to theoretical expectations, the UMP method has suggested the least plausible tree of all. In the case of the phonetic similarity-based input matrix, the distance-based methods (StarlingNJ, NJ, UPGMA) have produced the trees that are rather close to the consensus etymology-based tree and the traditional expert classification, whereas the character-based methods (Bayesian MCMC, UMP) have yielded less likely topologies.
Kassian, Alexei
2015-01-01
A lexicostatistical classification is proposed for 20 languages and dialects of the Lezgian group of the North Caucasian family, based on meticulously compiled 110-item wordlists, published as part of the Global Lexicostatistical Database project. The lexical data have been subsequently analyzed with the aid of the principal phylogenetic methods, both distance-based and character-based: Starling neighbor joining (StarlingNJ), Neighbor joining (NJ), Unweighted pair group method with arithmetic mean (UPGMA), Bayesian Markov chain Monte Carlo (MCMC), Unweighted maximum parsimony (UMP). Cognation indexes within the input matrix were marked by two different algorithms: traditional etymological approach and phonetic similarity, i.e., the automatic method of consonant classes (Levenshtein distances). Due to certain reasons (first of all, high lexicographic quality of the wordlists and a consensus about the Lezgian phylogeny among Caucasologists), the Lezgian database is a perfect testing area for appraisal of phylogenetic methods. For the etymology-based input matrix, all the phylogenetic methods, with the possible exception of UMP, have yielded trees that are sufficiently compatible with each other to generate a consensus phylogenetic tree of the Lezgian lects. The obtained consensus tree agrees with the traditional expert classification as well as some of the previously proposed formal classifications of this linguistic group. Contrary to theoretical expectations, the UMP method has suggested the least plausible tree of all. In the case of the phonetic similarity-based input matrix, the distance-based methods (StarlingNJ, NJ, UPGMA) have produced the trees that are rather close to the consensus etymology-based tree and the traditional expert classification, whereas the character-based methods (Bayesian MCMC, UMP) have yielded less likely topologies. PMID:25719456
Phylogeny of Marsileaceous Ferns and Relationships of the Fossil Hydropteris pinnata Reconsidered.
Pryer
1999-09-01
Recent phylogenetic studies have provided compelling evidence that confirms the once disputed hypothesis of monophyly for heterosporous leptosporangiate ferns (Marsileaceae and Salviniaceae). Hypotheses for relationships among the three genera of Marsileaceae (Marsilea, Regnellidium, and Pilularia), however, have continued to be in conflict. The phylogeny of Marsileaceae is investigated here using information from morphology and rbcL sequence data. In addition, relationships among all heterosporous ferns, including the whole-plant fossil Hydropteris pinnata are reconsidered. Data sets of 71 morphological and 1239 rbcL characters for 23 leptosporangiate ferns, including eight heterosporous ingroup taxa and 15 homosporous outgroup taxa, were subjected to maximum parsimony analysis. Morphological analyses were carried out both with and without the fossil Hydropteris, and it was excluded from all analyses with rbcL data. An annotated list of the 71 morphological characters is provided in the appendix. For comparative purposes, the Rothwell and Stockey (1994) data set was also reanalyzed here. The best estimate of phylogenetic relationships for Marsileaceae in all analyses is that Pilularia and Regnellidium are sister taxa and Marsilea is sister to that clade. Morphological synapomorphies for various nodes are discussed. Analyses that included Hydropteris resulted in two most-parsimonious trees that differ only in the placement of the fossil. One topology is identical to the relationship found by Rothwell and Stockey (1994), placing the fossil sister to the Azolla plus Salvinia clade. The alternative topology places Hydropteris as the most basal member of the heterosporous fern clade. Equivocal interpretations for character evolution in heterosporous ferns are discussed in the context of these two most-parsimonious trees. Because of the observed degree of character ambiguity, the phylogenetic placement of Hydropteris is best viewed as unresolved, and recognition of the suborder Hydropteridineae, as circumscribed by Rothwell and Stockey (1994), is regarded as premature. The two competing hypotheses of relationships for heterosporous ferns are also compared with the known temporal distribution of relevant taxa. Stratigraphic fit of the phylogenetic estimates is measured by using the Stratigraphic Consistency Index and by comparison with minimum divergence times.
We tested two methods for dataset generation and model construction, and three tree-classifier variants to identify the most parsimonious and thematically accurate mapping methodology for the SW ReGAP project. Competing methodologies were tested in the East Great Basin mapping un...
Anger, Nicolas; Fogliani, Bruno; Scutt, Charles P; Gâteblé, Gildas
2017-03-01
This work aimed to gain insight into the breeding system at the base of living angiosperms through both character state reconstructions and the study of sex ratios and phenotypes in the likely sister to all other living angiosperms, Amborella trichopoda . Sex phenotypes were mapped onto a phylogeny of basally diverging angiosperms using maximum parsimony. In parallel, sex ratios and phenotypes were studied over two consecutive flowering seasons in an ex situ population of A. trichopoda , while the sex ratio of an in situ population was also assessed. Parsimony analyses failed to resolve the breeding system present at the base of living angiosperms, but indicated the importance of A. trichopoda for the future elucidation of this question. The ex situ A. trichopoda population studied showed a primary sex ratio close to 1:1, though sex ratio bias was found in the in situ population studied. Instances of sexual instability were quantified in both populations. Sex ratio data support the presence of genetic sex determination in A. trichopoda , whose further elucidation may guide inferences on the breeding system at the base of living angiosperms. Sexual instability in A. trichopoda suggests the operation of epigenetic mechanisms, and the evolution of dioecy via a gynodioecious intermediate. © The Author 2017. Published by Oxford University Press on behalf of the Annals of Botany Company. All rights reserved. For Permissions, please email: journals.permissions@oup.com
DupTree: a program for large-scale phylogenetic analyses using gene tree parsimony.
Wehe, André; Bansal, Mukul S; Burleigh, J Gordon; Eulenstein, Oliver
2008-07-01
DupTree is a new software program for inferring rooted species trees from collections of gene trees using the gene tree parsimony approach. The program implements a novel algorithm that significantly improves upon the run time of standard search heuristics for gene tree parsimony, and enables the first truly genome-scale phylogenetic analyses. In addition, DupTree allows users to examine alternate rootings and to weight the reconciliation costs for gene trees. DupTree is an open source project written in C++. DupTree for Mac OS X, Windows, and Linux along with a sample dataset and an on-line manual are available at http://genome.cs.iastate.edu/CBL/DupTree
Kang, Jong-Soo; Lee, Byoung Yoon; Kwak, Myounghai
2017-01-01
The complete chloroplast genomes of Lychnis wilfordii and Silene capitata were determined and compared with ten previously reported Caryophyllaceae chloroplast genomes. The chloroplast genome sequences of L. wilfordii and S. capitata contain 152,320 bp and 150,224 bp, respectively. The gene contents and orders among 12 Caryophyllaceae species are consistent, but several microstructural changes have occurred. Expansion of the inverted repeat (IR) regions at the large single copy (LSC)/IRb and small single copy (SSC)/IR boundaries led to partial or entire gene duplications. Additionally, rearrangements of the LSC region were caused by gene inversions and/or transpositions. The 18 kb inversions, which occurred three times in different lineages of tribe Sileneae, were thought to be facilitated by the intermolecular duplicated sequences. Sequence analyses of the L. wilfordii and S. capitata genomes revealed 39 and 43 repeats, respectively, including forward, palindromic, and reverse repeats. In addition, a total of 67 and 56 simple sequence repeats were discovered in the L. wilfordii and S. capitata chloroplast genomes, respectively. Finally, we constructed phylogenetic trees of the 12 Caryophyllaceae species and two Amaranthaceae species based on 73 protein-coding genes using both maximum parsimony and likelihood methods.
Spread of cattle led to the loss of matrilineal descent in Africa: a coevolutionary analysis.
Holden, Clare Janaki; Mace, Ruth
2003-01-01
Matrilineal descent is rare in human societies that keep large livestock. However, this negative correlation does not provide reliable evidence that livestock and descent rules are functionally related, because human cultures are not statistically independent owing to their historical relationships (Galton's problem). We tested the hypothesis that when matrilineal cultures acquire cattle they become patrilineal using a sample of 68 Bantu- and Bantoid-speaking populations from sub-Saharan Africa. We used a phylogenetic comparative method to control for Galton's problem, and a maximum-parsimony Bantu language tree as a model of population history. We tested for coevolution between cattle and descent. We also tested the direction of cultural evolution--were cattle acquired before matriliny was lost? The results support the hypothesis that acquiring cattle led formerly matrilineal Bantu-speaking cultures to change to patrilineal or mixed descent. We discuss possible reasons for matriliny's association with horticulture and its rarity in pastoralist societies. We outline the daughter-biased parental investment hypothesis for matriliny, which is supported by data on sex, wealth and reproductive success from two African societies, the matrilineal Chewa in Malawi and the patrilineal Gabbra in Kenya. PMID:14667331
Zhong, Hua-Ming; Zhang, Hong-Hai; Sha, Wei-Lai; Zhang, Cheng-De; Chen, Yu-Cai
2010-04-01
The whole mitochondrial genome sequence of red fox (Vuples vuples) was determined. It had a total length of 16 723 bp. As in most mammal mitochondrial genome, it contained 13 protein coding genes, two ribosome RNA genes, 22 transfer RNA genes and one control region. The base composition was 31.3% A, 26.1% C, 14.8% G and 27.8% T, respectively. The codon usage of red fox, arctic fox, gray wolf, domestic dog and coyote followed the same pattern except for an unusual ATT start codon, which initiates the NADH dehydrogenase subunit 3 gene in the red fox. A long tandem repeat rich in AC was found between conserved sequence block 1 and 2 in the control region. In order to confirm the phylogenetic relationships of red fox to other canids, phylogenetic trees were reconstructed by neighbor-joining and maximum parsimony methods using 12 concatenated heavy-strand protein-coding genes. The result indicated that arctic fox was the sister group of red fox and they both belong to the red fox-like clade in family Canidae, while gray wolf, domestic dog and coyote belong to wolf-like clade. The result was in accordance with existing phylogenetic results.
Phylogeny of Celastrus L. (Celastraceae) inferred from two nuclear and three plastid markers.
Mu, Xian-Yun; Zhao, Liang-Cheng; Zhang, Zhi-Xiang
2012-09-01
This is the first comprehensive molecular investigation of the genus Celastrus L. Phylogenetic relationships within the genus were assessed based on sequences of two nuclear (ETS, ITS) and three plastid (psbA-trnH, rpl16 and trnL-F) regions using the Bayesian inference and the maximum parsimony methods. Our results show that Celastrus, together with Tripterygium, formed a maximal supported clade. Within the cluster, Celastrus is composed of a basal clade and a core Celastrus clade, and the latter is consisted of six subclades. Relationships among species are more influenced by latitude than continental distribution patterns. The cauline cyme and lunate seeds are distinct characters to one of the maximal supported subclades. Their close relationship, similar geographical pattern and habitat imply that C. flagellaris may be a potential invasive species threatening C. scandens in North America. Celastrus leiocarpus, C. oblanceifolius and C. rugosus are confirmed as synonyms of C. punctatus, C. aculeatus and C. glaucophyllus, respectively. Discordance between the molecular data and previous morphology-based subgeneric classifications are noted. More works are needed to clarify the relationship between Celastrus and Tripterygium and the species within Celastrus.
Genetic Identification of Orientobilharzia turkestanicum from Sheep Isolates in Iran.
Tabaripour, Reza; Youssefi, Mohammad Reza; Tabaripour, Rabeeh
2015-01-01
Adult worms of Orientobilharzia turkestanicum live in the portal veins, or intestinal veins of cattle, sheep, goat and many other mammals causing orientobilharziasis. Orientobilharziasis causes significant economic losses to livestock industry of Iran. However, there is limited information about genotypes of O. turkestanicum in Iran. In this study, 30 isolates of O. turkestanicum obtained from sheep were characterized by sequencing mitochondrial cytochrome c oxidase subunit 1 (cox1) and nicotinamide adenine dinucleotide dehydrogenase subunit 1 (nad1) gene. The mitochondrial cox1 and nad1 DNA were amplified by polymerase chain reaction (PCR) and then sequenced and compared with O. turkestanicum and that of other members of the Schistosomatidae available in Gen-Bank(™). Phylogenetic relationships between them were re-constructed using the maximum parsimony method. Phylogenetic analyses done in present study placed O. turkestanicum within the Schistosoma genus, and indicates that O. turkestanicum was phylogenetically closer to the African schistosome group than to the Asian schistosome group. Comparison of nad1 and cox1 sequences of O. turkestanicum obtained in this study with corresponding sequences available in Genbank(™) revealed some sequence variations and provided evidence for presence of microvarients in Iran.
Chen, Yuan
2017-01-01
Abstract In this study, we sequenced fragments of cytochrome oxidase subunit 1 (CO1), internal transcribed spacer 1 (ITS1), and internal transcribed spacer 2 (ITS2) genes from 150 specimens belonging to 16 species of the ant genus Formica from China. Odontoponera transversa from Ponerinae and Polyergus samurai from Formicinae were added as distant relative and close relative outgroups, respectively. Neighbor-joining, maximum parsimony, and Bayesian interference methods were used to analyze their phylogenetic relationships based on CO1 gene sequence as well as combined sequence data of CO1 + ITS1, CO1 + ITS2, and CO1 + ITS1 + ITS2. The results showed that nine Formica species (i.e., Formica sinensis, Formica manchu, Formica uralensis, Formica sanguinea, Formica gagatoides, Formica candida, Formica fusca, Formica glauca, and Formica sp.) formed monophyletic clades, which in agreement with the results based on morphological taxonomy. By comparing the results of DNA barcoding and morphological taxonomy, we propose that Formica aquilonia maybe a junior synonym of F. polyctena and that cryptic species could likely existed in Formica sinae. Further studies on morphology, biology, and geography are needed to confirm this notion.
Miao, Miao; Song, Weibo; Clamp, John C; Al-Rasheid, Khaled A S; Al-Khedhairy, Abdulaziz A; Al-Arifi, Saud
2009-01-01
The systematic relationships and taxonomic positions of the traditional heterotrich genera Condylostentor, Climacostomum, Fabrea, Folliculina, Peritromus, and Condylostoma, as well as the licnophorid genus Licnophora, were re-examined using new data from sequences of the gene coding for small subunit ribosomal RNA. Trees constructed using distance-matrix, Bayesian inference, and maximum-parsimony methods all showed the following relationships: (1) the "traditional" heterotrichs consist of several paraphyletic groups, including the current classes Heterotrichea, Armophorea and part of the Spirotrichea; (2) the class Heterotrichea was confirmed as a monophyletic assemblage based on our analyses of 31 taxa, and the genus Peritromus was demonstrated to be a peripheral group; (3) the genus Licnophora occupied an isolated branch on one side of the deepest divergence in the subphylum Intramacronucleata and was closely affiliated with spirotrichs, armophoreans, and clevelandellids; (4) Condylostentor, a recently defined genus with several truly unique morphological features, is more closely related to Condylostoma than to Stentor; (5) Folliculina, Eufolliculina, and Maristentor always clustered together with high bootstrap support; and (6) Climacostomum occupied a paraphyletic position distant from Fabrea, showing a close relationship with Condylostomatidae and Chattonidiidae despite of modest support.
Phylogenetic changes in soil microbial and diazotrophic diversity with application of butachlor.
Yen, Jui-Hung; Wang, Yei-Shung; Hsu, Wey-Shin; Chen, Wen-Ching
2013-01-01
We investigated changes in population and taxonomic distribution of cultivable bacteria and diazotrophs with butachlor application in rice paddy soils. Population changes were measured by the traditional plate-count method, and taxonomic distribution was studied by 16S rDNA sequencing, then maximum parsimony phylogenic analysis with bootstrapping (1,000 replications). The bacterial population was higher after 39 than 7 days of rice cultivation, which indicated the augmentation of soil microbes by rice root exudates. The application of butachlor increased the diazotrophic population in both upper (0-3 cm) and lower (3-15 cm) layers of soils. Especially at day 39, the population of diazotrophs was 1.8 and 1.6 times that of the control in upper and lower layer soils, respectively. We found several bacterial strains only with butachlor application; examples are strains closest to Bacillus arsenicus, B. marisflavi, B. luciferensis, B. pumilus, and Pseudomonas alvei. Among diazotrophs, three strains closely related to Streptomyces sp. or Rhrizobium sp. were found only with butachlor application. The population of cultivable bacteria and the species composition were both changed with butachlor application, which explains in part the contribution of butachlor to augmenting soil nitrogen-fixing ability.
Brammer, Colin A; von Dohlen, Carol D
2007-05-01
Stratiomyidae is a cosmopolitan family of Brachycera (Diptera) that contains over 2800 species. This study focused on the relationships of members of the subfamily Clitellariinae, which has had a complicated taxonomic history. To investigate the monophyly of the Clitellariinae, the relationships of its genera, and the ages of Stratiomyidae lineages, representatives for all 12 subfamilies of Stratiomyidae, totaling 68 taxa, were included in a phylogenetic reconstruction. A Xylomyidae representative, Solva sp., was used as an outgroup. Sequences of EF-1alpha and 28S rRNA genes were analyzed under maximum parsimony with bootstrapping, and Bayesian methods to recover the best estimate of phylogeny. A chronogram with estimated dates for all nodes in the phylogeny was generated with the program, r8s, and divergence dates and confidence intervals were further explored with the program, multidivtime. All subfamilies of Stratiomyidae with more than one representative were found to be monophyletic, except for Stratiomyinae and Clitellariinae. Clitellariinae were distributed among five separate clades in the phylogeny, and Raphiocerinae were nested within Stratiomyinae. Dating analysis suggested an early Cretaceous origin for the common ancestor of extant Stratiomyidae, and a radiation of several major Stratiomyidae lineages in the Late Cretaceous.
Estimation of the ARNO model baseflow parameters using daily streamflow data
NASA Astrophysics Data System (ADS)
Abdulla, F. A.; Lettenmaier, D. P.; Liang, Xu
1999-09-01
An approach is described for estimation of baseflow parameters of the ARNO model, using historical baseflow recession sequences extracted from daily streamflow records. This approach allows four of the model parameters to be estimated without rainfall data, and effectively facilitates partitioning of the parameter estimation procedure so that parsimonious search procedures can be used to estimate the remaining storm response parameters separately. Three methods of optimization are evaluated for estimation of four baseflow parameters. These methods are the downhill Simplex (S), Simulated Annealing combined with the Simplex method (SA) and Shuffled Complex Evolution (SCE). These estimation procedures are explored in conjunction with four objective functions: (1) ordinary least squares; (2) ordinary least squares with Box-Cox transformation; (3) ordinary least squares on prewhitened residuals; (4) ordinary least squares applied to prewhitened with Box-Cox transformation of residuals. The effects of changing the seed random generator for both SA and SCE methods are also explored, as are the effects of the bounds of the parameters. Although all schemes converge to the same values of the objective function, SCE method was found to be less sensitive to these issues than both the SA and the Simplex schemes. Parameter uncertainty and interactions are investigated through estimation of the variance-covariance matrix and confidence intervals. As expected the parameters were found to be correlated and the covariance matrix was found to be not diagonal. Furthermore, the linearized confidence interval theory failed for about one-fourth of the catchments while the maximum likelihood theory did not fail for any of the catchments.
Inferring species trees from incongruent multi-copy gene trees using the Robinson-Foulds distance
2013-01-01
Background Constructing species trees from multi-copy gene trees remains a challenging problem in phylogenetics. One difficulty is that the underlying genes can be incongruent due to evolutionary processes such as gene duplication and loss, deep coalescence, or lateral gene transfer. Gene tree estimation errors may further exacerbate the difficulties of species tree estimation. Results We present a new approach for inferring species trees from incongruent multi-copy gene trees that is based on a generalization of the Robinson-Foulds (RF) distance measure to multi-labeled trees (mul-trees). We prove that it is NP-hard to compute the RF distance between two mul-trees; however, it is easy to calculate this distance between a mul-tree and a singly-labeled species tree. Motivated by this, we formulate the RF problem for mul-trees (MulRF) as follows: Given a collection of multi-copy gene trees, find a singly-labeled species tree that minimizes the total RF distance from the input mul-trees. We develop and implement a fast SPR-based heuristic algorithm for the NP-hard MulRF problem. We compare the performance of the MulRF method (available at http://genome.cs.iastate.edu/CBL/MulRF/) with several gene tree parsimony approaches using gene tree simulations that incorporate gene tree error, gene duplications and losses, and/or lateral transfer. The MulRF method produces more accurate species trees than gene tree parsimony approaches. We also demonstrate that the MulRF method infers in minutes a credible plant species tree from a collection of nearly 2,000 gene trees. Conclusions Our new phylogenetic inference method, based on a generalized RF distance, makes it possible to quickly estimate species trees from large genomic data sets. Since the MulRF method, unlike gene tree parsimony, is based on a generic tree distance measure, it is appealing for analyses of genomic data sets, in which many processes such as deep coalescence, recombination, gene duplication and losses as well as phylogenetic error may contribute to gene tree discord. In experiments, the MulRF method estimated species trees accurately and quickly, demonstrating MulRF as an efficient alternative approach for phylogenetic inference from large-scale genomic data sets. PMID:24180377
TOWARD A MOLECULAR PHYLOGENY FOR PEROMYSCUS: EVIDENCE FROM MITOCHONDRIAL CYTOCHROME-b SEQUENCES
Bradley, Robert D.; Durish, Nevin D.; Rogers, Duke S.; Miller, Jacqueline R.; Engstrom, Mark D.; Kilpatrick, C. William
2009-01-01
One hundred DNA sequences from the mitochondrial cytochrome-b gene of 44 species of deer mice (Peromyscus (sensu stricto), 1 of Habromys, 1 of Isthmomys, 2 of Megadontomys, and the monotypic genera Neotomodon, Osgoodomys, and Podomys were used to develop a molecular phylogeny for Peromyscus. Phylogenetic analyses (maximum parsimony, maximum likelihood, and Bayesian inference) were conducted to evaluate alternative hypotheses concerning taxonomic arrangements (sensu stricto versus sensu lato) of the genus. In all analyses, monophyletic clades were obtained that corresponded to species groups proposed by previous authors; however, relationships among species groups generally were poorly resolved. The concept of the genus Peromyscus based on molecular data differed significantly from the most current taxonomic arrangement. Maximum-likelihood and Bayesian trees depicted strong support for a clade placing Habromys, Megadontomys, Neotomodon, Osgoodomys, and Podomys within Peromyscus. If Habromys, Megadontomys, Neotomodon, Osgoodomys, and Podomys are regarded as genera, then several species groups within Peromyscus (sensu stricto) should be elevated to generic rank. Isthmomys was associated with the genus Reithrodontomys; in turn this clade was sister to Baiomys, indicating a distant relationship of Isthmomys to Peromyscus. A formal taxonomic revision awaits synthesis of additional sequence data from nuclear markers together with inclusion of available allozymic and karyotypic data. PMID:19924266
Murdock, Andrew G
2008-05-01
Closely related outgroups are optimal for rooting phylogenetic trees; however, such ideal outgroups are not always available. A phylogeny of the marattioid ferns (Marattiaceae), an ancient lineage with no close relatives, was reconstructed using nucleotide sequences of multiple chloroplast regions (rps4 + rps4-trnS spacer, trnS-trnG spacer + trnG intron, rbcL, atpB), from 88 collections, selected to cover the broadest possible range of morphologies and geographic distributions within the extant taxa. Because marattioid ferns are phylogenetically isolated from other lineages, and internal branches are relatively short, rooting was problematic. Root placement was strongly affected by long-branch attraction under maximum parsimony and by model choice under maximum likelihood. A multifaceted approach to rooting was employed to isolate the sources of bias and produce a consensus root position. In a statistical comparison of all possible root positions with three different outgroups, most root positions were not significantly less optimal than the maximum likelihood root position, including the consensus root position. This phylogeny has several important taxonomic implications for marattioid ferns: Marattia in the broad sense is paraphyletic; the Hawaiian endemic Marattia douglasii is most closely related to tropical American taxa; and Angiopteris is monophyletic only if Archangiopteris and Macroglossum are included.
Teletchea, Fabrice; Laudet, Vincent; Hänni, Catherine
2006-01-01
Although Codfishes are probably one of the most studied groups of all teleost fishes worldwide owing to their great importance to fisheries, their phylogeny and classification are still far from being firmly established. In this study, we present phylogenetic relationships of 19 out of 22 genera traditionally included in the Gadidae based on the analysis of entire cytochrome b and partial cytochrome oxidase I genes (1530 bp). Maximum Parsimony, Maximum Likelihood, and Bayesian analyses all recovered five main clades that correspond to traditionally recognized groupings within Gadoids. The same clades were recovered with MP analysis based on 30 morphological characters (collected from the literature). Given these findings, we propose a revised provisional classification of Gadoids: one suborder Gadoidei containing two families, the Merlucciidae (1 genus) and the Gadidae (21 genera) distributed into four subfamilies: the Gadinae (12 genera), the Lotinae (3 genera), the Gaidropsarinae (3 genera), and the Phycinae (3 genera). Lastly, nuclear inserts of mitochondrial DNA (Numts) were identified in two species, i.e., Gadiculus argenteus and Melanogrammus aeglefinus.
Colli, Guarino R; Hoogmoed, Marinus S; Cannatella, David C; Cassimiro, José; Gomes, Jerriane Oliveira; Ghellere, José Mário; Gomes, Jerriane Oliveira; Ghellere, José Mário; Nunes, Pedro M Sales; Pellegrino, Kátia C M; Salerno, Patricia; Souza, Sergio Marques De; Rodrigues, Miguel Trefaut
2015-08-18
We describe a new genus and two new species of gymnophthalmid lizards based on specimens collected from Brazilian Amazonia, mostly in the "arc of deforestation". The new genus is easily distinguished from other Gymnophthalmidae by having very wide, smooth, and imbricate nuchals, arranged in two longitudinal and 6-10 transverse rows from nape to brachium level, followed by much narrower, strongly keeled, lanceolate, and mucronate scales. It also differs from all other Gymnophthalmidae, except Iphisa, by the presence of two longitudinal rows of ventrals. The new genus differs from Iphisa by having two pairs of enlarged chinshields (one in Iphisa); posterior dorsal scales lanceolate, strongly keeled and not arranged in longitudinal rows (dorsals broad, smooth and forming two longitudinal rows), and lateral scales keeled (smooth). Maximum parsimony, maximum likelihood, and Bayesian phylogenetic analyses based on morphological and molecular data indicate the new species form a clade that is most closely related to Iphisa. We also address several nomenclatural issues and present a revised classification of Gymnophthalmidae.
Ma, Shuai; Wu, Qi; Hu, Yibo; Wei, Fuwen
2018-05-20
The explosive growth in genomic data has provided novel insights into the conflicting signals hidden in phylogenetic trees. Although some studies have explored the effects of the GC content and parsimony informative sites (PIS) on the phylogenetic tree, the effect of the heterogeneity of the GC content at the first/second/third codon position on parsimony informative sites (GC1/2/3 PIS ) among different species and the effect of PIS on phylogenetic tree construction remain largely unexplored. Here, we used two different mammal genomic datasets to explore the patterns of GC1/2/3 PIS heterogeneity and the effect of PIS on the phylogenetic tree of genes: (i) all GC1/2/3 PIS have obvious heterogeneity between different mammals, and the levels of heterogeneity are GC3 PIS > GC2 PIS > GC1 PIS ; (ii) the number of PIS is positively correlated with the metrics of "good" gene tree topologies, and excluding the third codon position (C3) decreases the quality of gene trees by removing too many PIS. These results provide novel insights into the heterogeneity pattern of GC1/2/3 PIS in mammals and the relationship between GC3/PIS and gene trees. Additionally, it is necessary to carefully consider whether to exclude C3 to improve the quality of gene trees, especially in the super-tree method. Copyright © 2018 Elsevier B.V. All rights reserved.
Nickrent, D L; Parkinson, C L; Palmer, J D; Duff, R J
2000-12-01
A widely held view of land plant relationships places liverworts as the first branch of the land plant tree, whereas some molecular analyses and a cladistic study of morphological characters indicate that hornworts are the earliest land plants. To help resolve this conflict, we used parsimony and likelihood methods to analyze a 6, 095-character data set composed of four genes (chloroplast rbcL and small-subunit rDNA from all three plant genomes) from all major land plant lineages. In all analyses, significant support was obtained for the monophyly of vascular plants, lycophytes, ferns (including PSILOTUM: and EQUISETUM:), seed plants, and angiosperms. Relationships among the three bryophyte lineages were unresolved in parsimony analyses in which all positions were included and weighted equally. However, in parsimony and likelihood analyses in which rbcL third-codon-position transitions were either excluded or downweighted (due to apparent saturation), hornworts were placed as sister to all other land plants, with mosses and liverworts jointly forming the second deepest lineage. Decay analyses and Kishino-Hasegawa tests of the third-position-excluded data set showed significant support for the hornwort-basal topology over several alternative topologies, including the commonly cited liverwort-basal topology. Among the four genes used, mitochondrial small-subunit rDNA showed the lowest homoplasy and alone recovered essentially the same topology as the multigene tree. This molecular phylogeny presents new opportunities to assess paleontological evidence and morphological innovations that occurred during the early evolution of terrestrial plants.
Simmons, Mark P; Goloboff, Pablo A
2013-10-01
Empirical and simulated examples are used to demonstrate an artifact caused by undersampling optimal trees in data matrices that consist mostly or entirely of locally sampled (as opposed to globally, for most or all terminals) characters. The artifact is that unsupported clades consisting entirely of terminals scored for the same locally sampled partition may be resolved and assigned high resampling support-despite their being properly unsupported (i.e., not resolved in the strict consensus of all optimal trees). This artifact occurs despite application of random-addition sequences for stepwise terminal addition. The artifact is not necessarily obviated with thorough conventional branch swapping methods (even tree-bisection-reconnection) when just a single tree is held, as is sometimes implemented in parsimony bootstrap pseudoreplicates, and in every GARLI, PhyML, and RAxML pseudoreplicate and search for the most likely tree for the matrix as a whole. Hence GARLI, RAxML, and PhyML-based likelihood results require extra scrutiny, particularly when they provide high resolution and support for clades that are entirely unsupported by methods that perform more thorough searches, as in most parsimony analyses. Copyright © 2013 Elsevier Inc. All rights reserved.
A phylogeny of robber flies (Diptera: Asilidae) at the subfamilial level: molecular evidence.
Bybee, Seth M; Taylor, Sean D; Riley Nelson, C; Whiting, Michael F
2004-03-01
We present the first formal analysis of phylogenetic relationships among the Asilidae, based on four genes: 16S rDNA, 18S rDNA, 28S rDNA, and cytochrome oxidase II. Twenty-six ingroup taxa representing 11 of the 12 described subfamilies were selected to produce a phylogenetic estimate of asilid subfamilial relationships via optimization alignment, parsimony, and maximum likelihood techniques. Phylogenetic analyses support the monophyly of Asilidae with Leptogastrinae as the most basal robber fly lineage. Apocleinae+(Asilinae+Ommatiinae) is supported as monophyletic. The laphriinae-group (Laphriinae+Laphystiinae) and the dasypogoninae-group (Dasypogoninae+Stenopogoninae+Stichopogoninae+ Trigonomiminae) are paraphyletic. These results suggest that current subfamilial classification only partially reflects robber fly phylogeny, indicating the need for further phylogenetic investigation of this group.
ERIC Educational Resources Information Center
Peltier, James W.; Cummins, Shannon; Pomirleanu, Nadia; Cross, James; Simon, Rob
2014-01-01
Students' desire and intention to pursue a career in sales continue to lag behind industry demand for sales professionals. This article develops and validates a reliable and parsimonious scale for measuring and predicting student intention to pursue a selling career. The instrument advances previous scales in three ways. The instrument is…
Optimization of Multilocus Sequence Analysis for Identification of Species in the Genus Vibrio
Gabriel, Michael W.; Matsui, George Y.; Friedman, Robert
2014-01-01
Multilocus sequence analysis (MLSA) is an important method for identification of taxa that are not well differentiated by 16S rRNA gene sequences alone. In this procedure, concatenated sequences of selected genes are constructed and then analyzed. The effects that the number and the order of genes used in MLSA have on reconstruction of phylogenetic relationships were examined. The recA, rpoA, gapA, 16S rRNA gene, gyrB, and ftsZ sequences from 56 species of the genus Vibrio were used to construct molecular phylogenies, and these were evaluated individually and using various gene combinations. Phylogenies from two-gene sequences employing recA and rpoA in both possible gene orders were different. The addition of the gapA gene sequence, producing all six possible concatenated sequences, reduced the differences in phylogenies to degrees of statistical (bootstrap) support for some nodes. The overall statistical support for the phylogenetic tree, assayed on the basis of a reliability score (calculated from the number of nodes having bootstrap values of ≥80 divided by the total number of nodes) increased with increasing numbers of genes used, up to a maximum of four. No further improvement was observed from addition of the fifth gene sequence (ftsZ), and addition of the sixth gene (gyrB) resulted in lower proportions of strongly supported nodes. Reductions in the numbers of strongly supported nodes were also observed when maximum parsimony was employed for tree construction. Use of a small number of gene sequences in MLSA resulted in accurate identification of Vibrio species. PMID:24951781
Wang, Houshuai; Fan, Xiaoling; Owada, Mamoru; Wang, Min; Nylin, Sören
2014-01-01
The genus Panolis is a small group of noctuid moths with six recognized species distributed from Europe to East Asia, and best known for containing the widespread Palearctic pest species P. flammea, the pine beauty moth. However, a reliable classification and robust phylogenetic framework for this group of potentially economic importance are currently lacking. Here, we use morphological and molecular data (mitochondrial genes cytochrome c oxidase subunit I and 16S ribosomal RNA, nuclear gene elongation factor-1 alpha) to reconstruct the phylogeny of this genus, with a comprehensive systematic revision of all recognized species and a new one, P. ningshan sp. nov. The analysis results of maximum parsimony, maximum likelihood and Bayesian inferring methods for the combined morphological and molecular data sets are highly congruent, resulting in a robust phylogeny and identification of two clear species groups, i.e., the P. flammea species group and the P. exquisita species group. We also estimate the divergence times of Panolis moths using two conventional mutation rates for the arthropod mitochondrial COI gene with a comparison of two molecular clock models, as well as reconstruct their ancestral areas. Our results suggest that 1) Panolis is a young clade, originating from the Oriental region in China in the Late Miocene (6–10Mya), with an ancestral species in the P. flammea group extending northward to the Palearctic region some 3–6 Mya; 2) there is a clear possibility for a representative of the Palearctic clade to become established as an invasive species in the Nearctic taiga. PMID:24603596
McKenna, Duane D; Farrell, Brian D
2005-10-01
Here, we report the results of a species level phylogenetic study of Cephaloleia beetles designed to clarify relationships and patterns of host plant taxon and tissue use among species. Our study is based on up to 2088bp of mtDNA sequence data. Maximum parsimony, maximum likelihood, and Bayesian methods of phylogenetic inference consistently recover a monophyletic Cephaloleia outside of a basal clade of primarily palm feeding species (the 'Arecaceae-feeding clade'), and C. irregularis. In all three analyses, the 'Arecaceae-feeding clade' includes Cephaloleia spp. with unusual morphological features, and a few species currently placed in other cassidine genera and tribes. All three analyses also recover a clade that includes all Zingiberales feeding Cephaloleia and most Cephaloleia species (the 'Zingiberales-feeding clade'). Two notable clades are found within the 'Zingiberales-feeding clade.' One is comprised of beetles that normally feed only on the young rolled leaves of plants in the families Heliconiaceae and Marantaceae (the 'Heliconiaceae & Marantaceae-feeding clade'). The other is comprised of relative host tissue generalist, primarily Zingiberales feeding species (the 'generalist-feeding clade'). A few species in the 'generalist-feeding clade' utilize Cyperaceae or Poaceae as hosts. Overall, relatively basal Cephaloleia (e.g., the 'Arecaceae clade') feed on relatively basal monocots (e.g., Cyclanthaceae and Arecaceae), and relatively derived Cephaloleia (e.g., the 'Zingiberales-feeding clade') feed on relatively derived monocots (mostly in the order Zingiberales). Zingiberales feeding and specialization on young rolled Zingiberales leaves have each apparently evolved just once in Cephaloleia.
Shaw, A J
2000-05-01
Nucleotide sequence variation in the ITS1-5.8S-ITS2 region of nuclear ribosomal DNA (nrDNA) from 70 populations of Mielichhoferia elongata and M. mielichhoferiana, plus two outgroup species, was analysed using maximum parsimony and maximum likelihood methods. High levels of nucleotide substitution and numerous insertion-deletion events were detected within and between the two species. M. elongata is monophyletic with regard to nrDNA variation, but M. mielichhoferiana is paraphyletic. (M. elongata is nested within it.) A clade within M. mielichhoferiana provides evidence of vicariance, with North American and Scandinavian sister groups of populations. Two major clades are resolved in M. elongata by sequence data that are completely congruent with previous isozyme work. One clade includes populations from both North America and Europe whereas the other is strictly North American. These two clades, resolved by multiple independent loci, clearly represent cryptic species within the morphologically uniform M. elongata. Certain geographical areas, most notably southwestern Colorado in Ouray and San Juan Counties, harbour diverse populations of M. elongata with distinct phylogenetic and phylogeographical histories. Morphologically indistinguishable but phylogenetically distant populations were detected a few metres apart at one site. In contrast, all populations collected over hundreds of kilometres in California belong to a single clade. Arctic North American populations belong to a clade that includes disjunct populations in Alaska, northern Ellesmere Island, and the northeastern USA, but not subarctic Swedish populations, which are more closely related to plants from the Rocky Mountains. Morphological uniformity belies complex infraspecific phylogenetic patterns within M. elongata and M. mielichhoferiana.
Agaricus section Xanthodermatei: a phylogenetic reconstruction with commentary on taxa.
Kerrigan, Richard W; Callac, Philippe; Guinberteau, Jacques; Challen, Michael P; Parra, Luis A
2005-01-01
Agaricus section Xanthodermatei comprises a group of species allied to A. xanthodermus and generally characterized by basidiomata having phenolic odors, transiently yellowing discolorations in some parts of the basidiome, Schaeffer's reaction negative, and mild to substantial toxicity. The section has a global distribution, while most included species have distributions restricted to regions of single continents. Using specimens and cultures from Europe, North America, and Hawaii, we analyzed DNA sequences from the ITS1+2 region of the nuclear rDNA to identify and characterize phylogenetically distinct entities and to construct a hypothesis of relationships, both among members of the section and with representative taxa from other sections of the genus. 61 sequences from affiliated taxa, plus 20 from six (or seven) other sections of Agaricus, and one Micropsalliota sequence, were evaluated under distance, maximum parsimony and maximum likelihood methods. We recognized 21 discrete entities in Xanthodermatei, including 14 established species and 7 new ones, three of which are described elsewhere. Four species from California, New Mexico, and France deserve further study before they are described. Type studies of American taxa are particularly emphasized, and a lectotype is designated for A. californicus. Section Xanthodermatei formed a single clade in most analyses, indicating that the traditional sectional characters noted above are good unifying characters that appear to have arisen only once within Agaricus. Deep divisions within the sequence-derived structure of the section could be interpreted as subsections in Xanthodermatei; however, various considerations led us to refrain from proposing new supraspecific taxa. The nearest neighbors of section Xanthodermatei are putatively in section Duploannulati.
Phylogenetic analysis of Pasteuria penetrans by use of multiple genetic loci.
Charles, Lauren; Carbone, Ignazio; Davies, Keith G; Bird, David; Burke, Mark; Kerry, Brian R; Opperman, Charles H
2005-08-01
Pasteuria penetrans is a gram-positive, endospore-forming eubacterium that apparently is a member of the Bacillus-Clostridium clade. It is an obligate parasite of root knot nematodes (Meloidogyne spp.) and preferentially grows on the developing ovaries, inhibiting reproduction. Root knot nematodes are devastating root pests of economically important crop plants and are difficult to control. Consequently, P. penetrans has long been recognized as a potential biocontrol agent for root knot nematodes, but the fastidious life cycle and the obligate nature of parasitism have inhibited progress on mass culture and deployment. We are currently sequencing the genome of the Pasteuria bacterium and have performed amino acid level analyses of 33 bacterial species (including P. penetrans) using concatenation of 40 housekeeping genes, with and without insertions/deletions (indels) removed, and using each gene individually. By application of maximum-likelihood, maximum-parsimony, and Bayesian methods to the resulting data sets, P. penetrans was found to cluster tightly, with a high level of confidence, in the Bacillus class of the gram-positive, low-G+C-content eubacteria. Strikingly, our analyses identified P. penetrans as ancestral to Bacillus spp. Additionally, all analyses revealed that P. penetrans is surprisingly more closely related to the saprophytic extremophile Bacillus haladurans and Bacillus subtilis than to the pathogenic species Bacillus anthracis and Bacillus cereus. Collectively, these findings strongly imply that P. penetrans is an ancient member of the Bacillus group. We suggest that P. penetrans may have evolved from an ancient symbiotic bacterial associate of nematodes, possibly as the root knot nematode evolved to be a highly specialized parasite of plants.
Phylogeny of haemosporidian blood parasites revealed by a multi-gene approach.
Borner, Janus; Pick, Christian; Thiede, Jenny; Kolawole, Olatunji Matthew; Kingsley, Manchang Tanyi; Schulze, Jana; Cottontail, Veronika M; Wellinghausen, Nele; Schmidt-Chanasit, Jonas; Bruchhaus, Iris; Burmester, Thorsten
2016-01-01
The apicomplexan order Haemosporida is a clade of unicellular blood parasites that infect a variety of reptilian, avian and mammalian hosts. Among them are the agents of human malaria, parasites of the genus Plasmodium, which pose a major threat to human health. Illuminating the evolutionary history of Haemosporida may help us in understanding their enormous biological diversity, as well as tracing the multiple host switches and associated acquisitions of novel life-history traits. However, the deep-level phylogenetic relationships among major haemosporidian clades have remained enigmatic because the datasets employed in phylogenetic analyses were severely limited in either gene coverage or taxon sampling. Using a PCR-based approach that employs a novel set of primers, we sequenced fragments of 21 nuclear genes from seven haemosporidian parasites of the genera Leucocytozoon, Haemoproteus, Parahaemoproteus, Polychromophilus and Plasmodium. After addition of genomic data from 25 apicomplexan species, the unreduced alignment comprised 20,580 bp from 32 species. Phylogenetic analyses were performed based on nucleotide, codon and amino acid data employing Bayesian inference, maximum likelihood and maximum parsimony. All analyses resulted in highly congruent topologies. We found consistent support for a basal position of Leucocytozoon within Haemosporida. In contrast to all previous studies, we recovered a sister group relationship between the genera Polychromophilus and Plasmodium. Within Plasmodium, the sauropsid and mammal-infecting lineages were recovered as sister clades. Support for these relationships was high in nearly all trees, revealing a novel phylogeny of Haemosporida, which is robust to the choice of the outgroup and the method of tree inference. Copyright © 2015 Elsevier Inc. All rights reserved.
NASA Astrophysics Data System (ADS)
Komatitsch, Dimitri; Xie, Zhinan; Bozdaǧ, Ebru; Sales de Andrade, Elliott; Peter, Daniel; Liu, Qinya; Tromp, Jeroen
2016-09-01
We introduce a technique to compute exact anelastic sensitivity kernels in the time domain using parsimonious disk storage. The method is based on a reordering of the time loop of time-domain forward/adjoint wave propagation solvers combined with the use of a memory buffer. It avoids instabilities that occur when time-reversing dissipative wave propagation simulations. The total number of required time steps is unchanged compared to usual acoustic or elastic approaches. The cost is reduced by a factor of 4/3 compared to the case in which anelasticity is partially accounted for by accommodating the effects of physical dispersion. We validate our technique by performing a test in which we compare the Kα sensitivity kernel to the exact kernel obtained by saving the entire forward calculation. This benchmark confirms that our approach is also exact. We illustrate the importance of including full attenuation in the calculation of sensitivity kernels by showing significant differences with physical-dispersion-only kernels.
ESTimating plant phylogeny: lessons from partitioning
de la Torre, Jose EB; Egan, Mary G; Katari, Manpreet S; Brenner, Eric D; Stevenson, Dennis W; Coruzzi, Gloria M; DeSalle, Rob
2006-01-01
Background While Expressed Sequence Tags (ESTs) have proven a viable and efficient way to sample genomes, particularly those for which whole-genome sequencing is impractical, phylogenetic analysis using ESTs remains difficult. Sequencing errors and orthology determination are the major problems when using ESTs as a source of characters for systematics. Here we develop methods to incorporate EST sequence information in a simultaneous analysis framework to address controversial phylogenetic questions regarding the relationships among the major groups of seed plants. We use an automated, phylogenetically derived approach to orthology determination called OrthologID generate a phylogeny based on 43 process partitions, many of which are derived from ESTs, and examine several measures of support to assess the utility of EST data for phylogenies. Results A maximum parsimony (MP) analysis resulted in a single tree with relatively high support at all nodes in the tree despite rampant conflict among trees generated from the separate analysis of individual partitions. In a comparison of broader-scale groupings based on cellular compartment (ie: chloroplast, mitochondrial or nuclear) or function, only the nuclear partition tree (based largely on EST data) was found to be topologically identical to the tree based on the simultaneous analysis of all data. Despite topological conflict among the broader-scale groupings examined, only the tree based on morphological data showed statistically significant differences. Conclusion Based on the amount of character support contributed by EST data which make up a majority of the nuclear data set, and the lack of conflict of the nuclear data set with the simultaneous analysis tree, we conclude that the inclusion of EST data does provide a viable and efficient approach to address phylogenetic questions within a parsimony framework on a genomic scale, if problems of orthology determination and potential sequencing errors can be overcome. In addition, approaches that examine conflict and support in a simultaneous analysis framework allow for a more precise understanding of the evolutionary history of individual process partitions and may be a novel way to understand functional aspects of different kinds of cellular classes of gene products. PMID:16776834
Phylogenetic relationships, diversification and expansion of chili peppers (Capsicum, Solanaceae).
Carrizo García, Carolina; Barfuss, Michael H J; Sehr, Eva M; Barboza, Gloria E; Samuel, Rosabelle; Moscone, Eduardo A; Ehrendorfer, Friedrich
2016-07-01
Capsicum (Solanaceae), native to the tropical and temperate Americas, comprises the well-known sweet and hot chili peppers and several wild species. So far, only partial taxonomic and phylogenetic analyses have been done for the genus. Here, the phylogenetic relationships between nearly all taxa of Capsicum were explored to test the monophyly of the genus and to obtain a better knowledge of species relationships, diversification and expansion. Thirty-four of approximately 35 Capsicum species were sampled. Maximum parsimony and Bayesian inference analyses were performed using two plastid markers (matK and psbA-trnH) and one single-copy nuclear gene (waxy). The evolutionary changes of nine key features were reconstructed following the parsimony ancestral states method. Ancestral areas were reconstructed through a Bayesian Markov chain Monte Carlo analysis. Capsicum forms a monophyletic clade, with Lycianthes as a sister group, following both phylogenetic approaches. Eleven well-supported clades (four of them monotypic) can be recognized within Capsicum, although some interspecific relationships need further analysis. A few features are useful to characterize different clades (e.g. fruit anatomy, chromosome base number), whereas some others are highly homoplastic (e.g. seed colour). The origin of Capsicum is postulated in an area along the Andes of western to north-western South America. The expansion of the genus has followed a clockwise direction around the Amazon basin, towards central and south-eastern Brazil, then back to western South America, and finally northwards to Central America. New insights are provided regarding interspecific relationships, character evolution, and geographical origin and expansion of Capsicum A clearly distinct early-diverging clade can be distinguished, centred in western-north-western South America. Subsequent rapid speciation has led to the origin of the remaining clades. The diversification of Capsicum has culminated in the origin of the main cultivated species in several regions of South to Central America. © The Author 2016. Published by Oxford University Press on behalf of the Annals of Botany Company. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Consequence Valuing as Operation and Process: A Parsimonious Analysis of Motivation
ERIC Educational Resources Information Center
Whelan, Robert; Barnes-Holmes, Dermot
2010-01-01
The concept of the motivating operation (MO) has been subject to 3 criticisms: (a) the terms and concepts employed do not always overlap with traditional behavior-analytic verbal practices; (b) the dual nature of the MO is unclear; and (c) there is a lack of adequate contact with empirical data. We offer a more parsimonious approach to motivation,…
Subbotin, Sergei A; Ragsdale, Erik J; Mullens, Teresa; Roberts, Philip A; Mundo-Ocampo, Manuel; Baldwin, James G
2008-08-01
The root lesion nematodes of the genus Pratylenchus Filipjev, 1936 are migratory endoparasites of plant roots, considered among the most widespread and important nematode parasites in a variety of crops. We obtained gene sequences from the D2 and D3 expansion segments of 28S rRNA partial and 18S rRNA from 31 populations belonging to 11 valid and two unidentified species of root lesion nematodes and five outgroup taxa. These datasets were analyzed using maximum parsimony and Bayesian inference. The alignments were generated using the secondary structure models for these molecules and analyzed with Bayesian inference under the standard models and the complex model, considering helices under the doublet model and loops and bulges under the general time reversible model. The phylogenetic informativeness of morphological characters is tested by reconstruction of their histories on rRNA based trees using parallel parsimony and Bayesian approaches. Phylogenetic and sequence analyses of the 28S D2-D3 dataset with 145 accessions for 28 species and 18S dataset with 68 accessions for 15 species confirmed among large numbers of geographical diverse isolates that most classical morphospecies are monophyletic. Phylogenetic analyses revealed at least six distinct major clades of examined Pratylenchus species and these clades are generally congruent with those defined by characters derived from lip patterns, numbers of lip annules, and spermatheca shape. Morphological results suggest the need for sophisticated character discovery and analysis for morphology based phylogenetics in nematodes.
Wei Wu; James Clark; James Vose
2010-01-01
Hierarchical Bayesian (HB) modeling allows for multiple sources of uncertainty by factoring complex relationships into conditional distributions that can be used to draw inference and make predictions. We applied an HB model to estimate the parameters and state variables of a parsimonious hydrological model â GR4J â by coherently assimilating the uncertainties from the...
NASA Astrophysics Data System (ADS)
Xu, D.; Agee, E.; Wang, J.; Ivanov, V. Y.
2017-12-01
The increased frequency and severity of droughts in the Amazon region have emphasized the potential vulnerability of the rainforests to heat and drought-induced stresses, highlighting the need to reduce the uncertainty in estimates of regional evapotranspiration (ET) and quantify resilience of the forest. Ground-based observations for estimating ET are resource intensive, making methods based on remotely sensed observations an attractive alternative. Several methodologies have been developed to estimate ET from satellite data, but challenges remained in model parameterization and satellite limited coverage reducing their utility for monitoring biodiverse regions. In this work, we apply a novel surface energy partition method (Maximum Entropy Production; MEP) based on Bayesian probability theory and nonequilibrium thermodynamics to derive ET time series using satellite data for Amazon basin. For a large, sparsely monitored region such as the Amazon, this approach has the advantage methods of only using single level measurements of net radiation, temperature, and specific humidity data. Furthermore, it is not sensitive to the uncertainty of the input data and model parameters. In this first application of MEP theory for a tropical forest biome, we assess its performance at various spatiotemporal scales against a diverse field data sets. Specifically, the objective of this work is to test this method using eddy flux data for several locations across the Amazonia at sub-daily, monthly, and annual scales and compare the new estimates with those using traditional methods. Analyses of the derived ET time series will contribute to reducing the current knowledge gap surrounding the much debated response of the Amazon Basin region to droughts and offer a template for monitoring the long-term changes in global hydrologic cycle due to anthropogenic and natural causes.
Equally parsimonious pathways through an RNA sequence space are not equally likely
NASA Technical Reports Server (NTRS)
Lee, Y. H.; DSouza, L. M.; Fox, G. E.
1997-01-01
An experimental system for determining the potential ability of sequences resembling 5S ribosomal RNA (rRNA) to perform as functional 5S rRNAs in vivo in the Escherichia coli cellular environment was devised previously. Presumably, the only 5S rRNA sequences that would have been fixed by ancestral populations are ones that were functionally valid, and hence the actual historical paths taken through RNA sequence space during 5S rRNA evolution would have most likely utilized valid sequences. Herein, we examine the potential validity of all sequence intermediates along alternative equally parsimonious trajectories through RNA sequence space which connect two pairs of sequences that had previously been shown to behave as valid 5S rRNAs in E. coli. The first trajectory requires a total of four changes. The 14 sequence intermediates provide 24 apparently equally parsimonious paths by which the transition could occur. The second trajectory involves three changes, six intermediate sequences, and six potentially equally parsimonious paths. In total, only eight of the 20 sequence intermediates were found to be clearly invalid. As a consequence of the position of these invalid intermediates in the sequence space, seven of the 30 possible paths consisted of exclusively valid sequences. In several cases, the apparent validity/invalidity of the intermediate sequences could not be anticipated on the basis of current knowledge of the 5S rRNA structure. This suggests that the interdependencies in RNA sequence space may be more complex than currently appreciated. If ancestral sequences predicted by parsimony are to be regarded as actual historical sequences, then the present results would suggest that they should also satisfy a validity requirement and that, in at least limited cases, this conjecture can be tested experimentally.
Molecular phylogeny of Gavilea (Chloraeinae: Orchidaceae) using plastid and nuclear markers.
Chemisquy, M Amelia; Morrone, Osvaldo
2012-03-01
A phylogenetic analysis is provided for 70% of the representatives of genus Gavilea, as well as for several species of the remaining genera of subtribe Chloraeinae: Bipinnula, Chloraea and Geoblasta. Sequences from the plastid markers rpoC1, matK-trnK and atpB-rbcL and the nuclear marker ITS, were analyzed using Maximum Parsimony and Bayesian Inference. Monophyly of subtribe Chloraeinae was confirmed, as well as its position inside tribe Cranichideae. Neither Chloraea nor Bipinnula were recovered as monophyletic. Gavilea turned out polyphyletic, with Chloraeachica embedded in the genus while Gavilea supralabellata was related to Chloraea and might be a hybrid between both genera. None of the two sections of Gavilea were monophyletic, and the topologies obtained do not suggest a new division of the genus. Copyright © 2011 Elsevier Inc. All rights reserved.
Dynamical minimalism: why less is more in psychology.
Nowak, Andrzej
2004-01-01
The principle of parsimony, embraced in all areas of science, states that simple explanations are preferable to complex explanations in theory construction. Parsimony, however, can necessitate a trade-off with depth and richness in understanding. The approach of dynamical minimalism avoids this trade-off. The goal of this approach is to identify the simplest mechanisms and fewest variables capable of producing the phenomenon in question. A dynamical model in which change is produced by simple rules repetitively interacting with each other can exhibit unexpected and complex properties. It is thus possible to explain complex psychological and social phenomena with very simple models if these models are dynamic. In dynamical minimalist theories, then, the principle of parsimony can be followed without sacrificing depth in understanding. Computer simulations have proven especially useful for investigating the emergent properties of simple models.
Zhang, Jinju; Li, Zuozhou; Fritsch, Peter W.; Tian, Hua; Yang, Aihong; Yao, Xiaohong
2015-01-01
Background and Aims The phylogeography of plant species in sub-tropical China remains largely unclear. This study used Tapiscia sinensis, an endemic and endangered tree species widely but disjunctly distributed in sub-tropical China, as a model to reveal the patterns of genetic diversity and phylogeographical history of Tertiary relict plant species in this region. The implications of the results are discussed in relation to its conservation management. Methods Samples were taken from 24 populations covering the natural geographical distribution of T. sinensis. Genetic structure was investigated by analysis of molecular variance (AMOVA) and spatial analysis of molecular variance (SAMOVA). Phylogenetic relationships among haplotypes were constructed with maximum parsimony and haplotype network methods. Historical population expansion events were tested with pairwise mismatch distribution analysis and neutrality tests. Species potential range was deduced by ecological niche modelling (ENM). Key Results A low level of genetic diversity was detected at the population level. A high level of genetic differentiation and a significant phylogeographical structure were revealed. The mean divergence time of the haplotypes was approx. 1·33 million years ago. Recent range expansion in this species is suggested by a star-like haplotype network and by the results from the mismatch distribution analysis and neutrality tests. Conclusions Climatic oscillations during the Pleistocene have had pronounced effects on the extant distribution of Tapiscia relative to the Last Glacial Maximum (LGM). Spatial patterns of molecular variation and ENM suggest that T. sinensis may have retreated in south-western and central China and colonized eastern China prior to the LGM. Multiple montane refugia for T. sinense existing during the LGM are inferred in central and western China. The populations adjacent to or within these refugia of T. sinense should be given high priority in the development of conservation policies and management strategies for this endangered species. PMID:26187222
Appelhans, M. S.; Smets, E.; Razafimandimbison, S. G.; Haevermans, T.; van Marle, E. J.; Couloux, A.; Rabarison, H.; Randrianarivelojosia, M.; Keßler, P. J. A.
2011-01-01
Background and Aims The Spathelia–Ptaeroxylon clade is a group of morphologically diverse plants that have been classified together as a result of molecular phylogenetic studies. The clade is currently included in Rutaceae and recognized at a subfamilial level (Spathelioideae) despite the fact that most of its genera have traditionally been associated with other families and that there are no obvious morphological synapomorphies for the clade. The aim of the present study is to construct phylogenetic trees for the Spathelia–Ptaeroxylon clade and to investigate anatomical characters in order to decide whether it should be kept in Rutaceae or recognized at the familial level. Anatomical characters were plotted on a cladogram to help explain character evolution within the group. Moreover, phylogenetic relationships and generic limits within the clade are also addressed. Methods A species-level phylogenetic analysis of the Spathelia–Ptaeroxylon clade based on five plastid DNA regions (rbcL, atpB, trnL–trnF, rps16 and psbA–trnH) was conducted using Bayesian, maximum parsimony and maximum likelihood methods. Leaf and seed anatomical characters of all genera were (re)investigated by light and scanning electron microscopy. Key Results With the exception of Spathelia, all genera of the Spathelila–Ptaeroxylon clade are monophyletic. The typical leaf and seed anatomical characters of Rutaceae were found. Further, the presence of oil cells in the leaves provides a possible synapomorphy for the clade. Conclusions The Spathelia–Ptaeroxylon clade is well placed in Rutaceae and it is reasonable to unite the genera into one subfamily (Spathelioideae). We propose a new tribal classification of Spathelioideae. A narrow circumscription of Spathelia is established to make the genus monophyletic, and Sohnreyia is resurrected to accommodate the South American species of Spathelia. The most recent common ancestor of Spathelioideae probably had leaves with secretory cavities and oil cells, haplostemonous flowers with appendaged staminal filaments, and a tracheidal tegmen. PMID:21610209
Negrisolo, Enrico; Kuhl, Heiner; Forcato, Claudio; Vitulo, Nicola; Reinhardt, Richard; Patarnello, Tomaso; Bargelloni, Luca
2010-12-01
Comparative genomics holds the promise to magnify the information obtained from individual genome sequencing projects, revealing common features conserved across genomes and identifying lineage-specific characteristics. To implement such a comparative approach, a robust phylogenetic framework is required to accurately reconstruct evolution at the genome level. Among vertebrate taxa, teleosts represent the second best characterized group, with high-quality draft genome sequences for five model species (Danio rerio, Gasterosteus aculeatus, Oryzias latipes, Takifugu rubripes, and Tetraodon nigroviridis), and several others are in the finishing lane. However, the relationships among the acanthomorph teleost model fishes remain an unresolved taxonomic issue. Here, a genomic region spanning over 1.2 million base pairs was sequenced in the teleost fish Dicentrarchus labrax. Together with genomic data available for the above fish models, the new sequence was used to identify unique orthologous genomic regions shared across all target taxa. Different strategies were applied to produce robust multiple gene and genomic alignments spanning from 11,802 to 186,474 amino acid/nucleotide positions. Ten data sets were analyzed according to Bayesian inference, maximum likelihood, maximum parsimony, and neighbor joining methods. Extensive analyses were performed to explore the influence of several factors (e.g., alignment methodology, substitution model, data set partitions, and long-branch attraction) on the tree topology. Although a general consensus was observed for a closer relationship between G. aculeatus (Gasterosteidae) and Di. labrax (Moronidae) with the atherinomorph O. latipes (Beloniformes) sister taxon of this clade, with the tetraodontiform group Ta. rubripes and Te. nigroviridis (Tetraodontiformes) representing a more distantly related taxon among acanthomorph model fish species, conflicting results were obtained between data sets and methods, especially with respect to the choice of alignment methodology applied to noncoding parts of the genomic region under study. This may limit the use of intergenic/noncoding sequences in phylogenomics until more robust alignment algorithms are developed.
Gervais, Matthew M; Fessler, Daniel M T
2017-01-01
The target article argues that contempt is a sentiment, and that sentiments are the deep structure of social affect. The 26 commentaries meet these claims with a range of exciting extensions and applications, as well as critiques. Most significantly, we reply that construction and emergence are necessary for, not incompatible with, evolved design, while parsimony requires explanatory adequacy and predictive accuracy, not mere simplicity.
NASA Astrophysics Data System (ADS)
Sardet, Laure; Patilea, Valentin
When pricing a specific insurance premium, actuary needs to evaluate the claims cost distribution for the warranty. Traditional actuarial methods use parametric specifications to model claims distribution, like lognormal, Weibull and Pareto laws. Mixtures of such distributions allow to improve the flexibility of the parametric approach and seem to be quite well-adapted to capture the skewness, the long tails as well as the unobserved heterogeneity among the claims. In this paper, instead of looking for a finely tuned mixture with many components, we choose a parsimonious mixture modeling, typically a two or three-component mixture. Next, we use the mixture cumulative distribution function (CDF) to transform data into the unit interval where we apply a beta-kernel smoothing procedure. A bandwidth rule adapted to our methodology is proposed. Finally, the beta-kernel density estimate is back-transformed to recover an estimate of the original claims density. The beta-kernel smoothing provides an automatic fine-tuning of the parsimonious mixture and thus avoids inference in more complex mixture models with many parameters. We investigate the empirical performance of the new method in the estimation of the quantiles with simulated nonnegative data and the quantiles of the individual claims distribution in a non-life insurance application.
Bastian, S T; Tanaka, K; Anunciado, R V P; Natural, N G; Sumalde, A C; Namikawa, T
2002-04-01
Six flying fox species, genus Pteropus (four from the Philippines) were investigated using complete cytochrome b gene sequences (1140 bp) to infer their evolutionary relationships. The DNA sequences generated via polymerase chain reaction were analyzed using the neighbor-joining, parsimony, and maximum likelihood methods. We estimated that the first evolutionary event among these Pteropus species occurred approximately 13.90 +/- 1.49 MYA. Within this short period of evolutionary time we further hypothesized that the ancestors of the flying foxes found in the Philippines experienced a subsequent diversification forming two clusters in the topology. The first cluster is composed of P. pumilus (Philippine endemic), P. speciosus (restricted in western Mindanao) with P. scapulatus, while the second one comprised P. vampyrus and P. dasymallus species based on the analysis from first and second codon positions. Consistently, all phylogenetic analyses divulged close association of P. dasymallus with P. vampyrus contradicting the previous report categorizing P. dasymallus under subniger species group with P. pumilus. P. speciosus, and P. hypomelanus. The Philippine endemic species (P. pumilus) is closely linked with P. speciosus. The representative samples of P. vampyrus showed a large genetic distance of 1.87%. The large genetic distance between P. dasymallus and P. hypomelanus, P. pumilus and P. speciosus denotes a distinct species group.
Lovette, I.J.; Perez-Eman, J. L.; Sullivan, J.P.; Banks, R.C.; Fiorentino, I.; Cordoba-Cordoba, S.; Echeverry-Galvis, M.; Barker, F.K.; Burns, K.J.; Klicka, J.; Lanyon, Scott M.; Bermingham, E.
2010-01-01
The birds in the family Parulidae-commonly termed the New World warblers or wood-warblers-are a classic model radiation for studies of ecological and behavioral differentiation. Although the monophyly of a 'core' wood-warbler clade is well established, no phylogenetic hypothesis for this group has included a full sampling of wood-warbler species diversity. We used parsimony, maximum likelihood, and Bayesian methods to reconstruct relationships among all genera and nearly all wood-warbler species, based on a matrix of mitochondrial DNA (5840 nucleotides) and nuclear DNA (6 loci, 4602 nucleotides) characters. The resulting phylogenetic hypotheses provide a highly congruent picture of wood-warbler relationships, and indicate that the traditional generic classification of these birds recognizes many non-monophyletic groups. We recommend a revised taxonomy in which each of 14 genera (Seiurus, Helmitheros, Mniotilta, Limnothlypis, Protonotaria, Parkesia, Vermivora, Oreothlypis, Geothlypis, Setophaga, Myioborus, Cardellina, Basileuterus, Myiothlypis) corresponds to a well-supported clade; these nomenclatural changes also involve subsuming a number of well-known, traditional wood-warbler genera (Catharopeza, Dendroica, Ergaticus, Euthlypis, Leucopeza, Oporornis, Parula, Phaeothlypis, Wilsonia). We provide a summary phylogenetic hypothesis that will be broadly applicable to investigations of the historical biogeography, processes of diversification, and evolution of trait variation in this well studied avian group. ?? 2010 Elsevier Inc.
TeachEnG: a Teaching Engine for Genomics.
Kim, Minji; Kim, Yeonsung; Qian, Lei; Song, Jun S
2017-10-15
Bioinformatics is a rapidly growing field that has emerged from the synergy of computer science, statistics and biology. Given the interdisciplinary nature of bioinformatics, many students from diverse fields struggle with grasping bioinformatic concepts only from classroom lectures. Interactive tools for helping students reinforce their learning would be thus desirable. Here, we present an interactive online educational tool called TeachEnG (acronym for Teaching Engine for Genomics) for reinforcing key concepts in sequence alignment and phylogenetic tree reconstruction. Our instructional games allow students to align sequences by hand, fill out the dynamic programming matrix in the Needleman-Wunsch global sequence alignment algorithm, and reconstruct phylogenetic trees via the maximum parsimony, Unweighted Pair Group Method with Arithmetic mean (UPGMA) and Neighbor-Joining algorithms. With an easily accessible interface and instant visual feedback, TeachEnG will help promote active learning in bioinformatics. TeachEnG is freely available at http://teacheng.illinois.edu. The source code is available from https://github.com/KnowEnG/TeachEnG under the Artistic License 2.0. It is written in JavaScript and compatible with Firefox, Safari, Chrome and Microsoft Edge. songj@illinois.edu. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
Phylogenetic reconstruction of South American felids defined by protein electrophoresis.
Slattery, J P; Johnson, W E; Goldman, D; O'Brien, S J
1994-09-01
Phylogenetic associations among six closely related South American felid species were defined by changes in protein-encoding gene loci. We analyzed proteins isolated from skin fibroblasts using two-dimensional electrophoresis and allozymes extracted from blood cells. Genotypes were determined for multiple individuals of ocelot, margay, tigrina, Geoffroy's cat, kodkod, and pampas cat at 548 loci resolved by two-dimensional electrophoresis and 44 allozyme loci. Phenograms were constructed using the methods of Fitch-Margoliash and neighbor-joining on a matrix of Nei's unbiased genetic distances for all pairs of species. Results of a relative-rate test indicate changes in two-dimensional electrophoresis data are constant among all South American felids with respect to a hyena outgroup. Allelic frequencies were transformed to discrete character states for maximum parsimony analysis. Phylogenetic reconstruction indicates a major split occurred approximately 5-6 million years ago, leading to three groups within the ocelot lineage. The earliest divergence led to Leopardus tigrina, followed by a split between an ancestor of an unresolved trichotomy of three species (Oncifelis guigna, O. geoffroyi, and Lynchailuris colocolo) and a recent common ancestor of Leopardus pardalis and L. wiedii. The results suggest that modern South American felids are monophyletic and evolved rapidly after the formation of the Panama land bridge between North and South America.
Mark Welch, David B; Cummings, Michael P; Hillis, David M; Meselson, Matthew
2004-02-10
Rotifers of the asexual class Bdelloidea are unusual in possessing two or more divergent copies of every gene that has been examined. Phylogenetic analysis of the heat-shock gene hsp82 and the TATA-box-binding protein gene tbp in multiple bdelloid species suggested that for each gene, each copy belonged to one of two lineages that began to diverge before the bdelloid radiation. Such gene trees are consistent with the two lineages having descended from former alleles that began to diverge after meiotic segregation ceased or from subgenomes of an alloploid ancestor of the bdelloids. However, the original analyses of bdelloid gene-copy divergence used only a single outgroup species and were based on parsimony and neighbor joining. We have now used maximum likelihood and Bayesian inference methods and, for hsp82, multiple outgroups in an attempt to produce more robust gene trees. Here we report that the available data do not unambiguously discriminate between gene trees that root the origin of hsp82 and tbp copy divergence before the bdelloid radiation and those which indicate that the gene copies began to diverge within bdelloid families. The remarkable presence of multiple diverged gene copies in individual genomes is nevertheless consistent with the loss of sex in an ancient ancestor of bdelloids.
Yap, Fook Choy; Yan, Yap Jin; Loon, Kiung Teh; Zhen, Justina Lee Ning; Kamau, Nelly Warau; Kumaran, Jayaraj Vijaya
2010-10-01
The present investigation was carried out in an attempt to study the phylogenetic analysis of different breeds of domestic chickens in Peninsular Malaysia inferred from partial cytochrome b gene information and random amplified polymorphic DNA (RAPD) markers. Phylogenetic analysis using both neighbor-joining (NJ) and maximum parsimony (MP) methods produced three clusters that encompassed Type-I village chickens, the red jungle fowl subspecies and the Japanese Chunky broilers. The phylogenetic analysis also revealed that majority of the Malaysian commercial chickens were randomly assembled with the Type-II village chickens. In RAPD assay, phylogenetic analysis using neighbor-joining produced six clusters that were completely distinguished based on the locality of chickens. High levels of genetic variations were observed among the village chickens, the commercial broilers, and between the commercial broilers and layer chickens. In this study, it was found that Type-I village chickens could be distinguished from the commercial chickens and Type-II village chickens at the position of the 27th nucleotide of the 351 bp cytochrome b gene. This study also revealed that RAPD markers were unable to differentiate the type of chickens, but it showed the effectiveness of RAPD in evaluating the genetic variation and the genetic relationships between chicken lines and populations.
A comprehensive molecular phylogeny for the hornbills (Aves: Bucerotidae).
Gonzalez, Juan-Carlos T; Sheldon, Ben C; Collar, Nigel J; Tobias, Joseph A
2013-05-01
The hornbills comprise a group of morphologically and behaviorally distinct Palaeotropical bird species that feature prominently in studies of ecology and conservation biology. Although the monophyly of hornbills is well established, previous phylogenetic hypotheses were based solely on mtDNA and limited sampling of species diversity. We used parsimony, maximum likelihood and Bayesian methods to reconstruct relationships among all 61 extant hornbill species, based on nuclear and mtDNA gene sequences extracted largely from historical samples. The resulting phylogenetic trees closely match vocal variation across the family but conflict with current taxonomic treatments. In particular, they highlight a new arrangement for the six major clades of hornbills and reveal that three groups traditionally treated as genera (Tockus, Aceros, Penelopides) are non-monophyletic. In addition, two other genera (Anthracoceros, Ocyceros) were non-monophyletic in the mtDNA gene tree. Our findings resolve some longstanding problems in hornbill systematics, including the placement of 'Penelopides exharatus' (embedded in Aceros) and 'Tockus hartlaubi' (sister to Tropicranus albocristatus). We also confirm that an Asiatic lineage (Berenicornis) is sister to a trio of Afrotropical genera (Tropicranus [including 'Tockus hartlaubi'], Ceratogymna, Bycanistes). We present a summary phylogeny as a robust basis for further studies of hornbill ecology, evolution and historical biogeography. Copyright © 2013. Published by Elsevier Inc.
Zeng, Xu; Yuan, Zhengrong; Tong, Xin; Li, Qiushi; Gao, Weiwei; Qin, Minjian; Liu, Zhihua
2012-05-01
Oryzoideae (Poaceae) plants have economic and ecological value. However, the phylogenetic position of some plants is not clear, such as Hygroryza aristata (Retz.) Nees. and Porteresia coarctata (Roxb.) Tateoka (syn. Oryza coarctata). Comprehensive molecular phylogenetic studies have been carried out on many genera in the Poaceae. The different DNA sequences, including nuclear and chloroplast sequences, had been extensively employed to determine relationships at both higher and lower taxonomic levels in the Poaceae. Chloroplast DNA ndhF gene and atpB-rbcL spacer were used to construct phylogenetic trees and estimate the divergence time of Oryzoideae, Bambusoideae, Panicoideae, Pooideae and so on. Complete sequences of atpB-rbcL and ndhF were generated for 17 species representing six species of the Oryzoideae and related subfamilies. Nicotiana tabacum L. was the outgroup species. The two DNA datasets were analyzed, using Maximum Parsimony and Bayesian analysis methods. The molecular phylogeny revealed that H. aristata (Retz.) Nees was the sister to Chikusichloa aquatica Koidz. Moreover, P. coarctata (Roxb.) Tateoka was in the genus Oryza. Furthermore, the result of evolution analysis, which based on the ndhF marker, indicated that the time of origin of Oryzoideae might be 31 million years ago.
Meyer, Karin; Kirkpatrick, Mark
2005-01-01
Principal component analysis is a widely used 'dimension reduction' technique, albeit generally at a phenotypic level. It is shown that we can estimate genetic principal components directly through a simple reparameterisation of the usual linear, mixed model. This is applicable to any analysis fitting multiple, correlated genetic effects, whether effects for individual traits or sets of random regression coefficients to model trajectories. Depending on the magnitude of genetic correlation, a subset of the principal component generally suffices to capture the bulk of genetic variation. Corresponding estimates of genetic covariance matrices are more parsimonious, have reduced rank and are smoothed, with the number of parameters required to model the dispersion structure reduced from k(k + 1)/2 to m(2k - m + 1)/2 for k effects and m principal components. Estimation of these parameters, the largest eigenvalues and pertaining eigenvectors of the genetic covariance matrix, via restricted maximum likelihood using derivatives of the likelihood, is described. It is shown that reduced rank estimation can reduce computational requirements of multivariate analyses substantially. An application to the analysis of eight traits recorded via live ultrasound scanning of beef cattle is given. PMID:15588566
Molecular phylogeny of the spoonbills (Aves: Threskiornithidae) based on mitochondrial DNA
Chesser, R. Terry; Yeung, Carol K.L.; Yao, Cheng-Te; Tian, Xiu-Hua; Li, Shou-Hsien
2010-01-01
Spoonbills (genus Platalea) are a small group of wading birds, generally considered to constitute the subfamily Plataleinae (Aves: Threskiornithidae). We reconstructed phylogenetic relationships among the six species of spoonbills using variation in sequences of the mitochondrial genes ND2 and cytochrome b (total 1796 bp). Topologies of phylogenetic trees reconstructed using maximum likelihood, maximum parsimony, and Bayesian analyses were virtually identical and supported monophyly of the spoonbills. Most relationships within Platalea received strong support: P. minor and P. regia were closely related sister species, P. leucorodia was sister to the minor-regia clade, and P. alba was sister to the minor-regia-leucorodia clade. Relationships of P. flavipes and P. ajaja were less well resolved: these species either formed a clade that was sister to the four-species clade, or were successive sisters to this clade. This phylogeny is consistent with ideas of relatedness derived from spoonbill morphology. Our limited sampling of the Threskiornithinae (ibises), the putative sister group to the spoonbills, indicated that this group is paraphyletic, in agreement with previous molecular data; this suggests that separation of the Threskiornithidae into subfamilies Plataleinae and Threskiornithinae may not be warranted.
Probable flood predictions in ungauged coastal basins of El Salvador
Friedel, M.J.; Smith, M.E.; Chica, A.M.E.; Litke, D.
2008-01-01
A regionalization procedure is presented and used to predict probable flooding in four ungauged coastal river basins of El Salvador: Paz, Jiboa, Grande de San Miguel, and Goascoran. The flood-prediction problem is sequentially solved for two regions: upstream mountains and downstream alluvial plains. In the upstream mountains, a set of rainfall-runoff parameter values and recurrent peak-flow discharge hydrographs are simultaneously estimated for 20 tributary-basin models. Application of dissimilarity equations among tributary basins (soft prior information) permitted development of a parsimonious parameter structure subject to information content in the recurrent peak-flow discharge values derived using regression equations based on measurements recorded outside the ungauged study basins. The estimated joint set of parameter values formed the basis from which probable minimum and maximum peak-flow discharge limits were then estimated revealing that prediction uncertainty increases with basin size. In the downstream alluvial plain, model application of the estimated minimum and maximum peak-flow hydrographs facilitated simulation of probable 100-year flood-flow depths in confined canyons and across unconfined coastal alluvial plains. The regionalization procedure provides a tool for hydrologic risk assessment and flood protection planning that is not restricted to the case presented herein. ?? 2008 ASCE.
Phylogenetic study of Class Armophorea (Alveolata, Ciliophora) based on 18S-rDNA data.
da Silva Paiva, Thiago; do Nascimento Borges, Bárbara; da Silva-Neto, Inácio Domingos
2013-12-01
The 18S rDNA phylogeny of Class Armophorea, a group of anaerobic ciliates, is proposed based on an analysis of 44 sequences (out of 195) retrieved from the NCBI/GenBank database. Emphasis was placed on the use of two nucleotide alignment criteria that involved variation in the gap-opening and gap-extension parameters and the use of rRNA secondary structure to orientate multiple-alignment. A sensitivity analysis of 76 data sets was run to assess the effect of variations in indel parameters on tree topologies. Bayesian inference, maximum likelihood and maximum parsimony phylogenetic analyses were used to explore how different analytic frameworks influenced the resulting hypotheses. A sensitivity analysis revealed that the relationships among higher taxa of the Intramacronucleata were dependent upon how indels were determined during multiple-alignment of nucleotides. The phylogenetic analyses rejected the monophyly of the Armophorea most of the time and consistently indicated that the Metopidae and Nyctotheridae were related to the Litostomatea. There was no consensus on the placement of the Caenomorphidae, which could be a sister group of the Metopidae + Nyctorheridae, or could have diverged at the base of the Spirotrichea branch or the Intramacronucleata tree.
Phylogenetic study of Class Armophorea (Alveolata, Ciliophora) based on 18S-rDNA data
da Silva Paiva, Thiago; do Nascimento Borges, Bárbara; da Silva-Neto, Inácio Domingos
2013-01-01
The 18S rDNA phylogeny of Class Armophorea, a group of anaerobic ciliates, is proposed based on an analysis of 44 sequences (out of 195) retrieved from the NCBI/GenBank database. Emphasis was placed on the use of two nucleotide alignment criteria that involved variation in the gap-opening and gap-extension parameters and the use of rRNA secondary structure to orientate multiple-alignment. A sensitivity analysis of 76 data sets was run to assess the effect of variations in indel parameters on tree topologies. Bayesian inference, maximum likelihood and maximum parsimony phylogenetic analyses were used to explore how different analytic frameworks influenced the resulting hypotheses. A sensitivity analysis revealed that the relationships among higher taxa of the Intramacronucleata were dependent upon how indels were determined during multiple-alignment of nucleotides. The phylogenetic analyses rejected the monophyly of the Armophorea most of the time and consistently indicated that the Metopidae and Nyctotheridae were related to the Litostomatea. There was no consensus on the placement of the Caenomorphidae, which could be a sister group of the Metopidae + Nyctorheridae, or could have diverged at the base of the Spirotrichea branch or the Intramacronucleata tree. PMID:24385862
Li, Min; Tian, Ying; Zhao, Ying; Bu, Wenjun
2012-01-01
Heteroptera, or true bugs, are the largest, morphologically diverse and economically important group of insects with incomplete metamorphosis. However, the phylogenetic relationships within Heteroptera are still in dispute and most of the previous studies were based on morphological characters or with single gene (partial or whole 18S rDNA). Besides, so far, divergence time estimates for Heteroptera totally rely on the fossil record, while no studies have been performed on molecular divergence rates. Here, for the first time, we used maximum parsimony (MP), maximum likelihood (ML) and Bayesian inference (BI) with multiple genes (18S rDNA, 28S rDNA, 16S rDNA and COI) to estimate phylogenetic relationships among the infraorders, and meanwhile, the Penalized Likelihood (r8s) and Bayesian (BEAST) molecular dating methods were employed to estimate divergence time of higher taxa of this suborder. Major results of the present study included: Nepomorpha was placed as the most basal clade in all six trees (MP trees, ML trees and Bayesian trees of nuclear gene data and four-gene combined data, respectively) with full support values. The sister-group relationship of Cimicomorpha and Pentatomomorpha was also strongly supported. Nepomorpha originated in early Triassic and the other six infraorders originated in a very short period of time in middle Triassic. Cimicomorpha and Pentatomomorpha underwent a radiation at family level in Cretaceous, paralleling the proliferation of the flowering plants. Our results indicated that the higher-group radiations within hemimetabolous Heteroptera were simultaneously with those of holometabolous Coleoptera and Diptera which took place in the Triassic. While the aquatic habitat was colonized by Nepomorpha already in the Triassic, the Gerromorpha independently adapted to the semi-aquatic habitat in the Early Jurassic.
Patel, Swati; Weckstein, Jason D; Patané, José S L; Bates, John M; Aleixo, Alexandre
2011-01-01
We use the small-bodied toucan genus Pteroglossus to test hypotheses about diversification in the lowland Neotropics. We sequenced three mitochondrial genes and one nuclear intron from all Pteroglossus species and used these data to reconstruct phylogenetic trees based on maximum parsimony, maximum likelihood, and Bayesian analyses. These phylogenetic trees were used to make inferences regarding both the pattern and timing of diversification for the group. We used the uplift of the Talamanca highlands of Costa Rica and western Panama as a geologic calibration for estimating divergence times on the Pteroglossus tree and compared these results with a standard molecular clock calibration. Then, we used likelihood methods to model the rate of diversification. Based on our analyses, the onset of the Pteroglossus radiation predates the Pleistocene, which has been predicted to have played a pivotal role in diversification in the Amazon rainforest biota. We found a constant rate of diversification in Pteroglossus evolutionary history, and thus no support that events during the Pleistocene caused an increase in diversification. We compare our data to other avian phylogenies to better understand major biogeographic events in the Neotropics. These comparisons support recurring forest connections between the Amazonian and Atlantic forests, and the splitting of cis/trans Andean species after the final uplift of the Andes. At the subspecies level, there is evidence for reciprocal monophyly and groups are often separated by major rivers, demonstrating the important role of rivers in causing or maintaining divergence. Because some of the results presented here conflict with current taxonomy of Pteroglossus, new taxonomic arrangements are suggested. Copyright © 2010 Elsevier Inc. All rights reserved.
NASA Astrophysics Data System (ADS)
Gutierrez-Jurado, H. A.; Guan, H.; Wang, J.; Wang, H.; Bras, R. L.; Simmons, C. T.
2015-12-01
Quantification of evapotranspiration (ET) and its partition over regions of heterogeneous topography and canopy poses a challenge using traditional approaches. In this study, we report the results of a novel field experiment design guided by the Maximum Entropy Production model of ET (MEP-ET), formulated for estimating evaporation and transpiration from homogeneous soil and canopy. A catchment with complex terrain and patchy vegetation in South Australia was instrumented to measure temperature, humidity and net radiation at soil and canopy surfaces. Performance of the MEP-ET model to quantify transpiration and soil evaporation was evaluated during wet and dry conditions with independently and directly measured transpiration from sapflow and soil evaporation using the Bowen Ratio Energy Balance (BREB). MEP-ET transpiration shows remarkable agreement with that obtained through sapflow measurements during wet conditions, but consistently overestimates the flux during dry periods. However, an additional term introduced to the original MEP-ET model accounting for higher stomatal regulation during dry spells, based on differences between leaf and air vapor pressure deficits and temperatures, significantly improves the model performance. On the other hand, MEP-ET soil evaporation is in good agreement with that from BREB regardless of moisture conditions. The experimental design allows a plot and tree scale quantification of evaporation and transpiration respectively. This study confirms for the first time that the MEP-ET originally developed for homogeneous open bare soil and closed canopy can be used for modeling ET over heterogeneous land surfaces. Furthermore, we show that with the addition of an empirical function simulating the plants ability to regulate transpiration, and based on the same measurements of temperature and humidity, the method can produce reliable estimates of ET during both wet and dry conditions without compromising its parsimony.
Li, Juan; Zhu, Jin-long; Lou, Shi-di; Wang, Ping; Zhang, You-sen; Wang, Lin; Yin, Ruo-chun; Zhang, Ping-ping
2018-01-01
Abstract Coptotermes suzhouensis (Isoptera: Rhinotermitidae) is a significant subterranean termite pest of wooden structures and is widely distributed in southeastern China. The complete mitochondrial DNA sequence of C. suzhouensis was analyzed in this study. The mitogenome was a circular molecule of 15,764 bp in length, which contained 13 protein-coding genes (PCGs), 22 transfer RNA genes, two ribosomal RNA genes, and an A+T-rich region with a gene arrangement typical of Isoptera mitogenomes. All PCGs were initiated by ATN codons and terminated by complete termination codons (TAA), except COX2, ND5, and Cytb, which ended with an incomplete termination codon T. All tRNAs displayed a typical clover-leaf structure, except for tRNASer(AGN), which did not contain the stem-loop structure in the DHU arm. The A+T content (69.23%) of the A+T-rich region (949 bp) was higher than that of the entire mitogenome (65.60%), and two different sets of repeat units (A+B) were distributed in this region. Comparison of complete mitogenome sequences with those of Coptotermes formosanus indicated that the two taxa have very high genetic similarity. Forty-one representative termite species were used to construct phylogenetic trees by maximum likelihood, maximum parsimony, and Bayesian inference methods. The phylogenetic analyses also strongly supported (BPP, MLBP, and MPBP = 100%) that all C. suzhouensis and C. formosanus samples gathered into one clade with genetic distances between 0.000 and 0.002. This study provides molecular evidence for a more robust phylogenetic position of C. suzhouensis and inferrs that C. suzhouensis was the synonymy of C. formosanus. PMID:29718488
Zhao, Ying; Bu, Wenjun
2012-01-01
Heteroptera, or true bugs, are the largest, morphologically diverse and economically important group of insects with incomplete metamorphosis. However, the phylogenetic relationships within Heteroptera are still in dispute and most of the previous studies were based on morphological characters or with single gene (partial or whole 18S rDNA). Besides, so far, divergence time estimates for Heteroptera totally rely on the fossil record, while no studies have been performed on molecular divergence rates. Here, for the first time, we used maximum parsimony (MP), maximum likelihood (ML) and Bayesian inference (BI) with multiple genes (18S rDNA, 28S rDNA, 16S rDNA and COI) to estimate phylogenetic relationships among the infraorders, and meanwhile, the Penalized Likelihood (r8s) and Bayesian (BEAST) molecular dating methods were employed to estimate divergence time of higher taxa of this suborder. Major results of the present study included: Nepomorpha was placed as the most basal clade in all six trees (MP trees, ML trees and Bayesian trees of nuclear gene data and four-gene combined data, respectively) with full support values. The sister-group relationship of Cimicomorpha and Pentatomomorpha was also strongly supported. Nepomorpha originated in early Triassic and the other six infraorders originated in a very short period of time in middle Triassic. Cimicomorpha and Pentatomomorpha underwent a radiation at family level in Cretaceous, paralleling the proliferation of the flowering plants. Our results indicated that the higher-group radiations within hemimetabolous Heteroptera were simultaneously with those of holometabolous Coleoptera and Diptera which took place in the Triassic. While the aquatic habitat was colonized by Nepomorpha already in the Triassic, the Gerromorpha independently adapted to the semi-aquatic habitat in the Early Jurassic. PMID:22384163
The complexity of selection at the major primate beta-defensin locus.
Semple, Colin A M; Maxwell, Alison; Gautier, Philippe; Kilanowski, Fiona M; Eastwood, Hayden; Barran, Perdita E; Dorin, Julia R
2005-05-18
We have examined the evolution of the genes at the major human beta-defensin locus and the orthologous loci in a range of other primates and mouse. For the first time these data allow us to examine selective episodes in the more recent evolutionary history of this locus as well as the ancient past. We have used a combination of maximum likelihood based tests and a maximum parsimony based sliding window approach to give a detailed view of the varying modes of selection operating at this locus. We provide evidence for strong positive selection soon after the duplication of these genes within an ancestral mammalian genome. Consequently variable selective pressures have acted on beta-defensin genes in different evolutionary lineages, with episodes both of negative, and more rarely positive selection, during the divergence of primates. Positive selection appears to have been more common in the rodent lineage, accompanying the birth of novel, rodent-specific beta-defensin genes. These observations allow a fuller understanding of the evolution of mammalian innate immunity. In both the rodent and primate lineages, sites in the second exon have been subject to positive selection and by implication are important in functional diversity. A small number of sites in the mature human peptides were found to have undergone repeated episodes of selection in different primate lineages. Particular sites were consistently implicated by multiple methods at positions throughout the mature peptides. These sites are clustered at positions predicted to be important for the specificity of the antimicrobial or chemoattractant properties of beta-defensins. Surprisingly, sites within the prepropeptide region were also implicated as being subject to significant positive selection, suggesting previously unappreciated functional significance for this region. Identification of these putatively functional sites has important implications for our understanding of beta-defensin function and for novel antibiotic design.
Summers, Mindi M; Messing, Charles G; Rouse, Greg W
2014-11-01
Comatulidae Fleming, 1828 (previously, and incorrectly, Comasteridae A.H. Clark, 1908a), is a group of feather star crinoids currently divided into four accepted subfamilies, 21 genera and approximately 95 nominal species. Comatulidae is the most commonly-encountered and species-rich crinoid group on shallow tropical coral reefs, particularly in the Indo-western Pacific region (IWP). We conducted a molecular phylogenetic analysis of the group with concatenated data from up to seven genes for 43 nominal species spanning 17 genera and all subfamilies. Basal nodes returned low support, but maximum likelihood, maximum parsimony, and Bayesian analyses were largely congruent, permitting an evaluation of current taxonomy and analysis of morphological character transformations. Two of the four current subfamilies were paraphyletic, whereas 15 of the 17 included genera returned as monophyletic. We provide a new classification with two subfamilies, Comatulinae and Comatellinae n. subfamily Summers, Messing, & Rouse, the former containing five tribes. We revised membership of analyzed genera to make them all clades and erected Anneissia n. gen. Summers, Messing, & Rouse. Transformation analyses for morphological features generally used in feather star classification (e.g., ray branching patterns, articulations) and those specifically for Comatulidae (e.g., comb pinnule form, mouth placement) were labile with considerable homoplasy. These traditional characters, in combination, allow for generic diagnoses, but in most cases we did not recover apomorphies for subfamilies, tribes, and genera. New morphological characters that will be informative for crinoid taxonomy and identification are still needed. DNA sequence data currently provides the most reliable method of identification to the species-level for many taxa of Comatulidae. Copyright © 2014 Elsevier Inc. All rights reserved.
Ortiz-Rodriguez, Andrés Ernesto; Ornelas, Juan Francisco; Ruiz-Sanchez, Eduardo
2018-05-01
The predominantly Asian tribe Miliuseae (Annonaceae) includes over 37 Neotropical species that are mainly distributed across Mesoamerica, from southern Mexico to northern Colombia. The tremendous ecological and morphological diversity of this clade, including ramiflory, cauliflory, flagelliflory, and clonality, suggests adaptive radiation. Despite the spectacular phenotypic divergence of this clade, little is known about its phylogenetic and evolutionary history. In this study we used a nuclear DNA marker and seven chloroplast markers, and maximum parsimony, maximum likelihood and Bayesian inference methods to reconstruct a comprehensive time-calibrated phylogeny of tribe Miliuseae, especially focusing on the Desmopsis-Stenanona clade. We also perform ancestral area reconstructions to infer the biogeographic history of this group. Finally, we use ecological niche modeling, lineage distribution models, and niche overlap tests to assess whether geographic isolation and ecological specialization influenced the diversification of lineages within this clade. We reconstructed a monophyletic Miliuseae that is divided into two strongly supported clades: (i) a Sapranthus-Tridimeris clade and (ii) a Desmopsis-Stenanona clade. The colonization of the Neotropics and subsequent diversification of Neotropical Miliuseae seems to have been associated with the expansion of the boreotropical forests during the late Eocene and their subsequent fragmentation and southern displacement. Further speciation within Neotropical Miliuseae out of the Maya block seems to have occurred during the last 15 million years. Lastly, the geographic structuring of major lineages of the Desmopsis-Stenanona clade seems to have followed a climatic gradient, supporting the hypothesis that morphological differentiation between closely related species resulted from both long-term isolation between geographic ranges and adaptation to environmental conditions. Copyright © 2018 Elsevier Inc. All rights reserved.
Hochbach, Anne; Schneider, Julia; Röser, Martin
2015-06-01
To investigate phylogenetic relationships within the grass subfamily Pooideae we studied about 50 taxa covering all recognized tribes, using one plastid DNA (cpDNA) marker (matK gene-3'trnK exon) and for the first time four nuclear single copy gene loci. DNA sequence information from two parts of the nuclear genes topoisomerase 6 (Topo6) spanning the exons 8-13 and 17-19, the exons 9-13 encoding plastid acetyl-CoA-carboxylase (Acc1) and the partial exon 1 of phytochrome B (PhyB) were generated. Individual and nuclear combined data were evaluated using maximum parsimony, maximum likelihood and Bayesian methods. All of the phylogenetic results show Brachyelytrum and the tribe Nardeae as earliest diverging lineages within the subfamily. The 'core' Pooideae (Hordeeae and the Aveneae/Poeae tribe complex) are also strongly supported, as well as the monophyly of the tribes Brachypodieae, Meliceae and Stipeae (except PhyB). The beak grass tribe Diarrheneae and the tribe Duthieeae are not monophyletic in some of the analyses. However, the combined nuclear DNA (nDNA) tree yields the highest resolution and the best delimitation of the tribes, and provides the following evolutionary hypothesis for the tribes: Brachyelytrum, Nardeae, Duthieeae, Meliceae, Stipeae, Diarrheneae, Brachypodieae and the 'core' Pooideae. Within the individual datasets, the phylogenetic trees obtained from Topo6 exon 8-13 shows the most interesting results. The divergent positions of some clone sequences of Ampelodesmos mauritanicus and Trikeraia pappiformis, for instance, may indicate a hybrid origin of these stipoid taxa. Copyright © 2015 Elsevier Inc. All rights reserved.
Molecular phylogenetics and biogeography of the Neotropical redstarts (Myioborus; Aves, Parulinae).
Pérez-Emán, Jorge L
2005-11-01
Montane areas in the Neotropics are characterized by high diversity and endemism of birds and other groups. The avian genus Myioborus (Parulinae) is a group of insectivorous warblers, characteristic of cloud forests, that represents one of the few Parulinae genera (New World warblers) that has radiated substantially in South America. The genus is distributed throughout most montane regions from the southwestern United States to northern Argentina. Here, I use mitochondrial sequences from the cytochrome b, ND2, and ND3 genes to present the first hypothesis of phylogenetic relationship among all Myioborus species level taxa. Phylogenetic reconstructions based on maximum parsimony, maximum likelihood, and Bayesian methods produced similar results and suggest a northern origin for the genus Myioborus with subsequent colonization of the Neotropical Montane Region. The lower-montane species, M. miniatus, is the sister taxon to a clade in which all taxa occupy upper-montane habitats. These "highland" taxa diverged early in the history of the genus and produced two well-defined monophyletic lineages, a Central-northern Andean clade formed by M. albifrons, M. ornatus, and M. melanocephalus, and a Pantepui (table-mountains of southern Venezuela, northern Brazil, and western Guyana) clade consisting of M. castaneocapillus, M. albifacies, and M. cardonai, and probably M. pariae. M. brunniceps, M. flavivertex, and M. torquatus were included in this upper-montane clade but without clear relationships to other taxa. Lack of resolution of nodes defining the upper-montane species clade is likely to result from a period of rapid diversification mediated by geological and climatic events during the Late Pliocene. These results suggest that an interplay of dispersal and vicariance has shaped the current biogeographic patterns of Myioborus.
Live phylogeny with polytomies: Finding the most compact parsimonious trees.
Papamichail, D; Huang, A; Kennedy, E; Ott, J-L; Miller, A; Papamichail, G
2017-08-01
Construction of phylogenetic trees has traditionally focused on binary trees where all species appear on leaves, a problem for which numerous efficient solutions have been developed. Certain application domains though, such as viral evolution and transmission, paleontology, linguistics, and phylogenetic stemmatics, often require phylogeny inference that involves placing input species on ancestral tree nodes (live phylogeny), and polytomies. These requirements, despite their prevalence, lead to computationally harder algorithmic solutions and have been sparsely examined in the literature to date. In this article we prove some unique properties of most parsimonious live phylogenetic trees with polytomies, and their mapping to traditional binary phylogenetic trees. We show that our problem reduces to finding the most compact parsimonious tree for n species, and describe a novel efficient algorithm to find such trees without resorting to exhaustive enumeration of all possible tree topologies. Copyright © 2017 Elsevier Ltd. All rights reserved.
A Bayesian Supertree Model for Genome-Wide Species Tree Reconstruction
De Oliveira Martins, Leonardo; Mallo, Diego; Posada, David
2016-01-01
Current phylogenomic data sets highlight the need for species tree methods able to deal with several sources of gene tree/species tree incongruence. At the same time, we need to make most use of all available data. Most species tree methods deal with single processes of phylogenetic discordance, namely, gene duplication and loss, incomplete lineage sorting (ILS) or horizontal gene transfer. In this manuscript, we address the problem of species tree inference from multilocus, genome-wide data sets regardless of the presence of gene duplication and loss and ILS therefore without the need to identify orthologs or to use a single individual per species. We do this by extending the idea of Maximum Likelihood (ML) supertrees to a hierarchical Bayesian model where several sources of gene tree/species tree disagreement can be accounted for in a modular manner. We implemented this model in a computer program called guenomu whose inputs are posterior distributions of unrooted gene tree topologies for multiple gene families, and whose output is the posterior distribution of rooted species tree topologies. We conducted extensive simulations to evaluate the performance of our approach in comparison with other species tree approaches able to deal with more than one leaf from the same species. Our method ranked best under simulated data sets, in spite of ignoring branch lengths, and performed well on empirical data, as well as being fast enough to analyze relatively large data sets. Our Bayesian supertree method was also very successful in obtaining better estimates of gene trees, by reducing the uncertainty in their distributions. In addition, our results show that under complex simulation scenarios, gene tree parsimony is also a competitive approach once we consider its speed, in contrast to more sophisticated models. PMID:25281847
Ternès, Nils; Rotolo, Federico; Michiels, Stefan
2016-07-10
Correct selection of prognostic biomarkers among multiple candidates is becoming increasingly challenging as the dimensionality of biological data becomes higher. Therefore, minimizing the false discovery rate (FDR) is of primary importance, while a low false negative rate (FNR) is a complementary measure. The lasso is a popular selection method in Cox regression, but its results depend heavily on the penalty parameter λ. Usually, λ is chosen using maximum cross-validated log-likelihood (max-cvl). However, this method has often a very high FDR. We review methods for a more conservative choice of λ. We propose an empirical extension of the cvl by adding a penalization term, which trades off between the goodness-of-fit and the parsimony of the model, leading to the selection of fewer biomarkers and, as we show, to the reduction of the FDR without large increase in FNR. We conducted a simulation study considering null and moderately sparse alternative scenarios and compared our approach with the standard lasso and 10 other competitors: Akaike information criterion (AIC), corrected AIC, Bayesian information criterion (BIC), extended BIC, Hannan and Quinn information criterion (HQIC), risk information criterion (RIC), one-standard-error rule, adaptive lasso, stability selection, and percentile lasso. Our extension achieved the best compromise across all the scenarios between a reduction of the FDR and a limited raise of the FNR, followed by the AIC, the RIC, and the adaptive lasso, which performed well in some settings. We illustrate the methods using gene expression data of 523 breast cancer patients. In conclusion, we propose to apply our extension to the lasso whenever a stringent FDR with a limited FNR is targeted. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.
Inda, Luis A.; Pimentel, Manuel; Chase, Mark W.
2012-01-01
Background and aims Tribe Orchideae (Orchidaceae: Orchidoideae) comprises around 62 mostly terrestrial genera, which are well represented in the Northern Temperate Zone and less frequently in tropical areas of both the Old and New Worlds. Phylogenetic relationships within this tribe have been studied previously using only nuclear ribosomal DNA (nuclear ribosomal internal transcribed spacer, nrITS). However, different parts of the phylogenetic tree in these analyses were weakly supported, and integrating information from different plant genomes is clearly necessary in orchids, where reticulate evolution events are putatively common. The aims of this study were to: (1) obtain a well-supported and dated phylogenetic hypothesis for tribe Orchideae, (ii) assess appropriateness of recent nomenclatural changes in this tribe in the last decade, (3) detect possible examples of reticulate evolution and (4) analyse in a temporal context evolutionary trends for subtribe Orchidinae with special emphasis on pollination systems. Methods The analyses included 118 samples, belonging to 103 species and 25 genera, for three DNA regions (nrITS, mitochondrial cox1 intron and plastid rpl16 intron). Bayesian and maximum-parsimony methods were used to construct a well-supported and dated tree. Evolutionary trends in the subtribe were analysed using Bayesian and maximum-likelihood methods of character evolution. Key Results The dated phylogenetic tree strongly supported the recently recircumscribed generic concepts of Bateman and collaborators. Moreover, it was found that Orchidinae have diversified in the Mediterranean basin during the last 15 million years, and one potential example of reticulate evolution in the subtribe was identified. In Orchidinae, pollination systems have shifted on numerous occasions during the last 23 million years. Conclusions The results indicate that ancestral Orchidinae were hymenopteran-pollinated, food-deceptive plants and that these traits have been dominant throughout the evolutionary history of the subtribe in the Mediterranean. Evidence was also obtained that the onset of sexual deception might be linked to an increase in labellum size, and the possibility is discussed that diversification in Orchidinae developed in parallel with diversification of bees and wasps from the Miocene onwards. PMID:22539542
Storage Capacity of the Linear Associator: Beginnings of a Theory of Computational Memory
1988-04-27
Issues valuable to future efforts and provides methods for analysis of perceptual/ cognitive systems. vii Table of Contents 1. Introduction...not only enables a system to vastly simplify its representation of the environment, but the identification of such symbols In a cognitive system could...subse4 uently provide a parsimonious theory of cognition (Yes, I know, *traditional AI already knows this). Not that the Identification would be easy
Van Meter, Kimberly J.; Basu, Nandita B.
2015-01-01
Nutrient legacies in anthropogenic landscapes, accumulated over decades of fertilizer application, lead to time lags between implementation of conservation measures and improvements in water quality. Quantification of such time lags has remained difficult, however, due to an incomplete understanding of controls on nutrient depletion trajectories after changes in land-use or management practices. In this study, we have developed a parsimonious watershed model for quantifying catchment-scale time lags based on both soil nutrient accumulations (biogeochemical legacy) and groundwater travel time distributions (hydrologic legacy). The model accurately predicted the time lags observed in an Iowa watershed that had undergone a 41% conversion of area from row crop to native prairie. We explored the time scales of change for stream nutrient concentrations as a function of both natural and anthropogenic controls, from topography to spatial patterns of land-use change. Our results demonstrate that the existence of biogeochemical nutrient legacies increases time lags beyond those due to hydrologic legacy alone. In addition, we show that the maximum concentration reduction benefits vary according to the spatial pattern of intervention, with preferential conversion of land parcels having the shortest catchment-scale travel times providing proportionally greater concentration reductions as well as faster response times. In contrast, a random pattern of conversion results in a 1:1 relationship between percent land conversion and percent concentration reduction, irrespective of denitrification rates within the landscape. Our modeling framework allows for the quantification of tradeoffs between costs associated with implementation of conservation measures and the time needed to see the desired concentration reductions, making it of great value to decision makers regarding optimal implementation of watershed conservation measures. PMID:25985290
Van Meter, Kimberly J; Basu, Nandita B
2015-01-01
Nutrient legacies in anthropogenic landscapes, accumulated over decades of fertilizer application, lead to time lags between implementation of conservation measures and improvements in water quality. Quantification of such time lags has remained difficult, however, due to an incomplete understanding of controls on nutrient depletion trajectories after changes in land-use or management practices. In this study, we have developed a parsimonious watershed model for quantifying catchment-scale time lags based on both soil nutrient accumulations (biogeochemical legacy) and groundwater travel time distributions (hydrologic legacy). The model accurately predicted the time lags observed in an Iowa watershed that had undergone a 41% conversion of area from row crop to native prairie. We explored the time scales of change for stream nutrient concentrations as a function of both natural and anthropogenic controls, from topography to spatial patterns of land-use change. Our results demonstrate that the existence of biogeochemical nutrient legacies increases time lags beyond those due to hydrologic legacy alone. In addition, we show that the maximum concentration reduction benefits vary according to the spatial pattern of intervention, with preferential conversion of land parcels having the shortest catchment-scale travel times providing proportionally greater concentration reductions as well as faster response times. In contrast, a random pattern of conversion results in a 1:1 relationship between percent land conversion and percent concentration reduction, irrespective of denitrification rates within the landscape. Our modeling framework allows for the quantification of tradeoffs between costs associated with implementation of conservation measures and the time needed to see the desired concentration reductions, making it of great value to decision makers regarding optimal implementation of watershed conservation measures.
Rajpoot, Ankita; Kumar, Ved Prakash; Bahuguna, Archana; Kumar, Dhyanendra
2017-11-01
Monitor lizards are Varanus species widely distributed, endangered reptile in the IUCN red data list. In India, based on the morphological and ecological characteristic, it is divided into four species viz. Bengal monitor lizard, Yellow monitor lizard, Desert monitor lizard and Water monitor lizard. These four species listed as Schedule I species in Indian Wildlife (Protection) Act 1972. This paper first attempt to present Forensically Informative Nucleotide Sequencing (FINS) for the Indian Varanus based on three mitochondrial genes. The molecular framework will be useful for the identification of Indian Varanus species and trade products derived from monitors and as such, have important applications for wildlife management and conservation. Here, we used known 14 individual skin pieces of four species of monitor lizards; the partial fragment of three mitochondrial genes (Cyt b, 12S rRNA, and 16S rRNA) were amplified for genetic study. In Cyt b, 12S rRNA and 16s rRNA, we observed, 5, 5 and 4 Haplotypes; 71, 69, and 43 Variables sites; 90, 89, and 50 Parsimony Informative sites within four species of Indian monitor lizards, respectively. Despite it, the nucleotide composition was T 26.4, C 32.8, A 29.2 and G11.6; T 18.8, C 29.7, A 34.0 and G 17.5; T 21.7, C 27.3, A 32.5 and G 18.5 in Cyt b, 12S rRNA and 16S rRNA, respectively. The neighbor joining phylogenetic tree and maximum parsimony tree of three mitochondrial genes, showed similar results and reveal that, there are two major clades are present in Indian monitor lizards.
NASA Astrophysics Data System (ADS)
Luke, Denneko; McLaren, Kurt
2018-05-01
In situ measurements of leaf level photosynthetic response to light were collected from seedlings of ten tree species from a tropical montane wet forest, the John Crow Mountains, Jamaica. A model-based recursive partitioning ('mob') algorithm was then used to identify species associations based on their fitted photosynthetic response curves. Leaf area dark respiration (RD) and light saturated maximum photosynthetic (Amax) rates were also used as 'mob' partitioning variables, to identify species associations based on seedling demographic patterns (from June 2007 to May 2010) following a hurricane (Aug. 2007) and the spatiotemporal distribution patterns of stems in 2006 and 2012. RD and Amax rates ranged from 1.14 to 2.02 μmol (CO2) m-2s-1 and 2.97-5.87 μmol (CO2) m-2s-1, respectively, placing the ten species in the range of intermediate shade tolerance. Several parsimonious species 'mob' groups were formed based on 1) interspecific differences among species response curves, 2) variations in post-hurricane seedling demographic trends and 3) RD rates and species spatiotemporal distribution patterns at aspects that are more or less exposed to hurricanes. The composition of parsimonious groupings based on photosynthetic curves was not concordant with the groups based on demographic trends but was partially concordant with the RD - species spatiotemporal distribution groups. Our results indicated that the influence of photosynthetic characteristics on demographic traits and species distributions was not straightforward. Rather, there was a complex pattern of interaction between ecophysiological and demographic traits, which determined species successional status, post-hurricane response and ultimately, species distribution at our study site.
Molecular phylogeny of the Drusinae (Trichoptera: Limnephilidae): preliminary results
NASA Astrophysics Data System (ADS)
Pauls, S.; Lumbsch, T.; Haase, P.
2005-05-01
We examine the phylogenetic relationships within the subfamily of the Drusinae using molecular markers. Sequence data from two mitochondrial loci (mitochondrial cytochrome oxidase I, mitochondrial ribosomal large subunit) are used to infer the relationships within and among the genera of the Drusinae. Sequence data were generated for 21 taxa from five genera from the subfamily. The molecular data were analyzed using a Bayesian Markov Chain Monte Carlo and a Maximum Parsimony approach for both single gene and combined data sets. Several hypotheses of relationships previously inferred based on morphological characters were tested. The study revealed a very close relationship between Drusus discolor and D. romanicus suggesting that divergence between these two species occurred recently. The relationships inferred by molecular data suggest that larval morphology may be an important taxonomic character, which has often been neglected. The data also indicate that the genera Ecclisopteryx and Drusus are polyphyletic with respect to one another.
[Molecular identification of medicinal plant genus Uncaria in Guizhou].
Gang, Tao; Liu, Tao; Zhu, Ying; Liu, Zuo-Yi
2008-06-01
To analyze rDNA ITS regions of the Medicinal Plant Genus Uncaria in Guizhou and construct their phylogenetic tree in order to supply molecular evidence of taxonomy and identification of these Medicinal Plants in genetic level. The ITS gene fragments of the 4 Medicinal Plants were PCR amplified and sequenced. The rDNA ITS regions were analyzed by means of the software of ClustalX, BioEdit and PAUP* 4.0 beta 10. The entire sequences of rDNA ITS1, ITS2, and 5.8S rDNA were obtained, The Maximum-parsimony tree of four ITS regions together with those of similar sequences from GenBank were found, as Mitrayna rubrostipulata (AJ492621 ) and Mitragyna rubrostipulata (AJ605988) were designated as outgroup. The 4 medicinal plants are the 4 species in the genus Uncaria, and are mostly similar to the Uncaria rhynhcophylla.
Taylor, Maria Lucia; Chávez-Tapia, Catalina B; Rojas-Martínez, Alberto; del Rocio Reyes-Montes, Maria; del Valle, Mirian Bobadilla; Zúñiga, Gerardo
2005-09-01
Fourteen Histoplasma capsulatum isolates recovered from infected bats captured in Mexican caves and two human H. capsulatum reference strains were analyzed using random amplification of polymorphic DNA PCR-based and partial DNA sequences of four genes. Cluster analysis of random amplification of polymorphic DNA-patterns revealed differences for two H. capsulatum isolates of one migratory bat Tadarida brasiliensis. Three groups were identified by distance and maximum-parsimony analyses of arf, H-anti, ole, and tub1 H. capsulatum genes. Group I included most isolates from infected bats and one clinical strain from central Mexico; group II included the two isolates from T. brasiliensis; the human G-217B reference strain from USA formed an independent group III. Isolates from group II showed diversity in relation to groups I and III, suggesting a different H. capsulatum population.
Cladistic analysis of Bantu languages: a new tree based on combined lexical and grammatical data
NASA Astrophysics Data System (ADS)
Rexová, Kateřina; Bastin, Yvonne; Frynta, Daniel
2006-04-01
The phylogeny of the Bantu languages is reconstructed by application of the cladistic methodology to the combined lexical and grammatical data (87 languages, 144 characters). A maximum parsimony tree and Bayesian analysis supported some previously recognized clades, e.g., that of eastern and southern Bantu languages. Moreover, the results revealed that Bantu languages south and east of the equatorial forest are probably monophyletic. It suggests an unorthodox scenario of Bantu expansion including (after initial radiation in their homelands and neighboring territories) just a single passage through rainforest areas followed by a subsequent divergence into major clades. The likely localization of this divergence is in the area west of the Great Lakes. It conforms to the view that demographic expansion and dispersal throughout the dry-forests and savanna regions of subequatorial Africa was associated with the acquisition of new technologies (iron metallurgy and grain cultivation).
Towards resolving the complete fern tree of life.
Lehtonen, Samuli
2011-01-01
In the past two decades, molecular systematic studies have revolutionized our understanding of the evolutionary history of ferns. The availability of large molecular data sets together with efficient computer algorithms, now enables us to reconstruct evolutionary histories with previously unseen completeness. Here, the most comprehensive fern phylogeny to date, representing over one-fifth of the extant global fern diversity, is inferred based on four plastid genes. Parsimony and maximum-likelihood analyses provided a mostly congruent results and in general supported the prevailing view on the higher-level fern systematics. At a deep phylogenetic level, the position of horsetails depended on the optimality criteria chosen, with horsetails positioned as the sister group either of Marattiopsida-Polypodiopsida clade or of the Polypodiopsida. The analyses demonstrate the power of using a 'supermatrix' approach to resolve large-scale phylogenies and reveal questionable taxonomies. These results provide a valuable background for future research on fern systematics, ecology, biogeography and other evolutionary studies.
On-line algorithms for forecasting hourly loads of an electric utility
DOE Office of Scientific and Technical Information (OSTI.GOV)
Vemuri, S.; Huang, W.L.; Nelson, D.J.
A method that lends itself to on-line forecasting of hourly electric loads is presented, and the results of its use are compared to models developed using the Box-Jenkins method. The method consits of processing the historical hourly loads with a sequential least-squares estimator to identify a finite-order autoregressive model which, in turn, is used to obtain a parsimonious autoregressive-moving average model. The method presented has several advantages in comparison with the Box-Jenkins method including much-less human intervention, improved model identification, and better results. The method is also more robust in that greater confidence can be placed in the accuracy ofmore » models based upon the various measures available at the identification stage.« less
Remarkable convergent evolution in specialized parasitic Thecostraca (Crustacea)
Pérez-Losada, Marcos; Høeg, Jens T; Crandall, Keith A
2009-01-01
Background The Thecostraca are arguably the most morphologically and biologically variable group within the Crustacea, including both suspension feeders (Cirripedia: Thoracica and Acrothoracica) and parasitic forms (Cirripedia: Rhizocephala, Ascothoracida and Facetotecta). Similarities between the metamorphosis found in the Facetotecta and Rhizocephala suggests a common evolutionary origin, but until now no comprehensive study has looked at the basic evolution of these thecostracan groups. Results To this end, we collected DNA sequences from three nuclear genes [18S rRNA (2,305), 28S rRNA (2,402), Histone H3 (328)] and 41 larval characters in seven facetotectans, five ascothoracidans, three acrothoracicans, 25 rhizocephalans and 39 thoracicans (ingroup) and 12 Malacostraca and 10 Copepoda (outgroup). Maximum parsimony, maximum likelihood and Bayesian analyses showed the Facetotecta, Ascothoracida and Cirripedia each as monophyletic. The better resolved and highly supported DNA maximum likelihood and morphological-DNA Bayesian analysis trees depicted the main phylogenetic relationships within the Thecostraca as (Facetotecta, (Ascothoracida, (Acrothoracica, (Rhizocephala, Thoracica)))). Conclusion Our analyses indicate a convergent evolution of the very similar and highly reduced slug-shaped stages found during metamorphosis of both the Rhizocephala and the Facetotecta. This provides a remarkable case of convergent evolution and implies that the advanced endoparasitic mode of life known from the Rhizocephala and strongly indicated for the Facetotecta had no common origin. Future analyses are needed to determine whether the most recent common ancestor of the Thecostraca was free-living or some primitive form of ectoparasite. PMID:19374762
The phylogeny and evolutionary history of tyrannosauroid dinosaurs.
Brusatte, Stephen L; Carr, Thomas D
2016-02-02
Tyrannosauroids--the group of carnivores including Tyrannosaurs rex--are some of the most familiar dinosaurs of all. A surge of recent discoveries has helped clarify some aspects of their evolution, but competing phylogenetic hypotheses raise questions about their relationships, biogeography, and fossil record quality. We present a new phylogenetic dataset, which merges published datasets and incorporates recently discovered taxa. We analyze it with parsimony and, for the first time for a tyrannosauroid dataset, Bayesian techniques. The parsimony and Bayesian results are highly congruent, and provide a framework for interpreting the biogeography and evolutionary history of tyrannosauroids. Our phylogenies illustrate that the body plan of the colossal species evolved piecemeal, imply no clear division between northern and southern species in western North America as had been argued, and suggest that T. rex may have been an Asian migrant to North America. Over-reliance on cranial shape characters may explain why published parsimony studies have diverged and filling three major gaps in the fossil record holds the most promise for future work.
The phylogeny and evolutionary history of tyrannosauroid dinosaurs
Brusatte, Stephen L.; Carr, Thomas D.
2016-01-01
Tyrannosauroids—the group of carnivores including Tyrannosaurs rex—are some of the most familiar dinosaurs of all. A surge of recent discoveries has helped clarify some aspects of their evolution, but competing phylogenetic hypotheses raise questions about their relationships, biogeography, and fossil record quality. We present a new phylogenetic dataset, which merges published datasets and incorporates recently discovered taxa. We analyze it with parsimony and, for the first time for a tyrannosauroid dataset, Bayesian techniques. The parsimony and Bayesian results are highly congruent, and provide a framework for interpreting the biogeography and evolutionary history of tyrannosauroids. Our phylogenies illustrate that the body plan of the colossal species evolved piecemeal, imply no clear division between northern and southern species in western North America as had been argued, and suggest that T. rex may have been an Asian migrant to North America. Over-reliance on cranial shape characters may explain why published parsimony studies have diverged and filling three major gaps in the fossil record holds the most promise for future work. PMID:26830019
The phylogeny and evolutionary history of tyrannosauroid dinosaurs
NASA Astrophysics Data System (ADS)
Brusatte, Stephen L.; Carr, Thomas D.
2016-02-01
Tyrannosauroids—the group of carnivores including Tyrannosaurs rex—are some of the most familiar dinosaurs of all. A surge of recent discoveries has helped clarify some aspects of their evolution, but competing phylogenetic hypotheses raise questions about their relationships, biogeography, and fossil record quality. We present a new phylogenetic dataset, which merges published datasets and incorporates recently discovered taxa. We analyze it with parsimony and, for the first time for a tyrannosauroid dataset, Bayesian techniques. The parsimony and Bayesian results are highly congruent, and provide a framework for interpreting the biogeography and evolutionary history of tyrannosauroids. Our phylogenies illustrate that the body plan of the colossal species evolved piecemeal, imply no clear division between northern and southern species in western North America as had been argued, and suggest that T. rex may have been an Asian migrant to North America. Over-reliance on cranial shape characters may explain why published parsimony studies have diverged and filling three major gaps in the fossil record holds the most promise for future work.
Exponential series approaches for nonparametric graphical models
NASA Astrophysics Data System (ADS)
Janofsky, Eric
Markov Random Fields (MRFs) or undirected graphical models are parsimonious representations of joint probability distributions. This thesis studies high-dimensional, continuous-valued pairwise Markov Random Fields. We are particularly interested in approximating pairwise densities whose logarithm belongs to a Sobolev space. For this problem we propose the method of exponential series which approximates the log density by a finite-dimensional exponential family with the number of sufficient statistics increasing with the sample size. We consider two approaches to estimating these models. The first is regularized maximum likelihood. This involves optimizing the sum of the log-likelihood of the data and a sparsity-inducing regularizer. We then propose a variational approximation to the likelihood based on tree-reweighted, nonparametric message passing. This approximation allows for upper bounds on risk estimates, leverages parallelization and is scalable to densities on hundreds of nodes. We show how the regularized variational MLE may be estimated using a proximal gradient algorithm. We then consider estimation using regularized score matching. This approach uses an alternative scoring rule to the log-likelihood, which obviates the need to compute the normalizing constant of the distribution. For general continuous-valued exponential families, we provide parameter and edge consistency results. As a special case we detail a new approach to sparse precision matrix estimation which has statistical performance competitive with the graphical lasso and computational performance competitive with the state-of-the-art glasso algorithm. We then describe results for model selection in the nonparametric pairwise model using exponential series. The regularized score matching problem is shown to be a convex program; we provide scalable algorithms based on consensus alternating direction method of multipliers (ADMM) and coordinate-wise descent. We use simulations to compare our method to others in the literature as well as the aforementioned TRW estimator.
Quasi-continuous stochastic simulation framework for flood modelling
NASA Astrophysics Data System (ADS)
Moustakis, Yiannis; Kossieris, Panagiotis; Tsoukalas, Ioannis; Efstratiadis, Andreas
2017-04-01
Typically, flood modelling in the context of everyday engineering practices is addressed through event-based deterministic tools, e.g., the well-known SCS-CN method. A major shortcoming of such approaches is the ignorance of uncertainty, which is associated with the variability of soil moisture conditions and the variability of rainfall during the storm event.In event-based modeling, the sole expression of uncertainty is the return period of the design storm, which is assumed to represent the acceptable risk of all output quantities (flood volume, peak discharge, etc.). On the other hand, the varying antecedent soil moisture conditions across the basin are represented by means of scenarios (e.g., the three AMC types by SCS),while the temporal distribution of rainfall is represented through standard deterministic patterns (e.g., the alternative blocks method). In order to address these major inconsistencies,simultaneously preserving the simplicity and parsimony of the SCS-CN method, we have developed a quasi-continuous stochastic simulation approach, comprising the following steps: (1) generation of synthetic daily rainfall time series; (2) update of potential maximum soil moisture retention, on the basis of accumulated five-day rainfall; (3) estimation of daily runoff through the SCS-CN formula, using as inputs the daily rainfall and the updated value of soil moisture retention;(4) selection of extreme events and application of the standard SCS-CN procedure for each specific event, on the basis of synthetic rainfall.This scheme requires the use of two stochastic modelling components, namely the CastaliaR model, for the generation of synthetic daily data, and the HyetosMinute model, for the disaggregation of daily rainfall to finer temporal scales. Outcomes of this approach are a large number of synthetic flood events, allowing for expressing the design variables in statistical terms and thus properly evaluating the flood risk.
Kahmann, A; Anzanello, M J; Fogliatto, F S; Marcelo, M C A; Ferrão, M F; Ortiz, R S; Mariotti, K C
2018-04-15
Street cocaine is typically altered with several compounds that increase its harmful health-related side effects, most notably depression, convulsions, and severe damages to the cardiovascular system, lungs, and brain. Thus, determining the concentration of cocaine and adulterants in seized drug samples is important from both health and forensic perspectives. Although FTIR has been widely used to identify the fingerprint and concentration of chemical compounds, spectroscopy datasets are usually comprised of thousands of highly correlated wavenumbers which, when used as predictors in regression models, tend to undermine the predictive performance of multivariate techniques. In this paper, we propose an FTIR wavenumber selection method aimed at identifying FTIR spectra intervals that best predict the concentration of cocaine and adulterants (e.g. caffeine, phenacetin, levamisole, and lidocaine) in cocaine samples. For that matter, the Mutual Information measure is integrated into a Quadratic Programming problem with the objective of minimizing the probability of retaining redundant wavenumbers, while maximizing the relationship between retained wavenumbers and compounds' concentrations. Optimization outputs guide the order of inclusion of wavenumbers in a predictive model, using a forward-based wavenumber selection method. After the inclusion of each wavenumber, parameters of three alternative regression models are estimated, and each model's prediction error is assessed through the Mean Average Error (MAE) measure; the recommended subset of retained wavenumbers is the one that minimizes the prediction error with maximum parsimony. Using our propositions in a dataset of 115 cocaine samples we obtained a best prediction model with average MAE of 0.0502 while retaining only 2.29% of the original wavenumbers, increasing the predictive precision by 0.0359 when compared to a model using the complete set of wavenumbers as predictors. Copyright © 2018 Elsevier B.V. All rights reserved.
Using data mining to predict success in a weight loss trial.
Batterham, M; Tapsell, L; Charlton, K; O'Shea, J; Thorne, R
2017-08-01
Traditional methods for predicting weight loss success use regression approaches, which make the assumption that the relationships between the independent and dependent (or logit of the dependent) variable are linear. The aim of the present study was to investigate the relationship between common demographic and early weight loss variables to predict weight loss success at 12 months without making this assumption. Data mining methods (decision trees, generalised additive models and multivariate adaptive regression splines), in addition to logistic regression, were employed to predict: (i) weight loss success (defined as ≥5%) at the end of a 12-month dietary intervention using demographic variables [body mass index (BMI), sex and age]; percentage weight loss at 1 month; and (iii) the difference between actual and predicted weight loss using an energy balance model. The methods were compared by assessing model parsimony and the area under the curve (AUC). The decision tree provided the most clinically useful model and had a good accuracy (AUC 0.720 95% confidence interval = 0.600-0.840). Percentage weight loss at 1 month (≥0.75%) was the strongest predictor for successful weight loss. Within those individuals losing ≥0.75%, individuals with a BMI (≥27 kg m -2 ) were more likely to be successful than those with a BMI between 25 and 27 kg m -2 . Data mining methods can provide a more accurate way of assessing relationships when conventional assumptions are not met. In the present study, a decision tree provided the most parsimonious model. Given that early weight loss cannot be predicted before randomisation, incorporating this information into a post randomisation trial design may give better weight loss results. © 2017 The British Dietetic Association Ltd.
Jameson Kiesling, Natalie M; Yi, Soojin V; Xu, Ke; Gianluca Sperone, F; Wildman, Derek E
2015-01-01
The development and evolution of organisms is heavily influenced by their environment. Thus, understanding the historical biogeography of taxa can provide insights into their evolutionary history, adaptations and trade-offs realized throughout time. In the present study we have taken a phylogenomic approach to infer New World monkey phylogeny, upon which we have reconstructed the biogeographic history of extant platyrrhines. In order to generate sufficient phylogenetic signal within the New World monkey clade, we carried out a large-scale phylogenetic analysis of approximately 40 kb of non-genic genomic DNA sequence in a 36 species subset of extant New World monkeys. Maximum parsimony, maximum likelihood and Bayesian inference analysis all converged on a single optimal tree topology. Divergence dating and biogeographic analysis reconstruct the timing and geographic location of divergence events. The ancestral area reconstruction describes the geographic locations of the last common ancestor of extant platyrrhines and provides insight into key biogeographic events occurring during platyrrhine diversification. Through these analyses we conclude that the diversification of the platyrrhines took place concurrently with the establishment and diversification of the Amazon rainforest. This suggests that an expanding rainforest environment rather than geographic isolation drove platyrrhine diversification. Copyright © 2014 Elsevier Inc. All rights reserved.
Yu, Danna; Zhang, Jiayong; Li, Peng; Zheng, Rongquan; Shao, Chen
2015-01-01
he Chinese tiger frog Hoplobatrachus rugulosus is widely distributed in southern China, Malaysia, Myanmar, Thailand, and Vietnam. It is listed in Appendix II of CITES as the only Class II nationally-protected frog in China. The bred tiger frog known as the Thailand tiger frog, is also identified as H. rugulosus. Our analysis of the Cyt b gene showed high genetic divergence (13.8%) between wild and bred samples of tiger frog. Unexpected genetic divergence of the complete mt genome (14.0%) was also observed between wild and bred samples of tiger frog. Yet, the nuclear genes (NCX1, Rag1, Rhod, Tyr) showed little divergence between them. Despite this and their very similar morphology, the features of the mitochondrial genome including genetic divergence of other genes, different three-dimensional structures of ND5 proteins, and gene rearrangements indicate that H. rugulosus may be a cryptic species complex. Using Bayesian inference, maximum likelihood, and maximum parsimony analyses, Hoplobatrachus was resolved as a sister clade to Euphlyctis, and H. rugulosus (BT) as a sister clade to H. rugulosus (WT). We suggest that we should prevent Thailand tiger frogs (bred type) from escaping into wild environments lest they produce hybrids with Chinese tiger frogs (wild type).
Li, Peng; Zheng, Rongquan; Shao, Chen
2015-01-01
he Chinese tiger frog Hoplobatrachus rugulosus is widely distributed in southern China, Malaysia, Myanmar, Thailand, and Vietnam. It is listed in Appendix II of CITES as the only Class II nationally-protected frog in China. The bred tiger frog known as the Thailand tiger frog, is also identified as H. rugulosus. Our analysis of the Cyt b gene showed high genetic divergence (13.8%) between wild and bred samples of tiger frog. Unexpected genetic divergence of the complete mt genome (14.0%) was also observed between wild and bred samples of tiger frog. Yet, the nuclear genes (NCX1, Rag1, Rhod, Tyr) showed little divergence between them. Despite this and their very similar morphology, the features of the mitochondrial genome including genetic divergence of other genes, different three-dimensional structures of ND5 proteins, and gene rearrangements indicate that H. rugulosus may be a cryptic species complex. Using Bayesian inference, maximum likelihood, and maximum parsimony analyses, Hoplobatrachus was resolved as a sister clade to Euphlyctis, and H. rugulosus (BT) as a sister clade to H. rugulosus (WT). We suggest that we should prevent Thailand tiger frogs (bred type) from escaping into wild environments lest they produce hybrids with Chinese tiger frogs (wild type). PMID:25875761
Rothwell, Gar W; Van Atta, Michelle R; Ballard, Harvey E; Stockey, Ruth A
2004-02-01
We test competing hypotheses of relationships among Aroids (Araceae) and duckweeds (Lemnaceae) using sequences of the trnL-trnF spacer region of the chloroplast genome. Included in the analysis were 22 aroid genera including Pistia and five genera of Lemnaceae including the recently segregated genus Landoltia. Aponogeton was used as an outgroup to root the tree. A data set of 522 aligned nucleotides yielded maximum parsimony and maximum likelihood trees similar to those previously derived from restriction site data. Pistia and the Lemnaceae are placed in two separate and well-supported clades, suggesting at least two independent origins of the floating aquatic growth form within the aroid clade. Within the Lemnaceae there is only partial support for the paradigm of sequential morphological reduction, given that Wolffia is sister to Wolffiella+Lemna. As in the results of the restriction site analysis, pantropical Pistia is placed with Colocasia and Typhonium of southeastern Asia, indicative of Old World affinities. Branch lengths leading to duckweed terminal taxa are much longer relative to other ingroup taxa (including Pistia), evidently as a result of higher rates of nucleotide substitutions and insertion/deletion events. Morphological reduction within the duckweeds roughly correlates with accelerated chloroplast genome evolution.
Shin, Seunggwan; Jung, Sunghoon; Menzel, Frank; Heller, Kai; Lee, Heungsik; Lee, Seunghwan
2013-03-01
The phylogeny of the family Sciaridae is reconstructed, based on maximum likelihood, maximum parsimony, and Bayesian analyses of 4809bp from two mitochondrial (COI and 16S) and two nuclear (18S and 28S) genes for 100 taxa including the outgroup taxa. According to the present phylogenetic analyses, Sciaridae comprise three subfamilies and two genus groups: Sciarinae, Chaetosciara group, Cratyninae, and Pseudolycoriella group+Megalosphyinae. Our molecular results are largely congruent with one of the former hypotheses based on morphological data with respect to the monophyly of genera and subfamilies (Sciarinae, Megalosphyinae, and part of postulated "new subfamily"); however, the subfamily Cratyninae is shown to be polyphyletic, and the genera Bradysia, Corynoptera, Leptosciarella, Lycoriella, and Phytosciara are also recognized as non-monophyletic groups. While the ancestral larval habitat state of the family Sciaridae, based on Bayesian inference, is dead plant material (plant litter+rotten wood), the common ancestors of Phytosciara and Bradysia are inferred to living plants habitat. Therefore, shifts in larval habitats from dead plant material to living plants may have occurred within the Sciaridae at least once. Based on the results, we discuss phylogenetic relationships within the family, and present an evolutionary scenario of development of larval habitats. Copyright © 2012 Elsevier Inc. All rights reserved.
The early maximum likelihood estimation model of audiovisual integration in speech perception.
Andersen, Tobias S
2015-05-01
Speech perception is facilitated by seeing the articulatory mouth movements of the talker. This is due to perceptual audiovisual integration, which also causes the McGurk-MacDonald illusion, and for which a comprehensive computational account is still lacking. Decades of research have largely focused on the fuzzy logical model of perception (FLMP), which provides excellent fits to experimental observations but also has been criticized for being too flexible, post hoc and difficult to interpret. The current study introduces the early maximum likelihood estimation (MLE) model of audiovisual integration to speech perception along with three model variations. In early MLE, integration is based on a continuous internal representation before categorization, which can make the model more parsimonious by imposing constraints that reflect experimental designs. The study also shows that cross-validation can evaluate models of audiovisual integration based on typical data sets taking both goodness-of-fit and model flexibility into account. All models were tested on a published data set previously used for testing the FLMP. Cross-validation favored the early MLE while more conventional error measures favored more complex models. This difference between conventional error measures and cross-validation was found to be indicative of over-fitting in more complex models such as the FLMP.
Hodge, Jennifer R; Read, Charmaine I; van Herwerden, Lynne; Bellwood, David R
2012-02-01
We examined how peripherally isolated endemic species may have contributed to the biodiversity of the Indo-Australian Archipelago biodiversity hotspot by reconstructing the evolutionary history of the wrasse genus Anampses. We identified three alternate models of diversification: the vicariance-based 'successive division' model, and the dispersal-based 'successive colonisation' and 'peripheral budding' models. The genus was well suited for this study given its relatively high proportion (42%) of endemic species, its reasonably low diversity (12 species), which permitted complete taxon sampling, and its widespread tropical Indo-Pacific distribution. Monophyly of the genus was strongly supported by three phylogenetic analyses: maximum parsimony, maximum likelihood, and Bayesian inference based on mitochondrial CO1 and 12S rRNA and nuclear S7 sequences. Estimates of species divergence times from fossil-calibrated Bayesian inference suggest that Anampses arose in the mid-Eocene and subsequently diversified throughout the Miocene. Evolutionary relationships within the genus, combined with limited spatial and temporal concordance among endemics, offer support for all three alternate models of diversification. Our findings emphasise the importance of peripherally isolated locations in creating and maintaining endemic species and their contribution to the biodiversity of the Indo-Australian Archipelago. Copyright © 2011 Elsevier Inc. All rights reserved.
Cheng, Tian; Liu, Guo-Hua; Song, Hui-Qun; Lin, Rui-Qing; Zhu, Xing-Quan
2016-03-01
Hymenolepis nana, commonly known as the dwarf tapeworm, is one of the most common tapeworms of humans and rodents and can cause hymenolepiasis. Although this zoonotic tapeworm is of socio-economic significance in many countries of the world, its genetics, systematics, epidemiology, and biology are poorly understood. In the present study, we sequenced and characterized the complete mitochondrial (mt) genome of H. nana. The mt genome is 13,764 bp in size and encodes 36 genes, including 12 protein-coding genes, 2 ribosomal RNA, and 22 transfer RNA genes. All genes are transcribed in the same direction. The gene order and genome content are completely identical with their congener Hymenolepis diminuta. Phylogenetic analyses based on concatenated amino acid sequences of 12 protein-coding genes by Bayesian inference, Maximum likelihood, and Maximum parsimony showed the division of class Cestoda into two orders, supported the monophylies of both the orders Cyclophyllidea and Pseudophyllidea. Analyses of mt genome sequences also support the monophylies of the three families Taeniidae, Hymenolepididae, and Diphyllobothriidae. This novel mt genome provides a useful genetic marker for studying the molecular epidemiology, systematics, and population genetics of the dwarf tapeworm and should have implications for the diagnosis, prevention, and control of hymenolepiasis in humans.
von Konrat, Matt; de Lange, Peter; Greif, Matt; Strozier, Lynika; Hentschel, Jörn; Heinrichs, Jochen
2012-01-01
Abstract Frullania is a large and taxonomically complex genus. A new liverwort species, Frullania knightbridgei sp. nov. from southern New Zealand, is described and illustrated. The new species, and its placement in Frullania subg. Microfrullania, is based on an integrated evidence-based approach derived from morphology, ecology, experimental growth studies of plasticity, as well as sequence data. Diagnostic characters associated with the leaf and lobule cell-wall anatomy, oil bodies, and spore ultra-structure distinguish it from all other New Zealand species of Frullania. A critical comparison is also made between Frullania knightbridgei and morphologically allied species of botanical regions outside the New Zealand region and an artificial key is provided. The new species is similar to some forms of the widespread Australasian species, Frullania rostrata, but has unique characters associated with the lobule and oil bodies. Frullania knightbridgei is remarkably interesting in comparison with the majority of Frullania species, and indeed liverworts in general, in that it is at least partially halotolerant. Maximum parsimony and maximum likelihood analyses of nuclear ribosomal ITS2 and plastidic trnL-trnF sequences from purported related speciesconfirms its independent taxonomic status and corroborates its placement within Frullania subg. Microfrullania. PMID:22287928
Attigala, Lakshmi; Wysocki, William P; Duvall, Melvin R; Clark, Lynn G
2016-08-01
We explored phylogenetic relationships among the twelve lineages of the temperate woody bamboo clade (tribe Arundinarieae) based on plastid genome (plastome) sequence data. A representative sample of 28 taxa was used and maximum parsimony, maximum likelihood and Bayesian inference analyses were conducted to estimate the Arundinarieae phylogeny. All the previously recognized clades of Arundinarieae were supported, with Ampelocalamus calcareus (Clade XI) as sister to the rest of the temperate woody bamboos. Well supported sister relationships between Bergbambos tessellata (Clade I) and Thamnocalamus spathiflorus (Clade VII) and between Kuruna (Clade XII) and Chimonocalmus (Clade III) were revealed by the current study. The plastome topology was tested by taxon removal experiments and alternative hypothesis testing and the results supported the current plastome phylogeny as robust. Neighbor-net analyses showed few phylogenetic signal conflicts, but suggested some potentially complex relationships among these taxa. Analyses of morphological character evolution of rhizomes and reproductive structures revealed that pachymorph rhizomes were most likely the ancestral state in Arundinarieae. In contrast leptomorph rhizomes either evolved once with reversions to the pachymorph condition or multiple times in Arundinarieae. Further, pseudospikelets evolved independently at least twice in the Arundinarieae, but the ancestral state is ambiguous. Copyright © 2016 Elsevier Inc. All rights reserved.
Ala-Aho, Pertti; Tetzlaff, Doerthe; McNamara, James P; Laudon, Hjalmar; Kormos, Patrick; Soulsby, Chris
2017-07-01
Use of stable water isotopes has become increasingly popular in quantifying water flow paths and travel times in hydrological systems using tracer-aided modeling. In snow-influenced catchments, snowmelt produces a traceable isotopic signal, which differs from original snowfall isotopic composition because of isotopic fractionation in the snowpack. These fractionation processes in snow are relatively well understood, but representing their spatiotemporal variability in tracer-aided studies remains a challenge. We present a novel, parsimonious modeling method to account for the snowpack isotope fractionation and estimate isotope ratios in snowmelt water in a fully spatially distributed manner. Our model introduces two calibration parameters that alone account for the isotopic fractionation caused by sublimation from interception and ground snow storage, and snowmelt fractionation progressively enriching the snowmelt runoff. The isotope routines are linked to a generic process-based snow interception-accumulation-melt model facilitating simulation of spatially distributed snowmelt runoff. We use a synthetic modeling experiment to demonstrate the functionality of the model algorithms in different landscape locations and under different canopy characteristics. We also provide a proof-of-concept model test and successfully reproduce isotopic ratios in snowmelt runoff sampled with snowmelt lysimeters in two long-term experimental catchment with contrasting winter conditions. To our knowledge, the method is the first such tool to allow estimation of the spatially distributed nature of isotopic fractionation in snowpacks and the resulting isotope ratios in snowmelt runoff. The method can thus provide a useful tool for tracer-aided modeling to better understand the integrated nature of flow, mixing, and transport processes in snow-influenced catchments.
Uchoi, Ajit; Malik, Surendra Kumar; Choudhary, Ravish; Kumar, Susheel; Rohini, M R; Pal, Digvender; Ercisli, Sezai; Chaudhury, Rekha
2016-06-01
Phylogenetic relationships of Indian Citron (Citrus medica L.) with other important Citrus species have been inferred through sequence analyses of rbcL and matK gene region of chloroplast DNA. The study was based on 23 accessions of Citrus genotypes representing 15 taxa of Indian Citrus, collected from wild, semi-wild, and domesticated stocks. The phylogeny was inferred using the maximum parsimony (MP) and neighbor-joining (NJ) methods. Both MP and NJ trees separated all the 23 accessions of Citrus into five distinct clusters. The chloroplast DNA (cpDNA) analysis based on rbcL and matK sequence data carried out in Indian taxa of Citrus was useful in differentiating all the true species and species/varieties of probable hybrid origin in distinct clusters or groups. Sequence analysis based on rbcL and matK gene provided unambiguous identification and disposition of true species like C. maxima, C. medica, C. reticulata, and related hybrids/cultivars. The separation of C. maxima, C. medica, and C. reticulata in distinct clusters or sub-clusters supports their distinctiveness as the basic species of edible Citrus. However, the cpDNA sequence analysis of rbcL and matK gene could not find any clear cut differentiation between subgenera Citrus and Papeda as proposed in Swingle's system of classification.
A phylogenetic study of Laeliinae (Orchidaceae) based on combined nuclear and plastid DNA sequences
van den Berg, Cássio; Higgins, Wesley E.; Dressler, Robert L.; Whitten, W. Mark; Soto-Arenas, Miguel A.; Chase, Mark W.
2009-01-01
Background and Aims Laeliinae are a neotropical orchid subtribe with approx. 1500 species in 50 genera. In this study, an attempt is made to assess generic alliances based on molecular phylogenetic analysis of DNA sequence data. Methods Six DNA datasets were gathered: plastid trnL intron, trnL-F spacer, matK gene and trnK introns upstream and dowstream from matK and nuclear ITS rDNA. Data were analysed with maximum parsimony (MP) and Bayesian analysis with mixed models (BA). Key Results Although relationships between Laeliinae and outgroups are well supported, within the subtribe sequence variation is low considering the broad taxonomic range covered. Localized incongruence between the ITS and plastid trees was found. A combined tree followed the ITS trees more closely, but the levels of support obtained with MP were low. The Bayesian analysis recovered more well-supported nodes. The trees from combined MP and BA allowed eight generic alliances to be recognized within Laeliinae, all of which show trends in morphological characters but lack unambiguous synapomorphies. Conclusions By using combined plastid and nuclear DNA data in conjunction with mixed-models Bayesian inference, it is possible to delimit smaller groups within Laeliinae and discuss general patterns of pollination and hybridization compatibility. Furthermore, these small groups can now be used for further detailed studies to explain morphological evolution and diversification patterns within the subtribe. PMID:19423551
NASA Astrophysics Data System (ADS)
Martinez, Guillermo F.; Gupta, Hoshin V.
2011-12-01
Methods to select parsimonious and hydrologically consistent model structures are useful for evaluating dominance of hydrologic processes and representativeness of data. While information criteria (appropriately constrained to obey underlying statistical assumptions) can provide a basis for evaluating appropriate model complexity, it is not sufficient to rely upon the principle of maximum likelihood (ML) alone. We suggest that one must also call upon a "principle of hydrologic consistency," meaning that selected ML structures and parameter estimates must be constrained (as well as possible) to reproduce desired hydrological characteristics of the processes under investigation. This argument is demonstrated in the context of evaluating the suitability of candidate model structures for lumped water balance modeling across the continental United States, using data from 307 snow-free catchments. The models are constrained to satisfy several tests of hydrologic consistency, a flow space transformation is used to ensure better consistency with underlying statistical assumptions, and information criteria are used to evaluate model complexity relative to the data. The results clearly demonstrate that the principle of consistency provides a sensible basis for guiding selection of model structures and indicate strong spatial persistence of certain model structures across the continental United States. Further work to untangle reasons for model structure predominance can help to relate conceptual model structures to physical characteristics of the catchments, facilitating the task of prediction in ungaged basins.
Friesen, Vicki L.; Baker, Allan J.; Piatt, John F.
1996-01-01
The Alcidae is a unique assemblage of Northern Hemisphere seabirds that forage by "flying" underwater. Despite obvious affinities among the species, their evolutionary relationships are unclear. We analyzed nucleotide sequences of 1,045 base pairs of the mitochondrial cytochrome b gene and allelic profiles for 37 allozyme loci in all 22 extant species. Trees were constructed on independent and combined data sets using maximum parsimony and distance methods that correct for superimposed changes. Alternative methods of analysis produced only minor differences in relationships that were supported strongly by bootstrapping or standard error tests. Combining sequence and allozyme data into a single analysis provided the greatest number of relationships receiving strong support. Addition of published morphological and ecological data did not improve support for any additional relationship. All analyses grouped species into six distinct lineages: (1) the dovekie (Alle alle) and auks, (2) guillemots, (3) brachyramphine murrelets, (4) synthliboramphine murrelets, (5) true auklets, and (6) the rhinoceros auklet (Cerorhinca monocerata) and puffins. The two murres (genus Uria) were sister taxa, and the black guillemot (Cepphus grylle) was basal to the other guillemots. The Asian subspecies of the marbled murrelet (Brachyramphus marmoratus perdix) was the most divergent brachyramphine murrelet, and two distinct lineages occurred within the synthliboramphine murrelets. Cassin's auklet (Ptychoramphus aleuticus) and the rhinoceros auklet were basal to the other auklets and puffins, respectively, and the Atlantic (Fratercula arctica) and horned (Fratercula corniculata) puffins were sister taxa. Several relationships among tribes, among the dovekie and auks, and among the auklets could not be resolved but resembled "star" phylogenies indicative of adaptive radiations at different depths within the trees.
Alstrup, Jan; Jørgensen, Mikkel; Medford, Andrew J; Krebs, Frederik C
2010-10-01
We present a technique that enables the probing of the entire parameter space for each parameter with good statistics through a simple roll-to-roll processing method where gradients of donor, acceptor, and solvent are applied by differentially pumped slot-die coating. We thus demonstrate how the optimum donor-acceptor ratio and device film thickness can be determined with improved accuracy by varying the composition in small steps. We give as an example P3HT-PCBM devices and vary the composition between P3HT and PCBM in steps of 0.5-1% giving 100-200 individual solar cells. The coating experiment itself takes less than 4-8 min and requires 15-30 mg each of donor and acceptor material. The optimum donor-acceptor composition of P3HT and PCBM was found to be a broad maximum centered on a 1:1 ratio. We demonstrate how the optimal thickness of the active layer can be found by the same method and materials usage by variation of the layer thickness in small steps of 1.5-4 nm. Contrary to expectation we did not find oscillatory variation of the device performance with device thickness because of optical interference. We ascribe this to the nature of the solar cell type explored in this example that employs nonreflective or semitransparent printed electrodes. We further found that very thick active layers on the order of 1 μm can be prepared without loss in performance and estimate the active layer thickness could easily approach 4-5 μm while maintaining photovoltaic properties.
Polynomial Supertree Methods Revisited
Brinkmeyer, Malte; Griebel, Thasso; Böcker, Sebastian
2011-01-01
Supertree methods allow to reconstruct large phylogenetic trees by combining smaller trees with overlapping leaf sets into one, more comprehensive supertree. The most commonly used supertree method, matrix representation with parsimony (MRP), produces accurate supertrees but is rather slow due to the underlying hard optimization problem. In this paper, we present an extensive simulation study comparing the performance of MRP and the polynomial supertree methods MinCut Supertree, Modified MinCut Supertree, Build-with-distances, PhySIC, PhySIC_IST, and super distance matrix. We consider both quality and resolution of the reconstructed supertrees. Our findings illustrate the tradeoff between accuracy and running time in supertree construction, as well as the pros and cons of voting- and veto-based supertree approaches. Based on our results, we make some general suggestions for supertree methods yet to come. PMID:22229028
A Pareto-optimal moving average multigene genetic programming model for daily streamflow prediction
NASA Astrophysics Data System (ADS)
Danandeh Mehr, Ali; Kahya, Ercan
2017-06-01
Genetic programming (GP) is able to systematically explore alternative model structures of different accuracy and complexity from observed input and output data. The effectiveness of GP in hydrological system identification has been recognized in recent studies. However, selecting a parsimonious (accurate and simple) model from such alternatives still remains a question. This paper proposes a Pareto-optimal moving average multigene genetic programming (MA-MGGP) approach to develop a parsimonious model for single-station streamflow prediction. The three main components of the approach that take us from observed data to a validated model are: (1) data pre-processing, (2) system identification and (3) system simplification. The data pre-processing ingredient uses a simple moving average filter to diminish the lagged prediction effect of stand-alone data-driven models. The multigene ingredient of the model tends to identify the underlying nonlinear system with expressions simpler than classical monolithic GP and, eventually simplification component exploits Pareto front plot to select a parsimonious model through an interactive complexity-efficiency trade-off. The approach was tested using the daily streamflow records from a station on Senoz Stream, Turkey. Comparing to the efficiency results of stand-alone GP, MGGP, and conventional multi linear regression prediction models as benchmarks, the proposed Pareto-optimal MA-MGGP model put forward a parsimonious solution, which has a noteworthy importance of being applied in practice. In addition, the approach allows the user to enter human insight into the problem to examine evolved models and pick the best performing programs out for further analysis.
NASA Technical Reports Server (NTRS)
Achenbach-Richter, L.; Gupta, R.; Zillig, W.; Woese, C. R.
1988-01-01
The sequence of the 16S ribosomal RNA gene from the archaebacterium Thermococcus celer shows the organism to be related to the methanogenic archaebacteria rather than to its phenotypic counterparts, the extremely thermophilic archaebacteria. This conclusion turns on the position of the root of the archaebacterial phylogenetic tree, however. The problems encountered in rooting this tree are analyzed in detail. Under conditions that suppress evolutionary noise both the parsimony and evolutionary distance methods yield a root location (using a number of eubacterial or eukaryotic outgroup sequences) that is consistent with that determined by an "internal rooting" method, based upon an (approximate) determination of relative evolutionary rates.
A Scalable Approach to Probabilistic Latent Space Inference of Large-Scale Networks
Yin, Junming; Ho, Qirong; Xing, Eric P.
2014-01-01
We propose a scalable approach for making inference about latent spaces of large networks. With a succinct representation of networks as a bag of triangular motifs, a parsimonious statistical model, and an efficient stochastic variational inference algorithm, we are able to analyze real networks with over a million vertices and hundreds of latent roles on a single machine in a matter of hours, a setting that is out of reach for many existing methods. When compared to the state-of-the-art probabilistic approaches, our method is several orders of magnitude faster, with competitive or improved accuracy for latent space recovery and link prediction. PMID:25400487
DOE Office of Scientific and Technical Information (OSTI.GOV)
Leebens-Mack, Jim; Raubeson, Linda A.; Cui, Liying
2005-05-27
While there has been strong support for Amborella and Nymphaeales (water lilies) as branching from basal-most nodes in the angiosperm phylogeny, this hypothesis has recently been challenged by phylogenetic analyses of 61 protein-coding genes extracted from the chloroplast genome sequences of Amborella, Nymphaea and 12 other available land plant chloroplast genomes. These character-rich analyses placed the monocots, represented by three grasses (Poaceae), as sister to all other extant angiosperm lineages. We have extracted protein-coding regions from draft sequences for six additional chloroplast genomes to test whether this surprising result could be an artifact of long-branch attraction due to limited taxonmore » sampling. The added taxa include three monocots (Acorus, Yucca and Typha), a water lily (Nuphar), a ranunculid(Ranunculus), and a gymnosperm (Ginkgo). Phylogenetic analyses of the expanded DNA and protein datasets together with microstructural characters (indels) provided unambiguous support for Amborella and the Nymphaeales as branching from the basal-most nodes in the angiospermphylogeny. However, their relative positions proved to be dependent on method of analysis, with parsimony favoring Amborella as sister to all other angiosperms, and maximum likelihood and neighbor-joining methods favoring an Amborella + Nympheales clade as sister. The maximum likelihood phylogeny supported the later hypothesis, but the likelihood for the former hypothesis was not significantly different. Parametric bootstrap analysis, single gene phylogenies, estimated divergence dates and conflicting in del characters all help to illuminate the nature of the conflict in resolution of the most basal nodes in the angiospermphylogeny. Molecular dating analyses provided median age estimates of 161 mya for the most recent common ancestor of all extant angiosperms and 145 mya for the most recent common ancestor of monocots, magnoliids andeudicots. Whereas long sequences reduce variance in branch lengths and molecular dating estimates, the impact of improved taxon sampling on the rooting of the angiosperm phylogeny together with the results of parametric bootstrap analyses demonstrate how long-branch attraction can mislead genome-scale phylogenetic analyses.« less
Zhou, Shang-Ming; Lyons, Ronan A.; Brophy, Sinead; Gravenor, Mike B.
2012-01-01
The Takagi-Sugeno (TS) fuzzy rule system is a widely used data mining technique, and is of particular use in the identification of non-linear interactions between variables. However the number of rules increases dramatically when applied to high dimensional data sets (the curse of dimensionality). Few robust methods are available to identify important rules while removing redundant ones, and this results in limited applicability in fields such as epidemiology or bioinformatics where the interaction of many variables must be considered. Here, we develop a new parsimonious TS rule system. We propose three statistics: R, L, and ω-values, to rank the importance of each TS rule, and a forward selection procedure to construct a final model. We use our method to predict how key components of childhood deprivation combine to influence educational achievement outcome. We show that a parsimonious TS model can be constructed, based on a small subset of rules, that provides an accurate description of the relationship between deprivation indices and educational outcomes. The selected rules shed light on the synergistic relationships between the variables, and reveal that the effect of targeting specific domains of deprivation is crucially dependent on the state of the other domains. Policy decisions need to incorporate these interactions, and deprivation indices should not be considered in isolation. The TS rule system provides a basis for such decision making, and has wide applicability for the identification of non-linear interactions in complex biomedical data. PMID:23272108
Bayesian Model Averaging of Artificial Intelligence Models for Hydraulic Conductivity Estimation
NASA Astrophysics Data System (ADS)
Nadiri, A.; Chitsazan, N.; Tsai, F. T.; Asghari Moghaddam, A.
2012-12-01
This research presents a Bayesian artificial intelligence model averaging (BAIMA) method that incorporates multiple artificial intelligence (AI) models to estimate hydraulic conductivity and evaluate estimation uncertainties. Uncertainty in the AI model outputs stems from error in model input as well as non-uniqueness in selecting different AI methods. Using one single AI model tends to bias the estimation and underestimate uncertainty. BAIMA employs Bayesian model averaging (BMA) technique to address the issue of using one single AI model for estimation. BAIMA estimates hydraulic conductivity by averaging the outputs of AI models according to their model weights. In this study, the model weights were determined using the Bayesian information criterion (BIC) that follows the parsimony principle. BAIMA calculates the within-model variances to account for uncertainty propagation from input data to AI model output. Between-model variances are evaluated to account for uncertainty due to model non-uniqueness. We employed Takagi-Sugeno fuzzy logic (TS-FL), artificial neural network (ANN) and neurofuzzy (NF) to estimate hydraulic conductivity for the Tasuj plain aquifer, Iran. BAIMA combined three AI models and produced better fitting than individual models. While NF was expected to be the best AI model owing to its utilization of both TS-FL and ANN models, the NF model is nearly discarded by the parsimony principle. The TS-FL model and the ANN model showed equal importance although their hydraulic conductivity estimates were quite different. This resulted in significant between-model variances that are normally ignored by using one AI model.
Vertebrate endemism in south-eastern Africa numerically redefines a biodiversity hotspot.
Perera, Sandun J; ProcheŞ, Şerban; Ratnayake-Perera, Dayani; Ramdhani, Syd
2018-02-20
We use numerical methods to explore patterns of vertebrate endemism in south-eastern Africa, refining the boundaries of the intuitively-defined Maputaland-Pondoland-Albany biodiversity hotspot, also proposing a zoogeographic regionalisation. An incidence matrix of 300 vertebrate species endemic to south-eastern Africa sensu lato in 37 operational geographic units were used in (a) phenetic cluster analysis (PCA) using the algorithm of unweighted pair-group method with arithmetic averages (phenetic approach), and (b) parsimony analysis of endemicity (PAE; parsimony approach), in order to numerically evaluate the bioregional delimitations. The analyses provide a valid biogeographical entity 37% larger than the Maputaland-Pondoland-Albany hotspot, but substantially (131%) higher in vertebrate endemicity viz. the Greater Maputaland-Pondoland-Albany (GMPA) region of vertebrate endemism. South-east Africa is recognised as a dominion in the global zoogeographical area hierarchy, with subordinate units including the GMPA province. Various spatially-based measures of endemism were mapped for vertebrate species restricted to the dominion, i.e. endemic to south-eastern Africa sensu stricto. Areas and centres of endemism detected respectively from PAE and PCA, within the south-east Africa dominion also support the refined boundary of the GMPA region of endemism, which provides a better spatial conservation priority compared to the Maputaland-Pondoland-Albany hotspot. Reptiles and amphibians are found to be the main drivers of the overall pattern of endemism, while the pattern in freshwater fish is the most distinctive. Our analyses also indicate a good congruence of the centres of endemism across different terrestrial vertebrate taxa.
Bastani, Meysam; Vos, Larissa; Asgarian, Nasimeh; Deschenes, Jean; Graham, Kathryn; Mackey, John; Greiner, Russell
2013-01-01
Background Selecting the appropriate treatment for breast cancer requires accurately determining the estrogen receptor (ER) status of the tumor. However, the standard for determining this status, immunohistochemical analysis of formalin-fixed paraffin embedded samples, suffers from numerous technical and reproducibility issues. Assessment of ER-status based on RNA expression can provide more objective, quantitative and reproducible test results. Methods To learn a parsimonious RNA-based classifier of hormone receptor status, we applied a machine learning tool to a training dataset of gene expression microarray data obtained from 176 frozen breast tumors, whose ER-status was determined by applying ASCO-CAP guidelines to standardized immunohistochemical testing of formalin fixed tumor. Results This produced a three-gene classifier that can predict the ER-status of a novel tumor, with a cross-validation accuracy of 93.17±2.44%. When applied to an independent validation set and to four other public databases, some on different platforms, this classifier obtained over 90% accuracy in each. In addition, we found that this prediction rule separated the patients' recurrence-free survival curves with a hazard ratio lower than the one based on the IHC analysis of ER-status. Conclusions Our efficient and parsimonious classifier lends itself to high throughput, highly accurate and low-cost RNA-based assessments of ER-status, suitable for routine high-throughput clinical use. This analytic method provides a proof-of-principle that may be applicable to developing effective RNA-based tests for other biomarkers and conditions. PMID:24312637
Zhou, Shang-Ming; Lyons, Ronan A; Brophy, Sinead; Gravenor, Mike B
2012-01-01
The Takagi-Sugeno (TS) fuzzy rule system is a widely used data mining technique, and is of particular use in the identification of non-linear interactions between variables. However the number of rules increases dramatically when applied to high dimensional data sets (the curse of dimensionality). Few robust methods are available to identify important rules while removing redundant ones, and this results in limited applicability in fields such as epidemiology or bioinformatics where the interaction of many variables must be considered. Here, we develop a new parsimonious TS rule system. We propose three statistics: R, L, and ω-values, to rank the importance of each TS rule, and a forward selection procedure to construct a final model. We use our method to predict how key components of childhood deprivation combine to influence educational achievement outcome. We show that a parsimonious TS model can be constructed, based on a small subset of rules, that provides an accurate description of the relationship between deprivation indices and educational outcomes. The selected rules shed light on the synergistic relationships between the variables, and reveal that the effect of targeting specific domains of deprivation is crucially dependent on the state of the other domains. Policy decisions need to incorporate these interactions, and deprivation indices should not be considered in isolation. The TS rule system provides a basis for such decision making, and has wide applicability for the identification of non-linear interactions in complex biomedical data.
Tarasov, Sergei; Génier, François
2015-01-01
Scarabaeine dung beetles are the dominant dung feeding group of insects and are widely used as model organisms in conservation, ecology and developmental biology. Due to the conflicts among 13 recently published phylogenies dealing with the higher-level relationships of dung beetles, the phylogeny of this lineage remains largely unresolved. In this study, we conduct rigorous phylogenetic analyses of dung beetles, based on an unprecedented taxon sample (110 taxa) and detailed investigation of morphology (205 characters). We provide the description of morphology and thoroughly illustrate the used characters. Along with parsimony, traditionally used in the analysis of morphological data, we also apply the Bayesian method with a novel approach that uses anatomy ontology for matrix partitioning. This approach allows for heterogeneity in evolutionary rates among characters from different anatomical regions. Anatomy ontology generates a number of parameter-partition schemes which we compare using Bayes factor. We also test the effect of inclusion of autapomorphies in the morphological analysis, which hitherto has not been examined. Generally, schemes with more parameters were favored in the Bayesian comparison suggesting that characters located on different body regions evolve at different rates and that partitioning of the data matrix using anatomy ontology is reasonable; however, trees from the parsimony and all the Bayesian analyses were quite consistent. The hypothesized phylogeny reveals many novel clades and provides additional support for some clades recovered in previous analyses. Our results provide a solid basis for a new classification of dung beetles, in which the taxonomic limits of the tribes Dichotomiini, Deltochilini and Coprini are restricted and many new tribes must be described. Based on the consistency of the phylogeny with biogeography, we speculate that dung beetles may have originated in the Mesozoic contrary to the traditional view pointing to a Cenozoic origin. PMID:25781019
Salazar, Gerardo A.; Cabrera, Lidia I.; Madriñán, Santiago; Chase, Mark W.
2009-01-01
Background and Aims Phylogenetic relationships of subtribes Cranichidinae and Prescottiinae, two diverse groups of neotropical terrestrial orchids, are not satisfactorily understood. A previous molecular phylogenetic study supported monophyly for Cranichidinae, but Prescottiinae consisted of two clades not sister to one another. However, that analysis included only 11 species and eight genera of these subtribes. Here, plastid and nuclear DNA sequences are analysed for an enlarged sample of genera and species of Cranichidinae and Prescottiinae with the aim of clarifying their relationships, evaluating the phylogenetic position of the monospecific genera Exalaria, Ocampoa and Pseudocranichis and examining the value of various structural traits as taxonomic markers. Methods Approx. 6000 bp of nucleotide sequences from nuclear ribosomal (ITS) and plastid DNA (rbcL, matK-trnK and trnL-trnF) were analysed with cladistic parsimony and Bayesian inference for 45 species/14 genera of Cranichidinae and Prescottiinae (plus suitable outgroups). The utility of flower orientation, thickenings of velamen cell walls, hamular viscidium and pseudolabellum to mark clades recovered by the molecular analysis was assessed by tracing these characters on the molecular trees. Key Results Spiranthinae, Cranichidinae, paraphyletic Prescottia (with Pseudocranichis embedded), and a group of mainly Andean ‘prescottioid’ genera (the ‘Stenoptera clade’) were strongly supported. Relationships among these clades were unresolved by parsimony but the Bayesian tree provided moderately strong support for the resolution (Spiranthinae–(Stenoptera clade-(Prescottia/Pseudocranichis–Cranichidinae))). Three of the four structural characters mark clades on the molecular trees, but the possession of a pseudolabellum is variable in the polyphyletic Ponthieva. Conclusions No evidence was found for monophyly of Prescottiinae and the reinstatement of Cranichidinae s.l. (including the genera of ‘Prescottiinae’) is favoured. Cranichidinae s.l. are diagnosed by non-resupinate flowers. Lack of support from parsimony for relationships among the major clades of core spiranthids is suggestive of a rapid morphological radiation or a slow rate of molecular evolution. PMID:19136493
Côrtes, Ana Luiza A; Rapini, Alessandro; Daniel, Thomas F
2015-06-01
The Tetramerium lineage (Acanthaceae) presents a striking ecological structuring in South America, with groups concentrated in moist forests or in seasonally dry forests. In this study, we investigate the circumscription and relationships of the South American genera as a basis for better understanding historic interactions between dry and moist biomes in the Neotropics. We dated the ancestral distribution of the Tetramerium lineage based on one nuclear and four plastid DNA regions. Maximum parsimony, maximum likelihood, and Bayesian inference analyses were performed for this study using 104 terminals. Phylogenetic divergences were dated using a relaxed molecular clock approach and ancestral distributions obtained from dispersal-vicariance analyses. The genera Pachystachys, Schaueria, and Thyrsacanthus are nonmonophyletic. A dry forest lineage dispersed from North America to South America and reached the southwestern part of the continent between the end of the Miocene and beginning of the Pleistocene. This period coincides with the segregation between Amazonian and Atlantic moist forests that established the geographic structure currently found in the group. The South American genera Pachystachys, Schaueria, and Thyrsacanthus need to be recircumscribed. The congruence among biogeographical events found for the Tetramerium lineage suggests that the dry forest centers currently dispersed throughout South America are relatively old remnants, probably isolated since the Neogene, much earlier than the Last Glacial Maximum postulated by the Pleistocene Arc hypothesis. In addition to exploring the Pleistocene Arc hypothesis, this research also informs evolution in a lineage with numerous geographically restricted and threatened species. © 2015 Botanical Society of America, Inc.
Xing, Rui; Gao, Qing-Bo; Zhang, Fa-Qi; Fu, Peng-Cheng; Wang, Jiu-Li; Yan, Hui-Ying; Chen, Shi-Long
2017-08-01
Floccularia luteovirens, as an ectomycorrhizal fungus, is widely distributed in the Qinghai-Tibet Plateau. As an edible fungus, it is famous for its unique flavor. Former studies mainly focus on the chemical composition and genetic structure of this species. However, the phylogenetic relationship between genotypes remains unknown. In this study, the genetic variation and phylogenetic relationship between the genotypes of F. luteovirens in Qinghai-Tibet Plateau was estimated through the analysis on two protein-coding genes (rpb1 and ef-1α) from 398 individuals collected from 24 wild populations. The sample covered the entire range of this species during all the growth seasons from 2011 to 2015. 13 genotypes were detected and moderate genetic diversity was revealed. Based on the results of network analysis, the maximum likelihood (ML), maximum parsimony (MP), and Bayesian inference (BI) analyses, the genotypes H-1, H-4, H-6, H-8, H-10, and H-11 were grouped into one clade. Additionally, a relatively higher genotype diversity (average h value is 0.722) and unique genotypes in the northeast edge of Qinghai- Tibet plateau have been found, combined with the results of mismatch analysis and neutrality tests indicated that Southeast Qinghai-Tibet plateau was a refuge for F. luteovirens during the historical geological or climatic events (uplifting of the Qinghai-Tibet Plateau or Last Glacial Maximum). Furthermore, the present distribution of the species on the Qinghai-Tibet plateau has resulted from the recent population expansion. Our findings provide a foundation for the future study of the evolutionary history and the speciation of this species.
Hierarchical Adaptive Regression Kernels for Regression with Functional Predictors.
Woodard, Dawn B; Crainiceanu, Ciprian; Ruppert, David
2013-01-01
We propose a new method for regression using a parsimonious and scientifically interpretable representation of functional predictors. Our approach is designed for data that exhibit features such as spikes, dips, and plateaus whose frequency, location, size, and shape varies stochastically across subjects. We propose Bayesian inference of the joint functional and exposure models, and give a method for efficient computation. We contrast our approach with existing state-of-the-art methods for regression with functional predictors, and show that our method is more effective and efficient for data that include features occurring at varying locations. We apply our methodology to a large and complex dataset from the Sleep Heart Health Study, to quantify the association between sleep characteristics and health outcomes. Software and technical appendices are provided in online supplemental materials.
Taxonomic relationships among Phenacomys voles as inferred by cytochrome b
Bellinger, M.R.; Haig, S.M.; Forsman, E.D.; Mullins, T.D.
2005-01-01
Taxonomic relationships among red tree voles (Phenacomys longicaudus longicaudus, P. l. silvicola), the Sonoma tree vole (P. pomo), the white-footed vole (P. albipes), and the heather vole (P. intermedius) were examined using 664 base pairs of the mitochondrial cytochrome b gene. Results indicate specific differences among red tree voles, Sonoma tree voles, white-footed voles, and heather voles, but no clear difference between the 2 Oregon subspecies of red tree voles (P. l. longicaudus and P. l. silvicola). Our data further indicated a close relationship between tree voles and albipes, validating inclusion of albipes in the subgenus Arborimus. These 3 congeners shared a closer relationship to P. intermedius than to other arvicolids. A moderate association between porno and albipes was indicated by maximum parsimony and neighbor-joining phylogenetic analyses. Molecular clock estimates suggest a Pleistocene radiation of the Arborimus clade, which is concordant with pulses of diversification observed in other murid rodents. The generic rank of Arborimus is subject to interpretation of data.
de Jong, W W; Zweers, A; Versteeg, M; Dessauer, H C; Goodman, M
1985-11-01
The amino acid sequences of the eye lens protein alpha-crystallin A from many mammalian and avian species, two frog species, and a dogfish have provided detailed information about the molecular evolution of this protein and allowed some useful inferences about phylogenetic relationships among these species. We now have isolated and sequenced the alpha-crystallins of the American alligator and the common tegu lizard. The reptilian alpha A chains appear to have evolved as slowly as those of other vertebrates, i.e., at two to three amino acid replacements per 100 residues in 100 Myr. The lack of charged replacements and the general types and distribution of replacements also are similar to those in other vertebrate alpha A chains. Maximum-parsimony analyses of the total data set of 67 vertebrate alpha A sequences support the monophyletic origin of alligator, tegu, and birds and favor the grouping of crocodilians and birds as surviving sister groups in the subclass Archosauria.
Bunawan, Hamidun; Yen, Choong Chee; Yaakop, Salmah; Noor, Normah Mohd
2017-01-26
The chloroplastic trnL intron and the nuclear internal transcribed spacer (ITS) region were sequenced for 11 Nepenthes species recorded in Peninsular Malaysia to examine their phylogenetic relationship and to evaluate the usage of trnL intron and ITS sequences for phylogenetic reconstruction of this genus. Phylogeny reconstruction was carried out using neighbor-joining, maximum parsimony and Bayesian analyses. All the trees revealed two major clusters, a lowland group consisting of N. ampullaria, N. mirabilis, N. gracilis and N. rafflesiana, and another containing both intermediately distributed species (N. albomarginata and N. benstonei) and four highland species (N. sanguinea, N. macfarlanei, N. ramispina and N. alba). The trnL intron and ITS sequences proved to provide phylogenetic informative characters for deriving a phylogeny of Nepenthes species in Peninsular Malaysia. To our knowledge, this is the first molecular phylogenetic study of Nepenthes species occurring along an altitudinal gradient in Peninsular Malaysia.
2012-01-01
Background Gene duplication and the subsequent divergence in function of the resulting paralogs via subfunctionalization and/or neofunctionalization is hypothesized to have played a major role in the evolution of plant form. The LEAFY HULL STERILE1 (LHS1) SEPALLATA (SEP) genes have been linked with the origin and diversification of the grass spikelet, but it is uncertain 1) when the duplication event that produced the LHS1 clade and its paralogous lineage Oryza sativa MADS5 (OSM5) occurred, and 2) how changes in gene structure and/or expression might have contributed to subfunctionalization and/or neofunctionalization in the two lineages. Methods Phylogenetic relationships among 84 SEP genes were estimated using Bayesian methods. RNA expression patterns were inferred using in situ hybridization. The patterns of protein sequence and RNA expression evolution were reconstructed using maximum parsimony (MP) and maximum likelihood (ML) methods, respectively. Results Phylogenetic analyses mapped the LHS1/OSM5 duplication event to the base of the grass family. MP character reconstructions estimated a change from cytosine to thymine in the first codon position of the first amino acid after the Zea mays MADS3 (ZMM3) domain converted a glutamine to a stop codon in the OSM5 ancestor following the LHS1/OSM5 duplication event. RNA expression analyses of OSM5 co-orthologs in Avena sativa, Chasmanthium latifolium, Hordeum vulgare, Pennisetum glaucum, and Sorghum bicolor followed by ML reconstructions of these data and previously published analyses estimated a complex pattern of gain and loss of LHS1 and OSM5 expression in different floral organs and different flowers within the spikelet or inflorescence. Conclusions Previous authors have reported that rice OSM5 and LHS1 proteins have different interaction partners indicating that the truncation of OSM5 following the LHS1/OSM5 duplication event has resulted in both partitioned and potentially novel gene functions. The complex pattern of OSM5 and LHS1 expression evolution is not consistent with a simple subfunctionalization model following the gene duplication event, but there is evidence of recent partitioning of OSM5 and LHS1 expression within different floral organs of A. sativa, C. latifolium, P. glaucum and S. bicolor, and between the upper and lower florets of the two-flowered maize spikelet. PMID:22340849
Verdam, Mathilde G. E.; Oort, Frans J.
2014-01-01
Highlights Application of Kronecker product to construct parsimonious structural equation models for multivariate longitudinal data. A method for the investigation of measurement bias with Kronecker product restricted models. Application of these methods to health-related quality of life data from bone metastasis patients, collected at 13 consecutive measurement occasions. The use of curves to facilitate substantive interpretation of apparent measurement bias. Assessment of change in common factor means, after accounting for apparent measurement bias. Longitudinal measurement invariance is usually investigated with a longitudinal factor model (LFM). However, with multiple measurement occasions, the number of parameters to be estimated increases with a multiple of the number of measurement occasions. To guard against too low ratios of numbers of subjects and numbers of parameters, we can use Kronecker product restrictions to model the multivariate longitudinal structure of the data. These restrictions can be imposed on all parameter matrices, including measurement invariance restrictions on factor loadings and intercepts. The resulting models are parsimonious and have attractive interpretation, but require different methods for the investigation of measurement bias. Specifically, additional parameter matrices are introduced to accommodate possible violations of measurement invariance. These additional matrices consist of measurement bias parameters that are either fixed at zero or free to be estimated. In cases of measurement bias, it is also possible to model the bias over time, e.g., with linear or non-linear curves. Measurement bias detection with Kronecker product restricted models will be illustrated with multivariate longitudinal data from 682 bone metastasis patients whose health-related quality of life (HRQL) was measured at 13 consecutive weeks. PMID:25295016
Verdam, Mathilde G E; Oort, Frans J
2014-01-01
Application of Kronecker product to construct parsimonious structural equation models for multivariate longitudinal data.A method for the investigation of measurement bias with Kronecker product restricted models.Application of these methods to health-related quality of life data from bone metastasis patients, collected at 13 consecutive measurement occasions.The use of curves to facilitate substantive interpretation of apparent measurement bias.Assessment of change in common factor means, after accounting for apparent measurement bias.Longitudinal measurement invariance is usually investigated with a longitudinal factor model (LFM). However, with multiple measurement occasions, the number of parameters to be estimated increases with a multiple of the number of measurement occasions. To guard against too low ratios of numbers of subjects and numbers of parameters, we can use Kronecker product restrictions to model the multivariate longitudinal structure of the data. These restrictions can be imposed on all parameter matrices, including measurement invariance restrictions on factor loadings and intercepts. The resulting models are parsimonious and have attractive interpretation, but require different methods for the investigation of measurement bias. Specifically, additional parameter matrices are introduced to accommodate possible violations of measurement invariance. These additional matrices consist of measurement bias parameters that are either fixed at zero or free to be estimated. In cases of measurement bias, it is also possible to model the bias over time, e.g., with linear or non-linear curves. Measurement bias detection with Kronecker product restricted models will be illustrated with multivariate longitudinal data from 682 bone metastasis patients whose health-related quality of life (HRQL) was measured at 13 consecutive weeks.
Soft-tissue anatomy of the extant hominoids: a review and phylogenetic analysis.
Gibbs, S; Collard, M; Wood, B
2002-01-01
This paper reports the results of a literature search for information about the soft-tissue anatomy of the extant non-human hominoid genera, Pan, Gorilla, Pongo and Hylobates, together with the results of a phylogenetic analysis of these data plus comparable data for Homo. Information on the four extant non-human hominoid genera was located for 240 out of the 1783 soft-tissue structures listed in the Nomina Anatomica. Numerically these data are biased so that information about some systems (e.g. muscles) and some regions (e.g. the forelimb) are over-represented, whereas other systems and regions (e.g. the veins and the lymphatics of the vascular system, the head region) are either under-represented or not represented at all. Screening to ensure that the data were suitable for use in a phylogenetic analysis reduced the number of eligible soft-tissue structures to 171. These data, together with comparable data for modern humans, were converted into discontinuous character states suitable for phylogenetic analysis and then used to construct a taxon-by-character matrix. This matrix was used in two tests of the hypothesis that soft-tissue characters can be relied upon to reconstruct hominoid phylogenetic relationships. In the first, parsimony analysis was used to identify cladograms requiring the smallest number of character state changes. In the second, the phylogenetic bootstrap was used to determine the confidence intervals of the most parsimonious clades. The parsimony analysis yielded a single most parsimonious cladogram that matched the molecular cladogram. Similarly the bootstrap analysis yielded clades that were compatible with the molecular cladogram; a (Homo, Pan) clade was supported by 95% of the replicates, and a (Gorilla, Pan, Homo) clade by 96%. These are the first hominoid morphological data to provide statistically significant support for the clades favoured by the molecular evidence.
Mitogenomic analysis of the genus Panthera.
Wei, Lei; Wu, Xiaobing; Zhu, Lixin; Jiang, Zhigang
2011-10-01
The complete sequences of the mitochondrial DNA genomes of Panthera tigris, Panthera pardus, and Panthera uncia were determined using the polymerase chain reaction method. The lengths of the complete mitochondrial DNA sequences of the three species were 16990, 16964, and 16773 bp, respectively. Each of the three mitochondrial DNA genomes included 13 protein-coding genes, 22 tRNA, two rRNA, one O(L)R, and one control region. The structures of the genomes were highly similar to those of Felis catus, Acinonyx jubatus, and Neofelis nebulosa. The phylogenies of the genus Panthera were inferred from two combined mitochondrial sequence data sets and the complete mitochondrial genome sequences, by MP (maximum parsimony), ML (maximum likelihood), and Bayesian analysis. The results showed that Panthera was composed of Panthera leo, P. uncia, P. pardus, Panthera onca, P. tigris, and N. nebulosa, which was included as the most basal member. The phylogeny within Panthera genus was N. nebulosa (P. tigris (P. onca (P. pardus, (P. leo, P. uncia)))). The divergence times for Panthera genus were estimated based on the ML branch lengths and four well-established calibration points. The results showed that at about 11.3 MYA, the Panthera genus separated from other felid species and then evolved into the several species of the genus. In detail, N. nebulosa was estimated to be founded about 8.66 MYA, P. tigris about 6.55 MYA, P. uncia about 4.63 MYA, and P. pardus about 4.35 MYA. All these estimated times were older than those estimated from the fossil records. The divergence event, evolutionary process, speciation, and distribution pattern of P. uncia, a species endemic to the central Asia with core habitats on the Qinghai-Tibetan Plateau and surrounding highlands, mostly correlated with the geological tectonic events and intensive climate shifts that happened at 8, 3.6, 2.5, and 1.7 MYA on the plateau during the late Cenozoic period.
Li, Pui-Sze; Thomas, Daniel C.; Saunders, Richard M. K.
2015-01-01
Taxonomic delimitation of Disepalum (Annonaceae) is contentious, with some researchers favoring a narrow circumscription following segregation of the genus Enicosanthellum. We reconstruct the phylogeny of Disepalum and related taxa based on four chloroplast and two nuclear DNA regions as a framework for clarifying taxonomic delimitation and assessing evolutionary transitions in key morphological characters. Maximum parsimony, maximum likelihood and Bayesian methods resulted in a consistent, well-resolved and strongly supported topology. Disepalum s.l. is monophyletic and strongly supported, with Disepalum s.str. and Enicosanthellum retrieved as sister groups. Although this topology is consistent with both taxonomic delimitations, the distribution of morphological synapomorphies provides greater support for the inclusion of Enicosanthellum within Disepalum s.l. We propose a novel infrageneric classification with two subgenera. Subgen. Disepalum (= Disepalum s.str.) is supported by numerous synapomorphies, including the reduction of the calyx to two sepals and connation of petals. Subgen. Enicosanthellum lacks obvious morphological synapomorphies, but possesses several diagnostic characters (symplesiomorphies), including a trimerous calyx and free petals in two whorls. We evaluate changes in petal morphology in relation to hypotheses of the genetic control of floral development and suggest that the compression of two petal whorls into one and the associated fusion of contiguous petals may be associated with the loss of the pollination chamber, which in turn may be associated with a shift in primary pollinator. We also suggest that the formation of pollen octads may be selectively advantageous when pollinator visits are infrequent, although this would only be applicable if multiple ovules could be fertilized by each octad; since the flowers are apocarpous, this would require an extragynoecial compitum to enable intercarpellary growth of pollen tubes. We furthermore infer that the monocarp fruit stalks are likely to have evolved independently from those in other Annonaceae genera and may facilitate effective dispersal by providing a color contrast within the fruit. PMID:26630651
Zhang, Jinju; Li, Zuozhou; Fritsch, Peter W; Tian, Hua; Yang, Aihong; Yao, Xiaohong
2015-10-01
The phylogeography of plant species in sub-tropical China remains largely unclear. This study used Tapiscia sinensis, an endemic and endangered tree species widely but disjunctly distributed in sub-tropical China, as a model to reveal the patterns of genetic diversity and phylogeographical history of Tertiary relict plant species in this region. The implications of the results are discussed in relation to its conservation management. Samples were taken from 24 populations covering the natural geographical distribution of T. sinensis. Genetic structure was investigated by analysis of molecular variance (AMOVA) and spatial analysis of molecular variance (SAMOVA). Phylogenetic relationships among haplotypes were constructed with maximum parsimony and haplotype network methods. Historical population expansion events were tested with pairwise mismatch distribution analysis and neutrality tests. Species potential range was deduced by ecological niche modelling (ENM). A low level of genetic diversity was detected at the population level. A high level of genetic differentiation and a significant phylogeographical structure were revealed. The mean divergence time of the haplotypes was approx. 1·33 million years ago. Recent range expansion in this species is suggested by a star-like haplotype network and by the results from the mismatch distribution analysis and neutrality tests. Climatic oscillations during the Pleistocene have had pronounced effects on the extant distribution of Tapiscia relative to the Last Glacial Maximum (LGM). Spatial patterns of molecular variation and ENM suggest that T. sinensis may have retreated in south-western and central China and colonized eastern China prior to the LGM. Multiple montane refugia for T. sinense existing during the LGM are inferred in central and western China. The populations adjacent to or within these refugia of T. sinense should be given high priority in the development of conservation policies and management strategies for this endangered species. © The Author 2015. Published by Oxford University Press on behalf of the Annals of Botany Company. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Appelhans, M S; Smets, E; Razafimandimbison, S G; Haevermans, T; van Marle, E J; Couloux, A; Rabarison, H; Randrianarivelojosia, M; Kessler, P J A
2011-06-01
The Spathelia-Ptaeroxylon clade is a group of morphologically diverse plants that have been classified together as a result of molecular phylogenetic studies. The clade is currently included in Rutaceae and recognized at a subfamilial level (Spathelioideae) despite the fact that most of its genera have traditionally been associated with other families and that there are no obvious morphological synapomorphies for the clade. The aim of the present study is to construct phylogenetic trees for the Spathelia-Ptaeroxylon clade and to investigate anatomical characters in order to decide whether it should be kept in Rutaceae or recognized at the familial level. Anatomical characters were plotted on a cladogram to help explain character evolution within the group. Moreover, phylogenetic relationships and generic limits within the clade are also addressed. A species-level phylogenetic analysis of the Spathelia-Ptaeroxylon clade based on five plastid DNA regions (rbcL, atpB, trnL-trnF, rps16 and psbA-trnH) was conducted using Bayesian, maximum parsimony and maximum likelihood methods. Leaf and seed anatomical characters of all genera were (re)investigated by light and scanning electron microscopy. With the exception of Spathelia, all genera of the Spathelila-Ptaeroxylon clade are monophyletic. The typical leaf and seed anatomical characters of Rutaceae were found. Further, the presence of oil cells in the leaves provides a possible synapomorphy for the clade. The Spathelia-Ptaeroxylon clade is well placed in Rutaceae and it is reasonable to unite the genera into one subfamily (Spathelioideae). We propose a new tribal classification of Spathelioideae. A narrow circumscription of Spathelia is established to make the genus monophyletic, and Sohnreyia is resurrected to accommodate the South American species of Spathelia. The most recent common ancestor of Spathelioideae probably had leaves with secretory cavities and oil cells, haplostemonous flowers with appendaged staminal filaments, and a tracheidal tegmen.
Koehler, Samantha; Cabral, Juliano S.; Whitten, W. Mark; Williams, Norris H.; Singer, Rodrigo B.; Neubig, Kurt M.; Guerra, Marcelo; Souza, Anete P.; Amaral, Maria do Carmo E.
2008-01-01
Background and Aims Species' boundaries applied within Christensonella have varied due to the continuous pattern of variation and mosaic distribution of diagnostic characters. The main goals of this study were to revise the species' delimitation and propose a more stable classification for this genus. In order to achieve these aims phylogenetic relationships were inferred using DNA sequence data and cytological diversity within Christensonella was examined based on chromosome counts and heterochromatin patterns. The results presented describe sets of diagnostic morphological characters that can be used for species' identification. Methods Phylogenetic studies were based on sequence data of nuclear and plastid regions, analysed using maximum parsimony and maximum likelihood criteria. Cytogenetic observations of mitotic cells were conducted using CMA and DAPI fluorochromes. Key Results Six of 21 currently accepted species were recovered. The results also support recognition of the ‘C. pumila’ clade as a single species. Molecular phylogenetic relationships within the ‘C. acicularis–C. madida’ and ‘C. ferdinandiana–C. neowiedii’ species' complexes were not resolved and require further study. Deeper relationships were incongruent between plastid and nuclear trees, but with no strong bootstrap support for either, except for the position of C. vernicosa. Cytogenetic data indicated chromosome numbers of 2n = 36, 38 and 76, and with substantial variation in the presence and location of CMA/DAPI heterochromatin bands. Conclusions The recognition of ten species of Christensonella is proposed according to the molecular and cytogenetic patterns observed. In addition, diagnostic morphological characters are presented for each recognized species. Banding patterns and chromosome counts suggest the occurrence of centric fusion/fission events, especially for C. ferdinandiana. The results suggest that 2n = 36 karyotypes evolved from 2n = 38 through descendent dysploidy. Patterns of heterochromatin distribution and other karyotypic data proved to be a valuable source of information to understand evolutionary patterns within Maxillariinae orchids. PMID:18687799
Sato, Mitsuharu; Miyazaki, Kentaro
2017-01-01
Horizontal gene transfer (HGT) is a ubiquitous genetic event in bacterial evolution, but it seldom occurs for genes involved in highly complex supramolecules (or biosystems), which consist of many gene products. The ribosome is one such supramolecule, but several bacteria harbor dissimilar and/or chimeric 16S rRNAs in their genomes, suggesting the occurrence of HGT of this gene. However, we know little about whether the genes actually experience HGT and, if so, the frequency of such a transfer. This is primarily because the methods currently employed for phylogenetic analysis (e.g., neighbor-joining, maximum likelihood, and maximum parsimony) of 16S rRNA genes assume point mutation-driven tree-shape evolution as an evolutionary model, which is intrinsically inappropriate to decipher the evolutionary history for genes driven by recombination. To address this issue, we applied a phylogenetic network analysis, which has been used previously for detection of genetic recombination in homologous alleles, to the 16S rRNA gene. We focused on the genus Enterobacter, whose phylogenetic relationships inferred by multi-locus sequence alignment analysis and 16S rRNA sequences are incompatible. All 10 complete genomic sequences were retrieved from the NCBI database, in which 71 16S rRNA genes were included. Neighbor-joining analysis demonstrated that the genes residing in the same genomes clustered, indicating the occurrence of intragenomic recombination. However, as suggested by the low bootstrap values, evolutionary relationships between the clusters were uncertain. We then applied phylogenetic network analysis to representative sequences from each cluster. We found three ancestral 16S rRNA groups; the others were likely created through recursive recombination between the ancestors and chimeric descendants. Despite the large sequence changes caused by the recombination events, the RNA secondary structures were conserved. Successive intergenomic and intragenomic recombination thus shaped the evolution of 16S rRNA genes in the genus Enterobacter. PMID:29180992
The complexity of selection at the major primate β-defensin locus
Semple, Colin AM; Maxwell, Alison; Gautier, Philippe; Kilanowski, Fiona M; Eastwood, Hayden; Barran, Perdita E; Dorin, Julia R
2005-01-01
Background We have examined the evolution of the genes at the major human β-defensin locus and the orthologous loci in a range of other primates and mouse. For the first time these data allow us to examine selective episodes in the more recent evolutionary history of this locus as well as the ancient past. We have used a combination of maximum likelihood based tests and a maximum parsimony based sliding window approach to give a detailed view of the varying modes of selection operating at this locus. Results We provide evidence for strong positive selection soon after the duplication of these genes within an ancestral mammalian genome. Consequently variable selective pressures have acted on β-defensin genes in different evolutionary lineages, with episodes both of negative, and more rarely positive selection, during the divergence of primates. Positive selection appears to have been more common in the rodent lineage, accompanying the birth of novel, rodent-specific β-defensin genes. These observations allow a fuller understanding of the evolution of mammalian innate immunity. In both the rodent and primate lineages, sites in the second exon have been subject to positive selection and by implication are important in functional diversity. A small number of sites in the mature human peptides were found to have undergone repeated episodes of selection in different primate lineages. Particular sites were consistently implicated by multiple methods at positions throughout the mature peptides. These sites are clustered at positions predicted to be important for the specificity of the antimicrobial or chemoattractant properties of β-defensins. Surprisingly, sites within the prepropeptide region were also implicated as being subject to significant positive selection, suggesting previously unappreciated functional significance for this region. Conclusions Identification of these putatively functional sites has important implications for our understanding of β-defensin function and for novel antibiotic design. PMID:15904491
2014-01-01
Background Limited available sequence information has greatly impeded population genetics, phylogenetics and systematics studies in the subclass Acari (mites and ticks). Mitochondrial (mt) DNA is well known to provide genetic markers for investigations in these areas, but complete mt genomic data have been lacking for many Acari species. Herein, we present the complete mt genome of the scab mite Psoroptes cuniculi. Methods P. cuniculi was collected from a naturally infected New Zealand white rabbit from China and identified by morphological criteria. The complete mt genome of P. cuniculi was amplified by PCR and then sequenced. The relationships of this scab mite with selected members of the Acari were assessed by phylogenetic analysis of concatenated amino acid sequence datasets by Bayesian inference (BI), maximum likelihood (ML) and maximum parsimony (MP). Results This mt genome (14,247 bp) is circular and consists of 37 genes, including 13 genes for proteins, 22 genes for tRNA, 2 genes for rRNA. The gene arrangement in mt genome of P. cuniculi is the same as those of Dermatophagoides farinae (Pyroglyphidae) and Aleuroglyphus ovatus (Acaridae), but distinct from those of Steganacarus magnus (Steganacaridae) and Panonychus citri (Tetranychidae). Phylogenetic analyses using concatenated amino acid sequences of 12 protein-coding genes, with three different computational algorithms (BI, ML and MP), showed the division of subclass Acari into two superorders, supported the monophylies of the both superorders Parasitiformes and Acariformes; and the three orders Ixodida and Mesostigmata and Astigmata, but rejected the monophyly of the order Prostigmata. Conclusions The mt genome of P. cuniculi represents the first mt genome of any member of the family Psoroptidae. Analysis of mt genome sequences in the present study has provided new insights into the phylogenetic relationships among several major lineages of Acari species. PMID:25052180
Augmenting epidemiological models with point-of-care diagnostics data
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pullum, Laura L.; Ramanathan, Arvind; Nutaro, James J.
Although adoption of newer Point-of-Care (POC) diagnostics is increasing, there is a significant challenge using POC diagnostics data to improve epidemiological models. In this work, we propose a method to process zip-code level POC datasets and apply these processed data to calibrate an epidemiological model. We specifically develop a calibration algorithm using simulated annealing and calibrate a parsimonious equation-based model of modified Susceptible-Infected-Recovered (SIR) dynamics. The results show that parsimonious models are remarkably effective in predicting the dynamics observed in the number of infected patients and our calibration algorithm is sufficiently capable of predicting peak loads observed in POC diagnosticsmore » data while staying within reasonable and empirical parameter ranges reported in the literature. Additionally, we explore the future use of the calibrated values by testing the correlation between peak load and population density from Census data. Our results show that linearity assumptions for the relationships among various factors can be misleading, therefore further data sources and analysis are needed to identify relationships between additional parameters and existing calibrated ones. As a result, calibration approaches such as ours can determine the values of newly added parameters along with existing ones and enable policy-makers to make better multi-scale decisions.« less
Augmenting epidemiological models with point-of-care diagnostics data
Pullum, Laura L.; Ramanathan, Arvind; Nutaro, James J.; ...
2016-04-20
Although adoption of newer Point-of-Care (POC) diagnostics is increasing, there is a significant challenge using POC diagnostics data to improve epidemiological models. In this work, we propose a method to process zip-code level POC datasets and apply these processed data to calibrate an epidemiological model. We specifically develop a calibration algorithm using simulated annealing and calibrate a parsimonious equation-based model of modified Susceptible-Infected-Recovered (SIR) dynamics. The results show that parsimonious models are remarkably effective in predicting the dynamics observed in the number of infected patients and our calibration algorithm is sufficiently capable of predicting peak loads observed in POC diagnosticsmore » data while staying within reasonable and empirical parameter ranges reported in the literature. Additionally, we explore the future use of the calibrated values by testing the correlation between peak load and population density from Census data. Our results show that linearity assumptions for the relationships among various factors can be misleading, therefore further data sources and analysis are needed to identify relationships between additional parameters and existing calibrated ones. As a result, calibration approaches such as ours can determine the values of newly added parameters along with existing ones and enable policy-makers to make better multi-scale decisions.« less
Wilkerson, Richard C; Linton, Yvonne-Marie; Fonseca, Dina M; Schultz, Ted R; Price, Dana C; Strickman, Daniel A
2015-01-01
The tribe Aedini (Family Culicidae) contains approximately one-quarter of the known species of mosquitoes, including vectors of deadly or debilitating disease agents. This tribe contains the genus Aedes, which is one of the three most familiar genera of mosquitoes. During the past decade, Aedini has been the focus of a series of extensive morphology-based phylogenetic studies published by Reinert, Harbach, and Kitching (RH&K). Those authors created 74 new, elevated or resurrected genera from what had been the single genus Aedes, almost tripling the number of genera in the entire family Culicidae. The proposed classification is based on subjective assessments of the "number and nature of the characters that support the branches" subtending particular monophyletic groups in the results of cladistic analyses of a large set of morphological characters of representative species. To gauge the stability of RH&K's generic groupings we reanalyzed their data with unweighted parsimony jackknife and maximum-parsimony analyses, with and without ordering 14 of the characters as in RH&K. We found that their phylogeny was largely weakly supported and their taxonomic rankings failed priority and other useful taxon-naming criteria. Consequently, we propose simplified aedine generic designations that 1) restore a classification system that is useful for the operational community; 2) enhance the ability of taxonomists to accurately place new species into genera; 3) maintain the progress toward a natural classification based on monophyletic groups of species; and 4) correct the current classification system that is subject to instability as new species are described and existing species more thoroughly defined. We do not challenge the phylogenetic hypotheses generated by the above-mentioned series of morphological studies. However, we reduce the ranks of the genera and subgenera of RH&K to subgenera or informal species groups, respectively, to preserve stability as new data become available.
Wilkerson, Richard C.; Linton, Yvonne-Marie; Fonseca, Dina M.; Schultz, Ted R.; Price, Dana C.; Strickman, Daniel A.
2015-01-01
The tribe Aedini (Family Culicidae) contains approximately one-quarter of the known species of mosquitoes, including vectors of deadly or debilitating disease agents. This tribe contains the genus Aedes, which is one of the three most familiar genera of mosquitoes. During the past decade, Aedini has been the focus of a series of extensive morphology-based phylogenetic studies published by Reinert, Harbach, and Kitching (RH&K). Those authors created 74 new, elevated or resurrected genera from what had been the single genus Aedes, almost tripling the number of genera in the entire family Culicidae. The proposed classification is based on subjective assessments of the “number and nature of the characters that support the branches” subtending particular monophyletic groups in the results of cladistic analyses of a large set of morphological characters of representative species. To gauge the stability of RH&K’s generic groupings we reanalyzed their data with unweighted parsimony jackknife and maximum-parsimony analyses, with and without ordering 14 of the characters as in RH&K. We found that their phylogeny was largely weakly supported and their taxonomic rankings failed priority and other useful taxon-naming criteria. Consequently, we propose simplified aedine generic designations that 1) restore a classification system that is useful for the operational community; 2) enhance the ability of taxonomists to accurately place new species into genera; 3) maintain the progress toward a natural classification based on monophyletic groups of species; and 4) correct the current classification system that is subject to instability as new species are described and existing species more thoroughly defined. We do not challenge the phylogenetic hypotheses generated by the above-mentioned series of morphological studies. However, we reduce the ranks of the genera and subgenera of RH&K to subgenera or informal species groups, respectively, to preserve stability as new data become available. PMID:26226613
Freudenstein, John V; Chase, Mark W
2015-03-01
The largest subfamily of orchids, Epidendroideae, represents one of the most significant diversifications among flowering plants in terms of pollination strategy, vegetative adaptation and number of species. Although many groups in the subfamily have been resolved, significant relationships in the tree remain unclear, limiting conclusions about diversification and creating uncertainty in the classification. This study brings together DNA sequences from nuclear, plastid and mitochrondrial genomes in order to clarify relationships, to test associations of key characters with diversification and to improve the classification. Sequences from seven loci were concatenated in a supermatrix analysis for 312 genera representing most of epidendroid diversity. Maximum-likelihood and parsimony analyses were performed on this matrix and on subsets of the data to generate trees and to investigate the effect of missing values. Statistical character-associated diversification analyses were performed. Likelihood and parsimony analyses yielded highly resolved trees that are in strong agreement and show significant support for many key clades. Many previously proposed relationships among tribes and subtribes are supported, and some new relationships are revealed. Analyses of subsets of the data suggest that the relatively high number of missing data for the full analysis is not problematic. Diversification analyses show that epiphytism is most strongly associated with diversification among epidendroids, followed by expansion into the New World and anther characters that are involved with pollinator specificity, namely early anther inflexion, cellular pollinium stalks and the superposed pollinium arrangement. All tested characters show significant association with speciation in Epidendroideae, suggesting that no single character accounts for the success of this group. Rather, it appears that a succession of key features appeared that have contributed to diversification, sometimes in parallel. © The Author 2015. Published by Oxford University Press on behalf of the Annals of Botany Company. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Regression Models for the Analysis of Longitudinal Gaussian Data from Multiple Sources
O’Brien, Liam M.; Fitzmaurice, Garrett M.
2006-01-01
We present a regression model for the joint analysis of longitudinal multiple source Gaussian data. Longitudinal multiple source data arise when repeated measurements are taken from two or more sources, and each source provides a measure of the same underlying variable and on the same scale. This type of data generally produces a relatively large number of observations per subject; thus estimation of an unstructured covariance matrix often may not be possible. We consider two methods by which parsimonious models for the covariance can be obtained for longitudinal multiple source data. The methods are illustrated with an example of multiple informant data arising from a longitudinal interventional trial in psychiatry. PMID:15726666
Kang, Seokha; Sultana, Tahera; Eom, Keeseon S; Park, Yung Chul; Soonthornpong, Nathan; Nadler, Steven A; Park, Joong-Ki
2009-01-15
The complete mitochondrial genome sequence was determined for the human pinworm Enterobius vermicularis (Oxyurida: Nematoda) and used to infer its phylogenetic relationship to other major groups of chromadorean nematodes. The E. vermicularis genome is a 14,010-bp circular DNA molecule that encodes 36 genes (12 proteins, 22 tRNAs, and 2 rRNAs). This mtDNA genome lacks atp8, as reported for almost all other nematode species investigated. Phylogenetic analyses (maximum parsimony, maximum likelihood, neighbor joining, and Bayesian inference) of nucleotide sequences for the 12 protein-coding genes of 25 nematode species placed E. vermicularis, a representative of the order Oxyurida, as sister to the main Ascaridida+Rhabditida group. Tree topology comparisons using statistical tests rejected an alternative hypothesis favoring a closer relationship among Ascaridida, Spirurida, and Oxyurida, which has been supported from most studies based on nuclear ribosomal DNA sequences. Unlike the relatively conserved gene arrangement found for most chromadorean taxa, E. vermicularis mtDNA gene order is very unique, not sharing similarity to any other nematode species reported to date. This lack of gene order similarity may represent idiosyncratic gene rearrangements unique to this specific lineage of the oxyurids. To more fully understand the extent of gene rearrangement and its evolutionary significance within the nematode phylogenetic framework, additional mitochondrial genomes representing a greater evolutionary diversity of species must be characterized.
Wu, Zeng-Yuan; Milne, Richard I.; Chen, Chia-Jui; Liu, Jie; Wang, Hong; Li, De-Zhu
2015-01-01
Urticaceae is a family with more than 2000 species, which contains remarkable morphological diversity. It has undergone many taxonomic reorganizations, and is currently the subject of further systematic studies. To gain more resolution in systematic studies and to better understand the general patterns of character evolution in Urticaceae, based on our previous phylogeny including 169 accessions comprising 122 species across 47 Urticaceae genera, we examined 19 diagnostic characters, and analysed these employing both maximum-parsimony and maximum-likelihood approaches. Our results revealed that 16 characters exhibited multiple state changes within the family, with ten exhibiting >eight changes and three exhibiting between 28 and 40. Morphological synapomorphies were identified for many clades, but the diagnostic value of these was often limited due to reversals within the clade and/or homoplasies elsewhere. Recognition of the four clades comprising the family at subfamily level can be supported by a small number carefully chosen defining traits for each. Several non-monophyletic genera appear to be defined only by characters that are plesiomorphic within their clades, and more detailed work would be valuable to find defining traits for monophyletic clades within these. Some character evolution may be attributed to adaptive evolution in Urticaceae due to shifts in habitat or vegetation type. This study demonstrated the value of using phylogeny to trace character evolution, and determine the relative importance of morphological traits for classification. PMID:26529598
Molecular systematics of the Middle American genus Hypopachus (Anura: Microhylidae)
Greenbaum, Eli; Smith, Eric N.; de Sá, Rafael O.
2011-01-01
We present the first phylogenetic study on the widespread Middle American microhylid frog genus Hypopachus. Partial sequences of mitochondrial (12S and 16S ribosomal RNA) and nuclear (rhodopsin) genes (1275 bp total) were analyzed from 43 samples of Hypopachus, three currently recognized species of Gastrophryne, and seven arthroleptid, brevicipitid and microhylid outgroup taxa. Maximum parsimony (PAUP), maximum likelihood (RAxML) and Bayesian inference (MrBayes) optimality criteria were used for phylogenetic analyses, and BEAST was used to estimate divergence dates of major clades. Population-level analyses were conducted with the programs NETWORK and Arlequin. Results confirm the placement of Hypopachus and Gastrophryne as sister taxa, but the latter genus was strongly supported as paraphyletic. The African phrynomerine genus Phrynomantis was recovered as the sister taxon to a monophyletic Chiasmocleis, rendering our well-supported clade of gastrophrynines paraphyletic. Hypopachus barberi was supported as a disjunctly distributed highland species, and we recovered a basal split in lowland populations of Hypopachus variolosus from the Pacific versant of Mexico and elsewhere in the Mesoamerican lowlands. Dating analyses from BEAST estimate speciation within the genus Hypopachus occurred in the late Miocene/early Pliocene for most clades. Previous studies have not found bioacoustic or morphological differences among these lowland clades, and our molecular data support the continued recognition of two species in the genus Hypopachus. PMID:21798357
Ding, Hui-Hui; Chao, Yi-Shan; Callado, John Rey; Dong, Shi-Yong
2014-11-01
In this study we provide a phylogeny for the pantropical fern genus Tectaria, with emphasis on the Old World species, based on sequences of five plastid regions (atpB, ndhF plus ndhF-trnL, rbcL, rps16-matK plus matK, and trnL-F). Maximum parsimony, maximum likelihood, and Bayesian inference are used to analyze 115 individuals, representing ca. 56 species of Tectaria s.l. and 36 species of ten related genera. The results strongly support the monophyly of Tectaria in a broad sense, in which Ctenitopsis, Hemigramma, Heterogonium, Psomiocarpa, Quercifilix, Stenosemia, and Tectaridium should be submerged. Such broadly circumscribed Tectaria is supported by the arising pattern of veinlets and the base chromosome number (x=40). Four primary clades are well resolved within Tectaria, one from the Neotropic (T. trifoliata clade) and three from the Old World (T. subtriphylla clade, Ctenitopsis clade, and T. crenata clade). Tectaria crenata clade is the largest one including six subclades. Of the genera previously recognized as tectarioid ferns, Ctenitis, Lastreopsis, and Pleocnemia, are confirmed to be members in Dryopteridaceae; while Pteridrys and Triplophyllum are supported in Tectariaceae. To infer morphological evolution, 13 commonly used characters are optimized on the resulting phylogenetic trees and in result, are all homoplastic in Tectaria. Copyright © 2014 Elsevier Inc. All rights reserved.
Schwartz, Carolyn E; Patrick, Donald L
2014-07-01
When planning a comparative effectiveness study comparing disease-modifying treatments, competing demands influence choice of outcomes. Current practice emphasizes parsimony, although understanding multidimensional treatment impact can help to personalize medical decision-making. We discuss both sides of this 'tug of war'. We discuss the assumptions, advantages and drawbacks of composite scores and multidimensional outcomes. We describe possible solutions to the multiple comparison problem, including conceptual hierarchy distinctions, statistical approaches, 'real-world' benchmarks of effectiveness and subgroup analysis. We conclude that comparative effectiveness research should consider multiple outcome dimensions and compare different approaches that fit the individual context of study objectives.
The systematic component of phylogenetic error as a function of taxonomic sampling under parsimony.
Debry, Ronald W
2005-06-01
The effect of taxonomic sampling on phylogenetic accuracy under parsimony is examined by simulating nucleotide sequence evolution. Random error is minimized by using very large numbers of simulated characters. This allows estimation of the consistency behavior of parsimony, even for trees with up to 100 taxa. Data were simulated on 8 distinct 100-taxon model trees and analyzed as stratified subsets containing either 25 or 50 taxa, in addition to the full 100-taxon data set. Overall accuracy decreased in a majority of cases when taxa were added. However, the magnitude of change in the cases in which accuracy increased was larger than the magnitude of change in the cases in which accuracy decreased, so, on average, overall accuracy increased as more taxa were included. A stratified sampling scheme was used to assess accuracy for an initial subsample of 25 taxa. The 25-taxon analyses were compared to 50- and 100-taxon analyses that were pruned to include only the original 25 taxa. On average, accuracy for the 25 taxa was improved by taxon addition, but there was considerable variation in the degree of improvement among the model trees and across different rates of substitution.
A Basic Bivariate Structure of Personality Attributes Evident Across Nine Languages.
Saucier, Gerard; Thalmayer, Amber Gayle; Payne, Doris L; Carlson, Robert; Sanogo, Lamine; Ole-Kotikash, Leonard; Church, A Timothy; Katigbak, Marcia S; Somer, Oya; Szarota, Piotr; Szirmák, Zsofia; Zhou, Xinyue
2014-02-01
Here, two studies seek to characterize a parsimonious common-denominator personality structure with optimal cross-cultural replicability. Personality differences are observed in all human populations and cultures, but lexicons for personality attributes contain so many distinctions that parsimony is lacking. Models stipulating the most important attributes have been formulated by experts or by empirical studies drawing on experience in a very limited range of cultures. Factor analyses of personality lexicons of nine languages of diverse provenance (Chinese, Korean, Filipino, Turkish, Greek, Polish, Hungarian, Maasai, and Senoufo) were examined, and their common structure was compared to that of several prominent models in psychology. A parsimonious bivariate model showed evidence of substantial convergence and ubiquity across cultures. Analyses involving key markers of these dimensions in English indicate that they are broad dimensions involving the overlapping content of the interpersonal circumplex, models of communion and agency, and morality/warmth and competence. These "Big Two" dimensions-Social Self-Regulation and Dynamism-provide a common-denominator model involving the two most crucial axes of personality variation, ubiquitous across cultures. The Big Two might serve as an umbrella model serving to link diverse theoretical models and associated research literatures. © 2013 Wiley Periodicals, Inc.
NASA Astrophysics Data System (ADS)
Hsu, Shih-Jang
The major purpose of this study was to determine the relative contribution of nine variables in predicting teachers' responsible environmental behavior (REB). The theoretic framework of this study was based on the Hines model, the Hungerford and Volk model, and the environmental literacy framework proposed by Environmental Literacy Assessment Consortium. A nine-page instrument was administered by mailed questionnaire to 300 randomly selected secondary teachers in Hualien County of Taiwan with a 78.7% response rate. Correlation and stepwise multiple regression analyses were conducted. The following conclusions were drawn: (1) For all the respondents, all the nine environmental literacy variables were significant correlates of REB. These correlates included: perceived knowledge of environmental action strategies (KNOW; r =.46), intention to act (IA; r =.46), perceived skill in using environmental action strategies (SKILL; r =.45), perceived knowledge of environmental problems and issues (KISSU; r =.34), environmental sensitivity (r =.28), environmental responsibility (r =.27), perceived knowledge of ecology and environmental science (r =.27), locus of control (r =.27), and environmental attitudes (r =.21). (2) When only the nine environmental literacy variables were considered, the most parsimonious set of predictors of REB for all the teachers included: (a) KNOW, (Rsp2 =.2116); (b) IA, (Rsp2 =.0916); and (c) SKILL, (Rsp2 =.0205). For the urban teachers, the most parsimonious set of predictors included: (a) IA (Rsp2 =.2559); (b) SKILL (Rsp2.0926); and (c) environmental responsibility (Rsp2 =.0219). For the rural teachers, the most parsimonious set of predictors included: (a) KNOW (Rsp2 =.1872); (b) IA (Rsp2 =.0816); and (c) KISSU (Rsp2 =.0318). (3) When the environmental literacy variables as well as demographic and experience variables were considered, the most parsimonious set of predictors for all the teachers included: (a) KNOW, (Rsp2 =.2834); (b) IA, (Rsp2 =.0696); (c) area of residence, (Rsp2 =.0174); and (d) SKILL, (Rsp2 =.0163). For the urban teachers, the most parsimonious set of predictors included: (a) IA (Rsp2 =.3199); (b) SKILL (Rsp2 =.0840); (c) major sources of environmental information (Rsp2 =.0432); and (d) membership in environmental organizations, (Rsp2 =.0240). Implications for environmental education program development and instructional practice were presented. Recommendations for further research were also provided.
Stevenson, Jennifer C.; Simubali, Limonty; Mbambara, Saidon; Musonda, Michael; Mweetwa, Sydney; Mudenda, Twig; Pringle, Julia C.; Jones, Christine M.; Norris, Douglas E.
2016-01-01
Southern Zambia is the focus of strategies to create malaria-free zones. Interventions being rolled out include test and treat strategies and distribution of insecticide-treated bed nets that target vectors that host-seek indoors and late at night. In Macha, Choma District, collections of mosquitoes were made outdoors using barrier screens within homesteads or UV bulb light traps set next to goats, cattle, or chickens during the rainy season of 2015. Anopheline mosquitoes were identified to species using molecular methods and Plasmodium falciparum infectivity was determined by ELISA and real-time qPCR methods. More than 40% of specimens caught were identified as Anopheles squamosus Theobald, 1901 of which six were found harboring malaria parasites. A single sample, morphologically identified as Anopheles coustani Laveran, 1900, was also found to be infectious. All seven specimens were caught outdoors next to goat pens. Parasite-positive specimens as well as a subset of An. squamosus specimens from either the same study or archive collections from the same area underwent sequencing of the mitochondrial cytochrome oxidase subunit I gene. Maximum parsimony trees constructed from the aligned sequences indicated presence of at least two clades of An. squamosus with infectious specimens falling in each clade. The single infectious specimen identified morphologically as An. coustani could not be matched to reference sequences. This is the first report from Zambia of infections in An. squamosus, a species which is described in literature to display exophagic traits. The bionomic characteristics of this species needs to be studied further to fully evaluate the implications for indoor-targeted vector control. PMID:27297214
Mishra, Priyanka; Kumar, Amit; Nagireddy, Akshitha; Shukla, Ashutosh K.
2017-01-01
DNA barcoding is used as a universal tool for delimiting species boundaries in taxonomically challenging groups, with different plastid and nuclear regions (rbcL, matK, ITS and psbA-trnH) being recommended as primary DNA barcodes for plants. We evaluated the feasibility of using these regions in the species-rich genus Terminalia, which exhibits various overlapping morphotypes with pantropical distribution, owing to its complex taxonomy. Terminalia bellerica and T. chebula are ingredients of the famous Ayurvedic Rasayana formulation Triphala, used for detoxification and rejuvenation. High demand for extracted phytochemicals as well as the high trade value of several species renders mandatory the need for the correct identification of traded plant material. Three different analytical methods with single and multilocus barcoding regions were tested to develop a DNA barcode reference library from 222 individuals representing 41 Terminalia species. All the single barcodes tested had a lower discriminatory power than the multilocus regions, and the combination of matK+ITS had the highest resolution rate (94.44%). The average intra-specific variations (0.0188±0.0019) were less than the distance to the nearest neighbour (0.106±0.009) with matK and ITS. Distance-based Neighbour Joining analysis outperformed the character-based Maximum Parsimony method in the identification of traded species such as T. arjuna, T. chebula and T. tomentosa, which are prone to adulteration. rbcL was shown to be a highly conservative region with only 3.45% variability between all of the sequences. The recommended barcode combination, rbcL+matK, failed to perform in the genus Terminalia. Considering the complexity of resolution observed with single regions, the present study proposes the combination of matK+ITS as the most successful barcode in Terminalia. PMID:28829803
Xie, Lei; Yang, Zhi-Yun; Wen, Jun; Li, De-Zhu; Yi, Ting-Shuang
2014-08-01
Pistacia L. exhibits a disjunct distribution in Mediterranean Eurasia and adjacent North Africa, eastern Asia, and North to Central America. The spatio-temporal diversification history of Pistacia was assessed to test hypotheses on the Madrean-Tethyan and the Eurasian Tethyan disjunctions through phylogenetic and biogeographic analyses. Maximum parsimony and Bayesian methods were employed to analyze sequences of multiple nuclear and plastid loci of Pistacia species. Bayesian dating analysis was conducted to estimate the divergence times of clades. The likelihood method LAGRANGE was used to infer ancestral areas. The New World species of Pistacia formed a clade sister to the Old World clade in all phylogenetic analyses. The eastern Asian Pistacia weinmannifolia-P. cucphuongensis clade was sister to a clade of the remaining Old World species, which were further resolved into three subclades. Pistacia was estimated to have originated at 37.60 mya (with 95% highest posterior density interval (HPD): 25.42-48.51 mya). A vicariance event in the early Miocene (19.79 mya with 95% HPD: 10.88-30.36 mya) was inferred to account for the intercontinental disjunction between the New World and the Old World species, which is consistent with the Madrean-Tethyan hypothesis. The two Old World eastern Asian-Tethyan disjunctions are best explained by one vicariance event in the early Miocene (15.87 mya with 95% HPD: 8.36-24.36 mya) and one dispersal event in late Miocene (5.89 mya with 95% HPD: 2.68-9.16 mya). The diversification of the Old World Pistacia species was significantly affected by extensive geological and climatic changes in the Qinghai-Tibetan plateau (QTP) and in the Mediterranean region. Copyright © 2014 Elsevier Inc. All rights reserved.
HYDROSCAPE: A SCAlable and ParallelizablE Rainfall Runoff Model for Hydrological Applications
NASA Astrophysics Data System (ADS)
Piccolroaz, S.; Di Lazzaro, M.; Zarlenga, A.; Majone, B.; Bellin, A.; Fiori, A.
2015-12-01
In this work we present HYDROSCAPE, an innovative streamflow routing method based on the travel time approach, and modeled through a fine-scale geomorphological description of hydrological flow paths. The model is designed aimed at being easily coupled with weather forecast or climate models providing the hydrological forcing, and at the same time preserving the geomorphological dispersion of the river network, which is kept unchanged independently on the grid size of rainfall input. This makes HYDROSCAPE particularly suitable for multi-scale applications, ranging from medium size catchments up to the continental scale, and to investigate the effects of extreme rainfall events that require an accurate description of basin response timing. Key feature of the model is its computational efficiency, which allows performing a large number of simulations for sensitivity/uncertainty analyses in a Monte Carlo framework. Further, the model is highly parsimonious, involving the calibration of only three parameters: one defining the residence time of hillslope response, one for channel velocity, and a multiplicative factor accounting for uncertainties in the identification of the potential maximum soil moisture retention in the SCS-CN method. HYDROSCAPE is designed with a simple and flexible modular structure, which makes it particularly prone to massive parallelization, customization according to the specific user needs and preferences (e.g., rainfall-runoff model), and continuous development and improvement. Finally, the possibility to specify the desired computational time step and evaluate streamflow at any location in the domain, makes HYDROSCAPE an attractive tool for many hydrological applications, and a valuable alternative to more complex and highly parametrized large scale hydrological models. Together with model development and features, we present an application to the Upper Tiber River basin (Italy), providing a practical example of model performance and characteristics.
NASA Astrophysics Data System (ADS)
Aronica, G. T.; Candela, A.
2007-12-01
SummaryIn this paper a Monte Carlo procedure for deriving frequency distributions of peak flows using a semi-distributed stochastic rainfall-runoff model is presented. The rainfall-runoff model here used is very simple one, with a limited number of parameters and practically does not require any calibration, resulting in a robust tool for those catchments which are partially or poorly gauged. The procedure is based on three modules: a stochastic rainfall generator module, a hydrologic loss module and a flood routing module. In the rainfall generator module the rainfall storm, i.e. the maximum rainfall depth for a fixed duration, is assumed to follow the two components extreme value (TCEV) distribution whose parameters have been estimated at regional scale for Sicily. The catchment response has been modelled by using the Soil Conservation Service-Curve Number (SCS-CN) method, in a semi-distributed form, for the transformation of total rainfall to effective rainfall and simple form of IUH for the flood routing. Here, SCS-CN method is implemented in probabilistic form with respect to prior-to-storm conditions, allowing to relax the classical iso-frequency assumption between rainfall and peak flow. The procedure is tested on six practical case studies where synthetic FFC (flood frequency curve) were obtained starting from model variables distributions by simulating 5000 flood events combining 5000 values of total rainfall depth for the storm duration and AMC (antecedent moisture conditions) conditions. The application of this procedure showed how Monte Carlo simulation technique can reproduce the observed flood frequency curves with reasonable accuracy over a wide range of return periods using a simple and parsimonious approach, limited data input and without any calibration of the rainfall-runoff model.
Molecular phylogeny and evolutionary timescale for the family of mammalian herpesviruses.
McGeoch, D J; Cook, S; Dolan, A; Jamieson, F E; Telford, E A
1995-03-31
A detailed phylogenetic analysis for mammalian members of the family Herpesviridae, based on molecular sequences is reported. Sets of encoded amino acid sequences were collected for eight well conserved genes that are common to mammalian herpesviruses. Phylogenetic trees were inferred from alignments of these sequence sets using both maximum parsimony and distance methods, and evaluated by bootstrap analysis. In all cases the three recognised subfamilies (Alpha-, Beta- and Gammaherpesvirinae), and major sublineages in each subfamily, were clearly distinguished, but within sublineages some finer details of branching were incompletely resolved. Multiple-gene sets were assembled to give a broadly based tree. The root position of the tree was estimated by assuming a constant molecular clock and also by analysis of one herpesviral gene set (that encoding uracil-DNA glycosylase) using cellular homologues as outgroups. Both procedures placed the root between the Alphaherpesvirinae and the other two subfamilies. Substitution rates were calculated for the combined gene sets based on a previous estimate for alphaherpesviral UL27 genes, where the time base had been obtained according to the hypothesis of cospeciation of virus and host lineages. Assuming a constant molecular clock, it was then estimated that the three subfamilies arose approximately 180 to 220 million years ago, that major sublineages within subfamilies were probably generated before the mammalian radiation of 80 to 60 million years ago, and that speciations within sublineages took place in the last 80 million years, probably with a major component of cospeciation with host lineages.
Chao, Li-Lian; Shih, Chien-Ming
2016-12-01
The genetic identity of Rhipicephalus sanguineus tick was determined for the first time in Taiwan. The phylogenetic relationships were analyzed by comparing the sequences of mitochondrial 16S ribosomal DNA gene obtained from 32 strains of ticks representing six species of Rhipicephalus, two species of Dermacentor and two outgroup species (Haemaphysalis inermis and Ixodes ricinus). Seven major clades can be easily distinguished by neighbour-joining analysis and were congruent by maximum-parsimony method. All R. sanguineus ticks of Taiwan were genetically affiliated to the tropical lineage group of R. sanguineus sensu lato with highly homogeneous sequence (99.7-100% similarity), and can be discriminated from the temperate lineage group of Rhipicephalus sp. II and R. turanicus with a sequence divergence ranging from 1.7 to 5.2%. In contrast, the nucleotide variations among other Rhipicephalus spp. and other species/genus of ticks compared with the R. sanguineus ticks of Taiwan were measured from 10.6 to 25.5%. Moreover, intra- and inter-species analysis based on the genetic distance (GD) values indicated a lower level (GD < 0.003) within tropical lineage group compared with temperate lineage group (GD > 0.055) of Rhipicephalus, as well as other (GD > 0.129) and outgroup (GD > 0.236) species. Our results provide the first genetic identification of R. sanguineus ticks collected from Taiwan and demonstrate that all these R. sanguineus of Taiwan affiliated to the tropical lineage group of R. sanguineus sensu lato.
Chao, Li-Lian; Lu, Chun-Wei; Lin, Ying-Fang; Shih, Chien-Ming
2017-04-01
Genetic identity and morphological features of a human biting tick, Amblyomma testudinarium, were determined for the first time in Taiwan. Morphological features of adult male and female ticks of Am. testudinarium were observed and photographed by a stereo- microscope. The genetic identity was analyzed by comparing the sequences of mitochondrial 16S ribosomal DNA gene obtained from 18 strains of ticks representing 10 species of Amblyomma, and four outgroup species of Dermacentor and Rhipicephalus ticks. Nine major clades could be easily distinguished by neighbour-joining analysis and were congruent by maximum-parsimony method. All these Am. testudinarium ticks collected from Taiwan and Japan were genetically affiliated to a monophyletic group with highly homogeneous sequence (99.8-100% similarity), and can be discriminated from other species of Amblyomma and other genera of ticks (Dermacentor and Rhipicephalus) with a sequence divergence ranging from 6.9 to 23.9%. Moreover, intra- and inter-species analysis based on the genetic distance (GD) values indicated a lower level (GD < 0.003) within the same lineage of Am. testudinarium ticks collected from Taiwan and Japan, as compared with other lineage groups (GD > 0.108) of Amblyomma ticks, as well as outgroup (GD > 0.172) species. Our results provide the first distinguished features of adult Am. testudinarium ticks and the first genetic identification of Am. testudinarium ticks collected from humans in Taiwan. Seasonal prevalence, host range, and vectorial capacity of this tick species in Taiwan need to be further clarified.
Finding Nemo: molecular phylogeny and evolution of the unusual life style of anemonefish.
Santini, Simona; Polacco, Giovanni
2006-12-30
Anemonefish are a group of 28 species of coral reef fish belonging to the family Pomacentridae, subfamily Amphiprioninae, all characterized by living in symbiosis with sea anemones of several genera. Some anemonefish are specialized to cooperate with a single or few species of sea anemone, being immune to their poisonous tentacles but sensible to those of other species of sea anemones, while other anemonefish are more generalist and able to live together with a number of different species of sea anemone hosts. Despite the common life style, anemonefish species occur in a variety of colors, body shapes and degree of dependence from the host. To understand the evolutionary mechanisms responsible for the anemonefish diversification, we studied 23 out of 28 species of anemonefish by analyzing three mitochondrial regions: the cytochrome b gene, the 16S ribosomal RNA gene and the first half of the D-loop, a non-coding, regulatory region to reconstruct their molecular phylogeny through Bayesian and maximum parsimony approaches. The evolution of specialization was studied by means of character reconstruction methods. This work includes the highest number of anemonefish so far analyzed and particularly some species that had never been studied before. The results support a monophyletic origin for the subfamily Amphiprioninae, in contrast to the current taxonomy, based on morphological characters, that divides anemonefish into two separate genera. Moreover, we formulate some hypotheses concerning the life style and origin of the ancestral anemonefish.
Mucheka, Vimbai T; Lamb, Jennifer M; Pfukenyi, Davies M; Mukaratirwa, Samson
2015-11-30
The aim of this study was to identify and determine the genetic diversity of Fasciola species in cattle from Zimbabwe, the KwaZulu-Natal and Mpumalanga provinces of South Africa and selected wildlife hosts from Zimbabwe. This was based on analysis of DNA sequences of the nuclear ribosomal internal transcribed spacer (ITS1 and 2) and mitochondrial cytochrome oxidase 1 (CO1) regions. The sample of 120 flukes was collected from livers of 57 cattle at 4 abattoirs in Zimbabwe and 47 cattle at 6 abattoirs in South Africa; it also included three alcohol-preserved duiker, antelope and eland samples from Zimbabwe. Aligned sequences (ITS 506 base pairs and CO1 381 base pairs) were analyzed by neighbour-joining, maximum parsimony and Bayesian inference methods. Phylogenetic trees revealed the presence of Fasciola gigantica in cattle from Zimbabwe and F. gigantica and Fasciola hepatica in the samples from South Africa. F. hepatica was more prevalent (64%) in South Africa than F. gigantica. In Zimbabwe, F. gigantica was present in 99% of the samples; F. hepatica was found in only one cattle sample, an antelope (Hippotragus niger) and a duiker (Sylvicapra grimmia). This is the first molecular confirmation of the identity Fasciola species in Zimbabwe and South Africa. Knowledge on the identity and distribution of these liver flukes at molecular level will allow disease surveillance and control in the studied areas. Copyright © 2015 Elsevier B.V. All rights reserved.
Springer, Mark S; Signore, Anthony V; Paijmans, Johanna L A; Vélez-Juarbe, Jorge; Domning, Daryl P; Bauer, Cameron E; He, Kai; Crerar, Lorelei; Campos, Paula F; Murphy, William J; Meredith, Robert W; Gatesy, John; Willerslev, Eske; MacPhee, Ross D E; Hofreiter, Michael; Campbell, Kevin L
2015-10-01
The recently extinct (ca. 1768) Steller's sea cow (Hydrodamalis gigas) was a large, edentulous North Pacific sirenian. The phylogenetic affinities of this taxon to other members of this clade, living and extinct, are uncertain based on previous morphological and molecular studies. We employed hybridization capture methods and second generation sequencing technology to obtain >30kb of exon sequences from 26 nuclear genes for both H. gigas and Dugong dugon. We also obtained complete coding sequences for the tooth-related enamelin (ENAM) gene. Hybridization probes designed using dugong and manatee sequences were both highly effective in retrieving sequences from H. gigas (mean=98.8% coverage), as were more divergent probes for regions of ENAM (99.0% coverage) that were designed exclusively from a proboscidean (African elephant) and a hyracoid (Cape hyrax). New sequences were combined with available sequences for representatives of all other afrotherian orders. We also expanded a previously published morphological matrix for living and fossil Sirenia by adding both new taxa and nine new postcranial characters. Maximum likelihood and parsimony analyses of the molecular data provide robust support for an association of H. gigas and D. dugon to the exclusion of living trichechids (manatees). Parsimony analyses of the morphological data also support the inclusion of H. gigas in Dugongidae with D. dugon and fossil dugongids. Timetree analyses based on calibration density approaches with hard- and soft-bounded constraints suggest that H. gigas and D. dugon diverged in the Oligocene and that crown sirenians last shared a common ancestor in the Eocene. The coding sequence for the ENAM gene in H. gigas does not contain frameshift mutations or stop codons, but there is a transversion mutation (AG to CG) in the acceptor splice site of intron 2. This disruption in the edentulous Steller's sea cow is consistent with previous studies that have documented inactivating mutations in tooth-specific loci of a variety of edentulous and enamelless vertebrates including birds, turtles, aardvarks, pangolins, xenarthrans, and baleen whales. Further, branch-site dN/dS analyses provide evidence for positive selection in ENAM on the stem dugongid branch where extensive tooth reduction occurred, followed by neutral evolution on the Hydrodamalis branch. Finally, we present a synthetic evolutionary tree for living and fossil sirenians showing several key innovations in the history of this clade including character state changes that parallel those that occurred in the evolutionary history of cetaceans. Copyright © 2015 Elsevier Inc. All rights reserved.
Saarela, Jeffery M.; Wysocki, William P.; Barrett, Craig F.; Soreng, Robert J.; Davis, Jerrold I.; Clark, Lynn G.; Kelchner, Scot A.; Pires, J. Chris; Edger, Patrick P.; Mayfield, Dustin R.; Duvall, Melvin R.
2015-01-01
Whole plastid genomes are being sequenced rapidly from across the green plant tree of life, and phylogenetic analyses of these are increasing resolution and support for relationships that have varied among or been unresolved in earlier single- and multi-gene studies. Pooideae, the cool-season grass lineage, is the largest of the 12 grass subfamilies and includes important temperate cereals, turf grasses and forage species. Although numerous studies of the phylogeny of the subfamily have been undertaken, relationships among some ‘early-diverging’ tribes conflict among studies, and some relationships among subtribes of Poeae have not yet been resolved. To address these issues, we newly sequenced 25 whole plastomes, which showed rearrangements typical of Poaceae. These plastomes represent 9 tribes and 11 subtribes of Pooideae, and were analysed with 20 existing plastomes for the subfamily. Maximum likelihood (ML), maximum parsimony (MP) and Bayesian inference (BI) robustly resolve most deep relationships in the subfamily. Complete plastome data provide increased nodal support compared with protein-coding data alone at nodes that are not maximally supported. Following the divergence of Brachyelytrum, Phaenospermateae, Brylkinieae–Meliceae and Ampelodesmeae–Stipeae are the successive sister groups of the rest of the subfamily. Ampelodesmeae are nested within Stipeae in the plastome trees, consistent with its hybrid origin between a phaenospermatoid and a stipoid grass (the maternal parent). The core Pooideae are strongly supported and include Brachypodieae, a Bromeae–Triticeae clade and Poeae. Within Poeae, a novel sister group relationship between Phalaridinae and Torreyochloinae is found, and the relative branching order of this clade and Aveninae, with respect to an Agrostidinae–Brizinae clade, are discordant between MP and ML/BI trees. Maximum likelihood and Bayesian analyses strongly support Airinae and Holcinae as the successive sister groups of a Dactylidinae–Loliinae clade. PMID:25940204
Automatic load forecasting. Final report
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nelson, D.J.; Vemuri, S.
A method which lends itself to on-line forecasting of hourly electric loads is presented and the results of its use are compared to models developed using the Box-Jenkins method. The method consists of processing the historical hourly loads with a sequential least-squares estimator to identify a finite order autoregressive model which in turn is used to obtain a parsimonious autoregressive-moving average model. A procedure is also defined for incorporating temperature as a variable to improve forecasts where loads are temperature dependent. The method presented has several advantages in comparison to the Box-Jenkins method including much less human intervention and improvedmore » model identification. The method has been tested using three-hourly data from the Lincoln Electric System, Lincoln, Nebraska. In the exhaustive analyses performed on this data base this method produced significantly better results than the Box-Jenkins method. The method also proved to be more robust in that greater confidence could be placed in the accuracy of models based upon the various measures available at the identification stage.« less
Borths, Matthew R; Holroyd, Patricia A; Seiffert, Erik R
2016-01-01
Hyaenodonta is a diverse, extinct group of carnivorous mammals that included weasel- to rhinoceros-sized species. The oldest-known hyaenodont fossils are from the middle Paleocene of North Africa and the antiquity of the group in Afro-Arabia led to the hypothesis that it originated there and dispersed to Asia, Europe, and North America. Here we describe two new hyaenodont species based on the oldest hyaenodont cranial specimens known from Afro-Arabia. The material was collected from the latest Eocene Locality 41 (L-41, ∼34 Ma) in the Fayum Depression, Egypt. Akhnatenavus nefertiticyon sp. nov. has specialized, hypercarnivorous molars and an elongate cranial vault. In A. nefertiticyon the tallest, piercing cusp on M 1 -M 2 is the paracone. Brychotherium ephalmos gen. et sp. nov. has more generalized molars that retain the metacone and complex talonids. In B. ephalmos the tallest, piercing cusp on M 1 -M 2 is the metacone. We incorporate this new material into a series of phylogenetic analyses using a character-taxon matrix that includes novel dental, cranial, and postcranial characters, and samples extensively from the global record of the group. The phylogenetic analysis includes the first application of Bayesian methods to hyaenodont relationships. B. ephalmos is consistently placed within Teratodontinae, an Afro-Arabian clade with several generalist and hypercarnivorous forms, and Akhnatenavus is consistently recovered in Hyainailourinae as part of an Afro-Arabian radiation. The phylogenetic results suggest that hypercarnivory evolved independently three times within Hyaenodonta: in Teratodontinae, in Hyainailourinae, and in Hyaenodontinae. Teratodontines are consistently placed in a close relationship with Hyainailouridae (Hyainailourinae + Apterodontinae) to the exclusion of "proviverrines," hyaenodontines, and several North American clades, and we propose that the superfamily Hyainailouroidea be used to describe this relationship. Using the topologies recovered from each phylogenetic method, we reconstructed the biogeographic history of Hyaenodonta using parsimony optimization (PO), likelihood optimization (LO), and Bayesian Binary Markov chain Monte Carlo (MCMC) to examine support for the Afro-Arabian origin of Hyaenodonta. Across all analyses, we found that Hyaenodonta most likely originated in Europe, rather than Afro-Arabia. The clade is estimated by tip-dating analysis to have undergone a rapid radiation in the Late Cretaceous and Paleocene; a radiation currently not documented by fossil evidence. During the Paleocene, lineages are reconstructed as dispersing to Asia, Afro-Arabia, and North America. The place of origin of Hyainailouroidea is likely Afro-Arabia according to the Bayesian topologies but it is ambiguous using parsimony. All topologies support the constituent clades-Hyainailourinae, Apterodontinae, and Teratodontinae-as Afro-Arabian and tip-dating estimates that each clade is established in Afro-Arabia by the middle Eocene.
Seiffert, Erik R.
2016-01-01
Hyaenodonta is a diverse, extinct group of carnivorous mammals that included weasel- to rhinoceros-sized species. The oldest-known hyaenodont fossils are from the middle Paleocene of North Africa and the antiquity of the group in Afro-Arabia led to the hypothesis that it originated there and dispersed to Asia, Europe, and North America. Here we describe two new hyaenodont species based on the oldest hyaenodont cranial specimens known from Afro-Arabia. The material was collected from the latest Eocene Locality 41 (L-41, ∼34 Ma) in the Fayum Depression, Egypt. Akhnatenavus nefertiticyon sp. nov. has specialized, hypercarnivorous molars and an elongate cranial vault. In A. nefertiticyon the tallest, piercing cusp on M1–M2 is the paracone. Brychotherium ephalmos gen. et sp. nov. has more generalized molars that retain the metacone and complex talonids. In B. ephalmos the tallest, piercing cusp on M1–M2 is the metacone. We incorporate this new material into a series of phylogenetic analyses using a character-taxon matrix that includes novel dental, cranial, and postcranial characters, and samples extensively from the global record of the group. The phylogenetic analysis includes the first application of Bayesian methods to hyaenodont relationships. B. ephalmos is consistently placed within Teratodontinae, an Afro-Arabian clade with several generalist and hypercarnivorous forms, and Akhnatenavus is consistently recovered in Hyainailourinae as part of an Afro-Arabian radiation. The phylogenetic results suggest that hypercarnivory evolved independently three times within Hyaenodonta: in Teratodontinae, in Hyainailourinae, and in Hyaenodontinae. Teratodontines are consistently placed in a close relationship with Hyainailouridae (Hyainailourinae + Apterodontinae) to the exclusion of “proviverrines,” hyaenodontines, and several North American clades, and we propose that the superfamily Hyainailouroidea be used to describe this relationship. Using the topologies recovered from each phylogenetic method, we reconstructed the biogeographic history of Hyaenodonta using parsimony optimization (PO), likelihood optimization (LO), and Bayesian Binary Markov chain Monte Carlo (MCMC) to examine support for the Afro-Arabian origin of Hyaenodonta. Across all analyses, we found that Hyaenodonta most likely originated in Europe, rather than Afro-Arabia. The clade is estimated by tip-dating analysis to have undergone a rapid radiation in the Late Cretaceous and Paleocene; a radiation currently not documented by fossil evidence. During the Paleocene, lineages are reconstructed as dispersing to Asia, Afro-Arabia, and North America. The place of origin of Hyainailouroidea is likely Afro-Arabia according to the Bayesian topologies but it is ambiguous using parsimony. All topologies support the constituent clades–Hyainailourinae, Apterodontinae, and Teratodontinae–as Afro-Arabian and tip-dating estimates that each clade is established in Afro-Arabia by the middle Eocene. PMID:27867761
Aeroelastic Model Structure Computation for Envelope Expansion
NASA Technical Reports Server (NTRS)
Kukreja, Sunil L.
2007-01-01
Structure detection is a procedure for selecting a subset of candidate terms, from a full model description, that best describes the observed output. This is a necessary procedure to compute an efficient system description which may afford greater insight into the functionality of the system or a simpler controller design. Structure computation as a tool for black-box modelling may be of critical importance in the development of robust, parsimonious models for the flight-test community. Moreover, this approach may lead to efficient strategies for rapid envelope expansion which may save significant development time and costs. In this study, a least absolute shrinkage and selection operator (LASSO) technique is investigated for computing efficient model descriptions of nonlinear aeroelastic systems. The LASSO minimises the residual sum of squares by the addition of an l(sub 1) penalty term on the parameter vector of the traditional 2 minimisation problem. Its use for structure detection is a natural extension of this constrained minimisation approach to pseudolinear regression problems which produces some model parameters that are exactly zero and, therefore, yields a parsimonious system description. Applicability of this technique for model structure computation for the F/A-18 Active Aeroelastic Wing using flight test data is shown for several flight conditions (Mach numbers) by identifying a parsimonious system description with a high percent fit for cross-validated data.
Beyond technology acceptance to effective technology use: a parsimonious and actionable model.
Holahan, Patricia J; Lesselroth, Blake J; Adams, Kathleen; Wang, Kai; Church, Victoria
2015-05-01
To develop and test a parsimonious and actionable model of effective technology use (ETU). Cross-sectional survey of primary care providers (n = 53) in a large integrated health care organization that recently implemented new medication reconciliation technology. Surveys assessed 5 technology-related perceptions (compatibility with work values, implementation climate, compatibility with work processes, perceived usefulness, and ease of use) and 1 outcome variable, ETU. ETU was measured as both consistency and quality of technology use. Compatibility with work values and implementation climate were found to have differential effects on consistency and quality of use. When implementation climate was strong, consistency of technology use was high. However, quality of technology use was high only when implementation climate was strong and values compatibility was high. This is an important finding and highlights the importance of users' workplace values as a key determinant of quality of use. To extend our effectiveness in implementing new health care information technology, we need parsimonious models that include actionable determinants of ETU and account for the differential effects of these determinants on the multiple dimensions of ETU. © The Author 2015. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
A parsimonious modular approach to building a mechanistic belowground carbon and nitrogen model
NASA Astrophysics Data System (ADS)
Abramoff, Rose Z.; Davidson, Eric A.; Finzi, Adrien C.
2017-09-01
Soil decomposition models range from simple empirical functions to those that represent physical, chemical, and biological processes. Here we develop a parsimonious, modular C and N cycle model, the Dual Arrhenius Michaelis-Menten-Microbial Carbon and Nitrogen Phyisology (DAMM-MCNiP), that generates testable hypotheses regarding the effect of temperature, moisture, and substrate supply on C and N cycling. We compared this model to DAMM alone and an empirical model of heterotrophic respiration based on Harvard Forest data. We show that while different model structures explain similar amounts of variation in respiration, they differ in their ability to infer processes that affect C flux. We applied DAMM-MCNiP to explain an observed seasonal hysteresis in the relationship between respiration and temperature and show using an exudation simulation that the strength of the priming effect depended on the stoichiometry of the inputs. Low C:N inputs stimulated priming of soil organic matter decomposition, but high C:N inputs were preferentially utilized by microbes as a C source with limited priming. The simplicity of DAMM-MCNiP's simultaneous representations of temperature, moisture, substrate supply, enzyme activity, and microbial growth processes is unique among microbial physiology models and is sufficiently parsimonious that it could be incorporated into larger-scale models of C and N cycling.
Then, Amy Y.; Hoenig, John M; Hall, Norman G.; Hewitt, David A.
2015-01-01
Many methods have been developed in the last 70 years to predict the natural mortality rate, M, of a stock based on empirical evidence from comparative life history studies. These indirect or empirical methods are used in most stock assessments to (i) obtain estimates of M in the absence of direct information, (ii) check on the reasonableness of a direct estimate of M, (iii) examine the range of plausible M estimates for the stock under consideration, and (iv) define prior distributions for Bayesian analyses. The two most cited empirical methods have appeared in the literature over 2500 times to date. Despite the importance of these methods, there is no consensus in the literature on how well these methods work in terms of prediction error or how their performance may be ranked. We evaluate estimators based on various combinations of maximum age (tmax), growth parameters, and water temperature by seeing how well they reproduce >200 independent, direct estimates of M. We use tenfold cross-validation to estimate the prediction error of the estimators and to rank their performance. With updated and carefully reviewed data, we conclude that a tmax-based estimator performs the best among all estimators evaluated. The tmax-based estimators in turn perform better than the Alverson–Carney method based on tmax and the von Bertalanffy K coefficient, Pauly’s method based on growth parameters and water temperature and methods based just on K. It is possible to combine two independent methods by computing a weighted mean but the improvement over the tmax-based methods is slight. Based on cross-validation prediction error, model residual patterns, model parsimony, and biological considerations, we recommend the use of a tmax-based estimator (M=4.899tmax−0.916">M=4.899t−0.916maxM=4.899tmax−0.916, prediction error = 0.32) when possible and a growth-based method (M=4.118K0.73L∞−0.33">M=4.118K0.73L−0.33∞M=4.118K0.73L∞−0.33 , prediction error = 0.6, length in cm) otherwise.
Soft-tissue anatomy of the extant hominoids: a review and phylogenetic analysis
Gibbs, S; Collard, M; Wood, B
2002-01-01
This paper reports the results of a literature search for information about the soft-tissue anatomy of the extant non-human hominoid genera, Pan, Gorilla, Pongo and Hylobates, together with the results of a phylogenetic analysis of these data plus comparable data for Homo. Information on the four extant non-human hominoid genera was located for 240 out of the 1783 soft-tissue structures listed in the Nomina Anatomica. Numerically these data are biased so that information about some systems (e.g. muscles) and some regions (e.g. the forelimb) are over-represented, whereas other systems and regions (e.g. the veins and the lymphatics of the vascular system, the head region) are either under-represented or not represented at all. Screening to ensure that the data were suitable for use in a phylogenetic analysis reduced the number of eligible soft-tissue structures to 171. These data, together with comparable data for modern humans, were converted into discontinuous character states suitable for phylogenetic analysis and then used to construct a taxon-by-character matrix. This matrix was used in two tests of the hypothesis that soft-tissue characters can be relied upon to reconstruct hominoid phylogenetic relationships. In the first, parsimony analysis was used to identify cladograms requiring the smallest number of character state changes. In the second, the phylogenetic bootstrap was used to determine the confidence intervals of the most parsimonious clades. The parsimony analysis yielded a single most parsimonious cladogram that matched the molecular cladogram. Similarly the bootstrap analysis yielded clades that were compatible with the molecular cladogram; a (Homo, Pan) clade was supported by 95% of the replicates, and a (Gorilla, Pan, Homo) clade by 96%. These are the first hominoid morphological data to provide statistically significant support for the clades favoured by the molecular evidence. PMID:11833653
Hopple, J S; Vilgalys, R
1999-10-01
Phylogenetic relationships were investigated in the mushroom genus Coprinus based on sequence data from the nuclear encoded large-subunit rDNA gene. Forty-seven species of Coprinus and 19 additional species from the families Coprinaceae, Strophariaceae, Bolbitiaceae, Agaricaceae, Podaxaceae, and Montagneaceae were studied. A total of 1360 sites was sequenced across seven divergent domains and intervening sequences. A total of 302 phylogenetically informative characters was found. Ninety-eight percent of the average divergence between taxa was located within the divergent domains, with domains D2 and D8 being most divergent and domains D7 and D10 the least divergent. An empirical test of phylogenetic signal among divergent domains also showed that domains D2 and D3 had the lowest levels of homoplasy. Two equally most parsimonious trees were resolved using Wagner parsimony. A character-state weighted analysis produced 12 equally most parsimonious trees similar to those generated by Wagner parsimony. Phylogenetic analyses employing topological constraints suggest that none of the major taxonomic systems proposed for subgeneric classification is able to completely reflect phylogenetic relationships in Coprinus. A strict consensus integration of the two Wagner trees demonstrates the problematic nature of choosing outgroups within dark-spored mushrooms. The genus Coprinus is found to be polyphyletic and is separated into three distinct clades. Most Coprinus taxa belong to the first two clades, which together form a larger monophyletic group with Lacrymaria and Psathyrella in basal positions. A third clade contains members of Coprinus section Comati as well as the genus Leucocoprinus, Podaxis pistillaris, Montagnea arenaria, and Agaricus pocillator. This third clade is separated from the other species of Coprinus by members of the families Strophariaceae and Bolbitiaceae and the genus Panaeolus. Copyright 1999 Academic Press.
Principle of Parsimony, Fake Science, and Scales
NASA Astrophysics Data System (ADS)
Yeh, T. C. J.; Wan, L.; Wang, X. S.
2017-12-01
Considering difficulties in predicting exact motions of water molecules, and the scale of our interests (bulk behaviors of many molecules), Fick's law (diffusion concept) has been created to predict solute diffusion process in space and time. G.I. Taylor (1921) demonstrated that random motion of the molecules reach the Fickian regime in less a second if our sampling scale is large enough to reach ergodic condition. Fick's law is widely accepted for describing molecular diffusion as such. This fits the definition of the parsimony principle at the scale of our concern. Similarly, advection-dispersion or convection-dispersion equation (ADE or CDE) has been found quite satisfactory for analysis of concentration breakthroughs of solute transport in uniformly packed soil columns. This is attributed to the solute is often released over the entire cross-section of the column, which has sampled many pore-scale heterogeneities and met the ergodicity assumption. Further, the uniformly packed column contains a large number of stationary pore-size heterogeneity. The solute thus reaches the Fickian regime after traveling a short distance along the column. Moreover, breakthrough curves are concentrations integrated over the column cross-section (the scale of our interest), and they meet the ergodicity assumption embedded in the ADE and CDE. To the contrary, scales of heterogeneity in most groundwater pollution problems evolve as contaminants travel. They are much larger than the scale of our observations and our interests so that the ergodic and the Fickian conditions are difficult. Upscaling the Fick's law for solution dispersion, and deriving universal rules of the dispersion to the field- or basin-scale pollution migrations are merely misuse of the parsimony principle and lead to a fake science ( i.e., the development of theories for predicting processes that can not be observed.) The appropriate principle of parsimony for these situations dictates mapping of large-scale heterogeneities as detailed as possible and adapting the Fick's law for effects of small-scale heterogeneity resulting from our inability to characterize them in detail.
Whaley, Dana H.; Sheedy, Patrick F.; Peyser, Patricia A.
2010-01-01
Abstract Objective The etiology of breast arterial calcification (BAC) is not well understood. We examined reproductive history and cardiovascular disease (CVD) risk factor associations with the presence of detectable BAC in asymptomatic postmenopausal women. Methods Reproductive history and CVD risk factors were obtained in 240 asymptomatic postmenopausal women from a community-based research study who had a screening mammogram within 2 years of their participation in the study. The mammograms were reviewed for the presence of detectable BAC. Age-adjusted logistic regression models were fit to assess the association between each risk factor and the presence of BAC. Multiple variable logistic regression models were used to identify the most parsimonious model for the presence of BAC. Results The prevalence of BAC increased with increased age (p < 0.0001). The most parsimonious logistic regression model for BAC presence included age at time of examination, increased parity (p = 0.01), earlier age at first birth (p = 0.002), weight, and an age-by-weight interaction term (p = 0.004). Older women with a smaller body size had a higher probability of having BAC than women of the same age with a larger body size. Conclusions The presence or absence of BAC at mammography may provide an assessment of a postmenopausal woman's lifetime estrogen exposure and indicate women who could be at risk for hormonally related conditions. PMID:20629578
Romain, Ahmed Jerôme; Bernard, Paquito; Hokayem, Marie; Gernigon, Christophe; Avignon, Antoine
2016-03-01
This study aimed to test three factorial structures conceptualizing the processes of change (POC) from the transtheoretical model and to examine the relationships between the POC and stages of change (SOC) among overweight and obese adults. Cross-sectional study. This study was conducted at the University Hospital of Montpellier, France. A sample of 289 overweight or obese participants (199 women) was enrolled in the study. Participants completed the POC and SOC questionnaires during a 5-day hospitalization for weight management. Structural equation modeling was used to compare the different factorial structures. The unweighted least-squares method was used to identify the best-fit indices for the five fully correlated model (goodness-of-fit statistic = .96; adjusted goodness-of-fit statistic = .95; standardized root mean residual = .062; normed-fit index = .95; parsimonious normed-fit index = .83; parsimonious goodness-of-fit statistic = .78). The multivariate analysis of variance was significant (p < .001). A post hoc test showed that individuals in advanced SOC used more of both experiential and behavioral POC than those in preaction stages, with effect sizes ranging from .06 to .29. This study supports the validity of the factorial structure of POC concerning physical activity and confirms the assumption that, in this context, people with excess weight use both experiential and behavioral processes. These preliminary results should be confirmed in a longitudinal study. © The Author(s) 2016.
Waits, L P; Sullivan, J; O'Brien, S J; Ward, R H
1999-10-01
The bear family (Ursidae) presents a number of phylogenetic ambiguities as the evolutionary relationships of the six youngest members (ursine bears) are largely unresolved. Recent mitochondrial DNA analyses have produced conflicting results with respect to the phylogeny of ursine bears. In an attempt to resolve these issues, we obtained 1916 nucleotides of mitochondrial DNA sequence data from six gene segments for all eight bear species and conducted maximum likelihood and maximum parsimony analyses on all fragments separately and combined. All six single-region gene trees gave different phylogenetic estimates; however, only for control region data was this significantly incongruent with the results from the combined data. The optimal phylogeny for the combined data set suggests that the giant panda is most basal followed by the spectacled bear. The sloth bear is the basal ursine bear, and there is weak support for a sister taxon relationship of the American and Asiatic black bears. The sun bear is sister taxon to the youngest clade containing brown bears and polar bears. Statistical analyses of alternate hypotheses revealed a lack of strong support for many of the relationships. We suggest that the difficulties surrounding the resolution of the evolutionary relationships of the Ursidae are linked to the existence of sequential rapid radiation events in bear evolution. Thus, unresolved branching orders during these time periods may represent an accurate representation of the evolutionary history of bear species. Copyright 1999 Academic Press.
Eamsobhana, Praphathip; Lim, Phaik Eem; Zhang, Hongman; Gan, Xiaoxian; Yong, Hoi Sen
2010-12-01
The phylogenetic relationships and molecular differentiation of three species of angiostrongylid nematodes (Angiostrongylus cantonensis, Angiostrongylus costaricensis and Angiostrongylus malaysiensis) were studied using the AC primers for a 66-kDa protein gene of A. cantonensis. The AC primers successfully amplified the genomic DNA of these angiostrongylid nematodes. No amplification was detected for the DNA of Ascaris lumbricoides, Ascaris suum, Anisakis simplex, Gnathostoma spinigerum, Toxocara canis, and Trichinella spiralis. The maximum-parsimony (MP) consensus tree and the maximum-likelihood (ML) tree both showed that the Angiostrongylus taxa could be divided into two major clades - Clade 1 (A. costaricensis) and Clade 2 (A. cantonensis and A. malaysiensis) with a full support bootstrap value. A. costaricensis is the most distant taxon. A. cantonensis is a sister group to A. malaysiensis; these two taxa (species) are clearly separated. There is no clear distinction between the A. cantonensis samples from four different geographical localities (Thailand, China, Japan and Hawaii); only some of the samples are grouped ranging from no support or low support to moderate support of bootstrap values. The published nucleotide sequences of A. cantonensis adult-specific native 66kDa protein mRNA, clone L5-400 from Taiwan (U17585) appear to be very distant from the A. cantonensis samples from Thailand, China, Japan and Hawaii, with the uncorrected p-distance values ranging from 26.87% to 29.92%.
Zhao, Guang-Hui; Jia, Yan-Qing; Cheng, Wen-Yu; Zhao, Wen; Bian, Qing-Qing; Liu, Guo-Hua
2014-07-11
Nematodirus spp. are among the most common nematodes of ruminants worldwide. N. oiratianus and N. spathiger are distributed worldwide as highly prevalent gastrointestinal nematodes, which cause emerging health problems and economic losses. Accurate identification of Nematodirus species is essential to develop effective control strategies for Nematodirus infection in ruminants. Mitochondrial DNA (mtDNA) could provide powerful genetic markers for identifying these closely related species and resolving phylogenetic relationships at different taxonomic levels. In the present study, the complete mitochondrial (mt) genomes of N. oiratianus and N. spathiger from small ruminants in China were obtained using Long-range PCR and sequencing. The complete mt genomes of N. oiratianus and N. spathiger were 13,765 bp and 13,519 bp in length, respectively. Both mt genomes were circular and consisted of 36 genes, including 12 genes encoding proteins, 2 genes encoding rRNA, and 22 genes encoding tRNA. Phylogenetic analyses based on the concatenated amino acid sequence data of all 12 protein-coding genes by Bayesian inference (BI), Maximum likelihood (ML) and Maximum parsimony (MP) showed that the two Nematodirus species (Molineidae) were closely related to Dictyocaulidae. The availability of the complete mtDNA sequences of N. oiratianus and N. spathiger not only provides new mtDNA sources for a better understanding of nematode mt genomics and phylogeny, but also provides novel and useful genetic markers for studying diagnosis, population genetics and molecular epidemiology of Nematodirus spp. in small ruminants.
Hernández-Mena, David Iván; García-Prieto, Luís; García-Varela, Martín
2014-04-01
Parastrigea plataleae n. sp. (Digenea: Strigeidae) is described from the intestine of the roseate spoonbill Platalea ajaja (Threskiornithidae) from four localities on the Pacific coast of Mexico. The new species is mainly distinguished from the other 18 described species of Parastrigea based on the ratio of its hindbody length to forebody length. A principal component analysis (PCA) of 16 morphometric traits for 15 specimens of P. plataleae n. sp., five of Parastrigea cincta and 11 of Parastrigea diovadena previously recorded in Mexico, clearly shows three clusters, which correspond to the three species. DNA sequences of the internal transcribed spacers (ITSs) of ribosomal DNA and the mitochondrial gene cytochrome c oxidase subunit I (cox 1) were used to corroborate this morphological distinction. The genetic divergence estimated among P. plataleae n. sp., P. cincta and P. diovadena ranged from 0.5 to 1.48% for ITSs and from 9.31 to 11.47% for cox 1. Maximum parsimony (MP) and maximum likelihood (ML) analyses were performed on the combined datasets (ITSs+cox 1) and on each dataset alone. All of the phylogenetic analyses indicated that the specimens from the roseate spoonbill represent a clade with strong bootstrap support. The morphological evidence and the genetic divergence in combination with the reciprocal monophyly in all of the phylogenetic trees support the hypothesis that the digeneans found in the intestines of roseate spoonbills represent a new species. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Liu, Guo-Hua; Wang, Yan; Xu, Min-Jun; Zhou, Dong-Hui; Ye, Yong-Gang; Li, Jia-Yuan; Song, Hui-Qun; Lin, Rui-Qing; Zhu, Xing-Quan
2012-12-01
For many years, whipworms (Trichuris spp.) have been described with a relatively narrow range of both morphological and biometrical features. Moreover, there has been insufficient discrimination between congeners (or closely related species). In the present study, we determined the complete mitochondrial (mt) genomes of two whipworms Trichuris ovis and Trichuris discolor, compared them and then tested the hypothesis that T. ovis and T. discolor are distinct species by phylogenetic analyses using Bayesian inference, maximum likelihood and maximum parsimony) based on the deduced amino acid sequences of the mt protein-coding genes. The complete mt genomes of T. ovis and T. discolor were 13,946 bp and 13,904 bp in size, respectively. Both mt genomes are circular, and consist of 37 genes, including 13 genes coding for proteins, 2 genes for rRNA, and 22 genes for tRNA. The gene content and arrangement are identical to that of human and pig whipworms Trichuris trichiura and Trichuris suis. Taken together, these analyses showed genetic distinctiveness and strongly supported the recent proposal that T. ovis and T. discolor are distinct species using nuclear ribosomal DNA and a portion of the mtDNA sequence dataset. The availability of the complete mtDNA sequences of T. ovis and T. discolor provides novel genetic markers for studying the population genetics, diagnostics and molecular epidemiology of T. ovis and T. discolor. Copyright © 2012 Elsevier B.V. All rights reserved.
Wang, Yan; Liu, Guo-Hua; Li, Jia-Yuan; Xu, Min-Jun; Ye, Yong-Gang; Zhou, Dong-Hui; Song, Hui-Qun; Lin, Rui-Qing; Zhu, Xing-Quan
2013-02-01
This study examined sequence variation in three mitochondrial DNA (mtDNA) regions, namely cytochrome c oxidase subunit 1 (cox1), NADH dehydrogenase subunit 5 (nad5) and cytochrome b (cytb), among Trichuris ovis isolates from different hosts in Guangdong Province, China. A portion of the cox1 (pcox1), nad5 (pnad5) and cytb (pcytb) genes was amplified separately from individual whipworms by PCR, and was subjected to sequencing from both directions. The size of the sequences of pcox1, pnad5 and pcytb was 618, 240 and 464 bp, respectively. Although the intra-specific sequence variations within T. ovis were 0-0.8% for pcox1, 0-0.8% for pnad5 and 0-1.9% for pcytb, the inter-specific sequence differences among members of the genus Trichuris were significantly higher, being 24.3-26.5% for pcox1, 33.7-56.4% for pnad5 and 24.8-26.1% for pcytb, respectively. Phylogenetic analyses using combined sequences of pcox1, pnad5 and pcytb, with three different computational algorithms (maximum likelihood, maximum parsimony and Bayesian inference), indicated that all of the T. ovis isolates grouped together with high statistical support. These findings demonstrated the existence of intra-specific variation in mtDNA sequences among T. ovis isolates from different hosts, and have implications for studying molecular epidemiology and population genetics of T. ovis.
Šípek, Petr; Fabrizi, Silvia; Eberle, Jonas; Ahrens, Dirk
2016-08-01
Rose chafers (Cetoniinae) are a large group of flower visitors within the pleurostict Scarabaeidae that are characterized by their distinctive flight mode with nearly closed forewings. Despite their popularity, this is the first study to use molecular data to infer their phylogenetic relationships. We used partial gene sequences for 28S rRNA, cytochrome oxidase I (cox1) and 16S rRNA (rrnL) for 299 species, representing most recognized subfamilies of Scarabaeidae, including 125 species of Cetoniinae. Combined analyses using maximum parsimony, maximum likelihood and Bayesian inferences recovered Cetoniinae as monophyletic in all analyses, with the sister clade composed of Rutelinae and Dynastinae. Rutelinae was always recovered as paraphyletic with respect to Dynastinae. Trichiini sensu lato (s.l.) was recovered as a polyphyletic clade, while Cetoniini s.l. was recovered as paraphyletic. The inferred topologies were also supported by site bootstrapping of the ML trees. With the exception of Cremastochelini, most tribes of Cetoniinae were poly- or paraphyletic, indicating the critical need for a careful revision of rose chafer classification. Analysis of elytral base structure (including 11 scored characters) in the context of phylogeny, revealed a complex, concerted and rapid transformation of the single trait elements linked to a modified flight mode with closed elytra. This appears to be unlinked to the lateral sinuation of the elytra, which originated independently several times at later stages in the evolution of the group. Copyright © 2016 Elsevier Inc. All rights reserved.
Rosas-Valdez, Rogelio; Morrone, Juan J; García-Varela, Martín
2012-08-01
Species of Floridosentis (Acanthocephala) are common parasites of mullets (Mugil spp., Mugilidae) found in tropical marine and brackish water in the Americas. Floridosentis includes 2 species distributed in Mexico, i.e., Floridosentis pacifica, restricted to the Pacific Ocean near Salina Cruz, Oaxaca, and Floridosentis mugilis, distributed along the coast of the Pacific Ocean and the Gulf of Mexico. We sampled 18 populations of F. mugilis and F. pacifica (12 from the Pacific and 6 from the Gulf of Mexico) and sequenced a fragment of the rDNA large subunit to evaluate phylogenetic relationships of populations of Floridosentis spp. from Mexico. Species identification of museum specimens of F. mugilis from the Pacific Ocean was confirmed by examination of morphology traits. Phylogenetic trees inferred with maximum parsimony, maximum likelihood, and Bayesian inference indicate that Floridosentis is monophyletic comprising of 2 major well-supported clades, the first clade corresponding to F. mugilis from the Gulf of Mexico, and the second to F. pacifica from the Pacific Ocean. Genetic divergence between species ranged from 7.68 to 8.60%. Intraspecific divergence ranged from 0.14 to 0.86% for F. mugilis and from 1.72 to 4.49% for F. pacifica. Data obtained from diagnostic characters indicate that specimens from the Pacific Ocean in Mexico have differences in some traits among locations. These results are consistent with the phylogenetic hypothesis, indicating that F. pacifica is distributed in the Pacific Ocean in Mexico with 3 major lineages.
Prince, Mark A.; Connors, Gerard J.; Maisto, Stephen A.; Dearing, Ronda L.
2016-01-01
While past research has demonstrated a positive relationship between the therapeutic alliance (TA) and improved drinking outcomes, specific aspects of the alliance have received less attention. In this study, we examined the association between alliance characteristics during treatment and 4-month follow-up drinking reports. 65 treatment-seeking alcohol dependent clients who participated in 12 weeks of individual outpatient treatment provided weekly TA ratings during treatment and reported on pre-treatment, during treatment, and post-treatment alcohol use. Latent profile analysis was conducted to discern distinct profiles of client and therapist ratings of therapeutic alliance with similar alliance characteristics. TA profiles were based on clients’ and therapists’ mean alliance rating, minimum alliance rating, maximum alliance rating, the range of alliance ratings, and the difference in session number between maximum and minimum alliance ratings. 1- through 4- class models were fit to the data. Model fit was judged by comparative fit indices, substantive interpretability, and parsimony. Wald tests of mean equality determined whether classes differed on follow-up percentage of days abstinent (PDA) at 4 months posttreatment. 3-profile solutions provided the best fit for both client and therapist ratings of the therapeutic alliance. Client alliance rating profiles predicted drinking in the follow-up period, but therapist rating profiles did not. These results suggest that distinct profiles of the therapeutic alliance can be identified and that client alliance rating profiles are associated with frequency of alcohol use following outpatient treatment. PMID:26999350
Zhang, Jian-Qiang; Meng, Shi-Yong; Allen, Geraldine A; Wen, Jun; Rao, Guang-Yuan
2014-08-01
Rhodiola L. (Crassulaceae) is a mid-sized plant genus consisting of about 70 species, with most species distributed on the Qinghai-Tibetan Plateau (QTP) and the adjacent areas, and several species in north-east Asia, Europe, and North America. This study explored the origin and diversification history of Rhodiola and tested the biogeographic relationships between the QTP and other regions of the Northern Hemisphere. We sequenced the nuclear ribosomal internal transcribed spacers and eight plastid DNA fragments representing 55 species of Rhodiola, and reconstructed phylogenetic relationships with maximum parsimony, maximum likelihood and Bayesian inference. Several instances of incongruence between the nuclear and the plastid data sets were revealed, which can best be explained by reticulate evolution. Species of Rhodiola and Pseudosedum form a well-supported clade sister to Phedimus. Dating analysis suggested that the origin and diversification times of this group are largely correlated with the extensive uplifts of the Qinghai-Tibetan Plateau. Ancestral state reconstruction supports the hypothesis that Rhodiola originated on the QTP, and then dispersed to other regions of the Northern Hemisphere. Our findings highlight the importance of the uplifts of the Qinghai-Tibetan Plateau in promoting species diversification and the possible role of reticulate evolution in the diversification process. Our results also suggest the biogeographic significance of QTP as the source area in alpine plant evolution in the Northern Hemisphere. Copyright © 2014 Elsevier Inc. All rights reserved.
Ni, Lianghong; Zhao, Zhili; Xu, Hongxi; Chen, Shilin; Dorje, Gaawe
2016-02-15
Endemic to the Sino-Himalayan subregion, the medicinal alpine plant Gentiana straminea is a threatened species. The genetic and molecular data about it is deficient. Here we report the complete chloroplast (cp) genome sequence of G. straminea, as the first sequenced member of the family Gentianaceae. The cp genome is 148,991bp in length, including a large single copy (LSC) region of 81,240bp, a small single copy (SSC) region of 17,085bp and a pair of inverted repeats (IRs) of 25,333bp. It contains 112 unique genes, including 78 protein-coding genes, 30 tRNAs and 4 rRNAs. The rps16 gene lacks exon2 between trnK-UUU and trnQ-UUG, which is the first rps16 pseudogene found in the nonparasitic plants of Asterids clade. Sequence analysis revealed the presence of 13 forward repeats, 13 palindrome repeats and 39 simple sequence repeats (SSRs). An entire cp genome comparison study of G. straminea and four other species in Gentianales was carried out. Phylogenetic analyses using maximum likelihood (ML) and maximum parsimony (MP) were performed based on 69 protein-coding genes from 36 species of Asterids. The results strongly supported the position of Gentianaceae as one member of the order Gentianales. The complete chloroplast genome sequence will provide intragenic information for its conservation and contribute to research on the genetic and phylogenetic analyses of Gentianales and Asterids. Copyright © 2015 Elsevier B.V. All rights reserved.
A revision and phylogenetic analysis of Stoiba Spaeth 1909 (Coleoptera, Chrysomelidae)
Shin, Chulwoo; Chaboo, Caroline S.
2012-01-01
Abstract Stoiba Spaeth, 1909 is revised with a phylogenetic analysis of 38 adult morphological characters for nine Stoiba species and 11 outgroup species (Mesomphaliini, Ischyrosonychini, and Hemisphaerotini). Four Cuban species of Stoiba were not sampled. Parsimony analysis located the four most parsimonious trees. The strict consensus (CI=0.59, RI=0.78, Steps=83) resolved the monophyly of Stoiba. The monophyly of Stoiba is supported by pale yellow antennae, antennomere VII broader than its length, and rounded basal line of pronotum. An illustrated key to ten species of Stoiba is provided along with a distribution map of 11 species. Stoiba rufa Blake is synonymized with Stoiba swartzii (Thunberg) by a morphological comparison which includes female genitalia. PMID:23129988
Extended Islands of Tractability for Parsimony Haplotyping
NASA Astrophysics Data System (ADS)
Fleischer, Rudolf; Guo, Jiong; Niedermeier, Rolf; Uhlmann, Johannes; Wang, Yihui; Weller, Mathias; Wu, Xi
Parsimony haplotyping is the problem of finding a smallest size set of haplotypes that can explain a given set of genotypes. The problem is NP-hard, and many heuristic and approximation algorithms as well as polynomial-time solvable special cases have been discovered. We propose improved fixed-parameter tractability results with respect to the parameter "size of the target haplotype set" k by presenting an O *(k 4k )-time algorithm. This also applies to the practically important constrained case, where we can only use haplotypes from a given set. Furthermore, we show that the problem becomes polynomial-time solvable if the given set of genotypes is complete, i.e., contains all possible genotypes that can be explained by the set of haplotypes.
Fong, Ted C T; Ho, Rainbow T H
2015-01-01
The aim of this study was to reexamine the dimensionality of the widely used 9-item Utrecht Work Engagement Scale using the maximum likelihood (ML) approach and Bayesian structural equation modeling (BSEM) approach. Three measurement models (1-factor, 3-factor, and bi-factor models) were evaluated in two split samples of 1,112 health-care workers using confirmatory factor analysis and BSEM, which specified small-variance informative priors for cross-loadings and residual covariances. Model fit and comparisons were evaluated by posterior predictive p-value (PPP), deviance information criterion, and Bayesian information criterion (BIC). None of the three ML-based models showed an adequate fit to the data. The use of informative priors for cross-loadings did not improve the PPP for the models. The 1-factor BSEM model with approximately zero residual covariances displayed a good fit (PPP>0.10) to both samples and a substantially lower BIC than its 3-factor and bi-factor counterparts. The BSEM results demonstrate empirical support for the 1-factor model as a parsimonious and reasonable representation of work engagement.
Carnevale, Silvana; Malandrini, Jorge Bruno; Pantano, María Laura; Soria, Claudia Cecilia; Rodrigues-Silva, Rosângela; Machado-Silva, José Roberto; Velásquez, Jorge Néstor; Kamenetzky, Laura
2017-10-15
Fasciola hepatica is a trematode showing genetic variation among isolates from different regions of the world. The objective of this work was to characterize for the first time F. hepatica isolates circulating in different regions of Argentina. Twenty-two adult flukes were collected from naturally infected bovine livers in different areas from Argentina and used for DNA extraction. We carried out PCR amplification and sequence analysis of the ribosomal internal transcribed spacer 1 (ITS1), mitochondrial nicotinamide adenine dinucleotide dehydrogenase subunits 4 and 5 (nad4 and nad5) and mitochondrial cytochrome c oxidase subunit I (cox1) genes as genetic markers. Phylogenies were reconstructed using maximum parsimony algorithm. A total of 6 haplotypes were found for cox1, 4 haplotypes for nad4 and 3 haplotypes for nad5. The sequenced ITS1 fragment was identical in all samples. The analyzed cox1 gene fragment is the most variable marker and is recommended for future analyses. No geographic association was found in the Argentinean samples. Copyright © 2017 Elsevier B.V. All rights reserved.
Baum, D A; Small, R L; Wendel, J F
1998-06-01
The phylogeny of baobab trees was analyzed using four data sets: chloroplast DNA restriction sites, sequences of the chloroplast rpl16 intron, sequences of the internal transcribed spacer (ITS) region of nuclear ribosomal DNA, and morphology. We sampled each of the eight species of Adansonia plus three outgroup taxa from tribe Adansonieae. These data were analyzed singly and in combination using parsimony. ITS and morphology provided the greatest resolution and were largely concordant. The two chloroplast data sets showed concordance with one another but showed significant conflict with ITS and morphology. A possible explanation for the conflict is genealogical discordance within the Malagasy Longitubae, perhaps due to introgression events. A maximum-likelihood analysis of branching times shows that the dispersal between Africa and Australia occurred well after the fragmentation of Gondwana and therefore involved overwater dispersal. The phylogeny does not permit unambiguous reconstruction of floral evolution but suggests the plausible hypothesis that hawkmoth pollination was ancestral in Adansonia and that there were two parallel switches to pollination by mammals in the genus.
Kakiuchi, Nobuko; Atsumi, Toshiyuki; Higuchi, Mari; Kamikawa, Shohei; Miyako, Haruka; Wakita, Yuriko; Ohtsuka, Isao; Hayashi, Shigeki; Hishida, Atsuyuki; Kawahara, Nobuo; Nishizawa, Makoto; Yamagishi, Takashi; Kadota, Yuichi
2015-01-01
Aconite tuber is a representative crude drug for warming the body internally in Japanese Kampo medicine and Chinese traditional medicine. The crude drug is used in major prescriptions for the aged. Varieties of Aconitum plants are distributed throughout the Japanese Islands, especially Hokkaido. With the aim of identifying the medicinal potential of Aconitum plants from Hokkaido, 107 specimens were collected from 36 sites in the summer of 2011 and 2012. Their nuclear DNA region, internal transcribed spacer (ITS), and aconitine alkaloid contents were analyzed. Phylogenic analysis of ITS by maximum parsimony analysis showed that the majority of the specimens were grouped into one cluster (cluster I), separated from the other cluster (cluster II) consisting of alpine specimens. The aconitine alkaloid content of the tuberous roots of 76 specimens showed 2 aspects-specimens from the same collection site showed similar aconitine alkaloid profiles, and cluster I specimens from different habitats showed various alkaloid profiles. Environmental pressure of each habitat is presumed to have caused the morphology and aconitine alkaloid profile of these genetically similar specimens to diversify.
Molecular identification of hard ticks (Ixodes sp.) infesting rodents in Selangor, Malaysia
NASA Astrophysics Data System (ADS)
Ishak, Siti Nabilah; Shiang, Lim Fang; Taib, Farah Shafawati Mohd; Jing, Khoo Jing; Nor, Shukor Md; Yusof, Muhammad Afif; Sah, Shahrul Anuar Mohd; Sitam, Frankie Thomas; Japning, Jeffrine Rovie Ryan
2018-04-01
This study aims to identify hard ticks (Ixodes sp.) infesting rodents in three different sites in Selangor, Malaysia using a molecular approach. A total of 11 individual ticks infesting four different host species (Rattus tiomanicus, Rattus ratus, Maxomys surifer and Sundamys muelleri) were examined based on its morphological features, followed by molecular identification using mitochondrial 16S rDNA gene. Confirmation of the species identity was accomplished by using BLAST program. Clustering analysis based on 16S rDNA sequences was carried out by constructing Neighbour-joining (NJ) and Maximum parsimony (MP) tree using MEGA 7 to clarify the genetic identity of Ixodes sp. Based on morphological features, all individual ticks were only able to be identified up to genus level as most of the samples were fully engorged, damaged and lacked morphological characters. However, molecular analysis of samples revealed 99% similarity with Ixodes granulatus from the GenBank database. Thus, the result of this study showed that all these ticks (Ixodes granulatus) were genetically affiliated to a monophyletic group with highly homogenous sequences.
Livistona palms in Australia: ancient relics or opportunistic immigrants?
Crisp, Michael D; Isagi, Yuji; Kato, Yohei; Cook, Lyn G; Bowman, David M J S
2010-02-01
Eighteen of the 34 species of the fan palm genus Livistona (Arecaceae) are restricted to Australia and southern New Guinea, east of Wallace's Line, an ancient biogeographic boundary between the former supercontinents Laurasia and Gondwana. The remaining species extend from SE Asia to Africa, west of Wallace's Line. Competing hypotheses contend that Livistona is (a) ancient, its current distribution a relict of the supercontinents, or (b) a Miocene immigrant from the north into Australia as it drifted towards Asia. We have tested these hypotheses using Bayesian and penalized likelihood molecular dating based on 4Kb of nuclear and chloroplast DNA sequences with multiple fossil calibration points. Ancestral areas and biomes were reconstructed using parsimony and maximum likelihood. We found strong support for the second hypothesis, that a single Livistona ancestor colonized Australia from the north about 10-17Ma. Spread and diversification of the genus within Australia was likely favoured by a transition from the aseasonal wet to monsoonal biome, to which it could have been preadapted by fire-tolerance. Copyright (c) 2009 Elsevier Inc. All rights reserved.
The Cladophora complex (Chlorophyta): new views based on 18S rRNA gene sequences.
Bakker, F T; Olsen, J L; Stam, W T; van den Hoek, C
1994-12-01
Evolutionary relationships among species traditionally ascribed to the Siphonocladales/Cladophorales have remained unclear due to a lack of phylogenetically informative characters and extensive morphological plasticity resulting in morphological convergence. This study explores some of the diversity within the generic complex Cladophora and its siphonocladalaen allies. Twelve species of Cladophora representing 6 of the 11 morphological sections recognized by van den Hoek were analyzed along with 8 siphonocladalaen species using 18S rRNA gene sequences. The final alignment consisted of 1460 positions containing 92 phylogenetically informative substitutions. Weighting schemes (EOR weighting, combinatorial weighting) were applied in maximum parsimony analysis to correct for substitution bias. Stem characters were weighted 0.66 relative to single-stranded characters to correct for secondary structural constraints. Both weighting approaches resulted in greater phylogenetic resolution. Results confirm that there is no basis for the independent recognition of the Cladophorales and Siphonocladales. The Siphonocladales is polyphyletic, and Cladophora is paraphyletic. All analyses support two principal lineages, of which one contains predominantly tropical members including almost all siphonocladalean taxa, while the other lineage consists of mostly warm- to cold-temperate species of Cladophora.
A Devonian tetrapod-like fish reveals substantial parallelism in stem tetrapod evolution.
Zhu, Min; Ahlberg, Per E; Zhao, Wen-Jin; Jia, Lian-Tao
2017-10-01
The fossils assigned to the tetrapod stem group document the evolution of terrestrial vertebrates from lobe-finned fishes. During the past 18 years the phylogenetic structure of this stem group has remained remarkably stable, even when accommodating new discoveries such as the earliest known stem tetrapod Tungsenia and the elpistostegid (fish-tetrapod intermediate) Tiktaalik. Here we present a large lobe-finned fish from the Late Devonian period of China that disrupts this stability. It combines characteristics of rhizodont fishes (supposedly a basal branch in the stem group, distant from tetrapods) with derived elpistostegid-like and tetrapod-like characters. This mélange of characters may reflect either detailed convergence between rhizodonts and elpistostegids plus tetrapods, under a phylogenetic scenario deduced from Bayesian inference analysis, or a previously unrecognized close relationship between these groups, as supported by maximum parsimony analysis. In either case, the overall result reveals a substantial increase in homoplasy in the tetrapod stem group. It also suggests that ecological diversity and biogeographical provinciality in the tetrapod stem group have been underestimated.
Subspecies identification of captive Orang Utan in Melaka based on D-loop mitochondria DNA
NASA Astrophysics Data System (ADS)
Kamaluddin, Siti Norsyuhada; Yaakop, Salmah; Idris, Wan Mohd Razi; Rovie-Ryan, Jeffrine Japning; Md-Zain, Badrul Munir
2018-04-01
Mitochondrial DNA of Bornean Orang Utan populations suggests that there are three different subspecies (Pongo pygmaeus pygmaeus; Sarawak & Northwest Kalimantan, P. p. wurmbii; Southern West Kalimantan and Central Kalimantan, P. p. morio; East Kalimantan and Sabah). The subspecies of Orang Utans in captivity are difficult to determine through morphological observation. Thus, misidentification by ranger or zoo staffs leads to unwanted consequences especially towards conservation efforts of Orang Utan. The main objective of this study was to identify the subspecies and the geographic origin of 10 Orang Utans in Zoo Melaka and A' Famosa by using partial mitochondrial D-loop gene sequences. DNA of all individuals was extracted from FTA Card. Data analyses were performed using Maximum Parsimony, MP and Neighbor Joining, NJ. Molecular phylogeny analysis revealed that all the samples likely belong to one species of Sumatran Orang Utan (P. abelii) and three different subspecies of Bornean Orang Utans (P. p. pygmaeus, P. p. morio, and P. p. wurmbii). The results obtained in this study indirectly help the management of zoos in term of conservation and visitor's education.
Adaptive Covariation between the Coat and Movement Proteins of Prunus Necrotic Ringspot Virus
Codoñer, Francisco M.; Fares, Mario A.; Elena, Santiago F.
2006-01-01
The relative functional and/or structural importance of different amino acid sites in a protein can be assessed by evaluating the selective constraints to which they have been subjected during the course of evolution. Here we explore such constraints at the linear and three-dimensional levels for the movement protein (MP) and coat protein (CP) encoded by RNA 3 of prunus necrotic ringspot ilarvirus (PNRSV). By a maximum-parsimony approach, the nucleotide sequences from 46 isolates of PNRSV varying in symptomatology, host tree, and geographic origin have been analyzed and sites under different selective pressures have been identified in both proteins. We have also performed covariation analyses to explore whether changes in certain amino acid sites condition subsequent variation in other sites of the same protein or the other protein. These covariation analyses shed light on which particular amino acids should be involved in the physical and functional interaction between MP and CP. Finally, we discuss these findings in the light of what is already known about the implication of certain sites and domains in structure and protein-protein and RNA-protein interactions. PMID:16731922
Adaptive covariation between the coat and movement proteins of prunus necrotic ringspot virus.
Codoñer, Francisco M; Fares, Mario A; Elena, Santiago F
2006-06-01
The relative functional and/or structural importance of different amino acid sites in a protein can be assessed by evaluating the selective constraints to which they have been subjected during the course of evolution. Here we explore such constraints at the linear and three-dimensional levels for the movement protein (MP) and coat protein (CP) encoded by RNA 3 of prunus necrotic ringspot ilarvirus (PNRSV). By a maximum-parsimony approach, the nucleotide sequences from 46 isolates of PNRSV varying in symptomatology, host tree, and geographic origin have been analyzed and sites under different selective pressures have been identified in both proteins. We have also performed covariation analyses to explore whether changes in certain amino acid sites condition subsequent variation in other sites of the same protein or the other protein. These covariation analyses shed light on which particular amino acids should be involved in the physical and functional interaction between MP and CP. Finally, we discuss these findings in the light of what is already known about the implication of certain sites and domains in structure and protein-protein and RNA-protein interactions.
Phylogenetic relationships in Myrceugenia (Myrtaceae) based on plastid and nuclear DNA sequences
Murillo-A., José; Ruiz-P., Eduardo; Landrum, Leslie R.; Stuessy, Tod F.; Barfuss, Michael H.J.
2012-01-01
Myrceugenia is a genus endemic to South America with a disjunct distribution: 12 species occurring mainly in central Chile and approximately 25 in southeastern Brazil. Relationships are reconstructed within Myrceugenia from four plastid markers (partial trnK-matK, rpl32-trnL, trnQ-5′rps16 and rpl16) and two ribosomal nuclear regions (ETS and ITS) using maximum parsimony and Bayesian analyses. Relationships inferred previously from morphological data are not completely consistent with those from molecular data. All molecular analyses support the hypothesis that Myrceugenia is monophyletic, except for M. fernadeziana that falls outside the genus. Chilean species and Brazilian species form two separate lineages. Chilean species form three early diverging clades, whereas Brazilian species are a strongly supported monophyletic group in a terminal position. Least average evolutionary divergence, low resolution, short branches, and high species diversity found in the Brazilian clade suggest rapid radiation. Geographical distributions and phylogenetic reconstructions suggest that extant Myrceugenia species arose in northern Chile followed by colonization southward and finally to the Juan Fernández Islands and southeastern Brazil. PMID:22155422
Sparse-grid, reduced-basis Bayesian inversion: Nonaffine-parametric nonlinear equations
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chen, Peng, E-mail: peng@ices.utexas.edu; Schwab, Christoph, E-mail: christoph.schwab@sam.math.ethz.ch
2016-07-01
We extend the reduced basis (RB) accelerated Bayesian inversion methods for affine-parametric, linear operator equations which are considered in [16,17] to non-affine, nonlinear parametric operator equations. We generalize the analysis of sparsity of parametric forward solution maps in [20] and of Bayesian inversion in [48,49] to the fully discrete setting, including Petrov–Galerkin high-fidelity (“HiFi”) discretization of the forward maps. We develop adaptive, stochastic collocation based reduction methods for the efficient computation of reduced bases on the parametric solution manifold. The nonaffinity and nonlinearity with respect to (w.r.t.) the distributed, uncertain parameters and the unknown solution is collocated; specifically, by themore » so-called Empirical Interpolation Method (EIM). For the corresponding Bayesian inversion problems, computational efficiency is enhanced in two ways: first, expectations w.r.t. the posterior are computed by adaptive quadratures with dimension-independent convergence rates proposed in [49]; the present work generalizes [49] to account for the impact of the PG discretization in the forward maps on the convergence rates of the Quantities of Interest (QoI for short). Second, we propose to perform the Bayesian estimation only w.r.t. a parsimonious, RB approximation of the posterior density. Based on the approximation results in [49], the infinite-dimensional parametric, deterministic forward map and operator admit N-term RB and EIM approximations which converge at rates which depend only on the sparsity of the parametric forward map. In several numerical experiments, the proposed algorithms exhibit dimension-independent convergence rates which equal, at least, the currently known rate estimates for N-term approximation. We propose to accelerate Bayesian estimation by first offline construction of reduced basis surrogates of the Bayesian posterior density. The parsimonious surrogates can then be employed for online data assimilation and for Bayesian estimation. They also open a perspective for optimal experimental design.« less
Li, Lang; Li, Jie; Rohwer, Jens G; van der Werff, Henk; Wang, Zhi-Hua; Li, Hsi-Wen
2011-09-01
The Persea group (Lauraceae) has a tropical and subtropical amphi-pacific disjunct distribution with most of its members, and it includes two Macaronesian species. The relationships within the group are still controversial, and its intercontinental disjunction has not been investigated with extensive sampling and precise time dating. • ITS and LEAFY intron II sequences of 78 Persea group species and nine other Lauraceae species were analyzed with maximum parsimony and Bayesian inference. Divergence time estimation employed Bayesian Markov chain Monte Carlo method under a relaxed clock. • Several traditional genera or subgenera within the Persea group form well-supported monophyletic groups except Alseodaphne and Dehaasia. The divergence time of the Persea group is estimated as ∼55.3 (95% higher posterior densities [HPD] 41.4-69.9) million years ago (mya). Two major divergences within the Persea group are estimated as ∼51.9 (95% HPD 38.9-63.9) mya and ∼48.5 (95% HPD 35.9-59.9) mya. • Persea can be retained as a genus by the inclusion of Apollonias barbujana and exclusion a few species that do not fit into the established subgenera. A major revision is recommended for the delimitation between Alseodaphne, Dehaasia, and Nothaphoebe. We suggest that the Persea group originated from the Perseeae-Laureae radiation in early Eocene Laurasia. Its amphi-pacific disjunction results from the disruption of boreotropical flora by climatic cooling during the mid- to late Eocene. The American-Macaronesian disjunction may be explained by the long-distance dispersal.
Phylogeny of the Asian spiny frog tribe Paini (Family Dicroglossidae) sensu Dubois.
Che, Jing; Hu, Jian-sheng; Zhou, Wei-wei; Murphy, Robert W; Papenfuss, Theodore J; Chen, Ming-yong; Rao, Ding-qi; Li, Pi-peng; Zhang, Ya-ping
2009-01-01
The anuran tribe Paini, family Dicroglossidae, is known in this group only from Asia. The phylogenetic relationships and often the taxonomic recognition of species are controversial. In order to stabilize the classification, we used approximately 2100 bp of nuclear (rhodopsin, tyrosinase) and mitochondrial (12S, 16S rRNA) DNA sequence data to infer the phylogenetic relationships of these frogs. Phylogenetic trees reconstructed using Bayesian inference and maximum parsimony methods supported a monophyletic tribe Paini. Two distinct groups (I,II) were recovered with the mtDNA alone and the total concatenated data (mtDNA+nuDNA). The recognition of two genera, Quasipaa and Nanorana, was supported. Group I, Quasipaa, is widespread east of the Hengduan Mountain Ranges and consists of taxa from relatively low elevations in southern China, Vietnam and Laos. Group II, Nanorana, contains a mix of species occurring from high to low elevation predominantly in the Qinghai-Tibetan Plateau and Hengduan Mountain Ranges. The occurrence of frogs at high elevations appears to be a derived ecological condition. The composition of some major species groups based on morphological characteristics strongly conflicts with the molecular analysis. Some possible cryptic species are indicated by the molecular analyses. The incorporation of genetic data from type localities helped to resolve some of the taxonomic problems, although further combined analyses of morphological data from type specimens are required. The two nuDNA gene segments proved to be very informative for resolving higher phylogenetic relationships and more nuclear data should be explored to be more confident in the relationships.
Chang, Melanie L.; Yokoyama, Jennifer S.; Branson, Nick; Dyer, Donna J.; Hitte, Christophe; Overall, Karen L.
2009-01-01
Until recently, canine genetic research has not focused on population structure within breeds, which may confound the results of case–control studies by introducing spurious correlations between phenotype and genotype that reflect population history. Intrabreed structure may exist when geographical origin or divergent selection regimes influence the choices of potential mates for breeding dogs. We present evidence for intrabreed stratification from a genome-wide marker survey in a sample of unrelated dogs. We genotyped 76 Border Collies, 49 Australian Shepherds, 17 German Shepherd Dogs, and 17 Portuguese Water Dogs for our primary analyses using Affymetrix Canine v2.0 single-nucleotide polymorphism (SNP) arrays. Subsets of autosomal markers were examined using clustering algorithms to facilitate assignment of individuals to populations and estimation of the number of populations represented in the sample. SNPs passing stringent quality control filters were employed for explicitly phylogenetic analyses reconstructing relationships between individuals using maximum parsimony and Bayesian methods. We used simulation studies to explore the possible effects of intrabreed stratification on genome-wide association studies. These analyses demonstrate significant stratification in at least one of our primary breeds of interest, the Border Collie. Demographic and pedigree data suggest that this population substructure may result from geographic isolation or divergent selection regimes practiced by breeders with different breeding program goals. Simulation studies indicate that such stratification could result in false discovery rates significant enough to confound genome-wide association analyses. Intrabreed stratification should be accounted for when designing and interpreting the results of case–control association studies using purebred dogs.
Saski, Christopher; Lee, Seung-Bum; Fjellheim, Siri; Guda, Chittibabu; Jansen, Robert K.; Luo, Hong; Tomkins, Jeffrey; Rognli, Odd Arne; Clarke, Jihong Liu
2009-01-01
Comparisons of complete chloroplast genome sequences of Hordeum vulgare, Sorghum bicolor and Agrostis stolonifera to six published grass chloroplast genomes reveal that gene content and order are similar but two microstructural changes have occurred. First, the expansion of the IR at the SSC/IRa boundary that duplicates a portion of the 5′ end of ndhH is restricted to the three genera of the subfamily Pooideae (Agrostis, Hordeum and Triticum). Second, a 6 bp deletion in ndhK is shared by Agrostis, Hordeum, Oryza and Triticum, and this event supports the sister relationship between the subfamilies Erhartoideae and Pooideae. Repeat analysis identified 19–37 direct and inverted repeats 30 bp or longer with a sequence identity of at least 90%. Seventeen of the 26 shared repeats are found in all the grass chloroplast genomes examined and are located in the same genes or intergenic spacer (IGS) regions. Examination of simple sequence repeats (SSRs) identified 16–21 potential polymorphic SSRs. Five IGS regions have 100% sequence identity among Zea mays, Saccharum officinarum and Sorghum bicolor, whereas no spacer regions were identical among Oryza sativa, Triticum aestivum, H. vulgare and A. stolonifera despite their close phylogenetic relationship. Alignment of EST sequences and DNA coding sequences identified six C–U conversions in both Sorghum bicolor and H. vulgare but only one in A. stolonifera. Phylogenetic trees based on DNA sequences of 61 protein-coding genes of 38 taxa using both maximum parsimony and likelihood methods provide moderate support for a sister relationship between the subfamilies Erhartoideae and Pooideae. PMID:17534593
Lagos, Doris M; Voegtlin, David J; Coeur d'acier, Armelle; Giordano, Rosanna
2014-06-01
A phylogeny of the genus Aphis Linnaeus, 1 758 was built primarily from specimens collected in the Midwest of the United States. A data matrix was constructed with 68 species and 41 morphological characters with respective character states of alate and apterous viviparous females. Dendrogram topologies of analyses performed using UPGMA (Unweighted Pair Group Method with Arithmetic Mean), Maximum Parsimony and Bayesian analysis of Cytochrome Oxidase I, Elongation Factor 1-α and primary endosymbiont Buchnera aphidicola 16S sequences were not congruent. Bayesian analysis strongly supported most terminal nodes of the phylogenetic trees. The phylogeny was strongly supported by EF1-α, and analysis of COI and EF1-α molecular data combined with morphological characters. It was not supported by single analysis of COI or Buchnera aphidicola 16S. Results from the Bayesian phylogeny show 4 main species groups: asclepiadis, fabae, gossypii, and middletonii. Results place Aphis and species of the genera Protaphis Börner, 1952, Toxoptera Koch, and Xerobion Nevsky, 1928 in a monophyletic clade. Morphological characters support this monophyly as well. The phylogeny shows that the monophyletic clade of the North American middletonii species group belong to the genus Protaphis: P. debilicornis (Gillette & Palmer, 1929), comb. nov., P. echinaceae (Lagos and Voegtlin, 2009), comb. nov., and P. middletonii (Thomas, 1879). The genus Toxoptera should be considered a subgenus of Aphis (stat. nov.). The analysis also indicates that the current genus Iowana Frison, 1954 should be considered a subgenus of Aphis (stat. nov.). © 2013 Institute of Zoology, Chinese Academy of Sciences.
Douglas, P K; Harris, Sam; Yuille, Alan; Cohen, Mark S
2011-05-15
Machine learning (ML) has become a popular tool for mining functional neuroimaging data, and there are now hopes of performing such analyses efficiently in real-time. Towards this goal, we compared accuracy of six different ML algorithms applied to neuroimaging data of persons engaged in a bivariate task, asserting their belief or disbelief of a variety of propositional statements. We performed unsupervised dimension reduction and automated feature extraction using independent component (IC) analysis and extracted IC time courses. Optimization of classification hyperparameters across each classifier occurred prior to assessment. Maximum accuracy was achieved at 92% for Random Forest, followed by 91% for AdaBoost, 89% for Naïve Bayes, 87% for a J48 decision tree, 86% for K*, and 84% for support vector machine. For real-time decoding applications, finding a parsimonious subset of diagnostic ICs might be useful. We used a forward search technique to sequentially add ranked ICs to the feature subspace. For the current data set, we determined that approximately six ICs represented a meaningful basis set for classification. We then projected these six IC spatial maps forward onto a later scanning session within subject. We then applied the optimized ML algorithms to these new data instances, and found that classification accuracy results were reproducible. Additionally, we compared our classification method to our previously published general linear model results on this same data set. The highest ranked IC spatial maps show similarity to brain regions associated with contrasts for belief > disbelief, and disbelief < belief. Copyright © 2010 Elsevier Inc. All rights reserved.
Knoetze, Rinus; Swart, Antoinette
2014-12-09
A survey was performed to detect the presence of cyst nematodes in the Cape Floristic Region of South Africa. Soil was collected in the rhizosphere of the dominant plant species within blocks of indigenous vegetation and cysts were extracted from them. A total of 81 blocks of indigenous vegetation were sampled as described. Cysts were detected in 7 of these samples, representing 6 different vegetation types. One set of primers was used to amplify the ITS regions from these cysts, including the 5.8S ribosomal gene, as well as short parts of the 18S and 28S ribosomal genes. ITS-rDNA sequences from the indigenous isolates were aligned with selected sequences of other species from the Heteroderidae. Phylogenetic analyses to resolve the relationships between indigenous isolates and selected representatives of the Heteroderidae were conducted using the Maximum Parsimony method. The consensus tree resulting from alignment of the circumfenestrate cysts revealed that isolates SK18, WK1 and WK26 are included in a clade of Globodera species that parasitise non-solanaceous plants, forming a monophyletic group with G. millefolii, G. artemisiae, and an unidentified Globodera sp. from Portugal. In a tree resulting from the alignment of the Heterodera spp., isolates OK14 and WK2 are included in the Afenestrata group, forming a monophyletic group with H. orientalis.This survey unearthed at least four potentially new species of cyst nematodes, which may prove invaluable for the study of the evolution and biogeography of the group.
Wang, Pei; Lu, Yanli; Zheng, Mingmin; Rong, Tingzhao; Tang, Qilin
2011-01-01
Genetic relationship of a newly discovered teosinte from Nicaragua, Zea nicaraguensis with waterlogging tolerance, was determined based on randomly amplified polymorphic DNA (RAPD) markers and the internal transcribed spacer (ITS) sequences of nuclear ribosomal DNA using 14 accessions from Zea species. RAPD analysis showed that a total of 5,303 fragments were produced by 136 random decamer primers, of which 84.86% bands were polymorphic. RAPD-based UPGMA analysis demonstrated that the genus Zea can be divided into section Luxuriantes including Zea diploperennis, Zea luxurians, Zea perennis and Zea nicaraguensis, and section Zea including Zea mays ssp. mexicana, Zea mays ssp. parviglumis, Zea mays ssp. huehuetenangensis and Zea mays ssp. mays. ITS sequence analysis showed the lengths of the entire ITS region of the 14 taxa in Zea varied from 597 to 605 bp. The average GC content was 67.8%. In addition to the insertion/deletions, 78 variable sites were recorded in the total ITS region with 47 in ITS1, 5 in 5.8S, and 26 in ITS2. Sequences of these taxa were analyzed with neighbor-joining (NJ) and maximum parsimony (MP) methods to construct the phylogenetic trees, selecting Tripsacum dactyloides L. as the outgroup. The phylogenetic relationships of Zea species inferred from the ITS sequences are highly concordant with the RAPD evidence that resolved two major subgenus clades. Both RAPD and ITS sequence analyses indicate that Zea nicaraguensis is more closely related to Zea luxurians than the other teosintes and cultivated maize, which should be regarded as a section Luxuriantes species. PMID:21525982
Choosing and Using Introns in Molecular Phylogenetics
Creer, Simon
2007-01-01
Introns are now commonly used in molecular phylogenetics in an attempt to recover gene trees that are concordant with species trees, but there are a range of genomic, logistical and analytical considerations that are infrequently discussed in empirical studies that utilize intron data. This review outlines expedient approaches for locus selection, overcoming paralogy problems, recombination detection methods and the identification and incorporation of LVHs in molecular systematics. A range of parsimony and Bayesian analytical approaches are also described in order to highlight the methods that can currently be employed to align sequences and treat indels in subsequent analyses. By covering the main points associated with the generation and analysis of intron data, this review aims to provide a comprehensive introduction to using introns (or any non-coding nuclear data partition) in contemporary phylogenetics. PMID:19461984
Aeroelastic Model Structure Computation for Envelope Expansion
NASA Technical Reports Server (NTRS)
Kukreja, Sunil L.
2007-01-01
Structure detection is a procedure for selecting a subset of candidate terms, from a full model description, that best describes the observed output. This is a necessary procedure to compute an efficient system description which may afford greater insight into the functionality of the system or a simpler controller design. Structure computation as a tool for black-box modeling may be of critical importance in the development of robust, parsimonious models for the flight-test community. Moreover, this approach may lead to efficient strategies for rapid envelope expansion that may save significant development time and costs. In this study, a least absolute shrinkage and selection operator (LASSO) technique is investigated for computing efficient model descriptions of non-linear aeroelastic systems. The LASSO minimises the residual sum of squares with the addition of an l(Sub 1) penalty term on the parameter vector of the traditional l(sub 2) minimisation problem. Its use for structure detection is a natural extension of this constrained minimisation approach to pseudo-linear regression problems which produces some model parameters that are exactly zero and, therefore, yields a parsimonious system description. Applicability of this technique for model structure computation for the F/A-18 (McDonnell Douglas, now The Boeing Company, Chicago, Illinois) Active Aeroelastic Wing project using flight test data is shown for several flight conditions (Mach numbers) by identifying a parsimonious system description with a high percent fit for cross-validated data.
Mirus, Benjamin B.; Nimmo, J.R.
2013-01-01
The impact of preferential flow on recharge and contaminant transport poses a considerable challenge to water-resources management. Typical hydrologic models require extensive site characterization, but can underestimate fluxes when preferential flow is significant. A recently developed source-responsive model incorporates film-flow theory with conservation of mass to estimate unsaturated-zone preferential fluxes with readily available data. The term source-responsive describes the sensitivity of preferential flow in response to water availability at the source of input. We present the first rigorous tests of a parsimonious formulation for simulating water table fluctuations using two case studies, both in arid regions with thick unsaturated zones of fractured volcanic rock. Diffuse flow theory cannot adequately capture the observed water table responses at both sites; the source-responsive model is a viable alternative. We treat the active area fraction of preferential flow paths as a scaled function of water inputs at the land surface then calibrate the macropore density to fit observed water table rises. Unlike previous applications, we allow the characteristic film-flow velocity to vary, reflecting the lag time between source and deep water table responses. Analysis of model performance and parameter sensitivity for the two case studies underscores the importance of identifying thresholds for initiation of film flow in unsaturated rocks, and suggests that this parsimonious approach is potentially of great practical value.
Carreno, R A; Barta, J R
1998-11-01
The small subunit ribosomal RNA (SSU rRNA) genes of hippoboscid (Ornithoica vicina Walker) and tabanid (Chrysops niger Macquart) Diptera were sequenced to determine their phylogenetic position within the order and to determine whether or not extensive hypervariable regions in this gene are widespread in the Diptera. A parsimony analysis of an alignment containing 8 dipteran sequences produced a single most parsimonious tree that placed O. vicina as sister group to Drosophila melanogaster Meigen. The tabanid Chrysops niger was sister group to the asilomorphan taxa, and the sister group to the Brachycera was a Tipula sp. although this relationship was not supported by bootstrap analysis. The hippoboscid and tabanid sequences contain extensive hypervariable regions in the V2, V4, V6, and V7 regions as do other Diptera. When these regions of the alignment were excluded from the phylogenetic analysis, a single most parsimonious tree was found. This tree had an identical overall topology to the tree obtained from the total data set. The hypervariable regions in parts of the dipteran SSU rRNA genes were more extensive in the nematocerous dipteran sequences used in this study than in the other dipteran representatives; these hypervariable regions may be of more utility in inferring relationship among species and subspecies than at the suprageneric level.
The augmentation algorithm and molecular phylogenetic trees
NASA Technical Reports Server (NTRS)
Holmquist, R.
1978-01-01
Moore's (1977) augmentation procedure is discussed, and it is concluded that the procedure is valid for obtaining estimates of the total number of fixed nucleotide substitutions both theoretically and in practice, for both simulated and real data, and in agreement, for experimentally dense data sets, with stochastic estimates of the divergence, provided the restrictions on codon mutability resulting from natural selection are explicitly allowed for. Tateno and Nei's (1978) critique that the augmentation procedure has a systematic bias toward overestimation of the total number of nucleotide replacements is disputed, and a data analysis suggests that ancestral sequences inferred by the method of parsimony contain a large number of incorrectly assigned nucleotides.
Mariel, Petr; Hoyos, David; Artabe, Alaitz; Guevara, C Angelo
2018-08-15
Endogeneity is an often neglected issue in empirical applications of discrete choice modelling despite its severe consequences in terms of inconsistent parameter estimation and biased welfare measures. This article analyses the performance of the multiple indicator solution method to deal with endogeneity arising from omitted explanatory variables in discrete choice models for environmental valuation. We also propose and illustrate a factor analysis procedure for the selection of the indicators in practice. Additionally, the performance of this method is compared with the recently proposed hybrid choice modelling framework. In an empirical application we find that the multiple indicator solution method and the hybrid model approach provide similar results in terms of welfare estimates, although the multiple indicator solution method is more parsimonious and notably easier to implement. The empirical results open a path to explore the performance of this method when endogeneity is thought to have a different cause or under a different set of indicators. Copyright © 2018 Elsevier B.V. All rights reserved.
Model-based Clustering of High-Dimensional Data in Astrophysics
NASA Astrophysics Data System (ADS)
Bouveyron, C.
2016-05-01
The nature of data in Astrophysics has changed, as in other scientific fields, in the past decades due to the increase of the measurement capabilities. As a consequence, data are nowadays frequently of high dimensionality and available in mass or stream. Model-based techniques for clustering are popular tools which are renowned for their probabilistic foundations and their flexibility. However, classical model-based techniques show a disappointing behavior in high-dimensional spaces which is mainly due to their dramatical over-parametrization. The recent developments in model-based classification overcome these drawbacks and allow to efficiently classify high-dimensional data, even in the "small n / large p" situation. This work presents a comprehensive review of these recent approaches, including regularization-based techniques, parsimonious modeling, subspace classification methods and classification methods based on variable selection. The use of these model-based methods is also illustrated on real-world classification problems in Astrophysics using R packages.
The scenario on the origin of translation in the RNA world: in principle of replication parsimony
2010-01-01
Background It is now believed that in the origin of life, proteins should have been "invented" in an RNA world. However, due to the complexity of a possible RNA-based proto-translation system, this evolving process seems quite complicated and the associated scenario remains very blurry. Considering that RNA can bind amino acids with specificity, it has been reasonably supposed that initial peptides might have been synthesized on "RNA templates" containing multiple amino acid binding sites. This "Direct RNA Template (DRT)" mechanism is attractive because it should be the simplest mechanism for RNA to synthesize peptides, thus very likely to have been adopted initially in the RNA world. Then, how this mechanism could develop into a proto-translation system mechanism is an interesting problem. Presentation of the hypothesis Here an explanation to this problem is shown considering the principle of "replication parsimony" --- genetic information tends to be utilized in a parsimonious way under selection pressure, due to its replication cost (e.g., in the RNA world, nucleotides and ribozymes for RNA replication). Because a DRT would be quite long even for a short peptide, its replication cost would be great. Thus the diversity and the length of functional peptides synthesized by the DRT mechanism would be seriously limited. Adaptors (proto-tRNAs) would arise to allow a DRT's complementary strand (called "C-DRT" here) to direct the synthesis of the same peptide synthesized by the DRT itself. Because the C-DRT is a necessary part in the DRT's replication, fewer turns of the DRT's replication would be needed to synthesize definite copies of the functional peptide, thus saving the replication cost. Acting through adaptors, C-DRTs could transform into much shorter templates (called "proto-mRNAs" here) and substitute the role of DRTs, thus significantly saving the replication cost. A proto-rRNA corresponding to the small subunit rRNA would then emerge to aid the binding of proto-tRNAs and proto-mRNAs, allowing the reduction of base pairs between them (ultimately resulting in the triplet anticodon/codon pair), thus further saving the replication cost. In this context, the replication cost saved would allow the appearance of more and longer functional peptides and, finally, proteins. The hypothesis could be called "DRT-RP" ("RP" for "replication parsimony"). Testing the hypothesis The scenario described here is open for experimental work at some key scenes, including the compact DRT mechanism, the development of adaptors from aa-aptamers, the synthesis of peptides by proto-tRNAs and proto-mRNAs without the participation of proto-rRNAs, etc. Interestingly, a recent computer simulation study has demonstrated the plausibility of one of the evolving processes driven by replication parsimony in the scenario. Implication of the hypothesis An RNA-based proto-translation system could arise gradually from the DRT mechanism according to the principle of "replication parsimony" --- to save the replication cost of RNA templates for functional peptides. A surprising side deduction along the logic of the hypothesis is that complex, biosynthetic amino acids might have entered the genetic code earlier than simple, prebiotic amino acids, which is opposite to the common sense. Overall, the present discussion clarifies the blurry scenario concerning the origin of translation with a major clue, which shows vividly how life could "manage" to exploit potential chemical resources in nature, eventually in an efficient way over evolution. Reviewers This article was reviewed by Eugene V. Koonin, Juergen Brosius, and Arcady Mushegian. PMID:21110883
Degtjareva, Galina V; Valiejo-Roman, Carmen M; Samigullin, Tahir H; Guara-Requena, Miguel; Sokoloff, Dmitry D
2012-02-01
Phylogenetic relationships in the genus Anthyllis (Leguminosae: Papilionoideae: Loteae) were investigated using data from the nuclear ribosomal internal transcribed spacer regions (ITS) and three plastid regions (psbA-trnH intergenic spacer, petB-petD region and rps16 intron). Bayesian and maximum parsimony (MP) analysis of a concatenated plastid dataset recovered well-resolved trees that are topologically similar, with many clades supported by unique indels. MP and Bayesian analyses of the ITS sequence data recovered trees that have several well-supported topological differences, both among analyses, and to trees inferred from the plastid data. The most substantial of these concerns A. vulneraria and A. lemanniana, whose placement in the parsimony analysis of the ITS data appears to be due to a strong long-branch effect. Analysis of the secondary structure of the ITS1 spacer showed a strong bias towards transitions in A. vulneraria and A. lemanniana, many of which were also characteristic of certain outgroup taxa. This may contribute to the conflicting placement of this clade in the MP tree for the ITS data. Additional conflicts between the plastid and ITS trees were more taxonomically focused. These differences may reflect the occurrence of reticulate evolution between closely related species, including a possible hybrid origin for A. hystrix. The patterns of incongruence between the plastid and the ITS data seem to correlate with taxon ranks. All of our phylogenetic analyses supported the monophyly of Anthyllis (incl. Hymenocarpos). Although they are often taxonomically associated with Anthyllis, the genera Dorycnopsis and Tripodion are shown here to be more closely related to other genera of Loteae. We infer up to six major clades in Anthyllis that are morphologically well-characterized, and which could be recognized as sections. Four of these agree with various morphology-based classifications, while the other two are novel. We reconstruct the evolution of several morphological characteristics found only in Anthyllis or tribe Loteae. Some of these characters support major clades, while others show evidence of homoplasy within Anthyllis. Copyright © 2011 Elsevier Inc. All rights reserved.
Common reflection point migration and velocity analysis for anisotropic media
NASA Astrophysics Data System (ADS)
Oropeza, Ernesto V.
An efficient Kirchhoff-style prestack depth migration, called 'parsimonious' migration was developed a decade ago for isotropic 2D and 3D media. The common-reflection point (CRP) migration velocity analysis (MVA) was developed later for isotropic media. The isotropic parsimonious migration produces incorrect images when the media is actually anisotropic. Similarly, isotropic CRP MVA produces incorrect inversions when the medium is anisotropic. In this study both parsimonious depth migration and common-reflection point migration velocity analysis are extended for application to 2D tilted transversely isotropic (TTI) media and illustrated with synthetic P-wave data. While the framework of isotropic parsimonious migration may be retained, the extension to TTI media requires redevelopment of each of the numerical components, including calculation of the phase and group velocity for TTI media, development of a new two-point anisotropic ray tracer, and substitution of an initial-angle and anisotropic shooting ray-trace algorithm to replace the isotropic one. The 2D model parameterization consists of Thomsen's parameters (Vpo, epsilon, delta) and the tilt angle of the symmetry axis of the TI medium. The parsimonious anisotropic migration algorithm is successfully applied to synthetic data from a TTI version of the Marmousi-2 model. The quality of the image improves by weighting the impulse response by the calculation of the anisotropic Fresnel radius. The accuracy and speed of this migration makes it useful for anisotropic velocity model building. The common-reflection point migration velocity analysis for TTI media for P-waves includes (and inverts for) Vpo, epsilon, and delta. The orientation of the anisotropic symmetry axis have to be constrained. If it constrained orthogonal to the layer bottom (as it conventionally is), it is estimated at each CRP and updated at each iteration without intermediate picking. The extension to TTI media requires development of a new inversion procedure to include Vpo, epsilon, and delta in the perturbations. The TTI CRP MVA is applied to a single layer to demonstrate its feasibility. Errors in the estimation of the orientation of the symmetry axis larger that 5 degrees affect the inversion of epsilon and delta while Vpo is less sensitive to this parameter. The TTI CRP MVA is also applied to a version of the TTI BP model by layer stripping so one group of CRPs are used do to inversion top to bottom, constraining the model parameter after each previous group of CRPs converges. Vpo, delta and the orientation of the anisotropic symmetry axis (constrained orthogonal to the local reflector orientation) are successfully inverted. epsilon is less well constrained by the small acquisition aperture in the data .
Zhang, Honghai; Chen, Lei
2011-03-01
The dhole (Cuon alpinus) is the only existent species in the genus Cuon (Carnivora: Canidae). In the present study, the complete mitochondrial genome of the dhole was sequenced. The total length is 16672 base pairs which is the shortest in Canidae. Sequence analysis revealed that most mitochondrial genomic functional regions were highly consistent among canid animals except the CSB domain of the control region. The difference in length among the Canidae mitochondrial genome sequences is mainly due to the number of short segments of tandem repeated in the CSB domain. Phylogenetic analysis was progressed based on the concatenated data set of 14 mitochondrial genes of 8 canid animals by using maximum parsimony (MP), maximum likelihood (ML) and Bayesian (BI) inference methods. The genera Vulpes and Nyctereutes formed a sister group and split first within Canidae, followed by that in the Cuon. The divergence in the genus Canis was the latest. The divarication of domestic dogs after that of the Canis lupus laniger is completely supported by all the three topologies. Pairwise sequence divergence data of different mitochondrial genes among canid animals were also determined. Except for the synonymous substitutions in protein-coding genes, the control region exhibits the highest sequence divergences. The synonymous rates are approximately two to six times higher than those of the non-synonymous sites except for a slightly higher rate in the non-synonymous substitution between Cuon alpinus and Vulpes vulpes. 16S rRNA genes have a slightly faster sequence divergence than 12S rRNA and tRNA genes. Based on nucleotide substitutions of tRNA genes and rRNA genes, the times since divergence between dhole and other canid animals, and between domestic dogs and three subspecies of wolves were evaluated. The result indicates that Vulpes and Nyctereutes have a close phylogenetic relationship and the divergence of Nyctereutes is a little earlier. The Tibetan wolf may be an archaic pedigree within wolf subspecies. The genetic distance between wolves and domestic dogs is less than that among different subspecies of wolves. The domestication of dogs was about 1.56-1.92 million years ago or even earlier.
Simakova, Anastasia V; Vossbrinck, Charles R; Andreadis, Theodore G
2008-11-01
A new genus and species of microsporidia, Andreanna caspii n. gen., n. sp. is described from the mosquito, Ochlerotatus caspius (Pallas) based on ultrastructural morphology, developmental characteristics, and comparative sequence analyses of the small subunit (SSU) ribosomal DNA (rDNA). Parasite development is confined to fat body tissue and infected larvae appear swollen with dull white masses within the thorax and abdomen. Meronts have diplokaryotic nuclei and are delineated by a simple plasmalemma contiguous with the host cell cytoplasm. Merogony occurs by synchronous binary division followed by cytokinesis. Diplokaryotic sporonts undergo meiosis and synchronous nuclear division forming sporogonial plasmodia with two, four and eight nuclei enclosed within a persistent sporophorous vesicle. Cytokinesis of sporogonial plasmodia results in the formation of eight uninucleate spores. The episporontal space of early sporonts is filled with a homogeneous accumulation of electron dense granular inclusions and ovoid vesicles of various dimensions, transforming into an interwoven matrix during the initial phase of sporogenesis. Spores are oval, uninucleate and measure 4.8+/-0.3 x 3.1+/-0.4 microm (fixed). The spore wall is 260 microm thick with an irregular exospore consisting of two layers (150-170 microm) and a thinner endospore (90-100 microm). The anchoring disk is well developed and is contiguous with a lamellar polaroplast that occupies the anterior third of the spore and possess more narrow lamellae on the posterior end. The polar filament is gradually tapered and arranged in a single row consisting of six coils ranging from 180 to 150 microm in diameter. The posterior vacuole (posterosome) is moderately sized and filled with a matrix of moderate electron density. Phylogenetic analysis of SSU rDNA from A. caspii and 30 other species of microsporidia including 11 genera parasitic in mosquitoes using maximum parsimony, neighbor joining and maximum likelihood methods showed A. caspii to be a sister group to the clade containing all of the Amblyospora species, including Culicospora, Edhazardia and Intrapredatorus, as well as Culicosporella and Hyalinocysta thus providing strong support for establishment of Andreanna as a separate genus.
2014-01-01
Background Lethal amanitas (Amanita section Phalloideae) are a group of wild, fatal mushrooms causing many poisoning cases worldwide. However, the diversity and evolutionary history of these lethal mushrooms remain poorly known due to the limited sampling and insufficient gene fragments employed for phylogenetic analyses. In this study, five gene loci (nrLSU, ITS, rpb2, ef1-α and β-tubulin) with a widely geographic sampling from East and South Asia, Europe, North and Central America, South Africa and Australia were analysed with maximum-likelihood, maximum-parsimony and Bayesian inference methods. Biochemical analyses were also conducted with intention to detect amatoxins and phalloidin in 14 representative samples. Result Lethal amanitas were robustly supported to be a monophyletic group after excluding five species that were provisionally defined as lethal amanitas based on morphological studies. In lethal amanitas, 28 phylogenetic species were recognised by integrating molecular phylogenetic analyses with morphological studies, and 14 of them represented putatively new species. The biochemical analyses indicated a single origin of cyclic peptide toxins (amatoxins and phalloidin) within Amanita and suggested that this kind of toxins seemed to be a synapomorphy of lethal amanitas. Molecular dating through BEAST and biogeographic analyses with LAGRANGE and RASP indicated that lethal amanitas most likely originated in the Palaeotropics with the present crown group dated around 64.92 Mya in the early Paleocene, and the East Asia–eastern North America or Eurasia–North America–Central America disjunct distribution patterns were primarily established during the middle Oligocene to Miocene. Conclusion The cryptic diversity found in this study indicates that the species diversity of lethal amanitas is strongly underestimated under the current taxonomy. The intercontinental sister species or sister groups relationships among East Asia and eastern North America or Eurasia–North America–Central America within lethal amanitas are best explained by the diversification model of Palaeotropical origin, dispersal via the Bering Land Bridge, followed by regional vicariance speciation resulting from climate change during the middle Oligocene to the present. These findings indicate the importance of both dispersal and vicariance in shaping the intercontinental distributions of these ectomycorrhizal fungi. PMID:24950598
Koepfli, Klaus-Peter; Deere, Kerry A; Slater, Graham J; Begg, Colleen; Begg, Keith; Grassman, Lon; Lucherini, Mauro; Veron, Geraldine; Wayne, Robert K
2008-01-01
Background Adaptive radiation, the evolution of ecological and phenotypic diversity from a common ancestor, is a central concept in evolutionary biology and characterizes the evolutionary histories of many groups of organisms. One such group is the Mustelidae, the most species-rich family within the mammalian order Carnivora, encompassing 59 species classified into 22 genera. Extant mustelids display extensive ecomorphological diversity, with different lineages having evolved into an array of adaptive zones, from fossorial badgers to semi-aquatic otters. Mustelids are also widely distributed, with multiple genera found on different continents. As with other groups that have undergone adaptive radiation, resolving the phylogenetic history of mustelids presents a number of challenges because ecomorphological convergence may potentially confound morphologically based phylogenetic inferences, and because adaptive radiations often include one or more periods of rapid cladogenesis that require a large amount of data to resolve. Results We constructed a nearly complete generic-level phylogeny of the Mustelidae using a data matrix comprising 22 gene segments (~12,000 base pairs) analyzed with maximum parsimony, maximum likelihood and Bayesian inference methods. We show that mustelids are consistently resolved with high nodal support into four major clades and three monotypic lineages. Using Bayesian dating techniques, we provide evidence that mustelids underwent two bursts of diversification that coincide with major paleoenvironmental and biotic changes that occurred during the Neogene and correspond with similar bursts of cladogenesis in other vertebrate groups. Biogeographical analyses indicate that most of the extant diversity of mustelids originated in Eurasia and mustelids have colonized Africa, North America and South America on multiple occasions. Conclusion Combined with information from the fossil record, our phylogenetic and dating analyses suggest that mustelid diversification may have been spurred by a combination of faunal turnover events and diversification at lower trophic levels, ultimately caused by climatically driven environmental changes. Our biogeographic analyses show Eurasia as the center of origin of mustelid diversity and that mustelids in Africa, North America and South America have been assembled over time largely via dispersal, which has important implications for understanding the ecology of mustelid communities. PMID:18275614
Ruhlman, Tracey; Lee, Seung-Bum; Jansen, Robert K; Hostetler, Jessica B; Tallon, Luke J; Town, Christopher D; Daniell, Henry
2006-08-31
Carrot (Daucus carota) is a major food crop in the US and worldwide. Its capacity for storage and its lifecycle as a biennial make it an attractive species for the introduction of foreign genes, especially for oral delivery of vaccines and other therapeutic proteins. Until recently efforts to express recombinant proteins in carrot have had limited success in terms of protein accumulation in the edible tap roots. Plastid genetic engineering offers the potential to overcome this limitation, as demonstrated by the accumulation of BADH in chromoplasts of carrot taproots to confer exceedingly high levels of salt resistance. The complete plastid genome of carrot provides essential information required for genetic engineering. Additionally, the sequence data add to the rapidly growing database of plastid genomes for assessing phylogenetic relationships among angiosperms. The complete carrot plastid genome is 155,911 bp in length, with 115 unique genes and 21 duplicated genes within the IR. There are four ribosomal RNAs, 30 distinct tRNA genes and 18 intron-containing genes. Repeat analysis reveals 12 direct and 2 inverted repeats > or = 30 bp with a sequence identity > or = 90%. Phylogenetic analysis of nucleotide sequences for 61 protein-coding genes using both maximum parsimony (MP) and maximum likelihood (ML) were performed for 29 angiosperms. Phylogenies from both methods provide strong support for the monophyly of several major angiosperm clades, including monocots, eudicots, rosids, asterids, eurosids II, euasterids I, and euasterids II. The carrot plastid genome contains a number of dispersed direct and inverted repeats scattered throughout coding and non-coding regions. This is the first sequenced plastid genome of the family Apiaceae and only the second published genome sequence of the species-rich euasterid II clade. Both MP and ML trees provide very strong support (100% bootstrap) for the sister relationship of Daucus with Panax in the euasterid II clade. These results provide the best taxon sampling of complete chloroplast genomes and the strongest support yet for the sister relationship of Caryophyllales to the asterids. The availability of the complete plastid genome sequence should facilitate improved transformation efficiency and foreign gene expression in carrot through utilization of endogenous flanking sequences and regulatory elements.
2014-01-01
Background Given that most species that have ever existed on earth are extinct, it stands to reason that the evolutionary history can be better understood with fossil taxa. Bauhinia is a typical genus of pantropical intercontinental disjunction among the Asian, African, and American continents. Geographic distribution patterns are better recognized when fossil records and molecular sequences are combined in the analyses. Here, we describe a new macrofossil species of Bauhinia from the Upper Miocene Xiaolongtan Formation in Wenshan County, Southeast Yunnan, China, and elucidate the biogeographic significance through the analyses of molecules and fossils. Results Morphometric analysis demonstrates that the leaf shapes of B. acuminata, B. championii, B. chalcophylla, B. purpurea, and B. podopetala closely resemble the leaf shapes of the new finding fossil. Phylogenetic relationships among the Bauhinia species were reconstructed using maximum parsimony and Bayesian inference, which inferred that species in Bauhinia species are well-resolved into three main groups. Divergence times were estimated by the Bayesian Markov chain Monte Carlo (MCMC) method under a relaxed clock, and inferred that the stem diversification time of Bauhinia was ca. 62.7 Ma. The Asian lineage first diverged at ca. 59.8 Ma, followed by divergence of the Africa lineage starting during the late Eocene, whereas that of the neotropical lineage starting during the middle Miocene. Conclusions Hypotheses relying on vicariance or continental history to explain pantropical disjunct distributions are dismissed because they require mostly Palaeogene and older tectonic events. We suggest that Bauhinia originated in the middle Paleocene in Laurasia, probably in Asia, implying a possible Tethys Seaway origin or an “Out of Tropical Asia”, and dispersal of legumes. Its present pantropical disjunction resulted from disruption of the boreotropical flora by climatic cooling after the Paleocene-Eocene Thermal Maximum (PETM). North Atlantic land bridges (NALB) seem the most plausible route for migration of Bauhinia from Asia to America; and additional aspects of the Bauhinia species distribution are explained by migration and long distance dispersal (LDD) from Eurasia to the African and American continents. PMID:25288346
Meng, Hong-Hu; Jacques, Frédéric Mb; Su, Tao; Huang, Yong-Jiang; Zhang, Shi-Tao; Ma, Hong-Jie; Zhou, Zhe-Kun
2014-08-10
Given that most species that have ever existed on earth are extinct, it stands to reason that the evolutionary history can be better understood with fossil taxa. Bauhinia is a typical genus of pantropical intercontinental disjunction among the Asian, African, and American continents. Geographic distribution patterns are better recognized when fossil records and molecular sequences are combined in the analyses. Here, we describe a new macrofossil species of Bauhinia from the Upper Miocene Xiaolongtan Formation in Wenshan County, Southeast Yunnan, China, and elucidate the biogeographic significance through the analyses of molecules and fossils. Morphometric analysis demonstrates that the leaf shapes of B. acuminata, B. championii, B. chalcophylla, B. purpurea, and B. podopetala closely resemble the leaf shapes of the new finding fossil. Phylogenetic relationships among the Bauhinia species were reconstructed using maximum parsimony and Bayesian inference, which inferred that species in Bauhinia species are well-resolved into three main groups. Divergence times were estimated by the Bayesian Markov chain Monte Carlo (MCMC) method under a relaxed clock, and inferred that the stem diversification time of Bauhinia was ca. 62.7 Ma. The Asian lineage first diverged at ca. 59.8 Ma, followed by divergence of the Africa lineage starting during the late Eocene, whereas that of the neotropical lineage starting during the middle Miocene. Hypotheses relying on vicariance or continental history to explain pantropical disjunct distributions are dismissed because they require mostly Palaeogene and older tectonic events. We suggest that Bauhinia originated in the middle Paleocene in Laurasia, probably in Asia, implying a possible Tethys Seaway origin or an "Out of Tropical Asia", and dispersal of legumes. Its present pantropical disjunction resulted from disruption of the boreotropical flora by climatic cooling after the Paleocene-Eocene Thermal Maximum (PETM). North Atlantic land bridges (NALB) seem the most plausible route for migration of Bauhinia from Asia to America; and additional aspects of the Bauhinia species distribution are explained by migration and long distance dispersal (LDD) from Eurasia to the African and American continents.
2012-01-01
Background The twelve-item Self-Report Habit Index (SRHI) is the most popular measure of energy-balance related habits. This measure characterises habit by automatic activation, behavioural frequency, and relevance to self-identity. Previous empirical research suggests that the SRHI may be abbreviated with no losses in reliability or predictive utility. Drawing on recent theorising suggesting that automaticity is the ‘active ingredient’ of habit-behaviour relationships, we tested whether an automaticity-specific SRHI subscale could capture habit-based behaviour patterns in self-report data. Methods A content validity task was undertaken to identify a subset of automaticity indicators within the SRHI. The reliability, convergent validity and predictive validity of the automaticity item subset was subsequently tested in secondary analyses of all previous SRHI applications, identified via systematic review, and in primary analyses of four raw datasets relating to energy‐balance relevant behaviours (inactive travel, active travel, snacking, and alcohol consumption). Results A four-item automaticity subscale (the ‘Self-Report Behavioural Automaticity Index’; ‘SRBAI’) was found to be reliable and sensitive to two hypothesised effects of habit on behaviour: a habit-behaviour correlation, and a moderating effect of habit on the intention-behaviour relationship. Conclusion The SRBAI offers a parsimonious measure that adequately captures habitual behaviour patterns. The SRBAI may be of particular utility in predicting future behaviour and in studies tracking habit formation or disruption. PMID:22935297
Towards a global historical biogeography of Palms
NASA Astrophysics Data System (ADS)
Couvreur, Thomas; Baker, William J.; Frigerio, Jean-Marc; Sepulchre, Pierre; Franc, Alain
2017-04-01
Four mechanisms are at work for deciphering historical biogeography of plants : speciation, extinction, migration, and drift (a sort of neutral speciation). The first three mechanisms are under selection pressure of the environment, mainly the climate and connectivity of land masses. Hence, an accurate history of climate and connectivity or non connectivity between landmasses, as well as orogenesis processes, can shed new light on the most likely speciation events and migration routes driven by paleogeography and paleoclimatology. Currently, some models exist (like DIVA) to infer the most parsimonious history (in the number of migration events) knowing the speciation history given by phylogenies (extinction are mostly unknown), in a given setting of climate and landmass connectivity. In a previous project, we have built in collaboration with LSCE a series of paleogeographic and paleoclimatic maps since the Early Cretaceous. We have developed a program, called Aran, which enables to extend DIVA to a time series of varying paleoclimatic and paleogeogarphic conditions. We apply these new methods and data to unravel the biogeographic history of palms (Arecaceae), a pantropical family of 182 genera and >2600 species whose divergence is dated in Late Cretaceous (100 My). Based on a robust dated molecular phylogeny, novel paleoclimatic and paleogeographic maps, we will generate an updated biogeographic history of Arecaceae inferred from the most parsimonious history using Aran. We will discuss the results, and put them in context with what is known and needed to provide a global biogeographic history of tropical palms.
Spot the match – wildlife photo-identification using information theory
Speed, Conrad W; Meekan, Mark G; Bradshaw, Corey JA
2007-01-01
Background Effective approaches for the management and conservation of wildlife populations require a sound knowledge of population demographics, and this is often only possible through mark-recapture studies. We applied an automated spot-recognition program (I3S) for matching natural markings of wildlife that is based on a novel information-theoretic approach to incorporate matching uncertainty. Using a photo-identification database of whale sharks (Rhincodon typus) as an example case, the information criterion (IC) algorithm we developed resulted in a parsimonious ranking of potential matches of individuals in an image library. Automated matches were compared to manual-matching results to test the performance of the software and algorithm. Results Validation of matched and non-matched images provided a threshold IC weight (approximately 0.2) below which match certainty was not assured. Most images tested were assigned correctly; however, scores for the by-eye comparison were lower than expected, possibly due to the low sample size. The effect of increasing horizontal angle of sharks in images reduced matching likelihood considerably. There was a negative linear relationship between the number of matching spot pairs and matching score, but this relationship disappeared when using the IC algorithm. Conclusion The software and use of easily applied information-theoretic scores of match parsimony provide a reliable and freely available method for individual identification of wildlife, with wide applications and the potential to improve mark-recapture studies without resorting to invasive marking techniques. PMID:17227581
Modeling time-to-event (survival) data using classification tree analysis.
Linden, Ariel; Yarnold, Paul R
2017-12-01
Time to the occurrence of an event is often studied in health research. Survival analysis differs from other designs in that follow-up times for individuals who do not experience the event by the end of the study (called censored) are accounted for in the analysis. Cox regression is the standard method for analysing censored data, but the assumptions required of these models are easily violated. In this paper, we introduce classification tree analysis (CTA) as a flexible alternative for modelling censored data. Classification tree analysis is a "decision-tree"-like classification model that provides parsimonious, transparent (ie, easy to visually display and interpret) decision rules that maximize predictive accuracy, derives exact P values via permutation tests, and evaluates model cross-generalizability. Using empirical data, we identify all statistically valid, reproducible, longitudinally consistent, and cross-generalizable CTA survival models and then compare their predictive accuracy to estimates derived via Cox regression and an unadjusted naïve model. Model performance is assessed using integrated Brier scores and a comparison between estimated survival curves. The Cox regression model best predicts average incidence of the outcome over time, whereas CTA survival models best predict either relatively high, or low, incidence of the outcome over time. Classification tree analysis survival models offer many advantages over Cox regression, such as explicit maximization of predictive accuracy, parsimony, statistical robustness, and transparency. Therefore, researchers interested in accurate prognoses and clear decision rules should consider developing models using the CTA-survival framework. © 2017 John Wiley & Sons, Ltd.
Javadi, Firouzeh; Tun, Ye Tun; Kawase, Makoto; Guan, Kaiyun; Yamaguchi, Hirofumi
2011-08-01
The subgenus Ceratotropis in the genus Vigna is widely distributed from the Himalayan highlands to South, Southeast and East Asia. However, the interspecific and geographical relationships of its members are poorly understood. This study investigates the phylogeny and biogeography of the subgenus Ceratotropis using chloroplast DNA sequence data. Sequence data from four intergenic spacer regions (petA-psbJ, psbD-trnT, trnT-trnE and trnT-trnL) of chloroplast DNA, alone and in combination, were analysed using Bayesian and parsimony methods. Divergence times for major clades were estimated with penalized likelihood. Character evolution was examined by means of parsimony optimization and MacClade. Parsimony and Bayesian phylogenetic analyses on the combined data demonstrated well-resolved species relationships in which 18 Vigna species were divided into two major geographical clades: the East Asia-Southeast Asian clade and the Indian subcontinent clade. Within these two clades, three well-supported eco-geographical groups, temperate and subtropical (the East Asia-Southeast Asian clade) and tropical (the Indian subcontinent clade), are recognized. The temperate group consists of V. minima, V. nepalensis and V. angularis. The subtropical group comprises the V. nakashimae-V. riukiuensis-V. minima subgroup and the V. hirtella-V. exilis-V. umbellata subgroup. The tropical group contains two subgroups: the V. trinervia-V. reflexo-pilosa-V. trilobata subgroup and the V. mungo-V. grandiflora subgroup. An evolutionary rate analysis estimated the divergence time between the East Asia-Southeast Asia clade and the Indian subcontinent clade as 3·62 ± 0·3 million years, and that between the temperate and subtropical groups as 2·0 ± 0·2 million years. The findings provide an improved understanding of the interspecific relationships, and ecological and geographical phylogenetic structure of the subgenus Ceratotropis. The quaternary diversification of the subgenus Ceratotropis implicates its geographical dispersal in the south-eastern part of Asia involving adaptation to climatic condition after the collision of the Indian subcontinent with the Asian plate. The phylogenetic results indicate that the epigeal germination is plesiomorphic, and the germination type evolved independently multiple times in this subgenus, implying its limited taxonomic utility.
Schmidt-Lebuhn, Alexander N; Aitken, Nicola C; Chuah, Aaron
2017-11-01
Datasets of hundreds or thousands of SNPs (Single Nucleotide Polymorphisms) from multiple individuals per species are increasingly used to study population structure, species delimitation and shallow phylogenetics. The principal software tool to infer species or population trees from SNP data is currently the BEAST template SNAPP which uses a Bayesian coalescent analysis. However, it is computationally extremely demanding and tolerates only small amounts of missing data. We used simulated and empirical SNPs from plants (Australian Craspedia, Asteraceae, and Pelargonium, Geraniaceae) to compare species trees produced (1) by SNAPP, (2) using SVD quartets, and (3) using Bayesian and parsimony analysis with several different approaches to summarising data from multiple samples into one set of traits per species. Our aims were to explore the impact of tree topology and missing data on the results, and to test which data summarising and analyses approaches would best approximate the results obtained from SNAPP for empirical data. SVD quartets retrieved the correct topology from simulated data, as did SNAPP except in the case of a very unbalanced phylogeny. Both methods failed to retrieve the correct topology when large amounts of data were missing. Bayesian analysis of species level summary data scoring the two alleles of each SNP as independent characters and parsimony analysis of data scoring each SNP as one character produced trees with branch length distributions closest to the true trees on which SNPs were simulated. For empirical data, Bayesian inference and Dollo parsimony analysis of data scored allele-wise produced phylogenies most congruent with the results of SNAPP. In the case of study groups divergent enough for missing data to be phylogenetically informative (because of additional mutations preventing amplification of genomic fragments or bioinformatic establishment of homology), scoring of SNP data as a presence/absence matrix irrespective of allele content might be an additional option. As this depends on sampling across species being reasonably even and a random distribution of non-informative instances of missing data, however, further exploration of this approach is needed. Properly chosen data summary approaches to inferring species trees from SNP data may represent a potential alternative to currently available individual-level coalescent analyses especially for quick data exploration and when dealing with computationally demanding or patchy datasets. Crown Copyright © 2017. Published by Elsevier Inc. All rights reserved.
Marcussen, Thomas; Heier, Lise; Brysting, Anne K.; Oxelman, Bengt; Jakobsen, Kjetill S.
2015-01-01
Allopolyploidization accounts for a significant fraction of speciation events in many eukaryotic lineages. However, existing phylogenetic and dating methods require tree-like topologies and are unable to handle the network-like phylogenetic relationships of lineages containing allopolyploids. No explicit framework has so far been established for evaluating competing network topologies, and few attempts have been made to date phylogenetic networks. We used a four-step approach to generate a dated polyploid species network for the cosmopolitan angiosperm genus Viola L. (Violaceae Batch.). The genus contains ca 600 species and both recent (neo-) and more ancient (meso-) polyploid lineages distributed over 16 sections. First, we obtained DNA sequences of three low-copy nuclear genes and one chloroplast region, from 42 species representing all 16 sections. Second, we obtained fossil-calibrated chronograms for each nuclear gene marker. Third, we determined the most parsimonious multilabeled genome tree and its corresponding network, resolved at the section (not the species) level. Reconstructing the “correct” network for a set of polyploids depends on recovering all homoeologs, i.e., all subgenomes, in these polyploids. Assuming the presence of Viola subgenome lineages that were not detected by the nuclear gene phylogenies (“ghost subgenome lineages”) significantly reduced the number of inferred polyploidization events. We identified the most parsimonious network topology from a set of five competing scenarios differing in the interpretation of homoeolog extinctions and lineage sorting, based on (i) fewest possible ghost subgenome lineages, (ii) fewest possible polyploidization events, and (iii) least possible deviation from expected ploidy as inferred from available chromosome counts of the involved polyploid taxa. Finally, we estimated the homoploid and polyploid speciation times of the most parsimonious network. Homoploid speciation times were estimated by coalescent analysis of gene tree node ages. Polyploid speciation times were estimated by comparing branch lengths and speciation rates of lineages with and without ploidy shifts. Our analyses recognize Viola as an old genus (crown age 31 Ma) whose evolutionary history has been profoundly affected by allopolyploidy. Between 16 and 21 allopolyploidizations are necessary to explain the diversification of the 16 major lineages (sections) of Viola, suggesting that allopolyploidy has accounted for a high percentage—between 67% and 88%—of the speciation events at this level. The theoretical and methodological approaches presented here for (i) constructing networks and (ii) dating speciation events within a network, have general applicability for phylogenetic studies of groups where allopolyploidization has occurred. They make explicit use of a hitherto underexplored source of ploidy information from chromosome counts to help resolve phylogenetic cases where incomplete sequence data hampers network inference. Importantly, the coalescent-based method used herein circumvents the assumption of tree-like evolution required by most techniques for dating speciation events. PMID:25281848
EPR oximetry in three spatial dimensions using sparse spin distribution
NASA Astrophysics Data System (ADS)
Som, Subhojit; Potter, Lee C.; Ahmad, Rizwan; Vikram, Deepti S.; Kuppusamy, Periannan
2008-08-01
A method is presented to use continuous wave electron paramagnetic resonance imaging for rapid measurement of oxygen partial pressure in three spatial dimensions. A particulate paramagnetic probe is employed to create a sparse distribution of spins in a volume of interest. Information encoding location and spectral linewidth is collected by varying the spatial orientation and strength of an applied magnetic gradient field. Data processing exploits the spatial sparseness of spins to detect voxels with nonzero spin and to estimate the spectral linewidth for those voxels. The parsimonious representation of spin locations and linewidths permits an order of magnitude reduction in data acquisition time, compared to four-dimensional tomographic reconstruction using traditional spectral-spatial imaging. The proposed oximetry method is experimentally demonstrated for a lithium octa- n-butoxy naphthalocyanine (LiNc-BuO) probe using an L-band EPR spectrometer.
Yasukochi, Yoshiki; Satta, Yoko
2014-05-02
An extraordinary diversity of amino acid sequences in the peptide-binding region (PBR) of human leukocyte antigen [HLA; human major histocompatibility complex (MHC)] molecules has been maintained by balancing selection. The process of accumulation of amino acid diversity in the PBR for six HLA genes (HLA-A, B, C, DRB1, DQB1, and DPB1) shows that the number of amino acid substitutions in the PBR among alleles does not linearly correlate with the divergence time of alleles at the six HLA loci. At these loci, some pairs of alleles show significantly less nonsynonymous substitutions at the PBR than expected from the divergence time. The same phenomenon was observed not only in the HLA but also in the rat MHC. To identify the cause for this, DRB1 sequences, a representative case of a typical nonlinear pattern of substitutions, were examined. When the amino acid substitutions in the PBR were placed with maximum parsimony on a maximum likelihood tree based on the non-PBR substitutions, heterogeneous rates of nonsynonymous substitutions in the PBR were observed on several branches. A computer simulation supported the hypothesis that allelic pairs with low PBR substitution rates were responsible for the stagnation of accumulation of PBR nonsynonymous substitutions. From these observations, we conclude that the nonsynonymous substitution rate at the PBR sites is not constant among the allelic lineages. The deceleration of the rate may be caused by the coexistence of certain pathogens for a substantially long time during HLA evolution. Copyright © 2014 Yasukochi and Satta.
Silencing, positive selection and parallel evolution: busy history of primate cytochromes C.
Pierron, Denis; Opazo, Juan C; Heiske, Margit; Papper, Zack; Uddin, Monica; Chand, Gopi; Wildman, Derek E; Romero, Roberto; Goodman, Morris; Grossman, Lawrence I
2011-01-01
Cytochrome c (cyt c) participates in two crucial cellular processes, energy production and apoptosis, and unsurprisingly is a highly conserved protein. However, previous studies have reported for the primate lineage (i) loss of the paralogous testis isoform, (ii) an acceleration and then a deceleration of the amino acid replacement rate of the cyt c somatic isoform, and (iii) atypical biochemical behavior of human cyt c. To gain insight into the cause of these major evolutionary events, we have retraced the history of cyt c loci among primates. For testis cyt c, all primate sequences examined carry the same nonsense mutation, which suggests that silencing occurred before the primates diversified. For somatic cyt c, maximum parsimony, maximum likelihood, and Bayesian phylogenetic analyses yielded the same tree topology. The evolutionary analyses show that a fast accumulation of non-synonymous mutations (suggesting positive selection) occurred specifically on the anthropoid lineage root and then continued in parallel on the early catarrhini and platyrrhini stems. Analysis of evolutionary changes using the 3D structure suggests they are focused on the respiratory chain rather than on apoptosis or other cyt c functions. In agreement with previous biochemical studies, our results suggest that silencing of the cyt c testis isoform could be linked with the decrease of primate reproduction rate. Finally, the evolution of cyt c in the two sister anthropoid groups leads us to propose that somatic cyt c evolution may be related both to COX evolution and to the convergent brain and body mass enlargement in these two anthropoid clades.
Silencing, Positive Selection and Parallel Evolution: Busy History of Primate Cytochromes c
Pierron, Denis; Opazo, Juan C.; Heiske, Margit; Papper, Zack; Uddin, Monica; Chand, Gopi; Wildman, Derek E.; Romero, Roberto; Goodman, Morris; Grossman, Lawrence I.
2011-01-01
Cytochrome c (cyt c) participates in two crucial cellular processes, energy production and apoptosis, and unsurprisingly is a highly conserved protein. However, previous studies have reported for the primate lineage (i) loss of the paralogous testis isoform, (ii) an acceleration and then a deceleration of the amino acid replacement rate of the cyt c somatic isoform, and (iii) atypical biochemical behavior of human cyt c. To gain insight into the cause of these major evolutionary events, we have retraced the history of cyt c loci among primates. For testis cyt c, all primate sequences examined carry the same nonsense mutation, which suggests that silencing occurred before the primates diversified. For somatic cyt c, maximum parsimony, maximum likelihood, and Bayesian phylogenetic analyses yielded the same tree topology. The evolutionary analyses show that a fast accumulation of non-synonymous mutations (suggesting positive selection) occurred specifically on the anthropoid lineage root and then continued in parallel on the early catarrhini and platyrrhini stems. Analysis of evolutionary changes using the 3D structure suggests they are focused on the respiratory chain rather than on apoptosis or other cyt c functions. In agreement with previous biochemical studies, our results suggest that silencing of the cyt c testis isoform could be linked with the decrease of primate reproduction rate. Finally, the evolution of cyt c in the two sister anthropoid groups leads us to propose that somatic cyt c evolution may be related both to COX evolution and to the convergent brain and body mass enlargement in these two anthropoid clades. PMID:22028846
Zhao, Lei; Annie, Ang Shi Hui; Amrita, Srivathsan; Yi, Su Kathy Feng; Rudolf, Meier
2013-10-01
We here present a phylogenetic hypothesis for Sepsidae (Diptera: Cyclorrhapha), a group of schizophoran flies with ca. 320 described species that is widely used in sexual selection research. The hypothesis is based on five nuclear and five mitochondrial markers totaling 8813 bp for ca. 30% of the diversity (105 sepsid taxa) and - depending on analysis - six or nine outgroup species. Maximum parsimony (MP), maximum likelihood (ML), and Bayesian inferences (BI) yield overall congruent, well-resolved, and supported trees that are largely unaffected by three different ways to partition the data in BI and ML analyses. However, there are also five areas of uncertainty that affect suprageneric relationships where different analyses yield alternate topologies and MP and ML trees have significant conflict according to Shimodaira-Hasegawa tests. Two of these were already affected by conflict in a previous analysis that was based on the same genes and a subset of 69 species. The remaining three involve newly added taxa or genera whose relationships were previously resolved with low support. We thus find that the denser taxon sample in the present analysis does not reduce the topological conflict that had been identified previously. The present study nevertheless presents a significant contribution to the understanding of sepsid relationships in that 50 additional taxa from 18 genera are added to the Tree-of-Life of Sepsidae and that the placement of most taxa is well supported and robust to different tree reconstruction techniques. Copyright © 2013 Elsevier Inc. All rights reserved.
Nelson, Randin; Cañate, Raul; Pascale, Juan Miguel; Dragoo, Jerry W; Armien, Blas; Armien, Anibal G; Koster, Frederick
2010-09-01
Choclo virus (CHOV) was described in sigmodontine rodents, Oligoryzomys fulvescens, and humans during an outbreak of hantavirus cardiopulmonary syndrome (HCPS) in 1999-2000 in western Panama. Although HCPS is rare, hantavirus-specific serum antibody prevalence among the general population is high suggesting that CHOV may cause many mild or asymptomatic infections. The goals of this study were to confirm the role of CHOV in HCPS and in the frequently detected serum antibody and to establish the phylogenetic relationship with other New World hantaviruses. CHOV was cultured to facilitate the sequencing of the small (S) and medium (M) segments and to perform CHOV-specific serum neutralization antibody assays. Sequences of the S and M segments found a close relationship to other Oligoryzomys-borne hantaviruses in the Americas, highly conserved terminal nucleotides, and no evidence for recombination events. The maximum likelihood and maximum parsimony analyses of complete M segment nucleotide sequences indicate a close relationship to Maporal and Laguna Negra viruses, found at the base of the South American clade. In a focus neutralization assay acute and convalescent sera from six Panamanian HCPS patients neutralized CHOV in dilutions from 1:200 to 1:6,400. In a sample of antibody-positive adults without a history of HCPS, 9 of 10 sera neutralized CHOV in dilutions ranging from 1:100 to 1:6,400. Although cross-neutralization with other sympatric hantaviruses not yet associated with human disease is possible, CHOV appears to be the causal agent for most of the mild or asymptomatic hantavirus infections, as well as HCPS, in Panama.
Gritsun, T S; Venugopal, K; Zanotto, P M; Mikhailov, M V; Sall, A A; Holmes, E C; Polkinghorne, I; Frolova, T V; Pogodina, V V; Lashkevich, V A; Gould, E A
1997-05-01
The complete nucleotide sequence of two tick-transmitted flaviviruses, Vasilchenko (Vs) from Siberia and louping ill (LI) from the UK, have been determined. The genomes were respectively, 10928 and 10871 nucleotides (nt) in length. The coding strategy and functional protein sequence motifs of tick-borne flaviviruses are presented in both Vs and LI viruses. The phylogenies based on maximum likelihood, maximum parsimony and distance analysis of the polyproteins, identified Vs virus as a member of the tick-borne encephalitis virus subgroup within the tick-borne serocomplex, genus Flavivirus, family Flaviviridae. Comparative alignment of the 3'-untranslated regions revealed deletions of different lengths essentially at the same position downstream of the stop codon for all tick-borne viruses. Two direct 27 nucleotide repeats at the 3'-end were found only for Vs and LI virus. Immediately following the deletions a region of 332-334 nt with relatively conserved primary structure (67-94% identity) was observed at the 3'-non-coding end of the virus genome. Pairwise comparisons of the nucleotide sequence data revealed similar levels of variation between the coding region, and the 5' and 3'-termini of the genome, implying an equivalent strong selective control for translated and untranslated regions. Indeed the predicted folding of the 5' and 3'-untranslated regions revealed patterns of stem and loop structures conserved for all tick-borne flaviviruses suggesting a purifying selection for preservation of essential RNA secondary structures which could be involved in translational control and replication. The possible implications of these findings are discussed.
NASA Astrophysics Data System (ADS)
Alexander, J. S.; McElroy, B. J.
2015-12-01
Bar forms in wide sandy rivers store sediment, control channel hydraulics, and are fundamental units of riverine ecosystems. Bar form height is often used as a measure of channel depth in ancient fluvial deposits and is also a crucially important measure of habitat quality in modern rivers. In the Great Plains of North America, priority bird species use emergent bars to nest, and sandbar heights are a direct predictor of flood hazard for bird nests. Our current understanding of controls on bar height are limited to few datasets and ad hoc observations from specific settings. We here examine a new dataset of bar heights and explore models of bar growth. We present bar a height dataset from the Platte and Niobrara Rivers in Nebraska, and an unchannelized reach of the Missouri River along the Nebraska-South Dakota border. Bar height data are normalized by flow frequency, and we examine parsimonious statistical models between expected controls (depth, stage, discharge, flow duration, work etc.) and maximum bar heights. From this we generate empirical-statistical models of maximum bar height for wide, sand-bedded rivers in the Great Plains of the United States and rivers of similar morphology elsewhere. Migration of bar forms is driven by downstream slip-face additions of sediment sourced from their stoss sides, but bars also sequester sediment and grow vertically and longitudinally. We explore our empirical data with a geometric-kinematic model of bar growth driven by sediment transport from smaller-scale bedforms. Our goal is to understand physical limitations on bar growth and geometry, with implications for interpreting the rock record and predicting physically-driven riverine habitat variables.
Graf, Daniel L; Jones, Hugh; Geneva, Anthony J; Pfeiffer, John M; Klunzinger, Michael W
2015-04-01
The freshwater mussel family Hyriidae (Mollusca: Bivalvia: Unionida) has a disjunct trans-Pacific distribution in Australasia and South America. Previous phylogenetic analyses have estimated the evolutionary relationships of the family and the major infra-familial taxa (Velesunioninae and Hyriinae: Hyridellini in Australia; Hyriinae: Hyriini, Castaliini, and Rhipidodontini in South America), but taxon and character sampling have been too incomplete to support a predictive classification or allow testing of biogeographical hypotheses. We sampled 30 freshwater mussel individuals representing the aforementioned hyriid taxa, as well as outgroup species representing the five other freshwater mussel families and their marine sister group (order Trigoniida). Our ingroup included representatives of all Australian genera. Phylogenetic relationships were estimated from three gene fragments (nuclear 28S, COI and 16S mtDNA) using maximum parsimony, maximum likelihood, and Bayesian inference, and we applied a Bayesian relaxed clock model calibrated with fossil dates to estimate node ages. Our analyses found good support for monophyly of the Hyriidae and the subfamilies and tribes, as well as the paraphyly of the Australasian taxa (Velesunioninae, (Hyridellini, (Rhipidodontini, (Castaliini, Hyriini)))). The Hyriidae was recovered as sister to a clade comprised of all other Recent freshwater mussel families. Our molecular date estimation supported Cretaceous origins of the major hyriid clades, pre-dating the Tertiary isolation of South America from Antarctica/Australia. We hypothesize that early diversification of the Hyriidae was driven by terrestrial barriers on Gondwana rather than marine barriers following disintegration of the super-continent. Copyright © 2015 Elsevier Inc. All rights reserved.
García-Varela, Martín; García-Prieto, Luís; Rodríguez, Rodolfo Pérez
2011-12-01
The morphology of the males of Neoechinorhynchus schmidti (Acanthocephala: Neoechinorhynchidae) is unknown, because this species was described based exclusively on females. However, recently we collected 2 common slider turtles Trachemys scripta in Centla swamps, Tabasco, Mexico, parasitized by 27 specimens of an acanthocephalan whose females were morphologically identical to N. schmidti. The domains D2 and D3 of the large subunit of the nuclear ribosomal RNA (LSU) of 3 males and 2 females of this material were sequenced. The sequences of both sexes were identical, and based on this result, we described for the first time the morphology of the males of N. schmidti. In addition, 6 sequences of a congeneric species, also parasite of turtles (Neoechinorhynchus emyditoides) were generated in the current research. The 11 sequences of these 2 species were aligned with 13 sequences of another 4 species of the same genus, producing a data set of 24 taxa with 674 nucleotides. The genetic divergence between N. schmidti and N. emyditoides was 4% and intraspecific differences ranged from 0.01 to 0.02%. Pairwise differences between either of these species and 4 other congeners parasitic in fresh and brackish water fishes (Neoechinorhynchus golvani, Neoechinorhynchus roseum, Neoechinorhynchus saginatus, and Neoechinorhynchus sp.) varied from 9.5 to 33%. Maximum likelihood and maximum parsimony analyses show that N. schmidti and N. emyditoides are sister taxa. Bootstrap analysis also indicates that the sister relationship is reliably supported. Copyright © 2011 Elsevier Ireland Ltd. All rights reserved.
Umboniibacter roseus sp. nov., isolated from coastal seawater.
Sung, Hye-Ri; Kim, Mibang; Shin, Kee-Sun
2015-11-01
A Gram-reaction-negative, non-motile, strictly aerobic, dark pink-pigmented and rod-shaped bacterial isolate, designated 14-121-B13T, was isolated from surface seawater off the coast of the South Sea at Namhae-gun, Republic of Korea. Cells were catalase- and oxidase-positive and required NaCl for growth. Strain 14-121-B13T grew optimally at 30 °C, in the presence of 2 % (w/v) NaCl and at pH 7.5-8.0.Neighbour-joining, maximum-likelihood and maximum-parsimony phylogenetic trees based on 16S rRNA gene sequences showed that strain 14-121-B13T clustered with the type strain of Umboniibacter marinipuniceus, with which it exhibited 96.7 % sequence similarity. The DNA G+C content of strain 14-121-B13T was 48.9 mol%. The major cellular fatty acids were summed feature 3 (C16 : 1ω7c and/or C16 : 1ω6c) and C16 : 0. The major respiratory quinone was ubiquinone Q-7 and the polar lipids detected in strain 14-121-B13T were diphosphatidylglycerol, phosphatidylglycerol, phosphatidylethanolamine, an unidentified aminolipid, unidentified phospholipids, unidentified aminophospholipids and unidentified lipids. Based on the phenotypic, chemotaxonomic and phylogenetic data presented, strain 14-121-B13T is considered to represent a novel species of the genus, Umboniibacter for which the name Umboniibacter roseus sp. nov. is proposed. The type strain is 14-121-B13T ( = DSM 29882T = KCTC 42467T).
Saarela, Jeffery M; Wysocki, William P; Barrett, Craig F; Soreng, Robert J; Davis, Jerrold I; Clark, Lynn G; Kelchner, Scot A; Pires, J Chris; Edger, Patrick P; Mayfield, Dustin R; Duvall, Melvin R
2015-05-04
Whole plastid genomes are being sequenced rapidly from across the green plant tree of life, and phylogenetic analyses of these are increasing resolution and support for relationships that have varied among or been unresolved in earlier single- and multi-gene studies. Pooideae, the cool-season grass lineage, is the largest of the 12 grass subfamilies and includes important temperate cereals, turf grasses and forage species. Although numerous studies of the phylogeny of the subfamily have been undertaken, relationships among some 'early-diverging' tribes conflict among studies, and some relationships among subtribes of Poeae have not yet been resolved. To address these issues, we newly sequenced 25 whole plastomes, which showed rearrangements typical of Poaceae. These plastomes represent 9 tribes and 11 subtribes of Pooideae, and were analysed with 20 existing plastomes for the subfamily. Maximum likelihood (ML), maximum parsimony (MP) and Bayesian inference (BI) robustly resolve most deep relationships in the subfamily. Complete plastome data provide increased nodal support compared with protein-coding data alone at nodes that are not maximally supported. Following the divergence of Brachyelytrum, Phaenospermateae, Brylkinieae-Meliceae and Ampelodesmeae-Stipeae are the successive sister groups of the rest of the subfamily. Ampelodesmeae are nested within Stipeae in the plastome trees, consistent with its hybrid origin between a phaenospermatoid and a stipoid grass (the maternal parent). The core Pooideae are strongly supported and include Brachypodieae, a Bromeae-Triticeae clade and Poeae. Within Poeae, a novel sister group relationship between Phalaridinae and Torreyochloinae is found, and the relative branching order of this clade and Aveninae, with respect to an Agrostidinae-Brizinae clade, are discordant between MP and ML/BI trees. Maximum likelihood and Bayesian analyses strongly support Airinae and Holcinae as the successive sister groups of a Dactylidinae-Loliinae clade. Published by Oxford University Press on behalf of the Annals of Botany Company.
Moretzsohn, Márcio C.; Gouvea, Ediene G.; Inglis, Peter W.; Leal-Bertioli, Soraya C. M.; Valls, José F. M.; Bertioli, David J.
2013-01-01
Background and Aims The genus Arachis contains 80 described species. Section Arachis is of particular interest because it includes cultivated peanut, an allotetraploid, and closely related wild species, most of which are diploids. This study aimed to analyse the genetic relationships of multiple accessions of section Arachis species using two complementary methods. Microsatellites allowed the analysis of inter- and intraspecific variability. Intron sequences from single-copy genes allowed phylogenetic analysis including the separation of the allotetraploid genome components. Methods Intron sequences and microsatellite markers were used to reconstruct phylogenetic relationships in section Arachis through maximum parsimony and genetic distance analyses. Key Results Although high intraspecific variability was evident, there was good support for most species. However, some problems were revealed, notably a probable polyphyletic origin for A. kuhlmannii. The validity of the genome groups was well supported. The F, K and D genomes grouped close to the A genome group. The 2n = 18 species grouped closer to the B genome group. The phylogenetic tree based on the intron data strongly indicated that A. duranensis and A. ipaënsis are the ancestors of A. hypogaea and A. monticola. Intron nucleotide substitutions allowed the ages of divergences of the main genome groups to be estimated at a relatively recent 2·3–2·9 million years ago. This age and the number of species described indicate a much higher speciation rate for section Arachis than for legumes in general. Conclusions The analyses revealed relationships between the species and genome groups and showed a generally high level of intraspecific genetic diversity. The improved knowledge of species relationships should facilitate the utilization of wild species for peanut improvement. The estimates of speciation rates in section Arachis are high, but not unprecedented. We suggest these high rates may be linked to the peculiar reproductive biology of Arachis. PMID:23131301
The Impact of Missing Data on Species Tree Estimation.
Xi, Zhenxiang; Liu, Liang; Davis, Charles C
2016-03-01
Phylogeneticists are increasingly assembling genome-scale data sets that include hundreds of genes to resolve their focal clades. Although these data sets commonly include a moderate to high amount of missing data, there remains no consensus on their impact to species tree estimation. Here, using several simulated and empirical data sets, we assess the effects of missing data on species tree estimation under varying degrees of incomplete lineage sorting (ILS) and gene rate heterogeneity. We demonstrate that concatenation (RAxML), gene-tree-based coalescent (ASTRAL, MP-EST, and STAR), and supertree (matrix representation with parsimony [MRP]) methods perform reliably, so long as missing data are randomly distributed (by gene and/or by species) and that a sufficiently large number of genes are sampled. When data sets are indecisive sensu Sanderson et al. (2010. Phylogenomics with incomplete taxon coverage: the limits to inference. BMC Evol Biol. 10:155) and/or ILS is high, however, high amounts of missing data that are randomly distributed require exhaustive levels of gene sampling, likely exceeding most empirical studies to date. Moreover, missing data become especially problematic when they are nonrandomly distributed. We demonstrate that STAR produces inconsistent results when the amount of nonrandom missing data is high, regardless of the degree of ILS and gene rate heterogeneity. Similarly, concatenation methods using maximum likelihood can be misled by nonrandom missing data in the presence of gene rate heterogeneity, which becomes further exacerbated when combined with high ILS. In contrast, ASTRAL, MP-EST, and MRP are more robust under all of these scenarios. These results underscore the importance of understanding the influence of missing data in the phylogenomics era. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Post-Boreotropical dispersals explain the pantropical disjunction in Paederia (Rubiaceae)
Nie, Ze-Long; Deng, Tao; Meng, Ying; Sun, Hang; Wen, Jun
2013-01-01
Background and Aims Pantropical intercontinental disjunction is a common biogeographical pattern in flowering plants exhibiting a discontinuous distribution primarily in tropical Asia, Africa and the Americas. Only a few plant groups with this pattern have been investigated at the generic level with molecular phylogenetic and biogeographical methods. Paederia (Rubiaceae) is a pantropical genus of 31 species of woody lianas, with the greatest species diversity in continental Asia and Madagascar and only two species from tropical America. The aim of this study was to reconstruct the biogeographical history of Paederia based on phylogenetic analyses to explore how the genus attained its pantropical distribution. Methods Maximum parsimony and Bayesian inference were used for phylogenetic analyses using sequences of five plastid markers (the rbcL gene, rps16 intron, trnT-F region, atpB-rbcL spacer and psbA-trnH spacer). Biogeographical inferences were based on a Bayesian uncorrelated lognormal relaxed molecular clock together with both Bayesian and likelihood ancestral area reconstructions. Key Results The data suggest an early diverged Asian lineage sister to the clade of the remaining species consisting of a predominantly Asian sub-clade and a primarily Malagasy sub-clade. Paederia is inferred to have originated in the Oligocene in tropical continental Asia. It then reached Africa in the early to middle Miocene, most probably via long-distance dispersal across the Indian Ocean. The two Neotropical species are inferred to have derived independently in the late Miocene from ancestors of Asia and East Africa, respectively. Conclusions The results demonstrate the importance of post-Boreotropical long-distance dispersals (across three major oceans) in shaping the global pantropical disjunction in some plants, such as Paederia, with small, winged diaspores adapted to long-distance dispersal by various agents including wind, ocean currents or birds. Overland migration is less likely to explain its palaeotropical disjunction between Asia and Africa. PMID:23478944
2011-01-01
Background The avian family Cettiidae, including the genera Cettia, Urosphena, Tesia, Abroscopus and Tickellia and Orthotomus cucullatus, has recently been proposed based on analysis of a small number of loci and species. The close relationship of most of these taxa was unexpected, and called for a comprehensive study based on multiple loci and dense taxon sampling. In the present study, we infer the relationships of all except one of the species in this family using one mitochondrial and three nuclear loci. We use traditional gene tree methods (Bayesian inference, maximum likelihood bootstrapping, parsimony bootstrapping), as well as a recently developed Bayesian species tree approach (*BEAST) that accounts for lineage sorting processes that might produce discordance between gene trees. We also analyse mitochondrial DNA for a larger sample, comprising multiple individuals and a large number of subspecies of polytypic species. Results There are many topological incongruences among the single-locus trees, although none of these is strongly supported. The multi-locus tree inferred using concatenated sequences and the species tree agree well with each other, and are overall well resolved and well supported by the data. The main discrepancy between these trees concerns the most basal split. Both methods infer the genus Cettia to be highly non-monophyletic, as it is scattered across the entire family tree. Deep intraspecific divergences are revealed, and one or two species and one subspecies are inferred to be non-monophyletic (differences between methods). Conclusions The molecular phylogeny presented here is strongly inconsistent with the traditional, morphology-based classification. The remarkably high degree of non-monophyly in the genus Cettia is likely to be one of the most extraordinary examples of misconceived relationships in an avian genus. The phylogeny suggests instances of parallel evolution, as well as highly unequal rates of morphological divergence in different lineages. This complex morphological evolution apparently misled earlier taxonomists. These results underscore the well-known but still often neglected problem of basing classifications on overall morphological similarity. Based on the molecular data, a revised taxonomy is proposed. Although the traditional and species tree methods inferred much the same tree in the present study, the assumption by species tree methods that all species are monophyletic is a limitation in these methods, as some currently recognized species might have more complex histories. PMID:22142197
Diffusion of Innovations in Service Organizations: Systematic Review and Recommendations
Greenhalgh, Trisha; Robert, Glenn; Macfarlane, Fraser; Bate, Paul; Kyriakidou, Olivia
2004-01-01
This article summarizes an extensive literature review addressing the question, How can we spread and sustain innovations in health service delivery and organization? It considers both content (defining and measuring the diffusion of innovation in organizations) and process (reviewing the literature in a systematic and reproducible way). This article discusses (1) a parsimonious and evidence-based model for considering the diffusion of innovations in health service organizations, (2) clear knowledge gaps where further research should be focused, and (3) a robust and transferable methodology for systematically reviewing health service policy and management. Both the model and the method should be tested more widely in a range of contexts. PMID:15595944
Banerjee, Biswanath; Roy, Debasish; Vasu, Ram Mohan
2009-08-01
A computationally efficient pseudodynamical filtering setup is established for elasticity imaging (i.e., reconstruction of shear modulus distribution) in soft-tissue organs given statically recorded and partially measured displacement data. Unlike a regularized quasi-Newton method (QNM) that needs inversion of ill-conditioned matrices, the authors explore pseudodynamic extended and ensemble Kalman filters (PD-EKF and PD-EnKF) that use a parsimonious representation of states and bypass explicit regularization by recursion over pseudotime. Numerical experiments with QNM and the two filters suggest that the PD-EnKF is the most robust performer as it exhibits no sensitivity to process noise covariance and yields good reconstruction even with small ensemble sizes.
Leung, Janet T Y; Shek, Daniel T L
2011-01-01
This paper examines the use of quantitative and qualitative approaches to study the impact of economic disadvantage on family processes and adolescent development. Quantitative research has the merits of objectivity, good predictive and explanatory power, parsimony, precision and sophistication of analysis. Qualitative research, in contrast, provides a detailed, holistic, in-depth understanding of social reality and allows illumination of new insights. With the pragmatic considerations of methodological appropriateness, design flexibility, and situational responsiveness in responding to the research inquiry, a mixed methods approach could be a possibility of integrating quantitative and qualitative approaches and offers an alternative strategy to study the impact of economic disadvantage on family processes and adolescent development.
Traffic offense sentencing processes and highway safety. Volume 1, Summary report
DOT National Transportation Integrated Search
1977-04-01
The history and development of traffic offense sanctions are reviewed. Criteria for traffic offense sanctions are discussed in terms of evenness, economy, appropriateness, rational allocation, effectiveness and parsimony. The framework for developmen...
Shagin, Dmitry A; Barsova, Ekaterina V; Yanushevich, Yurii G; Fradkov, Arkady F; Lukyanov, Konstantin A; Labas, Yulii A; Semenova, Tatiana N; Ugalde, Juan A; Meyers, Ann; Nunez, Jose M; Widder, Edith A; Lukyanov, Sergey A; Matz, Mikhail V
2004-05-01
Homologs of the green fluorescent protein (GFP), including the recently described GFP-like domains of certain extracellular matrix proteins in Bilaterian organisms, are remarkably similar at the protein structure level, yet they often perform totally unrelated functions, thereby warranting recognition as a superfamily. Here we describe diverse GFP-like proteins from previously undersampled and completely new sources, including hydromedusae and planktonic Copepoda. In hydromedusae, yellow and nonfluorescent purple proteins were found in addition to greens. Notably, the new yellow protein seems to follow exactly the same structural solution to achieving the yellow color of fluorescence as YFP, an engineered yellow-emitting mutant variant of GFP. The addition of these new sequences made it possible to resolve deep-level phylogenetic relationships within the superfamily. Fluorescence (most likely green) must have already existed in the common ancestor of Cnidaria and Bilateria, and therefore GFP-like proteins may be responsible for fluorescence and/or coloration in virtually any animal. At least 15 color diversification events can be inferred following the maximum parsimony principle in Cnidaria. Origination of red fluorescence and nonfluorescent purple-blue colors on several independent occasions provides a remarkable example of convergent evolution of complex features at the molecular level.
Sun, Xiaoqin; Wei, Yanglian; Qin, Minjian; Guo, Qiaosheng; Guo, Jianlin; Zhou, Yifeng; Hang, Yueyu
2012-03-01
The rDNA ITS region of 18 samples of Changium smyrnioides from 7 areas and of 2 samples of Chuanminshen violaceum were sequenced and analyzed. The amplified ITS region of the samples, including a partial sequence of ITS1 and complete sequences of 5.8S and ITS2, had a total length of 555 bp. After complete alignment, there were 49 variable sites, of which 45 were informative, when gaps were treated as missing data. Samples of C. smyrnioides from different locations could be identified exactly based on the variable sites. The maximum parsimony (MP) and neighbor joining (NJ) tree constructed from the ITS sequences based on Kumar's two-parameter model showed that the genetic distances of the C. smyrnioides samples from different locations were not always related to their geographical distances. A specific primer set for Allele-specific PCR authentication of C. violaceum from Jurong of Jiangsu was designed based on the SNP in the ITS sequence alignment. C. violaceum from the major genuine producing area in Jurong of Jiangsu could be identified exactly and quickly by Allele-specific PCR.
Mélade, Julien; Wieseke, Nicolas; Ramasindrazana, Beza; Flores, Olivier; Lagadec, Erwan; Gomard, Yann; Goodman, Steven M; Dellagi, Koussay; Pascalis, Hervé
2016-04-12
An eco-epidemiological investigation was carried out on Madagascar bat communities to better understand the evolutionary mechanisms and environmental factors that affect virus transmission among bat species in closely related members of the genus Morbillivirus, currently referred to as Unclassified Morbilli-related paramyxoviruses (UMRVs). A total of 947 bats were investigated originating from 52 capture sites (22 caves, 18 buildings, and 12 outdoor sites) distributed over different bioclimatic zones of the island. Using RT-PCR targeting the L-polymerase gene of the Paramyxoviridae family, we found that 10.5% of sampled bats were infected, representing six out of seven families and 15 out of 31 species analyzed. Univariate analysis indicates that both abiotic and biotic factors may promote viral infection. Using generalized linear modeling of UMRV infection overlaid on biotic and abiotic variables, we demonstrate that sympatric occurrence of bats is a major factor for virus transmission. Phylogenetic analyses revealed that all paramyxoviruses infecting Malagasy bats are UMRVs and showed little host specificity. Analyses using the maximum parsimony reconciliation tool CoRe-PA, indicate that host-switching, rather than co-speciation, is the dominant macro-evolutionary mechanism of UMRVs among Malagasy bats.
COMPASS: a suite of pre- and post-search proteomics software tools for OMSSA
Wenger, Craig D.; Phanstiel, Douglas H.; Lee, M. Violet; Bailey, Derek J.; Coon, Joshua J.
2011-01-01
Here we present the Coon OMSSA Proteomic Analysis Software Suite (COMPASS): a free and open-source software pipeline for high-throughput analysis of proteomics data, designed around the Open Mass Spectrometry Search Algorithm. We detail a synergistic set of tools for protein database generation, spectral reduction, peptide false discovery rate analysis, peptide quantitation via isobaric labeling, protein parsimony and protein false discovery rate analysis, and protein quantitation. We strive for maximum ease of use, utilizing graphical user interfaces and working with data files in the original instrument vendor format. Results are stored in plain text comma-separated values files, which are easy to view and manipulate with a text editor or spreadsheet program. We illustrate the operation and efficacy of COMPASS through the use of two LC–MS/MS datasets. The first is a dataset of a highly annotated mixture of standard proteins and manually validated contaminants that exhibits the identification workflow. The second is a dataset of yeast peptides, labeled with isobaric stable isotope tags and mixed in known ratios, to demonstrate the quantitative workflow. For these two datasets, COMPASS performs equivalently or better than the current de facto standard, the Trans-Proteomic Pipeline. PMID:21298793
Microbial Diversity in Soil, Sand Dune and Rock Substrates of the Thar Monsoon Desert, India.
Rao, Subramanya; Chan, Yuki; Bugler-Lacap, Donnabella C; Bhatnagar, Ashish; Bhatnagar, Monica; Pointing, Stephen B
2016-03-01
A culture-independent diversity assessment of archaea, bacteria and fungi in the Thar Desert in India was made. Six locations in Ajmer, Jaisalmer, Jaipur and Jodhupur included semi-arid soils, arid soils, arid sand dunes, plus arid cryptoendolithic substrates. A real-time quantitative PCR approach revealed that bacteria dominated soils and cryptoendoliths, whilst fungi dominated sand dunes. The archaea formed a minor component of all communities. Comparison of rRNA-defined community structure revealed that substrate and climate rather than location were the most parsimonious predictors. Sequence-based identification of 1240 phylotypes revealed that most taxa were common desert microorganisms. Semi-arid soils were dominated by actinobacteria and alpha proteobacteria, arid soils by chloroflexi and alpha proteobacteria, sand dunes by ascomycete fungi and cryptoendoliths by cyanobacteria. Climatic variables that best explained this distribution were mean annual rainfall and maximum annual temperature. Substrate variables that contributed most to observed diversity patterns were conductivity, soluble salts, Ca(2+) and pH. This represents an important addition to the inventory of desert microbiota, novel insight into the abiotic drivers of community assembly, and the first report of biodiversity in a monsoon desert system.
Stolzer, Maureen; Lai, Han; Xu, Minli; Sathaye, Deepa; Vernot, Benjamin; Durand, Dannie
2012-09-15
Gene duplication (D), transfer (T), loss (L) and incomplete lineage sorting (I) are crucial to the evolution of gene families and the emergence of novel functions. The history of these events can be inferred via comparison of gene and species trees, a process called reconciliation, yet current reconciliation algorithms model only a subset of these evolutionary processes. We present an algorithm to reconcile a binary gene tree with a nonbinary species tree under a DTLI parsimony criterion. This is the first reconciliation algorithm to capture all four evolutionary processes driving tree incongruence and the first to reconcile non-binary species trees with a transfer model. Our algorithm infers all optimal solutions and reports complete, temporally feasible event histories, giving the gene and species lineages in which each event occurred. It is fixed-parameter tractable, with polytime complexity when the maximum species outdegree is fixed. Application of our algorithms to prokaryotic and eukaryotic data show that use of an incomplete event model has substantial impact on the events inferred and resulting biological conclusions. Our algorithms have been implemented in Notung, a freely available phylogenetic reconciliation software package, available at http://www.cs.cmu.edu/~durand/Notung. mstolzer@andrew.cmu.edu.
Motriuk-Smith, Dagmara; Seville, R Scott; Quealy, Leah; Oliver, Clinton E.
2011-01-01
The taxonomy of the coccidia has historically been morphologically based. The purpose of this study was to establish if conspecificity of isolates of Eimeria callospermophili from 4 ground-dwelling squirrel hosts (Rodentia: Sciuridae) is supported by comparison of rDNA sequence data and to examine how this species relates to eimerian species from other sciurid hosts. Eimeria callospermophili was isolated from 4 wild caught hosts, i.e., Urocitellus elegans, Cynomys leucurus, Marmota flaviventris, and Cynomys ludovicianus. The ITS1 and ITS2 genomic rDNA sequences were PCR generated, sequenced, and analyzed. The highest intraspecific pairwise distance values of 6.0% in ITS1 and 7.1% in ITS2 were observed in C. leucurus. Interspecific pairwise distance values greater than 5% do not support E. callospermophili conspecificity. Generated E. callospermophili sequences were compared to Eimeria lancasterensis from Sciuris niger and Sciurus niger cinereus, and Eimeria ontarioensis from S. niger. A single well-supported clade was formed by E. callospermophili amplicons in Neighbor Joining and Maximum Parsimony analyses. However, within the clade there was little evidence of host or geographic structuring of the species. PMID:21506777
Progress, pitfalls and parallel universes: a history of insect phylogenetics
Simon, Chris; Yavorskaya, Margarita; Beutel, Rolf G.
2016-01-01
The phylogeny of insects has been both extensively studied and vigorously debated for over a century. A relatively accurate deep phylogeny had been produced by 1904. It was not substantially improved in topology until recently when phylogenomics settled many long-standing controversies. Intervening advances came instead through methodological improvement. Early molecular phylogenetic studies (1985–2005), dominated by a few genes, provided datasets that were too small to resolve controversial phylogenetic problems. Adding to the lack of consensus, this period was characterized by a polarization of philosophies, with individuals belonging to either parsimony or maximum-likelihood camps; each largely ignoring the insights of the other. The result was an unfortunate detour in which the few perceived phylogenetic revolutions published by both sides of the philosophical divide were probably erroneous. The size of datasets has been growing exponentially since the mid-1980s accompanied by a wave of confidence that all relationships will soon be known. However, large datasets create new challenges, and a large number of genes does not guarantee reliable results. If history is a guide, then the quality of conclusions will be determined by an improved understanding of both molecular and morphological evolution, and not simply the number of genes analysed. PMID:27558853
Gleeson, Ricky; Adlard, Robert
2011-10-01
Three new species of Ceratomyxa Thélohan, 1892 are described from the gall-bladders of two species of carcharhinid sharks collected off Heron and Lizard Islands on the Great Barrier Reef, Australia. Ceratomyxa carcharhini n. sp. and C. melanopteri n. sp. are described from Carcharhinus melanopterus (Quoy & Gaimard), and Ceratomyxa negaprioni n. sp. is described from Negaprion acutidens (Rüppell). These species are the first ceratomyxids reported from Australian elasmobranchs, and this is the first paper to formally characterise a novel Ceratomyxa species from an elasmobranch using both morphology and small subunit ribosomal DNA sequence data. Maximum parsimony and Bayesian inference analyses of the SSU rDNA dataset revealed that ceratomyxids from elasmobranchs form a sister clade to that of species infecting marine teleosts and Palliatus indecorus Schulman, Kovaleva & Dubina, 1979. Furthermore, the only sequenced freshwater ceratomyxid, Ceratomyxa shasta Noble, 1950, fell outside the overall marine ceratomyxid clade. These data show that Ceratomyxa, as currently recognised, is polyphyletic and ignites discussion on whether Ceratomyxa should be split. However, further taxon sampling, particularly in freshwater systems, is required to establish relevant biological divisions within the genus.
Evaluation of atpB nucleotide sequences for phylogenetic studies of ferns and other pteridophytes.
Wolf, P
1997-10-01
Inferring basal relationships among vascular plants poses a major challenge to plant systematists. The divergence events that describe these relationships occurred long ago and considerable homoplasy has since accrued for both molecular and morphological characters. A potential solution is to examine phylogenetic analyses from multiple data sets. Here I present a new source of phylogenetic data for ferns and other pteridophytes. I sequenced the chloroplast gene atpB from 23 pteridophyte taxa and used maximum parsimony to infer relationships. A 588-bp region of the gene appeared to contain a statistically significant amount of phylogenetic signal and the resulting trees were largely congruent with similar analyses of nucleotide sequences from rbcL. However, a combined analysis of atpB plus rbcL produced a better resolved tree than did either data set alone. In the shortest trees, leptosporangiate ferns formed a monophyletic group. Also, I detected a well-supported clade of Psilotaceae (Psilotum and Tmesipteris) plus Ophioglossaceae (Ophioglossum and Botrychium). The demonstrated utility of atpB suggests that sequences from this gene should play a role in phylogenetic analyses that incorporate data from chloroplast genes, nuclear genes, morphology, and fossil data.
Phylogeny of mitochondrial DNA clones in tassel-eared squirrels Sciurus aberti.
Wettstein, P J; Lager, P; Jin, L; States, J; Lamb, T; Chakraborty, R
1994-12-01
The tassel-eared squirrel, Sciurus aberti, includes six subspecies which occupy restrictive and apparently identical habitats in Ponderosa pine forests in the south-western United States and Mexico; the strict habitat requirement of this species is based on dietary requirements which are only fulfilled in these forests. To examine evolutionary relationships among certain subspecies of S. aberti, we obtained estimates of nucleotide diversity within subspecies as well as nucleotide divergence between subspecies using mitochondrial DNA (mtDNA) analysis. Restriction site polymorphisms were identified in samples of the four US subspecies: S. a. aberti (Abert), S. a. kaibabensis (Kaibab), S. a. ferreus (Ferreus), and S. a. chuscensis (Chuska) Fourteen mtDNA clones were resolved that were, with one exception, uniquely subspecific. Dendrograms constructed by neighbour-joining and maximum parsimony methods revealed two major assemblages: (1) an Abert/Kaibab group; and (2) a Ferreus/Chuska group. The Abert vs. Ferreus clones exhibited the greatest net nucleotide divergence, with a lineage separation estimate approximating 572,000 years ago assuming a nucleotide substitution rate of 7.15 x 10(-9)/year/site. Five out of ten Chuska squirrels shared a clone with one Abert sample; the relative sizes of these two populations and their respective ranges as well as their close proximity support the proposal for relatively recent intermixing of Abert and Chuska populations resulting in what appears to be Abert-->Chuska migration. Nucleotide diversity within subspecies ranked as Kaibab < Ferreus < Abert < Chuska; the relatively high diversity for the Chuska sample is based on the apparent introgression of Abert mtDNA. The relative diversity exhibited by Kaibab, Ferreus and Aberti samples corresponds to the range size of the respective subspecies.
Latvala, Antti; Dick, Danielle M.; Tuulio-Henriksson, Annamari; Suvisaari, Jaana; Viken, Richard J.; Rose, Richard J.; Kaprio, Jaakko
2011-01-01
Objective: A lower level of education often co-occurs with alcohol problems, but factors underlying this co-occurrence are not well understood. Specifically, whether these outcomes share part of their underlying genetic influences has not been widely studied. Educational level also reflects various environmental influences that may moderate the genetic etiology of alcohol problems, but gene–environment interactions between educational attainment and alcohol problems are unknown. Method: We studied the two nonmutually exclusive possibilities of common genetic influences and gene–environment interaction between alcohol problems and low education using a population-based sample (n = 4,858) of Finnish young adult twins (Mage = 24.5 years, range: 22.8–28.6 years). Alcohol problems were assessed with the Rutgers Alcohol Problem Index and self-reported maximum number of drinks consumed in a 24-hour period. Years of education, based on completed and ongo-ing studies, represented educational level. Results: Educational level was inversely associated with alcohol problems in young adulthood, and this association was most parsimoniously explained by overlapping genetic influences. Independent of this co-occurrence, higher education was associated with increased relative importance of genetic influences on alcohol problems, whereas environmental factors had a greater effect among twins with lower education. Conclusions: Our findings suggest a complex relationship between educational level and alcohol problems in young adulthood. Lower education is related to higher levels of alcohol problems, and this co-occurrence is influenced by genetic factors affecting both phenotypes. In addition, educational level moderates the importance of genetic and environmental influences on alcohol problems, possibly reflecting differences in social-control mechanisms related to educational level. PMID:21388594
Chao, Li-Lian; Yu, Wen-Ching; Shih, Chien-Ming
2017-02-01
Babesia microti was firstly detected and identified in brown country rats (Rattus losea, Swinhoe) captured from the offshore Kinmen Island of Taiwan. The prevalence of Babesia infection in 283 rodents was screened by polymerase chain reaction (PCR) assay using a piroplasma-conserved primer set (Piro A/B) and the thirty-seven PCR-positive rodents were further examined by PCR using a species-specific primer set (Bab 1/4) targeting the gene encoding the nuclear small-subunit ribosomal RNA (18S rRNA) of Babesia species. B. microti was detected only in Rattus losea with a total infection rate of 9.9% (28/283). Positivity examined by species-specific PCR (9.9%) is higher than examined by blood smear (4.6%). Sequence and phylogenetic analyses revealed that Babesia species detected in Taiwan were genetically affiliated to the genotypes of B. microti, and can be easily distinguished from other genotypes of Babesia parasites by neighbour-joining and maximum-parsimony methods. Intra- and inter-species analysis also indicate that all these Taiwan species have a lower level of genetic divergence (genetic distance values <0.084) within the genotypes of B. microti, and were genetically more distant to other genotypes (>0.218) of Babesia parasites. This study provides the first evidence of B. microti identified in R. losea in Taiwan, and the high prevalence of Babesia infection in R. losea may imply its possible role served as reservoir host for maintaining an enzoonotic cycle of Babesia transmission in Kinmen Island. The possible vector tick responsible for the transmission of Babesia infection need to be further identified. Copyright © 2016 Elsevier GmbH. All rights reserved.
Benefit and cost curves for typical pollination mutualisms.
Morris, William F; Vázquez, Diego P; Chacoff, Natacha P
2010-05-01
Mutualisms provide benefits to interacting species, but they also involve costs. If costs come to exceed benefits as population density or the frequency of encounters between species increases, the interaction will no longer be mutualistic. Thus curves that represent benefits and costs as functions of interaction frequency are important tools for predicting when a mutualism will tip over into antagonism. Currently, most of what we know about benefit and cost curves in pollination mutualisms comes from highly specialized pollinating seed-consumer mutualisms, such as the yucca moth-yucca interaction. There, benefits to female reproduction saturate as the number of visits to a flower increases (because the amount of pollen needed to fertilize all the flower's ovules is finite), but costs continue to increase (because pollinator offspring consume developing seeds), leading to a peak in seed production at an intermediate number of visits. But for most plant-pollinator mutualisms, costs to the plant are more subtle than consumption of seeds, and how such costs scale with interaction frequency remains largely unknown. Here, we present reasonable benefit and cost curves that are appropriate for typical pollinator-plant interactions, and we show how they can result in a wide diversity of relationships between net benefit (benefit minus cost) and interaction frequency. We then use maximum-likelihood methods to fit net-benefit curves to measures of female reproductive success for three typical pollination mutualisms from two continents, and for each system we chose the most parsimonious model using information-criterion statistics. We discuss the implications of the shape of the net-benefit curve for the ecology and evolution of plant-pollinator mutualisms, as well as the challenges that lie ahead for disentangling the underlying benefit and cost curves for typical pollination mutualisms.
Anchored phylogenomics illuminates the skipper butterfly tree of life.
Toussaint, Emmanuel F A; Breinholt, Jesse W; Earl, Chandra; Warren, Andrew D; Brower, Andrew V Z; Yago, Masaya; Dexter, Kelly M; Espeland, Marianne; Pierce, Naomi E; Lohman, David J; Kawahara, Akito Y
2018-06-19
Butterflies (Papilionoidea) are perhaps the most charismatic insect lineage, yet phylogenetic relationships among them remain incompletely studied and controversial. This is especially true for skippers (Hesperiidae), one of the most species-rich and poorly studied butterfly families. To infer a robust phylogenomic hypothesis for Hesperiidae, we sequenced nearly 400 loci using Anchored Hybrid Enrichment and sampled all tribes and more than 120 genera of skippers. Molecular datasets were analyzed using maximum-likelihood, parsimony and coalescent multi-species phylogenetic methods. All analyses converged on a novel, robust phylogenetic hypothesis for skippers. Different optimality criteria and methodologies recovered almost identical phylogenetic trees with strong nodal support at nearly all nodes and all taxonomic levels. Our results support Coeliadinae as the sister group to the remaining skippers, the monotypic Euschemoninae as the sister group to all other subfamilies but Coeliadinae, and the monophyly of Eudaminae plus Pyrginae. Within Pyrginae, Celaenorrhinini and Tagiadini are sister groups, the Neotropical firetips, Pyrrhopygini, are sister to all other tribes but Celaenorrhinini and Tagiadini. Achlyodini is recovered as the sister group to Carcharodini, and Erynnini as sister group to Pyrgini. Within the grass skippers (Hesperiinae), there is strong support for the monophyly of Aeromachini plus remaining Hesperiinae. The giant skippers (Agathymus and Megathymus) once classified as a subfamily, are recovered as monophyletic with strong support, but are deeply nested within Hesperiinae. Anchored Hybrid Enrichment sequencing resulted in a large amount of data that built the foundation for a new, robust evolutionary tree of skippers. The newly inferred phylogenetic tree resolves long-standing systematic issues and changes our understanding of the skipper tree of life. These resultsenhance understanding of the evolution of one of the most species-rich butterfly families.
Tsuda, K; Kikkawa, Y; Yonekawa, H; Tanabe, Y
1997-08-01
To test the hypothesis that the domestic dogs are derived from several different ancestral gray wolf populations, we compared the sequence of the displacement (D)-loop region of the mitochondrial DNA (mtDNA) from 24 breeds of domestic dog (34 individual dogs) and 3 subspecies of gray wolf (Canis lupus lupus, C.l. pallipes and C.l. chanco; 19 individuals). The intraspecific sequence variations within domestic dogs (0.00-3.19%) and within wolves (0.00-2.88%) were comparable to the interspecific variations between domestic dogs and wolves (0.30-3.35%). A repetitive sequence with repeat units (TACACGTA/GCG) that causes the size variation in the D-loop region was also found in both dogs and wolves. However, no nucleotide substitutions or repetitive arrays were specific for domestic dogs or for wolves. These results showed that there is a close genetic relationship between dogs and wolves. Two major clades appeared in the phylogenetic trees constructed by neighbor-joining and by the maximum parsimony method; one clade containing Chinese wolf (C.l. chanco) showed extensive variations while the other showed only slight variation. This showed that there were two major genetic components both in domestic dogs and in wolves. However, neither clades nor haplotypes specific for any dog breed were observed, whereas subspecies-specific clades were found in Asiatic wolves. These results suggested that the extant breeds of domestic dogs have maintained a large degree of mtDNA polymorphisms introduced from their ancestral wolf populations, and that extensive interbreedings had occurred among multiple matriarchal origins.
Zhang, Wangshu; Coba, Marcelo P; Sun, Fengzhu
2016-01-11
Protein domains can be viewed as portable units of biological function that defines the functional properties of proteins. Therefore, if a protein is associated with a disease, protein domains might also be associated and define disease endophenotypes. However, knowledge about such domain-disease relationships is rarely available. Thus, identification of domains associated with human diseases would greatly improve our understanding of the mechanism of human complex diseases and further improve the prevention, diagnosis and treatment of these diseases. Based on phenotypic similarities among diseases, we first group diseases into overlapping modules. We then develop a framework to infer associations between domains and diseases through known relationships between diseases and modules, domains and proteins, as well as proteins and disease modules. Different methods including Association, Maximum likelihood estimation (MLE), Domain-disease pair exclusion analysis (DPEA), Bayesian, and Parsimonious explanation (PE) approaches are developed to predict domain-disease associations. We demonstrate the effectiveness of all the five approaches via a series of validation experiments, and show the robustness of the MLE, Bayesian and PE approaches to the involved parameters. We also study the effects of disease modularization in inferring novel domain-disease associations. Through validation, the AUC (Area Under the operating characteristic Curve) scores for Bayesian, MLE, DPEA, PE, and Association approaches are 0.86, 0.84, 0.83, 0.83 and 0.79, respectively, indicating the usefulness of these approaches for predicting domain-disease relationships. Finally, we choose the Bayesian approach to infer domains associated with two common diseases, Crohn's disease and type 2 diabetes. The Bayesian approach has the best performance for the inference of domain-disease relationships. The predicted landscape between domains and diseases provides a more detailed view about the disease mechanisms.
Evolutionary relationships between miRNA genes and their activity.
Zhu, Yan; Skogerbø, Geir; Ning, Qianqian; Wang, Zhen; Li, Biqing; Yang, Shuang; Sun, Hong; Li, Yixue
2012-12-22
The emergence of vertebrates is characterized by a strong increase in miRNA families. MicroRNAs interact broadly with many transcripts, and the evolution of such a system is intriguing. However, evolutionary questions concerning the origin of miRNA genes and their subsequent evolution remain unexplained. In order to systematically understand the evolutionary relationship between miRNAs gene and their function, we classified human known miRNAs into eight groups based on their evolutionary ages estimated by maximum parsimony method. New miRNA genes with new functional sequences accumulated more dynamically in vertebrates than that observed in Drosophila. Different levels of evolutionary selection were observed over miRNA gene sequences with different time of origin. Most genic miRNAs differ from their host genes in time of origin, there is no particular relationship between the age of a miRNA and the age of its host genes, genic miRNAs are mostly younger than the corresponding host genes. MicroRNAs originated over different time-scales are often predicted/verified to target the same or overlapping sets of genes, opening the possibility of substantial functional redundancy among miRNAs of different ages. Higher degree of tissue specificity and lower expression level was found in young miRNAs. Our data showed that compared with protein coding genes, miRNA genes are more dynamic in terms of emergence and decay. Evolution patterns are quite different between miRNAs of different ages. MicroRNAs activity is under tight control with well-regulated expression increased and targeting decreased over time. Our work calls attention to the study of miRNA activity with a consideration of their origin time.
Sha, Li-Na; Fan, Xing; Li, Jun; Liao, Jin-Qiu; Zeng, Jian; Wang, Yi; Kang, Hou-Yang; Zhang, Hai-Qin; Zheng, You-Liang; Zhou, Yong-Hong
2017-09-01
Leymus Hochst. (Triticeae: Poaceae), a group of allopolyploid species with the NsXm genomes, is a perennial genus with diversity in morphology, cytology, ecology, and distribution in the Triticeae. To investigate the genome origin and evolutionary history of Leymus, three unlinked low-copy nuclear genes (Acc1, Pgk1, and GBSSI) and three chloroplast regions (trnL-F, matK, and rbcL) of 32 Leymus species were analyzed with those of 36 diploid species representing 18 basic genomes in the Triticeae. The phylogenetic relationships were reconstructed using Bayesian inference, Maximum parsimony, and NeighborNet methods. A time-calibrated phylogeny was generated to estimate the evolutionary history of Leymus. The results suggest that reticulate evolution has occurred in Leymus species, with several distinct progenitors contributing to the Leymus. The molecular data in resolution of the Xm-genome lineage resulted in two apparently contradictory results, with one placing the Xm-genome lineage as closely related to the P/F genome and the other splitting the Xm-genome lineage as sister to the Ns-genome donor. Our results suggested that (1) the Ns genome of Leymus was donated by Psathyrostachys, and additional Ns-containing alleles may be introgressed into some Leymus polyploids by recurrent hybridization; (2) The phylogenetic incongruence regarding the resolution of the Xm-genome lineage suggested that the Xm genome of Leymus was closely related to the P genome of Agropyron; (3) Both Ns- and Xm-genome lineages served as the maternal donor during the speciation of Leymus species; (4) The Pseudoroegneria, Lophopyrum and Australopyrum genomes contributed to some Leymus species. Copyright © 2017 Elsevier Inc. All rights reserved.
Species delimitation in Trametes: a comparison of ITS, RPB1, RPB2 and TEF1 gene phylogenies.
Carlson, Alexis; Justo, Alfredo; Hibbett, David S
2014-01-01
Trametes is a cosmopolitan genus of white rot polypores, including the "turkey tail" fungus, T. versicolor. Although Trametes is one of the most familiar genera of polypores, its species-level taxonomy is unsettled. The ITS region is the most commonly used molecular marker for species delimitation in fungi, but it has been shown to have a low molecular variation in Trametes resulting in poorly resolved phylogenies and unclear species boundaries, especially in the T. versicolor species complex (T. versicolor sensu stricto, T. ochracea, T. pubescens, T. ectypa). Here we evaluate the performance of three protein-coding genes (TEF1, RPB1, RPB2) for species delimitation and phylogenetic reconstruction in Trametes. We obtained 59 TEF1, 34 RPB1 and 55 RPB2 sequences from 69 individuals, focusing on the T. versicolor complex and performed phylogenetic analyses with maximum likelihood and parsimony methods. All three protein-coding genes outperformed ITS for separating species in the T. versicolor complex. The multigene phylogenetic analysis shows the highest amount of resolution and supported nodes separating T. ectypa, T. ochracea, T. pubescens and T. versicolor with strong support. In addition three slineages are resolved in the species complex of T. elegans. The T. elegans complex includes three species: T. elegans (based on material from Puerto Rico, Belize, the Philippines), T. aesculi (from North America) and T. repanda (from Papua New Guinea, the Philippines, Venezuela). The utility of gene markers varies, with TEF1 having the highest PCR and sequencing success rate and RPB1 offering the best backbone resolution for the genus. © 2014 by The Mycological Society of America.
Simon and the Sirens: A Commentary.
Daston, Lorraine
2015-09-01
Even in its extended usage, the concept of bounded rationality bears the birthmark of its origins in economics. First and most obviously, it is about seeking the most efficient (not necessarily the best) means toward a given end, whether that is curing patients or proving theorems. Second, the means are whittled down to the most parsimonious possible, not only acknowledging cognitive limitations but actually imposing them, whether in the form of Morgan's canon, Methodist agnosticism about causes, or Entscheidungsproblem-like restrictions on the acceptable formulation of mathematical proofs. Third, these parsimonious restrictions all tend to minimize the role of reasonable deliberation in rationality, albeit in different ways. As an object of inquiry for the history of science, bounded rationality has great promise. But as a model of the history of science, as one long exercise in bounded rationality, its utility may apply more to future than past science.
Comparison among cognitive diagnostic models for the TIMSS 2007 fourth grade mathematics assessment.
Yamaguchi, Kazuhiro; Okada, Kensuke
2018-01-01
A variety of cognitive diagnostic models (CDMs) have been developed in recent years to help with the diagnostic assessment and evaluation of students. Each model makes different assumptions about the relationship between students' achievement and skills, which makes it important to empirically investigate which CDMs better fit the actual data. In this study, we examined this question by comparatively fitting representative CDMs to the Trends in International Mathematics and Science Study (TIMSS) 2007 assessment data across seven countries. The following two major findings emerged. First, in accordance with former studies, CDMs had a better fit than did the item response theory models. Second, main effects models generally had a better fit than other parsimonious or the saturated models. Related to the second finding, the fit of the traditional parsimonious models such as the DINA and DINO models were not optimal. The empirical educational implications of these findings are discussed.
Huang, Xiao-Lei; Qiao, Ge-Xia; Lei, Fu-Min
2010-01-01
Parsimony analysis of endemicity (PAE) was used to identify areas of endemism (AOEs) for Chinese birds at the subregional level. Four AOEs were identified based on a distribution database of 105 endemic species and using 18 avifaunal subregions as the operating geographical units (OGUs). The four AOEs are the Qinghai-Zangnan Subregion, the Southwest Mountainous Subregion, the Hainan Subregion and the Taiwan Subregion. Cladistic analysis of subregions generally supports the division of China’s avifauna into Palaearctic and Oriental realms. Two PAE area trees were produced from two different distribution datasets (year 1976 and 2007). The 1976 topology has four distinct subregional branches; however, the 2007 topology has three distinct branches. Moreover, three Palaearctic subregions in the 1976 tree clustered together with the Oriental subregions in the 2007 tree. Such topological differences may reflect changes in the distribution of bird species through circa three decades. PMID:20559504
Comparison among cognitive diagnostic models for the TIMSS 2007 fourth grade mathematics assessment
Okada, Kensuke
2018-01-01
A variety of cognitive diagnostic models (CDMs) have been developed in recent years to help with the diagnostic assessment and evaluation of students. Each model makes different assumptions about the relationship between students’ achievement and skills, which makes it important to empirically investigate which CDMs better fit the actual data. In this study, we examined this question by comparatively fitting representative CDMs to the Trends in International Mathematics and Science Study (TIMSS) 2007 assessment data across seven countries. The following two major findings emerged. First, in accordance with former studies, CDMs had a better fit than did the item response theory models. Second, main effects models generally had a better fit than other parsimonious or the saturated models. Related to the second finding, the fit of the traditional parsimonious models such as the DINA and DINO models were not optimal. The empirical educational implications of these findings are discussed. PMID:29394257
More quality measures versus measuring what matters: a call for balance and parsimony
Nelson, Eugene C; Pryor, David B; James, Brent; Swensen, Stephen J; Kaplan, Gary S; Weissberg, Jed I; Bisognano, Maureen; Yates, Gary R; Hunt, Gordon C
2012-01-01
External groups requiring measures now include public and private payers, regulators, accreditors and others that certify performance levels for consumers, patients and payers. Although benefits have accrued from the growth in quality measurement, the recent explosion in the number of measures threatens to shift resources from improving quality to cover a plethora of quality-performance metrics that may have a limited impact on the things that patients and payers want and need (ie, better outcomes, better care, and lower per capita costs). Here we propose a policy that quality measurement should be: balanced to meet the need of end users to judge quality and cost performance and the need of providers to continuously improve the quality, outcomes and costs of their services; and parsimonious to measure quality, outcomes and costs with appropriate metrics that are selected based on end-user needs. PMID:22893696
Minimal metabolic pathway structure is consistent with associated biomolecular interactions
Bordbar, Aarash; Nagarajan, Harish; Lewis, Nathan E; Latif, Haythem; Ebrahim, Ali; Federowicz, Stephen; Schellenberger, Jan; Palsson, Bernhard O
2014-01-01
Pathways are a universal paradigm for functionally describing cellular processes. Even though advances in high-throughput data generation have transformed biology, the core of our biological understanding, and hence data interpretation, is still predicated on human-defined pathways. Here, we introduce an unbiased, pathway structure for genome-scale metabolic networks defined based on principles of parsimony that do not mimic canonical human-defined textbook pathways. Instead, these minimal pathways better describe multiple independent pathway-associated biomolecular interaction datasets suggesting a functional organization for metabolism based on parsimonious use of cellular components. We use the inherent predictive capability of these pathways to experimentally discover novel transcriptional regulatory interactions in Escherichia coli metabolism for three transcription factors, effectively doubling the known regulatory roles for Nac and MntR. This study suggests an underlying and fundamental principle in the evolutionary selection of pathway structures; namely, that pathways may be minimal, independent, and segregated. PMID:24987116
More quality measures versus measuring what matters: a call for balance and parsimony.
Meyer, Gregg S; Nelson, Eugene C; Pryor, David B; James, Brent; Swensen, Stephen J; Kaplan, Gary S; Weissberg, Jed I; Bisognano, Maureen; Yates, Gary R; Hunt, Gordon C
2012-11-01
External groups requiring measures now include public and private payers, regulators, accreditors and others that certify performance levels for consumers, patients and payers. Although benefits have accrued from the growth in quality measurement, the recent explosion in the number of measures threatens to shift resources from improving quality to cover a plethora of quality-performance metrics that may have a limited impact on the things that patients and payers want and need (ie, better outcomes, better care, and lower per capita costs). Here we propose a policy that quality measurement should be: balanced to meet the need of end users to judge quality and cost performance and the need of providers to continuously improve the quality, outcomes and costs of their services; and parsimonious to measure quality, outcomes and costs with appropriate metrics that are selected based on end-user needs.
Matzke, Nicholas J; Irmis, Randall B
2018-01-01
Tip-dating, where fossils are included as dated terminal taxa in Bayesian dating inference, is an increasingly popular method. Data for these studies often come from morphological character matrices originally developed for non-dated, and usually parsimony, analyses. In parsimony, only shared derived characters (synapomorphies) provide grouping information, so many character matrices have an ascertainment bias: they omit autapomorphies (unique derived character states), which are considered uninformative. There has been no study of the effect of this ascertainment bias in tip-dating, but autapomorphies can be informative in model-based inference. We expected that excluding autapomorphies would shorten the morphological branchlengths of terminal branches, and thus bias downwards the time branchlengths inferred in tip-dating. We tested for this effect using a matrix for Carboniferous-Permian eureptiles where all autapomorphies had been deliberately coded. Surprisingly, date estimates are virtually unchanged when autapomorphies are excluded, although we find large changes in morphological rate estimates and small effects on topological and dating confidence. We hypothesized that the puzzling lack of effect on dating was caused by the non-clock nature of the eureptile data. We confirm this explanation by simulating strict clock and non-clock datasets, showing that autapomorphy exclusion biases dating only for the clocklike case. A theoretical solution to ascertainment bias is computing the ascertainment bias correction (M k parsinf ), but we explore this correction in detail, and show that it is computationally impractical for typical datasets with many character states and taxa. Therefore we recommend that palaeontologists collect autapomorphies whenever possible when assembling character matrices.
Modeling nonlinear responses of DOC transport in boreal catchments in Sweden
NASA Astrophysics Data System (ADS)
Kasurinen, Ville; Alfredsen, Knut; Ojala, Anne; Pumpanen, Jukka; Weyhenmeyer, Gesa A.; Futter, Martyn N.; Laudon, Hjalmar; Berninger, Frank
2016-07-01
Stream water dissolved organic carbon (DOC) concentrations display high spatial and temporal variation in boreal catchments. Understanding and predicting these patterns is a challenge with great implications for water quality projections and carbon balance estimates. Although several biogeochemical models have been used to estimate stream water DOC dynamics, model biases common during both rain and snow melt-driven events. The parsimonious DOC-model, K-DOC, with 10 calibrated parameters, uses a nonlinear discharge and catchment water storage relationship including soil temperature dependencies of DOC release and consumption. K-DOC was used to estimate the stream water DOC concentrations over 5 years for eighteen nested boreal catchments having total area of 68 km2 (varying from 0.04 to 67.9 km2). The model successfully simulated DOC concentrations during base flow conditions, as well as, hydrological events in catchments dominated by organic and mineral soils reaching NSEs from 0.46 to 0.76. Our semimechanistic model was parsimonious enough to have all parameters estimated using statistical methods. We did not find any clear differences between forest and mire-dominated catchments that could be explained by soil type or tree species composition. However, parameters controlling slow release and consumption of DOC from soil water behaved differently for small headwater catchments (less than 2 km2) than for those that integrate larger areas of different ecosystem types (10-68 km2). Our results emphasize that it is important to account for nonlinear dependencies of both, soil temperature, and catchment water storage, when simulating DOC dynamics of boreal catchments.
de Carvalho, André Luiz Gomes; de Britto, Marcelo Ribeiro; Fernandes, Daniel Silva
2013-01-01
Based on comprehensive distributional records of the 23 species currently assigned to the lizard genus Tropidurus, we investigated patterns of endemism and area relationships in South America. Two biogeographic methods were applied, Parsimony Analysis of Endemicity (PAE) and Brooks Parsimony Analysis (BPA). Two areas of endemism were detected by PAE: the first within the domains of the semiarid Brazilian Caatinga, which includes seven endemic species, and the second in the region of the Serranía de Huanchaca, eastern Bolivia, in which three endemic species are present. The area cladograms recovered a close relationship between the Atlantic Forest and areas of the South American open corridor. The results revealed a close relationship among the provinces Caatinga (Cerrado, Parana Forest (Pantanal+Chaco)). The uplift of the Brazilian Central Plateau in the Late Pliocene-Early Pleistocene (4-2 Myr BP) has been interpreted as a major event responsible for isolation and differentiation of biotas along these areas. However, we emphasize that without the establishment of a temporal framework concerning the diversification history of Tropidurus it is premature to correlate cladogenetic events with specific time periods or putative vicariant scenarios. The limiting factors hampering the understanding of the biogeographic history of this genus include (1) the absence of temporal references in relation to the diversification of distinct clades within Tropidurus; (2) the lack of an appropriate taxonomic resolution of the species complexes currently represented by widely distributed forms; and (3) the need for a comprehensive phylogenetic hypothesis. We suggest that these three important aspects should be prioritized in future investigations. PMID:23527261
Phylogeny of mycoplasmalike organisms (phytoplasmas): a basis for their classification.
Gundersen, D E; Lee, I M; Rehner, S A; Davis, R E; Kingsbury, D T
1994-01-01
A global phylogenetic analysis using parsimony of 16S rRNA gene sequences from 46 mollicutes, 19 mycoplasmalike organisms (MLOs) (new trivial name, phytoplasmas), and several related bacteria placed the MLOs definitively among the members of the class Mollicutes and revealed that MLOs form a large discrete monophyletic clade, paraphyletic to the Acholeplasma species, within the Anaeroplasma clade. Within the MLO clade resolved in the global mollicutes phylogeny and a comprehensive MLO phylogeny derived by parsimony analyses of 16S rRNA gene sequences from 30 diverse MLOs representative of nearly all known distinct MLO groups, five major phylogenetic groups with a total of 11 distinct subclades (monophyletic groups or taxa) could be recognized. These MLO subclades (roman numerals) and designated type strains were as follows: i, Maryland aster yellows AY1; ii, apple proliferation AP-A; iii, peanut witches'-broom PnWB; iv, Canada peach X CX; v, rice yellow dwarf RYD; vi, pigeon pea witches'-broom PPWB; vii, palm lethal yellowing LY; viii, ash yellows AshY; ix, clover proliferation CP; x, elm yellows EY; and xi, loofah witches'-broom LfWB. The designations of subclades and their phylogenetic positions within the MLO clade were supported by a congruent phylogeny derived by parsimony analyses of ribosomal protein L22 gene sequences from most representative MLOs. On the basis of the phylogenies inferred in the present study, we propose that MLOs should be represented taxonomically at the minimal level of genus and that each phylogenetically distinct MLO subclade identified should represent at least a distinct species under this new genus. Images PMID:8071198
Sampling and counting genome rearrangement scenarios
2015-01-01
Background Even for moderate size inputs, there are a tremendous number of optimal rearrangement scenarios, regardless what the model is and which specific question is to be answered. Therefore giving one optimal solution might be misleading and cannot be used for statistical inferring. Statistically well funded methods are necessary to sample uniformly from the solution space and then a small number of samples are sufficient for statistical inferring. Contribution In this paper, we give a mini-review about the state-of-the-art of sampling and counting rearrangement scenarios, focusing on the reversal, DCJ and SCJ models. Above that, we also give a Gibbs sampler for sampling most parsimonious labeling of evolutionary trees under the SCJ model. The method has been implemented and tested on real life data. The software package together with example data can be downloaded from http://www.renyi.hu/~miklosi/SCJ-Gibbs/ PMID:26452124
Variable selection with stepwise and best subset approaches
2016-01-01
While purposeful selection is performed partly by software and partly by hand, the stepwise and best subset approaches are automatically performed by software. Two R functions stepAIC() and bestglm() are well designed for stepwise and best subset regression, respectively. The stepAIC() function begins with a full or null model, and methods for stepwise regression can be specified in the direction argument with character values “forward”, “backward” and “both”. The bestglm() function begins with a data frame containing explanatory variables and response variables. The response variable should be in the last column. Varieties of goodness-of-fit criteria can be specified in the IC argument. The Bayesian information criterion (BIC) usually results in more parsimonious model than the Akaike information criterion. PMID:27162786
How Do Students Make Sense of Science?
ERIC Educational Resources Information Center
Linn, Marcia C.; Songer, Nancy Butler
1993-01-01
Eighth graders' ideas about thermodynamics, and their understanding of thermodynamics principles, were assessed before and after they attended a one-semester course on thermodynamics. Results characterized students' views concerning scientific explanations of phenomena, parsimonious versus descriptive explanations, the application of science…
Aliisedimentitalea scapharcae gen. nov., sp. nov., isolated from ark shell Scapharca broughtonii.
Kim, Young-Ok; Park, Sooyeon; Nam, Bo-Hye; Kim, Dong-Gyun; Won, Sung-Min; Park, Ji-Min; Yoon, Jung-Hoon
2015-08-01
A Gram-negative, aerobic, non-spore-forming, motile and ovoid or rod-shaped bacterial strain, designated MA2-16(T), was isolated from ark shell (Scapharca broughtonii) collected from the South Sea, South Korea. Strain MA2-16(T) was found to grow optimally at 30°C, at pH 7.0-8.0 and in the presence of 2.0% (w/v) NaCl. Neighbour-joining, maximum-likelihood and maximum-parsimony phylogenetic trees based on 16S rRNA gene sequences revealed that strain MA2-16(T) clustered with the type strain of Sedimentitalea nanhaiensis. The novel strain exhibited a 16S rRNA gene sequence similarity value of 97.1% to the type strain of S. nanhaiensis. In the neighbour-joining phylogenetic tree based on gyrB sequences, strain MA2-16(T) formed an evolutionary lineage independent of those of other taxa. Strain MA2-16(T) contained Q-10 as the predominant ubiquinone and C18:1 ω7c and 11-methyl C18:1 ω7c as the major fatty acids. The major polar lipids of strain MA2-16(T) were phosphatidylcholine, phosphatidylglycerol, phosphatidylethanolamine, an unidentified aminolipid and an unidentified lipid. The DNA G+C content of strain MA2-16(T) was 57.7 mol% and its DNA-DNA relatedness values with the type strains of S. nanhaiensis and some phylogenetically related species of the genera Leisingera and Phaeobacter were 13-24%. On the basis of the data presented, strain MA2-16(T) is considered to represent a novel genus and novel species within the family Rhodobacteraceae, for which the name Aliisedimentitalea scapharcae gen. nov., sp. nov. is proposed. The type strain is MA2-16(T) (=KCTC 42119(T) =CECT 8598(T)).
Huang, Wei-Yi; Zhao, Guang-Hui; Wei, Shu-Jun; Song, Hui-Qun; Xu, Min-Jun; Lin, Rui-Qing; Zhou, Dong-Hui; Zhu, Xing-Quan
2012-01-01
Complete mitochondrial (mt) genomes and the gene rearrangements are increasingly used as molecular markers for investigating phylogenetic relationships. Contributing to the complete mt genomes of Gastropoda, especially Pulmonata, we determined the mt genome of the freshwater snail Galba pervia, which is an important intermediate host for Fasciola spp. in China. The complete mt genome of G. pervia is 13,768 bp in length. Its genome is circular, and consists of 37 genes, including 13 genes for proteins, 2 genes for rRNA, 22 genes for tRNA. The mt gene order of G. pervia showed novel arrangement (tRNA-His, tRNA-Gly and tRNA-Tyr change positions and directions) when compared with mt genomes of Pulmonata species sequenced to date, indicating divergence among different species within the Pulmonata. A total of 3655 amino acids were deduced to encode 13 protein genes. The most frequently used amino acid is Leu (15.05%), followed by Phe (11.24%), Ser (10.76%) and IIe (8.346%). Phylogenetic analyses using the concatenated amino acid sequences of the 13 protein-coding genes, with three different computational algorithms (maximum parsimony, maximum likelihood and Bayesian analysis), all revealed that the families Lymnaeidae and Planorbidae are closely related two snail families, consistent with previous classifications based on morphological and molecular studies. The complete mt genome sequence of G. pervia showed a novel gene arrangement and it represents the first sequenced high quality mt genome of the family Lymnaeidae. These novel mtDNA data provide additional genetic markers for studying the epidemiology, population genetics and phylogeographics of freshwater snails, as well as for understanding interplay between the intermediate snail hosts and the intra-mollusca stages of Fasciola spp.. PMID:22844544
A Radical Solution: The Phylogeny of the Nudibranch Family Fionidae
Cella, Kristen; Ekimova, Irina; Chichvarkhin, Anton; Schepetov, Dimitry; Gosliner, Terrence M.
2016-01-01
Tergipedidae represents a diverse and successful group of aeolid nudibranchs, with approximately 200 species distributed throughout most marine ecosystems and spanning all biogeographical regions of the oceans. However, the systematics of this family remains poorly understood since no modern phylogenetic study has been undertaken to support any of the proposed classifications. The present study is the first molecular phylogeny of Tergipedidae based on partial sequences of two mitochondrial (COI and 16S) genes and one nuclear gene (H3). Maximum likelihood, maximum parsimony and Bayesian analysis were conducted in order to elucidate the systematics of this family. Our results do not recover the traditional Tergipedidae as monophyletic, since it belongs to a larger clade that includes the families Eubranchidae, Fionidae and Calmidae. This newly recovered clade is here referred to as Fionidae, the oldest name for this taxon. In addition, the present molecular phylogeny does not recover the traditional systematic relationships at a generic level, and therefore, systematic changes are required. We recognize the following clades within Fionidae: Calma, Cuthona, Cuthonella, Eubranchus, Fiona, Murmania, Tenellia, Tergipes, Tergiposacca gen. nov., Rubramoena gen. nov. and Abronica gen. nov. The type species of Tergiposacca, T. longicerata nov. sp. is described. The other two new genera have a previously described species as their type species. Most of these taxa, with the exceptions of Eubranchus, Tergipes and Fiona are composed of radically different constituent species from their traditional membership, but appear to be supported by morphological synapomorphies as well as molecular data. Aenigmastyletus, Catriona, Phestilla, Tenellia and Trinchesia are nested within other clades and, thus are here considered as synonyms of the larger clades. The phylogenetic position and validity of Myja, Guyvalvoria, Leostyletus and Subcuthona still need to be tested in future studies when material becomes available. PMID:27977703
Aestuariispira insulae gen. nov., sp. nov., a lipolytic bacterium isolated from a tidal flat.
Park, Sooyeon; Park, Ji-Min; Kang, Chul-Hyung; Yoon, Jung-Hoon
2014-06-01
A Gram-stain-negative, non-motile, aerobic, curved-to-spiral-rod-shaped bacterium, designated AH-MY2(T), was isolated from a tidal flat on Aphae island in the sea to the south-west of South Korea, and its taxonomic position was investigated using a polyphasic taxonomic approach. Strain AH-MY2(T) grew optimally at 30 °C, at pH 7.0-8.0 and in the presence of 2.0% (w/v) NaCl. Neighbour-joining, maximum-likelihood and maximum-parsimony phylogenetic trees based on 16S rRNA gene sequences showed that strain AH-MY2(T) clustered with the type strain of Terasakiella pusilla and that this cluster joined the clade comprising the type strains of species of the genus Thalassospira. Strain AH-MY2(T) exhibited 16S rRNA gene sequence similarity values of 90.6% to the type strain of Terasakiella pusilla and of less than 91.0% to the type strains of other species with validly published names. Strain AH-MY2(T) contained Q-10 as the predominant ubiquinone and C(18 : 1)ω7c as the major fatty acid. The major polar lipids detected in strain AH-MY2(T) were phosphatidylglycerol, phosphatidylethanolamine, two unidentified aminolipids and one unidentified glycolipid. The DNA G+C content of strain AH-MY2(T) was 56.0 mol%. The phylogenetic data and differential chemotaxonomic and other phenotypic properties revealed that strain AH-MY2(T) represented a novel genus and species within the family Rhodospirillaceae of the class Alphaproteobacteria, for which the name Aestuariispira insulae gen. nov., sp. nov. is proposed. The type strain of Aestuariispira insulae is AH-MY2(T) ( = KCTC 32577(T) = CECT 8488(T)). © 2014 IUMS.
A molecular phylogeny of the Canidae based on six nuclear loci.
Bardeleben, Carolyne; Moore, Rachael L; Wayne, Robert K
2005-12-01
We have reconstructed the phylogenetic relationships of 23 species in the dog family, Canidae, using DNA sequence data from six nuclear loci. Individual gene trees were generated with maximum parsimony (MP) and maximum likelihood (ML) analysis. In general, these individual gene trees were not well resolved, but several identical groupings were supported by more than one locus. Phylogenetic analysis with a data set combining the six nuclear loci using MP, ML, and Bayesian approaches produced a more resolved tree that agreed with previously published mitochondrial trees in finding three well-defined clades, including the red fox-like canids, the South American foxes, and the wolf-like canids. In addition, the nuclear data set provides novel indel support for several previously inferred clades. Differences between trees derived from the nuclear data and those from the mitochondrial data include the grouping of the bush dog and maned wolf into a clade with the South American foxes, the grouping of the side-striped jackal (Canis adustus) and black-backed jackal (Canis mesomelas) and the grouping of the bat-eared fox (Otocyon megalotis) with the raccoon dog (Nycteruetes procyonoides). We also analyzed the combined nuclear+mitochondrial tree. Many nodes that were strongly supported in the nuclear tree or the mitochondrial tree remained strongly supported in the nuclear+mitochondrial tree. Relationships within the clades containing the red fox-like canids and South American canids are well resolved, whereas the relationships among the wolf-like canids remain largely undetermined. The lack of resolution within the wolf-like canids may be due to their recent divergence and insufficient time for the accumulation of phylogenetically informative signal.
Cho, Myong-Suk; Kim, Chan-Soo; Kim, Seon-Hee; Kim, Ted Oh; Heo, Kyoung-In; Jun, Jumin; Kim, Seung-Chul
2014-11-01
The subgenus Cerasus of the genus Prunus includes several popular ornamental flowering cherries. Of the hundreds of cultivars, P. ×yedoensis ('Somei-yoshino') is the most popular and familiar cultivar in Korea and Japan and is considered to be of hybrid origin. However, the hybrid origin of P. ×yedoensis and its relationship to wild P. yedoensis, naturally occurring on Jeju Island, Korea, are highly controversial. We extensively sampled wild P. yedoensis, cultivated P. ×yedoensis, and numerous individuals from other species belonging to subgenus Cerasus on Jeju Island. Samples from 71 accessions, representing 13 species and one cultivar (P. ×yedoensis), were sequenced for nrDNA ITS/ETS (952 characters) and seven noncoding cpDNA regions (5421 characters) and subjected to maximum parsimony and maximum likelihood analysis. Additive polymorphisms in the ITS/ETS regions were confirmed by cloning amplicons from representative species. The nuclear (ITS/ETS and G3pdh) and cpDNA data, along with several morphological characteristics, provide the first convincing evidence for the hybrid origin of wild P. yedoensis. The maternal parent was determined to be P. spachiana f. ascendens, while the paternal parent was unresolved from the taxonomically complex P. serrulata/P. sargentii clade. The presence of two kinds of ribotypes was confirmed by cloning, and the possible origin of cultivated P. ×yedoensis from wild populations on Jeju Island was also suggested. Bidirectional and multiple hybridization events were responsible for the origin of wild P. yedoensis. Extensive gene flow was documented in this study, suggesting an important role of reticulate evolution in subgenus Cerasus. © 2014 Botanical Society of America, Inc.
Ribeiro, Tatiana Corrêa; Weiblen, Carla; de Azevedo, Maria Isabel; de Avila Botton, Sônia; Robe, Lizandra Jaqueline; Pereira, Daniela Isabel Brayer; Monteiro, Danieli Urach; Lorensetti, Douglas Miotto; Santurio, Janio Morais
2017-03-01
Pythium insidiosum is an important oomycete due to its ability to infect humans and animals. It causes pythiosis, a disease of difficult treatment that occurs more frequently in humans in Thailand and in horses in Brazil. Since cell-wall components are frequently related to host shifts, we decided here to use sequences from the exo-1,3-β-glucanase gene (exo1), which encodes an immunodominant protein putatively involved in cell wall remodeling, to investigate the microevolutionary relationships of Brazilian and Thai isolates of P. insidiosum. After neutrality ratification, the phylogenetic analyses performed through Maximum parsimony (MP), Neighbor-joining (NJ), Maximum likelihood (ML), and Bayesian analysis (BA) strongly supported Thai isolates being paraphyletic in relation to those from Brazil. The structure recovered by these analyses, as well as by Spatial Analysis of Molecular Variance (SAMOVA), suggests the subdivision of P. insidiosum into three clades or population groups, which are able to explain almost 81% of the variation encountered for exo1. Moreover, the two identified Thai clades were almost as strongly differentiated between each other, as they were from the Brazilian clade, suggesting an ancient Asian subdivision. The derived positioning in the phylogenetic tree, linked to the lower diversity values and the recent expansion signs detected for the Brazilian clade, further support this clade as derived in relation to the Asian populations. Thus, although some patterns presented here are compatible with those recovered with different molecular markers, exo1 was revealed to be a good marker for studying evolution in Pythium, providing robust and strongly supported results with regard to the patterns of origin and diversification of P. insidiosum. Copyright © 2016 Elsevier B.V. All rights reserved.
Pereira, Felipe B; Luque, José L
2017-02-01
Genetic and morphological variations in two component populations of Raphidascaris (Sprentascaris) lanfrediae collected in the intestine of Geophagus argyrosticus and G. proximus (Cichlidae) from States of Pará and Amapá, Brazil, respectively, were explored for the first time. A phylogenetic study including two genes (18S and 28S of the rDNA) plus morphological and life history traits of "anisakid-related" nematodes (Anisakidae, Raphidascarididae) was also performed in order to clarify taxonomic and systematic issues related to these taxa. Gene alignments were subjected to maximum likelihood (ML) and Bayesian Inference (BI), and combined data of the genetic and morphological datasets was subjected to maximum parsimony (MP) analysis. Despite of the subtle differences in the morphology (mainly in male caudal papillae) and morphometry between specimens of R. (S.) lanfrediae from the two different hosts and from the type material of the species, no genetic variation was found among representatives of the newly collected material. This find may represent an example of gene-environment interactions, similar to that recently observed for Raphidascaroides brasiliensis. Phylogenetic reconstructions indicated the paraphyly of Anisakidae represented by two subfamilies, i.e., Anisakinae and Contracaecinae and the monophyly of Raphidascarididae. Analysis of the combined datasets revealed that some morphological traits may represent apomorphic characters of Raphidascarididae and Anisakidae, whereas others are highly homoplastic and some may be interpreted with careful to avoid errors. The results support the premise that taxonomists should consider Anisakidae and Raphidascarididae as separate families, and only two subfamilies of Anisakidae, i.e., Anisakinae and Contracaecinae. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Carvalho-Sobrinho, Jefferson G; Alverson, William S; Alcantara, Suzana; Queiroz, Luciano P; Mota, Aline C; Baum, David A
2016-08-01
Bombacoideae (Malvaceae) is a clade of deciduous trees with a marked dominance in many forests, especially in the Neotropics. The historical lack of a well-resolved phylogenetic framework for Bombacoideae hinders studies in this ecologically important group. We reexamined phylogenetic relationships in this clade based on a matrix of 6465 nuclear (ETS, ITS) and plastid (matK, trnL-trnF, trnS-trnG) DNA characters. We used maximum parsimony, maximum likelihood, and Bayesian inference to infer relationships among 108 species (∼70% of the total number of known species). We analyzed the evolution of selected morphological traits: trunk or branch prickles, calyx shape, endocarp type, seed shape, and seed number per fruit, using ML reconstructions of their ancestral states to identify possible synapomorphies for major clades. Novel phylogenetic relationships emerged from our analyses, including three major lineages marked by fruit or seed traits: the winged-seed clade (Bernoullia, Gyranthera, and Huberodendron), the spongy endocarp clade (Adansonia, Aguiaria, Catostemma, Cavanillesia, and Scleronema), and the Kapok clade (Bombax, Ceiba, Eriotheca, Neobuchia, Pachira, Pseudobombax, Rhodognaphalon, and Spirotheca). The Kapok clade, the most diverse lineage of the subfamily, includes sister relationships (i) between Pseudobombax and "Pochota fendleri" a historically incertae sedis taxon, and (ii) between the Paleotropical genera Bombax and Rhodognaphalon, implying just two bombacoid dispersals to the Old World, the other one involving Adansonia. This new phylogenetic framework offers new insights and a promising avenue for further evolutionary studies. In view of this information, we present a new tribal classification of the subfamily, accompanied by an identification key. Copyright © 2016 Elsevier Inc. All rights reserved.
Thongklang, Naritsada; Nawaz, Rizwana; Khalid, Abdul N; Chen, Jie; Hyde, Kevin D; Zhao, Ruilin; Parra, Luis A; Hanif, Muhammad; Moinard, Magalie; Callac, Philippe
2014-01-01
The genus Agaricus is known for its medicinal and edible species but also includes toxic species that belong to section Xanthodermatei. Previous phylogenetic reconstruction for temperate species, based on sequence data of nuc rRNA gene (rDNA) internal transcribed spacers (ITS), has revealed two major groups in this section and a possible third lineage for A. pseudopratensis. Recent research in Agaricus has shown that classifications need improving with the addition of tropical taxa. In this study we add new tropical collections to section Xanthodermatei. We describe three species from collections made in Pakistan and Thailand and include them in a larger analysis using all available ITS data for section Xanthodermatei. Agaricus bisporiticus sp. nov. and A. fuscopunctatus sp. nov. are introduced based on molecular and morphological studies, whereas A. microvolvatulus is recorded for the first time in Asia. Specimens from Thailand however have a much larger pileus than the type specimens from Congo. In maximum likelihood (ML) and maximum parsimony (MP) phylogenetic analyses these three species cluster with A. pseudopratensis from the Mediterranean area and A. murinocephalus recently described from Thailand. In Agaricus section Xanthodermatei this new group is monophyletic and receives low bootstrap support whereas the two previously known groups receive strong support. Within the new group, the most closely related species share some traits, but we did not find any unifying morphological character; however the five species of the group share a unique short nucleotide sequence. Two putatively toxic species of section Xanthodermatei are now recognized in Pakistan and six in Thailand. © 2014 by The Mycological Society of America.
NASA Astrophysics Data System (ADS)
Almog, Assaf; Garlaschelli, Diego
2014-09-01
The dynamics of complex systems, from financial markets to the brain, can be monitored in terms of multiple time series of activity of the constituent units, such as stocks or neurons, respectively. While the main focus of time series analysis is on the magnitude of temporal increments, a significant piece of information is encoded into the binary projection (i.e. the sign) of such increments. In this paper we provide further evidence of this by showing strong nonlinear relations between binary and non-binary properties of financial time series. These relations are a novel quantification of the fact that extreme price increments occur more often when most stocks move in the same direction. We then introduce an information-theoretic approach to the analysis of the binary signature of single and multiple time series. Through the definition of maximum-entropy ensembles of binary matrices and their mapping to spin models in statistical physics, we quantify the information encoded into the simplest binary properties of real time series and identify the most informative property given a set of measurements. Our formalism is able to accurately replicate, and mathematically characterize, the observed binary/non-binary relations. We also obtain a phase diagram allowing us to identify, based only on the instantaneous aggregate return of a set of multiple time series, a regime where the so-called ‘market mode’ has an optimal interpretation in terms of collective (endogenous) effects, a regime where it is parsimoniously explained by pure noise, and a regime where it can be regarded as a combination of endogenous and exogenous factors. Our approach allows us to connect spin models, simple stochastic processes, and ensembles of time series inferred from partial information.
Weather radar data correlate to hail-induced mortality in grassland birds
Carver, Amber; Ross, Jeremy D.; Augustine, David J.; Skagen, Susan K.; Dwyer, Angela M.; Tomback, Diana F.; Wunder, Michael B.
2017-01-01
Small-bodied terrestrial animals such as songbirds (Order Passeriformes) are especially vulnerable to hail-induced mortality; yet, hail events are challenging to predict, and they often occur in locations where populations are not being studied. Focusing on nesting grassland songbirds, we demonstrate a novel approach to estimate hail-induced mortality. We quantify the relationship between the probability of nests destroyed by hail and measured Level-III Next Generation Radar (NEXRAD) data, including atmospheric base reflectivity, maximum estimated size of hail and maximum estimated azimuthal wind shear. On 22 June 2014, a hailstorm in northern Colorado destroyed 102 out of 203 known nests within our research site. Lark bunting (Calamospiza melanocorys) nests comprised most of the sample (n = 186). Destroyed nests were more likely to be found in areas of higher storm intensity, and distributions of NEXRAD variables differed between failed and surviving nests. For 133 ground nests where nest-site vegetation was measured, we examined the ameliorative influence of woody vegetation, nest cover and vegetation density by comparing results for 13 different logistic regression models incorporating the independent and additive effects of weather and vegetation variables. The most parsimonious model used only the interactive effect of hail size and wind shear to predict the probability of nest survival, and the data provided no support for any of the models without this predictor. We conclude that vegetation structure may not mitigate mortality from severe hailstorms and that weather radar products can be used remotely to estimate potential for hail mortality of nesting grassland birds. These insights will improve the efficacy of grassland bird population models under predicted climate change scenarios.
Amphritea ceti sp. nov., isolated from faeces of Beluga whale (Delphinapterus leucas).
Kim, Young-Ok; Park, Sooyeon; Kim, Doo Nam; Nam, Bo-Hye; Won, Sung-Min; An, Du Hae; Yoon, Jung-Hoon
2014-12-01
A Gram-stain-negative, aerobic, non-spore-forming, non-flagellated and rod-shaped or ovoid bacterial strain, designated RA1(T), was isolated from faeces collected from Beluga whale (Delphinapterus leucas) in Yeosu aquarium, South Korea. Strain RA1(T) grew optimally at 25 °C, at pH 7.0-8.0 and in the presence of 2.0 % (w/v) NaCl. Neighbour-joining, maximum-likelihood and maximum-parsimony phylogenetic trees based on 16S rRNA gene sequences revealed that strain RA1(T) joins the cluster comprising the type strains of three species of the genus Amphritea, with which it exhibited 95.8-96.0 % sequence similarity. Sequence similarities to the type strains of other recognized species were less than 94.3 %. Strain RA1(T) contained Q-8 as the predominant ubiquinone and summed feature 3 (C16 : 1ω7c and/or C16 : 1ω6c), C18 : 1ω7c and C16 : 0 as the major fatty acids. The major polar lipids of strain RA1(T) were phosphatidylethanolamine, phosphatidylglycerol, two unidentified lipids and one unidentified aminolipid. The DNA G+C content of strain RA1(T) was 47.4 mol%. The differential phenotypic properties, together with the phylogenetic distinctiveness, revealed that strain RA1(T) is separated from other species of the genus Amphritea. On the basis of the data presented, strain RA1(T) is considered to represent a novel species of the genus Amphritea, for which the name Amphritea ceti sp. nov. is proposed. The type strain is RA1(T) ( = KCTC 42154(T) = NBRC 110551(T)). © 2014 IUMS.
Molecular phylogeny of extant Holothuroidea (Echinodermata).
Miller, Allison K; Kerr, Alexander M; Paulay, Gustav; Reich, Mike; Wilson, Nerida G; Carvajal, Jose I; Rouse, Greg W
2017-06-01
Sea cucumbers (Holothuroidea) are a morphologically diverse, ecologically important, and economically valued clade of echinoderms; however, the understanding of the overall systematics of the group remains controversial. Here, we present a phylogeny of extant Holothuroidea assessed with maximum parsimony, maximum likelihood, and Bayesian approaches using approximately 4.3kb of mt- (COI, 16S, 12S) and nDNA (H3, 18S, 28S) sequences from 82 holothuroid terminals representing 23 of the 27 widely-accepted family-ranked taxa. Currently five holothuroid taxa of ordinal rank are accepted. We find that three of the five orders are non-monophyletic, and we revise the taxonomy of the groups accordingly. Apodida is sister to the rest of Holothuroidea, here considered Actinopoda. Within Actinopoda, Elasipodida in part is sister to the remaining Actinopoda. This latter clade, comprising holothuroids with respiratory trees, is now called Pneumonophora. The traditional Aspidochirotida is paraphyletic, with representatives from three orders (Molpadida, Dendrochirotida, and Elasipodida in part) nested within. Therefore, we discontinue the use of Aspidochirotida and instead erect Holothuriida as the sister group to the remaining Pneumonophora, here termed Neoholothuriida. We found four well-supported major clades in Neoholothuriida: Dendrochirotida, Molpadida and two new clades, Synallactida and Persiculida. The mapping of traditionally-used morphological characters in holothuroid systematics onto the phylogeny revealed marked homoplasy in most characters demonstrating that further taxonomic revision of Holothuroidea is required. Two time-tree analyses, one based on calibrations for uncontroversial crown group dates for Eleutherozoa, Echinozoa and Holothuroidea and another using these calibrations plus four more from within Holothuroidea, showed major discrepancies, suggesting that fossils of Holothuroidea may need reassessment in terms of placing these forms with existing crown clades. Copyright © 2017 Elsevier Inc. All rights reserved.
Daniels, Savel R
2011-11-01
The endemic, monotypic freshwater crab species Seychellum alluaudi was used as a template to examine the initial colonisation and evolutionary history among the major islands in the Seychelles Archipelago. Five of the "inner" islands in the Seychelles Archipelago including Mahé, Praslin, Silhouette, La Digue and Frégate were sampled. Two partial mtDNA fragments, 16S rRNA and cytochrome oxidase subunit I (COI) was sequenced for 83 specimens of S. alluaudi. Evolutionary relationships between populations were inferred from the combined mtDNA dataset using maximum parsimony, maximum likelihood and Bayesian inferences. Analyses of molecular variance (AMOVA) were used to examine genetic variation among and within clades. A haplotype network was constructed using TCS while BEAST was employed to date the colonisation and divergence of lineages on the islands. Phylogenetic analyses of the combined mtDNA data set of 1103 base pairs retrieved a monophyletic S. alluaudi group comprised three statistically well-supported monophyletic clades. Clade one was exclusive to Silhouette; clade two included samples from Praslin sister to La Digue, while clade three comprised samples from Mahé sister to Frégate. The haplotype network corresponded to the three clades. Within Mahé, substantial phylogeographic substructure was evident. AMOVA results revealed limited genetic variation within localities with most variation occurring among localities. Divergence time estimations predated the Holocene sea level regressions and indicated a Pliocene/Pleistocene divergence between the three clades evident within S. alluaudi. The monophyly of each clade suggests that transoceanic dispersal is rare. The absence of shared haplotypes between the three clades, coupled with marked sequence divergence values suggests the presence of three allospecies within S. alluaudi. Copyright © 2011 Elsevier Inc. All rights reserved.
Phylogeography of the Macaronesian Lettuce Species Lactuca watsoniana and L. palmensis (Asteraceae).
Dias, Elisabete F; Kilian, Norbert; Silva, Luís; Schaefer, Hanno; Carine, Mark; Rudall, Paula J; Santos-Guerra, Arnoldo; Moura, Mónica
2018-02-24
The phylogenetic relationships and phylogeography of two relatively rare Macaronesian Lactuca species, Lactuca watsoniana (Azores) and L. palmensis (Canary Islands), were, until this date, unclear. Karyological information of the Azorean species was also unknown. For this study, a chromosome count was performed and L. watsoniana showed 2n = 34. A phylogenetic approach was used to clarify the relationships of the Azorean endemic L. watsoniana and the La Palma endemic L. palmensis within the subtribe Lactucinae. Maximum parsimony, Maximum likelihood and Bayesian analysis of a combined molecular dataset (ITS and four chloroplast DNA regions) and molecular clock analyses were performed with the Macaronesian Lactuca species, as well as a TCS haplotype network. The analyses revealed that L. watsoniana and L. palmensis belong to different subclades of the Lactuca clade. Lactuca watsoniana showed a strongly supported phylogenetic relationship with North American species, while L. palmensis was closely related to L. tenerrima and L. inermis, from Europe and Africa. Lactuca watsoniana showed four single-island haplotypes. A divergence time estimation of the Macaronesian lineages was used to examine island colonization pathways. Results obtained with BEAST suggest a divergence of L. palmensis and L. watsoniana clades c. 11 million years ago, L. watsoniana diverged from its North American sister species c. 3.8 million years ago and L. palmensis diverged from its sister L. tenerrima, c. 1.3 million years ago, probably originating from an African ancestral lineage which colonized the Canary Islands. Divergence analyses with *BEAST indicate a more recent divergence of the L. watsoniana crown, c. 0.9 million years ago. In the Azores colonization, in a stepping stone, east-to-west dispersal pattern, associated with geological events might explain the current distribution range of L. watsoniana.
Delimiting Areas of Endemism through Kernel Interpolation
Oliveira, Ubirajara; Brescovit, Antonio D.; Santos, Adalberto J.
2015-01-01
We propose a new approach for identification of areas of endemism, the Geographical Interpolation of Endemism (GIE), based on kernel spatial interpolation. This method differs from others in being independent of grid cells. This new approach is based on estimating the overlap between the distribution of species through a kernel interpolation of centroids of species distribution and areas of influence defined from the distance between the centroid and the farthest point of occurrence of each species. We used this method to delimit areas of endemism of spiders from Brazil. To assess the effectiveness of GIE, we analyzed the same data using Parsimony Analysis of Endemism and NDM and compared the areas identified through each method. The analyses using GIE identified 101 areas of endemism of spiders in Brazil GIE demonstrated to be effective in identifying areas of endemism in multiple scales, with fuzzy edges and supported by more synendemic species than in the other methods. The areas of endemism identified with GIE were generally congruent with those identified for other taxonomic groups, suggesting that common processes can be responsible for the origin and maintenance of these biogeographic units. PMID:25611971
Marcussen, Thomas; Heier, Lise; Brysting, Anne K; Oxelman, Bengt; Jakobsen, Kjetill S
2015-01-01
Allopolyploidization accounts for a significant fraction of speciation events in many eukaryotic lineages. However, existing phylogenetic and dating methods require tree-like topologies and are unable to handle the network-like phylogenetic relationships of lineages containing allopolyploids. No explicit framework has so far been established for evaluating competing network topologies, and few attempts have been made to date phylogenetic networks. We used a four-step approach to generate a dated polyploid species network for the cosmopolitan angiosperm genus Viola L. (Violaceae Batch.). The genus contains ca 600 species and both recent (neo-) and more ancient (meso-) polyploid lineages distributed over 16 sections. First, we obtained DNA sequences of three low-copy nuclear genes and one chloroplast region, from 42 species representing all 16 sections. Second, we obtained fossil-calibrated chronograms for each nuclear gene marker. Third, we determined the most parsimonious multilabeled genome tree and its corresponding network, resolved at the section (not the species) level. Reconstructing the "correct" network for a set of polyploids depends on recovering all homoeologs, i.e., all subgenomes, in these polyploids. Assuming the presence of Viola subgenome lineages that were not detected by the nuclear gene phylogenies ("ghost subgenome lineages") significantly reduced the number of inferred polyploidization events. We identified the most parsimonious network topology from a set of five competing scenarios differing in the interpretation of homoeolog extinctions and lineage sorting, based on (i) fewest possible ghost subgenome lineages, (ii) fewest possible polyploidization events, and (iii) least possible deviation from expected ploidy as inferred from available chromosome counts of the involved polyploid taxa. Finally, we estimated the homoploid and polyploid speciation times of the most parsimonious network. Homoploid speciation times were estimated by coalescent analysis of gene tree node ages. Polyploid speciation times were estimated by comparing branch lengths and speciation rates of lineages with and without ploidy shifts. Our analyses recognize Viola as an old genus (crown age 31 Ma) whose evolutionary history has been profoundly affected by allopolyploidy. Between 16 and 21 allopolyploidizations are necessary to explain the diversification of the 16 major lineages (sections) of Viola, suggesting that allopolyploidy has accounted for a high percentage-between 67% and 88%-of the speciation events at this level. The theoretical and methodological approaches presented here for (i) constructing networks and (ii) dating speciation events within a network, have general applicability for phylogenetic studies of groups where allopolyploidization has occurred. They make explicit use of a hitherto underexplored source of ploidy information from chromosome counts to help resolve phylogenetic cases where incomplete sequence data hampers network inference. Importantly, the coalescent-based method used herein circumvents the assumption of tree-like evolution required by most techniques for dating speciation events. © The Author(s) 2014. Published by Oxford University Press, on behalf of the Society of Systematic Biologists.